Page cover

Data Quality

Data quality is a contingent concept - exactly what constitutes quality will be highly dependent on the audience for the data. Further, research shows that 'strange things' such as errors and outliers in the data may actually help users engage with the data by providing an opportunity for them to question the data.

The first step in the data quality process is map out exactly what your version of data quality looks like.

The 4 steps of data quality

Source: A Survey of Data Quality Requirements That Matter in Machine Learning Pipelines

Last updated