Skip to content

Prepare for the course challenge by reviewing terms and definitions in the glossary. Then, demonstrate your knowledge of the importance of sample size, data integrity, and the connection of data to business objectives during the quiz. You will also have an opportunity to apply your skill with data cleaning techniques in both spreadsheets and SQL. Finally, document, report on, and verify your data-cleaning process and results.

Learning Objectives

  • Describe statistical measures associated with data integrity including statistical power, hypothesis testing, and margin of error
  • Describe strategies that can be used to address insufficient data
  • Discuss the importance of sample size with reference to sample bias and random samples
  • Describe the relationship between data and related business objectives
  • Define data integrity with reference to types and risks
  • Describe data cleaning techniques with reference to identifying errors, redundancy, compatibility and continuous monitoring
  • Demonstrate an understanding of the use of spreadsheets to clean data
  • Describe how SQL can be used to clean large datasets
  • Describe the benefits of documenting data cleaning process
  • Discuss the elements and importance of data-cleaning reports
  • Describe the process involved in verifying the results of cleaning data

Course challenge


Reading: Glossary: Terms and definitions

Quiz: Course challenge