Course Outline
Introduction
Overview of Data Cleaning
- Why is Data Cleaning Important?
Case Study: When Big Data Is Dirty
Developing A Thorough Data Cleaning Strategy
Common Data Cleaning Tools
- Drake
- OpenRefine
- Pandas (for Python)
- Dplyr (for R)
Achieving High Data Integrity
- Complete
- Correct
- Accurate
- Relevant
- Consistent
Automating the Data Cleaning Process
Monitoring Your Data Cleaning System
Summary and Conclusion
Requirements
- An understanding of data analytics concepts.
Audience
- Data Scientists
- Data Analysts
- Business Analysts
Testimonials (7)
Examples/exercices perfectly adapted to our domain
Luc - CS Group
Course - Scaling Data Analysis with Python and Dask
The fact of having more practical exercises using more similar data to what we use in our projects (satellite images in raster format)
Matthieu - CS Group
Course - Scaling Data Analysis with Python and Dask
How big data work, data programs, greater knowledge of how our current world works using data
Ozayr Hussain - Vodacom
Course - A Practical Introduction to Data Analysis and Big Data
the clarity with which he explained the entire course, as well as the willingness to return to the syllabus when necessary
Carlos Eloy - AMERICAN EXPRESS COMPANY MEXICO
Course - Data Analytics With R
Machine Translated
I liked that the trainer made sure we all understood and were following the lectures. if we had a problem, he stopped and helped us fix it.
Cesar - AMERICAN EXPRESS COMPANY MEXICO
Course - Data Analytics With R
The practical side of the training.
Patrick - Vodacom PTy Ltd
Course - A Practical Introduction to Data Analysis and Big Data
I really was benefit from the real life practical examples.