Professional Documents
Culture Documents
Week 12
Week 12
Week 12
• Remove duplicates
• Remove irrelevant data
• Standardize capitalization
• Convert data type
• Handle missing values
Remove Duplicates
• When you collect your data from a range of different places,
or scrape your data, it’s likely that you will have duplicated
entries.
• Duplicates will inevitably skew your data and/or confuse
your results.
Remove Irrelevant Data
• Irrelevant data will slow down and confuse any analysis that
you want to do.