Professional Documents
Culture Documents
Data Cleansing: - Vishal Kumar 07IT910 - Karishma Verma 07IT927
Data Cleansing: - Vishal Kumar 07IT910 - Karishma Verma 07IT927
CLEANSING
-Vishal Kumar
07IT910
-Karishma Verma
07IT927
WHAT IS DATA
CLEANSING?
Data cleansing or data scrubbing is the act of
detecting and correcting (or removing) corrupt or
inaccurate records from a record set, table, or
database. Used mainly in databases, the term
refers to identifying incomplete, incorrect,
inaccurate, irrelevant etc. parts of the data and
then replacing, modifying or deleting this dirty
data
WHY DATA CLEANSING ?
After cleansing, a data set will be consistent with
other similar data sets in the system. The
inconsistencies detected or removed may have been
originally caused by different data dictionary
definitions of similar entities in different stores, may
have been caused by user entry errors, or may have
been corrupted in transmission or storage.
DATA QUALITY
High quality data needs to pass a set of quality
criteria. Those include: