Professional Documents
Culture Documents
INFS 6018: Managing Business Intelligence Week 3: Dirty Data and Data Quality
INFS 6018: Managing Business Intelligence Week 3: Dirty Data and Data Quality
– Molyneaux 2002
Demographic Human characteristics data eg age, sex etc
· Absence of Data
· Multipurpose Fields
· Cryptic Data
· Contradicting Data
Right data for Accuracy The data correctly defines the event
what happened
Defined
performance Validity The data fall between acceptable ranges defined by the business
Apples vs. oranges Consistency The data elements are consistently defined and understood
– Integrity – Is the structure of data and relationships among entities and attributes
maintained consistently?