Data Quality Assessment-Sprocket Central Pty LTD

You might also like

Download as docx, pdf, or txt
Download as docx, pdf, or txt
You are on page 1of 2

Greetings,

Thank you for sharing the data sets with us. We have conducted a data quality assessment and our
findings are given in the table below. We require your feedback on these in order to fix it.

We recommend NewCustomerList, CustomerDemographic and CustomerAddress to be merged into one


table. Only two tables are sufficient. One for transactions and one for customers.

Data governance standards should be adopted to avoid eliminate data quality issues.

Please feel free to contact me for any clarification.

Issue ID Table Name Name of Detailed Description Date Raised/ Client


Issue Added Feedbac
k
1 Transactions Blank Value 360 rows are blank in 23-Nov-20  
column 'online_order'
2 Transactions Blank Value 197 rows are blank in 23-Nov-20  
'brand', 'product_line',
'product_class',
'product_size',
'standard_cost',
'product_first_sold_dat
e' for product id 0
3 NewCustomerList Blank Value 29 rows are blank in 23-Nov-20  
column 'last_name'
4 NewCustomerList Blank Value 17 rows are blank in 23-Nov-20  
column 'DOB'
5 NewCustomerList Custom 54 rows are formatted 23-Nov-20  
Format as yyyy"-"mm"-"dd.
These values are not
recognised as norrmal
date in column 'DOB'
6 NewCustomerList Text Value 932 rows are formatted 23-Nov-20  
as text values. These
values are not
recognised as norrmal
date in column 'DOB'
7 NewCustomerList Blank Value 29 rows are blank in 23-Nov-20  
column 'job_title'
8 NewCustomerList n/a value 165 rows are filled with 23-Nov-20  
value n/a in column
'job_industry_category'
9 NewCustomerList Hidden 4 columns were 23-Nov-20  
Columns identified with no
header values. The
purpose of these
columns are not clear
10 NewCustomerList Missing Join Customer ID is not 23-Nov-20  
Key available in this table
11 CustomerDemographi Multiple Multiple values are 23-Nov-20  
c values available in In column
'gender' to indicate
same gender. e.g. F,
Female, Femal
12 CustomerDemographi Blank Value 125 rows are blank in 23-Nov-20  
c column 'last_name'
13 CustomerDemographi Custom 3912 rows are 23-Nov-20  
c Format formatted as
yyyy"-"mm"-"dd. These
values are not
recognised as norrmal
date in column 'DOB'
14 CustomerDemographi Blank Value 87 rows are blank in 23-Nov-20  
c column 'DOB'
15 CustomerDemographi Text Value one row is formatted as 23-Nov-20  
c text value. This values is
not recognised as
norrmal date in column
'DOB'
16 CustomerDemographi Blank Value 506 rows are blank in 23-Nov-20  
c column 'job_title'
17 CustomerDemographi Unknown purpose of default 23-Nov-20  
c values column is not clear
18 CustomerAddress Multiple Multiple values are 23-Nov-20  
values available in In column
'state' to indicate same
state. e.g. NSW, New
South Wales

You might also like