Download as docx, pdf, or txt
Download as docx, pdf, or txt
You are on page 1of 1

Dear Lead – (Data Science Team),

Based on the EDA performed on Gala Groceries Sales Data, several insights and patterns have been
uncovered, providing a proper understanding of the dataset and potential key points for further
analysis. However, it is important to note that the EDA is limited by the amount and quality of the data
available.

Here are my findings on the Sample Sales data:

 Fruits and Vegetables are most frequently purchased product categories, spices and herbs are
least purchased product categories
 Low priced products are sold more compared to high priced product
 If we considered ‘payment_type’ column as cash and cashless (i.e cashless being for credit card,
e-wallet, debit card), there are more cashless purchases then cash purchases.
 11th, 16th and 18th hour of the day are top 3 hours when Sales was more. Fruits , Vegetables,
Packed food , Caned food and Baked food is being sold most in these hours.
 There is not much correlation among numerical features , ‘unit_price’ and ‘total’ have high
correlation but this is not a useful correlation as total is calculated using unit_price.

I would recommend on this points for further analysis

 The current sample is only from 1 store and 1 week worth of data, we need data from all the
stores associated with Gala Groceries and data of more duration to better stock items.
 We need more columns or features which adds more of stock-sales point of view than customer
–sales.
 We need to frame the specific problem statement that we want to solve. The current business
problem is too broad, we should narrow down the focus in order to deliver a valuable end
product

Best regards,
Abhishek Kaddipudi

You might also like