Professional Documents
Culture Documents
EDA On Gala Groceries Sample Sales Data
EDA On Gala Groceries Sample Sales Data
Based on the EDA performed on Gala Groceries Sales Data, several insights and patterns have been
uncovered, providing a proper understanding of the dataset and potential key points for further
analysis. However, it is important to note that the EDA is limited by the amount and quality of the data
available.
Fruits and Vegetables are most frequently purchased product categories, spices and herbs are
least purchased product categories
Low priced products are sold more compared to high priced product
If we considered ‘payment_type’ column as cash and cashless (i.e cashless being for credit card,
e-wallet, debit card), there are more cashless purchases then cash purchases.
11th, 16th and 18th hour of the day are top 3 hours when Sales was more. Fruits , Vegetables,
Packed food , Caned food and Baked food is being sold most in these hours.
There is not much correlation among numerical features , ‘unit_price’ and ‘total’ have high
correlation but this is not a useful correlation as total is calculated using unit_price.
The current sample is only from 1 store and 1 week worth of data, we need data from all the
stores associated with Gala Groceries and data of more duration to better stock items.
We need more columns or features which adds more of stock-sales point of view than customer
–sales.
We need to frame the specific problem statement that we want to solve. The current business
problem is too broad, we should narrow down the focus in order to deliver a valuable end
product
Best regards,
Abhishek Kaddipudi