INFO8095-22S-SEC1-Big Data Analytics - Group 4

You might also like

Download as pptx, pdf, or txt
Download as pptx, pdf, or txt
You are on page 1of 10

BIG DATA - EXECUTIVE

PRESENTATION

Team Members;
Syed Isabat Hussain Rizvi
Rohit Tiwari
Vivek Chaudhary
Jvalant Pandya
Project analysisTABLE
slide
OF 2
CONTENTS

PROJECT SUMMARY

MODEL-DIMENSIONAL PROJECT SUMMARY OF LEARNINGS

MODEL-CLASSIFICATION MODEL-TEXT
PROJECT SUMMARY

This collaborative Originally, as part of The data was gathered Various models are
project exemplifies The data Preparation, in order to extract 50 created based on the
some of the important the team became distinct types of items, data collected, such
stages of the Big Data completely acquainted date of purchase, as:
Lifecycle. It focuses on with the data; in this store, and sales
Data Preparation, case, Grocery Bills information
Model Planning, and were gathered by the encompassing
Model Construction. team. quantity, transaction,
and price, among Entity Relationship Diagram
other things. "ERD in Star Schema format"
is built to depict how the
dimensional model would
appear for the 'Grocery Bill'
data, ensuring all fields have
names and all Primary and
Foreign Keys are labelled
correctly.
Tables
◦ Sales Table – Sales Table is the main table that contains
“DateKey”, “StoreKey”, “ProductKey”, “ProductDepartment”,
“ProductCost”, “TransactionID”, “DollarSales”, “UnitSold”.
◦ Product Table – The product table contains a unique Product key
"ProductKey", A column with the names of the products
"ProductDescription", a column that defines the product’s
department "ProductDepartment" and the cost of each product
"ProductCost".
◦ Date Table – The Date table consist of a unique Date key
"DateKey", Days "Day", Months“ Month", and Years "Year"
columns.
◦ Store Table – This table have the list of Stores and their store keys.
"StoreKey" and "StoreName".
Store Table Sales Table:
DateKey Date Table
StoreKey
DIMENSIONAL ProductKey
MODEL ProductDepartment Product
ProductCost Table
TransactionID
DollarSales
UnitsSold
CLASSIFICATIO
N MODEL
DECISION TREE

Classification Model
Decision Tree

Yes: 180 VEGETABLES No: 102

Buy FROZEN Buy FRESH

100 80 75 27

People Willingness People Willingness


Preferring to Pay < Following to Pay <
A quick & Market Price Healthy Market Price
Convenient Life-style
Life-style
TEXT MODEL
SUMMARY OF LEARNINGS
◦ Dimensional model – The dimensional model shows that in
total we have four tables' Sales, Product, Date and Store. The
sales table is the main table that has the primary keys of the
other three tables and where they are located. It gives us a full
picture of the dataset and the components in it.
◦ Classification Model – The classification model despite the
decision tree helps in drawing some conclusions from the input
values given. In this case, the calculations help to know more
about the type of eggs customers buy, Enriched or conventional.
◦ Text Model – The text model indicates the Term frequency of
the products. It tells us that the highest sold product is on the left
and the least is on the right.
QUESTIONS?
◦ Selection of attributes for splitting in a decision tree with the help
of an algorithm? Explain?
◦ What is the right strategy to manage a business problem that needs
a type of model?
◦ Provide a comparison between the Dimensional and classification
models?
◦ Which model is the most suitable fit for prediction? Why?
◦ Is a dimensional model more advised to understand data in a
database? If yes, Explain?
◦ Which model will be a good fit for analysts based on different
situations?
THANK YOU

You might also like