Professional Documents
Culture Documents
Wk1 Intro To Data Analytics
Wk1 Intro To Data Analytics
Taufique Hossain
LARGE-SCALE DATA IS EVERYWHERE!
§Expectations
§Gathered data will have value either for the
purpose collected or for a purpose not
envisioned.
Sensor Networks Computational Simulations
WHY DATA MINING AND ANALYTICS? COMMERCIAL VIEWPOINT
Lots of data is being collected
and warehoused
Web data
Yahoo has Peta Bytes of web data
Facebook has billions of active users
purchases at department/
grocery stores, e-commerce
Amazon handles millions of visits/day
Bank/Credit Card transactions
● Draw conclusions
● Improve businesses,
products, and services
Business analytics, is the use of data, information technology, statistical analysis, quantitative
methods, and computer-based models to help managers make better, fact-based decisions.
BUSINESS ANALYTICS:AREAS
Financial Performance
Analytics Analytics
Risk Analytics
Supply Chain
Analytics
Industry-focused
Customer Analytics
Analytics
Good resource
Effective decision
utilization
making
Good market
insights
TYPES OF ANALYTICS
The three main types of analytics based on the workflow and requirements of data analytics:
Descriptive Prescriptive
Analytics Analytics
Predictive
Analytics
Operationalize Discovery
Deliver final reports, briefs, Learn about business
codes, and technical domain and assess
documents. available resources.
6 1
4 3
● Customer preferences
● Location-based purchases
● Manage inventory
● Forecast demands
Weather
Data provides several insights such as identifying products that people tend to buy in a particular season.
Case Study: Amazon
Such predictions help increase sales and reduce shipping, inventory, and
supply chain costs.
Case Study: Amazon
It is the data that is processed, It is the type of data that lacks It is the data type containing
stored, and retrieved in a fixed any specific form or structure. both structured and
format. unstructured data.
Example: Email
Example: Employee details, Example: CSV and JSON
job positions, and salaries. documents
Analyzing Unstructured Data
Unstructured information is
About 80% of business data is text-heavy and contains data
unstructured. such as dates, numbers, and
facts.
The line between unstructured and semi-structured data is not clearly defined.
Unstructured data has some level of structure in it.
Qualitative and Quantitative Data
Qualitative Data
Data in which classification of objects is
based on attributes and properties.
Example: Softness of skin etc.
Quantitative Data
Data can be measured and expressed
numerically.
Example: Your height and shoe size.
Qualitative and Quantitative Data
Qualitative
Nominal data Ordinal data
Data
Unordered data to which an order is Ordered data that is assigned to
assigned in relation to other named categories in a ranked fashion
categories
Example: Grade classification like pass or fail Example: Feedback to a product with 1–5
for student's test results. ranking.
Subgroups of Quantitative Data
It can only take certain values. It can take any value within a
specified range.
Example: The number of students in a
class Example: Share price of a company
Data Levels of Measurement
It is a classification that describes the nature of information within the values assigned to variables.
Ratio
Interval
Ordinal
Nominal
Data Levels of Measurement
M F
• Categories bear no quantitative relationship to one another
• Examples:
- Example: Temperature in centigrade where the distance
between 80 degrees and 100 degrees is same as the
distance between 1000 degrees and 1020 degrees. 80°C - 100°C = 1000°C - 1020°C
Data Levels of Measurement
● Although properties of ratio measurement are similar to the interval level of measurement, the zero in scale makes it
different from the other levels of measurement.
● Ratios are meaningful
● Never fall below zero
Examples:
- monthly sales
- delivery times
Data for Business Analytics