Bsa s01 s02 Ppt-In-class

You might also like

Download as pdf or txt
Download as pdf or txt
You are on page 1of 125

Business Statistics and Analysis

Gaurav KS

July 30, 2022

e-mail: gauravks@iima.ac.in

Gaurav KS Business Statistics and Analysis July 30, 2022 1 / 125


Outline

Course overview: components, evaluation, policy


What and why of “Business Statistics” and “Analysis”
Real life examples from business
Data versus Information
Measures of data (variable types)
Approaches to collect data: cross-section, time-series; panel
Example data-set (from the class)
Overview/navigation of Syllabus/Topics with the data-set
Descriptive Statistics

Gaurav KS Business Statistics and Analysis July 30, 2022 2 / 125


Course Evaluation

Quiz : 20%
Group − Assignments : 30%
MidTerm : 25%
EndTerm : 35%

Gaurav KS Business Statistics and Analysis July 30, 2022 3 / 125


“Business Statistics” and “Analysis”

Gaurav KS Business Statistics and Analysis July 30, 2022 4 / 125


“Business Statistics” and “Analysis”

Objective/Purpose

Gaurav KS Business Statistics and Analysis July 30, 2022 5 / 125


“Business Statistics” and “Analysis”

Objective/Purpose
Using quantitative modeling to help companies/individuals make better
decisions and improve performance.

Gaurav KS Business Statistics and Analysis July 30, 2022 6 / 125


“Business Statistics” and “Analysis”

Objective/Purpose
Using quantitative modeling to help companies/individuals make better
decisions and improve performance.

Business Statistics
Application of statistical tool to business and managerial problems for the
purpose of decision making.

Gaurav KS Business Statistics and Analysis July 30, 2022 7 / 125


“Business Statistics” and “Analysis”

Objective/Purpose
Using quantitative modeling to help companies/individuals make better
decisions and improve performance.

Business Statistics
Application of statistical tool to business and managerial problems for the
purpose of decision making.

Statistics
Study of numerical data, facts, figures and measurements.
Statistics is used to convert raw numerical data into useful information.

Gaurav KS Business Statistics and Analysis July 30, 2022 8 / 125


“Business Statistics” and “Analysis”

Objective/Purpose
Using quantitative modeling to help companies/individuals make better
decisions and improve performance.

Business Statistics
Application of statistical tool to business and managerial problems for the
purpose of decision making.

Statistics
Study of numerical data, facts, figures and measurements.
Statistics is used to convert raw numerical data into useful information.

(Data) Analysis
includes data description, data inference, and the search for relationships in
data.
Gaurav KS Business Statistics and Analysis July 30, 2022 9 / 125
“Business Statistics” and “Analysis”

Objective/Purpose
Using quantitative modeling to help companies/individuals make better
decisions and improve performance.

Skills needed as a Business Analytic

Gaurav KS Business Statistics and Analysis July 30, 2022 10 / 125


“Business Statistics” and “Analysis”

Objective/Purpose
Using quantitative modeling to help companies/individuals make better
decisions and improve performance.

Skills needed as a Business Analytic


Information management skills to manage the data.

Gaurav KS Business Statistics and Analysis July 30, 2022 11 / 125


“Business Statistics” and “Analysis”

Objective/Purpose
Using quantitative modeling to help companies/individuals make better
decisions and improve performance.

Skills needed as a Business Analytic


Information management skills to manage the data.
Analytics skills and tools to understand the data.

Gaurav KS Business Statistics and Analysis July 30, 2022 12 / 125


“Business Statistics” and “Analysis”

Objective/Purpose
Using quantitative modeling to help companies/individuals make better
decisions and improve performance.

Skills needed as a Business Analytic


Information management skills to manage the data.
Analytics skills and tools to understand the data.
Data-oriented culture to act on data.

Gaurav KS Business Statistics and Analysis July 30, 2022 13 / 125


“Business Statistics” and “Analysis”

Objective/Purpose
Using quantitative modeling to help companies/individuals make better
decisions and improve performance.

Skills needed as a Business Analytic


Information management skills to manage the data.
Analytics skills and tools to understand the data.
Data-oriented culture to act on data.

data

Gaurav KS Business Statistics and Analysis July 30, 2022 14 / 125


Example data:

Objective
Objective: Let’s say one want to assess the fitness/obesity in a class.

Gaurav KS Business Statistics and Analysis July 30, 2022 15 / 125


Example data:

Objective
Objective: Let’s say one want to assess the fitness/obesity in a class.

Fitness criteria: Body mass index (BMI)

Gaurav KS Business Statistics and Analysis July 30, 2022 16 / 125


Example data:

Objective
Objective: Let’s say one want to assess the fitness/obesity in a class.

Survey Questionnaire:
1 Name:
2 Gender:
3 Age:
4 Height (in Meters):
5 Weight (in Kg):
6 Rate you fitness on scale 1(lowest) to 5 (highest):

Gaurav KS Business Statistics and Analysis July 30, 2022 17 / 125


Variables and Scales of Measurement

Gaurav KS Business Statistics and Analysis July 30, 2022 18 / 125


Qualitative versus Quantitative

Qualitative
A variable that is described verbally than numerically

Gaurav KS Business Statistics and Analysis July 30, 2022 19 / 125


Qualitative versus Quantitative

Qualitative
A variable that is described verbally than numerically
Also known as categorical data

Gaurav KS Business Statistics and Analysis July 30, 2022 20 / 125


Qualitative versus Quantitative

Qualitative
A variable that is described verbally than numerically
Also known as categorical data

Quantitative
Quantitative: a variable that assumes meaningful numerical value

Gaurav KS Business Statistics and Analysis July 30, 2022 21 / 125


Qualitative: Nominal, Binary, Ordinal

Nominal
Variables that store labelled information without any order or quantitative
value

Gaurav KS Business Statistics and Analysis July 30, 2022 22 / 125


Qualitative: Nominal, Binary, Ordinal

Nominal
Variables that store labelled information without any order or quantitative
value
The name “nominal” comes from the Latin name “nomen”, which means
“name”

Gaurav KS Business Statistics and Analysis July 30, 2022 23 / 125


Qualitative: Nominal, Binary, Ordinal

Nominal
Variables that store labelled information without any order or quantitative
value
The name “nominal” comes from the Latin name “nomen”, which means
“name”
We can’t do any numerical tasks or can’t give any order to sort the data

Gaurav KS Business Statistics and Analysis July 30, 2022 24 / 125


Qualitative: Nominal, Binary, Ordinal

Nominal
Variables that store labelled information without any order or quantitative
value
The name “nominal” comes from the Latin name “nomen”, which means
“name”
We can’t do any numerical tasks or can’t give any order to sort the data
These data don’t have any meaningful order

Gaurav KS Business Statistics and Analysis July 30, 2022 25 / 125


Qualitative: Nominal, Binary, Ordinal

Nominal
Variables that store labelled information without any order or quantitative
value
The name “nominal” comes from the Latin name “nomen”, which means
“name”
We can’t do any numerical tasks or can’t give any order to sort the data
These data don’t have any meaningful order
Also known as categorical data

Gaurav KS Business Statistics and Analysis July 30, 2022 26 / 125


Qualitative: Nominal, Binary, Ordinal

Nominal
Variables that store labelled information without any order or quantitative
value
The name “nominal” comes from the Latin name “nomen”, which means
“name”
We can’t do any numerical tasks or can’t give any order to sort the data
These data don’t have any meaningful order
Also known as categorical data

Examples
Nationality (Indian, German, American)

Gaurav KS Business Statistics and Analysis July 30, 2022 27 / 125


Qualitative: Nominal, Binary, Ordinal

Nominal
Variables that store labelled information without any order or quantitative
value
The name “nominal” comes from the Latin name “nomen”, which means
“name”
We can’t do any numerical tasks or can’t give any order to sort the data
These data don’t have any meaningful order
Also known as categorical data

Examples
Nationality (Indian, German, American)
Relationship status (Single, Live-in, Committed, Complicated, Married, Widowed,
Open)

Gaurav KS Business Statistics and Analysis July 30, 2022 28 / 125


Qualitative: Nominal, Binary, Ordinal

Nominal
Variables that store labelled information without any order or quantitative
value
The name “nominal” comes from the Latin name “nomen”, which means
“name”
We can’t do any numerical tasks or can’t give any order to sort the data
These data don’t have any meaningful order
Also known as categorical data

Examples
Nationality (Indian, German, American)
Relationship status (Single, Live-in, Committed, Complicated, Married, Widowed,
Open)
Gender (Male, Female, Others)

Gaurav KS Business Statistics and Analysis July 30, 2022 29 / 125


Qualitative: Nominal, Binary, Ordinal

Nominal
Variables that store labelled information without any order or quantitative
value
The name “nominal” comes from the Latin name “nomen”, which means
“name”
We can’t do any numerical tasks or can’t give any order to sort the data
These data don’t have any meaningful order
Also known as categorical data

Examples
Nationality (Indian, German, American)
Relationship status (Single, Live-in, Committed, Complicated, Married, Widowed,
Open)
Gender (Male, Female, Others)
Eye Color (Black, Brown, etc.)

Gaurav KS Business Statistics and Analysis July 30, 2022 30 / 125


Qualitative: Nominal, Binary, Ordinal

Nominal

Binary or Dichotomous
A categorical variable that can only take one of two values

Gaurav KS Business Statistics and Analysis July 30, 2022 31 / 125


Qualitative: Nominal, Binary, Ordinal

Nominal

Binary or Dichotomous
A categorical variable that can only take one of two values
For example- Male and Female, True and False, Day and Night, Pass and
Fail, etc.

Gaurav KS Business Statistics and Analysis July 30, 2022 32 / 125


Qualitative: Nominal, Binary, Ordinal

Nominal

Binary or Dichotomous

Ordinal
Variable has a natural ordering by their position on the scale.

Gaurav KS Business Statistics and Analysis July 30, 2022 33 / 125


Qualitative: Nominal, Binary, Ordinal

Nominal

Binary or Dichotomous

Ordinal
Variable has a natural ordering by their position on the scale.
Commonly used for observation like customer satisfaction, happiness, etc.

Gaurav KS Business Statistics and Analysis July 30, 2022 34 / 125


Qualitative: Nominal, Binary, Ordinal

Nominal

Binary or Dichotomous

Ordinal
Variable has a natural ordering by their position on the scale.
Commonly used for observation like customer satisfaction, happiness, etc.
Considered as “in-between” the qualitative data and quantitative data.

Gaurav KS Business Statistics and Analysis July 30, 2022 35 / 125


Qualitative: Nominal, Binary, Ordinal
Nominal

Binary or Dichotomous

Ordinal
Variable has a natural ordering by their position on the scale.
Commonly used for observation like customer satisfaction, happiness, etc.
Considered as “in-between” the qualitative data and quantitative data.

Examples
Companies ask for feedback, experience, or satisfaction on a scale of 1 to 10
Letter grades in the exam (A, B, C, D, etc.)
Ranking of peoples in a competition (First, Second, Third, etc.)
Economic Status (High, Medium, and Low)
Education Level (Higher, Secondary, Primary)
Gaurav KS Business Statistics and Analysis July 30, 2022 36 / 125
Quantitative: Discrete and Continuous

Discrete
Discrete data is a numerical type of data that includes whole, concrete
numbers with specific and fixed data values determined by counting.

Gaurav KS Business Statistics and Analysis July 30, 2022 37 / 125


Quantitative: Discrete and Continuous

Discrete
Discrete data is a numerical type of data that includes whole, concrete
numbers with specific and fixed data values determined by counting.
Discrete data refers to individual and countable items.

Gaurav KS Business Statistics and Analysis July 30, 2022 38 / 125


Quantitative: Discrete and Continuous

Discrete
Discrete data is a numerical type of data that includes whole, concrete
numbers with specific and fixed data values determined by counting.
Discrete data refers to individual and countable items.
Synonyms for the word discrete are disconnected, separate, and distinct. So,
on plotting one can see them scattered.

Gaurav KS Business Statistics and Analysis July 30, 2022 39 / 125


Quantitative: Discrete and Continuous

Discrete
Discrete data is a numerical type of data that includes whole, concrete
numbers with specific and fixed data values determined by counting.
Discrete data refers to individual and countable items.
Synonyms for the word discrete are disconnected, separate, and distinct. So,
on plotting one can see them scattered.
Discrete data are countable and finite; they are whole numbers or integers.

Gaurav KS Business Statistics and Analysis July 30, 2022 40 / 125


Quantitative: Discrete and Continuous

Discrete
Discrete data is a numerical type of data that includes whole, concrete
numbers with specific and fixed data values determined by counting.
Discrete data refers to individual and countable items.
Synonyms for the word discrete are disconnected, separate, and distinct. So,
on plotting one can see them scattered.
Discrete data are countable and finite; they are whole numbers or integers.

Examples
Total numbers of students present in a class
Numbers of employees in a company
The total number of players who participated in a competition
Days in a week

Gaurav KS Business Statistics and Analysis July 30, 2022 41 / 125


Quantitative: Discrete and Continuous

Discrete

Gaurav KS Business Statistics and Analysis July 30, 2022 42 / 125


Quantitative: Discrete and Continuous

Discrete

Continuous
Continuous data includes complex/fractional numbers and varying data
values that are measured over a specific time interval.

Gaurav KS Business Statistics and Analysis July 30, 2022 43 / 125


Quantitative: Discrete and Continuous

Discrete

Continuous
Continuous data includes complex/fractional numbers and varying data
values that are measured over a specific time interval.
Continuous data refers to change over time, involving concepts that are not
simply countable but require detailed measurements.

Gaurav KS Business Statistics and Analysis July 30, 2022 44 / 125


Quantitative: Discrete and Continuous

Discrete

Continuous
Continuous data includes complex/fractional numbers and varying data
values that are measured over a specific time interval.
Continuous data refers to change over time, involving concepts that are not
simply countable but require detailed measurements.
So, on plotting one can see them like a line.

Gaurav KS Business Statistics and Analysis July 30, 2022 45 / 125


Quantitative: Discrete and Continuous

Discrete

Continuous
Continuous data includes complex/fractional numbers and varying data
values that are measured over a specific time interval.
Continuous data refers to change over time, involving concepts that are not
simply countable but require detailed measurements.
So, on plotting one can see them like a line.

Examples
Height of a person
Speed of a vehicle
“Time-taken” to finish the work
Market share price

Gaurav KS Business Statistics and Analysis July 30, 2022 46 / 125


Quantitative: Discrete and Continuous

Gaurav KS Business Statistics and Analysis July 30, 2022 47 / 125


Quantitative: Continuous: Interval Scale and Ratio Scale

Interval Scale
Can be categorized
Can be ranked
Difference between the scale values are equal
No true zero point as the origin

Gaurav KS Business Statistics and Analysis July 30, 2022 48 / 125


Quantitative: Continuous: Interval Scale and Ratio Scale

Interval Scale
Can be categorized
Can be ranked
Difference between the scale values are equal
No true zero point as the origin

Example
Temperature

Gaurav KS Business Statistics and Analysis July 30, 2022 49 / 125


Quantitative: Continuous: Interval Scale and Ratio Scale

Interval Scale

Ratio Scale
Can be categorized
Can be ranked
Difference between the scale values are equal
A true zero point as the origin

Ratio scale is essentially an Interval scale with a true zero point.

Gaurav KS Business Statistics and Analysis July 30, 2022 50 / 125


Example data:

Survey Questionnaire:
1 Name:
2 Gender:
3 Age:
4 Height (in Meters):
5 Weight (in Kg):
6 Rate you fitness on scale 1(lowest) to 5 (highest):

Gaurav KS Business Statistics and Analysis July 30, 2022 51 / 125


Example data:

Survey Questionnaire:
1 Name: Nominal
2 Gender:
3 Age:
4 Height (in Meters):
5 Weight (in Kg):
6 Rate you fitness on scale 1(lowest) to 5 (highest):

Gaurav KS Business Statistics and Analysis July 30, 2022 52 / 125


Example data:

Survey Questionnaire:
1 Name: Nominal
2 Gender: Nominal/Binary
3 Age:
4 Height (in Meters):
5 Weight (in Kg):
6 Rate you fitness on scale 1(lowest) to 5 (highest):

Gaurav KS Business Statistics and Analysis July 30, 2022 53 / 125


Example data:

Survey Questionnaire:
1 Name: Nominal
2 Gender: Nominal/Binary
3 Age: Continuous/Ratio-scale
4 Height (in Meters):
5 Weight (in Kg):
6 Rate you fitness on scale 1(lowest) to 5 (highest):

Gaurav KS Business Statistics and Analysis July 30, 2022 54 / 125


Example data:

Survey Questionnaire:
1 Name: Nominal
2 Gender: Nominal/Binary
3 Age: Continuous/Ratio-scale
4 Height (in Meters): Continuous/Ratio-scale
5 Weight (in Kg):
6 Rate you fitness on scale 1(lowest) to 5 (highest):

Gaurav KS Business Statistics and Analysis July 30, 2022 55 / 125


Example data:

Survey Questionnaire:
1 Name: Nominal
2 Gender: Nominal/Binary
3 Age: Continuous/Ratio-scale
4 Height (in Meters): Continuous/Ratio-scale
5 Weight (in Kg): Continuous/Ratio-scale
6 Rate you fitness on scale 1(lowest) to 5 (highest):

Gaurav KS Business Statistics and Analysis July 30, 2022 56 / 125


Example data:

Survey Questionnaire:
1 Name: Nominal
2 Gender: Nominal/Binary
3 Age: Continuous/Ratio-scale
4 Height (in Meters): Continuous/Ratio-scale
5 Weight (in Kg): Continuous/Ratio-scale
6 Rate you fitness on scale 1(lowest) to 5 (highest): Ordinal

Gaurav KS Business Statistics and Analysis July 30, 2022 57 / 125


Types of Data Collection

Gaurav KS Business Statistics and Analysis July 30, 2022 58 / 125


Cross-section and Time-series

Cross-section
When data is collected by observing many subjects (such as individuals, firms,
countries, or regions) at the one point or period of time

Examples
We want to measure current obesity levels in a population, we could draw a
sample of 1,000 people randomly from that population (also known as a cross
section of that population), measure their weight and height, and calculate
what percentage of that sample is categorized as obese
Student grades at the end of the current semester
Household data of the previous year - expenditure on food, unemployment,
income, etc
Car data - average speed, horsepower, color, etc

Gaurav KS Business Statistics and Analysis July 30, 2022 59 / 125


Time-series

Time series a series of data points indexed (or listed or graphed) in time order.
India’s Monthly Inflation for past 5 years.
Height of a person, measured once every month.

Note: Time series is different from cross-sectional data because ordering of the
observations conveys important information.

Gaurav KS Business Statistics and Analysis July 30, 2022 60 / 125


What if both (cross-section, time-series) together?: Panel

Panel data (or longitudinal data), combines both cross-sectional and time
series data ideas and looks at how the subjects (firms, individuals, etc.)
change over a time series.

Gaurav KS Business Statistics and Analysis July 30, 2022 61 / 125


What if both (cross-section, time-series) together?: Panel

Panel data (or longitudinal data), combines both cross-sectional and time
series data ideas and looks at how the subjects (firms, individuals, etc.)
change over a time series.
Panel data differs from pooled cross-sectional data across time, because it
deals with the observations on the same subjects in different times whereas
the latter observes different subjects in different time periods.

Gaurav KS Business Statistics and Analysis July 30, 2022 62 / 125


Summary - 1

Gaurav KS Business Statistics and Analysis July 30, 2022 63 / 125


Summary - 2

Gaurav KS Business Statistics and Analysis July 30, 2022 64 / 125


Quiz

Gaurav KS Business Statistics and Analysis July 30, 2022 65 / 125


Variable quiz: Howz The Josh

Gaurav KS Business Statistics and Analysis July 30, 2022 66 / 125


Variable quiz: Howz The Josh

Qualitative
Ordinal

Gaurav KS Business Statistics and Analysis July 30, 2022 67 / 125


Variable quiz: “Gabbar is back”

Gaurav KS Business Statistics and Analysis July 30, 2022 68 / 125


Variable quiz: “Gabbar is back”, Answer

Quantitative
Discrete

Gaurav KS Business Statistics and Analysis July 30, 2022 69 / 125


Variable quiz: Choose the correct response - 1

How does ordinal data differ from nominal data?


1 Nominal data is a name, while ordinal data is a number
2 Nominal data only distinguishes, ordinal data also offers magnitude
information
3 Nominal data can be a name or number, while ordinal data can only be
number
4 Nominal data can only be a name, while ordinal data can be name or number

Gaurav KS Business Statistics and Analysis July 30, 2022 70 / 125


Variable quiz: Choose the correct response - 1, Answer

How does ordinal data differ from nominal data?


1 Nominal data is a name, while ordinal data is a number
2 Nominal data only distinguishes, ordinal data also offers magnitude
information
3 Nominal data can be a name or number, while ordinal data can only be
number
4 Nominal data can only be a name, while ordinal data can be name or number

Nominal data has the identity property and helps to distinguish between
individual data points. Ordinal data has both identity and magnitude
property and helps order the data points in a specific way.

Gaurav KS Business Statistics and Analysis July 30, 2022 71 / 125


Variable quiz: Choose the correct response - 2

“The sequential list according which the batsmen in a cricket team


would come out to bat” – Which of the following data types does this
data set belong to?
1 Nominal
2 Ordinal
3 Ratio
4 Interval

Gaurav KS Business Statistics and Analysis July 30, 2022 72 / 125


Variable quiz: Choose the correct response - 2, Answer

“The sequential list according which the batsmen in a cricket team


would come out to bat” – Which of the following data types does this
data set belong to?
1 Nominal
2 Ordinal
3 Ratio
4 Interval
A sequential list has an inherent order and, therefore, is ordinal data.

Gaurav KS Business Statistics and Analysis July 30, 2022 73 / 125


Variable quiz: Choose the correct response - 3

A group of 10 people were shown 15 photographs. Each person was


asked to choose their favourite photo, and the choices were recorded.
What is the data type of the recorded data?
1 Nominal
2 Ordinal
3 Ratio
4 Interval

Gaurav KS Business Statistics and Analysis July 30, 2022 74 / 125


Variable quiz: Choose the correct response - 3, Answer

A group of 10 people were shown 15 photographs. Each person was


asked to choose their favourite photo, and the choices were recorded.
What is the data type of the recorded data?
1 Nominal
2 Ordinal
3 Ratio
4 Interval
The choice of a particular photo conveys no order, and only conveys
identity. Even if the photos are numbered 1-15, the numbers still represent
identity and not magnitude.

Gaurav KS Business Statistics and Analysis July 30, 2022 75 / 125


Variable quiz: Choose the correct response - 4

What is the type of data scale marked on a measuring tape?


1 Integer
2 Ratio
3 Nominal
4 Discrete

Gaurav KS Business Statistics and Analysis July 30, 2022 76 / 125


Variable quiz: Choose the correct response - 4, Answer

What is the type of data scale marked on a measuring tape?


1 Integer
2 Ratio
3 Nominal
4 Discrete
Ratio scale offers identity, magnitude, equidistant points, as well as a true
zero and the measuring tape scale has all four properties.

Gaurav KS Business Statistics and Analysis July 30, 2022 77 / 125


Variable quiz: Choose the correct response - 5

A researcher doing a blind experiment got the respondent data coded


with numbers in a column named “Respondent-ID”. What data type
is it?
1 Ordinal
2 Continuous
3 Interval
4 Nominal

Gaurav KS Business Statistics and Analysis July 30, 2022 78 / 125


Variable quiz: Choose the correct response - 5, Answer

A researcher doing a blind experiment got the respondent data coded


with numbers in a column named “Respondent-ID”. What data type
is it?
1 Ordinal
2 Continuous
3 Interval
4 Nominal
The code refers to the identity of the respondent and does not convey any
other information. Even though the “Respondent-ID” is numeric, it is
nominal data.

Gaurav KS Business Statistics and Analysis July 30, 2022 79 / 125


Variable quiz: Choose the correct response - 6

What is the data type for the Rainfall in mm data?


1 Integer data, numeric
2 Integer data, discrete
3 Ratio scale, continuous
4 Ratio scale, discrete

Gaurav KS Business Statistics and Analysis July 30, 2022 80 / 125


Variable quiz: Choose the correct response - 6, Answer

What is the data type for the Rainfall in mm data?


1 Integer data, numeric
2 Integer data, discrete
3 Ratio scale, continuous
4 Ratio scale, discrete

Rainfall data can take any value on the scale and is, therefore, continuous.
Rainfall in mm has a true zero (indicating no rainfall at all) and is,
therefore, ratio scale.

Gaurav KS Business Statistics and Analysis July 30, 2022 81 / 125


Variable quiz: Fill in the blanks

For each of the following variables, indicate whether the variable is


continuous, discrete, ordinal, nominal or binary.
1 Importance of political party affiliation to people (very, somewhat, or not very
important): .................
2 Minutes spent reading yesterday: .................
3 Favourite type of book (fiction, nonfiction): .................
4 Average amount of sleep in the last 7 days: .................
5 Number of abnormal cells from a karyotype genetic test: .................
6 Weights of adult men in kg: .................
7 Country of residence in UK (England, Scotland, Wales, Northern Ireland ):
.................
8 Stage of cancer (1,2,3,4,5): .................
9 Number of mistakes in my presentation: .................

Gaurav KS Business Statistics and Analysis July 30, 2022 82 / 125


Variable quiz: Fill in the blanks

For each of the following variables, indicate whether the variable is


continuous, discrete, ordinal, nominal or binary.
1 Importance of political party affiliation to people (very, somewhat, or not very
important): Ordinal
2 Minutes spent reading yesterday: .................
3 Favourite type of book (fiction, nonfiction): .................
4 Average amount of sleep in the last 7 days: .................
5 Number of abnormal cells from a karyotype genetic test: .................
6 Weights of adult men in kg: .................
7 Country of residence in UK (England, Scotland, Wales, Northern Ireland ):
.................
8 Stage of cancer (1,2,3,4,5): .................
9 Number of mistakes in my presentation: .................

Gaurav KS Business Statistics and Analysis July 30, 2022 83 / 125


Variable quiz: Fill in the blanks

For each of the following variables, indicate whether the variable is


continuous, discrete, ordinal, nominal or binary.
1 Importance of political party affiliation to people (very, somewhat, or not very
important): Ordinal
2 Minutes spent reading yesterday: Continuous
3 Favourite type of book (fiction, nonfiction): .................
4 Average amount of sleep in the last 7 days: .................
5 Number of abnormal cells from a karyotype genetic test: .................
6 Weights of adult men in kg: .................
7 Country of residence in UK (England, Scotland, Wales, Northern Ireland ):
.................
8 Stage of cancer (1,2,3,4,5): .................
9 Number of mistakes in my presentation: .................

Gaurav KS Business Statistics and Analysis July 30, 2022 84 / 125


Variable quiz: Fill in the blanks

For each of the following variables, indicate whether the variable is


continuous, discrete, ordinal, nominal or binary.
1 Importance of political party affiliation to people (very, somewhat, or not very
important): Ordinal
2 Minutes spent reading yesterday: Continuous
3 Favourite type of book (fiction, nonfiction): Binary
4 Average amount of sleep in the last 7 days: .................
5 Number of abnormal cells from a karyotype genetic test: .................
6 Weights of adult men in kg: .................
7 Country of residence in UK (England, Scotland, Wales, Northern Ireland ):
.................
8 Stage of cancer (1,2,3,4,5): .................
9 Number of mistakes in my presentation: .................

Gaurav KS Business Statistics and Analysis July 30, 2022 85 / 125


Variable quiz: Fill in the blanks

For each of the following variables, indicate whether the variable is


continuous, discrete, ordinal, nominal or binary.
1 Importance of political party affiliation to people (very, somewhat, or not very
important): Ordinal
2 Minutes spent reading yesterday: Continuous
3 Favourite type of book (fiction, nonfiction): Binary
4 Average amount of sleep in the last 7 days: Continuous
5 Number of abnormal cells from a karyotype genetic test: .................
6 Weights of adult men in kg: .................
7 Country of residence in UK (England, Scotland, Wales, Northern Ireland ):
.................
8 Stage of cancer (1,2,3,4,5): .................
9 Number of mistakes in my presentation: .................

Gaurav KS Business Statistics and Analysis July 30, 2022 86 / 125


Variable quiz: Fill in the blanks

For each of the following variables, indicate whether the variable is


continuous, discrete, ordinal, nominal or binary.
1 Importance of political party affiliation to people (very, somewhat, or not very
important): Ordinal
2 Minutes spent reading yesterday: Continuous
3 Favourite type of book (fiction, nonfiction): Binary
4 Average amount of sleep in the last 7 days: Continuous
5 Number of abnormal cells from a karyotype genetic test: Discrete
6 Weights of adult men in kg: .................
7 Country of residence in UK (England, Scotland, Wales, Northern Ireland ):
.................
8 Stage of cancer (1,2,3,4,5): .................
9 Number of mistakes in my presentation: .................

Gaurav KS Business Statistics and Analysis July 30, 2022 87 / 125


Variable quiz: Fill in the blanks

For each of the following variables, indicate whether the variable is


continuous, discrete, ordinal, nominal or binary.
1 Importance of political party affiliation to people (very, somewhat, or not very
important): Ordinal
2 Minutes spent reading yesterday: Continuous
3 Favourite type of book (fiction, nonfiction): Binary
4 Average amount of sleep in the last 7 days: Continuous
5 Number of abnormal cells from a karyotype genetic test: Discrete
6 Weights of adult men in kg: Continuous
7 Country of residence in UK (England, Scotland, Wales, Northern Ireland ):
.................
8 Stage of cancer (1,2,3,4,5): .................
9 Number of mistakes in my presentation: .................

Gaurav KS Business Statistics and Analysis July 30, 2022 88 / 125


Variable quiz: Fill in the blanks

For each of the following variables, indicate whether the variable is


continuous, discrete, ordinal, nominal or binary.
1 Importance of political party affiliation to people (very, somewhat, or not very
important): Ordinal
2 Minutes spent reading yesterday: Continuous
3 Favourite type of book (fiction, nonfiction): Binary
4 Average amount of sleep in the last 7 days: Continuous
5 Number of abnormal cells from a karyotype genetic test: Discrete
6 Weights of adult men in kg: Continuous
7 Country of residence in UK (England, Scotland, Wales, Northern Ireland ):
Nominal
8 Stage of cancer (1,2,3,4,5): .................
9 Number of mistakes in my presentation: .................

Gaurav KS Business Statistics and Analysis July 30, 2022 89 / 125


Variable quiz: Fill in the blanks

For each of the following variables, indicate whether the variable is


continuous, discrete, ordinal, nominal or binary.
1 Importance of political party affiliation to people (very, somewhat, or not very
important): Ordinal
2 Minutes spent reading yesterday: Continuous
3 Favourite type of book (fiction, nonfiction): Binary
4 Average amount of sleep in the last 7 days: Continuous
5 Number of abnormal cells from a karyotype genetic test: Discrete
6 Weights of adult men in kg: Continuous
7 Country of residence in UK (England, Scotland, Wales, Northern Ireland ):
Nominal
8 Stage of cancer (1,2,3,4,5): Ordinal
9 Number of mistakes in my presentation: .................

Gaurav KS Business Statistics and Analysis July 30, 2022 90 / 125


Variable quiz: Fill in the blanks

For each of the following variables, indicate whether the variable is


continuous, discrete, ordinal, nominal or binary.
1 Importance of political party affiliation to people (very, somewhat, or not very
important): Ordinal
2 Minutes spent reading yesterday: Continuous
3 Favourite type of book (fiction, nonfiction): Binary
4 Average amount of sleep in the last 7 days: Continuous
5 Number of abnormal cells from a karyotype genetic test: Discrete
6 Weights of adult men in kg: Continuous
7 Country of residence in UK (England, Scotland, Wales, Northern Ireland ):
Nominal
8 Stage of cancer (1,2,3,4,5): Ordinal
9 Number of mistakes in my presentation: Discrete

Gaurav KS Business Statistics and Analysis July 30, 2022 91 / 125


Types of Data Collection

Gaurav KS Business Statistics and Analysis July 30, 2022 92 / 125


Cross-section and Time-series

Cross-section
When data is collected by observing many subjects (such as individuals, firms,
countries, or regions) at the one point or period of time

Examples

Gaurav KS Business Statistics and Analysis July 30, 2022 93 / 125


Time-series
Time series a series of data points indexed (or listed or graphed) in time order.
India’s Monthly Inflation for past 5 years.
Height of a person, measured once every month.

Examples

Gaurav KS Business Statistics and Analysis July 30, 2022 94 / 125


What if both (cross-section, time-series) together?: Panel

Panel data (or longitudinal data), combines both cross-sectional and time
series data ideas and looks at how the subjects (firms, individuals, etc.)
change over a time series.

Examples

Gaurav KS Business Statistics and Analysis July 30, 2022 95 / 125


Business Applications

Gaurav KS Business Statistics and Analysis July 30, 2022 96 / 125


Applications

1 Demand forecasting

Gaurav KS Business Statistics and Analysis July 30, 2022 97 / 125


Applications

1 Demand forecasting
2 GDP growth

Gaurav KS Business Statistics and Analysis July 30, 2022 98 / 125


Applications

1 Demand forecasting
2 GDP growth
3 Course grading of PRM students

Gaurav KS Business Statistics and Analysis July 30, 2022 99 / 125


Applications

1 Demand forecasting
2 GDP growth
3 Course grading of PRM students
4 Are the grades across the sections are significantly different from each other

Gaurav KS Business Statistics and Analysis July 30, 2022 100 / 125
A Seven-Step Modeling Process
1 Define the problem
2 Collect and summarize data
3 Develop a model
4 Verify the model
5 Select one of more suitable decisions
6 Present the results to the organization
7 Implement the model and update it over time

Gaurav KS Business Statistics and Analysis July 30, 2022 101 / 125
Course Overview and Schedule of Sessions

Gaurav KS Business Statistics and Analysis July 30, 2022 102 / 125
Session 2 & 3: Descriptive Statistics

Objective
To extract meaningful information from data.

Gaurav KS Business Statistics and Analysis July 30, 2022 103 / 125
Session 2 & 3: Descriptive Statistics

Objective
To extract meaningful information from data.

How
Descriptive measure or summary statistic
Visuals (Charts)

Gaurav KS Business Statistics and Analysis July 30, 2022 104 / 125
Session 2 & 3: Descriptive Statistics

Objective
To extract meaningful information from data.

How
Descriptive measure or summary statistic
Visuals (Charts)

Descriptive measures
30% of the class students are females.
Average height of students is 5.6 ft.

Gaurav KS Business Statistics and Analysis July 30, 2022 105 / 125
Session 2 & 3: Descriptive Statistics

Objective
To extract meaningful information from data.

How
Descriptive measure or summary statistic
Visuals (Charts)

Descriptive measures
30% of the class students are females.
Average height of students is 5.6 ft.

Visuals
Scatter plot
Frequency plot, Histograms
Pi chart

Gaurav KS Business Statistics and Analysis July 30, 2022 106 / 125
Session 4: Linear Transformation and Standardization

Objective
To understand the concept of standardization and how it can help in making
comparisons.

Gaurav KS Business Statistics and Analysis July 30, 2022 107 / 125
Session 5: Correlation and Covariance

Objective
Introduction to bivariate analysis, idea behind correlation and correlation does not
mean causation.

Gaurav KS Business Statistics and Analysis July 30, 2022 108 / 125
Correlation

Correlation tells the association


How two variables are related with each other

Examples
Temperature and attendance at outdoor events
The age of a car and its value
Years of education and annual earnings
People’s telephone number and their IQs
Miles driven and amount of fuel consumed
Amount of smoking and incidence of lung cancer

Gaurav KS Business Statistics and Analysis July 30, 2022 109 / 125
Correlation: Plot the Data

Correlation as an association
How two variables are related with each other

Gaurav KS Business Statistics and Analysis July 30, 2022 110 / 125
Correlation: Scatter Plot

Correlation as an association
How two variables are related with each other

From the above three plots, what can we say about the relation between the
two variables?

Gaurav KS Business Statistics and Analysis July 30, 2022 111 / 125
Correlation: Scatter plot inferences
Correlation as an association
How two variables are related with each other

The relation between the two variables:

Gaurav KS Business Statistics and Analysis July 30, 2022 112 / 125
Correlation Coefficient
Correlation
How two variables are related with each other
Two aspects of (co)relation: Direction and Strength

Gaurav KS Business Statistics and Analysis July 30, 2022 113 / 125
Session 6 &7: Probability and Probability Distribution

Objective
Introduction to notion of probability and to understand how probability can help
in decision making under uncertainty.

Gaurav KS Business Statistics and Analysis July 30, 2022 114 / 125
Session 8: Conditional Probability, Bayes’ Theorem, and
Their applications

Objective
To understand the concept of Bayes’ Theorem and probability and how it helps in
better decision making.

Gaurav KS Business Statistics and Analysis July 30, 2022 115 / 125
Session 9 & 10: Normal, Binomial, Poisson and
Exponential distributions applications

Objective
To get familiar with the concept of distributions and how different distributions
can be used in business world to analyze data.

Gaurav KS Business Statistics and Analysis July 30, 2022 116 / 125
Session 11: Sampling

Objective
To introduce the different types of probabilistic sampling techniques.

Gaurav KS Business Statistics and Analysis July 30, 2022 117 / 125
Session 13: Central Limit Theorem

Objective
One useful statistical theorem that helps in approximating distribution of a large
sample towards a normal distribution.

Gaurav KS Business Statistics and Analysis July 30, 2022 118 / 125
Session 14 & 15: Estimation

Objective
To understand the logic of estimation and how estimation helps in drawing
meaningful conclusion about a population.

Gaurav KS Business Statistics and Analysis July 30, 2022 119 / 125
Session 16 & 17: Testing of Hypothesis

Objective
To understand how to generate and test hypotheses.

Gaurav KS Business Statistics and Analysis July 30, 2022 120 / 125
Session 18: ANOVA

Objective
To understand the concept of ANOVA and its real life applications.

Gaurav KS Business Statistics and Analysis July 30, 2022 121 / 125
Session 19 & 20: Regression Analysis

Objective
To introduce the concept of regression and how it can be used to make
predictions.

Gaurav KS Business Statistics and Analysis July 30, 2022 122 / 125
Session Summary

Gaurav KS Business Statistics and Analysis July 30, 2022 123 / 125
QUESTIONS

Gaurav KS Business Statistics and Analysis July 30, 2022 124 / 125
THANK YOU

Gaurav KS Business Statistics and Analysis July 30, 2022 125 / 125

You might also like