Professional Documents
Culture Documents
Business Statistics - Session Introduction To Statistics
Business Statistics - Session Introduction To Statistics
Business Statistics - Session Introduction To Statistics
Introduction to Statistics
Session 1:
Introduction to Statistics
Faculty Intro
• Current
– VP & Head, Strategy and Corporate Communication, V-Mart Retail
– Co-founder and Mentor, Liferenew, a fitness and sports events, CSR and employee volunteering, and training solutions
provider to leading corporates like Adobe, Jubilant Foodworks, Tech Mahindra and ST Micro
– Corporate trainer and advisor to CXOs in project management, business intelligence and strategic management
• Professional background
– Sixteen-plus years of corporate experience across research and analytics, brand management and sales, and retail. Ten-plus
years of entrepreneurial and P&L leadership experience in building global delivery centres
– Twelve-plus years of experience as visiting faculty at b-schools; 7+ years of corporate training experience in financial
analysis, storyboarding, business writing, logical thinking and problem-solving, and data analysis skills
– Worked with companies like Religare group, RocSearch, Ranbaxy, Eli Lilly and Wimpy’s
• Academic background
– Dissertation (4th) year in Executive PhD program from IIM Lucknow. 4 papers published in international peer-reviewed
ABDC category journals; several papers presented at international conferences
– PGDM (MBA) from IIM Lucknow, concentration in Marketing and Strategy
– Hotel Management from IHM Lucknow; BA in Pol Science and English
– My alter ego
– Avid trekker and nature enthusiast, Hatha yoga practitioner, amateur chef and travel buff
Agenda for Today!
Hypergeometric distribution
Expected values of Statistics MSTR
P value, z value, t test, F distribution
& MSE under null hypothesis
Kruskal-Wallis test – non-parametric alternative
Standard errors of regression
to one-way ANOVA
estimators
Discrete & continuous random variables
Point Bi-serial correlation
Chebyshev’s theorem
Variance-stabilizing transformations
Negative Binomial distribution
Residual Autocorrelation & Durbin-
Tukey pairwise comparison test
Watson test
Mann-Whitney U test Taguchi methods, R chart, s chart, c
Wilcoxon Signed-Rank test chart
Contingency table analysis – Chi square test Kolmogorov-Smirnov One sample
Spearman rank correlation coefficient test
Poisson distribution
Mortality statistics and survival
analysis
That was ‘Sadistics’, not ‘Statistics’
Perspectives on statistics…
22 Classification of statistics
• The word statistics is derived from the Italian word stato, which means “state”
• Statista refers to a person involved with the affairs of state. Therefore, statistics originally meant the collection of facts
useful to the statista. Statistics in this sense was used in 16th-century Italy and then spread to France, Holland, and
Germany
• There is an evidence of use of some of the principles of statistics by ancient Indian civilization. Some of the techniques
find their mention in Vedic Mathematics. However, the modern statistical methods spread from Italy to France, Holland
and Germany in 16th century
• Theoretical development of modern statistics was during the mid-seventeenth centuries with the introduction of
‘Theory of Probability’ and ‘Theory of Games and Chance’
How did statistics originate? (2/2)
• Many famous problems like ‘the problem of points’ (posed by Chevalier de-Mere), ‘the gambler’s ruin’, etc.
posed by professional gamblers were solved by mathematicians. These solutions laid the foundation to the theory
of probability and statistics. Some of the notable contributors in the development of statistics are: Pascal, Fermat,
James Bernoulli, De-Moivre, Laplace, Gauss Euler, Lagrange, Bayes, Kolmogorov, Karl Pearson and so on. One
of the most significant works in modern times is by Ronald A. Fisher (1890-1962), who is considered to be the
‘Father of Statistics’ by the community of statisticians all over. He applied statistics to diversified fields such as
education, agriculture, genetics, biometry, psychology, etc. He also pioneered ‘Estimation Theory’, ‘Exact
sampling distribution’, ‘Analysis of variance’ and ‘Experimental Design’
• Significant contribution has also been made by Indians in the field of statistics. Prof Prasant Chandra
Mahalanobis, is the first to pioneer the study of statistical science in India. He founded the Indian Statistical
Institute (ISI) in 1931
Defining Statistics… let’s review these definitions
• “Statistics are the classified facts representing the conditions of the people in the state…. specially those facts which
can be stated in number or in table of numbers or in any tabular or classified arrangement”. – Webster
• “By statistics we mean quantitative data affected to a marked extent by multiplicity of causes”. –Yule and Kendall
• “Statistics is a science of estimates and probability”. – Boddington
• “Statistics is a method of decision-making in the face of uncertainty on the basis of numerical data and calculated
risk”. – Prof. Ya-Lun-Chou
• “Statistics may be defined as the science of collection, presentation, analysis and interpretation of data”. – Croxton
and Cowden
• “Statistics is the science and art of handling aggregate of facts– observing, enumerating, recording, classifying and
otherwise systematically treating them”. – Harlow
• “Statistics are measurements, enumerations or estimates of natural phenomenon, usually systematically arranged,
analyzed and presented as to exhibit important inter-relationships among them”. – A.M. Tuttle
Why do we need statistics?
The key distinction between normative (or judgmental) techniques and statistical techniques is of estimate of level of
confidence in decision. Statistical methods are explicit in nature and provide clearly defined measure of error.
On the other hand, normative techniques based on the judgment and rule of thumb, although help in effective
decision-making but fail to specify estimate of error.
Activity 1: Role of Statistics
• Define the variables that you want to study in order to solve a business problem or meet a business objective
• Collect the data from appropriate sources
• Organize the data collected by developing tables
• Visualize the data by developing charts
• Analyze the data by examining the appropriate tables and charts, and other statistical methods to reach conclusions
Session Outline
22 Classification of statistics
APPLIED
• When statistical methods are used, a problem is always formulated in terms of ‘population’ or ‘universe’, which is
defined as all the elements about which conclusions or decisions are to be made. In statistics, there is a specific
meaning to the words population and universe. We shall discuss exact definitions subsequently
o For example, if we want to find customer satisfaction, all our customers represent the population. If
information or data is taken from each and every element of the population, we are dealing with ‘Descriptive
Statistics’. In research vocabulary, such a process is called ‘Census’
o This includes methods for collection, collation, tabulation, summarization and analysis of the data on entire
population. Averages, trends, index numbers, dispersion and skewness, help in summarizing and describing
the main features of the statistical data. This is primarily to present the data in the form easily understandable
to the decision-maker. One example is the national census conducted every 10 years.
Analytical Statistics
• This deals with establishing relationship between two or more variables. This includes methods like correlation
and regression, association of attributes, multivariate analysis, etc., which help establishing relationship between
variables
• This facilitates comparison, interpolation, extrapolation and relationships. In these cases, we require multiple
samples on different populations or same population, for example, sales of a product before and after launch of
promotion campaign.
Inductive Statistics
• Decision-making in most business situations requires estimates about future like trends and forecast. Inductive
statistics include methods that help in generalizing the trends based on the random observations
• This process provides estimation indirectly on the basis of partial data or method of forecasting based on past
data, for example, future share price of a share based on the inflow of funds by FII.
Inferential Statistics
• Another way, in which conclusions or decisions are made, is using a portion of population or sample from the
universe. The sample data is analyzed. Then based on the sample evidence, conclusions are generalized about the
target population
• Exit poll during elections is an example of sample survey. This method is referred to as ‘Statistical Inference’.
Hypotheses and significance tests form an important part of inferential statistics
Applied Statistics
• It is the application of statistical methods and techniques used for solving the real life problems. Quality control,
sample surveys, inventory management, simulations, quantitative analysis for business decision-making, etc.,
form a part of this category.
Session Outline
22 Classification of statistics
Condensation
• Statistics compresses mass of figures to small meaningful information, for example, average sales, BSE index
(SENSEX), growth rate. It is impossible to get a precise idea about the profitability of a business from a record of
income and expenditure transactions. The information of Return on Investment (ROI), Earnings per Share (EPS),
profit margins, etc., however, can be easily remembered, understood and used in decision-making
Comparison
• Statistics facilitates comparing two related quantities for example, Price to Earning Ratio (PE Ratio) of Reliance
Industries stood at 17.5 as compared to the industry figure of 13 showing the confidence of investors
Functions of Statistics (2/3)
Forecast
• Statistics helps in forecast by looking at trends. These are essential for planning and decision-making. Predictions
based on the gut feeling or hunch could be harmful for the business. For example, to decide the refining capacity
for a petrochemical plant, we need to predict the demand of petrochemical product mix, supply of crude, cost of
crude, substitution products, etc., over next 15 to 25 years, before committing an investment
Hypotheses Testing
• Hypotheses are statements about the population parameters based on our past knowledge or information that we
would like to check its validity in the light of current information. Inductive inference about the population based
on the sample estimates involves an element of risk. However, sampling keeps the costs of decision-making low.
Statistics provides quantitative base for testing our beliefs about the population.
Functions of Statistics (3/3)
Forecast
• Statistics present facts precisely in quantitative form. Statement of facts conveyed in exact quantitative terms are
always more convincing than vague utterances. For example, ‘increase in profit margin is less in year 2006 than in year
2005’ does not convey a definite piece of information. On the other hand, statistics presents the information more
definitely like “profit margin is 10% of the turnover in year 2006 against 12% in year 2005”
Expectation
• Statistics provides the basic building block for framing suitable policies. For example, how much raw material should
be imported, how much capacity should be installed, or manpower recruited, etc., depends upon the expected value of
outcome of our present decisions.
Activity 2: Functions of Statistics
Condensation Comparison
Guidelines Condensation Comparison
• Refer to the dataset, ‘Bulbs’
• What is the information provided here?
Testing of
• How can you use this information Forecast Testing of
Forecast Hypotheses
“statistically”? Hypotheses
Preciseness Expectation
Preciseness Expectation
Laws of Statistics
DISTRUST OF STATISTICS
MISUSE OF STATISTICS
Limitations of Statistics
Statistical techniques, because of their flexibility and economy, have become popular and are used in numerous
fields. But statistics is not a cure-all technique and has limitations. It cannot be applied to all kinds of situations and
cannot be made to answer all queries.
available data using appropriate statistical tools. Statistical tools not only have
The word statistics is derived from the Italian word ‘Stato’ which means ‘state’; and
‘Statista’ refers to a person involved with the affairs of state. Thus, statistics originally was
meant for collection of facts useful for affaires of the state, like taxes, land records,
Cont….
Significant contribution has also been made by Indians in the field of
arrangement
Statistical methods are broadly divided into five categories. These are
tools are extensively used in business for time and motion study, consumer behaviour
used in numerous fields. But statistics is not a cure-all technique and has limitations. It cannot
More dangerous than distrust is misuse of statistics to draw convenient conclusions to satisfy
supported by facts, figures, charts, graphs, index numbers, etc. are indeed very
appealing and convincing. They can be used to intimidate opposing views. Hence,
Statistical techniques, because of their flexibility and economy, have become popular and are
and has limitations. It cannot be applied to all kinds of situations and cannot be
figures, charts, graphs, index numbers, etc. are indeed very appealing and
convincing. They can be used to intimidate opposing views. Hence, statistics is open
to manipulation.