Professional Documents
Culture Documents
Basics of Statistics
Basics of Statistics
Basics of Statistics
Review of Statistics:
In statistics we collect data, then we summarize data and take decision based on the summarize and
sometimes further analysis
→Mean
Mean: Mean represents central tendency of the data values, These leads us to the nation “random
variables.”
Suppose you want to invest money in some barrier. This is the only asset you have. If you loss it you will
be ant of the street
A 15 % B 50%
You are very rich. You can easily absorbed the loss.
The random variables can be qualitative or quantitative. We are going to study quantitative random
variables.
Quantitative variables are of two types: Discrete variables and Continuous variable
Discrete Variable:
Discrete random variables assume specific values on the number line which could be both fractional and
whole number. The values can be set an ordinal array. It’s countable.
Continuous Variable:
The continuous random variable assumes values from a continuous interval on the number line. By the
very nature of the random variable there are infinite values that the random variable can assume.
Whatever kind of random variable we are facing we need to know the two aspects of its nature.
The set of values it can assume. This is called the sample space and is denoted by S.
An idea of the likelihood of each value.
Both the piece of information are included in what we call the distribution of random variable.
This is an example of tabular distribution. Here f(x) is the probability mass function. In case of discrete
distribution the sum of the probability masses must always exactly 1.0
Event: Each value the random variable can take can be called an event. Such as x<3 is an event.
Here 1,1,5 & 2 are less than 3.
The probability of an event is equal to the sum of the probabilities of the outcome that define an event.
P(x<3)=0.1+0.2+0.15=0.45
Information of a distribution:
There are two pieces of summarized information we always need to know about any distribution.
1. Mean
2. Standard deviation
Standard Deviation:
Using Excel:
Varianc
e 1.2119
1.10086
SD 3
Cumulative Probability:
We are mostly going to work with normal distribution.
xeN(µ, ő)
xeN(100,25)
Two questions:
1. How to compute or determine probabilities associated with normal distribution?
2. If we already know the probability how to trace the value of random variable?
P(x=a)=0 in continuous distribution where a is a specific value from the sample space.
P(x=18)=0
P(x=12)=0
Z is actually called standard normal distribution, All other distributions are arbitrary distribution but they
can be converted to standard normal distribution.
Standard normal distribution has a mean of 0 and SD of 1
Another question:
Labor cost for P1, W1=$20/hour for P2, W2=$40/hour for P3, W3=$60/hour
Y=GPA.
Covariance (x,y)=
∑ (xi−µx )( yi−µy) will be zero if two variables are independent, otherwise it will
n
have positive or negative value.
Positive value means direct relationship between x and y, and negative value means inverse
relationship.
T is normally distributed
The mean or expected value of T, E(T)=E(t1)+E(t2)+E(t3)
=60+120+180=360 minutes
As T is the sum of random variable t1, t2 & t3, so the mean or expected value of (T) will be sum of mean
of t1, t2 & t3.
Now we have to determine standard deviation but for calculating SD we need to calculate variance first.
Var(T)=Var(t1)+Var(t2)+Var(t3)
=152+302+602=225+900+3600=4725 min2
Standard Deviation ő T =√ VAR(T )=√ 4725 min2 =68.74 min
The answer is
Now what is the chance that a product will take more than 7 hours to produce?
=1-NORM.DIST(420,360,68.74,1)
=0.191371
Note:
t1 t2 t3
=W1* + W2* + W3*
60 60 60
t1 t2 t3
= 20* + 40* + 60*
60 60 60
t 1 2t 2
= + +t3
3 3
Assumption
1. W is normally distributed
2. E(W)=(1/3)E(t1)+(2/3)E(t2)=E(t3)
=(60/3)+(2*120/3)+180
=20+80+180=$280
Now we need to calculate standard deviation and for calculating this we need to calculate
variance
Var(W)=
=((1/3)*15)^2+((2/3)*30)^2+(60)^2
=(5)^2+(20)^2+(60)^2
=25+400+3600
=4025
So the answer is
Summarizing Theory:
1. Y is normally distributed
=(a1őx1)^2+(a2őx2)^2+(a3őx3)^2