Professional Documents
Culture Documents
Application": σ/μ σ=standard deviation,μ=mean
Application": σ/μ σ=standard deviation,μ=mean
Application": σ/μ σ=standard deviation,μ=mean
Standard deviation is a statistic that measures the dispersion of a dataset relative to its mean and is
calculated as the square root of the variance. The standard deviation is calculated as the square root of
variance by determining each data point's deviation relative to the mean.
3. The coefficient of variation (CV) is the ratio of the standard deviation to the mean.
V=σ/μ
where:σ=standard deviation,μ=mean
aa
Application” coefficient of variation (CV) measures data point dispersion around a mean. Representing
the standard deviation to the mean makes CV a valuable resource in comparing variations from one data
series to another. It shows how much data varies in a sample in comparison to the mean of the population.
4. The mean (average) of a data set is found by adding all numbers in the data set and then dividing by the
number of values in the set.
The median is the middle value when a data set is ordered from least to greatest.
The mode is the number that occurs most often in a data set.
l MERITS OF ARITHEMETIC MEAN-l ARITHEMETIC MEAN RIGIDLY DEFINED BY ALGEBRIC FORMULA
l It is easy to calculate and simple to understand
l IT BASED ON ALL OBSERVATIONS AND IT CAN BE REGARDED AS REPRESENTATIVE OF THE GIVEN DATA
l Arithmetic mean can be computed even if the detailed distribution is not known but some of the observation and number of the
observation are known.
l It is least affected by the fluctuation of sampling
DEMERITS OF ARITHMETIC MEAN-l It can neither be determined by inspection or by graphical location
Arithmetic mean cannot be computed for qualitative data like data on intelligence honesty and smoking habit etc
It is too much affected by extreme observations and hence it is not adequately represent data consisting of some extreme
point
l Arithmetic mean cannot be computed when class intervals have open ends
Merits of median--(1) Simplicity:- It is very simple measure of the central tendency of the series.
(2) Free from the effect of extreme values: -
(3) Certainty: - Certainty is another merits is the median. Median values are always a certain specific value in the
series.
(4) Real value: - Median value is real value and is a better representative value of the series compared to arithmetic
mean average, the value of which may not exist in the series at all.
(5) Graphic presentation: - Besides algebraic approach, the median value can be estimated also through the graphic
presentation of data.
(6) Possible even when data is incomplete: - Median can be estimated even in the case of certain incomplete series.
It is enough if one knows the number of items and the middle item of the series.
Demerits of median:
(1) Lack of representative character: -
(2) Unrealistic:- When the median is located somewhere between the two middle values, it remains only an
approximate measure, not a precise value.
(3) Lack of algebraic treatment: - Arithmetic mean is capable of further algebraic treatment, but median is not. For
example, multiplying the median with the number of items in the series will not give us the sum total of the values of
the series.However, median is quite a simple method finding an average of a series. It is quite a commonly used
measure in the case of such series which are related to qualitative observation as and health of the student.
Merits of mode: (1) Simple and popular: - Mode is very simple measure of central tendency. Sometimes, just at the
series is enough to locate the model value. Because of its simplicity, it s a very popular measure of the central
tendency
(2) Less effect of marginal values: -
(3) Graphic presentation:- Mode can be located graphically, with the help of histogram.
(4) Best representative: - Mode is that value which occurs most frequently in the series. Accordingly, mode is the
best representative value of the series.
Demerits of mode: (1) Uncertain and vague: - Mode is an uncertain and vague measure of the central tendency.
(2) Not capable of algebraic treatment: - Unlike mean, mode is not capable of further algebraic treatment.
(3) Difficult: - With frequencies of all items are identical, it is difficult to identify the modal value.
(4) Complex procedure of grouping:- Calculation of mode involves cumbersome procedure of grouping the data.
If the extent of grouping changes there will be a change in the model value.
6. The regression coefficients are a statically measure which is used to measure the average functional
relationship between variables. In regression analysis, one variable is dependent and other is independent. Also, it measures
the degree of dependence of one variable on the other(s).
A=note Write formula
8.
1.Probability can be defined as the ratio of the number of favorable outcomes to the total number of outcomes of
an event.
P(A)=n(A)/n(S)
3.If A and B are two events in a sample space S, then the conditional probability of A given B is defined as P(A|
B)=P(A∩B)P(B), when P(B)>0.
4. A sample space is a collection or a set of possible outcomes of a random experiment. The sample space is
represented using the symbol, “S”.
5. In Probability, the set of outcomes of an experiment is called events. There are different types of events such
as independent events, dependent events, mutually exclusive events, and so on.
a) If the probability of occurrence of an event A is not affected by the occurrence of another event
B, then A and B are said to be independent events.
b) Two events A and B are said to be mutually exclusive events if they cannot occur at the same
time. Mutually exclusive events never have an outcome in common.
Mutually exclusive events
Independent Events
9………..What is correlation? Correlation is a statistical measure that expresses the extent to which two variables are
linearly related (meaning they change together at a constant rate). It's a common tool for describing simple
relationships without making a statement about cause and effect.
Primary Data Secondary Data For
Definition
Primary data are those that are collected for the first time. Secondary data refer to those data that have already been
collected by some other person.
Originality
These are original because these are collected by the These are not original because someone else has collected these
investigator for the first time. for his own purpose.
Nature of Data
These are more reliable and suitable for the enquiry because These are less reliable and less suitable as someone else has
these are collected for a particular purpose. collected the data which may not perfectly match our purpose.
Collecting primary data is quite expensive both in the terms of Secondary data requires less time and money; hence it is
time and money. economical.
No particular precaution or editing is required while using the Both precaution and editing are essential as secondary data
primary data as these were collected with a definite purpose. were collected by someone else for his own purpose.
Discrete variables are countable in a finite amount of time. For example, you can count the change in your pocket. You
can count the money in your bank account. You could also count the amount of money in everyone’s bank accounts. It
might take you a long time to count that last item, but the point is—it’s still countable.
Discrete variables on a scatter plot.