Professional Documents
Culture Documents
Statistics Is Mainly Divided Into The Following Two Categories
Statistics Is Mainly Divided Into The Following Two Categories
Statistics Is Mainly Divided Into The Following Two Categories
The practice or science of collecting and analysing numerical data in large quantities,
especially for the purpose of inferring proportions in a whole from those in a representative
sample
CATEGORIES OF STATISTICS:
Statistics is mainly divided into the following two categories:
Descriptive Statistics.
Inferential Statistics.
Nominal with order: Some nominal data can be sub-categorised in order, such as “cold,
warm, hot and very hot.”
Nominal without order: Nominal data can also be sub-categorised as nominal without
order, such as male and female.
Dichotomous: Dichotomous data is defined by having only two categories or levels, such as
“yes’ and ‘no’.
2. Ordinal scale of measurement
The ordinal scale defines data that is placed in a specific order. While each value is ranked,
there’s no information that specifies what differentiates the categories from each other.
These values can’t be added to or subtracted from.
An example of this kind of data would include satisfaction data points in a survey, where
‘one = happy, two = neutral, and three = unhappy.’ Where someone finished in a race also
describes ordinal data. While first place, second place or third place shows what order the
runners finished in, it doesn’t specify how far the first-place finisher was in front of the
second-place finisher.
Data points on the interval scale have the same difference between them. The difference on
the scale between 10 and 20 degrees is the same between 20 and 30 degrees. This scale is
used to quantify the difference between variables, whereas the other two scales are used to
describe qualitative values only. Other examples of interval scales include the year a car was
made or the months of the year.
HYPOTHESIS: A hypothesis is an assumption, an idea that is proposed for the sake of
argument so that it can be tested to see if it might be true. ... A hypothesis is usually
tentative; it's an assumption or suggestion made strictly for the objective of being tested.
Null Hypothesis: It states a negative statement to support the researcher’s findings that
there is no relationship between two variables.
Alternative Hypothesis: It states that there is a relationship between the two variables of
the study and that the results are significant to the research topic.
MEASURES OF CENTRAL TENDENCY:
The Mean
The mean is the most common measure of central tendency used by researchers and people
in all kinds of professions. It is the measure of central tendency that is also referred to as
the average. A researcher can use the mean to describe the data distribution of variables
measured as intervals or ratios. These are variables that include numerically corresponding
categories or ranges (like race, class, gender, or level of education), as well as variables
measured numerically from a scale that begins with zero (like household income or the
number of children within a family).
A mean is very easy to calculate. One simply has to add all the data values or "scores" and
then divide this sum by the total number of scores in the distribution of data. For example, if
five families have 0, 2, 2, 3, and 5 children respectively, the mean number of children is (0 +
2 + 2 + 3 + 5)/5 = 12/5 = 2.4. This means that the five households have an average of 2.4
children.
The Median
The median is the value at the middle of a distribution of data when those data are
organized from the lowest to the highest value. This measure of central tendency can be
calculated for variables that are measured with ordinal, interval or ratio scales.
Calculating the median is also rather simple. Let’s suppose we have the following list of
numbers: 5, 7, 10, 43, 2, 69, 31, 6, 22. First, we must arrange the numbers in order from
lowest to highest. The result is this: 2, 5, 6, 7, 10, 22, 31, 43, 69. The median is 10 because it
is the exact middle number. There are four numbers below 10 and four numbers above 10.
If your data distribution has an even number of cases which means that there is no exact
middle, you simply adjust the data range slightly in order to calculate the median. For
example, if we add the number 87 to the end of our list of numbers above, we have 10 total
numbers in our distribution, so there is no single middle number. In this case, one takes the
average of the scores for the two middle numbers. In our new list, the two middle numbers
are 10 and 22. So, we take the average of those two numbers: (10 + 22) /2 = 16. Our median
is now 16.
The Mode
The mode is the measure of central tendency that identifies the category or score that
occurs the most frequently within the distribution of data. In other words, it is the most
common score or the score that appears the highest number of times in a distribution. The
mode can be calculated for any type of data, including those measured as nominal variables,
or by name.
For example, let’s say we are looking at pets owned by 100 families and the distribution
looks like this:
Dog: 60
Cat: 35
Fish: 17
Hamster: 13
Snake: 3
The mode here is "dog" since more families own a dog than any other animal. Note that the
mode is always expressed as the category or score, not the frequency of that score. For
instance, in the above example, the mode is "dog," not 60, which is the number of times dog
appears.
BY: ABHICHITHA
THANK YOU
.