Lesson 1 Data Analysis

You might also like

Download as pdf or txt
Download as pdf or txt
You are on page 1of 31

DATA ANALYSIS

❖is a process of inspecting, cleansing, transforming and


modeling data with the goal of discovering useful information,
informing conclusion and supporting decision-making,
(Wikipedia)

❖involves sorting through massive amounts of unstructured


information and deriving key insights from it. These insights are
enormously valuable for decision-making at companies of all
sizes.
DATA ANALYSIS PROCESS

1. Data Requirement Gathering


➢ decide what to analyze and how to measure it

2. Data Collection
➢ collect your data based on requirements

3. Data Cleaning
➢ data should be cleaned and error free.
DATA ANALYSIS PROCESS
4. Data Analysis
➢ Use the data analysis tools and software which will
help you understand, interpret, and derive
conclusions based on the requirements

5. Data Interpretation
➢ Express or communicate your data analysis

6. Data Visualization
➢ They often appear in the form of charts and graphs
2 CORE AREAS OF DATA ANALYSIS:

Quantitative Data Qualitative Data


Analysis Methods Analysis Methods

Quantitative Data Analysis Methods


Quantitative data is defined as the value of data in the form of counts
or numbers where each data-set has a unique numerical value
associated with it.
Types of Quantitative Data:

Counter: Count equated with entities.


Measurement of physical objects: Calculating measurement of any
physical thing.
Sensory calculation: Mechanism to naturally “sense” the measured
parameters to create a constant source of information.
Projection of data: Future projection of data can be done using
algorithms and other mathematical analysis tools
Quantification of qualitative entities: Identify numbers to qualitative
information.
Qualitative Data Analysis Methods

Qualitative data analysis is made up of words, observations, images, and


even symbols

Measurement Scales:
✓ Nominal Scale
✓ Ordinal Scale
✓ Interval Scale
✓ Ratio Scale
Measurement Scales:

NOMINAL SCALE numbers serve as “tags” or “labels” only, to identify


or classify an object

Examples:
How would you describe
Could you please select an your behavioral pattern?
option from below to describe E – Extroverted
your hair color. I – Introverted
1 – Black A – Ambivert
2 – Brown
3 – Burgundy What is your gender?
4 – Auburn M- Male
5 – Other F- Female
dichotomous nominal scale
ORDINAL SCALE reports the ranking and ordering of the data
without actually establishing the degree of
variation between them.

Examples:

“How satisfied are you with “How happy are you with
our products?” the customer service?”
1- Totally Satisfied 1- Very Unhappy
2- Satisfied 2- Unhappy
3- Neutral 3- Neutral
4- Dissatisfied 4- Unhappy
5- Totally Dissatisfied 5- Very Unhappy
The Likert scale is a variant of the ordinal scale that is used
to calculate customer or employee satisfaction

The frequency of occurrence – Questions such as “How


frequently do you have to get the phone repaired?”
Very often
Often
Not Often
Not at all
Understanding preferences: If a marketer
conducts surveys to understand which laptop brand do
their respondents do not prefer, they can use the
ordinal scale. Out of the five mentioned laptop brand,
rate the order of preference –
HP _____
Apple _____
Lenovo _____
Dell _____
Acer _____
INTERVAL SCALE scale is a quantitative measurement scale where
there is order, the difference between the two
variables is meaningful and equal, and the
presence of zero is arbitrary.

Examples:
Time is also one of the
most popular interval data
examples measured on an
interval scale where the
values are constant,
known, and measurable.
Temperature
RATIO SCALE helps to understand the ultimate-order, interval,
values, and the true zero characteristic is an essential
factor in calculating ratios.

Examples:

What is your height in feet


and inches?
• Less than 5 feet.
• 5 feet 1 inch – 5 feet 5 inches
• 5 feet 6 inches- 6 feet
• More than 6 feet
How much time do you spend
What is your weight in kgs?
daily watching television?
• Less than 50 kgs
• Less than 2 hours
• 51- 70 kgs
• 3-4 hours
• 71- 90 kgs
• 4-5 hours
• 91-110 kgs
• 5-6 hours
• More than 110 kgs
• More than 6 hours
SUMMARY of Measurement Scales:
POPULATION AND
SAMPLE
POPULATION AND SAMPLE
Population (N) is a collection, or set of individuals, objects, or
measurements whose properties are to be analyzed. It is the totality of
the observation.

Sample (n) is a subset of a population. It is a smaller group representing


the population having identical characteristics from which it was taken.
Slovin’s Formula
Formula: n = sample
𝑵
𝒏= N = population
𝟏 + 𝑵𝒆𝟐 e = marginal error

Marginal error is the probability of committing a mistake


Confidence level is the probability of getting the correct result
SAMPLE (n)
Example: In a population of 5,000 students, how many samples will you
take to get an accurate result of a study?

Solution: N = 5,000 students 𝑵


n=? 𝒏=
𝟏 + 𝑵𝒆𝟐

If e = 1% If e = 5% If e = 10%

5,000 5,000 5,000


𝑛= 𝑛= 𝑛=
1 + (5,000)(0.01)2 1 + (5,000)(0.05)2 1 + (5,000)(0.10)2

𝑛 = 3,333 𝑠𝑡𝑢𝑑𝑒𝑛𝑡𝑠 𝑛 = 370 𝑠𝑡𝑢𝑑𝑒𝑛𝑡𝑠 𝑛 = 98 𝑠𝑡𝑢𝑑𝑒𝑛𝑡𝑠


SAMPLING TECHNIQUES
SIMPLE
RANDOM CONVENIENCE
SAMPLING SAMPLING

CLUSTER PURPOSIVE
SAMPLING SAMPLING

PROBABILITY SAMPLING NON-PROBABILITY


SAMPLING SAMPLING
METHODS
METHOD METHOD
SNOWBALL
SYSTEMATIC SAMPLING
SAMPLING

STRATIFIED QUOTA
RANDOM SAMPLING
SAMPLING
SAMPLING TECHNIQUES

Sampling Techniques Or Sampling Plan is the process of gathering


sampling units from the population.

1. Probability Sampling Technique is a sampling technique wherein each of


the population unit has an equal chance of being included in the sample.

a. Random Sampling is also called “Lottery Sampling” or


“Raffle Sampling”.

Example: electronic raffle system, random picker, etc.


b. Cluster Sampling . The process of randomly selecting intact groups,
not individuals, with the defined population sharing similar
characteristics

c. Systematic Sampling in this technique, elements are numbered


for identification and samples are selected at regular intervals.

Example: If N = 300, e = 10% and n = 75, which of the


300 will be Included in the sample?
𝑁 𝟑𝟎𝟎
Use: 𝑖𝑛𝑡𝑒𝑟𝑣𝑎𝑙 = = =𝟒
𝑛 𝟕𝟓

Therefore, from the population, take the


elements tagged as the 4th, 8th, 12th, 16th,…
d. Stratified Sampling in this technique, the set of interests is divided
into groups or aggregates from which the actual sampling is done.

Example: College # of
computation Sample
If e = 5%, n = 370. Using Students
stratified sampling, how CED 900 900 67
× 370
will you get the 370 5000
CEAT 2,000 2000 148
sample from the × 370
5000
5,000population? CAS 1,000 1000 74
× 370
Use: 5000
CBET 1,100 1100 81
× 370
# 𝒐𝒇 𝒔𝒕𝒖𝒅𝒆𝒏𝒕𝒔 𝒑𝒆𝒓 𝒄𝒐𝒍𝒍𝒆𝒈𝒆 × 𝒏 5000
𝑵 N=5,000 n=370
2. Non-Probability Sampling Technique is a sampling technique wherein
each of the population unit has a n equal chance of being included in the
sample.

a. Convenience Sampling in this technique, the samples are simply


those chosen to respond.

b. Purposive Sampling in this technique, the samples are chosen


based on specific objectives.

c. Snowball Sampling in this technique, the researcher can rely on


his initial respondents to refer him to the next respondents

d. Quota Sampling in this technique, the researcher characterizes


the population based on desired features.
DATA COLLECTION
DATA COLLECTION METHODS:
1. Direct or Interview – researcher prepares a set of questions
and respondents will answer verbally and directly.

2. Indirect or Questionnaire – researcher prepares a well- planned,


written questions.
Useful steps in designing a survey:

➢ Determine the goal of your survey


➢ Identify the sample population
➢ Choose an interviewing method
➢ Decide what questions you will ask in what order, and how to
phrase them.
➢ Conduct the interview and collect the information.
➢ Analyze the results by making graphs and drawing conclusions.
Example of Planning and Conducting Surveys:

We want to construct a survey that shows which sports students at


your school like to play the most.

(a) List the goal of the survey

✓ The goal of the survey is to find the answer to the question:


“Which sports do students at your school like to play the most?”

(b) What population sample should we interview?

✓ A sample of the population would include a random sample of the


student population in your school. A good strategy would be to
randomly select students
FOCUS GROUP DISCUSSION

FGD is an in-depth field method that brings together a


small homogeneous group (usually six to twelve persons) to
discuss topics on a study agenda.

General Principles to consider:

➢ Standardization of questions
➢The number of focus groups conducted
➢Number of participants in a group
➢Level of moderator involvement
3. Registration – document and records

4. Observation – collects information on the characteristics of the


units under study by actual measurement.

5. Experimental Method – makes trials and tests; it is used to


describe any process that generates a set of data
Activity #1

Gform for Activity #1 will be posted in our Gclassroom.

You might also like