Download as pdf or txt
Download as pdf or txt
You are on page 1of 40

ENGINEERING

DATA ANALYSIS
MARCELINO C. YU JR.
Instructor
OBTAINING DATA
Lesson 1
TOPIC OBJECTIVES

1 2 3
Students should be Students should be Students should be
able to learn about the able to know about able to solve simple
the Data and the statistical problems
concepts and
methods of data using basic of
principles of
collection. statistics.
engineering data
analysis.
TABLE OF CONTENTS

INTRODUCTION TO METHODS OF
01 ENGINEERING DATA 03 DATA
ANALYSIS COLLECTION

DATA AND DATA BASICS OF


02 COLLECTION 04 STATISTICS
01
INTRODUCTION
TO ENGINEERING
DATA ANALYSIS
INTRODUCTION TO ENGINEERING
DATA ANALYSIS
Engineering Data Analysis (EDA)
– is an indispensable analysis tool for the engineering team that deals
with the methods of gathering, presentation, analysis, and interpretation of
data from facts and information.

– introduces basic statistical techniques, probability, risk analysis, and


predictive modeling, and how they impact engineering and manufacturing
activities in both analytical and forward-looking activities.
INTRODUCTION TO ENGINEERING
DATA ANALYSIS
Steps in the Data Analysis Process
Decide on the
objectives or Summarizing
Pose a Data and Visualizing Optimize and
Question Collection Data Repeat

What to Data Cleaning Data Modeling


Measure and
How to
Measure
02
DATA AND DATA
COLLECTION
DATA AND DATA COLLECTION
Data
– can be defined as the quantitative or qualitative value of a variable.

– an information that has been translated into a form that is efficient for
movement or processing. It is the lowest unit of information from which
other measurements and analysis can be done.

– one of the most important and vital aspect of any research study.
DATA AND DATA COLLECTION
Classification of Data
QUANTITATIVE DATA – are measures of values or counts and
expressed as numbers such as weights, prices, costs, numbers of items
sold, and others.

QUALITATIVE DATA – are data that approximates and characterize


and non-numerical in nature such as employee names, product names,
addresses, tax codes, registration marks etc.
DATA AND DATA COLLECTION
Quantitative Data
o CONTINUOUS DATA – data that can take the form of decimals or
continuous values of varying degrees of precision. A continuous data
usually results from making a measurement of some type.
Example: Height, Weight

o DISCRETE DATA (DISCONTINUOUS DATA) – data whose value cannot


be the form of decimals or whose values are countable. Discrete data
usually result from counting.
Example: Family size, Enrolment size
DATA AND DATA COLLECTION
Qualitative Data
o ATTRIBUTE DATA – data that can be counted for recording and analysis.
▪ NOMINAL DATA – data defined by an operation which allows making
the statements only equality or difference.
Example: Religion (Islam, Christian, etc.), Gender (Male, Female,
LGBTQ)
▪ ORDINAL DATA – date defines affiliation operation whereby
members of a particular group are ranked.
Example: Pain level (low, moderate, severe),
ML Ranking (grandmaster, epic, legend, mythic)
o OPEN DATA – data that is depending on the sample and not given a
specific value on a possible set of respondents or answers.
DATA AND DATA COLLECTION
Data Collection
– is the first step of process of gathering and measuring information on
variables of interest, in an established systematic fashion that enables one
to answer stated research questions, test hypotheses, and evaluate
outcomes.
o INVESTIGATOR or RESEARCHER is a well-trained person who
collects the data.
o RESPONDENTS are the person from whom the data is collected.
DATA AND DATA COLLECTION
Factors to Consider before Collection of Data
❑ Objectives and scope of the enquiry
❑ Sources of information
❑ Quantitative expression
❑ Techniques of data collection
❑ Unit of collection
DATA AND DATA COLLECTION
Sources of Data
EXTERNAL SOURCE – information is collected from outside
agencies. This type of information can be collected by census or
sampling method by conducting survey.
TYPES OF EXTERNAL DATA
• Primary
• Secondary
INTERNAL SOURCE – information collected in many institutions and
department about their regular function for their own internal purpose.
Example: Social Welfare societies
03
METHODS OF
DATA
COLLECTION
METHODS OF DATA COLLECTION
Method of Data Collection
– is the process or styles of getting information through interviews,
questionnaires, observation, experimentations, testing, surveys, and other
methods.

Concerns in Selecting a Method for Collecting Data


❑ Resources must be available
❑ Analysis and reporting
❑ Relevant study
❑ The skill of the evaluator
METHODS OF DATA COLLECTION
Types of Data
PRIMARY DATA – are the fresh and the first-hand information
collected, compiled, and published by organization.
Example: Population census by PSA

SECONDARY DATA – data are second-hand information which are


already collected, compiled, and published by some organization for the
purpose and available for the present study.
Example: Thesis, Economic survey of the country
METHODS OF DATA COLLECTION
PRIMARY DATA
SOURCES OF PRIMARY DATA
Direct Personal Investigation through interview method
Indirect Oral Investigation through enumerators
Investigation through local reporters’ questionnaire
Investigation through mailed questionnaire
Investigation through observation
METHODS OF DATA COLLECTION
PRIMARY DATA
METHODS OF COLLECTING PRIMARY DATA
o Personal Investigation – researcher conducts the survey and collect data
from it.
o Through Investigation – Trained investigators are employed to collect the
data.
o Collection Through Questionnaire – The researcher gets the data from
local representatives that are based upon their own experience.
o Through Telephone – The researchers get information through telephone.
METHODS OF DATA COLLECTION
SECONDARY DATA
SOURCES OF SECONDARY DATA
Published Sources
o International
o Government
o Municipal Corporation
o Institutional/Commercial
Unpublished Sources
METHODS OF DATA COLLECTION
SECONDARY DATA
METHODS OF COLLECTING SECONDARY DATA
o Official – data published by government agencies or other public bodies
such as international organizations as a public good.
o Semi-Official – data which are produced by non-public national sources or
data of an organization which has some but not all of the authority of an
official body.
o Technical and Trade Journals and Newspapers – these are published
articles and news distributed through newspapers, websites and other
platforms.
o Research Organization – these are data from various researchers of
universities and research agencies. Some are available and published for
METHODS OF DATA COLLECTION
Difference Between Primary and Secondary Data
PRIMARY DATA SECONDARY DATA
Real time data Past data
Sure about sources of data Note sure about sources of data
Help to give results/findings Refining the problem
Cheap and no time-consuming
Costly and time-consuming process
process
Cannot identify if data have
Avoids biasness of response data
biasness or no biasness
More flexible Less flexible
METHODS OF DATA COLLECTION
Example:
Suppose we are interested to find the average age of the Engineering students.

We collect the age's data by two methods:


❑ directly collecting from each student himself personally
❑ getting their ages from the university record/registrar

The data collected by the direct personal investigation is called primary data.
The data obtained from the university record/registrar is called secondary data.
04
BASICS OF
STATISTICS
BASICS OF STATISTICS
Statistics
– is a discipline of study dealing with the collection, analysis,
interpretation, and presentation of data.

Statistical Methodology is utilized by:


❑ pollsters who sample our opinions concerning topics ranging from art to
zoology.
❑ business and industry to help control the quality of goods and services that
they produce.
❑ social scientists and psychologists use statistical methodology to study our
behaviors.
BASICS OF STATISTICS
Descriptive Statistics
– is the use of graphs, charts, and tables and the calculation of various
statistical measures to organize and summarize information. It helps to
reduce our information to a manageable size and put it into focus.
EXAMPLE:
The compilation of batting average, runs batted in, runs scored, and number of
home runs for each player, won/lost percentage, and etc., for major league
baseball players.
These statistical measures allow us to compare players, determine whether a
player is having an “off year” or “good year,” etc.
BASICS OF STATISTICS
Inferential Statistics
– it consists of techniques for reaching conclusions about a population
based upon information contained in a sample.

❑ POPULATION – it is the complete collection of individuals, items, or


data under consideration in a statistical study.

❑ SAMPLE – is the portion of the population selected for analysis.


BASICS OF STATISTICS
Population and Sample
POPULATION SAMPLE
All registered voters A telephone survey of 600 registered voters
Households headed by a single The results from questionnaires sent to 2500
parent households headed by a single parent

Data Set
– is consists of elements of a sample.

EXAMPLE:
The number of speeding tickets issued by 75 Nebraska state troopers for the
month of June is recorded. The data set consists of 75 troopers.
BASICS OF STATISTICS
Graphs and Shapes of Distribution
– it is how to summarize a distribution of your data in terms of graphs.

❑ Bar Graph
❑ Pie Chart
❑ Dot Plot
❑ Histogram
BASICS OF STATISTICS
Bar Graph
– a chart that plots data using
rectangular bars or columns
(called bins) that represent the
total amount of observations in
the data for that category.
BASICS OF STATISTICS
Pie Graph
– a type of graph in which a
circle is divided into sectors that
each represent a proportion of
the whole.
BASICS OF STATISTICS
Dot Plot
– is one kind of representation
that can be used in working
with quantitative data or
numerical variable.

Plot each and every point into


the graphs after drawing a
horizontal line and label the
possible values on it, in regular
intervals.
BASICS OF STATISTICS
Histogram
– is similar to a bar graph in the sense
that it uses bars to portray the
frequencies of the possible values of a
data.

The difference is that the bars in a


histogram touch each other which
represents that the values of an
interval/ratio represent an underlying
continuous scale.
BASICS OF STATISTICS
Mode, Median, and Mean
– the center of distribution or the measuring central tendency of a
variables
❑ Mode
❑ Median
❑ Mean
BASICS OF STATISTICS
Mode
– it is the value that occurs most frequently. In other words, mode is the
most common outcome.
BASICS OF STATISTICS
Median
– it is the middle value of your observations when they are order from the
smallest to the largest.
BASICS OF STATISTICS
Mean
– it is the most known as average value. The mean is the sum of all the
values divided by the number of observations.
Without data you’re
just another person
with an opinion.

—EDWARDS DEMING
THANKS!
Do you have any questions?

marcelino.yu@ustp.edu.ph
+63 966 6528 319

www.facebook.com/marcelino.yu

CREDITS: This presentation template was created by Slidesgo,


including icons by Flaticon, and infographics & images by Freepik.

smtg2021

You might also like