Professional Documents
Culture Documents
Data Collection and Analysis: Interpretation and Providing Solution
Data Collection and Analysis: Interpretation and Providing Solution
Data Collection and Analysis: Interpretation and Providing Solution
2
4.1. Data Collection– Quantitative & Qualitative
• After
• developing the instruments
• Pilot and Main (actual)
• determine the data sources
• In case of experimental, analytical or predictive
• Setting the objectives of the design artifacts and later after
the construction of the artifact.
Actual Data Collection
• One should carefully plan the data collection as this is the
departure for execution of the research
• Pre-data collection
• Training of Data Collectors might be crucial
• Supporting letters might be necessary
• Post-data collection like editing of returned questionnaires
3
Cont…
• The data you have collected may be presented using
• Tabular methods
• Graphical methods
• Followed by analysis
• In strict sense in case of quantitative but not in qualitative
• In qualitative you collect- analyze- collect…
• Until “saturation “
• Analysis could start
• by arranging; presentation
• by description (like in NLP and DM)
• By structuring requirements
4
Exploring and Organizing a Data Set
• Look closely at your data and explore various ways of
organising them – detect patterns
• Example : reading test scores of 11 children.
Ruth, 96; Robert, 60; Chuck, 68; Margaret, 88; Tom, 56; Mary, 92; Ralph,
64; Bill, 72; Alice, 80; Adam, 76; Kathy, 84
What do you see? Arrange the data and look for patterns
• Alphabetical arrangement – look for meaning or pattern,
• Observable fact –
• highest score was earned by a girl and that the lowest score was
earned by a boy – although silly and meaningless,
Symmetrical pattern: same sex arrangement- girls vs. boys
• the girls’ scores increase as we proceed through the alphabet,
and the boys’ scores decrease
Every researcher should be able to provide a clear, logical
rationale for the procedure used to arrange and organise
the data. - affect the meaning that those data reveal
Drawing Conclusions from the Data
• Questions from the example
• Why were all the scores of the girls higher than those of the
boys?
• Why is this algorithm working better?
• Why were the intervals between each of the scores
equidistant for both boys and girls?
• Knowledge springs from questions like these
• But must be careful not to make snap judgments
• Even the most thorough research can go astray(wrong) at
the point of drawing conclusions
• The example – might conclude that girls read better than
boys – not thinking carefully
• Reading is a complex and multifaceted skill
Tabular Methods of Data Presentation
• Tabulated data can be more easily understood than facts
• They help facilitate statistical treatment of data
• When data are tabulated, all unnecessary details and
repetitions are avoided.
• Type of tables
• Simple (one way) table: shows one characteristic
• Two-way table: shows two characteristics
• Higher order table: shows three or more characteristics
Tabular method- Frequency distributions
Steps:
• Begin by arranging the data from smallest to largest
• Count values that repeat by making tallies
• Group observations with comparable magnitude
• Stop the classification when you’re sure that the 1st & the last
classes respectively consist the smallest and larges values.
Cont…
• Indicate how many values are included in a class
• Note: If the number of classes k has been fixed, then class width
may be fixed as w = range/k
Graphical Methods Of Data Presentation
• Data in a frequency distribution can be presented graphically
or diagrammatically
• Graphs are the natural choice to represent continuous data
• For discrete or qualitative data, we have
• Pie chart (multiply relative frequency by 3600), Pictogram (use
of pictures) or Bar graph (class limit and Abs. frequency)
• For continuous data, we have
• Histogram (class boundary and abs. freq.), Frequency polygon
(Class mark and abs freq.), Cumulative frequency graph (class
mark and cumulative frequency) – also known as Ogive
Summarizing Data Numerically
• Measures of Central Tendency
• describe the characteristics of a frequency distribution
• We can calculate the
• Mean, Median, and Mode for ungrouped data
• Mean, Median, and Mode for grouped data
• We need to determine how representative the average is as
a description of a given set of data.
• We need to calculate : Range (very simple), Quartile Deviations;
Mean Deviations, Standard Deviations
Estimation and Hypothesis Testing
• Two important problems of statistical inference are:
• Estimation of parameters (point estimation and interval
estimation) and
• Test of hypothesis
Cont…
• Estimation of parameters
• producing sound and reasonable substitute for unknown
parameters of a population (Mean, variance, correlation
coefficient, etc.)
Test of Hypothesis
• The problem in hypothesis testing refers to speculation made
about the value of unknown parameter of a distribution.
• We can test hypothesis based on sample data.
Procedures to follow in Tests of Hypothesis
• The assumption about the parameter is called the null
hypothesis, and is denoted by H o
• The counter hypothesis is known as alternative hypothesis
and is denoted by H1
• There should always be a level of significance α in testing a
hypothesis (the probability of rejecting a hypothesis when it
is actually true)
Test of Associations
• You can also test the relationship between two
variables. We have
• Test of independence
• Testing association between two variables
• Chi square test
• You have know how to find the calculated value
(o e) 2
12
Interpretation and Discussion
• Explain the results in light of previous literatures and theories.
• No clear distinction in case of qualitative
• Involves demonstration and evaluation in case of design
research
• May be required to collect data and analysis with the same
procedures as we have seen before
• Interpreting the data means:
1. Relating the findings to the original research problem and
to the specific research questions and hypotheses.
• Researchers must eventually come full circle to their
starting point – why they conducted a research study in the
first place and what they hoped to discover – and relate
their results to their initial concerns and questions.
13
Cont…
2. Relatingthe findings to preexisting literature, concepts,
theories, and research studies.
• To be useful, research findings must in some way be
connected to the larger picture – to what people already
know or believe about the topic in question.
• Perhaps the new findings confirm a current theoretical
perspective, perhaps they cast doubt on common
“knowledge”, or perhaps they simply raise new questions that
must be addressed before we can truly understand the
phenomenon in question
3. Determining whether the findings have practical significance
as well as statistical significance.
• Statistical significance is one thing; practical significance –
whether findings are actually useful – is something else
altogether.
Cont…
4. Identifying limitations of the study.
• Finally, interpreting the data involves outlining the
weaknesses of the study that yielded them.
• No research study can be perfect, and its imperfections
inevitably cast at least a hint of doubt on its findings. Good
researchers know and they also report the weaknesses along
with the strengths of their research. E.g In design science
Exercise
• Read the following problem descriptions and
1. Craft a candidate general objectives for both.
2. Identify possible Data source
3. Determine candidate Data collection techniques and
procedures
4. Propose Data analysis techniques and procedures
Example 1: problem identification
• Distance learning puts learners in isolation, lack of
observation by teachers and more freedom to learners
• Researchers in distance learning are interested to develop
collaborative tools that supports student interactions
• There is no sufficient, collaboration among tutors is also
necessary for effective distance learning
• No system so far that supports tutors.
16
Cont…
Example 2: Amharic
• Due to the advent of the internet, many Amharic documents
are now available online. Additionally, the popular search
engine, Google, has provided an Amharic interface. However,
to date, no tolerant-retrieval mechanism based on spelling
correction has been employed for Amharic; and even there is
no published prior work regarding spelling correction for the
Amharic language.
Objectives:
• Example 1- To develop a collaborative tool that supports
tutors in distance education.
• Example 2- To develop an Amharic spelling corrector to assist
in the development of tolerant-retrieval Amharic search
systems
17
The Tools Of Research- with respect to
data collection and analysis
• Tools are chosen to facilitate research tasks
• Be careful not to equate tools of research with the
methodology of research
• Research tool = specific mechanism or strategy researcher
uses to collect, manipulate, or interpret data.
• Research methodology = general approach the researcher
takes in carrying out the research project; to some extent,
this approach dictates the particular tools the researcher
selects.
• There are six general tools of research:
• The library and its resources, The computer and its software
• Techniques of measurement, Statistics, The human mind and
• Language
1. The Library and Its Resources
• After selecting research problem, the library is the FIRST
place to clarify the dimension of the problem
• Learn what others have done in the area or in corollary
investigation, receive ideas to sharpen the focus of research
• Catalogue is the heart of the library – books, films, filmstrip,
tapes, phonograph records, maps, pictures, slides, CDs, …
• E.g., ACM Digital Library, UPM Online Database
36
Exploring and Organizing a Data Set
• Before employing any statistical procedure, develop habit of
looking closely at your data and exploring various ways of
organising them – detect patterns
• Example : reading test scores of 11 children.
Ruth, 96; Robert, 60; Chuck, 68; Margaret, 88; Tom, 56;
Mary, 92; Ralph, 64; Bill, 72; Alice, 80; Adam, 76; Kathy, 84
• What do you see? Arrange and look for patterns
Arranging The Data
• Alphabetical arrangement – look for meaning or pattern,
• Observable fact – highest score was earned by a girl and
that the lowest score was earned by a boy – although silly
and meaningless, it’s an observable fact, and it may come in
handy at a future time
• Careful researchers discover everything possible about their data,
whether the information is immediately useful or not
Arranging The Data….
Symmetrical pattern: same sex arrangement- girls vs. boys
• The graph shows dramatic trends –
• the girls’ scores increase as we proceed through the
alphabet, and the boys’ scores decrease
• The researcher should be aware of the dynamics, the
phenomena, that are active within the data, whether those
phenomena are important to the purpose of the research or
not
• Fundamental Guideline for Looking at the Data
• Whatever the researcher does with the data to prepare it for
inspection or interpretation will affect the meaning that
those data reveal
• Therefore, every researcher should be able to provide a clear,
logical rationale for the procedure used to arrange and
organise the data
Drawing Conclusions from the Data
• Questions from the example
• Why were all the scores of the girls higher than those of the
boys?
• Why is this algorithm working better?
• Why were the intervals between each of the scores
equidistant for both boys and girls?
• Knowledge springs from questions like these
• But must be careful not to make snap judgments
• Even the most thorough research can go astray(wrong) at
the point of drawing conclusions
• The example – might conclude that girls read better than
boys – not thinking carefully
• Reading is a complex and multifaceted skill