Professional Documents
Culture Documents
What Is Data Collection: A Definition
What Is Data Collection: A Definition
Before we define what is data collection, it’s essential to ask the question, “What is
data?” The abridged answer is, data is various kinds of information formatted in a
particular way. Therefore, data collection is the process of gathering, measuring, and
analyzing accurate data from a variety of relevant sources to find answers to research
problems, answer questions, evaluate outcomes, and forecast trends and probabilities.
Our society is highly dependent on data, which underscores the importance of collecting
it. Accurate data collection is necessary to make informed business decisions, ensure
quality assurance, and keep research integrity.
During data collection, the researchers must identify the data types, the sources of data,
and what methods are being used. We will soon see that there are many different data
collection methods. There is heavy reliance on data collection in research, commercial,
and government fields.
Before an analyst begins collecting data, they must answer three questions first:
• What methods and procedures will be used to collect, store, and process the
information?
Additionally, we can break up data into qualitative and quantitative types. Qualitative
data covers descriptions such as color, size, quality, and appearance. Quantitative data,
unsurprisingly, deals with numbers, such as statistics, poll numbers, percentages, etc.
What Are the Different Methods of Data Collection?
• Surveys
• Transactional Tracking
• Observation
• Online Tracking
• Forms
Now that you know what is data collection, let's take a look at the different methods of
data collection. While the phrase “data collection” may sound all high-tech and digital, it
doesn’t necessarily entail things like computers, big data, and the internet. Data
collection could mean a telephone survey, a mail-in comment card, or even some guy
with a clipboard asking passersby some questions. But let’s see if we can sort the
different data collection methods into a semblance of organized categories.
Data collection breaks down into two methods. As a side note, many terms, such as
techniques, methods, and types, are interchangeable and depending on who uses them.
One source may call data collection techniques “methods,” for instance. But whatever
labels we use, the general concepts and breakdowns apply across the board whether
we’re talking about marketing analysis or a scientific research project.
The two methods are:
• Primary.
As the name implies, this is original, first-hand data collected by the data researchers.
This process is the initial information gathering step, performed before anyone carries
out any further or related research. Primary data results are highly accurate provided the
researcher collects the information. However, there’s a downside, as first-hand research
is potentially time-consuming and expensive.
• Secondary.
Secondary data is second-hand data collected by other parties and already having
undergone statistical analysis. This data is either information that the researcher has
tasked other people to collect or information the researcher has looked up. Simply put,
it’s second-hand information. Although it’s easier and cheaper to obtain than primary
information, secondary information raises concerns regarding accuracy and
authenticity. Quantitative data makes up a majority of secondary data.
Let’s get into specifics. Using the primary/secondary methods mentioned above, here is
a breakdown of specific techniques.
• Interviews.
• Projective Technique.
• Delphi Technique.
• Focus Groups.
• Questionnaires.
Unlike primary data collection, there are no specific collection methods. Instead, since
the information has already been collected, the researcher consults various data
sources, such as:
• Financial Statements
• Sales Reports
• Retailer/Distributor/Deal Feedback
• Business Journals
• Trade/Business Magazines
• The internet
2.
Interviews
Interviewing people is a standard method used for data collection. However,
the interviews conducted to collect quantitative data are more structured,
wherein the researchers ask only a standard set of questionnaires and
nothing more than that.
There are three major types of interviews conducted for data collection
Surveys/questionnaires
Surveys or questionnaires created using online survey software are playing
a pivotal role in online data collection be is quantitative or qualitative
research. The surveys are designed in a manner to legitimize the behavior
and trust of the respondents. More often, checklists and rating scale type of
questions make the bulk of quantitative surveys as it helps in simplifying
and quantifying the attitude or behavior of the respondents.
Observations
As the name suggests, it is a pretty simple and straightforward method of
collecting quantitative data. In this method, researchers collect quantitative
data through systematic observations by using techniques like counting the
number of people present at the specific event at a particular time and a
particular venue or number of people attending the event in a designated
place. More often, for quantitative data collection, the researchers have a
naturalistic observation approach that needs keen observation skills and
senses for getting the numerical data about the “what” and not about “why”
and ”how.”
Naturalistic observation is used to collect both types of data; qualitative
and quantitative. However, structured observation is more used to collect
quantitative rather than qualitative data.
Three primary document types are being analyzed for collecting supporting
quantitative research data
Qualitative surveys:
To develop an informed hypothesis, many researchers use qualitative
surveys for data collection or to collect a piece of detailed information
about a product or an issue. If you want to create questionnaires for
collecting textual or qualitative data, then ask more open-ended questions.
To answer such questions, the respondent has to write his/her opinion or
perspective concerning a specific topic or issue. Unlike other qualitative
data collection methods, online surveys have a wider reach wherein many
people can provide you quality data that is highly credible and valuable.
• Paper surveys
The paper questionnaires are frequently used for qualitative data collection
from the participants. The survey consists of short text questions, which are
often open-ended. These questions’ motive is to collect as much detailed
information as possible in the respondents’ own words. More often, the
survey questionnaires are designed to collect standardized data hence
used to collect responses from a larger population or large sample size.
• Onlinesurveys
An online survey or a web survey is prepared using a prominent
online survey software and either uploaded in a website or emailed to the
selected sample size with a motive to collect reliable online data. Instead of
writing down responses, the respondents use computers and keyboards to
type their answers. With an online survey questionnaire, it becomes easier
and smoother to collect qualitative data.
In addition to that, online surveys have a wider reach, and the respondent
is not pressurized to answer each question under the interviewer’s
supervision. One of the significant benefits that online surveys offer is that
they allow the respondents to take the survey on any device, be it a
desktop, tablet, or mobile.
Observations:
Observation is one of the traditional qualitative data collection methods
used by researchers to gather descriptive text data by observing people
and their behavior at events or in their natural settings. In this method, the
researcher is completely immersed in watching or seeing people by taking a
participatory stance to take down notes. Aside from taking notes, different
techniques such as videos, photographs, audio recordings, tangible items
like artifacts, and souvenirs are also be used.
4.
Pilot studies can play a very important role prior to conducting a full-
scale research project
A pilot study, also called a 'feasibility' study, is a small scale preliminary study
conducted before any large-scale quantitative research in order to evaluate the
potential for a future, full-scale project.
Pilot studies are a fundamental stage of the research process. They can help
identify design issues and evaluate feasibility, practicality, resources, time, and
cost of a study before the main research is conducted.
It involves selecting a few people and trying out the study on them. It is possible
to save time, and in some cases, money, by identifying any flaws in the
procedures designed by the researcher.
A pilot study can help the researcher spot any ambiguities (i.e. unusual things) or
confusion in the information given to participants or problems with the task
devised.
Sometimes the task is too hard, and the researcher may get a floor effect, because
none of the participants can score at all or can complete the task – all
performances are low. The opposite effect is a ceiling effect, when the task is so
easy that all achieve virtually full marks or top performances and are “hitting the
ceiling”.
This enables researchers to predict an appropriate sample size, budget
accordingly, and improve upon the study design prior to performing a full-scale
project.
Pilot studies also provide researchers with preliminary data so they can gain
insight into the potential results of their proposed experiment.
However, pilot studies should not be used to test hypotheses since the
appropriate power and sample size are not calculated. Rather, pilot studies
should be used to assess the feasibility of participant recruitment or study design.
By conducting a pilot study, researchers will be better prepared to face the
challenges that might arise in the larger study, and they will be more confident
with the instruments they will use for data collection.
In some studies, multiple pilot studies may be needed and qualitative and/or
quantitative methods may be used.
In order to avoid bias, pilot studies are usually carried out on individuals who are
as similar as possible to the target population, but not on those who will be a part
of the final sample.
It is important to conduct a questionnaire pilot study for the following
reasons:
• Check that respondents understand the terminology used in the
questionnaire.
• Check that emotive questions have not been used as they make people
defensive and could invalidate their answers.
• Check that leading questions have not been used as they could bias the
respondent's answer.
• Ensure the questionnaire can be completed in an appropriate time frame
(i.e., it's not too long).
5.
Reliability and validity are closely related, but they mean different things. A
measurement can be reliable without being valid. However, if a measurement is valid, it
is usually also reliable.
Reliability
Reliability refers to how consistently a method measures something. If the same result
can be consistently achieved by using the same methods under the same
circumstances, the measurement is considered reliable.
You measure the temperature of a liquid sample several times under identical conditions. The
thermometer displays the same temperature every time, so the results are reliable.
A doctor uses a symptom questionnaire to diagnose a patient with a long-term medical
condition. Several different doctors use the same questionnaire with the same patient but give
different diagnoses. This indicates that the questionnaire has low reliability as a measure of the
condition.
Validity
Validity refers to how accurately a method measures what it is intended to measure. If
research has high validity, that means it produces results that correspond to real
properties, characteristics, and variations in the physical or social world.
High reliability is one indicator that a measurement is valid. If a method is not reliable, it
probably isn’t valid.
If the thermometer shows different temperatures each time, even though you have
carefully controlled conditions to ensure the sample’s temperature stays the same, the
thermometer is probably malfunctioning, and therefore its measurements are not valid.
If a symptom questionnaire results in a reliable diagnosis when answered at different times and
with different doctors, this indicates that it has high validity as a measurement of the medical
condition.
However, reliability on its own is not enough to ensure validity. Even if a test is reliable,
it may not accurately reflect the real situation.
The thermometer that you used to test the sample gives reliable results. However, the
thermometer has not been calibrated properly, so the result is 2 degrees lower than the true
value. Therefore, the measurement is not valid.
A group of participants take a test designed to measure working memory. The results are
reliable, but participants’ scores correlate strongly with their level of reading comprehension.
This indicates that the method might have low validity: the test may be measuring participants’
reading comprehension instead of their working memory.
Validity is harder to assess than reliability, but it is even more important. To obtain
useful results, the methods you use to collect your data must be valid: the research
must be measuring what it claims to measure. This ensures that your discussion of the
data and the conclusions you draw are also valid.