Professional Documents
Culture Documents
Business Data Mining Week 1
Business Data Mining Week 1
Data collection is the process of collecting and evaluating information or data from multiple
sources to find answers to research problems, answer questions, evaluate outcomes, and
forecast trends and probabilities. It is an essential phase in all types of research, analysis, and
decision-making, including that done in the social sciences, business, and healthcare.
Accurate data collection is necessary to make informed business decisions, ensure quality
assurance, and keep research integrity.
During data collection, the researchers must identify the data types, the sources of data, and
what methods are being used. We will soon see that there are many different data collection
methods. There is heavy reliance on data collection in research, commercial, and government
fields.
Before an analyst begins collecting data, they must answer three questions first:
1. What’s the goal or purpose of this research?
2. What kinds of data are they planning on gathering?
3. What methods and procedures will be used to collect, store, and process the
information?
Additionally, we can break up data into qualitative and quantitative types. Qualitative data
covers descriptions such as color, size, quality, and appearance. Quantitative data,
unsurprisingly, deals with numbers, such as statistics, poll numbers, percentages, etc.
Pragalath EA2252001010013 1
Types of Data Collection Methods
The choice of data collection method depends on the research question being addressed,
the type of data needed, and the resources and time available. Data collection methods
can be categorized into primary and secondary methods.
Primary data collection methods can be divided into two categories: quantitative
methods and qualitative methods.
Quantitative Methods:
Quantitative techniques for market research and demand forecasting usually use
statistical tools. In these techniques, demand is forecasted based on historical data.
These methods of primary data collection are generally used to make long-term
forecasts. Statistical analysis methods are highly reliable as subjectivity is minimal.
Pragalath EA2252001010013 2
Time Series Analysis: A time series refers to a sequential order of values of a
variable, known as a trend, at equal time intervals. Using patterns, an organization
can predict the demand for its products and services over a projected time period.
Smoothing Techniques: Smoothing techniques can be used in cases where the time
series lacks significant trends. They eliminate random variation from the historical
demand, helping identify patterns and demand levels to estimate future demand.
The most common methods used in smoothing demand forecasting are the simple
moving average and weighted moving average methods.
Barometric Method: Also known as the leading indicators approach, researchers
use this method to speculate future trends based on current developments. When
past events are considered to predict future events, they act as leading indicators.
Qualitative Methods:
Qualitative data collection methods are especially useful when historical data is
unavailable or when numbers or mathematical calculations are unnecessary.
Quantitative methods do not provide the motive behind participants’ responses, often
Pragalath EA2252001010013 3
don’t reach underrepresented populations, and require long periods of time to collect
the data. Hence, it is best to combine quantitative methods with qualitative methods.
1. Surveys: Surveys collect data from the target audience and gather insights into their
preferences, opinions, choices, and feedback related to their products and services. Most
survey software offers a wide range of question types.
You can also use a ready-made survey template to save time and effort. Online surveys
can be customized to match the business’s brand by changing the theme, logo, etc. They
can be distributed through several channels, such as email, website, offline app, QR
code, social media, etc.
You can select the channel based on your audience’s type and source. Once the data is
collected, survey software can generate various reports and run analytics algorithms to
discover hidden insights.
A survey dashboard can give you statistics related to response rate, completion rate,
demographics-based filters, export and sharing options, etc. Integrating survey builders
with third-party apps can maximize the effort spent on online real-time data collection.
Practical business intelligence relies on the synergy between analytics and reporting,
where analytics uncovers valuable insights, and reporting communicates these findings
to stakeholders.
2. Polls: Polls comprise one single or multiple-choice question. They are useful when
you need to get a quick pulse of the audience’s sentiments. Because they are short, it is
easier to get responses from people.
Like surveys, online polls can be embedded into various platforms. Once the
respondents answer the question, they can also be shown how they compare to others’
responses.
Pragalath EA2252001010013 4
person, the interviewer can go for a telephone interview.
This form of data collection is suitable for only a few respondents. It is too time-
consuming and tedious to repeat the same process if there are many participants.
4. Delphi Technique: In the Delphi method, market experts are provided with the
estimates and assumptions of other industry experts’ forecasts. Experts may reconsider
and revise their estimates and assumptions based on this information. The consensus of
all experts on demand forecasts constitutes the final demand forecast.
5. Focus Groups: Focus groups are one example of qualitative data in education. In a
focus group, a small group of people, around 8-10 members, discuss the common areas
of the research problem. Each individual provides his or her insights on the issue
concerned.
A moderator regulates the discussion among the group members. At the end of the
discussion, the group reaches a consensus.
b. Interviews: Interviews involve direct interaction between the researcher and the
respondent. They can be conducted in person, over the phone, or through video
conferencing. Interviews can be structured (with predefined questions), semi-structured
(allowing flexibility), or unstructured (more conversational).
e. Focus Groups: Focus groups bring together a small group of individuals who discuss
specific topics in a moderated setting. This method helps in understanding opinions,
perceptions, and experiences shared by the participants.
Financial Statements
Magazines
Pragalath EA2252001010013 6
Sales Report
CRM Software
Executive summaries
External sources of secondary data:
Government reports
Press releases
Business journals
Libraries
Internet
Secondary data collection methods can also involve quantitative and qualitative
techniques. Secondary data is easily available, less time-consuming, and expensive than
primary data. However, the authenticity of the data gathered cannot be verified using
these methods.
Secondary data collection methods can also involve quantitative and qualitative
observation techniques. Secondary data is easily available, less time-consuming, and
more expensive than primary data.
However, the authenticity of the data gathered cannot be verified using these methods.
Regardless of the data collection method of your choice, there must be direct
communication with decision-makers so that they understand and commit to acting
according to the results.
Secondary data collection involves using existing data collected by someone else for a
purpose different from the original intent. Researchers analyze and interpret this data to
extract relevant information. Secondary data can be obtained from various sources,
including:
Pragalath EA2252001010013 7
b. Online Databases: Numerous online databases provide access to a wide range of
secondary data, such as research articles, statistical information, economic data, and
social surveys.
e. Past Research Studies: Previous research studies and their findings can serve as
valuable secondary data sources. Researchers can review and analyze the data to gain
insights or build upon existing knowledge.
Word Association
The researcher gives the respondent a set of words and asks them what comes to mind
when they hear each word.
Sentence Completion
Researchers use sentence completion to understand what kind of ideas the respondent
has. This tool involves giving an incomplete sentence and seeing how the interviewee
finishes it.
Role-Playing
Respondents are presented with an imaginary situation and asked how they would act
or react if it was real.
In-Person Surveys
Pragalath EA2252001010013 8
The researcher asks questions in person.
Online/Web Surveys
These surveys are easy to accomplish, but some users may be unwilling to answer
truthfully, if at all.
Mobile Surveys
These surveys take advantage of the increasing proliferation of mobile technology.
Mobile collection surveys rely on mobile devices like tablets or smartphones to conduct
surveys via SMS or mobile apps.
Phone Surveys
No researcher can call thousands of people at once, so they need a third party to handle
the chore. However, many people have call screening and won’t answer.
Observation
Sometimes, the simplest method is the best. Researchers who make direct observations
collect data quickly and easily, with little intrusion or third-party bias. Naturally, it’s
only effective in small-scale situations.
Quality and Accuracy: The choice of data collection technique directly impacts the
quality and accuracy of the data obtained. Properly designed methods help ensure
that the data collected is error-free and relevant to the research questions.
Relevance, Validity, and Reliability: Effective data collection methods help
ensure that the data collected is relevant to the research objectives, valid (measuring
what it intends to measure), and reliable (consistent and reproducible).
Bias Reduction and Representativeness: Carefully chosen data collection
methods can help minimize biases inherent in the research process, such as sampling
Pragalath EA2252001010013 9
bias or response bias. They also aid in achieving a representative sample, enhancing
the findings’ generalizability.
Informed Decision Making: Accurate and reliable data collected through
appropriate methods provide a solid foundation for making informed decisions
based on research findings. This is crucial for both academic research and practical
applications in various fields.
Achievement of Research Objectives: Data collection methods should align with
the research objectives to ensure that the collected data effectively addresses the
research questions or hypotheses. Properly collected data facilitates the attainment
of these objectives.
Support for Validity and Reliability: Validity and reliability are essential to
research validity. The choice of data collection methods can either enhance or
detract from the validity and reliability of research findings. Therefore, selecting
appropriate methods is critical for ensuring the credibility of the research.
The importance of data collection methods cannot be overstated, as they play a key role
in the research study’s overall success and internal validity.
Pragalath EA2252001010013 10