Professional Documents
Culture Documents
Research Methodology - 2017-18
Research Methodology - 2017-18
Research Methodology - 2017-18
Research Process
Review of
Literature
Review of
Concepts & Analyse
Define Design Data Inter
Theories Formulate Collect
Research Research (Test pret
Hypotheses Data
Problem (including Hypothe- and
Review of & Objectives (Execu-
Sample ses, if Report
Previous tion)
I Design) any)
Research III
Findings V VII
IV VI
II
25
Research Process
Grounded Theory
• Researcher derives a theory with multiple stages of data
collection and refinement and interrelationship of
categories of information.
• Two primary characteristics of this design are the constant
comparison of data with emerging categories, and
theoretical sampling of different groups to maximize the
similarities and differences of information. 42
Case Studies
• Researcher collects information about a single - entity or
phenomenon (‘the case’), bounded by time and activity (event,
programme, process, institution or social group) using different
data collection procedures for a sustained period of time.
• Researcher observes individual - his situation and behaviour as
a total configuration of factors that influence him through time.
• Researcher tries to find out how cumulative experiences at
different stages of life have influenced the individual life.
Phenomenological Studies
• Human experiences are examined through the detailed
descriptions of the people being studied.
• Understanding the "lived experiences" marks phenomenology
as a philosophy (Gandhian philosophy).
• Involves studying a small number of subjects through extensive
and prolonged engagement to develop patterns 43 and
relationships of meaning.
Content Analysis
• Content: a technique for gathering and analyzing content of text.
• It can be words, phrases, sentences, paragraphs, pictures,
symbols, or ideas.
• It can be done quantitatively as well as qualitatively, and computer
programmes can be used to assist the researcher.
• Initial step involves sorting the content into themes, which
depends on the content.
• There are strict limitations on the inferences a researcher can make
with content analysis. Eg., inferences about motivation or intention
can not normally be made, nor can the researcher infer what the
effect of seeing such content would be on a viewer.
• Content analysis is only analysis of what is in the text.
• A researcher cannot use it to prove that newspapers intended, for
example, to mislead the public, or that a certain style of journalism
has a particular effect on public attitudes.
44
Assignment 2: Present the Objectives of any two theses research
RESEARCH DESIGN, REVIEW OF LITERATURE,
FORMULATION OF HYPOTHESES AND OBJECTIVES
Literature Survey
• Once research problem is formulated, a brief summary
(synopsis) should be written.
• Synopsis preparation requires an extensive literature
survey
• For this, abstracting and indexing journals, and published
or unpublished bibliographies, academic journals,
conference proceedings, government reports, books, etc.,
must be reviewed.
• In this process, one source will lead to another.
45
• A good library will help to the researcher at this stage.
Accomplishment of Literature Review
i) It shares the results of other studies that are
closely related to the study being reported.
ii) It relates a study to the larger ongoing dialogue
in the literature about a topic, filling in gaps and
extending prior studies.
iii) It provides a framework for establishing the
importance of the study, as well as benchmark
for comparing the results of a study with other
findings. 46
Suggestions in using literature in a quantitative or
qualitative study
i) In qualitative study, use literature sparingly in the
beginning of the plan so as to convey an inductive design
unless a substantial literature is required at the outset.
ii) Consider appropriate place for literature (in introduction
or separate section or at the end) in a qualitative study.
iii) Use the literature in a quantitative study deductively as a
basis for advancing research questions or hypotheses.
iv) Use literature to introduce the study, to describe related
literature in a separate section, and to compare with
findings in a quantitative study.
v) If a separate review of literature is used, consider
whether it will be described as integrative summaries,
theoretical reviews, or methodological reviews.
vi) A typical practice in dissertation writing is to present an
integrated review. 47
Design Techniques in Literature Use
i) Material to be Included in a Review
• Knowing what to abstract and how to abstract it
quickly is very important when hundreds of studies
are reviewed
• A good summary of a journal article includes the
following points:
(a) Mention the problem being addressed,
(b) State the focus of the study,
(c) State information about sample, population, or
subjects,
(d) Review key results that relate to the study, and
(e) Depending on whether or not the review is a
methodological review, point out technical and
48
methodological flaws in the study.
ii) Priority for Reviewing the Literature
a) Begin with reputed journal articles, national journals, i.e.,
those that report research studies. Journals are placed first
because they are very easy to locate and summarize.
b) Review the books related on the topic. Begin with research
monographs that are summaries of the scholarly literature.
c) Follow this search by reading recent conference papers on a
topic as they report the latest research developments.
d) Dissertation Abstract International (University Microfilms)
can be reviewed. They vary considerably in quality and are
difficult reading materials to locate.
e) Make contact with authors of studies. Seek them out at
conferences. Write or phone or send e-mail to them to ask
whether they know of studies related to the proposed study
and whether they have an instrument that can be used or
modified to measure variables of the present study. 49
iii) A Model for Delimiting the Literature Review
• It is difficult to determine how much literature is to
be reviewed.
• To overcome this problem, consider a literature
review to be composed of five components:
(a) an introduction,
(b) Topic 1 (about the independent variables),
(c) Topic 2 (about the dependent variables),
(d) Topic 3 (studies that address the relationships
between the independent and dependent
variables), and
(e) a summary that captures the most important
studies and major themes in the review. 50
iv) Computer Databases
• Information retrieval has become the next frontier of scientific
development for social scientists. Internet offers world wide database
that can be easily scanned by researchers.
v) Style Manuals and Bibliographic Citation
• Adopt a style manual - acceptable to the audience - in initial planning
process. Common style problems arise in the following instances:
a) When writing in-text references, appropriate form for types of
references and format for multiple citations should be followed.
b) Note whether the end-of-text references have to be alphabetized or
numbered. Also cross-check that each in-text reference is included in
the end-of-text list.
c) Headings are ordered in terms of levels. First, note how many levels of
headings are used. Then, refer to the manual for appropriate format for
the number of levels.
d) Foot notes are used less frequently in scholarly papers today. If foot-
notes are used, consult style manual for their proper placement. Note
whether they go at the bottom of the page or at the end of the paper.
e) Tables and figures have a specific form in each style manual. Note
such aspects as bold lines, titles, and spacing in the examples 51
as
given in the style manual.
(i) Purpose Statement
• Introduction focuses on problem leading to the study, but
purpose statement establishes the direction for research.
• Purpose Statement captures essence of a study in a single
sentence or paragraph. It provides “a specific and accurate
synopsis of overall purpose of study”.
Qualitative Purpose Statement: This implies the assumptions
of qualitative paradigm, like language of qualitative
research and methodology of an emerging design based
on experiences of individuals in a natural setting.
Basic design features of writing this statement are as
follows:
i) Use such words as purpose, intent, and objective as central
controlling idea in a study to describe this statement,
ii) Use words that convey an emerging design because of52the
• Quantitative Purpose Statement: For writing a quantitative
statement, a firm understanding of variables is needed. A
variable is a discrete phenomenon that can be measured or
observed in two or more categories. As the phenomenon
varies (in two or more categories), it is called “variables”.
• Variables may be distinguished by two characteristics:
their temporal order and measurement. In social science
research, there are three types of variables:
• Independent variables that cause, influence and affect
outcomes.
• Dependent variables are the outcomes or results of the
influence of the independent variables.
• Intervening (also called extraneous or mediating) variables:
They intervene between independent and dependent
variables; they are statistically controlled in analyses
through random sampling. (E.g.) Demographic variables
53
like gender, age, income, and class size.
Components of a good quantitative purpose statement
i) Use a statement such as purpose, intent, or objective to
begin the passage,
ii) Identify the theory, model, or conceptual framework to be
tested in the study. Mention specific type of method of
inquiry (survey and experiment) being used in the study,
iii) State whether the independent and dependent variables
will be related or whether two or more groups (as an
independent variable) will be compared in terms of the
dependent variable(s). As a general principal, order the
variables in the relationships or comparison sentence
from the independent to dependent, and
iv) Refer to the unit of analysis in the study (subjects,
population and sample being studied). 54
Hypothesis - Meaning
• Hypothesis means a mere assumption or some
supposition to be proved or disproved.
• A hypothesis as a proposition not known to be definitely
true or false, examined for the sake of determining the
consequences which would follow from its truth.
• A hypothesis is a tentative explanation for which the
evidence necessary for testing is at least potentially
available. (E.g.) Inflation causes a decline in the value of
money of a country.
• A hypothesis is a tentative statement derived from theory
and acts a guide in the conduct of an inquiry. If the
hypothesis is established by testing, it becomes one of
the generalizations of social science. If it does not meet
the test, it is re-examined or rejected. 55
Sources of Hypothesis
• Ideal source of fruitful and relevant hypothesis is a fusion
of two elements, past experience and disciplined
imagination of a scientist.
• Broadly speaking, experience, imagination, observation,
analogy, and theory are the sources of a hypothesis.
i) Experience: A reasonable guess is made to solve a
problem on the basis of the experience of the researcher.
ii) Imagination provides guidance for solving a problem.
iii) Observation of facts previously gathered and current
facts related to it help in formulating a hypothesis.
iv) Analogy is used to compare two subjects, not necessarily
alike, except in the characteristics compared. Whenever a
phenomenon is perceived, the first impulse of the mind is
to connect it with the most nearly similar phenomenon.
• When objects of different classes show same or similar
56
relationship, analogy is used to compare their relationship.
v) Theory is capable of explaining all known facts relating to
the problem under study and suggests further areas of
knowledge to be inquired into.
Other sources of hypothesis are as detailed below:
a) Environment: Sociological, cultural, political and
economic activities followed in a country can be analyzed
to formulate a hypothesis.
b) Popular beliefs and practices followed by people.
c) Logical deduction: If certain conditions prevail, certain
results will be followed.
What difference would it make between gathering data with
a hypothesis and gathering them without it?
• Answer: Hypothesis guides investigator in establishing
direction of an inquiry in which to proceed and assisting
him in collection of relevant data.
• Hypotheses stand in the midpoint of research, when one
looks back to formulation of problem and also looks
57
forward to collection of data.
Research Functions Performed by a Hypothesis
i) It directs the search for data in an orderly way.
ii) It provides basis for separating relevant from
irrelevant observations. It prevents blind search and
indiscriminate gathering of data which may later
prove irrelevant to the problem under study.
iii) It creates an answer where an answer did not exist.
iv) It helps in the development of research techniques.
v) It establishes the relationships between
assumptions and observations.
vi) It is used to predict results.
vii) It leads researchers to make deductions which, if
hypothesis is correct, constitute a new knowledge.
viii) It serves as an important link between theory and
58
investigation.
Attributes of a Good Hypothesis
i) Testability: Hypothesis should be testable. It is possible
to deduce logically certain conclusions from hypothesis,
which can be tested by observation.
ii) Clarity: Hypothesis should not be ambiguous.
Terminology used should be commonly accepted and
understood.
iii) Relevance: It should be relevant in relation to the
problem under study.
iv) Consistent: A hypothesis must be self consistent
without internal contradictions.
v) Simplicity: A hypothesis should be simple without
involving many complexities and assumptions. It has
systematic simplicity or logical compactness.
vi) Adjustment: A hypothesis should fit the facts and also to
the technique of data collection. 59
vii) Fruitful for new discoveries: Hypothesis should
suggest new field of knowledge to be studied.
viii) Power of prediction: A hypothesis should be
formulated to precisely predict the future trends.
ix) A hypothesis does not conflict with any laws of
nature or of mind, which we hold them to be true.
x) Hypothesis should state relationship between
variables, if it happens to be a relational hypothesis.
xi) Hypothesis should be limited in scope and must be
specific. Narrower hypothesis are generally more
testable and researcher should develop such
60
hypothesis.
Types of Hypothesis
i) Working Hypothesis
• Working or trial hypothesis is provisionally adopted to explain certain
facts and to guide a research worker in investigating others.
• Working hypothesis explains relationship between the facts gathered.
• If it fits the fact, it is accepted, otherwise it is revised or a new
hypothesis is developed.
ii) Null Hypothesis
• A hypothesis which is tested for possible rejection is null hypothesis.
E.g., Development of cottage and small scale industries in
Coimbatore District has no bearing on socio-economic upliftment of
people.
iii) Alternative Hypothesis
• It is desirable to formulate several hypotheses in the beginning and
then to select the one that explains the phenomenon most efficiently
or fits the case best. If a hypothesis is introduced at a later stage of
the inquiry, especially in the event of rejection of working hypothesis,
61
it is known as the alternative hypothesis. An alternative hypothesis is,
Points to be considered in the formulation of hypothesis
Pre conceived notions: If researcher starts investigation with
certain pre-conceived notions, the scientific spirit is at once
destroyed and the findings will be misleading.
• Subjective influence should not come into the mind of
researcher
Value judgment: Researcher should avoid value judgment in the
statement of his hypothesis.
• A value judgment refers only to personal or subjective standard,
such as good or bad, happy or unhappy, etc.
Use of future tense: As hypothesis is a statement deduced from
theory, future tense should not be used in stating it. E.g., Null
hypothesis: There will be no correlation between intelligence of
children and their progress in school.
• Instead of using "will be", he should write "is", because a
hypothesis is a tentative explanation that already exists and for
62
which the evidence for testing is potentially available.
Difficulties in Formulation of Hypothesis
i) Lack of clear theoretical back ground:
Researchers do not have a clear cut and definite
theoretical background. They are unable to arrive
at correct conclusions.
ii) Lack of logical back ground: There is a lack of
logical use of theoretical background.
iii) Lack of knowledge of scientific methods: It is
always not possible to have complete information
of and acquaintance with scientific methods of
formulating hypothesis.
These difficulties, however, can be overcome by:
i) Acquiring complete and perfect knowledge on the
principles
63
ii) Stating hypothesis briefly and timely.
Objectives
• In quantitative studies, objectives represent specific
restatements of the purpose of the study.
• Objective is a statement indicating the relationship between two
or more variables in declarative form.
• They are presented as either a comparison between two or more
groups in terms of a dependent variable or as a relationship of
two or more independent and dependent variables.
• In deductive methodological process of quantitative research,
objectives are testable propositions deduced from theory.
• In objectives, independent and dependent variables may be kept
and measured separately.
Assignment No. 3
(i) Formulate hypotheses for a thesis research.
(ii) Make purpose statement for a thesis research.
(iii) Make an abstract of a thesis in one page covering introduction,
64
methodology, results and policy implications.
RESEARCH DESIGN
• Research design: Sequence of those steps taken ahead of
time to ensure that relevant data are collected in a way that
permits objective analysis of different hypotheses and
objectives formulated
• A research design is the plan, structure and strategy of
investigation conceived so as to obtain answers to research
questions and to control “variance”
• Research design is the blue print of the detailed procedures
of testing the hypotheses and analyzing the collected data.
• Decisions regarding what, where, when, how much, by what
means concerning a research study constitute a research
design.
• It includes an outline of what the researcher will do from the
stage of writing hypothesis and its operational implications
65
to the final analysis of data.
The design decisions happen to be in respect of:
i) What is the study about?
ii) Why is the study being made?
iii) Where will the study be carried out?
iv) What type of data is required?
v) Where can the required data be found?
vi) What is the study period?
vii) What will be the sample design?
viii) What techniques of data collection will be
used?
ix) How the data will be analyzed?
x) In what style will the report be prepared?
66
The research design can be split into the following parts:
i) Sampling design: method of selecting items to be observed
for the given study;
ii) Observational design: conditions under which the
observations are to be made;
iii) Statistical design deals with how many items are to be
observed and how the information and data gathered are to be
analyzed; and
iv) Operational design deals with the techniques by which the
procedures specified in the sampling, statistical and
observational designs can be carried out.
In brief, research design must, at least contain
(a) a clear statement of the research problem
(b) procedure and techniques to be used for gathering
information
(c) the population to be studied and
(d) methods to be used in processing and analyzing data. 67
Purpose of Research Design
To provide maximum amount of information relevant to the
problem at a minimum cost.
It is essential in the following ways:
i) It helps to organize the researcher’s ideas in a form where
by it will be possible to look for flaws and inadequacies.
ii) It provides an inventory of what must be done and which
materials have to be collected as a preliminary step.
iii) It is a document that can be given to others for comment.
iv) It answers to the research questions as objectively, validly
and economically as possible.
• It is useful in collecting maximum information with
minimum effort, time and money.
v) It acts as a control mechanism - it enables the researcher
to eliminate the unwanted flaws and inadequacies.
• It helps in controlling variance. 68
Important Concepts Relating to Research Design
i) Variable: A concept which can take on different quantitative
values, e.g., weight, height, income, etc.
a) Dependent variable depends upon or is a consequence of
other variable.
b) Independent variable is causing or influencing or affecting
the outcome of another variable.
c) Extraneous: Independent variables that are not related to
the purpose of the study, but may affect the dependent
variable are extraneous variable.
• Whatever effect is noticed on dependent variable as a result
of extraneous variable(s) is technically described as an
‘experimental error’. 69
Variables can also be continuous and discrete in nature.
a) Continuous variable: Phenomena which can take an
quantitatively different values even in decimal points.
b) Discrete variable: It can be expressed only in integer values.
ii) Control is used when the study is designed for minimising the
effect of extraneous independent variables.
iii) Confounded relationship: When the dependent variable is not
free from the influence of extraneous variable(s), the
relationship between the dependent and independent variables
is said to be confounded by an extraneous variable.
iv) Research Hypotheses: When a prediction or a hypothesized
relationship is tested by scientific methods, it is research
hypothesis.
v) Treatments: Different conditions under which experimental and
control groups are put.
vi) Experiment: Process of examining the truth of a statistical
hypothesis relating to some research problem.
vii) Experimental unit(s): Pre-determined plots or the blocks, where
70
different treatments are used.
Principles of Research Design
i) Principle of Randomisation
• Randomisation is a procedure of allocating experimental units to
specified group with each unit having an equal chance of being
selected as a member of that group.
• This provides protection against the effect of extraneous factors.
ii) Principle of Replication
• It refers to the deliberate repetition of an experiment using a nearly
identical procedure with a different set of subjects in a different setting
and at a different time.
• Statistical accuracy of the experiment is increased through replication
iii) Principle of Local Control
• It refers to amount of balancing, blocking and grouping of the subjects
or the experimental units employed in the experimental design.
• Under this, the extraneous factor, the known source of variability, is
made to vary deliberately over as wide a range as necessary in such a
way that the variability it causes can be measured and eliminated from
71
the experimental error.
Major Steps in Preparing a Research Design
i) Review of literature
• To review all relevant materials connected with the problem.
• To show how the selected problem is related or different
from previous research studies.
• To give better insight into the problem and help in correct
planning.
ii) Sources of Information to be Tapped
• There are two sources, viz., a) Documentary and b) Field
a) Documentary: Published / unpublished documents reports,
statistics, manuscripts, etc. (secondary sources).
b) Field: Direct source and personal sources who have
knowledge on the problem (participant observation,
personal interview, correspondence, questionnaires, etc)
(primary sources). 72
iii) Development of Bibliography
• Building up of reference cards on the problem chosen:
Author name, Initial, (year), “Topic”, publishers name /
Journal name, volume number and page number.
iv) Nature of Study
• Whether it is a statistical study / case study / comparative
study / impact study / experimental study / combination of
all these have to be decided before hand.
v) Objectives of the Study
• It should be compiled in clear cut terms. It is based on the
hypothesis formulated.
vi) Socio-Cultural Context of Study
• Social and cultural settings of study must be reviewed,
which form a backdrop for better understanding of results.
73
vii) Geographical Areas to be Covered
• Physical boundaries of area in which study is to be carried
out, have to be documented.
viii) Time dimension of the Study
• Phasing of activities of the study that will coincide with what
period has to be done.
• Will the subjects (respondents) be relatively free during the
interview to be conducted?
• Whether the time chosen is a normal period?
ix) Assumptions to be made and Limitations
• The assumptions made and the limitations of the study can
be given in clear cut terms.
x) Basis for Selecting the Data
a) sample must represent the universe,
b) sample must be in correct size to draw logical conclusions
c)The sample must be efficient. 74
xi) Techniques of Study involve determination of suitable
techniques for collecting the necessary data.
• In observation method, close attention of researcher is required
• If sample is scattered and literate, questionnaire method is
followed.
xii) Control of Error
a) Suppression of facts causing errors,
b) Cross checking of facts, and
c) Randomizing the observations to minimize error.
xiii) Establish the Reliability and Validity of Test Instruments
• Good research design minimizes bias and maximizes the
reliability of the data.
The ways and means of improving the reliability of the study are:
a) Fewer transformation and translation would lessen the
possibility of distortion, and
b) Do test the facts and measure what is claimed from them. 75
Validity of a Research Design
• Validity means that the researcher’s conclusion is true – that is, it
corresponds to the actual state of the world. It is of four types:
a) Internal Validity indicates the extent to which a study provides
evidence of a cause – effect relationship between the independent
and dependent variables.
b) Construct Validity refers to the extent to which the results support
the theory behind the research.
c) External Validity indicates how well the findings of an experiment
generalize to other situations or populations.
d) Statistical Validity indicates the extent to which data are shown to
be the result of cause – effect relationships rather than accident.
xiv) Scheming the Chapters
a) Chapter outline,
b) Name of each chapter and their arrangements and
c) Main results and discussion should be separate chapter.
d) In general, the optimum number is 5 to 6 chapters. 76
Advantages of Research design
i) Saves lot of researcher’s time
ii) Directs him to prepare himself for executing the
various activities systematically.
iii) Enables resources planning and procurement in
right time. It also ensures a project time schedule.
iv) Better documentation of the activities, while the
project is in progress.
v) Provides satisfaction and sense of success from
beginning to completion of every stage of project.
77
Types of Research Designs
i) Research design in exploratory research studies
• Researcher uses his own imagination and judgment in
obtaining information on the type of things he is looking for
in the unexplored field of study.
• Development of hypothesis for conducting the inquiry is not
necessary.
• Major emphasis is on the discovery of ideas and insights.
ii) Research design in descriptive and diagnostic research
studies
• In descriptive studies, the researcher is interested in
describing on situation of phenomenon under study.
• Data collected are mostly of qualitative nature, e.g., attitude,
behaviour, public opinion, political interest, etc.
• Diagnostic research studies determine frequency with
which something occurs or its association with something 78
iii) Experiment (or) Hypothesis Test Studies
• In hypothesis testing research studies, researcher tests
hypotheses of causal relationships between variables.
• Professor Fisher found that by dividing agricultural fields or
plots into different blocks and by conducting experiments in
each of these blocks, whatever information is collected and
inferences drawn from them, happens to be more reliable.
• Experimental design in social sciences is different from that
of natural sciences wherein the human beings can not be
tested under laboratory conditions.
• Social scientists have designed tests that lead to
reasonable generalizations.
• Experimental group is assumed to be causal (or
independent) variable while a control group is not; the two
groups are then compared in terms of assumed effect (or
79
dependent variable).
Informal Experimental Designs
80
ii) After – Only With Control Design
Assignment No. 4
81
Prepare a research design of your thesis research
CHAPTER V
SAMPLING THEORY AND SAMPLING DISTRIBUTION
• Sampling: Selection of some part of an aggregate or
totality on the basis of which a judgment or inference
about the aggregate or totality is made.
• Researcher selects only a few items from universe for his
study purposes. This is done on the assumption that
sample data will enable him to estimate the population
parameters.
• Items so selected constitute what is technically called a
sample, their selection process or technique is called
sample design and the survey conducted on the basis of
sample is described as a sample survey.
• Sample should be truly representative of population
characteristics without any bias so that it may result in
82
valid and reliable conclusions.
Need for Sampling
Sampling is used for a variety of reasons such as:
i) Sampling can save time and money: A sample study is
usually less expensive than a census study and produces
results at a relatively faster speed.
ii) Sampling enables more accurate measurements as the
sample study is generally conducted by trained and
experienced investigators.
iii) Sampling is inevitable when population contains infinitely
large number of members or units.
iv) Sampling remains the only choice when a test involves
the destruction of the item under study.
v) Sampling enables to estimate sampling errors and helps
in getting information on some characteristics of
population. 83
Definitions
i) Universe / Population: Universe refers to total of items or units
in any field of inquiry, whereas ‘population’ refers to total of
items about which information is desired.
• Units possessing the attributes or characteristics that are the
object of study are called elementary units.
• All units in any field of inquiry constitute ‘universe’, while all
elementary units (on the basis of one characteristic or more)
constitute ‘population’.
• Population is totality of all possible observations of same kind.
• Population or universe can be finite or infinite. Population is
finite, if it consists of fixed number of elements or items or units
so that it is possible to enumerate in its totality, e.g., number of
workers in a factory, population of a city, etc.
ii) Population of Interest: Basic issue is to specify characteristics
of those individuals or things (e.g., companies, stores, etc) from
84
which information is needed to meet the objectives of research.
iii) Unit: Any population / universe should contain some
specifications in terms of current limits, extent and time.
• Eg: Medium farm households in Coimbatore district of Tamil
Nadu in 2017.
• Unit designation: Medium Farm Household.
• Time designation: 2017.
• Place: Coimbatore District of Tamil Nadu State.
iv) Sample: It is subset of the population of interest.
• Information is obtained from or about a subset of population
to estimate about various characteristics of total population.
• Subset of population from or about which information is
obtained should be a representative cross-section of the
total population.
v) Sampling is taking any portion of a population or universe
as a representative of that population or universe. 85
Populations and Samples
• Populations and samples are studied through descriptive
measures like mean, variance and standard deviation.
• Statisticians use the term ‘parameters’ for basic
descriptive measures of population and the term
‘statistics’ for characteristic measures of sample. They
use Greek letters for population parameters and Latin
letters for sample statistics.
Population Parameters Sample Statistics
i) Population mean: µ i) Sample mean: X 2
ii) Population variance: x 2
ii) Sample variance:sx
iii) Population standard iii) Sample standard
deviation: x deviation: s x
vi) Random: Every element in the total population has an
equal probability of being chosen for the sample and each
of these elements is independent of the other.
vii) Attribute: Specific or characteristic trait of an element of
population is called an attribute. E.g., Medium farmers 86 in
Coimbatore District own > 5 acres of dry land.
viii) Variable: The factor that varies across time and units. It is one
which can take different quantitative values.
ix) Bias: Tendency to have an error in the direction is a bias.
x) Confidence level and significance level: Confidence level or
reliability is the expected percentage of times that the actual
value will fall within the stated precision limits.
• If a confidence level is 95 %, there are 95 chances in 100 (or
0.95 in 1) that the sample results represent the true condition of
the population within a specified precision range against 5
chances in 100 (0.05 in 1) that it does not.
• Precision: range within which the answer may vary and still be
acceptable; confidence level indicates that likelihood that the
answer will fall within that range, and significance level
indicates the likelihood that answer will fall outside the range.
• If the confidence level is 95 %, then the significance level will
be (100-95) 5 %. If the confidence level is 99 %, then 87the
significance level is 1 %, i.e., 100-99.
Precision
• Area of normal curve within precision limits for the
specified confidence level constitutes the acceptance
region and the area of the curve outside these limits in
either direction constitutes the rejection regions.
xi) Precision: It is the range within which the population
mean (or other parameter) will lie in accordance with the
reliability specified in the confidence level as a
percentage of the estimate ± or as a numerical quantity.
E.g., if the estimate is Rs. 4000 and the designated precision
is ± 4%. Then the true value should lie in between
Rs. 3840 and Rs. 4160.
• But if we desire that the estimate should not deviate from
the actual value by more than Rs.200 in either direction, in
that case the range would be from Rs.3800 to Rs.4200. 88
xii) Probability
• Probability of a given event is an expression of likelihood
of occurrence of an event.
• Probability is a number ranges from 0 to 1. Zero for event
which cannot occur and 1 for an event certain to occur.
• In sampling theory, the term probability is used as
equivalent to the relative frequency.
• Development of hypothesis is the domain of theory and
inference drawn from the study of a sample is expressed
in terms of probability.
Probability Sample: In this, each individual item has an
equal chance or probability of being included as a part of
the sample.
Non-probability Sample: In this, there is no way of
estimation of probability of individuals being included89
in
the sample.
Frequency
• It is the number of times a particular value occurs.
• Frequency distribution is a presentation of data in which
the frequencies are distributed according to class intervals
or attributes.
• Frequency distribution is also graphically represented as
either of the following: graphs, histograms, frequency
polygon, frequency curve and cumulative frequency curve.
• Frequency distributions are classified under two heads :
i) Observed frequency distribution and
ii) Theoretical or expected frequency distribution
• Observed frequency distributions are based in
observation. e.g. Data on marks of a class are classified in
90
the form of frequency distribution.
Theoretical distributions
• Observed frequency distribution is based on actual observation
• However, it is possible to deduce mathematically what the frequency
distributions of certain population should be.
• Such distributions as expected on the basis of previous experience
or theoretical considerations are known as ‘theoretical distributions’
or ‘probability distributions’.
• For example, if a coin is tossed we expect that as ‘n’ increases we
shall get close to 50 % heads and 50 % tails.
• Based on this expectation, we test whether a given coin is unbiased
or not.
• If a coin is tossed 100 times, we may get 40 heads and 60 tails. This
is our observation and the expectation is 50 % heads and 50 % tails.
• The fact that the probabilities for both heads and tails are 1/2 does
not mean that we must necessarily get 50 % heads and 50 % tails - it
only means that if experiment is carried out a large number of times
we will, on an average, get close to 50 % heads & 50 % tails. 91
• According to Robert Parket, probability distribution consists of a
mutually exclusive and exhaustive compilation of all random events
that can occur for a particular process and probability of each
event’s occurring.
• Any statement of a function associating each of a set of mutually
exclusive and exhaustive events with its probability is a probability
distribution”
• It is a mathematical model that represents the distribution of the
universe, obtained either from a theoretical population or from
actual world; the distribution shows the results of computed
statistics for each sample, if we take many probability samples.
• A table listing all possible values that a random variable can take on
together with the associated probabilities is called a probability
distribution.
• Probability distribution of X, where X is the number of spots
showing when a six sided symmetric die is rolled, is given below.
Probability Distribution for a Single Die
X 1 2 3 4 5 6
P(X) 1/6 1/6 1/6 1/6 1/6 1/6
92
• Sum of all probabilities in a probability distribution is 1, since
one of the values produced by a random experiment must
occur if the experiment is performed.
• A random variable is a numerical quantity whose value is
determined by the outcome of a random (chance) experiment.
• When a random experiment is performed, the totality of
outcomes of the experiment forms a set which is called sample
space (s) of the experiment.
• Let the random experiment be tossing of a coin 2 times.
• Here S = {(T, T) (T,H) (H,T) (H,H)}.
• If we replace T by 0 and H by 1, then the number of heads
obtained in both the trails shall be:
• T, T – 0; T, H – 1; H, T – 1; and H, H - 2
• The sample space S can be written as {0, 1, 2} and here
• P (X=0) = P (T, T) = 1/4
• P (X=1) = P {(T, H) (H,T)} = 1/2
• P (X=2) = P (H, H) = 1/4
93
• Here P (X) = 1/4 + 1/2 + 1/4 = 1
• A function P(X) is called the probability function of the
random variable X.
• Probability distribution is the outcome of the different
probabilities taken by this function of random variable X.
• A random variable can be either discrete or continuous.
• A random variable is said to be discrete, if the set of values
defined by it over the sample space is finite.
• On the other hand, a random variable is 'continuous', if it
can assume any (real) value in an interval.
• If the random variable X is a discrete one, the probability
function P(X) is called ‘Probability Mass Function’ and its
distribution as ‘Discrete Probability Distribution’
• If the random variable X is of continuous type, the
probability function P(X) is called Probability Density
Function (PDF) and its distribution is called ‘Continuous 94
Probability Distribution’.
Sampling Error
• Sample surveys imply the study of small portion of the
population and as such there would be a certain amount
of inaccuracy in the information collected.
• This inaccuracy is called sampling error or error variance.
Sampling frame
Frame error
Sample
Chance error
Response
Response error
Measurement
Surrogate Sample Design
Error
Information Error
Error
Processing Selection Frame
Error Interviewer
Error Error Error
Instrument Population
Error Specification
Error
Response
Response Error Non-Response
Bias Bias
100
TOTAL SURVEY ERROR AND ITS COMPONENTS
Essentials of a Good Sampling
• Sample should be a true representative of universe, i.e.,
possessing all its characteristics.
• No bias in selecting the sample.
• All items should be independent of each other.
• Regulating conditions are same for all the individual items.
• Adequate to allow confidence in the stability of its
characteristics
• Possibility to measure sampling error.
• Aim at homogeneity of items as far as possible.
• Yields desired information with required reliability at
minimum cost
• Greater the accuracy required, larger should be the sample
101
size.
Steps in Developing a Sampling Plan
i) Defining the Population of Interest: specify characteristics of those
individuals or things (e.g. companies, stores, etc) from which
information is needed to meet objectives of research.
• Specified in combination of characteristics: geography, demographic
characteristics, product or service use characteristics or awareness
measures.
ii) Choose Data Collection Method: Telephone interviewing has inherent
advantages and mail interviewing has inherent disadvantages in
regard to sampling.
iii) Choose Sampling Frame: Sampling frame is a list of population
elements or members from whom sampling units are selected.
• In ideal situation, the list is reliable, complete, appropriate and
accurate. Unfortunately, too often, we do not have such a list.
• As sampling frame is a true representative of population, developing
an appropriate sampling frame is more challenging one. 102
Steps in Developing a Sampling Plan
iv) Selection of a Sampling Method depends on study objectives,
financial resources available, time limitations, accuracy of
estimates required, and nature of problem under investigation.
• Sampling methods are grouped under probability sampling
methods and non-probability sampling methods.
• Sample design for a given sample size and for a given cost,
should have a smaller error.
Determining the Sample Design
• All items under consideration in any field of inquiry constitute a
'universe' or 'population'.
• A complete enumeration of all items in population is the census
inquiry in which no element of chance for the omission of any
item is left and highest accuracy is obtained.
• Even the slightest element of error or inaccuracy in census will
103
get larger and larger as the number of observations increases.
Steps in Developing a Sampling Plan . . .
• In census, there is no way of checking for bias or its extent
except through a resurvey or use of sample checks.
• Census involves a great deal of time, money and efforts.
• Hence, only a few items from universe are selected.
• Items selected constitute what is technically called a sample.
• Researcher must decide the way of selecting a sample.
Samples can be either probability or non probability samples.
v) Determine the Sample Size: Process of determining sample
size involves financial, statistical, and managerial issues.
• The larger the sample size, the lesser the sampling error.
• Larger samples cost more money, and the funds available for a
particular project are always limited.
• Determinants of sample size are: budget available, rules of
thumb, number of subgroups to be analyzed and use 104
of
statistical methods to be adopted for accuracy of estimates.
Steps in Developing a Sampling Plan
vi) Development of Operational Procedures for Selecting
Sampling Elements
• Operational procedures in selecting sampling elements
depend upon whether a probability or non-probability
sample is used.
• In probability samples, procedures should be detailed,
clear, and unambiguous and should take all discretion
regarding selection of specific sample elements away
from interviewer.
vii) Execution of the sampling plan
• This final step involves the execution of the operational
sampling plan
• This step also includes adequate checking to make sure
105
that specified procedures are adhered to.
Principles of Sampling
i) Principles of statistical regularity: A group of objects
randomly selected from a population possesses
characteristics of larger groups.
• This law operates only if samples are randomly chosen, i.e.,
every item of the population has equal chance of selection.
ii) Principles of large numbers: Large samples are more stable
in their characteristics than the small ones.
• Other things being equal, larger the sample size, more
accurate the results are to be.
Factors Determining Sample Size
• i) Nature of universe: If items of universe are homogenous, a
small sample can serve the purpose and vice-versa.
• ii) Number of classes proposed: If many class groups and
sub-groups are to be formed, a large sample would be
required because a small sample might not be able to give a
reasonable number of items in each class group. 106
Factors Determining Sample Size
iii) Nature of study: Small sample for studies taken intensively
and continuously. Large sample for a general survey, but a
small sample is considered appropriate in technical surveys.
iv) Types of sampling: A small random sample is apt to be
much superior to a large but badly selected sample.
v) Standard of accuracy and acceptable confidence level:
Larger sample, if high standard of accuracy or level of
precision is to be kept. For doubling the accuracy for a fixed
significance level, sample size has to be increased four fold.
vi) Availability of finance: Large samples result in increasing
the cost of sampling estimates.
vii) Other considerations: Nature of units, size of population,
size of questionnaire, availability of trained investigators, the
conditions under which the sample is being conducted, the
107
time available for completion of the study and so on.
Sampling Methods