Professional Documents
Culture Documents
Field Experimentation
Field Experimentation
FIELD EXPERIMENTATION
LECTURER
Prof. Joseph Sarkodie-Addo
Telephone :0244576813
Email : jaddosak@yahoo.com
TA’s
* IKe (Postgraduate Assistance)-0501542625 (WhatsApp only)
EXPERIMENT
•An experiment is a scientific investigation conducted to obtain a
solution to a problem, to find the cause of the occurrence of an event, or
to provide a better way of doing things.
PRELIMINARY EXPERIMENT
In which case, real solution is expected but it gives an indication for
the next level of study.
CRITICAL
This is where the lead or results obtained at the previous level is
subjected to a closer and more critical study in order to arrive at the
possible solution to the problem.
DEMONSTRATIONAL
This is where after a new technology or the solution gotten or
obtained is shared with relevant stakeholders so that they can
adopt to improve their situation.
FIELD EXPERIMENT
These are conducted on open lands where conditions are
difficult to control.
CONTROLLED CONDITION EXPERIMENT
These are conducted in the greenhouse or glasshouse, where
conditions can be manipulated to achieve better results.
TREATMENT.
This is the procedure whose effect is investigated during
experimentation.
E.g there can be different levels of the same factor or different
factors.
Choice of treatment must be scientific and must be randomly
applied to the experimental material.
NB. The most important fact of experimentation is objectivity.
POPULATION
This is the total number of individuals. Whereas a part of the
population is called a sample.
For the information on the sample to be correctly applied, it must
be
• Large enough
• Random
MEAN
This is the of a data set which is obtained by summing the
individual data set and dividing by the total number of
individuals
MODE
The frequently occurring figure in the data after arranging the
figure in ascending order.
MEDIAN
This is the central placed figure after arranging the data in an
ascending order. When the data set is an even number, we add
the 2 figures in the middle and divide it by 2.
VARIATION
This is the measure of the dispersion variability of a data set on
the greater the variance, the less reliable the data appears.
This is calculated by summing the square of the deviations, the
mean and dividing by the number of observations.
STANDARD DEVIATION
This is a measure of variation which is calculated by finding the
square root of the variance.
The ff a
EXPERIMENTAL MATERIAL
This is the substance to which the treatment is applied.
EXPERIMENTAL UNIT
This is the portion of the experimental material of which data are
taken or measured.
EXPERIMENTAL RESULTS
This is the outcome of the experimental procedure.
Note that in some cases, we can have negative or positive
results.
EXPERIMENTAL ERROR
This is variation that cannot be ascribed to any particular
source.
NB
All experiments are supposed to have errors. But the lower the
error level, the more precise the results.
RANDOMIZATION
This is the process of giving every member of the population
on equal chance to be selected.
STANDARD ERROR
This is obtained by dividing the standard deviation by the square
root of the number of the variability of the data set.
CO-EFFICIENT OF VARIATION
It is obtained by dividing standard deviation by the mean.
Ensuring accuracy in scientific experimentation
• Proper randomization is by giving each number of the population -
sample equal chance of been selected. It cuts across the entire
experiment.
• Select sensible treatment. Make sure your treatment is sensible.
• Refine experiment design. Split plot design is used when a factor is
more important than the other. Factorial designs allow researchers to
look at how multiple factors can affect a dependent variable that is
knowing more than one independent variable.
NB: it is used when there are more than 2 levels of each factor. Refining
your experimental design is all about changing.
• Increase number of replications on blocks.
Normally, people use 4,3 and 5 replications. Tuber size, interaction
of growth media, this will help you get accurate results.
• Be as objective as possible. You cannot estimate a false size of a
certain plant.
Example: one estimating cowpea to grow more than 4m is
impossible.
NB: when you get negative results, find scientific explanations
to back up your reason.
CONDUCTING AND REPORTING UNDERGRADUATE PROJECT
WORK.
1. Settle with supervisor what you will do
2. Nature of project- field, pot, laboratory etc.
3. Choose the best site and prepare land or chemicals etc.
4. Pegging and demarcation of plots
5. Planting. All plots or rep by rep.
6. Initial soil sample analysis.
7. Data – what data to measure?(All possible and relevant. Settle on data before
experiment).
8. Take data at times appointed. Objectiveness supreme.
9. Recording data
Field note book
Pen drive
Laptops
10. Data analysis – ANOVA, other statistical packages.
Where Yi = measurement
µ = sum of population mean
Ei = error
Ti = Treatment
We say that in all the design, the population mean is so large that we
cannot measure it, so it is always represented with zero (0)
Writing the model of CRD design means state the formula don’t just
write 0.
TYPICAL CRD ANOVA TABLE
Source df ss ms F
TMS /
Treatment t–1 ∑Ti2 / r – C.F SS(T) /DF ERROR MS
TOTAL SS –
Error (N – 1) – (t -1) TREATMENT SS SS(E) /DF
Total N–1 Ʃn i2 – CF
• Calculate and complete the ANOVA table and indicate the fertilizer
treatment, significantly affected the yield of the tomato plant and state
your conclusion.
Treatment R1 R2 R3 R4
1 60 64 65 55
2 75 70 66 69
3 74 78 72 68
Solution
We are interested in the total
T1 = 244, T2 = 280 T3 = 292
Grand total = TG = 816.
The second thing to do is to calculate the correction factor (CF)
CF = =
CF = 665856 / 12 = 55488
Ss treatment = ∑fi2 – C.F / r
Ss (treatment) = (244)2 + (280)2 + (292)2 / 4 – 55488
Ss (treatment) = 312
Ss total = 602 + 642 +652 + 552 +752 +702 + 662 +692 +742 +782 +722 +682 -(55488)
Ss total = 55956 – 55488
Ss total = 468 Error ss = 468 – 312 = 156
Source DF SS MS F (VR)
Treatment 3–1 312 312/2 156/17.3 =
(T – 1) ∑FI2 – C.F / R (SST / DF) 9.01
(TMS / ERROR
MS)
Error 11-2 = 9 156 156/9
(N – 1) – (T – 1) SS(TOTAL) - (SSE / DF)
SS(TREATMENT
)
Total 12-1 = 11 468
(N – 1) ƩN I2 – CF
The total treatment was 3 so the df = N -1 = 3 – 1 = 2
To determine if the treatment effect was significant or not, we compare
the F calculated with the F tabulated. Where the F calculated is greater
than F tabulated, we reject the null hypothesis and declare that
treatment effect significant. On the other hand, if the F calculated is
less than F tabulated, we accept the null hypothesis and declare that the
treatment effect was not significant and that any variation among the
treatment means was simply due to chance. The above is the typical
conclusion to give.
F calculated is 9.01 and hence we reject the null hypothesis since F
calculated is greater than F tabulated and declare that the treatment
effect was significant.
Note the ff.
SS (treatment) = ∑fi2 – C.F / r
Ss (error) = ss (total) – ss (treatment)
Ss (total) = ∑n12 – C. F
MS (treatment) = ss (treatment) / (df (treatment)
Ms (error) = ss(error) / df (error)
Work Example 2
the following table shows the result of an experiment which was
conducted in a growth chamber to determine the mycelial growth of a
fungus at the university of Guelph. 4 different treatments were applied at
the same rate.
i) Test the null hypothesis of no difference among the treatment means
and state your conclusions.
ii)determine the best and the worst treatment.
Treatment DOK1 DOK2 DOK3 DOK4
=10339 – 9522.25
Ss(total) = 816.75
Ss(error) = ss(total) – ss(treatment)
Ss(error) = 816.75 – 549.91
Ss(error) = 266.84
Source df Ss ms F
Treatment 4-1=3 549.91 183.30 10.98
Error (20-1) - (4-1) 266.84 16.68
=16
Total 19 816.75
Conclusion
From the table, F tab is 3.13. Since F cal. Is greater than F tab, we reject the
null hypothesis and declare significance.
Whereas, the best treatment was treatment 1 and the worst treatment being
treatment 3 according to the result.
RANDOMIZED COMPLETE BLOCK DESIGN
This is probably the most used in field trials. under the CRD design, it was
stated that is the best and probably the only design sure that there are no
difference in the experimental material. but in real life situation, things differ.
And because of the difference in the experiment material so that any difference
that are measured during experimentation can be fully or wholly attributed to
the different treatment applied. In this case, we put the materials that are similar
into one group and all others into another group.
µ= population Mean
Eij= Error
• A = MEAN – B MEAN OF X
RCBD ANOVA TABLE
Source df ss ms F
1.Calculate and complete the ANOVA table and indicate whether His
assumption of the non homogeneity of the field was supported by the data
presented in the following table.
2. if his assumption was not supported by the data, suggest the design He
should have used.
Solution
We first of all calculate the Treatment totals.
T1= 4.4 + 5.9 +6.0 + 4.1 =20.4
T2= 3.9 + 1.9 + 4.9 +7.1 = 17.8
T3= 4.4 + 4.0 + 4.5 + 3.1 = 16
T4= 6.8 + 6.6 + 7.2 + 6.4 = 27
T5= 6.3 + 4.9 + 5.9 + 7.1 =24.2
T6= 6.4 + 6.3 + 7.7 +6.7 = 27.1
Having calculated the treatment totals, we move on to calculate the block totals.
B1= 4.4 + 3.9 + 4.4 + 6.8 + 6.3 + 6.4 = 32.2
B2= 5.9 + 1.9 + 4.0 + 6.6 + 4.9 + 6.3 = 29.6
B3 = 6.0 + 4.9 + 4.5 + 7.2 + 5.9 + 7.7 = 36.2
B4 = 4.1 + 7.1 + 3.1 + 6.4 + 7.1 + 6.7 = 34.5
The next step to consider, is calculating the Grand total of the treatments and
Grand total of Block.
You need to make sure that the grand total of the treatment is the same as
the grand total of the block.
GT (treatment) = 20.4 + 17.8 + 16 + 27 + 24.2 + 27.1= 132.5
GT (Block) = 32.2 + 29.6 + 36.2 + 34.5 = 132.5
The next thing to calculate for is, the C.F
C.F =
SS (treatment)=∑ti2 – C.F / r
SS (treatment) =
=28.00
SS (Block) =∑bi2 – C.F / t
SS (Block) =
SS (Block) = 4.11
Total SS = ∑
Total SS =
-731.51
Total SS = 50.28
ANOVA TABLE
Source df ss ms F
Treatment 5 28.00 5.6 4.63
Block 3 4.11 1.37 1.13
Error 15 18.17 1.21
Total 23 50.28
F- tab = 3.29
F- calc.=1.13
In conclusion, we accept the null hypothesis because, there was no significant
difference in the experimental materials.
CRD was the design he should have used.
Example 2
Tomato plant given fertilizer treatment were grown in a green house, 8
positions were designed on the greenhouse benches and 5 plants were placed
at each position, each plant was expressed as kg of ripe fruits.
1. Test the null hypothesis of no difference among the fertilizer treatment and
indicate the best treatment.
2. Did the data support blocking, even in the greenhouse?
Position T1 T2 T3 T4 T5
1 1.29 1.33 1.24 1.28 1.31
2 1.10 1.00 1.05 1.03 1.17
3 1.34 1.31 1.41 1.39 1.28
4 1.39 1.31 1.41 1.39 1.28
5 1.30 1.26 1.35 1.36 1.39
6 1.19 1.24 1.17 1.18 1.14
7 1.33 1.35 1.29 1.38 1.32
8 1.00 1.09 1.19 1.14 1.14
Solution
Block Totals.
B1=6.45
B2=5.35
B3=6.73
B4=6.78
B5=6.66
B6=5.92
B7=6.67
B8=5.56
Treatment Totals
T1=9.94
T2=9.89
T3=10.11
T4=10.15
T5=10.03
Grand total of treatment =50.12
Grand total of blocks (position) = 50.12
C.F =
SS (treatment) =
Ss(treatment) =
Ss (treatment) = 0.01
SS (block) =
SS (block) =
= 0.46
SS (Total) = ƩNi2 – CF
= 63.35 - 62.80
= 0.55
SS(error) = TSS-SSB-SST
SS(error) = 0.55 - 0.46 - 0.01
SS(error) = 0.08
ANOVA TABLE
Source df ss ms F-cal F-tab
(0.05)
Treatment 4 0.01 0.003 23.33 2.36
Total 39 0.55
F – tab is 2.36
Since F- cal is < F- tab, we accept the null hypothesis that there are no
differences in the fertilizer treatments. The Block – effect is highly
significant which proves that the experimenter was right in saying that the
position of a plant on the greenhouse bench could affect growth.
The best treatment was treatment 4 which had a value of 10.15
Again, the data shown above, indicates that blocking was necessary or
justifiable.
LATIN SQUARE DESIGN
In CRD, it was stated that it is the only design to use when the
experimenter is sure that the experimental material is
homogenous. Even though this may be correct, in some instances,
difference may occur. where this is so, we carry out the
experiment by putting the materials similar in one group.
In LSD,
no. of rows = number of columns = number of treatments.
C1 C2 C3 C4
R1 A B C D
R2 B C D A
R3 C D A B
R4 D A B C
LATIN SQUARE ANOVA TABLE
source df ss ms F
Treatment t-1
Row t-1
Column t-1
Row Totals
R1=100.43
R2=104.89
R3=110.13
R4=113.57
R5=117.42
Column Totals
C1=106.01
C2=108.4
C3=110.81
C4=108.96
C5=112.26
Treatment Totals.
A= 87.86
B= 94.55
C= 116.81
D= 123.53
E= 123.69
C.F =
SS (treatment) =
=11,943.87
= 228.64
SS (row) =
= 11,943.87
=36.59
SS (column) =
= 11,943.87
= 4.56
Total 24 286.71
Since F – cal > F – tab, we reject the null hypothesis that there are no differences
between the treatments.
We are 95% confident that, the difference between the best and the worst treatment
lies between 5.53 kg and 8.79kg of ripe fruits per plot.
Assignment
the table below shows the yield of wheat in kg per plot. In an experiment
where 4x4 Latin square was used at the university of California in 2009.
i) calculate and complete the ANOVA table and perform an independent t-test
between the second best treatment and the worst treatment.
Total 15 98.54
Second best treatment = TA=48.0
Worst treatment = TD=26.9
T- test
T=
Where
MeanA= mean of second best treatment
MeanD = mean of worst treatment
S.E = Standard Error.
But,
S.E =
=
=1.01
MeanA =
MeanD =
T=
= 5.22
F- tab = 2.447
F- cal. = 5.22
Therefore, since F- cal is greater than F – tab, we reject the null
hypothesis and conclude that there was a significant difference
between the second best and the worst treatment.
C.I =
= 12
= 6.725
= 2.447
S.E = 1.01
C.I = 12 – 6.725 2.447 x 1.01
= 5.275 2.471
We are 95% confident that, the difference between the second best
and worst treatment lies between 2.80kg and 7.75kg of wheat yield
per plot.
Row
F – tab = 4.76
F – cal = 0.21
Column
F – tab = 4.76
F – cal = 2.08
Since the F – cal of both Row and Column are less than their
respective F - tab, we accept the null hypothesis of
insignificance and declare that, the experimenter ought not to
have blocked along both sides ( row and column).
Completely Randomized Design was the best design he should
have used. Therefore, the experimenter was wrong in using
LSD.
Hence the new CRD ANOVA table will be…
Source df ss ms F
Error 12 20.02 1.67
Total 15 98.54
FACTORIAL EXPERIMENT
In the previous discussions, we were dealing with different rates of
the same factor or different sources of that same factor.
But in this experiment, we will be dealing with more than one
factor, hence treatment in this type of experiment are the
combination of different level of different factors.
Factor A a-1 A
Ax B (a-1) (b-1) AB
Total rab-1
Where,
Ay = sum of squares due A.
By = sum of squares due B.
In factorial crops. If the interaction F is significant,
you are not allowed to present the result of the main
effects.
The table below shows the yield of egg- plant in kg per plot after
all fruits from the three replications have been harvested.
Calculate and complete the ANOVA table and state your
conclusions
Nitrogen Phosphorus Rep 1 Rep 2 Rep 3 NP Totals (N) Totals
kg/ha kg/ha
0 38.62 40.26 39.23 118.11
0 220.47
40 32.75 32.36 37.25 102.36
0 55.07 53.86 47.37 156.30
20 306.16
40 48.81 49.62 51.43 149.86
0 40.49 51.94 46.94 139.37
40 277.09
40 46.11 52.31 39.30 137.72
0 58.88 50.43 49.93 159.24
60 316.15
40 61.55 48.49 46.87 156.91
Rep. Totals 382.28 379.27 358.32 1119.87
Phosphorus Totals
P 1 = 573.02
P 2 = 546.85
Calculation of Variances
C.F = = 52254.53
TSS = (38.62)2 + (40.26)2 … (46.87)2 – C.F
= 53612.84 – 52254.53
= 1358.31
Rep SS =
=52297.12 – 52254.53
=42.59
Nitrogen (SS) =
=53178.44 – 52254.53
=923.91
Phosphorus (SS) =
= 52283.07 – 52254.53
= 28.54
(NP) SS =
=53228.06 – C.F – Ny – Py
= 973.53 – 923.91 – 28.54
= 21.08
C.F =
Where r = Reps
A=a
B=b
Sources df SS MS F
Replication R – 1 =2 42.559 21.30 0.87
Nitrogen N–1=3 923.91 311.30 12.74
Phosphorus P–1=1 28.54 28.54 1.17
NxP (n -1) (p-1) =3 21.08 3.69 0.15
Error (r-1) (np -1) 342.19 24.44
=14
Where,
µ = population mean
Replication effect
Error A
factor B
= Interaction AXB
= Sub- plot Error or Error II
What is Research ?
Even though this gives a good impression about the treatment that
was applied, we are limited as to where the differences lie, because
more than one treatment is over used during experimentation. Thus,
we need to explore, to identify which pair of treatment means do
we have significant differences and where we have not.
Where the difference is greater than the critical value, it means the
difference between such pair of treatment is significant and vice
versa.
How to calculate critical value under LSD
Critical value = t αdf x SED
Where,
t αdf = t – tab value
SED = Standard Error
S.E =
=
= 2.131 x 0.778
CV = 1.65
Assuming treatments are:
Treatment Total Treatment
Mean
A = 20.4 5.1
B = 17.8 4.45
C = 16 4.0
D = 27 6.75
E = 24.2 6.05
F = 27.1 6.77
But CV = 1.65
CV = t αdf x
Treatment with the greatest effect (F) = 6.77
Treatment with the least effect (C) = 4.0
The difference between the greatest treatment and
the worse treatment will be
6.77 – 4.0 = 2.77
We now compare the difference with the critical value. Since
the difference is greater than the CV, we declare significant
difference between the treatment.
0 x
a. Independence of error: this will be achieved for you to use the
ANOVA to run the data, if proper randomization was done at
all stages of the experiment.
b.Homogeneity of Variances: the assumption is that, errors
associated with the various treatments are all proper estimates
of true population error variance. In order words, The true
population error variance is being estimated by pulling or
adding all the other variance
c. Experimental error must be normally distributed: this means
that the bulk of the measurement must lie at the mid-point
with few outliners.
DATA TRANSFORMATION
This is the process where the nature of the data set is
changed so that they can conform to the assumptions
under the ANOVA so that we can use the ANOVA
technique to run the data.
Remember that it is only the nature of the data set that
changed.
Three Methods of Data Transformation
1.Square root Transformation.
This is used where the data are counts (number of tillers,
number of leaves, number of branches, number of insects,
number of bacteria colonies, number of nodules etc.)
This transformation is also appropriate when the
measurement consist of small whole numbers. In effect, what
we do is that, before data analysis, every number must
calculate the square root of each data set and run the data on
these values.
This method are normally used by scientist.
1.Logarithm ( log) Transformation: this is used when the treatment
effect is multiplicative instead of additive.
Any log base could be used but most often 10 is used. Base 10
is the simplest among the other bases. When the data to be
transformed includes Zero. 1 is added to every data before
transformation.
a.Positive correlation
b.No correlation
c.Negative correlation
Simple linear regression
This is a dependent – independent relationship.
From the scatter diagrams above, it is so because all the
treatment effect do not lie on a straight line. So we draw
the line that best fits the occasion. This is the regression
line.
We can thus estimate the regression co-efficient, which is
the regression co- efficient and that is the amount of
increase in the dependent variable for a unit increase in
the independent variable.
X Y XY
10 15 150 100
12.5 17 212.5 156.25
15 21 315 225
17.5 20 350 306.25
20 27 540 400
= 75 =100 =1567.5 =1187.5