Professional Documents
Culture Documents
Visscher Present
Visscher Present
Medical Research
Peter M. Visscher
peter.visscher@qimr.edu.au
Overview
• Darwin and Mendel
• Background: population genetics
• Background: quantitative genetics
• GWAS
– Examples
– Analysis
– Statistical power
[Galton, 1889]
Mendelian Genetics
Following a single (or several)
genes that we can directly score
Biometricians Mendelians
qq
QQ
RA Fisher (1918).
Transactions of the
Royal Society
of Edinburgh
52: 399-433.
Population Genetics
• Allele and genotype frequencies
• Hardy-Weinberg Equilibrium
• Linkage (dis)equilibrium
Allele and Genotype Frequencies
Given genotype frequencies, we can always compute allele
frequencies, e.g.,
1
pi = freq( Ai Ai ) + ∑ freq( Ai Aj )
2 i≠ j 6
The converse is not true: given allele frequencies we
cannot uniquely determine the genotype frequencies
freq(AB) = freq(A)*freq(B)
freq(ABC) = freq(A) * freq(B) * freq(C)
Linkage disequilibrium
When linkage disequilibrium (LD) present, alleles are no
longer independent --- knowing that one allele is in the
gamete provides information on alleles at other loci:
Q1 M1
Q1 M1
Q1 M1
Q2 M1
Q1 M1
Q2 M2
Q2 M2 Q2 M1
Q2 M2
Q2 M2
Q1 M2 Q2 M2
Q2 M2
Q1 M1
Q1 M1
Q1 M2
The Decay of Linkage Disequilibrium
The frequency of the AB gamete is given by
0.90
0.80
0.70
small 0.40
0.30
0.20 c = 0.10
c = 0.01
0.10
c = 0.001
NB: Gene mapping & GWAS 0.00
0 10 20 30 40 50 60 70 80 90 100
Generation
Forces that Generate LD
• Drift (finite population size)
• Selection
• Migration (admixture)
• Mutation
• Population structure (stratification)
qq
QQ
Basic model: P = G + E
Genotype ee Ee EE
Var = ∑ ( xi − µ ) f ( xi )
2
Contribution of the QTL to the Variance (X)
i
Genotypes AA Aa aa
Effect, x a d -a
p2 2pq q2 HW proportions
Frequencies, f(x)
= VQTL
Biometrical model for single diallelic QTL
= 2pq[a+(q-p)d]2 + (2pqd)2
= VAQTL + VDQTL
a
d
Fisher 1918
-a
aa Aa AA
• GWAS works
• Effect sizes are typically small
– Disease: OR ~1.1 to ~1.3
– Quantitative traits: % var explained
<<1%
Effect sizes QT (104 SNPs)
% variance explained, quantitative
traits
35
30
Frequency
25
20
15
10
5
0
1
7
0.
0.
0.
0.
0.
1.
1.
1.
1.
Linear model for single SNP
• Allelic
Additive model
Y = µ+ b*x + e
x = 0, 1, 2 for genotypes aa, Aa and AA
• Genotypic
Additive + dominance model
Y = µ + Gi + e
Gi = genotype group for corresponding to
genotypes aa, Aa and AA
Method
• Linear regression
• ANOVA
• (other: maximum likelihood, Bayesian)
Test statistic (allelic model)
T = bˆ / σ (bˆ) ~ t N − 2 ≈ N (0,1)
T = b / var(b) ~ F1, N − 2 ≈ χ1
2 ˆ 2 ˆ 2
σ e2 σ e2
var(bˆ) = =
N var( x) N 2 p (1 − p )
Statistical Power (additive model)
q2 = {2p(1-p)[a + d(1-2p)]2} / σp2
λ = Nq2/(1-q2) ≈ Nq2
Required sample size given type-I (α) and type-II (β) error:
2.000
1.500
38% of phenotypic
variance explained
95% CI PHENOTYPE
1.000
0.500
0.000
-0.500
0 1 2
RS2070959_A
1984