Professional Documents
Culture Documents
Detecting and Genotyping CNV
Detecting and Genotyping CNV
Lachlan Coin
July 2010
Outline
• Population-haplotype approach to CNV
detecting and genotyping
f0 (g) = 1
rm ( g ) f1 ( g ) = log(CN( g )/2)
2
rm ( g )
f 2 ( g ) = (log(CN( g )/2)) 2
(g) = β * f 3 ( g ) = bfrac( g )
bm
2 ( g ) f ( g ) = bfrac ( g ) * (1 bfrac( g ))
bm 4
f ( g ) = bfrac( g ) * (bfrac( g ) 0.5) * (bfrac( g ) 1)
5
0
log 2ratio
-1
-2
-3
1
3
2
2
2
1
2.
3.
2.
2.
2.
3.
1.
q2
3.
4.
3.
q1
q2
p1
q2
p1
p1
p1
q2
q2
p1
chromosome 16
Population cohorts
3/1592 1/6235
(NFBC1966, CoLaus, EGPUT)
B-allele frequency
NGS versus CGH data
NGS data chrom1:350mb-351mb CGH data chrom1:350mb-351mb
NGS vs CGH data
Haplotype structure of deletion
NGS amplification
Depth/coverage
With consistent break-points in
population
Imputation error rate Switch error rate Polyploid phasing and imputation
Conclusions
• Population-haplotype model enables joint
CNV discovery and genotyping using array
data
• Preliminary results indicate this will also
help using NGS data
• Combining information from multiple
platforms improves sensitivity
• Imputation still works for ploidy > 2,
phasing becomes more difficult
Acknowledgements
Evangelos Bellos David Balding (UCL)
Shu-Yi Su Rob Sladek (McGill)
Robin Walters
Julian Asher
Alex Blakemore
Adam de Smith
Phillipe Froguel
Julia El-Sayed Moustafa