Professional Documents
Culture Documents
Microbial Genetics
Microbial Genetics
Metagenomics
Individual genomes
Sequence determination
Comparison with data banks
Soil sample
opcL
LMG 16660 B. m ult. (II) LMG 16668 B. m ult. (II) R 2900 B. cc. (VI) LMG 13544 B. am b. (VIII)
31 11 11
Re-amplification
Sequence analysis
58
47 96
89 98
LMG 17829 B. anth. (VII) R 4183 B. am b. (VIII) LMG 10824 B. viet. (V)
Soil sample
(20-100kb)
Transformation of E. coli
Amylase activity
Metagenomics/Environmental genomics
Traditional Microbial Genomics Environmental Genomics
http://www.asmusa.org/memonly/asmnews/jul00/feature3.html
http://www.asmusa.org/memonly/asmnews/jul00/feature3.html
http://www.asmusa.org/memonly/asmnews/jul00/feature3.html
Sargasso Sea
1,000,000
Minnesota Soil Whale fall community Anaerobic methane oxidation Acid mine drainage Human Arabidopsis Rice
Mouse gut
Worm
Fly
10,000
H.influenza
Yeast
Mycoplasma
100
1995
2000
2005
2007
Goals in metagenomics
Medical: understanding human-microbial interplay Biotech: discovering novel enzymes, streamlining industrial processes
Fundamental sciences: ecology, evolution, biogeography, microbiology Understanding ecosystem functioning and complexity
ecological dominance Sequencing is not exhaustive! Incomplete genomes, half- or totally not assembled Short reads (no assembly) with incomplete genes Huge datasets computing power (or smart programming) needed! Sampling issues: biases, contaminations Before you can understand microbial communities from their genome data, you first need to determine which influencing factors/properties of the organisms/environments might be influencing computational analyses!
Comparative Metagenomics
What is different between communities? What is the same? Can we learn something about how organisms adapt to their environment?
1.6 Mb
3.2 Mb
3.6 Mb
4.5 Mb
Preferred habitats
currently known genomes not very representative clear habitat preferences of microbial clades changes in habitat rather rare ?
Data integration: Reconciling species diversity with functional diversity and habitat (geochemical) data
(collab. P.Hughenholz JGI / N.Pace Boulder CO)
Soil organisms are slow-evolving (+high GC, +large genomes), Surface water organisms evolve rapidly (+low GC, +small genomes).
silica te tem pe ra ture nitra te /phos pha te salinity oxygen sam pling de pth multiple fa ctors
temperature
oxygen
oxygen
Now metagenomics, soon metatranscriptomics, meta-metabolomics Computational integration will be main challenge