Professional Documents
Culture Documents
Collection
Collection
• There are 10X as many microbial cells as human cells in our bodies. We
are constantly learning more about how changes in the composition of
these communities of organisms (microbiomes) correlate with human
health. Studies have shown that disruptions in our microbiomes may
influence the course of particular disease states. The HMP will expand
upon these studies by performing 16S rRNA and metagenomic sequencing
of samples from a healthy population to address questions such as
whether there is a "core" microbiome at individual body sites and
whether variation in the microbiome can be systematically studied. If you
have questions about this aspect of the HMP, please contact us via the
feedback form in the upper-right hand portion of the screen.
What is an accession number?
Page 27
BLAST is…
• Basic Local Alignment Search Tool
• NCBI's sequence similarity search tool
• supports analysis of DNA and protein databases
• 100,000 searches per day
Page 25
Blast
Step 2: Choose the BLAST program
Step 2: Choose the BLAST program
Activity
• Use the HMP database to find the following:
Acidaminococcus sp.
Gram stain:+
Shape:coccus
Oxygen Req. : Anaerobic
Range: Mesophilic
Protein fasta of conserved hypothetical protein [Acidaminococcus sp. D21] :
MHMLLYVALGGALGSVGRYLVAGSLKGVGGTDFPWHMIAVNTLGCILVGFFVAVLYVKLP
HPRWINLFYWGFIGGFTAFSSFIKEGMHFFLHGEHITGFLYIFLQNMLGMFAAGAAFWLG
KMLL
WGS: assembly fasta
The first gene from ncbi:
1 ttataaagtt ttataaagga cgttgaagac tttcaagagt ttttcgttcc gtctttcggt 61 tgcttcctta tctttcccat tcgagtagtc gtagaagcct
ttcttggtct tcacacccag 121 ttcacccttt tcataatgtt cctttaaaag agtggggatc tcatggctgt catcaaggtc 181 cttcatgagg
taactcgaga catggtagaa cgtgtcgatc ccgccaaaat ccatggtttc 241 gagcggtcca atgcaggccc agcggaaagc aagtccatat
ttcataacag cgtcaatatc 301 ttcagcagaa acaacacctt ttttcaccaa agacagggct tcccgcacga cagccagctg 361
gatgcggttt gcagcaaaac ccaggacatc cttattgaca atgaccggtt tctttccaat 421 ggtgcgggca aggtcccgaa cggcctcagc
cacgtcttgg caggtttcgt catttttaat 481 gatttcaata aggagaataa gcgtcggcgg attgaaccag tgcatcccta aaaagcgttc 541
cctgtgcgtc acaaattggg ccagggcatt gatggaaaga cctgatgtat ttgtcgcaag 601 gatcatgtcc gcatcggcca tcttgcagaa
agattcatag aacccctctt taatggccat 661 gtcttccgtc acgttttcaa tgacaatatc gagatgggcg atgtcctcca gattggtcgt 721
atagtggatc ttgtcgcggc tcgtttcact gatgagggtc tttgcacgct caagcgtagg 781 ctttctatgg ttccaaaggg tcacatcaaa
accataggaa gcaaagatat ccgccatgga 841 atagcccatc gtaccagcgc cggcaatgcc gattcgtttg atctccataa tcaaggctcc
901 tttcaagatc at
Then make blast of the obtained gene against database
• Use the cmr to find the following:
Staphylococcus aureus
how many strains are there ?14,
select the first one : Staphylococcus aureus
MW2,
taxon ID:196620
no. of protein coding genes:2632
G+C %: 32.82%
find the sequencing center for it: Juntendo Univ NITE
• bacillus subtilis
how many strains are there ? 1
select the first one : Bacillus subtilis 168
taxon ID: 224308
no. of protein coding genes: 4245
A+T %: 56.48%
find the sequencing center for it: Japanese
Consortium European Consortium
Sequence Length: 4215606 bp
16s rRNA
The Biomarker 16S rDNA - identify and
classify organisms by
gene sequence
16S rDNA variations.
LSU
SSU
C. perfringens probe set identified in
EPA sample 22 (N.Y. Spring)
C.AURANTIBUTYRICUM
CFB
C.THERMOBUTYRICUM_SUBGROUP C. BUTYRICUM
Cyan High G+C C.ALGIDICARNIS
Bacteria Proteo C.BOTULINUM_SUBGROUP C.CADAVERIS
Bacil-Strep
Gram + C.PERFRINGENS
C.BARATI_SUBGROUP
Clostridium
27 1492
16S rDNA
420 469
...CGTAAAGCTCTGTCTTTGGGGAAGATAATGACGGTACCCAAGGAGGAAGCCACGGCTAACT... C. perf. str.CPN50
5 6 7 8
................................................................... C. perf. resistant
................................................................... Clostridium sp. AB&J
................................................................... clone p-4636-2Wa2
................................................................... C. perf. A
................................................................... C. perf rrnA
................................................................... C. perf rrnE
.................................T................................. C. perf rrnD
................................................................... C. perf rrnC
................................................................... C. perf rrnB
................................................................... C. perf rrnF
................................................................... C. perf rrnG
................................................................... C. perf str.13a
................................................................... C. perf str.13b
................................................................... C. perf rrnH
................................................................... C. perf rrnI
...................................................................
...................................................................
C. perf rrnJ
clone OI1612 Ave Diff =1891
................................................................... C. perf. B
Probe Properties:
................................................................... Swine manure 37-3
................................................................... Swine manure 37-4
TAAAGCTCTGTCTTTGGGGAAGATA
AAAGCTCTGTCTTTGGGGAAGATAA
AAGCTCTGTCTTTGGGGAAGATAAT
tacccaaggaggaagccacggctaa
25mer exits in 90% of the taxon’s seqs
AGCTCTGTCTTTGGGGAAGATAATG