Professional Documents
Culture Documents
Bio SCIENCE Kuliah Perdana
Bio SCIENCE Kuliah Perdana
Background information
experimental sciences
There is a tendency to look ever deeper in:
Matter e.g. Physics
Universe e.g. Astronomy
Life e.g. Life sciences
8.00E+09
6.00E+09
4.00E+09
Morowitz
2.00E+09
0.00E+00
1982
1983
1984
1985
1986
1987
1988
1989
1990
1991
Year
1992
1993
1994
1995
1996
1997
1998
1999
2000
New technologies in
Life Sciences research
cell
Methodology/
Technology
DNA
Genomics
RNA
Transcriptomics
protein
metabolites
Proteomics
Metabolomics
UniversityofAmsterdam
Terminology
DNA
Genome
Genomics
RNA
Transcriptome
Transcriptomics
Protein
Proteome
Proteomics
Metabolite
Metabolome
Metabolomics
Omics impact
Literature
Hypothesis
Publication
Experiment
Data
Databases
(e-)Literature
Hypothesis
Publication
Experiment
Data
cell
Gene
DNA
AAAAAAAAA
mRNA
mRNA translation by
protein synthesis
High-throughput protein-analysis
NH2
Protein
COOH
Protein function:
-prediction by bioinformatics
-proof by laboratory research
function-1
function-n
function-2
Role of bioinformatics
Genomics
RNA
Transcriptomics
protein
metabolites
Proteomics
Metabolomics
Integrative/System Biology
DNA
Bioinformatics
Data integration/fusion
methodology
Data generation/validation
cell
Conclusions
Omics experiments change the face of life sciences
Bioinformatics can be considered to be an essential
enabler and is a form of e-Science
Will help to realize necessary paradigm shift in Life
Science experimentation
Better support of experimentation & optimal use of ICT
infrastructure requires rationalization experimentation
process
Information management essential technology
Bioinformatics can not be decoupled from e-Bio-science
applications
e-Bioscience also has to comprise biomedical applications
DNA
RNA
protein
DNA
genome
RNA
transcriptome
protein
proteome
Omics data
In the Omics era, we see proliferation of
genome/proteome-wide high throughput data
that are available in public archives
Comparative genome sequences
Sequence variation & phenotypes
Epigenetics & chromatin structure
Regulatory elements & gene expression
Protein expression, modification & localization
Protein domain, structure, interaction
Metabolic, signal, regulatory pathways
Drug, toxicogenomics, toxicoproteomics
Illumina Solexa
454 vs Solexa
Homopolymers (AAAAA..)
Read length: 400 bp
Number of reads: 400.000
Per-base cost greater
Novo assembly, metagenomics
Read length: 40 bp
Number of reads: millions
Per-base cost cheaper
Ideal for application requiring short reads: ncRNA
Ancient DNA
DNA mixtures from diverse ecosystems, metagenomics
Resequencing previously published reference strains
Identification of all mutations in an organism
Errors in published literature
Expand the number of available genomes
Comparative studies
Deciphering cells transcripts at sequence level
without knowledge of the genome sequence
Sequencing extremely large genomes, crop plants
Detection of cancer specific alleles avoiding traditional
cloning
Chip-seq: interactions protein-DNA
Epigenomics
Detecting ncRNA
Genetic human variation : SNP, CNV (diseases)
Extreme example:
multiplexing the amplification
of 10 000 human exons using
primers from a programmable
microarray and sequencing
them using NGS.
Epigenetics: beyond the sequence. "The major problem, I think, is chromatin. What
determines whether a given piece of DNA along the chromosome is functioning,
since it's covered with the histones? What is happening at the level of methylation
and epigenetics? You can inherit something beyond the DNA sequence. That's
where the real excitement of genetics is now." (James D. Watson). Chromatin is
defined as the dynamic complex of DNA and histone proteins that makes up
chromosomes.
Epigenetic processes are essential for packaging and interpreting the genome, are
fundamental to normal development and are increasingly recognized as being
involved in human disease. Epigenetic mechanisms include, among other things,
histone modification, positioning of histone variants, nucleosome remodelling, DNA
methylation, small and non-coding RNAs. (Nature, 7 Aug 2008).
Reduced sequencing
error
Increment read length
Developing new
bioinformatic tools
Align: MAQ, SOAP
Assembly: SSAKE
Base caller: PyroBayes
Variant detection: MAQ, GEM
Comparison of the genomes of marine sedimentderived strain (GMY01) with those of terrestrial
origin (GMR22) will provide insight into the
environmental adaptation and evolution of
Streptomyces species.
http://bgiamericas.com
Features Annotation
GMY01 (Count)
GMR22 (Count)
Genome Size
7,907,487 bp
11,637,374 bp ?
CDS
6,420
12,222 ?
tRNA
65
186?
rRNA
12
RBS
5,107
Transmembrane Helix
3,312
Signal Peptide
42
801
Inverted Repeats
204
318
Tandem Repeats
3,777
3,320
Navigate
Organism
Comparative Tools
Help
find
Navigate
Organism
Comparative Tools
J aka Widada
Help
find
Closest neighbors
export to file
Streptomyces
GMR22
display 30
itemssp.
per page
Streptomyces
sp.
GMY01
display 30
items
per page
displaying 1 - 30 of 30
displaying 1 - 30 of 30
Genome I D
Score
Genome Name
Genome I D
Score
Genome Name
100226.1
100226.15
539
528
457427.3
653045.3
518
449
457428.4
227882.1
227882.9
496
494
493
591159.3
645465.3
253839.3
483
448
413
749414.3
463191.3
645465.3
227882.9
455632.3
455632.4
441
410
380
365
361
352
Streptomyces
Streptomyces
Streptomyces
Streptomyces
Streptomyces
Streptomyces
bingchenggensis BCW-1
sviceus ATCC 29083
sp. e14
avermitilis MA-4680
griseus subsp. griseus NBRC 13350
griseus subsp. griseus NBRC 13350
680198.5
443255.8
381
368
463191.3
566461.4
465541.3
367
360
359
591159.3
649189.3
379731.5
227882.1
253839.3
347
339
327
326
307
Streptomyces
Streptomyces
Pseudomonas
Streptomyces
Streptomyces
355249.3
443255.19
467200.3
358
355
337
455632.3
457430.3
328
321
100226.1
647653.3
457430.3
100226.15
457431.3
299
297
297
291
289
Streptomyces
Streptomyces
Streptomyces
Streptomyces
Streptomyces
coelicolor A3(2)
sp. ACTE
roseosporus NRRL 11379
coelicolor A3(2)
roseosporus NRRL 15998
465543.3
457429.3
455632.4
319
319
318
457425.4
591167.3
647653.3
313
299
292
457428.4
379731.4
680198.5
591157.3
286
284
260
248
Streptomyces
Pseudomonas
Streptomyces
Streptomyces
lividans TK24
stutzeri A1501
scabiei 87.22
sp. SPB78
649189.3
457431.3
292
279
889487.3
683219.3
591157.3
277
264
262
Streptomyces sp. S4
Streptomyces sp. SA3_actG
Streptomyces sp. SPB78
566461.4
208964.1
208963.3
355249.3
457429.3
238
235
231
228
221
Streptomyces
Pseudomonas
Pseudomonas
Streptomyces
Streptomyces
653045.3
259
457425.4
683219.3
208963.12
215
214
204
Genome ID Score
Genome Name
Genome ID Score
Genome Name
oti
s
tre
res
cs
st
iive
dat
Oxi
m
Os
GMY01
GMR22
Functional Abundance
Analysis
Chlorobi
Basidiomycota
Lentisphaerae
unclassified (derived from other sequences)
Chrysiogenetes
Nitrospirae
Actinobacteria
Proteobacteria
Bacteroidetes
Streptophyta
Arthropoda
Cyanobacteria
Chordata
Firmicutes
unclassified (derived from Bacteria)
Verrucomicrobia
Acidobacteria
Ascomycota
Spirochaetes
Planctomycetes
Chloroflexi
452319
452311
GMR22 GMY01
GMY01
http://antism
ash.seconda
rymetabolite
s.org/upload/
cdfb620be865-4aabb5154bf7e55abde
6/index.html
GMY01
Number
GMR22
Number
Terpene
Terpene
16
NRPS
NRPS
46
Ectoine
Ectoine
Butyrolactone
Butyrolactone
T1pks
T1pks
67
T1pks-t4pks
T1pks-t4pks
Bacteriocin
Bacteriocin
11
Nrps-butyrolactones
Nrps-butyrolactones
T2pks
T2pks
Nrps-t1pks
Nrps-t1pks
Lantipeptide
Lantipeptide
Siderophore
Siderophore
Other
Other
14
T3pks
T3pks
T4pks
T4pks
15
T4pks-t1pks
T4pks-t1pks
Indole
Hserlactone
T2pks-bacteriocin
Bacteriocinlantipeptide
T3pks-nrps
Oligosaccaridet2pks
Total
28
201
GMR22
16
21
AA homology
97-100%
Marine sediment-derived
Genome size: 7.91 Mb
Total genes 6,420
GMY01
E.14
Wasp symbiont
Genome size: 7.93 Mb
Total genes 6,198