Professional Documents
Culture Documents
Genome Sequencing: Dr. P. Balaji Vysya College, Hosur
Genome Sequencing: Dr. P. Balaji Vysya College, Hosur
Dr. P. Balaji
Vysya College, Hosur
Introduction
At the start of the 21st century the emphasis in molecular biology shifted from the
study of individual genes to the study of entire genomes.
This chance in emphasis as prompted by the development during the 1990s of
methods for sequencing large genomes.
Genome sequencing predates the 1990s that of the phage ф X174, was completed
on 1975 – but it was not until 20 years later, in 1995, that the first genome of a
free-living organism, the bacterium Haemophilus influenzae, was completely
sequenced.
The next 5 years were a watershed with the genome sequences of almost 50 other
bacteria published, along with complete sequences for the much larger genomes of
yeast, fruit fly and humans.
By 2000 the sequencing f bacterial genomes has become routine and projects
aimed at eukaryotic genomes were being approached with much greater confidence
than has been possible just a few years earlier.
Sequencing a genome
• A single chain termination sequencing experiment performed by hand produces
about 400 nucleotides of sequence, and a single run in an automated sequencer
gives about 750bp.
• But the total size of a fairly typical bacterial genome is 4000000 bp and the human
genomes is 3 200 000 000 bp.
• The situation is not hopeless but it requires the use of robotic systems to prepare
the DNA for sequencing and t carry out the chain termination experiments, with the
sequences read by automated sequences that transfer the data directly into a
computer.
• In the factory-style laboratories that undertake these projects the main objective is
to keep the automated sequencers operating at their full capacities.
The largest sequencing initiatives use up to 100 automated sequencers all working
around the clock, representing a theoretical output of 50 000 000 bp per day.
Looked at in these terms, genome sequences are not so daunting after all. In
practice, the generation of sufficient sequence data is one of the more routine
aspects of a genome project.
The first real problem that arises is the need to assemble the thousands or perhaps
millions of individual 750 bp sequences into a contiguous genome sequence.
Strategies for sequence assembly
et
the original strand as a template, and the newly synthesized DNA strands (red)
grow toward each other as synthesis proceeds.
h denature the duplex DNA. Upon lowering of the temperature, the original
parental strands anneal with the primers and are replicated.
o
The daughter strands produced in the first round of amplification also anneal
with primers and are replicated.
In this case, although the daughter duplex molecules are identical in sequence
h 64, 128, 256, 512, 1024, and so forth, doubling with each cycle of
replication.
s
M • For example, starting with a single molecule, 25 rounds of DNA
replication will result in 225 = 3.4 X 107 molecules.
d
of the amplified sequence.
• The optimal Mg2+ concentration varies with each new sequence, but is
usually between 1 and 4 mM.
Increase temperature to
72oC DNA polymerase +
dNTPs
DNA copies vs Cycle number
2500000
2000000
DNA copies
1500000
1000000
500000
0
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23
Cycle number
Problems and
Contamination of the reaction mixture of PCR Limitations
even by a single gene
can result into impure product of PCR reaction.
PCR is most useful for the amplification of DNA segment less than 2kb
in length.
There are significant if PCR extends over 30 cycles when error occurs
in every 300basepair of product. High error rate of Taq is because it
lacks 3’ -5’ exo nuclease activity (proof reading).
Application
Classification of organisms
Genotyping
Molecular archaeology
Mutagenesis
Mutation detection
Sequencing
Cancer research
Detection of pathogens
DNA fingerprinting
Drug discovery
Genetic matching
Genetic engineering
Application
o PCR amplification is very useful for generating large quantities of a specific
DNA sequence.
o The principal limitation of the technique is that the DNA sequences at the ends of
the region to be amplified must be known so that primer oligonucleotides can be
synthesized.
o In addition, sequences longer than about 5000 base pairs cannot be replicated
efficiently by conventional PCR procedures, although there are modifications of
PCR that allow longer fragments to be amplified.
o The major advantage of PCR amplification is that it requires only trace amounts
of template DNA.
Application
o Theoretically only one template molecule is required, but in practice the
amplification of a single molecule may fail because the molecule may, by
chance, be broken or damaged.
o The exquisite sensitivity of PCR amplification has led to its use in DNA typing
for criminal cases in which a minuscule amount of biological material has been
left behind by the perpetrator.
blood, chorionic
villus, amniotic 68,719,476,736 copies Gel Analysis,
fluid, semen, hair Restriction
root, saliva Digestion,
Sequencing
Conclusion
The speed and ease of use,
sensitivity,
sensitivity specificity and
robustness of PCR has
revolutionised molecular biology
and made PCR the most widely
used and powerful technique with
great spectrum of research and
diagnostic applications.
Biology of Genetic Engineering
Once recombinant DNA molecules have been constructed in vitro, the
desired sequence can be isolated.
• The type of host cell used for a particular application will depend
mainly on the purpose of the cloning experiment.
• If the aim to isolate a gene for structural analysis, the requirements
may calla for a simple system that is easy to use.
• If a aim is to express the genetic information in a higher eukaryote
such as plant, a more specific system will be required.
• These two aims are not necessarily mutually exclusive; often a simple
primary host is used to isolated a sequence that is then introduced into
a more complex system for expression.
Prokaryotic hosts – E.
coli
An ideal host cell should be easy to handle and propagate, should be
available as a wide variety of genetically defined strains, and should
accept a range of vector.
E. coli has been studied in great detail and many different strains were
isolated by microbial geneticists as they investigated the genetic
mechanisms of this prokaryotic organisms,.
Other fungi that may be used for gene cling experiments include
Aspergillus indulans and Neurospora crassa.
Plant and animal cells may also be used as hosts for gene manipulation
experiments.
Unicellular algae such as Chlamydomonas reinhardtii have all the
advantages of microorganisms plays the structural and functional
organization of plant cells, and their use in genetic manipulation will
increase as they become more widely studied.
Other plant cells (and animal cells) are usually grown as cell cultures,
which are much easier to manipulate than cells in a whole organism.
Plasmids
1. Plasmids are small, circular, double stranded, extra chromosomal DNA’s
present in bacterial cells.
2. They are inherited sharply without the influence of chromosomal DNA.
3. They replicate independently due to the presence of an “Origin of
Replication”.
4. The Plasmids are 1 Kbp – 200 Kbp in size and have limited number of
genes.
5. Most bacteria contain more than one copy of each plasmid (Copy number).
6. The copy number of plasmids usually varies from 1-50.
7. The copy number can be increased by treating bacterial culture with
chloramphenicol
A number of host properties are specified by plasmids, such
as
Antibiotic and Heavy metal resistance
Nitrogen fixation
Pollutant degradation
Bacteriocin production
Toxin production
Colicin factors
The plasmids typically have three important elements:
Classification by phenotype