Professional Documents
Culture Documents
Lec 6 BI 317 Lecture 6 (McStay)
Lec 6 BI 317 Lecture 6 (McStay)
Lec 6 BI 317 Lecture 6 (McStay)
Lecture 6
and
Centre for
Chromosome
Biology
Sanger or Dideoxy sequencing with 1000’s of these automated fluorescent
sequencers at centres such as the Sanger Centre in Cambridge UK were
responsible for the human genome sequence.
Dideoxy nucleotides can be incorporated into DNA by polymerase but
act as chain terminators since the 3’OH group has been replaced by H and
cannot form a phosphodiester bond with an incoming dNTP
Deoxy NTP
3’ OH
Dideoxy NTP 5’
Chain terminator
Fluorescent derivatives of each Dideoxy NTP have been developed
The Sanger method underpinned molecular genetics for three decades. The
method is disadvantaged by the need for gel electrophoresis, making it difficult
to scale up. The most advanced machines could only sequence 96 samples at
a time (30-60kb every 3-4 hrs).
Library preparation
Samples consisting of longer fragments are first sheared into a random library of 100-300 base-pair
long fragments. After fragmentation the ends of the obtained DNA-fragments are repaired, and an A-
overhang is added at the 3'-end of each strand. Afterwards, adaptors which are necessary for
amplification and sequencing are ligated to both ends of the DNA-fragments. These fragments are
then size selected and purified.
Illumina workflow (2)
Cluster Generation
The Cluster Generation is performed on the Illumina cBot. Single DNA-fragments are attached to the flow cell
by hybridizing to oligos on its surface that are complementary to the ligated adaptors. The DNA-molecules are
then amplified by a so called bridge amplification which results in a hundred of millions of unique clusters.
Finally, the reverse strands are cleaved and washed away and the sequencing primer is hybridized to the
DNA-templates.
Illumina workflow (3)
Sequencing
During sequencing the huge amount of generated clusters are sequenced simultaneously. The DNA-
templates are copied base by base using the four nucleotides (ACGT) which are fluorescently-labeled and
reversibly terminated. After each synthesis step, the clusters are excited by a laser which causes
fluorescence of the last incorporated base. After that, the fluorescence label and the blocking group are
removed allowing the addition of the next base. The fluorescence signal after each incorporation step is
captured by a built-in camera, producing images of the flow cell.
2000X
human genome
MiSeq
NextSeq 500
HiSeq 2500
Sequencing without an amplification step could overcome bias in the
amplification step of current NGS platforms (eg Illumina etc)
Target DNA
PacBio reads
Genetic maps rely on the principal that if two mutant phenotypes show a tendancy
to be inherited together they could be expected to be closely linked on the same
chromosome.
Microsatellite DNA also known as short tandem repeats (STRs). (remember 2nd yr lectures )
Assayed by PCR
(rapid)
Marker on
average every
3.5Mb
Marker ~ every Mb
Physical Mapping
Somatic cell hybrids
• URA3 and TRP1 provide positive selection for yeast containing YAC
• Telomeres (TEL) and a centromeres (CEN4) confer stability to the artificial chromosome
A Clone Contig The chromosomal sequence from position A to B is represented by
overlapping DNA inserts in a series of genomic clones (YACs or BACs). Clones with
overlapping inserts are generated by the random fragmentation of DNA(usually by partial
digestion with restriction enzymes) when a genomic DNA library is constructed
Physical Mapping
Bacterial Artificial Chromosome vectors
(BACs)
An STS marker is any known UNIQUE DNA sequence that can easily be assayed by
PCR.
STS markers include polymorphic markers such as microsatellites and many more
non-polymorphic sequences (many obtained from end sequence of BAC clones).
Aligning BAC clones by hybridisation with STS probes
Aligning BAC clones by hybridisation with STS probes
A Clone Contig An example of human clone contig assembly. YAC clones from a
portion of human chromosome 2. Positive typing for an STS marker is indicated by a
closed circle and brackets indicated the absence of an expected STS (YAC instability)
Expressed sequence tags
Other markers were obtained from sequenced cDNA clones and these were known
as expressed sequence tags (ESTs)
Library of
cDNA inserts
Two approaches to genome sequencing
protein sequence
Nucleotide BLAST
Results
TBLASTN
Protein Vs translated
nucleotide database
of Danio rerio
1. Sequencing technologies
NGS (Illumina)
Sanger sequencing