Professional Documents
Culture Documents
Lecture 4: Detecting Mutations and Mutation Consequences Learning Goals
Lecture 4: Detecting Mutations and Mutation Consequences Learning Goals
Lecture 4: Detecting Mutations and Mutation Consequences Learning Goals
Learning Goals:
RFLP Analysis
To look for smaller-scale changes, on the order of one or a few
bases (such as single base substitutions, or small insertions or deletions), it is
necessary to use a technique that allows you to determine the specific
DNA sequence. Sometimes you need to actually sequence the entire
gene, but in some situations, you may be able to use a faster technique,
RFLP analysis.
Restriction Fragment Length Polymorphism (RFLP or riff-lip) analysis is
a molecular biology technique that you can use to detect a subset of
changes to DNA sequences that affect one or a few bases, such as a
single base substitution. In order for this type of analysis to be applicable
for the detection of a particular sequence variant, it needs to either
create or interrupt a restriction site in the DNA sequence. Restriction sites
are specific short DNA sequences that are recognized and cut by
restriction enzymes.
Restriction enzymes (also known as restriction endonucleases) are
found in bacteria, where they serve as a defense mechanism against
viruses. The bacteria use restriction enzymes to cut up viral DNA,
preventing an infection from progressing and the virus from replicating.
Today, biotechnology companies mass produce restriction enzymes for
use in molecular biology labs, where they are commonly used to
manipulate DNA.
Each restriction enzyme recognizes a specific DNA sequence, and
cuts that sequence between specific bases. These restriction sites are
generally between four and eight bases long, and many are palindromic
– the sequence on one strand, as read in the 5’ to 3’ direction, is the same
as the sequence on the other strand in the 5’ to 3’ direction. For example,
the restriction enzyme EcoRI, which comes from the bacterium E. coli, cuts
at the following sequence between the G and the A on each strand:
5’ – GAATTC – 3’
3’ – CTTAAG – 5’
DNA Sequencing
Sanger Sequencing was invented in the 1970s by Fred Sanger, who
had already won a Nobel Prize for figuring out how to determine the
amino acid sequence of a protein (he got a second for DNA
sequencing), and was the way all DNA was sequenced until the
introduction in around 2007 of ‘next-generation’ methods. Sanger
sequencing is still used today when, for example, we need to confirm
results derived from next gen methods.
The Sanger method uses the same enzyme that copies DNA
naturally in cells, DNA polymerase. The trick involves making the copy out
of base pairs that have been slightly altered. Instead of using only the
normal “deoxy” bases (As, Ts, Gs, and Cs) found naturally in DNA, Sanger
also added some so-called “dideoxy bases.” Dideoxy bases have a
peculiar property: DNA polymerase will happily incorporate them into the
growing DNA chain (i.e., the copy being assembled as the complement
of the template strand), but it cannot then add any further bases to the
chain. In other words, the duplicate chain cannot be extended beyond a
dideoxy base.
Imagine a template strand whose sequence is 3’-GGCCTAGTA-5’.
There are many, many copies of that strand in the experiment. Now
imagine that the strand is being copied using DNA polymerase, in the
presence of a mixture of normal A, T, G, and C plus some dideoxy A. The
enzyme will copy along, adding first a C (to correspond to the initial G),
then another C, then a G, and another G. But when the enzyme reaches
the first T, there are two possibilities: either it can add a normal A to the
growing chain, or it can add a dideoxy A. If it picks up a dideoxy A, then
the strand can grow no further, and the result is a short chain that ends in
a dideoxy A (ddA): CCGGddA. If it happens to add a normal A, however,
then DNA polymerase can continue adding bases: T, C, etc. The next
chance for a dideoxy “stop” of this kind will not come until the enzyme
reaches the next T. Here again it may add either a normal A or a ddA. If it
adds a ddA, the result is another truncated chain, though a slightly longer
one: this chain has a sequence of CCGGATCddA. And so it goes every
time the enzyme encounters a T (i.e., has occasion to add an A to the
chain); if by chance it selects a normal A, the chain continues, but in the
case of a ddA the chain terminates there.
Where does this leave us? At the end of this experiment, we have a
whole slew of chains of varying lengths copied from the template DNA;
what do they all have in common? They all end with a ddA.
Now, imagine the same process carried out for each of the other three
bases: in the case of T, for instance, we use a mix of normal A, T, G, and C
plus ddT; the resultant molecules will be either CCGGAddT or
CCGGATCAddT.
Having staged the reaction all four ways—once with ddA, once
with ddT, once with ddG, and once with ddC—we have four sets of DNA
chains: one consists of chains ending in ddA, one with chains ending with
ddT, and so on. We can use gel electrophoresis to sort by size all these
mini-chains. The speed with which a particular mini-chain will travel is a
function of its size: short chains travel faster than long ones. Within a fixed
interval of time, the smallest fragment, in our case a simple ddC, will travel
furthest; the next smallest, CddC, will travel a slightly shorter distance; and
the next one, CCddG, a slightly shorter one still. Now Sanger’s trick should
be clear: by reading off the relative positions of all these mini-chains after
a timed race through our gel, we can infer the sequence of our piece of
DNA: first is a C, then another C, then a G, and so on.
Gel electrophoresis
Simple Sequence Repeats
Preimplantation Genetic Diagnosis (PGD)
Genotype
Phenotype
Synonymous mutation
Non-synonymous mutation
Frameshift mutation
Coding variant
Regulatory variant
Restriction enzyme
Restriction site
Allele
Heterozygous
Homozygous
Locus
Dominant
Recessive
Next generation sequencing
Exome
Loss of function mutation
Haploinsufficiency
Gain of function mutation