Professional Documents
Culture Documents
Jeremy M. Berg, John L. Tymoczko, Gregory J. Gatto, Lubert Stryer - Biochemistry-W. H. Freeman (2015) - 140-160
Jeremy M. Berg, John L. Tymoczko, Gregory J. Gatto, Lubert Stryer - Biochemistry-W. H. Freeman (2015) - 140-160
H
4.1 Nucleic Acids
N N H N
N1 6 5 7 N N
PURINES 8 H H H
2 4 9
3
H N N H N N H2N N N
H H H
Purine Adenine Guanine
NH2 O O
H
H H H H CH3
H
N3 4 5 N N N
PYRIMIDINES FIGURE 4.4 Purines and
2
1 6 pyrimidines. Atoms within bases
H H O N H O N H O N H
N are numbered without primes.
H H H Uracil is present in RNA instead
Pyrimidine Cytosine Uracil Thymine of thymine.
acid, the bases vary from one monomer to the next. Two of the bases of
DNA are derivatives of purine—adenine (A) and guanine (G)—and two of
pyrimidine—cytosine (C) and thymine (T), as shown in Figure 4.4.
Ribonucleic acid (RNA), like DNA, is a long unbranched polymer con-
sisting of nucleotides joined by 39-to-59 phosphodiester linkages (Figure 4.3).
The covalent structure of RNA differs from that of DNA in two respects.
First, the sugar units in RNA are riboses rather than deoxyriboses. Ribose
contains a 29-hydroxyl group not present in deoxyribose. Second, one of the
four major bases in RNA is uracil (U) instead of thymine (T).
Note that each phosphodiester bridge has a negative charge. This negative
charge repels nucleophilic species such as hydroxide ions, which are capable
of hydrolytic attack on the phosphate backbone. This resistance is crucial for
maintaining the integrity of information stored in nucleic acids. The absence
of the 29-hydroxyl group in DNA further increases its resistance to hydrolysis.
The greater stability of DNA probably accounts for its use rather than RNA
as the hereditary material in all modern cells and in many viruses.
NH2 O
2– O – – N N
O O
H2 N H2 NH
P P P N N
O C C
O O O O HO O
O O O N N
NH2
HO OH H
O
P 2–
O
O O
5 -ATP 3 -dGMP
FIGURE 4.9 The Indian muntjac and its chromosomes. Cells from a female Indian muntjac
(right) contain three pairs of very large chromosomes (stained orange). The cell shown is a
hybrid containing a pair of human chromosomes (stained green) for comparison. [(Left) Hugh
Lansdown/Shutterstock. (Right) J.–Y. Lee, M. Koi, E. J. Stanbridge, M. Oshimura, A. T. Kumamoto,
and A. P. Feinberg. Nat. Genet. 7:30, 1994.]
~36°
H
H N
O
N
~20Å width
N
FIGURE 4.11 Watson–Crick model of N N H N
double-helical DNA. (A) Side view.
N O
Adjacent bases are separated by 3.4 Å.
The structure repeats along the helical axis N H
(vertical) at intervals of 34 Å, which H
corresponds to approximately 10 Guanine Cytosine
nucleotides on each chain. (B) Axial view,
H CH3
looking down the helix axis, reveals a O
N H
rotation of 36o per base and shows that N
the bases are stacked on top of one
another [Source: J. L. Tymoczko, J. Berg,
N H N
and L. Stryer, Biochemistry: A Short Course, N N
2nd ed. (W. H. Freeman and Company, N
FIGURE 4.12 Structures of the base pairs O
2013), Fig. 33.11.].
proposed by Watson and Crick. Adenine Thymine
110
TABLE 4.1 Base compositions experimentally determined for a variety of organisms Base stacking
(van der Waal interactions)
Organism A:T G:C A:G
Human being 1.00 1.00 1.56
Salmon 1.02 1.02 1.43
Wheat 1.00 0.97 1.22
Yeast 1.03 1.02 1.67
Escherichia coli 1.09 0.99 1.05
Serratia marcescens 0.95 0.86 0.70
from the surrounding water, whereas the more polar surfaces are exposed to
water. This arrangement is reminiscent of protein folding, where hydropho-
bic amino acids are in the protein’s interior and the hydrophilic amino acids
FIGURE 4.13 A side view of DNA. Base
are on the exterior (Section 2.4). Second, the stacked base pairs attract one
pairs are stacked nearly one on top of
another through van der Waals forces (p. 8), appropriately referred to as another in the double helix. The stacked
stacking forces, further contributing to stabilization of the helix (Figure bases interact with van der Waals forces.
4.13). The energy associated with a single van der Waals interaction is quite Such stacking forces help stabilize the
small, typically from 2 to 4 kJ mol⫺1 (0.5–1.0 kcal mol⫺1). In the double double helix. [Source: J. L. Tymoczko, J.
Berg, and L. Stryer, Biochemistry: A Short
helix, however, a large number of atoms are in van der Waals contact, and
Course, 2nd ed. (W. H. Freeman and
the net effect, summed over these atom pairs, is substantial. In addition, Company, 2013), Fig. 33.13.].
base stacking in DNA is favored by the conformations of the somewhat rigid
five-membered rings of the backbone sugars.
O H O
are further induced to take the A-DNA form because of steric
HO
Base H C-2’ Base hindrance from the 29-hydroxyl group: the 29-oxygen atom would
H H H
HO be too close to three atoms of the adjoining phosphoryl group
H HO H and to one atom in the next base. In an A-form helix, in contrast,
OH
C-3’-endo C-2’-endo the 29-oxygen atom projects outward, away from other atoms.
(A form) (B form) The phosphoryl and other groups in the A-form helix bind fewer
H2O molecules than do those in B-DNA. Hence, dehydration
FIGURE 4.15 Sugar pucker. In A-form
DNA, the C-39 carbon atom lies above the
favors the A form.
approximate plane defined by the four other
sugar nonhydrogen atoms (called C-39 Z-DNA is a left-handed double helix in which backbone
endo). In B-form DNA, each deoxyribose is phosphates zigzag
in a C-29-endo conformation, in which C-29
lies out of the plane. A third type of double helix is left-handed, in contrast with the right-handed
screw sense of the A and B helices. Furthermore, the phosphoryl groups in
the backbone are zigzagged; hence, this form of DNA is called Z-DNA
(Figure 4.16).
Although the biological role of Z-DNA is still under investigation,
Z-DNA-binding proteins have been isolated, one of which is required for
viral pathogenesis of poxviruses, including variola, the agent of smallpox.
The existence of Z-DNA shows that DNA is a flexible, dynamic molecule
whose parameters are not as fixed as depictions suggest. The properties of
A-, B-, and Z-DNA are compared in Table 4.2.
A B Z
Shape Broadest Intermediate Narrowest
Rise per base pair 2.3 Å 3.4 Å 3.8 Å
Helix diameter ~26 Å ~20 Å ~18 Å
Screw sense Right-handed Right-handed Left-handed
Glycosidic bond* anti anti Alternating anti and syn
Base pairs per turn of helix 11 10.4 12
Pitch per turn of helix 25.3 Å 35.4 Å 45.6 Å
Tilt of base pairs from
perpendicular to helix axis 19 degrees 1 degree 9 degrees
*Syn and anti refer to the orientation of the N-glycosidic bond between the base and deoxyribose. In the
anti orientation, the base extends away from the deoxyribose. In the syn orientation, the base is above the
deoxyribose. Pyrimidines can be in anti orientations only, whereas purines can be anti or syn.
112
113
4.2 The Double Helix
(B)
C G
A
C A
G A
G C
T A
U U
A T C G
T A U A
G C G C
G C A U
T A G C
T A G C
FIGURE 4.18 Stem-loop structures.
A T U A
5 T A A A G G 3 5 U U G G U U G C A 3 Stem-loop structures can be formed from
DNA molecule RNA molecule single-stranded DNA and RNA molecules.
114 the local structure but introduce deviations from the standard double-
CHAPTER 4 DNA, RNA, and the Flow helical structure that can be important for higher-order folding and for
of Genetic Information function (Figure 4.19).
Single-stranded nucleic acids can adopt structures that are more com-
plex than simple stem-loops through the interaction of more widely sepa-
rated bases. Often, three or more bases interact to stabilize these structures.
In such cases, hydrogen-bond donors and acceptors that do not participate
in Watson–Crick base pairs participate in hydrogen bonds to form nonstan-
dard pairings. Metal ions such as magnesium ion (Mg2⫹) often assist in the
stabilization of these more elaborate structures. These complex structures
allow RNA to perform a host of functions that the double-stranded DNA
molecule cannot. Indeed, the complexity of some RNA molecules rivals
that of proteins, and these RNA molecules perform a number of functions
that had formerly been thought the exclusive domain of proteins.
(A) G
C A (B)
A C
A C
CG
CG
GC
U U A
UA G C
CG G U
AU U G
GC C G
G
UA AU C G Adenine
U G
AUAA A
CGUA A A
CG GU A A A
GGA A C G Guanine
A G C C U U GC A
C G
A G C G
G U G C
U A C G
C G A UUAAGG 5⬘
U A G UUCA 3⬘
C G
A U
C
C GA
G U A C
A G Cytosine
G U G U
G C U A
G A C G
AA C G
U
AU A
AG U
U A
The three linked C G
A U
nucleotides A U
C G
highlighted in A U
part B A
G C
U FIGURE 4.19 Complex structure of an RNA molecule. A single-stranded RNA molecule
UC U can foldback on itself to form a complex structure. (A) The nucleotide sequence showing
Watson–Crick base pairs and other nonstandard base pairings in stem-loop structures.
(B) The three-dimensional structure and one important long-range interaction between three
bases. In the three-dimensional structure at the left, cytidine nucleotides are shown in blue,
adenosine in red, guanosine in black, and uridine in green. In the detailed projection, hydrogen
bonds within the Watson–Crick base pair are shown as dashed black lines; additional
hydrogen bonds are shown as dashed green lines.
Matthew Meselson and Franklin Stahl carried out a critical test of this hypoth-
esis in 1958. They labeled the parent DNA with 15N, a heavy isotope of nitro-
gen, to make it denser than ordinary DNA. The labeled DNA was generated by
growing E. coli for many generations in a medium that contained 15NH4Cl as (A)
the sole nitrogen source. After the incorporation of heavy nitrogen was com-
plete, the bacteria were abruptly transferred to a medium that contained 14N, the
ordinary isotope of nitrogen. The question asked was: What is the distribution
of 14N and 15N in the DNA molecules after successive rounds of replication?
The distribution of 14N and 15N was revealed by the technique of density- 14N 15N
0.3
0.7
1.0
1.1
1.5
1.9
2.5
(A) (B)
Single-
stranded 1.4
Relative absorbance (260 nm)
1.3
Absorbance
1.2
Melting
Double- 1.1 temperature
helical (Tm )
1.0
220 260 300 60 70 80
Wavelength (nm) Temperature (°C)
FIGURE 4.23 Hypochromism. (A) Single-stranded DNA absorbs light more effectively than
does double-helical DNA. (B) The absorbance of a DNA solution at a wavelength of 260 nm
increases when the double helix is melted into single strands.
116
The ability to melt and reanneal DNA reversibly in the laboratory pro- 117
vides a powerful tool for investigating sequence similarity. For instance, 4.4 DNA Replication
DNA molecules from two different organisms can be melted and allowed to
reanneal, or hybridize, in the presence of each other. If the sequences are
similar, hybrid DNA duplexes, with DNA from each organism contribut-
ing a strand of the double helix, can form. The degree of hybridization is an
indication of the relatedness of the genomes and hence the organisms.
Similar hybridization experiments with RNA and DNA can locate genes in
a cell’s DNA that correspond to a particular RNA. We will return to this
important technique in Chapter 5.
1. The reaction requires all four activated precursors—that is, the deoxy-
nucleoside 59-triphosphates dATP, dGTP, dCTP, and TTP—as well as
Mg2⫹ ion.
2. The new DNA strand is assembled directly on a preexisting DNA template.
DNA polymerases catalyze the formation of a phosphodiester linkage effi-
ciently only if the base on the incoming nucleoside triphosphate is comple-
mentary to the base on the template strand. Thus, DNA polymerase is a
template-directed enzyme that synthesizes a product with a base sequence
complementary to that of the template.
P 3 P P 3 P P P 3
5 5 5
dATP PPi dGTP PPi
G C G C A G C A G
C G T C A C G T C A C G T C A
5 5 5
3 P P P P 3 P P P P 3 P P P P
H2 C 2 Pi H2 C
base base base base
O O
– O PPi O
3′
– O OH O
2– O O O
O – P
P
P O
P O O
O O
O
H2 C
H2 C base base O base base
O
HO
HO 5⬘ 5⬘
Kilobase (kb) Ribosomal RNA is the most abundant of these three types of RNA.
A unit of length equal to 1000 base pairs of Transfer RNA comes next, followed by messenger RNA, which constitutes
a double-stranded nucleic acid molecule (or only 5% of the total RNA. Eukaryotic cells contain additional small RNA
1000 bases of a single-stranded molecule).
molecules that play a variety of roles including the regulation of gene
One kilobase of double-stranded DNA
has a length of 0.34 mm at its maximal
expression, processing of RNA and the synthesis of proteins. We will exam-
extension (called the contour length) and a ine these small RNAs in later chapters. In this chapter, we will consider
mass of about 660 kDa. rRNA, mRNA, and tRNA.
Mg2+
FIGURE 4.27 RNA Polymerase. This large enzyme comprises many subunits, including
b (red) and b9 (yellow), which form a “claw” that holds the DNA to be transcribed. Notice that
the active site includes a Mg2⫹ ion (green) at the center of the structure. The curved tubes
making up the protein in the image represent the backbone of the polypeptide chain. [Drawn
from 1L9Z, pdb.]
RNA product 3⬘ RNA product 3⬘
5′
O O
Direction of strand growth
H2 C 2 Pi H2 C
base base base base
O O
– O OH PPi O OH
– O OH O
2– O O O
O – P
P
P O
3′ P O O
O O
O
H2 C
H2 C base base O base base
O
HO OH
HO OH 5⬘ 5⬘
The synthesis of RNA is like that of DNA in several respects (Figure 4.28).
First, the direction of synthesis is 59 S 39. Second, the mechanism of elon-
gation is similar: the 39-OH group at the terminus of the growing chain
makes a nucleophilic attack on the innermost phosphoryl group of the
incoming nucleoside triphosphate. Third, the synthesis is driven forward by
the hydrolysis of pyrophosphate. In contrast with DNA polymerase, how-
ever, RNA polymerase does not require a primer. In addition, the ability of
RNA polymerase to correct mistakes is not as extensive as that of DNA
polymerase.
All three types of cellular RNA—mRNA, tRNA, and rRNA—are syn-
thesized in E. coli by the same RNA polymerase according to instructions Table 4.4 Base composition
given by a DNA template. In mammalian cells, there is a division of labor (percentage) of RNA
among several different kinds of RNA polymerases. We shall return to these synthesized from a viral
RNA polymerases in Chapter 29. DNA template
DNA template
RNA polymerases take instructions from DNA templates (plus, or coding,
RNA polymerase, like the DNA polymerases described earlier, takes strand of fX174) RNA product
instructions from a DNA template. The earliest evidence was the finding A 25 U 25
that the base composition of newly synthesized RNA is the complement of T 33 A 32
that of the DNA template strand, as exemplified by the RNA synthesized G 24 C 23
from a template of single-stranded DNA from the fX174 virus (Table 4.4). C 18 G 20
The strongest evidence for the fidelity of transcription came from
121
122 base-sequence studies. For instance, the nucleotide sequence of a segment
CHAPTER 4 DNA, RNA, and the Flow of the gene encoding the enzymes required for tryptophan synthesis was
of Genetic Information determined with the use of DNA-sequencing techniques (Section 5.1).
Likewise, the sequence of the mRNA for the corresponding gene was deter-
mined. The results showed that the RNA sequence is the precise comple-
ment of the DNA template sequence (Figure 4.29).
5′ GCG GCG ACG CG CAGUU AAU CCCACAG CCG CCAGU U C CG CU G G CG G CAU 3′ mRNA
3′ CGC CGC TGC GC GTC AA TTAGGG TG TCGGCGG TCA AG G C G A C C G C C G TA 5′ Template strand of DNA
5′ GCG GCG ACG CG CAG TTAATCCCACAG CCG CCAG T TC CG C T G G CG G CAT 3′ Coding strand of DNA
FIGURE 4.29 Complementarity between mRNA and DNA. The base sequence of mRNA
(red) is the complement of that of the DNA template strand (blue). The sequence shown here
is from the tryptophan operon, a segment of DNA containing the genes for five enzymes that
catalyze the synthesis of tryptophan. The other strand of DNA (black) is called the coding
strand because it has the same sequence as the RNA transcript except for thymine (T) in
place of uracil (U).
Consensus sequence
Transcription begins near promoter sites and ends at terminator sites
Not all base sequences of promoter sites are RNA polymerase must detect and transcribe discrete genes from within
identical. However, they do possess common large stretches of DNA. What marks the beginning of the unit to be tran-
features, which can be represented by an
scribed? DNA templates contain regions called promoter sites that
idealized consensus sequence. Each base in
the consensus sequence TATAAT is found in
specifically bind RNA polymerase and determine where transcription
most prokaryotic promoters. Nearly all begins. In bacteria, two sequences on the 59 (upstream) side of the first
promoter sequences differ from this consensus nucleotide to be transcribed function as promoter sites (Figure 4.30A).
sequence at only one or two bases. One of them, called the Pribnow box, has the consensus sequence
TATAAT and is centered at 210 (10 nucleotides on the 59 side of the
first nucleotide transcribed, which is denoted by 11). The other, called
the 235 region, has the consensus sequence TTGACA. The first nucleo-
tide transcribed is usually a purine.
Eukaryotic genes encoding proteins have promoter sites with a TATAAA
consensus sequence, called a TATA box or a Hogness box, centered at about
225 (Figure 4.30B). Many eukaryotic promoters also have a CAAT box
with a GGNCAATCT consensus sequence centered at about 275. The
transcription of eukaryotic genes is further stimulated by enhancer sequences,
which can be quite distant (as many as several kilobases) from the start site,
on either its 59 or its 39 side.
−35 −10 +1
−75 −25 +1
FIGURE 4.30 Promoter sites for transcription in (A) prokaryotes and (B) eukaryotes.
Consensus sequences are shown. The first nucleotide to be transcribed is numbered 11.
The adjacent nucleotide on the 59 side is numbered 21. The sequences shown are those
of the coding strand of DNA.
In E. coli, RNA polymerase proceeds along the DNA template, tran- C
U C
scribing one of its strands until it synthesizes a terminator sequence. This
sequence encodes a termination signal, which is a base-paired hairpin on the U G
newly synthesized RNA molecule (Figure 4.31). This hairpin is formed by G C
5′ AAAAAAAAAAAAAAA 3′
Aminoacyl-tRNA
Coding region NH3⫹
FIGURE 4.32 Modification of mRNA. Messenger RNA in eukaryotes is modified after R Amino acid
transcription. A nucleotide “cap” structure is added to the 59 end, and a poly(A) tail is added H O
at the 39 end.
O OH
Anticodon
1. Three nucleotides encode an amino acid. Proteins are built from a basic set
of 20 amino acids, but there are only four bases. Simple calculations show
that a minimum of three bases is required to encode at least 20 amino acids.
Genetic experiments showed that an amino acid is in fact encoded by a group
of three bases, or codon.
2. The code is nonoverlapping. Consider a base sequence ABCDEF. In an
overlapping code, ABC specifies the first amino acid, BCD the next, CDE
the next, and so on. In a nonoverlapping code, ABC designates the first
amino acid, DEF the second, and so forth. Genetic experiments again
established the code to be nonoverlapping.
3. The code has no punctuation. In principle, one base (denoted as Q) might
serve as a “comma” between groups of three bases.
. . . QABCQDEFQGHIQJKLQ . . .
However, it is not the case. Rather, the sequence of bases is read sequen-
tially from a fixed starting point, without punctuation.
4. The code has directionality. The code is read from the 59 end of the mes-
senger RNA to its 39 end.
5. The genetic code is degenerate. Most amino acids are encoded by more 125
than one codon. There are 64 possible base triplets and only 20 amino acids, 4.6 The Genetic Code
and in fact 61 of the 64 possible triplets specify particular amino acids.
Three triplets (called stop codons) designate the termination of translation.
Thus, for most amino acids, there is more than one code word.
−10 +1
Met Protein
(B) Eukaryotic start signal
FIGURE 4.35 Initiation of protein synthesis. Start signals are required for the initiation of
protein synthesis in (A) prokaryotes and (B) eukaryotes.
(A) DNA
(B) Intron
Displaced strand
mRNA
of DNA
Duplex DNA
gene for the b chain of hemoglobin is interrupted within its amino acid-
coding sequence by a long stretch of of 550 non-coding base pairs and a
short one of 120 base pairs. Thus, the -globin gene is
split into three coding sequences (Figure 4.37). Non-coding Introns
regions are called introns (for intervening sequences),
whereas coding regions are called exons (for expressed
240 120 500 550 250 Base pairs
sequences). The average human gene has 8 introns, and -Globin gene
some have more than 100. The size ranges from 50 to
10,000 nucleotides. FIGURE 4.37 Structure of the b-globin gene.