Professional Documents
Culture Documents
Chemistry of Proteins
Chemistry of Proteins
Chemistry of Proteins
(ABMB)
BCh 152
Chemistry of Proteins
identification.
Amino Acids & Peptides 1
BIOMEDICAL IMPORTANCE more than 20 amino acids, its redundancy
In addition to providing the monomer units from limits the available codons to the 20 L--amino
which the long polypeptide chains of proteins are acids listed in Table 3–1, classified according to
synthesized, the L--amino acids and their the polarity of their R groups. Both one- and three-
derivatives participate in cellular functions as letter abbreviations for each amino acid can be
diverse as nerve transmission and the biosynthesis used to represent the amino acids in peptides
of porphyrins, purines, pyrimidines, and urea. (Table 3–1). Some proteins contain additional
Short polymers of amino acids called peptides amino acids that arise by modification of an
perform prominent roles in the neuroendocrine amino acid already present in a peptide. Examples
system as hormones, hormone-releasing factors, include conver- sion of peptidyl proline and lysine
neuromodula- tors, or neurotransmitters. While to 4-hydroxyproline and 5-hydroxylysine; the
proteins contain only L--amino acids, conversion of peptidyl gluta- mate to -
microorganisms elaborate peptides that contain carboxyglutamate; and the methylation,
formylation, acetylation, prenylation, and
both D- and L--amino acids. Several of these
phosphoryla- tion of certain aminoacyl residues.
peptides are of therapeutic value, including the an-
These modifications extend the biologic diversity
tibiotics bacitracin and gramicidin A and the
of proteins by altering their solubility, stability, and
antitumor agent bleomycin. Certain other
interaction with other proteins.
microbial peptides are toxic. The cyanobacterial
peptides microcystin and nodularin are lethal in
large doses, while small quantities promote the Only L-a-Amino Acids Occur in Proteins
formation of hepatic tumors. Neither hu- mans With the sole exception of glycine, the -
nor any other higher animals can synthesize 10 of carbon of amino acids is chiral. Although some
the 20 common L--amino acids in amounts protein amino acids are dextrorotatory and some
adequate to support infant growth or to maintain levorotatory, all share the absolute configuration of
health in adults. Consequently, the human diet L-glyceraldehyde and thus are L--amino acids.
must contain adequate quantities of these Several free L--amino acids fulfill important roles
nutritionally essential amino acids. in metabolic processes. Examples in- clude
ornithine, citrulline, and argininosuccinate that
PROPERTIES OF AMINO ACIDS participate in urea synthesis; tyrosine in
The Genetic Code Specifies formation of thyroid hormones; and glutamate in
20 L-a-Amino Acids neurotransmitter biosynthesis. D-Amino acids
that occur naturally in- clude free D-serine and
Of the over 300 naturally occurring amino acids, 20 D-aspartate in brain tissue, D-alanine and D-
con- stitute the monomer units of proteins. glutamate in the cell walls of gram- positive
While a nonre- dundant three-letter genetic code bacteria, and D-amino acids in some nonmam-
could accommodate malian peptides and certain antibiotics.
1
Table 1–1. L- -Amino acids present in proteins.
H3C
CH CH CH COO–
2
Leucine Leu [L] H3C + 2.3 9.7
NH
3
CH3
CH2
Isoleucine Ile [I] CH CH COO– 2.3 9.8
+
CH3 NH3
OH NH3+
Threonine Thr [T] – 2.1 9.1 about 13
CH3 CH CH COO
OH NH3+
SH NH3+
–
Methionine Met [M] CH2 CH2 CH COO 2.1 9.3
S CH3 NH3+
HN C CH CH CH COO–
2 2 2
Glutamine Gln [Q] 2.2 9.1
O NH +
3
(continued )
2
Table 1–1. L--Amino acids present in proteins. (continued)
Imino Acid
Proline Pro [P] 2.0 10.6
+ –
N COO
H2
Amino Acids May Have Positive, Negative, Molecules that contain an equal number of ioniz-
or Zero Net Charge able groups of opposite charge and that therefore bear
no net charge are termed zwitterions. Amino acids in
Charged and uncharged forms of the ionizable blood and most tissues thus should be represented as in
COOH and NH3+ weak acid groups exist in solu- A, below.
tion in protonic equilibrium:
NH + NH2
R — COOH R — COO H
O– OH
R —NH3 R — NH2 H R R
O O
While both RCOOH and RNH3 are weak acids, + A B
RCOOH is a far stronger acid than RNH3+. At
physiologic pH (pH 7.4), carboxyl groups exist almost Structure B cannot exist in aqueous solution because at
entirely as RCOO and amino groups predomi- any pH low enough to protonate the carboxyl group
nantly as RNH3+. Figure 3–1 illustrates the effect of the amino group would also be protonated. Similarly,
pH on the charged state of aspartic acid. at any pH sufficiently high for an uncharged amino
O H+ O H+ O H+ O
–
OH OH O O–
group to predominate, a carboxyl group will be present At Its Isoelectric pH (pI), an Amino Acid
as RCOO. The uncharged representation B (above) Bears No Net Charge
is, however, often used for reactions that do not involve
protonic equilibria. The isoelectric species is the form of a molecule that
has an equal number of positive and negative charges
and thus is electrically neutral. The isoelectric pH, also
pKa Values Express the Strengths called the pI, is the pH midway between pKa values on
of Weak Acids either side of the isoelectric species. For an amino acid
The acid strengths of weak acids are expressed as their such as alanine that has only two dissociating groups,
pKa (Table 3–1). The imidazole group of histidine and there is no ambiguity. The first pKa (R COOH) is
the guanidino group of arginine exist as resonance hy- 2.35 and the second pKa (RNH3+) is 9.69. The iso-
brids with positive charge distributed between both ni- electric pH (pI) of alanine thus is
trogens (histidine) or all three nitrogens (arginine) (Fig-
ure 3–2). The net charge on an amino acid—the pK 1 pK 2 2.35 9.69
pl 6.02
algebraic sum of all the positively and negatively 2 2
charged groups present—depends upon the pKa values
of its functional groups and on the pH of the surround- For polyfunctional acids, pI is also the pH midway be-
ing medium. Altering the charge on amino acids and tween the pKa values on either side of the isoionic
their derivatives by varying the pH facilitates the physi- species. For example, the pI for aspartic acid is
cal separation of amino acids, peptides, and proteins
(see Chapter 4). pK 1 pK 2 2.09 3.96
pl 3.02
2 2
121 122
C
120
117
110
120 120
0.1 nm
O H R
0.36 nm
21
22 /
the mobile phase. Proteins elute in inverse order of the fied by affinity chromatography using immobilized sub-
strength of their interactions with the stationary phase. strates, products, coenzymes, or inhibitors. In theory,
Since the net charge on a protein is determined by only proteins that interact with the immobilized ligand
the pH (see Chapter 3), sequential elution of proteins adhere. Bound proteins are then eluted either by compe-
may be achieved by changing the pH of the mobile tition with soluble ligand or, less selectively, by disrupt-
phase. Alternatively, a protein can be subjected to con- ing protein-ligand interactions using urea, guanidine
secutive rounds of ion exchange chromatography, each hydrochloride, mildly acidic pH, or high salt concentra-
at a different pH, such that proteins that co-elute at one tions. Stationary phase matrices available commercially
pH elute at different salt concentrations at another pH. contain ligands such as NAD+ or ATP analogs. Among
the most powerful and widely applicable affinity matri-
Hydrophobic Interaction Chromatography ces are those used for the purification of suitably modi-
fied recombinant proteins. These include a Ni 2+ matrix
Hydrophobic interaction chromatography separates that binds proteins with an attached polyhistidine ―tag‖
proteins based on their tendency to associate with a sta- and a glutathione matrix that binds a recombinant pro-
tionary phase matrix coated with hydrophobic groups tein linked to glutathione S-transferase.
(eg, phenyl Sepharose, octyl Sepharose). Proteins with
exposed hydrophobic surfaces adhere to the matrix via
hydrophobic interactions that are enhanced by a mobile Peptides Are Purified by Reversed-Phase
phase of high ionic strength. Nonadherent proteins are High-Pressure Chromatography
first washed away. The polarity of the mobile phase is
then decreased by gradually lowering the salt concentra- The stationary phase matrices used in classic column
tion. If the interaction between protein and stationary chromatography are spongy materials whose compress-
phase is particularly strong, ethanol or glycerol may be ibility limits flow of the mobile phase. High-pressure liq-
added to the mobile phase to decrease its polarity and uid chromatography (HPLC) employs incompressible
further weaken hydrophobic interactions. silica or alumina microbeads as the stationary phase and
pressures of up to a few thousand psi. Incompressible
matrices permit both high flow rates and enhanced reso-
Affinity Chromatography lution. HPLC can resolve complex mixtures of lipids or
Affinity chromatography exploits the high selectivity of peptides whose properties differ only slightly. Reversed-
most proteins for their ligands. Enzymes may be puri- phase HPLC exploits a hydrophobic stationary phase of
24 /
aliphatic polymers 3–18 carbon atoms in length. Peptide through the acrylamide matrix determines the rate of
mixtures are eluted using a gradient of a water-miscible migration. Since large complexes encounter greater re-
organic solvent such as acetonitrile or methanol. sistance, polypeptides separate based on their relative
molecular mass (Mr). Individual polypeptides trapped
Protein Purity Is Assessed by in the acrylamide gel are visualized by staining with
Polyacrylamide Gel Electrophoresis dyes such as Coomassie blue (Figure 4–4).
(PAGE)
Isoelectric Focusing (IEF)
The most widely used method for determining the pu-
rity of a protein is SDS-PAGE—polyacrylamide gel Ionic buffers called ampholytes and an applied electric
electrophoresis (PAGE) in the presence of the anionic field are used to generate a pH gradient within a poly-
detergent sodium dodecyl sulfate (SDS). Electrophore- acrylamide matrix. Applied proteins migrate until they
sis separates charged biomolecules based on the rates at reach the region of the matrix where the pH matches
which they migrate in an applied electrical field. For their isoelectric point (pI), the pH at which a peptide’s
SDS-PAGE, acrylamide is polymerized and cross- net charge is zero. IEF is used in conjunction with SDS-
linked to form a porous matrix. SDS denatures and PAGE for two-dimensional electrophoresis, which sepa-
binds to proteins at a ratio of one molecule of SDS per rates polypeptides based on pI in one dimension and
two peptide bonds. When used in conjunction with 2- based on Mr in the second (Figure 4–5). Two-dimen-
mercaptoethanol or dithiothreitol to reduce and break sional electrophoresis is particularly well suited for sepa-
disulfide bonds (Figure 4 –3), SDS separates the com- rating the components of complex mixtures of proteins.
ponent polypeptides of multimeric proteins. The large
number of anionic SDS molecules, each bearing a SANGER WAS THE FIRST TO DETERMINE
charge of 1, on each polypeptide overwhelms the
charge contributions of the amino acid functional
THE SEQUENCE OF A POLYPEPTIDE
groups. Since the charge-to-mass ratio of each SDS- Mature insulin consists of the 21-residue A chain and
polypeptide complex is approximately equal, the physi- the 30-residue B chain linked by disulfide bonds. Fred-
cal resistance each peptide encounters as it moves erick Sanger reduced the disulfide bonds (Figure 4–3),
NH
O HN
S
HN S
H
NH
O
O SH
HCOOH C2H5
OH
NH
O
HN
SO2
HN
HS H
NH
O
Figure 4–4. Use of SDS-PAGE to observe successive
purification of a recombinant protein. The gel was
Figure 4–3. Oxidative cleavage of adjacent polypep- stained with Coomassie blue. Shown are protein stan-
tide chains linked by disulfide bonds (shaded) by per- dards (lane S) of the indicated mass, crude cell extract
formic acid (left) or reductive cleavage by -mercap- (E), high-speed supernatant liquid (H), and the DEAE-
toethanol (right) forms two peptides that contain Sepharose fraction (D). The recombinant protein has a
cysteic acid residues or cysteinyl residues, respectively. mass of about 45 kDa.
pH = 3 pH = 10
IEF
SDS
PAGE
Figure 4–5. Two-dimensional IEF-SDS-PAGE. The
gel was stained with Coomassie blue. A crude bacter-
ial extract was first subjected to isoelectric focusing
(IEF) in a pH 3–10 gradient. The IEF gel was then
placed horizontally on the top of an SDS gel, and the
proteins then further resolved by SDS-PAGE. Notice
the greatly improved resolution of distinct polypep-
tides relative to ordinary SDS-
PAGE gel (Figure 4–4).
separated the A and B chains, and cleaved each chain Large Polypeptides Are First Cleaved Into
into smaller peptides using trypsin, chymotrypsin, and Smaller Segments
pepsin. The resulting peptides were then isolated and
treated with acid to hydrolyze peptide bonds and gener- While the first 20–30 residues of a peptide can readily
ate peptides with as few as two or three amino acids. be determined by the Edman method, most polypep-
Each peptide was reacted with 1-fluoro-2,4-dinitroben- tides contain several hundred amino acids. Conse-
zene (Sanger’s reagent), which derivatizes the exposed quently, most polypeptides must first be cleaved into
-amino group of amino terminal residues. The amino smaller peptides prior to Edman sequencing. Cleavage
acid content of each peptide was then determined. also may be necessary to circumvent posttranslational
While the -amino group of lysine also reacts with modifications that render a protein’s -amino group
Sanger’s reagent, amino-terminal lysines can be distin- ―blocked‖, or unreactive with the Edman reagent.
guished from those at other positions because they react It usually is necessary to generate several peptides
with 2 mol of Sanger’s reagent. Working backwards to using more than one method of cleavage. This reflects
larger fragments enabled Sanger to determine the com- both inconsistency in the spacing of chemically or enzy-
plete sequence of insulin, an accomplishment for which matically susceptible cleavage sites and the need for sets
he received a Nobel Prize in 1958. of peptides whose sequences overlap so one can infer
the sequence of the polypeptide from which they derive
(Figure 4–7). Reagents for the chemical or enzymatic
THE EDMAN REACTION ENABLES cleavage of proteins include cyanogen bromide (CNBr),
PEPTIDES & PROTEINS trypsin, and Staphylococcus aureus V8 protease (Table
TO BE SEQUENCED 4–1). Following cleavage, the resulting peptides are pu-
rified by reversed-phase HPLC—or occasionally by
Pehr Edman introduced phenylisothiocyanate (Edman’s SDS-PAGE—and sequenced.
reagent) to selectively label the amino-terminal residue
of a peptide. In contrast to Sanger’s reagent, the
phenylthiohydantoin (PTH) derivative can be removed MOLECULAR BIOLOGY HAS
under mild conditions to generate a new amino terminal REVOLUTIONIZED THE DETERMINATION
residue (Figure 4–6). Successive rounds of derivatization OF PRIMARY STRUCTURE
with Edman’s reagent can therefore be used to sequence
many residues of a single sample of peptide. Edman se- Knowledge of DNA sequences permits deduction of
quencing has been automated, using a thin film or solid the primary structures of polypeptides. DNA sequenc-
matrix to immobilize the peptide and HPLC to identify ing requires only minute amounts of DNA and can
PTH amino acids. Modern gas-phase sequencers can readily yield the sequence of hundreds of nucleotides.
analyze as little as a few picomoles of peptide. To clone and sequence the DNA that encodes a partic-
26 /
S Peptide X Peptide Y
C Peptide Z
N
+
O H NH2 Carboxyl terminal Amino terminal
N portion of portion of
N R peptide X peptide Y
H
R
Figure 4–7. The overlapping peptide Z is used to de-
Phenylisothiocyanate (Edman reagent) duce that peptides X and Y are present in the original
and a peptide protein in the order X Y, not Y X.
SECONDARY STRUCTURE
Peptide Bonds Restrict Possible
Secondary Conformations
Free rotation is possible about only two of the three co-
valent bonds of the polypeptide backbone: the -car- occur in nature. Schematic diagrams of proteins
bon (C) to the carbonyl carbon (Co) bond and the
C to nitrogen bond (Figure 3–4). The partial double- repre-sent helices as cylinders.
bond character of the peptide bond that links Co to the The stability of an helix arises primarily from hy-
-nitrogen requires that the carbonyl carbon, carbonyl drogen bonds formed between the oxygen of the pep-
oxygen, and -nitrogen remain coplanar, thus prevent- tide bond carbonyl and the hydrogen atom of the pep-
ing rotation. The angle about the CN bond is tide bond nitrogen of the fourth residue down the
termed the phi () angle, and that about the CoC polypeptide chain (Figure 5–4). The ability to form the
bond the psi () angle. For amino acids other than maximum number of hydrogen bonds, supplemented
glycine, most combinations of phi and psi angles are by van der Waals interactions in the core of this tightly
disallowed because of steric hindrance (Figure 5–1). packed structure, provides the thermodynamic driving
The conformations of proline are even more restricted force for the formation of an helix. Since the peptide
due to the absence of free rotation of the NC bond. bond nitrogen of proline lacks a hydrogen atom to con-
Regions of ordered secondary structure arise when a tribute to a hydrogen bond, proline can only be stably
series of aminoacyl residues adopt similar phi and psi accommodated within the first turn of an helix.
angles. Extended segments of polypeptide (eg, loops) When present elsewhere, proline disrupts the confor-
can possess a variety of such angles. The angles that de- mation of the helix, producing a bend. Because of its
fine the two most common types of secondary struc- small size, glycine also often induces bends in helices.
ture, the a helix and the β sheet, fall within the lower Many helices have predominantly hydrophobic R
and upper left-hand quadrants of a Ramachandran groups on one side of the axis of the helix and predomi-
plot, respectively (Figure 5–1). nantly hydrophilic ones on the other. These amphi-
pathic helices are well adapted to the formation of in-
The Alpha Helix terfaces between polar and nonpolar regions such as the
hydrophobic interior of a protein and its aqueous envi-
The polypeptide backbone of an helix is twisted by
an equal amount about each -carbon with a phi angle
of approximately 57 degrees and a psi angle of approx-
imately 47 degrees. A complete turn of the helix con-
tains an average of 3.6 aminoacyl residues, and the dis-
tance it rises per turn (its pitch) is 0.54 nm (Figure
5–2). The R groups of each aminoacyl residue in an
helix face outward (Figure 5–3). Proteins contain only
L-amino acids, for which a right-handed helix is by
far the more stable, and only right-handed helices
32 /
R R
N O
R C
COOH
H
H CH2
N
H C H
C
H N O C O
CH3 C O H N
CH2OH
C C
H H
30
15
or other ligand. Other domains may anchor a protein to
55 a membrane or interact with a regulatory molecule that
N modulates its function. A small polypeptide such as
50
345 80 70 triose phosphate isomerase (Figure 5–6) or myoglobin
330 280 (Chapter 6) may consist of a single domain. By contrast,
90
protein kinases contain two domains. Protein kinases
150
185 350 catalyze the transfer of a phosphoryl group from ATP to
145 C
a peptide or protein. The amino terminal portion of the
230 377 polypeptide, which is rich in sheet, binds ATP, while
the carboxyl terminal domain, which is rich in helix,
310 245
110 320
220 260 binds the peptide or protein substrate (Figure 5–8). The
groups that catalyze phosphoryl transfer reside in a loop
300 258 positioned at the interface of the two domains.
205
120 In some cases, proteins are assembled from more
than one polypeptide, or protomer. Quaternary struc-
170
125
ture defines the polypeptide composition of a protein
and, for an oligomeric protein, the spatial relationships
Figure 5–6. Examples of tertiary structure of pro- between its subunits or protomers. Monomeric pro-
teins. Top: The enzyme triose phosphate isomerase. teins consist of a single polypeptide chain. Dimeric
Note the elegant and symmetrical arrangement of al- proteins contain two polypeptide chains. Homodimers
ternating sheets and helices. (Courtesy of J Richard- contain two copies of the same polypeptide chain,
son.) Bottom: Two-domain structure of the subunit of a
while in a heterodimer the polypeptides differ. Greek
letters (, , etc) are used to distinguish different sub-
homodimeric enzyme, a bacterial class II HMG-CoA re-
units of a heterooligomeric protein, and subscripts indi-
ductase. As indicated by the numbered residues, the
cate the number of each subunit type. For example, 4
single polypeptide begins in the large domain, enters
designates a homotetrameric protein, and 22 a pro-
the small domain, and ends in the large domain. (Cour- tein with five subunits of three different types.
tesy of C Lawrence, V Rodwell, and C Stauffacher, Purdue Since even small proteins contain many thousands
University.) of atoms, depictions of protein structure that indicate
the position of every atom are generally too complex to
be readily interpreted. Simplified schematic diagrams
thus are used to depict key features of a protein’s ter-
from water. Other significant contributors include hy-
drogen bonds and salt bridges between the carboxylates
of aspartic and glutamic acid and the oppositely
charged side chains of protonated lysyl, argininyl, and
histidyl residues. While individually weak relative to a
typical covalent bond of 80–120 kcal/mol, collectively
these numerous interactions confer a high degree of sta-
bility to the biologically functional conformation of a
protein, just as a Velcro fastener harnesses the cumula-
tive strength of multiple plastic loops and hooks.
Some proteins contain covalent disulfide (S S)
bonds that link the sulfhydryl groups of cysteinyl
residues. Formation of disulfide bonds involves oxida-
tion of the cysteinyl sulfhydryl groups and requires oxy-
gen. Intrapolypeptide disulfide bonds further enhance
the stability of the folded conformation of a peptide,
while interpolypeptide disulfide bonds stabilize the
quaternary structure of certain oligomeric proteins.
THREE-DIMENSIONAL STRUCTURE
IS DETERMINED BY X-RAY
CRYSTALLOGRAPHY OR BY
NMR SPECTROSCOPY
X-Ray Crystallography
Since the determination of the three-dimensional struc-
ture of myoglobin over 40 years ago, the three-dimen-
sional structures of thousands of proteins have been de-
Figure 5–8. Domain structure. Protein kinases con- termined by x-ray crystallography. The key to x-ray
tain two domains. The upper, amino terminal domain crystallography is the precipitation of a protein under
binds the phosphoryl donor ATP (light blue). The lower, conditions in which it forms ordered crystals that dif-
carboxyl terminal domain is shown binding a synthetic fract x-rays. This is generally accomplished by exposing
peptide substrate (dark blue).
small drops of the protein solution to various combina-
tions of pH and precipitating agents such as salts and
organic solutes such as polyethylene glycol. A detailed
three-dimensional structure of a protein can be con-
tiary and quaternary structure. Ribbon diagrams (Fig- structed from its primary structure using the pattern by
ures 5–6 and 5–8) trace the conformation of the which it diffracts a beam of monochromatic x-rays.
polypeptide backbone, with cylinders and arrows indi- While the development of increasingly capable com-
cating regions of helix and sheet, respectively. In an puter-based tools has rendered the analysis of complex
even simpler representation, line segments that link the x-ray diffraction patterns increasingly facile, a major
carbons indicate the path of the polypeptide back- stumbling block remains the requirement of inducing
bone. These schematic diagrams often include the side highly purified samples of the protein of interest to
chains of selected amino acids that emphasize specific crystallize. Several lines of evidence, including the abil-
structure-function relationships. ity of some crystallized enzymes to catalyze chemical re-
actions, indicate that the vast majority of the structures
MULTIPLE FACTORS STABILIZE determined by crystallography faithfully represent the
TERTIARY & QUATERNARY STRUCTURE structures of proteins in free solution.
Higher orders of protein structure are stabilized primar- Nuclear Magnetic Resonance
ily—and often exclusively—by noncovalent interac-
tions. Principal among these are hydrophobic interac-
Spectroscopy
tions that drive most hydrophobic amino acid side Nuclear magnetic resonance (NMR) spectroscopy, a
chains into the interior of the protein, shielding them powerful complement to x-ray crystallography, mea-
36 /
O Arg 145
NH2 NH
+ OH C
N NH2 NH2
O CH2 O
H H
O C O O
N H C C
N H Tyr 248
H H H
HO OH N C
His 196
– O CH2
O P O
2+
Zn
NH2
NH2 O O His 69
C N
N
Glu 72
N
O N N
H
O P O CH2
Figure 7–3. Two-dimensional representation of a
O– O dipeptide substrate, glycyl-tyrosine, bound within the
active site of carboxypeptidase A.
H H
HO OR
Raising the temperature increases For the reaction A + B P + Q— characteristic changes in free energy, GF,
and GD are
[P][Q] associated with each partial reaction.
Keq (4)
[A][B] E R L
ELRLL GF (8)
and for reaction (5)
ELRLL
E R L GD (9)
(5)
AA
P E R L G G G
E R L F D (8-10)
[P] (6)
K eq
[A]2 For the overall reaction (10), G is the sum of GF and
GD. As for any equation of two terms, it is not possi-
—G0 may be calculated from equation (3) if the con- ble to infer from G either the sign or the magnitude
centrations of substrates and products present at equi- of GF or GD.
librium are known. If G0 is a negative number, Keq Many reactions involve multiple transition states,
will be greater than unity and the concentration of each with an associated change in free energy. For these
products at equilibrium will exceed that of substrates. If reactions, the overall G represents the sum of all of
G0 is positive, Keq will be less than unity and the for- the free energy changes associated with the formation
mation of substrates will be favored. and decay of all of the transition states. Therefore, it is
Notice that, since G0 is a function exclusively of not possible to infer from the overall ΔG the num-
the initial and final states of the reacting species, it can ber or type of transition states through which the re-
provide information only about the direction and equi- action proceeds. Stated another way: overall thermo-
librium state of the reaction. G0 is independent of the dynamics tells us nothing about kinetics.
mechanism of the reaction and therefore provides no
information concerning rates of reactions. Conse- GF Defines the Activation Energy
quently—and as explained below—although a reaction
may have a large negative G0 or G0, it may never- Regardless of the sign or magnitude of G, GF for the
theless take place at a negligible rate. overwhelming majority of chemical reactions has a pos-
itive sign. The formation of transition state intermedi-
ates therefore requires surmounting of energy barriers.
THE RATES OF REACTIONS For this reason, GF is often termed the activation en-
ARE DETERMINED BY THEIR ergy, Eact, the energy required to surmount a given en-
ACTIVATION ENERGY ergy barrier. The ease—and hence the frequency—with
which this barrier is overcome is inversely related to
Reactions Proceed via Transition States Eact. The thermodynamic parameters that determine
how fast a reaction proceeds thus are the GF values for
The concept of the transition state is fundamental to formation of the transition states through which the re-
understanding the chemical and thermodynamic basis action proceeds. For a simple reaction, where means
of catalysis. Equation (7) depicts a displacement reac- ―proportionate to,‖
tion in which an entering group E displaces a leaving
group L, attached initially to R. E
act
(11)
E R L Rate e RT
E R L (7)
Midway through the displacement, the bond between The activation energy for the reaction proceeding in the
R and L has weakened but has not yet been completely opposite direction to that drawn is equal to GD.
severed, and the new bond between E and R is as yet
incompletely formed. This transient intermediate—in
which neither free substrate nor product exists—is NUMEROUS FACTORS AFFECT
termed the transition state, E···R···L. Dotted lines THE REACTION RATE
represent the ―partial‖ bonds that are undergoing for- The kinetic theory—also called the collision theory—
mation and rupture. of chemical kinetics states that for two molecules to
Reaction (7) can be thought of as consisting of two react they must (1) approach within bond-forming dis-
―partial reactions,‖ the first corresponding to the forma- tance of one another, or ―collide‖; and (2) must possess
tion (F) and the second to the subsequent decay (D) of sufficient kinetic energy to overcome the energy barrier
the transition state intermediate. As for all reactions, for reaching the transition state. It therefore follows
that anything which increases the frequency or energy of which can also be written as
collision between substrates will increase the rate of the
reaction in which they participate. ABBP (15)
the corresponding rate expression is
Temperature
the kinetic energy of molecules. As illustrated in Rate [A][B][B] (16)
Figure 8–1, the total num- ber of molecules whose or
kinetic energy exceeds the en- ergy barrier Eact (vertical
bar) for formation of products increases from low (A), Rate [A][B]
2
(17)
through intermediate (B), to
high (C) temperatures. Increasing the kinetic energy of For the general case when n molecules of A react with
molecules also increases their motion and therefore the m molecules of B,
frequency with which they collide. This combination of
more frequent and more highly energetic and produc- nA mB P (18)
tive collisions increases the reaction rate.
the rate expression is
Reactant Concentration
Rate [A]n[B]m (19)
The frequency with which molecules collide is directly
proportionate to their concentrations. For two different Replacing the proportionality constant with an equal
molecules A and B, the frequency with which they col- sign by introducing a proportionality or rate constant
lide will double if the concentration of either A or B is k characteristic of the reaction under study gives equa-
doubled. If the concentrations of both A and B are dou- tions (20) and (21), in which the subscripts 1 and 1
bled, the probability of collision will increase fourfold. refer to the rate constants for the forward and reverse
For a chemical reaction proceeding at constant tem- reactions, respectively.
perature that involves one molecule each of A and B,
Rate 1 k1[A]n[B]m (20)
ABP (12)
the number of molecules that possess kinetic energy Rate1 k 1[P] (21)
sufficient to overcome the activation energy barrier will
be a constant. The number of collisions with sufficient
energy to produce product P therefore will be directly Keq Is a Ratio of Rate Constants
proportionate to the number of collisions between A While all chemical reactions are to some extent re-
and B and thus to their molar concentrations, denoted versible, at equilibrium the overall concentrations of re-
by square brackets. actants and products remain constant. At equilibrium,
the rate of conversion of substrates to products there-
Rate [A][B] (13) fore equals the rate at which products are converted to
Similarly, for the reaction represented by substrates.
Therefore,
k1[A]n[B]m k 1[P] (23)
Energy barrier
and
A B
Number of
molecules
k1 [P]
(24)
k 1 [A]n[B]m
Vmax
0
Low High
Vmax/2
pH
vi
B
Figure 8–2. Effect of pH on enzyme activity. Con-
sider, for example, a negatively charged enzyme (EH ) A Vmax/2
that binds a positively charged substrate (SH+). Shown
is the proportion (%) of SH+ [\\\] and of EH [///] as a K [S]
function of pH. Only in the cross-hatched area do both
the enzyme and the substrate bear an appropriate Figure 8–3. Effect of substrate concentration on the
charge. initial velocity of an enzyme-catalyzed reaction.
Nucleotides 6
BIOMEDICAL IMPORTANCE H H
6 4
C 5 C 5
Nucleotides—the monomer units or building blocks of 1
N 3
N CH
nucleic acids—serve multiple additional functions. They 2
CH
C HC CH
form a part of many coenzymes and serve as donors of H N 4 2 N 6
3 H 1
phosphoryl groups (eg, ATP or GTP), of sugars (eg,
UDP- or GDP-sugars), or of lipid (eg, CDP-acylglyc- Purine Pyrimidine
erol). Regulatory nucleotides include the second mes-
sengers cAMP and cGMP, the control by ADP of ox- Figure 33–1. Purine and pyrimidine. The atoms are
idative phosphorylation, and allosteric regulation of numbered according to the international system.
enzyme activity by ATP, AMP, and CTP. Synthetic
purine and pyrimidine analogs that contain halogens,
thiols, or additional nitrogen are employed for chemo-
therapy of cancer and AIDS and as suppressors of the Nucleosides & Nucleotides
immune response during organ transplantation. Nucleosides are derivatives of purines and pyrimidines
that have a sugar linked to a ring nitrogen. Numerals
PURINES, PYRIMIDINES, NUCLEOSIDES, with a prime (eg, 2 or 3) distinguish atoms of the
& NUCLEOTIDES sugar from those of the heterocyclic base. The sugar in
ribonucleosides is D-ribose, and in deoxyribonucleo-
Purines and pyrimidines are nitrogen-containing hete- sides it is 2-deoxy-D-ribose. The sugar is linked to the
rocycles, cyclic compounds whose rings contain both heterocyclic base via a β-N-glycosidic bond, almost al-
carbon and other elements (hetero atoms). Note that ways to N-1 of a pyrimidine or to N-9 of a purine (Fig-
the smaller pyrimidine has the longer name and the ure 33–3).
larger purine the shorter name and that their six-atom
rings are numbered in opposite directions (Figure
33–1). The planar character of purines and pyrimidines
facilitates their close association, or ―stacking,‖ which NH2 NH O OH
stabilizes double-stranded DNA (Chapter 36). The oxo
and amino groups of purines and pyrimidines exhibit
keto-enol and amine-imine tautomerism (Figure 33–2),
but physiologic conditions strongly favor the amino Figure 33–2. Tautomerism of the oxo and amino
and oxo forms. functional groups of purines and pyrimidines.
286
NUCLEOTIDES / 287
NH2 NH2 O O
HN
HN
O O
N N H2N N N
HO HO HO HO
O O O O
OH OH OH OH OH OH OH OH
Adenosine Cytidine Guanosine Uridine
Mononucleotides are nucleosides with a phosphoryl cleotides. Both therefore exist as syn or anti conformers
group esterified to a hydroxyl group of the sugar. The (Figure 33–5). While both conformers occur in nature,
3- and 5-nucleotides are nucleosides with a phospho- anti conformers predominate. Table 33–1 lists the
ryl group on the 3- or 5-hydroxyl group of the sugar, major purines and pyrimidines and their nucleoside
respectively. Since most nucleotides are 5-, the prefix and nucleotide derivatives. Single-letter abbreviations
―5-‖ is usually omitted when naming them. UMP and are used to identify adenine (A), guanine (G), cytosine
dAMP thus represent nucleotides with a phosphoryl (C), thymine (T), and uracil (U), whether free or pre-
group on C-5 of the pentose. Additional phosphoryl sent in nucleosides or nucleotides. The prefix ―d‖
groups linked by acid anhydride bonds to the phos- (deoxy) indicates that the sugar is 2-deoxy-D-ribose
phoryl group of a mononucleotide form nucleoside (eg, dGTP) (Figure 33–6).
diphosphates and triphosphates (Figure 33–4).
Steric hindrance by the base restricts rotation about Nucleic Acids Also Contain
the -N-glycosidic bond of nucleosides and nu- Additional Bases
Small quantities of additional purines and pyrimidines
NH2 occur in DNA and RNAs. Examples include 5-methyl-
N
cytosine of bacterial and human DNA, 5-hydroxy-
Adenine methylcytosine of bacterial and viral nucleic acids, and
mono- and di-N-methylated adenine and guanine of
N N
CH2
O NH2 NH2
O Ribose
O– O O–
HO P O P O P
HO OH
O O– O N N
HO HO
Adenosine 5-monophosphate (AMP) O O
Nucleoside
Base Base X = Ribose or Nucleotide, Where
Formula X=H Deoxyribose X = Ribose Phosphate
NH2
N N
Adenine Adenosine Adenosine monophosphate
N
A A AMP
N
X
O
H N
N
Guanine Guanosine Guanosine monophosphate
H2N N
G G GMP
N
X
NH2
O
H
N
Uracil Uridine Uridine monophosphate
U U UMP
O N
O
H CH3
N
Thymine Thymidine Thymidine monophosphate
O N
T T TMP
dX
NH2 NH2 O O
CH3
HN HN
N N N N O N O
O O O O O O O O O O O O
P P P P
–O O– –O O– –O O– –
O O
OH OH OH H OH OH OH H
O O N N H2N N N
N N
H H O
CH2 O CH2
5-Methylcytosine 5-Hydroxymethylcytosine O O
–O –O
P O P O
H3C CH3 O OH O
N O CH3 OH
N N
7
Figure 33–9. cAMP, 3,5-cyclic AMP, and cGMP.
HN
N H2N N N
H
Nucleotides Serve Diverse
Dimethylaminoadenine 7-Methylguanine Physiologic Functions
Figure 33–7. Four uncommon naturally occurring Nucleotides participate in reactions that fulfill physio-
pyrimidines and purines. logic functions as diverse as protein synthesis, nucleic
acid synthesis, regulatory cascades, and signal transduc-
tion pathways.
mammalian messenger RNAs (Figure 33–7). These
atypical bases function in oligonucleotide recognition
and in regulating the half-lives of RNAs. Free nu- Nucleoside Triphosphates Have High
cleotides include hypoxanthine, xanthine, and uric acid Group Transfer Potential
(see Figure 34–8), intermediates in the catabolism of Acid anhydrides, unlike phosphate esters, have high
adenine and guanine. Methylated heterocyclic bases of group transfer potential. 0 for the hydrolysis of each
plants include the xanthine derivatives caffeine of cof- of the terminal phosphates of nucleoside triphosphates
fee, theophylline of tea, and theobromine of cocoa (Fig- is about 7 kcal/mol (30 kJ/mol). The high group
ure 33–8). transfer potential of purine and pyrimidine nucleoside
Posttranslational modification of preformed polynu- triphosphates permits them to function as group trans-
cleotides can generate additional bases such as fer reagents. Cleavage of an acid anhydride bond typi-
pseudouridine, in which D-ribose is linked to C-5 of cally is coupled with a highly endergonic process such
uracil by a carbon-to-carbon bond rather than by a as covalent bond synthesis—eg, polymerization of nu-
-N-glycosidic bond. The nucleotide pseudouridylic cleoside triphosphates to form a nucleic acid.
acid arises by rearrangement of UMP of a preformed In addition to their roles as precursors of nucleic
tRNA. Similarly, methylation by S-adenosylmethionine acids, ATP, GTP, UTP, CTP, and their derivatives
of a UMP of preformed tRNA forms TMP (thymidine each serve unique physiologic functions discussed in
monophosphate), which contains ribose rather than de- other chapters. Selected examples include the role of
oxyribose. ATP as the principal biologic transducer of free energy;
the second messenger cAMP (Figure 33–9); adenosine
CH 3-phosphate-5-phosphosulfate (Figure 33–10), the
O 3
sulfate donor for sulfated proteoglycans (Chapter 48)
H3C N and for sulfate conjugates of drugs; and the methyl
N
group donor S-adenosylmethionine (Figure 33–11).
O N
N
CH3
P
Figure 33–8. Caffeine, a trimethylxanthine. The di- Adenine Ribose P O SO 2–
methylxanthines theobromine and theophylline are
similar but lack the methyl group at N-1 and at N-7, re- Figure 33–10. Adenosine 3-phosphate-5-phos-
spectively. phosulfate.
NH2 Table 33–2. Many coenzymes and related
compounds are derivatives of adenosine
monophosphate.
N N NH2
N
COO– CH3 CH2 N Adenine
O
CH CH2 CH S N
+ N
O
NH3+
R O P O CH2
HO OH
O– O
Methionine Adenosine
R'' O OR'
Figure 33–11. S-Adenosylmethionine.
D-Ribose
O O
I HN
HN
6N
O O
N
HO HO O
O
F O HN
O HN 5 8
N
H2N N
2 O H
N
H HO OH
HO H
5-Iodo-2-deoxyuridine 5-Fluorouracil 6-Azauridine 8-Azaguanine
SH SH OH
6 N N
N
N H2N N N N
H H H
6-Mercaptopurine 6-Thioguanine Alloburinol
CH2 N NH2
N NH2
O
O
P
O O
CH2 N
H H
H H
H3C NH
O
H O
T
P
O
H H CH2 N NH2
H H
O N
O
H O
P
CH2 N N
H H
H H
O
O
H O
P
H H
H H
O
3
P
Figure 35–1. A segment of one strand of a DNA molecule in which the purine and pyrimidine bases guanine
(G), cytosine (C), thymine (T), and adenine (A) are held together by a phosphodiester backbone between 2-de-
oxyribosyl moieties attached to the nucleobases by an N-glycosidic bond. Note that the backbone has a polarity
(ie, a direction). Convention dictates that a single-stranded DNA sequence is written in the 5 to 3 direction (ie,
pGpCpTpA, where G, C, T, and A represent the four bases and p represents the interconnecting phosphates).
sides in the sequence of nucleotides on one strand, the dine nucleotide, whereas the other pair, the A–T pair, is
template strand. This is the strand of DNA that is held together by two hydrogen bonds. Thus, the G–C
copied during nucleic acid synthesis. It is sometimes re- bonds are much more resistant to denaturation, or
ferred to as the noncoding strand. The opposite strand ―melting,‖ than A–T-rich regions.
is considered the coding strand because it matches the
RNA transcript that encodes the protein. The Denaturation (Melting) of DNA
The two strands, in which opposing bases are held Is Used to Analyze Its Structure
together by hydrogen bonds, wind around a central axis
in the form of a double helix. Double-stranded DNA The double-stranded structure of DNA can be sepa-
exists in at least six forms (A–E and Z). The B form is rated into two component strands (melted) in solution
usually found under physiologic conditions (low salt, by increasing the temperature or decreasing the salt
high degree of hydration). A single turn of B-DNA concentration. Not only do the two stacks of bases pull
about the axis of the molecule contains ten base pairs. apart but the bases themselves unstack while still con-
The distance spanned by one turn of B-DNA is 3.4 nected in the polymer by the phosphodiester backbone.
nm. The width (helical diameter) of the double helix in Concomitant with this denaturation of the DNA mole-
B-DNA is 2 nm. cule is an increase in the optical absorbance of the
As depicted in Figure 35–3, three hydrogen bonds purine and pyrimidine bases—a phenomenon referred
hold the deoxyguanosine nucleotide to the deoxycyti- to as hyperchromicity of denaturation. Because of the
CH3
O
H H
N
H
O
Thymidine
Minor groove
T S Adenosine
P S A o
P 34 A
S T A S
P P
S G S H
P P
S G C S
Major groove H
O
O
H
Cytosine
H Guanosine
20 A
Figure 35–3. Base pairing between deoxyadenosine
Figure 35–2. A diagrammatic representation of the and thymidine involves the formation of two hydrogen
Watson and Crick model of the double-helical structure bonds. Three such bonds form between deoxycytidine
of the B form of DNA. The horizontal arrow indicates and deoxyguanosine. The broken lines represent hy-
the width of the double helix (20 Å), and the vertical drogen bonds.
arrow indicates the distance spanned by one complete
turn of the double helix (34 Å). One turn of B-DNA in-
cludes ten base pairs (bp), so the rise is 3.4 Å per bp. or DNA-RNA hybrids to be separated at much lower
The central axis of the double helix is indicated by the temperatures and minimizes the phosphodiester bond
vertical rod. The short arrows designate the polarity of breakage that occurs at high temperatures.
the antiparallel strands. The major and minor grooves
are depicted. (A, adenine; C, cytosine; G, guanine; Renaturation of DNA Requires
T, thymine; P, phosphate; S, sugar [deoxyribose].) Base Pair Matching
Separated strands of DNA will renature or reassociate
when appropriate physiologic temperature and salt con-
stacking of the bases and the hydrogen bonding be- ditions are achieved. The rate of reassociation depends
tween the stacks, the double-stranded DNA molecule upon the concentration of the complementary strands.
exhibits properties of a rigid rod and in solution is a vis- Reassociation of the two complementary DNA strands
cous material that loses its viscosity upon denaturation. of a chromosome after DNA replication is a physiologic
The strands of a given molecule of DNA separate example of renaturation (see below). At a given temper-
over a temperature range. The midpoint is called the ature and salt concentration, a particular nucleic acid
melting temperature, or Tm. The Tm is influenced by strand will associate tightly only with a complementary
the base composition of the DNA and by the salt con- strand. Hybrid molecules will also form under appro-
centration of the solution. DNA rich in G–C pairs, priate conditions. For example, DNA will form a hy-
which have three hydrogen bonds, melts at a higher tem- brid with a complementary DNA (cDNA) or with a
perature than that rich in A–T pairs, which have two hy- cognate messenger RNA (mRNA; see below). When
drogen bonds. A tenfold increase of monovalent cation combined with gel electrophoresis techniques that sepa-
concentration increases the Tm by 16.6 °C. Formamide, rate hybrid molecules by size and radioactive labeling to
which is commonly used in recombinant DNA experi- provide a detectable signal, the resulting analytic tech-
ments, destabilizes hydrogen bonding between bases, niques are called Southern (DNA/cDNA) and North-
thereby lowering the Tm. This allows the strands of DNA ern blotting (DNA/RNA), respectively. These proce-
dures allow for very specific identification of hybrids the cell and organism, and it provides the information
from mixtures of DNA or RNA (see Chapter 40). inherited by daughter cells or offspring. Both of these
functions require that the DNA molecule serve as a
There Are Grooves in the DNA Molecule template—in the first case for the transcription of the
information into RNA and in the second case for the
Careful examination of the model depicted in Figure replication of the information into daughter DNA mol-
35–2 reveals a major groove and a minor groove wind- ecules.
ing along the molecule parallel to the phosphodiester The complementarity of the Watson and Crick dou-
backbones. In these grooves, proteins can interact specif- ble-stranded model of DNA strongly suggests that
ically with exposed atoms of the nucleotides (usually by replication of the DNA molecule occurs in a semicon-
H bonding) and thereby recognize and bind to specific servative manner. Thus, when each strand of the dou-
nucleotide sequences without disrupting the base pair- ble-stranded parental DNA molecule separates from its
ing of the double-helical DNA molecule. As discussed in complement during replication, each serves as a tem-
Chapters 37 and 39, regulatory proteins control the ex- plate on which a new complementary strand is synthe-
pression of specific genes via such interactions. sized (Figure 35–4). The two newly formed double-
stranded daughter DNA molecules, each containing
DNA Exists in Relaxed one strand (but complementary rather than identical)
& Supercoiled Forms from the parent double-stranded DNA molecule, are
then sorted between the two daughter cells (Figure
In some organisms such as bacteria, bacteriophages, and 35–5). Each daughter cell contains DNA molecules
many DNA-containing animal viruses, the ends of the with information identical to that which the parent
DNA molecules are joined to create a closed circle with possessed; yet in each daughter cell the DNA molecule
no covalently free ends. This of course does not destroy of the parent cell has been only semiconserved.
the polarity of the molecules, but it eliminates all free 3
and 5 hydroxyl and phosphoryl groups. Closed circles
exist in relaxed or supercoiled forms. Supercoils are intro- THE CHEMICAL NATURE OF RNA DIFFERS
duced when a closed circle is twisted around its own axis FROM THAT OF DNA
or when a linear piece of duplex DNA, whose ends are
fixed, is twisted. This energy-requiring process puts the Ribonucleic acid (RNA) is a polymer of purine and
molecule under stress, and the greater the number of su- pyrimidine ribonucleotides linked together by 3,5-
percoils, the greater the stress or torsion (test this by phosphodiester bridges analogous to those in DNA
twisting a rubber band). Negative supercoils are formed (Figure 35–6). Although sharing many features with
when the molecule is twisted in the direction opposite DNA, RNA possesses several specific differences:
from the clockwise turns of the right-handed double
helix found in B-DNA. Such DNA is said to be under- (1) In RNA, the sugar moiety to which the phos-
wound. The energy required to achieve this state is, in a phates and purine and pyrimidine bases are attached is
sense, stored in the supercoils. The transition to another ribose rather than the 2-deoxyribose of DNA.
form that requires energy is thereby facilitated by the un- (2) The pyrimidine components of RNA differ from
derwinding. One such transition is strand separation, those of DNA. Although RNA contains the ribonu-
which is a prerequisite for DNA replication and tran- cleotides of adenine, guanine, and cytosine, it does not
scription. Supercoiled DNA is therefore a preferred form possess thymine except in the rare case mentioned
in biologic systems. Enzymes that catalyze topologic below. Instead of thymine, RNA contains the ribonu-
changes of DNA are called topoisomerases. Topoisom- cleotide of uracil.
erases can relax or insert supercoils. The best-character- (3) RNA exists as a single strand, whereas DNA ex-
ized example is bacterial gyrase, which induces negative ists as a double-stranded helical molecule. However,
supercoiling in DNA using ATP as energy source. Ho- given the proper complementary base sequence with
mologs of this enzyme exist in all organisms and are im- opposite polarity, the single strand of RNA—as
portant targets for cancer chemotherapy. demonstrated in Figure 35–7—is capable of folding
back on itself like a hairpin and thus acquiring double-
DNA PROVIDES A TEMPLATE FOR stranded characteristics.
(4) Since the RNA molecule is a single strand com-
REPLICATION & TRANSCRIPTION plementary to only one of the two strands of a gene, its
The genetic information stored in the nucleotide se- guanine content does not necessarily equal its cytosine
quence of DNA serves two purposes. It is the source of content, nor does its adenine content necessarily equal
information for the synthesis of all protein molecules of its uracil content.
OLD OLD
5 G C 3
G
Original
C G parent molecule
T A
A T
G C
G C
A
First-generation
A T
daughter molecules
T A
G C
G C
3 5
T A T A
C G C G
C C
Second-generation
daughter molecules
A T A T
A T A T
T A T A
G C G C
Figure 35–5. DNA replication is semiconservative.
G G
During a round of replication, each of the two strands
A T A T of DNA is used as a template for synthesis of a new,
complementary strand.
3 T A 5 3 T A 5
OLT NEW ETW OLD
A A
Figure 35–4. The double-stranded structure of DNA
and the template function of each old strand (dark
complementarity, an RNA molecule can bind specifi-
shading) on which a new (light shading) complemen-
cally via the base-pairing rules to its template DNA
tary strand is synthesized.
strand; it will not bind (―hybridize‖) with the other
(coding) strand of its gene. The sequence of the RNA
molecule (except for U replacing T) is the same as that
of the coding strand of the gene (Figure 35–8).
(5) RNA can be hydrolyzed by alkali to 2,3 cyclic
diesters of the mononucleotides, compounds that can- Nearly All of the Several Species of RNA
not be formed from alkali-treated DNA because of the Are Involved in Some Aspect of Protein
absence of a 2-hydroxyl group. The alkali lability of Synthesis
RNA is useful both diagnostically and analytically.
Those cytoplasmic RNA molecules that serve as tem-
Information within the single strand of RNA is con- plates for protein synthesis (ie, that transfer genetic in-
tained in its sequence (―primary structure‖) of purine formation from DNA to the protein-synthesizing ma-
and pyrimidine nucleotides within the polymer. The chinery) are designated messenger RNAs, or mRNAs.
sequence is complementary to the template strand of Many other cytoplasmic RNA molecules (ribosomal
the gene from which it was transcribed. Because of this RNAs; rRNAs) have structural roles wherein they con-
O
N
NH
5 G
O
O
P
O O
CH2 N
H H
H H
O NH
HO O
P
O
H H CH2 N NH2
H H
O N
O
HO O
P
CH2 N N
H H
H H
O
O
O
HO
P
H H
H H
O
3 HO
P
Figure 35–6. A segment of a ribonucleic acid (RNA) molecule in which the purine and pyrimidine bases—
guanine (G), cytosine (C), uracil (U), and adenine (A)—are held together by phosphodiester bonds between ribo-
syl moieties attached to the nucleobases by N-glycosidic bonds. Note that the polymer has a polarity as indi-
cated by the labeled 3- and 5-attached phosphates.
tribute to the formation and function of ribosomes (the The genetic material for some animal and plant
organellar machinery for protein synthesis) or serve as viruses is RNA rather than DNA. Although some RNA
adapter molecules (transfer RNAs; tRNAs) for the viruses never have their information transcribed into a
translation of RNA information into specific sequences DNA molecule, many animal RNA viruses—specifi-
of polymerized amino acids. cally, the retroviruses (the HIV virus, for example)—are
Some RNA molecules have intrinsic catalytic activ- transcribed by an RNA-dependent DNA polymerase,
ity. The activity of these ribozymes often involves the the so-called reverse transcriptase, to produce a dou-
cleavage of a nucleic acid. An example is the role of ble-stranded DNA copy of their RNA genome. In
RNA in catalyzing the processing of the primary tran- many cases, the resulting double-stranded DNA tran-
script of a gene into mature messenger RNA. script is integrated into the host genome and subse-
Much of the RNA synthesized from DNA templates quently serves as a template for gene expression and
in eukaryotic cells, including mammalian cells, is de- from which new viral RNA genomes can be tran-
graded within the nucleus, and it never serves as either a scribed.
structural or an informational entity within the cellular
cytoplasm. RNA Is Organized in Several
In all eukaryotic cells there are small nuclear RNA Unique Structures
(snRNA) species that are not directly involved in pro-
tein synthesis but play pivotal roles in RNA processing. In all prokaryotic and eukaryotic organisms, three main
These relatively small molecules vary in size from 90 to classes of RNA molecules exist: messenger RNA
about 300 nucleotides (Table 35–1). (mRNA), transfer RNA (tRNA), and ribosomal RNA
Table 35–1. Some of the species of small stable
Loop
RNAs found in mammalian cells.
C G
C G
G C
A U
A U
A U
U G
U G
Stem
C C
G C
U A
U A
U C
U A Messenger RNAs, particularly in eukaryotes, have
C G some unique chemical characteristics. The 5 terminal
G C of mRNA is ―capped‖ by a 7-methylguanosine triphos-
phate that is linked to an adjacent 2-O-methyl ribonu-
cleoside at its 5-hydroxyl through the three phosphates
5 3 (Figure 35–10). The mRNA molecules frequently con-
Figure 35–7. Diagrammatic representation of the tain internal 6-methyladenylates and other 2-O-ribose
methylated nucleotides. The cap is involved in the
secondary structure of a single-stranded RNA molecule
recognition of mRNA by the translating machinery,
in which a stem loop, or ―hairpin,‖ has been formed and
and it probably helps stabilize the mRNA by preventing
is dependent upon the intramolecular base pairing.
the attack of 5-exonucleases. The protein-synthesizing
Note that A forms hydrogen bonds with U in RNA. machinery begins translating the mRNA into proteins
beginning downstream of the 5 or capped terminal.
The other end of most mRNA molecules, the 3-hy-
(rRNA). Each differs from the others by size, function, droxyl terminal, has an attached polymer of adenylate
and general stability. residues 20–250 nucleotides in length. The specific
function of the poly(A) “tail” at the 3-hydroxyl termi-
nal of mRNAs is not fully understood, but it seems that
A. MESSENGER RNA (MRNA) it maintains the intracellular stability of the specific
This class is the most heterogeneous in size and stabil- mRNA by preventing the attack of 3-exonucleases.
ity. All members of the class function as messengers Some mRNAs, including those for some histones, do
conveying the information in a gene to the protein- not contain poly(A). The poly(A) tail, because it will
synthesizing machinery, where each serves as a template form a base pair with oligodeoxythymidine polymers
on which a specific sequence of amino acids is polymer- attached to a solid substrate like cellulose, can be used
ized to form a specific protein molecule, the ultimate to separate mRNA from other species of RNA, includ-
gene product (Figure 35–9). ing mRNA molecules that lack this tail.
DNA strands:
Coding 5 —T G G A A T T G T G A G C G G A T A A C A A T T T C A C A C A G G A A A C A G C T A T G A C C A T G — 3
Template 3 —A C C T T A A C A C T C G C C T A T T G T T A A A G T G T G T C C T T T G T C G A T A C T G G T A C — 5
RNA 5 p A U U G U G A G C G G A U A A C A A U U U C A C A C A G G A A A C A G C U A U G A C C A U G 3
transcript
Figure 35–8. The relationship between the sequences of an RNA transcript and its gene, in which the cod-
ing and template strands are shown with their polarities. The RNA transcript with a 5 to 3 polarity is comple-
mentary to the template strand with its 3 to 5 polarity. Note that the sequence in the RNA transcript and its
polarity is the same as that in the coding strand, except that the U of the transcript replaces the T of the gene.
DNA
5 3
3 5
mRNA
5 3
Protein synthesis on mRNA template
5 3
Figure 35–9. The expression of genetic in-
Ribosome Completed
protein formation in DNA into the form of an mRNA
molecule transcript. This is subsequently translated by
ribosomes into a specific protein molecule.
In mammalian cells, including cells of humans, the The D, TWC, and extra arms help define a specific
mRNA molecules present in the cytoplasm are not the tRNA.
RNA products immediately synthesized from the DNA Although tRNAs are quite stable in prokaryotes, they
template but must be formed by processing from a pre- are somewhat less stable in eukaryotes. The opposite is
cursor molecule before entering the cytoplasm. Thus, true for mRNAs, which are quite unstable in prokary-
in mammalian nuclei, the immediate products of gene otes but generally stable in eukaryotic organisms.
transcription constitute a fourth class of RNA mole-
cules. These nuclear RNA molecules are very heteroge- C. RIBOSOMAL RNA (RRNA)
neous in size and are quite large. The heterogeneous A ribosome is a cytoplasmic nucleoprotein structure
nuclear RNA (hnRNA) molecules may have a molecu- that acts as the machinery for the synthesis of proteins
lar weight in excess of 107, whereas the molecular from the mRNA templates. On the ribosomes, the
weight of mRNA molecules is generally less than 2 mRNA and tRNA molecules interact to translate into a
106. As discussed in Chapter 37, hnRNA molecules are specific protein molecule information transcribed from
processed to generate the mRNA molecules which then the gene. In active protein synthesis, many ribosomes
enter the cytoplasm to serve as templates for protein are associated with an mRNA molecule in an assembly
synthesis. called the polysome.
The components of the mammalian ribosome,
B. TRANSFER RNA (TRNA) which has a molecular weight of about 4.2 106 and a
tRNA molecules vary in length from 74 to 95 nu- sedimentation velocity of 80S (Svedberg units), are
cleotides. They also are generated by nuclear processing shown in Table 35–2. The mammalian ribosome con-
of a precursor molecule (Chapter 37). The tRNA mole- tains two major nucleoprotein subunits—a larger one
cules serve as adapters for the translation of the infor- with a molecular weight of 2.8 106 (60S) and a
mation in the sequence of nucleotides of the mRNA smaller subunit with a molecular weight of 1.4 106
into specific amino acids. There are at least 20 species (40S). The 60S subunit contains a 5S ribosomal RNA
of tRNA molecules in every cell, at least one (and often (rRNA), a 5.8S rRNA, and a 28S rRNA; there are also
several) corresponding to each of the 20 amino acids re- probably more than 50 specific polypeptides. The 40S
quired for protein synthesis. Although each specific subunit is smaller and contains a single 18S rRNA and
tRNA differs from the others in its sequence of nu- approximately 30 distinct polypeptide chains. All of the
cleotides, the tRNA molecules as a class have many fea- ribosomal RNA molecules except the 5S rRNA are
tures in common. The primary structure—ie, the nu- processed from a single 45S precursor RNA molecule in
cleotide sequence—of all tRNA molecules allows the nucleolus (Chapter 37). 5S rRNA is independently
extensive folding and intrastrand complementarity to transcribed. The highly methylated ribosomal RNA
generate a secondary structure that appears like a molecules are packaged in the nucleolus with the spe-
cloverleaf (Figure 35–11). cific ribosomal proteins. In the cytoplasm, the ribo-
All tRNA molecules contain four main arms. The somes remain quite stable and capable of many transla-
acceptor arm terminates in the nucleotides CpCpAOH. tion cycles. The functions of the ribosomal RNA
These three nucleotides are added posttranscription- molecules in the ribosomal particle are not fully under-
ally. The tRNA-appropriate amino acid is attached to stood, but they are necessary for ribosomal assembly
the 3-OH group of the A moiety of the acceptor arm. and seem to play key roles in the binding of mRNA to
OH OH
C C
H H
HC CH
H2N –
H2C 5 O NH2
O
P O
HN O
O P O–
CH3 5
O
O
O– P CH2
O
O
O
CAP
HC CH
H H
2
3 C C O
OCH3 NH
5 O
O CH2 N
O
P
O O
mRNA
O–
HC H H CH
3 C C
OH
P
O
O–
Figure 35–10. The cap structure attached to the 5 terminal of most eukaryotic messen-
ger RNA molecules. A 7-methylguanosine triphosphate (black) is attached at the 5 terminal
of the mRNA (shown in blue), which usually contains a 2-O-methylpurine nucleotide.
These modifications (the cap and methyl group) are added after the mRNA is transcribed
from DNA.
ribosomes and its translation. Recent studies suggest size from 90 to 300 nucleotides and are present in
that an rRNA component performs the peptidyl trans- 100,000–1,000,000 copies per cell.
ferase activity and thus is an enzyme (a ribozyme). Small nuclear RNAs (snRNAs), a subset of these
RNAs, are significantly involved in mRNA processing
and gene regulation. Of the several snRNAs, U1, U2,
D. SMALL STABLE RNA U4, U5, and U6 are involved in intron removal and the
A large number of discrete, highly conserved, and small processing of hnRNA into mRNA (Chapter 37). The
stable RNA species are found in eukaryotic cells. The U7 snRNA may be involved in production of the cor-
majority of these molecules are complexed with pro- rect 3 ends of histone mRNA—which lacks a poly(A)
teins to form ribonucleoproteins and are distributed in tail. The U4 and U6 snRNAs may also be required for
the nucleus, in the cytoplasm, or in both. They range in poly(A) processing.