Professional Documents
Culture Documents
Glossary of Molecular Biology Terminology
Glossary of Molecular Biology Terminology
This glossary is designed to help the reader with the ter- Term Section
minology of molecular biology. Each year, the glossary c-rel X
will be expanded to include new terms introduced in the Calcium phosphate VI
Education Program. The basic terminology of molecu- CAN X
lar biology is also included. The glossary is divided into CAT V
several general sections. A cross-reference guide is in- cDNA II
cluded to direct readers to the terms they are interested cDNA blunting IX
in. The hope is that this addition to the Education Pro- cDNA library preparation IX
gram will further the understanding of those who are cdk V
less familiar with the discipline of molecular biology. cdkI V
CFBβ IX
CROSS-REFERENCE GUIDE Chimeraplasty VIII
Chitosan-DNA VIII
Term Section Chloramphenicol transferase V
Actinomycin D pulse experiments V Chromatography, gel filtration IV
Adeno-associated viral vectors VIII Chromatography, ion exchange IV
Adenoviral vectors VIII Chromatography, hydrophobic IV
ALK X Chromatography, affinity IV
Allele-specific hybridization XI Chromatography, high performance
Allele-specific PCR IV liquid (HPLC) IV
AML-1 X Cis-acting factors V
Amphotropic virus VIII Codon II
Anaplastic lymphoma kinase X Color complementation assay XI
Antisense oligonucleotides VIII Comparative gene hybridization IV
Basic helix-loop-helix proteins V Competitive oligonucleotide hybridization XI
Bcl-1 X Concatamerization VI
Bcl-2 X Cyclin-dependent kinase V
Bcl-3 X Contig VII
Bcl-6 X Cosmid II
β galactosidase V CpG nucleotide II
Branched chain DNA signal Cyclins V
amplification assay II DEAE dextran VI
c-abl X DEK X
c-fos X Dideoxynucleotide (ddN) chain
c-jun X termination sequencing IV
c-myb X Directional cloning IX
c-myc X DNA (deoxyribonucleic acid) II
c-ras X DNA methylases III
DNA microarrays IV
DNA polymerase III
* University of Washington School of Medicine, Division of DNAse footprinting IV
Hematology, Box 357710, Seattle WA 98195-7710 DNAse hypersensitivity site mapping IV
Thermostabile polymerases The prototype polymerase, DNA methylases These enzymes are normally part of a
Taq, and newer versions such as Vent and Tth polymerase bacterial host defense against invasion by foreign DNA.
are derived from microorganisms that normally reside The enzyme normally methylates endogenous (host)
at high temperature. Consequently, their DNA poly- DNA and thereby renders it resistant to a series of en-
merase enzymes are quite stable to heat denaturation, dogenous restriction endonucleases. In recombinant
making them ideal enzymes for use in the polymerase DNA work, methylation finds use in cDNA cloning to
chain reaction. prevent subsequent digestion by the analogous restriction
endonuclease.
RNA polymerase II This enzyme is used by mamma-
lian cells to transcribe structural genes that result in Reverse transcriptase This enzyme, first purified from
mRNA. The enzyme interacts with a number of other retrovirus-infected cells, produces a cDNA copy from
proteins to correctly initiate transcription, including a an mRNA molecule if first provided with an antisense
number of general factors, and tissue-specific and in- primer (oligo dT or a random primer). This enzyme is
duction-specific enhancing proteins. critical for converting mRNA into cDNA for purposes
of cloning, PCR amplification, or the production of spe-
RNA polymerase III This enzyme is used by the cell to cific probes.
transcribe ribosomal RNA genes.
Topoisomerase A homodimeric chromosomal unwind-
Kinases These enzymes transfer the γ-phosphate group ing enzyme that introduces a double-stranded nick in
from ATP to the 5' hydroxyl group of a nucleic acid chain. DNA, which allows the unwinding necessary to permit
DNA replication, followed by religation. Inhibition of topo-
Viral-derived kinases These enzymes are utilized in re- isomerases leads to blockade of cell division, the target of
combinant DNA technology to transfer phosphate groups several chemotherapeutic agents (e.g., etoposide).
(either unlabeled or 32P-labeled) to oligonucleotides or
DNA fragments. The most commonly used kinase is T4 Telomerase A specialized DNA polymerase that pro-
polynucleotide kinase. tects the length of the terminal segment of a chromo-
some. Should the telomere become sufficiently short-
Mammalian protein kinases These enzymes transfer ened (by repeated rounds of cell division), the cell un-
phosphate groups from ATP to either tyrosine, threo- dergoes apoptosis. The holoenzyme contains both a poly-
nine, or serine residues of proteins. These enzymes are merase and an RNA template; only the latter has been
among the most important signaling molecules present characterized, although the gene for the enzymatic ac-
in mammalian cell biology. tivity has recently been cloned.
Chromatography, hydrophobic This methodology Yeast 2-hybrid screens A strategy designed to deter-
separates proteins based on their hydrophobicity. Pro- mine the binding partners for a protein of interest. The
teins preferentially bind to the matrix based on the gene (or a fragment of the gene) representing a protein
strength of this interaction; proteins are then eluted off of interest (the “bait”) is fused in frame to DNA binding
using solvents of increasing hydrophobicity. Separation domain (DBD) of yeast transcription factor and then in-
media include phenyl-sepharose and octyl-sepharose. troduced into a yeast strain. A cDNA library is then con-
structed from the cells in which the bait is normally ex-
Chromatography, affinity This separation method de- pressed, and fused in frame to the activation domain (AD)
pends on using any molecule that can preferentially bind of the same yeast transcription factor. When the library
to a protein of interest. Typical methodologies include is introduced into the yeast expressing the bait/DBD fu-
using lectins (such as wheat germ or concanavalin A) to sion, any yeast cell expressing a cDNA encoding a bind-
bind glycoproteins or using covalently coupled mono- ing partner of the bait protein will have that cDNA/AD
clonal antibodies to bind specific protein ligands. fusion protein bind to the bait/DBD fusion, bringing the
AD and DBD together, thereby creating a fully func-
Chromatography, high performance liquid (HPLC). tional transcription factor that now drives a reporter gene,
A general methodology to improve the separation of allowing the yeast carrying such interacting proteins to
complex protein mixtures. The types of HPLC columns be identified and the cDNA recovered.
available are the same as for conventional chromatogra-
phy, such as those based on size exclusion, hydropho- V. PHYSIOLOGIC GENE REGULATION
bicity, and ionic interaction, but the improved flow rates
resulting from the high pressure system provide enhanced The regulation of gene expression is central to physiol-
separation capacity and improved speed. ogy. Complex organisms have evolved multiple mecha-
nisms to accomplish this task. The first step in protein
Proteomics. The general term used in the study of the expression is the transcription of a specified gene. The
display of all proteins present in cells under defined con- rate of initiation and elongation of this process is the
ditions. By deciphering which proteins are differentially most commonly used mechanism for regulating gene ex-
displayed in tumor cells compared to their normal coun- pression. Once formed, the primary transcript must be
terparts, or in cells stimulated to grow, vs. their quies- spliced, polyadenylated, and transported to the cyto-
cent state, one can determine the proteins that are re- plasm. These mechanisms are also possible points of
sponsible for the cellular phenotype. In essence, regulation. In the cytoplasm, mRNA can be rapidly de-
proteomics is to proteins what genomics is to genes. graded or retained, another potential site of control. Pro-
tein translation next occurs on the ribosome, which can
DNA microarrays (gene expression arrays or gene be free or membrane-associated. Secreted proteins take
chips) Multiple (presently up to tens of thousands) gene the latter course, and the trafficking of the protein through
fragments or oligonucleotides representing distinct genes these membranes and ultimately to storage or release makes
spotted onto a solid support. Theoretically, microarrays up another important point of potential regulation. Indi-
could be used to determine the totality of the genome vidual gene expression is often controlled at multiple lev-
expressed in a given cell under specific growth condi- els, making investigation and intervention a complex task.
tions, if the entire genome were present on the
microarray. At present, gene chips are available that rep- Transcription Transcription is the act of generating a
resent about 1/3 of the human genome. The microarray primary RNA molecule from the double-stranded DNA
is hybridized with a labeled probe (either radioactive or gene. Regulation of gene expression is predominantly
fluoresceinated) representing all the mRNA species in a at the level of regulating the initiation and elongation of
Initiation complex This multi-protein complex forms Trans-acting factors Proteins that are involved in the
at the site of transcription initiation and is composed of transcriptional regulation of a gene of interest.
RNA polymerase, a series of ubiquitous transcription
factors (TF II family), and specific enhancers and/or si- Cis-acting factors These are regions at a gene either
lencers. The proteins are brought together by the loop- upstream, within, or downstream of the coding sequence
ing of DNA strands so that protein binding sites, which that contains sites to which transcriptionally important
may range up to tens of kb apart, can be brought into proteins may bind. Sequences that contain 5 to 25 nucle-
close juxtaposition. Specific protein•protein interactions otides are present in a typical cis-acting element.
then allow assembly of the complex.
Transcription factors Specific proteins that bind to con-
Polyadenylation Following transcription of a gene, a trol elements of genes. Several families of transcription
specific signal near the 3' end of the primary transcript factors have been identified and include helix-loop-he-
(AATAAA) signals that a polyadenine tail be added to lix proteins, helix-turn-helix proteins, and leucine zip-
the newly formed transcript. The tail may be up to sev- per proteins. Each protein includes several distinct do-
eral hundred nucleotides long. The precise function of mains such as activation and DNA-binding regions.
the poly A tail is uncertain but it seems to play a role in
stability of the mRNA and perhaps in its metabolism LCR (locus control region) Cis-acting sites are occa-
through the nuclear membrane to the ribosome. sionally organized into a region removed from the struc-
tural gene(s) they control. Such locus control regions
Splicing The primary RNA transcript contains a num- (LCRs) are best described for the β globin and α globin
ber of sequences that are not part of the mature mRNA. loci. First recognized by virtue of clustering of multiple
These regions are called introns and are removed from DNAse hypersensitive sites, the β globin LCR is required
the primary RNA transcript by a process termed splic- for high level expression from all of the genes and ap-
ing. A complex tertiary structure termed a lariat is formed pears to be critical for their stage-specific developmen-
and the intron sequence is eliminated bringing the cod- tal pattern of expression.
ing sequences (exons) together. Specific sequences
Transcriptional regulation Gene regulation is deter- Actinomycin D pulse experiments The application of
mined by the rate of transcriptional initiation. This usu- actinomycin D to actively metabolizing cells results in
ally results from alteration in the level of activity of trans- the cessation of new RNA transcription. Consequently,
acting proteins, which, in turn, are regulated either by serial determinations of specific RNA levels will allow
the amount of the transcriptionally active protein or by one to calculate the mRNA half life. Should this vary
their state of activation. between control and stimulated conditions, evidence is
garnered that a gene of interest is regulated at the level
Leucine zipper proteins A family of DNA-binding pro- of mRNA stability.
teins that require a dimeric state for activity and that
dimerize by virtue of an alpha helical region that con- Reporter genes In order to determine how a gene pro-
tains leucine at every seventh position. Because 3.4 moter or enhancer works in vitro, that genetic element
amino acids reside in each turn of an alpha helix, the is often linked to a gene for which a simple assay is
occurrence of leucine at every seventh position results readily available and whose regulation is not affected
in a strip of highly hydrophobic residues on one surface by post-transcriptional processes. Such reporter genes
of the alpha helix. Such a domain on one polypeptide include chloramphenicol acetyl transferase, β galactosi-
can intercollate with a similar domain on a second dase, and firefly luciferase. The first is the most com-
polypeptide, resulting in the formation of a stable monly used reporter; however, more recent studies have
homodimer or heterodimer. Examples of the leucine zip- emphasized the use of the latter two reporters, as these
per family include the proto-oncogenes c-jun and c-fos. are more sensitive to minimal changes in promoter or
enhancer activity.
Basic helix-loop-helix proteins These transcriptional
proteins are characterized by two alpha helical regions CAT (chloramphenicol acetyl transferase) The bac-
separated by a loop structure; this domain is involved in terial gene for chloramphenicol resistance, chloram-
protein dimerization. Examples of this family of tran- phenicol acetyl transferase (CAT) is commonly used as
scription factors include E12/E47 of the immunoglobu- a reporter gene for investigating physiologic gene regu-
β galactosidase The presence of β galactosidase activ- Ubiquitin A small molecular weight protein that can be
ity in the cytoplasm of transfected cells can be readily coupled to lysine residues of proteins targeted for de-
detected by its ability to convert a colorless substrate to struction. There are ubiquitin ligases and deubiquinases,
a blue-colored product. This is usually assayed using a which add or remove ubiquitin from proteins, usually in
fluorimeter. a fairly specific manner. Once polyubiquinated, proteins
are subject to destruction by the proteosome.
Luciferase This gene, which is the most recent reporter
gene to be used, has gained increasing acceptance be- Immunoglobulin somatic hypermutation Immunoglo-
cause of its ease of assay and extreme sensitivity. The bulin variable region gene sequences are further diver-
assay is based on the ability of the protein to undergo sified in mature B cells during clonal expansion that
chemiluminescence and transmit light, detected with a occurs following antigen stimulation. Mutations clustered
luminometer. within V regions typically involve nucleotide substitu-
tion, and less frequently small deletions or insertions.
Cyclins A group of proteins that vary in expression This event usually follows immunoglobulin class switch-
throughout the cell cycle. Once a threshold level is at- ing, which by itself does little to alter Ig specificity; only
tained, interaction with specific cellular kinases results the effector functions of the molecule are altered by the
in phosphorylation of critical components of the mitotic change from IgM to IgG, etc.
machinery. Several classes of cyclins (A through E) ex-
ist that regulate different aspects of the cell cycle (G0, VI. EXPRESSION OF RECOMBINANT PROTEINS
G1, S, G2, M). Altered expression of some cyclins is as-
sociated with hematologic malignancy, e.g., t(11;14) in In order to exploit the techniques of recombinant DNA
mantle cell lymphoma leads to over-expression of cyclin research, one must possess a system to manufacture the
D1, a G1 phase cyclin. protein of interest. After identifying the gene encoding
the protein and obtaining a cDNA representation of it
Cdk (cyclin-dependent kinase) A related group of cel- (“cloning”), the cDNA must be placed in a vector ca-
lular kinases, present in virtually all cells, that are regu- pable of driving high levels of RNA transcription in a
lated both positively and negatively by specific phos- host system capable of translating and appropriately
phorylation events and negatively by association with modifying the polypeptide to produce fully functional
other proteins, and are dependent on cyclins, present only protein. And just like obtaining a protein of interest from
during certain phases of the cell cycle (cdk1-activated natural sources, one must purify protein from the final
during G2/M phase, cdk2-G1/S phases, cdk4-G1/S phases, expression system. Because of the nature of the highly
cdk6-G1 phase, cdk7-throughout the cell cycle). engineered systems and high levels of expression, this
latter task is usually considerably easier using recombi-
CdkI (cdk inhibitors) Proteins that inhibit the cdks by nant methods than from natural sources. The methods
stoicheometric combination, arresting cells in G1 phase, used to generate expression vectors are described in Sec-
and include p27, p21 and the p16 Ink 4A family of pro- tion IV, but the methods to purify proteins are discussed
teins. The latter are implicated as tumor suppressor genes, in only a rudimentary way and are beyond the scope of
as their deficiency in mice leads to rapid cellular prolif- this glossary.
eration and a high rate of spontaneous tumor develop-
ment. Moreover, deficiency of p16 family members has Expression vector A plasmid that contains all of the
been associated with numerous types of human tumors, elements necessary to express an inserted cDNA in the
including a fraction of cases of B cell ALL and T cell host of interest. For a mammalian cell host, such a vec-
leukemia. tor typically contains a powerful promoter coupled to
an enhancer, a cloning site, and a polyadenylation sig-
Proteosome A large multiprotein complex designed to nal. In addition, several expression vectors also contain
digest proteins that have been targeted for destruction, a selectable marker gene such as DHFR or NeoR, which
usually based on the presence of multiple sites of aids in the generation of stable cell lines. The plasmid
Transposon Naturally occurring genetic elements that Pseudotyped virus These take advantage of the power-
are naturally easily removed and inserted into the ge- ful expression levels obtainable by murine retroviral
nome, allowing for the recombination of genetic seg- backbones, yet are packaged in an envelope that allows
ments, giving rise to genetic diversity. These same ele- docking and uptake by human target cells. An example
ments can be utilized for gene therapy. is the popular MFG vector that utilizes a murine leuke-
mia retroviral backbone and an amphotropic packaging
VIII. GENE THERAPY cell line to produce infectious particles.
Gene therapy takes many forms. To treat malignancy, it Episomal Episomal refers to gene therapy vectors that
may involve the insertion of an adjuvant substance (such remain free in the target cell without being taken into
as GM-CSF) into tumor cells to generate a tumor vac- the host genome.
cine, transfer of a gene that renders tumor cells suscep-
tible to eradication with an antitumor agent (e.g., herpes Positional variegation Refers to the observation that the
thymidine kinase), or insertion of a gene that makes by- site of vector integration into the genome often results
stander cells resistant to the effects of chemotherapy (e.g., in variable levels of gene expression.
MDR). For gene deficiencies, insertion of the wild type
Bcl-6 A zinc finger transcription factor, expression of ALK (anaplastic lymphoma kinase) A large propor-
which is altered in approximately one-third of diffuse B tion of Ki-1 positive lymphomas are characterized by a
large cell lymphomas as a consequence of 3q27 translo- t(2;5). The breakpoint involves nucleophosmin, a ubiq-
cations. Its target genes are unknown. uitously expressed gene, and ALK. The chimeric mRNA
and protein are thought to be responsible for transfor-
RAR (retinoic acid receptor) The retinoic acid recep- mation. ALK is a member of the insulin receptor family
tor is a member of the steroid hormone group of tran- of transmembrane receptor kinases, which is not nor-
scriptionally active proteins and contains a steroid hor- mally expressed in hematopoietic tissues; the fusion gene
mone-binding domain, a zinc finger DNA-binding do- is no longer membrane bound, which may underlie its
main, and a transcriptional activation domain. RAR is pathogenesis.
located at the t(15;17) present in the majority of cases of
acute promyelocytic leukemia. Its fusion partner in the Evi-1 A transcription factor whose rearrangement in
translocation is termed pml. Normally, RAR forms t(3;21) is implicated as contributing to MDS. Over-
heterodimers with members of the RXR family of tran- expression of evi-1 blocks differentiation in response to
scription factors. hematopoietic growth factors.
p53 Wild-type p53 is a sequence-specific DNA-binding ETO Located on chromosome 8, ETO is involved in t(8:21)
nuclear protein that acts to induce gene expression. Over- of AML type M2. Based on the presence of two zinc-fin-
all, the program of p53-activated genes is associated with ger motifs. ETO possibly encodes a transcription factor,
suppression of cell growth, consistent with our under- but its role in the pathogenesis of AML is unknown.
standing of the mechanisms of anti-oncogenes. Muta-
tions of p53 may not only inactivate its growth-suppres- AML-1 Located on chromosome 21, AML-1 is the fu-
sion function, but can actually generate a genetically sion partner of ETO in t(8:21). The gene is homologous
dominant, functional oncogene. Human tumors associ- to the runt gene of Drosophila and encodes a transcrip-
ated with p53 mutations include those of hematopoietic tion factor. Normal hemopoietic targets include the
tissues (e.g., 20% of myelomas), bladder, liver, brain, CD13, GM-CSF, MPO, IL-3, and the T cell antigen re-
breast, lung, and colon. It is likely the most frequently ceptor promoters. AML-1 binds as a heterodimer,
mutated gene in human cancer. partnered with CBFβ. It is unclear if its mechanism of
action is to enhance aberrant transcription or to blunt
ras This gene encodes a critical signalling intermediate transcription by acting in a dominant negative fashion.
involved in the response to multiple growth factors. There
are several related proteins (Ha-ras, Ki-ras, N-ras). N- CBFβ β Located on chromosome 16, CBFβ is one of the
ras and K-ras are mutated in many cancers, including fusion partners in the inv(16) associated with AML type
45% of myelomas and > 50% of CMML cases. Consti- M4Eo. As with AML-1, it is unclear whether the altered
tutive activation of ras can mimic chronic stimulation transcription factor enhances or blocks transcription.
by the corresponding lineage-specific growth factor.
MLL Located on chromosome 11, MLL (mixed lineage
Hox 11 A homeobox containing transcription factor dis- leukemia) is frequently altered in ALL, 1oAML, and es-
rupted by translocation to the T cell receptor locus [t(10;14)] pecially in AML secondary to the use of topoisomerase
in 10% of cases of T cell ALL/lymphoblastic lymphoma. II inhibitors. MLL is homologous to the trithorax gene
The Hox 11 gene is critical to the development of the spleen of Drosophila and displays many features of a transcrip-
but its role in hematopoiesis is unclear. tion factor and of a DNA methyl transferase.