1992 Shizuya PNAS

You might also like

Download as pdf or txt
Download as pdf or txt
You are on page 1of 4

Proc. Natl. Acad. Sci.

USA
Vol. 89, pp. 8794-8797, September 1992
Biochemistry

Cloning and stable maintenance of 300-kilobase-pair fragments of


human DNA in Escherichia coli using an F-factor-based vector
(electoporaton/phyical mapping/human genome)
HIROAKI SHIZUYA*, BRUCE BIRREN, UNG-JIN KIM, VALERIA MANCINO, TATIANA SLEPAK,
YOSHIAKI TACHIIRI, AND MELVIN SIMONt
Division of Biology, 147-75, California Institute of Technology, Pasadena, CA 91125
Contributed by Melvin Simon, June 30, 1992

ABSTRACT A bacterial coning system for mapping and human DNA inserts >120 kb have not been cloned and
analysis of complex genomes has been developed. The BAC characterized. The BAC system allows us to clone large
system (for bacterial artificial chromosome) is based on Esch- DNA from a variety of complex genomic sources into bac-
erichia colt and its single-copy plasmid F factor. It is capable of teria, where the DNA is stable, easy to manipulate, and
maintaining human genomic DNA fragments of >300 kilobase represents a single foreign DNA source.
pai-. Individual clones of human DNA appear to be main-
tained with a high degree of structural stability in the host, even MATERIALS AND METHODS
after 100 generations of serial growth. Because of high cloning
efficiency, easy manipulation of the cloned DNA, and stable Construction of BAC Vector. A 97-base-pair (bp) synthetic
maintenance of inserted DNA, the BAC system may facilitate oligonucleotide containing nucleotide sequences for 17/SP6
construction of DNA libraries of complex genomes with fuller promoters, two cloning sites (HindIII and BamHIf), and sites
representation and subsequent rapid analysis of complex ge- for rare-cutter restriction enzymes (Not I, Eag I, Xma I, Sma
nomic structure. I, Bgl I, and Sfi I) were inserted into the single Sal I site of
pMBO131 (9). A 400-bp Sac I fragment of bacteriophage A
There is currently underway an intense effort to construct a carrying the cosN site was then inserted into the Sac I site of
high-resolution physical map of each of the human chromo- the plasmid. A 42-bp oligonucleotide having a bacteriophage
somes. Eventually, these maps will be composed of over- P1 loxP site was inserted at a Cla I site between cosN and the
lapping fragments of human DNA and will allow the direct cloning site. The resulting plasmid is called pBAC108L.
acquisition of DNA fragments that correspond to specific Preparation of High Molecular Weight DNA. We prepared
genes. Completion of the physical map requires the avail- human DNA for cloning in agarose from MOLT-4 cells (a
ability of comprehensive libraries of DNA clones in appro- leukemic cell line). Cells were harvested by low-speed cen-
priate vectors. Furthermore, the accuracy and efficiency of trifugation and suspended in phosphate-buffered saline (PBS)
physical mapping increase progressively with the size of the to a final cell density of 10V per ml. An equal volume of
clone firgments in these libraries. Thus, the construction of liquefied 1% low-melting agarose in PBS was mixed with the
libraries using yeast artificial chromosomes (YACs), which cell suspension and then the whole agarose/cell mixture was
permit cloning of fragments of ;500 kilobase pairs (kb), poured into a 7 mm x 7 mm plastic mold as described by
represents a fundamental advance in our ability to generate Birren and Lai (10). After the mold was placed at 0C for 10
physical maps that order DNA over multi-megabase dis- min to solidify, the agarose block was pushed out into 50 ml
tances (1). However, some difficulties have been encoun- of digestion buffer (1% sodium N-lauroylsarcosine/0.5 M
tered with the manipulation of YAC libraries (2-4). Thus, for EDTA, pH 8.0, with proteinase K at 2 mg/ml). Cells were
example, in various libraries a fraction of the clones result digested at 500C for 2 days with one change of an additional
50 ml of lysis buffer, and the digest was dialyzed against four
from co-cloning events; i.e., they include in a single clone changes of 50 ml of 50 mM EDTA, pH 8.0.
noncontiguous DNA fiagments. We describe here a bacterial Partial Digestion of DNA. For HindIII partial digestion,
cloning system that we refer to as BACs, bacterial artificial agarose plugs were first dialyzed extensively against 10 mM
chromosomes. This system may provide a supplement and Tris.HCl/1 mM EDTA, pH 8.0, with 50 mM NaCl, and then
alternative to the YAC system for some applications requir- equilibrated with HindIII digestion buffer (50 mM NaCl/10
ing cloning of large fragments. The BAC system is based on mM Tris'HCl, pH 8.0/10 mM MgCl2) containing nuclease-
the well-studied Escherichia coli F factor. Replication of the free bovine serum albumin (100 jug/ml) at 40C for 30 min.
F factor in E. coli is strictly controlled (5). The F plasmid is Serially diluted HindUl (1-10 units) was added to the plugs
maintained in low copy number (one or two copies per cell), and incubated at 370C for 30 min with gentle shaking. Optimal
thus reducing the potential for recombination between DNA digestion conditions were determined by analyzing the size
fragments carried by the plasmid. Furthermore, F factors range of digested DNA by pulsed-field gel electrophoresis.
carrying inserted bacterial DNA are capable of maintaining Cloning Human DNA into pBAC. DNA ranging from 100 to
fragments as large as 1 megabase pair, suggesting that the F 300 kb was excised from low-melting agarose pulsed-field
factor is suitable for cloning of large DNA fragments (6). gels. The agarose slice was melted at 650C for 5 min and
Other bacterial systems for cloning large DNA have been digested with GELase (Epicentre, Madison, WI) by adding 1
developed. For example, the system based on bacteriophage unit ofenzyme, mixing gently, and incubating at 40"C for 1 hr.
P1 is in use (7). However, the P1 vector has a maximum One to three hundred nanograms of the human DNA was
cloning capacity of 100 kb. A bacterial system based on F
factors has been reported (8). However, in this system,
Abbreviations: BAC, bacterial artificial chromosome; YAC, yeast
artificial chromosome.
The publication costs of this article were defrayed in part by page charge *Permanent address: Department of Biological Sciences, University
payment. This article must therefore be hereby marked "advertisement" of Southern California, Los Angeles, CA 90089.
in accordance with 18 U.S.C. §1734 solely to indicate this fact. tTo whom reprint requests should be addressed.
8794
Biochemistry: Shizuya et al. Proc. Natl. Acad. Sci. USA 89 (1992) 8795
ligated to HindIII-digested pBAC108L (molar ratio of 10 to 1 Sfi I) for potential excision of the inserts. The cloning site is
in pBAC excess) with 400 units of T4 DNA ligase at 16WC flanked by T7 and SP6 promoters for generating RNA probes
overnight. To prevent religation of the vector, HindIII-cut for chromosome walking, and for DNA sequencing of the
pBAC108L was treated with HK phosphatase (Epicentre). inserted segment at the vector-insert junction. The cosN site
Transformation of E. coli DH1OB cells (BRL) was carried out provides a fixed position for specific cleavage with the
by electroporation. Competent cells were prepared and elec- bacteriophage A terminase (14). The loxP site can be utilized
troporation was performed according to the manufacturer's similarly. In this case P1 Cre protein catalyzes the cleavage
instructions (Bio-Rad). After electroporation, cells were in- reaction in the presence of the loxP oligonucleotide (ref. 15
cubated at 370C with gentle shaking for 45 min prior to and unpublished result). These sites (cosN and loxP) allow
spreading on LB plates containing chloramphenicol at 12.5 the convenient generation of ends that can be used for
pg/ml. Plates were incubated at 370C overnight. To identify restriction-site mapping to arrange the clones in an ordered
BAC transformants carrying human inserts, colonies were array. Restriction maps of the individual clones can be
lifted onto nylon filters (Hybond-N, Amersham) for hybrid- determined by indirect end-labeling and subsequent partial
ization with radioactively labeled total human DNA. digestion (14, 16, 17).
Cloning Human DNA in the BAC Vector. The ligated BAC
DNA molecules are expected to be much larger than those
RESULTS traditionally used in bacterial cloning. As one expects, trans-
The BAC Vector. Many bacterial vectors based on high- to formation becomes increasingly more difficult as the size of
medium-copy-number replicons often show structural insta- DNA molecules increases (unpublished observation). We
bility of inserts, deleting or rearranging portions of the cloned have tested several procedures to transform large circular
DNA, particularly DNA inserts of eukaryotic organisms that F'lac DNA (180 kb) into different E. coli strains and found
may contain families of repeated sequences (11-13). There- that standard electroporation gives high-efficiency transfor-
fore, it is difficult to clone and maintain large intact DNA in mation, about 106 transformants per ,ug of DNA (data not
bacteria. However, the F factor not only codes for genes that shown). The frequency of transformation with F'lac DNA is
about 1/40th that found with pBAC plasmid with no insert.
are essential to regulate its own replication but also controls Among E. coli strains commonly used for electroporation, we
its copy number. The regulatory genes include oriS, repE, found a few strains that are capable of transformation by
parA, and parB. The oriS and repE genes mediate the F'lac DNA, and we chose DH1OB as host. The exact nature
unidirectional replication of the F factor while parA and parB of the genetic basis of the competence is not known.
maintain copy number at a level of one or two per E. coli Typically, when 2 1Ld of the ligation mixture containing 50
genome. The BAC vector (pBAG) incorporates these essen- ng of input human DNA was transformed into highly con-
tial genes as well as a chloramphenicol resistance marker and centrated competent cells (108), 100-1000 colonies appeared
a cloning segment (Fig. 1). The cloning segment includes (i) on the plate. Colonies appearing on selective plates were
the bacteriophage A cosN and P1 loxP sites, (ii) two cloning hybridized with whole human DNA to identify clones with
sites (HindIII and BamHI), and (iii) several C+G-rich re- human inserts. Subsequent analysis indicated high variability
striction enzyme sites (Not I, Eag I, Xma I, Sma I, Bgl I, and with respect to insert frequency from experiment to experi-
ment; 10-50% of the transformants carried human inserts,
depending upon the batch of the vector and insert DNA used.
XmaI / SmaI Analysis of the Human Inserts. Although the amount of
NotI / BgII / SfE1 DNA obtained from bacterial cells carrying the single-copy
plasmid is expected to be low, routine minipreps from 1.5 ml
of overnight culture yielded enough DNA for several exper-
iments. Twenty randomly picked clones having human in-
,/, serts were analyzed by pulsed-field gel electrophoresis after
Not I digestion (Fig. 2A). All lanes contain the 6.7-kb vector
band and human insert DNA of various molecular weights.
The size distribution of the inserts in Fig. 2A ranges from 10
to 215 kb, with an average size of 100 kb. Fig. 2C shows an
additional five large clones ranging in size up to 300 kb, which
were obtained from separate transformations. The DNA in
the gel of Fig. 2A was transferred to a nylon filter and probed
with total human DNA. As expected, only the insert hybrid-
ized to the human DNA probe and no additional minor bands
appeared even after prolonged exposure (Fig. 2B). Deletions
occurring within the BACs in the E. coli host would yield
additional minor bands on restriction digestion. These were
not detected by ethidium bromide staining or by the more
sensitive hybridization assay. This suggests that human in-
serts on the BAC vector are structurally quite stable within
the resolution of the pulsed-field gel electrophoresis analysis.
To further examine the structural stability of such large
regions of human DNA cloned in E. coli, the restriction
patterns of DNA taken before and after long-term culture
were compared. Overnight cultures of two clones (125-kb and
repE
175-kb inserts) were diluted and used to inoculate fresh broth
for overnight growth. This process was repeated for 5 days
FIG. 1. Construction of pBAC vector. The plasmid is based on a (about 100 generations). The sizes of the restriction fragments
mini-F plasmid, pMB0131 (9). A detailed description of the con- produced by digestion of the DNA from the final cultures
struction is provided in Materials and Methods. CMR, chloramphen- (Nhe I, Sac I, and Xho I) were compared with those from the
icol resistance marker. original day 1 DNA. No visible difference between the two
87% Biochemistry: Shizuya et al. Proc. NatL Acad. Sci. USA 89 (1992)

A B C

--
3315
291.0
242.5
194.0
-Wab _
145.5

FIG. 2. Analysis of human BAC clones by pulsed-field gel electrophoresis. pBAC with inserts was prepared from small cultures (1.5 ml) of
transformed colonies by a standard alkaline lysis procedure. (A) Ethidium bromide-stained BAC DNA analyzed by pulsed-field gel
electrophoresis. DNA minipreparations were digested with Not I. Separation of the digested DNA was carried out on a Bio-Rad CHEF Mapper'
apparatus for 20 hr at a field strength of 6 V/cm in a 1% agarose gel in 45 mM Tris/45 mM boric acid/1 mM EDTA at 16'C with a linear pulse
time ramping from 5 to 15 sec. (B) Southern blot showing hybridization to radioactive total human DNA probe. DNA from A was transferred
to a nylon filter (Hybond-N, Amersham). DNA hybridization was performed at 42TC in 50%o formamide for 16 hr and the filter was washed three
times with 0.3 M NaCl/0.03 M sodium citrate, pH 7.0, at 470C. (C) Sizing of five large BAC DNA. For separation of DNA fragments of 50-500
kb, switch times are ramped from 7 sec to 45 sec over 29 hr.
samples after digestion and pulsed-field gel electrophoresis the case of YACs, the size of the inserts can be larger than
was observed (Fig. 3). Moreover, Not I-digested DNA from 300 kb. However, a high percentage of YAC clones, partic-
12 individual colonies derived from the day 5 culture showed ularly higher molecular weight inserts, represent chimeric
no visible differences when compared with each other or with molecules. These chimeric clones may hamper the progress
DNA from the same number of colonies from the day 1 of mapping and analysis of chromosomes. To evaluate the
culture (data not shown). Comparable experiments with frequency of co-cloning events among the BAC clones, we
cosmid clones (35- to 45-kb inserts) in the Super-cos or have employed fluorescence in situ hybridization with BAC
Lawrist vectors showed that up to 40% of the clones rear- DNA minipreps. We have thus far analyzed 28 randomly
ranged after a shorter period of serial growth (13). picked human BAC clones (all between 200 and 300 kb) and
We have obtained BAC clones larger than 300 kb; how- identified only one clone that potentially represents a chi-
ever, this is probably not the upper limit for BAC cloning. In meric clone (data not shown). Comparable experiments with
YAC clones carrying large human DNA inserts showed that
BJ25 BJ6 a substantial number of clones mapped to multiple sites,
I-I suggesting that many of the clones were chimeric (2). The
co-cloning frequency in the BAC system appears to be
2 ~ cu
XO
4--
)
cm
XO
-c-
significantly lower than that in the YAC. Another possible
source of chimeric YAC clones is that multiple artificial
-Z

chromosomes may be found in a single yeast cell (3, 4). In the


BAC system, the two genesparA and parB are involved in the
exclusion of extraneous F factors (two BACs in a single cell).
In one experiment we analyzed more than 300 clones and
have not found multiple BACs in a single cell.,

DISCUSSION
Several lines of evidence suggest the utility of the BAC vector
as a general cloning vehicle to construct comprehensive
libraries of higher organisms. (i) Large plasmids can be
transfected to E. coli by electroporation. The transformation
of bacteria by electroporation is 10-100 times more efficient
than yeast spheroplast transformation. This reduces the
amount of DNA necessary for library construction. This
feature is particularly suited to creating specialized libraries
where the source DNA is limited. (ii) Traditional bacterial
colony lifts and hybridization methods can be used for library
screening even though only a single copy of pBAC plasmid
exists in cells. (iii) Unlike YACs, which are linear, BACs with,
FIG. 3. Comparison of restriction fragments of two clones before inserts, like large F' factors, exist as supercoiled circular
and after 100 generations. Two clones carrying BACs of 125 kb plasmids in E. coli. This permits easy isolation and manipu-
(BJ25) and 175 kb (BL6) were grown in 5 ml of LB containing lation of the large DNA in solution with minimal breaking,,
chloramphenicol (12.5 ,ug/ml) and grown at 37°C overnight (day 1). whereas yeast chromosomes are more difficult to isolate
Cultures were diluted 100-fold and 0.1 ml of this dilution was intact since the linear DNA is more susceptible to shear.
inoculated into 500 ml of LB with chloramphenicol (12.5 Pg/ml) and Furthermore, the ability to visualize the cloned DNA by
incubated overnight. Serial cultures were made in this way for an
additional 3 days. Plasmid DNAs were prepared from day 1 and day ethidium bromide staining in pulsed-field gels vastly in-
5 cultures, digested with three different restriction enzymes (Nhe I, creases the speed of structural analysis of the clones. (iv)
Sac I, and Xho I), and analyzed by pulsed-field gel electrophoresis DNA from minipreps is easily reintroduced into bacterial
(day 1, left lane; day 5, right lane). cells by the same electroporation procedure. (v) A set of
Biochemistry: Shizuya et al. Proc. Natl. Acad. Sci. USA 89 (1992) 8797
genes for F' maintenance is sufficient to sustain large foreign Recently, we have begun construction of a BAC library of
DNA in the BACs. (vi) A low frequency of co-cloning in the total human DNA. In addition we have prepared a chromo-
BACs has significant advantage in physical mapping by some 22-specific fosmid library with 7-fold redundancy (un-
"bottom-up" approaches. published results). The library together with fosmids will
One of the important aspects in any cloning system is the augment physical chromosome mapping with YACs and with
genetic stability of cloned DNA. Results showing a stable conventional cosmids.
structure for human DNA in the BAC vector suggests that the
cloned DNA should represent a stable source of specific We thank Drs. M. O'Connor and W. Bender for providing the
fragments. However, more experiments including long-range pMBO131 plasmid prior to publication. This work was supported by
a grant from the Department of Energy, DE-FG03-89ER60891.
restriction-site mapping with BACs will be required to dem-
onstrate that cloned DNA fragments are indeed faithful 1. Burke, D. T., Carle, G. F. & Olson, M. V. (1987) Science 236,
copies of the source DNA. Some clones in any genomic 806-812.
library may be expected to show instability in a particular 2. Green, E. D., Riethman, H. C., Dutchik, J. E. & Olson, M. V.
host because of the recombinogenic activity of certain spe- (1991) Genomics 11, 658-669.
cific DNA sequences. A bacterial system may offer advan- 3. Wada, M., Little, R. D., Abidi, F., Porta, G., Labella, T.,
tages in this respect because the pathways of genetic recom- Cooper, T., Della Valle, G., D'Urso, M. & Schlessinger, D.
(1990) Am. J. Hum. Genet. 46, 95-106.
bination in E. coli have been described in detail, and the use 4. Bellis, M., Gerard, A., Charlieu, J. P., Marcais, B., Brun,
of appropriate bacterial mutants may reduce instability prob- M. E., Viegas-Pequignot, E., Carter, D. A. & Roizes, G. (1991)
lems. For this purpose we have constructed several candidate DNA Cell Biol. 10, 301-310.
repository strains that lack many recombination functions. A 5. Willetts, N. & Skurray, R. (1987) in Escherichia coli and
set of new strains are based on DH1OB. They are capable of Salmonella typhimurium: Cellular and Molecular Biology, ed.
in chief Neidhardt, F. C. (Am. Soc. Microbiol., Washington),
transfection of the large DNA and deficient in host-controlled Vol. 2, pp. 1110-1133.
restriction (hsdMRS), methylation-sensitive restriction 6. Low, K. B. (1987) in Escherichia coli and Salmonella typhi-
(mcrABC, mrr), and various types of general recombination murium: Cellular and Molecular Biology, ed. in chief Neid-
systems (recBC, sbcBC). Among the strains constructed are hardt, F. C. (Am. Soc. Microbiol., Washington), Vol. 2, pp.
HS986, having a recJ mutation, and HS979, having a recA 1134-1137.
deletion mutation, as well as the mutations described above. 7. Sternberg, N. (1990) Proc. Natl. Acad. Sci. USA 87, 103-107.
8. Hosoda, F., Nishimura, S., Uchida, H. & Ohki, M. (1990)
In any cloning system, analysis and fingerprinting of many Nucleic Acids Res. 18, 3863-3869.
clones with large inserts require rapid and convenient meth- 9. O'Connor, M., Pfeifer, M. & Bender, W. (1989) Science 244,
ods. One fingerprinting strategy that takes advantage of the 1307-1312.
vector involves the use of A terminase to generate cohesive 10. Birren, B. & Lai, E., in Pulsed Field Gel Electrophoresis: A
ends at the cosN site, which may then be used as reference Practical Guide (Academic, San Diego), in press.
points to establish a restriction map by partial digestion. The 11. Wyman, A. R., Wolfe, L. B. & Botstein, D. (1985) Proc. Natl.
Acad. Sci. USA 82, 2880-2884.
restriction pattern of each clone can be compared and used 12. Ishiura, M., Hazumi, N., Koide, T., Uchida, T. & Okada, Y. J.
to recognize overlapping clones. Independent labeling of (1989) J. Bacteriol. 171, 1068-1074.
both allows us to carry out the fingerprinting analysis with 13. Kim, U.-J., Shizuya, H., de Jong, P. J., Birren, B. & Simon,
greater speed and accuracy. The other benefit of having cosN M. I. (1992) Nucleic Acids Res. 20, 1083-1085.
on the vector is that the BAC vector can be used as a highly 14. Rackwitz, H. R., Zehetner, G., Murialdo, H., Delius, H., Chai,
efficient cloning vector for DNA of 40-kb average size after J. H., Poustka, A., Frischauf, A.-M. & Lehrach, H. (1985)
packaging and infection as in conventional cosmid cloning. Gene 40, 259-266.
15. Abremski, K., Hoess, R. & Sternberg, R. (1983) Cell 32,
Extensive libraries of fosmid (F-based cosmid) clones are 1301-1311.
readily constructed and offer increased insert stability (13). 16. Rackwitz, H. R., Zehetner, G., Frischauf, A.-M. & Lehrach,
The clones can be easily analyzed and used in physical H. (1984) Gene 30, 195-200.
mapping. 17. Kohara, Y., Akiyama, K. & Isono, K. (1987) Cell 50, 495-508.

You might also like