Professional Documents
Culture Documents
Marshall and Berbee 2010 MBE
Marshall and Berbee 2010 MBE
Abstract
Research article
We use population genetics to detect the molecular footprint of a sexual cycle, of a haploid vegetative state, and of lack of
host specificity in Pseudoperkinsus tapetis, a marine unicellular relative of the animals. Prior to this study, complete life
cycles were not known for any of the unicellular lineages sharing common ancestry with multicellular animals and fungi.
We established the first collection of conspecific cultures of any member from the unicellular opisthokont lineage
ichthyosporea, isolating 126 cultures of P. tapetis from guts of marine invertebrates ranging from clams to sea cucumbers.
© The Author 2010. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please
e-mail: journals.permissions@oxfordjournals.org
2014 Mol. Biol. Evol. 27(9):2014–2026. 2010 doi:10.1093/molbev/msq078 Advance Access publication April 1, 2010
Cryptic Sex and Life History of Pseudoperkinsus · doi:10.1093/molbev/msq078 MBE
determining ploidy has been analysis of the frequency of het-
erozygotes in microsatellite data sampled from populations
(e.g., Oliveira et al. 1998; Brondani et al. 2000; Rynearson and
Armbrust 2000; van der Verde et al. 2001; Weeks et al. 2001).
The lack of heterozygotes from microsatellite data showed,
for example, that a population of Symbiodinium dinoflagel-
lates was haploid (Santos and Coffroth 2003). A similar prin-
ciple can be applied using sequence data from a population
with polymorphisms. Direct sequencing of polymerase chain
reaction (PCR) products from heterozygous diploids produ-
ces sequence chromatograms with double peaks at the seg-
regating sites (Clark 1990). The multicellular animals are
mainly diploid, and the higher fungi are mostly haploid or
dikaryotic. To assess chromosome copy numbers in P. tapetis,
we carefully screened the segregating sites of our sequence
chromatograms for double peaks, indicative of heterozy-
gotes and diploidy.
FIG. 1. Pseudoperkinsus tapetis isolation sites from Vancouver Island,
2015
Marshall and Berbee · doi:10.1093/molbev/msq078 MBE
Table 1. Collection Location, Animal Host and Haplotype for the tion of a 150 ml aliquot of 0.5 l of water surrounding three
34 Sequenced Pseudoperkinsus tapetis Isolates. captive oysters. Ichthyosporean colonies were white and
Animal Host Haplotype raised and consisted of large round cells. We repeatedly
Location (number collected) Numbera Isolate IDb streaked each of the individual colonies from the starting
East 1 Mussel (5) 1* GM8 plates onto fresh media until they were axenic. For isolates
East 2 Mussel (5)c 1* MB4 CG2 and LE7, we observed asexual reproduction and stained
3 BM6, MB5, MB18
4 BM33
the nuclei with 4#,6-diamidino-2-phenylindole (DAPI) ac-
5* BM19 cording to Marshall et al. (2008). These strains are available
6 MB29 at the American Type Culture Collection (ATCC) under
12* MMB5, MMB3, MB13 accession numbers PRA-281/282.
13* BM17
14 MB16
Varnish clam (5)c 5* NO32
DNA Extraction, PCR Amplification, and
10 ON1M Sequencing
12* NO6, ON8, ON50M We extracted DNA from approximately 50–200 ll of cells
13* ON12 scraped from plates with confluent growth using a DNeasy
16 NO16 Plant Mini Kit (QIAGEN Inc., Mississauga, Ontario, Canada).
East 3 Manilla clam (5)d 2 PV3, PV38, VP2
3* PV25, VP7
We amplified and sequenced the SSU, internal transcribed
5* PV23 spacers ITS1 and ITS2 and the 5.8S ribosomal subunit rDNA
2016
Cryptic Sex and Life History of Pseudoperkinsus · doi:10.1093/molbev/msq078 MBE
and were excluded from this study because they were at
most 88.4% identical to P. tapetis over a 725-bp region of
overlap in their ITS regions. These levels of identity were
comparable with the 5.1% interspecific versus 0.51% intra-
specific ITS divergence within a data set of Penicillium
species (Seifert and Samson 2007). Of our 126 putatively
conspecific isolates, we chose 34 isolates for sequencing
of the EFL and HSP70 (table 1). These isolates represented
a selection from all eight animal hosts (table 1 and supple-
mentary table S1, Supplementary Material online). Because
inbreeding can result in elevated levels of homozygosity, we
increased the number of sequences from the EFL locus to 52.
Genetic Analysis
We aligned sequences in ClustalX v1.8 (Thompson et al.
1997) and edited them in MacClade 4.08 (Maddison WP
and Maddison DR 2000). We calculated pairwise nucleotide
diversity, p, and theta per site, hw (Watterson 1975), in
2017
Marshall and Berbee · doi:10.1093/molbev/msq078 MBE
FIG. 3. Polymorphic sites from the amplified regions of the EFL and HSP70 loci shown for each of 24 individuals or unique haplotypes per host
collection. Each individual was assigned 1 of 18 unique haplotypes defined after the EFL and HSP70 sequences were concatenated. Horizontal lines
separate isolates based on their combined haplotype designation. A small ‘‘g’’ denotes a gap, and each gap was coded as one character regardless of
its length. The noncoding region of the HSP70 is shaded in grey. The polymorphic region used to calculate the expected heterozygosity of the EFL is
outlined by a box. Balanced sites were defined as any site where polymorphisms were present in at least three individuals and are marked by an
asterisk above the consensus line. Allele and haplotype numbers correspond to figure 2. Specific collection locations are listed in table S1.
Table 2. Significant (P , 0.05) Indexes of Association (IA) Between HSP70 and EFL Alleles Were Evident When All Individual Isolates Were
Included, But Not When Data Sets Consisted of Unique Haplotypes or Location Corrected Haplotypes.
Allele Definitions Isolates Included Number of Isolates IA P
All sites considered All individuals 24 0.1814 0.015
Alleles reduced by one variable site (fig. 2) All individuals 24 0.1482 0.025
Alleles defined by balanced sites (fig. 2) All individuals 24 0.1328 0.036
All sites considered Location corrected 21 0.0717 0.23
All sites considered Unique haplotypes 18 20.0736 1.0
All sites considered East side individuals 18 0.1949 0.034
All sites considered East side unique haplotypes 12 20.1277 1.0
2018
Cryptic Sex and Life History of Pseudoperkinsus · doi:10.1093/molbev/msq078 MBE
separate chromosomes. We tested for incongruent histories The population recombination parameter (q) is the mea-
between the HSP70 and the EFL using the incongruence length sure of overall covariation of nucleotides at different sites,
difference (ILD) test (Farris et al. 1994) for the data set of 24 and (rr) is a compound value based on the observed num-
individuals. The ILD tests a null hypothesis of clonality by com- ber of chiasmata in Morgans per kilobase (M/kb) during
paring the summed tree length of the observed data with the meiotic division or per meiosis recombination rate (r) mul-
sums after random partitioning. We generated 1,000 trees in tiplied by the frequency of sex (r). We used the HSP70 intron
PAUP* using heuristic searches with random sequence addi- to calculate hw to avoid the effects of selection and codon use
tion and TBR branch swapping limited to 106 rearrangements. bias. We estimated the per meiosis recombination rate (r) as
Our cut-off for significance was P , 0.01, and we excluded in- being between 0.000017 and 0.0048 M/kb based on obser-
variant characters (Cunningham 1997). vations from unicellular eukaryotes (Awadella 2003; Lynch
To map the history of recombination between the 2006). An r value of 103 M/kb corresponds to roughly 1
two loci, we made recombination networks (Huson and chiasma for a 1 Mb chromosome. We did not account for
Kloepper 2005) for the EFL, HSP70, and both regions com- inbreeding nor consider the series of mitotic divisions be-
bined in Splits Tree4 v4.10 using the data set of 24 individ- tween endospore growth and release of new daughter cells.
uals (Huson 1998; Huson and Bryant 2006). To minimize Because we had so few isolates from our west side sites, we
homoplasies and keep the number of segregating sites were unable to confidently test for population connectivity.
within each locus comparable, we only used sequence from However, alleles from both loci were shared by isolates from
the second exon of HSP70 (fig. 3) when the loci were com- both sides of the island, and we assumed that our 24 indi-
2019
Marshall and Berbee · doi:10.1093/molbev/msq078 MBE
heterozygotes at a chosen site within the EFL (fig. 3) should
have been 56.9% based on the frequencies of the alleles in
the individual data set. However, the observed frequency of
heterozygotes among the 52 isolates sequenced for the EFL
was zero. There were also no double peaks at any of the
polymorphic sites within the 34 isolates sequenced for
the HSP70. We therefore conducted all further analyses un-
der the premise that P. tapetis was haploid.
We found six unique haplotypes, one time each, from the
west side of Vancouver Island, and all the other isolates came
from four host collections from three beaches within a 2 km
range on the east side of the island (table 1 and fig. 1). The 12
different haplotypes found on the east side represented
a minimum of 18 independent host colonizations. The num-
ber was a minimum because it was calculated by ignoring
repeated cultures with the same haplotype from the same
host collection under the assumption that genetically iden-
tical cells originated and spread from a single initial coloni-
2020
Cryptic Sex and Life History of Pseudoperkinsus · doi:10.1093/molbev/msq078 MBE
FIG. 5. Inter- and intralocus recombination reconstructed using Splits Tree4. Reticulations representing potential recombination events are
the tree lengths of the combined data sets were not addi- the east side and the intergenic region of the HSP70 had
tive. The C.I., R.I., and lengths calculated in PAUP* for each higher diversity than the degenerate sites from the coding
locus were 0.94, 0.99, and 68 for the HSP70 and 1.0, 1.0, and regions (table 4). Using intergenic and degenerate sites, the
20 for the EFL, respectively. The values for the combined Ne was on the order of 106 to 107 depending on the mutation
data sets were C.I. 5 0.8077, R.I. 5 0.9373, and length 5 rate (table 4).
104. When the two loci were analyzed separately, the ob- Frequency of Sex
served number of steps exceeded the minimum number of Ratios of sexual to asexual generations or frequency of sex,
steps only by 1 (EFL) or 4 (HSP70), but when the loci were r, ranged between 1 sexual cycle for 154–495,000 asexual or
combined, the tree was 21 steps longer than the minimum mitotic cycles with a median value of approximately once
and the ILD test was significant (P 5 0.001). Similarly, the for every 22,700 asexual cycles when q equalled 1 (fig. 6).
combined network for the coding region of the HSP70 and Median values were based on the two less extreme esti-
the EFL had several reticulations (fig. 5a), suggestive of re- mates. Our estimate of q was based on a relatively short
combination. A reticulation was detected when a fragment fragment of DNA, and q can vary dramatically between loci
of the second exon of the HSP70 was analyzed alone (fig. (Awadella 2003) for reasons including chromosomal posi-
5a), although without significant support from the PHI test tion. When we repeated our calculations using a hypothet-
(P 5 0.2), whereas the EFL had no reticulations. The PHI ical high q of 50, the frequency of sex ranged between 1 in 3
test for the combined data was significant (P , 0.01). and 1 in 9,910 asexual generations, with a median value of 1
Reticulations and intralocus recombination were evi-
dent in the entire sequenced region of the HSP70 (PHI:
P , 0.01) (fig. 5b). Both Bootscan and LARD detected three
significant recombination events (P 0.05) but neither Table 3. Estimates of Effective Population Size Times the
could confidently distinguish daughter sequences from Frequency of Sex (Ner)a Based on Several Estimates of Population
the minor parent or identify breakpoints. Both methods Recombination Parameter qb Calculated for the Data Set
suggested OI22B as a major parent, BM6 as a minor parent, Including All 24 Individuals.
and NO6 and CRG9 were two potential daughters. A third Effective Population Size 3
scenario was BM6 as a major parent, OI22B as a minor Frequency of Sex (Nes)a
parent, and LE7 a daughter. Method rb r/uw rb 5 0.000017 rb 5 0.0048
Effective Population Size Using hw Wakeley (1997) 0.285 0.0136 8,380 30
Rm/pairwise 1 0.044 29,400 104
Estimates of q for the HSP70 ranged between 0.285 and 1.6 Hudson (1987) 1.6 0.070 47,100 167
(table 3). Estimates of nucleotide diversity (p) and hw were
NOTE.—The q/hwc ratio was calculated using hw from the HSP70 intron and Ner
higher for the HSP70 than the EFL, although this difference based the upper and lower bounds of per meiosis recombination rate, rb.
was minimized when only 4-fold degenerate sites were con- a
(Ner) is the effective population size, Ne, times the frequency of sex, r. (Ner) is
sidered (data not shown). hw ranged between 0.0126 and calculated based on Ner 5 q/2r.
b
q and r are expressed in Morgans per kilobase (M/kb).
0.0227 using degenerate or intergenic sites (table 4). For c
hw is Watterson’s (1975) estimator of nucleotide diversity expressed per
the EFL, nucleotide diversity was higher on the west side than kilobase.
2021
Marshall and Berbee · doi:10.1093/molbev/msq078 MBE
Table 4. Effective Population Sizes (Ne) Based on Intergenic and 4-fold Degenerate Sites Given an Upper and Lower Bound for Mutation Rate l.
Locus and Sites Used (number of nucleotides) uw Ne Given m 5 0.22 3 1029 Ne Given m 5 2.5 3 1029
HSP70 coding (74) 0.0179 40,680,000 3,580,000
HSP70 intron (524) 0.0227 51,590,000 4,540,000
EFL coding, both sides (155) 0.0126 29,090,000 2,560,000
EFL coding, east side Vancouver Island (155) 0.0110 25,000,000 2,200,000
EFL coding, west side Vancouver Island (155) 0.0176 31,360,000 2,760,000
NOTE.—Nucleotide diversity (hw) was calculated from all 24 individuals.
sexual generation for 418 asexual generations. A data set tapetis that could not be observed in the field or in culture.
based on the expected number of infective individuals The complete absence of heterozygotes at polymorphic
as estimated with a Poisson correction for repeated infec- sites suggested that our P. tapetis population lives in its in-
tions by the same haplotype increased the number of in- vertebrate host primarily as a haploid and that it has a hap-
dividuals by 7 to 31. Because the expected data set included loid asexual cycle. If life cycles in other ichthyosporea are
a higher proportion of identical isolates, the resulting esti- predominantly haploid, Hibbits’ (1978) observations of
mates of the frequency of sex were slightly lower, with a me- cells of E. sexuale undergoing nuclear fusion could be inter-
dian value near 1 sexual cycle for 23,900 asexual cycles preted as fusing haploid cells in the process of forming a zy-
2022
Cryptic Sex and Life History of Pseudoperkinsus · doi:10.1093/molbev/msq078 MBE
we were unable to reject a null hypothesis of recombina- rounds of mitosis per sporangium, the ratio of frequency
tion. The EFL and HSP70 had significantly incongruent his- of meiotic divisions to asexual endospore-to-endospore
tories, an expected consequence of a meiotic stage with generations, would be six times the meiosis:mitosis ratio.
independent chromosome assortment (or recombination For example, in figure 6, with q 5 1.0, the lowest estimate
between loci on the same chromosome). Evidence of intra- of meiosis:mitosis was 1:154, but the ratio of meiosis:asex-
locus recombination in the HSP70 provided more positive ual endospore-to-endospore generations would be 1:25.7.
support for a sexual cycle in our haploid organism where This last ratio would imply that P. tapetis undergoes at least
meiosis would have offered the only opportunity for cross- 25 rounds of sporangium formation for each round of mei-
over between different alleles from a single copy homolo- osis. The change in the units of measure for an asexual
gous gene. We did not find intralocus recombination in the generation would, of course, have no effect on the time
EFL, but this was unsurprising as intralocus recombination between sexual generations or the probability of infection
is less common than interlocus recombination and the of a host by an asexual propagule (fig. 6).
EFL was less variable than the HSP70 (fig. 3), so the prob- Regardless of the margins of error, P. tapetis is most likely
ability of detecting recombination within the EFL was facultatively sexual and can probably undergo unlimited
lower. Although difficult to observe directly, the evidence rounds of asexual reproduction, much like a yeast. Out-
for recombination was not unexpected. Meiosis is likely an crossing rates in yeasts have been estimated at once per
integral process in all extant eukaryotes except for some 3,000–9,000 asexual cycles for S. paradoxus (Tsai et al.
lineages with recent losses (Ramesh et al. 2005). 2008) and once per 50,000 asexual cycles for Saccharomyces
2023
Marshall and Berbee · doi:10.1093/molbev/msq078 MBE
frequency of sex, P. tapetis must pass through several hosts mals, and if other unicellular opisthokonts are haploid as
before mating. If sex was obligate and infection limited to well, then haploid dominant life cycles may have been the
sexually derived resting spores or zygotes, the retention ancestral condition of the metazoan stem lineage.
time necessary before sexual reproduction could equal
or surpass the host lifespan (fig. 6). Supplementary Material
The average level of silent-site pairwise nucleotide diver-
sity (hw) was within the range but lower than the average Supplementary tables S1 and S2 are available at Molecular Bi-
value of 0.057 of other unicellular eukaryotes reported by ology and Evolution online (http://www.mbe.oxfordjournals
Lynch (2006). The effective population size of P. tapetis was .org/).
on the order of 106 to 107, which is typical for unicellular
eukaryotes (Lynch 2006). Effective population sizes are Acknowledgments
based on numbers of individuals expected to contribute We thank S. Otto for helpful discussions on coalescent the-
progeny to the next generation. Because many cells die ory, W. Maddison for Poisson probability calculations, and
and do not contribute to the gene pool, the effective size L. Rieseberg and several anonymous reviewers for com-
is lower than the censused population. Pseudoperkinsus ta- ments on earlier versions of this manuscript. This work
petis can grow and divide for a few days in unsupplemented was supported by the Natural Sciences and Research Coun-
seawater but cells arrest at uni- and binucleate stages. cil of Canada and the Bamfield Marine Sciences Centre.
Reinfection is probably a limiting factor in P. tapetis’s de-
2024
Cryptic Sex and Life History of Pseudoperkinsus · doi:10.1093/molbev/msq078 MBE
Clement M, Posada D, Crandall KA. 2000. TCS: a computer program nant sequences and recombination breakpoints. AIDS Res Hum
to estimate gene genealogies. Mol Evol. 9:1657–1659. Retroviruses. 21:98–102.
Cooper MA, Adam RD, Worobey M, Sterling CR. 2007. Population Martin D, Williamson C, Posada D. 2005. RDP2: recombination
genetics provides evidence for recombination in Giardia. Curr detection and analysis from sequence alignments. Bioinformatics
Biol. 17:1984–1988. 21:260–262.
Cunningham CW. 1997. Can three incongruence tests predict when Maynard Smith J, Smith NH, O’Rourke M, Spratt BG. 1993. How
data should be combined? Mol Biol Evol. 14:733–740. clonal are bacteria? Proc Natl Acad Sci U S A. 90:4384–4388.
Farris JS, Kallersjo M, Kluge AG, Bult C. 1994. Testing significance of McVean G, Awadella P, Fearnhead P. 2002. A coalescent-based
incongruence. Cladistics 10:315–319. method for detecting and estimating recombination from gene
Figueras A, Lorenzo G, Ordas MC, Gouy M, Novoa B. 2000. Sequence sequences. Genetics 160:1231–1241.
of the small subunit ribosomal DNA gene of Pseudoperkinsus Mendoza L, Taylor JW, Ajello L. 2002. The class Mesomycetozoea:
tapetis isolated carpet shell clam in Galicia (NW of Spain). Mar a heterogeneous group of microorganisms at the animal-fungal
Biotechnol. 2:419–428. boundary. Annu Rev Microbiol. 56:315–344.
Fraser C, Hanage WP, Spratt BG. 2007. Recombination and the Ngouanesavanh T, Guyot K, Certad G, Le Fichoux Y, Chartier C,
nature of bacterial speciation. Science 315:476–480. Verdier RI, Cailliez JC, Camus D, Dei-Cas E, Bañuls AL. 2006.
Geiser DM, Pitt JI, Taylor JW. 1998. Cryptic speciation and Cryptosporidium population genetics: evidence of clonality
recombination in the aflatoxin-producing fungus Aspergillus in isolates from France and Haiti. J Eukaryot Microbiol. 53:
flavus. Proc Natl Acad Sci U S A. 95:388–393. S33–S36.
Hibbits J. 1978. Marine Eccrinales (Trichomycetes) found in Oliveira RP, Broude NE, Macedo AM, Cantor CR, Smith CL,
crustaceans of the San Juan Archipelago, Washington. Syesis Pena SDJ. 1998. Probing the population genetic structure of
2025
Marshall and Berbee · doi:10.1093/molbev/msq078 MBE
Swofford DL. 1998. PAUP**: phylogenetic analysis using parsimony Tsai IJ, Bensasson D, Burt A, Koufopanou V. 2008. Population
(*and other methods). Version 4. Sunderland (MA): Sinauer genomics of the wild yeast Saccharomyces paradoxus: quanti-
Associates. fying the life cycle. Proc Natl Acad Sci U S A. 105:4957–4962.
Takishita K, Fujiwara Y, Kawato M, Kakizoe N, Miyazaki M, van der Verde M, During HJ, van de Zande L, Bijlsma R. 2001. The
Maruyama T. 2008. Molecular identification of the ichthyospor- reproductive biology of Polytrichum formosum clonal structure
ean protist ‘‘Pseudoperkinsus tapetis’’ from mytilid mussel and paternity revealed by microsatellites. Mol Ecol. 10:2423–2434.
Adipicola pacifica associated with submerged whale carcasses Wakeley J. 1997. Using the variance of pairwise differences to
in Japan. Mar Biotechnol. 10:13–18. estimate the recombination rate. Genet Res. 69:45–48.
Thompson JD, Gibson TJ, Plewniak F, Jeanmougin F, Higgins DG. Watterson GA. 1975. On the number of segregating sites in
1997. The Clustal X windows interface flexible strategies for genetical models without recombination. Theor Popul Biol. 7:
multiple sequence alignment aided by quality analysis tools. 256–276.
Nucleic Acids Res. 24:4876–4882. Weeks AR, Marec F, Breeuwer JAJ. 2001. A mite species that consists
Tibayrenc M, Kjellberg F, Ayala FJ. 1990. A clonal theory of parasitic entirely of haploid females. Science 292:2479–2482.
protozoa: the population structures of Entamoeba, Giardia, Whisler HC. 1968. Developmental control of Amoebidium para-
Leishmania, Naegleria, Plasmodium, Trichomonas, and Trypano- siticum. Dev Biol. 17:562–570.
soma and their medical and taxonomical consequences. Proc Xu J. 2004. The advantage of sex in evolving yeast populations.
Natl Acad Sci U S A. 87:2414–2418. Nature 388:465–468.
2026