Professional Documents
Culture Documents
Oryzae, R. Solanacearum, and Pectobacterium Parmentieri That Carried Up To 13, 12, and 13
Oryzae, R. Solanacearum, and Pectobacterium Parmentieri That Carried Up To 13, 12, and 13
Competition for limited resources and space is an activity between and among populations
of the natural world, including phytopathogenic bacteria. Polymorphic toxins (PT) are
inhibit the growth of neighboring competitor cells. Among these PT, Rearrangement hot
spot toxins (Rhs-toxins) are distinguished by their ubiquity in bacterial genomes, and they
diverse ecological niches. The research presented here aims to establish a comprehensive
database on the abundance, diversity, and potential functions of Rhs-toxins in four major
Within the four genera tested, almost all species carry at least one Rhs-toxin in their
genome and have a variety of predicted functions. A small percentage of species did not
code for any Rhs-toxins in their genome. The majority of species carried anywhere between
three to five toxins in their genomes with the exception of some strains in Xanthomonas
oryzae, R. solanacearum, and Pectobacterium parmentieri that carried up to 13, 12, and 13
Rhs-toxins respectively. Furthermore, we found that some Rhs-toxins are shared among
strain in a species, between species, and even between genera. Understanding the ecological
promising insights into bacterial survival strategies in their natural microbiomes and get us
one step closer toward innovative strategy for plant disease management.
© Copyright 2023 by Andrea Gómez
By
Plant Pathology
APPROVED BY:
This work is dedicated with immense love to my grandmother, Yolanda, or as called her, "Mami
Yoli." She raised me, she waited for me every day to come home after school. She is the hands
that fed me and cared for me when I was sick. She is a part of the beauty in me and in so many
memories. She is the birds, she is the sea, she is a cup of cinnamon tea with a soft voice saying
"China quieres té de candela?”. She is in the picture on my desk, she is the necklace that I wear
when I need luck on an important day. My grandma passed away in April 2020 as a result of
COVID-19, and since that day, nothing has been the same. Every action, every goal, and my entire
future is driven by my desire to make her proud and happy. Life is unpredictable, and things change
all the time, but the love and memories we have for our loved ones are something that life cannot
take away from us. Before I left home to pursue my studies in the United States, I received her last
blessing and heartfelt hug. The last words she said to me, are: "Yo lo único que le pido es que siga
estudiando y que siempre agradezca a Dios por las buenas oportunidades que le brinda, con eso
yo soy feliz." ("All I ask of you is to keep studying and always thank God for the good opportunities
He provides; that will make me happy."). I remind myself of this every day.
I want to express my gratitude to God for giving me the gift of life and the privilege of growing
up in my beautiful country, Ecuador. I'm thankful for the chance to have known my grandmother
and be raised by her. My dear Mami Yoli, you live in my heart and my memories. I hope someday
we can meet again in a beautiful garden filled with jasmine flowers and birds.
To my mother Hercilia, who has always sacrificed everything she has for my well-being and
happiness, thank you for always making me feel that I can achieve everything I want and
supporting me in all my big dreams.
To my father Javier, who keeps me close to home through a phone call and constantly reminds me
how proud he is of me.
To my beloved sisters, Diana, and Estefania, the greatest sources of joy and pride in my life.
To my partner, Roshni, who has supported me with love and care through the challenges and
triumphs of this degree, meeting you has been my biggest blessing. Thank you to the Panwala
family for giving me a second home in this country.
To my guiding light, Dr. Alejandra Huerta, who believes in me and has been my constant support,
from the very first day of this scientific journey.
To my friends Kalina, Valeria, Katie, Ying-Yu, and Anna, for their true friendship that keeps me
grounded.
I want to express my heartfelt gratitude to all of you for your support, patience, guidance, and love.
ii
BIOGRAPHY
Andrea Mariel Gómez Cabrera was born in Guayaquil, Ecuador, raised by her mother, Hercilia,
her grandmother, Yolanda, and her older sisters, Diana, and Estefania. She developed a love for
agriculture at a young age, living on a banana tree plantation for the first eight years of her life.
Her father, who is an agricultural engineer, taught her the significance of this profession and its
wide-reaching impact. Andrea earned a scholarship from the Ecuadorian government to pursue her
2020. In her last year of undergraduate she completed an internship with Dr. Robert Beckstead's
lab at NC State University. There, she developed an interest in microbiology studying the vertical
turkeys. Her experience as an intern in poultry health and a general plant pathology course led
Andrea to decide to pursue a master's degree in plant pathology at North Carolina State University
under the guidance of Dr. Alejandra I. Huerta. As a master's student, she has led important work
on characterizing Rhs-toxin abundance, diversity, and function within four of the most important
bacterial plant pathogens. Her work has generated multiple hypothesis and will be the base for
iii
TABLE OF CONTENTS
Chapter II: Abundance and diversity of Rhs-toxins in four genera of plant pathogenic
bacteria ........................................................................................................................................ 21
Abstract ........................................................................................................................................ 21
Introduction .................................................................................................................................. 22
Materials and methods ................................................................................................................. 26
Results and discussion .................................................................................................................. 28
Cited literature .............................................................................................................................. 55
Chapter III: Concluding Remarks and Future Studies on Rhs-toxin Diversity and Function
in Four Genera of Plant Pathogenic Bacteria ........................................................................... 61
Cited literature ............................................................................................................................... 66
iv
LIST OF TABLES
Table 1 Summary of Rhs-toxins dataset used for abundance and diversity study for four plant
pathogenic bacterial genera ...................................................................................... 69
Table 3 Strains represented in this study with their non-curated Rhs-toxin repertoire.......... 81
Table 4 Strains represented in this study with their curated Rhs-toxins repertoire ............... 92
Table 5 Plant pathogenic genera and their species represented in this study with their
complementary N-terminal clade ........................................................................... 102
Table 6 Rhs-toxin protein length with their corresponding toxic domain and putative
predicted function. .................................................................................................. 122
v
LIST OF FIGURES
Figure 1.1 Organization and structure of a Rhs-toxin system in bacteria ................................ 173
Figure 1.2 Mechanism of Rhs-toxin delivery by the Type 6 secretion system ........................ 174
Figure 2.1 Abundance and diversity of Rhs-toxin sequences in plant pathogenic Xanthomonas
.................................................................................................................................................. .175
Figure 2.2 Extensive variation in protein sequence length among Rhs-toxin repertoires within
Xanthomonas genomes ........................................................................................... 176
Figure 2.3 Nine conserved Rhs-toxin system loci in genomes of Xanthomonas species ........ 177
Figure 2.4 A total of 181 unique Rhs C-Terminal tips in Xanthomonas species ..................... 178
Figure 2.5 Unique, general, and large unknown functional diversity among Rhs-toxins in
Xanthomonas genomes ........................................................................................... 179
Figure 3.1 Abundance and diversity of Rhs-toxin sequences in plant pathogenic Ralstonia
......................................................................................................................................... .….….180
Figure 3.2 Extensive variation in protein sequence length among Rhs-toxin repertoires within
Ralstonia genomes.................................................................................................. 181
Figure 3.3 Three conserved Rhs-toxin system loci in genomes of Ralstonia species.............. 182
Figure 3.4 A total of 76 unique Rhs C-Terminal tips in Ralstonia species.............................. 183
Figure 3.5 Unique, general, and large unknown functional diversity among Rhs-toxins in
Ralstonia species .................................................................................................... 184
Figure 4.1 Abundance and diversity of Rhs-toxin sequences in plant pathogenic Pectobacterium
................................................................................................................................................... 185
Figure 4.2 Extensive variation in protein sequence length among Rhs-toxin repertoires within
Pectobacterium genomes ........................................................................................ 186
Figure 4.3 Five conserved Rhs-toxin system loci in genomes of Pectobacterium species ...... 187
Figure 4.4 A total of 106 unique Rhs C-Terminal tips in Pectobacterium species .................. 188
Figure 4.5 Unique, general, and large unknown functional diversity among Rhs-toxins in
Pectobacterium species .......................................................................................... 189
vi
Figure 5.1 Abundance and diversity of Rhs-toxin sequences in plant pathogenic Dickeya .... 190
Figure 5.2 Extensive variation in protein sequence length among Rhs-toxin repertoires within
Dickeya genomes .................................................................................................... 191
Figure 5.3 Three conserved Rhs-toxin system loci in genomes of Dickeya species................ 192
Figure 5.4 A total of 46 unique Rhs C-Terminal tips in Dickeya species ................................ 193
Figure 5.5 Unique, general, and large unknown functional diversity among Rhs-toxins in Dickeya
species..................................................................................................................... 194
Figure 6 Rhs-toxin abundance and diversity across species and genera of plant pathogenic
bacteria ................................................................................................................... 195
vii
LIST OF SUPPLEMENTAL FIGURES
Figure 2.1 Abundance and diversity of Rhs-toxin sequences in plant pathogenic Xanthomonas...
................................................................................................................................ 196
Figure 2.2 Extensive variation in protein sequence length among Rhs-toxin repertoires within
Xanthomonas genomes ........................................................................................... 197
Figure 3.1 Abundance and diversity of Rhs-toxin sequences in plant pathogenic Ralstonia .. 198
Figure 3.2 Extensive variation in protein sequence length among Rhs-toxin repertoires within
Ralstonia genomes .................................................................................................. 199
Figure 4.1 Abundance and diversity of Rhs-toxin sequences in plant pathogenic Pectobacterium
................................................................................................................................ 200
Figure 4.2 Extensive variation in protein sequence length among Rhs-toxin repertoires within
Pectobacterium genomes ........................................................................................ 201
Figure 5.1 Abundance and diversity of Rhs-toxin sequences in plant pathogenic Dickeya .... 202
Figure 5.2 Extensive variation in protein sequence length among Rhs-toxin repertoires within
Dickeya genomes .................................................................................................... 203
viii
Chapter I: Review of the Literature
Introduction
Bacteria are everywhere, and they can come together to establish a resilient microbial
community in the phytobiome. In these complex microbial communities, bacterial cells can
perform different roles and establish unique relationships with their surrounding biotic and
abiotic environment [1,2]. The individual needs of a species can dominate a microbial
community if the species is environmentally fit and there are environmental conditions that
support that species. Both beneficial and harmful interactions occur among and between bacterial
cells in these microbiomes [3]. For example, Yurtsev et al. (2016) demonstrated that two E. coli
strains can form a symbiotic mutualistic relationship and protect each other from ampicillin and
chloramphenicol inhibition, while Koskiniemi et al. (2013) revealed that Dickeya dadantii cells
can inhibit the growth of neighboring cell using proteinaceous toxins [4,5,6]. A diversity of
antagonistic relationships have been reported for multiple bacterial species in natural
environments such as soils, rhizospheres, and plant tissues [7,8,9,10]. These interactions drive
Horizontal gene transfer (HGT) is one of the evolutionary mechanisms that bacteria employ to
acquire new genetic material through three principal modes: 1) conjugation, when bacteria
transfers genes directly to another cell, 2) transformation, when bacteria take up DNA from their
surroundings; and 3) transduction, when a bacteriophage injects DNA directly into a host
bacterial cell [12]. During conjugation, mobile genetic elements play an important role in the
integration of the new DNA fragments into the host genome, enhancing the transfer of DNA
inside the same cell or between cells [13]. Incorporation of new DNA into a bacterial genome
1
can expand a bacterial cell's metabolic capabilities and influence their survival through the
from the same species, different species, and even different genera via the delivery of
inhibitory molecules such as antibiotics, bacteriocins, and PT. The secretion of these toxins is
facilitated through various bacterial mechanisms. However, their size and function ultimately
determine the mechanism of secretion [16,17]. For PT, -like Rhs-toxins, the type six
secretion system (T6SS) have been shown to play an important role in the secretion [18].
Rhs-toxins typically consist of highly conserved N-terminal regions responsible for toxin
translocation from the producing strain to the target strain. This region is often conserved
among genera and determines Rhs-toxin families [19]. A variable C-terminal tip forms the
second part of the polymorphic Rhs-toxin and contains the killing toxin domain, which is
hypervariable and can vary among and between strains, species, and genus [17]. Each C-
terminal tip toxin has a highly specific cognate immunity protein directly downstream of the
C-terming tip open reading frame, this protects the producing cell from self- killing and
guards against other bacterial cells that might carry the same toxin tip [20]. Meaning that the
competition (Figure 1.1) [19]. Genome mining studies from Zhang et al. in 2012
demonstrated that Rhs-toxins are abundant and diverse among bacteria and that the C-
terminal domains can be shared even between distantly related strains, suggesting that these
toxins may have evolved from an ancient reservoir of shared domains [20]. Multiple studies
have explored the abundance, diversity, and delivery of Rhs-toxins in animal bacterial
2
pathogens, but little is known about their importance and abundance in plant pathogenic
A diversity of microorganisms typically inhabit the above-ground parts of plants, with bacteria
being the second most predominant microbe in the phyllosphere, preceded only by bacteriophage
[25,26]. Foliar bacteria can originate from soil, seeds, and air before adapting to life within the
leaf tissue [27]. Various environmental factors, including UV light, water, and nutrients, can
influence their community interactions at the genus and species levels [27, 28,29]. A previous
study by Massoni et al. (2020) suggested that the aerial parts of plants are dominated by bacteria
with evolutionarily conserved traits that can quickly and successfully adapt to abrupt
environmental changes, both biotic and abiotic [30]. Certain species of foliar bacteria can modify
their surrounding environment to enhance nutrient uptake or can develop endospores to protect
themselves against harsh environmental conditions [30, 31]. Many bacteria can adhere to
surfaces, leading to the formation of bacterial biofilms over time [32]. These biofilms are known
to protect bacterial cells from external factors, host defenses, and the effect of antimicrobial
compounds [33,34]. Leaf architecture and morphology can protect bacterial cells and
communities through rain water accumulation on the leaf surface. These tiny water bodies on the
leaf surface can provide cells with nutrients and water for their survival. Water not only provides
bacteria with life but also induces their dispersal, leading to bacterial cells in more
[35,36,37,38].
3
One bacterial genus that displays a phyllosphere lifestyle is Xanthomonas, which include
many bacterial species that are plant pathogens and causal agents of bacterial spot and bacterial
blight disease on more than 400 different crops, including citrus, tomato, rice, pepper, beans,
peaches, and ornamental plants [39,40]. Xanthomonas spp. are widely distributed throughout
different geographical regions across the world [41]. The genus was first described and named
Bacterium vesicatorium by Doidge in 1914 when it was first isolated on tomatoes in South
Africa [42]. The primary dispersal mode for these pathogens are through seed in nursery
production facilities, where high temperatures, humidity, plant density, and excessive irrigation
create an ideal environment for their pathogen growth and transmission [40,41]. Xanthomonas
can survive epiphytically in the phyllosphere for extended periods of time, and under the right
circumstances form latent infections, promoting the spread of the pathogen [42,43,44].
Soil ecosystems house a large diversity of organisms including bacterial, fungi, nematodes, and
viruses. A single gram of soil can contain up to 10 billion bacterial cells with a species
diversity of 4,000 to 50,000 species [45]. It is estimated that there are approximately 1.5
million fungal species globally and populations in soil can vary from 1 to 10 million individuals
per square meter [45,46]. Moreover, it has been estimated that one gram of soil may harbor
anywhere from 10 million to 1 billion virus particles [47]. Many of these microorganisms can
be found in both the rhizosphere and bulk soil. The bulk soil encompasses the area surrounding
the rhizosphere, and the microbial communities found here tend to be more generalized and less
specialized to those found in the rhizosphere endophytically. The rhizosphere is a narrow soil
region between the plant roots and the bulk soil that is highly influenced by plant root exudates.
Within the rhizosphere, specialized plant-associated microbes like Bacillus, Azotobacter, and
4
other nitrogen-fixing bacteria are abundant [48]. These bacteria possess the ability to capture
inorganic nitrogen and convert it into a biologically available form that plants can absorb and
utilize for their growth. In exchange, the bacteria receive carbohydrates and other essential
nutrients from the plant, this mutually beneficial relationship enhances plant growth and
Not all interactions are beneficial, some plant pathogenic bacteria can overwinter in bulk soil,
often associated with crop residue, and under favorable environmental conditions and a
susceptible plant host colonize the rhizosphere and host. These bacterial pathogens can often
overcome plant defenses and colonize the host, establishing a parasitic relationship that
ultimately leads to plant death [52]. Soil-borne bacterial plant pathogens are diverse and depend
on unique mechanisms to infect their host, employ an array of survival strategies, and can adapt
bacterial pathogens, includes seeds movement, contaminated tools, irrigation water, trade [55].
Ralstonia solanacearum, the causal agent of bacterial wilt, is one of the most destructive
soilborne bacterial plant pathogens worldwide. The disease was first reported in Brazil in 1922
[56]. A century later, this pathogen is still recognized as a major threat to agricultural
production systems largely due to its wide geographic distribution and large host range. among
its most significant hosts are solanaceous, leguminous, and tree species [57]. Under high
humidity, warm weather, acidic soils, and a susceptible host this pathogen infects and
multiplies in the vascular tissue of the plant, blocking the transportation of water and nutrients
through the xylem resulting the symptoms that gives it its name, wilt [58,59,60,61].
Bacterial wilt is a challenging disease to manage. This is largely due to the pathogen's
ability to persist in the soil for extended periods of time and genotypic diversity [62]. Cells of
5
the pathogen can be found in aquatic systems for several years after its introduction [63].
Furthermore, there is no chemical or biological treatment for the disease [54]. Instead,
traditional cultural practices can be applied to manage this disease including crop rotation with
non-hosts and amending the soil to disfavor bacterial viability [64, 65]. The strains in the R.
solanacearum species complex (RSSC) are categorized into different taxonomic levels such as
species, phylotypes, sequevars, races, and biovars. The most recent classification divided the
syzygii [63,64]. Prior to this designation, strains were grouped into four phylotypes (phy) (I, II,
III, and IV) which roughly correlated to a strains geographic origin, Phy II the America, Phy I
Pectobacterium spp. and Dickeya spp., are also soilborne plant pathogens that infect
multiple tuberous crops, leading to soft rot and wilting of plants [66]. Pectobacterium
carotovorum subsp. carotovorum (Pcc) is known for having the broadest host range among the
seventeen species in the genus, however, potatoes are their primary host, and when infected,
potato plants can show wilt, soft rot, and necrotic stem symptoms. [67] Soft-rot symptoms in
tubers are frequently observed after harvest, especially when tubers are bruised or damaged
during storage, these wounds act as infection courts for PCC to enter the host. Moreover,
favorable conditions like elevated moisture levels and moderate temperatures helps with the
dispersion and persistence of this pathogen. [68,69]. Like Pcc, Dickeya spp. also cause soft-rot,
stem wilt, and blight diseases in potatoes, carrots, and cabbage. Infections by D. dadantii can
produce localized symptoms consisting of blotches, blight, and rotting on multiple plant tissues,
including tubers, stems, leaves, and flowers [70]. Pectobacterium and Dickeya are frequently
6
encountered together in agricultural fields, and their co-infection can have a synergistic effect
on the host. In 2021, Hao et al. observed that when D. dianthicola and P. parmentieri were co-
infected, it resulted in heightened disease severity for black leg disease and an increased
Bacterial competition
Bacteria have multiple mechanisms that enable them to detect friends and foes within their
ecological niche (e.g., microbiome), Bacterial competition can be categorized into two primary
cells/species ability to deplete available nutrients from the environment thus blocking access to
producing a molecule that directly damages the competitor cell. Both competition strategies can
have detrimental effects on their target bacterial host making competition a strong factor
pushing bacteria to evolve a diversity of strategies to outcompete their rivals [21,74 ,75].
Numerous studies have shown that bacterial competition plays a vital role in the proliferation
and survival of bacterial pathogens [18, 76, 77], For instance, Pseudomonas aeruginosa and
Burkholderia cepacia, two gram-negative bacterial pathogens responsible for human respiratory
infections, in vitro studies revealed that B. cepacia isolates could outcompete P. aeruginosa
[78,79]. These two pathogens produce various secondary metabolites that enable them to
compete against each other, including the production of antibiotics, like streptomycin, and
proteinaceous toxins like bacteriocins [80]. Bacteriocins have antibacterial properties, similar to
bacteria, and their only difference is that bacteriocins only inhibit closely related strains of
7
bacteria [81,82]. Bacteriocin families can encompass a wide array of proteins varying in size,
mode of action, and specificity [83]. It is predicted that 99% of all bacteria have the capability
to produce at least one bacteriocin, [84]. Similarly, Rearrangement hot spot toxins (Rhs-toxins)
another group of polymorphic toxins are ubiquitous in bacterial genomes but unlike
bacteriocins they are secreted via the T6SS, are not induced by UV irradiation of mitomycin C
[85]. Extensive research is available on the diversity and function of bacteriocins; however,
little is known about the abundance, diversity, and function of Rhs-toxins [83,84,85].
Rhs-toxins were first identified in the 1900s by CW. Hill and colleagues, initially it was
thought that they served as recombination elements in E. coli and named Rhs elements [21,86].
However, Koskiniemi and colleagues elucidated the true function of the Rhs elements to be
proteinaceous toxins used by bacteria for intraspecies competition [6,23]. These toxins have
been shown to correlate with the presence of a vgrG and hcp genes. These two genes linked to
the T6SS, known for its contractile needle capable of puncturing the cell wall of target bacterial
cells, delivering Rhs-toxins that either kill or inhibit bacterial replication [87,88].
Rhs-toxins are one of many strategies employed by bacteria to compete against closely related
strains in a microbiome [89,90,91]. They have been shown to be translocated from the producing
strain into the target cell via the T6SS [92]. Unlike many bacterial secreted proteins, Rhs-toxins
are surprisingly large with an average sequence length of 1,500 amino acids (aa) long, and are
terminal segment responsible for encoding the protein translocation, adhesion, and anchoring of
the toxin produced. For instance, molecular studies conducted with Aeromonas dhakensis have
8
revealed that when the N-terminal region is deleted there is a disruption in toxin secretion. [93,
94]. The second architectural motif that defines Rhs-toxins is a highly conserved ten-amino-acid
motif that separates the N-terminal from the C-terminal tip. The C-terminal is the third and final
component of the polymorphic protein that codes for a hypervariable region containing the toxic
domain which may degrade DNA, RNA, or proteins [19,95,96,97]. Genes that encode Rhs-toxins
are typically located upstream from an immunity protein that protects the bacteria from
autoinhibition. Occasionally, but not always, downstream of the immunity protein, one or more
orphan modules can be found encoding an alternative toxin domain, along with their cognate
immunity proteins. It is believed that each orphan module carries an alternative toxin domain and
thus may target a a unique strain (Figure 1.1 A) [98, 99, 100]. The structure of these Rhs-toxins
and their numerous homologues in bacterial genomes suggest that Rhs-toxins have a propensity
for genetic recombination, allowing for the replacement of the existing C-termina toxin with an
alternative orphan module. Jackson and colleagues (2009) suggest that this recombination plays a
Research shows that polymorphic toxins, which house the unique Rhs-toxin, can be exported
through more than one secretion system [102, 103, 104]. Notably, the Type 6 secretion system
(T6SS) is the only one that has been observed to translocate Rhs-toxins [105]. This secretion
system was originally discovered in Vibrio cholerae and P. aeruginosa [89]. Recent research has
highlighted its essential role in bacterial competition. For example, a competition study involving
Shigella and Salmonella infections in the mouse intestine showed that their interactions rely on a
functional T6SS [106, 107]. This molecular nano weapon comprises a complex assembly of
various proteins, each of them serves a unique and crucial function in facilitating the effective
translocation of the Rhs-toxin through an injectable needle (see Figure 1.2). the protein groups
9
include: a hemolysin co- regulated protein (Hcp), which makes up the main tube and is secreted
along with the toxin and protects it from degradations; a PAAR protein, forms the injectable tip
that carries the toxin outside the bacterial cell; and a valine-glycine repeat protein G (VgrG), acts
as a mediator enabling the toxin to move through this Type VI secretion apparatus [108]. Each of
the three protein groups Hcp, PAAR, and VgrG play a significant role in bacteria-bacteria
[109,110]. While multiple studies have been conducted on the diversity and secretion of Rhs-
toxins in bacterial pathogens affecting humans and animals, there is a knowledge gap on the
Polymorphic toxins are distinguished by their unique structure, featuring a toxin domain in their
C-terminal tip, that is crucial for competition and works to disrupt biological processes within
target cells. They can carry diverse range of enzymatic activities, including Dnases for DNA
cleavage, Rnases causing RNA breakdown, glycosylases that are capable of disrupting and
modifying nucleic acid bases and peptidases that cleavage vital protein targets in other bacterial
cells [17, 92, 105, 116, 117, 118]. According to bioinformatic analysis from Zhang et al. (2012),
nucleases are the most widespread predicted function among toxin domain of polymorphic toxins
[20]. Previous studies on E. coli strain EC869 have demonstrated the functional activity of a C-
terminal domain encoding a potent Dnase capable of degrading genomic DNAA and killing
neighboring cells [119]. Similarly, Rnases can degrade RNA and inhibit cell growth while
performed on a Salmonella enterica Rhs-toxin found that this toxins inhibited protein synthesis in
E. coli through ADP- ribosylation, a chemical modification that can alter the activity and function
10
of proteins in bacterial cells [113,118]. Recently studies have found deaminases encoded in C-
terminal toxin domains, these enzymes can catalyze deamination in nucleic acids or proteins to
make them unfunctional [116,123,124,125]. Interestingly, research indicates that deaminases can
induce mutagenesis in some bacteria and inhibit or kill others [124]. Toxin-domains in
polymorphic toxins groups are diverse and this diversity can be observed even when comparing
closely related bacterial strains, hence the term “polymorphic” [125]. Zhang et al (2012) have
made significant progress in the prediction of numerous toxin domain families. However, the
large abundance, diversity, and predicted function of many C-terminal types remain a challenge.
Many polymorphic toxins, including Rhs-toxins remain unknown. [19,20,96 ,125]. The
remain unexplored. To elucidate the role of Rhs-toxins in the ecology and evolution of plant
pathogenic bacteria and the diseases they cause, we first need to understand the abundance and
this end, my overarching hypothesis is that soilborne plant pathogenic bacteria exhibit a
compared to their foliar counterparts. To test this hypothesis, I proposed the following three
major aims:
Aim 1. Determine the abundance of Rhs-toxins in for the following bacterial genera:
Aim 2. Explore the diversity and predicted function of Rhs-toxins in the genomes of
11
CITED LITERATURE
1. Ghoul, M., & Mitri, S. (2016). The Ecology and Evolution of Microbial Competition.
Trends in microbiology, 24(10), 833–845.
2. Kazmierczak B. I. (2020). The Enemy of my Enemy: Bacterial Competition in the Cystic
Fibrosis Lung. Cell host & microbe, 28(4), 502–504.
3. Maitra, A., and Dill, K. A. (2015). Bacterial growth laws reflect the evolutionary importance
of energy efficiency. Proc. Natl. Acad. Sci. U.S.A. 112, 406–411. doi:
10.1073/pnas.1421138111
4. Perault, A. I., Chandler, C. E., Rasko, D. A., Ernst, R. K., Wolfgang, M. C., & Cotter,
(2020). Host Adaptation Predisposes Pseudomonas aeruginosa to Type VI Secretion
System-Mediated Predation by the Burkholderia cepacia Complex. Cell host & microbe,
28(4), 534–547.e3.
5. Yurtsev, E. A., Conwill, A., & Gore, J. (2016). Oscillatory dynamics in a bacterial cross-
protection mutualism. Proceedings of the National Academy of Sciences, 113(22), 6236-
6241. https://doi.org/10.1073/pnas.1523317113
6. Koskiniemi, S., Lamoureux, J. G., Nikolakakis, K. C., t'Kint de Roodenbeke, C., Kaplan, M.
D., Low, D. A., & Hayes, C. S. (2013). Rhs proteins from diverse bacteria mediate
intercellular competition. Proceedings of the National Academy of Sciences of the United
States of America, 110(17), 7032–7037.
7. Darbandi, A., Asadi, A., Mahdizade Ari, M., Ohadi, E., Talebi, M., Halaj Zadeh, M., Darb
Emamie, A., Ghanavati, R., & Kakanj, M. (2022). Bacteriocins: Properties and potential use
as antimicrobials. Journal of clinical laboratory analysis, 36(1), e24093.
8. Brown, S. P., Le Chat, L., & Taddei, F. (2008). Evolution of virulence: triggering host
inflammation allows invading pathogens to exclude competitors. Ecology letters, 11(1), 44–
51.
9. Passarge J, Hol S, Escher M, Huisman J. (2006). Competition for nutrients and light: stable
coexistence, alternative stable states, or competitive exclusion? Ecol Monographs 76: 57–
72.
10. Vetsigian K, Jajoo R, Kishony R. (2011). Structure and evolution of Streptomyces
interaction networks in soil and in silico. PLoS Biol 9: e1001184.
11. Hirano S, Upper D. (1983) Ecology and Epidemiology of Foliar Bacterial Plant Pathogens,
Annual Review of Phytopathology 1983 21:1, 243-270.
12. Roder H. L., Sorensen S. J., Burmolle M. (2016). Studying bacterial multispecies biofilms:
where to start? Trends Microbiol. 24 503–513. 10.1016/j.tim.2016.02.019.
13. Thomas, C., Nielsen, K. (2005). Mechanisms of, and Barriers to, Horizontal Gene Transfer
between Bacteria. Nat Rev Microbiol 3, 711–721
14. Hibbing, M. E., Fuqua, C., Parsek, M. R., & Peterson, S. B. (2010). Bacterial competition:
surviving and thriving in the microbial jungle. Nature reviews. Microbiology, 8(1), 15– 25.
15. Gilbert S. Gregory, Parker M. Ingrid. (2023) The evolutionary ecology of plant disease.
12
16. R. Niehus, S. Mitri, A.G. Fletcher, K.R. (2015). Foster Migration and horizontal gene
transfer divide microbial genomes into multiple niches Nat. Commun., 6 (2015), p. 8924
17. Rappuoli R, Montecucco C. Guidebook to protein toxins and their use in cell biology.
Oxford; New York: Oxford University Press; 1997.
18. S. Borgeaud, L.C. Metzger, T. Scrignari, M. Blokesch. (2015), The type VI secretion system
of Vibrio cholerae fosters horizontal gene transfer Science, 347 ,pp. 63-67
19. Jackson, A.P., Thomas, G.H., Parkhill, J. et al. (2009). Evolutionary diversification of an
ancient gene family (rhs) through C-terminal displacement. BMC Genomics 10, 584.
20. Zhang, D., de Souza, R. F., Anantharaman, V., Iyer, L. M., & Aravind, L. (2012).
Polymorphic toxin systems: Comprehensive characterization of trafficking modes,
processing, mechanisms of action, immunity and ecology using comparative genomics.
Biology direct, 7, 18.
21. Riley MA, Wertz JE (2002) Bacteriocins: evolution, ecology, and application Annual
Review of Microbiology 56:117–137.
22. Stårsta M, Hammarlöf DL, Wäneskog M, Schlegel S, Xu F, Heden Gynnå A, et al. (2020)
RHS-elements function as type II toxin-antitoxin modules that regulate intra-macrophage
replication of Salmonella Typhimurium. PLoS Genet 16(2): e1008607.
23. Steele, M. I., & Moran, N. A. (2021). Evolution of Interbacterial Antagonism in Bee Gut
Microbiota Reflects Host and Symbiont Diversification. mSystems, 6(3), e00063-21.
24. Steele, M. I., Kwong, W. K., Whiteley, M., & Moran, N. A. (2017). Diversification of Type
VI Secretion System Toxins Reveals Ancient Antagonism among Bee Gut Microbes. mBio,
8(6), e01630-17.
25. Jurėnas, D., Rosa, L. T., Rey, M., Fronzes, R., & Cascales, E. (2021). Mounting, structure
and autocleavage of a type VI secretion-associated Rhs polymorphic toxin. Nature
Communications, 12(1), 1-11.
26. Lu, W., Tan, J., Lu, H. et al. (2021). Function of Rhs proteins in porcine extraintestinal
pathogenic Escherichia coli PCN033. J Microbiol. 59, 854–860.
27. Schroth, M. N., & Hancock, J. G. (1981). Selected topics in biological control. Annual
review of microbiology, 35, 453–476.
28. Hirano SS. (2000). Upper CD. Bacteria in the leaf ecosystem with emphasis on
Pseudomonas syringae-a pathogen, ice nucleus, and epiphyte. Microbiol Mol Biol Rev.
2000 Sep;64(3):624-53. doi: 10.1128/MMBR.64.3.624-653.2000. PMID: 10974129;
PMCID: PMC99007.
29. Indananda, C., Thamchaipenet, A., Matsumoto, A., Inahashi, Y., Duangmal, K., &
Takahashi, Y. (2011). Actinoallomurus oryzae sp. nov., an endophytic actinomycete isolated
from roots of a Thai jasmine rice plant. International journal of systematic and evolutionary
microbiology, 61(Pt 4), 737–741.
30. Parasuraman P, Pattnaik S, Busi S (2019) Phyllosphere microbiome: functional importance
in sustainable agriculture. In: Singh JS, Singh DP (eds) New and future developments in
microbial biotechnology and bioengineering, Elsevier, pp 135—148. Massoni, J., Bortfeld-
13
Miller, M., Jardillier, L., Salazar, G., Sunagawa, S., & Vorholt, J. (2020). Consistent host
and organ occupancy of phyllosphere bacteria in a community of wild herbaceous plant
species. The ISME journal, 14(1), 245–258.
31. Mina D, Pereira JA, Lino-Neto T, Baptista P (2020) Epiphytic and endophytic bacteria on
olive tree phyllosphere: exploring tissue and cultivar effect. Microb Ecol 80:145—157.
32. Beattie GA, Lindow SE (1999) Bacterial colonization of leaves: a spectrum of strategies.
Phytopathology 89:53—359.
33. Parsek MR, Fuqua C (2004) Biofilms 2003: emerging themes and challenges in studies of
surface-associated microbial life. J Bacteriol 186:4427—4440.
34. Vorholt JA (2012) Microbial life in the phyllosphere. Nat Rev Microbiol 10:828—840.
35. Baier R: Conditioning surfaces to suit the biomedical environment: recent progress. J
Biomed Eng. 1982;104:257-271
36. Sivakumar, N., Sathishkumar, R., Selvakumar, G., Shyamkumar, R., & Arjunekumar, K.
(2020). Phyllospheric Microbiomes: Diversity, Ecological Significance, and
Biotechnological Applications. Plant Microbiomes for Sustainable Agriculture, 25, 113–
172.
37. Allard, S. M., Ottesen, A. R., & Micallef, S. A. (2020). Rain induces temporary shifts in
epiphytic bacterial communities of cucumber and tomato fruit. Scientific reports, 10(1),
1765.
38. Jacques, M. A., Arlat, M., Boulanger, A., Boureau, T., Carrère, S., Cesbron, S., Chen, N. W.,
Cociancich, S., Darrasse, A., Denancé, N., Fischer-Le Saux, M., Gagnevin, L., Koebnik, R.,
Lauber, E., Noël, L. D., Pieretti, I., Portier, P., Pruvost, O., Rieux, A., Robène, I., …
Vernière, C. (2016). Using Ecology, Physiology, and Genomics to Understand Host
Specificity in Xanthomonas. Annual review of phytopathology, 54, 163– 187.
39. Potnis, N., Timilsina, S., Strayer, A., Shantharaj, D., Barak, J. D., Paret, M. L., Vallad, G. E.,
& Jones, J. B. (2015). Bacterial spot of tomato and pepper: diverse Xanthomonas species
with a wide variety of virulence factors posing a worldwide challenge. Molecular plant
pathology, 16(9), 907–920.
40. Pohronezny, K. , Stall, R.E. , Canteros, B.I. , Kegley, M. , Datnoff, L.E. and Subramanya,R.
(1992) Sudden shift in the prevalent race of Xanthomonas campestris pv. vesicatoria in
pepper fields in South Florida. Plant Dis. 76, 118–120.
41. Doidge, E.M. (1920) A tomato canker. J. Dep. Agric. Union S. Afr. 1, 718–721.
42. Jones, J.B. , Pohronezny, K.L. , Stall, R.E. and Jones, J.P. (1986) Survival of Xanthomonas
campestris pv. vesicatoria on tomato crop residue, weeds, seeds, and volunteer tomato
plants. Phytopathology, 76, 430–434.
43. Araújo, E.R. , Pereira, R.C. , Moita, A.W. , Ferreira, M.A.S.V. , Café-Fiho, A.C. and
Quezado-Duval, A.M. (2010) Effect of temperature on pathogenicity components of
tomato bacterial spot and competition between Xanthomonas perforans and X. gardneri. III
International symposium on tomato diseases. Acta Hort. 914, 39–42.
44. Lavelle, P. and Spain, A.V. (2001) Soil Ecology. Kluwer Scientific, Amsterdam.
14
45. D.L. Hawksworth, (1991) The fungal dimension of biodiversity: magnitude, significance,
and conservation, Mycological Research, Volume 95, Issue 6, 1991, Pages 641-655,ISSN
0953-7562.
46. Vitorino, L.C., & Bessa, L.A. (2018). Microbial Diversity: The Gap between the Estimated
and the Known. Diversity.
47. Wang, Y., Jiao, P., Guo, W., Du, D., Hu, Y., Tan, X., & Liu, X. (2022). Changes in Bulk
and Rhizosphere Soil Microbial Diversity and Composition Along an Age Gradient of
Chinese Fir (Cunninghamia lanceolate) Plantations in Subtropical China. Frontiers in
microbiology, 12, 777862.
48. Wang, Z., Liu, L., Chen, Q. et al. (2017). Conservation tillage enhances the stability of the
rhizosphere bacterial community responding to plant growth. Agron. Sustain. Dev. 37, 44.
49. Bulen W. A. (1965). Biological Nitrogen Fixation. Science (New York, N.Y.), 147(3655),
310–312.
50. Pankievicz, V.C.S., Irving, T.B., Maia, L.G.S. et al. (2019). Are we there yet? The long
walk towards the development of efficient symbiotic associations between nitrogen-fixing
bacteria and non-leguminous crops. BMC Biol 17, 99
51. Mendes, R., Garbeva, P., & Raaijmakers, J. M. (2013). The rhizosphere microbiome:
significance of plant beneficial, plant pathogenic, and human pathogenic microorganisms.
FEMS microbiology reviews, 37(5), 634–663.
52. Schostag, M., Stibal, M., Jacobsen, C. S., Bælum, J., Taş, N., Elberling, B., Jansson, J.
K., Semenchuk, P., & Priemé, A. (2015). Distinct summer and winter bacterial communities
in the active layer of Svalbard permafrost revealed by DNA- and RNA-based analyses.
Frontiers in microbiology, 6, 399.
53. Menzies, J. D. (1963). Survival of Microbial Plant Pathogens in Soil. Botanical Review,
29(1), 79–122.
54. Narayanasamy, P. (2019). Soilborne Microbial Plant Pathogens and Disease Management,
Volume One: Nature and Biology (1st ed.). CRC Press.
55. Wang, Z., Luo, W., Cheng, S., Zhang, H., Zong, J., & Zhang, Z. (2023). Ralstonia
solanacearum - A soil borne hidden enemy of plants: Research development in management
strategies, their action mechanism and challenges. Frontiers in plant science, 14, 1141902.
56. Wilen CA, Koike ST, Ploeg AT, Tjosvold SA, Bethke JA, Mathews DM, Stapleton JJ.
Revised continuously. UC IPM Pest Management Guidelines: Floriculture and Ornamental
Nurseries. UC ANR Publication 3392. Davis, CA.
57. Takatsu, A., & Lopes, C. A. (1997). Murcha-bacteriana em hortaliças: Avanços científicos e
perspectivas de controle. Horticultura Brasileira, 15, 170– 177.
58. Genin, S., & Boucher, C. (2002). Ralstonia solanacearum: Secrets of a major pathogen
unveiled by analysis of its genome. Molecular Plant Pathology, 3, 111– 118.
59. Caruso, P., Palomo, J. L., Bertolini, E., Alvarez, B., López, M. M., & Biosca, E. G. (2005).
Seasonal variation of Ralstonia solanacearum biovar 2 populations in a Spanish river:
15
recovery of stressed cells at low temperatures. Applied and environmental microbiology,
71(1), 140–148.
60. Hiromichi Yamazaki, Sunao Kikuchi, Tsuguo Hoshina & Takeshi Kimura (1999) Effect of
calcium concentration in nutrient solution before and after inoculation with Ralstonia
solanacearum on resistance of tomato seedlings to bacterial wilt, Soil Science and Plant
Nutrition, 45:4, 1009-1014
61. Raymond O. García, Jim P. Kerns,Lindsey Thiessen, (2019) Ralstonia solanacearum Species
Complex: A Quick Diagnostic Guide, Department of Entomology and Plant Pathology,
North Carolina State University, Raleigh, 27695
62. Safni, I., Cleenwerck, I., De Vos, P., Fegan, M., Sly, L., & Kappler, U. (2014). Polyphasic
taxonomic revision of the Ralstonia solanacearum species complex: proposal to emend the
descriptions of Ralstonia solanacearum and Ralstonia syzygii and reclassify current R.
syzygii strains as Ralstonia syzygii subsp. syzygii subsp. nov., R. solanacearum phylotype
IV strains as Ralstonia syzygii subsp. indonesiensis subsp. nov., banana blood disease
bacterium strains as Ralstonia syzygii subsp. celebesensis subsp. nov. and R. solanacearum
phylotype I and III strains as Ralstonia pseudosolanacearum sp. nov. International
journal of systematic and evolutionary microbiology, 64(Pt 9), 3087– 3103.
63. Paudel, S., Dobhal, S., Alvarez, A. M., & Arif, M. (2020). Taxonomy and Phylogenetic
Research on Ralstonia solanacearum Species Complex: A Complex Pathogen with
Extraordinary Economic Consequences. Pathogens, 9(11).
64. Denny, T. (2007). Plant pathogenic Ralstonia species. In: Gnanamanickam, S.S. (eds) Plant-
Associated Bacteria. Springer, Dordrecht.
65. Czajkowski, R., & M. Pérombelon, M. C. (2011). Control of blackleg and tuber soft rot of
potato caused by Pectobacterium and Dickeya species: A review. Plant Pathology, 60(6),
999-1013.
66. Portier, P., Pédron, J., Taghouti, G., Dutrieux, C., & Barny, M. A. (2020). Updated
Taxonomy of Pectobacterium Genus in the CIRM-CFBP Bacterial Collection: When Newly
Described Species Reveal "Old" Endemic Population. Microorganisms, 8(9), 1441.
67. M. Pérombelon, M. C. (2002). Potato diseases caused by soft rot erwinias: An overview of
pathogenesis. Plant Pathology, 51(1), 1-12.
68. Toth, I. K., Barny, M-A., Brurberg, M. B., Condemine, G., Czajkowski, R. L.,
Elphinstone, J. G., Helias, V., Johnson, S. B., Moleleki, L. N., Pirhonen, M., Rossmann, S.,
Tsror, L., van der Waals, J. E., van der Wolf, J. M., van Gijsegem, F., & Yedidia, I. (2021).
Pectobacterium and Dickeya: Environment to Disease Development. In F. Van Gijsegem, J.
M. van der Wolf, & I. K. Toth (Eds.), Plant Diseases Caused by Dickeya and Pectobacterium
Species (pp. 39-84). Springer.
69. Nelson, S. C. (2009). Bacterial Leaf Blight of Aglaonema. Honolulu, HI: University of
Hawai‘i at Manoa.
16
70. Ge, T., Ekbataniamiri, F., Johnson, S. B., Larkin, R. P., & Hao, J. (2021). Interaction
between Dickeya dianthicola and Pectobacterium parmentieri in Potato Infection under
Field Conditions. Microorganisms, 9(2), 316.
71. Lories, B., Roberfroid, S., Dieltjens, L., De Coster, D., Foster, K. R., & Steenackers, H. P.
(2020). Biofilm Bacteria Use Stress Responses to Detect and Respond to Competitors.
Current biology : CB, 30(7), 1231–1244.e4.
72. Birch, L. C. (1957).The meanings of competition. Am. Nat. 91, 5–18.
73. Case, T. J., & Gilpin, M. E. (1974). Interference competition and niche theory.
Proceedings of the National Academy of Sciences of the United States of America, 71(8),
3073–3077.
74. Carruthers MD, Nicholson PA, Tracy EN, Munson RS Jr (2013) Correction:
Acinetobacter baumannii Utilizes a Type VI Secretion System for Bacterial Competition.
PLOS ONE 8(12): 10.1371/annotation/7aa1688c-56c8-46ca-82ea-f86697f3c4fe.
75. Decoin V, Barbey C, Bergeau D, Latour X, Feuilloley MGJ, Orange N, et al. (2014) A
Type VI Secretion System Is Involved in Pseudomonas fluorescens Bacterial Competition.
PLoS ONE 9(2): e89411.
76. Divine Y. Shyntum, Jacques Theron, Stephanus N. Venter, Lucy N. Moleleki, Ian K.
Toth, and Teresa A. Coutinho (2015) Pantoea ananatis Utilizes a Type VI Secretion System
for Pathogenesis and Bacterial Competition, Department of Microbiology and Plant
Pathology, Faculty of Natural and Agricultural Sciences, and Forestry and Agricultural
Biotechnology Institute (FABI), University of Pretoria, Pretoria 0002, South Africa.
77. Eberl, L., & Tümmler, B. (2004). Pseudomonas aeruginosa and Burkholderia cepacia in
cystic fibrosis: genome evolution, interactions and adaptation. International journal of
medical microbiology : IJMM, 294(2-3), 123–131.
78. Perault, A. I., Chandler, C. E., Rasko, D. A., Ernst, R. K., Wolfgang, M. C., & Cotter, P. A.
(2020). Host Adaptation Predisposes Pseudomonas aeruginosa to Type VI Secretion
System-Mediated Predation by the Burkholderia cepacia Complex. Cell host & microbe,
28(4), 534–547.e3. https://doi.org/10.1016/j.chom.2020.06.019
79. James R, Lazdunski C, Pattus F, eds (1991). Bacteriocins, Microcins and lantibiotics. Vol. 65.
New York: Springer-Verlag. 519 pp.
80. Tagg JR, Dajani AS, Wannamaker LW. (1976). Bacteriocins of gram-positive bacteria.
Bact. Rev: 40:722-56
81. Torreblanca M. Meseguer I, Ventosa A. (1994). Production of halocin is a practically
universal feature of archael halophic rods. Let. Appl. Microbiol. 19: 2015.
82. Ghequire MGK, De Mot R. (2014) Ribosomally encoded antibacterial proteins and peptides
from Pseudomonas. FEMS Microbiol Rev. 38: 523–568. pmid:24923764
83. Pei, T., Kan, Y., Wang, H., Tang, X., Li, H., Yan, S., Cui, Y., Zheng, Y., Luo, H., Liang, X.,
& Dong, T. (2022). Delivery of an Rhs-family nuclease effector reveals direct penetration of
the gram-positive cell envelope by a type VI secretion system in Acidovorax citrulli. MLife,
1(1), 66-78. https://doi.org/10.1002/mlf2.12007.
17
84. Klaenhammer TR. (1988). Bacteriocins of lactic acid bacteria. Biochimie 70:33749
85. Hill, C.W., C.H. Sandt, and D.A. Vlazny (1994)., Rhs elements of Escherichia coli: a family
of genetic composites each encoding a large mosaic protein. Molecular microbiology. 12(6):
p. 865-871. 36.
86. Hill, C.W (1999)., Large genomic sequence repetitions in bacteria: lessons from rRNA
operons and Rhs elements. Research in microbiology, 150(9-10): p. 665-674.
87. Mougous JD, Cuff ME, Raunser S, Shen A, Zhou M, Gifford CA, Goodman AL,
Joachimiak G, Ordoñez CL, Lory S, Walz T, Joachimiak A, Mekalanos JJ. (2006) A
virulence locus of Pseudomonas aeruginosa encodes a protein secretion apparatus. Science.
Jun 9;312(5779):1526-30. doi: 10.1126/science.1128393. PMID: 16763151; PMCID:
PMC2800167.
88. Boyer, F., Fichant, G., Berthod, J., Vandenbrouck, Y., and Attree, I.(2009). Dissecting the
bacterial type VI secretion system by a genome wide in silico analysis: what can be learned
from available microbial genomic resources? BMCGenomics10,104.
89. Aoki SK, Pamma R, Hernday AD, Bickham JE, Braaten BA, Low DA. (2005) Contact-
dependent inhibition of growth in Escherichia coli. Science.19;309(5738):1245-8. doi:
10.1126/science.1115109. PMID: 16109881.
90. Alouf JE, Popoff MR. (2006) The comprehensive sourcebook of bacterial protein toxins. 3.
Amsterdam; Boston: Elsevier Academic Press.
91. Sisto A, Cipriani MG, Morea M, Lonigro SL, Valerio F, Lavermicocca P, (2010). An Rhs-
like genetic element is involved in bacteriocin production by Pseudomonas savastanoi pv.
savastanoi. Antonie Van Leeuwenhoek., 98 (4): 505-517. 10.1007/s10482-010-9468-7.
92. Shneider, M., Buth, S., Ho, B. et al. (2013). PAAR-repeat proteins sharpen and diversify the
type VI secretion system spike. Nature 500, 350–353.
93. Pei, TT., Li, H., Liang, X. (2020). Intramolecular chaperone-mediated secretion of an
Rhs effector toxin by a type VI secretion system. Nat Commun 11, 1865.
94. Zhang D, Iyer LM, Aravind L, (2011). A novel immunity system for bacterial nucleic acid
degrading toxins and its recruitment in various eukaryotic and DNA viral systems. Nucleic
Acids Res, 39 (11): 4532-4552. 10.1093/nar/gkr036.
95. Anantharaman V, Aravind L (2003). New connections in the prokaryotic toxin-antitoxin
network: relationship with the eukaryotic nonsense-mediated RNA decay system. Genome
Biol., 4 (12): R81-10.1186/gb-2003-4-12-r81.
96. Leplae R, Geeraerts D, Hallez R, Guglielmini J, Dreze P, Van Melderen L, (2011):
Diversity of bacterial type II toxin-antitoxin systems: a comprehensive search and functional
analysis of novel families. Nucleic Acids Res., 39 (13): 5513-5525. 10.1093/nar/gkr131.
97. Iyer LM, Zhang D, Rogozin IB, Aravind L (2011) Evolution of the deaminase fold and
multiple origins of eukaryotic editing and mutagenic nucleic acid deaminases from bacterial
toxin systems. Nucleic Acids Res, 39 (22): 9473-9497. 10.1093/nar/gkr691.
98. Jamet, A. & Nassif, X. (2015). New players in the toxin field: polymorphic toxin systems in
bacteria. mBio 6, e00285–00215.
18
99. Ruhe, Z. C., Low, D. A. & Hayes, C. S. (2020).Polymorphic Toxins and Their Immunity
Proteins: Diversity, Evolution, and Mechanisms of Delivery. Annu Rev. Microbiol 74, 497–
520.
100. Whitney, J. C. et al. (2014). Genetically distinct pathways guide effector export through the
type VI secretion system. Mol. Microbiol 92, 529–542.
101. Cianfanelli, F. R., Alcoforado Diniz, J., Guo, M., De Cesare, V., Trost, M., & Coulthurst, S.
J. (2016). VgrG and PAAR Proteins Define Distinct Versions of a Functional Type VI
Secretion System. PLoS pathogens, 12(6), e1005735.
102. Ahmad, S., Tsang, K. K., Sachar, K., Quentin, D., Tashin, T. M., Bullen, N. P., Raunser, S.,
McArthur, A. G., Prehna, G., & Whitney, J. C. (2020). Structural basis for effector
transmembrane domain recognition by type VI secretion system chaperones. eLife, 9,
e62816. https://doi.org/10.7554/eLife.62816
103. Brackmann, M., Nazarov, S., Wang, J., & Basler, M. (2017). Using Force to Punch Holes:
Mechanics of Contractile Nanomachines. Trends in cell biology, 27(9), 623–632.
104. Anderson, M. C., Vonaesch, P., Saffarian, A., Marteyn, B. S., & Sansonetti, P. J. (2017).
Shigella sonnei Encodes a Functional T6SS Used for Interbacterial Competition and Niche
Occupancy. Cell host & microbe, 21(6), 769–776.e3
105. Sana, T. G., Flaugnatti, N., Lugo, K. A., Lam, L. H., Jacobson, A., Baylot, V., Durand, E.,
Journet, L., Cascales, E., & Monack, D. M. (2016). Salmonella Typhimurium utilizes a
T6SS-mediated antibacterial weapon to establish in the host gut. Proceedings of the
National Academy of Sciences of the United States of America, 113(34), E5044–E5051.
106. Silverman, J. M., Agnello, D. M., Zheng, H., Andrews, B. T., Li, M., Catalano, C. E.,
Gonen, T., & Mougous, J. D. (2013). Haemolysin coregulated protein is an exported
receptor and chaperone of type VI secretion substrates. Molecular cell, 51(5), 584–593.
107. Pukatzki, S., McAuley, S. B., & Miyata, S. T. (2009). The type VI secretion system:
translocation of effectors and effector-domains. Current opinion in microbiology, 12(1), 11–
17.
108. Wu, H. Y., Chung, P. C., Shih, H. W., Wen, S. R., & Lai, E. M. (2008). Secretome analysis
uncovers an Hcp-family protein secreted via a type VI secretion system in Agrobacterium
tumefaciens. Journal of bacteriology, 190(8), 2841–2850.
109. Zhou, Y., Tao, J., Yu, H., Ni, J., Zeng, L., Teng, Q., Kim, K. S., Zhao, G. P., Guo, X., &
110. Yao, Y. (2012). Hcp family proteins secreted via the type VI secretion system coordinately
regulate Escherichia coli K1 interaction with human brain microvascular endothelial cells.
Infection and immunity, 80(3), 1243–1251.
111. Fraikin, N., Goormaghtigh, F., & Van Melderen, L. (2020). Type II Toxin-Antitoxin
Systems: Evolution and Revolutions. Journal of bacteriology, 202(7), e00763-19.
112. Ma, J., Sun, M., Dong, W., Pan, Z., Lu, C., & Yao, H. (2017). PAAR-Rhs proteins harbor
various C-terminal toxins to diversify the antibacterial pathways of type VI secretion
systems. Environmental microbiology, 19(1), 345–360.
19
113. Jurėnas, D., Payelleville, A., Roghanian, M., Turnbull, K. J., Givaudan, A., Brillard, J.,
Hauryliuk, V., & Cascales, E. (2021). Photorhabdus antibacterial Rhs polymorphic toxin
inhibits translation through ADP-ribosylation of 23S ribosomal RNA. Nucleic acids
research, 49(14), 8384–8395.
114. Kung, V. L., Khare, S., Stehlik, C., Bacon, E. M., Hughes, A. J., & Hauser, A. R. (2012). An
rhs gene of Pseudomonas aeruginosa encodes a virulence protein that activates the
inflammasome. Proceedings of the National Academy of Sciences of the United States of
America, 109(4), 1275–1280.
115. Proft T: Microbial toxins: molecular and cellular biology. (2005), Norfolk, England: BIOS
Scientific.
116. Aepfelbacher M, Aktories K, Just I: Bacterial protein toxins. (2000).Berlin; New York:
Springer.
117. Dhananjaya, B. L., & D Souza, C. J. (2010). An overview on nucleases (DNase, RNase, and
phosphodiesterase) in snake venoms. Biochemistry. Biokhimiia, 75(1), 1–6.
118. Yuliar, Nion, Y. A., & Toyota, K. (2015). Recent trends in control methods for bacterial
wilt diseases caused by Ralstonia solanacearum. Microbes and environments, 30(1), 1–11
119. Laskowski, M. SR. (1985). Nucleases: historical respectives. In Nucleases (eds. Linn, S. M.
& Roberts, R. J.), pp. 1–21. Cold Spring Harbor, NY: Cold Spring Harbor Laboratory Press.
120. Hsia, K. C., Li, C. L., & Yuan, H. S. (2005). Structural and functional insight into sugar-
nonspecific nucleases in host defense. Current opinion in structural biology, 15(1), 126–
134.
121. RP Morse, et al., Structural basis of toxicity and immunity in contact-dependent growth
inhibition (CDI) systems. Proc Natl Acad Sci USA 109, 21480–21485 (2012).
122. Proft T. Microbial toxins: molecular and cellular biology. Norfolk, England: BIOS
Scientific; 2005.
123. Harold CS. , RNA and DNA Editing: Molecular Mechanisms and Their Integration into
Biological Systems, 2008Hoboken, New JerseyJohn Wiley & Sons, Inc.
124. Iyer LM, Leipe DD, Koonin EV, Aravind L. Evolutionary history and higher order
classification of AAA+ ATPases. Journal of Structural Biology. 2004;146:11–31.
125. Makarova KS, Wolf YI, Karamycheva S, Zhang D, Aravind L, Koonin EV. Antimicrobial
peptides, polymorphic toxins, and Self-Nonself recognition systems in archaea: an untapped
armory for intermicrobial conflicts. mBio. 2019;10:e00715-19. doi: 10.1128/mBio.00715-
19.
126. de Moraes MH, Hsu F, Huang D, Bosch DE, Zeng J, Radey MC, Simon N, Ledvina
HE, Frick JP, Wiggins PA, Peterson SB, Mougous JD. An interbacterial DNA deaminase
toxin directly mutagenizes surviving target populations. Elife. 2021 Jan 15;10:e62967.
20
Chapter II: Abundance and diversity of Rhs-toxins in four genera of
plant pathogenic bacteria
Abstract
Rearrangement Hot Spot (Rhs) are proteinaceous toxins produced by bacteria that can kill or
inhibit the growth of neighboring competing cells. They are widespread throughout the bacterial
kingdom and play important roles in intra- and interspecies competition. However, little is known
about Rhs abundance, diversity, and function in plant pathogenic bacteria. To this end, we applied
a Hidden Markov Model (HMM) to extract and characterize the Rhs-toxin repertoires in genomes
of foliar and soilborne phytopathogens in: Xanthomonas, Pectobacterium, Dickeya, and Ralstonia.
Among the 343 genomes mined across the four genera, the abundance of Rhs-toxins per genome
varied from 0 to 13, depending on the genera and species. A protein functional analysis of all Rhs-
toxins analyzed in this study identified Rhs-toxins domains clustered into DNases, RNases,
proteases, and deaminases. Interestingly, the killing domain-containing region of some Rhs-toxins,
Rhs-toxin C-termina tips, were shared among and between bacterial genera and species. These
findings provide insights into the distribution and diversity of Rhs-toxins among plant pathogenic
bacteria, which may have implications for understanding niche specification and exclusion of
21
Introduction
The scientific study of bacterial toxins emerged a century ago from the pioneering research of
Klebs and Löffler in the 1880s. They not only identified the bacterial strain Corynebacterium
diphtheriae but also proposed that this bacterium produced a "poison" responsible for causing
diphtheria disease in humans [1]. Subsequently, Roux and Yersinia supported this hypothesis
by cultivating the bacterium, extracting the toxin, and introducing the toxin into guinea pigs,
which resulted in similar symptoms to those seen in diphtheria patients [2]. Currently, there is a
diversity of bacterial toxins known, each with distinct and some with overlapping mechanisms
involved in cell replication, protein synthesis, and DNA replication or stability [3]. One of the
most extensively studied polymorphic toxins are those with an "A-B" architecture, in which two
distinct subunits: "A" and "B" comprise the complete functional unit. The "A" subunit carries
the toxic function of the protein, whereas the "B" subunit determines target cell specificity by
binding to the complementary cell surface (i.e., determines target host). The binding of subunit
“B” to the target cell facilitates the entry of the A-B toxin into the cell, once inside, the "A"
component is released into the target cell cytoplasmic melee to perform its function. Similar
mechanisms of action are observed in the two human bacterial pathogens that cause pulmonary
infections and cholera, Pseudomonas aeruginosa and Vibrio cholera, respectively [3,4].
In 2005, a novel polymorphic toxin, Rearrangement hot spot toxin (Rhs-toxins), was described
in Escherichia coli isolate EC93 [5]. However, back then the function of these genes were
predicted to play a role in genome rearrangement. It was a decade later that Rhs-toxin function
was functionally confirmed in the plant pathogenic bacterium Dickeya dadantii 3937 [6]. Rhs-
the target cell was only achieved when two cells were in direct contact, and termed contact-
22
dependent cell inhibition (CDI). The CDI system, described originally in gram positive
proteins named CdiB work with the type 5 secretion system (T5SS) to secrete and translocating
CdiA to the target cell surface. The CdiA protein encodes the toxin domain that inhibits the
growth of target bacterial cells [7]. The CDI is very much like the “A-B toxins” described
above. Unlike the above-mentioned “A-B toxin” systems the polymorphic toxin is followed by
a small protein that protects the toxin-producing cell from autoinhibition [7]. Aoki et al (2005)
showed both in vitro and in vivo that the C-terminal region of the Rhs-toxins identified in D.
dadantii strain 3937 and uropathogenic E. coli strain 536 encode a similar toxin function but
different nuclease toxin domains and that the cognate immunity protein that followed each C-
terminal tip was unique to each toxin domain [6]. Similarly, to the CDI mechanism, the Rhs
protein family is also known for its antibacterial activity, and it can be found across various
The polymorphic Rhs-toxin system protein family consists of a multi-domain rhs gene that can
encode variable toxin domains. The majority of functional Rhs-toxin system loci in bacterial
genomes have similar structures and architecture. The Rhs-toxin protein has an N-terminal
interaction in the cell cytoplasm of the producing strain, enabling the translocation of the toxin
through the inner and outer membrane; and a C-terminal region demarcated by a conserved ten
amino acid peptide motif (PxxxxDPxGL) [8]. Directly downstream of the C-terminal tip there is
a cognate immunity protein that protects the producing bacterium from autoinhibition and
enables killing of competing cells (Figure 1B) [7,9]. Poole et al (2011) demonstrated that CdiA
and Rhs-toxin families share sequence similarity at the C-terminal, implying homology between
23
the toxin domain [10]. Similarly, to the CdiA systems, additional C-terminal and immunity
protein “genes” known as orphan modules can be found directly downstream of the full
polymorphic Rhs-toxin (Figure 1A). Orphan modules can encode for alternate toxin domains,
and thus, expand the target host range for a bacterial cell. C-terminal tips are always paired with
their cognate immunity protein. Despite lacking a complete N-terminal, orphan modules do
include the previously stated ten-amino-acid peptide motif that demarcates the end of the N-
terminal and beginning of the C-terminal tip; however, they do not have the N-terminal
conserved region [5,7]. In vivo studies performed in S. enterica serovar Typhimurium strain
LT2, confirmed that homologous recombination between an orphan module C-terminal tip and a
full polymorphic Rhs-toxin C-terminal tip can lead to an active protein and may provide the
producing strain with a competitive advantage [11]. Like the CDI system that uses the T5SS to
translocate toxins from the producing cell to the target cell, Rhs-toxins can be translocated by
the T6SS [12,13,14]. The T6SS is an expandable apparatus that injects Rhs-toxins directly from
the producing cell into cytoplasm of the target cell, and often referred to as a micro-weapon.
Composed of 13 to 14 core proteins (i.e., TssA, Tssl, Hcp), the T6SS builds a syringe-like
apparatus in a step-by-step manner to deliver and secrete proteins (Figure 1.2) [13,14]. To
facilitate the export of Rhs-toxins through the T6SS an association with a valine-glycine repeat
protein G (VgrG) is required [15]. The VgrG proteins are predicted to serve as molecular
chaperones for Rhs-toxins during the secretion process. Studies performed in D. dadantii, E.
cloacae, and P. aeruginosa have demonstrated that the deletion of the vgrG gene disrupts the
secretion of Rhs-toxins via the T6SS [16,17,18]. Bioinformatic analysis like the one conducted
by Zhang et al (2011) have begun to elucidate the predicted functional diversity of polymorphic
toxin families, from which Rhs-toxins are only one member [19]. Rhs-toxin are predicted to
24
have a diversity of toxic domains embedded in their C-terminal tips, and thus, perform various
functions including forming pores in the cell membrane of target cells, degrading phospholipids,
breaking down DNA, attacking peptide bonds with amino groups or metals ions (zinc, cobalt,
manganese, nickel, or copper) using water molecules to break chemical bonds (e.g. deaminases
and metallopeptidases, respectively), and disrupting key cellular processes like cell growth and
transcription regulations using ADP-ribosylases [15]. However, the function of many Rhs-toxins
remains unknown. More importantly, the role of Rhs-toxins in niche-specific colonization and
Advancements in genomic and sequencing technologies have enhanced and broadened the
communities [20]. These advancements have enabled fast throughput genome sequencing and
their upload to publicly accessible databases, revealing the biochemical diversity, mechanisms,
structure, and predicted functions of toxin molecules. [21]. One of the most accurate methods to
identify toxin families is through sequence similarity analysis which helps determine sequence
homology between a known and an unknown protein sequence [22]. An initial Rhs-toxin
alignment of full sequences allows for the visualization of the conserved N-terminal and the
hypervariable nature of the C-terminal tips, which is one of the main characteristics of
polymorphic toxins [23] After creating an alignment, novel sequences can be detected using
BLAST and SSEARCH, and other probabilistic models like Hidden Markov Models (HMM)
[24,25, 26].
Numerous studies offer insights into the role of polymorphic toxins like those in the Rhs-toxin
family in human and animal bacterial pathogens [23,27,28]. However, there is little information
25
concerning the diversity and function of Rhs-toxins in bacterial plant pathogens, one of the
most destructive agents of plants that limit maximum attainable yields in many crops. Our
research aim is to build a Rhs-toxin user-friendly database through an online portal that can
serve the scientific community to 1) extract and identify Rhs-toxin in their favorite bacterial
research and tools developed with the data collected and analyzed here, we will better
understand the diversity and function of Rhs-toxins, their ecological role, and evolution. A
long-term goal is to apply the knowledge gained from basic functional studies and ecological
niche theory to develop targeted on-demand disease management tools for biological control
Data sources
Whole genome sequences for Xanthomonas, Ralstonia, Pectobacterium, and Dickeya genera
were downloaded on June 25, 2023, from the National Center for Biotechnology Information
(Supplemental Table 1) Additionally, two genomes from Escherichia coli K-12, and P.
aeruginosa PAO1 were also retrieved and used as outgroups in phylogenetic analysis.
An Average Nucleotide Identity (ANI) analysis for all complete genomes from each of the four
bacterial genera and associated species were generated [29]. The results were used to remove
duplicate bacterial whole genome sequence (WGS) (i.e., 100% similar at the nucleotide level).
Only one representative WGS was chosen at random to mitigate selection bias in Rhs-toxin
26
abundance and diversity analysis [29]. The final WGS used are listed in Supplemental Table 2
with their respective characteristics. A perl script was used to convert nucleotide WGS
GenBank files into protein CDS (Appendix 1). A HMM-based method developed by Ameen et
al. (2021), named Rhs-HMM, was used to extract putative Rhs-toxin protein sequence using a
43 amino acids (aa) sequence that included the conserved 10 aa motif PxxxxDPxGL. All
putative Rhs-toxin sequences were downloaded in FASTA format for each bacterial genus. All
these sequences were passed through the NCBI Conserved Domains database to confirm the
Data Curation
All-putative Rhs-toxin sequence files for each bacterial genera were aligned, visually inspected,
[8,59]. Briefly, protein sequences that only had the 43 aa sequence were excluded from further
study. Custom bash scripts were applied on the final curated Rhs-toxin data files to identify
Rhs-toxin abundance per genome and their protein length (Appendix 1).
N-terminal phylogeny
To determine the number of Rhs-toxin families per genus all Rhs-toxin protein sequences from
each bacterial strain for each genus were aligned and analyzed. The sequence beyond the
demarcated 10 aa motif that demarcates the end of the conserved N-terminal and the start of the
hypervariable C-terminal tip was removed for all sequences for this analysis [8]. Final Rhs N-
terminal aa sequences were aligned using MAFFT [32] and the auto-alignment strategy
alignments were visually inspected in Geneious v10.1.3 for the Rhs core domain motifs
27
“RVxxxxxxxG '' which are typically 776-888 amino acids upstream of the PxxxxDPxGL motif
[8]. This 750-900 bp region is considered the “core region” and is housed within the N-terminal
of the Rhs-toxin architecture. Once identified, the core region was extracted and realigned to
infer a maximum likelihood tree using RAxML v8.2.12, based on a GTR matrix-based model of
amino acid substitution rates and 1,000 bootstrap replicates [33]. The phylogenies were
visualized using the upload tree option of the Tree-Based Alignment Selector (T-BAS, version
2.3) toolkit (https://tbas.hpc.ncsu.edu/) [34,35]. The clades for each genus defined a Rhs family.
Similar to what was described for N-terminal analysis was done for C-terminal tips, except that
any sequence before the 10 aa motif, PxxxxDPxGL, was removed following previously
described methods [59]. The final curated C-terminal tip aa sequences were aligned and used to
run an all vs all blastP search to create a similarity network analysis (SNA). The SNA was
saved in xgmml format and visualized in Cytoscape v3.3.0. The Pfam database was used to
assign predicted functions to all C-terminal tip sequences in this study [36,37,38].
species Each species can have multiple pathovars and together cause vascular foliar diseases on
approximately 400 plant hosts, including agricultural crops and ornamentals [39, 40]. As of
June 2023, 509 complete genome sequences from 22 Xanthomonas species were downloaded
from the NCBI public database. These genomes were sequenced using Illumina, MinION, and
28
PacBio technologies [41]. Among these, the majority of closed genomes belong to X.
campestris, X. citri, and X. oryzae. Strains in these three species have been extensively studied
and sequenced since the 1980s due to their economic importance on agronomic hosts [42,43].
X. campestris causes Black rot disease in brassicas, including cabbage, broccoli, brussels
sprouts, and cauliflower. X. citri is the causal agent of citrus canker, affecting important fruit
crops such as orange, lime, and grapefruit. X. oryzae infects rice causing bacterial blight and
genera studied we had to ensure data accuracy, decrease bias, and eliminate redundancy in
WGS. This was partially accomplished using an Average Nucleotide Identity (ANI) analysis
where identical (100% nucleotide identity), mislabeled, and duplicated genomes were removed
from the original 509 downloaded genomes leaving a total of 207 genomes in the Xanthomonas
database. This was key for our analysis given the recent study that reports that 36% of genomes
from Aeromonas in GenBank were inaccurately categorized or duplicates [48]. Similarly, of the
509 Xanthomonas WGS downloaded from NCBI for this study 59% were duplicates. Prior to
data curation, the Rhs-HMM developed by Ameen et al 2021 retrieved a total of 707 Rhs-toxin
sequences among 20 Xanthomonas species (Supplemental Table 1). Of these, the number of
Rhs-toxin found in one genome ranged from as little as one to as many as 12 toxins. Notably,
did not have any Rhs-toxin sequences according to our Rhs-HMM (Supplemental Table 3,
Supplemental Figure 2.1). The multiple sequence alignment with the 707 Rhs-toxin sequences
enabled visualization of the variation among Rhs-toxin sequence lengths (Supplemental Figure.
2.2). The majority of sequences consisted of the full Rhs-toxin which is defined as toxins with
29
the N-terminal and C-terminal tip. A second group consisted of the orphan C-terminal tips,
while other groups housed several sequences that had incomplete N-terminals or lacked C-
terminal tips entirely. A small group of sequences consisted of the 43 aa motif only. Among all
the Xanthomonas protein sequences only one, from strain AM6, was larger than 2000 aa.
Xanthomonas strain AM6 had no species designation and was isolated from water in the
Shandong province of China [49]. The NCBI conserved domain database identified a TcdB
pore-forming domain within the sequence, known to play a role as an insecticidal toxin [50]. A
total of 258 of the 604 Rhs-toxin sequences were in the range of 1400-1700 aa long. This is in
accordance with previous research that indicates that full-length Rhs-toxins are approximately
1400 aa an above (as depicted in Figure 1.1 B) [11]. Sequences that were approximately half
the size of a full length Rhs-toxin, in the range 400-700 aa were also found. These sequences
have a shorter N-terminal or C-terminal compared to the full-length toxins. Interestingly, some
of these half size Rhs-toxin sequences are flanked by transposable elements (Supplemental
Figure 6). Short Rhs-toxin sequences ranging from 120 to 350 aa were also identified in this
dataset indicating the presence of orphan Rhs C-terminal tips, however, any protein sequence of
<90 aa was considered inconsequential due to size and removed from further analysis. (Figure
1.1 B, Supplemental Figure 2.2). This analysis further excluded 34 genomes that did not have
any predicted Rhs-toxin in their genomes and eight sequences where the only toxins retrieved
matched the 43 aa Rhs HMM motif. The final curated Xanthomonas genomes dataset included
A total of 604 Rhs-toxins remained in the Xanthomonas Rhs-toxin dataset (Supplemental Table
1 and Figure 2.1). Of these 122 were classified as orphan module C-terminal tips because they
30
lacked a corresponding Rhs N-terminal sequence. X. citri, X. prunicola, and X. oryzae were
found to house the highest number of predicted orphan module C-terminal tips, up to five, while
other species had none (Figure. 2.2). An alignment of the 604 Rhs-toxin sequences across 19
length supporting the hypothesis that Rhs-toxins are undergoing strong positive selection.
Furthermore, the high number of orphan module C-terminal tips, reservoirs for toxin diversity,
encoded in the genomes of Xanthomonas spp. indicate extensive competition among the species
Rhs-toxins, categorized as polymorphic toxins, consist of two general components. (Figure 1.1
B). The N-terminal facilitates Rhs-toxin secretion from the producing strain to the target strain,
whereas the C-terminal region carries the active toxin domain [19,58]. Given that rhs genes are
predicted to undergo recombination through the exchange of their C-terminal tips, full length
Rhs-toxin sequences were separated into N-terminal and C-terminal sequences. The N-terminal
was used to group conserved N-terminals into families for each genus, and the C-terminal tip to
determine the predicted killing function. These analyses together provide significant insights
The two different 10 aa motifs RVxxxxxxxG and PxxxxDPxGL (Figure. 1.1 B) were
used to extract the Rhs-core sequence from the Rhs-toxin N-terminal. Any sequence that did not
contain these two motifs was excluded from the N-terminal alignment (<700 aa). Ultimately,
405 Rhs N- terminal sequences from different Xanthomonas species, five sequences from E.
coli, and one from P. aeruginosa were used to generate an Rhs-toxin phylogeny (as detailed in
Supplemental Table 5 and represented in Figure 2.3 A). A total of nine distinct clades, named I-
31
IX, were identified (Figure 2.3 B). Each clade contained a unique set of Rhs-toxin N-terminal
sequences that originated from a diversity of strains within the genera, indicating shared
homology in their secretion mechanism. Many studies have demonstrated that Rhs-toxins are
translocated from the producing strain to the target strain through the T6SS with the help of a
chaperone or adaptor protein that enables their translocation [51,52, 53]. Many of the T6SS
chaperone proteins are directly upstream of the Rhs-toxin N-terminal and are currently
categorized with domains of unknown function (DUF). However, these DUF proteins are linked
to VgrG and Hcp proteins, two important architectural components of the T6SS [54]. This
correlation indicates the involvement of the T6SS in the translocation of Rhs-toxins that have a
To better understand the association between DUF proteins in the secretion of Rhs-
toxins, the upstream regions of the Rhs-toxin proteins were examined to enable the
identification of putative chaperone proteins. Intriguingly, each of the nine families defined by
our phylogenetic tree exhibited a unique Rhs-toxin system locus protein arrangement in the
upstream region of the Rhs- toxin (Figure 2.3 B). This leads us to hypothesize that the well
supported clades are also functionally distinct. Clade I has a hypothetical protein between the
Rhs-toxin N-terminal sequence and a tssl protein, known to be required for T6S activity [55].
Clades II and III have IPT/TIG domain containing proteins upstream of the toxin that are
known to play a role in as transcription factors, however, the chaperone toxin immediately
upstream of the Rhs protein and after the IPT/TIG are different [56]. Both clades IV and VI had
the full Rhs-toxin and their cognate immunity protein flanked by transposases, indicating the
potential for genetic mobility and rearrangements. Toxins in Clade VIII are associated with a
Type II secretion system (T2SS) structural protein that was 300 aa upstream of the Rhs-toxin
32
and followed by a predicted chaperone protein, suggesting that Rhs-toxins may be secreted by
more than one secretion system. Clade IX on the other hand is associated with a T5SS
autotransporter protein, known for playing a role in bacterial interactions, cytotoxicity, and
immunoregulation [57]. All of these Rhs-toxin system loci architectures hint at the importance
the environment may play in regulating the expression and release of Rhs-toxins, future work
on genomic syntony and functional studies will further elucidate the role of these protein in the
secretion and translocation of Rhs-toxins. Steele and colleagues (2021) discovered a group of
Rhs-toxins carried by Apibacter spp. were closely associated with proteins of the Type IX
secretion system (T9SS) and autotransporters. However, functional studies have not confirmed
this observation. This is further evidence that Rhs- Toxins may have alternative secretion and
delivery mechanisms to the T6SS [58]. Despite the diversity in predicted delivery strategies
observed in Xanthomonas spp., the exact mechanisms behind Rhs-toxin translocation remain
unknown, and only future functional characterization will help elucidate if and how Rhs-toxins
To comprehend the diversity of Rhs-toxins C-terminal tips and their killing mechanisms we
focused on the C-terminal tips, which harbor the toxin mode of action in this family of
polymorphic proteins. First, to identify homologous C-terminal tips, a total of 604 Rhs-toxin C-
terminal sequences with their 10-amino-acid peptide motif "PxxxxDPxGL” were used to perform
an all-vs- all BlastP search. C-terminal tips were considered homologous with members of the
same clade with strong bootstrap support. A total of 181 unique C-terminal tips, named rhs1,
rhs2, rhs3…rhs181, were identified within the genus. Of these 64 unique C-terminal tips formed
33
clusters that encompassed X to two tips sequences and the remaining 42 were singletons. Each
of the 64 clusters housed a unique toxin domain family and for the 42 singletons, no toxin
domain was found. Across bacterial species, numerous full-length Rhs-toxins C-terminal tips
shared homology with C-terminal tips from orphan modules (Figure. 2.4 B). These findings
occurring across different bacterial species in the genus and probably across genera in a
microbial community.
To explore the functional diversity of Rhs-toxin in Xanthomonas we used the C-terminal tip
sequences, known to carry the toxic domain, and subjected them to a blast search against the
Pfam database [59]. Out of the 604 C-terminal tip sequences, only 173 were successfully
recognized by Pfam and assigned a toxin domain/function. No predicted function was predicted
for the remaining 431 C-terminal tips (Figure 2.5). To determine if species, pathovar, or plant
host contributed to C- terminal sequence clustering a similarity network analysis (SNA) was
performed and revealed 96 cluster and 85 singletons (Fig. 2.4 B). The Pfam clusters correlated
clustered together based on shared toxin domains, and thus, function. However, our data
revealed instances where C-terminal tip sequences had the same predicted function and may
carry the same predicted toxin domain but were not homologous as seen with clusters l, 27, and
37. Each of those cluster are composed of sequences that encode predicted Tox-REase-7 toxin
domains. This domain is known to be able to identify and cleave DNA at specific target
sequences [23]. However, upon aligning one representative sequence from each cluster, they
were found to share <40% identity (Figure 2.5). We hypothesize that despite sharing the same
34
toxic domain, these sequences might employ distinct mechanisms of action or target specific
molecular pathways, just like the plant-microbe T3SS effectors proteins can be redundant, so
Various toxin domains were found to share a similar function, such as DNases, Rnases,
proteases, and deaminases, and different Rhs-toxin C-terminal tips have been found to be shared
among different species in the genus. For example, toxin domains MafB19-deam and Xoo_2897-
deam are only found in Rhs C-terminal tips rhs4 and rhs12, and their respective clusters: 4 and 12.
The sequences that make us these clusters originated from genomes in X. translucens and X.
oryzae, both of which are pathogens of cereals; wheat and rice, respectively. Clusters rhs1, rhs2,
rhs5, and rhs7 encompassed C-terminal tip sequences from genomes of four species. In contrast,
some clusters were more species-specific; like Cluster rhs4, containing 17 full-length Rhs-toxin
from X. oryzae and X. translucens, both of which infect cereals. Among these 96 clusters, 40
contain anywhere between six to twelve Rhs-toxin C-terminal sequences from different genomes,
while 27 clusters housed only three to four Rhs-toxins from different genomes. Interestingly, 26
clusters were unique to just two species within the Xanthomonas genus, X. euroxanthea, X. oryaze,,
(Figure 2.4 B). Notably, some clusters were exclusively comprised full length Rhs-toxins, like
rhs2, rhs4, and rhs12. Other smaller and less common clusters exclusively contained orphan C-
terminal tips, like rhs18, rhs19, and rhs30. Additionally, we noticed that some orphan module C-
terminal tips shared homology with C-terminals tips of full length Rhs-toxin sequences, clusters
rhs1, rhs7, rhs8, rhs11, among others. This means that the same full length Rhs-toxins C-terminal
tip and orphan modules C-terminal tip can be found in various genomes of different strains across
different species as a full toxin or orphan module (Fig. 2.4 B, Supplemental Table 6). This data
35
supports the natural occurrence of C-terminal displacement given that the same Rhs C-terminal
Ralstonia is a genus of gram-negative bacteria commonly found in soil and water sources in
warm temperate regions. Currently, six species have been identified within this genus: R.
syzygii. While most Ralstonia species are commonly found in soil and considered environmental
pathogens and are the causal agents of bacterial wilt on over 200 plant species. Economically
significant hosts of these pathogens are tomato, eggplant, pepper, banana, potato, and peanuts, as
well as various ornamental plants and trees [60, 61, 62]. Some strains of R. insidiosa, R.
mannitolilytica and R. picketii are considered human pathogens and mostly affect immuno
Because of their soilborne nature and xylem dwelling lifestyle our research objective was to
investigate how pathogen life cycle impacts the abundance and diversity of Rhs-toxins in
various species of Ralstonia. Toward this goal, we downloaded 131 closed WGS. An ANI
analysis excluded 40 duplicated genomes and a total of 338 Rhs-toxin sequences were extracted
from 91 genomes that were mined with the Rhs-HMM. Anywhere between 1-16 Rhs-toxin
sequences were found per Ralstonia genome, with the highest predicted number of Rhs-toxins
found in R. solanacearum strain CFBP2957, a phylotype IIA, sequevar 36 strain isolated from
tomato in Martinique, French Indies [64]. Interestingly, the Rhs model did not detect any full-
36
length Rhs-toxin sequences in the genomes of R. mannitolilytica, R. insidiosa, and R. picketii.
This indicates enrichment of Rhs-toxins in plant pathogenic Ralstonia species compared to the
environmental species. Of note, however, is that the Rhs HMM did capture two sequences in
strain SN82F48 of R. mannitolilytica. One sequence was 27 aa and the other 134 aa long, both
included the 10vaa peptide motif that is characteristic of Rhs-toxins (PxxxxDPxGL), suggesting
that at some point these species may have had Rhs-toxin encoding genes and lost them or have
A sequence alignment allowed visualization of the 338 Rhs-HMM extracted sequences and
revealed that several Rhs-toxin sequences in the dataset had truncated N-terminals, were
missing C-terminal tips, or both, and only consisted of the conserved 43 aa peptide motif.
Furthermore, any predicted C-terminal tip that was <90 aa long was also removed from further
analysis. A total of 294 Rhs-toxin sequences comprised the Ralstonia Rhs-toxin dataset.
Multiple studies have reported that a full Rhs-toxin is 1,400-1,700 aa long [8, 11, 65]. In our
analysis we discovered 1,200 aa long sequences that display a complete architecture and were
also linked to the T6SS. A total of 122 sequences were found to be in the range of 100-350 aa
long. Investigation into the location of these smaller Rhs-toxin sequences in their respective
bacterial genomes revealed that they were positioned downstream of full-length Rhs-toxins.
This observation suggests that these shorter sequences are orphan modules (Figure 1A). A
Ralstonia genomes had fewer predicted orphan C- terminal tips (Figures 2.2 & 3.2).
Additionally, there were numerous predicted Rhs-toxin sequences that ranged in size between
retained the “core region” of the N-terminal and the C-terminal tip. Interestingly these truncated
37
Rhs-toxins were flanked by transposable elements, like clades IV and VI in Xanthomonas,
implying that they could potentially be pseudogenes or undergoing recombination [65]. This
result suggests that Rhs-toxin protein lengths can be extremely variable, and they can differ
To better understand the functional diversity of Rhs-toxin we divided our data set into a file
with Rhs-toxin N-terminal sequences only and one with C-terminal tips. The N-terminal
sequences, which house the “core region” were used to cluster Rhs-toxins into clades (e.g.,
families) (Figure 3.3 A). The "core" region is defined by Jackson et al (2009) as the hyper-
conserved region within the N-terminal of the Rhs-toxins that is found between two conserved
peptide motif RVxxxxxxxG and PxxxxDPxGL. The latter motif also demarcates the start of the
C-terminal tips. The former can be found 750-900 aa upstream of 10 aa motif [8]. Any sequences
that were shorter than 700 aa from the N-terminal to the 10 aa peptide motifs were removed
based on the observations described above. A total of 114 sequences remained after our final
data curation and were used to build the Rhs-toxins phylogeny of Rhs-toxin N-terminal
sequences for Ralstonia. This analysis revealed three primary Rhs-toxins clades (i.e., families),
outgroup E. coli and P. aeruginosa (Figure 3.3 A). Significant diversity was observed among
and between Rhs-N-terminal clades. Clade I housed Rhs-toxins from genomes that
1 and 3 were linked to a Vgr and Tssl protein, respectively, indicating an associated with the
T6SS. A correlation was found between Rhs-toxins N-terminal sequences that were in cluster I
38
and a DUF1795 protein that was directly upstream of the N- terminal and downstream of the
vgr linked gene (Figure 3.3 B). The correlation, between a DUF and a T6SS was not only found
in this genus but in other genera as well. Like in Xanthomonas we hypothesize that this DUF is
directing the Rhs-toxin into one of the several secretion mechanisms. Unlike clade I, clade III
has a DcrB-related protein upstream of N-terminal sequence and downstream of a Tssl protein
(Figure. 3.3 B). Interestingly, DcrB proteins have been found to participate in phage DNA
transport pathways suggesting that they could also be participating in the transport of Rhs-
toxins in bacteria via an alternative mechanism [66]. Cluster II differs structurally from clades 1
and II in that in the upstream of the N-terminal a gspG protein, known for having an active role
in the Type II secretion system (TIISS), is found along with a protein named Type II secretion
system F family protein. These results support our findings in Xanthomonas above where it is
predicted that the Rhs-toxins in clade VIII are secreted via the T2SS.
blastP analysis to identify homologous and unique Rhs-toxin C-terminal tips. A total 294 C-
terminal tip sequences that included the 10 aa peptide motif were used for this analysis.
Sequences exhibiting >85% similarity were designated arbitrarily as rhs-rhs76.) (Figure 3.4 A). A
presence and absence plot of Rhs-C-terminal tips revealed that several C-terminal tip sequences
were shared among and between strain of same species, while 32 C-terminal tip sequences
displayed no similarity to any other sequence in this dataset, and thus, considered Rhs singleton
tips. The C-terminal tips rhs1 and rhs2 were found in 34 of the 86 complete genomes mined.
Interestingly, both toxins were found in genomes from strains from the three plant pathogenic
species (Figure 3.4 A). The C- terminal tip sequence represented in rhs2 was unique to R.
39
Sequence Similarity Network (SNA) with the 294 C-terminal sequences to better understand
their clustering patterns. The SNA results confirmed that C-terminal tips sequences were not
only clustering by sequence homology but also their predicted toxin domain. A total of 44 Rhs-
toxin C-terminal tips clusters and 22 C-terminal tip singletons were found. Clusters rhs1 and
rhs2 housed the majority of the C-terminal tips in our dataset and corresponded to genomes from
strain in the solanacearum species. In contrast, clusters like rhs3, rhs6 and rhs9 housed C-
pseudosolanacerum, Ralstonia syzygii and Ralstonia sp. Notably, certain clusters exclusively
comprised C-terminal tips from full length Rhs-toxins, as observed in clusters rhs3, rhs10-12,
and rhs14. Smaller and less common clusters exclusively composed from orphan module C-
terminal tip were also observed, like rhs13 and rhs43. Additionally, we noticed like in
Xanthomonas, that C-terminal tip sequences from orphan modules shared homology with C-
terminal tips from full length Rhs-toxin sequences, clusters rhs6, rhs9 and rhs26 (Figure 2.4 B).
This means that the same hypervariable tip of the full Rhs-toxins and orphan modules can be
found among various strains and species (Figure 3.4 B, Supplemental Table 6). These results
provide insight on the diversity of Rhs-toxins in Ralstonia species; however, it also shows that a
soilborne plant pathogen has a lower abundance and diversity of Rhs-toxin compared to a foliar
plant pathogen.
A total of 294 C-terminal tip sequences were submitted to Pfam to predict functional domains
that provide insight into function of the Rhs-toxin C-terminal tips in our dataset. A total of 18
distinct toxin domains were predicted for 96 C-terminal tips while the remaining 200 sequences
had no unique protein domain or motif that enable prediction of function. Dnases, Rnases, and
40
Peptidases were among the leading predicted function for the Rhs-toxin C-terminal tips domains
predicted in Ralstonia. Interestingly, the majority of C-terminal tip sequences clustered into rhs2
which encode a GH-E toxin domain that is believed to degrade nucleic acids (Figure 3.4 B). In
addition, clusters rhs3 and rhs5 were found to encode an AHH toxin domain, predicted to also
function as a Dnase. Like clusters rhs11, rhs27, and rhs37 from Xanthomonas that encode a
Tox-Rease-7 domain, without sharing C-terminal tip homology at the aa level, rhs2 and rhs3
from Xanthomonas also share the same toxin domain but not homology. This is further
evidence that different Rhs-toxin can have the same killing domain but are not identical and
Unique toxins and toxin domains, rhs66 from R. wenshanensis, were also identified
through our analysis. This rhs66 did not share homology with any other C-terminal sequences
in our datasets. According to Pfam results, it encodes a CdiA-CT domain associated with the
contact- dependent growth inhibition (CDI) system. This domain is believed to encode
ribonucleases and when expressed can effectively inhibit the growth of neighboring cells
(Figure 3.5) [67]. Another interesting find was the presence of the peptidase_C80 toxin domain
among various C-terminal tips of different strains of R. solanacearum, cluster rhs;11 (Figure.
3.4 B). These toxins are recognized for containing a cysteine protease domain, which degrades
proteins. Similar domains have been found in toxins from bacterial species such as Clostridium
difficile and Vibrio cholera [68]. These findings shed light on diversity and predicted function
of Rhs-toxins in Ralstonia, however, like in Xanthomonas we were only able to predict function
for <33% of Rhs-toxin being the question: What are the remaining 66% of Rhs-toxins doing?
41
Rhs-toxins of Pectobacterium
vegetables. There are 18 recognized species in the Pectobacterium genus, of which 14 are plant
pathogens and the remaining four; P. fontis, P. aquaticum, P. versatile, and P. polonicum are
environmental microbes isolated from water sources [69,70,71]. Together the plant pathogenic
species are known as the soft rot Pectobacteriaceae (SRP) and are the most common and widely
studied causal agent of soft rot of potato [72]. The primary virulence factors of SRP are
pectinolytic enzymes that are secreted by the T2SS and degrade pectin in the middle lamella and
primary plant cell walls, which results in soft, wet, rotted, diseased tissue [73, 74]. The T2SS is a
multi-protein two-step molecular machine composed of 12-15 different proteins that translocate
folded proteins from the periplasm, through the outer membrane, into the extracellular milieu of
bacterial cells. In addition to the delivery of proteases, lipases, and carbohydrates-active enzymes
to the extracellular space of Gram-negative bacteria, they also release toxins [75]. The SRP can
cause multiple diseases of potato, including tuber soft rot, blackleg, aerial stem rot, and lenticel
rot and can be a destructive disease at every stage of potato production, including planting,
harvest, transport, and storage [76]. Potatoes are produced by planting tubers from the previous
year’s crop. Vascular pathogens like SRP cannot be excluded from tubers, thus seed potatoes
harvested from infected plants will carry inoculum into the next growing season [77].
Unlike Xanthomonas and Ralstonia, which largely depend on the T3SS to inject effector
protein directly into the plant host cells, Pectobacterium pathogens largely depend on the T2SS
[78]. Given that the T2SS has been implicated in toxin release and they are linked to a few Rhs-
toxin clusters in our study, our research objective was to investigate the presence of Rhs-toxins in
42
various species of the genus Pectobacterium to understand their function and the role of the T2SS
in the secretion of these toxins. Towards this goal, 95 complete WGS were downloaded from
NCBI on June 25, 2023. An average Nucleotide Identity (ANI) analysis identified 33 duplicated
genomes which were removed from further analysis. The Rhs-HMM was able to extract 255 Rhs-
toxin sequences from the 62 mined complete WGS (Supplemental Table 1) [79]. Anywhere
between 1-14 Rhs-toxin sequences were found per genome for this genus, with the highest
number observed in P. parmentieri Strain QK-5, which was isolated from a solanaceous weed in
Poland and deposited in NCBI in 2019. The Rhs-HMM failed to detect any Rhs-toxins in the
A multiple sequence alignment of 255 Rhs-toxin sequences was used to pinpoint the 10
amino acid motifs that demarcate the N-terminal and C-terminal regions. The alignment enabled
visualization of the variations among Rhs-toxin sequence lengths (Supplemental Figure. 4.2).
Several sequences had truncated N-terminals, lacked a C-terminal tip or consisted of the 43 aa
motif used by Rhs HMM. However, the majority consisted of the full Rhs-toxin or were a
putative orphan module C-terminal tip. To our surprise a few genomes from strains of P.
encoded Rhs-toxin proteins that exceeded 2000 aa in length. These were similar in size to the one
found in Xanthomonas spp. strain AM6, rhs181 (Figure 2.2, Supplemental Figure 4.2). A NCBI
conserved domain database search on these long Rhs-toxins showed that the additional sequence
also contained the RHS repeats that characterizes Rhs-toxin. In addition to the rare long Rhs-
toxin sequences we also found sequences with the typical length range of 1400-1700 aa
(Supplemental Figure 4.2). Sequences spanning 350-700 aa also appeared in the Pectobacterium
43
dataset displaying truncated N-terminals. Of the total number of Rhs-toxin extracted from the
(Supplemental Figure 4.2). Any protein sequence <90 aa long was considered a truncated C-
terminal tip or a Rhs-HMM artifact and removed from further analysis (Figure 1, Supplemental
Figure 4.2).
considerable diversity in sequence length. The average median for Rhs-toxin per Pectobacterium
genome was between two to four, except for P. parmentieri, which had a median of 12 Rhs-
toxins per genome in our study (Figure 4.1). Interestingly, this same species housed the largest
number of predicted orphan module C-terminal tips (Figure. 4.2) while other species carried 0
majority of the toxin repertoire in Pectobacterium encoded full Rhs-toxin sequences from 1000-
1700aa (Supplemental Figure 4.2 and Figure 4.2). This result suggests that Rhs-toxin protein
lengths can be extremely variable, and that some species can encode more Rhs-toxins than others,
To better understand the diversity of Rhs-toxins among various Pectobacterium species, the
curated Rhs-toxin dataset for this genus was divided into two files, as described above. The “core
region” of 165 N-terminal sequences was used to generate a phylogenetic tree that identified four
Rhs-toxins N-terminal families for this genus, named family I, II, III, and V (Figure 4.3).
Interestingly, all six clades had similar Rhs-toxin system loci architecture in that upstream of the
full length Rhs-toxin there was a DUF protein associated with the T6SS. For example, Family II
44
and VI had a DUF1795 flanked by the full-length Rhs-toxin protein and a type VI secretion
protein, however a protein alignment showed that these two DUF were not homologues. Family 1
and IV have a PAAR domain upstream of the Rhs- toxin in addition to more protein linked to the
T6SS. Of all the families in Pectobacterium, the Rhs- toxins that were found in clade III had no
correlation to any type of secretion system and were exceedingly long 2000 aa toxins. In the
upstream region of Rhs-toxin in this clade, a ssDNA-binding protein (SSB1) was found. The
To explore the diversity of Rhs-toxin C-terminal tip killing mechanisms we conducted an all-
vs- all blastP analysis as described above with a total of 255 C-terminal tip sequences from 60
designated arbitrarily as rhs1-rhs106 (Fig. 4.4 A) Each identifier represents a different C-terminal
tip sequence. A C-terminal tip sequence presence-absence plot revealed that several C-terminal
tips are shared between Pectobacterium species, while 42 C-terminal tips sequence had no
similarity to any other toxin, and thus, unique and a singleton (Fig 4.4 A). A SNA was built with
the 255 C-terminal tip sequences to better understand the sharing of C-terminal tips among strain
and species in the genus. The SNA results confirmed that C-terminal sequences were clustering
according to their toxin domain. A total of 64 distinct Rhs-toxin C-terminal tip clusters were
elucidated. The most abundant Rhs C-terminal tip was rhs6, with representative C-terminal tip
and P. carotovorum. Interestingly, this C- terminal tip was only found among one species of the
plant pathogens and was more abundant in the environmental species. Cluster rhs3, rhs7-rhs9,
and rhs11 were formed by C-terminal tips that originated from genomes of four different species
45
parmentieri; P. Polaris; and P. versatile. Twenty- one rhs clusters were composed of 2-3 Rhs
sequences and were dominated by sequences in genomes from a single species (Figure 4.4 B).
Similarly, to Rhs C-terminal tip clusters in Xanthomonas and Ralstonia some clusters in
Pectobacterium also had C-terminal tips associated with both full-length Rhs- toxin and orphan
module C-terminal tips, cluster rhs2-rhs7. (Fig. 4.4 B, Supplemental Table 5).
A blast analysis of 255 C-terminal (CT) sequences against the Pfam database assigned a toxin
domain to 28 of the C-terminal tip rhs clusters from a total of 102 tip sequences while 153 CT
sequences remained undetermined. Putative functions predicted from sequence motifs embedded
in the C-terminal tips include: Dnases, Rnases, peptidases and deaminases (Fig. 4.5). The
majority of species within the Pectobacterium genera encode a GH-E, AHH, and ParB toxin
domains, all described as DNases in the literature [23]. Cluster 11 C-terminal tip sequences were
predicted to encode an unusual Ntox30 toxin domain, which is a known RNase (Fig. 4.4B). Not
surprisingly, a cdiA-CT toxin domain was predicted to be encoded in Rhs-toxin tips that formed
clusters rhs41, rhs49, rhs55, and rhs63 from genomes of P. parmentieri, P. aquaticum, and P.
brasiliense. This toxin domain has the ability to inhibit the growth of neighboring bacterial cells,
a function previously observed in Rhs-toxin of Dickeya dadantii strain 3937, a cousin genus of
Pectobacterium [11]. Another interesting finding is the toxin domain TNT encoded by C-
terminal sequences in P. parmentieri strain HC and P. versatile strain SR1. This toxin domain is
known as the tuberculosis necrotizing toxin and while it is presumed to act as a protease, the
specific mechanisms of action remain unknown [80]. The predicted function for most of the Rhs-
toxin C- terminal tips in Pectobacterium, Ralstonia, and Xanthomonas represent but the tip of the
iceberg with respect to determining their diversity and killing action as only <30% of all
46
identified Rhs- toxin C-terminal tips can be assigned a predicted function and an even smaller
Rhs-toxins of Dickeya
The only Rhs-toxins that have been somewhat functionally characterized in plant pathogenic
bacteria are rhsA and rhsB from D. dadantii and rhs12 and rhs14 from R. solanacearum [18,81].
Dickeya spp., like some Pectobacterium, are plant pathogens that cause blackleg and soft rot of
potato. Together, these two soft rotting genera are known as the SRP and their main virulence
strategy is the deployment of cell-wall-degrading enzymes that breaks down the plant cell wall
leading to maceration and rot of plant tissue. This genus of plant pathogens comprises 12 species
that affect various economically important vegetables [82, 83, 84, 85]. As a closely related
pathogen of Pectobacterium, Dickeya species are worthy of study for the same reason described
above for Pectobacterium. Furthermore, D. dadantii strain 3937 was the model strain in which
the true function of Rhs-toxins was functionally characterized. This strain and family are
furthermore in the order Enterobacteria, used as the first model for the analysis of Rhs-toxin
phylogeny and the determination of Rhs N-terminal families (e.g., clades) [8]. Much effort has
been invested in understanding the functional role of Rhs-toxin in human pathogens like
Salmonella, Escherichia coli, Klebsiella, and Shigella. However, little is known about Rhs-toxin
species from NCBI (Supplemental Table 1). The ANI revealed four duplicated genomes in this
dataset that were removed leaving a total of 31 unique genomes to run the Rhs-HMM. A total of
144 putative Rhs-toxins were extracted from these genomes, from which we noted that the lowest
47
number of Rhs-toxin sequences found per genome among all strains was two and the highest was
nine. The nine Rhs-toxins sequences were traced back to strain ZXC1 of D. fangzhongdai,
however after further sequence analyses three WGS were removed from further analysis. The
Rhs HMM did not detect any Rhs-toxins in bacterial genomes of D. aquatica, D. lacustris, and
D. poaceiphila (Supplemental Fig 5.1). Interestingly, D. aquatica, D. lacustris were both recently
described as pectolytic bacteria isolated from waterways in France. D. poaceiphila was also
recently isolated from sugarcane and described [86]. Visualization of all the Rhs-toxin for the
genus was used to remove any remaining sequence that could further bias our analysis as
described above. This led to a total of 113 Rhs-proteins being retained. After data curation strains
of D. fangzhongdai, D. dadantii, D. chrysanthemi were found to have the highest number of Rhs-
toxin in the genus, whereas D. zeae, D. dianthicola, and D. parazeae had the lowest number with
2 Rhs-toxin per genome (Figure 5.1). According to the literature, Rhs-toxins typically range in
length from 1,400 to 1,700 aa [8,17]. A total of 77 sequences were in this range for this genus.
Sequences spanning 400- 700 aa were also found, just like they were also found in Xanthomonas,
Ralstonia, and a few Pectobacterium species (Figures 2.2, 3.2, 4.2, 5.2). Unlike the other genera,
Dickeya spp. did not have as many Rhs-toxin sequences in this range. This may be an artifact of
the smaller number of genomes in this dataset, and thus, Rhs-sequences retrieved from them.
Twenty-eight sequences were found to be in the range of 100 to 350 amino acids and further
investigation revealed that they were positioned downstream of full-length Rhs proteins
confirming our hypothesis that these are orphan module-C-terminal tips. (Figure 5.2).
48
N-terminal and C-terminal Clusters of Rhs-Toxins
To better understand the functional diversity of Rhs-toxins, N-terminal and C-terminal regions
were divided into two files as described above (Fig. 1). Three major families were identified for
this genus and all of them house Rhs-toxins N-terminal tips from all Dickeya spp. (Fig. 5.3A).
Family I was the smallest clade, comprising 6 Rhs-sequences from D. zeae, D. solani, D.
dadantii, and D. dianthicola. Family II had N-terminal sequences from almost all species, with
the exception of D. dianthicola, while Family III was the largest family and exhibits the largest
species diversity. Closer examination of the genomic loci that encompassed the Rhs-toxin N-
terminal sequences for each clade, revealed that all three clades were associated with the T6SS
and had very similar Rhs-toxin system loci (Figure 5.3 B). Despite their association with the
TS6S protein, the predicted DUF chaperon protein that immediately preceded the Rhs
polymorphic toxin differs for the three females. Family I and III encode proteins annotated as
DUF1795, but a sequence alignment indicates no shared homology. This is the same DUF1795
that was found upstream of the Rhs-toxin in Pectobacterium clades II and VI, and clade I of
Ralstonia (Figures 3.3, 4.3, & 5.3). Suggesting that these Rhs-toxin employ the same
mechanisms to exit the producing cell or are subjected to the same environmental stimuli that
triggers expression. In contrast, family II of Dickeya N-terminal sequences encodes two proteins
annotated as DUF4123 back-to-back upstream of the Rhs-toxin. This same locus architecture was
also observed in family IV of Pectobacterium. Despite having the same annotation, they also
differ in their aa identity. These results correlate with similar observations from Rhs-toxins of
49
An all-vs- all BlastP analysis was performed with 113 C-terminal tips to identify homologous and
unique C- terminal tips (Supplemental Table 6). C-terminal tips exhibiting >85% similarity were
categorized into arbitrary groups and designated as rhs1-rhs46 (Fig. 5.4 A). A C-terminal tip
presence-absence plot revealed that several C-terminal tips were shared among Dickeya strains
and species, while 16 sequences were unique, and thus, considered Rhs singleton tips (Fig. 5.4A).
A SNA of 113 C- terminal sequences confirmed the results from the Rhs-toxin C-terminal tip
sequences rhs1-rhs46 to form 30 clusters and 16 singleton tips (Figure 5.4 A&B). Like all the
other bacterial genera investigated here, the same C-terminal tips sequence was found as both
full-length-C-terminal tips and orphan modules C-terminal sequence cluster 1, 2, 4, and 5 (Figure
5.4 B).
A BlastP analysis of the 113 C-terminal tip sequences against the Pfam database revealed
13 distinct toxin domains among 63 Rhs-toxin sequences. The predominant toxin domain
among rhs clusters for this genus encoded for DNases and RNases. Most species in the genus
were found to possess toxin domains such as AHH, TOX-HNH-EHH, and NUC, all having the
same predicted function, degrading DNA, (DNases). Notably, Rhs-toxins from D. fangzhondai
and D. Zeae were identified to encode the toxic domain Ntox8, a known RNase. Similar to
Pectobacterium and Ralstonia, we also noticed a CdiA-CT toxic domain in D. Zeae that is
Koskiniemi et al. (2013) revealed that two out of the three Rhs proteins found in D. dadantii
strain 3937 had the ability to inhibit the growth of neighboring cells. In our in-silico research,
The Rhs HMM identified 6 rhs-toxins in the genome of D. dadantii strain 3937 (Supplemental
Table 3). Out of these, four were predicted to be DNases just like Koskiniemi and colleagues
50
functionally confirmed. (Supplemental Table 6). A putative function was not assigned to the two
remaining C-terminal tips in strain 3937, however, they are assigned a locus tag. One of the Rhs-
toxin sequences, DDA3937_RS06980, was 1,658 aa long. The second was a C-terminal tip
sequence, DDA3937_RS06995, that was 346 aa long. Visual analysis of the locus where these
two proteins are found in the genome confirmed that the former is a predicted full-length Rhs-
protein and the latter the associated orphan module C-terminal tip.
Rhs-toxins are diverse and can recombine within bacterial genomes to form new active full-
length Rhs-toxin. It is predicted that this high functional diversity among Rhs-toxin in bacterial
genera are acquired through HGT among closely related species [8, 87]. Here we show how both
full-length Rhs-toxins and orphan module C- terminal tips sequences encoding specific toxin
domains are shared among strains of the same species and between species. The findings shared
thus far emphasize the complexity and diversity within Rhs-toxin systems and their secretion
patterns across plant pathogenic bacteria. To add to the complexity, we compared Rhs-toxin
repertoires among bacterial strains, from different species, among the four bacterial genera
investigated in this study. A total of 165, 87, 60, and 31 complete genomes from Xanthomonas,
Ralstonia, Pectobacterium, and Dickeya, respectively, were used in our study to look at the
abundance and diversity of Rhs-toxin in plant pathogenic bacteria. From these a total of 1266
Rhs-toxin sequences were analyzed (Supplemental Table 1). Briefly, 604, 294, 255, and 113
Rhs-toxin were found among Xanthomonas, Ralstonia, Pectobacterium, and Dickeya species,
respectively. Most genomes within a species and genera had at least one Rhs-toxin encoded in
their genomes, with some exceptions. There were 34 genomes among five species of
Xanthomonas that our Rhs-model was not able to detect any Rhs-genes of significance;
51
Ralstonia had 4 genomes among three species; Pectobacterium 2, and Dickeya 3 each from a
different species. The majority of genomes that were predicted to have no Rhs-toxin were traced
back to species that are either environmental soil or water bacteria. Furthermore, our data
indicate that Rhs-toxin are enriched in plant pathogenic bacteria, although additional complete
The genus Xanthomonas has the highest abundance of Rhs-toxins, but this can be an
artifact of the higher number of complete WGS that are available for some of the species in this
genus. The highest number of Rhs-toxins per genome was found in two strains of P.
parmentieri, WC19161 and QK-5, each encoding 13 putative Rhs-toxins. Through this data set,
we also discovered that nearly 50% of the genomes in each species contained anywhere
between one and three orphan modules, with some exception of some genomes. For example,
strain CFBP 1156 of X. hyacinthi is predicted to encode a total of 10 Rhs-toxins in its genome;
Of these, two are full- length Rhs-toxins (~1400-1700aa), while the remaining eight appear to
be orphan module C- terminal tips (Supplemental Table 3 and Fig. 2.2). This Xanthomonas
pathogen is the causal agent of yellow disease of Hyacinthus, Scilla, and other related
ornamental plants and reported across multiple countries in Europe, Australia, and the USA
[88].
The diversity among Rhs-toxin system architecture and the N-termini that make up the
different clades that are associated with the mode of toxin secretion for these four genera is
extensive. Once again, the N-termini from Rhs-toxins of Xanthomonas exhibit the highest
diversity with nine distinct clades that correlated to unique Rhs-toxin system architecture.
Among the nine different clades identified in Xanthomonas, only Clade I was linked to the
T6SS. This clade was largely made up of N-terminal sequences that originated from X.
52
fragarie, X. translucens, and X. oryzae genomes, the causal agents of an angular spot of
strawberries, bacterial streak of wheat, (and other cereals), and bacterial blight and streak of
rice and other weeds, respectably. Clade II of Xanthomonas was the most diverse, with
representative N-terminal sequences from the majority of species in the genus. This same clade
was linked to genes of the T2SS (Fig. 2.3). The appearance of genes associated with structural
components of the T2SS in Rhs-toxin system architecture was unexpected given the extensive
body of literature that reports the T6SS as the mechanism of Rhs-toxin secretion via contact-
dependent manner [89]. Genes associated with the T2SS were also found in Rhs-toxin system
architectures of Ralstonia. However, the T2SS was not associated with Rhs-toxin systems in
Pectobacterium and Dickeya, plant pathogens that cause soft rot in potatoes (Fig. 3.3, 4.3, 5.3).
As such, these two pathogenic genera rely on the T2SS for delivery of cell-wall-degrading
A substantial number of C-terminal sequences were identified among the total 1267
C- terminal tips analyzed in this study. These sequences were clustered into 233 groups based
on sequence similarity (85%). Once again, Xanthomonas exhibited the majority of clusters with
a total of 96, followed by Pectobacterium, Ralstonia, and Dickeya with 44, 64, and 30,
respectively. Many of these clusters were composed of sequences that originated from a single
species, a few species, or multiple species such as Ralstonia clusters 5 and 12-15; Dickeya
cluster 1-15; and Xanthomonas cluster 2, 4, and 5, respectively (Fig. 2.4). Pfam was able to
assign functional toxin domain to <30% of all clusters from each genus leaving many C-
terminal tip sequence functions a mystery. Xanthomonas encoded the highest number of
predicted toxin domains, and a significant portion of these domains are unique to the genus. For
instance, the two deaminases named XOO_2897-deam and MafB19-deam, which are
53
exclusively found in the genomes of X. translucens and X. oryzae, two bacterial pathogens of
Dickeya, but not in Xanthomonas. CdiA-CT are dependent on cell-to-cell contact, such as that
enabled by the T6SS, to inhibit the target bacterial cells [11].We observed a consistent trend
among these four genera, where C-terminal tip sequences were shared among different species.
This observation is interesting, if a target cell has a very similar C-terminal tip it is assumed
they also carry the cognate immunity protein to protect them from autoinhibition. This implies
that two cells that carry the same Rhs-toxins will be immune to the killing function for that
toxin. An all-vs-all BLASTP analysis of all 1267 C-terminal tips from all four genera elucidated
four clusters of Rhs-toxin C-terminal tips that are shared between Xanthomonas and Ralstonia
at >50% identity. Ralstonia and Dickeya shared one C-terminal cluster where the sequence
shared over 90% identity. Another cluster made up of C-terminal tip sequences from Ralstonia
and Pectobacterium showed 60% identity. The majority of shared clusters were found between
Pectobacterium and Dickeya, with 19 clusters exhibiting more than 80% identity (see Figure.
6). These results confirm our hypothesis that Rhs-toxins are found in different genera and
species of plant pathogenic bacteria. However, this data rejects my original hypothesis that
soilborne plant pathogens have greater abundance and diversity than foliar pathogens.
54
CITED LITERATURE
1. Pappenheimer A. M., Jr (1977). Diphtheria toxin. Annual review of biochemistry, 46, 69–94.
2. Kolmer, J. A., & Wayson, N. E. (1912). A Contribution to the Bacteriology of Diphtheria.
The Journal of Infectious Diseases, 11(1), 56-70. https://doi.org/30075051
3. Odumosu, O., Nicholas, D., Yano, H., & Langridge, W. (2010). AB Toxins: A Paradigm
Switch from Deadly to Desirable. Toxins, 2(7), 1612-1645.
4. Vanden Broeck, D., Horvath, C., & De Wolf, M. J. (2007). Vibrio cholerae: cholera toxin.
The international journal of biochemistry & cell biology, 39(10), 1771–1775.
5. Aoki SK, Pamma R, Hernday AD, Bickham JE, Braaten BA, et al. (2005) Contact-dependent
inhibition of growth in Escherichia coli. Science 309: 1245–1248.
6. Aoki, S. K., Diner, E. J., de Roodenbeke, C. T., Burgess, B. R., Poole, S. J., Braaten, B. A.,
Jones, A. M., Webb, J. S., Hayes, C. S., Cotter, P. A., & Low, D. A. (2010). A widespread
family of polymorphic contact-dependent toxin delivery systems in bacteria. Nature,
468(7322), 439–442.
7. Jacob-Dubuisson, F., Guérin, J., Baelen, S., & Clantin, B. (2013). Two-partner secretion: as
simple as it sounds? Research in microbiology, 164(6), 583–595.
8. Jackson AP, Thomas GH, Parkhill J, Thomson NR (2009) Evolutionary diversification of an
ancient gene family (rhs) through C-terminal displacement. BMC Genomics 10: 584.
9. Lin RJ, Capage M, Hill CW (1984) A repetitive DNA sequence, rhs, responsible for
duplications within the Escherichia coli K-12 chromosome. J Mol Biol 177: 1–18.
10. Poole SJ, Diner EJ, Aoki SK, Braaten BA, t'Kint de Roodenbeke C, et al. (2011) Identification
of Functional Toxin/Immunity Genes Linked to Contact-Dependent Growth Inhibition (CDI)
and Rearrangement Hotspot (Rhs) Systems. PLOS Genetics 7(8): e1002217.
11. Koskiniemi S, Garza-Sánchez F, Sandegren L, Webb JS, Braaten BA, et al. (2014) Selection
of Orphan Rhs Toxin Expression in Evolved Salmonella enterica Serovar Typhimurium.
PLOS Genetics 10(3): e1004255. https://doi.org/10.1371/journal.pgen.1004255
12. Russell, A. B., Hood, R. D., Bui, N. K., LeRoux, M., Vollmer, W., & Mougous, J. D. (2011).
Type VI secretion delivers bacteriolytic effectors to target cells. Nature, 475(7356), 343–347.
13. Barret, M., Egan, F., Fargier, E., Morrissey, J. P., & O'Gara, F. (2011). Genomic analysis of
the type VI secretion systems in Pseudomonas spp.: novel clusters and putative effectors
uncovered. Microbiology (Reading, England), 157(Pt 6), 1726–1739.
14. Coulthurst S. (2019). The Type VI secretion system: a versatile bacterial weapon.
Microbiology (Reading, England), 165(5), 503–515.
15. Zoued, A., Brunet, Y. R., Durand, E., Aschtgen, M. S., Logger, L., Douzi, B., Journet, L.,
Cambillau, C., & Cascales, E. (2014). Architecture and assembly of the Type VI secretion
system. Biochimica et biophysica acta, 1843(8), 1664–1673.
16. Boyer, F., Fichant, G., Berthod, J., Vandenbrouck, Y., & Attree, I. (2009). Dissecting the
bacterial type VI secretion system by a genome wide in silico analysis: what can be learned
from available microbial genomic resources? BMC genomics, 10, 104.
17. Whitney, J. C., Beck, C. M., Goo, Y. A., Russell, A. B., Harding, B. N., De Leon, J.
A., Cunningham, D. A., Tran, B. Q., Low, D. A., Goodlett, D. R., Hayes, C. S., & Mougous,
J. D. (2014). Genetically distinct pathways guide effector export through the type VI
secretion system. Molecular Microbiology, 92(3), 529-542.
18. Koskiniemi, S., Lamoureux, J. G., Nikolakakis, K. C., t'Kint de Roodenbeke, C., Kaplan, M.
D., Low, D. A., & Hayes, C. S. (2013). Rhs proteins from diverse bacteria mediate
55
intercellular competition. Proceedings of the National Academy of Sciences of the United
States of America, 110(17), 7032–7037.
19. Pei, T. T., Li, H., Liang, X., Wang, Z. H., Liu, G., Wu, L. L., Kim, H., Xie, Z., Yu, M., Lin,
S., Xu, P., & Dong, T. G. (2020). Intramolecular chaperone-mediated secretion of an Rhs
effector toxin by a type VI secretion system. Nature communications, 11(1), 1865.
20. Zhang, D., Iyer, L. M., & Aravind, L. (2011). A novel immunity system for bacterial nucleic
acid degrading toxins and its recruitment in various eukaryotic and DNA viral systems.
Nucleic acids research, 39(11), 4532–4552.
21. Handelsman J. (2004). Metagenomics: application of genomics to uncultured
microorganisms. Microbiology and molecular biology reviews: MMBR, 68(4), 669–685.
22. Pearson, W. R., & Lipman, D. J. (1988). Improved tools for biological sequence
comparison. Proceedings of the National Academy of Sciences of the United States of
America, 85(8), 2444–2448.
23. Zhang, D., de Souza, R. F., Anantharaman, V., Iyer, L. M., & Aravind, L. (2012).
Polymorphic toxin systems: Comprehensive characterization of trafficking modes,
processing, mechanisms of action, immunity and ecology using comparative genomics.
Biology direct, 7, 18.
24. Altschul, S. F., Madden, T. L., Schäffer, A. A., Zhang, J., Zhang, Z., Miller, W., & Lipman,
D. J. (1997). Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs. Nucleic acids research, 25(17), 3389–3402.
25. Smith, T. F., & Waterman, M. S. (1981). Identification of common molecular
subsequences. Journal of molecular biology, 147(1), 195–197.
26. Johnson, L. S., Eddy, S. R., & Portugaly, E. (2010). Hidden Markov model speed heuristic
and iterative HMM search procedure. BMC bioinformatics, 11, 431.
27. Zhou, C. E., Smith, J., Lam, M., Zemla, A., Dyer, M. D., & Slezak, T. (2007). MvirDB—a
microbial database of protein toxins, virulence factors and antibiotic resistance genes for bio-
defence applications. Nucleic acids research, 35(suppl_1), D391-D394.
28. Negi, S. S., Schein, C. H., Ladics, G. S., Mirsky, H., Chang, P., Rascle, J. B., ... & Braun, W.
(2017). Functional classification of protein toxins as a basis for bioinformatic screening.
Scientific reports, 7(1), 13940.
29. Goris J, Konstantinidis KT, Klappenbach JA, Coenye T, Vandamme P, Tiedje JM (2007)
DNA- DNA hybridization values and their relationship to whole-genome sequence
similarities. Int J Syst Evol Microbiol 57:81–91.
30. Marchler-Bauer, A., & Bryant, S. H. (2004). CD-Search: protein domain annotations on
the fly. Nucleic acids research, 32(Web Server issue), W327–W331.
31. Marchler-Bauer, A., Lu, S., Anderson, J. B., Chitsaz, F., Derbyshire, M. K., DeWeese-Scott,
C., Fong, J. H., Geer, L. Y., Geer, R. C., Gonzales, N. R., Gwadz, M., Hurwitz, D. I.,
Jackson, J. D., Ke, Z., Lanczycki, C. J., Lu, F., Marchler, G. H., Mullokandov, M.,
Omelchenko, M. V., Robertson, C. L., … Bryant, S. H. (2011). CDD: a Conserved Domain
Database for the functional annotation of proteins. Nucleic acids research, 39, D225–D229.
32. Katoh, K., Rozewicki, J., and Yamada, K. D. (2019). MAFFT online service: multiple
sequence alignment, interactive sequence choice and visualization. Briefings in
Bioinformatics 20: 1160– 1166.
33. Gatto, L., Catanzaro, D., & Milinkovitch, M. C. (2007). Assessing the applicability of the
GTR nucleotide substitution model through simulations. Evolutionary bioinformatics online,
2, 145– 155.
56
34. Carbone, I., White, J. B., Miadlikowska, J., Arnold, A. E., Miller, M. A., Kauff, F., U'Ren, J.
M., May, G., & Lutzoni, F. (2017). T-BAS: Tree-Based Alignment Selector toolkit for
phylogenetic- based placement, alignment downloads and metadata visualization: an example
with the Pezizomycotina tree of life. Bioinformatics (Oxford, England), 33(8), 1160–1168.
35. Carbone, I., White, J. B., Miadlikowska, J., Arnold, A. E., Miller, M. A., Magain, N., U'Ren,
J. M., & Lutzoni, F. (2019). T-BAS Version 2.1: Tree-Based Alignment Selector Toolkit for
Evolutionary Placement of DNA Sequences and Viewing Alignments and Specimen Metadata
on Curated and Custom Trees. Microbiology resource announcements, 8(29), e00328-19.
36. Zhang W., Pengcheng D., Han Z. et al. (2014) Whole-genome sequence comparison as a
method for improving bacterial species definition. J. Gen. Appl. Microbiol., 60, 75–78.
37. Medlar, A. J., Törönen, P., & Holm, L. (2018). AAI-profiler: fast proteome-wide exploratory
analysis reveals taxonomic identity, misclassification, and contamination. Nucleic acids
research, 46(W1), W479–W485.
38. Sonnhammer, E. L., Eddy, S. R., & Durbin, R. (1997). Pfam: a comprehensive database of
protein domain families based on seed alignments. Proteins, 28(3), 405–420.
39. Burkholder, W.H. (1957) Genus II. Xanthomonas Dowson 1939. In: Bergey's Manual of
Determinative Bacteriology, 7th edn. (R.S. Breed, E.G.D. Murray and N.R. Smith, eds), pp.
152–
183. Baltimore, MD: Williams & Wilkins Co.
40. Sharma, A., Gautam, S., & Wadhawan, S. (2014). Xanthomonas. Encyclopedia of Food
Microbiology (Second Edition), 811-817.
41. Sayers, E. W., Bolton, E. E., Brister, J. R., Canese, K., Chan, J., Comeau, D. C., Connor, R.,
Funk, K., Kelly, C., Kim, S., Madej, T., Marchler-Bauer, A., Lanczycki, C., Lathrop, S., Lu,
Z., Thibaud-Nissen, F., Murphy, T., Phan, L., Skripchenko, Y., Tse, T., … Sherry, S. T.
(2022). Database resources of the national center for biotechnology information. Nucleic
acids research, 50(D1), D20–D26.
42. Hayward, A.C. (1993) The hosts of Xanthomonas. In: Xanthomonas ( J.G. Swings and
E.L. Civerolo, eds), pp. 1–119. London: Chapman & Hall.
43. Vauterin, L., Swings, J., Kersters, K., Gillis, M., Mew, T.W., Schroth, M.N., Palleroni,
N.J., Hildebrand, D.C., Stead, D.E., Civerolo, E.L., Hayward, A.C., Maraite, H., Stall, R.E.,
Vidaver, A.K. and Bradbury, J.F. (1990) Towards an improved taxonomy of Xanthomonas.
Int. J. Syst. Bacteriol. 40, 312–316.
44. Bradbury, J.F. (1986) Guide to Plant Pathogenic Bacteria. Slough: CAB International.
45. Williams, P.H. (1980) Black rot: a continuing threat to world crucifers. Plant Dis. 64, 736–
742.
46. Ference, C. M., Gochez, A. M., Behlau, F., Wang, N., Graham, J. H., & Jones, J. B. (2018).
Recent advances in the understanding of Xanthomonas citri ssp. Citri pathogenesis and citrus
canker disease management. Molecular Plant Pathology, 19(6), 1302-1318.
47. White, F. F., & Yang, B. (2009). Host and pathogen factors controlling the rice-
Xanthomonas oryzae interaction. Plant physiology, 150(4), 1677–1686.
48. Figueras, M. J., Beaz-Hidalgo, R., Hossain, M. J., & Liles, M. R. (2014). Taxonomic
Affiliation of New Genomes Should Be Verified Using Average Nucleotide Identity and
Multilocus Phylogenetic Analysis. Genome Announcements, 2(6).
49. Li, T., Li, Y., Ma, X., Dan, X., Huang, X., Li, Q., Lei, S., Zhang, Z., Huang, S., Jiang, W.,
Yu, Y., & He, Y. Q. (2022). Comparative Genomic Analysis of Two
57
Xanthomonas oryzae pv. oryzae Strains Isolated from Low Land and High Mountain Paddies
in Guangxi, China. Frontiers in microbiology, 13, 867633.
50. Yang, G., & Waterfield, N. R. (2013). The role of TcdB and TccC subunits in secretion of
the Photorhabdus Tcd toxin complex. PLoS pathogens, 9(10), e1003644.
51. Jamet, A. and X. Nassif (2015) New players in the toxin field: polymorphic toxin systems in
bacteria. MBio, 2015. 6(3).
52. Günther, P., Quentin, D., Ahmad, S., Sachar, K., Gatsogiannis, C., Whitney, J. C., & Raunser,
S. (2022). Structure of a bacterial Rhs effector exported by the type VI secretion system.
PLoS pathogens, 18(1), e1010182.
53. Cianfanelli, F. R., Alcoforado Diniz, J., Guo, M., De Cesare, V., Trost, M., & Coulthurst, S.
J. (2016). VgrG and PAAR Proteins Define Distinct Versions of a Functional Type VI
Secretion System. PLoS pathogens, 12(6), e1005735.
54. Unterweger D., Kostiuk B., Pukatzki S. (2016). Adaptor proteins of type VI secretion system
effectors. Trends Microbiol. 25, 8–10. 10.1016/j.tim.2016.10.003
55. Lopez, J., Ly, P. M., & Feldman, M. F. (2020). The Tip of the VgrG Spike Is Essential to
Functional Type VI Secretion System Assembly in Acinetobacter baumannii. mBio, 11(1),
e02761-19.
56. Siponen, M. I., Wisniewska, M., Lehtiö, L., Johansson, I., Svensson, L., Raszewski, G.,
Nilsson, L., Sigvardsson, M., & Berglund, H. (2010). Structural Determination of Functional
Domains in Early B-cell Factor (EBF) Family of Transcription Factors Reveals Similarities
to Rel DNA- binding Proteins and a Novel Dimerization Motif. Journal of Biological
Chemistry, 285(34), 25875-25879.
57. Clarke, K. R., Hor, L., Pilapitiya, A., Luirink, J., Paxman, J. J., & Heras, B. (2022).
Phylogenetic Classification and Functional Review of Autotransporters. Frontiers in
Immunology, 13, 921272.
58. Steele, M. I., & Moran, N. A. (2021). Evolution of Interbacterial Antagonism in Bee Gut
Microbiota Reflects Host and Symbiont Diversification. mSystems, 6(3), e00063-21.
59. Sonnhammer, E. L., Eddy, S. R., & Durbin, R. (1997). Pfam: a comprehensive database of
protein domain families based on seed alignments. Proteins, 28(3), 405–420.
60. Ahmed, W., Yang, J., Tan, Y., Munir, S., Liu, Q., Zhang, J., Ji, G., & Zhao, Z. (2022).
Ralstonia solanacearum, a deadly pathogen: Revisiting the bacterial wilt biocontrol practices
in tobacco and other Solanaceae. Rhizosphere, 21, 100479.
61. Kelman, A. (1953). The bacterial wilt caused by Pseudomonas solanacearum. North
Carolina Agricultural Experiment Station Technical Bulletin, 99, 194 p.
62. Yabuuchi E, Kosako Y, Yano I, Hotta H, Nishiuchi Y (1995) Transfer of two Burkholderia
and an Alcaligenes species to Ralstonia gen. Nov.: Proposal of Ralstonia pickettii, Doudoroff
1973) comb. Nov., Ralstonia solanacearum (Smith 1896) comb. Nov. and Ralstonia
eutropha (Davis 1969) comb. Nov. Microbiol Immunol 39(11):897–904
63. Ryan MP, Pembroke JT, Adley CC (2011) Ralstonia. In: Liu D (ed) Molecular detection of
human bacterial pathogens. Taylor & Francis CRC Press, Boca Raton, pp 769–778
64. Remenant, B., Coupat-Goutaland, B., Guidot, A., Cellier, G., Wicker, E., Allen, C., Fegan,
M., Pruvost, O., Elbaz, M., Calteau, A., Salvignol, G., Mornico, D., Mangenot, S., Barbe, V.,
Médigue, C., & Prior, P. (2010). Genomes of three tomato pathogens within the Ralstonia
solanacearum species complex reveal significant evolutionary divergence. BMC genomics,
11, 379.
58
65. Donato, S. L., Beck, C. M., Jensen, S. J., Ruhe, Z. C., Cunningham, D. A., Singleton, I., Low,
D. A., & Hayes, C. S. (2020). The β-encapsulation cage of rearrangement hotspot (Rhs)
effectors is required for type VI secretion. Proceedings of the National Academy of
Sciences, 117(52), 33540-33548.
66. Tutar Y. (2012). Pseudogenes. Comparative and functional genomics, 2012, 424526.
67. Samsonov, V. V., Samsonov, V. V., & Sineoky, S. P. (2002). DcrA and dcrB Escherichia
coli genes can control DNA injection by phages specific for BtuB and FhuA receptors.
Research in microbiology, 153(10), 639–646.
68. Beck, C. M., Morse, R. P., Cunningham, D. A., Iniguez, A., Low, D. A., Goulding, C. W., &
Hayes, C. S. (2014). CdiA from Enterobacter cloacae Delivers a Toxic Ribosomal RNase
into Target Bacteria. Structure, 22(5), 707-718.
69. Sturino JM, Pokusaeva K, Carpenter R. Effective Sequestration of Clostridium difficile
Protein Toxins by Calcium Aluminosilicate. Antimicrob Agents Chemother. 2015
Dec;59(12):7178-83. doi: 10.1128/AAC.05050-14. Epub 2015 Jul 6. PMID: 26149988;
PMCID: PMC4649210.
70. Pédron, J., Bertrand, C., Taghouti, G., Portier, P., & Barny, M. A. (2019). Pectobacterium
aquaticum sp. nov., isolated from waterways. International journal of systematic and
evolutionary microbiology, 69(3), 745–751.
71. Waleron, M., Misztak, A., Waleron, M., Jonca, J., Furmaniak, M., & Waleron, K. (2019).
Pectobacterium polonicum sp. nov. isolated from vegetable fields. International journal of
systematic and evolutionary microbiology, 69(6), 1751–1759.
72. Charkowski A. O. (2018). The Changing Face of Bacterial Soft-Rot Diseases. Annual review
of phytopathology, 56, 269–288.
73. Abbott, D. W., & Boraston, A. B. (2008). Structural biology of pectin degradation by
Enterobacteriaceae. Microbiology and molecular biology reviews: MMBR, 72(2), 301–316.
74. Korotkov, K. V., & Sandkvist, M. (2019). Architecture, Function, and Substrates of the Type
II Secretion System. EcoSal Plus, 8(2), 10.1128/ecosalplus.ESP-0034-2018.
75. Stevenson, W. R., Loria, R., Franc, G. D., & Weingartner, D. P. (Eds.). (2001). Compendium
of potato diseases.
76. Frost, K. E., Groves, R. L., & Charkowski, A. O. (2013). Integrated Control of Potato
Pathogens Through Seed Potato Certification and Provision of Clean Seed Potatoes. Plant
disease, 97(10), 1268–1280.
77. Büttner, D., & He, S. Y. (2009). Type III protein secretion in plant pathogenic bacteria. Plant
physiology, 150(4), 1656–1664.
78. Meyer, R. R., & Laine, P. S. (1990). The single-stranded DNA-binding protein of
Escherichia coli. Microbiological reviews, 54(4), 342–380.
79. Ameen S., Huguet-Tapia J., Huerta I. A. (2021) Diversity, Function, and Role of Rhs
Polymorphic Toxins within the Ralstonia solanacearum Species Complex
80. Pajuelo, D., Tak, U., Zhang, L., Danilchanka, O., Tischler, A. D., & Niederweis, M. (2021).
Toxin secretion and trafficking by Mycobacterium tuberculosis. Nature Communication
81. Huerta, A.I., (2015) Identification and characterization of interstrain chemical weapons in
a major plant pathogen. ProQuest Dissertations Publishing, University of Wisconsin-
Madison.
82. Mansfield, J., Genin, S., Magori, S., Citovsky, V., Sriariyanum, M., Ronald, P., Dow, M.,
Verdier, V., Beer, S. V., Machado, M. A., Toth, I., Salmond, G., & Foster, G. D. (2012).
59
Top 10 plant pathogenic bacteria in molecular plant pathology. Molecular plant pathology,
13(6), 614–629.
83. Pédron, J., Mondy, S., des Essarts, Y. R., Van Gijsegem, F., & Faure, D. (2014). Genomic
and metabolic comparison with Dickeya dadantii 3937 reveals the emerging Dickeya
solani potato pathogen to display distinctive metabolic activities and T5SS/T6SS-related
toxin repertoire. BMC genomics, 15, 283.
84. Samson, R., Legendre, J. B., Christen, R., Saux, M. F., Achouak, W., & Gardan, L. (2005).
Transfer of Pectobacterium chrysanthemi (Burkholder et al. 1953) Brenner et al. 1973 and
Brenneria paradisiaca to the genus Dickeya gen. nov. as Dickeya chrysanthemi comb. nov.
and Dickeya paradisiaca comb. nov. and delineation of four novel species, Dickeya dadantii
sp. nov., Dickeya dianthicola sp. nov., Dickeya dieffenbachiae sp. nov. and
Dickeya zeae sp. nov. International journal of systematic and evolutionary microbiology,
55(Pt 4), 1415–1427.
85. Hussain, M. B., Zhang, H. B., Xu, J. L., Liu, Q., Jiang, Z., & Zhang, L. H. (2008). The
acyl- homoserine lactone-type quorum-sensing system modulates cell motility and virulence
of Erwinia chrysanthemi pv. zeae. Journal of bacteriology, 190(3), 1045–1053.
86. Hugouvieux-Cotte-Pattat, N., Brochier-Armanet, C., Flandrois, J. P., & Reverchon, S.
(2020). Dickeya poaceiphila sp. nov., a plant-pathogenic bacterium isolated from sugar
cane (Saccharum officinarum). International journal of systematic and evolutionary
microbiology, 70(8), 4508–4514.
87. CW Hill, CH Sandt, DA Vlazny, (1994). Rhs elements of Escherichia coli: A family of
genetic composites each encoding a large mosaic protein. Mol Microbiol 12, 865–871.
88. Van Doorn, J., Boonekamp, P. M., & Oudega, B. (1994). Partial characterization of
fimbriae of Xanthomonas campestris pv. hyacinthi. Molecular plant-microbe interactions:
MPMI, 7(3), 334–344.
89. Jurėnas, D., Rosa, L. T., Rey, M., Chamot-Rooke, J., Fronzes, R., & Cascales, E. (2021).
Mounting, structure and autocleavage of a type VI secretion associated Rhs
polymorphic toxin. Nature communications, 12(1), 6998.
60
Chapter III: Concluding Remarks and Future Studies on Rhs-toxin Diversity and
Function in Four Genera of Plant Pathogenic Bacteria
The significant amount of genomic, functional, in-vitro, and in-vivo studies accentuate the
bacteria by both abiotic and biotic factors (e.g., microbes), bacteria have evolved a diversity of
molecular mechanisms to ensure their survival. The diversity of mechanisms a bacterial cell
employs to suppress competitor cell is vast and has been extensively reviewed in the literature
(1,2,3,4). The diversity and redundancy of microbial competition mechanisms provide insight
into the vast number of bacteria and other microbes in a microbial community. This large
microbial diversity in the environment might explain the diversification observed in Rhs-toxins
from pathogenic bacteria in their quest to survive and reach a suitable host. Like the plant-
microbe evolutionary arms-race, bacteria are also in an arms race with other microbes in their
Rhs-toxins have been studied for over 30 years, since CW Hill and colleagues identified
them as accessory elements in E. coli, however, it was only in the last 12 years that in-silico
analysis demonstrated their widespread presence across the bacterial kingdom and their true
function as interspecies bacterial toxins were elucidated [5,8,9,10]. Recent studies have indicated
that deletion of rhs genes are affects a bacterium's competitive abilities, suggesting that these
Rhs-toxin research has centered on bacterial pathogens that infect humans, insects, and
mammals but research on their diversity and role in plant pathogenic bacteria has lagged
61
behind [14,15,16,17]. Many studies that have focused on plant pathogenic organisms have
highlighted the little we know about competition mechanisms in the phytobiome [18,19].
1. What are the relationships between microbes in the soil, rhizosphere, phyllosphere,
This last one has been a goal for more plant pathologist in the last century with very little
success. To achieve biological control of bacterial plant pathogens, we first need to understand
bacterial ecology at a deeper level. A recent study showed that R. solanacearum strain K60
outcompetes two different strains in from the same genus, one in the same species, strain
UW551, and on from a R. pseudosolanacearum a closely related species in tomato stems and
rhizospheres [20,21]. All three strains are capable of infecting the same host but when together
strain K60 has a colonization advantage compared to the other two in tropical conditions.
Research from Ameen et al. in 2021 investigated Rhs-toxin diversity within the Ralstonia
solanacearum species complex, where they found that strain K60 from Ralstonia solanacearum
encodes a higher number of Rhs-toxins than GMI1000 and UW551. These results correlate with
our findings where K60 encodes nine Rhs- toxins, GMI1000 has 6, and UW551 3, all after
curation. Together these findings suggest that due to a higher abundance and a diverse Rhs-
toxins repertoire, strain K60 has great capability to compete and succeed against other closely
related strains.
62
Rhs-toxin abundance and diversity research was limited and incomplete due to scarcity of
high-quality WGS sequencing. Our data is in agreement with published literature that shown
rhs genes tend to appear at the end of contigs, which might contribute to Rhs-toxin abundance
bias [22]. Thus, a complete WGS is essential to understand the true abundance, diversity, and
In this study we hypothesized that soilborne plant pathogens have a higher abundance and
diversity of Rhs-toxins repertoires in comparison to pathogens that affect foliar tissue. To test
this hypothesis, an in- silico analysis using the Rhs-HMM developed by Yaken et al 2021 was
used to mine WGS of four different bacterial genera: Xanthomonas, a foliar pathogen; Ralstonia,
a soilborne vascular pathogen; Pectobacterium, a soilborne sot rot pathogen; and Dickeya, also a
soilborne soft rot pathogen. The data set included a total of 343 genome from these four genera.
Rhs-toxin abundance comparisons between bacterial pathogens based on soil or foliar lifestyles
show that the foliar pathogen, Xanthomonas, had a higher number of Rhs-toxins in the genus
compared to soilborne pathogens. This evidence rejects our primary hypothesis at the genus
level. However further analysis of this dataset comparing species that infect the same host and
have different lifestyles may indicate differently. Among the genus, there are 21 Xanthomonas
species, each can infect one to multiple hosts. A study from Zhang et al in 2019 revealed that
Xanthomonas is one of the most abundant bacterial genera within the seed microbiome [23]. The
mechanisms suggesting that Xanthomonas as a whole may be interaction with a higher diversity
Xanthomonas, like for many bacteria, has been influences by horizontal gene transfer, it is
estimated that 5-25% of Xanthomonas genomes was gained through recombination [24,25,26].
63
This may explain why our data elucidates more sharing of Rhs-toxins gens by multiple
repertoire in comparison to soilborne bacterial genera, we show that each genus is functionally
diverse within species and even between strains in a species. Most genomes mined have between
1-13 Rhs-toxins encoded in their genomes with various protein lengths. We observed that the
majority of genomes encode at least one full-Rhs toxin structure that includes the N-terminal and
C-terminal tip. At least 80% percent of genomes housed at least one orphan module C-terminal
tip, with the highest genome displaying eight putative orphan modules. We were surprised to
find that in each bacterial genera, there were genomes that were predicted to not encode any
orphan modules with the methods we used (Table 2). In Ralstonia there were two genomes with
zero Rhs- toxins, R. picketii and R. insidiosa. The two strains in the species are classified as
environmental bacteria are found in soil and water sources. Both have been reported as pathogens
in humans that are immunocompromised [27,28]. Since these two species are not true plant
pathogens and have a different lifestyle than plant pathogenic species, we believe that their lack
of Rhs-toxins is due the lack of competition with other microbes in the niche. Similar results
were found in D. lacustris and D. aquatica. Strain from both species were isolated from water
systems [29,30,31]
Koskiniemi et al in 2013, suspected that bacteria were share Rhs-toxin domains, our findings
confirm that indeed, Rhs-toxin C-terminal tips sequences, and this, domains are being shared
within, between, and among bacterial genera. A SNA shows that Ralstonia and Xanthomonas
share one Rhs-toxin, both bacteria can colonize the vascular tissue and continue their life cycle
inside the plant [32,33,34]. Rhs-toxin sharing was also observed among the soilborne pathogens.
64
Ralstonia shared one Rhs-toxin with both Pectobacterium and Dickeya. While Pectobacterium
and Dickeya shared 38 Rhs-toxins. Predicted function of for some of the Rhs-toxin C-terminal
tips that are shared among bacterial genera include endonucleases: AHH, LHH, NUC, PARB
and GIY-YIG. These toxin domains break the nucleotide chain into two or more pieces
The secretion mechanism of Rhs-toxins into target cells or the extracellular millue remains
unexplored. Much of the Rhs-toxin literature links Rhs-toxin secretion to the type six secretion
system (T6SS), but recent studies propose, including us, that they may be secreted through other
mechanisms [8,15]. A of Rhs-toxins in Apibacter spp. found that the N-terminal housed SpvB
and TcdB domains instead of a typical T6SS PAAR domain. Interestingly, SpvB and TcdB
are predicted to be linked with a novel pathway of secretion, the type nine secretion (T9SS). The
T9SS was first reported in 2020 by Jana and colleagues in Bacteroidetes. They found that this
secretion system was exporting polymorphic toxins across the outer membrane of bacterial cells
[15,35]. Similarly, most of the N-terminal sequences that formed the nine clades, families, of
Xanthomonas showed nine different Rhs-system loci architectures, Clade I was linked to
the T2SS. This association between Rhs-toxin and the T2SS was also observed in Ralstonia
with Clade II. Here we propose that Rhs-toxin may be secreted via alternative secretion systems
The Rhs-toxin polymorphic system is still not completely understood and with every study
more of their diversity and function is known. However, many questions remain unanswered,
specifically those related to the role of Rhs-toxin in microbial community ecology and pathogen
evolution. This thesis is the first step for many more studies to come that precisely study these
knowledge gaps.
65
CITED LITERATURE
1. Coburn, B., Sekirov, I., & Finlay, B. B. (2007). Type III secretion systems and disease.
Clinical microbiology reviews, 20(4), 535–549.
2. Alteri, C. J., Himpsl, S. D., Pickens, S. R., Lindner, J. R., Zora, J. S., Miller, J. E., Arno, P.
D., Straight, S. W., & Mobley, H. L. (2013). Multicellular bacteria deploy the type VI
secretion system to preemptively strike neighboring cells. PLoS pathogens, 9(9), e1003608.
3. Douzi, B., Filloux, A., & Voulhoux, R. (2012). On the path to uncover the bacterial type II
secretion system. Philosophical transactions of the Royal Society of London. Series B,
Biological sciences, 367(1592), 1059–1072.
4. Grohmann, E., Christie, P. J., Waksman, G., & Backert, S. (2018). Type IV secretion in
Gram-negative and Gram-positive bacteria. Molecular microbiology, 107(4), 455-471.
5. Bucci, V., Nadell, C. D., & Xavier, J. B. (2011). The evolution of bacteriocin production in
bacterial biofilms. The American Naturalist, 178(6), E162-E173.
6. Cornforth, D. M., & Foster, K. R. (2013). Competition sensing: the social side of bacterial
stress responses. Nature Reviews Microbiology, 11(4), 285-293.
7. Inglis, R. F., Gardner, A., Cornelis, P., & Buckling, A. (2009). Spite and virulence in the
bacterium Pseudomonas aeruginosa. Proceedings of the National Academy of Sciences,
106(14), 5703-5707.
8. Koskiniemi, S., Lamoureux, J. G., Nikolakakis, K. C., t'Kint de Roodenbeke, C., Kaplan, M.
D., Low, D. A., & Hayes, C. S. (2013). Rhs proteins from diverse bacteria mediate
intercellular competition. Proceedings of the National Academy of Sciences of the United
States of America, 110(17), 7032–7037.
9. Poole SJ, Diner EJ, Aoki SK, Braaten BA, t'Kint de Roodenbeke C, et al.
(2011) Identification of Functional Toxin/Immunity Genes Linked to Contact-Dependent
Growth Inhibition (CDI) and Rearrangement Hotspot (Rhs) Systems. PLOS Genetics 7(8):
e1002217.
10. Aggarwal, K., & Lee, K. H. (2011). Overexpression of cloned RhsA sequences perturbs the
cellular translational machinery in Escherichia coli. Journal of bacteriology, 193(18), 4869-
4880.
11. Cianfanelli, F. R., Alcoforado Diniz, J., Guo, M., De Cesare, V., Trost, M., & Coulthurst, S.
J. (2016). VgrG and PAAR Proteins Define Distinct Versions of a Functional Type VI
Secretion System. PLoS pathogens, 12(6), e1005735.
12. Alcoforado Diniz, J., & Coulthurst, S. J. (2015). Intraspecies Competition in Serratia
marcescens Is Mediated by Type VI-Secreted Rhs Effectors and a Conserved Effector-
Associated Accessory Protein. Journal of bacteriology, 197(14), 2350–2360.
13. Vazquez-Lopez, J., & Navarro-Garcia, F. (2020). In silico Analyses of Core Proteins and
Putative Effector and Immunity Proteins for T6SS in Enterohemorrhagic E. coli. Frontiers in
cellular and infection microbiology, 10, 195.
14. Koskiniemi S, Garza-Sánchez F, Sandegren L, Webb JS, Braaten BA, et al. (2014) Selection
of Orphan Rhs Toxin Expression in Evolved Salmonella enterica Serovar Typhimurium.
PLOS Genetics 10(3): e1004255.
15. Steele, M. I., & Moran, N. A. (2021). Evolution of Interbacterial Antagonism in Bee Gut
Microbiota Reflects Host and Symbiont Diversification. mSystems, 6(3), e00063-21.
16. Tang, L., Dong, S., Rasheed, N., Wu, H. W., Zhou, N., Li, H., Wang, M., Zheng, J., He, J., &
Chao, W. C. H. (2022). Vibrio parahaemolyticus prey targeting requires autoproteolysis-
66
triggered dimerization of the type VI secretion system effector RhsP. Cell reports, 41(10),
111732.
17. Peleg, A. Y., Hogan, D. A., & Mylonakis, E. (2010). Medically important bacterial–fungal
interactions. Nature Reews Microbiology, 8(5), 340-349.
18. Curtis, M. M., & Sperandio, V. (2011). A complex relationship: The interaction among
symbiotic microbes, invading pathogens, and their mammalian host. Mucosal Immunology,
4(2), 133-138.
19. Goh, H., Veliz Vallejos, D. F., Nicotra, A. B., & Mathesius, U. (2013). The Impact of
Beneficial Plant-Associated Microbes on Plant Phenotypic Plasticity. Journal of Chemical
Ecology, 39(7), 826-839.
20. Huerta, A. I., Milling, A., & Allen, C. (2015). Tropical Strains of Ralstonia solanacearum
Outcompete Race 3 Biovar 2 Strains at Lowland Tropical Temperatures. Applied and
Enronmental Microbiology, 81(10), 3542-3551.
21. Huerta, A.I. (2015). Identification and characterization of interstrain chemical weapons in a
major plantpathogen. ProQuest Dissertations Publishing, University of Wisconsin-Madison.
22. Jackson AP, Thomas GH, Parkhill J, Thomson NR (2009) Evolutionary diversification of an
ancient gene family (rhs) through C-terminal displacement. BMC Genomics 10: 584.
23. Zhang, J. et al. (2019). Insights into endophytic bacterial community structures of seeds
among various Oryza sativa L. rice genotypes. J. Plant Growth Regul. 38, 93–102.
24. Newberry, E. A., Bhandari, R., Minsavage, G. V., Timilsina, S., Jibrin, M. O., Kemble, J.,
Sikora, E. J., Jones, J. B., & Potnis, N. (2019). Independent Evolution with the Gene Flux
Originating from Multiple Xanthomonas Species Explains Genomic Heterogeneity in
Xanthomonas perforans. Applied and environmental microbiology, 85(20), e00885-19.
25. Jackson, R. W., Vinatzer, B., Arnold, D. L., Dorus, S., & Murillo, J. (2011). The influence of
the accessory genome on bacterial pathogen evolution. Mobile genetic elements, 1(1), 55– 65.
26. Bartoli, C., Roux, F., & Lamichhane, J. R. (2016). Molecular mechanisms underlying the
emergence of bacterial pathogens: an ecological perspective. Molecular plant pathology,
17(2), 303–310.
27. Fang, Q., Feng, Y., Feng, P. et al. (2019). Nosocomial bloodstream infection and the
emerging carbapenem-resistant pathogen Ralstonia insidiosa. BMC Infect Dis 19, 334.
28. Nurjadi, D., Boutin, S., Schmidt, K., Ahmels, M., & Hasche, D. (2020). Identification and
Elimination of the Clinically Relevant Multi-Resistant Enronmental Bacteria Ralstonia
insidiosa in Primary Cell Culture. Microorganisms, 8(10).
29. Hugouvieux-Cotte-Pattat, N., Brochier-Armanet, C., Flandrois, J. P., & Reverchon, S.
(2020). Dickeya poaceiphila sp. nov., a plant-pathogenic bacterium isolated from sugar cane
(Saccharum officinarum). International journal of systematic and evolutionary microbiology,
70(8), 4508–4514.
30. Duprey, A., Taib, N., Leonard, S., Garin, T., Flandrois, J. P., Nasser, W., Brochier-Armanet,
C., & Reverchon, S. (2019). The phytopathogenic nature of Dickeya aquatica 174/2 and the
dynamic early evolution of Dickeya pathogenicity. Environmental microbiology, 21(8),
2809–2835.
31. Hugouvieux-Cotte-Pattat, N., Jacot-des-Combes, C., & Briolay, J. (2019). Dickeya lacustris
sp. nov., a water-living pectinolytic bacterium isolated from lakes in France. International
journal of systematic and evolutionary microbiology, 69(3), 721–726.
67
32. Backiyarani, S., Uma, S., Anuradha, C., & Chandrasekar, A. (2022). Application of “omics”
in banana improvement. Omics in Horticultural Crops, 165-191.
https://doi.org/10.1016/B978-0-323-89905-5.00006-9
33. Mori, Y., Inoue, K., Ikeda, K., Nakayashiki, H., Higashimoto, C., Ohnishi, K., Kiba, A., &
Hikichi, Y. (2016). The vascular plant-pathogenic bacterium Ralstonia solanacearum
produces biofilms required for its virulence on the surfaces of tomato cells adjacent to
intercellular spaces. Molecular Plant Pathology, 17(6),
34. Blakely, G., & Murray, N. (2009). DNA Restriction and Modification. Encyclopedia of
Microbiology (Third Edition), 538-549.
35. Jana, B., Salomon, D., & Bosis, E. (2020). A novel class of polymorphic toxins in
Bacteroidetes. Life science alliance, 3(4), e201900631.
68
Table 1. Summary of Rhs-toxins dataset used for abundance and diversity study for four plant pathogenic bacterial genera.
# of
# of Total # of Total # of
Rhs- Total # Total #
# of # of Rhs- N- C-
Genera toxin of of
Genomes species toxins/ terminal terminal
domains clusters singletons
genome sequences sequences
found
Before data curation
Xanthomonas 207 21 707
Ralstonia 91 8 338
Pectobacterium 62 15 491
Dickeya 34 11 144
After data curation
Xanthomonas 165 20 604 412 604 33 96 85
69
Table 2. Bacterial genomes used in this study.
Xanthomonas
Assembly Number Species Strain Source
GCF_009931595.1 X. albilineans Xa-FJ1 NCBI
GCF_007724205.1 X. arboricola SL2098 NCBI
GCF_000972745.1 X. arboricola 17 NCBI
GCF_008761555.1 X. arboricola 15-088 NCBI
GCF_020002335.2 X. arboricola R1 NCBI
GCF_022930885.1 X. arboricola YchA NCBI
GCF_023375075.1 X. arboricola CITA 33 NCBI
GCF_905220715.1 X. arboricola 301 NCBI
GCF_905220745.1 X. arboricola 1314c NCBI
GCF_905220805.1 X. arboricola CFBP 6600 NCBI
GCF_905367745.1 X. arboricola CPBF 765 NCBI
GCF_903989475.2 X. arboricola 3 NCBI
GCF_905367715.1 X. arboricola CPBF 1494 NCBI
GCF_023375035.1 X. arboricola IVIA 1317 NCBI
GCF_023374975.1 X. arboricola IVIA 3978 NCBI
GCF_905220695.1 X. arboricola 1311a NCBI
GCF_905367735.1 X. arboricola CPBF 766 NCBI
GCF_018141705.1 X. arboricola A7 NCBI
GCF_023374995.1 X. arboricola CFBP 1846 NCBI
GCF_905220785.1 X. arboricola CFBP 1159 NCBI
GCF_000225915.1 X. axonopodis F1 NCBI
GCF_003698225.1 X. axonopodis LMG26789 NCBI
GCF_013177355.1 X. axonopodis NCPPB 796 NCBI
GCF_000007145.1 X. campestris ATCC 33913 NCBI
GCF_000070605.1 X. campestris B100 NCBI
GCF_000277895.2 X. campestris NCPPB 4379 NCBI
GCF_000403575.2 X. campestris CN15 NCBI
GCF_001854165.1 X. campestris 85-10 NCBI
GCF_002776735.1 X. campestris CN03 NCBI
GCF_012848175.1 X. campestris NEB122 NCBI
GCF_013388375.1 X. campestris MAFF106181 NCBI
GCF_014841015.1 X. campestris M28 NCBI
GCF_025666215.1 X. campestris CFBP 8444 NCBI
GCF_028749645.1 X. campestris CFBP1371 NCBI
70
Table 2 (continued).
71
Table 2 (continued).
72
Table 2 (continued).
73
Table 2 (continued).
74
Table 2 (continued).
75
Table 2 (continued).
76
Table 2 (continued)
77
Table 2 (continued).
Pectobacterium
Assembly Number Species Strain Source
GCF_029027905.1 P. actinidiae GX-Pa1 NCBI
GCF_003382565.3 P. aquaticum A212-S19-A16 NCBI
GCF_015689195.1 P. aroidearum L6 NCBI
GCF_020181375.1 P. aroidearum LJ2 NCBI
GCF_024498715.1 P. aroidearum AK042 NCBI
GCF_024498755.1 P. aroidearum QJ313 NCBI
GCF_024498835.1 P. aroidearum QJ011 NCBI
GCF_025402815.1 P. aroidearum QJ021 NCBI
GCF_000011605.1 P. atrosepticum SCRI1043 NCBI
GCF_000740965.1 P. atrosepticum 21A NCBI
GCF_018634035.1 P. atrosepticum Green1 NCBI
GCF_001932635.1 P. brasiliense BC1 NCBI
GCF_002068115.1 P. brasiliense SX309 NCBI
GCF_002764035.1 P. brasiliense BZA12 NCBI
GCF_009873295.1 P. brasiliense 1692 NCBI
GCF_016864975.1 P. brasiliense ZLMLSHJ5 NCBI
GCF_016944255.1 P. brasiliense IPO:4132 NAK:239 NCBI
GCF_016944315.1 P. brasiliense IPO:4062 NAK:237 NCBI
GCF_020683105.1 P. brasiliense TS20HJ1 NCBI
GCF_022220705.1 P. brasiliense 130 NCBI
GCF_026723725.1 P. brasiliense 21PCA_AGRO2 NCBI
GCF_025946765.1 P. cacticida CFCC10813 NCBI
GCF_009931215.1 P. carotovorum JR1.1 NCBI
GCF_000023605.1 P. carotovorum PC1 NCBI
GCF_013488025.1 P. carotovorum WPP14 NCBI
GCF_015277635.1 P. carotovorum PCCS1 NCBI
GCF_016415585.1 P. carotovorum 2A NCBI
GCF_016864995.1 P. carotovorum XP-13 NCBI
GCF_020520265.1 P. carotovorum A077-S18-O15 NCBI
GCF_021049225.1 P. carotovorum 25.1 NCBI
GCF_022392325.1 P. carotovorum RC5297 NCBI
GCF_022392805.1 P. carotovorum ZM1 NCBI
GCF_028736315.1 P. carotovorum ZJ-4-2 NCBI
GCF_000769535.1 P. odoriferum BC S7 NCBI
GCF_009931295.1 P. odoriferum JK2.1 NCBI
GCF_000024645.1 P. parmentieri WPP163 NCBI
GCF_000260925.1 P. parmentieri SCC3193 NCBI
78
Table 2 (continued).
79
Table 2 (continued).
80
Table 3. Strains represented in this study with their non-curated Rhs-toxins repertoire.
Xanthomonas
Species Strain # Rhs per genome
X. albilineans Xa-FJ1 5
X. arboricola SL2098 1
X. arboricola 17 4
X. arboricola 15-088 3
X. arboricola R1 3
X. arboricola YchA 4
X. arboricola CITA 33 3
X. arboricola 301 3
X. arboricola 1314c 4
X. arboricola CFBP 6600 2
X. arboricola CPBF 765 3
X. arboricola 3 1
X. arboricola CPBF 1494 1
X. arboricola CPBF 766 2
X. arboricola IVIA 1317 1
X. arboricola IVIA 3978 1
X. arboricola 1311a 1
X. arboricola A7 0
X. arboricola CFBP 1846 0
X. arboricola CFBP 1159 0
X. axonopodis F1 2
X. axonopodis LMG26789 2
X. axonopodis NCPPB 796 3
X. campestris ATCC 33913 2
X. campestris B100 2
X. campestris NCPPB 4379 3
X. campestris CN15 4
X. campestris 85-10 6
X. campestris CN03 3
X. campestris NEB122 5
X. campestris MAFF106181 2
X. campestris M28 3
X. campestris CFBP 8444 5
X. campestris CFBP1371 2
81
Table 3 (continued).
X. campestris 719 2
X. campestris 8284 2
X. campestris 29-5 3
X. campestris 30-1 3
X. campestris GBBC 3077 2
X. campestris CFBP6690 1
X.campestris 576 1
X. campestris 5053 1
X. campestris 10103 1
X. campestris 12112 1
X. campestris 12049 1
X. campestris 16-Oct 1
X. campestris CFBP5824 1
X. campestris BJSJQ20200612 1
X. campestris CN05 1
X. campestris CN07 1
X. campestris 756C 0
X. campestris ICMP 21080 0
X. campestris MAFF302021 0
X. campestris 18048 0
X. campestris GSXT20191014 0
X. campestris CN13 0
X. campestris CFBP1606 0
X. campestris 609 0
X. campestris 13108G 0
X. campestris 792 0
X. campestris 659 0
X. campestris 793 0
X. campestris 4053 0
X. campestris 5057 0
X. campestris 16-8 0
X. campestris 10006 0
X. campestris 42-1 0
X. campestris 33-1 0
X. citri NT17 4
X. citri AW15 5
X. citri FDC 1609 4
X. citri 1566 2
82
Table 3 (continued).
X. citri MSCT 9
X. citri TX160149 4
X. citri CFBP7111 5
X. citri CFBP7112 5
X. citri XcmH1005 7
X. citri XcmN1003 8
X. citri CFBP6166 4
X. citri CFBP6975 5
X. citri CFBP6990 8
X. citri CFBP6991 7
X. citri ISO12C3 5
X. citri LMG7439 6
X. citri 8ra 9
X. citri K2 9
X. citri UnB-XtecTG02-2 11
X. citri DAR73886 4
X. citri DAR72029 4
X. citri CFBP7119 11
X. citri 6165R(delta)tal22B 2
X. citri M12 4
X. citri T21 10
X. citri GZ09 2
X. citri CFBP 2036 7
X. citri CFBP7113 1
X. cucurbitae OH_261 3
X. cucurbitae MI_359 2
X. cucurbitae IL_234 2
X. euroxanthea 1 3
X. euroxanthea CPBF 424 4
X. euvesicatoria LMG930 5
X. euvesicatoria CFBP3836 4
X. fragariae Fap29 3
X. fragariae SHQP01 8
X. fragariae YLX21 10
X. fragariae LMG 703 4
X. fragariae NBC2815 3
X. fragariae PD5205 3
X. hortorum JS749-3 4
X. hortorum ICMP 7383 3
83
Table 3 (continued).
X. hortorum B07-007 2
X. hortorum VT106 2
X. hortorum LM16734 3
X. hortorum jj2001 4
X. hortorum OSU493 4
X. hortorum Oregano 108 7
X. hortorum 305 4
X. hortorum CFBP 498 3
X. hyacinthi CFBP 1156 11
X. oryzae MAI134 1
X. oryzae CFBP7337 1
X. oryzae PXO99A 10
X. oryzae X11-5A 7
X. oryzae CFBP7342 5
X. oryzae YM15 3
X. oryzae CFBP2286 3
X. oryzae CFBP7331 4
X. oryzae CFBP7341 5
X. oryzae NCPPB4346 7
X. oryzae AXO1947 2
X. oryzae PXO211 6
X. oryzae NJ611 11
X. oryzae BB156-2 11
X. oryzae BB151-3 12
X. oryzae BAI23 3
X. oryzae PXO404 6
X. oryzae AUST2013 5
X. oryzae KXO85 7
X. oryzae 0-9 5
X. oryzae ITCCBB0002 4
X. oryzae HGA4 6
X. oryzae YN01 3
X. oryzae NJ01 2
X. oryzae JS49-6 6
X. oryzae YNJC 5
X. perforans LH3 1
X. phaseoli Px100 1
X. phaseoli CFBP412 0
X. phaseoli CFBP6164 0
84
Table 3 (continued).
X. phaseoli CFBP6982 0
X. phaseoli PR1 0
X. phaseoli Xcp25 0
X. phaseoli CHN01 0
X. phaseoli BB013a 0
X. phaseoli CFBP 7423 0
X. prunicola MAI5069 4
X. prunicola MAI5037 7
X. prunicola CIX383 10
X. prunicola CIX249 10
X. prunicola CIX97 3
X. sacchari DD13 4
X. sacchari YT9-19-2 11
X. sacchari LT6-2 5
X. sacchari LT6-16-1 8
X. sacchari JR3-14 5
X. sacchari HR3-46 9
X. sacchari HR1-32 12
X. sacchari DJ16 3
X. theicola CFBP 4691 7
X. translucens P3 1
X. translucens Xtu 4699 1
X. translucens ICMP11055 3
X. translucens LW16 2
X. translucens XtKm33 2
X. translucens XtLr8 2
X. translucens XtKm15 2
X. translucens MAI5034 2
X. translucens CFBP 2055 3
X. translucens CFBP 2541 5
X. translucens LMG 728 8
X. translucens ICMP 16317 4
X. translucens LMG 843 3
X. translucens LMG 727 2
X. translucens Xtu-UPB513 3
X. translucens NCPPB 3711 4
X. translucens XtKm7 1
X. translucens UPB458 1
X. translucens ART-Xtg2 1
85
Table 3 (continued).
86
Table 3 (continued).
R. solanacearum UY031 7
R. solanacearum KACC 10722 5
R. solanacearum UW163 3
R. solanacearum IBSBF1503 2
R. solanacearum YC40-M 2
R. solanacearum KACC 10709 3
R. solanacearum SEPPX05 2
R. solanacearum RS 488 7
R. solanacearum RS 489 7
R. solanacearum T51 5
R. solanacearum SL3175 11
R. solanacearum T117 2
R. solanacearum T98 11
R. solanacearum T78 2
R. solanacearum SL3730 2
R. solanacearum SL2729 2
R. solanacearum SL2064 5
R. solanacearum T95 5
R. solanacearum T60 2
R. solanacearum T42 2
R. solanacearum SL3300 2
R. solanacearum IBSBF 2571 3
R. solanacearum HA4-1 2
R. solanacearum UW386 4
R. solanacearum B2 3
R. solanacearum 204 2
R. solanacearum 202 2
R. solanacearum CIAT_078 3
R. solanacearum FJAT91.F50 2
R. solanacearum FJAT445.F50 2
R. solanacearum FJAT442.F50 2
R. solanacearum FJAT442.F1 2
R. solanacearum FJAT15353.F8 4
R. solanacearum FJAT15353.F50 4
R. solanacearum FJAT15353.F1 4
R. solanacearum FJAT15340.F50 2
R. solanacearum FJAT15304.F6 2
R. solanacearum FJAT15304.F50 2
R. solanacearum FJAT15244.F50 2
87
Table 3 (continued).
R. solanacearum FJAT15244.F1 2
R. solanacearum FJAT1463.F1 2
R. solanacearum FJAT1452.F50 2
R. solanacearum FJAT1303.F50 4
R. solanacearum FJAT454.F50-1 2
R. solanacearum RUN2474 6
R. solanacearum RUN2279 5
R. solanacearum UW763 9
R. solanacearum Rs5 7
R. solanacearum MAFF 301560 2
R. solanacearum 362200 2
R. solanacearum RS24 2
R. solanacearum Bs715 2
R. solanacearum Wj644 2
R. solanacearum K60 15
R. solanacearum UW551 7
R. solanacearum FJAT15244-F8 1
R. solanacearum FJAT91-F8 1
R. solanacearum MAFF 211491 1
R. solanacearum MAFF 241648 1
R. solanacearum MAFF 211479 1
R. solanacearum MAFF 211479 1
R. solanacearum MAFF 311693 1
R. solanacearum PSI07 4
R. solanacearum MolK2 10
R. solanacearum CFBP2957 16
R. syzygii LLRS-1 14
R. syzygii BDBR229 5
R. syzygii UGMSS_Db01 5
R. syzygii R24 1
R. wenshanensis 56D2 5
Ralstonia sp. RS642 5
Ralstonia sp. RS650 8
Ralstonia sp. RS647 2
Pectobacterium
Species Strain # Rhs per genome
P. actinidiae GX-Pa1 3
P. aquaticum A212-S19-A16 3
P. aroidearum L6 6
88
Table 3 (continued).
P. aroidearum LJ2 2
P. aroidearum AK042 3
P. aroidearum QJ313 2
P. aroidearum QJ011 4
P. aroidearum QJ021 5
P. atrosepticum SCRI1043 5
P. atrosepticum 21A 1
P. atrosepticum Green1 4
P. brasiliense BC1 3
P. brasiliense SX309 3
P. brasiliense BZA12 2
P. brasiliense 1692 3
P. brasiliense ZLMLSHJ5 2
P. brasiliense IPO:4132 NAK:239 2
P. brasiliense IPO:4062 NAK:237 3
P. brasiliense TS20HJ1 4
P. brasiliense 130 3
P. brasiliense 21PCA_AGRO2 4
P. cacticida CFCC10813 3
P. carotovorum JR1.1 1
P. carotovorum PC1 2
P. carotovorum WPP14 3
P. carotovorum PCCS1 4
P. carotovorum 2A 4
P. carotovorum XP-13 2
P. carotovorum A077-S18-O15 3
P. carotovorum 25.1 3
P. carotovorum RC5297 7
P. carotovorum ZM1 4
P. carotovorum ZJ-4-2 5
P. odoriferum BC S7 3
P. odoriferum JK2.1 5
P. parmentieri WPP163 8
P. parmentieri SCC3193 12
P. parmentieri IFB5408 12
P. parmentieri IFB5485 12
P. parmentieri IFB5441 12
P. parmentieri HC 9
P. parmentieri WC19161 13
89
Table 3 (continued).
P. parmentieri QK-5 14
P. parvum YT22221 2
P. parvum FN20211 1
P. polaris NIBIO1392 5
P. polaris NIBIO1006 3
P. polaris QK413-1 2
P. punjabense SS95 0
P. quasiaquaticum A477-S1-J17 3
P. quasiaquaticum A398-S21-F17 4
P. versatile F131 4
P. versatile 2-Mar 4
P. versatile 14A 6
P. versatile ECC15 2
P. versatile SR12 2
P. versatile SR1 4
P. versatile A73-S18-O15 4
Pectobacterium sp. PL64 2
Pectobacterium sp. 21LCBS03 5
Pectobacterium sp. F1-1 1
Pectobacterium sp. A5351 0
Dickeya
Species Strain # Rhs per genome
D. aquatica 174/2 0
D. chrysanthemi Ech1591 7
D. dadantii DSM 18020 3
D. dadantii S3-1 6
D. dadantii M2-3 5
D. dadantii FZ06 5
D. dadantii XJ12 6
D. dadantii 3937 6
D. dianthicola 67-19 3
D. dianthicola 16ME22T 2
D. dianthicola LAR.16.03.LID 3
D. fangzhongdai ND14b 6
D. fangzhongdai DSM 101947 4
D. fangzhongdai PA1 6
D. fangzhongdai AP6 3
D. fangzhongdai 643b 6
D. fangzhongdai Onc5 5
90
Table 3 (continued).
D. fangzhongdai ZXC1 9
D. lacustris LMG30899 0
D. parazeae Ech586 2
D. poaceiphila NCPPB 569 0
D. solani RNS 05.1.2A 4
D. solani RNS 08.23.3.1.A 4
D. zeae MS1 5
D. zeae EC1 4
D. zeae MS2 5
D. zeae EC2 6
D. zeae CE1 4
D. zeae JZL7 4
D. zeae PL65 4
D. zeae A586-S18-A17 5
D. zeae MS_2018 6
D. zeae A5272 2
Dickeya. sp. Secpp 1600 4
Outgroup
Species Strain # Rhs per
genome
Escherichia coli K-12 7
Pseudomonas aeruginosa PAO1 1
91
Table 4: Strains represented in this study with their curated Rhs-toxins repertoire.
Xanthomonas
Species Strain # Rhs per genome
X. albilineans Xa-FJ1 5
X. arboricola SL2098 1
X. arboricola 17 4
X. arboricola YchA 3
X. arboricola 1314c 4
X. arboricola 1311a 1
X. arboricola 15-088 3
X. arboricola R1 3
X. arboricola CITA 33 3
X. arboricola 301 2
X. arboricola CPBF 765 2
X. arboricola 3 1
X. arboricola CPBF 1494 1
X. arboricola IVIA 1317 1
X. arboricola IVIA 3978 1
X. axonopodis F1 2
X. axonopodis LMG26789 1
X. axonopodis NCPPB 796 3
X. campestris M28 3
X. campestris GBBC 3077 2
X. campestris 5053 1
X. campestris 16-Oct 1
X. campestris CFBP5824 1
X. campestris ATCC 33913 1
X. campestris B100 2
X. campestris NCPPB 4379 3
X. campestris CN15 3
X. campestris 85-10 6
X. campestris CN03 3
X. campestris NEB122 4
X. campestris MAFF106181 2
X. campestris CFBP 8444 5
X. campestris CFBP1371 1
X. campestris 719 2
92
Table 4 (continued).
X. campestris 8284 1
X. campestris 29-5 3
X. campestris 30-1 3
X. campestris CFBP6690 1
X. campestris 576 1
X. campestris 10103 1
X. campestris 12112 1
X. campestris 12049 1
X. campestris BJSJQ20200612 1
X. citri NT17 4
X. citri AW15 5
X. citri FDC 1609 2
X. citri MSCT 7
X. citri TX160149 3
X. citri CFBP7111 5
X. citri CFBP7112 5
X. citri XcmH1005 5
X. citri XcmN1003 6
X. citri CFBP6166 2
X. citri CFBP6975 3
X. citri CFBP6990 7
X. citri CFBP6991 6
X. citri ISO12C3 4
X. citri LMG7439 5
X. citri 8ra 7
X. citri K2 7
X. citri UnB-XtecTG02-2 11
X. citri DAR73886 4
X. citri DAR72029 4
X. citri CFBP7119 9
X. citri M12 2
X. citri T21 9
X. citri GZ09 2
X. citri CFBP 2036 5
X. cucurbitae OH_261 3
X. cucurbitae MI_359 2
X. cucurbitae IL_234 2
X. euroxanthea 1 3
93
Table 4 (continued).
94
Table 4 (continued).
X. oryzae PXO404 5
X. oryzae AUST2013 3
X. oryzae KXO85 3
X. oryzae 0-9 4
X. oryzae ITCCBB0002 4
X. oryzae HGA4 5
X. oryzae YN01 3
X. oryzae YNJC 2
X. perforans LH3 1
X. prunicola MAI5069 4
X. prunicola MAI5037 6
X. prunicola CIX383 8
X. prunicola CIX249 8
X. prunicola CIX97 3
X. sacchari DD13 4
X. sacchari YT9-19-2 7
X. sacchari LT6-2 4
X. sacchari LT6-16-1 5
X. sacchari JR3-14 5
X. sacchari HR3-46 5
X. sacchari HR1-32 9
X. sacchari DJ16 3
X. sp. GW 4
X. sp. CFBP 8445 3
X. theicola CFBP 4691 7
X. translucens P3 1
X. translucens Xtu 4699 1
X. translucens ICMP11055 3
X. translucens LW16 2
X. translucens XtKm33 2
X. translucens XtLr8 2
X. translucens XtKm15 2
X. translucens MAI5034 2
X. translucens CFBP 2055 3
X. translucens CFBP 2541 5
X. translucens LMG 728 4
X. translucens ICMP 16317 4
X. translucens LMG 843 3
95
Table 4 (continued).
X. translucens Xtu-UPB513 2
X. translucens NCPPB 3711 4
X. translucens XtKm7 1
X. translucens UPB458 1
X. translucens ART-Xtg2 1
X. vasicola NCPPB 1060 6
X. vasicola NCPPB 902 2
X. vasicola NCPPB 2649 4
X. vasicola Xv1601 3
X. vesicatoria LM159 1
Xanthomonas sp. SS 4
Xanthomonas sp. SI 6
Xanthomonas sp. WG16 4
Xanthomonas sp. AM6 7
Xanthomonas sp. CFBP 8443 4
Xanthomonas sp. 2 5
Ralstonia
Species Strain # Rhs per genome
R. mannitolilytica SN83A39 1
R. pseudosolanacearum CQPS-1 2
R. pseudosolanacearum RS 476 6
R. pseudosolanacearum SL1931 1
R. pseudosolanacearum PeaFJ1 2
R. pseudosolanacearum LMG 9673 7
R. pseudosolanacearum Cw717 1
R. pseudosolanacearum Sw698 2
R. pseudosolanacearum RUN2340 6
R. pseudosolanacearum GMI1000 6
R. solanacearum Po82 3
R. solanacearum UY031 5
R. solanacearum KACC 10722 5
R. solanacearum UW163 3
R. solanacearum IBSBF1503 2
R. solanacearum YC40-M 2
R. solanacearum KACC 10709 2
R. solanacearum SEPPX05 2
R. solanacearum RS 488 5
R. solanacearum RS 489 7
96
Table 4 (continued).
R. solanacearum RSCM 1
R. solanacearum T51 5
R. solanacearum SL3175 10
R. solanacearum T117 2
R. solanacearum T98 10
R. solanacearum T78 2
R. solanacearum T12 1
R. solanacearum SL3730 2
R. solanacearum SL2729 2
R. solanacearum SL2064 5
R. solanacearum T101 1
R. solanacearum T95 5
R. solanacearum T82 1
R. solanacearum T60 2
R. solanacearum T42 2
R. solanacearum SL3300 2
R. solanacearum IBSBF 2571 3
R. solanacearum HA4-1 2
R. solanacearum UW386 4
R. solanacearum B2 3
R. solanacearum 204 2
R. solanacearum 202 2
R. solanacearum CIAT_078 3
R. solanacearum FJAT91.F50 2
R. solanacearum FJAT445.F50 2
R. solanacearum FJAT442.F50 2
R. solanacearum FJAT442.F1 2
R. solanacearum FJAT15353.F8 4
R. solanacearum FJAT15353.F50 4
R. solanacearum FJAT15353.F1 4
R. solanacearum FJAT15340.F50 2
R. solanacearum FJAT15304.F6 2
R. solanacearum FJAT15304.F50 2
R. solanacearum FJAT15244.F50 1
R. solanacearum FJAT15244.F1 1
R. solanacearum FJAT1463.F1 2
R. solanacearum FJAT1452.F50 2
R. solanacearum FJAT1303.F50 4
R. solanacearum FJAT91-F8 1
97
Table 4 (continued).
R. solanacearum FJAT15244-F8 1
R. solanacearum FJAT454.F50-1 2
R. solanacearum RUN2474 5
R. solanacearum RUN2279 4
R. solanacearum UW763 9
R. solanacearum Rs5 6
R. solanacearum MAFF 211479 1
R. solanacearum MAFF 211479 1
R. solanacearum MAFF 211491 1
R. solanacearum MAFF 301560 2
R. solanacearum MAFF 241648 1
R. solanacearum MAFF 311693 1
R. solanacearum 362200 2
R. solanacearum RS24 2
R. solanacearum Bs715 2
R. solanacearum Wj644 2
R. solanacearum PSI07 4
R. solanacearum MolK2 9
R. solanacearum CFBP2957 12
R. solanacearum K60 9
R. solanacearum UW551 3
Ralstonia sp. RS647 2
Ralstonia sp. RS642 5
Ralstonia sp. RS650 8
R. syzygii LLRS-1 9
R. syzygii BDBR229 4
R. syzygii UGMSS_Db01 4
R. syzygii R24 1
R. wenshanensis 56D2 4
Pectobacterium
Species Strain # Rhs per genome
P. actinidiae GX-Pa1 2
P. aquaticum A212-S19-A16 3
P. aroidearum AK042 3
P. aroidearum QJ313 2
P. aroidearum QJ011 4
P. aroidearum QJ021 4
P. aroidearum LJ2 2
98
Table 4 (continued).
P. aroidearum L6 6
P. atrosepticum SCRI1043 5
P. atrosepticum 21A 1
P. atrosepticum Green1 4
P. brasiliense BC1 3
P. brasiliense ZLMLSHJ5 2
P. brasiliense IPO:4132 2
P. brasiliense 21PCA_AGRO2 3
P. brasiliense BZA12 2
P. brasiliense SX309 3
P. brasiliense 1692 3
P. brasiliense IPO:4062 3
P. brasiliense 130 3
P. brasiliense TS20HJ1 4
P. cacticida CFCC10813 3
P. carotovorum JR1.1 1
P. carotovorum XP-13 2
P. carotovorum WPP14 3
P. carotovorum A077-S18-O15 3
P. carotovorum 25.1 3
P. carotovorum PCCS1 4
P. carotovorum 2A 4
P. carotovorum ZM1 4
P. carotovorum RC5297 7
P. carotovorum PC1 2
P. carotovorum ZJ-4-2 5
P. odoriferum BC S7 3
P. odoriferum JK2.1 5
P. parmentieri WPP163 8
P. parmentieri SCC3193 12
P. parmentieri IFB5408 12
P. parmentieri IFB5485 11
P. parmentieri IFB5441 11
P. parmentieri WC19161 13
P. parmentieri QK-5 13
P. parmentieri HC 9
P. parvum YT22221 2
P. parvum FN20211 1
P. polaris QK413-1 2
99
Table 4 (continued).
P. polaris NIBIO1006 3
P. polaris NIBIO1392 5
P. quasiaquaticum A477-S1-J17 3
P. quasiaquaticum A398-S21-F17 4
Pectobacterium sp. PL64 2
Pectobacterium sp. F1-1 1
Pectobacterium sp. 21LCBS03 5
P. versatile A73-S18-O15 4
P. versatile ECC15 2
P. versatile SR12 2
P. versatile F131 4
P. versatile 2-Mar 3
P. versatile SR1 4
P. versatile 14A 6
Dickeya
Species Strain # Rhs per genome
D. chrysanthemi Ech1591 6
D. dadantii DSM 18020 3
D. dadantii S3-1 5
D. dadantii M2-3 3
D. dadantii FZ06 5
D. dadantii XJ12 5
D. dadantii 3937 6
D. dianthicola 67-19 2
D. dianthicola 16ME22T 2
D. dianthicola LAR.16.03.LID 2
D. fangzhongdai ND14b 5
D. fangzhongdai DSM 101947 3
D. fangzhongdai PA1 4
D. fangzhongdai AP6 3
D. fangzhongdai 643b 4
D. fangzhongdai Onc5 4
D. fangzhongdai ZXC1 6
D. parazeae Ech586 2
D. solani RNS 05.1.2A 3
D. solani RNS 08.23.3.1.A 4
Dickeya sp. Secpp 1600 4
D. zeae MS1 3
D. zeae EC1 3
100
Table 4 (continued).
D. zeae MS2 4
D. zeae EC2 4
D. zeae CE1 2
D. zeae JZL7 3
D. zeae PL65 2
D. zeae A586-S18-A17 4
D. zeae MS_2018 5
D. zeae A5272 2
Outgroup
Species Strain # Rhs per genome
Escherichia coli K-12 5
Pseudomonas aeruginosa PAO1 1
101
Table 5. Plant pathogenic genera and their species represented in this study with their
complementary N-terminal clade.
Xanthomonas
Species Strain Rhs_locus Clade
X. oryzae MAI134 BVV20_RS16895 I
X. oryzae CFBP7331 ACU16_RS16515 I
X. oryzae BB156-2 EYC55_RS06280 I
X. oryzae NCPPB4346 EYR26_RS16965 I
X. oryzae NJ01 MML47_RS05895 I
X. oryzae YNJC QN060_RS16955 I
X. oryzae PXO404 EBA23_RS08555 I
X. oryzae AUST2013 EBA17_RS16090 I
X. oryzae JS49-6 LL928_RS15920 I
X. oryzae CFBP7331 ACU16_RS16535 I
X. oryzae NCPPB4346 EYR26_RS16995 I
X. oryzae PXO211 ATY44_RS15850 I
X. translucens XtKm33 ISN33_RS07930 I
X. translucens CFBP 2055 KCU58_RS05760 I
X. translucens LW16 F0H32_RS14580 I
X. translucens MAI5034 LTC53_RS14350 I
X. oryzae CFBP7337 EBA11_RS16380 I
X. oryzae AXO1947 AXO1947_RS05255 I
X. oryzae CFBP7342 BE73_RS04950 I
X. oryzae BB156-2 EYC55_RS06235 I
X. oryzae BB151-3 EYC56_RS17670 I
X. oryzae PXO99A PXO_RS07175 I
X. oryzae NCPPB4346 EYR26_RS16955 I
X. translucens XtLr8 ISN38_RS15700 I
X. translucens XtKm15 ISN36_RS12090 I
X. oryzae X11-5A EYR27_RS02355 I
X. oryzae AXO1947 AXO1947_RS21135 I
X. oryzae BB156-2 EYC55_RS06265 I
X. oryzae BB151-3 EYC56_RS17650 I
X. oryzae BAI23 EYC57_RS05720 I
X. oryzae CFBP7341 ACU17_RS24220 I
X. oryzae NJ611 EYC54_RS06110 I
X. translucens UPB458 KFS86_RS08990 I
102
Table 5 (continued).
103
Table 5 (continued).
104
Table 5 (continued).
105
Table 5 (continued).
106
Table 5 (continued).
107
Table 5 (continued).
108
Table 5 (continued).
109
Table 5 (continued).
110
Table 5 (continued).
111
Table 5 (continued).
112
Table 5 (continued).
113
Table 5 (continued).
114
Table 5 (continued).
115
Table 5 (continued).
116
Table 5 (continued).
117
Table 5 (continued).
118
Table 5 (continued).
P. carotovorum 2A JFY74_RS20110 V
P. aroidearum QJ011 L0Y21_RS13090 V
P. brasiliense BC1 NC16_RS12970 V
P. parvum YT22221 NV347_RS12470 V
P. brasiliense SX309 B5S52_RS09345 V
P. quasiaquaticum A477-S1-J17 IG609_RS11695 V
P. brasiliense IPO:4132 NAK:239 H5A38_RS00515 V
P. carotovorum 25.1 LQF52_RS00325 V
P. atrosepticum 21A GZ59_RS20890 V
P. aroidearum QJ011 L0Y21_RS14315 V
P. aroidearum QJ021 N5056_RS13645 V
Pectobacterium sp. 21LCBS03 MYB54_RS08925 V
P. brasiliense BC1 NC16_RS14180 V
P. brasiliense 21PCA_AGRO2 OWC53_RS08265 V
P. versatile 14A EIP93_RS14330 V
P. atrosepticum SCRI1043 ECA_RS14080 V
P. aroidearum L6 I2D83_RS14360 V
P. parmentieri SCC3193 W5S_RS07230 V
P. parmentieri QK-5 IG623_RS07860 V
P. carotovorum ZJ-4-2 PSR30_RS13630 V
P. quasiaquaticum A477-S1-J17 IG609_RS07235 V
P. quasiaquaticum A398-S21-F17 IG605_RS07275 V
P. versatile F131 F131LOC_RS00100 V
P. versatile SR1 LGL96_RS10940 V
P. carotovorum PC1 PC1_RS00250 V
P. versatile 2-Mar OA04_RS22165 V
P. parmentieri SCC3193 W5S_RS12980 V
P. parmentieri IFB5485 C5E21_RS12620 V
P. parmentieri WPP163 PECWA_RS13170 V
P. parmentieri QK-5 IG623_RS12785 V
P. odoriferum JK2.1 EO763_RS00285 V
P. aroidearum L6 I2D83_RS00315 V
P. carotovorum WPP14 HER17_RS00295 V
P. brasiliense IPO:4062 H5A40_RS06580 V
P. parmentieri IFB5441 C5E20_RS13950 V
P. parmentieri WC19161 JBL47_RS00905 V
P. carotovorum ZJ-4-2 PSR30_RS12465 V
P. versatile SR12 LGL95_RS09080 V
119
Table 5 (continued).
120
Table 5 (continued).
121
Table 5 (continued).
122
Table 6. Rhs-toxin protein length with their corresponding toxic domain and putative predicted function.
Xanthomonas
Putative
Rhs toxin Protein
Toxin domain predicted
Species strain Rhs locus Identifier Length function
X. citri LMG7439 XapA_RS12625 rhs1 Tox-REase-7 303 Dnase
X. citri NT17 AMD13_RS09980 rhs1 Tox-REase-7 335 Dnase
Xanthomonas sp. WG16 IG630_RS09960 rhs1 Tox-REase-7 335 Dnase
X. citri DAR72029 H8Z73_RS09560 rhs1 Tox-REase-7 335 Dnase
X. citri AW15 AMD07_RS12525 rhs1 Tox-REase-7 350 Dnase
X. citri TX160149 B7L66_RS12545 rhs1 Tox-REase-7 350 Dnase
X. citri DAR73886 H8Z71_RS12115 rhs1 Tox-REase-7 350 Dnase
X. citri XcmN1003 APY30_RS09720 rhs1 Tox-REase-7 335 Dnase
X. citri MSCT BGK55_RS11855 rhs1 Tox-REase-7 329 Dnase
X. citri CFBP 2036 LPY96_RS13485 rhs1 Tox-REase-7 329 Dnase
X. citri XcmH1005 APY29_RS13305 rhs1 Tox-REase-7 335 Dnase
X. vasicola Xv1601 CXP37_RS23425 rhs1 Tox-REase-7 1042 Dnase
X. campestris NCPPB 4379 KWO_RS19495 rhs1 Tox-REase-7 367 Dnase
X. vasicola NCPPB 2649 NX80_RS03165 rhs1 Tox-REase-7 333 Dnase
X. prunicola MAI5037 M0D47_RS15265 rhs1 Tox-REase-7 1033 Dnase
X. prunicola CIX383 M0D48_RS03750 rhs1 Tox-REase-7 1033 Dnase
X. prunicola CIX249 M0D43_RS15440 rhs1 Tox-REase-7 1033 Dnase
X. vasicola NCPPB 2649 NX80_RS23480 rhs1 Tox-REase-7 1033 Dnase
X. campestris NCPPB 4379 KWO_RS23195 rhs1 Tox-REase-7 1033 Dnase
X. vasicola NCPPB 1060 NX81_RS24565 rhs1 Tox-REase-7 1033 Dnase
X. vasicola Xv1601 CXP37_RS01435 rhs1 Tox-REase-7 333 Dnase
XcvCFBP7112P_R
X. citri CFBP7112 rhs1 Tox-REase-7 325 Dnase
S10110
X. citri UnB-XtecTG02- 2 G3566_RS09615 rhs1 Tox-REase-7 334 Dnase
123
Table 6 (continued).
CFBP 1156 FZ025_RS15345 rhs2 Undetermined 1551 Unknown
X. hyacinthi function
Unknown
X. campestris CFBP 8444 NRY95_RS06510 rhs2 Undetermined 1565
function
XtKm33 ISN33_RS16900 rhs2 Undetermined 1352 Unknown
X. translucens function
Unknown
X. translucens Xtu-UPB513 MZO50_RS18285 rhs2 Undetermined 1323
function
Unknown
X. translucens ICMP 16317 K8O61_RS05725 rhs2 Undetermined 979
function
Unknown
X. campestris 576 JH277_RS08765 rhs2 Undetermined 1594 function
Unknown
X. campestris 16-Oct JH307_RS12435 rhs2 Undetermined 1594
function
Unknown
X. euvesicatoria LMG930 BJD11_RS05975 rhs2 Undetermined 1357
function
Unknown
X. euroxanthea CPBF 424 XTG_RS15535 rhs2 Undetermined 1520
function
Unknown
X. campestris M28 IFJ81_RS09090 rhs2 Undetermined 1329
function
Unknown
X. campestris GBBC 3077 QMY63_RS11040 rhs2 Undetermined 1329
function
Unknown
X. campestris 12049 JH293_RS08480 rhs2 Undetermined 1329
function
Unknown
X. arboricola 1314c LOK39_RS12170 rhs2 Undetermined 1337
function
Unknown
X. campestris 85-10 BHE83_RS09435 rhs2 Undetermined 893
function
Unknown
X. citri DAR73886 H8Z71_RS08810 rhs3 DUF4329 1612
function
Unknown
X. arboricola 1314c LOK39_RS10655 rhs3 DUF4329 1588
function
Unknown
X. hortorum 305 PML25_RS09245 rhs3 DUF4329 1210 function
Unknown
X. hortorum Oregano 108 OEG85_RS05880 rhs3 DUF4329 1550
function
Unknown
X. hortorum jj2001 NMB96_RS19550 rhs3 DUF4329 386
function
Unknown
X. arboricola 1311a LOK40_RS12405 rhs3 DUF4329 1592
function
Unknown
X. citri DAR72029 H8Z73_RS13160 rhs3 DUF4329 1592
function
Unknown
X. citri NT17 AMD13_RS13340 rhs3 DUF4329 1592
function
Unknown
Xanthomonas sp. WG16 IG630_RS13345 rhs3 DUF4329 1592
function
124
Table 6 (continued).
Unknown
X. citri AW15 AMD07_RS10975 rhs3 DUF4329 1592
function
Unknown
CFBP7112 XcvCFBP7112P_ rhs3 DUF4329 538
X. citri function
RS13110
Unknown
X. campestris 85-10 BHE83_RS06255 rhs3 DUF4329 492 function
Unknown
X. citri 8ra FPK90_RS14470 rhs3 DUF4329 459
function
Unknown
X. citri K2 FPL05_RS13800 rhs3 DUF4329 459
function
Unknown
CFBP7119 XcgCFBP7119R_ rhs3 DUF4329 459
X. citri function
RS14225
Unknown
CFBP7111 XcvCFBP7111P_ rhs3 DUF4329 904
X. citri function
RS10715
Unknown
CFBP6990 XcfCFBP6990P_R rhs3 DUF4329 413
X. citri function
S13035
CFBP6991 XcfCFBP6991P_R rhs3 DUF4329 413 Unknown
X. citri function
S13070
X. oryzae MAI134 BVV20_RS16895 rhs4 XOO_2897-deam 1528 Protease
X. oryzae CFBP7331 ACU16_RS16515 rhs4 XOO_2897-deam 1528 Protease
X. oryzae BB156-2 EYC55_RS06280 rhs4 XOO_2897-deam 1528 Protease
X. oryzae NCPPB4346 EYR26_RS16965 rhs4 XOO_2897-deam 1527 Protease
X. oryzae NJ01 MML47_RS05895 rhs4 XOO_2897-deam 1527 Protease
X. oryzae YNJC QN060_RS16955 rhs4 XOO_2897-deam 1527 Protease
X. oryzae HGA4 IZG20_RS13390 rhs4 XOO_2897-deam 811 Protease
X. oryzae PXO404 EBA23_RS08555 rhs4 XOO_2897-deam 1129 Protease
X. oryzae AUST2013 EBA17_RS16090 rhs4 XOO_2897-deam 1527 Protease
X. oryzae JS49-6 LL928_RS15920 rhs4 XOO_2897-deam 1527 Protease
X. oryzae CFBP7331 ACU16_RS16535 rhs4 XOO_2897-deam 1564 Protease
X. oryzae NCPPB4346 EYR26_RS16995 rhs4 XOO_2897-deam 1180 Protease
X. oryzae PXO211 ATY44_RS15850 rhs4 XOO_2897-deam 1564 Protease
X. translucens XtKm33 ISN33_RS07930 rhs4 XOO_2897-deam 1611 Protease
X. translucens CFBP 2055 KCU58_RS05760 rhs4 XOO_2897-deam 1611 Protease
X. translucens LW16 F0H32_RS14580 rhs4 XOO_2897-deam 1156 Protease
X. translucens MAI5034 LTC53_RS14350 rhs4 XOO_2897-deam 1607 Protease
Unknown
X. hortorum jj2001 NMB96_RS19530 rhs5 Undetermined 1598
function
Unknown
X. hortorum JS749-3 BJD10_RS12280 rhs5 Undetermined 1340
function
Unknown
LM16734 XHV734_RS0837 rhs5 Undetermined 1598
X. hortorum function
5
125
Table 6 (continued).
Unknown
X. hortorum ICMP 7383 BI317_RS10180 rhs5 Undetermined 1598
function
Unknown
X. hortorum B07-007 XJ27_RS06820 rhs5 Undetermined 1598
function
Unknown
CFBP 498 CFBP8129_RS1330 rhs5 Undetermined 1598
X. hortorum function
0
Unknown
X. hortorum VT106 DYQ48_RS13035 rhs5 Undetermined 1237
function
Unknown
X. oryzae PXO99A PXO_RS06855 rhs5 Undetermined 1493
function
Unknown
X. oryzae CFBP2286 ACU11_RS25435 rhs5 Undetermined 1493
function
Unknown
X. arboricola 301 P3C56_RS06145 rhs5 Undetermined 1524
function
Unknown
X. prunicola MAI5037 M0D47_RS16575 rhs5 Undetermined 1524
function
Unknown
X. oryzae PXO404 EBA23_RS08200 rhs5 Undetermined 477
function
Unknown
X. oryzae KXO85 EBA20_RS16975 rhs5 Undetermined 477
function
Unknown
X. oryzae JS49-6 LL928_RS15575 rhs5 Undetermined 477
function
Unknown
X. oryzae AUST2013 EBA17_RS16445 rhs5 Undetermined 502
function
Unknown
X. vasicola NCPPB 1060 NX81_RS13995 rhs6 Undetermined 1347 function
Unknown
X. prunicola CIX97 M0D46_RS13530 rhs6 Undetermined 1510
function
Unknown
X. citri ISO12C3 AC612_RS07065 rhs6 Undetermined 1506
function
Unknown
X. citri M12 DGN11_RS06735 rhs6 Undetermined 1318
function
Unknown
X. vasicola NCPPB 2649 NX80_RS11085 rhs6 Undetermined 1502
function
Unknown
X. citri FDC 1609 TP50_RS18950 rhs6 Undetermined 1395
function
Unknown
X. citri CFBP6166 XcfCFBP6166P_RS rhs6 Undetermined 840
function
06935
Unknown
X. citri CFBP6975 XcfCFBP6975P_RS rhs6 Undetermined 840
function
06830
Unknown
X. vasicola NCPPB 1060 NX81_RS17520 rhs6 Undetermined 513
function
Unknown
X. prunicola MAI5069 M0D45_RS05695 rhs6 Undetermined 464
function
Unknown
X. prunicola CIX383 M0D48_RS04930 rhs6 Undetermined 464
function
126
Table 6 (continued).
Unknown
X. prunicola CIX249 M0D43_RS16620 rhs6 Undetermined 464
function
Unknown
X. citri CFBP7112 XcvCFBP711 rhs7 Undetermined 1552
function
2P_RS00780
Unknown
X. citri CFBP6990 XcfCFBP699 rhs7 Undetermined 1576
function
0P_RS00645
Unknown
X. citri CFBP6991 XcfCFBP699 rhs7 Undetermined 1576
function
1P_RS00645
Unknown
X. prunicola MAI5069 M0D45_RS09610 rhs7 Undetermined 1569
function
Unknown
X. prunicola CIX97 M0D46_RS09630 rhs7 Undetermined 1569
function
Unknown
X. axonopodis LMG26789 Xcom_RS02415 rhs7 Undetermined 333
function
X. euvesicatoria LMG930 BJD11_RS24465 rhs7 Undetermined 1096 Unknown
function
Unknown
X. axonopodis F1 XACM_RS19450 rhs7 Undetermined 1557
function
Unknown
X. campestris 85-10 BHE83_RS13365 rhs7 Undetermined 1557
function
Unknown
X. euvesicatoria CFBP3836 XeaCFBP3836p rhs7 Undetermined 853
function
Unknown
X. theicola CFBP 4691 G4Q83_RS22755 rhs7 Undetermined 324
function
UnB-XtecG02- Unknown
X. citri G3566_RS00655 rhs7 Undetermined 913
2 function
X. campestris NEB122 HG421_RS21100 rhs8 Ntox16 1529 Rnase
X. oryzae NJ611 EYC54_RS21870 rhs8 Ntox16 1487 Rnase
X. oryzae BB156-2 EYC55_RS09335 rhs8 Ntox16 1496 Rnase
X. oryzae HGA4 IZG20_RS13790 rhs8 Ntox16 378 Rnase
X. oryzae KXO85 EBA20_RS17030 rhs8 Ntox16 1496 Rnase
X. oryzae PXO99A PXO_RS06285 rhs8 Ntox16 1496 Rnase
X. oryzae PXO404 EBA23_RS08175 rhs8 Ntox16 1496 Rnase
X. oryzae JS49-6 LL928_RS15550 rhs8 Ntox16 1496 Rnase
X. oryzae PXO211 ATY44_RS16160 rhs8 Ntox16 920 Rnase
X. oryzae AUST2013 EBA17_RS16470 rhs8 Ntox16 1003 Rnase
X. oryzae X11-5A EYR27_RS04990 rhs8 Ntox16 214 Rnase
Unknown
X. prunicola MAI5037 M0D47_RS12515 rhs9 Undetermined 1561
function
Unknown
X. prunicola CIX383 M0D48_RS00875 rhs9 Undetermined 1561
function
Unknown
X. prunicola CIX249 M0D43_RS12695 rhs9 Undetermined 1561
function
127
Table 6 (continued).
Unknown
X. cucurbitae OH_261 K6978_RS14005 rhs9 Undetermined 1523
function
Unknown
X. cucurbitae IL_234 K6979_RS05550 rhs9 Undetermined 1523
function
Unknown
X. arboricola 17 XB05_RS25205 rhs9 Undetermined 1541
function
Unknown
X. arboricola YchA LZZ50_RS02835 rhs9 Undetermined 1567
function
Unknown
Xanthomonas sp. 2 H7A87_RS20740 rhs9 Undetermined 410
function
Unknown
X. euroxanthea 1 H7A86_RS00675 rhs9 Undetermined 1548
function
Unknown
X. arboricola CPBF 1494 KHN93_RS00775 rhs9 Undetermined 1548
function
Unknown
Xanthomonas sp. CFBP 8443 NUG20_RS21735 rhs9 Undetermined 1572
function
Unknown
X. oryzae BB156-2 EYC55_RS01340 rhs10 Undetermined 1527
function
Unknown
X. citri UnB- rhs10 Undetermined 1367
G3566_RS22055 function
XtecTG02- 2
Unknown
X. oryzae X11-5A EYR27_RS05005 rhs10 Undetermined 1465
function
Unknown
X. citri MSCT BGK55_RS25635 rhs10 Undetermined 1494
function
Unknown
X. oryzae NJ611 EYC54_RS21145 rhs10 Undetermined 584
function
Unknown
X. vasicola NCPPB 1060 NX81_RS17515 rhs10 Undetermined 851
function
Unknown
X. citri CFBP6166 XcfCFBP6166P_RS rhs10 Undetermined 426
function
12670
Unknown
X. citri CFBP6975 XcfCFBP6975P_RS rhs10 Undetermined 426
function
12645
X. citri ISO12C3 AC612_RS14020 rhs10 Undetermined 426 Unknown
function
Unknown
X. citri M12 DGN11_RS10075 rhs10 Undetermined 426
function
Unknown f
X. translucens XtKm7 ISN30_RS08370 rhs11 Undetermined 1358
unction
Unknown
X. sacchari DD13 NKJ47_RS07105 rhs11 Undetermined 1603 function
Unknown
X. fragariae SHQP01 K4A87_RS04825 rhs11 Undetermined 1489
function
Unknown
X. oryzae BB156-2 EYC55_RS01365 rhs11 Undetermined 1481
function
Unknown
X. oryzae BB151-3 EYC56_RS22600 rhs11 Undetermined 1490
function
128
Table 6 (continued).
Unknown
X. arboricola 301 P3C56_RS09790 rhs11 Undetermined 1487
function
Unknown
X. prunicola CIX383 M0D48_RS04960 rhs11 Undetermined 1530
function
Unknown
X. prunicola CIX249 M0D43_RS16650 rhs11 Undetermined 1530
function
Unknown
X. campestris 85-10 BHE83_RS24020 rhs11 Undetermined 422
function
Unknown
X. sacchari YT9-19-2 NG828_RS06410 rhs11 Undetermined 297
function
Protease
X. oryzae CFBP7337 EBA11_RS16380 rhs12 MafB19-deam 1579
Protease
X. oryzae AXO1947 AXO1947_RS05255 rhs12 MafB19-deam 1579
Protease
X. oryzae CFBP7342 BE73_RS04950 rhs12 MafB19-deam 1579
Protease
X. oryzae BB156-2 EYC55_RS06235 rhs12 MafB19-deam 1579
Protease
X. oryzae BB151-3 EYC56_RS17670 rhs12 MafB19-deam 1579
Protease
X. oryzae PXO99A PXO_RS07175 rhs12 MafB19-deam 1579
Protease
X. oryzae NCPPB4346 EYR26_RS16955 rhs12 MafB19-deam 1178
Protease
X. translucens XtLr8 ISN38_RS15700 rhs12 MafB19-deam 1612
Protease
X. translucens XtKm15 ISN36_RS12090 rhs12 MafB19-deam 1612
Unknown
X. citri UnB-XtecTG02- 2 G3566_RS21935 rhs13 Undetermined 394
function
Unknown
X. citri T21 DGN02_RS22560 rhs13 Undetermined 242
function
Unknown
X. arboricola 17 XB05_RS11070 rhs13 Undetermined 1313
function
Unknown
X. citri CFBP7111 XcvCFBP7111P_RS rhs13 Undetermined 299
function
27080
Unknown
Xanthomonas sp. 2 H7A87_RS06480 rhs13 Undetermined 1330
function
Unknown
X. arboricola CITA 33 KQR53_RS03210 rhs13 Undetermined 241
function
Unknown
X. arboricola SL2098 FPL04_RS16495 rhs13 Undetermined 1324
function
Unknown
X. arboricola 15-088 F6Y24_RS05500 rhs13 Undetermined 1155
function
Unknown
X. arboricola R1 K9U01_RS06725 rhs13 Undetermined 1155
function
Unknown
X. cucurbitae MI_359 K6982_RS13340 rhs14 Undetermined 1509
function
129
Table 6 (continued).
Unknown
X. campestris 29-5 JH291_RS13450 rhs14 Undetermined 1586
function
Unknown
X. citri DAR72029 H8Z73_RS16735 rhs14 Undetermined 1507
function
Unknown
X. citri DAR73886 H8Z71_RS05205 rhs14 Undetermined 1507
function
Unknown
X. citri NT17 AMD13_RS16920 rhs14 Undetermined 1507
function
Unknown
Xanthomonas sp. WG16 IG630_RS16920 rhs14 Undetermined 1507
function
Unknown
X. citri AW15 AMD07_RS05675 rhs14 Undetermined 1507
function
Unknown
X. citri TX160149 B7L66_RS04365 rhs14 Undetermined 1507
function
Unknown
X. fragariae YLX21 OZ429_RS14290 rhs14 Undetermined 859
function
Unknown
X. citri ISO12C3 AC612_RS14010 rhs15 Undetermined 1334
function
Unknown
X. citri T21 DGN02_RS22685 rhs15 Undetermined 1354
function
Unknown
X. campestris 85-10 BHE83_RS06250 rhs15 Undetermined 552
function
Unknown
X. euvesicatoria LMG930 BJD11_RS09255 rhs15 Undetermined 552
function
Unknown
X. prunicola CIX383 M0D48_RS04950 rhs15 Undetermined 488
function
Unknown
X. prunicola CIX249 M0D43_RS16640 rhs15 Undetermined 488
function
Unknown
X. citri UnB-XtecTG02- 2 G3566_RS22010 rhs15 Undetermined 259
function
Unknown
X. citri CFBP6990 XcfCFBP6990P_R rhs15 Undetermined 327
function
S22380
Dnase
X. cucurbitae MI_359 K6982_RS05345 rhs16 HNH 1184
Dnase
X. citri DAR73886 H8Z71_RS14935 rhs16 HNH 1061
Dnase
X. citri DAR72029 H8Z73_RS06720 rhs16 HNH 1061
Dnase
X. citri NT17 AMD13_RS07130 rhs16 HNH 1061
Dnase
Xanthomonas sp. WG16 IG630_RS06660 rhs16 HNH 1061
Dnase
X. citri AW15 AMD07_RS14915 rhs16 HNH 1061
Dnase
X. citri TX160149 B7L66_RS03335 rhs16 HNH 1061
Dnase
X. citri GZ09 LCZ91_RS13550 rhs16 HNH 506
Unknown
X. oryzae X11-5A EYR27_RS02355 rhs17 Undetermined 1600
function
130
Table 6 (continued).
Unknown
X. oryzae BB156-2 EYC55_RS06265 rhs17 Undetermined 1089
function
Unknown
X. oryzae AXO1947 AXO1947_RS2113 rhs17 Undetermined 1563
function
5
Unknown
X. oryzae BB151-3 EYC56_RS17650 rhs17 Undetermined 1565
function
Unknown
X. oryzae BAI23 EYC57_RS05720 rhs17 Undetermined 1567
function
Unknown
X. oryzae CFBP7341 ACU17_RS24220 rhs17 Undetermined 1565
function
Unknown
X. oryzae NJ611 EYC54_RS06110 rhs17 Undetermined 1575
function
Unknown
X. translucens UPB458 KFS86_RS08990 rhs17 Undetermined 1601
function
Unknown
X. hyacinthi CFBP 1156 FZ025_RS00660 rhs18 Undetermined 241
function
Unknown
X. citri MSCT BGK55_RS19730 rhs18 Undetermined 238
function
Unknown
X. citri LMG7439 XapA_RS21055 rhs18 Undetermined 238
function
Unknown
X. citri 8ra FPK90_RS22325 rhs18 Undetermined 238
function
Unknown
X. citri K2 FPL05_RS22220 rhs18 Undetermined 238
function
Unknown
X. citri CFBP7119 XcgCFBP7119R_R rhs18 Undetermined 238
function
S21955
Unknown
X. citri CFBP7112 XcvCFBP7112P_R rhs18 Undetermined 238
function
S06845
UnB-XtecTG02 Unknown
X. citri G3566_RS06390 rhs18 Undetermined 238
function
-2
Unknown
X. hortorum jj2001 NMB96_RS07235 rhs19 Undetermined 303
function
Unknown
X. campestris B100 XCCB100_RS2325 rhs19 Undetermined 303
function
5
Unknown
X. campestris CN15 XCCCN15_RS2137 rhs19 Undetermined 303
function
5
Unknown
X. campestris CN03 XCCCN03_RS2150 rhs19 Undetermined 303
function
5
Unknown
X. campestris 30-1 JH303_RS20925 rhs19 Undetermined 303
function
Unknown
X. campestris BJSJQ20200 JM952_RS21260 rhs19 Undetermined 303
function
612
Unknown
X. campestris 29-5 JH291_RS07000 rhs19 Undetermined 303
function
131
Table 6 (continued).
Unknown
X. fragariae YLX21 OZ429_RS07145 rhs20 Undetermined 1579
function
Unknown
X. vesicatoria LM159 BI313_RS24690 rhs20 Undetermined 1590
function
Unknown
X. euroxanthea CPBF 424 XTG_RS08390 rhs20 Undetermined 1590
function
Unknown
Xanthomonas sp. CFBP 8445 NUG21_RS06315 rhs20 Undetermined 1585
function
Unknown
X. hortorum OSU493 NDY25_RS19930 rhs20 Undetermined 525
function
Unknown
X. hortorum 305 PML25_RS09255 rhs20 Undetermined 525
function
Unknown
X. euvesicatoria CFBP3836 XeaCFBP3836p_R rhs20 Undetermined 1589
function
S05615
Unknown
X. campestris NCPPB 4379 KWO_RS16610 rhs21 Undetermined 1629
function
Unknown
X. citri 8ra FPK90_RS25030 rhs21 Undetermined 1365
function
Unknown
X. citri K2 FPL05_RS24570 rhs21 Undetermined 1365
function
Unknown
X. citri CFBP7119 XcgCFBP7119R_R rhs21 Undetermined 1365
function
S18080
X. oryzae ITCCBB0002 GKO49_RS15840 rhs21 Undetermined 1496 Unknown
function
Unknown
X. oryzae 0-9 GHV42_RS05550 rhs21 Undetermined 1496
function
Unknown
X. oryzae HGA4 IZG20_RS13760 rhs21 Undetermined 1496
function
Unknown
X. vasicola NCPPB 2649 NX80_RS04185 rhs22 Undetermined 1359
function
Unknown
X. campestris 85-10 BHE83_RS26150 rhs22 Undetermined 1545
function
Unknown
X. citri XcmH1005 APY29_RS09475 rhs22 Undetermined 1262
function
Unknown
X. citri XcmN1003 APY30_RS13510 rhs22 Undetermined 1262
function
Unknown
X. citri CFBP 2036 LPY96_RS09700 rhs22 Undetermined 1262
function
Unknown
X. euvesicatoria LMG930 BJD11_RS09260 rhs22 Undetermined 1037
function
Unknown
X. campestris 12112 JH286_RS21150 rhs23 Undetermined 209
function
Unknown
X. cucurbitae OH_261 K6978_RS00120 rhs23 Undetermined 1526
function
Unknown
X. campestris GBBC 3077 QMY63_RS01220 rhs23 Undetermined 1513
function
132
Table 6 (continued).
133
Table 6 (continued).
134
Table 6 (continued).
Unknown
X. arboricola CPBF 765 KHN80_RS22080 rhs32 Undetermined 1452
function
Unknown
X. oryzae CFBP7342 BE73_RS24340 rhs32 Undetermined 1462
function
Unknown
X. oryzae CFBP7341 ACU17_RS24240 rhs32 Undetermined 1462
function
Unknown
X. arboricola 17 XB05_RS25140 rhs32 Undetermined 1455
function
Unknown
X. oryzae CFBP7331 ACU16_RS16875 rhs32 Undetermined 653
function
Unknown
X. citri UnB-XtecTG02- 2 G3566_RS13050 rhs33 Undetermined 415
function
Unknown
X. campestris 30-1 JH303_RS13080 rhs33 Undetermined 513
function
Unknown
X. campestris CN15 XCCCN15_RS0921 rhs33 Undetermined 549
function
5
Unknown
X. campestris CN03 XCCCN03_RS0889 rhs33 Undetermined 549
function
0
Unknown
X. campestris B100 XCCB100_RS2435 rhs33 Undetermined 1250
function
5
Unknown
X. campestris NEB122 HG421_RS19535 rhs34 Undetermined 1600
function
Unknown
X. prunicola MAI5037 M0D47_RS12520 rhs34 Undetermined 368
function
Unknown
X. prunicola CIX383 M0D48_RS00880 rhs34 Undetermined 368
function
Unknown
X. prunicola CIX249 M0D43_RS12700 rhs34 Undetermined 368
function
Unknown
X. hortorum Oregano 108 OEG85_RS00980 rhs34 Undetermined 545
function
Unknown
X. arboricola CPBF 766 KHN05_RS00695 rhs35 Undetermined 1540
function
X. hortorum Oregano 108 OEG85_RS00950 rhs35 Undetermined 1537 Unknown
function
Unknown
X. euroxanthea CPBF 424 XTG_RS00650 rhs35 Undetermined 1560
function
Unknown
X. hortorum Oregano 108 OEG85_RS00960 rhs35 Undetermined 546
function
Unknown
X. hortorum Oregano 108 OEG85_RS00970 rhs35 Undetermined 546
function
Unknown
X. fragariae YLX21 OZ429_RS05525 rhs36 Undetermined 251
function
Unknown
X. hortorum LM16734 XHV734_RS14570 rhs36 Undetermined 1175
function
Unknown
X. arboricola 3 H5027_RS08090 rhs36 Undetermined 1175
function
135
Table 6 (continued).
Unknown
X. arboricola 17 XB05_RS24585 rhs36 Undetermined 1159
function
Unknown
X. citri LMG7439 XapA_RS24475 rhs36 Undetermined 369
function
Dnase
X. citri UnB-XtecTG02- 2 G3566_RS06640 rhs37 Tox-REase-7 1176
Dnase
X. oryzae NCPPB4346 EYR26_RS09400 rhs37 Tox-REase-7 1191
Dnase
X. citri CFBP7111 XcvCFBP7111P_R rhs37 Tox-REase-7 1015
S03540
Dnase
X. vasicola NCPPB 902 NX08_RS17220 rhs37 Tox-REase-7 1052
Dnase
X. citri XcmN1003 APY30_RS17325 rhs37 Tox-REase-7 1186
Unknown
X. citri FDC 1609 TP50_RS15830 rhs38 Undetermined 347
function
Unknown
X. citri CFBP7119 XcgCFBP7119R_R rhs38 Undetermined 1412
function
S09985
Unknown
X. axonopodis F1 XACM_RS08075 rhs38 Undetermined 1412
function
Unknown
X. citri 8ra FPK90_RS10315 rhs38 Undetermined 1412
function
Unknown
X. citri K2 FPL05_RS10375 rhs38 Undetermined 1412
function
Dnase
X. fragariae Fap29 BER93_RS11435 rhs39 Tox-SHH 1530
Dnase
X. fragariae PD5205 PD5205_RS11720 rhs39 Tox-SHH 1530
Dnase
X. fragariae LMG 703 OW158_RS12005 rhs39 Tox-SHH 1018
Dnase
X. translucens ICMP 16317 K8O61_RS11885 rhs39 Tox-SHH 1530
Dnase
X. translucens LMG 843 KFS85_RS06145 rhs39 Tox-SHH 1530
Unknown
X. translucens ICMP 16317 K8O61_RS11900 rhs40 Undetermined 1553
function
Unknown
X. fragariae NBC2815 NBC2815_RS2093 rhs40 Undetermined 1112
function
5
Unknown
X. fragariae Fap29 BER93_RS20935 rhs40 Undetermined 1408
function
Unknown
X. fragariae PD5205 PD5205_RS11680 rhs40 Undetermined 1408
function
Unknown
X. fragariae LMG 703 OW158_RS11970 rhs40 Undetermined 1534
function
Unknown
X. sacchari JR3-14 NG824_RS04165 rhs41 Undetermined 195
function
Unknown
X. sacchari YT9-19-2 NG828_RS17425 rhs41 Undetermined 195
function
136
Table 6 (continued).
Unknown
X. sacchari LT6-2 NG827_RS16975 rhs41 Undetermined 195
function
Unknown
X. sacchari LT6-16-1 NG829_RS16855 rhs41 Undetermined 195
function
Unknown
X. sacchari HR1-32 NG831_RS17630 rhs41 Undetermined 535
function
Unknown
Xanthomonas sp. 2 H7A87_RS08515 rhs42 Undetermined 1589
function
Unknown
X. prunicola MAI5037 M0D47_RS05415 rhs42 Undetermined 1578
function
Unknown
X. prunicola CIX249 M0D43_RS05650 rhs42 Undetermined 1578
function
Unknown
X. theicola CFBP 4691 G4Q83_RS23960 rhs42 Undetermined 576
function
Unknown
X. prunicola CIX383 M0D48_RS15735 rhs42 Undetermined 1536
function
Unknown
X. oryzae CFBP7341 ACU17_RS16415 rhs43 Undetermined 1614
function
Unknown
X. oryzae BB156-2 EYC55_RS06255 rhs43 Undetermined 1614
function
Unknown
X. oryzae CFBP2286 ACU11_RS16075 rhs43 Undetermined 1614
function
Unknown
X. oryzae 0-9 GHV42_RS05875 rhs43 Undetermined 1612
function
Unknown
X. oryzae YN01 IYN96_RS05795 rhs43 Undetermined 1612
function
Unknown
X. cucurbitae OH_261 K6978_RS05695 rhs44 Undetermined 1063
function
Unknown
X. vasicola NCPPB 902 NX08_RS22840 rhs44 Undetermined 1220
function
Unknown
X. citri MSCT BGK55_RS25730 rhs44 Undetermined 448
function
Unknown
X. citri CFBP6990 XcfCFBP6990P_RS rhs44 Undetermined 1044
function
06790
X. citri CFBP6991 XcfCFBP6991P_RS rhs44 Undetermined 1044 Unknown
function
06810
Unknown
X. campestris CFBP5824 JH314_RS16820 rhs45 Undetermined 1576
function
Unknown
X. hortorum OSU493 NDY25_RS04850 rhs45 Undetermined 857
function
Unknown
X. hortorum 305 PML25_RS16215 rhs45 Undetermined 857
function
Unknown
X. fragariae SHQP01 K4A87_RS01775 rhs45 Undetermined 370
function
Unknown
X. fragariae YLX21 OZ429_RS17695 rhs45 Undetermined 921
function
137
Table 6 (continued).
Unknown
X. citri T21 DGN02_RS06730 rhs46 Undetermined 1209
function
Unknown
X. citri CFBP7112 XcvCFBP7112P_R rhs46 Undetermined 1203
function
S16595
Unknown
X. citri LMG7439 XapA_RS24135 rhs46 Undetermined 1203
function
Unknown
X. prunicola MAI5069 M0D45_RS12480 rhs47 Undetermined 1016
function
Unknown
X. vasicola NCPPB 1060 NX81_RS24560 rhs47 DUF4258 292
function
Unknown
X. vasicola Xv1601 CXP37_RS23430 rhs47 DUF4258 344
function
Unknown
X. arboricola 15-088 F6Y24_RS23010 rhs48 COG5529 264
function
Unknown
X. arboricola R1 K9U01_RS02070 rhs48 COG5529 264
function
Unknown
X. arboricola CITA 33 KQR53_RS07875 rhs48 COG5529 264
function
Unknown
X. citri CFBP6990 XcfCFBP6990P_RS rhs49 Undetermined 267
function
22370
Unknown
X. citri CFBP6991 XcfCFBP6991P_RS rhs49 Undetermined 267
function
23120
Unknown
X. citri T21 DGN02_RS22690 rhs49 Undetermined 267
function
Unknown
X. sacchari DJ16 QBE03_RS00945 rhs50 DUF4329 1560
function
Unknown
X. sacchari LT6-16-1 NG829_RS20780 rhs50 DUF4329 1560
function
Unknown
X. sacchari JR3-14 NG824_RS20860 rhs50 DUF4329 1560
function
Dnase
X. oryzae NJ611 EYC54_RS06145 rhs51 LHH 1590
Dnase
X. oryzae BB151-3 EYC56_RS17620 rhs51 LHH 323
Dnase
X. oryzae BB151-3 EYC56_RS17600 rhs51 LHH 1103
Rnase
X. hortorum OSU493 NDY25_RS20705 rhs52 Ntox34 1488
Rnase
X. hortorum 305 PML25_RS10040 rhs52 Ntox34 1488
Rnase
X. hortorum JS749-3 BJD10_RS15725 rhs52 Ntox34 1489
Unknown
Xanthomonas sp. SS HEP74_RS05875 rhs53 Undetermined 1565
function
Unknown
Xanthomonas sp. SI HEP75_RS06045 rhs53 Undetermined 1454
function
Unknown
X. citri T21 DGN02_RS13315 rhs53 Undetermined 1375
function
138
Table 6 (continued).
Unknown
X. sacchari LT6-16-1 NG829_RS20485 rhs54 Undetermined 330
function
Unknown
X. sacchari JR3-14 NG824_RS20515 rhs54 Undetermined 330
function
Unknown
X. sacchari DJ16 QBE03_RS01275 rhs54 Undetermined 330
function
Dnase
X. hortorum ICMP 7383 BI317_RS16880 rhs55 PDDEXK_nuclease 1197
Dnase
X. citri CFBP6975 XcfCFBP6975P_R rhs55 PDDEXK_nuclease 665
S22530
Dnase
X. campestris M28 IFJ81_RS22255 rhs55 PDDEXK_nuclease 1183
Dnase
X. oryzae X11-5A EYR27_RS02345 rhs56 AHH 1512
Dnase
X. oryzae NCPPB4346 EYR26_RS17005 rhs56 AHH 1526
Dnase
X. oryzae PXO404 EBA23_RS08545 rhs56 AHH 1526
Dnase
X. oryzae BB156-2 EYC55_RS24855 rhs56 AHH 320
Unknown
X. campestris ATCC 33913 XCC_RS00685 rhs57 Undetermined 1501
function
Unknown
X. campestris M28 IFJ81_RS00725 rhs57 Undetermined 1501
function
Unknown
X. campestris 8284 JH306_RS00675 rhs57 Undetermined 1501
function
Unknown
X. campestris 10103 JH282_RS00725 rhs57 Undetermined 1501
function
Unknown
X. euvesicatoria CFBP3836 XeaCFBP3836p_R rhs58 Undetermined 1169
function
S16095
Unknown
X. prunicola CIX97 M0D46_RS01270 rhs58 Undetermined 1199
function
Unknown
X. campestris CFBP6690 JH261_RS05475 rhs58 Undetermined 1170
function
Unknown
X. fragariae YLX21 OZ429_RS05545 rhs58 Undetermined 1181
function
Unknown
X. citri MSCT BGK55_RS00640 rhs59 Undetermined 241
function
Unknown
X. citri XcmH1005 APY29_RS00650 rhs59 Undetermined 241
function
Unknown
X. citri XcmN1003 APY30_RS00665 rhs59 Undetermined 241
function
Unknown
X. citri CFBP 2036 LPY96_RS01375 rhs59 Undetermined 241
function
Unknown
X. arboricola 15-088 F6Y24_RS23015 rhs60 Undetermined 265
function
Unknown
X. arboricola CITA 33 KQR53_RS07890 rhs60 Undetermined 265
function
Unknown
X. arboricola R1 K9U01_RS02055 rhs60 Undetermined 1808
function
139
Table 6 (continued).
Unknown
X. sacchari JR3-14 NG824_RS09410 rhs60 Undetermined 1791
function
Unknown
X. CFBP3836 XeaCFBP3836p_ rhs61 Undetermined 1017
function
euvesicatoria RS02340
Unknown
X. prunicola MAI5037 M0D47_RS15275 rhs61 Undetermined 281
function
Unknown
X. prunicola CIX383 M0D48_RS03760 rhs61 Undetermined 281
function
Unknown
X. prunicola CIX249 M0D43_RS15450 rhs61 Undetermined 281
function
Unknown
X. oryzae CFBP7342 BE73_RS27715 rhs62 Undetermined 1442
function
Unknown
X. oryzae BAI23 EYC57_RS05735 rhs62 Undetermined 1547
function
Unknown
X. translucens CFBP 2541 KHA79_RS1198 rhs62 Undetermined 1587
function
0
Unknown
X. fragariae NBC2815 NBC2815_RS118 rhs62 Undetermined 1587
function
35
Unknown
X. arboricola IVIA 3978 KPG65_RS18010 rhs63 Undetermined 1326
function
Unknown
X. citri 8ra FPK90_RS14455 rhs63 Undetermined 1473
function
Unknown
X. citri CFBP7119 XcgCFBP7119R_ rhs63 Undetermined 1473
function
RS14210
X. citri K2 FPL05_RS13785 rhs63 Undetermined 1439 Unknown
function
Unknown
X. translucens LMG 843 KFS85_RS06170 rhs64 DUF2380 1027
function
Unknown
X. translucens Xtu 4699 FD63_RS13525 rhs64 DUF2380 1543
function
Unknown
X. translucens CFBP 2055 KCU58_RS05740 rhs64 DUF2380 1539
function
Unknown
X. translucens ICMP 16317 K8O61_RS11920 rhs64 DUF2380 1539
function
Unknown
X. sacchari JR3-14 NG824_RS14860 rhs65 Undetermined 1572
function
Unknown
X. translucens CFBP 2055 KCU58_RS15120 rhs65 Undetermined 1476
function
Unknown
X. sacchari LT6-16-1 NG829_RS06270 rhs65 Undetermined 1021
function
Unknown
X. translucens LMG 728 KM539_RS06295 rhs65 Undetermined 512 function
Unknown
X. theicola CFBP 4691 G4Q83_RS22720 rhs66 Undetermined 1391
function
Unknown
X. sacchari YT9-19-2 NG828_RS21750 rhs66 Undetermined 1572 function
140
Table 6 (continued).
141
Table 6 (continued).
142
Table 6 (continued).
Unknown
X. translucens LMG 728 KM539_RS10370 rhs80 Undetermined 1524
function
Unknown
X. campestris CFBP 8444 NRY95_RS12615 rhs80 Undetermined 1530
function
Unknown
X. sacchari HR3-46 NG825_RS21015 rhs81 Undetermined 398
function
Unknown
X. sacchari LT6-16-1 NG829_RS11660 rhs81 Undetermined 1577
function
Unknown
X. sacchari YT9-19-2 NG828_RS12090 rhs82 Undetermined 1773
function
Unknown
X. sacchari HR1-32 NG831_RS12155 rhs82 Undetermined 1775
function
Xanthomonas sp. GW HEP73_RS20695 rhs82 Tox-MPTase3 1521 Protease
143
Table 6 (continued).
144
Table 6 (continued).
Unknown
X. prunicola MAI5069 M0D45_RS05685 rhs110 Undetermined 1590
function
Unknown
X. citri CFBP6991 XcfCFBP6991P_R rhs111 Undetermined 1332
function
S13040
Unknown
Xanthomonas sp. SS HEP74_RS13005 rhs112 Undetermined 326
function
Unknown
Xanthomonas sp. CFBP 8443 NUG20_RS15640 rhs113 DUF4329 1556
function
Unknown
X. fragariae SHQP01 K4A87_RS17890 rhs114 Undetermined 237
function
Unknown
X. citri CFBP7119 XcgCFBP7119R_ rhs115 Undetermined 738
function
RS08365
Dnase
X. sacchari HR1-32 NG831_RS21295 rhs116 Tox-REase-9 317
Unknown
X. campestris 5053 JH280_RS00600 rhs117 Undetermined 330
function
Unknown
X. hyacinthi CFBP 1156 FZ025_RS20375 rhs118 Undetermined 1533
function
Unknown
X. hyacinthi CFBP 1156 FZ025_RS10365 rhs119 Undetermined 286
function
Rnase
X. campestris CFBP 8444 NRY95_RS01070 rhs120 Ntox21 329
Unknown
X. oryzae BB151-3 EYC56_RS22565 rhs121 Undetermined 625
function
Xanthomonas sp. SI HEP75_RS15370 rhs122 Undetermined 320 Unknown
function
Unknown
X. theicola CFBP 4691 G4Q83_RS21605 rhs123 Undetermined 1035
function
Unknown
X. theicola CFBP 4691 G4Q83_RS21595 rhs124 Undetermined 425
function
Unknown
X. translucens NCPPB 3711 KHF85_RS19705 rhs125 Undetermined 325
function
Unknown
X. translucens CFBP 2541 KHA79_RS11990 rhs126 Undetermined 1621
function
Xanthomonas sp. SI HEP75_RS09565 rhs127 FHA 342 Protease
Unknown
X. oryzae NJ611 EYC54_RS21155 rhs128 Undetermined 1505
function
X. axonopodis NCPPB 796 XAV_RS01895 rhs129 PMT1 1759 Protease
Unknown
X. fragariae YLX21 OZ429_RS17710 rhs130 Undetermined 565
function
Unknown
Xanthomonas sp. AM6 OCJ37_RS20860 rhs131 Undetermined 1558
function
Unknown
Xanthomonas sp. AM6 OCJ37_RS05755 rhs132 Undetermined 1418
function
Unknown
X. hyacinthi CFBP 1156 FZ025_RS20230 rhs133 Undetermined 229
function
145
Table 6 (continued).
Unknown
X. sacchari DD13 NKJ47_RS07755 rhs134 Undetermined 263
function
Unknown
X. sacchari HR1-32 NG831_RS12145 rhs135 Undetermined 761
function
Unknown
X. oryzae 0-9 GHV42_RS13390 rhs136 Undetermined 1052
function
Unknown
X. sacchari DD13 NKJ47_RS00480 rhs137 Undetermined 329
function
Unknown
Xanthomonas sp. CFBP 8443 NUG20_RS08520 rhs138 Undetermined 336
function
Unknown
X. translucens ICMP11055 NZ30_RS06515 rhs139 Undetermined 1365
function
Protease
X. citri T21 DGN02_RS02115 rhs140 CHAP 1787
Unknown
X. albilineans Xa-FJ1 XaFJ1_RS10230 rhs141 Undetermined 1779
function
Unknown
X. citri CFBP7111 XcvCFBP7111P_ rhs142 Undetermined 1332
function
RS26910
Unknown
Xanthomonas sp. SS HEP74_RS19970 rhs143 Undetermined 1512
function
Unknown
X. albilineans Xa-FJ1 XaFJ1_RS10215 rhs144 Undetermined 652
function
Unknown
Xanthomonas sp. AM6 OCJ37_RS19440 rhs145 Undetermined 1486
function
Dnase
X. citri T21 DGN02_RS06740 rhs146 HNH 648
Unknown
X. hyacinthi CFBP 1156 FZ025_RS14615 rhs147 Undetermined 240
function
Dnase
X. campestris CFBP 8444 NRY95_RS15825 rhs148 HNH 304
Unknown
X. sacchari HR3-46 NG825_RS20620 rhs149 Undetermined 331
function
Unknown
Xanthomonas sp. 2 H7A87_RS05400 rhs150 Undetermined 1587
function
Unknown
X. translucens ICMP11055 NZ30_RS06505 rhs151 Undetermined 1584
function
Unknown
X. sacchari YT9-19-2 NG828_RS10195 rhs152 Undetermined 1510
function
Unknown
X. sacchari DD13 NKJ47_RS02030 rhs153 Undetermined 1527
function
Unknown
X. hyacinthi CFBP 1156 FZ025_RS10335 rhs154 Undetermined 312
function
Unknown
X. hyacinthi CFBP 1156 FZ025_RS09340 rhs155 Undetermined 250
function
Unknown
X. translucens LMG 843 KFS85_RS14540 rhs156 Undetermined 1581
function
Unknown
X. citri ISO12C3 AC612_RS11240 rhs157 Undetermined 233 function
146
Table 6 (continued).
Unknown
Xanthomonas sp. CFBP 8443 NUG20_RS00825 rhs158 Undetermined 332 function
Unknown
X. hortorum JS749-3 BJD10_RS00020 rhs159 Undetermined 562 function
Unknown
X. oryzae BB151-3 EYC56_RS02655 rhs160 Undetermined 1773 function
Unknown
X. citri LMG7439 XapA_RS09500 rhs161 Undetermined 1334
function
Unknown
Xanthomonas sp. GW HEP73_RS22295 rhs162 DUF4157 1539
function
X. euvesicatoria LMG930 BJD11_RS09230 rhs163 Undetermined 493 Unknown
function
Unknown
X. axonopodis NCPPB 796 XAV_RS0 rhs164 Undetermined 604
function
1920
Unknown
Xanthomonas sp. AM6 OCJ37_RS14840 rhs165 Undetermined 300
function
Unknown
X. translucens LMG 728 KM539_RS06285 rhs166 Undetermined 1558
function
Unknown
X. fragariae SHQP01 K4A87_RS01745 rhs167 Undetermined 1807
function
Unknown
X. hyacinthi CFBP 1156 FZ025_RS03295 rhs168 Undetermined 269
function
Unknown
X. citri MSCT BGK55_RS25685 rhs169 Undetermined 316
function
X. arboricola CPBF 766 KHN80_RS03670 rhs170 PT-HINT 1623 Protease
Unknown
X. albilineans Xa-FJ1 XaFJ1_RS11385 rhs171 Undetermined 332
function
Unknown
X. sacchari HR1-32 NG831_RS06820 rhs172 Undetermined 1582
function
Unknown
Xanthomonas sp. SI HEP75_RS10700 rhs173 Undetermined 1515
function
X. theicola CFBP 4691 G4Q83_RS00275 rhs174 Peptidase_C97 427 Protease
Unknown
X. albilineans Xa-FJ1 XaFJ1_RS11545 rhs175 Undetermined 1547
function
Unknown
Xanthomonas sp. SI HEP75_RS20380 rhs176 Undetermined 1482
function
UnB-XtecTG02- Unknown
X. citri G3566_RS22005 rhs177 Undetermined 1421
2 function
Unknown
X. oryzae CFBP7342 BE73_RS10025 rhs178 Undetermined 1334 function
Unknown
X. hyacinthi CFBP 1156 FZ025_RS10385 rhs179 Undetermined 300
function
Unknown
Xanthomonas sp. AM6 OCJ37_RS19420 rhs180 Undetermined 288
function
Unknown
Xanthomonas sp. AM6 OCJ37_RS10125 rhs181 Undetermined 2355
function
147
Table 6 (continued).
Ralstonia
Rhs Putative
Toxin domain Protein
Species strain Rhs locus Toxin predicted
Length
identifier function
Unknown
R. solanacearum YC40-M A3768_RS18470 rhs1 Undetermined 1718
function
R. Unknown
pseudosolanacearum CQPS-1 BC350_RS21085 rhs1 Undetermined 1706
function
Unknown
R. solanacearum HA4-1 CFM90_RS22255 rhs1 Undetermined 1698
function
Unknown
R. solanacearum T117 CJO74_RS17965 rhs1 Undetermined 1716 function
Unknown
R. solanacearum T78 CJO80_RS19245 rhs1 Undetermined 1716
function
Unknown
R. solanacearum T60 CJO81_RS19115 rhs1 Undetermined 1716
function
R. solanacearum T42 CJO83_RS17890 rhs1 Undetermined 1740 Unknown
function
Unknown
R. solanacearum SL3730 CJO90_RS17890 rhs1 Undetermined 1740
function
Unknown
R. solanacearum SL3300 CJO91_RS18780 rhs1 Undetermined 1722
function
Unknown
R. solanacearum UW386 E7Z57_RS20405 rhs1 Undetermined 506
function
Unknown
R. solanacearum 204 G7939_RS19800 rhs1 Undetermined 1706
function
Unknown
R. solanacearum B2 G7968_RS24855 rhs1 Undetermined 370
function
Unknown
R. solanacearum 202 G7969_RS18430 rhs1 Undetermined 1706
function
Unknown
R. solanacearum FJAT454.F50-1 H9X76_RS24320 rhs1 Undetermined 1706
function
Unknown
R. solanacearum FJAT1303.F50 HI793_RS18750 rhs1 Undetermined 418 function
Unknown
R. solanacearum FJAT1452.F50 HI796_RS17655 rhs1 Undetermined 1718
function
Unknown
R. solanacearum FJAT1463.F1 HI799_RS19530 rhs1 Undetermined 1706
function
Unknown
R. solanacearum FJAT15304.F50 HI808_RS17550 rhs1 Undetermined 1706
function
Unknown
R. solanacearum FJAT15304.F6 HI809_RS17565 rhs1 Undetermined 1706
function
Unknown
R. solanacearum FJAT15340.F50 HI811_RS17550 rhs1 Undetermined 1706
function
Unknown
R. solanacearum FJAT15353.F1 HI813_RS18740 rhs1 Undetermined 418
function
Unknown
R. solanacearum FJAT15353.F50 HI814_RS18745 rhs1 Undetermined 418
function
148
Table 6 (continued).
Unknown
R. solanacearum FJAT15353.F8 HI815_RS18735 rhs1 Undetermined 418
function
Unknown
R. solanacearum FJAT442.F1 HI816_RS17665 rhs1 Undetermined 1718
function
Unknown
R. solanacearum FJAT442.F50 HI817_RS17655 rhs1 Undetermined 1718
function
Unknown
R. solanacearum FJAT445.F50 HI819_RS17660 rhs1 Undetermined 1718
function
Unknown
R. solanacearum FJAT91.F50 HI824_RS18695 rhs1 Undetermined 1704
function
Unknown
R. solanacearum FJAT91-F8 HWE47_RS2254 rhs1 Undetermined 1704
function
0
Unknown
R. solanacearum 362200 IMF25_RS24870 rhs1 Undetermined 1698
function
R. Unknown
pseudosolanacearum PeaFJ1 JNO62_RS22440 rhs1 Undetermined 1698
function
Unknown
Ralstonia sp. RS647 LGV81_RS21355 rhs1 Undetermined 1718
function
Unknown
R. solanacearum Bs715 NQ321_RS04685 rhs1 Undetermined 1716 function
Unknown
R. solanacearum Wj644 NQ322_RS19390 rhs1 Undetermined 1704
function
Unknown
R. solanacearum MolK2 PG909_RS09415 rhs1 Undetermined 414
function
R.
pseudosolanacearum CQPS-1 BC350_RS23285 rhs2 GH-E 1522 Dnase
R. solanacearum T117 CJO74_RS23195 rhs2 GH-E 1522 Dnase
R. solanacearum T78 CJO80_RS24500 rhs2 GH-E 1522 Dnase
R. solanacearum T60 CJO81_RS24410 rhs2 GH-E 1522 Dnase
R. solanacearum T42 CJO83_RS22000 rhs2 GH-E 1522 Dnase
R. solanacearum SL3730 CJO90_RS22770 rhs2 GH-E 1522 Dnase
R. solanacearum SL3300 CJO91_RS23830 rhs2 GH-E 1522 Dnase
Unknown
R. solanacearum SL2729 CJO95_RS17870 rhs2 Undetermined 1734
function
R. solanacearum SL2729 CJO95_RS22760 rhs2 GH-E 1522 Dnase
Unknown
R. solanacearum RSCM CYD94_RS25805 rhs2 Undetermined 1471
function
R. solanacearum 204 G7939_RS25520 rhs2 GH-E 1522 Dnase
R. solanacearum B2 G7968_RS19500 rhs2 GH-E 1522 Dnase
R. solanacearum 202 G7969_RS20605 rhs2 GH-E 1522 Dnase
R. solanacearum FJAT454.F50-1 H9X76_RS21700 rhs2 GH-E 1522 Dnase
R. solanacearum FJAT1303.F50 HI793_RS23775 rhs2 GH-E 1522 Dnase
R. solanacearum FJAT1463.F1 HI799_RS25155 rhs2 GH-E 1522 Dnase
149
Table 6 (continued).
150
Table 6 (continued).
R. syzygii Dnase
BDBR229 PG904_RS11850 rhs3 AHH 1524
R. Dnase
pseudosolanacearum GMI1000 RS_RS22605 rhs3 AHH 1518
Unknown
R. solanacearum UW386 E7Z57_RS01595 rhs4 Undetermined 763
function
Unknown
R. solanacearum UW386 E7Z57_RS20465 rhs4 Undetermined 1735
function
Unknown
R. solanacearum RUN2279 HF908_RS21900 rhs4 Undetermined 641
function
Unknown
R. solanacearum RUN2474 HF909_RS21515 rhs4 Undetermined 641
function
Unknown
R. solanacearum FJAT1303.F50 HI793_RS18720 rhs4 Undetermined 1722
function
Unknown
R. solanacearum FJAT15353.F1 HI813_RS18715 rhs4 Undetermined 1722
function
Unknown
R. solanacearum FJAT15353.F50 HI814_RS18715 rhs4 Undetermined 1722
function
Unknown
R. solanacearum FJAT15353.F8 HI815_RS18705 rhs4 Undetermined 1722
function
Unknown
R. syzygii UGMSS_Db01 JK151_RS22250 rhs4 Undetermined 763
function
R. Unknown
pseudosolanacearum LMG 9673 NY025_RS04810 rhs4 Undetermined 642
function
Unknown
R. syzygii BDBR229 PG904_RS23545 rhs4 Undetermined 763
function
R. Unknown
pseudosolanacearum RUN2340 PG907_RS18570 rhs4 Undetermined 641
function
Unknown
R. solanacearum PSI07 RPSI07_RS23215 rhs4 Undetermined 763 function
Unknown
R. solanacearum K60 B7R77_18740 rhs4 Undetermined 1,732
function
R. solanacearum IBSBF 2571 C2124_RS18965 rhs5 AHH 1433 Dnase
R. solanacearum RS 488 CCY86_RS18665 rhs5 AHH 1433 Dnase
R. solanacearum RS 489 CDC59_RS18560 rhs5 AHH 1433 Dnase
R. solanacearum CIAT_078 G8D25_RS02275 rhs5 AHH 1433 Dnase
R. solanacearum CFBP2957 PG903_RS23825 rhs5 AHH 1433 Dnase
R. solanacearum IBSBF1503 RALBFv3_RS16990 rhs5 AHH 1433 Dnase
R. solanacearum Po82 RSPO_RS18555 rhs5 AHH 1433 Dnase
R. solanacearum UY031 RSUY_RS18170 rhs5 AHH 1433 Dnase
R. solanacearum UW163 UW163_RS18365 rhs5 AHH 1433 Dnase
R. solanacearum UW551 B7R79_19675 rhs5 AHH 1433 Dnase
Unknown
R. syzygii LLRS-1 GO998_RS16440 rhs6 Undetermined 778
function
151
Table 6 (continued).
R. solanacearum Unknown
UW763 HF907_RS21545 rhs6 Undetermined 779
function
Ralstonia sp. Unknown
RS642 LGV80_RS25870 rhs6 Undetermined 779
function
Ralstonia sp. Unknown
RS650 LGV82_RS18275 rhs6 Undetermined 779
function
R. solanacearum Unknown
CFBP2957 PG903_RS00780 rhs6 Undetermined 780
function
R. syzygii Unknown
BDBR229 PG904_RS22280 rhs6 Undetermined 779
function
R. solanacearum Unknown
SEPPX05 RSSE_RS25425 rhs6 Undetermined 779
function
R. solanacearum Unknown
K60 B7R77_25580 rhs6 Undetermined 193
function
R. solanacearum Unknown
IBSBF 2571 C2124_RS17055 rhs7 Undetermined 476
function
R. solanacearum Unknown
CIAT_078 G8D25_RS07315 rhs7 Undetermined 461
function
R. solanacearum Unknown
RUN2279 HF908_RS21925 rhs7 Undetermined 674
function
R. solanacearum Unknown
RUN2474 HF909_RS21530 rhs7 Undetermined 669
function
R. Unknown
pseudosolanacearum LMG 9673 NY025_RS04795 rhs7 Undetermined 1383
function
R. Unknown
pseudosolanacearum RUN2340 PG907_RS18555 rhs7 Undetermined 1730
function
Unknown
R. solanacearum IBSBF1503 RALBFv3_RS2265 rhs7 Undetermined 1727
function
5
Unknown
R. solanacearum Po82 RSPO_RS16785 rhs7 Undetermined 476
function
Unknown
R. solanacearum UW163 UW163_RS23205 rhs7 Undetermined 476
function
R. Unknown
pseudosolanacearum RS 476 CDC45_RS20075 rhs8 Undetermined 742 function
R. solanacearum Rs5 HF906_RS14655 rhs8 ParB 745 Dnase
R. solanacearum UW763 HF907_RS21440 rhs8 ParB 742 Dnase
R. solanacearum RUN2279 HF908_RS21940 rhs8 ParB 1702 Dnase
R. solanacearum RUN2474 HF909_RS21545 rhs8 ParB 1726 Dnase
R.
pseudosolanacearum LMG 9673 NY025_RS04770 rhs8 ParB 524 Dnase
R. solanacearum CFBP2957 PG903_RS16825 rhs8 ParB 1699 Dnase
R.
pseudosolanacearum GMI1000 RS_RS19450 rhs8 ParB 742 Dnase
R. solanacearum K60 B7R77_06485 rhs8 ParB 648 Dnase
R. Unknown
pseudosolanacearum RS 476 CDC45_RS18105 rhs9 Undetermined 765
function
152
Table 6 (continued).
153
Table 6 (continued).
Unknown
R. solanacearum T51 CJO82_RS24930 rhs13 Undetermined 249
function
Unknown
R. solanacearum SL3175 CJO92_RS25460 rhs13 Undetermined 209
function
Unknown
R. solanacearum SL2064 CJO98_RS25250 rhs13 Undetermined 249
function
Unknown
R. solanacearum KACC 10722 LBM2029_RS25550 rhs13 Undetermined 249
function
Unknown
R. syzygii LLRS-1 GO998_RS00950 rhs14 DUF2380 776
function
Unknown
R. solanacearum KACC 10709 LBM341_RS13945 rhs14 DUF2380 776
function
R. Unknown
Sw698 NQS37_RS02985 rhs14 DUF2380 776
pseudosolanacearum function
Unknown
R. solanacearum CFBP2957 PG903_RS16705 rhs14 DUF2380 777
function
Unknown
R. solanacearum MolK2 PG909_RS13555 rhs14 DUF2380 777
function
Unknown
R. solanacearum RS 488 CCY86_RS01325 rhs15 Undetermined 782
function
Unknown
R. solanacearum RS 489 CDC59_RS01315 rhs15 Undetermined 783
function
Unknown
R. syzygii LLRS-1 GO998_RS06900 rhs15 Undetermined 781
function
Unknown
Ralstonia sp. RS650 LGV82_RS01545 rhs15 Undetermined 784
function
Unknown
R. solanacearum UY031 RSUY_RS01280 rhs15 Undetermined 782
function
Unknown
R. solanacearum T101 CJO76_RS21760 rhs16 Undetermined 1475
function
Unknown
R. solanacearum T82 CJO79_RS21750 rhs16 Undetermined 1475
function
Unknown
R. solanacearum T12 CJO85_RS21880 rhs16 Undetermined 1475
function
Unknown
R. syzygii LLRS-1 GO998_RS24545 rhs16 Undetermined 1475
function
Unknown
R. solanacearum CFBP2957 PG903_RS22375 rhs16 Undetermined 1474
function
R. Unknown
pseudosolanacearum RS 476 CDC45_RS24230 rhs17 Undetermined 741
function
R. Unknown
pseudosolanacearum RS 476 CDC45_RS24390 rhs17 Undetermined 741
function
R. solanacearum UW763 HF907_RS25250 rhs17 Undetermined 789 Unknown
function
R. Unknown
pseudosolanacearum GMI1000 RS_RS23445 rhs17 Undetermined 741
function
R. Unknown
GMI1000 RS_RS23575 rhs17 Undetermined 741
pseudosolanacearum function
154
Table 6 (continued).
Unknown
R. solanacearum UW763 HF907_RS22995 rhs18 Undetermined 1695
function
Unknown
R. solanacearum MAFF 301560 JK146_RS18420 rhs18 Undetermined 1702
function
Unknown
R. solanacearum RS24 KM864_RS24520 rhs18 Undetermined 1702
function
Unknown
R. solanacearum T98 CJO77_RS11790 rhs19 Undetermined 733
function
Unknown
R. solanacearum SL3175 CJO92_RS11785 rhs19 Undetermined 733
function
Unknown
R. solanacearum CFBP2957 PG903_RS18315 rhs19 Undetermined 734
function
Dnase
R. solanacearum UW763 HF907_RS03305 rhs20 GIY-YIG_SF 775
Dnase
R. solanacearum MolK2 PG909_RS04760 rhs20 GIY-YIG_SF 777
Dnase
R. solanacearum K60 B7R77_09775 rhs20 GIY-YIG_SF 784
Unknown
R. solanacearum T98 CJO77_RS22060 rhs21 Undetermined 1539
function
Unknown
R. solanacearum SL3175 CJO92_RS22080 rhs21 Undetermined 1539
function
Unknown
R. solanacearum PSI07 RPSI07_RS26455 rhs21 Undetermined 1539
function
Unknown
R. solanacearum HA4-1 CFM90_RS25210 rhs22 Undetermined 1482
function
Unknown
R. solanacearum 362200 IMF25_RS21770 rhs22 Undetermined 1482
function
R. Unknown
PeaFJ1 JNO62_RS19340 rhs22 Undetermined 1482
pseudosolanacearum function
Unknown
R. solanacearum RS 488 CCY86_RS16535 rhs23 Undetermined 758
function
Unknown
R. solanacearum RS 489 CDC59_RS16455 rhs23 Undetermined 757
function
Unknown
R. solanacearum UY031 RSUY_RS16195 rhs23 Undetermined 758
function
Unknown
R. wenshanensis 56D2 KOL96_RS09195 rhs24 Undetermined 1887
function
Unknown
Ralstonia sp. RS642 LGV80_RS13235 rhs24 Undetermined 754
function
Unknown
Ralstonia sp. RS650 LGV82_RS12745 rhs24 Undetermined 754
function
R. Unknown
RS 476 CDC45_RS18395 rhs25 Undetermined 1711
pseudosolanacearum function
Unknown
R. solanacearum B2 G7968_RS24875 rhs25 Undetermined 1713
function
R. Unknown
pseudosolanacearum GMI1000 RS_RS17875 rhs25 Undetermined 1711
function
155
Table 6 (continued).
Unknown
Ralstonia sp. RS642 LGV80_RS25765 rhs26 Undetermined 311
function
Unknown
Ralstonia sp. RS650 LGV82_RS26520 rhs26 Undetermined 311
function
Unknown
R. solanacearum K60 B7R77_06540 rhs26 Undetermined 752
function
Unknown
R. solanacearum T98 CJO77_RS05780 rhs27 HI1514 774 function
Unknown
R. solanacearum SL3175 CJO92_RS05780 rhs27 HI1514 774
function
Unknown
R. solanacearum PSI07 RPSI07_RS13970 rhs27 HI1514 774
function
Unknown
R. solanacearum T95 CJO78_RS07765 rhs28 Undetermined 799
function
Unknown
R. solanacearum T51 CJO82_RS07540 rhs28 Undetermined 799
function
Unknown
R. solanacearum SL2064 CJO98_RS07775 rhs28 Undetermined 799
function
Unknown
R. solanacearum KACC 10722 LBM2029_RS07455 rhs28 Undetermined 799
function
R. solanacearum RS 488 CCY86_RS00760 rhs29 Ntox44 775 Rnase
R. solanacearum MolK2 PG909_RS12995 rhs29 Ntox44 776 Rnase
R. solanacearum UY031 RSUY_RS00730 rhs29 Ntox44 775 Rnase
R. solanacearum UW551 B7R79_15660 rhs29 Ntox44 775 Rnase
R. solanacearum T98 CJO77_RS06785 rhs30 Undetermined 750 Unknown
function
R. solanacearum SL3175 CJO92_RS06785 rhs30 Undetermined 750 Unknown
function
Ralstonia sp. RS642 LGV80_RS25775 rhs30 Undetermined 752 Unknown
function
Ralstonia sp. RS650 LGV82_RS18195 rhs30 Undetermined 752 Unknown
function
R. solanacearum RS 488 CCY86_RS17490 rhs31 DUF2778 764 Unknown
function
R. solanacearum RS 489 CDC59_RS17400 rhs31 DUF2778 764 Unknown
function
R. solanacearum UY031 RSUY_RS17095 rhs31 DUF2778 764 Unknown
function
R. solanacearum UW551 B7R79_18395 rhs31 DUF2778 764 Unknown
function
R. solanacearum IBSBF 2571 C2124_RS17000 rhs32 Undetermined 1716 Unknown
function
R. solanacearum CIAT_078 G8D25_RS07370 rhs32 Undetermined 1716 Unknown
function
156
Table 6 (continued).
Unknown
R. solanacearum Po82 RSPO_RS16740 rhs32 Undetermined 1716
function
Unknown
R. solanacearum UW163 UW163_RS23250 rhs32 Undetermined 1716
function
Unknown
R. solanacearum T98 CJO77_RS00965 rhs33 Undetermined 811
function
Unknown
R. solanacearum SL3175 CJO92_RS00965 rhs33 Undetermined 811
function
Unknown
R. solanacearum T98 CJO77_RS07310 rhs34 Undetermined 727
function
Unknown
R. solanacearum SL3175 CJO92_RS07305 rhs34 Undetermined 727
function
Unknown
R. solanacearum CFBP2957 PG903_RS18400 rhs35 NuoF 796
function
R. Unknown
pseudosolanacearum RUN2340 PG907_RS00945 rhs35 Undetermined 801 function
R. solanacearum Rs5 HF906_RS14570 rhs36 Undetermined 743 Unknown
function
R. Unknown
pseudosolanacearum LMG 9673 NY025_RS04740 rhs36 Undetermined 1701
function
R. solanacearum Rs5 HF906_RS22570 rhs37 AHH 1420 Dnase
R. solanacearum K60 B7R77_24635 rhs37 AHH 1418 Dnase
Unknown
R. syzygii LLRS-1 GO998_RS07635 rhs38 Undetermined 727
function
Unknown
R. solanacearum CFBP2957 PG903_RS18480 rhs38 Undetermined 133
function
Unknown
R. solanacearum UW763 HF907_RS23235 rhs39 Undetermined 630 function
R. Unknown
RUN2340 PG907_RS00925 rhs39 Undetermined 787 function
pseudosolanacearum
Unknown
R. syzygii LLRS-1 GO998_RS01050 rhs40 Undetermined 135 function
R. Unknown
RUN2340 PG907_RS19990 rhs40 Undetermined 661 function
pseudosolanacearum
Rnase
R. solanacearum T98 CJO77_RS15880 rhs41 Ntox37 768
Rnase
R. solanacearum SL3175 CJO92_RS15890 rhs41 Ntox37 768
Unknown
R. syzygii LLRS-1 GO998_RS01035 rhs42 DUF4258 760 function
Unknown
R. solanacearum UW763 HF907_RS11975 rhs42 DUF4258 760 function
Unknown
R. solanacearum RUN2279 HF908_RS21860 rhs43 Undetermined 195 function
Unknown
R. solanacearum RUN2474 HF909_RS21460 rhs43 Undetermined 180 function
Unknown
R. syzygii LLRS-1 GO998_RS25125 rhs44 Undetermined 767 function
Unknown
R. solanacearum RUN2474 HF909_RS15140 rhs44 Undetermined 198 function
157
Table 6 (continued).
158
Table 6 (continued).
159
Table 6 (continued).
160
Table 6 (continued).
161
Table 6 (continued).
Unknown
P. parmentieri WPP163 PECWA_RS01740 rhs10 Undetermined 1379
function
Unknown
P. parmentieri WC19161 JBL47_RS20215 rhs10 Undetermined 1379
function
Unknown
P. parmentieri IFB5408 C5E17_RS01945 rhs10 Undetermined 1379
function
P. polaris NIBIO1006 BJJ97_RS21700 rhs11 Ntox30 1774 Rnase
P. aroidearum QJ021 N5056_RS16410 rhs11 Ntox30 1704 Rnase
P. aquaticum A212-S19-A16 DMB82_RS05180 rhs11 Ntox30 1729 Rnase
P. polaris NIBIO1392 BJK05_RS11620 rhs11 Ntox30 1729 Rnase
P. aroidearum LJ2 LCF43_RS05215 rhs11 Ntox30 1772 Rnase
P. atrosepticum Green1 HLB43_RS04945 rhs11 Ntox30 1647 Rnase
Unknown
P. parmentieri IFB5441 C5E20_RS02970 rhs12 Undetermined 150
function
Unknown
P. parmentieri WPP163 PECWA_RS02590 rhs12 Undetermined 150
function
Unknown
P. parmentieri IFB5408 C5E17_RS02930 rhs12 Undetermined 150
function
Unknown
P. parmentieri IFB5485 C5E21_RS03095 rhs12 Undetermined 150
function
Unknown
P. parmentieri WC19161 JBL47_RS19215 rhs12 Undetermined 150
function
Unknown
P. odoriferum BC S7 BCS7_RS18250 rhs13 Undetermined 1279
function
Unknown
P. parvum YT22221 NV347_RS17715 rhs13 Undetermined 2733
function
Unknown
P. parvum FN20211 LOZ86_RS17700 rhs13 Undetermined 2733
function
Unknown
Pectobacterium F1-1 NAL19_RS17595 rhs13 Undetermined 2704
function
sp.
Unknown
P. odoriferum JK2.1 EO763_RS18400 rhs13 Undetermined 2459
function
Unknown
P. atrosepticum SCRI1043 ECA_RS14080 rhs14 Undetermined 1419
function
Unknown
P. parmentieri IFB5441 C5E20_RS12580 rhs14 Undetermined 1329
function
Unknown
P. parmentieri SCC3193 W5S_RS11680 rhs14 Undetermined 1329
function
Unknown
P. parmentieri IFB5485 C5E21_RS11275 rhs14 Undetermined 1329
function
Unknown
P. parmentieri QK-5 IG623_RS11480 rhs14 Undetermined 1329
function
Unknown
P. brasiliense 1692 GT391_RS12385 rhs15 Undetermined 1703
function
Unknown
P. parmentieri WC19161 JBL47_RS21180 rhs15 Undetermined 1777
function
Unknown
P. parmentieri WC19161 JBL47_RS20145 rhs15 Undetermined 693
function
162
Table 6 (continued).
163
Table 6 (continued).
Dnase
P. parmentieri IFB5485 C5E21_RS12620 rhs25 GH-E 1413
Dnase
P. parmentieri WPP163 PECWA_RS13170 rhs25 GH-E 1428
Dnase
P. parmentieri QK-5 IG623_RS12785 rhs25 GH-E 1428
Unknown
P. odoriferum JK2.1 EO763_RS00285 rhs26 Undetermined 1459
function
Unknown
P. aroidearum L6 I2D83_RS00315 rhs26 Undetermined 1451
function
Unknown
P. carotovorum WPP14 HER17_RS00295 rhs26 Undetermined 1451 function
IPO:4062 Unknown
P. brasiliense H5A40_RS06580 rhs26 Undetermined 1451
NAK:237 function
Unknown
P. parmentieri IFB5441 C5E20_RS13950 rhs27 Undetermined 1454
function
P. parmentieri WPP163 PECWA_RS11940 rhs27 GIY-YIG_SF 1348 Dnase
P. parmentieri WC19161 JBL47_RS00905 rhs27 GIY-YIG_SF 1438 Dnase
P. carotovorum ZJ-4-2 PSR30_RS12465 rhs27 GIY-YIG_SF 1437 Dnase
Unknown
P. parmentieri IFB5485 C5E21_RS02065 rhs28 Undetermined 703 function
Unknown
P. parmentieri QK-5 IG623_RS02105 rhs28 Undetermined 703
function
Unknown
P. parmentieri HC GMW39_RS24840 rhs28 Undetermined 151
function
Unknown
P. parmentieri IFB5441 C5E20_RS24400 rhs28 Undetermined 151
function
Pectobacterium sp. PL64 HP572_RS14765 rhs29 dnaA 1748 Dnase
Unknown
P. brasiliense BZA12 CTV95_RS20455 rhs29 Undetermined 1750
function
P. brasiliense SX309 B5S52_RS22765 rhs29 Undetermined 1750 Unknown
function
Unknown
P. carotovorum PCCS1 IQ281_RS00565 rhs29 Undetermined 1677
function
P. carotovorum PC1 PC1_RS16100 rhs30 Ribonuclease 1639 Rnase
P. versatile F131 F131LOC_RS16115 rhs30 Ribonuclease 1721 Rnase
P. polaris NIBIO1392 BJK05_RS11575 rhs30 Ribonuclease 737 Rnase
P. odoriferum JK2.1 EO763_RS17005 rhs30 Ribonuclease 566 Rnase
P. versatile SR12 LGL95_RS09080 rhs31 Ntox15 1429 Rnase
NES2-
P. polaris NIBIO1392 BJK05_RS07590 rhs31 NLS_ChREBP-like 1429 Dnase
Unknown
P. parmentieri IFB5485 C5E21_RS02015 rhs32 Undetermined 701
function
Unknown
P. parmentieri QK-5 IG623_RS02055 rhs32 Undetermined 701
function
Unknown
Pectobacterium sp. 21LCBS03 MYB54_RS04870 rhs33 Undetermined 1757
function
164
Table 6 (continued).
Unknown
P. carotovorum RC5297 F9W95_RS20515 rhs33 Undetermined 1758
function
Protease
P. parmentieri HC GMW39_RS20680 rhs34 TNT 1789
Protease
P. parmentieri HC GMW39_RS24965 rhs34 TNT 130
Unknown
P. aroidearum QJ021 N5056_RS00290 rhs35 Undetermined 1390
function
Unknown
P. brasiliense SX309 B5S52_RS00340 rhs35 Undetermined 1391
function
Unknown
P. parmentieri HC GMW39_RS17190 rhs36 Undetermined 1385 function
Unknown
P. parmentieri IFB5441 C5E20_RS02000 rhs36 Undetermined 1385
function
Unknown
P. parmentieri HC GMW39_RS25060 rhs37 Undetermined 483
function
Unknown
P. parmentieri IFB5441 C5E20_RS24770 rhs37 Undetermined 483
function
P. parmentieri SCC3193 W5S_RS17045 rhs38 Undetermined 1423 Unknown
function
Unknown
P. parmentieri QK-5 IG623_RS02250 rhs38 Undetermined 108
function
P. aroidearum AK042 L0Y25_RS05390 rhs39 LHH 1690 Dnase
P. aroidearum AK042 L0Y25_RS05420 rhs39 LHH 150 Dnase
P. versatile A73-S18-O15 LLE50_RS06165 rhs40 LHH 1712 Dnase
P. versatile 14A EIP93_RS17085 rhs40 LHH 1712 Dnase
P. parmentieri IFB5408 C5E17_RS21815 rhs41 CdiA-CT 530 Rnase
P. parmentieri WC19161 JBL47_RS21230 rhs41 CdiA-CT 224 Rnase
P. parmentieri IFB5408 C5E17_RS13630 rhs42 Tox-URI2 1411 Dnase
P. parmentieri WC19161 JBL47_RS09455 rhs42 Tox-URI2 1411 Dnase
Unknown
Pectobacterium sp. 21LCBS03 MYB54_RS00315 rhs43 Undetermined 1407
function
Unknown
P. brasiliense TS20HJ1 LJQ72_RS00295 rhs43 Undetermined 132
function
P. brasiliense BC1 NC16_RS00275 rhs44 HNHc 1454 Dnase
P. carotovorum A077-S18-O15 LHL03_RS00305 rhs44 HNHc 679 Dnase
Unknown
P. versatile A73-S18-O15 LLE50_RS11935 rhs45 Undetermined 1404
function
Unknown
P. carotovorum RC5297 F9W95_RS04425 rhs45 Undetermined 698
function
Unknown
P. parmentieri IFB5485 C5E21_RS01905 rhs46 Undetermined 1385
function
Unknown
P. parmentieri QK-5 IG623_RS01900 rhs46 Undetermined 1385
function
P. parmentieri IFB5485 C5E21_RS02050 rhs47 Tox-ART-HYD1 675 Protease
P. parmentieri QK-5 IG623_RS02090 rhs47 Tox-ART-HYD1 675 Protease
165
Table 6 (continued).
Dnase
P. polaris QK413-1 KSL88_RS00315 rhs48 Tox-SHH 1453
Dnase
P. polaris NIBIO1392 BJK05_RS16395 rhs48 Tox-SHH 1453
Rnase
P. carotovorum PCCS1 IQ281_RS00550 rhs49 CdiA-CT 634
Rnase
P. brasiliense 21PCA_AGRO2 OWC53_RS17050 rhs49 CdiA-CT 136
Dnase
P. brasiliense IPO:4062 H5A40_RS00715 rhs50 Tox-URI2 1748
NAK:237
Dnase
P. parmentieri IFB5408 C5E17_RS21685 rhs50 Tox-URI2 1760
Unknown
P. parmentieri IFB5485 C5E21_RS23365 rhs51 Undetermined 174
function
Unknown
P. parmentieri QK-5 IG623_RS23555 rhs51 Undetermined 174
function
P. aroidearum QJ011 L0Y21_RS00280 rhs52 Tox-ART-HYD1 1400 Protease
P. brasiliense 130 MBA20_RS00310 rhs52 Tox-ART-HYD1 1401 Protease
Unknown
P. cacticida CFCC10813 OI450_RS15625 rhs53 Undetermined 1622
function
Unknown
P. parmentieri WC19161 JBL47_RS21215 rhs53 Undetermined 145
function
Unknown
P. odoriferum JK2.1 EO763_RS17045 rhs54 Undetermined 1737
function
Unknown
P. carotovorum ZM1 GBN63_RS21300 rhs54 Undetermined 220
function
P. brasiliense 1692 GT391_RS12395 rhs55 CdiA-CT 242 Rnase
P. aquaticum A212-S19-A16 DMB82_RS05195 rhs55 CdiA-CT 539 Rnase
P. parmentieri IFB5408 C5E17_RS12225 rhs56 Ntox30 1334 Rnase
P. parmentieri WC19161 JBL47_RS10785 rhs56 Ntox30 1334 Rnase
P. polaris NIBIO1006 BJJ97_RS05660 rhs57 ADPRTs_Tse2 1395 Dnase
P. carotovorum RC5297 F9W95_RS04415 rhs57 ADPRTs_Tse2 466 Dnase
P. aquaticum A212-S19-A16 DMB82_RS00260 rhs58 Aromatic_hydrox 1425 Dnase
Unknown
P. carotovorum ZM1 GBN63_RS05270 rhs58 Undetermined 1425
function
Unknown
P. parmentieri IFB5485 C5E21_RS23240 rhs59 Undetermined 1758
function
Unknown
P. parmentieri QK-5 IG623_RS23445 rhs59 Undetermined 1762
function
Unknown
P. parmentieri SCC3193 W5S_RS23555 rhs60 Undetermined 127
function
Unknown
P. parmentieri QK-5 IG623_RS02165 rhs60 Undetermined 127 function
Unknown
P. versatile 14A EIP93_RS21970 rhs61 Undetermined 189
function
Unknown
P. carotovorum RC5297 F9W95_RS04435 rhs61 Undetermined 165
function
166
Table 6 (continued).
Unknown
P. carotovorum 2A JFY74_RS18195 rhs62 Undetermined 2125
function
Unknown
P. carotovorum RC5297 F9W95_RS01055 rhs62 Undetermined 2125
function
Rnase
P. parmentieri WPP163 PECWA_RS24630 rhs63 CdiA-CT 186
Rnase
P. parmentieri SCC3193 W5S_RS25910 rhs63 CdiA-CT 186
Dnase
P. actinidiae GX-Pa1 M9782_RS01740 rhs64 NUC 1451
Dnase
P. brasiliense BZA12 CTV95_RS15660 rhs64 Endonuclea_NS_2 1451
Unknown
P. carotovorum WPP14 HER17_RS04950 rhs65 Undetermined 1702
function
Unknown
P. quasiaquaticum A398-S21-F17 IG605_RS04515 rhs66 Undetermined 142
function
Unknown
P. versatile 14A EIP93_RS21980 rhs67 Undetermined 679
function
Unknown
P. carotovorum ZJ-4-2 PSR30_RS16850 rhs68 COG4875 1780
function
Unknown
P. polaris NIBIO1006 BJJ97_RS21685 rhs69 Undetermined 784
function
Unknown
P. versatile SR12 LGL95_RS05075 rhs70 Undetermined 1706
function
P. parmentieri WPP163 PECWA_RS03155 rhs71 AHH 1422 Dnase
P. carotovorum 25.1 LQF52_RS16850 rhs72 LHH 1698 Dnase
Unknown
P. parmentieri IFB5408 C5E17_RS24290 rhs73 Undetermined 293
function
Unknown
P. odoriferum BC S7 BCS7_RS21710 rhs74 Undetermined 197
function
Unknown
P. parmentieri IFB5441 C5E20_RS24670 rhs75 Undetermined 1778
function
Unknown
P. parmentieri HC GMW39_RS16120 rhs76 Undetermined 1115
function
Unknown
P. aroidearum L6 I2D83_RS02700 rhs77 Undetermined 1431
function
Pectobacterium sp. PL64 HP572_RS20700 rhs78 GcvP2 138 Dnase
Unknown
P. brasiliense TS20HJ1 LJQ72_RS00325 rhs79 Undetermined 1392
function
P. carotovorum A077-S18-O15 LHL03_RS16705 rhs80 HNHc 1718 Dnase
Unknown
P. odoriferum JK2.1 EO763_RS23450 rhs81 Undetermined 216
function
Unknown
P. carotovorum XP-13 IHJ55_RS00260 rhs82 Undetermined 1441
function
Unknown
P. atrosepticum SCRI1043 ECA_RS16870 rhs83 Undetermined 1162
function
Unknown
P. polaris QK413-1 KSL88_RS05045 rhs84 Undetermined 1697
function
167
Table 6 (continued).
Unknown
P. brasiliense IPO:4132 H5A38_RS05120 rhs85 Undetermined 1741
function
NAK:239
Unknown
P. actinidiae GX-Pa1 M9782_RS18655 rhs86 Undetermined 1718
function
Unknown
P. brasiliense ZLMLSHJ5 IHJ54_RS15735 rhs87 Undetermined 1622
function
Unknown
P. carotovorum ZJ-4-2 PSR30_RS00285 rhs88 Undetermined 1385
function
Unknown
P. cacticida CFCC10813 OI450_RS00335 rhs89 Undetermined 1759
function
Dnase
P. parmentieri SCC3193 W5S_RS21520 rhs90 NTP_transf_2 1776
Unknown
P. brasiliense 1692 GT391_RS07590 rhs91 Undetermined 1449
function
Unknown
P. carotovorum ZJ-4-2 PSR30_RS14000 rhs92 Undetermined 1448
function
Dnase
P. aroidearum LJ2 LCF43_RS00295 rhs93 ParB 1403
Dnase
P. cacticida CFCC10813 OI450_RS03605 rhs94 DNase_NucA_ 1416
NucB
P. parmentieri IFB5441 C5E20_RS19805 rhs95 Undetermined 1430 Unknown
function
Unknown
P. parmentieri SCC3193 W5S_RS25970 rhs96 Undetermined 118
function
Pectobacterium sp. 21LCBS03 MYB54_RS04895 rhs97 Undetermined 311 Unknown
function
Unknown
P. carotovorum PCCS1 IQ281_RS03370 rhs98 Undetermined 1458
function
Unknown
P. odoriferum BC S7 BCS7_RS16795 rhs99 DUF3990 1433
function
Unknown
P. parmentieri WPP163 PECWA_RS07940 rhs100 Undetermined 1447
function
Unknown
P. parmentieri IFB5408 C5E17_RS24140 rhs101 Undetermined 135 function
Unknown
P. quasiaquaticum A398-S21-F17 IG605_RS04455 rhs102 Undetermined 1735
function
Unknown
P. versatile 2-Mar OA04_RS17325 rhs103 Undetermined 1770
function
P. quasiaquaticum A398-S21-F17 IG605_RS04545 rhs104 Tox-HNH-HHH 1179 Dnase
Unknown
P. brasiliense 21PCA_AGRO2 OWC53_RS17105 rhs105 Undetermined 1769
function
P. versatile SR1 LGL96_RS05030 rhs106 TNT 1715 Protease
Dickeya
Rhs- Putative
Toxin Protein
Species strain Rhs locus toxin predicted
domain Length
identifier function
D. dadantii 3937 DDA3937_RS23790 rhs1 NUC 195 Dnase
168
Table 6 (continued).
Dnase
D. dadantii XJ12 OL445_RS18200 rhs1 NUC 179
Dnase
D. fangzhongdai ZXC1 PQ617_RS07225 rhs1 NUC 1449
RNS Dnase
D. solani D083_RS16290 rhs1 NUC 1448
08.23.3.1.A
Dnase
D. dianthicola 67-19 HGI48_RS08570 rhs1 NUC 1449
Dnase
D. dadantii 3937 DDA3937_RS04095 rhs1 NUC 1434
Dnase
D. dadantii FZ06 MUB29_RS04230 rhs1 NUC 1449
Dnase
D. dadantii DSM 18020 CO076_RS08595 rhs1 NUC 834
Dnase
D. zeae MS_2018 HJ586_RS10085 rhs2 Tox-HNH-EHHH 541
Dnase
D. dadantii 3937 DDA3937_RS06990 rhs2 Tox-HNH-EHHH 461
Dnase
D. dadantii XJ12 OL445_RS16875 rhs2 Tox-HNH-EHHH 461
Dnase
D. fangzhongdai Onc5 K0H75_RS06830 rhs2 Tox-HNH-EHHH 447
Dnase
D. fangzhongdai ZXC1 PQ617_RS14045 rhs2 Tox-HNH-EHHH 1686
Dnase
D. zeae EC2 DWV07_RS06335 rhs2 Tox-HNH-EHHH 1665
Unknown
D. fangzhongdai PA1 B6N31_RS08490 rhs3 Undetermined 1420
function
Unknown
D. fangzhongdai ZXC1 PQ617_RS14855 rhs3 Undetermined 1435
function
Unknown
D. dadantii XJ12 OL445_RS18575 rhs3 Undetermined 1435
function
Unknown
D. zeae A586-S18-A17 LHK94_RS01390 rhs3 Undetermined 1420
function
Unknown
D. dadantii FZ06 MUB29_RS14910 rhs3 Undetermined 1420
function
Unknown
D. zeae A586-S18-A17 LHK94_RS03045 rhs4 Undetermined 190
function
D. dadantii S3-1 KNV89_RS16960 rhs4 Undetermined 1657 Unknown
function
Unknown
Dickeya sp. Secpp 1600 CPA59_RS06965 rhs4 Undetermined 1657
function
D. fangzhongdai AP6 FXN80_RS07265 rhs4 Undetermined 1657 Unknown
function
Unknown
D. fangzhongdai ND14b LH89_RS05140 rhs4 Undetermined 459 function
D. fangzhongdai ND14b LH89_RS22050 rhs5 Ntox8 117 Rnase
D. zeae A5272 FGI00_RS14460 rhs5 Ntox8 649 Rnase
D. dadantii S3-1 KNV89_RS16990 rhs5 Ntox8 637 Rnase
D. zeae A586-S18-A17 LHK94_RS03030 rhs5 Ntox8 563 Rnase
D. fangzhongdai ZXC1 PQ617_RS14085 rhs5 Ntox8 343 Rnase
RNS
D. solani 08.23.3.1.A D083_RS19120 rhs6 Tox-HNH-EHHH 1660 Dnase
169
Table 6 (continued).
170
Table 6 (continued).
171
Table 6 (continued).
172
Figure 1.1. Organization and structure of a Rhs-toxin system in bacteria. (A) Representation
of a Rhs-toxin system showing common flanking upstream and downstream regions to the toxin.
An associated vgr gene (green) that facilitates secretion of the toxin. Directly downstream of the
toxin is an immunity (Imm) (dark blue) protein that protects from self-killing. Occasionally, an
orphan module (grey) is found downstream of the Imm protein, encoding an alternative C-terminal
tip with its Imm protein. (B) The full Rhs-toxin is approximately 1,400 - 1,750 amino acids (aa)
long and consists of two main regions: 1) N-terminal region involved in both signaling and
translocation of the toxin. Within the N-terminal region is a core region flanked by two conserved
motifs (depicted in red) of 10 aa: RvxxxxxxxG and PxxxxDPxGL; and 2) C-terminal region
contains the toxin domain.
173
Figure 1.2. Rhs-toxin delivery by the Type 6 secretion system. The Type Six Secretion System
(T6SS) involves in the secretion of the Rhs-toxin. In the extended assembly phase (left), a
baseplate is formed, comprising components like vgrG, PAAR spike where the Rhs-toxin is
attached. During the contracted and delivery step (middle), a quick discharge process occurs, with
the Hcp-VgrG-PAAR exiting the cell and penetrating the outer membrane of a target cell. This
delivery process involves the transfer of an Rhs-toxin, which may act to disrupt essential functions
and inhibit its proliferation. On the right, the disassembled T6SS often relocates to start a new
assembly process for subsequent secretion events.
174
Figure 2.1. Abundance and diversity of Rhs-toxin sequences in plant pathogenic Xanthomonas. The x-axis show
Xanthomonas species, the y-axis number of Rhs-toxin per genome, and each dot represent a genome color coded by
species. The size of the circle corresponds to the number of Rhs-toxins per genome. The dotted line intercepting the y-
axis shows the genomes with the largest number of Rhs-toxins. Short black lines, interspersed among dots, represent
the median value of total Rhs-toxins found for each species. Data represents 165 genomes across 20 species.
175
Protein Length
1400
1700
2000
350
700
90
Xa
−F
13 J1
1
13 1a
15 14c
−0
88
17
3
CP CIT 301
B A
Figure
CP F 14 33
B 9
CP F 7 4
BF 65
IV 7
I 6
IV A 13 6
IA 17
39
78
SL R1
20
9
Yc 8
LM hA
NC G2 F1
PP 678
B 9
7
10 96
1
12 03
0
2.2.
12 49
16 112
−O
c
29 t
−
30 5
−
50 1
53
57
6
71
9
AT 8 8284
CC 5−
BJ 33 10
SJ 91
Q2
0 B 3
CF 2006100
BP 12
CF 8
4
CFBP1 44
3
CFBP5 71
BP 824
66
9
C 0
GB CN03
BC N1
pseudogenes, respectively.
M 30 5
A 77
NC FF1 M2
PP 061 8
B 81
NE4379
B1
22
CF A 8ra
B W
CF P 2 15
B 0
CF P6 36
1
CFBP6 66
9
CFBP6 75
9
CFBP6 90
9
CFBP7 91
1
CFBP7 11
1
DABP7 12
1
DAR72 19
R7 029
FD 38
C 86
16
09
IS GZ
O1 09
2C
LM 3
G7 K2
43
M9
M 12
SC
Un NT T
B− T 17
Xte X1 T2
cT 601 1
Xc G0 49
m 2
Xc H1 −2
mN 00
1 5
IL 003
M _234
OHI_35
_2 9
CP 61
CF BF 1
BP 424
LM383
G9 6
3
LMFap2 0
NB G 7 9
C 03
SI
CF W SS
B G
AR P 46 16
CF T− 91
X
CFBP 2 tg2
IC BP 055
M 25
IC P 1 41
M 63
P 1
LM110 7
5
LMG 7 5
G 28
8
L 43
NC MA W1
PP I5 6
B 034
37
11
UP P
3
XtKB458
XtKm1
5
XtKm33
m7
X
Xtu Xtu tLr
NC −UP 469 8
9
NCPPB B51
P 1 3
NC PB 060
PP 264
B 9
Xv 902
1
LM601
15
9
citri
fragariae
hortorum
cucurbitae
arboricola
campestris
albilineans
axonopodis
euroxanthea
represents individual strains, y axis represents the protein sequence length in amino acids (aa), and each dot represents
and 1400 aa demarcate the orphan module C-terminal tip from full Rhs-toxins and full Rhs-toxins from putative
toxin; and 4) Any toxin >1900 aa also containing full length toxins. The dashed lines that intercept the y-axis at 350
this represents Rhs-toxins tips and parts of the core region; 3) Any toxin between 700-1900 aa, indicative of a full Rhs-
divided into four categories: 1) Any toxin <350 aa, this represents Rhs-toxin tips; 2) Any toxin between 350-699 aa,
an Rhs-toxin sequence color coded by species. The size of the dot represents the amino acid sequence length that are
repertoires within Xanthomonas genomes. The x axis
Extensive variation in
euvesicatoria
Species
<350aa
>2000aa
sp.
350−699aa
700−1886aa
oryzae
theicola
vasicola
sacchari
176
hyacinthi
perforans
prunicola
vesicatoria
translucens
shades of green were identified, with maximum likelihood. The white cluster correspond to outgroups from E. coli and P.
strip represents their respective species. Branches are supported by >90% with 1,000 bootstraps. (B) Representative
immediately upstream of the Rhs-toxin (purple); Immunity protein/hipothetical immunity proteins (bright green); T6SS-
aeruginosa. Branch labels indicate the Rhs-toxin locus tag from which the core regions where extracted, and the outer
structures of Rhs-toxin system loci from each of the nine different clades: Rhs-toxin (light green); Coding sequence
linked proteins (light blue); T2SS-linked proteins (pink); Transposases (yellow); and proteins not associated with
constructed from 412 Rhs-toxin core region sequences representing 19 species. Nine distinct clades depicted in various
100.0 PA2684
NAG70 RS16640
NAG70 RS11315
NAG70 RS15605
NAG70 RS15595
NAG70 RS00575
NAG70 RS01195
KFS86 RS08990
NBC2815 RS11835
KHA79 RS11980
K8O61 RS11900
OW158 RS12015
NZ30 RS06515
K8O61 RS11885
KFS85 RS06145
OW158 RS12005
BER93 RS11435
PD5205 RS11720
F0H32 RS14615
NZ30 RS06505
KHA79 RS11990
MZO50 RS08935
KM563 RS15915
F0H33 RS14415
ISN38 RS15700
100.0 ISN36 RS12090
NBC2815 RS20935
OW158 RS11970
PD5205 RS11680
BER93 RS20935
K8O61 RS11920
KCU58 RS05740
KFS85 RS06170
FD63 RS13525
F0H32 RS14580
KCU58 RS05760
ISN33 RS07930
LTC53 RS14350
EYR27 RS02355
EYC55 RS06265
EYC56 RS17650
AXO1947 RS21135
EYC57 RS05720
ACU17 RS24220
EYC54 RS06110
EYC57 RS05750
EYC56 RS17635
EYR27 RS02345
EYR26 RS17005
EBA23 RS08545
ACU17 RS16415
EYC55 RS06255
ACU11 RS16075
IYN96 RS05795
GHV42 RS05875
EYC54 RS23225
ACU16 RS16535
ATY44 RS15850
100.0
EYR26 RS16995
EYC55 RS06280
ACU16 RS16515
BVV20 RS16895
EYR26 RS16965
MML47 RS05895
QN060 RS16955
EBA23 RS08555
rhs loci structure of nine clades in Xanthomonas
EBA17 RS16090
LL928 RS15920
IZG20 RS13450
EYC54 RS06145
EYC56 RS17600
ACU16 RS16550
GHV42 RS05860
secretion (grey). Numbers to the left represent approximate full Rhs-toxin sequence length (aa).
IYN96 RS05805
MML47 RS05905
IYN96 RS05820
QN060 RS16945
EYC57 RS05735
BE73 RS27715
PXO RS07175
EYR26 RS16955
EYC56 RS17670
EYC55 RS06235
BE73 RS04950
100.0
AXO1947 RS05255
EBA11 RS16380
FZ025 RS20375
KHN80 RS03670
OCJ37 RS10125
HEP74 RS06905
HEP75 RS07115
100.0
K9U01 RS02055
AD14011 RS21420
NG831 RS12155
NG828 RS12090
NG831 RS12145
NG825 RS08785
NG824 RS09410
NG829 RS11660
XaFJ1 RS11700
XaFJ1 RS10230
XaFJ1 RS10215
GKO49 RS02190
90.0
ATY44 RS24970
PXO RS12220
PXO RS13150
LL928 RS20780
PXO RS13125
PXO RS12195
XAV RS01910
XAV RS01895
K4A87 RS01730
OZ429 RS17725
OZ429 RS17750
OZ429 RS17695
K4A87 RS01745
DGN02 RS02115
BJD10 RS00030
EYC55 RS20925
EYC54 RS02420
EYC56 RS02680
BE73 RS24035
EYC55 RS20950
EYC56 RS02655
100.0
KHA79 RS18900
BE73 RS10025
XeaCFBP3836p RS02340
H7A86 RS06100
NX81 RS24565
KWO RS23195
NX80 RS23480
M0D47 RS15265
M0D48 RS03750
M0D43 RS15440
G4Q83 RS21605
CXP37 RS23425
M0D45 RS12480
BI317 RS16880
H7A87 RS06480
FPL04 RS16495
XB05 RS11070
F6Y24 RS05500
K9U01 RS06725
M0D46 RS01270
OZ429 RS05545
XB05 RS24585
H5027 RS08090
XHV734 RS14570
IFJ81 RS22255
JH261 RS05475
JH264 RS16040
AD14011 RS05040
K6979 RS13135
K6978 RS05695
K6982 RS05345
AMD07 RS26300
XeaCFBP3836p RS16095
XcvCFBP7111P RS03540
G3566 RS06640
NX08 RS22840
EYR26 RS09400
GHV42 RS13390
NX08 RS17220
APY30 RS17325
XcfCFBP6990P RS06790
XcfCFBP6991P RS06810
XcvCFBP7112P RS16595
XapA RS24135
DGN02 RS06730
LPY96 RS16390
APY29 RS16285
APY30 RS06770
BGK55 RS14880
XcvCFBP7111P RS17725
FPK90 RS25350
FPL05 RS24910
XcgCFBP7119R RS24575
H8Z73 RS06720
H8Z71 RS14935
IG630 RS06660
AMD13 RS07130
B7L66 RS03335
100.0
~1,389 aa
~1,306 aa
~1,126 aa
~1,407 aa
~1,415 aa
~1,435 aa
~1,443 aa
~1,400 aa
AMD07 RS14915
~999 aa
KM539 RS12985
NG831 RS05640
100.0
XTG RS08390
BI313 RS24690
XTG RS11135
PML25 RS09245
XeaCFBP3836p RS05615
LZZ50 RS07680
H7A86 RS08655
OEG85 RS14840
LOK39 RS10655
H7A87 RS08515
100.0 JH314 RS16820
H7A87 RS05400
M0D48 RS15735
M0D47 RS05415
M0D43 RS05650
OZ429 RS07145
NDY25 RS04850
PML25 RS16215
100.0
OEG85 RS00950
JH282 RS00725
XCC RS00685
IFJ81 RS00725
96.0
JH306 RS00675
QMY63 RS01220
JH299 RS20680
H7A87 RS00685
K6978 RS00120
XcfCFBP6991P RS00645
XcfCFBP6990P RS00645
HG421 RS19535
XTG RS00650
KHO05 RS00695
XB05 RS25205
B
H7A86 RS00675
LZZ50 RS02835
KHN93 RS00775
M0D45 RS09610
M0D46 RS09630
M0D43 RS12695
M0D47 RS12515
M0D48 RS00875
XcvCFBP7112P RS00780
G3566 RS00655
XACM RS19450
BHE83 RS13365
BJD11 RS24465
HG421 RS10280
LOK40 RS12405
AMD07 RS10975
H8Z71 RS08810
H8Z73 RS13160
IG630 RS13345
AMD13 RS13340
JH277 RS08765
JH307 RS12435
NMB96 RS19530
BJD10 RS12280
BI317 RS10180
CFBP8129 RS13300
XJ27 RS06820
translucens
DYQ48 RS13035
vesicatoria
XHV734 RS08375
EYC54 RS21870
EYC55 RS09335
EYC54 RS21155
EYC56 RS22600
EYC55 RS01340
prunicola
hortorum
100.0
EYC55 RS01365
hyacinthi
EBA20 RS17030
LL928 RS15550
sacchari
EBA23 RS08175
vasicola
PXO RS06285
theicola
EBA17 RS16470
ATY44 RS16160
BE73 RS24340
ACU17 RS24240
GHV42 RS05550
oryzae
GKO49 RS15840
IZG20 RS13760
ACU11 RS25435
PXO RS06855
XcvCFBP7111P RS10715
KWO RS16610
NX81 RS17510
NX80 RS04185
NX81 RS13995
NX80 RS11085
DGN02 RS13315
XcfCFBP6991P RS16435
XcfCFBP6990P RS16400
sp.
DGN02 RS22685
XcvCFBP7111P RS26910
G3566 RS22005
G3566 RS22055
TP50 RS18950
AC612 RS14010
XcfCFBP6991P RS13040
XcfCFBP6990P RS13005
AC612 RS07065
DGN11 RS06735
XcfCFBP6166P RS06935
XcfCFBP6975P RS06830
HG421 RS21100
OEG85 RS05880
JH291 RS13450
M0D47 RS16575
P3C56 RS06145
KPG65 RS18010
LZZ50 RS14280
KHN80 RS22080
XB05 RS25140
EYR27 RS05005
P3C56 RS09790
M0D46 RS13530
M0D43 RS16650
M0D48 RS04960
M0D45 RS05685
Pseudomonas aeruginosa
IFJ81 RS09090
JH293 RS08480
QMY63 RS11040
LOK39 RS12170
XTG RS15535
BJD11 RS05975
BHE83 RS09435
FPL05 RS13785
XcgCFBP7119R RS14210
100.0
FPK90 RS14455
BGK55 RS25635
LPY96 RS09700
APY29 RS09475
APY30 RS13510
XapA RS09500
LCZ91 RS03705
XcgCFBP7119R RS18080
FPL05 RS24570
FPK90 RS25030
BHE83 RS26150
BJD11 RS09260
Xanthomonas species
LPY96 RS06290
APY29 RS05890
APY30 RS17055
H8Z71 RS05205
AMD13 RS16920
IG630 RS16920
H8Z73 RS16735
B7L66 RS04365
AMD07 RS05675
EYR27 RS01770
CFBP8129 RS25130
KP727 RS13905
Escherichia coli
XcgCFBP7119R RS24395
XACM RS08075
FPL05 RS10375
FPK90 RS10315
XcgCFBP7119R RS09985
K4A87 RS04825
OZ429 RS14290
XCCCN15 RS09225
JH303 RS13035
XCCB100 RS24355
euvesicatoria
XCCCN03 RS08915
JH291 RS16890
NKJ47 RS02030
HEP75 RS10700
euroxanthea
NRY95 RS12615
KM539 RS10370
OCJ37 RS19440
axonopodis
HEP73 RS20695
HEP74 RS19970
albilineans
campestris
HEP75 RS20380
cucurbitae
arboricola
K6982 RS13340
K6979 RS05550
K6978 RS14005
LOK39 RS09320
fragariae
BJD10 RS15725
PML25 RS10040
NDY25 RS20705
DYQ48 RS09410
CFBP8129 RS16560
XHV734 RS12125
BI317 RS13745
G4Q83 RS22980
G4Q83 RS22720
XaFJ1 RS11545
92.0 NG825 RS12405
NG828 RS10195
NKJ47 RS07105
NUG21 RS06315
NUG21 RS11915
QBE03 RS00945
NG829 RS20780
NG824 RS20860
NG827 RS06235
citri
QBE03 RS07645
93.0 NG825 RS06490
NG828 RS21750
NG827 RS20835
NG831 RS06820
NG829 RS06270
NG824 RS14860
OCJ37 RS05755
OCJ37 RS05745
HEP73 RS22415
HEP73 RS22295
OCJ37 RS20860
NUG20 RS21735
A NUG20 RS15640
NRY95 RS05470
HEP73 RS22405
HEP75 RS06045
HEP74 RS05875
NRY95 RS06510
FZ025 RS15345
KFS85 RS14540
KCU58 RS15120
KM539 RS06285
NZ30 RS21745
KHF85 RS05665
KHA79 RS05675
ISN38 RS20150
ISN36 RS20150
K8O61 RS05725
MZO50 RS18285
ISN33 RS16900
ISN30 RS08370
LTC53 RS05350
A Presence and absence plot of Rhs C-terminal tips in Xanthomonas
1 0
Strain
CFBP6991
CFBP6990
T21
UnB−XtecTG02−2
YLX21
YT9−19−2
HR1−32
CN03
30−1
CN15
B100
29−5
jj2001
NCPPB 796
BAI23
NBC2815
1
YchA
IL_234
CPBF 1494
17
3
1311a
IVIA 1317
IVIA 3978
SL2098
8284
10103
ATCC 33913
CFBP1371
12112
5053
BJSJQ20200612
CFBP5824
CFBP6690
NCPPB 4379
CPBF 766
LH3
P3
UPB458
Xtu 4699
Xv1601
LM159
XtKm7
301
CFBP2286
MAFF106181
719
GZ09
CFBP7337
AXO1947
XtLr8
XtKm15
Xtu−UPB513
ART−Xtg2
CITA 33
15−088
R1
NCPPB 902
OH_261
CFBP 8443
YN01
YM15
0−9
CFBP7331
CPBF 765
CFBP7341
LMG26789
F1
CIX97
CFBP3836
MAI5069
GW
NEB122
LMG7439
CFBP7112
PD5205
Fap29
LMG 703
ICMP 16317
LMG 843
VT106
B07−007
CFBP 498
ICMP 7383
LM16734
JS749−3
OSU493
305
CFBP 8445
DD13
Xa−FJ1
LMG 728
CFBP 8444
DAR73886
DAR72029
NT17
WG16
AW15
MI_359
TX160149
CFBP7111
YNJC
NJ01
MAI134
LW16
CFBP 2055
LW16
NCPPB4346
HGA4
NCPPB 2649
FDC 1609
M12
CFBP6166
CFBP6975
ISO12C3
NCPPB 1060
X11−5A
CFBP 2541
CFBP7342
LT6−16−1
JR3−14
DJ16
LT6−2
NCPPB 3711
ICMP11055
HR3−46
Oregano 108
CPBF 424
45215
12049
576
GBBC 3077
XtKm33
MAI5034
M28
1314c
LMG930
85−10
SHQP01
XcmH1005
CFBP 2036
XcmN1003
MSCT
JS49−6
AUST2013
KXO85
PXO404
PXO211
PXO99A
ITCCBB0002
SS
SI
CIX383
CIX249
MAI5037
2
CFBP 4691
AM6
K2
8ra
CFBP7119
CFBP 1156
NJ611
BB156−2
BB151−3
rhs1
rhs2
rhs3
rhs4
rhs5
rhs6
rhs7
rhs8
rhs9
rhs10
rhs11
rhs12
rhs13
rhs14
rhs15
rhs16
rhs17
rhs18
rhs19
rhs20
rhs21
rhs22
rhs23
rhs24
rhs25
rhs26
rhs27
rhs28
rhs29
rhs30
rhs31
rhs32
rhs33
rhs34
rhs35
rhs36
rhs37
rhs38
rhs39
rhs40
rhs41
rhs42
rhs43
rhs44
rhs45
rhs46
rhs47
rhs48
rhs49
rhs50
rhs51
rhs52
rhs53
rhs54
rhs55
rhs56
rhs57
rhs58
rhs59
rhs60
rhs61
rhs62
rhs63
rhs64
rhs65
rhs66
rhs67
rhs68
rhs69
rhs70
rhs71
rhs72
rhs73
rhs74
rhs75
rhs76
rhs77
rhs78
rhs79
rhs80
rhs81
rhs82
rhs83
rhs84
rhs85
rhs86
rhs87
rhs88
rhs89
rhs90
rhs91
rhs92
rhs93
rhs94
rhs95
rhs96
rhs97
rhs98
rhs99
rhs100
rhs101
rhs102
rhs103
rhs104
rhs105
rhs106
rhs107
rhs108
rhs109
rhs110
rhs111
rhs112
rhs113
rhs114
rhs115
rhs116
rhs117
rhs118
rhs119
rhs120
rhs121
rhs122
rhs123
rhs124
rhs125
rhs126
rhs127
rhs128
rhs129
rhs130
rhs131
rhs132
rhs133
rhs134
rhs135
rhs136
rhs137
rhs138
rhs139
rhs140
rhs141
rhs142
rhs143
rhs144
rhs145
rhs146
rhs147
rhs148
rhs149
rhs150
rhs151
rhs152
rhs153
rhs154
rhs155
rhs156
rhs157
rhs158
rhs159
rhs160
rhs161
rhs162
rhs163
rhs164
rhs165
rhs166
rhs167
rhs168
rhs169
rhs170
rhs171
rhs172
rhs173
rhs174
rhs175
rhs176
rhs177
rhs178
rhs179
rhs180
rhs181
C-terminal tip identifiers
B 1
2 3 4
5 6 7 8 9 10 11
12 13 14 15 16 17 18 19 20
32
21 22 23 24 25 26 27 28 29 30 31 32
33 3441 35 36 37 38 39 40 41 42 43 44 48 45
46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61
62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77
78 90
91 96
Figure 2.4. A total of 181 unique Rhs C-Terminal tips in Xanthomonas species. (A) Presence
and absence plot of C-terminal toxin tip sequences which carry the active toxin domain. Unique
C-terminal tip identifiers are along the bottom, and on the right side showing Xanthomonas
strains. A blastP analysis was performed on 604 C-terminal tip sequences from 165 genomes to
search for homology; C-terminal tips were considered the same if homologous (>85% similarity).
(B) A similarity network analysis shows 96 clusters and 85 singleton C-terminal tip putative toxin
functions. Node names represent the Rhs C-terminal tip identifiers, are color-coded based on
species, and their size corresponds to the protein length. Small circles represent orphan C-
terminal tips and large circles represent full Rhs-toxins.
178
Figure 2.5. Unique, general, and large unknown functional diversity among Rhs-toxins in
Xanthomonas species. (A) A total of 33 distinct toxin domains from 173 Rhs C-terminal tips were
identified through a Pfam database analysis. Each ring is color coded to represent a different
Xanthomonas species and each source node represents a strain. Target node (gray) represents
identified toxin domain and shape represents putative predicted function: Dnases (circles); Rnases
(diamonds); proteases (squares); and deaminases (hexagon). Edges connecting source and target
nodes are also color-coded based on species. Of the 605 Rhs C-terminal tip sequences analyzed,
431 have undetermined toxin domains (octagon). (B) Alignment and percent identity of
representative protein sequences that are in different clusters but share the same toxin domain
(cluster 1, 27, and 37), supporting hypervariability of C-terminal tips outside of functional
domains.
179
Figure 3.1 Abundance and diversity of Rhs-toxin sequences in plant pathogenic Ralstonia. The x-axis show Ralstonia species,
the y-axis number of Rhs-toxin per genome, and each dot represent a genome color coded by species. The size of the circle
corresponds to the number of Rhs-toxins per genome. The dotted line intercepting the y-axis shows the genomes with the largest
number of Rhs-toxins. Short black lines, interspersed among dots, represent the median value of total Rhs-toxins found for each
species. Data represents 87 genomes across 6 species.
180
Protein length of Curated Rhs−toxins repertoires in Ralstonia
2000
1700
Species
1400 mannitolilytica sp.
Protein Length
pseudosolanacearum syzygii
solanacearum wenshanensis
Protein Length Range
<350aa
700
350−699aa
700−1886aa
350
90
RS −M
G 000
Po 2
M
AT 303 078
M CC 107 0
2
2
1
0
8
2
5
UWT98
40 4
SS R 1
IB F 2 4−1
a 3
CF Bs B2
T1 1
17
UW 63
UW386
UW551
UY763
W 31
_D 24
2
36 204
RS I07
D2
91 F8
LM I1 17
SL 340
24
RU SC 5
SE N24 9
PP 74
C −1
FJ JAT442 .F1
IB H F50
AF 1 09
AF 21 22
M FF 114 9
M FF 416 1
AF 30 48
31 0
M 93
Sw 31
PS 82
SL X05
SL 064
SL 729
SL 175
SL 300
T1 0
RS 2
BD RS6 7
BR 50
RS 9
N2 76
FJ AT 244 F1
45 45. 0
FJ 4.F5 50
RS 488
RS 9
56 1
PS 9
RU S 4 1
00
FJ T1 T_ 7
3
FJ T1 5244 F8
FJ JAT 52.F50
FJ T15 463. 0
FJ 153 4.F 1
FJ AT 40. F6
AT 53 F1
A 4 8
F1 1
FJ AT9 0−1
FJ C P29 5
FJ AT 04. 50
FJ 153 04. 0
FJ 153 53. 0
FJ 535 F50
K
K CC K6
T1
T4
T5
T6
T7
T8
T9
YC j64
−
Pe 967
0
20
Rs
7
A 2 7
A 2 9
F 156
64
64
LL 22
AT 4 .F5
48
b0
CQ3A3
R FJ
69
A IA 5
50
A 1 5
AT 524 .F
FJ AT 3.F
SB 57
B 71
AT 153 F5
AT 153 F5
0
G w7
RUN22
M F 07
M F 14
16
19
ol
37
AT 1−
F
22
F 14 .F
A 1 −
F T 42
SB A
2
2
3
3
.
1 .
R
M
8
SN
M
A
A
K
UG
Strain
Figure 3.2. Extensive variation in protein sequence length among Rhs-toxin repertoires within Ralstonia genomes. The x axis
represents individual strains, y axis represents the protein sequence length in amino acids (aa), and each dot represents an Rhs-toxin
sequence color coded by species. The size of the dot represents the amino acid sequence length that are divided into four categories: 1)
Any toxin <350 aa, this represents Rhs-toxin tips; 2) Any toxin between 350-699 aa, this represents Rhs-toxins tips and parts of the
core region; 3) Any toxin between 700-1900 aa, indicative of a full Rhs-toxin; and 4) Any toxin >1900 aa also containing full length
toxins. The dashed lines that intercept the y-axis at 350 and 1400 aa demarcate the orphan module C-terminal tip from full Rhs-toxins
and full Rhs-toxins from putative pseudogenes, respectively.
181
Figure 3.3. Three conserved Rhs-toxin system loci in genomes of Ralstonia species. (A) A phylogeny was constructed from 144
Rhs- toxin core region sequences representing 6 species. three distinct clades depicted in various shades of pink were identified, with
maximum likelihood. The white cluster correspond to outgroups from E. coli and P. aeruginosa. Branch labels indicate the Rhs-toxin
locus tag from which the core regions where extracted, and the outer strip represents their respective species. Branches are supported
by >90% with 1,000 bootstraps. (B) Representative structures of Rhs-toxin system loci from each of the nine different clades: Rhs-
toxin (light green); Coding sequence immediately upstream of the Rhs-toxin (purple); Immunity protein/hypothetical immunity
proteins (bright green); T6SS-linked proteins (light blue); T2SS-linked proteins (pink); and proteins not associated with secretion
(grey). Numbers to the left represent approximate full Rhs-toxin sequence length (aa).
182
A Presence and absence plot of Rhs C-terminal tips in Ralstonia
1 0 Strain
SL2064
KACC 10722
T51
T95
56D2
Rs5
UGMSS_Db01
BDBR229
SEPPX05
PSI07
FJAT15353.F1
FJAT1303.F50
FJAT15353.F50
FJAT15353.F8
UW386
KACC 10709
Sw698
Cw717
CQPS−1
202
204
Bs715
FJAT1463.F1
FJAT15304.F50
FJAT15304.F6
FJAT15340.F50
FJAT454.F50−1
FJAT91.F50
MAFF 211479
MAFF 211491
MAFF 241648
MAFF 311693
RSCM
SL2729
SL3300
SL3730
T117
T42
T60
T78
Wj644
B2
RS24
MAFF 301560
FJAT91−F8
SN83A39
362200
PeaFJ1
HA4−1
FJAT1452.F50
SL1931
FJAT15244−F8
FJAT15244.F1
FJAT15244.F50
FJAT442.F1
FJAT442.F50
FJAT445.F50
YC40−M
RS647
T12
T101
T82
R24
IBSBF 2571
CIAT_078
Po82
UW163
IBSBF1503
RUN2340
RUN2474
RUN2279
LMG 9673
UY031
RS 488
UW551
RS 489
MolK2
RS 476
GMI1000
UW763
K60
RS650
RS642
LLRS−1
CFBP2957
T98
SL3175
rhs1
rhs2
rhs3
rhs4
rhs5
rhs6
rhs7
rhs8
rhs9
rhs10
rhs11
rhs12
rhs13
rhs14
rhs15
rhs16
rhs17
rhs18
rhs19
rhs20
rhs21
rhs22
rhs23
rhs24
rhs25
rhs26
rhs27
rhs28
rhs29
rhs30
rhs31
rhs32
rhs33
rhs34
rhs35
rhs36
rhs37
rhs38
rhs39
rhs40
rhs41
rhs42
rhs43
rhs44
rhs45
rhs46
rhs47
rhs48
rhs49
rhs50
rhs51
rhs52
rhs53
rhs54
rhs55
rhs56
rhs57
rhs58
rhs59
rhs60
rhs61
rhs62
rhs63
rhs64
rhs65
rhs66
rhs67
rhs68
rhs69
rhs70
rhs71
rhs72
rhs73
rhs74
rhs75
rhs76
C-terminal tip identifiers
B 1 2
3 4 5 6 7 8
9 10 11 12 13 14 15 16 17 18
19 20 21 22 23 24 25 26 27 28 29 30 31
32 43
44
Figure 3.4. A total of 76 unique Rhs C-Terminal tips in Ralstonia species. (A) Presence and
absence plot of C-terminal toxin tip sequences which carry the active toxin domain. Unique C-
terminal tip identifiers are along the bottom, and on the right side showing Ralstonia strains. A
blastP analysis was performed on 294 C-terminal tip sequences from 87 genomes to search for
homology; C-terminal tips were considered the same if homologous (>85% similarity). (B) A
similarity network analysis shows 44 clusters and 32 singleton C-terminal tip putative toxin
functions. Node names represent the Rhs C-terminal tip identifiers, are color-coded based on
species, and their size corresponds to the protein length. Small circles represent orphan C-
terminal tips and large circles represent full Rhs-toxins.
183
Figure 3.5. Unique, general, and large unknown functional diversity among Rhs-toxins in
Ralstonia species. A total of 18 distinct toxin domains from 94 Rhs C-terminal tips were identified
through a Pfam database analysis. Each ring is color coded to represent a different Ralstonia
species and each source node represents a strain. Target node (gray) represents identified toxin
domain and shape represents putative predicted function: Dnases (circles); Rnases (diamonds);
proteases (squares). Edges connecting source and target nodes are also color-coded based on
species. Of the 294 Rhs C-terminal tip sequences analyzed, 200 have undetermined toxin domains
(octagon).
184
Figure 4.1. Abundance and diversity of Rhs-toxin sequences in plant pathogenic Pectobacterium. The x-axis show Pectobacterium
species, the y-axis number of Rhs-toxin per genome, and each dot represent a genome color coded by species. The size of the circle
corresponds to the number of Rhs-toxins per genome. The dotted line intercepting the y-axis shows the genomes with the largest number
of Rhs-toxins. Short black lines, interspersed among dots, represent the median value of total Rhs-toxins found for each species. Data
represents 60 genomes across 14 species.
185
Protein length of Curated Rhs−toxins repertoires in Pectobacterium
Species
actinidiae odoriferum
aquaticum parmentieri
aroidearum parvum
2000 atrosepticum polaris
brasiliense quasiaquaticum
Protein Length
350−699aa
700
700−1886aa
350
>2000aa
90
10 5
8− ar
L6
G A
.1
2A
PC 1
BC 1
IF HC
1
C3 5
BI 06
A3 QK 392
2
:4 NA 12
64
5
19 a1
ZJ 3
−1
−2
2.1
XP 4
IF 08
IF 41
BI 21
31
1
1.1
11
21
13
43
AG 2
TS 309
M J1
RC 1
W 7
W 61
FN 163
12
S7
RI 1
YT 11
AK 37
39
W 193
03
CC HJ
15
SC K−
16
15
LC 7
LJ
13
BC
PC
ZM
SR
04
C1
−1
1
48
−S 13−
21
A_ 169
CS
14
SC en
RO
81
25
3− 2−M
77 −F1
21 J1
NI 10
P
PL
PP
NI 222
F1
F1
−4
J0
J0
J3
10
ZL 0H
52
91
SR
2
2N 2
:2
BS
JK
JR
O
−A
−O
1
AK
CF LS
B5
B5
B5
X−
SX
PP
re
20
:
IP 062 BZ
EC
Q
1−
O
O
K
Q
C1
4
2
1
18
G
−S
2
S1
−S
−S
98
PC
12
77
13
A4
A7
A2
A0
:4
21
O
IP
Strain
Figure 4.2. Extensive variation in protein sequence length among Rhs-toxin repertoires within Pectobacterium genomes. The x
axis represents individual strains, y axis represents the protein sequence length in amino acids (aa), and each dot represents an Rhs-toxin
sequence color coded by species. The size of the dot represents the amino acid sequence length that are divided into four categories: 1)
Any toxin <350 aa, this represents Rhs-toxin tips; 2) Any toxin between 350-699 aa, this represents Rhs-toxins tips and parts of the
core region; 3) Any toxin between 700-1900 aa, indicative of a full Rhs-toxin; and 4) Any toxin >1900 aa also containing full length
toxins. The dashed lines that intercept the y-axis at 350 and 1400 aa demarcate the orphan module C-terminal tip from full Rhs-toxins
and full Rhs-toxins from putative pseudogenes, respectively.
186
Figure 4.3. Five conserved Rhs-toxin system loci in genomes of Pectobacterium species. (A) A phylogeny was constructed from 165
Rhs-toxin core region sequences representing 14 species. Five distinct clades depicted in various shades of orange were identified, with
maximum likelihood. The white cluster correspond to outgroups from E. coli and P. aeruginosa. Branch labels indicate the Rhs-toxin
locus tag from which the core regions where extracted, and the outer strip represents their respective species. Branches are supported by
>90% with 1,000 bootstraps. (B) Representative structures of Rhs-toxin system loci from each of the nine different clades: Rhs-toxin
(light green); Coding sequence immediately upstream of the Rhs-toxin (purple); Immunity protein/hypothetical immunity proteins
(bright green); T6SS-linked proteins (light blue). and proteins not associated with secretion (grey). Numbers to the left represent
approximate full Rhs-toxin sequence length (aa).
187
A Presence and absence plot of Rhs C-terminal tips in Pectobacterium
1 0
Strain
IFB5441
HC
WPP163
JK2.1
ZM1
NIBIO1392
BC1
21PCA_AGRO2
QJ021
LJ2
A212−S19−A16
NIBIO1006
QJ313
AK042
BZA12
GX−Pa1
IPO:4132 NAK:239
21A
25.1
XP−13
JR1.1
QK413−1
SR12
ECC15
130
QJ011
WPP14
IPO:4062 NAK:237
1692
CFCC10813
F131
PC1
44987
SR1
F1−1
FN20211
YT22221
BC S7
ZLMLSHJ5
TS20HJ1
SCRI1043
Green1
PL64
SX309
PCCS1
A077−S18−O15
2A
A477−S1−J17
A398−S21−F17
ZJ−4−2
L6
A73−S18−O15
14A
RC5297
21LCBS03
WC19161
IFB5408
QK−5
IFB5485
SCC3193
rhs1
rhs2
rhs3
rhs4
rhs5
rhs6
rhs7
rhs8
rhs9
rhs10
rhs11
rhs12
rhs13
rhs14
rhs15
rhs16
rhs17
rhs18
rhs19
rhs20
rhs21
rhs22
rhs23
rhs24
rhs25
rhs26
rhs27
rhs28
rhs29
rhs30
rhs31
rhs32
rhs33
rhs34
rhs35
rhs36
rhs37
rhs38
rhs39
rhs40
rhs41
rhs42
rhs43
rhs44
rhs45
rhs46
rhs47
rhs48
rhs49
rhs50
rhs51
rhs52
rhs53
rhs54
rhs55
rhs56
rhs57
rhs58
rhs59
rhs60
rhs61
rhs62
rhs63
rhs64
rhs65
rhs66
rhs67
rhs68
rhs69
rhs70
rhs71
rhs72
rhs73
rhs74
rhs75
rhs76
rhs77
rhs78
rhs79
rhs80
rhs81
rhs82
rhs83
rhs84
rhs85
rhs86
rhs87
rhs88
rhs89
rhs90
rhs91
rhs92
rhs93
rhs94
rhs95
rhs96
rhs97
rhs98
rhs99
rhs100
rhs101
rhs102
rhs103
rhs104
rhs105
rhs106
C-terminal tip identifiers
1 2 3 4 5 6
B
7 8 9 10 11 12 13
14 15 16 17 18 19 20 21 22
23 24 25 26 27 28 29 30 31
32
33 40
41 48
49 56
57 64
Figure 4.4. A total of 106 unique Rhs C-Terminal tips in Pectobacterium species. (A) Presence
and absence plot of C-terminal toxin tip sequences which carry the active toxin domain. Unique
C-terminal tip identifiers are along the bottom, and on the right side showing Pectobacterium
strains. A blastP analysis was performed on 255 C-terminal tip sequences from 60 genomes to
search for homology; C- terminal tips were considered the same if homologous (>85% similarity).
(B) A similarity network analysis shows 64 clusters and 42 singleton C-terminal tip putative
toxin functions. Node names represent the Rhs C-terminal tip identifiers, are color-coded based
on species, and their size corresponds to the protein length. Small circles represent orphan C-
terminal tips and large circles represent full Rhs-toxins.
188
Figure 4.5. Unique, general, and large unknown functional diversity among Rhs-toxins in
Pectobacterium species. A total of 28 distinct toxin domains from 102 Rhs C-terminal tips were
identified through a Pfam database analysis. Each ring is color coded to represent a different
Ralstonia species and each source node represents a strain. Target node (gray) represents identified
toxin domain and shape represents putative predicted function: Dnases (circles); Rnases
(diamonds); proteases (squares); and deaminases (hexagon). Edges connecting source and target
nodes are also color-coded based on species. Of the 255 Rhs C-terminal tip sequences analyzed,
153 have undetermined toxin domains (octagon).
189
Figure 5.1. Abundance and diversity of Rhs-toxin sequences in plant pathogenic Dickeya. The x-axis show Dickeya species, the y-
axis number of Rhs-toxin per genome, and each dot represent a genome color coded by species. The size of the circle corresponds to
the number of Rhs-toxins per genome. The dotted line intercepting the y-axis shows the genomes with the largest number of Rhs-toxins.
Short black lines, interspersed among dots, represent the median value of total Rhs-toxins found for each species. Data represents 31
genomes across 8 species.
190
Protein length of Curated Rhs−toxins repertoires in Dickeya
2000
1700
Species
chrysanthemi parazeae
Protein Length
350
90
L7
3
6
2T
E1
S1
S2
ID
2
P6
65
b
37
3b
72
1
18
0
86
47
1
A
C
2−
17
0
EC
EC
A
J1
PA
00
14
nc
−
−1
02
59
FZ
PL
.2
ZX
JZ
.L
39
E2
52
C
64
20
S3
1.
h5
19
A
M
16
D
O
18
67
h1
.1
3.
03
8−
S_
10
Ec
M
05
3.
Ec
p
SM
6.
S1
16
M
cp
.2
SM
.1
08
6−
N
Se
D
R
LA
58
S
N
A
R
Strain
Figure 5.2. Extensive variation in protein sequence length among Rhs-toxin repertoires within Dickeya genomes. The x axis
represents individual strains, y axis represents the protein sequence length in amino acids (aa), and each dot represents an Rhs-toxin
sequence color coded by species. The size of the dot represents the amino acid sequence length that are divided into four categories: 1)
Any toxin <350 aa, this represents Rhs-toxin tips; 2) Any toxin between 350-699 aa, this represents Rhs-toxins tips and parts of the
core region; 3) Any toxin between 700-1900 aa, indicative of a full Rhs-toxin; and 4) Any toxin >1900 aa also containing full length
toxins. The dashed lines that intercept the y-axis at 350 and 1400 aa demarcate the orphan module C-terminal tip from full Rhs-toxins
and full Rhs-toxins from putative pseudogenes, respectively.
191
Figure 5.3. Three conserved Rhs-toxin system loci in genomes of Dickeya species. (A) A phylogeny was constructed from 113 Rhs-
toxin core region sequences representing 8 species. Five distinct clades depicted in various shades of blue were identified, with maximum
likelihood. The white cluster correspond to outgroups from E. coli and P. aeruginosa. Branch labels indicate the Rhs-toxin locus tag
from which the core regions where extracted, and the outer strip represents their respective species. Branches are supported by >90%
with 1,000 bootstraps. (B) Representative structures of Rhs-toxin system loci from each of the nine different clades: Rhs-toxin (light
green); Coding sequence immediately upstream of the Rhs-toxin (purple); Immunity protein/hypothetical immunity proteins (bright
green); T6SS-linked proteins (light blue). and proteins not associated with secretion (grey). Numbers to the left represent approximate
full Rhs-toxin sequence length (aa).
192
A Presence and absence plot of Rhs C-terminal tips in Dickeya
1 0
Strain
MS_2018
RNS 05.1.2A
643b
PL65
DSM 101947
LAR.16.03.LID
16ME22T
JZL7
CE1
DSM 18020
67−19
RNS 08.23.3.1.A
Ech586
M2−3
XJ12
3937
FZ06
EC1
PA1
Onc5
ZXC1
MS2
EC2
A5272
S3−1
A586−S18−A17
ND14b
Secpp 1600
AP6
MS1
Ech1591
rhs1
rhs2
rhs3
rhs4
rhs5
rhs6
rhs7
rhs8
rhs9
rhs10
rhs11
rhs12
rhs13
rhs14
rhs15
rhs16
rhs17
rhs18
rhs19
rhs20
rhs21
rhs22
rhs23
rhs24
rhs25
rhs26
rhs27
rhs28
rhs29
rhs30
rhs31
rhs32
rhs33
rhs34
rhs35
rhs36
rhs37
rhs38
rhs39
rhs40
rhs41
rhs42
rhs43
rhs44
rhs45
rhs46
C-terminal tip identifiers
B 1 2 3 4
5 6 7 8 9 10
11 12 13 14 15 16
17 18 19 20 21
22 26
27 30
Figure 5.4. A total of 46 unique Rhs C-Terminal tips in Dickeya species. (A) Presence and
absence plot of C-terminal toxin tip sequences which carry the active toxin domain. Unique C-
terminal tip identifiers are along the bottom, and on the right side showing Dickeya strains. A
blastP analysis was performed on 113 C-terminal tip sequences from 31 genomes to search for
homology; C- terminal tips were considered the same if homologous (>85% similarity). (B) A
similarity network analysis shows 30 clusters and 16 singleton C-terminal tip putative toxin
functions. Node names represent the Rhs C-terminal tip identifiers, are color-coded based on
species, and their size corresponds to the protein length. Small circles represent orphan C-
terminal tips and large circles represent full Rhs-toxins.
193
Figure 5.5. Unique, general, and large unknown functional diversity among Rhs-toxins in
Dickeya species. A total of 13 distinct toxin domains from 63 Rhs C-terminal tips were identified
through a Pfam database analysis. Each ring is color coded to represent a different Dickeya species
and each source node represents a strain. Target node (gray) represents identified toxin domain and
shape represents putative predicted function: Dnases (circles); Rnases (diamonds); proteases
(squares). Edges connecting source and target nodes are also color-coded based on species. Of the
113 Rhs C-terminal tip sequences analyzed, 50 have undetermined toxin domains (octagon).
194
Figure 6. Rhs-toxin abundance and diversity across species and genera of plant pathogenic
bacteria (A) A total of 343 genomes from Xanthomonas (green), Ralstonia (pink), Pectobacterium
(brown), and Dickeya (blue) were analyzed. This accounted for 19, 6, 14, and 8 species,
respectively. On the x-axis are different bacterial genera and, on the y-axis, Rhs-toxin count per
genome. Superimposed are median values (horizontal black lines) and total number of Rhs-toxin
(on top) per genera. Each dot represents a genome color coded by genera. (B) An all vs all BlastP
analysis of 1,267 C-terminal sequences tips from all bacterial genera were used to perform a
similarity network analysis. A total of 181, 44, 106, and 46 clusters and singletons C-terminal tip
putative toxin functions were identified for Xanthomonas, Ralstonia, Pectobacterium and Dickeya,
respectively. Node names represent the Rhs C-terminal tip identifiers, are color-coded based on
species, and their size corresponds to the protein length. Small circles represent orphan C-terminal
tips and large circles represent full Rhs-toxins. (C) Shared Rhs C-terminal tips between bacterial
genera. C-terminal tips were grouped if they showed >85% similarity (Ralstonia-Pectobacterium,
Ralstonia-Dickeya, and Pectobacterium-Dickeya) and 50% or more (Xanthomonas-Ralstonia).
195
Supplemental figures
Supplemental Figure 2.1. Abundance and diversity of Rhs-toxin sequences in plant pathogenic Xanthomonas. The x-axis show
xanthomonas species, the y-axis number of Rhs-toxin per genome, and each dot represent a genome color coded by species. The size
of the circle corresponds to the number of Rhs-toxins per genome. The dotted line intercepting the y-axis shows the genomes with the
largest number of Rhs-toxins. Short black lines, interspersed among dots, represent the median value of total Rhs-toxins found for
each species. Data represents 207 genomes across 21 species.
196
Protein Length
1400
1700
2000
350
700
90
Xa
−F
13 J1
1
13 1a
15 14c
−0
88
17
3
CP CIT 301
B A
CP F 14 33
9
CPBF 7 4
6
CPBF 7 5
BF 66
IV 7
I 6
IV A 13 7
IA 17
39
78
SL R1
20
Yc 98
LM hA
NC G2 F1
PP 678
B 9
7
10 96
1
12 03
0
12 49
16 112
−O
c
29 t
−
30 5
−
50 1
53
57
6
71
9
AT 8 8284
CC 5−
BJ 33 10
SJ 91
Q2
0 B 3
CF 2006100
B 1
CF P 8 2
4
CFBP1 44
3
CFBP5 71
BP 824
66
9
CN 0
GB C 03
BC N1
M 30 5
A 77
NC FF1 M2
PP 061 8
B 81
NE4379
B1
22
CF A 8ra
B W
CF P 2 15
B 0
CF P6 36
1
CFBP6 66
9
CFBP6 75
9
CFBP6 90
9
CFBP7 91
1
CFBP7 11
1
DABP7 12
1
DAR72 19
R7 029
FD 38
C 86
16
09
IS GZ
O1 09
2C
LM 3
G7 K2
43
M9
M 12
SC
Un NT T
B− T 17
Xte X1 T2
cT 601 1
Xc G0 49
m 2
Xc H1 −2
mN 00
1 5
IL 003
M _234
OHI_35
_2 9
CP 61
CF BF 1
BP 424
LM383
G9 6
3
LMFap2 0
NB G 7 9
C 03
PD2815
SH 520
QP 5
YL 01
X2
B0 3 1
CF 7−0 05
IC BP 07
M 49
P7 8
3
jj 8
JS 2003
7 1
Or LM 49−3
eg 16
an 73
o 4
OS 108
U4
CF VT 93
BP 10
11 6
56
Strain
AU
S 0−
AXT20 9
O1 13
94
BB BAI 7
1 2
B 51 3
CFB15 −3
6
CFBP2 −2
2
CFBP7 86
B
CF 7 1 P
33
3
CFBP7 37
BP 341
IT 73
CC HG42
BB A4
JS 0002
49
KX −6
NC M O8
PP AI1 5
B4 34
34
N 6
N J01
PX J61
1
PXO21
1
PX O40
O9 4
X1 9A
from full Rhs-toxins and full Rhs-toxins from putative pseudogenes, respectively.
1−
YM5A
YN 15
YN 01
JC
CI LH3
X
CI 249
X3
8
M CIX 3
AI 97
M 0 5
AI 37
50
DD 69
1 3
HRDJ1
6
HR1−3
2
J 3−
LT R3− 46
6− 14
16
YT LT −1
9− 6−2
19
−2
CF
A 2
CFBP 8 M6
BP 44
84 3
45
GW
SI
CF W SS
B G
Protein length of Non−curated Rhs−toxins repertoires in Xanthomonas
AR P 46 16
CF T− 91
X
CFBP 2 tg2
IC BP 055
M 2
IC P 1 541
M 63
P 1
LM110 7
5
LMG 7 5
G 28
84
3
NC MALW1
PP I5 6
B 034
37
11
UP P
3
XtKB458
XtKm1
5
XtKm33
m7
Xtu XtuXtLr
NC −UP 469 8
9
NCPPB B51
PP 10 3
NC B 60
PP 264
B 9
Xv 902
1
LM601
15
9
citri
fragariae
hortorum
cucurbitae
arboricola
campestris
albilineans
axonopodis
euroxanthea
euvesicatoria
Species
<350aa
>2000aa
sp.
350−699aa
700−1886aa
oryzae
theicola
vasicola
sacchari
hyacinthi
perforans
prunicola
vesicatoria
translucens
genomes. The x axis represents individual strains, y axis represents the protein sequence length in amino acids (aa), and each dot
197
containing full length toxins. The dashed lines that intercept the y-axis at 350 and 1400 aa demarcate the orphan module C-terminal tip
tips and parts of the core region; 3) Any toxin between 700-1900 aa, indicative of a full Rhs-toxin; and 4) Any toxin >1900 aa also
into four categories: 1) Any toxin <350 aa, this represents Rhs-toxin tips; 2) Any toxin between 350-699 aa, this represents Rhs-toxins
represents an Rhs-toxin sequence color coded by species. The size of the dot represents the amino acid sequence length that are divided
Supplemental Figure 2.2. Extensive variation in protein sequence length among Rhs-toxin repertoires within Xanthomonas
Protein Length Range
Supplemental Figure 3.1. Abundance and diversity of Rhs-toxin sequences in plant pathogenic Ralstonia. The x-axis show
Ralstonia species, the y-axis number of Rhs-toxin per genome, and each dot represent a genome color coded by species. The size of the
circle corresponds to the number of Rhs-toxins per genome. The dotted line intercepting the y-axis shows the genomes with the largest
number of Rhs-toxins. Short black lines, interspersed among dots, represent the median value of total Rhs-toxins found for each species.
Data represents 91 genomes across 8 species.
198
Protein length of Non−curated Rhs−toxins repertoires in Ralstonia
2000
1700
Species
1400 mannitolilytica sp.
Protein Length
pseudosolanacearum syzygii
solanacearum wenshanensis
Protein Length Range
<350aa
700 350−699aa
700−1886aa
350
90
RS −M
G 000
Po 2
M
AT 303 078
M CC 107 0
2
2
1
0
8
2
5
UW 98
40 4
SS R 1
IB F 2 4−1
a 3
CF Bs 2
T1 1
17
UW163
UW386
UW551
UY763
W 31
_D 24
2
36 204
D2
RS I07
91 F8
LM I1 17
SL 340
24
RU SC 5
SE N24 9
PP 74
C −1
FJ JAT442 .F1
IB H F50
AF 1 09
M FF 114 2
M FF 114 9
M FF 416 1
AF 30 48
31 0
M 93
Sw 31
PS 82
SL X05
SL 064
SL 729
SL 175
SL 300
T1 0
RS 42
BD RS6 7
BR 50
RS 9
N2 76
FJ AT 244 F1
45 45. 0
FJ 4.F5 50
RS 488
RS 9
56 1
PS 9
RU S 4 1
00
FJ T1 T_ 7
3
FJ T1 5244 F8
FJ JAT 52.F50
FJ T15 463. 0
FJ 153 4.F 1
FJ AT 40. F6
AT 53 F1
A 4 8
F1 1
FJ AT9 0−1
FJ CI P29 5
FJ AT 04. 50
FJ 153 04. 0
FJ 153 53. 0
FJ 535 F50
K
K CC K6
T1
T4
T5
T6
T7
T8
T9
YC j64
−
Pe 967
0
20
Rs
7
A 2 2
A 2 7
A 2 9
F 156
64
LL 22
AT 4 .F5
48
b0
CQ3A3
R FJ
69
A A 5
50
A 1 5
AT 524 .F
FJ AT 3.F
SB 57
B 71
AT 153 F5
AT 153 F5
0
G w7
RUN22
M F 07
16
19
ol
37
6
AT 1−
F
22
F 14 .F
A 1 −
F T 42
SB A
2
2
3
3
.
1 .
R
M
8
SN
M
A
A
K
UG
Strain
Supplemental Figure 3.2. Extensive variation in protein sequence length among Rhs-toxin repertoires within Ralstonia genomes.
The x axis represents individual strains, y axis represents the protein sequence length in amino acids (aa), and each dot represents an
Rhs-toxin sequence color coded by species. The size of the dot represents the amino acid sequence length that are divided into four
categories: 1) Any toxin <350 aa, this represents Rhs-toxin tips; 2) Any toxin between 350-699 aa, this represents Rhs-toxins tips and
parts of the core region; 3) Any toxin between 700-1900 aa, indicative of a full Rhs-toxin; and 4) Any toxin >1900 aa also containing
full length toxins. The dashed lines that intercept the y-axis at 350 and 1400 aa demarcate the orphan module C-terminal tip from full
Rhs-toxins and full Rhs-toxins from putative pseudogenes, respectively.
199
Supplemental Figure 4.1. Abundance and diversity of Rhs-toxin sequences in plant pathogenic Pectobacterium. The x-axis show
pectobacterium species, the y-axis number of Rhs-toxin per genome, and each dot represent a genome color coded by species. The size
of the circle corresponds to the number of Rhs-toxins per genome. The dotted line intercepting the y-axis shows the genomes with the
largest number of Rhs-toxins. Short black lines, interspersed among dots, represent the median value of total Rhs-toxins found for each
species. Data represents 62 genomes across 15 species.
200
Protein length of Non−curated Rhs−toxins repertoires in Pectobacterium
Species
actinidiae odoriferum
aquaticum parmentieri
aroidearum parvum
2000 atrosepticum polaris
brasiliense quasiaquaticum
1700
cacticida sp.
carotovorum versatile
Protein Length
1400
Protein Length Range
<350aa
700 350−699aa
700−1886aa
350
90 >2000aa
10 5
BC 1
IF HC
8− ar
L6
C3 5
QJ 2
2A
BI 06
A3 QK 392
2
Gr A
1
NA 12
PC 1
1
64
5
ZJ 3
−1
−2
2.1
0
XP 4
IF 08
IF 41
BI 21
31
1
1
.1
1.1
QJ 1
QJ 1
3
43
AG 2
TS 309
RC 1
W 7
W 61
FN 163
12
S7
1
YT 211
NA 37
W 193
03
CC HJ
ZM
5
−
16
15
LC 7
LJ
04
BC
PC
−S −Pa
C1
−1
13
48
SR
−S 13−
01
02
31
A_ 169
CS
9
SC en
RO
81
23
21
14
3− 2−M
25
O1
77 −F1
21 −J1
NI 10
QK
A
PL
PP
4
4
NI 222
F1
F1
−4
10
ZL H
52
91
SR
32 K:2
BS
JK
JR
−A
−O
O1
AK
CF LS
B5
B5
B5
SX
PP
e
20
K:
IP 062 BZ
EC
20
O
RI
C1
4
GX
1
1
19
18
M
SC
−S
2
S1
−S
98
PC
12
77
A4
A7
41
A2
A0
4
21
O:
O:
IP
Strain
Supplemental Figure 4.2. Extensive variation in protein sequence length among Rhs-toxin repertoires within Pectobacterium
genomes. The x axis represents individual strains, y axis represents the protein sequence length in amino acids (aa), and each dot
represents an Rhs-toxin sequence color coded by species. The size of the dot represents the amino acid sequence length that are divided
into four categories: 1) Any toxin <350 aa, this represents Rhs-toxin tips; 2) Any toxin between 350-699 aa, this represents Rhs-toxins
tips and parts of the core region; 3) Any toxin between 700-1900 aa, indicative of a full Rhs-toxin; and 4) Any toxin >1900 aa also
containing full length toxins. The dashed lines that intercept the y-axis at 350 and 1400 aa demarcate the orphan module C-terminal tip
from full Rhs-toxins and full Rhs-toxins from putative pseudogenes, respectively.
201
Supplemental Figure 5.1. Abundance and diversity of Rhs-toxin sequences in plant pathogenic Dickeya. The x-axis show Dickeya
species, the y-axis number of Rhs-toxin per genome, and each dot represent a genome color coded by species. The size of the circle
corresponds to the number of Rhs-toxins per genome. The dotted line intercepting the y-axis shows the genomes with the largest number
of Rhs-toxins. Short black lines, interspersed among dots, represent the median value of total Rhs-toxins found for each species. Data
represents 34 genomes across 11 species.
202
Protein length of Non−curated Rhs−toxins repertoires in Dickeya
2000
1700
Species
chrysanthemi parazeae
Protein Length
350
90
L7
3
06
2T
E1
S1
S2
ID
2
P6
65
b
37
3b
72
−1
18
0
86
47
1
A
C
2−
17
EC
EC
A
J1
PA
00
14
nc
−1
02
59
FZ
PL
.2
ZX
JZ
.L
39
E2
52
C
64
20
S3
1.
h5
19
A
M
16
D
O
18
67
h1
.1
3.
03
8−
S_
10
Ec
M
05
3.
Ec
p
SM
6.
S1
16
M
cp
.2
SM
.1
08
6−
N
Se
D
R
LA
58
S
N
A
R
Strain
Supplemental Figure 5.2. Extensive variation in protein sequence length among Rhs-toxin repertoires within Dickeya genomes.
The x axis represents individual strains, y axis represents the protein sequence length in amino acids (aa), and each dot represents an
Rhs-toxin sequence color coded by species. The size of the dot represents the amino acid sequence length that are divided into four
categories: 1) Any toxin <350 aa, this represents Rhs-toxin tips; 2) Any toxin between 350-699 aa, this represents Rhs-toxins tips and
parts of the core region; 3) Any toxin between 700-1900 aa, indicative of a full Rhs-toxin; and 4) Any toxin >1900 aa also containing
full length toxins. The dashed lines that intercept the y-axis at 350 and 1400 aa demarcate the orphan module C-terminal tip from full
Rhs-toxins and full Rhs-toxins from putative pseudogenes, respectively.
203
Supplemental Figure 6. Example of Rhs-toxin flanked by transposases. Closer examination of
Rhs-toxins bewteen 400-700 aminoacids long (light green) and its cognate immunity protein
(bright green) depicted the truncation of the N-terminal region flanked by two transposases, one in
the upstream and downstream region (yellow).
204
APPENDICES
205
Appendix A
#! /usr/bin/perl -w
use strict;
use warnings;
use Bio::SeqIO;
use Getopt::Long;
#2008-05-05
#- fork of gb2fasta_pep_loc.pl
#- modified to handle DNA and peptide export
#2008-04-30
#- if there is no locus_tag, look for a gene tag
my $genetcode = 11;
my $use_gene_tag;
my $new_loc;
my $dna;
# my $format = 'fasta';
if($#ARGV<1){
print "Usage: $0 <gb file> <taxa name>\n";
print "This program export CDSs from a genbank file to a fasta file\n";
print "Options:\n\t-code <number>: NCBI genetic code [default=11]
\t-gene: Use the gene tag if the locus_tag is not available [default=OFF]
\t-new <tag>: Apply a new locus tag to each CDS starting form <tag>1
\t-dna: Export DNA sequences instead of proteins\n";
exit;
}
my $ext = '.pep.loc.fasta';
if($dna) { $ext = '.dna.loc.fasta' }
my $sp=$ARGV[1];
206
open OUT,">$ARGV[1]$ext";
my $n = 0;
207
}
}
}
}
else {
print "Can't find a locus_tag for this CDS:\n";
for my $tag ($feat_object->get_all_tags) {
print " tag: ", $tag, "\n";
for my $value ($feat_object->get_tag_values($tag))
{
print " value: ", $value, "\n";
}
}
}
}
}
}
}
##This command searches the file for Rhs proteins using the model
hmmsearch 54_42_Model $x".pep.loc.fasta" > "results_"$x
grep ">>" "results_"$x > "RhsL_"$x
sed 's/>//g' "RhsL_"$x | sed 's/ //g' > "RhsList_"$x
#This command indexes the database (the whole protein genome file)
esl-sfetch --index $x".pep.loc.fasta"
##This command uses the list generated from the pipeline to fetch/present the proteins from the
database and save them into another file
esl-sfetch -f $x".pep.loc.fasta" "RhsList_"$x > $x"_X.fa"
done
208
#!/bin/bash
209