Professional Documents
Culture Documents
2003 Comparative Genomics of The Vitamin B12 PDF
2003 Comparative Genomics of The Vitamin B12 PDF
2003 Comparative Genomics of The Vitamin B12 PDF
Using comparative analysis of genes, operons, and tamin B12 (cyanocobalamin) with different upper axial ligands,
regulatory elements, we describe the cobalamin (vita- are essential cofactors for several important enzymes that cat-
FIG. 1. Biosynthetic pathways for adenosylcobalamin and other tetrapyrrolic cofactors in bacteria. The anaerobic and aerobic
Ado-CBL pathways are characterized by the early and late cobalt insertions, respectively. In bacteria with the anaerobic pathway, cobalt is inserted
into the macrocycle using either the CbiK (as in S. typhimurium) or CbiX chelatases. (as in B. megaterium). S. typhimurium gene names are
underlined and used throughout this work. Similar genes of S. typhimurium and P. denitrificans are arranged within same gray block (see the
introduction for explanation). Various chelatases are in black blocks. The vitamin B12 and cobalt transport routes are shown by lines with arrows.
ond, since the majority of the intermediates of the anaerobic, route is probably mediated via the complexed cobalt ion, which
but not aerobic, pathway have the cobalt ion inserted into the can assume different valence states. In summary, CbiD, CbiG,
macrocycle, the pathways could use enzymes with different and CbiK are specific to the anaerobic route of S. typhimurium,
substrate specificities. CobG from P. denitrificans requires mo- whereas CobE, CobF, CobG, CobN, CobS, CobT, and CobW are
lecular oxygen to oxidize precorrin 3A and is specific for the unique to the aerobic pathway of P. denitrificans.
aerobic pathway (11). The respective CR oxidation of anaerobic In most bacteria, cobalt and other heavy metal ions are
41150 Cobalamin Biosynthesis in Bacteria
␣ MLO, BJA, SM, PD, SAR, AU, BME CobG CobN ⫹ CobST
RPA, RC ORF663 CobN ⫹ CobST
RS ? CobN ⫹ CobST
 BPS CobG CobN ⫹ ChlID/CobST
RSO CbiG CbiX; CobN ⫹ ChlID
␥ TY, KP, YE CbiG CbiK
PP, PU, PY, PA CobG CobN ⫹ ChIID
B/C BE, BI, HMO, DHA CbiG CbiX
CA, CPE, CB, DF, LMO CbiG CbiK
Act TFU, RK, DI, MT CobG CobN ⫹ ChlID
SX CbiG CbiX; CobN ⫹ ChlID
PI CbiG CbiX; CysGB
Cya PMA, CY, SN, TEL CbiG CbiX
AN CobG ?
with seven TMSs. In two other actinobacteria, Thermobifida chelatase contain the hypothetical gene cobW. This gene is
fusca and Rhodococcus str., we identified one more candidate always located immediately upstream of the cobN chelatase
cobalt transporter, encoded by the B12-regulated gene cbtE, component (Table I). Interestingly, the N-terminal part of
which has six possible TMSs and a histidine-rich loop between CobW has a P-loop nucleotide-binding motif and is similar to
transmembrane segments I and II. the urease/hydrogenase accessory proteins UreG and HypB,
A new member of the B12 regulon in Treponema denticola, which are involved in the GTP-dependent incorporation of Ni2⫹
CbtF, has a predicted signal peptide cleavage site on its N into the metallocenters of target enzymes. In addition, the
terminus. In Brucella melitensis, cbtF lies in one locus with variable loop between the conserved N- and C-terminal do-
Ni2⫹-dependent urease immediately upstream of the cbiMQO mains of CobW contains a histidine-rich motif possibly involved
genes and is thought to be involved in the nickel transport. In in metal binding. Finally, cobW is co-localized with predicted
addition, Rhodobacter capsulatus and F. nucleatum have a cobalt transporters in some genomes. We suggest that CobW is
single cbtF gene, which is B12-regulated in the former genome. required for the cobalt chelation during CBL biosynthesis and
In view of the existence of mixed cobalt/nickel families of trans- it is possible that the histidine-rich region of CobW is used to
porters (see above), CbtF, in conjunction with systems homol- store the cobalt ions within of the cell prior to their delivery to
ogous to CbiMQO, could be involved in cobalt transport. the chelatase complex.
Cobalt Chelatases—Insertion of cobalt ions into CR at early Another predicted member of the B12 regulon, the cfrX gene,
and late stages of the CBL biosynthesis is mediated by different was found within the conserved gene cluster cfrX-cobW-cobN in
cobalt chelatases, termed here as “early” and “late.” There are all CBL-producing ␣-proteobacteria except Sinorhizobium me-
at least two distinct early chelatases, CbiK and CbiX, and one liloti and B. melitensis. CfrX is weakly similar to various ferre-
late chelatase composed of CobN, CobS, and CobT subunits. In doxin proteins including the CBL-related ferredoxin CbiW of
contrast to the CobN component of the late chelatase, which is B. megaterium (35) and could act as an oxidoreductant during
widely distributed in bacteria, the CobS and CobT components cobalt insertion.
were found only in ␣-proteobacteria and Burkholderia Cobalt Reductases—Reduction of the cobalt ion of corrinoids
pseudomallei, where the cobST cluster is always separated is the least studied stage the CBL biosynthesis. It is a prereq-
from other CBL genes and never has an upstream B12 element. uisite for further corrinoid adenosylation. Although the NADH-
Thus, the CobST components of cobalt chelatase are missing in dependent flavoprotein with cobalt reductase activity was pu-
actinobacteria and pseudomonades. Nevertheless, genes simi- rified in P. denitrificans, the gene encoding this activity has not
lar to the Mg-chelatase subunits, namely chlID, were found in been identified (36). In S. typhimurium, however, in vitro stud-
clusters with various CBL genes, most often with cobN. Nota- ies showed that flavodoxin FldA can catalyze the co(II)rrinoid
bly, similar to CobNST, ATP-dependent Mg-chelatase involved reduction when the latter is bound to the adenosyltransferase
in the bacteriochlorophyll biosynthesis consists of three sub- enzyme (37). Using a similarity search, the fldA gene was
units, ChlH, ChlI, and ChlD, and ChlH is a close homolog of found in enterobacteria, Pasteurellaceae, ⑀-proteobacteria, and
CobN (34). The hypothesis that ChlI and ChlD are the missing cyanobacteria as well as in the Bacillus/Clostridium and CFB
components of the late cobalt chelatase was proved by phylo- groups of bacteria. Since FldA was found in some CBL path-
genetic analysis. Indeed, proteins associated with the chloro- way-deficient bacteria, its function is unlikely to be restricted
phyll and CBL biosynthesis form separate branches on the to the B12 synthesis. It is consistent with experimental facts
phylogenetic trees for both CobN/ChlH and ChlI families (data that the FldA protein, shared by several metabolic pathways, is
not shown). Based on these facts, we suggest that, in contrast an essential enzyme in E. coli (38). Nevertheless, FldA was not
to ␣-proteobacteria, the late cobaltochelatase complex of acti- found in bacteria with the aerobic CBL biosynthetic pathway.
nobacteria, pseudomonads, and -proteobacteria consists of the In this work, a candidate cobalt reductase associated with
CobN, ChlI, and ChlD subunits (Table III). the CBL biosynthesis was identified in most bacteria with the
CBL gene clusters of proteobacteria possessing late cobalt aerobic CBL pathway (␣-proteobacteria, burkholderia, pseudo-
Cobalamin Biosynthesis in Bacteria 41155
TABLE IV
Predicted B12-element-mediated regulation of bacterial genes, which are not involved in the CBL biosynthesis and transport
The cases of the predicted CBL-box-mediated regulation in pyrococci are underlined.
Gene Function Genome
monads, and actinobacteria), as well as in several other species and PDU-associated genes and to the single genes (data not
(Table I). The corresponding gene, bluB, was previously de- shown).
scribed in R. capsulatus as a gene of unknown function essen- In summary, adenosyltransferases were found in all B12-
tial for the CBL synthesis (39). We found that bluB orthologs utilizing prokaryotes except two clostridia and three methano-
are predominantly regulated by B12 elements and often genic archaea. However, the latter have only one B12-depend-
co-localized with various CBL genes. The BluB proteins are ent enzyme, a methyl-CBL-dependent methyltransferase, and
similar to various FMN-dependent reductases from the nitro- thus do not require the adenosylated form of CBL. Interestingly,
reductase family, including oxygen-insensitive NADH nitrore- pduO from Archaeoglobus fulgidus and btuR from Geobacter
ductase and NADH-flavin oxidoreductase, that catalyze the metallireducens appear in one putative operon with B12-depend-
electron transfer from NADH to various electron acceptors. The ent methylmalonyl-CoA mutase. Overall, it seems that particular
bluB and fldA genes never co-occur in CBL-producing pro- types of adenosyltransferases are specialized for particular B12-
teobacteria. The majority of these bacteria have only bluB, dependent enzymes or for the de novo CBL biosynthesis.
whereas it is not present in the fldA-containing group of enter- Nucleotide Loop Assembly Pathway—The pathways for the
obacteria. Thus, we propose that BluB functions in cobalt re- lower ligand synthesis of CBL (also known as the nucleotide
duction for the CBL biosynthesis. Moreover, bluB was also loop assembly) are thought to vary between bacterial groups
found in three proteobacteria with incomplete CBL pathways, (42). Some bacterial genomes have neither cobT nor cobC genes
M. flagellatus, A. vinelandii, and Ralstonia eutropha. The CBL required for the synthesis of ␣-ribazole from dimethylbenzimi-
pathways in these bacteria include all genes for the conversion dazole or have only one of these genes. In contrast, two other
of cobyrinic acid to Ado-CBL and are possibly involved in the genes of the nucleotide loop assembly pathway, cobU and cobS,
assimilation of exogenous corrinoids. We believe that predicted are conserved in all CBL-synthesizing bacteria with the excep-
cobalt reductase BluB, which presumably acts on cob(II)yrinic tion of cobU in archaea (see below).
acid a,c-diamide, is necessary for these incomplete pathways as Three Gram-positive bacteria, L. monocytogenes, Clostrid-
well. ium botulinum, and Thermoanaerobacter tengcongensis, lack
ATP:Corrinoid Adenosyltransferases—The active form of co- the cobT gene but have all other genes for nucleotide loop
enzyme B12, Ado-CBL, can be obtained either by de novo syn- assembly, including cobC. Instead, the CBL gene clusters of
thesis or by assimilation of exogenous corrinoids. Both routes these bacteria contain two hypothetical genes, named cblT and
require ATP:corrinoid adenosyltransferase encoded by the cblS, which are not similar to any known protein. However,
btuR gene. BtuR adenosylates either CBL or an intermediate these two genes were found in several other Gram-positive
prior to CBL. The search for btuR in bacterial genomes showed bacteria simultaneously with cobT. The cblTS operon of Clos-
that this widely distributed gene is usually co-localized with tridium perfringes and single cblS genes of D. halfniense and
other CBL genes and sometimes is B12-regulated (Table I). Heliobacillus mobilis are preceded by regulatory B12 elements.
However, among nearly 80 B12-utilizing species, BtuR was not In addition, the B12-regulated CBL operon of B. stearother-
found in 18 genomes. It is known that, besides BtuR, enter- mophilus contains the cblTS genes. The hypothetical protein
obacteria possess two other CBL adenosyltransferases, PduO CblT has five predicted transmembrane segments. These facts
and EutT, which are associated with the CBL-dependent 1,3- allow us to propose a possible role of new CblT and CblS
propanediol dehydratase and ethanolamine ammonia lyase en- proteins in uptake of dimethylbenzimidazole and its subse-
coded by the pdu and eut gene clusters, respectively (40, 41). quent transformation into ␣-ribazole-5P, respectively.
Strikingly, the BtuR, PduO, and EutT adenosyltransferases A case of nonorthologous displacement of the CBL genes was
show no sequence similarity. Homologs of the eutT gene were previously found in archaea, where the bacterial-type nucleoti-
found only within eut gene clusters, strongly suggesting that its dyltransferase CobU is replaced by a new nucleotidyltrans-
only function is in the ethanolamine utilization. In contrast, ferase named CobY (43). Here we extend this analysis using 13
pduO appears to be widely distributed in prokaryotes and, in instead of 6 complete archaeal genomes. All of these genomes
particular, in most BtuR-deficient bacteria, where it would fill appear to lack the cobU gene and possess the cobY gene, thus
the gaps in the CBL pathways (Supplementary Table VI). The confirming the nonorthologous displacement. The only excep-
pduO genes may reside within the pdu and CBL gene clusters tion is Pyrobaculum aerophilum, which lacks both nucleotidyl-
or be single genes. Notably, the phylogenetic tree of the PduO transferases. In 9 of 12 archaeal species, cobY is positionally
family contains distinct branches corresponding to the CBL- linked to other CBL genes.
41156 Cobalamin Biosynthesis in Bacteria
TABLE V
Predicted functions for B12-related genes
Various techniques of genome analysis were used to infer these functional predictions: clustering on the chromosome (C); protein fusion events
(F); co-occurrence phylogenetic profiles (O); analysis of regulatory sites (R); similarity search (S); motif search (M); and prediction of transmem-
brane segments (T). NOD, nonorthologous gene displacement. Asterisks mark new gene names given in this study. Genome abbreviations are listed
in Table I.
Protein Suggested function Genomes Reasons/comments
Here we identified one more case of nonorthologous gene single pduX gene preceded by a B12 element was found in
displacement of CBL genes in archaea. The cobC gene encoding S. coelicolor. Finally, Y. enterocolitica has two copies of pduX,
␣-ribazole-5P phosphatase was found in only two archaea, and one of them is co-localized with the cobD gene. The PduX
S. solfataricus and Thermoplasma. In all other archaea, except protein belongs to the GHMP kinase family and is weakly
A. fulgidus and Halobacterium sp., the hypothetical gene cobZ similar to galactokinase, L-homoserine kinase, and mevalonate
(PF0294 in P. furiousus) was found within the CBL gene clus- kinase. Since the CBL biosynthesis requires L-threonine-3P as
ters, being most often linked with cobS. CobZ is weakly similar a substrate for the CobD aminotransferase, and the positional
to bacterial phosphatidylglycerophosphatases but not to CobC. analysis shoes that PduX is probably CBL-related, we propose
Thus, we predict that CobZ is the missing ␣-ribazole-5P phos- the L-threonine kinase function to PduX.
phatase replacing CobC. In Halobacterium sp., the CBL locus The phylogenetic tree of CobN-related proteins has three
contains the hypothetical gene HSL01294, another predicted main branches (data not shown). The first two branches com-
nonorthologous displacement of CobC, which is weakly similar prise the B12-regulated cobalt chelatases CobN involved in the
to the phosphoglycolate phosphatases from proteobacteria but CBL synthesis and the magnesium chelatases ChlH required
has no orthologs in other archaea. for the bacteriochlorophyll biosynthesis. The third branch,
Among CBL-synthesizing ␣-proteobacteria, only four rhi- named here BtuS, includes hypothetical proteins of unknown
zobacteria, Mesorhizobium loti, S. meliloti, B. melitensis, and function from diverse bacterial and archaeal genomes. In all
A. tumefaciens, lack the cobC gene. Instead, we have found a cases, the btuS genes are clustered with a new gene, named
pair of genes, named cblXY, which is clustered with the cobT btuT, which encodes a hypothetical transporter with four pre-
and cobS genes. Similarly, all CBL-synthesizing actinobacteria dicted TMSs. In addition, the btuST clusters of P. aeruginosa,
lack the cobC gene but have a new hypothetical gene, named R. palustris, and N. europaea are co-localized with hypothetical
cblZ, which is mainly clustered with cobT, cobS, and cobU outer membrane receptor genes encoding proteins homologous
genes. The hypothetical proteins CblX, CblY, and CblZ are not to the vitamin B12 receptor BtuB. Moreover, in P. gingivalis
similar to any known protein; nevertheless, the CblX, contain- and B. fragilis, the btuST genes form a candidate operon with
ing a zinc ribbon motif, comprises a small metal-binding pro- the iron-induced hemoglobin transport genes hmuYR (45). The
tein of about 60 amino acids. Thus, we propose that cblXY and BtuB-like HmuR receptor was recently found to bind hemoglo-
cblZ are nonorthologous replacements of the cobC gene in rhi- bin, hemin, various porphyrins, and metalloporphyrins (46). In
zobacteria and actinobacteria, respectively. archaea, the btuST genes are linked to the btuW gene encoding
Other New Members of the B12 Regulon Related to the CBL a hypothetical transporter with seven predicted TMSs. These
Biosynthesis—The 1,2-propanediol utilization operon pdu of observations allow us to propose that the hypothetical trans-
S. typhimurium includes the pduX gene of unknown function porter BtuT, chelatase BtuS, and homologs of the BtuB/HmuR
(44). Here we found that some Gram-positive bacteria, namely receptors (or BtuW in archaea) are involved in the transport
three clostridia, L. monocytogenes, D. halfniense, and H. mobi- and salvage of various metalloporphyrines rather than in the
lis, contain the pduX genes located within the CBL gene clus- CBL biosynthesis.
ters, usually adjacent to the cobD gene (Table I). Most of these The first gene of the B. megaterium cbi operon, cbiW, encodes
clusters are predicted to be B12-regulated. In addition, the a hypothetical ferredoxin and could be involved in the CBL
Cobalamin Biosynthesis in Bacteria 41157
biosynthesis, possibly acting as oxidoreductant during the ring Pseudomonas species, B. melitensis, B. pseudomallei, S. coeli-
contraction process under anaerobic conditions (35). Homologs color, and cyanobacteria, whereas the CbiG-CbiF fusions were
of cbiW are widely distributed in prokaryotes, but only some of found in the CFB group of bacteria. Thus, we place CbiG
them are clustered with CBL genes and regulated by B12 between CbiH and CbiF on the pathway of CBL biosynthesis
elements. The cbiW genes are co-localized with cobalt trans- (Fig. 1).
porters and CBL biosynthetic genes in Ralstonia solanacearum B12-regulated Genes Not Involved in the CBL Biosynthesis
and B. stearothermophilus and with B12 transport systems in and Transport—Analysis of the regulatory B12 elements in
Chloroflexus aurantiacus and Anabaena sp. In L. interrogans bacterial genomes allowed us to detect B12-regulated genes
and Halobacterium sp., CbiW and the cobalt chelatase CbiX are that are not involved in the CBL biosynthesis. An unexpected
encoded by a single fused gene. Thus, the B12-related ferredox- result was that most of these genes appear to belong to B12-de-
ins CbiW occur only in bacteria with anaerobic CBL pathways pendent metabolic pathways (Table IV).
(see below). First, in some ␣-proteobacteria, actinobacteria, and Bacillus
M. tuberculosis, in contrast to most actinobacteria, lacks the species, as well as in B. fragilis and T. elongatus, B12 elements
cobF gene, but the CBL cluster of this bacterium contains were found upstream of the metE gene encoding the B12-inde-
another gene, named metZ, which is not similar to cobF but is pendent methionine synthase. On the other hand, genes encod-