Professional Documents
Culture Documents
Metagenomics: Novel Enzymes From Non-Culturable Microbes
Metagenomics: Novel Enzymes From Non-Culturable Microbes
Metagenomics: Novel Enzymes From Non-Culturable Microbes
doi: 10.1093/femsle/fnx211
Advance Access Publication Date: 4 October 2017
Minireview
ABSTRACT
In the transition to the post-petroleum economy, there is a growing demand for novel enzymes with high process
performances to replace traditional chemistry with a more ‘green’ approach. To date, microorganisms encompass the
richest source of industrial biocatalysts, but the Earth-living microbiota remains largely untapped by using traditional
isolation and cultivation methods. Metagenomics, which is culture independent, represents a powerful tool for discovering
novel enzymes from unculturable microorganisms. Herein, we summarize the variety of approaches adopted for mining
environmental DNA and, based on a systematic literature review, we provide a comprehensive list of 332 industrially
relevant enzymes discovered from metagenomes within the last three years.
INTRODUCTION cells estimated to be 4–6 × 1030 (Bunge, Willis and Walsh 2014),
inhabiting a widest variety of ecosystems, including the less-
Microbial enzymes are employed in almost all industrial sectors,
hospitable habitats, such as hot springs, nearly saturated salt
from the chemical, pharmaceutical and food industries to the
brines, acid mine waters at pH near zero, deep-sea hydrother-
manufacturing of detergents, textiles, leather, pulp and paper
mal vents and Antarctic ices (Mirete, Morgante and Gonzàlez-
(Demain and Adrio 2008). In 2015, the application of industrial
Pastor 2016). Environmental DNA (eDNA) analysis of microbial
enzymes generated a turnover of about 5 billion USD, a value
communities currently estimates that only from 0.1% to 1% of
that is forecasted to rise to 6.9 billion USD in 2017 and to 10.7
these prokaryotes are culturable by using traditional microbio-
billion USD in 2024 (World Enzyme to 2017, https://www.gmin
logical methods (Culligan et al. 2014a). Also part of eukaryotic
sights.com/industry-analysis/enzymes-market), thanks to the
microbes is unculturable (Hirst, Kita and Dawson 2011), although
increasing demand for novel biocatalysts with high process per-
the extent of this phenomenon is less clear. Magnuson and
formances to replace traditional chemistry. Enzyme-based pro-
Lasure (2002) reported that 10% to 30% of fungi from different
cesses are favored by reduced costs, increased efficiency, im-
soils can be cultivated.
proved product recovery, and reduced use of toxic compounds
In the last two decades, the development of culture-
and harsh conditions. Microorganisms represent the largest pro-
independent methods based on collecting biological material
portion of biomass on Earth, with a total number of prokaryotic
1
2 FEMS Microbiology Letters, 2017, Vol. 364, No. 21
from environmental samples (metagenomics, metatran- stimulates active chitinolytic microbial communities, increas-
scriptomics and metaproteomics) revealed the potential of ing the chance of finding novel chitinases (Cretoiu et al. 2015;
unculturable microorganisms as sources of novel enzymes. Stöveken et al. 2015). Similar approaches were used to identify
Metagenomics is generally considered the most promising novel hemicellulases (Sun et al. 2015; Zhao et al. 2017) and ox-
methodology for identifying innovative biocatalysts from idoreductases (Nagayama et al. 2015) from cellulose-amended
prokaryotic eDNA (Lorenz and Eck 2005), although recent ad- samples or artificially polluted soils. A hormone-sensitive li-
vances in metaproteomics have shown great potential (Wilmes, pase was recovered from an olive oil-fed permafrost-derived mi-
Heintz-Buschart and Bond 2015) leading to the discovery of crobiome (Petrovskaya et al. 2016). Incorporating stable-isotope
novel lipases (Sukul et al. 2017) and cellulases (Speda et al. probing is another tool for innovating target library construc-
2017). Differently from prokaryotic genes that can be directly tion: for example, soil sample incubation with stable-isotope-
transcribed and translated in a suitable heterologous host, the labeled carbohydrates enriched the eDNA of those bacteria ac-
presence of large introns in eukaryotic genes limits metage- tively engaged in degrading plant-derived biomasses (Verastegui
nomic analyses. Consequently, starting from the pioneering et al. 2014).
Target DNA source Library Heterologous Screening Number of Number of Hit Number of Reference
type host approach screened positive frequency characterized
Activity EC number clones clones enzymes
Cellulase/hemicellulase
Endoglucanase 3.2.1.4 Soil Plasmid E. coli Function-based (A) 2.4 × 104 na na 1 Pimentel et al. (2017)
Endoglucanase 3.2.1.4 Hot spring / / Sequence-based (G) / / / 1 Zhao et al. (2017)
sediment
Endoglucanase 3.2.1.4 Ovine rumen BAC E. coli Function-based (A) 1.3 × 104 6 1:2150 1 Cheng et al. (2016)
Endoglucanase 3.2.1.4 Soil Plasmid E. coli Function-based (A) na 1 na 1 Garg et al. (2016)
Endoglucanase 3.2.1.4 Compost Cosmid E. coli Function-based (A) 1.0 × 104 1 1:10 000 1 Meneses et al. (2016)
Endoglucanase 3.2.1.4 Beer lees / / Sequence-based (G) / 23 na 3 Yang et al. (2016)
Endoglucanase 3.2.1.4 Paddy soil Fosmid E. coli Function-based (A) 2.5 × 104 na na 1 Zhou et al. (2016)
Endoglucanase 3.2.1.4 Soil / / Sequence-based (E) / 1 na 1 Hua et al. (2015)
Endoglucanase 3.2.1.4 Compost Fosmid E. coli Function-based (A) 2.1 × 104 3 1:7000 2 Okano et al. (2015b)
Endoglucanase 3.2.1.4 Soil Fosmid E. coli Function-based (A) 1.0 × 104 1 1:10 000 1 Mai et al. (2014)
Endoglucanase 3.2.1.4 Algae Plasmid E. coli Function-based (A) 4.0 × 104 20 1:2000 1 Martin et al. (2014)
Endoglucanase 3.2.1.4 Insect gut Fosmid E. coli Function-based (A) 9.2 × 104 1 1:92 000 1 Lee et al. (2014)
Endoglucanase 3.2.1.4 Soil Plasmid E. coli Function-based (A) 2.4 × 104 1 1:24 000 1 Xiang et al. (2014)
β-Glucosidase 3.2.1.21 Soil Fosmid E. coli Function-based (A) 1.0 × 105 1 1:100 000 1 Matsuzawa and Yaoi (2017)
β-Glucosidase 3.2.1.21 Insect gut Plasmid E. coli Function-based (A) 8.0 × 105 13 1:61 500 1 Gao et al. (2016a)
β-Glucosidase 3.2.1.21 Soil Cosmid E. coli Sequence-based (F) na na na 1 Gomes-Pepe et al. (2016)
β-Glucosidase 3.2.1.21 Bovine rumen Fosmid E. coli Function-based (A) 1.4 × 104 1 1:14 000 1 Ramı́rez-Escudero et al.
(2016)
β-Glucosidase 3.2.1.21 Soil Plasmid E. coli Function-based (A) 5.0 × 104 5 1:10 000 1 Cao et al. (2015)
β-Glucosidase 3.2.1.21 Bovine rumen Fosmid E. coli Function-based (A) 3.0 × 104 45 1:650 1 Loaces et al. (2015)
β-Glucosidase 3.2.1.21 Kusaya gravy Plasmid E. coli Function-based (A 1.0 × 104 7 1:1450 1 Uchiyama, Yaoi and
and B) Miyazaki (2015)
β-Glucosidase 3.2.1.21 Soil Fosmid E. coli Function-based (A) 9.8 × 104 2 1:49 000 2 Bergmann et al. (2014)
β-Glucosidase 3.2.1.21 Soil Plasmid E. coli Function-based (A) 9.0 × 104 4 1:22 500 2 Biver et al. (2014)
β-Glucosidase 3.2.1.21 Bovine rumen Fosmid E. coli Function-based (A) 2.0 × 104 1 1:20 000 1 Li et al. (2014b)
β-Glucosidase 3.2.1.21 Hydrothermal Plasmid E. coli Function-based (A) na na na 1 Schröder et al. (2014)
spring
β-Glucosidase 3.2.1.21 Insect gut Fosmid E. coli Function-based (A) 1.0 × 104 12 1:850 3 Zhang et al. (2014a)
β-Xylanase 3.2.1.8 Insect gut Fosmid E. coli Function-based (A) 1.0 × 104 13 1:750 1 Qian et al. (2015)
β-Xylanase 3.2.1.8 Compost / / Sequence-based (E) / na / 1 Sun et al. (2015)
β-Xylanase 3.2.1.8 Ovine rumen Fosmid E. coli Function-based (A) 1.2 × 104 18 1:650 1 Wang et al. (2015b)
β-Xylosidase/ 3.2.1.37 Bovine rumen Phagemid E. coli Function-based (A) na na na 1 Jordan et al. (2016)
arabinofuranosidase
α-Xylosidase/ 3.2.1.177 Soil Fosmid E. coli Function-based (A) 5.0 × 104 1 1:50 000 1 Matsuzawa et al. (2016)
arabinofuranosidase
β-Xylosidase/ 3.2.1.21 Compost Plasmid E. coli Function-based (A) 3.0 × 104 40 1:750 1 Matsuzawa, Kaneko and
arabinofuranosidase Yaoi (2015)
α-Arabinofuranosidase 3.2.1.55 Insect gut Fosmid E. coli Function-based (A) 4.0 × 104 87 1:450 4 Arnal et al. (2015)
α-Fucosidase 3.2.1.51 Soil Fosmid E. coli Function-based (A) 1.0 × 105 7 1:14 300 7 Lezyk et al. (2016)
Berini et al.
3
Table 1. (Continued)
Target DNA source Library Heterologous Screening Number of Number of Hit Number of Reference
type host approach screened positive frequency characterized
Activity EC number clones clones enzymes
β-Galactosidase 3.2.1.23 Soil Cosmid E. coli and Function-based (B) 7.9 × 104 na na 3 Cheng et al. (2017)
Sinorhizobium
melitoti
α-Galactosidase 3.2.1.23 Hot spring water / / Sequence-based (G) / / / 1 Schröder et al. (2017)
and sediment
β-Galactosidase 3.2.1.23 Soil Plasmid E. coli Function-based (A) 1.3 × 106 6 1:216 700 1 Erich et al. (2015)
β-Galactosidase 3.2.1.23 Marine sediment Plasmid E. coli Function-based (A) na 28 na 1 Li et al. (2015)
β-Galactosidase 3.2.1.23 Hot spring water / / Sequence-based (E) / 1 / 1 Liu et al. (2015b)
β-Galactosidase 3.2.1.23 Soil Plasmid E. coli Function-based (A) 7.0 × 105 1 1:700 000 1 Wang et al. (2014)
β-N- 3.2.1.52 Soil Fosmid E. coli Function-based (A) 1.0 × 105 30 1:3300 2 Nyffenegger et al. (2015)
Acetylhexosaminidase
α-Rhamnosidase 3.2.1.40 Feces Fosmid E. coli Function-based (A) 2.0 × 104 na na 1 Rabausch, Ilmberger and
FEMS Microbiology Letters, 2017, Vol. 364, No. 21
Streit (2014)
Lichenase 3.2.1.73 Soil Plasmid E. coli Function-based (A) 2.0 × 104 1 1:20 000 1 Kim, Oh and Kwon (2014)
(endoglucanase)
Cellulase 3.2.1.- Compost Fosmid E. coli Function-based (A) 6.0 × 103 24 1:250 1 Okano et al. (2014)
Polygalacturonase 3.2.1.15 Soil Plasmid E. coli Function-based (A) 2.0 × 103 9 1:200 1 Sathya, Jacob and Khan
(2014)
Cellobiose epimerase 5.1.3.11 Ovine rumen / / Sequence-based (E) / 71 / 2 Wasaki et al. (2015)
Glycosyl hydrolase with 3.2.1.- Ovine rumen Fosmid E. coli Function-based (A) 1.2 × 105 155 1:750 1 Song et al. (2017)
multifunctional activity
Glycosyl hydrolase with 3.2.1.- Bovine rumen / / Sequence-based (G) / 2597 / 1 Patel et al. (2016)
multifunctional activity
Glycosyl hydrolase with 3.2.1.- Bovine rumen BAC E. coli Function-based (A) na na na 1 Gruninger et al. (2014)
multifunctional activity
Glycosyl hydrolase with 3.2.1.- Compost Fosmid E. coli Function-based (A) 2.5 × 102 5 1:50 1 Sae-Lee and Boonmee
multifunctional activity (2014)
4
β-Xylanase, cellulase, 3.2.1.8, Insect gut Fosmid E. coli Function-based (A) 4.0 × 10 31 1:1300 8 Rashamuse et al. (2017)
α-fucosidase 3.2.1.51
β-Galactosidase, 3.2.1.23, Wheat straw Fosmid E. coli Function-based (A) 4.4 × 104 71 1:600 7 Maruthamuthu et al. (2016)
β-xylosidase, 3.2.1.37,
α-glucosidase 3.2.1.20
Endoglucanase, 3.2.1.4, Sugarcane Fosmid E. coli Function-based (A) 1.0 × 105 7 1:14 300 2 Kanokratana et al. (2015)
endoxylanase 3.2.1.8 bagasse
β-Xylosidase, xylanase, 3.2.1.37, Digester Fosmid E. coli Function-based (A) 9.7 × 103 178 1:54 4 Wang et al. (2015a)
β-glucosidase, cellulase 3.2.1.8,
3.2.1.21
Target DNA source Library Heterologous Screening Number of Number of Hit Number of Reference
type host approach screened positive frequency characterized
Activity EC number clones clones enzymes
Glycosyl hydrolases 3.2.1.- Digester / / Sequence-based (G) / 163 / 6 Wei et al. (2015)
belonging to various
families
β-Glucosidase, 3.2.1.21, Subseafloor / / Sequence-based (G) / 60 / 10 Klippel et al. (2014)
endomannanase, 3.2.1.25, sediments
endoxylanase, 3.2.1.8,
β-xylosidase 3.2.1.37
Cellulase, xylanase 3.2.1.4, 3.2.1.8 Soil Plasmid E. coli Function-based (A) 1.5 × 105 6 1:25 000 3 Mori et al. (2014)
β-Glucosidase, 3.2.1.21 Soil Plasmid E. coli Function-based (A) 5.0 × 105 9 1:55 600 9 Stroobants, Portetelle,
glycosyltransferases Vandenbol (2014)
Amylase
Amylopullulanase 3.2.1.1 Insect gut Fosmid E. coli Function-based (A) 9.2 × 104 1 1:92 000 1 Lee et al. (2016b)
(α-amylase)
α-Amylase 3.2.1.1 Cow dung Plasmid E. coli Function-based (C) 1.0 × 105 200 1:500 1 Pooja et al. (2015)
α-Amylase 3.2.1.1 Submarine ikaite BAC E. coli Function-based (A) 2.8 × 103 3 1:900 3 Vester, Glaring and
column Stougaard (2014)
α-Amylase 3.2.1.1 Feces Fosmid E. coli Function-based (A) 5.0 × 104 8 1:6200 1 Xu et al. (2014)
Chitinase
Chitinase 3.2.1.14 Soil Fosmid E. coli Sequence- (D) and 7.8 × 103 1 1:7800 1 Hjort et al. (2014); Berini
function-based (A) et al. (2017)
Chitinase 3.2.1.14 Soil Fosmid E. coli Function-based (A) 5.0 × 104 15 1:3300 1 Thimoteo et al. (2017)
Chitinase 3.2.1.14, Soil Fosmid E. coli Sequence-based (D) 1.45 × 105 8 1:18 100 1 Cretoiu et al. (2015)
3.5.1.41
Chitinase 3.2.1.14 Pig feces / / Sequence-based (E) / 1 / 1 Liu et al. (2015a)
Chitinase 3.2.1.14, Soil / / Sequence-based(G) / 10 / 1 Stöveken et al. (2015)
3.5.1.41
Chitin deacetylase 3.5.1.41 Sediment Plasmid E. coli Sequence-based (F) 10 1 1:10 1 Liu et al. (2016)
Esterase/lipase
Carboxylesterase 3.1.1.1 Anaerobic Fosmid E. coli Function-based (A) 1.1 × 106 714 1:1500 77a Popovic et al. (2017)
digester, sunken
shipwreck’s tar,
water, soil etc.
Carboxylesterase 3.1.1.1 Marine mud Fosmid E. coli Function-based (A) 4.0 × 104 34 1:1200 1 Gao et al. (2016b)
Carboxylesterase 3.1.1.1 Soil Cosmid E. coli Function-based (A) 8.0 × 104 1 1:80 000 1 Jeon et al. (2016)
Carboxylesterase 3.1.1.1 Gill chamber Fosmid E. coli Function-based (A) 2.7 × 104 10 1:2700 3 Alcaide et al. (2015)
Carboxylesterase 3.1.1.1 Biogas digester Plasmid E. coli Function-based (A) 9.6 × 103 1 1:9600 1 Cheng et al. (2014)
Carboxylesterase 3.1.1.1 Hot vent Fosmid E. coli Function-based (A) 9.6 × 103 120 1:80 3 Placido et al. (2015)
sediment
Berini et al.
5
Table 1. (Continued)
Target DNA source Library Heterologous Screening Number of Number of Hit Number of Reference
type host approach screened positive frequency characterized
Activity EC number clones clones enzymes
Carboxylesterase 3.1.1.1 Seawater Phagemid E. coli Function-based (A) 3.0 × 105 23 1:13 050 5 Tchigvintsev et al. (2015)
Esterase 3.1.1.1 Sediment Fosmid E. coli Function-based (A) 7.2 × 103 10 1:720 1 Zhang et al. (2017)
Esterase 3.1.1.1 Marine sediment Fosmid E. coli Function-based (A) 3.9 × 103 19 1:200 1 De Santi et al. (2016)
Esterase 3.1.1.1 Compost Fosmid E. coli Function-based (A) 2.3 × 104 18 1:1300 1 Lee et al. (2016a)
Esterase 3.1.1.1 Moss Fosmid E. coli Function-based (A) 9.0 × 104 83 1:1100 6 Müller et al. (2017)
Esterase 3.1.1.- Hot spring mud / / Sequence-based (G) / 1 / 1 Zarafeta et al. (2016)
Esterase 3.1.1.1 Glacier soil Fosmid E. coli Function-based (A) 1.0 × 104 5 1:2000 1 De Santi et al. (2015)
Esterase 3.1.1.1 Sediment Fosmid E. coli Function-based (A) 2.0 × 105 1 1.200 000 1 Hu et al. (2015)
FEMS Microbiology Letters, 2017, Vol. 364, No. 21
Esterase 3.1.1.1 Hot spring water, Fosmid E. coli and Function-based (A 8.0 × 103 8 1:2650 2b Leis et al. (2015)
sediment and Thermus for E. coli, B for T.
compost thermophilus thermophilus)
Esterase 3.1.1.1 Hot spring water Fosmid E. coli Function-based (A) 1.2 × 104 6 1:2000 1 López-López et al. (2015)
Esterase 3.1.1.1 Compost Fosmid E. coli Function-based (A) 6.0 × 103 19 1:300 1 Okano et al. (2015a)
Esterase 3.1.1.1 Bovine rumen Fosmid E. coli Function-based (A) 2.8 × 104 3 1:9200 1 Rodrı́guez et al. (2015)
Esterase 3.1.1.1 Soil Plasmid E. coli Function-based (A) 1.0 × 104 3 1:3300 1 Sudan and Vakhlu (2015)
Esterase 3.1.1.1 Sediment Fosmid E. coli Function-based (A) 3.2 × 104 1 1:32 000 1 Seo et al. (2014)
Hormone-sensitive lipase 3.1.1.79 Soil Plasmid E. coli Function-based (A) 1.7 × 105 1 1:170 000 1 Dukunde et al. (2017)
Hormone-sensitive lipase 3.1.1.79 Sediment Fosmid E. coli Function-based (A) 2.0 × 104 12 1:1700 1 Huang et al. (2016)
Hormone-sensitive lipase 3.1.1.79 Permafrost Fosmid E. coli Function-based (A) 5.0 × 103 7 1:700 1 Petrovskaya et al. (2016)
Hormone-sensitive lipase 3.1.1.79 Sediment Fosmid E. coli Function-based (A) 1.1 × 104 7 1:1600 1 Li et al. (2014a)
Hormone-sensitive lipase 3.1.1.79 Sediment Plasmid E. coli Function-based (A) 1.2 × 105 15 1:8000 1 Peng et al. (2014)
Lipase 3.1.1.3 Hot spring Plasmid E. coli Function-based (A) 1.3 × 104 7 1:1900 1 Sahoo et al. (2017)
sediment
Lipase 3.1.1.3 Hot spring water BAC E. coli Function-based (A) 6.8 × 104 4 1:17 000 1 Yan et al. (2017)
Lipase 3.1.1.3 Soil / / Sequence-based (E) / na / 1 Kumar et al. (2017)
Lipase 3.1.1.3 Soil Plasmid E. coli Function-based (A) 2.0 × 103 15 1:130 1 Khan and Kumar (2016)
Lipase 3.1.1.3 Soil Fosmid E. coli Function-based (A) 5.0 × 105 32 1:15 600 2 Martini et al. (2014); Alnoch
et al. (2015)
Lipase 3.1.1.3 Water / / Sequence-based (F) / 777 / 1 Masuch et al. (2015)
Lipase 3.1.1.3 Marine sponge Plasmid E. coli Function-based (A) 6.5 × 103 1 1:6500 1 Su et al. (2015)
Target DNA source Library Heterologous Screening Number of Number of Hit Number of Reference
type host approach screened positive frequency characterized
Activity EC number clones clones enzymes
Lipase 3.1.1.3 Fed-batch reactor Cosmid E. coli Function-based (A) 1.0 × 104 10 1:1000 1 Brault et al. (2014)
Lipase 3.1.1.3 Soil BAC E. coli Function-based (A) 3.0 × 103 6 1:500 1 Pindi, Srinath and
Pavankumar (2014)
Lipase 3.1.1.3 Human oral Plasmid E. coli Function-based (A) 4.0 × 104 20 1:2000 1 Preeti et al. (2014)
microbiome
Esterase/lipase 3.1.1.- Soil Fosmid E. coli Function-based (A) 4.2 × 103 30 1:140 2 Pereira et al. (2015); Maester
et al. (2016)
3
Esterase/lipase 3.1.1.- Mud flat Plasmid E. coli Function-based (A) 3.0 × 10 9 1:300 2 Kim et al. (2015)
Esterase/lipase 3.1.1.- Soil Fosmid E. coli Function-based (A) 1.4 × 104 1 1:14 000 1 O’Mahony et al. (2015)
Esterase/lipase/ 3.1.1.- Bovine rumen Fosmid E. coli Function-based (A) 2.4 × 104 14 1:1700 5 Privé et al. (2015)
phospholipase
Esterase/lipase 3.1.1.- Water BAC E. coli Sequence-based (F) / / / 1 Fang et al. (2014)
Thioesterase 3.1.2.- Activated sludge Plasmid E. coli Function-based (A) 1.0 × 104 1 1:10 000 1 Sánchez-Reyez et al. (2017)
Glucuronoyl esterase 3.1.1.- Marine sediment Fosmid E. coli Function-based (A) 1.7 × 102 1 1:170 1 De Santi, Willassen and
Williamson (2016)
Sulfatase, 3.1.6.-, 2.8.2.1 Soil and rumen Plasmid E. coli Function-based (A) 1.3 × 106 14 1:89 000 13c Colin et al. (2015)
sulfotransferase
Feruloyl esterase 3.1.1.73 Hindgut Fosmid E. coli Function-based (A) 4.0 × 104 7 1:5700 6 Rashamuse et al. (2014)
symbiont
Phospholipase 3.1.1.- Hydrothermal Fosmid E. coli Function-based (A) 1.8 × 104 7 1:2600 1 Fu et al. (2015)
(patatin-like) vent
Phosphodiesterase 3.1.1.- Coalfield Fosmid E. coli Function-based (A) na 1 na 1 Singh et al. (2015a)
Wax ester synthase 2.3.1.75 Soil Fosmid E. coli Function-based (A) 3.3 × 105 155 1:2100 1 Kim et al. (2016a)
Oxidoreductase
Dioxygenase 1.13.11.- Soil Fosmid E. coli Function-based (B) 1.5 × 105 62 1:2400 1 dos Santos et al. (2015)
Dioxygenase 1.13.11.- Soil Plasmid E. coli Function-based (A) na 1 na 1 Wang et al. (2015c)
Dioxygenase 1.13.11.- Soil / / Sequence-based (G) / 510 / 4 Chemerys et al. (2014)
Alcohol dehydrogenase 1.1.1.1 Soil Plasmid E. coli Sequence- (D) and 1.2 × 102 23 1:5 2 Itoh et al. (2014)
function-based (A)
Alcohol dehydrogenase 1.1.1.1 Soil and compost Plasmid E. coli Sequence- (D) and 2.0 × 103 40 1:50 2d Itoh, Kariya and Kurokawa
function-based (A) (2014)
d-Amino acid oxidase 1.4.3.3 Soil Plasmid E. coli Sequence-based (F) 3.2 × 104 1 1:32 000 1 Ou et al. (2015)
Monooxygenase, 1.14.-, Soil Cosmid E. coli and P. Function-based (B) 2.2 × 105 29 1:7600 2 Nagayama et al. (2015)
dioxygenase 1.13.11.- putida
β-Carotene 1.14.99.36 Human gut Fosmid E. coli Function-based (B) 2.3 × 104 53 1:450 1 Culligan et al. (2014b)
monooxygenase microbiome
4
Glucose dehydrogenase 1.1.99.10 Hay infusion Phagemid E. coli Function-based (A) 2.0 × 10 13 1:1550 1 Basner and Antranikian
(2014)
Berini et al.
7
Table 1. (Continued)
Target DNA source Library Heterologous Screening Number of Number of Hit Number of Reference
type host approach screened positive frequency characterized
Activity EC number clones clones enzymes
Aldehyde dehydrogenase 1.2.1.3 Hot spring water / / Sequence-based (E) / na / 1 Chen et al. (2014)
and soil
Mercuric reductase 1.16.1.1 Brine pool / / Sequence-based (G) / 1 / 1 Sayed et al. (2014)
Multicopper oxidase 1.10.- Coalbed Fosmid E. coli Function-based (C) 4.6 × 104 24 1.1900 1 Strachan et al. (2014)
Bilirubin oxidase (laccase) 1.3.3.5 Activated sludge Fosmid E. coli Function-based (A) 1.0 × 105 1 1:100 000 1 Kimura and Kamagata
(2016)
Protease
Serine protease 3.4.21.- Tannery sludge Plasmid E. coli Function-based (A) 1.0 × 104 1 1:10 000 1 Devi et al. (2016)
Serine protease 3.4.21.- Hot spring Plasmid E. coli Function-based (A) 9.0 × 103 1 1:9000 1 Singh et al. (2015b)
sediment
Subtilisin-like serine 3.4.21.62 Underground Fosmid E. coli Function-based (A) 2.1 × 104 23 1:900 2 Apolinar–Hernández et al.
protease water (2016)
Cysteine protease 3.4.22.- Ovine rumen Plasmid E. coli Sequence-based (G) na na na 1 Faheem et al. (2016)
Metalloprotease 3.4.- Sludge Fosmid E. coli Function-based (A) 2.8 × 104 2 1:14 000 1 Morris and Marchesi (2015)
FEMS Microbiology Letters, 2017, Vol. 364, No. 21
Phosphatase/phytase
Phytase 3.1.3.- Bovine rumen / / Sequence-based (G) / 1 / 1 Mootapally et al. (2016)
Phytase 3.1.3.- Peat soil na na Sequence-based (F) na 4 na 1 Tan et al. (2016a)
Phytase 3.1.3.- Fungus garden na na Sequence-based (F) na 11 na 1 Tan et al. (2016b)
Phytase 3.1.3.- Groundwater na na Sequence-based (F) na 1 na 1 Tan et al. (2015)
Phytase 3.1.3.- Soil Fosmid E. coli Function-based (B) 1.4 × 104 28 1:500 1 Tan et al. (2014)
Alkaline phosphatase 3.1.3.1 Sediment Fosmid E. coli Function-based (C) 8.0 × 104 6 1:13 350 1 Lee et al. (2015)
Other
Amine transferase 3.6.1.- Hot spring / / Sequence-based (G) / 3 / 3 Ferrandi et al. (2017)
β-Agarase 3.2.1.81 Soil Fosmid E. coli Function-based (A) 1.0 × 105 3 1:33 350 1 Mai, Su and Zhang (2016)
Penicillin G acylase 3.5.1.11 Sediment Cosmid E. coli Function-based (B) 7.0 × 103 1 1:7000 1 Zhang et al. (2014b)
Epoxide hydrolase 3.3.2.8 Hot spring / / Sequence-based (G) / 2 / 2 Ferrandi et al. (2015)
Nitrilase 3.5.5.1 Water / / Sequence-based (G) / 1 / 1 Sonbol, Ferreira and Siam
(2016)
Exonuclease 3.1.11.1 Soil Plasmid E. coli Function-based (B) 2.7 × 103 1 1:2700 1 Silva-Portela et al. (2016)
Rhodanese (sulfur 2.8.1.1 Soil Plasmid E. coli Function-based (A) 8.5 × 103 5 1:1700 1 Bhat et al. (2015)
transferase)
The list was created by searching Pubmed database (accession on 17 July 2017) with the following query: (metagenom∗[Title/Abstract]) AND (enzyme[Title/Abstract]) AND (‘2014/01/01’[Date—Publication]: ‘2017/03/31’[Date—
Publication]) (where ‘enzyme’ was substituted with the name of each enzymatic class considered). The results were manually checked in order to select only those publications dealing with enzymes fulfilling three criteria:
(i) identification by metagenomics, (ii) heterologous expression and at least partial purification (thus confirming the activity of the putative hit), (iii) biochemical and/or functional and/or structural characterization.
For functional screening: A = phenotypical detection; B = heterologous complementation; C = induced gene expression (see also Fig. 1). For genetic screening: D = PCR amplification of metagenomic library DNA; E = PCR
amplification of eDNA; F = in silico analysis of metagenomic library DNA; G = in silico analysis of shotgun eDNA (see also Fig. 1). na = data not available.
a
28 from anaerobic digester, 21 from water sample, 11 from sunken shipwreck’s tar, 7 from soil and sediment, 5 from compost, 4 from eukaryote-associated microbiome, 1 from paper mill.
b
1 from sediment, 1 from compost.
c
10 from soil, 3 from rumen.
d
From compost.
transcription, heterologous protein misfolding, poor secretion range either sequentially or in parallel (Katzke et al. 2017). Recent
and inclusion body formation (Binda et al. 2013; Lam and Charles examples (Table 1) include an esterase discovered in a fos-
2015). On average, only 30%–40% of environmental bacterial mid library propagated in E. coli and in Thermus thermophilus
genes could be efficiently expressed in E. coli, a value dropping to (Leis et al. 2015), two dioxygenases from a cosmid metage-
7% for high G + C DNA (Gabor, Alkema and Janssen 2004). Ran- nomic library in Pseudomonas putida (Nagayama et al. 2015) and
dom insertion of bidirectional promoters into the library (Kim a β-galactosidase from a library in Sinorhizobium meliloti (Cheng
et al. 2016b) and addition of heterologous σ -factors for increas- et al. 2017). Interestingly, Bouhajja et al. (2017) identified novel
ing transcription proficiency (Gaida et al. 2015) were recently ex- toluene monooxygenase genes by cloning eDNA in three par-
perimented to increase E. coli performance. A different strategy allel hosts, i.e. E. coli, Cupriavidus metallidurans and Edaphobacter
is using alternative expression systems, such as Agrobacterium, aggregans.
Bacillus, Rhodococcus, Streptomyces and Pseudomonas, along with Another critical factor is the sensitivity of commonly used
a few archaea (Liebl et al. 2014). Multihost expression strate- function-based screening methods, which might be too low
gies were proposed too, using shuttle vectors with a broad host to make gene expression easily detectable (Gabor, Alkema
10 FEMS Microbiology Letters, 2017, Vol. 364, No. 21
and Janssen 2004; Ekkers et al. 2012). Classical phenotypical and hits are selected based on the emitted fluorescence or ab-
detection (A in Table 1, Fig. 1), applied for decades in screen- sorbance (Mair, Gielen and Hollfelder 2017). This method al-
ing microbial isolates and based on identifying specific phe- lowed screening over a million clones per day, recovering both
notypic traits associated with the activity of interest, remains slow and fast biocatalysts, promiscuous enzymes or those en-
the election method in metagenomic, too (260 enzymes). Phe- coded by rare members of the microbial community. Through
notypical detection on agar plates is still the most widely this approach, Colin et al. 2015 identified novel sulfatases and
used approach, especially when searching for hydrolytic en- phosphotriesterases from soil and rumen metagenomic libraries
zymes (such as lipases) forming easy-to-detect degradation (Table 1).
haloes (Placido et al. 2015; Popovic et al. 2017) on appro- In addition to phenotypical detection, two other functional
priate substrates. Alternative methods were based on high- screening approaches merit mention. Heterologous comple-
throughput assays in liquid culture (Zottig, Meddeb-Mouelhi and mentation (B in Table 1, Fig. 1) relies on the selection of clones
Beauregard 2016) or high-performance thin-layer chromatog- that have acquired the capability to grow under selective con-
raphy (Rabausch et al. 2013). High-throughput screening (HTS) ditions, such as in the presence of a specific substrate given as
with multiple substrates, either used in parallel (Ufarté et al. the sole carbon source. It was used to detect a phytase in E. coli
2016) or as a mixture (Maruthamuthu et al. 2016), significantly clones grown on phytate as the sole phosphorus source (Tan et al.
increased hit rate. Proper selection of the screening substrate 2014), as well as for the identification of a thermostable peni-
is highly recommended. Recently, novel hemicellulases were cillin G acylase by selecting E. coli clones growing in the pres-
discovered also thanks to a new generation of versatile multi- ence of penicillin G (Zhang et al. 2014b). Induced gene expression
ple chromogenic substrates mimicking insoluble plant cell wall (C in Table 1, Fig. 1) includes substrate-induced gene expression
components (Maruthamuthu et al. 2016). To minimize redun- (SIGEX) (Uchiyama et al. 2005), product-induced gene expression
dancy (hits showing the same activities/sequences), Ufarté et al. (PIGEX) (Uchiyama and Miyazaki 2010) and metabolite-regulated
(2016) suggested multistep screening of several small libraries expression (METREX) (Williamson et al. 2005), which are all based
originating from different samples instead of focusing on only on the use of operon-trap green fluorescent protein (GFP) expres-
one large library from a unique source. A recent frontier in func- sion vectors: positive clones, co-expressing the desired enzyme
tional metagenomics is using micro- and picodroplet techniques activity and the GFP upon exposure to a target substrate (SIGEX),
for eDNA HTS: each single clone is encapsulated in water-in-oil product (PIGEX) or quorum-sensing signal molecule (METREX)
droplets together with a suitable substrate and lysis reagents, can be isolated by fluorescence-assisted cell sorting. By using
Berini et al. 11
PIGEX, a periplasmic α-amylase was discovered in a clone from of enzymes from metagenomes patented in the last decade.
a cow dung metagenomic library that fluoresced on maltose as Additionally, Table 3 indicates some successful cases of
substrate (Pooja et al. 2015). By-products from a wide variety of metagenome-sourced enzymes that were marketed. Herein, we
enzymes (e.g. cellulases, lipases and lyases) may be detected us- get insight into the activities and potential applications of few
ing a single cell-based reporter system in which GFP expression (the most interesting ones from our point of view) among the 332
is activated by the presence of phenyl compounds (Choi et al. metagenome-sourced enzymes recently discovered (Table 1, Fig.
2014), as in the case of a psychrophilic alkaline phosphatase 2D). Although the majority of them could be considered useful
from tidal flat sediment (Lee et al. 2015). variants of already known protein families, some of the biocat-
Alternatively to functional metagenomics, genetic screening alysts reported in Table 1 belong to previously uncharacterized
is gene expression independent. It is traditionally based on the enzyme families or subfamilies, with unconventional structures
use of PCR with primers specific for conserved regions of the and active site architectures pointing to novel catalytic mecha-
genes being targeted, which for enzymes are usually the cat- nisms.
alytic domains (Ekkers et al. 2012; Coughlan et al. 2015). It can
Table 2. Examples of metagenome-sourced enzymes patented in the last decade and their suggested applications.
Patent
Enzyme Source Patent number filling date Suggested applications
Cellulase/hemicellulase
Xylanase Hot spring EP2990482 A1 2014 Biofuel production from
lignocellulosic biomasses
Cellobiohydrolase Hot spring EP2980212 A1 2013 Biofuel production from
lignocellulosic biomasses
Cellulase Bovine WO2014142529 A1 2013 Biofuel production from
rumen lignocellulosic biomasses; fiber,
detergent, feed, food, pulp, paper
production
Among other industrially relevant GHs, a handful of α- tritional potential (Cretoiu et al. 2015). In 2014, we identified
amylases (Table 1, Fig. 2D) with a potential for applications in a chitobiosidase by combining genetic and functional screen-
starch processing and detergent, food and pharmaceutical pro- ings applied to a suppressive soil metagenome, now under de-
duction were discovered by functional screening of metage- velopment as a potential biocontrol agent thanks to its an-
nomic libraries from cow and pygmi loris dung (Xu et al. 2014; tifungal activity (Hjort et al. 2014; Berini et al. 2016, 2017).
Pooja et al. 2015), Greenland submarine ikaite columns (Vester, In 2015, the PCR-based analysis of a library from a chitin-
Glaring and Stougaard 2014) and Hermetia illucens gut microflora amended agricultural soil led us to discover a novel halophilic
(Lee et al. 2016b), this last being stable also in polar organic sol- chitinase, with the potential to be exploited for the treat-
vents and non-ionic detergents. ment and valorization of seafood wastes (Cretoiu et al. 2015).
Chitinases, which are active on chitin—the second most An interesting enzyme for its potential application in pro-
abundant biopolymer after lignocellulose—recently attracted ducing chitosan from chitin is the chitin deacetylase that
attention for the production of chitin derivatives (chitosan has been identified in Arctic Ocean deep-sea sediments (Liu
and chitooligosaccharides) with high pharmaceutical and nu- et al. 2016).
Berini et al. 13
Table 3. Examples of metagenome-sourced enzymes commercialized bed bacterial community (Strachan et al. 2014) and activated
by BASF Enzymes LLC (https://www.basf.com). sludge-associated microbiome (Kimura and Kamagata 2016).
More recently, we reported on the isolation and characterization
Commercial
of the first metagenome-sourced acidobacterial MCO, with high
name Enzyme class Applications
tolerance to salt and heat (Ausec et al. 2017).
Luminase Xylanase Pulp biobleaching in paper
production Proteases
Fuelzyme α-Amylase Fuels and industrial-use alcohols
production Five metagenomic studies led to the identification of novel pro-
Pyrolase 160, Cellulase Secondary oil and gas recovery teases, one of the most widely used industrial classes of en-
Pyrolase 200 zymes in detergent, leather and food processing, peptide and
Phyzyme XP Phytase Additive for livestock feed drug synthesis, brewing and wastewater treatment (Table 1,
Fig. 2D). Among them, two serine proteases from Yucatán under-
Oxidoreductases CONCLUSIONS
Oxidoreductases represent a heterogeneous group of enzymes Almost unheard of only 10/15 years ago in enzyme biotech-
with multiple applications in pharmaceutical and food in- nology, metagenomics is now rapidly developing as a mean
dustries and in bioremediation. Recently discovered oxidore- of encrypting novel biocatalysts from environmental samples.
ductases (Table 1, Fig. 2D) include five soil-sourced dioxyge- Technical challenges still exist, though, that are connected
nases with bioremediation potential (Chemerys et al. 2014; dos to screening procedures and heterologous expression of
Santos et al. 2015) and the first metagenome-sourced d-amino metagenome-derived enzymes, limiting the great potential
acid oxidase with potential application in the biosynthesis of the of functional metagenomics. Advances in HTS and NGS pave
antibiotic intermediate of 7-aminocephalosporanic acid from the way to make metagenomics more efficient and efficacious
cephalosporin C (Ou et al. 2015). Belonging to this enzymatic in discovering potential industrially valuable biocatalysts.
class are multi-copper oxidases (MCOs), enzymes with a broad We envisage that very soon, metagenome approaches will be
range of activity on both phenolic and non-phenolic substrates, mature enough to substantially impact biobased industrial
which attract attention due to their involvement in degrading production. The experience on how to build and screen metage-
lignocellulose biomasses. Table 1 includes two MCOs from coal- nomic libraries is becoming a common shared knowledge in
14 FEMS Microbiology Letters, 2017, Vol. 364, No. 21
the scientific community aiming at developing novel biocat- Berini F, Caccia S, Franzetti E et al. Effects of Trichoderma viride
alysts to replace chemical processes. A proof of this is the chitinases on the peritrophic matrix of Lepidoptera. Pest
list of the 332 metagenome-sourced enzymes identified and Manag Sci 2016;72:980–9.
characterized in the past three years. Scientists and industries Berini F, Presti I, Beltrametti F et al. Production and characteriza-
may select from this pool enzymes that may be translated to tion of a novel antifungal chitinase identified by functional
industrial development for an incredible variety of industrial screening of a suppressive-soil metagenome. Microb Cell Fact
applications. Interestingly, only very few metagenome-sourced 2017;16:16.
enzymes have already undergone optimization through di- Bhat A, Riyaz-Ul-Hassan S, Srivastava N et al. Molecular cloning
rected evolution strategies, and this means that there is room of rhodenase gene from soil metagenome of cold desert of
for further improving their protein backbones to fulfill industrial North-West Himalayas: sequence and structural features of
requirements. the rhodanese enzyme. 3 Biotech 2015;5:513–21.
Binda E, Marcone GL, Berini F et al. Streptomyces spp. as efficient
expression system for a d,d-peptidase/d,d-carboxypeptidase
ACKNOWLEDGEMENTS
Culligan EP, Sleator RD, Marchesi JR et al. Metagenomics and Fu L, He Y, Xu F et al. Characterization of a novel thermostable
novel gene discovery: promise and potential for novel thera- patatin-like protein from a Guaymas basin metagenomic li-
peutics. Virulence 2014a;5:399–412. brary. Extremophiles 2015;19:829–40.
Culligan EP, Sleator RD, Marchesi JR et al. Metagenomic identifi- Gabor EM, Alkema WB, Janssen DB. Quantifying the accessibil-
cation of a novel salt tolerance gene from the human gut mi- ity of the metagenome by random expression cloning tech-
crobiome which encodes a membrane protein with homol- niques. Environ Microbiol 2004;6:879–86.
ogy to a brp/blh-family β-carotene 15,15’-monooxygenase. Gaida SM, Sandoval NR, Nicholaou SA et al. Expression of het-
PLoS One 2014b;9:e103318. erologous sigma factors enables functional screening of
De Santi C, Altermark B, Pierechod MM et al. Characteriza- metagenomic and heterologous genomic libraries. Nat Com-
tion of a cold-active and salt tolerant esterase identified by mun 2015;6:7045.
functional screening of arctic metagenomic libraries. BMC Gao G, Wang A, Gong BL et al. A novel metagenome-derived gene
Biochem 2016;17:1. cluster from termite hindgut: encoding phosphotransferase
De Santi C, Ambrosino L, Tedesco P et al. Identification and char- system components and high glucose tolerant glucosidase.
Kanokratana P, Eurwilaichtr L, Pootanakit K et al. Identifica- Li L, Li G, Cao LC et al. Characterization of the cross-linked en-
tion of glycosyl hydrolase from a metagenomic library of zyme aggregates of a novel β-galactosidase, a potential cat-
microflora in sugarcane bagasse collection site and their alyst for the synthesis of galacto-oligosaccharides. J Agr Food
cooperative action on cellulose degradation. J Biosci Bioeng Chem 2015;63:894–901.
2015;119:384–91. Li PY, Ji P, Li CY et al. Structural basis for dimerization and catal-
Katz M, Hover BM, Brady SF. Culture-independent discovery of ysis of a novel esterase from the GTSAG motif subfamily
natural products from soil metagenomes. J Ind Microbiol Biot of the bacterial hormone-sensitive lipase family. J Biol Chem
2016;43:129–41. 2014a;289:19031–41.
Katzke N, Knapp A, Loeschcke A et al. Novel tools for the func- Li Y, Liu N, Yang H et al. Cloning and characterization of a new
tional expression of metagenomic DNA. Methods Mol Biol β-glucosidase from a metagenomic library of rumen of cattle
2017;1539:159–96. feeding with Miscanthus sinensis. BMC Biotechnol 2014b;2:14–
Kellner H, Luis P, Portetelle D et al. Screening of a soil metatran- 85.
scriptomic library by functional complementation of Saccha- Liebl W, Angelov A, Juergensen J et al. Alternative hosts
Martini VP, Glogauer A, Müller-Santos M et al. First co-expression Okano H, Kanaya E, Ozaki M et al. Structure, activity, and stabil-
of a lipase and its specific foldase obtained by metagenomics. ity of metagenome-derived glycoside hydrolase family 9 en-
Microb Cell Fact 2014;13:171. doglucanase with an N-terminal Ig-like domain. Protein Sci
Maruthamuthu M, Jiménez DJ, Stevens P et al. A multi-substrate 2015b;24:408–19.
approach for functional metagenomics-based screening for Okano H, Ozaki M, Kanaya E et al. Structure and stability of
(hemi)cellulases in two wheat straw-degrading microbial metagenome-derived glycoside hydrolase family 12 cellulase
consortia unveils novel thermoalkaliphilic enzymes. BMC Ge- (LC-CelA) a homolog of Cel12A from Rhodothermus marinus.
nomics 2016;17:86. FEBS Open Bio 2014;4:936–46.
Masuch T, Kusnezowa A, Nilewski S et al. A combined bioinfor- Ou Q, Liu Y, Deng J et al. A novel d-amino acid oxidase from a
matics and functional metagenomics approach to discover- contaminated agricultural soil metagenome and its charac-
ing lipolytic biocatalysts. Front Microbiol 2015;6:1110. terization. Anton Leeuw 2015;107:1615–23.
Matsuzawa T, Kaneko S, Yaoi K. Screening, identification, Patel AB, Patel AK, Shah MP et al. Isolation and characterization
and characterization of a GH43 family β-xylosidase/α- of novel multifunctional recombinant family 26 glycoside hy-
Ramı́rez-Escudero M, Del Pozo MV, Marı́n-Navarro J et al. Strachan CR, Singh R, Vanlnsberghe D et al. Metagenomic scaf-
Structural and functional characterization of a ruminal fold enable combinatorial lignin transformation. P Natl Acad
β-glycosidase defines a novel subfamily of glycoside hydro- Sci USA 2014;111:10143–8.
lase family 3 with permuted domain topology. J Biol Chem Stroobants A, Portetelle D, Vandenbol M. New carbohydrate-
2016;291:24200–14. active enzymes identified by screening two metagenomic li-
Rashamuse K, Ronneburg T, Sanyika W et al. Metagenomic min- braries derived from the soil of a winter wheat field. J Appl
ing of feruloyl esterases from termite enteric flora. Appl Mi- Microbiol 2014;117:1045–55.
crobiol Biot 2014;98:727–37. Su J, Zhang F, Sun W et al. A new alkaline lipase obtained from
Rashamuse K, Sanyika Tendai W, Mathiba K et al. Metagenomic the metagenome of marine sponge Ircinia sp. World J Microb
mining of glycoside hydrolases from the hindgut bacterial Biot 2015;31:1093–102.
symbionts of a termite (Trinervitermes trinervoides) and the Sudan AK, Vakhlu J. Isolation and in silico characterization
characterization of a multimodular β-1,4-xylanase (GH11). of novel esterase gene with β-lactamase fold isolated
Biotechnol Appl Biochem 2017;64:174–86. from metagenome of north western Himalayas. 3 Biotech
Vester JK, Glaring MA, Stougaard P. Discovery of novel enzymes Xing MN, Zhang XZ, Huang H. Application of metagenomic tech-
with industrial potential from a cold and alkaline environ- niques in mining enzymes from microbial communities for
ment by a combination of functional metagenomics and cul- biofuel synthesis. Biotechnol Adv 2012;304:920–9.
turing. Microb Cell Fact 2014;13:72. Xu B, Yang F, Xiong C et al. Cloning and characterization of a
Wang M, Lai GL, Nie Y et al. Synergistic function of four novel novel α-amylase from a fecal microbial metagenome. J Mi-
thermostable glycoside hydrolases from a long-term en- crobiol Biot 2014;24:447–52.
riched thermophilic methanogenic digester. Front Microbiol Yan W, Li F, Wang L et al. Discovery and characterization of a
2015a;6:509. novel lipase with transesterification activity from hot spring
Wang Q, Luo Y, He B et al. Characterization of a novel xylanase metagenomic library. Biotechnol Rep (Amst) 2017;14:27–33.
gene from rumen content of Hu sheep. Appl Biochem Biotech Yang C, Xia Y, Qu H et al. Discovery of new cellulases from the
2015b;177:1424–36. metagenome by a metagenomics-guided strategy. Biotechnol
Wang SD, Guo GS, Li L et al. Identification and characteriza- Biofuels 2016;9:138.
tion of an unusual glycosyltransferase-like enzyme with Zarafeta D, Moschidi D, Ladoukakis E et al. Metagenomic mining