Professional Documents
Culture Documents
Bioinformatics Analysis of Metagenomics Data of Biogas-Producing Microbial Communities in Anaerobic Digesters: A Review
Bioinformatics Analysis of Metagenomics Data of Biogas-Producing Microbial Communities in Anaerobic Digesters: A Review
Bioinformatics Analysis of Metagenomics Data of Biogas-Producing Microbial Communities in Anaerobic Digesters: A Review
a
Department of Chemical and Biomolecular Engineering, National University of Singapore, 4 Engineering Drive 4, S117576 Singapore, Singapore
b
NUS Environmental Research Institute, National University of Singapore, 1 Create Way, Create Tower #15-02, S138602 Singapore, Singapore
Keywords: Complex microbial communities in anaerobic digestion (AD) system play a vital role in the production of biogas.
Anaerobic digestion An in-depth understanding of the microbial compositions, diversity/similarity, metabolic networks, functional
Microbial communities gene patterns, and relations between biodiversity and system functions at the genome level could help to op-
Bioinformatics timize microbial productivity and contribute to enhancement of AD process. The study of microbial communities
Metagenomics
has been revolutionized in recent years with the development of high-throughput sequencing technologies.
Pyrosequencing
Artificial neural network
Analysis of high-throughput sequencing data and a suitable bioinformatics analysis approach therefore plays a
very critical role in the investigation of microbial metagenome. The present article reviews the overall procedure
of processing metagenomics data of microbial communities for revealing metagenomics characterization using
bioinformatics approaches. This includes (1) introduction of application case summary, (2) DNA extraction and
high-throughput pyrosequencing, (3) processing metagenomics data using function-based bioinformatics plat-
forms and tools, and (4) several specific bioinformatics analysis of anaerobic microbial communities. Key
findings on anaerobic digestion via bioinformatics analysis are summarized. Limitations and future potential of
bioinformatics approaches for analysis of metagenomics information of microbial communities are also dis-
cussed, with the hope of promoting its further development. Finally, a big-data-based precision fermentation
platform using artificial neural network is proposed for integrating the bioinformatics data of microbial com-
munities with performance of anaerobic digesters to facilitate the usage of huge metagenomics data.
⁎
Corresponding author.
E-mail address: chelohkc@nus.edu.sg (K.-C. Loh).
https://doi.org/10.1016/j.rser.2018.10.021
Received 10 May 2018; Received in revised form 27 July 2018; Accepted 17 October 2018
Available online 02 November 2018
1364-0321/ © 2018 Elsevier Ltd. All rights reserved.
L. Zhang et al. Renewable and Sustainable Energy Reviews 100 (2019) 110–126
Fig. 1. Number of annual journal publications from 2008 to 2017 on “pyr- 2008, consistent with the statistical results in Fig. 1. Since then,
osequencing” and “anaerobic” (based on search results from (A) Scopus data- bioinformatics analysis based on metagenomics sequencing has been
base and (B) Web of Science database). Trend lines are added via linear fitting used by many researchers to investigate microbial communities in
of the data. anaerobic digesters under various operating conditions. Nevertheless,
much of the related studies are diverse, scattered and not consolidated,
conditions. For instance, it is now known that there is a clear correla- let alone identify the state-of-the-art. An appropriate review in this
tion between taxonomic and functional genes patterns of anaerobic topic seeks to fill this gap.
microorganisms in biogas-producing digesters [12]. Moreover, it has This paper provides a comprehensive technical explanation of the
been established that both taxonomic and functional patterns can be procedure of using bioinformatics approaches to analyze metagenomics
influenced by environmental variables such as digester configuration, data of microbial communities in biogas-producing digesters (Fig. 2),
feedstock components, temperature, organic loading rate (OLR), hy- with the hope of improving biogas production. A summary of the ap-
draulic retention time (HRT), and free ammonia concentration [13–15]. plication case was first presented in Section 2. This was followed by a
Additionally, based on the analysis of functional genes by metage- general introduction for the DNA extraction and high-throughput se-
nomics studies and network-based approaches, corresponding meta- quencing methods. A comprehensive review of bioinformatics analysis
bolic pathways can be estimated, consequently pointing to the identi- of anaerobic microbial communities, including characterization of
fication of the actual dominant metabolic pathways and mechanisms in taxonomic compositions, alpha and beta diversity analysis, taxonomic
the biogas digesters [16,17]. pattern differences and similarities, multivariate statistical analysis and
Notwithstanding, how to effectively understand the underlying microbial gene functional patterns analysis follows in Section 3. The
biological features of microbial communities in anaerobic digesters function-based bioinformatics platforms and software tools are also
from large-scale sequencing data and how to use this knowledge to discussed. Taking advantage of genome sequence information, the
maximize the biogas production are still big challenges, due to the latest progress related to anaerobic digestion operations made with
extremely complex microbial communities, immature gene sequence bioinformatics approaches is reviewed in Section 4. Finally, the lim-
analysis technology (e.g. standard procedure, platform and software itations and prospects of bioinformatics approaches for analysis of mi-
tools, storage space, and process speed), costly deep sequencing of crobial metagenomics are alluded to in Section 5, which also discusses a
microbial metagenome [18], interpretation of microbial diversity data big-data-based artificial intelligence system via artificial neural net-
containing methodological flaws [19], and fragmented analysis of works for integrating information from various aspects of digester
various microbial communities. Recently, Ju and Zhang [20] reported a configuration, operation parameters, feedstock characteristics, process
rigorous experimental design and bioinformatics analysis procedure for performance, and microbial communities.
the application of metagenomics in environmental sciences and bio-
technology, with the aim to facilitate more extensive application of 2. Summary of application case and metagenomics data collection
metagenomics in the microbe-environment interactions. Although the of microbial communities
metagenomics sequencing has been conducted on various physical en-
vironments (e.g. soil [21], sea [22], hydrogen bioreactor [23], and 2.1. Application case summary and analysis
petroleum hydrocarbon contaminated site [24]), the first metagenomics
analysis of microbial communities in biogas digesters was reported in Partial case summary of using bioinformatics methods to investigate
111
L. Zhang et al. Renewable and Sustainable Energy Reviews 100 (2019) 110–126
Fig. 3. Statistical analysis of 82 research papers based on (A) feedstock, (B) country and (C) operation temperature.
microbial diversity and dynamics in biogas-producing digesters is pre- good repeatability for a digestate sample. Digestate may be rinsed with
sented in Table S1 of the Supplemental material. Eighty two typical phosphate buffer solution and then stored at −20 °C for long-term
research papers in Table S1 in the past decade were statistically ana- storage. Prior to DNA extraction, the digestate could be centrifuged for
lyzed in terms of feedstock, country and operation temperature, results 10 min and the supernatant should be decanted. Then, a certain amount
of which are presented in Fig. 3. (0.3–0.5 g) of wet weight sludge material is used to extract the DNA of
As shown in Fig. 3A, the most dominant feedstock was food waste, microbial communities. Table 1 summarizes the frequently-used DNA
covering approximately 30% of total feedstocks, followed by sludge extraction methods of microbial communities in anaerobic digesters. As
(22.8%), manure (20.3%), agricultural and horticulture waste (15.2%). shown in Table 1, the most frequently-used method for DNA extraction
Country-based analysis (Fig. 3B) indicated that the most active coun- has been using commercial kits, amounting to more than 90% among
tries (> 3%) involving the microbial community analysis in biogas- all the related studies. CTAB (cetyltrimethylammonium bromide) based
producing digesters were China, Denmark, Korea, Germany, USA, UK, DNA extraction method (6%) is another method for DNA extraction. In
Singapore, Russia and Canada. Meanwhile, analysis on basis of AD terms of commercial kits, more than 60% products are provided by
temperature demonstrated that percentage of mesophilic AD was 2.5- MoBio Laboratories (43%, USA) and MP Biomedicals (25%, USA), fol-
fold higher than that of thermophilic AD (Fig. 3C), which can be at- lowed by OMEGA (6%, USA), Macherey-Nagel (6%, Germany), Preci-
tributed to the fact that thermophilic AD consumed more energy for sion System Science (6%, Japan), and Intron biotechnology (2%,
maintaining higher temperature. However, thermophilic AD was re- Korea). Taken together, the USA has the overwhelming share of the
ported to have several advantages, such as preventing foaming by re- DNA kits market. Upon completion of extraction, the quality of the
ducing extracellular polymers [25], enhancing easily-degradable matter extracted DNA can be checked by determining its absorbance at 260 nm
digestion [26], and allowing higher OLR and methane productivity and 280 nm using equipment such as NanoDrop 1000 (Thermo Fisher
[27]. It is foreseeable that more of thermophilic AD will be conducted Scientific, Waltham, MA) [38]. The typical purity and concentration of
in the future. In addition, it is also noteworthy that several interesting the extracted DNA are ~ 1.9 (Abs 260 nm/Abs 80 nm) and ~ 50 ng/uL,
development trends have been observed over the past decade, from respectively [39]. Finally, the purified DNA can be stored in TE (Tris-
operating single-stage digesters to two or three-stage digesters [28–30], EDTA) buffer at −20 °C until the next step.
from single-country research to multinational cooperative research
[12,31–33], and from simple analysis targets like microbial composi- 2.3. Frequently-used DNA sequencing methods
tions to complex analysis targets like interaction mechanisms
[16,34–37]. Currently, next-generation sequencing (NGS) technologies are able
to quickly and economically perform qualitative and quantitative ana-
2.2. DNA extraction, verification and storage lysis of the microbial communities in various AD systems. The most
frequently-used DNA sequencing method is based on the Roche GS FLX
DNA extraction is a prerequisite for performing bioinformatics 454 pyrosequencing platform [65], amounting in excess of 90% of the
analysis of metagenomics data of microbial communities. The quality of surveyed studies. The rest of the NGS techniques are based on Illumina
DNA would be greatly affected by the different sampling methods [20]. (Solexa) sequencing platform [29,30], ABI SOLiD™ short-read DNA
Consequently, the appropriate choice of sampling method of anaerobic sequencing platform [35], ABI analysis reagents (Big Dye terminator
digestate is essential. In order to reduce the experimental errors, it is mix) coupled with Applied Biosystems 3130xl Genetic Analyzer), and
recommended to use the same sampling method in the whole experi- Ion PGM™ Hi-Q™ sequencing kit (Thermo Fisher Scientific) coupled
ment. At the same time, repeated trials need to be carried out to obtain with Ion PGM™ sequencer operated by Torrent Suite™ software [75].
112
L. Zhang et al. Renewable and Sustainable Energy Reviews 100 (2019) 110–126
Table 1
Frequently-used DNA extraction methods of microbial communities in anaerobic digesters.
DNA extraction methods (percentages) Product brands Refs.
MoBio PowerSoil DNA extraction kit (43%) MoBio Laboratories, CA, USA [29,30,38–56]
Fast DNA SPIN kit for soil (25%) MP Biomedicals, Illkirch, France [57]
MP Biomedicals, Australia [58]
MP Biomedicals, Germany [59,60]
MP Biomedicals, USA [61–66]
Q-Bio gene, Australia [67]
Q-Bio gene, Carlsbad, CA, USA [68]
E.Z.N.A Soil DNA kit (6%) OMEGA, USA [69–71]
NucleoSpin Tissue kit + NucleoSpin soil kit (6%) Macherey-Nagel, Germany [37,72,73]
Automated nucleic acid extractor (kit) (6%) Magtration System 6GC, Precision System Science, Chiba, Japan [74–76]
CTAB (cetyltrimethylammonium bromide) based DNA extraction method (6%) – [34,35,77]
I-genomic BYF DNA extraction kit (2%) Intron biotechnology, Korea [78]
DNA extraction kit (2%) Felix bio-tech, USA [79]
ZR soil microbe DNA kit (2%) Zymo Research, Orange, CA, USA [80]
PCR reaction mix (2%) Clontech, Mountain View, CA, USA [81]
Currently, although the 454 pyrosequencing technique is increasing in bioinformatics platforms and tools in microbial metagenomics-based
its use rapidly, the reference data for temporal profiling of anaerobic studies are summarized in Table 2. Although the surveyed literature
microbial communities is still limited [82]. Thus, more related studies pool might be incomplete and slightly biased, information in Table 2
on fundamental analysis of microbial metagenome and establishment of can still shed light on some general trends in the literature and provide
storage platform should be conducted in detail. an example of the usefulness of bioinformatics analysis to analyze a
microbial community.
113
L. Zhang et al. Renewable and Sustainable Energy Reviews 100 (2019) 110–126
Table 2
Frequently-used bioinformatics platforms and tools in microbial metagenomics-based studies.
Platform/software Programme/module Function Refs
114
L. Zhang et al. Renewable and Sustainable Energy Reviews 100 (2019) 110–126
2% of the total 231 OTUs. This indicated that the changed parameter
(HRT) was strongly linked to the variation of bacterial dominance.
Table 3
Equation and index meaning of parameters for alpha diversity analysis.
Index Equation Note Ref.
Chao1 index Schao1 and Sobs are the estimated and observed OUTs number, [101]
respectively; n1 and n2 are the number of OTUs with 1 and 2
n (n 1)
Schao1 = Sobs + 1 1 sequences, respectively.
2(n2+ 1)
ACE index ni is the number of OTUs containing i sequences; Srare is the number [102]
of OTUs containing ‘abund’ number of sequences or less; Sabund is the
S n1
SACE = Sabund + rare + 2
ACE (for ACE 0.8); number of OTUs containing more than ‘abund’ number of sequences;
CACE CACE
‘abund’ is threshold value of dominant OTU.
Srare n1 2
SACE = Sabund + + ACE (for ACE 0.8).
CACE CACE
n1
CACE = 1
Nrare
abund
Nrare = i ni
i= 1
abund
2 Srare i = 1 i(i 1)ni
ACE = max 1, 0
CACE Nrare (Nrare 1)
abund
2 2
Nrare (1 CACE) i = 1 i(i 1)ni
ACE = max ACE 1+ , 0
Nrare (Nrare CACE)
Shannon- Wiener Sobs is actually observed OTUs number; ni is the number of sequences [103]
index Sobs in No. i of OTU; N is total number of sequences.
ni ni
Hshannon = ln
i= 1
N N
Simpson index Sobs is actually observed OTUs number; ni is the number of sequences [104]
Sobs in No. i of OTU; N is total number of sequences.
i=1 ni (ni 1)
Dsimpson =
N(N 1)
Good's coverage n1 is the number of OTUs with 1 sequence; N is total number of [105]
n sequences.
C=1 1
N
115
L. Zhang et al. Renewable and Sustainable Energy Reviews 100 (2019) 110–126
this analysis and sufficient sequence depth was achieved at these levels.
Nevertheless, the numbers of OTUs at the 99–100% levels were still
rising, signifying much more undetected diversity at this level [97].
Fig. 7. Schematic diagram of multi-level species taxonomy of the anaerobic microbial community using Krona [111].
116
L. Zhang et al. Renewable and Sustainable Energy Reviews 100 (2019) 110–126
Fig. 8. Taxonomic classification of the dominant bacterial groups from a three-stage digester at the genus levels (data adapted from ref. [30] with permission/license
granted by the publisher).
found that the most dominant populations (Fig. 8) in the first stage respectively, indicating that different bacteria were selectively enriched
(hydrolysis), the second stage (acidogenesis) and third third stage in different stages of the anaerobic digestion process.
(methanogenesis) were Trichococcus, Aminobacterium and Levilinea, In this same three-stage digester, co-digestion of food waste and
Fig. 9. Statistical analysis of the differences between relative abundance (Data adapted from ref. [99] with permission/license granted by the publisher).
117
L. Zhang et al. Renewable and Sustainable Energy Reviews 100 (2019) 110–126
Table 4
Frequently-used multivariate analysis techniques for microbial communities.
Multivariate analysis Raw-data-based data sets Distance-based data sets
a
Unconstrained ordination refers to an ordination only using data of species compositions.
b
Constrained ordination refers to an ordination using data of species compositions and environmental parameters.
118
L. Zhang et al. Renewable and Sustainable Energy Reviews 100 (2019) 110–126
119
L. Zhang et al. Renewable and Sustainable Energy Reviews 100 (2019) 110–126
microbial genomes on three dimensions (genes, genomes and functions) sequencing was recently used to investigate the methanogens and
by using several tools like KEGG [127], InterPro [128], Pfam [129], and syntrophic bacterial species that were difficult to isolate and cultivate,
Gene Ontology [130]. Hitherto, albeit numerous bioinformatics ap- among microbial communities in biological biogas upgrading systems
proaches have been developed for gene functional annotations, part of with external hydrogen [140–143]. It has been found that addition of
them such as single genome gene prediction tools are not well suited for external hydrogen can selectively enrich both hydrogenotrophic me-
complex metagenomics datasets of anaerobic microbial communities thanogens and homoacetogenic species such as Acetobacterium woodii
due to the heterogeneous sequence composition, length and errors and Moorella thermoacetica that produce acetate from H2 and CO2
[131,132]. Undeniably, powerful gene annotation tools (e.g. IMG/M + [144,145]. More similar examples of using bioinformatics methods to
KEGG) can still be widely applied to locate protein coding regions, investigate the microbial dynamics were listed in Table S1 in the
which is one of the essential issues in microbial metagenomics studies. Supplementary material. In each case, the feedstock, AD operation
Such an example was the study by Campanaro et al. [84], which made conditions, reactor performance and major findings related to the mi-
use of bioinformatics tools (e.g. KEGG, COG and SEED) to profile crobial communities were provided.
functional interactions between microbial species involved in the AD
process and identify key microbial genomes encoding enzymes involved 4.2. Progress on understanding of microbial function and connectivity
in the specific metabolic pathways. Moreover, MEGAN can also be used
for functional analysis of multiple metagenomes based on the SEED Well understanding of the microbial function and connectivity is a
hierarchy and the KEGG pathways [86]. Thereby, it is expected that prerequisite to control and engineer the AD process with the aim to
further studies performed under different operational conditions (e.g. optimize its efficiency [12]. In recent years, lots of progress towards
different substrate and temperature) will allow, in the near future, an functional characterization and interactions of key microbial species in
enrichment of the anaerobic microbial genome database and corre- biogas-producing digesters has been made with the help of bioinfor-
sponding applications via the increasingly powerful software and al- matics platforms and tools (Section 3.7). For instance, metagenomics
gorithms. analysis and functional characterization of the biogas microbiome using
high throughput shotgun sequencing and a novel binning strategy were
4. Key findings on anaerobic digestion via bioinformatics analysis performed to disclose nearly one million genes and extract 106 mi-
crobial genomes [84]. As a result, several key microbial genomes en-
Many researchers have successfully applied bioinformatics analysis coding enzymes involved in metabolic pathways including amino acids
on microbial metagenomics from anaerobic digesters to identify fermentation, fatty acids degradation, carbohydrates utilization and
dominant bacteria and methanogenic archaea [12,84,133], monitor syntrophic acetate oxidation were identified [84]. Utilizing functional
microbial community shift [76,98,134], explore methanogenesis annotation methodology, consolidated with the KEGG Orthology da-
pathway [42,43], and elucidate correlations between digester perfor- tabase, a new uncultured archaeon associated with Methanomassilii-
mance, microbial community structures and pertinent environmental coccales was identified and was putatively having a methylotrophic
variables [33,135,136]. To help researchers in anaerobic digestion field methanogenic pathway [84]. Similarly, Campanaro et al. [146] eluci-
quickly know about the latest developments, the progress made mainly dated the microbial ecology in twelve thermophilic and mesophilic full-
on AD optimization, resilience and robustness, metabolic mechanisms, scale biogas plants via a genome-centric metagenomic approach. In this
microbial modeling and control as well as antibiotic resistance genes study, 132 Metagenome-Assembled Genomes (MAGs) were identified
analysis via bioinformatics approaches are summarized. using Prodigal (v2.6.2) software and their abundance profiles in dif-
ferent samples were analyzed with FOAM software [146]. The anno-
4.1. Progress on optimized AD processes tation study provided a clue for the identification of the most abundant
core members of the anaerobic digestion microbiome [84]. The net-
Identification of predominant bacteria and methanogenic archaea work analysis shed light on species-species associations and allowed
was most frequently conducted during the optimized AD processes via investigation of syntrophic interactions between core members
various enhancing strategies such as optimization of operational para- [16,146]. Furthermore, by investigating the comprehensive biochem-
meters [137], supplementation of additives [138], pretreatment of ical pathways in multiple samples and its link with different aspects of
substrates [139] and structure optimization of digesters [29,30], etc. information (e.g. environmental factors, genomes abundance profile
The identification results contributed a lot to explanation of the en- and predicted functional pathways), this study [146] revealed im-
hancing mechanisms of the diverse techniques. For instance, during portant relationships existing between the high-abundant core species
activated carbon enhanced anaerobic digestion of food waste, Zhang (e.g. acetate utilizers) of the thermophilic digesters, their metabolic
et al. [138] concluded that operation stability of anaerobic digestion pathways and the crucial operational parameters including feedstock,
process in lab- and pilot-scale digesters were greatly enhanced by ac- temperature, VFA concentration and methane yield, thus provided a
tivated carbon supplementation, and demonstrated that the abundance promising widely used strategy for the future studies to predict the
of the predominant phyla Firmicutes, Elusimicrobia and Proteobacteria functional role of microbes participating in anaerobic decomposition of
were selectively enhanced by 1.7–2.9 times due to supplementation of organic wastes.
activated carbon using pyrosequencing of 16 S rRNA gene. In another In terms of microbial connectivity, progress in connectivity of mi-
example of optimized anaerobic digestion of oil palm biomass by op- crobial communities during the anaerobic digestion process was re-
timizing of total solids (TS) contents, feedstock to inoculum (F:I) ratios cently reviewed by De Vrieze and Verstraete [147], who concluded that
and carbon to nitrogen (C:N), Suksong et al. [137] demonstrated that a richer knowledge base on microbial communication in anaerobic di-
the highest methane yield was achieved at TS content of 16%, C:N ratio gesters through quorum sensing and nanowires should be further es-
of 30:1 and F:I ratio of 2:1. The enhancement of AD process was at- tablished. To this end, methods to detect centers of concentrated ac-
tributed to the selective enrichment of bacteria Ruminococcus sp. and tivity and the highly active and well-connected species with a central
Clostridium sp. and enriched archaea Methanoculleus sp. [137]. Re- role in the anaerobic digestion process have to be firstly optimized
cently, by pyrosequencing analysis, it was also found that different [147]. In addition, light sheet fluorescence microscopy was recently
microbial communities in terms of hydrolyzing bacteria, acidogenic reported as a promising tractable tool for examining microbial com-
bacteria and methanogenic archaea, corresponding to hydrolysis, munities [148]. However, most of the current studies are based on
acidogenesis and methanogenesis, respectively, were selectively en- eukaryotic cell tracking experiments; the further studies on examining
riched in the three separate chambers of the three-stage anaerobic di- more and different microbial species and engineered strains that allow
gesters [29,30]. Additionally, the high throughput 16S rRNA amplicon delineation of how specific genetic factors affect microbial connectivity
120
L. Zhang et al. Renewable and Sustainable Energy Reviews 100 (2019) 110–126
remain to be conducted. community dynamics modeling and how the physicochemical and op-
erational parameters controlled these dynamics. For instance, to facil-
4.3. Progress on exploration of methanogenesis pathway itate rational control and intervention in microbial communities,
Hanemaaijer et al. [152] introduced systematically the modeling ap-
Sufficient understanding of methanogenesis pathways could sig- proaches for microbial community studies from metagenomics to in-
nificantly contribute to treating wastes and producing biogas with high ference of the community structure. They focused on two types of
efficiency through AD process. In this aspect, metagenomic sequencing models, a mathematical model that could integrate existing physiolo-
analysis coupled with radioisotopic analysis has been successfully ap- gical, genomic, and physicochemical information with metagenomics
plied to reveal the dominant methanogenic pathways in biogas reactors data and a simpler coarse-grained model that could solve the inference
treating either sludge or manure [12]. The results exemplified a dis- problem from the experimental data [152]. The former model has been
crepancy between the metabolic patterns revealed by metagenomics mainly studied and used due to its capability of maximizing information
analysis and metabolic pathways determined by radioisotopic analysis. content and predictive power. The latter model has received re-
Specifically, the aceticlastic methanogenic pathway was dominant in markably little attention, albeit it can potentially be optimized to be
radioisotopic experiments, while the hydrogenotrophic methanogenic more detailed genome-scale stoichiometric models that can act as het-
pathway was dominant in metagenomics analysis [12]. Similarly, stu- erogeneous data integrators. Similarly, Succurro et al. [153] provided
dies in metabolic pathways and networks of microbial communities an overview of mathematical models of natural microbial ecosystems
using bioinformatics analysis are available in the optimized AD sys- and emphasized that the choice of specific problem scale was a critical
tems. Zhang et al. [16] tracked the changes in biometabolic pathways point in the development of a theoretical description of a microbial
and the microbial network using the KEGG pathway analysis and community. A recent expert consensus is built that it is time to develop
taxonomic tree analysis in an optimized anaerobic digester for food more extensively hypothesis-driven methods, metagenomic microbial
waste by incorporating activated carbon. The KEGG pathway analysis interaction simulators (e.g. MetaMIS [154]) and mathematical in-
revealed that the propanoate metabolic pathway that converted pro- tegration tools (e.g. MetaTopics [155]) to advance the field of commu-
panoic acid to acetic acid became more prominent in activated carbon nity dynamics modeling from a purely descriptive representation to a
enhanced AD digesters. The biometabolic pathways of lipid metabolism sound biogeochemical theory [153,156–159]. More recently, a critical
and methane metabolism were also found to enhance [16]. Microbial review on mathematical modeling of microbial ecosystems was pub-
network analysis revealed that activated carbon promoted the pro- lished by Succurro and Ebenhöh [160], who focused on two major
liferation of syntrophic microbial species like Geobacter and Methano- classes of mathematical methods (constraint-based stoichiometric
saeta, forming a highly intensive syntrophic microbial network [16]. models and differential equation models) and introduced their recent
Recently, Jiang et al. [149] examined how a pyrosequencing technique applications to the study of microbial communities. The advantage of
using a fragment of the methyl Co-A reductase gene (mcrA) and a these two models is that they are deterministic approaches that can
radiolabelling technique using 14C labelled sodium acetate can be ap- effectively provide a macroscopic representation of biological systems
plied to investigate the relationship between total ammonia nitrogen from microscopic behaviors [160]. To go a step further, an integration
(TAN) and the dominant methanogen pathway by quantifying the modeling method was illustrated in order to understand emergent
percentage of methane formation through acetoclastic and syntrophic patterns in microbial systems and their dynamics regulated by diverse
acetate oxidation pathways in anaerobic digesters. The ratio between spatio-temporal phenomena [160]. Taken together, it was re-
14
CO2 and 14CH4 in high TAN (11.1 g/kg wet weight) digesters ranged commended that modelers and experimentalists should work together
from 2.1 to 3.0, showing 68–75% of methane was produced via the from the conceptual phases of the experimental design to guarantee
hydrogenotrophic route; whereas in low ammonia (0.2 g/kg wet exact integration of theory and experiments [153]. In this way, more
weight) digesters the ratio was 0.1–0.3, revealing 9–23% of methane common interdisciplinary languages can be developed that can help
was produced by the hydrogenotrophic route [149]. A quantitative unravel the mechanisms among the complex microbial interactions.
correlation between TAN and the dominant methanogen pathway can Knowledge obtained from the modeling of microbial communities
be further optimized and utilized to manipulate the acetoclastic and provides the possibility of optimizing the AD process by regulating the
syntrophic methanogenesis. During the acclimation to extremely high microorganisms. In this light, some progress in microbial community
ammonia levels (10 g NH4+-N /L) in continuous AD process, 16 S rRNA structure regulation has been made. Feedstock composition, digester
gene sequencing revealed a dramatic microbiome change throughout configuration, operating parameters (e.g. temperature), and environ-
the ammonia acclimation process [150]. A critical point of NH4+-N ment conditions are the leading driving factors for community structure
concentration for a clear shift from aceticlastic to complex and flexible changes [161]. For instance, in the anaerobic digesters of pig manure,
pathways was found to be 5 g/L [151]. At extreme ammonia levels the major genus at pH 7.0 was Methanocorpusculum whereas that was
(> 7 g NH4+-N /L), Clostridium ultunense, a syntrophic acetate oxi- Methanosarcina at both pH 6.0 and 8.0, indicating that predominant
dizing bacteria, alongside with hydrogenotrophic methanogen Metha- species in a microbial community can be controlled through pH [162].
noculleus spp. increased significantly, indicating strong hydro- Moreover, there has also been study focusing on developing early traffic
genotrophic methanogenic activity [150]. Additionally, effect of light system to avoid system failure through using alkalinity as an in-
temperature (55–65 °C) and hydraulic retention time (2–4 days) on dicator of process stability in an anaerobic digestion system [163–165].
methanogenesis pathways was studied in anaerobic digesters treating However, as imaging the complex three-dimensional multi-species
with waste activated sludge [58]. Stable isotopic signatures (δ13C) communities still presents considerable technical challenges, how such
analysis showed that elevated temperature stimulated syntrophic information can be used sufficiently for current AD applications is still
acetate oxidation [58]. Community analysis using 16S rRNA pyr- under development.
osequencing evidenced the dominance of Methanosarcina at 55–60 °C,
whereas a further increase to 65 °C led to loss of Methanosarcina, 4.5. Progress on identification and analysis of antibiotic resistance genes
alongside with reduced methane yield and an accumulation of VFAs
[58]. And the similar trend was also applied to the case of reducing the In order to properly define the risks posed by land application of
HRT to 2 days [58]. anaerobic digestate, high-throughput sequencing-based metagenomics
approaches have been successfully utilized to detect the antibiotic re-
4.4. Progress on modeling and control of microbial community dynamics sistance genes (ARGs) in biogas digesters. Lee et al. [166] characterized
the diversity and amounts of ARGs in representative organic wastes
In recent years, there has also been some progress made towards (manure, sludge and food waste-recycling wastewater) and found that
121
L. Zhang et al. Renewable and Sustainable Energy Reviews 100 (2019) 110–126
Fig. 12. A big-data-based precision fermentation platform using artificial neural network.
total relative amount of ARGs were highest in manure, followed by resulted in the increase in depth of sequencing, which has led to a
sludge and food waste-recycling wastewater. Diversity and mechanisms critical issue about metagenomics data storage. For instance, the 454
of ARGs were considerably different between substrates [166]. In a pyrosequencing technology can generally produce up to 1000,000 reads
recent study, Luo et al. [33] found that the total abundance of ARGs in per sequencing run (equivalent to around 0.7 GB), while HiSeq. 2500
digesters fed with manure and industrial wastes varied from 7 × 10-3 to can generate a much greater output (600–1000 GB per run) [172,173].
1.1 × 10-1 copy of ARGs/copy of 16 S rRNA gene. The samples obtained Further analysis of raw sequences will enlarge the amount of data by
from thermophilic biogas digesters exhibited a lower total abundance of ten to twenty times each analysis time [173]. Such a huge amount of
ARGs [33], which showed the superiority of thermophilic AD for ARGs data has become a challenge for bioinformatics analysis [132]. More-
removal [167,168]. More recently, by using Illumina MiSeq sequencing over, the NGS-based metagenome research on anaerobic microbial
and high throughput fluorescent quantitative PCR, antibiotic resistant communities is still in its infancy. Analysis of more microbial meta-
bacteria (ARB) and ARGs from pig manure to fields were monitored genomes newly sequenced is still required to provide statistical support
[169]. The results indicated that 83 ARGs and 3 transposons genes were for the findings derived from the previous studies, which also leads to
detected. Relative abundance of Macrolide-Lincosamide-Streptogramin the necessity for a larger storage space. Thus, a standard storage plat-
and tetracycline resistance genes reduced after anaerobic digestion. form should be established and used to investigate metadata of anae-
Nevertheless, the relative abundance of aminoglycoside, sulfa, florfe- robic microbial communities in the future research.
nicol, amphenicol and chloramphenicol resistance genes were enriched Secondly, many bioinformatics tools and methods for data proces-
by 52–270 times [169]. Long-term application of biogas residue con- sing have been borrowed from the fields of data mining, artificial in-
taminated with antibiotics in fields increased the number of ARB and telligence and statistical methods [174]. However, the characteristics of
the relative abundance of ARGs in the vegetable soil [169]. Hence, to microbial genomic data may differ significantly from those of the ori-
reduce the spreading of ARGs in the agricultural environment, more ginal data for which the tools were developed [174]. Though many
effective prevention and control measures remain to be explored. In computational methods have been successfully applied for genomic
another research performed in anaerobic digesters fed with microwave- data analysis, some challenges still exist. Currently, assembly and bin-
pretreated sewage sludge, most ARGs concentration tended to decrease ning of super-large scale of sequences remain an insurmountable hurdle
in pretreatment but enriched after AD [170]. Therefore, the microwave [83], which can be attributed to the limited computing capability and
pretreatment is a very promising technology that can reduce the specific internal storage capability. Furthermore, there exists a paradox
abundance of ARGs in bio-fertilizers made from anaerobic digestate. In between data processing speed and accuracy due to the fact that
addition, for enhancement characterization and quantification of ARGs taxonomic binning and assembly generally need several hours to sev-
in environmental metagenomes, an open online analysis pipeline, eral days [173]. Thus, on the premise of high enough accuracy, more
ARGs-OAP, was developed consisting of a database termed SARG ver- efforts are needed to develop new powerful bioinformatics methods and
sion 2.0 [171]. The new SARG database contains sequences not only effectively integrate supercomputing technology into current proces-
from CARD and ARDB databases, but also sequences from the latest sing tools to rapidly complete the analysis on microbial genomic data.
protein collection of the NCBI-NR database [171]. ARGs-OAP and the Thirdly, the relatively high cost of sequencing has led to the fact
database can be accessed through http://smile.hku.hk/SARGs, which that some metagenomics studies were performed lacking repeated trials
offer huge potential in respect to more comprehensive identification [20], which could compromise the data reproducibility, making it hard
and analysis of ARGs. to confirm whether the observed differences were significant or merely
artefacts of the analytical techniques. Therefore, sequencing in tripli-
cate is recommended for a more reliable data analysis. According to a
5. Limitations and prospects of bioinformatics analysis on
recent study from Campanaro et al. [175], reliability of the 16S rRNA
microbial metagenomics
amplicon sequencing results could be biased by two main effects, which
were the failure of universal primers to match all the 16S rRNA targets
Metagenome analyses and metatranscriptome analyses of biogas-
and the limited number of hypervariable regions (e.g. V4) studied.
producing microbial communities are rapidly developing in many
Thus, to improve the evaluation of abundance for crucial taxonomic
countries (Fig. 3B). However, there are several technical bottlenecks
groups, use of multiple marker genes and analysis at transcriptional
that need to be addressed before bioinformatics analysis on microbial
level are recommended [175]. Moreover, to date, most bioinformatics
metagenomics become a welcome tool.
analysis depended on sequence comparison against reference databases.
Firstly, the rapid development of sequencing techniques has
122
L. Zhang et al. Renewable and Sustainable Energy Reviews 100 (2019) 110–126
However, practices to use current databases for bioinformatics analysis application) that should be overcome before microbial communities can
can be problematic considering the potential incompleteness of data- be finely regulated for the enhanced and commercial biogas production
bases. Therefore, updated databases with better quality data are needed process. The present review indicates that most of the current chal-
to support stable and effective analysis of anaerobic microbial meta- lenges can be overcome by substantial collaboration among scholars
genomics information. from fields of biological sciences, environmental engineering, chemical
Fourthly, metagenomics techniques provide deeper insights into engineering and computer science. Nevertheless, more research ded-
microbial communities, but have been generally used for descriptive ication is needed to integrate these methods and tools as an essential
and explanatory approaches to reply the initial ‘who is there?’ question strategy for the development of molecular microbiology for industrial
[132]. The challenges that exist between lab-, pilot- and field-scale data applications. Herein, a big-data-based precision fermentation platform
and extrapolation are still existing. In fact, there is still a major gap using artificial neural network, specifically, is likely to be at the fore-
between what we can observe from the microbial communities via front of these developments. Hence, it is possible to maximize the
bioinformatics approaches and what species and gene functions we can methane yield in a commercial biogas plant using the precision fer-
manipulate, due to the fact that a microbial ecosystem is an extremely mentation platform coupling with versatile bioinformatics approaches
complex network of dynamic spatio-temporal interactions among mi- in the near future.
croorganisms as well as between microorganisms and the ambient en-
vironment [176]. To fill this gap, more insights should be provided into Acknowledgements
how the genetic properties and the dynamic connectivity between in-
dividual microorganisms generate their dynamic activities. Further- We greatly thank the reviewers for their review and constructive
more, concentrations and fluxes of nutrients within the microbial comments on this manuscript. This research project was funded by the
communities should also be investigated thoroughly. National Research Foundation, Prime Minister's Office, Singapore under
Last but not least, the current depth of mining on metagenomics its Campus for Research Excellence and Technological Enterprise
data of anaerobic microorganisms, mainly focusing on straightforward (CREATE) Programme. The authors greatly thank Associate Professor
sequence similarity searches, might not be satisfactory since microbial Yen Wah Tong from NUS for his help in facilitating some of the re-
genomes ought to be the fundamental basis for microbial ecology search. Le Zhang would like to thank the China Scholarship Council
[177]. For deeper data mining, the combination of metagenomics-based (CSC) for financial support. For all copyrighted figures including
bioinformatics methods with key approaches in biotechnology (e.g. T- Figs. 5, 6, 8, 9, 10 and 11, the permission attained from the publishers
RFLP, FISH, PCR-DGGE and qPCR [178]) is highly recommended for adaptation and reuse in the present work has been acknowledged.
[5,23]. Even so, there still may be a long way to reach the ideal point
where the digester performance can be finely regulated based on real- Appendix A. Supplementary material
time analysis of metagenomics information of anaerobic microbial
communities. To fulfill such endeavors and continue to provide in- Supplementary data associated with this article can be found in the
sightful solutions, a big-data-based precision fermentation platform online version at doi:10.1016/j.rser.2018.10.021.
using artificial neural network is proposed (Fig. 12). Through an arti-
ficial intelligence (AI) workflow, the huge data (e.g. elemental com- References
ponents of feedstock, operational parameters, and metagenomics data
of microbial communities) derived from substantial previous AD prac- [1] Kougias PG, Angelidaki I. Biogas and its opportunities-a review. Front Environ Sci
tices can be formulated into convolutional neural networks, which can Eng 2018;12:1–12.
[2] European Commission, Energy Strategy and Energy Union. A policy framework for
be further optimized into valuable networks such as deep Q network climate and energy in the period from 2020 to 2030 [COM(2014) 15], 〈https://
using the experimental data as training information. In the valuable eur-lex.europa.eu/legal-content/EN/TXT/?Uri=COM%3A2014%3A15%3AFIN〉;
network, with new feedstock components input, a best combination of 2014 [Accessed 20 July 2018].
[3] Tabassum MR, Xia A, Murphy JD. Potential of seaweed as a feedstock for renew-
controlling parameters, digester type and microbial community in- able gaseous fuel production in Ireland. Renew Sustain Energy Rev
formation can be recommended to facilitate the practical AD opera- 2017;68:136–46.
tions. Although there are only a few research reports [179–181] based [4] Anyaoku CC, Baroutian S. Decentralized anaerobic digestion systems for increased
utilization of biogas from municipal solid waste. Renew Sustain Energy Rev
on the implementation of AI approaches in the past decade, the im- 2018;90:982–91.
plementation of AI in the simulation, control and optimization of [5] Ju F, Lau F, Zhang T. Linking microbial community, environmental variables, and
anaerobic digestion possesses great potential in the near future, parti- methanogenesis in anaerobic biogas digesters of chemically enhanced primary
treatment sludge. Environ Sci Technol 2017;51:3982–92.
cularly with the fast development of novel and hybrid AI approaches.
[6] Wang P, Wang H, Qiu Y, Ren L, Jiang B. Microbial characteristics in anaerobic
For instance, a novel algorithm for designing minimal microbial com- digestion process of food waste for methane production-a review. Bioresour
munities with desired metabolic capacities was developed [182]. Spe- Technol 2018;248:29–36.
cifically, this algorithm can help researchers identify minimal sets of [7] Hassa J, Maus I, Off S, Pühler A, Scherer P, Klocke M, et al. Metagenome, meta-
transcriptome, and metaproteome approaches unraveled compositions and func-
microbial species that can collectively provide the enzymatic capacity tional relationships of microbial communities residing in biogas plants. Appl
required to synthesize a group of desired target metabolites from a Microbiol Biotechnol 2018:1–19.
predefined set of available substrates [182]. This finding may present a [8] Kim E, Lee J, Han G, Hwang S. Comprehensive analysis of microbial communities
in full-scale mesophilic and thermophilic anaerobic digesters treating food waste-
first step towards the goal that various microbial communities can be recycling wastewater. Bioresour Technol 2018;259:442–50.
manipulated for the anaerobic digesters’ optimization and industrial [9] Keegan KP, Glass EM, Meyer F. MG-RAST, a metagenomics service for analysis of
applications. microbial community structure and function. Microb Environ Genom
2016:207–33.
[10] Namiki T, Hachiya T, Tanaka H, Sakakibara Y. MetaVelvet: an extension of Velvet
6. Conclusions assembler to de novo metagenome assembly from short sequence reads. Nucleic
Acids Res 2012;40:e155.
[11] Laserson J, Jojic V, Koller D. Genovo: de novo assembly for metagenomes. J
In the past decade, considerable research on metagenomics of Comput Biol 2011;18:429–43.
biogas-producing microbial communities has provided insights into [12] Luo G, Fotidis IA, Angelidaki I. Comparative analysis of taxonomic, functional, and
their diversity and shift. Many new applications of bioinformatics metabolic patterns of microbiomes from 14 full-scale biogas reactors by metage-
nomic sequencing and radioisotopic analysis. Biotechnol Biofuels 2016;9:51.
technology on anaerobic digestion have emerged in the recent years,
[13] Rincón B, Borja R, González J, Portillo M, Sáiz-Jiménez C. Influence of organic
which offer huge potential in the optimized biogas industries with loading rate and hydraulic retention time on the performance, stability and mi-
higher efficiency and sustainability. There are still several challenges crobial communities of one-stage anaerobic digestion of two-phase olive mill solid
(e.g. data storage, data processing technique, data reliability and data residue. Biochem Eng J 2008;40:253–61.
123
L. Zhang et al. Renewable and Sustainable Energy Reviews 100 (2019) 110–126
[14] De Vrieze J, Saunders AM, He Y, Fang J, Nielsen PH, Verstraete W, et al. Ammonia [41] Bibby K, Viau E, Peccia J. Pyrosequencing of the 16S rRNA gene to reveal bacterial
and temperature determine potential clustering in the anaerobic digestion mi- pathogen diversity in biosolids. Water Res 2010;44:4252–60.
crobiome. Water Res 2015;75:312–23. [42] Li A, Chu Y, Wang X, Ren L, Yu J, Liu X, et al. A pyrosequencing-based metage-
[15] Regueiro L, Spirito CM, Usack JG, Hospodsky D, Werner JJ, Angenent LT. nomic study of methane-producing microbial community in solid-state biogas re-
Comparing the inhibitory thresholds of dairy manure co-digesters after prolonged actor. Biotechnol Biofuels 2013;6:3.
acclimation periods: Part 2–correlations between microbiomes and environment. [43] Wong MT, Zhang D, Li J, Hui RKH, Tun HM, Brar MS, et al. Towards a metage-
Water Res 2015;87:458–66. nomic understanding on enhanced biomethane production from waste activated
[16] Zhang J, Mao L, Zhang L, Loh K-C, Dai Y, Tong YW. Metagenomic insight into the sludge after pH 10 pretreatment. Biotechnol Biofuels 2013;6:38.
microbial networks and metabolic mechanism in anaerobic digesters for food [44] Ghanimeh SA, Saikaly PE, Li D, El-Fadel M. Population dynamics during startup of
waste by incorporating activated carbon. Sci Rep 2017;7:11293. thermophilic anaerobic digesters: the mixing factor. Waste Manag
[17] Ye L, Zhang T, Wang TT, Fang ZW. Microbial structures, functions and metabolic 2013;33:2211–8.
pathways in wastewater treatment bioreactors revealed using high-throughput [45] Röske I, Sabra W, Nacke H, Daniel R, Zeng A-P, Antranikian G, et al. Microbial
sequencing. Environ Sci Technol 2012;46:13244–52. community composition and dynamics in high-temperature biogas reactors using
[18] Genitsaris S, Monchy S, Denonfoux J, Ferreira S, Kormas KA, Sime-Ngando T, et al. industrial bioethanol waste as substrate. Appl Microbiol Biotechnol
Marine microbial community structure assessed from combined metagenomic 2014;98:9095–106.
analysis and ribosomal amplicon deep-sequencing. Mar Biol Res 2016;12:30–42. [46] Sun R, Xing D, Jia J, Zhou A, Zhang L, Ren N. Methane production and microbial
[19] Krakat N, Anjum R, Demirel B, Schröder P. Methodological flaws introduce strong community structure for alkaline pretreated waste activated sludge. Bioresour
bias into molecular analysis of microbial populations. J Appl Microbiol Technol 2014;169:496–501.
2017;122:364–77. [47] Martínez MA, Romero H, Perotti NI. Two amplicon sequencing strategies revealed
[20] Ju F, Zhang T. Experimental design and bioinformatics analysis for the application different facets of the prokaryotic community associated with the anaerobic
of metagenomics in environmental sciences and biotechnology. Environ Sci treatment of vinasses from ethanol distilleries. Bioresour Technol
Technol 2015;49:12628–40. 2014;153:388–92.
[21] Alvarez-Silva MC, Álvarez-Yela AC, Gómez-Cano F, Zambrano MM, Husserl J, [48] Lu X, Rao S, Shen Z, Lee PK. Substrate induced emergence of different active
Danies G, et al. Compartmentalized metabolic network reconstruction of microbial bacterial and archaeal assemblages during biomethane production. Bioresour
communities to determine the effect of agricultural intervention on soils. PloS One Technol 2013;148:517–24.
2017;12:e0181826. [49] Tuan NN, Chang Y-C, Yu C-P, Huang S-L. Multiple approaches to characterize the
[22] Badhai J, Ghosh TS, Das SK. Taxonomic and functional characteristics of microbial microbial community in a thermophilic anaerobic digester running on swine
communities and their correlation with physicochemical properties of four geo- manure: a case study. Microbiol Res 2014;169:717–24.
thermal springs in Odisha, India. Front Microbiol 2015;6:1166. [50] Ghasimi DS, Lier JB, Zandvoort MH, Kreuk M, Tao Y. Microbial population dy-
[23] Kumar GR, Chowdhary N. Biotechnological and bioinformatics approaches for namics during long-term sludge adaptation of thermophilic and mesophilic se-
augmentation of biohydrogen production: a review. Renew Sustain Energy Rev quencing batch digesters treating sewage fine sieved fraction at varying organic
2016;56:1194–206. loading rates. Biotechnol Biofuels 2015;8:171.
[24] Mukherjee A, Chettri B, Langpoklakpam JS, Basak P, Prasad A, Mukherjee AK, [51] Cho S-K, Kim D-H, Quince C, Im W-T, Oh S-E, Shin SG. Low-strength ultra-
et al. Bioinformatic approaches including predictive metagenomic profiling reveal sonication positively affects methanogenic granules toward higher AD perfor-
characteristics of bacterial response to petroleum hydrocarbon contamination in mance: implications from microbial community shift. Ultrason Sonochem
diverse environments. Sci Rep 2017;7:1108. 2016;32:198–203.
[25] Suhartini S, Heaven S, Banks CJ. Comparison of mesophilic and thermophilic [52] Yun YM, Kim DH, Cho SK, Shin HS, Jung KW, Kim HW. Mitigation of ammonia
anaerobic digestion of sugar beet pulp: performance, dewaterability and foam inhibition by internal dilution in high‐rate anaerobic digestion of food waste lea-
control. Bioresour Technol 2014;152:202–11. chate and evidences of microbial community response. Biotechnol Bioeng
[26] Riggio S, Hernandéz-Shek M, Torrijos M, Vives G, Esposito G, Van Hullebusch E, 2016;113:1892–901.
et al. Comparison of the mesophilic and thermophilic anaerobic digestion of spent [53] Westerholm M, Crauwels S, Houtmeyers S, Meerbergen K, Van Geel M, Lievens B,
cow bedding in leach-bed reactors. Bioresour Technol 2017;234:466–71. et al. Microbial community dynamics linked to enhanced substrate availability and
[27] Munk B, Guebitz GM, Lebuhn M. Influence of nitrogen-rich substrates on biogas biogas production of electrokinetically pre-treated waste activated sludge.
production and on the methanogenic community under mesophilic and thermo- Bioresour Technol 2016;218:761–70.
philic conditions. Anaerobe 2017;46:146–54. [54] Azizi A, Kim W, Lee JH. Comparison of microbial communities during the anae-
[28] Li W, Loh K-C, Zhang J, Tong YW, Dai Y. Two-stage anaerobic digestion of food robic digestion of Gracilaria under mesophilic and thermophilic conditions. World
waste and horticultural waste in high-solid system. Appl Energy 2017;209:400–8. J Microbiol Biotechnol 2016;32:158.
[29] Zhang J, Loh K-C, Lee J, Wang C-H, Dai Y, Tong YW. Three-stage anaerobic co- [55] Westerholm M, Crauwels S, Van Geel M, Dewil R, Lievens B, Appels L. Microwave
digestion of food waste and horse manure. Sci Rep 2017;7:1269. and ultrasound pre-treatments influence microbial community structure and di-
[30] Zhang J, Loh K-C, Li W, Lim JW, Dai Y, Tong YW. Three-stage anaerobic digester gester performance in anaerobic digestion of waste activated sludge. Appl
for food waste. Appl Energy 2017;194:287–95. Microbiol Biotechnol 2016;100:5339–52.
[31] Luo G, Angelidaki I. Analysis of bacterial communities and bacterial pathogens in a [56] Li N, Xue Y, Chen S, Takahashi J, Dai L, Dai X. Methanogenic population dynamics
biogas plant by the combination of ethidium monoazide, PCR and Ion Torrent regulated by bacterial community responses to protein-rich organic wastes in a
sequencing. Water Res 2014;60:156–63. high solid anaerobic digester. Chem Eng J 2017;317:444–53.
[32] Luo G, De Francisci D, Kougias PG, Laura T, Zhu X, Angelidaki I. New steady-state [57] Yu Z, Wen X, Xu M, Huang X. Characteristics of extracellular polymeric substances
microbial community compositions and process performances in biogas reactors and bacterial communities in an anaerobic membrane bioreactor coupled with
induced by temperature disturbances. Biotechnol Biofuels 2015;8:3. online ultrasound equipment. Bioresour Technol 2012;117:333–40.
[33] Luo G, Li B, Li L-G, Zhang T, Angelidaki I. Antibiotic resistance genes and corre- [58] Ho D, Jensen P, Batstone D. Effects of temperature and hydraulic retention time on
lations with microbial community and metal resistance genes in full-scale biogas acetotrophic pathways and performance in high-rate sludge digestion. Environ Sci
reactors as revealed by metagenomic analysis. Environ Sci Technol Technol 2014;48:6468–76.
2017;51:4069–80. [59] Ziganshin AM, Liebetrau J, Proeter J, Kleinsteuber S. Microbial community
[34] Schlüter A, Bekel T, Diaz NN, Dondrup M, Eichenlaub R, Gartemann K-H, et al. The structure and dynamics during anaerobic digestion of various agricultural waste
metagenome of a biogas-producing microbial community of a production-scale materials. Appl Microbiol Biotechnol 2013;97:5161–74.
biogas plant fermenter analysed by the 454-pyrosequencing technology. J [60] Lebuhn M, Hanreich A, Klocke M, Schlüter A, Bauer C, Pérez CM. Towards mo-
Biotechnol 2008;136:77–90. lecular biomarkers for biogas production from lignocellulose-rich substrates.
[35] Wirth R, Kovács E, Maróti G, Bagi Z, Rákhely G, Kovács KL. Characterization of a Anaerobe 2014;29:10–21.
biogas-producing microbial community by short-read next generation DNA se- [61] Solli L, Håvelsrud OE, Horn SJ, Rike AG. A metagenomic study of the microbial
quencing. Biotechnol Biofuels 2012;5:41. communities in four parallel biogas reactors. Biotechnol Biofuels 2014;7:146.
[36] Güllert S, Fischer MA, Turaev D, Noebauer B, Ilmberger N, Wemheuer B, et al. [62] Razaviarani V, Buchanan ID. Anaerobic co-digestion of biodiesel waste glycerin
Deep metagenome and metatranscriptome analyses of microbial communities af- with municipal wastewater sludge: microbial community structure dynamics and
filiated with an industrial biogas fermenter, a cow rumen, and elephant feces re- reactor performance. Bioresour Technol 2015;182:8–17.
veal major differences in carbohydrate hydrolysis strategies. Biotechnol Biofuels [63] Ziganshina EE, Belostotskiy DE, Ilinskaya ON, Boulygina EA, Grigoryeva TV,
2016;9:121. Ziganshin AM. Effect of the organic loading rate increase and the presence of
[37] Sträuber H, Lucas R, Kleinsteuber S. Metabolic and microbial community dynamics zeolite on microbial community composition and process stability during anae-
during the anaerobic digestion of maize silage in a two-phase process. Appl robic digestion of chicken wastes. Microb Ecol 2015;70:948–60.
Microbiol Biotechnol 2016;100:479–91. [64] Belostotskiy DE, Ziganshina EE, Siniagina M, Boulygina EA, Miluykov VA,
[38] Bareither CA, Wolfe GL, McMahon KD, Benson CH. Microbial diversity and dy- Ziganshin AM. Impact of the substrate loading regime and phosphoric acid sup-
namics during methane production from municipal solid waste. Waste Manag plementation on performance of biogas reactors and microbial community dy-
2013;33:1982–92. namics during anaerobic digestion of chicken wastes. Bioresour Technol
[39] Lee S-H, Kang H-J, Lee YH, Lee TJ, Han K, Choi Y, et al. Monitoring bacterial 2015;193:42–52.
community structure and variability in time scale in full-scale anaerobic digesters. [65] Sun L, Liu T, Müller B, Schnürer A. The microbial community structure in in-
J Environ Monit 2012;14:1893–905. dustrial biogas plants influences the degradation rate of straw and cellulose in
[40] Zhang H, Banaszak JE, Parameswaran P, Alder J, Krajmalnik-Brown R, Rittmann batch tests. Biotechnol Biofuels 2016;9:128.
BE. Focused-Pulsed sludge pre-treatment increases the bacterial diversity and re- [66] Oosterkamp MJ, Méndez-García C, Kim C-H, Bauer S, Ibáñez AB, Zimmerman S,
lative abundance of acetoclastic methanogens in a full-scale anaerobic digester. et al. Lignocellulose-derived thin stillage composition and efficient biological
Water Res 2009;43:4517–26. treatment with a high-rate hybrid anaerobic bioreactor system. Biotechnol Biofuels
124
L. Zhang et al. Renewable and Sustainable Energy Reviews 100 (2019) 110–126
125
L. Zhang et al. Renewable and Sustainable Energy Reviews 100 (2019) 110–126
[126] Markowitz VM, Chen IMA, Chu K, Szeto E, Palaniappan K, Pillay M, et al. IMG/M 4 simulator based on microbial community profiles. BMC Bioinforma 2016;17:488.
version of the integrated metagenome comparative analysis system. Nucleic Acids [155] Yan J, Chuai G, Qi T, Shao F, Zhou C, Zhu C, et al. MetaTopics: an integration tool
Res 2013;42:D568–73. to analyze microbial community profile by topic model. BMC Genom 2017;18:962.
[127] Kanehisa M, Araki M, Goto S, Hattori M, Hirakawa M, Itoh M, et al. KEGG for [156] Hanemaaijer M, Olivier BG, Röling WF, Bruggeman FJ, Teusink B. Model-based
linking genomes to life and the environment. Nucleic Acids Res 2007;36:D480–4. quantification of metabolic interactions from dynamic microbial-community data.
[128] Hunter S, Apweiler R, Attwood TK, Bairoch A, Bateman A, Binns D, et al. InterPro: PloS One 2017;12:e0173183.
the integrative protein signature database. Nucleic Acids Res 2008;37:D211–5. [157] Cardona C, Weisenhorn P, Henry C, Gilbert JA. Network-based metabolic analysis
[129] Finn RD, Bateman A, Clements J, Coggill P, Eberhardt RY, Eddy SR, et al. Pfam: and microbial community modeling. Curr Opin Microbiol 2016;31:124–31.
the protein families database. Nucleic Acids Res 2013;42:D222–30. [158] Song HS, Cannon WR, Beliaev AS, Konopka A. Mathematical modeling of micro-
[130] Consortium GO. The Gene Ontology (GO) database and informatics resource. bial community dynamics: a methodological review. Processes 2014;2:711–52.
Nucleic Acids Res 2004;32:D258–61. [159] Larsen PE, Gibbons SM, Gilbert JA. Modeling microbial community structure and
[131] Hoff KJ. The effect of sequencing errors on metagenomic gene prediction. BMC functional diversity across time and space. FEMS Microbiol Lett 2012;332:91–8.
Genom 2009;10:520. [160] Succurro A, Ebenhöh O. Review and perspective on mathematical modeling of
[132] Jünemann S, Kleinbölting N, Jaenicke S, Henke C, Hassa J, Nelkner J, et al. microbial ecosystems. Biochem Soc Trans 2018;46:403–12.
Bioinformatics for NGS-based metagenomics and the application to biogas re- [161] Chen Y, Lan S, Wang L, Dong S, Zhou H, Tan Z, et al. A review: driving factors and
search. J Biotechnol 2017;261:10–23. regulation strategies of microbial community structure and dynamics in waste-
[133] Bassani I, Kougias PG, Treu L, Angelidaki I. Biogas upgrading via hydro- water treatment systems. Chemosphere 2017;174:173–82.
genotrophic methanogenesis in two-stage continuous stirred tank reactors at me- [162] Zhou J, Zhang R, Liu F, Yong X, Wu X, Zheng T, et al. Biogas production and
sophilic and thermophilic conditions. Environ Sci Technol 2015;49:12585–93. microbial community shift through neutral pH control during the anaerobic di-
[134] Jang HM, Kim JH, Ha JH, Park JM. Bacterial and methanogenic archaeal com- gestion of pig manure. Bioresour Technol 2016;217:44–9.
munities during the single-stage anaerobic digestion of high-strength food was- [163] Argyropoulos A. Soft sensor development and process control of anaerobic di-
tewater. Bioresour Technol 2014;165:174–82. gestion [Ph.D. thesis]. University of Exeter; 2013http://hdl.handle.net/10871/
[135] Koo T, Shin SG, Lee J, Han G, Kim W, Cho K, et al. Identifying methanogen 15068.
community structures and their correlations with performance parameters in four [164] Bai X, Li Z, Wang X, He X, Cheng S, Bai X, et al. Online measurement of alkalinity
full-scale anaerobic sludge digesters. Bioresour Technol 2017;228:368–73. in anaerobic co-digestion using linear regression method. Int J Agric Biol Eng
[136] Hidaka T, Tsushima I, Tsumori J. Comparative analyses of microbial structures 2017;10:176–83.
and gene copy numbers in the anaerobic digestion of various types of sewage [165] Amha YM, Anwar MZ, Brower A, Jacobsen CS, Stadler LB, Webster TM, et al.
sludge. Bioresour Technol 2018. https://doi.org/10.1016/j.biortech.2017.12.097. Inhibition of anaerobic digestion processes: applications of molecular tools.
[137] Suksong W, Kongjan P, Prasertsan P, Imai T, Sompong O. Optimization and mi- Bioresour Technol 2018;247:999–1014.
crobial community analysis for production of biogas from solid waste residues of [166] Lee J, Shin SG, Jang HM, Kim YB, Lee J, Mo Y. Characterization of antibiotic
palm oil mill industry by solid-state anaerobic digestion. Bioresour Technol resistance genes in representative organic solid wastes: food waste-recycling
2016;214:166–74. wastewater, manure, and sewage sludge. Sci Total Environ 2017;579:1692–8.
[138] Zhang L, Zhang J, Loh K-C. Activated carbon enhanced anaerobic digestion of food [167] Tian Z, Zhang Y, Yu B, Yang M. Changes of resistome, mobilome and potential
waste-Laboratory-scale and pilot-scale operation. Waste Manag 2018;75:270–9. hosts of antibiotic resistance genes during the transformation of anaerobic diges-
[139] Zhang J, Li W, Lee J, Loh K-C, Dai Y, Tong YW. Enhancement of biogas production tion from mesophilic to thermophilic. Water Res 2016;98:261–9.
in anaerobic co-digestion of food waste and waste activated sludge by biological [168] Jang HM, Shin J, Choi S, Shin SG, Park KY, Cho J, et al. Fate of antibiotic re-
co-pretreatment. Energy 2017;137:479–86. sistance genes in mesophilic and thermophilic anaerobic digestion of chemically
[140] Agneessens LM, Ottosen LDM, Voigt NV, Nielsen JL, de Jonge N, Fischer CH, et al. enhanced primary treatment (CEPT) sludge. Bioresour Technol 2017;244:433–44.
In-situ biogas upgrading with pulse H2 additions: the relevance of methanogen [169] Pu C, Liu H, Ding G, Sun Y, Yu X, Chen J, et al. Impact of direct application of
adaption and inorganic carbon level. Bioresour Technol 2017;233:256–63. biogas slurry and residue in fields: In situ analysis of antibiotic resistance genes
[141] Bassani I, Kougias PG, Treu L, Porté H, Campanaro S, Angelidaki I. Optimization of from pig manure to fields. J Hazard Mater 2018;344:441–9.
hydrogen dispersion in thermophilic up-flow reactors for ex situ biogas upgrading. [170] Tong J, Liu J, Zheng X, Zhang J, Ni X, Chen M, et al. Fate of antibiotic resistance
Bioresour Technol 2017;234:310–9. bacteria and genes during enhanced anaerobic digestion of sewage sludge by
[142] Kougias PG, Treu L, Benavente DP, Boe K, Campanaro S, Angelidaki I. Ex-situ microwave pretreatment. Bioresour Technol 2016;217:37–43.
biogas upgrading and enhancement in different reactor systems. Bioresour [171] Yin X, Jiang XT, Chai B, Li L, Yang Y, Cole JR, et al. ARGs-OAP v2.0 with an
Technol 2017;225:429–37. expanded SARG database and hidden Markov models for enhancement char-
[143] Mulat DG, Mosbæk F, Ward AJ, Polag D, Greule M, Keppler F, et al. Exogenous acterization and quantification of antibiotic resistance genes in environmental
addition of H2 for an in situ biogas upgrading through biological reduction of metagenomes. Bioinformatics 2018:1–8.
carbon dioxide into methane. Waste Manag 2017;68:146–56. [172] Oulas A, Pavloudi C, Polymenakou P, Pavlopoulos GA, Papanikolaou N, Kotoulas
[144] Schuchmann K, Müller V. Autotrophy at the thermodynamic limit of life: a model G, et al. Metagenomics: tools and insights for analyzing next-generation sequen-
for energy conservation in acetogenic bacteria. Nat Rev Microbiol cing data derived from biodiversity studies. Bioinform Biol Insights 2015:9.
2014;12:809–21. [BBI.S12462].
[145] Angelidaki I, Treu L, Tsapekos P, Luo G, Campanaro S, Wenzel H, et al. Biogas [173] Dröge J, McHardy AC. Taxonomic binning of metagenome samples generated by
upgrading and utilization: current status and perspectives. Biotechnol Adv next-generation sequencing technologies. Brief Bioinform 2012;13:646–55.
2018;36:452–66. [174] Tabish M, Azim S, Hussain MA, Rehman SU, Sarwar T, Ishqi HM. Bioinformatics
[146] Campanaro S, Treu L, Kougias PG, Luo G, Angelidaki I. Metagenomic binning approaches in studying microbial diversity. Management of microbial resources in
reveals the functional roles of core abundant microorganisms in twelve full-scale the environment. Dordrecht: Springer; 2013. p. 119–40.
biogas plants. Water Res 2018;140:123–34. [175] Campanaro S, Treu L, Kougias PG, Zhu X, Angelidaki I. Taxonomy of anaerobic
[147] De Vrieze J, Verstraete W. Perspectives for microbial community composition in digestion microbiome reveals biases associated with the applied high throughput
anaerobic digestion: from abundance and activity to connectivity. Environ sequencing strategies. Sci Rep 2018;8:1926.
Microbiol 2016;18:2797–809. [176] Widder S, Allen RJ, Pfeiffer T, Curtis TP, Wiuf C, Sloan WT, et al. Challenges in
[148] Parthasarathy R. Monitoring microbial communities using light sheet fluorescence microbial ecology: building predictive understanding of community function and
microscopy. Curr Opin Microbiol 2018;43:31–7. dynamics. ISME J 2016;10:2557–68.
[149] Jiang Y, Banks C, Zhang Y, Heaven S, Longhurst P. Quantifying the percentage of [177] Hiraoka S, Yang C, Iwasaki W. Metagenomics and bioinformatics in microbial
methane formation via acetoclastic and syntrophic acetate oxidation pathways in ecology: current status and beyond. Microbes Environ 2016;31:204–12.
anaerobic digesters. Waste Manag 2018;71:749–56. [178] Tolvanen KE, Karp MT. Molecular methods for characterizing mixed microbial
[150] Tian H, Fotidis IA, Mancini E, Treu L, Mahdy A, Ballesteros M, et al. Acclimation to communities in hydrogen-fermenting systems. Int J Hydrogen Energy
extremely high ammonia levels in continuous biomethanation process and the 2011;36:5280–8.
associated microbial community dynamics. Bioresour Technol 2018;247:616–23. [179] Palaniswamy D, Ramesh G, Sivasankaran S, Kathiravan N. Optimising biogas from
[151] Yang Z, Wang W, He Y, Zhang R, Liu G. Effect of ammonia on methane production, food waste using a neural network model. In: Proceedings of the institution of civil
methanogenesis pathway, microbial community and reactor performance under engineers-municipal engineer. Thomas Telford Ltd; 2016. p. 221–9.
mesophilic and thermophilic conditions. Renew Energy 2018;125:915–25. [180] Nair VV, Dhar H, Kumar S, Thalla AK, Mukherjee S, Wong JW. Artificial neural
[152] Hanemaaijer M, Röling WF, Olivier BG, Khandelwal RA, Teusink B, Bruggeman FJ. network based modeling to evaluate methane yield from biogas in a laboratory-
Systems modeling approaches for microbial community studies: from metage- scale anaerobic bioreactor. Bioresour Technol 2016;217:90–9.
nomics to inference of the community structure. Front Microbiol 2015;6:213. [181] Antwi P, Li J, Boadi PO, Meng J, Shi E, Deng K, et al. Estimation of biogas and
[153] Succurro A, Moejes FW, Ebenhöh O. A diverse community to study communities: methane yields in an UASB treating potato starch processing wastewater with
integration of experiments and mathematical models to study microbial consortia. backpropagation artificial neural network. Bioresour Technol 2017;228:106–15.
J Bacteriol 2017;199. [e00865-16]. [182] Eng A, Borenstein E. An algorithm for designing minimal microbial communities
[154] Shaw GTW, Pao YY, Wang D. MetaMIS: a metagenomic microbial interaction with desired metabolic capacities. Bioinformatics 2016;32:2008–16.
126