EMBO J 2019 E101011

You might also like

Download as pdf or txt
Download as pdf or txt
You are on page 1of 5

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/329320009

The nature of the biological material and the irreproducibility problem in


biomedical research

Article  in  The EMBO Journal · January 2019


DOI: 10.15252/embj.2018101011

CITATIONS READS

3 368

1 author:

George Papamokos
University of Ioannina
35 PUBLICATIONS   398 CITATIONS   

SEE PROFILE

All content following this page was uploaded by George Papamokos on 11 June 2019.

The user has requested enhancement of the downloaded file.


Commentary

The nature of the biological material and


the irreproducibility problem in
biomedical research
George V Papamokos1,2,3,4

Biomedical research has a reproducibility Yet, the role of the unique nature of adopted by biological material upon
problem since various crucial landmark biological material itself has not been infinitesimal changes in the microenviron-
papers could not be independently repro- discussed analytically for its potential role in ment which support that this is an opportu-
duced. While there are many causes related the irreproducibility problem. I suggest that nity to produce new science. I will also
to statistical analysis, methodology or insuf- irreproducibility may also be a combined introduce methodology bias as a new term
ficient reporting of experimental details, result of the complexity of the biological for the discussion of reproducibility.
this commentary argues that the complexity material and of our finite ability to deal with
of biological material itself is, until now, a it, provided that all other factors are Perception of biology by Ernst Mayr
largely ignored source of irreproducibility. addressed (Eisner, 2018). Irreproducibility in
By discussing examples from evolutionary carefully selected experiments can also be Ernst Mayr, in his classic work “The Growth
biology, intrinsically disordered proteins and interpreted to the end that our current meth- of Biological Thought”, epitomized the
current biomedical research, it contends ods, theories and techniques are either philosophy of research in biology. He argued
that some results are irreproducible because applied ipsilaterally or are insufficient to that the complexity of biological systems
we do not have the knowledge, the tools or analyse and describe functions, properties inevitably places biology beyond the limits
the analytical ability to understand biologi- and unknown or missing parts of the biolog- of the natural sciences as the diversity of
cal complexity and how it can give rise to ical material. biological systems is based on rules that are
different results. Instead of casting irrepro- To support this argument, I will first substantially different from the laws of
ducible research out as bad or sloppy present the perception of the science of physics. Mayr believed that these rules
science, they should serve as an inspiration biology by Ernst Mayr, one of the leading cannot be described by methodological
for pioneering research not just to develop evolutionary biologists of the past century reductionism and can only approached prob-
such tools but also to attempt to explore who described the evolutionary process as abilistically, and that mathematics, although
what lies beneath our current inability to unpredictable or, at best, only statistically fundamental in physics, can contribute less
deal with complexity. predictable (Mayr, 1961). Subsequent recent to biology.
advancements either challenge the impossi- In his landmark paper Cause and Effect
The EMBO Journal (2019) e101011 ble or probabilistic approach of repeatability in Biology (Mayr, 1961), Mayr described
and predictability at certain levels, or biology as consisting of two different schools

R
eproducibility is a central dogma of suggest that predictable biochemical of thought: functional biology aims to
science as expressed by Karl Popper: changes in evolutionary biology do not have answer the question “How?” while evolu-
“. . . for we have seen that non-repro- a predictable molecular basis. A second tionary biology attempts to answer “Why?”
ducible single occurrences are of no signifi- argument is the discovery of a whole new To this end, he used a warbler on his
cance to science”. However, the life sciences kind of proteins, intrinsically disordered summer place in New Hampshire as an
do have an irreproducibility problem for proteins (IDPs) (Dyson & Wright, 2005), that example to distinguish between proximate
published research articles: during the past are key factors in health, disease, evolution causes that are linked to functional biology
years, evidence of irreproducible results has and now possible drug targets. Lastly, two and ultimate causes linked to evolutionary
been accumulating, and various method- key examples at the level of eukaryotic cells biology. Thus, the physiological condition of
ological causes have been identified (Eisner, (Hines William et al, 2014) and at the organ- the bird—its sensing of photoperiodicity and
2018) and extensively reviewed (see Eisner, ismal level (Lithgow et al, 2017) that were temperature—is the proximate cause of
2018 and references therein). unable to account for different states migration, while the reduced availability of

1 Biomedical Division, The Institute of Molecular Biology and Biotechnology, FORTH-ITE, Heraklion, Crete, Greece E-mail: george.papamokos@gmail.com
2 Laboratory of Biological Chemistry, Medical School, University of Ioannina, Ioannina, Greece
3 Laboratory of Soft Matter, Department of Physics, University of Ioannina, Ioannina, Greece
4 Department of Physics and School of Engineering and Applied Sciences, Harvard University, Cambridge, MA, USA
DOI 10.15252/embj.2018101011

ª 2019 The Author The EMBO Journal e101011 | 2019 1 of 4


The EMBO Journal Irreproducibility and complexity George V Papamokos

food during winter and its genetic disposi- analysis that inevitably yielded errors. For specifically, they tested whether high-alti-
tion are the ultimate causes of migration. Mayr, unpredictability and the unrepeatabil- tude taxa have convergently evolved
In the same paper, Mayr concluded that ity in biology were therefore almost a increases in Hb-O2 affinity and whether this
“. . . causality in biological systems is not dogma. evolutionary process is linked to parallel
predictive or at best is only statistically Recent findings in molecular biology amino acid substitutions (Natarajan et al,
predictive”. challenge this dogma as well as the various 2016). The authors showed that predictable
Two years earlier, at the 1959 Cold examples of convergent evolution in biol- changes in biochemical phenotype do not
Spring Harbor Symposium on Quantitative ogy. Adaptive convergent evolution entails have a predictable molecular basis, since
Biology, he had questioned not only the the independent emergence of similar or only a few changes were attributable to
contribution of mathematics to biology, but identical traits in multiple lineages that parallel amino acid substitutions at key resi-
also argued that applying mathematics in undergo the same ecological stimulus (Zou dues, while the majority of them were attri-
evolution is fundamentally and methodolog- & Zhang, 2015), At the molecular level, butable to nonreplicated substitutions and/
ically wrong: “I am impressed by the proteins exhibit convergence too. When dif- or parallel substitutions at sites that are not
uniqueness, by the unpredictability, and by ferent ancestral amino acids are substituted considered “key residues”.
the unrepeatability of evolutionary events. by the same descendant amino acid along While Mayr’s claim about the value of
Let me end this discussion with the provoca- independent evolutionary lineages, conver- mathematics proved wrong, his observation
tive question: Is it not perhaps a basic error gent substitution can be identified. When (Mayr, 1988) about the complexity of biolog-
of methodology to apply such a generalizing the same ancestral amino acid is changed to ical systems is critical: at the molecular
technique as mathematics to a field of a newer amino acid along independent level, biomacromolecules and macro-
unique events, as is organic evolution?” lineages, a parallel substitution takes place molecules of inanimate material do not dif-
Eventually, his claim that the “generalizing (Zou & Zhang, 2015). Another example is fer and show similar physicochemical
technique” of mathematics is not applicable to the observation that many insects that feed behaviour—this is evident from various
biology and evolution was refuted by other on Apocynaceae plants evolved indepen- studies especially for simple models. But
scientists (Crow, 2009). Today, fields such as dently the ability to avoid the toxic effects of biomacromolecules possess unique proper-
bioinformatics, computational biology or cardenolides, chemicals produced by the ties. In a living organism, they are hierarchi-
molecular modelling indeed show that mathe- same plants. Zhen et al analysed the alpha cally ordered to form new entities:
matics can be applied to analyse and under- subunit of the sodium pump, Na+, K+- biomolecules to cells, cells to tissues, tissues
stand biological evolution; during Mayr’s time, ATPase (ATPa) which is the protein target to organs and organs to fully functional
it was the human inability to perform highly for cardenolides in a broad range of taxa systems. When a fully functional system is
complex calculations and the lack of suffi- and found evidence of parallel changes and assembled, new unique functions and prop-
ciently powerful computers that prevented the duplications that explain the shift to avoid erties appear that can only be found in living
application of sophisticated mathematical toxicity. Their findings support the hypothe- organisms: response to external stimuli,
modelling in biology. By way of example, sis that adaptation follows evolutionary metabolism, growth, differentiation and
computational phylogenetics, which is a part paths that minimize negative pleiotropy (See replication (Mayr, 1988).
of systematics, Mayr’s field, demonstrates Zhen et al, 2012; Zou & Zhang, 2015). The last experiment described above
how human limitations were enhanced by This raises an important question in (Natarajan et al, 2016), along with Mayr’s
computer science—software and hardware. evolutionary biology: Is evolution indeed critical observation, show that adaptive
More generally, computational science unpredictable and unrepeatable or are our changes may be consequences of various
enhanced our ability to solve complex prob- knowledge, theories and methods still insuf- paths at the molecular level (different muta-
lems by orders of magnitude and encouraged ficient? There is a fundamental difference tions with different mechanisms), which
scientists to develop new methodologies that though: if unpredictability and unrepeatabil- converge to a common endpoint. They show
none would have conceived before, simply ity are properties of the evolutionary that the biological material has the neces-
because the technology was not available. process, then these properties must be sary complexity to create different processes
Last, but not least, we have entered the era of systematically verified. If these are unveri- upon the same stimulus for reasons not yet
big data which requires mathematics and fied properties that appear occasionally and fully understood. It raises a fascinating ques-
bioinformatics to make sense of the huge in conflict, we must examine whether unpre- tion: Is there a way to simulate reversible
amount of information. dictability and unrepeatability fall into our routes from the endpoint to the initial states
In general, back in the 1950s, as biology current human inability to predict. This of living organisms that evolved?
was maturing into an autonomous research inability might be a temporary and can be
field, many scientists and philosophers, such revoked in the future. The latter would be a Intrinsically disordered proteins
as Michael Scriven, accepted that, in evolu- clear sign that more research is needed.
tionary biology, “satisfactory explanation of So far, recent works support that evolu- Intrinsically disordered proteins (IDPs) are a
the past is possible even when prediction of tion is unpredictable at the molecular level. class of proteins that do not adopt a defined
the future is impossible”. Indeed, applying For instance, Natarajan et al studied the secondary structure; they possess flat energy
the laws and theories on complex problems predictability of genetic adaptation by exam- landscapes; and their spatial representation
in biology, psychology, anthropology, ining the molecular basis of convergence in resembles a polypeptidic sequence fluctuating
history, cosmology, economics and quantum haemoglobin function of 56 avian taxa with between various conformations such as
physics increasingly required statistical different altitudinal range limits—more extended coils or collapsed globules. (Dyson &

2 of 4 The EMBO Journal e101011 | 2019 ª 2019 The Author


George V Papamokos Irreproducibility and complexity The EMBO Journal

Wright, 2005). Despite their dynamic nature, monitored (The last two sentences are corro- Gordon J. Lithgow was one of the
they are multifunctional and can exhibit vari- borating comments of an unknown authors of a 2000 paper describing that a
ous interactions depending on cell-signalling reviewer). Mayr’s view that mathematics drug-like molecule could extend an animal’s
and regulatory networks. Their primary struc- cannot be applied to biology can also be lifespan (Lithgow et al, 2017 and refs 1,2
ture is dominated by charged, polar, small interpreted as a methodology bias. therein). This finding could not be repro-
hydrophilic amino acids, and they are either The result of this methodology bias is duced by other laboratories, and the reason
entirely disordered or a part of the sequence that we are missing a lot of information on is still unknown (Lithgow et al, 2017). The
constitutes an intrinsically disordered region IDPs, which has a severe effect on our abil- authors spent 4 years and studied more than
(IDR). ity to gain knowledge in living organisms. 100,000 worms to systematically test ageing
IDPs are subject to post-translational modi- Immanuel Kant, in his “The Critique of interventions in the nematode Caenorhabdi-
fications, which adds more complexity to IDPs’ Judgment” written in 1790, described an tis elegans and eliminated many causes of
functionality. Interestingly, IDPs are highly organism as “. . . a whole which result from variability. Yet, even when performing iden-
abundant in nature and their abundance the functioning of the parts, while the parts, tical experiments in a single laboratory, they
increases with the complexity of the organism in turn, depend on the functioning of the observed that some cohorts of worms could
from bacteria to archaea to eukaryota: more whole”. In fact, the removal of a part may fall into one of two modes of ageing: short-
than half of all eukaryotic proteins are esti- be lethal for the organism and for part. Our lived or long-lived. The reason is not under-
mated to embody at least regions of intrinsic lack of knowledge on IDPs is similarly a lack stood, and the team is focusing on molecular
disorder (IDRs). IDP functions play a central of knowledge on the functionality of an differences that may account for this dif-
role in many diseases: cancer, neurogenerative important part that prevents us from under- ference within the same strain. They state
diseases, cardiovascular diseases, type II standing and predicting the function of the that this phenomenon would have been
diabetes and acquired immunodeficiency whole living system. undiscoverable if they had not eliminated all
syndrome, while there are numerous findings other sources of variability.
that they are also part of the evolutionary Unexplained irreproducibility in cells In both examples, irreproducibility
process (Uversky et al, 2014). For example, and living organisms results spurred deeper analysis at the molec-
the IDP protein a-synuclein is linked to ular level that proved or may prove very
Parkinson’s disease, dementia with Lewy From the wealth of literature on irrepro- fruitful. Both examples also show that
bodies, Alzheimer’s disease and Down’s ducibility, I here discuss two papers that biological material from cells to living organ-
syndrome. highlight and emphasize the sensitivity of isms has the property to exhibit different
Unfortunately, a lot of information about the biological material to environmental states at almost identical conditions.
IDPs was lost or ignored. The standard factors no matter how minute these are (Bis-
method to determine the structure of a sell, 2013). The first example describes how What do cases A, B and C share?
protein has been X-ray crystallography, but cells from the same human breast cell line
many proteins were not investigated but from different sources respond dif- These cases highlight the complexity of the
because they failed to crystallize; since these ferently to the same assay. Second, studying biological material and a critical conse-
were considered negative results, this infor- the role of glucose uptake in cancer progres- quence: biological material can either
mation was not published. Moreover, many sion, Bissell and collaborators showed that produce the same result following various
proteins had parts missing in their crystal either changes in media glucose levels or molecular routes, or the same stimulus may
structure because these were comprised of the cells’ shape, when media was kept have different results. How can this conse-
intrinsically disordered regions that failed to constant, changed the nature of the metabo- quence affect science and irreproducible
crystallize. These parts were ignored until lites and the metabolic pathways. A third research? Suppose that a living system,
recently, which is an example of methodol- example from a collaborator of hers showed examined at the molecular level, can exhibit
ogy bias: the use of a method that can give that they could not reproduce their own various states in health, disease or evolution:
reliable results (X-ray crystallography) experiments using nonmalignant human S1, S2 and S3. Starting from either of these
provided that the material under study breast cell lines obtained from an investiga- states, it can converge to a state denoted as
adopts a specific state or condition (well- tor with cells obtained from a cell bank. S4. Suppose now that three independent
folded protein). However, if the material Subsequent analysis revealed that the laboratories, starting from state S4, explore
cannot adopt this state or condition owing to cultured cells had drifted and revealed the reasons for this reaction and reveal states
unknown properties (IDPs, IDRs), it leads to crucial information in cell cycle regulation S1, S2 and S3, respectively. Although these
methodology bias and misleading results, of the drifted cells. results may appear as irreproducible, they
wrong analysis and lost information. The same researcher and her collabora- manifest a chance for further research, new
Another example of methodology bias is tors provided another excellent example of knowledge and the missing factor of biologi-
irreproducibility of experiments involving how to track down and eliminate causes of cal complexity.
enzymes that are caused by differences in irreproducibility: working side by side on The current inability of our methods and
purification procedures and assay methods. the same tumour biopsy, they found that approaches to deal with the complexity of
Moreover, molecules that inhibit a specific even small differences in cell isolation— the biological material and its functions
enzyme may initiate multiple downstream vigorous stirring versus prolonged gentle should indeed inspire scientific progress to
effects, because they can also inhibit other rocking—resulted in irreproducible research replace earlier views based on ignorance
enzymes the activity of which is not (Hines William et al, 2014). with solid knowledge based on new

ª 2019 The Author The EMBO Journal e101011 | 2019 3 of 4


The EMBO Journal Irreproducibility and complexity George V Papamokos

findings, methods and approaches. Failures irreproducible research. On the contrary, it Eisner DA (2018) Reproducibility of science: fraud,
to reproduce experiments should therefore shows that when dealing with complex impact factors and carelessness. J Mol Cell
not be considered a catastrophe for science material, we need to share data; describe Cardiol 114: 364 – 368
and purged from the literature. Science has our work in detail; improve our methods, Hines William C, Su Y, Kuhn I, Polyak K, Bissell
made enormous progress through failures practices and protocols; increase our collab- Mina J (2014) Sorting out the FACS: a devil in
and errors: for example, the Rayleigh–Jeans oration and our interdisciplinary research; the details. Cell Rep 6: 779 – 781
law, the classical approach to describe and deepen our perception of the field. Lithgow G, Driscoll M, Phillips P (2017) A long
black-body radiation, resulted in the ultravi- Questionable practices on a material we only journey toward reproducible results. Nature
olet catastrophe. As the law predicted that know it partially, on a material that can 548: 387 – 388
cool objects should radiate in the visible and produce multiple results by default, do not Mayr E (1961) Cause and effect in biology. Science
ultraviolet regions of the electromagnetic serve science; they just serve career goals. 134: 1501 – 1506
spectrum, darkness could not exist. Max Instead, they demonstrate how sloppy or Mayr E (1988) Toward a new philosophy of biology:
Planck eventually brought darkness back biased research incubates irreproducible observations of an evolutionist. Cambridge, MA:
with the introduction of the quantization of research and builds expensive castles in the Harvard University Press
energy. sand. This kind of research is fundamentally Natarajan C, Hoffmann FG, Weber RE, Fago A,
different from the research and its results I Witt CC, Storz JF (2016) Predictable
Envoy described in this article. We must distin- convergence in hemoglobin function has
guish between bad science and honest fail- unpredictable molecular underpinnings. Science
Many methods and tools that are applied to ures that indicate the need for further 354: 336 – 339
analyse biological material were originally research. Uversky VN, Dave V, Iakoucheva LM, Malaney P,
designed to study materials of the inanimate Metallo SJ, Pathak RR, Joerger AC (2014)
world. The biological material, however, has Conflict of interest Pathological unfoldomics of
unique properties that are not predictable The authors declare that they have no conflict of uncontrolled chaos: intrinsically disordered
from its constituting parts. Are these meth- interest. proteins and human diseases. Chem Rev 114:
ods and techniques sufficient to reveal the 6844 – 6879
secrets of life in health and disease? References Zhen Y, Aardema ML, Medina EM, Schumer M,
Do selected irreproducible results imply Bissell M (2013) Reproducibility: the risks of the Andolfatto P (2012) Parallel Molecular
the need to improve existing and to invent replication drive. Nature 503: 333 – 334 Evolution in an Herbivore Community. Science
new methods, techniques, theories and Crow JF (2009) Mayr, mathematics and the study 337: 1634 – 1637
approaches? This commentary presents of evolution. J Biol 8: 13 Zou Z, Zhang J (2015) Are convergent and parallel
evidence and argues that we need to look Dyson HJ, Wright PE (2005) Intrinsically amino acid substitutions in protein evolution
out of our current scientific toolbox. This unstructured proteins and their functions. Nat more prevalent than neutral expectations? Mol
must not be perceived as an excuse for Rev Mol Cell Biol 6: 197 – 208 Biol Evol 32: 2085 – 2096

4 of 4 The EMBO Journal e101011 | 2019 ª 2019 The Author


View publication stats

You might also like