Professional Documents
Culture Documents
Addressing The Challenges of High Throughput Cancer Tissue Proteomics For Clinical Application - ProCan
Addressing The Challenges of High Throughput Cancer Tissue Proteomics For Clinical Application - ProCan
Figure 1. ProCan flywheel. The principal objective of ProCan is to build a body of insight and knowledge about cancer. This is an iterative process where
each cycle, driven by a single question of unmet clinical need, enables a body of knowledge to be created and the proteomic landscape of cancer to
be expanded. Around the wheel, there are loci of activities, each of which contain many challenges and decision points, especially those related to the
practicalities of performing high-quality science in a biobank-scale context. As the landscape grows, pan-cancer analyses become possible, offering a
new and unique perspective that we expect will enable insights not obtained through other means.
high-quality proteomics data to existing stores of clinical and The key to clinical application of tissue-based proteomics is to
molecular information (Figure 1). adapt to the practical requirements of clinical workflow. In the
The ProCan program is structured as a series of individual discovery phase, a major implication for ProCan is to prioritize
hypothesis-driven research studies focused on individual cancer analysis of formalin-fixed paraffin-embedded (FFPE) over fresh-
types that combine to form a pan-cancer knowledge-base over the frozen (FF) tissue samples where possible. Our experience with
course of the program. Operational decision-making has been FFPE tissue proteomics is consistent with other reports show-
shaped by the practical requirements of clinical research and ing that high-quality data can be generated and that the scale and
high-throughput proteomic studies performed at biobank-scale. scope of quantifiable proteins is comparable with FF tissues.[7]
Here, we describe major challenges we have identified and our However, there may be differences in the proteomic profiles of
approach to addressing some of these. FFPE and FF samples that require definition of distinct classi-
fiers for the two sample types. Furthermore, in order to ensure
throughput, we are focusing on analysis of unmodified peptides.
1.1. Study Design However, re-analysis of publicly available ProCan data by us or
others will enable questions currently out of scope to be explored
Individual studies that comprise the majority of ProCan research in the future.
are conducted in collaboration with expert clinical or research
groups. Priority research collaborations are based on the avail-
ability of tissue sample collections that are richly annotated with 1.2. Sample Preparation
clinical data, and ideally have other ‘omic data available for inte-
grated analysis with proteomics (Figure 2). A major considera- Cancer tissue samples are a complex mix of malignant, reac-
tion in study design is to ensure that the hypotheses to be tested tive, and normal elements and diagnostic interpretation relies
address areas of “unmet clinical need” relevant to specific can- on morphological assessment by a specialist pathologist. To in-
cers. In this respect, input from domain-expert collaborators at terpret data from proteomics, which is tissue disruptive, it is im-
the stage of project planning and monitoring is critical. In the fu- perative that the components of a sample submitted for MS are
ture, biomarker-seeking sub-studies integrated into clinical trials known. Isolation of cancer cells from tissue by microdissection
may provide opportunities to facilitate these interactions. is a strategy to directly assess their proteomic features. However,
A challenge in conducting a large number of collaborative this is a very labor-intensive process that is not well suited to
projects across institutions and jurisdictions has been to de- high-throughput, or clinical application. It is also likely that a pro-
velop systems to manage regulatory processes, including human teomic profile from the tumor microenvironment will be infor-
research ethics/institutional review board approvals, materials mative and is important to capture.
transfer, and other inter-institutional agreements. The time and The strategy that ProCan is using to address these issues
resource cost of this aspect is substantial. An advantage is that ex- is to prepare adjacent sections from tissue blocks for matched
perience in this area has been advanced by the conduct of large histopathologic review and proteomics analysis. A technical chal-
international collaborative studies over the past two decades, and lenge that this has created in the case of FF samples is to
our experience indicates a high level of familiarity with relevant develop efficient methods for removal of optimal cutting tem-
issues as well as dedicated expertise available in many institu- perature compound prior to MS. In the future, it may be-
tions to support researchers dealing with these challenges. come possible to deconvolute a tissue proteomic profile to
Figure 2. Schematic of ProCan matrix. ProCan aims to survey the proteomic landscape of all types of cancer via a series of individual collaborative projects
(represented here as radial panels). A typical project is designed to address a clinically relevant question by analyzing a cohort of cancer samples that
has been collected by a collaborating research group and annotated with clinical data, including response to treatment and survival. Wherever possible,
a matched section of the tumor—immediately adjacent to tissue processed for mass spectrometry—is analyzed by histopathology review. Cohorts for
which various combinations of other ‘omic data are available (indicated here by colored segments; WGS, whole genome sequencing; WES, whole exome
sequencing) are prioritized, so that correlation between proteomics and other ’omics can be addressed. The use of a single proteomics technology
across all ProCan projects will facilitate a pan-cancer overview of the human cancer tissue proteome.
estimate the contribution of different tissue elements, as has time. ProCan research is largely based on a variant of label-free
been achieved to a degree with transcriptomic data.[8] ProCan is LC-MS/MS that is particularly well suited to comparative
curating a large-scale database of digitized whole slide images analysis of a large number of tissue samples; Sequential
anticipating that it will become possible for machine learning Window Acquisition of all THeoretical mass spectra/Data-
techniques to enable more direct integration of histopathology Independent Acquisition (SWATH/DIA).[11] DIA/SWATH gives
and tissue proteomic profiles. close to complete detection of peptides from tryptic digestion
After samples are collected, tissue lysis and digestion protocols of complex samples, and its applicability to comparative can-
must be rapid, efficient, reproducible, and broadly applicable to cer tissue analysis has been demonstrated.[9] It does rely on
tissues of different kinds and from different source laboratories. availability of relevant spectral reference libraries, and our
In addition, the methodology should be adaptable for integration strategy is to use two-dimensions of chromatography from
of robotics to facilitate throughput where possible. ProCan has pooled samples to make libraries representative of all samples
instituted the use of pressure-cycling technology (barocyclers) to analyzed.
achieve consistent lysis and digestion of tissue samples.[9] In the There are many practical considerations in the design and
establishment phase, we have made considerable effort to refine operation of a biobank-scale LC-MS/MS facility. A key for Pro-
protocols that achieve shorter processing times while improving Can is to use standardized approaches to SWATH/DIA data
digestion and peptide yield, and maintaining reproducibility and capture, and to give priority to time- and cost-saving modifi-
sample quality. For example, simultaneous Lys-C and tryptic di- cations, including use of readily available reagents and com-
gestion is used in an accelerated manner as an aid to sample ponents such as commercially available capillary columns. In
throughput.[10] addition, the entire system requires constant monitoring and
quality control (QC). ProCan has enabled continuous QC by
selected use of simple (bovine serum albumin) and complex
1.3. Raw Data Acquisition (HEK293 cells) digests coupled to automated monitoring sys-
tems and protocols. These external controls are in addition to
LC-MS/MS is a highly sensitive, mature technology capable synthetic peptide standards that have been selected to span the
of identifying and providing relative quantitation of many retention time of the experiment and are included in every
thousands of proteins in a tissue sample in a 1 to 2 h run injection.
2. Concluding Comments [3] P. Janiaud, S. Serghiou, J. P. A. Ioannidis, Cancer Treat. Rev. 2019, 73,
20.
Sustained effort in cancer tissue–based proteomics research [4] J. Marquart, E. Y. Chen, V. Prasad, JAMA Oncol. 2018, 4, 1093.
is expected to provide substantial clinical benefit. Success will [5] G. T. Gibney, L. M. Weiner, M. B. Atkins, Lancet Oncol. 2016, 17,
require input from a broad range of stakeholders, including e542.
cancer consumers, cancer biologists, oncologists, patholo- [6] C. Bais, B. Mueller, M. F. Brady, R. S. Mannel, R. A. Burger, W. Wei,
gists, proteomicists, software engineers, data scientists, health K. M. Marien, M. M. Kockx, A. Husain, M. J. Birrer, N. R. G. On-
cology/Gynecologic Oncology Group, J. Natl. Cancer Inst. 2017, 109,
economists, regulators, and research funding agencies. Our
djx066.
experience to date with ProCan indicates that a high level [7] O. J. Gustafsson, G. Arentz, P. Hoffmann, Biochim. Biophys. Acta
of community support exists, and that clinical application of 2015, 1854, 559.
high-throughput proteomics will be achievable. [8] K. Yoshihara, M. Shahmoradgoli, E. Martinez, R. Vegesna, H. Kim,
W. Torres-Garcia, V. Trevino, H. Shen, P. W. Laird, D. A. Levine, S.
L. Carter, G. Getz, K. Stemke-Hale, G. B. Mills, R. G. Verhaak, Nat.
Acknowledgements Commun. 2013, 4, 2612.
[9] T. Guo, P. Kouvonen, C. C. Koh, L. C. Gillet, W. E. Wolski, H. L. Rost,
ProCan is supported by the Australian Cancer Research Foundation, the G. Rosenberger, B. C. Collins, L. C. Blum, S. Gillessen, M. Joerger, W.
Cancer Institute New South Wales (NSW) (2017/TPG001, REG171150), Jochum, R. Aebersold, Nat. Med. 2015, 21, 407.
the NSW Ministry of Health (CMP-01), the University of Sydney, the [10] N. Lucas, A. B. Robinson, M. Marcker Espersen, S. Mahboob, D.
National Breast Cancer Foundation (IIRS-18-164), the Cancer Council
Xavier, J. Xue, R. L. Balleine, A. deFazio, P. G. Hains, P. J. Robinson,
NSW (IG 18-01), Ian Potter Foundation, the Commonwealth of Australia
J. Proteome. Res. 2019, 18, 399.
through the Medical Research Futures Fund (MRFF-PD), and the Na-
[11] C. Ludwig, L. Gillet, G. Rosenberger, S. Amon, B. C. Collins, R. Aeber-
tional Health and Medical Research Council of Australia (GNT1047070,
GNT1170739). sold, Mol. Sys. Biol. 2018, 14, e8126.
[12] M. Ali, S. A. Khan, K. Wennerberg, T. Aittokallio, Bioinformatics 2018,
34, 1353.
[13] M. D. Wilkinson, M. Dumontier, I. J. Aalbersberg, G. Appleton, M.
Conflict of Interest Axton, A. Baak, N. Blomberg, J. W. Boiten, L. B. da Silva Santos, P. E.
The authors declare no conflict of interest. Bourne, J. Bouwman, A. J. Brookes, T. Clark, M. Crosas, I. Dillo, O.
Dumon, S. Edmunds, C. T. Evelo, R. Finkers, A. Gonzalez-Beltran, A.
J. Gray, P. Groth, C. Goble, J. S. Grethe, J. Heringa, P. A. t Hoen, R.
Hooft, T. Kuhn, R. Kok, J. Kok, et al., Sci. Data 2016, 3, 160018.
Keywords [14] B. H. Yoon, S. K. Kim, S. Y. Kim, Genom. Informat. 2017, 15, 19.
cancer, data analysis, data-independent acquisition, proteomics, sequen- [15] C. R. Kinsinger, J. Apffel, M. Baker, X. Bian, C. H. Borchers, R. Brad-
tial window acquisition of all theoretical mass spectra–mass spectrometry shaw, M. Y. Brusniak, D. W. Chan, E. W. Deutsch, B. Domon, J. Gor-
man, R. Grimm, W. Hancock, H. Hermjakob, D. Horn, C. Hunter,
P. Kolar, H. J. Kraus, H. Langen, R. Linding, R. L. Moritz, G. S.
Received: April 30, 2019
Omenn, R. Orlando, A. Pandey, P. Ping, A. Rahbar, R. Rivers, S. L.
Revised: July 11, 2019
Published online: Seymour, R. J. Simpson, D. Slotta, et al., Mol. Cell. Proteom. 2011, 10,
O111.015446.
[16] C. R. Jimenez, H. Zhang, C. R. Kinsinger, E. C. Nice, Clin. Proteom.
2018, 15, 4.
[1] S. Loibl, L. Gianni, Lancet 2017, 389, 2415. [17] R. J. Chalkley, M. J. MacCoss, J. D. Jaffe, H. L. Rost, Mol. Cell. Proteom.
[2] S. A. Cook, A. V. Tinker, BioDrugs 2019, 33, 255. PMID: 30895466. 2019, 18, 1.