Professional Documents
Culture Documents
Methods Guide For Cancer Research: For Research Use Only. Not For Use in Diagnostic Procedures
Methods Guide For Cancer Research: For Research Use Only. Not For Use in Diagnostic Procedures
For Research Use Only. Not for use in diagnostic procedures. M-GL-00029 v01 2
Illumina genomic analysis techniques for cancer researchers
Take your research to the next level
The introduction of NGS has transformed the way scientists study biological systems. With clear benefits, such as
reduced time and cost compared to legacy technologies and the capacity to scale from small studies to population-
level throughput, NGS opens the door to a broad range of research capabilities.
Deep sequencing provides the sensitivity to detect low-frequency molecular events, uncovering the somatic variants
behind the tumor. These studies can be instrumental to understanding how changes at the DNA, RNA, protein, or
cellular level contribute to tumor initiation, growth, and metastasis. Single-cell techniques allow researchers to go
beyond bulk measurements to understand how different cells within the tumor microenvironment promote or inhibit
cancer progression.
NGS has already expanded our knowledge of cancer as a disease of the genome. More recent techniques are poised
to shed even more light within oncology. With the advantages of speed, sensitivity, and scale, NGS can take your
cancer research to the next level.
In addition to NGS, microarrays are a valuable tool for variant detection, from discovery applications to routine
screening. Powered by widely adopted Infinium technology, Illumina microarrays provide the trusted data quality
needed to accelerate research. The scalable, multi-sample format supports labs conducting large studies or
institutions processing high sample volumes, making it easy to keep pace with demand.
NGS-based workflows
Library prep
Library prep kits are available for a range of applications, including exome, transcriptome, and whole-genome
sequencing (WGS).
Sequencing
Illumina offers a full portfolio of sequencing platforms, from the benchtop iSeq™ 100 System to the production-scale
NovaSeq™ 6000 System, that deliver the right level of speed, capacity, and cost for various laboratory or sequencing
center requirements. Illumina has pioneered major advances in sequencing simplicity, flexibility, and platform
performance. Experiments that used to require complex workflows now use simple push-button workflows.
Data analysis
Illumina cancer research workflows using the DRAGEN pipelines provide user-friendly data analysis tools that are
easily accessible through the web with BaseSpace Sequence Hub or onboard the NextSeq 1000 and NextSeq 2000
Systems for push-button analysis. Additionally, BaseSpace Sequence Hub enables integration of many workflow
steps, including library prep planning and sample management, run set-up and chemistry validation, and real-time
data monitoring and data transfer to computing and analysis modules.
Microarray workflows
Illumina offers ready-to-use microarrays with expertly-designed content. BeadChips can be processed and scanned
using Illumina systems as part of a rapid and automation-compatible DNA-to-data workflow.
For Research Use Only. Not for use in diagnostic procedures. M-GL-00029 v01 3
Methods
For Research Use Only. Not for use in diagnostic procedures. M-GL-00029 v01 4
Method 1: Single-cell sequencing
Understand the molecular drivers of cancer at single-cell resolution.
Key benefits over bulk sequencing:
Analysis,
Tissue Single-cell isolation
Specimens Sequencing visualization, and
preparation and library prep
interpretation
For Research Use Only. Not for use in diagnostic procedures. M-GL-00029 v01 5
Potential applications of single-cell cancer studies
For Research Use Only. Not for use in diagnostic procedures. M-GL-00029 v01 6
Single-cell RNA Sequencing (scRNA-Seq)
This guide presents an overview of scRNA-Seq in cancer research. For a complete overview of single-cell sequencing
techniques, download the single-cell eBook www.illumina.com/single-cell-rna-sequencing.
• Tissue preparation from solid tumors can be difficult, as the goal is to obtain viable, individualized fresh cells
• For RNA-Seq, this difficulty can translate into artifacts, as tissue manipulation can lead to transcriptional changes
• For cancer research methods, isolation of nuclei only has been employed, which enables preparation of fixed
tissues and reduces dependence on cell viability and integrity
• In general, after dissociation, fluorescence-activated cell sorting (FACS) to isolate single cells has been used,
especially with tumor samples
Advances in microfluidics technologies have enabled high-throughput single-cell profiling, where researchers can
cost-effectively examine hundreds to tens of thousands of cells per experiment. This guide only covers these high-
throughput technologies; for low-throughput options, consult the single-cell eBook.
For Research Use Only. Not for use in diagnostic procedures. M-GL-00029 v01 7
Step 4 Sequencing
The number of cells being evaluated varies depending on the experimental study design. For illustrative purposes,
this guide uses 5000 cells per experiment.
For Research Use Only. Not for use in diagnostic procedures. M-GL-00029 v01 8
Step 5 Single-cell analysis and visualization
After the single-cell sequencing run is complete, downstream analysis can be performed. Generally, the analysis
pipeline for single-cell sequencing experiments involves three phases: primary analysis (base calling), secondary
analysis (demultiplexing, alignment, and genetic characterization), and tertiary analysis (data visualization and
interpretation). There is no one correct way to carry out an analysis pipeline for single-cell sequencing experiments.
Many approaches and software programs are available for each step in the pipeline. The research objective, single-
cell isolation platform, and general lab considerations will largely determine the specific pipeline used. For more
information around analysis, consult the single-cell eBook.
2 Demultiplexing
Secondary analysis:
If the samples were multiplexed for sequencing,
Demultiplexing
resulting read files are demultiplexed prior to
(if applicable)
downstream analyses.
3 Sequence alignment
Sequence alignment The reads are mapped and aligned to a
reference genome.
5 Genetic characterization
Initial genetic Quality-controlled data sets are analyzed for
characterization genomic variants, gene expression, chromatin
accessibility, protein expression, etc.
6 Data visualization
Tertiary analysis: Multidimensional data plots enable the
Data visualization and interpretation clustering of cells and identification of
subpopulations.
For Research Use Only. Not for use in diagnostic procedures. M-GL-00029 v01 9
Downstream tertiary analysis and visualization solutions
There are many options for single-cell tertiary analysis tools, including open-source analysis tools developed
by academic labs in popular programming languages like R and Python, ‘plug-and-play’ packages that allow
researchers to use preconfigured analysis workflows, and commercial offerings. The tools chosen will depend on the
research goals and experimental objectives.
For Research Use Only. Not for use in diagnostic procedures. M-GL-00029 v01 10
Method 2: Spatial transcriptomics analysis
Perform multiomic profiling of tumors within the native morphological structure and context.
Key benefits over single parameter methods:
• Preserve spatial context for gene expression studies without the need for tissue dissociation
• Combine molecular profiling with immunofluorescence or histochemical staining and imaging on the same sample
• Obtain results rapidly with streamlined, automation-compatible workflows
Nanostring GeoMx Digital Spatial Profiler
Prepare Prepare
Specimens Image Sequence Data visualization
samples libraries
Prepare Prepare
Specimens Image Sequence Data visualization
samples libraries
FF or FFPE 10x Genomics Visium Spatial Gene Expression NovaSeq 10x Genomics Space
tumor samples 6000 System Ranger and Loupe
+ normal Browser
NextSeq 1000
and
NextSeq 2000
Systems
For Research Use Only. Not for use in diagnostic procedures. M-GL-00029 v01 11
Potential applications of spatial genomics analysis cancer studies
For Research Use Only. Not for use in diagnostic procedures. M-GL-00029 v01 12
Step 1 Sample preparation
Spatial analysis is performed on tissue sections mounted on slides. Illumina recommends following the respective
protocol for either the 10x Genomics Visium Spatial Gene Expression or Nanostring GeoMx DSP for sample
preparation. Options are available for targeted gene expression analysis.
Step 2 Imaging
Illumina recommends following the respective manufacturer’s protocol for library preparation.
For Research Use Only. Not for use in diagnostic procedures. M-GL-00029 v01 13
Step 4 Recommended sequencing systems
Step 5 Visualization
Illumina recommends using the respective analysis software for each manufacturer.
For Research Use Only. Not for use in diagnostic procedures. M-GL-00029 v01 14
Method 3: Assay for Transposase-Accessible Chromatin
Sequencing (ATAC-Seq)
Determine chromatin accessibility across the genome, without prior knowledge of regulatory
elements.
Key benefits over traditional methods for assessing chromatin accessibility, such as chromatin immunoprecipitation
sequencing (ChIP-Seq), formaldehyde-assisted isolation of regulatory elements sequencing (FAIRE-Seq), or DNase I
hypersensitive sites sequencing (DNase-Seq):
• Avoid sensitive enzymatic digestion or rigorous validation of antibodies by using the Tn5 transposase
• Obtain results rapidly, with a streamlined workflow that can be completed in < 3 hours
• Interrogate precious samples with input requirements as low as 500-50,000 cells
For Research Use Only. Not for use in diagnostic procedures. M-GL-00029 v01 15
Potential applications of ATAC-Seq cancer studies
For Research Use Only. Not for use in diagnostic procedures. M-GL-00029 v01 16
Step 1 Sample preparation
ATAC-Seq is performed on intact cells or nuclei, not isolated genomic DNA. Illumina recommends input requirements
for ATAC-Seq of ≥ 50,000 cells or nuclei. While lower inputs are possible (as few as 500 cells), they may cause issues
with low library complexity.
Illumina recommends following the protocol outlined in ATAC-seq: A Method for Assaying Chromatin Accessibility
Genome-Wide. IDT for Illumina Nextera DNA Unique Dual Indexes can be incorporated during PCR amplification to
generate a sequencing-ready library.
For Research Use Only. Not for use in diagnostic procedures. M-GL-00029 v01 17
Step 4 Primary and secondary analysis
Illumina recommends using the BWA Aligner app on BaseSpace Sequence Hub for primary sequencing data analysis
and the third-party Genrich or MACS2 software applications for secondary analysis.
Primary analysis: alignment
Software Application Input
Burrows Wheeler Aligner (BWA)
Alignment to reference genome FASTQ files
Bowtie2
Secondary analysis: peak calling
Software Application Input
Genrich
Analysis of alignment files to call peaks of
Model-based Analysis of ChIP- Output files from alignment application
significant enrichment
Seq (MACS2)
Step 5 Visualization
Illumina recommends using the IGV app on BaseSpace Sequence Hub or UCSC Genome Browser for visualizing
called peaks in genomic context.
Application Application Input
Any project can be used as input, including BAM,
Integrative Genomics Viewer (IGV)
Visual exploration of genomic data VCF, BED, BW, and BEDGRAPH
UCSC Genome Browser Multiple file types, including BAM, BED, and more
For Research Use Only. Not for use in diagnostic procedures. M-GL-00029 v01 18
Method 4: Bulk epitope and nucleic acid sequencing (BEN-Seq)
Perform simultaneous bulk protein and gene expression profiling.
Key benefits over single parameter methods:
• Detect multiple proteins more efficiently than traditional methods, such as Western blotting
• Achieve similar levels of accuracy for protein measurement as flow cytometry with added benefit of quantifying
RNA expression
Isolate Prepare
Prepare samples Sequence Analyze data
TotalSeq ADTs libraries
Stain cells with BioLegend TotalSeq antibodies, lyse, and Illumina library prep NovaSeq DESeq2
isolate antibody-derived tags (ADTs) kits 6000 System BaseSpace App
For Research Use Only. Not for use in diagnostic procedures. M-GL-00029 v01 19
Potential applications of BEN-Seq cancer studies
For Research Use Only. Not for use in diagnostic procedures. M-GL-00029 v01 20
Step 1 Sample preparation and antibody staining
Illumina recommends following the BioLegend BEN-Seq protocol to prepare samples and stain cells with TotalSeq
antibodies for protein analysis. Illumina recommends input requirements of 8K cells. RNA and protein associated
libraries (ADTs) are separated using streptavidin-magnetic beads loaded with complementary oligonucleotides,
according to the protocol.
For Research Use Only. Not for use in diagnostic procedures. M-GL-00029 v01 21
Step 4 Data analysis and visualization
Illumina recommends using the DESeq2 app on BaseSpace Sequence Hub for data analysis and the third-party
R ComplexHeatmap module software for data visualization.
For Research Use Only. Not for use in diagnostic procedures. M-GL-00029 v01 22
Method 5: TCR (T-cell Receptor) sequencing
Characterize TCR diversity to understand immune function and responsiveness.
Key benefits over traditional methods for TCR characterization, such as PCR and flow cytometry:
• Target all three complementary determining regions (CDRs) for comprehensive coverage of the TCR region
• Detect T-cell clones at far greater sensitivity with fewer false positive and false negative rates
• Interrogate precious samples with input requirements as low as 25 ng of RNA
Analysis,
Nucleic acid
Specimens Library prep Sequencing visualization, and
extraction
interpretation
Easy: Fresh QIAGEN AllPrep DNA AmpliSeq for Illumina NovaSeq MiXCR Immune
and RNA FFPE Kit Repertoire Plus, TCR 6000 System Repertoire Analyzer
Difficult: FF and FFPE beta Panel BaseSpace App
NextSeq 1000 and
AmpliSeq for Illumina NextSeq 2000
TCR beta-SR Panel Systems
For Research Use Only. Not for use in diagnostic procedures. M-GL-00029 v01 23
Potential applications of immune profiling cancer studies
For Research Use Only. Not for use in diagnostic procedures. M-GL-00029 v01 24
Step 1 Recommended extraction methods
There are several DNA and RNA extraction methods that can be used with FF samples. For FFPE samples, Illumina
recommends the QIAGEN AllPrep DNA/RNA FFPE Kit. In Illumina internal studies, this kit extracts DNA and RNA with
high-quantity and -quality in the same workflow.
Illumina recommends using the MiXCR Immune Repertoire Analyzer app on BaseSpace Sequence Hub for TCR
repertoire sequencing data analysis.
Software Application Input
MiXCR Immune Repertoire V-(D)-J segment mapping, alignment, mutation Any sequencing data type with any level of TCR
Analyzer analysis coverage
For Research Use Only. Not for use in diagnostic procedures. M-GL-00029 v01 25
Method 6: WGS
Identify cancer-driving genetic events beyond protein-coding variants.
Key benefits over targeted or exome sequencing
• Obtain a complete view of the mutational profile of a tumor sample, including noncoding and structural variants
• Identify mutational signatures linked to cancer progression
• Differentiate driver from passenger mutations
DNA Expression
Specimens Library prep Sequencing
extraction analysis
DRAGEN
FF and FFPE QIAGEN AllPrep DNA Illumina DNA Prep NovaSeq 6000 DRAGEN Bio-IT
Kit System Platform
For Research Use Only. Not for use in diagnostic procedures. M-GL-00029 v01 26
Potential applications of WGS cancer studies
For Research Use Only. Not for use in diagnostic procedures. M-GL-00029 v01 27
Step 1 Recommended extraction methods
There are several DNA extraction methods that can be used with FF samples. For FFPE samples, Illumina recommends
the QIAGEN AllPrep FFPE Kit. In Illumina internal studies, this kit gives a high quantity and quality of DNA.
Illumina recommends using the DRAGEN pipelines on BaseSpace Sequence Hub or on a DRAGEN Server to obtain
somatic variant calls and gene expression data. When using BaseSpace Sequence Hub, users can monitor runs in
real time while securely streaming data directly from the instruments into the ecosystem for push-button analysis.
For Research Use Only. Not for use in diagnostic procedures. M-GL-00029 v01 28
Method 7: Mutational profiling
Identify tumor-specific peptides that may be capable of inducing an immune response.
Key benefits over traditional methods such as qPCR, Sanger sequencing, and small NGS panels
DRAGEN
FF or FFPE QIAGEN AllPrep Tumor and normal NovaSeq DRAGEN Platform Multiple
tumor samples DNA and RNA DNA: 6000 System methods
+ normal FFPE Kit Illumina DNA Prep DNA: Somatic Pipeline
with Enrichment with NextSeq 1000
exome oligos and RNA: RNA-Seq Pipeline
NextSeq 2000
Tumor RNA: Illumina Systems
RNA Prep with
Enrichment
For Research Use Only. Not for use in diagnostic procedures. M-GL-00029 v01 29
Potential research applications of neoantigen prediction
For Research Use Only. Not for use in diagnostic procedures. M-GL-00029 v01 30
Step 1 Recommended extraction methods
There are several DNA and RNA extraction methods that can be used with FF samples. For FFPE samples, Illumina
recommends the QIAGEN AllPrep DNA/RNA FFPE Kit. In Illumina internal studies, this kit extracts DNA and RNA with
high-quantity and -quality in the same workflow.
Product Illumina DNA Prep with Enrichment Illumina RNA Prep with Enrichment
Most important to me Minimal input requirements, fewest number of steps, and high uniformity of coverage
Minimum input requirements 10 ng 10 ng
Total library prep time ~6.5 h 9.5 ha
Sample type FF and FFPE compatible FF and FFPE compatible
Sample index sets 384 unique dual indexes 384 indexes availableb
a. Turnaround time for 24 samples with 3-plex enrichment
b. Up to 192 unique dual indexes are currently supported; 384 indexes will be available later in 2021
For Research Use Only. Not for use in diagnostic procedures. M-GL-00029 v01 31
Step 4 Secondary analysis
Illumina recommends using the DRAGEN pipelines on BaseSpace Sequence Hub, a DRAGEN Server, or onboard
the NextSeq 1000 and NextSeq 2000 Systems to obtain somatic variant calls and gene expression data. When
using BaseSpace Sequence Hub, users can monitor runs in real time while securely streaming data directly from the
instruments into the ecosystem for push-button analysis.
Before neoantigen prediction, users must use the somatic variant calls and gene expression data for human
leukocyte antigen (HLA) typing, peptide processing, and major histocompatibility complex (MHC)-binding prediction.
There are many options for the user in each of these steps. For a review of these options and instructions for use,
read “Best practices for bioinformatic characterization of neoantigens for clinical utility” at https://genomemedicine.
biomedcentral.com/articles/10.1186/s13073-019-0666-2.
For Research Use Only. Not for use in diagnostic procedures. M-GL-00029 v01 32
Method 8: Methylation arrays
Interrogate methylation status across the genome of cancer cells at single-nucleotide resolution.
Key benefits over other methods:
• Comprehensive genome-wide coverage including: CpG islands, CHH sites, enhancers, open chromatin,
transcription factor binding sites, and more
• High-throughput research capabilities at minimal cost per sample
• User-friendly, streamlined workflow with high assay reproducibility and support for FFPE samples
Sample and
Specimens Scanning Analysis
BeadChip prep
Infinium Mouse
Methylation BeadChip
For Research Use Only. Not for use in diagnostic procedures. M-GL-00029 v01 33
Potential applications of methylation array cancer studies
For Research Use Only. Not for use in diagnostic procedures. M-GL-00029 v01 34
Step 1 Sample preparation
Illumina recommends an input requirement of 250 ng genomic DNA. FFPE samples are supported.
Illumina recommends following the Infinium MethylationEPIC BeadChip protocol for analysis of human samples and
the Infinium Mouse Methylation BeadChip protocol for supporting studies with mouse models of cancer.
Application Description
Calculates methylation levels (beta values) and analyzes differential methylation levels between
GenomeStudio Software
experimental groups. Single-site resolution data can be visualized as line plots, bar graphs, scatter plots,
Methylation Module
histograms, dendrograms, box plots, or heat maps.
For Research Use Only. Not for use in diagnostic procedures. M-GL-00029 v01 35
Illumina • 1.800.809.4566 toll-free (US) • +1.858.202.4566 tel • techsupport@illumina.com • www.illumina.com
For Research Use Only. Not for use in diagnostic procedures.
© 2021 Illumina, Inc. All rights reserved. All trademarks are the property of Illumina, Inc. or their respective owners. For specific trademark
information, see www.illumina.com/company/legal.html. M-GL-00029 v01