Professional Documents
Culture Documents
Artigo Nature
Artigo Nature
Similar to the evolution in species, the approximately 1014 cells in the about the times when these lesions arise during somatic evolution and
human body are subject to the forces of mutation and selection1. This where the boundary between normal evolution and cancer progression
process of somatic evolution begins in the zygote and only comes to should be drawn.
rest at death, as cells are constantly exposed to mutagenic stresses, Sequencing of bulk tumour samples enables partial reconstruction of
introducing 1–10 mutations per cell division2. These mutagenic forces the evolutionary history of individual tumours, based on the catalogue
lead to a gradual accumulation of point mutations throughout life, of somatic mutations they have accumulated3,14,15. These inferences
observed in a range of healthy tissues5–11 and cancers12. Although these include timing of chromosomal gains during early somatic evolution16,
mutations are predominantly selectively neutral passenger mutations, phylogenetic analysis of late cancer evolution using matched primary
some are proliferatively advantageous driver mutations13. The types and metastatic tumour samples from individual patients17–20, and tem-
of mutation in cancer genomes are well studied, but little is known poral ordering of driver mutations across many samples21,22.
1
European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, UK. 2European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany.
3
Wellcome Sanger Institute, Cambridge, UK. 4The Francis Crick Institute, London, UK. 5Broad Institute of MIT and Harvard, Cambridge, MA, USA. 6Big Data Institute, University of Oxford, Oxford,
UK. 7University of Cambridge, Cambridge, UK. 8University of Toronto, Toronto, Ontario, Canada. 9Vector Institute, Toronto, Ontario, Canada. 10Molecular and Medical Genetics, Oregon Health &
Science University, Portland, OR, USA. 11The University of Texas MD Anderson Cancer Center, Houston, TX, USA. 12German Cancer Research Center (DKFZ), Heidelberg, Germany. 13Heidelberg
University, Heidelberg, Germany. 14University of Ljubljana, Ljubljana, Slovenia. 15NorthShore University HealthSystem, Evanston, IL, USA. 16Cancer Research UK Cambridge Institute, University of
Cambridge, Cambridge, UK. 17Simon Fraser University, Burnaby, British Columbia, Canada. 18Vancouver Prostate Centre, Vancouver, British Columbia, Canada. 19University of Melbourne,
Melbourne, Victoria, Australia. 20Walter and Eliza Hall Institute, Melbourne, Victoria, Australia. 21University of Leuven, Leuven, Belgium. 22Weill Cornell Medicine, New York, NY, USA. 23New York
Genome Center, New York, NY, USA. 24University of California Santa Cruz, Santa Cruz, CA, USA. 25Ontario Institute for Cancer Research, Toronto, Ontario, Canada. 26University of California, Los
Angeles, CA, USA. 27Peter MacCallum Cancer Centre, Melbourne, Victoria, Australia. 28Center for Cancer Research, Massachusetts General Hospital, Charlestown, MA, USA. 29Department of
Pathology, Massachusetts General Hospital, Boston, MA, USA. 30Harvard Medical School, Boston, MA, USA. 31Dana-Farber Cancer Institute, Boston, MA, USA. 32Indiana University, Bloomington,
IN, USA. 33The University of Chicago, Chicago, IL, USA. 34University of Cologne, Cologne, Germany. 35University of Helsinki, Helsinki, Finland. 36University of Glasgow, Glasgow, UK. 37A list of
members and their affiliations appears at the end of the paper. 38Oxford NIHR Biomedical Research Centre, Oxford, UK. 39A list of members and their affiliations appears in the Supplementary
Information. 40These authors contributed equally: Moritz Gerstung, Clemency Jolly, Ignaty Leshchiner, Stefan C. Dentro, Santiago Gonzalez. 41These authors jointly supervised this work: Paul T.
Spellman, David C. Wedge, Peter Van Loo. *e-mail: moritz.gerstung@ebi.ac.uk; peter.vanloo@crick.ac.uk
Density
2 copies per cell
chromosomes
0.2 1 copy per cell
<1 copy per cell 0
0
Bi-allelic gain 1 2 3 4 5 6 7 8 10 12 14 16 20 X Y
Major allele Minor allele Structural variant
Genomic position (chromosome)
6
number
Molecular time Copy
Asynchronous: < 80% co-occurring
0
SA542034, 90 years old, Lymph–BNHL, ploidy = 2.2
Clonal allelic status of point mutations Mono-allelic gain CN LOH Bi-allelic gain
Density
of t(3p;5q), part
Clonal Mutation on 1 copy per cell, Unbalanced t(3p;5q): +5q, –3p t(3p;5q)
either on amplified or retained allele 0 of WGD Zygote
1 2 3 4 5 6 7 8 9 10 12 14 161820 X Y 0
Subclonal Mutation on <1 copy per cell Genomic position (chromosome) 1 2 3 4 5 6 7 8 10 12 14 16 20 X Y
Genomic position (chromosome)
Sample
n = 149 Near diploid
CNS−Medullo (n = 119) 400
WGD synchronous asynchronous
Breast−AdenoCA (n = 189) n = 542 n = 347
No. of affected
300
chromosomes
20
Samples
Bladder−TCC (n = 23) 1 5 10 15 X
Near diploid 10
Ovary–AdenoCA, n = 113 synchronous
Stomach−AdenoCA (n = 65) Molecular WGD 200 7
n = 468 6
Ovary−AdenoCA (n = 112) time uninformative 5
WGD n = 129 4
Sample
Lymph−BNHL (n = 104) Late 100
clonal 3
Eso−AdenoCA (n = 94) 0 2
1.0
Near diploid uninformative
Panc−AdenoCA (n = 223)
d
ed
n = 1,143
ve
ct
er
Bone−Osteosarc (n = 35)
pe
bs
0.5
Ex
O
Uterus−AdenoCA (n = 50)
1 5 10 15 X
Kidney−CCRCC (n = 90)
0 Kidney–CCRCC, n = 111
g h
Head−SCC (n = 53) Early WGD
Rel. frequency
0.4
Proportion
ColoRect−AdenoCA (n = 56) clonal 0.3 0.15
Freq. 0.2 0.10
Sample
Biliary−AdenoCA (n = 25)
0.1 0.05
Liver−HCC (n = 302) of gain
0 0
100%
W D ync
N un c
as c
c
N un f
N D s inf
D in
Lung−SCC (n = 48) 10% 0 100
D n
D yn
yn
G sy
G s
W a
Lung−AdenoCA (n = 36) Latency of second gain (%)
D
G
W
Skin−Melanoma (n = 103)
1 5 10 15 X
Kidney−PapRCC (n = 33) Chromosome
Fig. 1 | Timing clonal copy number gains using allele frequencies of point in Supplementary Table 1. d, Heat maps representing molecular timing
mutations. a, Principles of timing mutations and copy number gains based on estimates of gains on different chromosome arms (x axis) for individual
whole-genome sequencing. The number of sequencing reads reporting point samples (y axis) for selected tumour types. e, Temporal patterns of two
mutations can be used to discriminate variants as early or late clonal (green or near-diploid cases illustrating synchronous gains (top) and asynchronous
purple, respectively) in cases of specific copy number gains, as well as clonal gains (bottom). f, Left, distribution of synchronous and asynchronous gain
(blue) or subclonal (red) in cases without. b, Annotated point mutations in one patterns across samples, split by WGD status. Uninformative samples
sample based on VAF (top), copy number (CN) state and structural variants have too few or too small gains for accurate timing. Right, the enrichment
(middle), and resulting timing estimates (bottom). LOH, loss of heterozygosity. of synchronous gains in near-diploid samples is shown by systematic
c, Overview of the molecular timing distribution of copy number gains across permutation tests. g, Proportion of copy number segments (n = 90,387) with
cancer types. Pie charts depict the distribution of the inferred mutation time secondary gains. Error bars denote 95% credible intervals. ND, near diploid.
for a given copy number gain in a cancer type. Green denotes early clonal gains, h, Distribution of the relative latency of n = 824 secondary gains with available
with a gradient to purple for late gains. The size of each chart is proportional to timing information, scaled to the time after the first gain and aggregated per
the recurrence of this event. Abbreviations for each cancer type are defined chromosome.
The PCAWG Consortium has aggregated whole-genome sequenc- in a single cell, which gives rise to a lineage of cells bearing the same
ing data from 2,658 cancers4, generated by the ICGC and TCGA, and mutation. If that chromosomal locus is subsequently duplicated, any
produced high-accuracy somatic variant calls, driver mutations, and point mutation on this allele preceding the gain will subsequently be
mutational signatures4,23,24 (Methods and Supplementary Information). present on the two resulting allelic copies, unlike mutations succeeding
Here, we leverage the PCAWG dataset to characterize the evolution- the gain, or mutations on the other allele. As sequencing data enable the
ary history of 2,778 cancer samples from 2,658 unique donors across measurement of the number of allelic copies, one can define categories
38 cancer types. We infer timing and patterns of chromosomal evolu- of early and late clonal variants, preceding or succeeding copy number
tion and learn typical sequences of mutations across samples of each gains, as well as unspecified clonal variants, which are common to all
cancer type. We then define broad periods of tumour evolution and cancer cells, but cannot be timed further. Lastly, we identify subclonal
examine how drivers and mutational signatures vary between these mutations, which are present in only a subset of cells and have occurred
epochs. Using clock-like mutational processes, we map mutation timing after the most recent common ancestor (MRCA) of all cancer cells in
estimates into approximate real time. Combined, these analyses allow the tumour sample (Supplementary Information).
us to sketch out the typical evolutionary trajectories of cancer, and map The ratio of duplicated to non-duplicated mutations within a gained
them in real time relative to the point of diagnosis. region can be used to estimate the time point when the gain hap-
pened during clonal evolution, referred to here as molecular time,
which measures the time of occurrence relative to the total num-
Reconstructing the life history of tumours ber of (clonal) mutations. For example, there would be few, if any,
The genome of a cancer cell is shaped by the cumulative somatic aber- co-amplified early clonal mutations if the gain had occurred right
rations that have arisen during its evolutionary past, and part of this after fertilization, whereas a gain that happened towards the end of
history can be reconstructed from whole-genome sequencing data3 clonal tumour evolution would contain many duplicated mutations14
(Fig. 1a). Initially, each point mutation occurs on a single chromosome (Fig. 1a, Methods).
Fraction of point
Timing category: Clonal (early) Clonal (late) Clonal (unsp.) Subclonal NA
1
800
between tumour types, whereas within tumour types, different chro-
ph SCC
My mph HL
My eloid– CLL
Ov yeloid–M L
Bo Bon BenigC
Bre e−O e−Ep n
ast steo ith
Bre BreAden sarc
rvix bu CIS
S− −G C
Co CN CNS edullM
loR S− −O o
ect Pilo ligo
o− deno tro
Kid ney–CCR C
ne Ch CC
ap C
C
ary id DS
de PN
StoTissu −Leio ma
ch Lipo yo
UteThy–Aden sarc
–A no A
de CA
Ce −Lo −D A
Ce den CA
CNCNS –SC A
CA
CA
CA
rin
elo AM
oC
rus de oC
ast ast oC
rvix oC
Ad C
y–P RC
RC
HC
− C
M B
600
m
As
Ly −BN
–A –M
no
–A lar
no
no
o
mosomes typically show similar distributions (Fig. 1c, Extended Data
n e −T
oc
−
L en
en
So issu elan
−
er–
de
Bla-Ade
nd
d
Ad
Liv
A
−A
–A
A
M
ma e−
–E
st−
−
−
e
Figs. 1, 2, Supplementary Information). In glioblastoma and medul-
e
y
So kin−
400
nc
ng
nc
n
iar
M
Es
Pro
n
Pa
Lu
Pa
Bil
S
ftT
ft
200 loblastoma, a substantial fraction of gains occurs early in molecular
0
time. By contrast, in lung cancers, melanomas and papillary kidney
1 cancers, gains arise towards the end of the molecular timescale. Most
type
0
tumour types, including breast, ovarian and colorectal cancers, show
b Preferentially early/clonal NS relatively broad periods of chromosomal instability, indicating a very
>50
variable timing of gains across samples.
10
Odds ratio
KDBAP1
EB 1
SP H1
NFMT2C1
KN 1
ARPTENA
AP 4
FB NFB
M 2M
CDG1
AR MAPIDH1
FOF3B1
AX 5C
AL 4
XW 1
CT ER A
1
SM ARCH1
PBRAFL
SM T2D
OB
PT AP1
VR 1
BTIN1
KMID1A
V C
NOETDM
KDGFRA
DD TA3
B S
PIKRAS3
APXA1
AR ID2
A OP
RBP
L2
T 2
NR 7
KETRX
S X3X
CDNNBT
HG 3K
CR RM
K EN
AD
CA
CH
B H
AC 3 R
K P5
2
3C
E 2
B
S AT
E2
M
timed within the first 10% of molecular time, which suggests that they
T
A
T
Driver mutation arise very early in a patient’s lifetime. In the case of trisomy 7, typically
c TP53 d less than 3 out of 600 single nucleotide variants (SNVs) on the whole
>50
chromosome precede the gain (Extended Data Fig. 3a, b). On the basis
of driver mutations
contributing 50%
30
No. of genes
10
Odds ratio
al
late
0.1
nsp
arl
lon
al (
l (u
)
oC (24)
Pro Head CA )
Lu ph− ma ( )
−A BNH 17)
Liv noCA 97)
Sk −Ade CC )
CN −TC 14)
So iliary −GB (10)
−L oCA )
Ute ct−Ad noCA 1)
my (8)
−A oCA 58)
Me noCA 3)
o− enoC 108)
ma Lung CC )
7)
bc
Bla deno L (22
no (30
o (24
2
(32
(62
iss Ade M (1
(4
(9
(2
o(
na
n
na
Su
er CA (
(
(1
C
A
(
Clo
Clo
Bre −Ade CA
n
eio
H
−S
en
Lym an
−
er−
de
en
n
S
d
l
Ad
dd
Ov c−Ad
−A
−
ue
ng
ast
rus
c
ary
Es
e
n
ftT
B
Pa
Preceding mutation
67%
reconstructed order
1 Add Aggregate 36%
2
Mutation
probabilities pairwise 66%
3 of pairwise data 63%
ordering into ranking 81%
n 28%
30%
86%
Indeterminate order of Unambigously ordered
pair of mutations,
mutations, e.g. both (+ties; not shown) 0 2 4 6 8
before WGD e.g. clonal before subclonal Relative rank (prevalence)
e Aggregated ranking f g
ColoRect–AdenoCA, n = 58 Panc–Endocrine, n = 79 CNS–GBM, n = 41 Prevalence*
Prevalence* EGFR Odds 27%
Prevalence* –16 Odds 47%
–11
–10 late 88%
APC Odds 79% late 70% TP53 > 10 27%
Mutation, preferential timing (early, intermediate/variable, late)
Fig. 3 | Aggregating single-sample ordering reveals typical timing of neuroendocrine cancer (Panc–Endocrine) (f) and glioblastoma (CNS–GBM) (g).
driver mutations. a, Schematic representation of the ordering process. Probability distributions show the uncertainty of timing for specific
b–d, Examples of individual patient trajectories (partial ordering relationships), events in the cohort. Events with odds above 10 (either earlier or later) are
the constituent data for the ordering model process. e–g, Preferential ordering highlighted. The prevalence of the event type in the cohort is displayed as a bar
diagrams for colorectal adenocarcinoma (ColoRect–AdenoCA) (e), pancreatic plot on the right.
Aggregating the clonal status of all driver point mutations over occur after tumours have accumulated several driver mutations,
time reveals an increased diversity of driver genes mutated at and many chromosomal gains and losses are typically late. These
later stages of tumour development: 50% of all early clonal driver results are in agreement with the classical APC-KRAS-TP53 pro-
mutations occur in just 9 genes, whereas 50% of late and subclonal gression model of Fearon and Vogelstein35, but add considerable
mutations occur in approximately 35 different genes each, a nearly detail.
fourfold increase (Fig. 2d). Consistent with previous studies of indi- In many cancer types, the sequence of events during cancer progres-
vidual tumour types31–34, these results suggest that, in general, the sion has not previously been determined in detail. For example, in
very early events in cancer evolution occur in a constrained set of pancreatic neuroendocrine cancers, we find that many chromosomal
common drivers, and a more diverse array of drivers is involved in losses, including those of chromosomes 2, 6, 11 and 16, are among
late tumour development. the earliest events, followed by driver mutations in MEN1 and DAXX
(Fig. 3c, f). WGD events occur later, after many of these tumours have
reached a pseudo-haploid state due to widespread chromosomal
Relative timing of somatic driver events losses. In glioblastoma, we find that the loss of chromosome 10, and
Although timing estimates of individual events reflect evolutionary driver mutations in TP53 and EGFR are very early, often preceding
periods that differ from one sample to another, they define in part early gains of chromosomes 7, 19 and 20 (Fig. 3d, g). Mutations in the
the order in which driver mutations and copy number alterations have TERT promoter tend to occur at early to intermediate time points,
occurred in each sample (Fig. 3a–d). As confirmed by simulations, whereas other driver mutations and copy number changes tend to
aggregating these orderings across samples defines a probabilistic be later events.
ranking of lesions (Fig. 3a), recapitulating whether each mutation Across cancer types, we typically find TP53 mutations among the
occurs preferentially early or late during tumour evolution (Extended earliest events, as well as losses of chromosome 17 (Supplementary
Data Figs. 4, 5, Supplementary Information). Information). WGD events usually have an intermediate ranking, and
In colorectal adenocarcinoma, for example, we find APC mutations most copy number changes occur later. Losses typically precede gains,
to have the highest odds of occurring early, followed by KRAS, loss and consistent with the results above, common drivers typically occur
of 17p and TP53, and SMAD4 (Fig. 3b, e). Whole-genome duplications before rare drivers.
SNVs
0 0
C>A C>G C>T T>A T>C T>G C>A C>G C>T T>A T>C T>G undirected temporal variability over several orders of magnitude
b SA557034, Breast−AdenoCA, 82 years old
(Fig. 4c, d, Extended Data Fig. 7). In addition, several signatures dem-
7% APOBEC (SBS2&13)
Clonal 64% APOBEC (SBS2&13) Subclonal 85% defective DSBR onstrate distinct temporal trends. As one may expect, signatures of
3,500 32% defective DSBR 70 (SBS3)
exogenous mutagens are predominantly active in the early clonal
SNVs
SNVs
(SBS3)
0 0 stages of tumorigenesis. These include tobacco smoking in lung adeno-
C>A C>G C>T T>A T>C T>G C>A C>G C>T T>A T>C T>G
carcinoma (signature SBS4, median fold change 0.43, IQR 0.31–0.72),
c Late clonal consistent with previous reports37,38, and ultraviolet light exposure in
≥100
melanoma (SBS7; median fold change 0.16, IQR 0.09–0.43). Another
10
strong decrease over time is found for a signature of unknown aetiol-
Fold change
1 ogy, SBS12, which acts mostly in liver cancers (median fold change 0.22,
0.1 IQR 0.06–0.41). In chronic lymphoid leukaemia, there was a 20-fold
relative decrease in mutations associated with somatic hypermuta-
≤0.01
Early clonal tion (SBS9; median fold change 0.05, IQR 0.02–0.43) from clonal to
subclonal stages.
3 Stinu g
S e
1 ID - S tc.
- C > ng
-A B m
ck G
- A 13 SM
S1 D SM
- P ew f.
-R 7
- 6
SB 1 - - S BS V
S5 C mo 12
D 17
m DBS8
ID ing S3
DBUV
4* 8 BS9
M HE4
R J
o R 0
B 1
8
ID B UV
DB D OS
SB - S ?
SB EC
DBacc SB S4 .
SB -lik
la in
B f
SB S5 ch de
8 BS
S1 BS
-M - S
PO S4
S3
2 EC
S U
lo Tp
ob - D S de
1 e
pG ki
ok B
-
PO -
-S
SBBS
-T 3
&1
S
S1
9 BS
S
S2
S2
DB
S6
S2 S
SBS13) increases in many cancer types from the early to late clonal
DB
SB
Subclonal
SB
d ≥100 stages (median fold change 2.0, IQR 0.8–3.6), as does a newly described
signature SBS38 (median fold 3.6, IQR 1.8–11). Signatures of defective
10
Fold change
mismatch repair (SBS6, 14, 15, 20, 21, 26 and 44) increase from clonal
1
to subclonal stages (median fold 1.8, IQR 1.2–3.0).
0.1
≤0.01
Clonal Chronological time estimates
The molecular timing data presented above do not measure the
f.
–4 DAP tin ng
S2S3 ID SBS1 e
ID SB- U .
SBS4 .
SBTp g
4* B OB um
- C S1G
S1 4 - S SSM
- CSmBS M
SB et f.
2 SV
SBBS 1 - SS 2
pG ok 17
lo B 6
DBk-lik8
- S D 3 S30
SB &1 5 - ch S4J
m SB - U 0
S1
- A DB 28
PODBS4
SBEC6
DB 22
DBS9
- T ID18 DB 7
8 - R S3
S6 3 - Plaewi 0
R V
S2 BScco S HES
S?
C
m
c
ok R V
de
> in
in de
S ID - 1
c S
B S
M -U
-N O
- MS1 E
S7per
B
g
D
SBhy
S
at
SB
-
S5
om
SB D ba
mutations are acquired per year in each sample was constant, the
1
S1
SB
o
-S
DBSB
DB
S9
*SBS6,14,15,20,21,26,44
SB
SB
Latency (year)
Rate TP53*, FBXW7 SMAD2, SMAD4, PTEN CNA: –8p, Sigs:
rate
CpG>TpG mutations
400 increase CNA: –17p, +8q, –18 +20q, +7p, +20q11.21, –4q35.1, DBS: 3
MRCA 20 5x increase Signatures:
1× Sigs: +13q, +2q, –14q, Sigs: DBS: 8, DBS: 4 SBS:
300
7.5× ID: 1, 2, SBS: 1, 5, 18 ID: 1, 2, SBS: 1, 5, 6–44, 10, 18, 28 SBS: 17, 40 40
10×
200 10 20×
WGD 20×
100 –5 years Last detectable subclone
0
0 Fertilized WGD (38% samples)
t− S sa C
Ad −G rc
St P LymLiv en BM
h− d BN C
te L Adeno HL
Skidney−Cen CC
e g e R C
Paas −A lan CC
−E d no a
N e S e
ftTiss −A M TCC
O ssue−Ldendul C
va e ip o lo
deom rc
no yo
ec CN teoSC A
acc− h− HC A
Ki us un noCA
K ne Ad −S CA
Br un −M ChRC A
B HeadocoCA
So C ladd d− rinA
ry −L o CA
A
MRCA Diagnosis
nc −A e m
oR Osix− oC
oman p er oC
L in y− C C
n en C
C
So tT so S− r− C
−A ei sa
o
o
0 10 30 50 70 egg –3.3 years (1.9, 9.7) –0.4 years (0.1, 1.8) 67 years,
C ne ervden
e
e
d − g
d
Bo C−A
A
Age − (57, 74)
b
ry
i u
Lung squamous cell carcinoma
lia
E
Bi
U
ol
f
c Preferentially early Variable/constant Late Subclonal
14 Drivers: CDKN2A Drivers: NFE2L2, TP53, KMT2D, Sigs:
12
Latency (year)
h R PN
C erv −S A
K su Pa noCrc
nc A eno LL
Lu L ste HC o
oR CAd −SCrc
So Cast−AdenoC A
yo
Ki ye nd oC A
−A e n M
om ary p −T a
ac −A h-B CC
ry Ad GB A
So Kid nc−−Li As A
ftT ne Adpos tro
os m O A
−C M e
O ect NS no C
U idn e−L pR A
Pa hy− Ad −C o
C ead noCA
Bo −M−SCC
ne Liv ed C
ng un os C
ru − io C
Pr Ly NS−noCC
C
ftT N Ad no A
Pasue PilonoCA
dn lo oc A
h− de NH
St ili m er m
ey id– rin
T t− ph lig
e − e C
M E en C
va − − C
H de oC
is S− e C
N ix C
−O er− ul
te ey e C
C de RC
C
is y− e a
− g a
s− CC m
Br so d o
Ly dd ano
− d
Bl Me
Diagnosis
E
Sk
ol
68 years,
C
To our knowledge, our study presents the first large-scale genome-wide Fertilized WGD (40% samples)
Diagnosis
67 years,
reconstruction of the evolutionary history of cancers, reconstructing egg –4 years (2.3, 11) MRCA (58, 74)
–0.6 years (0.1, 2.8)
both early (pre-cancer) and later stages of 38 cancer types. This is facili-
tated by the timing of copy number gains relative to all other events Fig. 6 | Typical timelines of tumour development. a–d, Timelines
representing the length of time, in years, between the fertilized egg and the
in the genome, through multiplicity and clonal status of co-amplified
median age of diagnosis for colorectal adenocarcinoma (a), squamous cell lung
point mutations. However, several limitations exist (Supplementary
cancer (b), ovarian adenocarcinoma (c) and pancreatic adenocarcinoma (d).
Information). Perhaps most importantly, molecular timing is based
Real-time estimates for major events, such as WGD and the emergence of the
on point mutations and is therefore subject to changes in mutation
MRCA, are used to define early, variable, late and subclonal stages of tumour
rate. Notably, healthy tissues acquire point mutations at rates not too evolution approximately in chronological time. The range of chronological
dissimilar from those seen in cancers, particularly when considering time estimates according to varying clock mutation acceleration rates is shown
only endogenous mutational processes, and furthermore, some tissues as well, with tick marks corresponding to 1×, 2.5×, 5×, 7.5×, 10× and 20×. Driver
are riddled with microscopic clonal expansions of driver gene mutations and copy number alterations (CNA) are shown in each stage
mutations5–9,11. This is direct evidence that the life history of almost according to their preferential timing, as defined by relative ordering.
every cell in the human body, including those that develop into cancer, Mutational signatures (Sigs) that, on average, change over the course of tumour
is driven by somatic evolution. evolution, or are substantially active but not changing, are shown in the epoch in
Together, the data presented here enable us to draw approximate which their activity is greatest. DBS, double base substitution; SBS, single base
timelines summarizing the typical evolutionary history of each cancer substitutions. Where applicable, lesions with a known timing from the literature
type (Fig. 6, Supplementary Information for all other cancer types). are annotated; dagger symbols denotes events that were found to have a
different timing; asterisk symbol denotes events that agree with our timing.
These make use of the qualitative timing of point mutations and copy
number alterations, as well as signature activities, which can be inter-
leaved with the chronological estimates of WGD and the appearance
of the MRCA. suggesting an epistatic fitness landscape canalizing the first steps of
It is remarkable that the evolution of practically all cancers displays cancer evolution. Over time, as tumours evolve, they follow increasingly
some level of order, which agrees very well with, and adds much detail diverse paths driven by individually rare driver mutations, and by copy
to, established models of cancer progression35,46. For example, TP53 number alternations. However, none of these trends is absolute, and
with accompanying 17p deletion is one of the most frequent initiating the evolutionary paths of individual tumours are highly variable, show-
mutations in a variety of cancers, including ovarian cancer, in which it ing that cancer evolution follows trends, but is far from deterministic.
is the hallmark of its precancerous precursor lesions47. Furthermore, Our study sheds light on the typical timescales of in vivo tumour
the list of typically early drivers includes most other highly recurrent development, with initial driver events seemingly occurring up to
cancer genes, such as KRAS, TERT and CDKN2A, indicating a preferred decades before diagnosis, demonstrating how cancer genomes are
role in early and possibly even pre-cancer evolution. This initially con- shaped by a lifelong process of somatic evolution, with fluid bound-
strained set of genes broadens at later stages of cancer development, aries between normal ageing processes5–11 and cancer evolution.
32. Gibson, W. J. et al. The genomic landscape and evolution of endometrial carcinoma Computer Science, Princeton University, Princeton, NJ, USA. 44Korea University, Seoul, South
progression and abdominopelvic metastasis. Nat. Genet. 48, 848–855 (2016). Korea. 45Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA.
Competing interests R.B. owns equity in Ampressa Therapeutics. G.G. receives research funds
from IBM and Pharmacyclics and is an inventor on patent applications related to MuTect,
Code availability ABSOLUTE, MutSig, MSMuTect and POLYSOLVER. I.L. is a consultant for PACT Pharma. B.J.R. is
The core computational pipelines used by the PCAWG Consortium for a consultant at and has ownership interest (including stock and patents) in Medley
Genomics. All other authors declare no competing interests.
alignment, quality control and variant calling are available to the public
at https://dockstore.org/search?search=pcawg under the GNU General Additional information
Public License v3.0, which allows for reuse and distribution. Analysis Supplementary information is available for this paper at https://doi.org/10.1038/s41586-019-
1907-7.
code presented in this study is available through the GitHub reposi- Correspondence and requests for materials should be addressed to M.G. or P.V.L.
tory https://github.com/PCAWG-11/Evolution. This archive contains Reprints and permissions information is available at http://www.nature.com/reprints.
Extended Data Fig. 1 | Summary of all results obtained for colorectal clonal and late clonal stages, per patient. Green and purple indicate,
adenocarcinoma (n = 60) as an example. a, Clustered heat maps of mutational respectively, a signature decrease and increase in late clonal from early clonal
timing estimates for gained segments, per patient. Colours as indicated in mutations. Inactive signatures are coloured white. e, As in d but for clonal
main text: green represents early clonal events, purple represents late clonal. versus subclonal stages. Blue indicates a signature decrease and red an
b, Relative ordering of copy number events and driver mutations across all increase in subclonal from clonal mutations. f, Typical timeline of tumour
samples. c, Distribution of mutations across early clonal, late clonal and development. Similar result summaries for all other cancer types can be found
subclonal stages, for the most common driver genes. A maximum of 10 driver in the Supplementary Information (pages 46–77).
genes are shown. d, Clustered mutational signature fold changes between early
Article
Extended Data Fig. 2 | Comparison of methods used for timing of individual copy number gains. a, b, Pairwise comparison of the three approaches for timing
individual copy number gains. c, Comparison using simulated data, showing high concordance.
Extended Data Fig. 3 | Early copy number gains in brain cancers. a, Three n = 34 GBM samples with trisomy 7. c, Medulloblastoma example with
illustrative examples of glioblastoma with trisomy 7. The red arrow depicts the isochromosome 17q. d, Distributions of SNVs on 17q in n = 95 samples with
expected VAF cluster of point mutations preceding trisomy 7, which usually isochromosome 17q; 74 out of 95 samples have less than 1 SNV preceding the
contains less than three SNVs. b, Distributions of the number of SNVs isochromosome.
preceding trisomy 7 and total number of mutations on chromosome (chr) 7 in
Article
Extended Data Fig. 5 | Correlation between the league model and Bradley– mutations and copy number events. Axes indicate the ordered events observed
Terry model ordering. Direct comparison for each tumour type of the league in the respective tumour types. Correlation is quantified by Spearman’s rank
and Bradley–Terry models for determining the order of recurrent somatic correlation coefficient. A total of n = 756 ordered events are shown.
Extended Data Fig. 6 | Examples of mutation spectrum changes across b, Three examples of tumours with substantial changes between mutation
tumour evolution. a, Three examples of tumours with substantial changes spectra of clonal (top) and subclonal (bottom) time points.
between mutation spectra of early (top) and late (bottom) clonal time points.
Article
Extended Data Fig. 7 | Overview of early-to-late clonal and clonal-to- each signature. Signatures are split into three categories: (1) clock-like,
subclonal signature changes across tumour types. a, b, Pie charts comprising the putative clock signatures 1 and 5; (2) frequent, which are
representing signature changes per cancer type for early-to-late clonal signatures present in ten or more cancer types; and (3) cancer-type specific,
signature changes (a) and clonal-to-subclonal signature changes (b). which are in fewer than ten cancer types and are often limited to specific
Signatures that decrease between early and late are coloured green; signatures cohorts.
that increase are purple. The size of each pie chart represents the frequency of
Extended Data Fig. 8 | See next page for caption.
Article
Extended Data Fig. 8 | Age-dependent mutation burden and relapse samples d, Mutation rate inferred from cancer as in b and from selected normal tissue
indicate near-normal CpG>TpG mutation rate in cancer, with moderate sequencing studies of n = 140 normal haematopoietic stem cells, n = 1 normal
acceleration during carcinogenesis. a, Across all cancer samples, a skin sample, n = 182 samples from normal endometrium, and n = 445 normal
predominantly linear accumulation of CpG>TpG mutations (scaled to copy colonic crypts; error bars denote the 95% confidence interval. e, Median
number) is observed over time, as measured by the age at diagnosis. b, Cancer- fraction of mutations attributed to linear age-dependent accumulation, based
specific analysis of the CpG>TpG mutation burden as a function of age at on estimates from b and the age at diagnosis for each sample. Error bars denote
diagnosis for n = 1,978 samples of 34 informative cancer types. The dotted line the 95% credible interval. f, g, CpG>TpG mutations per gigabase for ovarian
denotes the median mutations per year (that is, not offset), and shading cancer (f) and breast cancer (g) samples with matched primary and relapse
denotes the 95% credible interval of a hierarchical Bayesian linear regression samples. h, Increase in CpG>TpG mutation rate inferred from paired primary
model across all data points. Slope and intercepts are drawn for each cancer and relapse samples for six cancer types. Bars denote the range of the rate
type from a gamma distribution, respectively; inference was done by increase for different scenarios of copy number evolution, assuming ploidy
Hamiltonian Monte Carlo sampling. c, Maximum a posteriori estimates of changes have occurred prior (upper value) or posterior (lower value) to the
rate and offset for 34 cancer types with 95% credible intervals as defined in b. branching between primary and relapse sample.
Extended Data Fig. 9 | Real-time estimates indicate long latencies for some CpG>TpG mutations (y axis) as a function of the mutation time estimate of
samples caused by the absence of early mutations. a, Time of WGD for n = 571 WGD (x axis). Colours and fit as in c. Early molecular timing is thus caused by a
individual patients, split by tumour type with an estimated mutation rate depletion of early CpG>TpG mutations, rather than an inflation of late
increase of 5×, except for ovary–adenocarcinoma (7.5×) and CNS (2.5×). Error CpG>TpG mutations. e, Estimated median WGD latency of n = 571 patients as in
bars represent 80% confidence intervals, reflecting uncertainty stemming a for fixed (x axis) versus patient specific rate increases, depending on the
from the number of mutations per segment and onset of the rate increase. Box observed CpG>TpG mutation burden, allowing for a higher (up to 10×)
plots demarcate the quartiles and median of the distribution with whiskers mutation rate increase in samples with more mutations (y axis). Error bars
indicating 5% and 95% quantiles. b, Scatter plots showing the time of diagnosis denote the IQR. f, Timing of subclonal diversification using CpG>TpG
(x axis) and inferred time of WGD (y axis) with error bars as in a. c, Scatter plot of mutations in n = 1,953 individual patients. Box plots and error bars for data
early (co-amplified) CpG>TpG mutations (y axis) as a function of the mutational points as in a. g, Comparison of the median duration of subclonal
time estimate of WGD (x axis). The black line denotes a nonlinear loess fit with diversification per cancer type assuming branching and linear phylogenies.
95% confidence interval. Colours define the cancer type as in a. d, Total
12345656762589
5653
17425
C$(Ek('"%
!"#$"%&'()*+, A(m&"-
-&('!#&(#./&'()*+,n4(o=pqrs
03&('0!&(4)5$$")%1 ' 22& /
($2!6()!#'4$.$7$(/8()59()&(5!'.7$):;)$82!6$# ('4('84"$("4/&"#(&"!&"4/
$"!($"%:<8'()$"82&($""3&('0&4)!7$4$=>'()?08 &"#()@#$($&7A7$4/)497$(:
1(&($($4
<&77(&($($4&7&"&7/=4"8$2()&(()8775$"%$(2 &!"($"()8$%'7%"#=(&.77%"#=2&$"(B(=C()# 4($":
"D& "8$2#
t ;)B&4(&2!7 $E*F+8&4)B!$2"(&7%'!D4"#$($"=%$6"& &#$4("'2.&"#'"$(82&'2"(
t > (&(2"("5)()2&'2"(5(&9"82#$($ "4(&2!7 5)()() &2 &2!75& 2&'#!&(#7/
t; ) (&($($4&7((*+'#>3G5)()()/&"H(5H$##
IFJKLMNOONFLPQRPRLRSNTJULVQLUQRMWXVQULRNJQJKLVKLFYOQZLUQRMWXVQLONWQLMNO[JQ\LPQMSFX]TQRLXFLPSQL^QPSNURLRQMPXNF_
t >#4$ !($"8&7746&$&( ((#
t >#4$ !($"8&"/& '2!($" 44($"='4)& ((8"2&7$(/&"#&#`'(2"(82'7($!742!&$"
t >8 '77#4$!($"8() (&($($4&7!&&2($"47'#$"%4"(&7("#"4/*:%:2&"+().&$4($2&( *:%:% $"488$4$"(+
>3G6&$&($"*:%:(&"#&##6$&($"+& 4$&(#($2&( 8'"4(&$"(/*:%:4"8$#"4$"(6&7+
t <"'7 7)/!()$(($"%=()(((&($($4*:%:a=P=W+5$()4"8$#"4$"(6&7=884($E=#% 88#2&"#b6&7'"(#
cXdQLbLdYJTQRLYRLQ\YMPLdYJTQRLeSQFQdQWLRTXPYVJQ_
t <f&/$&"&"&7/$=$"82&($""()4)$48!$&"#C&964)&$"C"(&7 (($"%
t <)$&4)$4&7&"#42!7B#$%"=$#"($8$4&($"8()&!!!$&(7678((&"#8'77!($"%8'(42
t @($2&( 8884($E *:%:)"gU=A&"gW+=$"#$4&($"%)5()/54&74'7&(#
ITWLeQVLMNJJQMPXNFLNFLRPYPXRPXMRLhNWLVXNJNiXRPRLMNFPYXFRLYWPXMJQRLNFLOYFKLNhLPSQL[NXFPRLYVNdQ_
18(5&&"#4#
A7$4/$"82&($"&.'(&6&$7&.$7$(/842!'(4#
G&(&4774($" G& (&&"#2(&#&(&54774(#82u"("&($"&7&"4k"2"($'2*uk+4"($'222.'$"%4'(2 8(5&
!&49&% #$%"#./()ukG&(&#$"&($"%"(:;)%"&7H!'!47$.&$ &"#'($7$($ '"#7/$"%()$ 8(5&)&6
."7&#'"#()kA-6v!" '47$4"& ()wn6('w!&49&%&"#&&6&$7&.7&()((!,DD555:6(':.$:n()#&(&
4774($" 8(5&'#$"()$88(='4)& ukH!4$8$4!(&7'$"(8&4=&&6&$7&.7'!"x'((4"(&4(y6(':.$:
G&(&&"&7/$ ;)A>jk59875 B4'($"%4jk1&7$%"2"(=z&"#6&$&"(H4&77$"% 8(5&&!&49&%#& B4'(&.7G49($2&% &"#
&6&$7&.7&(,)((!,DD#49(:%D&4){7&.7:6&7':9/5#|!4&5%?&4)C#|8$7:u"#$6$#'&78(5&42!""(&&
8775 ,fj>HC@C6q:}o:oH~G@--6q::>@x6r:q:rosG< 2&($413m598756r:q:rvpHrA7&(/!' 6q:}:~&4&(3%
6r::pf0>116~:qrp%& 6r:r:&m@C&"6r:qA$"#76r::}>f1n-
;@D&f.>6r:16>f>pqrHqHpq#0&"%pqrHqvHrv
f&9A$"(pqrHrpHppC';4(6r:r:~C'1@6r:q41C'<u3pqr~HrqHpnBkpqrH~Hpom>k@3;6p:r:p>33nm>06pqr~36rp
m&$&"(f>C6pqr}G4rp13mHC%6pqr}C&/p1mHC@0k@6pqr}G4rpG<6pqrG4r:
>"&7/$4#!"(#$"()$ ('#/$&6&$7&.7()'%)()%$()'.!$(/)((!,DD%$()'.:42DA>jkHrrD@67'($":;)$&4)$6
4"(&$" 76&"(8(5&&"#&"&7/$59875 & '.2#'7=$"47'#$"%4#8($2$"%4!/"'2.%&$"=!$"(2'(&($" &"#
2'(&($" $%"&('=&7H($2($2$"%=&"#67'($"&/7&%'2#7&"&7/$=& 577& 4$!((%"&(()8$%' !"(#:
<2&"'4$!('($7$E$"%4'(2&7%$()2 8(5&()&(&4"(&7(()&4).'("(/(#4$.#$"!'.7$)#7$(&('=8(5&2'(.2&#&6&$7&.7(#$(D6$5:
j ("%7/"4'&%4##!$($"$"&422'"$(/!$(/*:%:k$(l'.+:1()3&('0&4)%'$#7$" 8'.2$(($"%4#? 8(5&88'()$"82&($":
0
G&(&
12345656762589
5653
17425
A7$4/$"82&($"&.'(&6&$7&.$7$(/8#&(&
>772&"'4$!(2'($"47'#&#&(&&6&$7&.$7$(/(&(2"(:;)$ (&(2"()'7#!6$#()8775$"%$"82&($"=5)&!!7$4&.7,
H>44 $"4#='"$x'$#"($8$=5.7$"98!'.7$47/&6&$7&.7#&(&(
H>7$(88$%' ()&()&6& 4$&(#&5#&(&
H>#4$!($"8&"/($4($" "#&(&&6&$7&.$7$(/
jk1 2&($4&"#%27$"6&$&"(4&77=2'(&($"&7$%"&('='.47"&74"('4($"=(&"4$!(&.'"#&"4=!7$44&77&"#()4#&(&%"&(#./()
ukD;k>A&"H4&"4>"&7/$8j)7k"2 "($'2&&6&$7&.78#5"7&#&()((!,DD#44:$4%4:%D7&DA>jk:>##$($"&7$"82&($""
&44 $"%()#&(&=$"47'#$"%&5$7=4&".8'"#&()((!,DD#4:$4%4:%D!4&5%D#&(&D:u"&44#&"45$()()#&(&&44 !7$4$ 8()uk&"#;k>
!`4(=2(274'7&=47$"$4&7&"# !4$2"#&(&&$"&"!"($5)$4)# "(x'$&44 &!!6&7:;&44 !("($&77/$#"($8$4&($"$"82&($"=
'4)& %27$"&777 &"#'"#7/$"% x'"4$"%#&(&=&4)5$77"#(&!!7/(();k>G&(&>44 22$((*G>+6$&#.k&A*)((!,DD
#.%&!:"4.$:"72:"$):%6D&&D5%&:4%${!&%|7%$"+8&44 (();k>!($"8()#&(&(=&"#(()ukG&(&>44 2!7$&"4n88$4*G>n)((!,DD
$4%4:%D#&4+8()uk!($":u"&##$($"=(&44 2&($4$"%7"'47($#6&$&"(#$6#82;k>#"=&4)5$77&7"#(.(&$"#.k&A
&'()$E&($":>77'7(!"(#$"()$ ('#/=$"47'#$"%($2$"%($2&( 84!/"'2.%&$"=&7($2($2&( 8jkG&"#C0>=& 577& 2'(&($"
$%"&('&4($6$($=&&6&$7&.7&()((!,DD555:/"&!:%D1/"&!,/"r~rsvs:
->7$78('#$2'4$(#"$447"()(!'#$"/# $% "
(6"5)"()#$47'$"%&($6:
1&2!7 $E j42!$7#&"$"6"(/82&(4)#('2'D"2&75)74&"4%"2 $"()ukG&(&#$"&($"%"(:C(&2!7 4&282
(&(2"(H"&6=!$2&/4&"4=.'(()5& 2&77"'2.8#"5$()2'7($!7 &2!7 8!$2&/=2(&(&($4&"#D4'"(
('2':n'$"47'$"4$($&5,*$+2&(4)#('2'&"#"2&7!4$2"!&$*$$+&2$"$2&7(847$"$4&78$7#&"#*$$$+
4)&&4($&($"8('2'&"#"2&75)7%"2 '$"%u77'2$"&l$1x!&$#H"# x'"4$"%&#:j4774(#%"2#&(&82
p=ov~#"=!"($"%&77uk&"#;k>#"()&(2(()4$($&&(()($28()8$"&7#&(&8E$"&'('2"pqr~:
G&(&B47'$" >B8((2.
x'&7$(/& '&"4=#&(&82r}#"5B47'##& '"'&.7:0&" 8#&(&B47'$" $"47'##$"&#x'&(46&%=
$& $"46&%&4 ()%"2=6$#"484"(&2$"&($"$" &2!7 &"#B4 $6 x'"4$"%*8B&2!7=()'%)oH
B%'&"$"+:l/!2'(&(#&"# &2!7 5$()"2&74"(&2$"&($"5B47'##84)"7%$4&7$"8"4 $"()$ ('#/=& #4$.#$"
()1'!!72"(&/C()#:
0!7$4&($" u" #(6&7'&(()!82&"48&4)8()2'(&($"H4&77$"%!$!7$" &"##(2$"&"$"(%&($" (&(%/=5!82#&7&%H
4&7#! x'"4$"%6&7$#&($"B!$2"(:j 74(#&!$7((8v!"(&($6('2'D"2&7!&$="5)$4)5&"()()
4!$!7$"=(%()5$()& (8rq&##$($"&72&($46&$&"(H4&77$"%!$!7$" 4"($.'(#./22.8()13m&77$"%j9$"%
k'!:n6&77=() "$($6$(/&"#!4$$"8()4""' 2&($46&$&"(4&775s*usq,ooHso+&"#s*usq,}rHss+
!4($67/813m:<2&($4$"#7="$($6$(/&"#!4$$"5q*v~H}p+&"#sr*}vHs+!4($67/:0%&#$"%1m=5
($2&(() "$($6$(/8()2%$"%&7%$()2(.sq8('4&77%"&(#./&"/"4&77!4$$"5& ($2&(#& s}:H()&(
$=s}:81m $"()2%#1m4&77H()&6&"& 4$&(#4!/"'2.4)&"%.&7&"4#!&("&&"%2"(:
;)&44'&4/8$"8"4 $"()$ ('#/5& & #'$"% $2'7&($" &"#./&!!7/$"%()#$88"(&7%$()2 8()($2$"%84!/
"'2.%&$" *@B("##G&(&<$%'p+=& 577& (5#$88"(&7%$()2 8()(2!&7#$"%8#$62'(&($" *@B("##G&(&
<$%'+:
0&"#2$E&($" 3D>H;)$B!7&(/('#/#$#"(4"(&$"&&"#2$E&($" (!
f7$"#$"% 3D>H;)$B!7&(/('#/#$#"(4"(&$"&.7$"##&"&7/$
C&($&7?B!$2"(&7/(2 C()#
"D& u"676#$
$"(
" ) ('#/ "D& u"676#$
$"(
" ) ('#/
12345656762589
55653
17
t >"($.#$ t )uAHx
t @'9&/($44777$" t <754/(2(/
t A&7&"(7%/ t C0uH.&#"'$2&%$"%
t >"$2&7&"#()%&"$2
t l'2&"&4)!&($4$!&"(
t 7$"$4&7#&(&
l'2&"&4)!&($4$!&"(
7425
A7$4/$"82&($"&.'(('#$ $"676$"%)'2&"&4)!&($4$!&"(
A!'7&($"4)&&4($($4 AG&(2
$"(H./H!&($"(47$"$4&7#&(&&!6$##$
$"@
" B("##G&(&;&.7r
8()2&9!&!8()A>jk4"($'2:
%&!)$4&77/=()4)($"47'##r=~s2&7 *+&"#r=ros82&7 *~+=5$()&2&"&% 8/
&*&"%=rHsq
/&+:
$"%!!'7&($"&"4(/H#$88"($&(# $"%7"'47($#!7/2!)$2 *13A+=()&"4(/#$($.'($"5& )&6$7/
5$%)(#(5&# #"8@'!&"#4"(*}} 8((&7+8775#.
./@
/ &(>$&" *r+=& B!4(#87&%4"($.'($"
82@'!&"=3()>2$4&"&"#>'(&7$&"!`4(:j j4"7$#&(#)$(!&()7%/#4$!($" 8()('2'&2!7=
'$"%()uGHqHv('2'$(4"(77#64&.'7&/:n6&77=()A>jk#&(& (42!$ vo# o $($"4(('2'(/!:j)$7()
2(422"('2'(/! &$"47'##$ $"(
" )#&(&(=()$#$($.'($"# "(2&(4)()7&($6!!'7&($"$"4$#"4=
7&%7/#'(
(#
$88"4 &2"%4"($.'($"%ukD;k>%'! $"" " '2. x'"4#:
04'$(2"( A&($"(54'$(#../(
/)!&($4$!&($"%4"( 8775$"%74&7!(47:
@()$46$%)( ;)@()$46$%)(8()A>jk!(475& '"#(&9".
22$((
./(
/);k>A%&2n88$4&"#()@()$4&"#k6"&"4
8()uk:@&4)$"#$6$#'&7uk&"#;k>!`4(()&(4"($.'(##&(&( (A
>jk)&#()$5"74&7
&&"%2"(8()$46$%)(&"#%'7&(/&7$%"2"(:
3(()&(8'77$"82&($"
"(
" )&!!6&78() ('#/!(472'(&7..! 6$##$ $"(
" )2&"'4$!(: