Professional Documents
Culture Documents
Asynchrony Version 2
Asynchrony Version 2
15 Abstract
16 Seasonal influenza viruses create a persistent global disease burden by evolving to
17 escape immunity induced by prior infections and vaccinations. New antigenic variants
18 have a substantial selective advantage at the population level, but these variants are
19 rarely selected within-host, even in previously immune individuals. We find that the
20 temporal asynchrony between within-host virus exponential growth and
21 antibody-mediated selection can limit within-host antigenic evolution. If selection for
22 new antigenic variants acts principally at the point of initial virus inoculation, where
23 small virus populations encounter well-matched mucosal antibodies in previously
24 infected individuals, there can exist protection against reinfection that does not
25 regularly produce observable new antigenic variants within individual infected hosts.
26 Our results explain how virus antigenic evolution can be highly selective at the global
27 level but nearly neutral within hosts. They also suggest new avenues for improving
28 influenza control.
29 Introduction
30 Antibody-mediated immunity exerts evolutionary selection pressure on the antigenic
31 phenotype of seasonal influenza viruses (Archetti & Horsfall, 1950; Hensley et al., 2009).
32 Influenza virus infections and vaccinations induce neutralizing antibodies that can
33 prevent re-infection with previously encountered virus antigenic variants, but such
34 re-infections nonetheless occur (Clements et al., 1986; Javaid et al., 2020; Memoli et al.,
35 2019). At the human population level, accumulation of antibody-mediated immunity
36 creates selection pressure favoring antigenic novelty. Circulating antigenic variants
24
37 typically go rapidly extinct following the population-level emergence of a new antigenic
38 variant, at least for A/H3N2 viruses (Smith et al., 2004).
39 New antigenic variants like those that result in antigenic cluster transitions (Smith
40 et al., 2004) and warrant updating the composition of seasonal influenza virus vaccines
41 are likely to be produced in every infected host. Seasonal influenza viruses have high
42 polymerase error rates (on the order of 10−5 mutations/nucleotide/replication
43 (Nobusawa & Sato, 2006)), reach large within-host virus population sizes (as many as
44 1010 virions (Perelson et al., 2012)), and can be altered antigenically by single amino
45 acid substitutions in the hemagglutinin (HA) protein (Koel et al., 2013; Linderman
46 et al., 2014).
47 In the absence of antibody-mediated selection pressure, de novo generated antigenic
48 variants should constitute a tiny minority of the total within-host virus population.
49 Such minority variants are unlikely to be transmitted onward or detected with current
50 next-generation sequencing (NGS) methods. But selection pressure imposed by the
51 antibody-mediated immune response in previously exposed individuals could promote
52 these variants to sufficiently high frequencies to make them easily transmissible and
53 NGS detectable. The potential for antibody-mediated antigenic selection can be readily
54 observed in infections of vaccinated mice (Hensley et al., 2009) and in virus passage in
55 eggs in the presence of immune sera (Davis et al., 2018).
56 Surprisingly, new antigenic variants are rarely observed in human seasonal influenza
57 virus infections, even in recently infected or vaccinated hosts (Debbink et al., 2017; Dinis
58 et al., 2016; Han et al., 2019; Javaid et al., 2020; Leonard et al., 2016; McCrone et al.,
59 2018; Valesano et al., 2019) (Fig. 1A,B). These observations contradict existing models
60 of within-host influenza virus evolution (Luo et al., 2012; Volkov et al., 2010), which
61 model strong within-host antibody selection and therefore predict that new antigenic
62 variants will be at consensus or fixation in detectable reinfections of previously immune
63 hosts. This raises a fundamental dilemma. If within-host antibody selection is strong,
64 why do new antigenic variants appear so rarely? If this selection is weak, how can there
65 be protection against reinfection and resulting strong population-level selection?
66 We hypothesized that influenza virus antigenic evolution is limited by the asynchrony
67 between virus diversity and antibody-mediated selection pressure. Antibody immunity
68 at the point of transmission in previously infected or vaccinated individuals should
69 reduce the initial probability of reinfection (Le Sage et al., 2020); secretory IgA
70 antibodies on mucosal surfaces (sIgA) are likely to play a large role (Wang et al., 2017,
71 see Appendix 2). But if viruses are not blocked at the point of transmission and
72 successfully infect host cells, an antibody-mediated recall response takes multiple days
73 to mount (Coro et al., 2006; Lam & Baumgarth, 2019) (see detailed review in the
74 appendix, section 2). Virus titer—and virus shedding (Lau et al., 2010)—may peak
75 before the production of new antibodies has even begun, leaving limited opportunity for
76 within-host immune selection. If antigenic selection pressure is strong at the point of
77 transmission but weak during virus exponential growth, new antigenic variants could
78 spread rapidly at the population level without being readily selected during the
79 timecourse of a single infection. Moreover, prior work has established tight population
80 bottlenecks at the point of influenza transmission (McCrone et al., 2018; Xue & Bloom,
81 2019). With a tight transmission bottleneck and weak selection during virus
82 exponential growth, antigenic diversity generated during any particular infection will
25
83 most likely be lost, slowing the accumulation of population-level antigenic diversity.
84 We used a mathematical model to test the hypothesis that realistically timed
85 antibody-mediated immune dynamics slow within-host antigenic evolution. We found
86 three key results: (1) antibody neutralization at the point of inoculation can protect
87 experienced hosts against reinfection and explain new antigenic variants’ population
88 level selective advantage. (2) If successful reinfection occurs, the delay between virus
89 replication and the mounting of a recall antibody response renders within-host antigenic
90 evolution nearly neutral, even in experienced hosts. (3) It is therefore reasonable that
91 substantial population immunity may need to accumulate before new antigenic variants
92 are likely to observed in large numbers at the population level, whereas effective
93 within-host selection would predict that they should be readily observable even before
94 they proliferate.
95 Model overview
96 Our model reflects the following biological realities: 1. Seasonal influenza virus
97 infections of otherwise healthy individuals typically last 5–7 days (Suess et al., 2012); 2.
98 In influenza virus-naive individuals, it can take up to 7 days for anti-influenza virus
99 antibodies to start being produced (Wrammert et al., 2008), effectively resulting in no
100 selection (Fig. 1A); 3. In previously infected (“experienced”) individuals, sIgA
101 antibodies can neutralize inoculated virions before they can infect host cells; (Wang
102 et al., 2017) 4. However, if an inoculated virion manages to cause an infection in an
103 experienced individual, it takes 2–5 days for the infected host to mount a recall
104 adaptive immune response, including producing new antibodies (Coro et al., 2006;
105 Zuccarino-Catania et al., 2014) (see Appendix Section 2 for further discussion of
106 motivating immunology). Importantly, this contrasts with previous within-host models
107 of virus evolution, which have assumed that antibody-mediated neutralization of virions
108 during virus replication is strong from the point of inoculation onward and is the
109 mechanism of protection against reinfection (Luo et al., 2012; Volkov et al., 2010), and
110 reflects new animal model evidence of sterilizing antibody immunity Le Sage et al.,
111 2020. We discuss existing models and hypotheses for the rarity of population level
112 influenza antigenic variation in section 7 of the Appendix.
26
observed HA variant within-host prob. new variant prob. new variant
1.0
A polymorphism B frequencies C
1.08
at 1% D at consensus
80 non-ant. 25 ridge 100
10 °3
site
10 ° 1
site
ridge
0.8
number of infections
non-ant.
selection strength ±
20
number of variants
6
10 °2
none
60 10°2
10 °1
0.6
10 °3
10 °5
15
4
40 10°4
0.8 10 0.4
20 2
5 0.2 10°6
0 0 0.00
match mismatch none 0.0 0.1 0.2 0.3 0.4 0.5 0.0
0 1 0.2 2 30.4 00.6 1 0.8
2 3 1.0
vaccination status variant frequency time of recall response tM (days)
0.6 no recall constant recall recall response at response at 48h,
E response F response G 48h H variant inoculated
1.0 1.0 1.0 1.0
108
virions, cells
102
0.6 0.6 0.6 0.6
100
0.4 0.4 0.4 0.4
variant frequency
0.2
10°2
°6
100.0 0.0 0.0 0.0
0.0
0 2 0.5
4 6 0.2
1.0
8 0.0
0 2 0.5
4 0.4 6 1.0
8 0.0
0 2 0.6 0.5
4 6 1.0
8 0.0
0.8
0 2 0.5
4 6 1.0
8
time (days)
target new variant NGS detection influenza-like
cells virus limit parameters
old variant recall response analytical new
virus active variant frequency
27
113 We constructed a model of within-host influenza virus evolution that can be
114 parameterized to reflect different hypothesized immune mechanisms, different host
115 immune statuses, and different durations of infection. In the model, virions Vi of
116 antigenic type i infect target cells C, replicate, possibly mutate to a new antigenic type
117 j at a rate µij , and decay at a rate dv . We model the innate immune response implicitly
118 as depletion of infectible cells. We model the antibody-mediated immune response as an
119 increase k in the virion decay rate in the presence of well-matched antibodies. To model
120 partial antibody cross reactivity, we scale k by a parameter ci ∈ [0, 1]; ci reflects the
121 binding strength of the host’s best-matched antibodies to antigenic type i. So in the
122 presence of an antibody-mediated response, virions Vi of type i decay at a rate dv + ci k.
123 To assess the importance of transmission bottlenecks, initial virus diversity, and sIgA
124 antibody neutralization in virus evolution, we modeled the point of transmission as a
125 random sample of within-host virus diversity from a transmitting host, potentially
126 thinned at the point of transmission by antibodies in the recipient host. Virions then
127 compete to found the infection. Inoculated virions are Poisson-distributed with a mean
128 v, so if variant i has within-host frequency fi , the number of variant i virions inoculated
129 is Poisson-distributed with mean vfi .
130 The virions then encounter antibodies, which we interpret as sIgA but can be
131 understood to be any antigen-specific antibody-mediated protection that precedes cell
132 infection; each virion of variant i is independently neutralized with a probability κi .
133 This probability depends upon the strength of protection against homotypic reinfection
134 κ and the sIgA cross immunity between variants σij , (0 ≤ σij ≤ 1). So if a host with
135 antibodies to variant j is challenged with variant i, those virions will be neutralized
136 with a probability κi = κσij . For simplicity, we assume the same homotypic protection
137 level across all variants and hosts, though in practice there may be variation in the
138 immunogenicity of individual variants and in the strength of responses generated by
139 individual hosts.
140 The model is continuous-time and stochastic: cell infection, virion production, virus
141 mutation, and virion decay are stochastic events that occur at rates governed by the
142 current state of the system, with exponentially distributed (memoryless) waiting times.
143 The system is approximately well-mixed: we track counts of virions and cells without
144 an explicit account of space within the upper respiratory tract. We treat infected and
145 dead or removed cells implicitly.
146 Parameterized as in Table 1, the model captures key features of influenza infections:
147 rapid peaks approximately 2 days post infection and slower declines with clearance
148 approximately a week post infection, with faster clearance in experienced hosts.
149 We give a full mathematical description of the model in the Methods.
150 We then explored the between-host and population-level implications of our within-host
151 dynamics using a simple transmission chain model and an SIR-like population-level
152 model, which we describe in the Methods.
28
153 Results
154 Realistically-timed immune kinetics limit otherwise rapid
155 adaptation during exponential growth
156 In our model, sufficiently strong antibody neutralization during the virus exponential
157 growth period can potentially stop replication and block onward transmission, but this
158 mechanism of protection results in detectable new antigenic variants in each homotypic
159 reinfection, since the infection is terminated rapidly unless it generates a new antigenic
160 variant that substantially escapes antibody neutralization (Fig. 2).
161 If there is antibody neutralization throughout virus exponential growth and it is not
162 sufficiently strong to control the infection, this facilitates the establishment of new
163 antigenic variants: variants can be generated de novo and then selected to detectable
164 and easily transmissible frequencies (Fig. 1C,D,F, sensitivity analysis in Fig. A3). We
165 term selection on a replicating within-host virus population “replication selection”.
166 Virus phenotypes that directly affect fitness independent of immune system interactions
167 are likely to be subject to replication selection.
168 Adding a realistic delay to antibody production of two days post-infection (Lam &
169 Baumgarth, 2019) curtails antigenic replication selection. There is no focused
170 antibody-mediated response during the virus exponential growth phase, and so the
171 infection is dominated by old antigenic variant virus (Fig. 1C,D,G, sensitivity analysis
172 in Fig. A3). Antigenic variant viruses begin to be selected to higher frequencies late in
173 infection, once a memory response produces high concentrations of cross-reactive
174 antibodies. But by the time this happens in typical infections, both new antigenic
175 variant and old antigenic variant populations have peaked and begun to decline due to
176 innate immunity and depletion of infectible cells, so new antigenic variants remain too
177 rare to be detectable with NGS (Fig. 1G).
178 We find that replication selection of antigenic novelty to detectable levels becomes likely
179 only if infections are prolonged, and virus antigenic diversity and antibody selection
180 pressure therefore coincide (Fig. 1G, see also Fig. 7). This can explain existing
181 observations: within-host adaptive antigenic evolution can be seen in prolonged
182 infections of immune-compromised hosts (Xue et al., 2017), and prolonged influenza
183 infections show large within-host effective population sizes (Lumby et al., 2020).
29
195 thus a population level selective advantage for new antigenic variants—without
196 observable within-host antigenic selection in typical infections of experienced hosts.
30
239 infections their transmission advantage (population-level selection) but may also
240 increase the rate at which these new antigenic variant infections arise (inoculation
241 selection).
242 There is some suggestive evidence of differential survival of particular (not necessarily
243 antigenic) influenza genetic variants at the point of transmission from experiments in
244 ferrets (Moncla et al., 2016; Wilker et al., 2013). But as Lumby et al., 2018 point out in
245 a reanalysis of those experiments, it difficult empirically to distinguish selection that
246 occurs at the point of transmission from selection that occurs during replication in the
247 donor host because of the challenges associated with sampling the small virus
248 populations present at the earliest stages of infection. Here we define inoculation
249 selection as selection on the bottlenecked virus population that establishes infection in
250 the recipient host before any virus replication has taken place in that host.
31
281 on four quantities: 1. δτ , the product of the replication fitness difference δ = k(cw − cm )
282 and the time under replication selection τ . This determines the degree to which the new
283 variant is promoted by replication selection prior to transmission (increasing fmt ). 2.
284 κw , the neutralization probability for the old variant, which must be large enough to
285 reduce competition for the final bottleneck. 3. v/b, the ratio of the sIgA bottleneck size
286 v to the cell infection (final) bottleneck size b. This determines the degree of
287 competition to found the infection, and thus sets maximum potential strength of
288 inoculation selection when κw is large: a vb -fold improvement over drift for small b. 4.
289 1 − κm , how likely the new variant is to avoid neutralization at the point of
290 transmission; this scales down the maximum replication selective strength set by v/b (to
291 zero if the new variant is neutralized with certainty; κm = 1). When δτ is small and κw ,
292 v/b, and 1 − κm are large, new variant survival is most facilitated by inoculation
293 selection. When the opposite is true, replication selection is most important. And there
294 are parameter regimes in which both replication and and inoculation selection provide a
295 substantial improvement over drift (Fig. 4, see Appendix section 4.6 for mathematical
296 intuition for these results).
297 At realistic parameter values and assuming all individuals develop well-matched
298 antibodies to previously encountered antigenic variants, only ∼1 to 2 in 104 inoculations
299 of an experienced host results in a new antigenic variant surviving the bottleneck
300 (Fig. 3E,H, Fig. 4). This rate is likely to be an overestimate due to the factors
301 mentioned above. Moreover, it is only about 2- to 3-fold higher than the rate of
302 bottleneck survival in naive hosts, where new antigenic variant infections should
303 occasionally occur via neutral stochastic founder effects. In short, even in the presence
304 of experienced hosts, antigenic selection is inefficient and most generated antigenic
305 diversity is lost at the point of transmission. Because of these inefficiencies, new
306 antigenic variants can be generated in every infected host without producing explosive
307 antigenic diversification at the population level.
32
A no detectable infection Bwithdetectable infection Cwithdetectable infection D distribution of outcomes
de novo new variant inoculated new variant
1.0
5
10
109
107
virions, cells
no detectable
0.8 104 infection
105 detectable infection with
de novo new variant
102
100
0.4
101
10°2
frequency
0.2
10°4
100
0
°6
100.0
0.0
0 2 4 60.2 8 0.2 0 0.4 2 4 6 0.6 8 0.4 0 0.82 4 6 1.0
8 0.6 1 3 10
0.8 50 100 2001.0
time (days) bottleneck
33
A
0.8
proportion of inocula
0.75
0.6
0.50
0.4
neither
0.25 old variant
0.2 new variant
both
0.00
0.0
0.0 1 3 10 50 1000.2200 1
0.4 3 10 50 100 200
0.6 1 0.8 3 10 50 100 200 1.0
virions encountering IgA
(v)
E F G H
£10°4 £10°4
prob. new variant survives bottleneck
∑w
10°2 1.0 ∑m
2.5
8 0.25 implied
by zm
0.75 0.75 ∑w
per inoculation
susceptibility
6
new variant
1
0.6 1.5
10°4
4
0.4 1.0
b = 10, ∑m = 0.95
10°5
b = 10, ∑m = 0.99
2 0.2 0.5
b = 1, ∑m = 0.95 multiplicative
b = 1, ∑m = 0.99 sigmoid
10°6
0 0.0 0.0
0.0 0.5 1.0 1 100 200 0 1 2 3 4 5 0 1 2 3 4 5
mutant neutralization ratio of mucus bottleneck distance between distance between
prob. (∑m) to final bottleneck (v/b) host memory and old variant host memory and old variant
34
∑m = 0 ∑m = 0.5 ∑m = 0.99
1.00
10
drift
inoc
repl
both
10°2
v = 10
0.8
probability of new variant transmission pnv
10°4
100
0.6
v = 100
10°2
°4
100.4
100
v = 1000
100.2
°2
10°4
0.0
0.0
0 1 0.2
2 3 0 0.4 1 2 0.6 3 0 0.8
1 2 1.0
3
35
A naive population B immediate recall response C mucosal antibodies and D mucosal antibodies and
immediate recall response realstic (48h) recall response
0.30
0.25
0.20
frequency
0.15
0.10
0.05
0.00
E F G H
0.05
probability density
0.04
0.03
0.02
0.01
0.00
°0.2 0.0 0.2 °0.2 0.0 0.2 °0.2 0.0 0.2 °0.2 0.0 0.2
antigenic change antigenic change antigenic change antigenic change
36
308 Inoculation selection produces realistically noisy between-host
309 evolution
310 To investigate the between-host consequences of adaptation given weak replication
311 selection, tight bottlenecks, and possible inoculation selection, we simulated
312 transmission chains according to our within-host model, allowing the virus to evolve in
313 a 1-dimensional antigenic space (Bedford et al., 2012; Smith et al., 2004) until a
314 generated antigenic mutant becomes the majority within-host variant. When all hosts
315 in a model transmission chain are naive, antigenic evolution is non-directional and
316 recapitulates the distribution of within-host mutations (Fig. 5A). When antigenic
317 selection is constant throughout infection and even a moderate fraction of hosts are
318 experienced, antigenic evolution is unrealistically strong: the virus evolves directly away
319 from existing immunity and large-effect antigenic changes are observed frequently
320 (Fig. 5B,C). When the model incorporates both mucosal antibodies and realistically
321 timed recall responses, major antigenic variants appear only rarely and the overall
322 distribution of emerged variants better mimics empirical observations (Fig. 5D); most
323 notably, the phenomenon of quasi-neutral diversification within an antigenic cluster
324 seen in Figures 1 and 2 of Smith et al., 2004. A simple analytical model (see Methods)
325 that assumes generated antigenic mutants fix according to their replication and
326 inoculation selective advantages also displays this behavior (Fig. 5B–H).
A B C
£10°3 £10°4
3.0
∑m 0.16 R0 R0
1.0 0.75 1.5 1.5
2.5
0.14 2 2
0.9
2.5 2.5
per capita reinoculations
0.8 0.99
new variant infections
0.12
per capita probability
2.0
per inoculation
0.10
0.6 1.5
0.08
0.04
0.2 0.5
0.02
37
327 Epidemic dynamics can alter rates of inoculation selection
328 We used an epidemic-level model to study the consequences of individual-level
329 inoculation selection for population-level antigenic selection. If inoculation selection is
330 efficient, an intermediate initial fraction of immune hosts maximizes the probability
331 that a new antigenic variant infection is created during an epidemic (Fig. 6.) This is
332 due to a trade-off between the frequency of naive or weakly immune “generator” hosts
333 who can propagate the epidemic and produce new antigenic variants through de novo
334 mutation, and the frequency of strongly immune “selector” hosts who, if inoculated, are
335 unlikely to be infected, but can facilitate the survival of these new antigenic variants at
336 the point of transmission. As selector host frequency increases, epidemics become rarer
337 and smaller, eventually decreasing opportunities for evolution, but moderate numbers of
338 efficient selectors can substantially increase the rate at which new antigenic variants
339 reach within-host consensus (Fig. 6B,C).
340 Discussion
341 Any explanation of influenza virus antigenic evolution—and why it is not even
342 faster—must explain why population level antigenic selection is strong, as evidenced by
343 the typically rapid sequential population-level replacement of old antigenic variants
344 upon the emergence of a new major antigenic variant, but within-host antigenic
345 evolution is difficult to observe.
346 We hypothesized that antibodies present in the mucosa at the time of virus exposure
347 can effectively block transmission, but have only a small effect on viral replication once
348 cells become productively infected. Antigenic selection after successful infection
349 therefore begins with the mounting of a recall response 48–72 hours post infection. In
350 this case, selection pressure can be strong at the point of transmission, but subsequently
351 weak until after the period of virus exponential growth. This mechanistic paradigm
352 reconciles strong but not perfect sterilizing homotypic immunity with rare observations
353 of mutants in successfully reinfected experienced hosts.
38
369 model-based comparison between the role of selection at the point of transmission
370 versus selection during replication in the promotion of antigenic mutants, to show that
371 immunologically plausible mechanisms could make the former more salient than the
372 latter, and to connect that finding to the rarity of observable new antigenic variants in
373 homotypically reinfected human hosts. We discuss the relationship between this
374 paradigm and those put forward in previous modeling studies at further length in
375 section 7.5 of the Appendix.
39
411 Point of transmission
412 A key proposal of our study is that population-level antigenic selection and homotypic
413 protection are mediated by antibody neutralization (likely sIgA) at the point of
414 transmission. Currently, empirical evidence for antibody protection at the point of
415 transmission is mostly indirect. Most of this evidence comes from human and animal
416 challenge studies (Clements et al., 1986; Le Sage et al., 2020; Memoli et al., 2019). In
417 these studies, individuals who are challenged with the same antigenic variant sometimes
418 appear to have sterilizing immunity, but failing that develop detectable infections. The
419 study by Memoli et al., 2019 is notable for having used likely artificially large
420 inocula–106 or 107 TCID50 . Despite this, two of the challenge subjects had neither
421 detectable virus nor seroconversion. Similarly, ferret experiments Le Sage et al., 2020
422 have shown that many ferrets develop sufficiently sterilizing immunity to prevent the
423 virus from ever being detected, while some ferrets showed briefly detectable infections
424 that were then cleared. While we cannot rule out a powerful immediate cellular
425 response that was differentially evaded in the various subjects, we believe that our
426 model, coupled with existing understanding of the timing of cellular responses and the
427 speed of influenza virus replication, provides a more parsimonious explanation.
428 Another limitation of our study is that, while we put forward mucosal sIgA as a
429 biologically documented potential mechanism of immune protection at the point of
430 inoculation that would not lead to strong selection during early viral replication, no
431 modeling study can establish such a mechanism without empirical investigation. Our
432 study reveals that such an empirical investigation would be of substantial scientific
433 value.
434 We model neutralization at the point of transmission as a binomial process. Each virion
435 is independently neutralized with a probability κi that depends on its antigenic
436 phenotype and the host’s immune history. As we discuss appendix section 5.2, this
437 assumption of independence may be violated in practice. Careful experiments are
438 required to develop a more realistic model of neutralization at the point of transmission.
439 Moreover, individual variation in immune system properties and complex effects of host
440 immune history Fonville et al., 2014; Lee et al., 2019 mean that even a pair of hosts who
441 have both been previously exposed to the currently circulating variant may exert
442 different selection pressures at the point of transmission. Modeling neutralization as a
443 series of independent events that depend only on host history and virus phenotype is a
444 baseline: it allows us to establish that antigenic selection at the point of transmission is
445 possible and show what its consequences might be. But a more realistic model will be
446 required to predict the selective pressures imposed by real hosts with real immune
447 histories on real virions.
448 Finally, while we believe neutralization at the point of transmission is a crucial
449 mechanism of protection against detectable reinfection for influenza, this may not be
450 true for all RNA viruses. Some, such as measles and varicella, have long incubation
451 periods even in naive hosts and induce reliable, long-lasting immune protection against
452 detectable reinfection. This may be because they replicate slowly enough that they
453 cannot “outrun” the adaptive response as influenza can. Neither shows influenza
454 virus-like patterns of clocklike immune escape, suggesting that (1) escape mutants may
455 be less available and (2) that adaptive response acts on a small population and is
40
456 forceful.
494 Implications
495 Our study has a number of implications for the study and control of influenza viruses.
41
496 Importance of host heterogeneity
497 Experienced hosts are undoubtedly heterogeneous in their immunity to a given
498 influenza variant (Lee et al., 2019), so the overall population average protection against
499 homotypic reinfection with variant i, zi , is in fact an average over experienced hosts.
500 Our model implies that the degree of neutralization difference between ancestral variant
501 virions and new antigenic variant virions at the point of transmission strongly affects the
502 probability of inoculation selection. Hosts with more focused immune responses—highly
503 specific antibodies that neutralize old antigenic variant virions well and new antigenic
504 variant virions poorly—could be especially good inoculation selectors and important
505 sources of population-level antigenic selection. Hosts who develop less specific memory
506 responses, such as very young children (Neuzil et al., 2006), could be less important.
507 Similarly, immune-compromised hosts are excellent replication selectors (Lumby et al.,
508 2020; Xue et al., 2017), and so their role in the generation of antigenic novelty and their
509 impact on overall population level diversification rates both deserve further study.
42
541 within-host scale, it would be surprising to observe “backward” antigenic changes and
542 noisy evolution at higher scales (Fig. 5A–D). In fact, there may be analogous
543 “neutralizing” population dynamics at higher scales as well, and those may also be
544 necessary for explaining population level noisiness. But whatever happens at higher
545 scales, within-host replication selection would create a strongly directional bias in what
546 population-level diversity is introduced (Fig. 5A–D), and introduces many small
547 forward antigenic changes. Inoculation selection does not necessarily do this.
43
584 to this argument. A/H3N2 viruses accumulate genetic mutations at a similar pace in
585 swine and humans, but antigenic evolution is much slower in swine (De Jong et al.,
586 2007; Lewis et al., 2016). Slaughter for meat means that pig population turnover is
587 high. It follows that the frequency of experienced hosts rarely becomes sufficient to
588 facilitate the appearance of observable new antigenic variants.
589 That antigenic emergence tracks immunity accumulation rather than diversity
590 accumulation suggests that A/H3N2 evolution is indeed selection-limited, not
591 diversity-limited. And yet due to tight transmission bottlenecks and weak replication
592 selection, this selection limitation renders much generated antigenic diversity invisible
593 to surveillance.
604 Conclusion
605 The asynchrony between within-host virus diversity and antigenic selection pressure
606 provides a simple mechanistic explanation for the phenomenon of weak within-host
607 selection but strong population-level selection in seasonal influenza virus antigenic
608 evolution. Measuring or even observing antibody selection in natural influenza
609 infections is likely to be difficult because it is inefficient and consequently rare.
610 Theoretical studies are therefore essential for understanding these phenomena and for
611 determining which measurable quantities will facilitate influenza virus control. Our
612 study highlights a critical need for new insights into sIgA neutralization and IgA
613 responses to natural influenza virus infection and vaccination and shows that cross-scale
614 dynamics can decouple selection and diversity, introducing randomness into otherwise
615 strongly adaptive evolution.
616 Methods
617 Model notation
618 In all model descriptions, X += y and X −= y denote incrementing and decrementing
619 the state variable X by the quantity y, respectively, and X = y denotes setting the
620 variable X to the value y. Ẋ denotes the rate of event X.
44
621 Within-host model overview
622 The within-host model is a target cell-limited model of within-host influenza virus
623 infection with three classes of state variables:
624 • C: Target Cells available for the virus to infect. Shared across all virus variants.
625 • Vi : Virions of virus antigenic variant i
626 • Ei : Binary variable indicating whether the host has previously Experienced
627 infection with antigenic variant i (Ei = 1 for experienced individuals; Ei = 0 for
628 naive individuals).
629 New virions are produced through infections of target cells by existing virions, at a rate
630 βCVi . Infection eventually renders a cell unproductive, so target cells decline at a rate
631 f βC V̄ , where V̄ is the total number of virions of all variants. The model allows
632 mutation: a virion of antigenic variant i has some probability µij of producing a virion
633 of antigenic variant j when it reproduces.
634 Virions have a natural per-capita decay rate dv . A fully active specific
635 antibody-mediated immune response to variant i increases the virion per-capita decay
636 rate for variant i by a factor k (assumed equal for all variants). The degree of activation
637 of the antibody response during an infection is given by a function M (t), where t is the
638 time since inoculation.
639 We use a parameter cij to denote the protective strength of antibodies raised against a
640 strain j against a different strain i. cii = 1 by definition and cij = 0 indicates complete
641 absence of cross-protection. So if host has antibodies against a strain j but not against
642 the infecting strain i, a fully active antibody response raises the virion decay rate for
643 strain i by cij k. If there are multiple candidate forms of cross protection cij and cik , we
644 choose the strongest. We typically assume that cij = cji .
645 We assume that an antibody immune response is raised whenever the host has
646 experienced a prior infection with a partially cross-reactive strain. For notational ease,
647 we define the host’s strongest cross-reactivity against strain i, ci by:
648 So an antibody response is raised during an infection with strains i, j, ... whenever one
649 of ci , cj , ... > 0.
650 For this study, we consider a two-antigenic variant model with an ancestral “old
651 antigenic variant” virions Vw and novel “new antigenic variant” virions Vm , though the
652 model generalizes to more than two variants.
C+ : C += 1
C− : C −= 1
Vw+ : Vw += 1
(2)
Vw− : Vw −= 1
Vm+ : Vm += 1
Vm− : Vm −= 1
45
653 The events occur at the following rates:
Ċ − = f βC(Vw + Vm )
! "
Ċ + = pC C 1 − C/Cmax
V̇w+ = βCVw rw (1 − µ)
# $
V̇w− = Vw dv + cw kM (t) (3)
# $
V̇m+ = βC Vm rm + Vw rw µ
# $
−
V̇m = Vm dv + cm kM (t)
654 where M (t) is a minimal model of a time-varying antibody response given by:
%
1, t > tM
M (t) = (4)
0, otherwise
655 For simplicity, the equations are symmetric between old antigenic variant and new
656 antigenic variant viruses, except that we neglect back mutation, which is expected to be
657 rare during a single infection, particularly before mutants achieve large populations.
658 The parameters rw and rm allow the two variants optionally to have distinct within-host
659 replication fitnesses; for all results shown, we assumed no replication fitness difference
660 (rw = rm = r), unless otherwise stated. A de novo antibody response raised to a
661 not-previously-encountered variant i can be modeled by setting Ei = 1 at a time
662 tiN ≥ tM post-infection.
663 We characterize a virus variant i by its within-host basic reproduction number Ri0 , the
664 mean number of new virions produced before decay by a single virion at the start of
665 infection in a naive host:
βCmax ri
Ri0 ≡ (5)
dv
666 When parametrizing our model, we fixed the within-host basic reproduction number
667 Ri0 , the initial target cell population Cmax , the virus reproduction rate ri , and the
668 shared virion decay rate dv . We then calculated the implied β according to (5).
669 Another useful quantity is the within-host effective reproduction number Ri (t) of
670 variant i at time t: the mean number of new virions produced before decay by a single
671 virion of variant i at a given time t post-infection.
βC(t)ri
Ri (t) ≡ (6)
dv + ci kM (t)
672 Note that Ri0 is Ri (0) in a naive host, and that if Ri < 1, the virus population will
673 usually decline.
674 The distribution of virions that encounter sIgA antibody neutralization depends on the
675 mean mucosal bottleneck size v (i.e. the mean number of virions that would pass
46
676 through the mucosa in the absence of antibodies) and on the frequency of new antigenic
677 variant fmt = fm (tinoc ) in the donor host at the time of inoculation tinoc . nw old
678 antigenic variant virions and nm new antigenic variant virions encounter sIgA. The total
679 number of virions ntot = nw + nm is Poisson distributed with mean v and each virion is
680 independently a new antigenic variant with probability fmt and otherwise an old
681 antigenic variant. The principle of Poisson thinning then implies:
# $
nm ∼ Poisson vfmt
# $ (7)
nw ∼ Poisson v(1 − fmt )
682 Note that since fmt is typically small the results should also hold for a binomial model of
683 nw and nm with a fixed total number of virions encountering sIgA antibodies: vtot = v.
684 We then model the sIgA bottleneck—neutralization of virions by mucosal sIgA
685 antibodies. Each virion of variant i is independently neutralized with a probability κi .
686 This probability depends upon the strength of protection against homotypic reinfection
687 κ and the sIgA cross immunity between variants σ, (0 ≤ σ ≤ 1):
κw = κ max{Ew , σEm }
(8)
κm = κ max{Em , σEw }
688 Since each virion of strain i in the inoculum is independently neutralized with
689 probability κi , then given nw and nm , the populations that compete the pass through
690 the cell infection bottleneck xw and xm are binomially distributed:
xw ∼ Binomial(nw , κw )
(9)
xm ∼ Binomial(nm , κm )
691 At this point, the remaining virions are sampled without replacement to determine
692 what passes through the cell infection bottleneck, b, with all virions passing through if
693 xw + xm ≤ b, so the final founding population is hypergeometrically distributed given
694 xw and xm .
695 If κw = κm = 0, fmt is small, and v is large, this is approximated by a binomially
696 distributed founding population of size b, in which each virion is independently a new
697 antigenic variant with (low) probability fmt and is otherwise an old antigenic variant, or
698 by a Poisson distribution with a small mean of fmt b ( 1 new antigenic variants:
47
699 When there is mucosal antibody neutralization, the variant’s survival probability can be
700 reduced below this (inoculation pruning) or promoted above it (inoculation promotion),
701 depending upon parameters. There can be inoculation pruning even when the variant is
702 more fit (neutralized with lower probability, positive inoculation selection) than the old
703 antigenic variant (see Fig. 3E).
48
Table 1: Model parameters, default values, and sources/justifications
49
707 Selection and drift within hosts
708 In this section, we derive analytical expressions for the within-host frequency of the new
709 variant over time in an infected host (Fig. 7, the probability distribution of the time of
710 first de novo mutation to produce a surviving new variant lineage, and the approximate
711 probability of replication selection to frequency x by time t given our parameters.
dfm
= fm (1 − fm )δ(t) (12)
dt
715 where δ(t) is the fitness advantage of the new antigenic variant over the old antigenic
716 variant at time t (see appendix section 3 for a derivation).
717 If the variant is neutral in the absence of antibodies, then δ(t) = k(cw − cm ) if t > tM
718 and δ(t) = 0 otherwise. If the new antigenic variant has emerged by tM , then at a time
719 t > tM :
eδ(t−tM )
fm (t) = −1 (13)
eδ(t−tM ) + fM −1
dfm 2
= fm (1 − fm )δ(t) + µR0 dv (1 − 2fm + fm ) (14)
dt
723 which for δ *= 0 yields:
A exp(δt) + µg0
fm (t) ≈
A exp(δt) + µg0 − δ
(15)
f0 µg0
A = (µg0 − δ) −
1 − f0 f0
1
+ µg0 t − 1
(16)
1−f0
fm (t) ≈ 1
1−f0
+ µg0 t
50
726 Distribution of first mutation times
727 In our stochastic model, new variant lineages that survive stochastic extinction are
728 produced by de novo mutation according to a continuous-time, variable-rate Poisson
729 process. The cumulative distribution function for the time of the first successful
730 mutation, te , depends on the mutation rate µ and the per-capita rate at which old
731 antigenic variant virions are produced, gw (t) = rw βC(t). It also depends on psse , the
732 probability that the generated new antigenic variant survives stochastic extinction.
733 Denoting the new variant per-capita virion production rate gm (t) = rm βC(t), we
734 calculate psse using a branching process approximation (Ball et al., 2016).
gm (t)
psse = (17)
gm (t) + dv + kM (t) max{Em , cEw }
737 It follows that the CDF of surviving new variant mutation times is:
738 This expression is exact for any given realization of the stochastic model if the realized
739 values of the random variables Vw (t), C(t), and gw (t) are used. In practice, we mainly
740 use it to get a closed form for the approximate new variant CDF by making the
741 approximations that C(t) ≈ Cmax early in infection. This yields approximations for
742 gw (t), gm (t), and Vw (t):
! "
b exp (g − d )t t ≤ tM
0 v
Vw (t) ≈ (22)
! " ! "
b exp (g0 − dv )tM exp (g0 − dv − cw k)(t − tM ) t > tM
743 The resultant approximate solution for the CDF of new variant mutation times agrees
744 well with simulations (Fig. 8).
745 The slightly earlier simulated mutation times in the immediate recall response case
746 (Fig. 8A) would only make replication selection more likely in that case than our
747 analytical approximation suggests.
51
748 Required mutation time for variant to reach a given frequency
749 By inverting the within-host replicator equation, we can also calculate the time t∗ (x, t)
750 by which a new variant must emerge if it is to reach at least frequency x by time t. We
751 show (see appendix section 9.4 for derivation) that there are two candidate values for t∗ ,
752 depending on whether the time that the new variant first emerges (te ) is before or after
753 the onset of the antibody response (tM ):
754 If te ≤ tM :
+ ! " ,
1−x
ln x
exp δ(t − tM ) + 1 − ln b
t∗− (x, t) = (23)
g0 − dv
755 If te > tM :
! 1−x "
ln − ln(b) + δt − cw ktM
t∗+ (x, t) ≈ x
(24)
g 0 − d v − cw k + δ
756 It may be that t∗+ (x, t) > t and t∗− (x, t) > t. This indicates that the mutant will be at
757 frequency x if it emerges at t itself. In that case, we therefore have t∗ (x, t) = t. So
758 combining:
t− (x, t) t− (x, t) < t and t− (x, t) ≤ tM
∗ ∗ ∗
t∗ (x, t) = t∗+ (x, t) t∗+ (x, t) < t and t∗− (x, t) > tM (25)
t otherwise
759 Finally, it is worth noting that in the case of a complete escape mutant (cm = 0,
760 δ = cw k), the approximate expression for t∗+ is exactly equal to the equivalent
761 approximate expression for t∗− :
! 1−x "
ln − ln(b) + cw k(t − tM )
∗
t (x, t) ≈ x
(26)
g0 − dv
(27)
∗ (a,t))
prepl (a, t) = P (te < t∗ (a, t)) = 1 − e−Λ(t
767 This analytical model agrees well with simulations (Fig. 9). We use it used to calculate
768 the heatmaps shown in Fig. 1, with the C(t) ≈ Cmax early infection approximations that
769 give us a closed form for Λm (x).
52
- t∗
770 When t∗ > tM , the integral 0 λm (x)dx can be evaluated piecewise, first from 0 to tM ,
771 and then from tM to t∗ . This can also be used to find evaluate the probability density
772 function (PDF) of first mutation times, as needed.
773 Finally, note that for te < tM , the expression for prepl in practice depends only on
774 δ = (cw − cm )k, not on cw , cm and k separately. In the absence of an antibody response,
775 the mutant is generated with near certainty by 1 day post-infection (P (te < 1) ≈ 1,
776 Fig. 8B). So when tM ≥ 1, values for prepl calculated with cw = 1 cm = 0, and δ = k—as
777 in (Fig. 1C,D)—will in fact hold for any cw , cm , and k that produce that fitness
778 difference δ.
779 When Rw 0 < 1 early in infection, the probability of replication selection depends on the
780 probability of generating an escape mutant before the infection is extinguished:
Rw (0)
prepl ≈ bµpsse (28)
1 − Rw (0)
789 The parameter V50 sets the scaling, and reflects virus population size at which there is a
790 50% chance of successful transmission to a naive host. We used a default value of
791 V50 = 108 virions. In addition to this probabilistic model, we also consider an alternative
792 threshold model in which a transmission attempt occurs with certainty if Vtot is greater
793 than a threshold θ and does not occur otherwise.
795 Summing pcib (xw , xm ) over the possible values of xw and xm weighted by their joint
796 probabilities yields the variant’s overall probability of successful onward transmission
53
797 psurv (fmt , v, b, κ, c):
0
psurv = P (xm , xw )pcib (xw , xm , b) (31)
xm ,xw
798 P (xw , xm ) is the product of the probability mass functions for xw and xm :
804 If fmt is small, there will almost always be only one such virion (xm = 1). That new
805 antigenic variant’s probability of surviving the cell infection bottleneck depends upon
806 how many old antigenic variant virions are present after neutralization, xw :
! xw "
xw − b + 1 b
pcib (xw , 1, b) = 1 − !xwb+1" = 1 − = (34)
b
xw + 1 xw + 1
807 Summing over the possible xw , we obtain a closed form for the unconditional
808 probability pcib (κw , v, b) (see section 9.5 for a derivation):
b 0 w̄j #
b−1
b $
pcib (κw , v, b) = (1 − e −w̄
) + e−w̄ 1− (35)
w̄ j=0
j! j+1
809 where w̄ = v(1 − fmt )(1 − κw ) is the mean number of old antigenic variant virions
810 present after sIgA neutralization. If b = 1, this reduces the just the first term, and it is
811 approximately equal to just the first term when w̄ is large, since e−w̄ becomes small.
812 It follows that there is an approximate closed form for the probability that a variant
813 survives the final bottleneck:
814 We use this expression to calculate the analytical variant survival probabilities shown in
815 the main text, and to gain conceptual insight into the strength of inoculation selection
816 relative to neutral processes (see appendix section 4.5).
54
817 Neutralization probability and probability of no infection
818 A given per-virion sIgA neutralization probability κi implies a probability zi that a
819 transmission event involving only virions of variant i fails to result in an infected cell.
820 Some transmissions fail even without sIgA neutralization; this occurs with probability
821 exp(−v). Otherwise, with probability 1 − exp(−v), ni virions must be neutralized by
822 sIgA to prevent an infection. We define zi as the probability of no infection given
823 inoculated virions in need of neutralization (i.e. given ni > 0).
824 The probability that!there are no" remaining virions of variant i after mucosal
825 neutralization is exp −v(1 − κi ) . So we have:
! "
exp −v(1 − κi ) − exp(−v)
zi = (37)
1 − exp(−v)
826 This can be solved algebraically for κi in terms of zi , but it is more illuminating first to
827 find the overall probability of no infection given inoculation pno and then solve:
! "
pno = exp −v(1 − κi )
ln(pno ) (38)
κi = 1 +
v
828 An infection occurs with probability (1 − exp(−v))(1 − zi ) (the probability of at least
829 one virion needing to be neutralized times the conditional probability that it is not) so:
830 This yields the same expression for κi in terms of zi as a direct algebraic solution of 37.
831 For moderate to large v, 1 − exp(−v) approaches 1 so pno approaches zi and κi
832 approaches 1 + ln(zv
i)
. This reflects the fact that for even moderately large v, it is almost
833 always the case that ni > 0: a least one virion must be neutralized to prevent infection.
834 In those cases, zi can be interpreted as the (approximate) probability of no infection
835 given a transmission event (i.e. as pno ).
836 Note also that seeded infections can also go stochastically extinct; this occurs with
837 approximate probability R10 . At the start of infection, if there is no antibody response,
838 R0 is large (5 to 10) stochastic extinction probabilities should be low ( 15 to 10
1
), and
839 equal in immune and naive hosts. We have therefore parametrized our model in terms
840 of zi , the probability that no cell is ever infected, as that probability determines to
841 leading order the frequency with which immunity protects against detectable reinfection
842 given challenge.
55
848 results for two candidate models: a multiplicative model used in a prior modeling and
849 empirical studies of influenza evolution (Asaduzzaman et al., 2018; Boni et al., 2006),
850 and a sigmoid model as parametrized from data on empirical HI titer and protection
851 against infection (Coudeville et al., 2010).
852 In the multiplicative model, the probability z(i, x) of no infection with variant i given
853 that the nearest variant to i in the immune history is variant x is given by:
. /d(i,x)
z1
z(i, x) = z0 (40)
z0
854 where z0 is the probability of no infection given homotypic reinfection, d(i, x) is the
855 antigenic distance in antigenic clusters between i and x, and z1 is the probability of no
856 infection given d(i, x) = 1.
857 In the sigmoid model:
1
z(i, x) = 1 − ! " (41)
b ln(T (i,x))−a
1+e
858 where a = 2.844 and b = 1.299 (α and β estimated in (Coudeville et al., 2010)) and
859 T (i, x) is the individual’s HI titer against variant i (Coudeville et al., 2010). To convert
860 this into a model in terms of z0 and z1 we calculate the typical homotypic titer T0
861 implied by z0 and the n-fold-drop D in titer per unit of antigenic distance implied by
862 z1 , since units in antigenic space correspond to a n-fold reductions in HI titer for some
863 value n (Smith et al., 2004). We calculate T0 by plugging z0 and T0 into equation 41
864 and solving. We calculate D by plugging z1 and T1 = T0 D−1 into equation 41 and
865 solving. We can then calculate T (i, x) as:
56
875 Within-host simulations (Figs. 1, 3)
876 To evaluate the relative probabilities of replication selection and inoculation selection
877 for antigenic novelty and to check the validity of the analytical results, we simulated 106
878 transmissions from a naive host to a previously immune host. The transmitting host
879 was simulated for 10 days, which given the selected model parameters is sufficient time
880 for almost all infections to be cleared. Time of transmission was randomly drawn from
881 the time period during which the infected host had a transmissible virus population and
882 an inoculum of fixed size was drawn from the within-host virus population at that time.
883 Variant counts in the inoculum were Poisson distributed with probability equal to the
884 variant frequency within the transmitting host at the time of transmission. We
885 simulated the recipient host until the clearance of infection and found the maximum
886 frequency of transmissible variant: that is, the variant frequencies when the
887 transmission probability was greater than 5 × 10−2 (probabilistic model) or when the
888 virus population was above the transmission threshold θ (threshold model). For
889 Fig. 3D, we defined an infection with an emerged new antigenic variant as an infection
890 with a maximum transmissible new variant frequency of greater than 50%.
57
919 subsequent infection, according to the within-host model described above. If the
920 recipient host developed a transmissible infection, it became a new donor host. If not,
921 we continued to simulate contacts and possible transmissions for the donor host until
922 recovery. If a donor host recovered without transmitting successfully, the chain was
923 declared extinct and a new chain was founded.
924 We iterated this process until the first phenotypic change event—a generated or
925 transmitted minority phenotype becoming the new majority phenotype. We simulated
926 1000 such events for each model and examined the observed distribution of phenotypic
927 changes compared to the mutation kernel.
928 For the results shown in Fig. 5, we set the population distribution of immune histories
929 as follows: 20% −0.8, 20% −0.5, 20% 0.0, and the remaining 40% of hosts naive. This
930 qualitatively models the directional pressure that is thought to canalize virus evolution
931 (Bedford et al., 2012) once a cluster has begun to circulate.
ns
0
si (0)psurv (κm (i), κw (i), v, b, fmt ) (44)
i=1
950 where κw (i) and κm (i) are the mucosal antibody neutralization probabilities for the old
951 antigenic variant and the new antigenic variant associated with susceptibility class i,
952 and si (0) = SiN(0) is the initial fraction of individuals in susceptibility class i.
953 We then considered a well-mixed population with frequency-dependent transmission,
954 where infected individuals from all susceptibility classes are equally infectious if
955 infected. Using an existing result from epidemic theory (Magal et al., 2018), we
956 calculated R∞ , the average fraction of individuals who are infected if an epidemic
58
957 occurs in such a population. During such an epidemic, each individual will on average
958 be inoculated (challenged) R0 R∞ times, where R0 is the population-level basic
959 reproduction number (Miller, 2012). We can then calculate the probability that a new
960 variant transmission chain is started in an arbitrary focal individual:
ns
0
R0 R ∞ si (0)psurv (κm (i), κw (i), v, b, fmt ) (45)
i=1
59
980 Github repositories and journal websites. We independently verified that reported
981 antigenic site and antigenic ridge amino acid substitutions were correctly indicated,
982 determined the number of subjects with no NGS-detectable antigenic amino acid
983 substitutions, and produced figures from the aggregated data.
60
A B
1.0
1 ± f0
12 µ £ 103
9 µ £ 102
6
variant frequency fm
µ £ 101
0.75 3 µ £ 100
0.8
0.5
0.25
0.6
0
C D
100
0.4
10°1
variant frequency fm
10°2
°3
100.2
10°4
10°5
0.0
0.0
0 1 0.2 2 3 0.4 4 0 0.6 1 2 0.8 3 1.0
4
time since first new variant (days)
61
A immediate recall response B delayed recall response
1.0 analytical
simulated
0.8
cumulative probability
0.6
0.4
0.2
0.0
0.0 0.5 1.0 1.5 2.0 0.0 0.5 1.0 1.5 2.0
emergence time emergence time
62
one percent consensus
100
probability for k = 0.5
10°2
10°4
100
probability for k = 1.0
10°2
10°4
100
probability for k = 2.0
10°2
10°4
100
probability for k = 4.0
10°2
10°4
100
probability for k = 8.0
10°2
10°4
0 1 2 3 0 1 2 3
tM tM
63
996 References
997 Archetti, I., & Horsfall, F. L. (1950). Persistent antigenic variation of influenza A
998 viruses after incomplete neutralization in ovo with heterologous immune serum.
999 Journal of Experimental Medicine, 92 (5), 441–462.
1000 Asaduzzaman, S. M., Ma, J., & van den Driessche, P. (2018). Estimation of
1001 cross-immunity between drifted strains of influenza A/H3N2. Bulletin of
1002 Mathematical Biology, 80 (3), 657–669.
1003 Baccam, P., Beauchemin, C., Macken, C. A., Hayden, F. G., & Perelson, A. S. (2006).
1004 Kinetics of influenza A virus infection in humans. Journal of Virology, 80 (15),
1005 7590–7599.
1006 Ball, F., Britton, T., & Neal, P. (2016). On expected durations of birth–death processes,
1007 with applications to branching processes and SIS epidemics. Journal of Applied
1008 Probability, 53 (1), 203–215.
1009 Bazinet, A. L., Zwickl, D. J., & Cummings, M. P. (2014). A gateway for phylogenetic
1010 analysis powered by grid computing featuring GARLI 2.0. Systematic Biology,
1011 63 (5), 812–8. https://doi.org/10.1093/sysbio/syu031
1012 Bedford, T., Rambaut, A., & Pascual, M. (2012). Canalization of the evolutionary
1013 trajectory of the human influenza virus. BMC Biology, 10 (1), 38.
1014 Bedford, T., Riley, S., Barr, I. G., Broor, S., Chadha, M., Cox, N. J., Daniels, R. S.,
1015 Gunasekaran, C. P., Hurt, A. C., Kelso, A., Et al. (2015). Global circulation
1016 patterns of seasonal influenza viruses vary with antigenic drift. Nature,
1017 523 (7559), 217.
1018 Bonhoeffer, S., Lipsitch, M., & Levin, B. R. (1997). Evaluating treatment protocols to
1019 prevent antibiotic resistance. Proceedings of the National Academy of Sciences,
1020 94 (22), 12106–12111.
1021 Boni, M. F., Gog, J. R., Andreasen, V., & Feldman, M. W. (2006). Epidemic dynamics
1022 and antigenic evolution in a single season of influenza A. Proceedings of the
1023 Royal Society B: Biological Sciences, 273 (1592), 1307–1316.
1024 Burke, D. F., & Smith, D. J. (2014). A recommended numbering scheme for influenza A
1025 HA subtypes. PLoS ONE, 9 (11), e112302.
1026 https://doi.org/10.1371/journal.pone.0112302
1027 Canini, L., Holzer, B., Morgan, S., Dinie Hemmink, J., Clark, B.,
1028 sLoLa Dynamics Consortium, Woolhouse, M. E., Tchilian, E., & Charleston, B.
1029 (2020). Timelines of infection and transmission dynamics of H1N1pdm09 in
1030 swine. PLoS pathogens, 16 (7), e1008628.
1031 Cao, P., & McCaw, J. M. (2017). The mechanisms for within-host influenza virus
1032 control affect model-based assessment and prediction of antiviral treatment.
1033 Viruses, 9 (8), 197.
1034 Caton, A. J., Brownlee, G. G., Yewdell, J. W., & Gerhard, W. (1982). The antigenic
1035 structure of the influenza virus A/PR/8/34 hemagglutinin (H1 subtype). Cell,
1036 31 (2), 417–427.
1037 Clements, M., Betts, R., Tierney, E., & Murphy, B. (1986). Serum and nasal wash
1038 antibodies associated with resistance to experimental challenge with influenza A
1039 wild-type virus. Journal of Clinical Microbiology, 24 (1), 157–160.
1040 Coro, E. S., Chang, W. W., & Baumgarth, N. (2006). Type I IFN receptor signals
1041 directly stimulate local B cells early following influenza virus infection. Journal
1042 of Immunology, 176 (7), 4343–4351.
64
1043 Coudeville, L., Bailleux, F., Riche, B., Megas, F., Andre, P., & Ecochard, R. (2010).
1044 Relationship between haemagglutination-inhibiting antibody titres and clinical
1045 protection against influenza: Development and application of a Bayesian
1046 random-effects model. BMC Medical Research Methodology, 10 (1), 18.
1047 Davenport, F. M., & Hennessy, A. V. (1956). A serologic recapitulation of past
1048 experiences with influenza A; antibody response to monovalent vaccine. Journal
1049 of Experimental Medicine, 104 (1), 85–97.
1050 Davis, A. K., McCormick, K., Gumina, M. E., Petrie, J. G., Martin, E. T., Xue, K. S.,
1051 Bloom, J. D., Monto, A. S., Bushman, F. D., & Hensley, S. E. (2018). Sera from
1052 individuals with narrowly focused influenza virus antibodies rapidly select viral
1053 escape mutations in ovo. Journal of Virology, 92 (19), e00859–18.
1054 De Jong, J., Smith, D. J., Lapedes, A., Donatelli, I., Campitelli, L., Barigazzi, G.,
1055 Van Reeth, K., Jones, T. C., Rimmelzwaan, G., Osterhaus, A., Et al. (2007).
1056 Antigenic and genetic evolution of swine influenza A (H3N2) viruses in europe.
1057 Journal of Virology, 81 (8), 4315–4322.
1058 Debbink, K., McCrone, J. T., Petrie, J. G., Truscon, R., Johnson, E., Mantlo, E. K.,
1059 Monto, A. S., & Lauring, A. S. (2017). Vaccination has minimal impact on the
1060 intrahost diversity of H3N2 influenza viruses. PLoS Pathogens, 13 (1), e1006194.
1061 https://doi.org/10.1371/journal.ppat.1006194
1062 Dinis, J. M., Florek, N. W., Fatola, O. O., Moncla, L. H., Mutschler, J. P.,
1063 Charlier, O. K., Meece, J. K., Belongia, E. A., & Friedrich, T. C. (2016). Deep
1064 sequencing reveals potential antigenic variants at low frequencies in influenza A
1065 virus-infected humans. Journal of Virology, 90 (7), 3355–3365.
1066 Ferguson, N. M., Galvani, A. P., & Bush, R. M. (2003). Ecological and immunological
1067 determinants of influenza evolution. Nature, 422 (6930), 428.
1068 Fonville, J. M., Wilks, S., James, S. L., Fox, A., Ventresca, M., Aban, M., Jones, T. C.,
1069 Le, N., Pham, Q., Et al. (2014). Antibody landscapes after influenza virus
1070 infection or vaccination. Science, 346 (6212), 996–1000.
1071 Frensing, T., Kupke, S. Y., Bachmann, M., Fritzsche, S., Gallo-Ramirez, L. E., &
1072 Reichl, U. (2016). Influenza virus intracellular replication dynamics, release
1073 kinetics, and particle morphology during propagation in mdck cells. Applied
1074 Microbiology and Biotechnology, 100 (16), 7181–7192.
1075 Ghafari, M., Lumby, C. K., Weissman, D., & Illingworth, C. J. R. (2020). Inferring
1076 transmission bottleneck size from viral sequence data using a novel haplotype
1077 reconstruction method. bioRxiv.
1078 Gog, J. R. (2008). The impact of evolutionary constraints on influenza dynamics.
1079 Vaccine, 26, C15–C24.
1080 Grenfell, B. T., Pybus, O. G., Gog, J. R., Wood, J. L., Daly, J. M., Mumford, J. A., &
1081 Holmes, E. C. (2004). Unifying the epidemiological and evolutionary dynamics of
1082 pathogens. Science, 303 (5656), 327–332.
1083 Hadjichrysanthou, C., Cauët, E., Lawrence, E., Vegvari, C., de Wolf, F., &
1084 Anderson, R. M. (2016). Understanding the within-host dynamics of influenza A
1085 virus: from theory to clinical implications. Journal of The Royal Society
1086 Interface, 13 (119), 20160289.
1087 Han, A. X., Maurer-Stroh, S., & Russell, C. A. (2019). Individual immune selection
1088 pressure has limited impact on seasonal influenza virus evolution. Nature Ecology
1089 & Evolution, 3 (2), 302.
65
1090 Hartfield, M., & Alizon, S. (2014). Epidemiological feedbacks affect evolutionary
1091 emergence of pathogens. The American Naturalist, 183 (4), E105–E117.
1092 Hartfield, M., & Alizon, S. (2015). Within-host stochastic emergence dynamics of
1093 immune-escape mutants. PLoS Computational Biology, 11 (3), e1004149.
1094 Hensley, S. E., Das, S. R., Bailey, A. L., Schmidt, L. M., Hickman, H. D.,
1095 Jayaraman, A., Viswanathan, K., Raman, R., Sasisekharan, R., Bennink, J. R.,
1096 Et al. (2009). Hemagglutinin receptor binding avidity drives influenza A virus
1097 antigenic drift. Science, 326 (5953), 734–736.
1098 Hoft, D. F., Lottenbach, K. R., Blazevic, A., Turan, A., Blevins, T. P., Pacatte, T. P.,
1099 Yu, Y., Mitchell, M. C., Hoft, S. G., & Belshe, R. B. (2017). Comparisons of the
1100 humoral and cellular immune responses induced by live attenuated influenza
1101 vaccine and inactivated influenza vaccine in adults. Clin. Vaccine Immunol.,
1102 24 (1), e00414–16.
1103 Hu, Y., & Li, T. (2009). Highly accurate tau-leaping methods with random corrections.
1104 Journal of Chemical Physics, 130 (12), 03B618.
1105 Iwasa, Y., Michor, F., & Nowak, M. A. (2004). Evolutionary dynamics of invasion and
1106 escape. Journal of Theoretical Biology, 226 (2), 205–214.
1107 Javaid, W., Ehni, J., Gonzalez-Reiche, A. S., Carreno, J. M., Hirsch, E., Tan, J.,
1108 Khan, Z., Kriti, D., Ly, T., Kranitzky, B., Barnett, B., Cera, F., Prespa, L.,
1109 Moss, M., Albrecht, R. A., Mustafa, A., Herbison, I., Hernandez, M. M., Pak, T.,
1110 . . . van Bakel, H. (2020). Real time investigation of a large nosocomial influenza
1111 a outbreak informed by genomic epidemiology. medRxiv.
1112 https://doi.org/10.1101/2020.05.10.20096693
1113 Jones, E., Oliphant, T., Peterson, P., Et al. (2001). SciPy: Open source scientific tools
1114 for Python. http://www.scipy.org/
1115 Katoh, K., Misawa, K., Kuma, K., & Miyata, T. (2002). MAFFT: a novel method for
1116 rapid multiple sequence alignment based on fast Fourier transform. Nucleic
1117 Acids Research, 30 (14), 3059–3066. https://doi.org/10.1093/nar/gkf436
1118 Kennedy, D. A., & Read, A. F. (2017). Why does drug resistance readily evolve but
1119 vaccine resistance does not? Proceedings of the Royal Society B: Biological
1120 Sciences, 284 (1851), 20162562.
1121 Kimura, M. (1968). Evolutionary rate at the molecular level. Nature, 217 (5129),
1122 624–626.
1123 Koel, B. F., Burke, D. F., Bestebroer, T. M., van der Vliet, S., Zondag, G. C.,
1124 Vervaet, G., Skepner, E., Lewis, N. S., Spronken, M. I., Russell, C. A., Et al.
1125 (2013). Substitutions near the receptor binding site determine major antigenic
1126 change during influenza virus evolution. Science, 342 (6161), 976–979.
1127 https://doi.org/10.1126%2Fscience.1244730
1128 Koel, B. F., van Someren Greve, F., Vigeveno, R. M., Pater, M., Russell, C. A., &
1129 de Jong, M. D. (2019). Disparate evolution of virus populations in upper and
1130 lower airways of mechanically ventilated patients. bioRxiv, 509901.
1131 Koelle, K., Cobey, S., Grenfell, B. T., & Pascual, M. (2006). Epochal evolution shapes
1132 the phylodynamics of interpandemic influenza A (H3N2) in humans. Science,
1133 314 (5807), 1898–1903. https://doi.org/10.1126/science.1132745
1134 Koelle, K., Kamradt, M., & Pascual, M. (2009). Understanding the dynamics of rapidly
1135 evolving pathogens through modeling the tempo of antigenic change: Influenza
1136 as a case study. Epidemics, 1 (2), 129–137.
66
1137 Koelle, K., & Rasmussen, D. A. (2015). The effects of a deleterious mutation load on
1138 patterns of influenza A/H3N2’s antigenic evolution in humans. eLife, 4, e07361.
1139 Krammer, F. (2019). The human antibody response to influenza A virus infection and
1140 vaccination. Nature Reviews Immunology, 1.
1141 Krammer, F., & Palese, P. (2019). Universal influenza virus vaccines that target the
1142 conserved hemagglutinin stalk and conserved sites in the head domain. Journal
1143 of Infectious Diseases, 219 (Supplement_1), S62–S67.
1144 Kucharski, A. J., & Gog, J. R. (2011). Influenza emergence in the face of evolutionary
1145 constraints. Proc. R. Soc. Lond. B: Biol. Sci.
1146 Kucharski, A. J., Lessler, J., Read, J. M., Zhu, H., Jiang, C. Q., Guan, Y.,
1147 Cummings, D. A., & Riley, S. (2015). Estimating the life course of influenza A
1148 (H3N2) antibody responses from cross-sectional data. PLoS Biology, 13 (3),
1149 e1002082.
1150 Lam, J. H., & Baumgarth, N. (2019). The multifaceted B cell response to influenza
1151 virus. Journal of Immunology, 202 (2), 351–359.
1152 Lau, L. L., Cowling, B. J., Fang, V. J., Chan, K.-H., Lau, E. H., Lipsitch, M.,
1153 Cheng, C. K., Houck, P. M., Uyeki, T. M., Peiris, J. M., Et al. (2010). Viral
1154 shedding and clinical illness in naturally acquired influenza virus infections.
1155 Journal of Infectious Diseases, 201 (10), 1509–1516.
1156 Le Sage, V., Jones, J. E., Kormuth, K. A., Fitzsimmons, W. J., Nturibi, E.,
1157 Padovani, G. H., Arevalo, C. P., French, A. J., Avery, A. J., Manivanh, R.,
1158 McGrady, E. E., Bhagwat, A. R., Lauring, A. S., Hensley, S. E., &
1159 Lakdawala, S. S. (2020). Pre-existing immunity provides a barrier to airborne
1160 transmission of influenza viruses. bioRxiv.
1161 https://doi.org/10.1101/2020.06.15.103747
1162 Lee, J. M., Eguia, R., Zost, S. J., Choudhary, S., Wilson, P. C., Bedford, T.,
1163 Stevens-Ayers, T., Boeckh, M., Hurt, A., Lakdawala, S. S., Hensley, S. E., &
1164 Bloom, J. D. (2019). Mapping person-to-person variation in viral mutations that
1165 escape polyclonal serum targeting influenza hemagglutinin. bioRxiv, 670497.
1166 Leonard, A. S., McClain, M. T., Smith, G. J., Wentworth, D. E., Halpin, R. A., Lin, X.,
1167 Ransier, A., Stockwell, T. B., Das, S. R., Gilbert, A. S., Et al. (2016). Deep
1168 sequencing of influenza A virus from a human challenge study reveals a selective
1169 bottleneck and only limited intrahost genetic diversification. Journal of Virology,
1170 90 (24), 11247–11258.
1171 Lessler, J., Riley, S., Read, J. M., Wang, S., Zhu, H., Smith, G. J., Guan, Y.,
1172 Jiang, C. Q., & Cummings, D. A. (2012). Evidence for antigenic seniority in
1173 influenza A (H3N2) antibody responses in southern China. PLoS Pathogens,
1174 8 (7), e1002802.
1175 Lewis, N. S., Russell, C. A., Langat, P., Anderson, T. K., Berger, K., Bielejec, F.,
1176 Burke, D. F., Dudas, G., Fonville, J. M., Fouchier, R. A., Et al. (2016). The
1177 global antigenic diversity of swine influenza A viruses. eLife, 5, e12217.
1178 Linderman, S. L., Chambers, B. S., Zost, S. J., Parkhouse, K., Li, Y., Herrmann, C.,
1179 Ellebedy, A. H., Carter, D. M., Andrews, S. F., Zheng, N.-Y., Et al. (2014).
1180 Potential antigenic explanation for atypical H1N1 infections among middle-aged
1181 adults during the 2013–2014 influenza season. Proceedings of the National
1182 Academy of Sciences, 111 (44), 15798–15803.
67
1183 Lumby, C. K., Nene, N. R., & Illingworth, C. J. (2018). A novel framework for inferring
1184 parameters of transmission from viral sequence data. PLoS Genetics, 14 (10),
1185 e1007718.
1186 Lumby, C. K., Zhao, L., Breuer, J., & Illingworth, C. J. R. (2020). A large effective
1187 population size for within-host influenza virus infection. eLife, 9 (e56915).
1188 https://doi.org/10.7554/eLife.56915
1189 Luo, S., Reed, M., Mattingly, J. C., & Koelle, K. (2012). The impact of host immune
1190 status on the within-host and population dynamics of antigenic immune escape.
1191 Journal of The Royal Society Interface, 9 (75), 2603–2613.
1192 Magal, P., Seydi, O., & Webb, G. (2018). Final size of a multi-group SIR epidemic
1193 model: Irreducible and non-irreducible modes of transmission. Mathematical
1194 Biosciences, 301, 59–67.
1195 Matrosovich, M. N., Matrosovich, T. Y., Gray, T., Roberts, N. A., & Klenk, H.-D.
1196 (2004). Neuraminidase is important for the initiation of influenza virus infection
1197 in human airway epithelium. Journal of Virology, 78 (22), 12665–12667.
1198 McCrone, J. T., Woods, R. J., Martin, E. T., Malosh, R. E., Monto, A. S., &
1199 Lauring, A. S. (2018). Stochastic processes constrain the within and between
1200 host evolution of influenza virus. eLife, 7, e35962.
1201 Memoli, M. J., Han, A., Walters, K.-A., Czajkowski, L., Reed, S., Athota, R.,
1202 Angela Rosas, L., Cervantes-Medina, A., Park, J.-K., Morens, D. M., Et al.
1203 (2019). Influenza A reinfection in sequential human challenge: Implications for
1204 protective immunity and universal vaccine development. Clinical Infectious
1205 Diseases, ciz281.
1206 Miller, J. C. (2012). A note on the derivation of epidemic final sizes. Bulletin of
1207 Mathematical Biology, 1–17.
1208 Moncla, L. H., Zhong, G., Nelson, C. W., Dinis, J. M., Mutschler, J., Hughes, A. L.,
1209 Watanabe, T., Kawaoka, Y., & Friedrich, T. C. (2016). Selective bottlenecks
1210 shape evolutionary pathways taken during mammalian adaptation of a 1918-like
1211 avian influenza virus. Cell Host & Microbe, 19 (2), 169–180.
1212 Neuzil, K. M., Jackson, L. A., Nelson, J., Klimov, A., Cox, N., Bridges, C. B., Dunn, J.,
1213 DeStefano, F., & Shay, D. (2006). Immunogenicity and reactogenicity of 1 versus
1214 2 doses of trivalent inactivated influenza vaccine in vaccine-naive 5–8-year-old
1215 children. Journal of Infectious Diseases, 194 (8), 1032–1039.
1216 Nobusawa, E., & Sato, K. (2006). Comparison of the mutation rates of human influenza
1217 A and B viruses. Journal of Virology, 80 (7), 3675–3678.
1218 Ohta, T. (1992). The nearly neutral theory of molecular evolution. Annual Review of
1219 Ecology and Systematics, 23 (1), 263–286.
1220 Pawelek, K. A., Huynh, G. T., Quinlivan, M., Cullinane, A., Rong, L., & Perelson, A. S.
1221 (2012). Modeling within-host dynamics of influenza virus infection including
1222 immune responses. PLoS Computational Biology, 8 (6), e1002588.
1223 Perelson, A. S., Rong, L., & Hayden, F. G. (2012). Combination antiviral therapy for
1224 influenza: Predictions from modeling of human infections. Journal of Infectious
1225 Diseases, 205 (11), 1642–1645.
1226 Perron, G. G., Inglis, R. F., Pennings, P. S., & Cobey, S. (2015). Fighting microbial
1227 drug resistance: A primer on the role of evolutionary biology in public health.
1228 Evolutionary Applications, 8 (3), 211–222.
1229 Petrova, V. N., & Russell, C. A. (2018). The evolution of seasonal influenza viruses.
1230 Nature Reviews Microbiology, 16 (1), 47.
68
1231 Quinlivan, M., Nelly, M., Prendergast, M., Breathnach, C., Horohov, D., Arkins, S.,
1232 Chiang, Y.-W., Chu, H.-J., Ng, T., & Cullinane, A. (2007). Pro-inflammatory
1233 and antiviral cytokine expression in vaccinated and unvaccinated horses exposed
1234 to equine influenza virus. Vaccine, 25 (41), 7056–7064.
1235 Recker, M., Pybus, O. G., Nee, S., & Gupta, S. (2007). The generation of influenza
1236 outbreaks by a network of host immune responses against a limited set of
1237 antigenic types. Proceedings of the National Academy of Sciences, 104 (18),
1238 7711–7716.
1239 Russell, C. A., Fonville, J. M., Brown, A. E., Burke, D. F., Smith, D. L., James, S. L.,
1240 Herfst, S., Van Boheemen, S., Linster, M., Schrauwen, E. J., Katzelnick, L.,
1241 Mosterín, A., Kuiken, T., Maher, E., Neumann, G., Osterhaus, A. D. M. E.,
1242 Kawaoka, Y., Fouchier, R. A. M., & Smith, D. J. (2012). The potential for
1243 respiratory droplet–transmissible A/H5N1 influenza virus to evolve in a
1244 mammalian host. Science, 336 (6088), 1541–1547.
1245 Russell, C. A., Jones, T. C., Barr, I. G., Cox, N. J., Garten, R. J., Gregory, V.,
1246 Gust, I. D., Hampson, A. W., Hay, A. J., Hurt, A. C., Et al. (2008). The global
1247 circulation of seasonal influenza A (H3N2) viruses. Science, 320 (5874), 340–346.
1248 Sigal, D., Reid, J. N., & Wahl, L. M. (2018). Effects of transmission bottlenecks on the
1249 diversity of influenza a virus. Genetics, 210 (3), 1075–1088.
1250 Smith, D. J., Lapedes, A. S., de Jong, J. C., Bestebroer, T. M., Rimmelzwaan, G. F.,
1251 Osterhaus, A. D., & Fouchier, R. A. (2004). Mapping the antigenic and genetic
1252 evolution of influenza virus. Science, 305 (5682), 371–376.
1253 Stamatakis, A. (2014). RAxML version 8: A tool for phylogenetic analysis and
1254 post-analysis of large phylogenies. Bioinformatics, 30 (9), 1312–3.
1255 https://doi.org/10.1093/bioinformatics/btu033
1256 Strelkowa, N., & Lässig, M. (2012). Clonal interference in the evolution of influenza.
1257 Genetics, 192 (2), 671–682. https://doi.org/10.1534/genetics.112.143396
1258 Suess, T., Remschmidt, C., Schink, S. B., Schweiger, B., Heider, A., Milde, J.,
1259 Nitsche, A., Schroeder, K., Doellinger, J., Braun, C., Et al. (2012). Comparison
1260 of shedding characteristics of seasonal influenza virus (sub) types and influenza
1261 A (H1N1) pdm09; Germany, 2007–2011. PloS ONE, 7 (12), e51653.
1262 Tsang, T. K., Cowling, B. J., Fang, V. J., Chan, K.-H., Ip, D. K., Leung, G. M.,
1263 Peiris, J. M., & Cauchemez, S. (2015). Influenza A virus shedding and infectivity
1264 in households. Journal of Infectious Diseases, 212 (9), 1420–1428.
1265 Valesano, A. L., Fitzsimmons, W. J., McCrone, J. T., Petrie, J. G., Monto, A. S.,
1266 Martin, E. T., & Lauring, A. S. (2019). Influenza b viruses exhibit lower
1267 within-host diversity than influenza a viruses in human hosts. BioRxiv, 791038.
1268 Varble, A., Albrecht, R. A., Backes, S., Crumiller, M., Bouvier, N. M., Sachs, D.,
1269 García-Sastre, A., Et al. (2014). Influenza A virus transmission bottlenecks are
1270 defined by infection route and recipient host. Cell Host & Microbe, 16 (5),
1271 691–700.
1272 Volkov, I., Pepin, K. M., Lloyd-Smith, J. O., Banavar, J. R., & Grenfell, B. T. (2010).
1273 Synthesizing within-host and population-level selective pressures on viral
1274 populations: The impact of adaptive immunity on viral immune escape. Journal
1275 of The Royal Society Interface, 7 (50), 1311–1318.
1276 Wang, Y.-Y., Harit, D., Subramani, D. B., Arora, H., Kumar, P. A., & Lai, S. K.
1277 (2017). Influenza-binding antibodies immobilise influenza viruses in fresh human
1278 airway mucus. European Respiratory Journal, 49 (1), 1601709.
69
1279 Weis, J. F., Baeten, J. M., McCoy, C. O., Warth, C., Donnell, D., Thomas, K. K.,
1280 Hendrix, C. W., Marzinke, M. A., Mugo, N., Matsen IV, F. A., Et al. (2016).
1281 Prep-selected drug resistance decays rapidly after drug cessation. AIDS, 30 (1),
1282 31.
1283 Welkers, M. R., Pawestri, H. A., Fonville, J. M., Sampurno, O. D., Pater, M.,
1284 Holwerda, M., Han, A. X., Russell, C. A., Jeeninga, R. E., Setiawaty, V., Et al.
1285 (2019). Genetic diversity and host adaptation of avian H5N1 influenza viruses
1286 during human infection. Emerging Microbes & Infections, 8 (1), 262–271.
1287 Wikramaratna, P. S., Sandeman, M., Recker, M., & Gupta, S. (2013). The antigenic
1288 evolution of influenza: drift or thrift? Philosophical Transactions of the Royal
1289 Society B: Biological Sciences, 368 (1614), 20120200.
1290 Wiley, D., Wilson, I., & Skehel, J. (1981). Structural identification of the
1291 antibody-binding sites of Hong Kong influenza haemagglutinin and their
1292 involvement in antigenic variation. Nature, 289 (5796), 373.
1293 Wilker, P. R., Dinis, J. M., Starrett, G., Imai, M., Hatta, M., Nelson, C. W.,
1294 OConnor, D. H., Hughes, A. L., Neumann, G., Kawaoka, Y., Et al. (2013).
1295 Selection on haemagglutinin imposes a bottleneck during mammalian
1296 transmission of reassortant H5N1 influenza viruses. Nature Communications, 4,
1297 2636.
1298 Wrammert, J., Smith, K., Miller, J., Langley, W. A., Kokko, K., Larsen, C.,
1299 Zheng, N.-Y., Mays, I., Garman, L., Helms, C., Et al. (2008). Rapid cloning of
1300 high-affinity human monoclonal antibodies against influenza virus. Nature,
1301 453 (7195), 667.
1302 Xue, K. S., & Bloom, J. D. (2019). Reconciling disparate estimates of viral genetic
1303 diversity during human influenza infections. Nature Genetics, 51 (9), 1298–1301.
1304 https://doi.org/https://doi.org/10.1038/s41588-019-0349-3
1305 Xue, K. S., & Bloom, J. D. (2020). Linking influenza virus evolution within and
1306 between human hosts. Virus evolution, 6 (1), veaa010.
1307 Xue, K. S., Stevens-Ayers, T., Campbell, A. P., Englund, J. A., Pergam, S. A.,
1308 Boeckh, M., & Bloom, J. D. (2017). Parallel evolution of influenza across
1309 multiple spatiotemporal scales. eLife, 6, e26875.
1310 https://doi.org/10.7554/eLife.26875
1311 Yu, G., Smith, D. K., Zhu, H., Guan, Y., & Lam, T. T.-Y. (2017). ggtree: An r package
1312 for visualization and annotation of phylogenetic trees with their covariates and
1313 other associated data. Methods in Ecology and Evolution, 8 (1), 28–36.
1314 https://doi.org/10.1111/2041-210X.12628
1315 Zinder, D., Bedford, T., Gupta, S., & Pascual, M. (2013). The roles of competition and
1316 mutation in shaping antigenic and genetic diversity in influenza. PLoS
1317 Pathogens, 9 (1), e1003104.
1318 Zuccarino-Catania, G. V., Sadanand, S., Weisel, F. J., Tomayko, M. M., Meng, H.,
1319 Kleinstein, S. H., Good-Jacobson, K. L., & Shlomchik, M. J. (2014). CD80 and
1320 PD-L2 define functionally distinct memory B cell subsets that are independent
1321 of antibody isotype. Nature Immunology, 15 (7), 631.
70
1322 Acknowledgments
1323 We thank Daniel B. Cooney, Andrea L. Graham, Joseph Gibson, Katelyn M. Gostic,
1324 Alvin X. Han, Chadi M. Saad-Roy, and Edward C. Schrom for helpful discussions. We
1325 thank Christopher J. Illingworth, Daniel B. Weissman, and an anonymous reviewer for
1326 helpful comments on a prior version of this manuscript.
1327 We thank the GISAID Initiative and the influenza surveillance and research groups who
1328 openly shared the genetic sequence data used in this work (a table with accession
1329 numbers and associated labs is available online in the project materials and data
1330 repositories on Github and OSF, see below for links).
71
1 Appendix
2 Asynchrony between virus diversity and antibody
3 selection limits influenza virus evolution
4
16 Contents
17 1 Summary 2
24 8 Further implications 41
1
26 1 Summary
27 This appendix contains additional information for the paper “Asynchrony between virus
28 diversity and antibody selection limits influenza virus evolution”.
29 We provide a detailed review of virological and immunological evidence supporting our
30 model of the interaction between influenza viruses and the (human) host immune
31 system (Section 2). We give a more detailed mathematical analysis of our within-host
32 model, deriving the approximate and exact analytical results that are used in the main
33 text and provide intuition to explain model behavior and limitations (Section 3). We do
34 the same for our model of the point of transmission, and assess which hosts are most
35 likely to act as inoculation selectors and which mutants are most likely to be
36 inoculation selected (Section 4).
37 We next discuss parameter uncertainties and report the results of a sensitivity analysis
38 for our within-host models (Section 5). We report an additional empirical analysis
39 showing that new variants often emerge polyphyletically: they appear simultaneously
40 on multiple branches of the influenza virus phylogeny. We discuss how this is more
41 easily explained in light of our model than in light of previous models (Section 6).
42 We then review these prior studies and models of within-host influenza virus antigenic
43 evolution and general pathogen immune escape in depth. We argue that they are
44 insufficient to explain within-host influenza virus antigenic evolution and that
45 inoculation selection accompanied by a realistically-timed antibody response is the most
46 likely competing hypothesis (Section 7). We elaborate on the conclusions that can be
47 drawn from our study and its implications for influenza virus control and for future basic
48 research (Section 8). We conclude by providing complete step-by-step mathematical
49 derivations of lemmas and approximations from elsewhere in the text (Section 9).
2
67 irrespective of the HA antigenic phenotype of the newly infecting virus. By contrast,
68 anti-HA antibodies are essential for inoculation selection because they recognise
69 previously encountered antigenic variants. This provides a competitive advantage to
70 new, distinct antigenic variants. The new variants with strongest competitive advantage
71 are those with large antigenic effects; they have the lowest probability to be neutralized
72 by any cross-reactive anti-HA sIgAs.
3
111 variants.
112 The peak of influenza virus replication occurs in the first 24–48 hours post infection
113 (Hadjichrysanthou et al., 2016). Despite the presence of several immunological routes
114 for generation of influenza virus-specific antibodies, the timing of the serological
115 immune response is always later than the time of peak of virus titer (at least in typical
116 infections of otherwise healthy individuals), thus limiting the opportunity for replication
117 selection during the course of infection.
4
154 The dynamics of mucosal immunity with age are also likely to have impact on the
155 selector phenotype of previously exposed hosts. In older individuals, multiple previous
156 exposures will eventually lead to preferential activation of recall responses for antibody
157 production to new variants as a result of original antigenic sin (Davenport & Hennessy,
158 1956), immune backboosting (Fonville et al., 2014), or antigenic seniority (Lessler et al.,
159 2012).
160 Even if the recall responses result in backboosting and higher levels of sIgA, these
161 antibodies are unlikely to be well-matched to the new variant or succeeding variants, as
162 continuous re-exposure favours responses to conserved epitopes (Krammer, 2019;
163 Krammer & Palese, 2019) rather than receptor-binding site epitopes suspected to be
164 most important for immune escape.
165 As a result of the limited ability to generate novel immune responses and the
166 phenomenon of antigenic seniority, an elderly individual who has been exposed multiple
167 times in the past is likely to exert very weak inoculation selection to newly infecting
168 viruses. By contrast, a young person recently exposed to influenza virus is more likely
169 to act as a strong selector, due to the availability of mucosal sIgA antibodies and the
170 more limited impact of original antigenic sin. However, due to the acute nature of
171 influenza virus infection, very young children are likely to require more than one
172 exposure before they begin to develop highly specific antibody responses to influenza
173 virus infection able to exert inoculation selection (Neuzil et al., 2006).
174 Most current efforts for characterizing humoral immunity to influenza virus infections
175 are focused on antibody responses in the serum. However, there is limited
176 understanding of the relationship between antibodies in the serum measured via HI and
177 the levels of mucosal antibodies, able to exert inoculation selection. Better
178 characterisation of the dynamics of mucosal immunity with age and across individuals is
179 needed to understand the implications of inoculation selection to influenza virus
180 evolution depending on the host population structure.
5
194 governed by the following replicator equation (see section 9 for an explicit derivation):
dfm #2 3 2 3$
= fm (1 − fm ) gm (t) − gw (t) − dm (t) − dw (t) (A1)
dt
195 If the new variant is neutral in the absence of antibody-mediated selection,
196 rw = rm = r, and therefore gm (t) = gw (t). It follows that despite changing target cell
197 populations and virion population sizes, the variants compete in a manner independent
198 of Vtot , provided each di (t) is independent of Vtot . We have competition only through
199 the respective death rates of old variant and new variant virions. These may be affected
200 by antibody-mediated neutralization of virions.
+ , + ,
201 If the fitness difference δ(t) = gw (t) − gm (t) − dm (t) − dw (t) is a constant δ, this
202 differential equation has a closed-form solution:
eδt
fm (t) = δt (A2)
e + f0−1 − 1
212 if te > tM :
0 t < te
fm (t) = δ(t−t ) (A4)
eδ(t−te )e+f (te )−1 −1 te ≤ t
m e
213 where δ = k(cw − cm ) is the fitness difference between the variants due to the recall
214 adaptive response.
6
215 3.2 Mutant frequency over time given ongoing de novo
216 mutation
217 Thus far we have neglected mutation the first mutation of interest (i.e. the first that
218 produces a new variant lineage that evades stochastic extinction). In fact, with
219 symmetric mutation between the two types of interest, we have:
dVw + ,
= Vw (1 − µ)gw (t) − dw (t) + µgm (t)Vm
dt (A5)
dVm + ,
= Vm (1 − µ)gm (t) − dm (t) + µgw (t)Vw
dt
220 The rate of fitness-irrelevant mutation is assumed to be equal for the two types, and to
221 preserve antigenic type (w or m). The rate of lethal or deleterious mutants is assumed
222 equal for the two types and is captured in g.
223 It can be shown (see section 9) that the equation for dfm
dt
in this case is:
dfm # $
2 2
= fm (1 − fm )(αm − αw ) + µ gw (t) − 2gw (t)fm + gw (t)fm − gm (t)fm (A6)
dt
When gw (t) = gm (t) = g(t):
dfm # $
= fm (1 − fm ) dm (t) − dw (t) + µg(t)(1 − 2fm ) (A7)
dt
224 When fm ( 0.5, the net effect of symmetric mutation on fm is to increase it at rate ≈ µ
225 (note that the symmetric mutation term becomes zero at fm = 1/2). In other words, it
226 is roughly equivalent to a case with no back mutation at all:
dVw + ,
= Vw (1 − µ)gw (t) − dw (t)
dt (A8)
dVm + ,
= Vm gm (t) − dm (t) + µgw (t)Vw
dt
227 in that case, we have:
dfm #2 3 2 3$ 2
= fm (1−fm ) gm (t)−(1−µ)gw (t) − dm (t)−dw (t) +µgw (t)(1−2fm +fm ) (A9)
dt
228 In either case, we mainly study a situation in which gw (t) = gm (t) = g(t) at all times
229 (though note that g(t) varies in time), dw (t) = dm (t) = d(t) = dv prior to the onset of
230 the adaptive immune response, and dw (t) = dv + k, dm (t) = dv + ck after the onset of
231 the adaptive immune response.
If fm is large relative to µ, the mutant growth term Vm g(t) is large relative to the de
novo mutation term µg(t)Vw . We can see this from the fact that Vm = fm Vtot ≥ fm Vw ,
with equality if and only if fm = 0. It follows that if fm > µ > 0, then:
7
232 But if the first mutation to produce a new variant is sufficiently late and there is little
233 or no positive selection, we do not necessarily have fm , µ, so ongoing mutation cannot
234 be ignored in the dynamics of fm . In that case, we can consider either the
235 µg(t)(1 − 2fm + fm 2
) term in equation A9 or the µg(t)(1 − 2fm ) term in equation A7.
236 We can approximate g(t) ≈ g0 = g(0) = R0 dv early in infection, since for small t,
237 C(t) ≈ Cmax and therefore g(t) = rβC(t) ≈ g0 .
238 With this approximation for g(t), the one-way mutation case has an analytical solution
239 for δ *= 0:
A exp(δt) + µg0
fm (t) ≈
A exp(δt) + µg0 − δ
(A10)
f0 µg0
A= (µg0 − δ) −
1 − f0 f0
1
+ µg0 t − 1
(A11)
1−f0
fm (t) ≈ 1
1−f0
+ µg0 t
dfm
≈ µg0 (A12)
dt
245 and therefore:
246 We can also see this by inspecting the other two expressions for fm (t) and seeing that
247 they are quasi-linear for small µ and small t. This further confirms that it is not crucial
248 to decide whether symmetric or one-way mutation is more realistic, as they will be
249 functionally the same from the point of view of practical mutant frequencies in the
250 absence of positive selection.
251 These approximations break down once C(t) ( Cmax and therefore gw (t) ( g0 ; fewer
252 wild-type replications are occurring, and rate of ongoing mutation therefore falls. So a
253 reasonable approximation for the maximum value of fm (t) in the absence of positive
254 selection is given by fm (tpeak ). For a mutation rate of 0.33 × 10−5 , typical values of
255 fm (tpeak ) without selection range from 10−5 to 2 × 10−4 , depending on R0 and tpeak .
256 We use the expression in A10 to plot the analytical predicted mutant frequencies show
257 in Fig. 1 and to calculate the transmission frequencies for the analytical model of
258 observed phenotypes in 5.
8
259 3.4 Consequences of ongoing de novo mutation
260 Ongoing mutation enhances possibilities for both replication and inoculation selection.
261 It truncates the left tail of the distribution of fm (tM ) and the distribution of fmt (the
262 mutant frequency at the time of transmission). Whenever tM is sufficiently late that we
263 have reached Vw ( µ1 before tM , fm (tM ) should be on the order of the inverse mutation
264 rate, or larger.
265 The consequence is that probabilities of replication selection and inoculation selection
266 should both be somewhat higher than those estimated based on the time to the first
267 non-extinct de novo mutant lineage, as in Fig 1. This further strengthens the case for
268 the importance of an adaptive response at 48 hours or later in explaining the absence of
269 replication selection.
280 the virus peaks once Cmax 1 − R10 target cells have been consumed (assuming
281 negligible
# target $ cell replenishment during the timecourse of infection), and
282
r
C
f max
1 − R0 viral replication events have occurred (this is a slight overestimate due
1
9
299 mechanism can be invoked, such as inoculation selection with replication selection only
300 occurring 2–3 days post-infection.
314 The approximations use the Poisson approximation to the binomial and the
315 approximation ex ≈ 1 + x when |x| ( 1.
316 The unconditional probability of replication selection prepl is the expectation in q of
317 prepl (q): Eq (prepl ). But since prepl is approximately linear in q, the linearity of
318 expectation implies:
325 Note that Iwasa et al., 2004 have previously derived this approximate result for the
326 probability that a subcritical replicator mutates to a supercritical one prior to going
327 extinct in the context of analyzing tumor cell mutations. They use a more rigorous
328 generating function approach.
10
329 4 Point of transmission analysis
330 4.1 Distinction between selection and drift at the point of
331 transmission
332 In our model, mucosal antibodies acting at the point of transmission provide a
333 mechanism for population level selection pressure: some protection of experienced hosts
334 against infection with old variant virus, and worse protection against infection with new
335 variant virus. In particular, it provides a mechanism that produces selection pressure
336 for new antigenic variants while predicting that reinfections with old variant
337 viruses—without observable new variant viruses—should still be observed. Prior models
338 (see section 7) predict that in fully immune hosts, any observable reinfections will have
339 new antigenic variant viruses at consensus.
340 Antibodies at the point of transmission appear to play a key role in population-level
341 influenza virus evolution: they produce the selection pressure that allows new variants
342 to spread more reliably and thus rapidly from host to host than old variants. But they
343 may play a second role as well: promoting of new variants in frequency from the low
344 frequencies at which they are typically generated to high frequencies at which they can
345 be observed and reliably transmitted onward. This inoculation selection takes on the
346 role of previously held by replication selection in explaining why inoculations of
347 experienced hosts might produce new variant infections at a higher rate (per
348 inoculation) than inoculations of naive hosts.
11
364 Whether inoculation selection improves mutant survival relative to pdrift depends on the
365 ratio of v/b, and on the mutant’s probability of avoiding neutralization 1 − κm (main
366 text Figs. 3F, 4, Fig. A1, and sec. 4.5 below).
367 4.3 Expression for and properties of the new variant survival
368 probability
369 As shown in the Methods, the probability that a new variant survives the cell infection
370 bottleneck of size b is well approximated for small fmt by:
371 pinoc is the probability that at least one mutant survives the sIgA bottleneck:
b 0 w̄j #
b−1
b $
pcib (κw , v, b) = (1 − e−w̄ ) + e−w̄ 1−
w̄ j=0
j! j+1
375 where w̄ = v(1 − fmt )(1 − κw ). If b = 1, this expression reduces to the first term.
382 except the last in the summation is negative, since b > j + 1 for j < b − 1. The
383 last term is zero (the last term is included for notational reasons, so that the
384 formula is correct with b = 1). This implies that the first term is always ≥ pcib ,
385 with equality if and only f b = 1.
386 • If κm = σκw , then pinoc is decreasing in κw for 0 < σ ≤ 1. This is as expected: the
387 more likely the new variant is to be neutralized, the smaller pinoc becomes. It
388 suffices to show that dpdκinoc
w
< 0 for 0 < σ ≤ 1. We find:
dpinoc
= −e−vfmt (1−σκw ) (vfmt σ) (A20)
dκw
12
390 It is sometimes useful to write this as:
dpinoc
= vfmt σ(pinoc − 1) (A21)
dκw
391 • pcib is decreasing in w̄ (see section 9.7 for derivation). This is intuitive: the less
392 competition in the form of (expected numbers of) un-neutralized old variant
393 virions, the more likely a surviving mutant is to pass through the cell infection
394 bottleneck.
395 • pcib is increasing in κw and fmt and decreasing in v. Since v, 1 − κw , and 1 − fmt
396 are all ≥ 0, it is clear that w̄ is decreasing in κw and fmt and increasing in v. It
397 follows that pcib is increasing in κw and fmt and decreasing in v. In all cases, this
398 reflects the parameters’ effect on the mean amount of competition for the mutant
399 from old variant virions for the cell infection bottleneck.
400 • pinoc is increasing in fmt , since a larger fmt implies an vfmt (1 − κm ) term that is
401 larger in absolute value, so e−vfmt (1−κm ) becomes smaller and
402 pinoc = 1 − e−vfmt (1−κm ) becomes larger.
410 4.5 The relative importance of selection and drift at the point
411 of inoculation
412 Consider psurv / pdrift . This ratio quantifies the degree to which sIgA neutralization at
413 the point of transmission facilitates (or hinders) mutant survival of the final bottleneck
414 relative to a purely neutral founder effect.
415 For realistically small b and fmt , we can use the approximations pdrift ≈ bfmt and
416 pinoc ≈ vfmt (1 − κm )
417 Then we have
pinoc pcib v
psurv /pdrift = ≈ (1 − κm )pcib (κw , v, b) (A22)
pdrift b
13
422 • Conversely, if we hold κm constant, increasing κw makes inoculation selection
423 more effective relative to drift (this follows from the fact that pcib is increasing in
424 κw , see section 4.3.1 above).
425 • Since pcib is increasing in fmt (section 4.3.1), psurv /pdrift is increasing in fmt . That
426 is, inoculation selection is more efficient relative to drift when promoting higher
427 frequency mutants, all else equal.
428 • psurv /pdrift is decreasing in b (see section 9.9 for a derivation). In other words,
429 when the cell infection bottleneck is wider, inoculation selection is less important
430 relative to drift. The large bottleneck gives the mutant a reasonably good chance
431 of surviving even in the absence of any neutralization of competing old variant
432 virions.
433 For a bottleneck of size b = 1, we can also see that inoculation selection becomes more
434 efficient relative to drift as v increases, since:
v
psurv /pdrift ≈ (1 − κm )pcib
b ! "
1 − exp −v(1 − fmt )(1 − κw )
= v(1 − κm )
v(1 − fmt )(1 − κw )
1 1 − κm ! "
= [1 − exp −v(1 − fmt )(1 − κw ) ]
1 − fmt 1 − κw
1 1 − κm
psurv /pdrift ≈ (v(1 − fmt )(1 − κw ))
1 − fmt 1 − κw
= (1 − κm )v
441 This provides intuition for the effects seen in main text Fig. 4, in which inoculation
442 selection improves survival relative to drift by a factor of vb for a full escape mutant, and
443 by a factor scaled down by 1 − κm for a partial escape mutant.
Moreover, if we let fmt = fd for the drift case and fmt = fr for the selective case, the
approximate ratio becomes
fr
psurv /pdrift ≈ (1 − κm )v
fd
444 Our replicator equation implies that this ratio ffdr will increase in expectation when
445 more replication selection occurs prior to transmission (larger δτ ), explaining the
14
446 increasing ratio of the selective variant survival relative to drift observed in Fig. 4 when
447 replication selection is permitted prior to transmission.
448 Note that we did not integrate over the emergence times to obtain pnv ; instead, we
449 derived principles that take the initial frequency as a given.
462 Consider the endpoint at κw = 1. pcib = 1, pinoc ≈ fmt v(1 − κm ) = fmt v(1 − σ), and
463 w̄ = v(1 − fmt )(1 − κw ) = 0.
+ ,
464 So dκw = −v(1 − fmt )e
dpcib −w̄
S (w̄) − S(w̄) can be evaluated, and it evaluates to 0 except
%
466 Case 1: b > 1. For b > 1, it follows from the above that
dpsurv dpinoc
= 0 − pcib (A24)
dκw dκw
15
478 Case 2: b = 1. When the bottleneck is 1, limw̄→0 dpdw̄cib = − 12 . So in this case, the
479 derivative at κw = 1 may be positive or negative, depending on whether:
dpcib
pinoc + pcib (vfmt σ)(pinoc − 1) > 0 (A25)
dκw
1
(1 − fmt )pinoc + fmt σpinoc > fmt σ
2
1
(1 − fmt )pinoc + σpinoc > σ
2fmt
1
(1 − fmt )pinoc > σ(1 − pinoc )
2fmt
1 − fmt pinoc
>σ
2fmt 1 − pinoc
When pinoc is small, we have approximately pinoc ≈ fmt v(1 − σ) and 1 − pinoc ≈ 1, so:
1 − fmt
v(1 − σ) > c
2
1 − fmt σ
v > (A26)
2 1−σ
481 In other words, with extremely small cell infection bottlenecks like those observed for
482 influenza viruses, fully immune hosts are the best selectors provided that the number of
483 virions v encountering IgA antibodies is sufficiently large given the degree of escape
484 achieved. Less immune escape (larger sIgA cross immunity σ) necessitates a larger v to
485 make fully immune hosts the best selectors. Since fmt ( 1, a rule of thumb is that
486
2σ
v > 1−σ .
487 The intuition is that larger v means more competition for the cell infection bottleneck
488 among virions that reach IgA, and thus a greater opportunity for the mutant’s selective
489 advantage to be realized, but that this only works provided that this advantage is large
490 enough so that the mutant is not itself at too large a risk of being neutralized.
16
495 5.1 Strength of selection
496 A crucial parameter in our model is δ: the magnitude of the fitness advantage of the
497 new variant over the old variant during viral replication in an infected individual. One
498 possible objection to our analysis here is that the antibody mediated virion
499 neutralization rate k is low enough or the antigenic similarity between the variants is
500 great enough to make the fitness difference δ = k(cw − cm ) small. In that case,
501 replication selection to consensus will be rare even if tM is very small, and the adaptive
502 response is mounted immediately (main text Fig. 1C,D). Despite uncertainty about
503 both k and cw − cm , k is likely to be high, perhaps extremely so. And given sufficiently
504 high k and a homotypically reinfected host (cw = 1), even moderate immune escape
505 (0 , cm < 1) produces a substantial fitness difference. Moreover, assuming small k
506 early in infection grants our basic hypothesis: antibodies mediated selection is weak
507 early in infection, neutralization during viral replication is not the mechanism of
508 protection against reinfection, and there is asynchrony between antigenic diversity and
509 meaningful antigenic selection.
510 5.1.1 Antibody-mediated virion neutralization rate (k) may be very high
511 Parameter estimates for antibody-mediated virion neutralization in the presence of
512 substantial well-matched antibody correspond to values of k that are extremely high.
513 For instance, by fitting a single-variant within-host model to data, Cao and McCaw
514 (Cao & McCaw, 2017) estimate antibody neutralization rates between 0.4 and 0.8 per
515 virion-day per pg/mL of antibody, and antibody concentrations of over 100 pg/mL by
516 day 6 in an infection of a naive host. This corresponds to a k of 40 to 80 in our more
517 phenomenological model.
518 For tM = 0 (constant immunity) if k is sufficiently large that the old variant virus has
519 an initial R(0) < 1 (which occurs if k > dv (R0 − 1)), then all visible reinfections will be
520 mutant infections. At R0 < 10 with dv = 4, this corresponds to a k of 40, the lower end
521 of the Cao and McCaw estimates.
522 A k of this magnitude has several implications that support our hypotheses of
523 asynchrony between diversity and selection. Such a k would suffice to drive R(0) below
524 1. A sufficiently early activation of a recall antibody response would then imply that all
525 observable infections of experienced hosts would be mutant infections (Fig. 2). In
526 intermediately immune hosts, there could be a substantial fitness advantage
527 δ = (cw − cm )k for new variant viruses over old variant viruses (where cm < cw are the
528 cross immunities to the host memory variant for the old variant and the new variant,
529 respectively).
530 With large k and tM = 0, intermediately immune hosts who cannot block transmission
531 could easily have R(0) ≈ 1 for the old variant; this would make them excellent at
532 generating and selecting for mutant before an infection is cleared.
533 A final reason why a high k supports the hypotheses of this paper is that antigenic
534 evolution is extremely rapid if a sufficiently strong recall antibody responses becomes
535 active after viral replication begins but before the infection peaks. New antigenic
536 variants should be generated with near-certainty by virus exponential growth well before
537 the infection’s peak—roughly when the number replications that have occurred is at or
538 above the order of magnitude of the inverse mutation rate (Fig. 8). Suppose a strong
17
539 (R(te ) < 1) recall response is mounted at that point te . The mutant then has target cell
540 resources on which to grow (since C(te ) , Cpeak ) and experiences negligible replication
541 competition from the old variant (since the old variant population is not growing but in
542 fact is shrinking). If not lost stochastically, it should therefore emerge to detectable and
543 transmissible levels. We do not observe this. This suggests that if k is large, not only
544 must the antibody response not be immediate (tM > 0), but it must also be late enough
545 enough to make this effect unlikely: it must happen either just before or after the peak
546 of infection. Evidence suggests that this is indeed the case. Infections peak by 36–48
547 hours post-infection, and antibody responses only begin 48–72 hours post infection.
18
580 5.3 Bottleneck sizes
581 Replication selection may improve mutant bottleneck survival relative to neutral drift
582 (see Fig. 3F, Fig. A1, and section 4). For this to be true, a non-antigenic bottlenecking
583 must follow IgA antibody neutralization, with a ratio v/b , 1
584 Evidence suggests that a non-antigenic bottleneck does occur after the sIgA bottleneck
585 because bottlenecks measured in vaccinated and non-vaccinated hosts are of comparable
586 size (indistinguishable from one), suggesting that founding virion population sizes are
587 cut down to small numbers even in the absence of IgA antibodies, though this would
588 also be consistent with the case of v = 1 (i.e. IgA antibodies must neutralize on average
589 a single virion to prevent infection, and this virion, if not neutralized or otherwise lost,
590 uniquely founds the infection).
591 Furthermore, an empirical study of adaptation of an avian influenza virus to a naive
592 mammalian host (ferrets) found that as the virus adapted, measured transmission
593 bottlenecks in naive hosts became tighter (Moncla et al., 2016), while a re-analysis of
594 that data found that bottlenecks were tight throughout (Lumby et al., 2018). The fact
595 that bottlenecks do not appear to loosen (and may tighten) with adaptation for better
596 transmission and replication is further evidence, albeit circumstantial, that when an
597 influenza virus is well-adapted to its host and replicates rapidly, the first virion or first
598 few virions to infect a cell will be the ancestor of the vast majority of progeny viruses.
19
620 5.5 Threshold versus probabilistic transmission
621 In the simulation results shown in the main text (Figs. 1, 2, 3, 5) we use a probabilistic
622 model of transmission: the donor host’s probability of inoculating the recipient at a
623 given point during the infection is proportional to donor viral load in virions, Vtot (t).
624 Because influenza virus infections spike and decline so quickly, this should be roughly
625 equivalent to a simple threshold model, in which transmission can occur (equiprobably)
626 any time Vtot is above some threshold θ and never when it is below θ. We plot
627 equivalent figures here and show that adopting a threshold model does not qualitatively
628 change the results.
20
mucus bottleneck v: 1 mucus bottleneck v: 10 mucus bottleneck v: 25
cell infection bottleneck b: 1 cell infection bottleneck b: 1 cell infection bottleneck b: 1
sIgA cross immunity
10°2 (æ = ∑m/∑w )
0% 90%
10°3 50% 99%
10°4
10°5
10°6
mucus bottleneck v: 5 mucus bottleneck v: 50 mucus bottleneck v: 125
cell infection bottleneck b: 5 cell infection bottleneck b: 5 cell infection bottleneck b: 5
10°2
probability mutant present
after final bottleneck
10°3
10°4
10°5
10°6
mucus bottleneck v: 10 mucus bottleneck v: 100 mucus bottleneck v: 250
cell infection bottleneck b: 10 cell infection bottleneck b: 10 cell infection bottleneck b: 10
10°2
10°3
10°4
10°5
10°6
0.00 0.25 0.50 0.75 1.00 0.00 0.25 0.50 0.75 1.00 0.00 0.25 0.50 0.75 1.00
Fig. A1. Probability that a new variant is present after cell infection (final)
bottleneck as a function of cross immunity and degree of competition for
the final bottleneck. Probability shown as a function of probability of no old variant
infection (zw ), degree of cross immunity between mutant and new variant σ = κm /κw ,
mucus bottleneck size v, and final bottleneck size b. Gray dotted line indicates probability
that a new variant survives the transmission bottleneck in a host who is naive both to
the old variant and to the new variant (i.e. drift). fmt = 9 × 10−5 , a typical value for a
naive transmitting host in stochastic simulations
21
A naive population B immediate recall response C mucosal antibodies and D mucosal antibodies and
immediate recall response realstic (48h) recall response
0.30
0.25
0.20
frequency
0.15
0.10
0.05
0.00
E F G H
0.05
probability density
0.04
0.03
0.02
0.01
0.00
°0.2 0.0 0.2 °0.2 0.0 0.2 °0.2 0.0 0.2 °0.2 0.0 0.2
antigenic change antigenic change antigenic change antigenic change
22
A detectable infections C new variant infections E new variant infections
per inoculation per inoculation per detectable infection
1.0 100 100
1.0 more de novo
new variants
10°1 more inoculated 10°1
0.8 new variants
0.8
immediate recall response
10°2 10°2
0.6
0.6
0.8 10°3 10°3
0.4
0.4 10°4 10°4
B D F
1.0 100 100
1.0
0.4
10°2 10°2
0.6
0.6
10°3 10°3
0.2
0.4
0.4 10°4 10°4
10°6 10°6
0.0
0.0 1 3 100.2 50 10.4 3 10 50
0.6 1 0.8 3 10 50 1.0
bottleneck
23
100
per 100 detectable infections
new variant infections
10
0
1 3 10 50 0.0 0.5 1.0 5 10 15 0 200 400
bottleneck V50 £109 R0 rw
100
per 100 detectable infections
new variant infections
10
0
0 1 2 3 2 4 6 8 5 10 15 0.6 0.8 1.0
µ £10°5 dV k c
100
per 100 detectable infections
new variant infections
10
0
0.0 0.5 1.0 6 7 8 9 0.5 1.0 0.7 0.8 0.9 1.0
tM tN Cmax £109 zw
100
per 100 detectable infections
more common
inoculated new variants
10 more common
0
0.6 0.8 0 20 40 0.00 0.25 0.50 0.75 1.00
zm/zw v/b
Fig. A4. Sensitivity analysis: parameter values versus rate of new variant
infections per 100 detectable infections of an experienced host, given an
unrealistically early recall response. Parameters randomly varied across the ranges
given in table 2, with tM varied between 0 and 1. Each point represents a parameter set;
the rate of new variant infections per hundred 100 detectable infections is estimated from
50 000 simulated inoculations of an experienced host. A new variant infection is defined
as one in which the new variant reached a transmissible frequency of at least 1% at any
point in the infection. Points are colored according to whether new variant infections
were more frequently caused by de novo generated (purple) or inoculated (green) new
variant viruses
.
24
646 5.7 Threshold versus proportional transmission
647 As described in the Methods, simulation figures shown in the paper use a model of
648 transmission in which probability of transmission to a naive host is proportional to viral
649 load.
650 Since influenza virus population sizes grow and shrink rapidly around the within-host
651 peak, however, this model should be qualitatively similar to a threshold model in which
652 transmission occurs with certainty if the total viral load is above a certain threshold θ
653 and does not occur if it is below that threshold. To confirm this, we here show
654 corresponding simulation results based on a threshold model, with the threshold set
655 equal to V50 from the proportional model.
25
100
per 100 detectable infections
new variant infections
10
0
1 3 10 50 0.0 0.5 1.0 5 10 15 0 200 400
bottleneck V50 £109 R0 rw
100
per 100 detectable infections
new variant infections
10
0
0 1 2 3 2 4 6 8 10 20 0.6 0.8 1.0
µ £10°5 dV k c
100
per 100 detectable infections
new variant infections
10
0
2 3 4 6 7 8 9 0 2 4 0.7 0.8 0.9 1.0
tM tN Cmax £109 zw
100
per 100 detectable infections
more common
inoculated new variants
10 more common
0
0.6 0.8 0 20 40 0.00 0.25 0.50 0.75 1.00
zm/zw v/b
Fig. A5. Sensitivity analysis: parameter values versus rate of new variant
infections per 100 detectable infections of an experienced host given a realistic
(48 hours or more post-infection) recall response. Parameters randomly varied
across the ranges given in table 2, with tM varied between 2 and 4.5. Each point represents
a parameter set; the rate of new variant infections per hundred 100 detectable infections
is estimated from 50 000 simulated inoculations of an experienced host. A new variant
infection is defined as one in which the new variant reached a transmissible frequency
of at least 1% at any point in the infection. Points are colored according to whether
new variant infections were more frequently caused by de novo generated (purple) or
inoculated (green) new variant viruses.
.
26
656 6 Polyphyletic antigenicity altering substitutions
657 6.1 Background
658 Amino acid substitutions associated with antigenic “cluster transitions” have reached
659 fixation in parallel—and near synchronously in time—in genetically distinct
660 co-circulating lineages. This has occurred both in A/H1N1 and in A/H3N2 seasonal
661 influenza viruses.
662 Existing “mutation-limited” or “diversity-limited” model explanations of why
663 observable influenza virus antigenic novelty is rare despite continuous genetic evolution
664 are less convincing in light of this synchronous polyphyly (see section 7). In particular,
665 a number of models that large effect antigenicity-altering substitutions can only emerge
666 in specific genetic contexts (Gog, 2008; Koelle & Rasmussen, 2015; Kucharski et al.,
667 2015), and that this constraint limits the rate of population-level antigenic change.
668 Synchronous, polyphyletic cluster transitions cast doubt on these claims. If major
669 antigenic changes appeared infrequently due to the rarity of “jackpot” scenarios of a
670 large-effect mutant in a favorable genetic background (Koelle & Rasmussen, 2015), it
671 would be surprising to see two or three distinct and synchronous jackpots after years of
672 none. So while genetic background may play an important role in determining which
673 lineages are fittest, synchronous polyphyly suggests that it is unlikely to set the clock of
674 antigenic evolution. Rather, it suggests that accumulating population immunity may be
675 crucial in setting the clock, and that there may even be a rising generation rate of
676 observable antigenic novelty as a cluster ages, rather than a fixed rate (see section
677 Population immunity sets the clock of antigenic evolution of the main text).
678 To show that large effect antigenic changes occur near-synchronously in multiple
679 backgrounds, we assessed evolutionary relationships and built phylogenetic trees for
680 known recent polyphyletic antigenicity-altering substitutions that have emerged in
681 A/H1N1 and A/H3N2. Our analysis rules out reassortment as an explanation for the
682 apparent polyphlyly, thus verifying that the branches represent distinct, independent,
683 but simultaneous de novo lineages.
27
698 reconstructed tree using Garli 2.01 with model parameters matching the RAxML
699 reconstruction (500,000 generations) (Bazinet et al., 2014). It was computationally
700 intractable to run Garli on the full phylogeny of A/H3N2 2012–2014 (n = 10107). We
701 visualized phylogenetic trees using Figtree (http://tree.bio.ed.ac.uk/software/figtree)
702 and ggtree (Yu et al., 2017)
703 We mapped amino acid substitutions onto branches using custom Python scripts
704 available in the project Github. All numbering complies with H1/H3 numbering scheme
705 (Burke & Smith, 2014).
28
Table A2: Substitutions that characterize the co-circulating K140E-defined lineages
29
Amino acid at position 140
E
K
Other
D35NK145R
K140E
E273K
R188K
E68G
I47K
K82R
2
K140E N244S
T193K
R188M
A189T
S36N
K140E
Y94H P270S
V128A
A128T
3
K73R
0.01
Fig. A6. Phylogeny of A/H1N1 seasonal viruses for the period 1999 to 2008.
Branch tip colour indicates the amino acid identity at position 140. Co-circulating
lineages defined by the K140E fixation are highlighted. Scale bar indicates the number
of nucleotide substitutions per site. Tree rooted to A/New Caledonia/20/1999
30
726 6.4 A/H3N2 2008-2011
727 In the period 2008-2011, the K158N substitution was only detected once in 2009 before
728 fixing in combination with N189K in the same year in two distinct co-circulating
729 A/H3N2 lineages. N189K was not detected before fixing as a K158N/N189K double
730 substitution. The combination of the substitutions resulted in an antigenic phenotype
731 switch from Wisconsin/2005-like to the Perth/2009-like and Victoria/2009-like
732 phenotypes respectively. Residues 158 and 189 are both located in antigenic site B
733 adjacent to the receptor binding site, with substitutions at both residues characterized
734 as cluster-transition substitutions in multiple historic A/H3N2 cluster transitions (Koel
735 et al., 2013). The HA-defined evolutionary trajectories of viruses from the
736 Perth/2009-like lineage and Victoria/2009-like lineages from their most recent common
737 ancestor are characterized by distinct sets of substitutions (table A3, Fig. A7). Both
738 lineages acquired substitutions at previously characterized antigenic sites (Koel et al.,
739 2013), but only share the T212A substitution.
Table A3: Substitutions that characterize the co-circulating genetic / antigenic clades
defined by K158N and N189K
31
Amino acid at positions 158 and 189
158N/189K
158K/189N
Other
S199A
E62V
1
N145S
N278KQ33R
N312S T46I
A198S
Q57H
T48A R42R
N312S
S45N
N145SN144D
K158N, N189K T212A V223I
T212A R142G
N133D
V213A
2
E50K
P162S
I260M
R261Q
0.004
Fig. A7. Phylogeny of A/H3N2 seasonal viruses for the period 2008 to 2011.
Branch tip colour indicates the amino acid identity at position 158 and 189. Co-circulating
lineages defined by the K158N/N189K fixation are highlighted. Scale bar indicates the
number of nucleotide substitutions per site. Tree rooted to A/Brisbane/10/2007.
32
740 6.5 A/H3N2 2012–2014
741 In 2014 two independent substitution at position 159, F159S and F159Y, fixed in
742 distinct A/H3N2 lineages co-circulating globally (Fig. A8). The S159Y substitution was
743 detected sporadically in 2012/2013 before fixing in 2014 to define the A/H3N2 clade
744 3C.2a, which circulated as the dominant clade globally for the next three years. The
745 F159S substitution was not detected in 2012–2013 before reaching fixation in 2014 to
746 define clade 3C.3a, which continued to circulate globally at low frequencies.
747 Residue 159 is located in antigenic site B. The S159Y substitution in combination with
748 Y155H and K189R substitutions previously resulted in the transition of the A/H3N2
749 Bangkok/79-like antigenic phenotype to Sichuan/87-like phenotype (Koel et al., 2013).
750 The two lineages have independent mutational trajectories for their HA gene away from
751 their most recent common ancestor, excluding both acquiring the N225D substitution
752 prior to F159X-fixation, with changes acquired changes at the major antigenic epitopes
753 (Table A4, Fig. A8) (Koel et al., 2013).
Table A4: Substitutions that characterize the co-circulating genetic / antigenic clades
defined by F159Y and F159S
33
Amino acid at position 159
1
F159S
A138S N225D
R142G T128A
R261L
R142K
2
F159Y
N225D L3I, Q311H, N144S K160T
Fig. A8. Phylogeny of A/H3N2 seasonal viruses for the period 2012 to 2014.
Branch tip colour indicates the amino acid identity at position 159. Co-circulating
lineages defined by the F159X fixation are highlighted. Scale bar indicates the number
of nucleotide substitutions per site. Tree is rooted to A/Perth/16/2009.
34
754 7 Prior theoretical studies
755 In this section, we discuss how our work fits into the substantial existing theoretical
756 literature on influenza virus evolutionary dynamics.
757 A number of theoretical studies over the last 20 years have addressed the tempo and
758 structure of seasonal influenza virus antigenic evolution, but only two have addressed
759 within-host influenza virus antigenic evolution (Luo et al., 2012; Volkov et al., 2010).
760 Most have focused on population level dynamics (Bedford et al., 2012; Ferguson et al.,
761 2003; Gog, 2008; Koelle et al., 2006; Koelle et al., 2009; Koelle & Rasmussen, 2015;
762 Recker et al., 2007; Strelkowa & Lässig, 2012; Wikramaratna et al., 2013; Zinder et al.,
763 2013).
764 Our model suggests revisions to the theoretical understanding of within-host and
765 point-of-transmission evolutionary dynamics. The revised paradigm has implications for
766 the population-level models—most notably, it suggests that the rate of population-level
767 antigenic diversification may not be constant over time.
35
796 events and therefore select upon on a larger, more diverse array of potential pathogen
797 variants. Prophylactic interventions may limit the number of replications before
798 clearance and thereby limit opportunities for diversification. Like the influenza-specific
799 models discussed, this general model does not address the question of how evolution can
800 be rare in symptomatic or otherwise observable infections, where substantial antigenic
801 diversity can be generated.
36
837 7.3 Fitness costs for antigenic mutants
838 Previous population-level studies have argued that non-antigenic fitness costs associated
839 with antigenic substitutions (Gog, 2008; Kucharski & Gog, 2011) or deleterious
840 mutational load on the influenza virus genome (Koelle & Rasmussen, 2015) are
841 necessary to explain the pace of influenza virus antigenic evolution in the human
842 population. These models introduce population-level antigenic novelty at rates 5–10
843 times higher than predicted by inoculation selection, and at equal rates early and late
844 during the circulation of a particular antigenic variant.
845 There are empirical reasons to doubt that influenza viruses are in fact antigenic
846 diversity-limited due to fitness costs. As discussed in section 6), we find multiple
847 instances of polyphyletic cluster transitions: a cluster transition amino acid substitution
848 arises and proliferates on two or more branches of the virus phylogeny at this same
849 time. This is inconsistent with the hypothesis that antigenic variants are either
850 intrinsically unfit (deleterious substitutions) or incidentally unfit (poor background)
851 prior to the moment that they proliferate at the population level (Gog, 2008; Koelle &
852 Rasmussen, 2015; Kucharski et al., 2015), since the cluster transition mutant can reach
853 surveillance-detectable levels even before substantial population immunity has
854 accumulated (suggesting that strong antigenic selection is not required to overcome
855 intrinsic deleteriousness), and it can fully emerge when the time is ripe against multiple
856 independent genetic backgrounds, suggesting background may not suffice to constrain
857 diversification earlier in a cluster’s circulation.
858 In the absence of meaningful replication selection, the population level rate of antigenic
859 diversification is the average rate at which new variants survive all transmission
860 bottlenecks to found or co-found infections. In an inoculation selection regime, that rate
861 is unlikely to surpass 2 in 104 inoculations (Fig. 6A). We obtain that estimate assuming
862 no within-host replication cost for antigenic mutants, weak cross immunity between
863 mutant and old variant viruses, and many experienced hosts who reliably neutralize old
864 variant virions at the point of transmission. Actual rates of new variant survival are
865 almost certainly lower. First, we have assumed that a new antigenic variant can be
866 produced by a single amino acid substitution (accessible via a single nucleotide
867 substitution), however a recent detailed study of the antigenic evolution of A/H3N2
868 viruses shows that some new variants require more than one substitution (Koel et al.,
869 2013). Second, some (possibly many) previously infected hosts will not possess
870 well-matched antibodies due to original antigenic sin (Davenport & Hennessy, 1956),
871 antigenic seniority (Lessler et al., 2012), immune backboosting (Fonville et al., 2014), or
872 other sources individual-specific variation in antibody production (Lee et al., 2019).
873 These factors individually, and particularly in combination, which have not been
874 modelled in this study mean that the above 2 in 104 transmission estimate is likely to
875 be unrealistically high. By comparison, existing population level models that invoke
876 fitness costs introduce population-level antigenic diversity at much higher rates. One
877 study (Koelle & Rasmussen, 2015) introduces new antigenic variants at a rate of 7.5 per
878 104 transmissions; another (Kucharski & Gog, 2011) introduces new population-level
879 mutations at a continuous rate of 6.8 × 10−4 mutations per infected individual per day,
880 which should produce new variant infections by 2 days post infection in at least 1 of
881 every 103 infected hosts:
37
! "
1 − exp −6.8 × 10−4 ∗ 2 ≈ 0.0013
38
921 branching parameter Rw (0), see section 3.6). In this case, homotypic reinfection is truly
922 binary: it results in an undetectable infection that is rapidly cleared or it results in a
923 visible infection dominated by an escape mutant. This is how influenza viruses have
924 been hypothesized to behave in previous models of immune escape (Luo et al., 2012),
925 but it is contradicted by recent empirical work (McCrone et al., 2018; Xue et al., 2017)
926 and by challenge studies (Clements et al., 1986), which suggest that detectable
927 reinfection of experienced hosts frequently occurs without observable immune escape.
39
963 circulated for multiple years (Smith et al., 2004). This suggests that intermediately
964 immune hosts do not generate and select for new antigenic variants as readily as the
965 model in Volkov et al., 2010 implies.
966 Our proposed model of immune selection can explain why intermediately immune hosts
967 may not reliably select for new antigenic variants. A realistically timed recall response
968 makes replication selection unlikely. Rates of inoculation selection are limited in all
969 hosts due by low transmitting host mutant frequencies fmt . Intermediately immune
970 hosts who possesses sIgA that are not well-matched to the old variant may additionally
971 have a relatively low value of κw and therefore be little better than a naive host in
972 promoting new variant survival at the point of transmission.
2 3 2 3
δ(t) = gm (t) − gw (t) − dm (t) − dw (t) (A27)
981 Typically gm (t) = gw (t) and so δ = k(cw − cm ). Here, gm < gw . For instance, we could
982 have rm = qrw , q < 1 and therefore gm = qgw . In that case:
983 We estimate g to be order 20 early in infection and declining subsequently. k may well
984 be order 10. So it is very plausible that a weakly deleterious mutant (q = 0.95, for
985 instance) could still undergo substantial replication selection early in infection if it
986 offered enough immune escape (c = 0.70) and antibodies were present (tM small).
987 The effect could be particularly extreme in intermediately immune hosts if immunity
988 drops superlinearly with antigenic distance. If the old variant is neutralized at a rate
989 kcw = 4 but the mutant is barely neutralized at all cm ≈ 0, it could easily overcome
990 even substantial within-host deleteriousness.
991 In the absence of replication selection, even weak within-host deleteriousness should
992 lead mutants to be rapidly purged (purifying selection should be efficient for phenotypes
993 subject to replication selection) (Sigal et al., 2018), reducing fmt . Within-host
994 deleteriousness would therefore reduce the rate at which new variants survive
995 bottlenecks, since psurv and pdrift both decrease as fmt decreases (see section 4.4).
996 A final possibility is that antigenic mutants could be extremely deleterious in the
997 absence of compensatory mutations: q small enough that δ < 0 even when t > tM (i.e.
998 the old variant is more fit within-host than the new variant even in the presence of a
999 well-matched adaptive immune response). This scenario is unlikely given the strength of
40
1000 recall responses, but it would make inoculation selection and founder effects even more
1001 important for population-level antigenic diversification.
bRw (0)
fmt bpsse > µ psse
1 − Rw (0)
or simply:
Rw (0)
fmt > µ (A29)
1 − Rw (0)
1013 the left-hand side comes from equation A17, which gives an probability that a mutant
1014 with Rm (0) > 1 is inoculated. The right-hand side comes from equation 28, which gives
1015 the probability that an escape mutant is generated in a declining but replicating virus
1016 population before that population goes extinct. psse , which occurs on both sizes and
1017 cancels, is the probability that a mutant survives stochastic extinction. We have also
1018 ignored the fact the the right-hand side should have a term representing the probability
1019 that inoculation selection does not occur, but as we are considering cases when both
1020 inoculation selection and replication selection are rare, that term is approximately one.
1021 On average, fmt > µ due the asymmetry between forward and back mutation rates
1022 when the mutant is rare. It follows that fmt > µ 1−Rw (0) should hold when (A) the
Rw (0)
1023 mutant not is too deleterious in the absence of positive selection (since this reduces fmt )
1024 or (B) Rw (0) is not close to 1 (since then many copies are made on average before the
1025 virus goes extinct). Alternatively, if b is sufficiently large—on the order of
1026
1
fmt
—selection on inoculated diversity will be the dominant mode of evolution simply
1027 because most inocula will include mutants.
1028 Selection on inoculated diversity may be important in many host-pathogen systems, not
1029 just in influenza virus antigenic evolution. Some anti-microbial resistance in bacteria,
1030 for instance, is acquired through the uptake of preexisting plasmids, rather than
1031 through de novo mutation (Perron et al., 2015). Whether drug resistance emerges in an
41
1032 individual treated host will depend on whether any of the bacteria inoculated bear the
1033 needed plasmid.
42
1074 wider than occurs in humans, to use the terminology of main text Fig. 3A). This could
1075 produce a very large value of v, the number of virions encountering IgA, and potentially
1076 a large value of b, the final (cell infection) bottleneck as well—in the presence of
1077 sufficiently many virions, early cell infections might be sufficiently simultaneous as to
1078 produce have observably diverse within-host populations. All of this should tend to
1079 facilitate new variant survival and promotion to observable levels, but also to improve
1080 chances, relative to humans, that at least some old variant virions could survive
1081 alongside the new variant. Second, inoculations were also carried out until a successful
1082 inoculation could be achieved. This further improves the chances of observing an escape
1083 mutant selected at the point of transmission. Finally, vaccinated mice in the passage
1084 chain were inoculated 10–21 days post-vaccination, meaning IgA levels could be higher
1085 than the memory baseline. This should at minimum strengthen inoculation selection
1086 and perhaps even permit replication selection.
1087 One detail in supplementary table 1 of the mouse passage study (Hensley et al., 2009) is
1088 particularly relevant. In the passaging of virus stock #3, two mutants were observed for
1089 the first time in the second vaccinated mouse in the chain, but neither at fixation. Both
1090 remained present for 5 further passages in experienced mice without either being lost or
1091 fixing. This suggests a much wider final bottleneck than occurs in naturally-infected
1092 humans (McCrone et al., 2018). It also suggests that replication selection is unlikely to
1093 have been at all strong if present at all: over repeated exponential growth phases of two
1094 days with wide bottlenecks, even very small fitness differences can be easily amplified,
1095 so the more fit escape mutants should have fixed if they were subject to replication
1096 selection. Inoculation selection might have fixed them too—one of the two did fix, while
1097 the other was purged, in the 8th host. But if cell infection bottlenecks are wide and
1098 neutralization probabilities are low for both viruses, as appears to have been the case in
1099 these passage experiments, inoculation selection can favor a mutant but allow for a long
1100 intermittent period of coexistence.
1101 An example calculation illustrates this: if v = 1000, b = 10, fmt = 0.5, κm /κw > 0.9,
1102 and κw < 0.8 the mutant has an inoculation-selective advantage, but the expected
1103 mutant frequency at after the next transmission event approximately 0.58 or less.
1104 Letting w̄ = E(xw ) and m̄ = E(xm ):
1105 And since w̄, m̄ , b, the hypergeometric sampling is approximately binomial, and
1106 therefore the expected frequency of mutant is just m̄+
m̄
w̄
≈ 0.58
1107 While it is difficult to compare fitness differences during replication to fitness differences
1108 in mucosal neutralization, a very small fitness difference of δ = 0.5 should raise the
1109 fitter type from frequency 0.5 to frequency 0.73 over the course of two days.
43
1114 become sufficiently weak that true drift dominates as a force for introducing new
1115 antigenic variants. In such a regime, we expect the influenza viruses to evolve gradually,
1116 fail to cause large epidemics, and possibly diversify, not unlike influenza B viruses
1117 (Bedford et al., 2015). In particular, slow spread enables a kind of immunological niche
1118 partitioning: if population immune histories are not spatially uniform because
1119 epidemiological spread is slow relative to antigenic diversification, multiple variants that
1120 are each locally favored in distinct areas could co-circulate and form separate lineages,
1121 as has happened with B/Victoria and B/Yamagata.
1122 When antigenic jump size approaches the maximum size possible, such that all
1123 population immunity disappears each time there is an antigenic cluster transition
1124 (Smith et al., 2004), we expect influenza viruses to follow a strongly clock-like pattern
1125 with high-amplitude oscillations in case numbers. They would cause massive
1126 punctuated epidemics during jump years and then in the next year either select for a
1127 new variant or go extinct. There would be huge booms followed by one or more years of
1128 bust. In between, at intermediate degrees of escape, a punctuated but somewhat less
1129 clocklike pattern—such as the pattern observed in nature for A/H3N2 viruses (Smith
1130 et al., 2004)—becomes possible.
44
1156 We propose a previously unexplored consequence of this argument: successful
1157 establishment of a mutant lineage at the population level requires that the mutant
1158 lineage be exported to a new host subpopulation where susceptible hosts are common.
1159 This is rare but potentially selective sampling event.
1160 When a new variant lineage emerges at the population level, it is likely to be
1161 surrounded by many propagating old variant lineages due to the generator-selector
1162 dynamic discussed in the main text (see main text Fig. 6B,C). Moreover, most mutant
1163 lineage generation events will occur as an epidemic is peaking, since that is when the
1164 most inoculations occur. The consequence is that most generated population-level
1165 mutant lineages will encounter severe competition for susceptible hosts (especially if we
1166 consider realistic, spatial host contact networks rather than well-mixed epidemic
1167 models). So a generated mutant lineage is unlikely to account for many cases—or even
1168 necessarily more than one case—in the epidemic in which it emerges.
1169 Many local influenza virus epidemics are likely to be evolutionary dead ends with no
1170 cases exported that establish chains of transmission in other locations. Because only a
1171 minority of the human population is likely to travel to or from the site of any particular
1172 epidemic, the majority of virus diversity generated within each epidemic is likely to be
1173 lost in between-epidemic bottlenecks. Conditional on being exported, new variant
1174 lineages could have competitive advantages over old variant lineages because they
1175 spread should spread more rapidly and go stochastically extinct less easily due to their
1176 higher Re . It follows that there is analogous dynamic to inoculation selection at the
1177 population level: mutant lineages are rarely exported from one sub-population (host,
1178 epidemic) to another, but conditional on being exported, they have an advantage in any
1179 potential competition with exported old variant lineages.
1180 The rate of proliferation of mutants at the population level, then, may be limited by the
1181 rarity of early generation of mutant lineages during an epidemic (analogous to the rarity
1182 of very early within-host de novo generation of mutants) and by the rarity of successful
1183 mutant lineage exportation when generation is not early (analogous to the rarity of
1184 mutant virions surviving the transmission bottleneck between hosts).
1185 We aim to explore this argument with a formal mathematical model in future work.
dfm #2 3 2 3$
= fm (1 − fm ) gm (t) − gw (t) − dm (t) − dw (t)
dt
Derivation. We note that:
Vm (t)
fm (t) :=
Vtot (t)
45
dfm Vtot V̇m − Vm (V̇m + V̇w )
= 2
dt Vtot
2
= αm fm − αm fm − αw fm (1 − fm )
= αm fm (1 − fm ) − αw fm (1 − fm )
= fm (1 − fm )(αw − αm )
#2 3 2 3$
= fm (1 − fm ) gm (t) − gw (t) − dm (t) − dw (t)
1191 !
1194 Derivation. Given symmetric mutation, V̇i = Vi αi (t) + µgj (t)Vj where
1195 αi (t) = (1 − µ)gi (t) − di (t)
1196 Substituting in to the previous derivation yields:
dfm Vtot (αm Vm + µgw (t)Vw ) − Vm (αm Vm + µgw (t)Vw + αw Vw + µgm (t)Vm )
= 2
dt Vtot
# $
= αm fm + µgw (t)(1 − fm ) − fm αm fm + µgw (t)(1 − fm ) + αw (1 − fm ) + µgm (t)fm
2 2
= αm fm + µgw (t)(1 − fm ) − αm fm − µgw (t)fm (1 − fm ) − αw fm (1 − fm ) − µgm (t)fm
! 2
"
= αm fm (1 − fm ) − αw fm (1 − fm ) + µ gw (t)(1 − fm ) − gw (t)fm (1 − fm ) − gm (t)fm
# $
2 2
= fm (1 − fm )(αm − αw ) + µ gw (t) − 2gw (t)fm + gw (t)fm − gm (t)fm
dfm 2 3
= fm (1 − fm ) dm (t) − dw (t) + µg(t)(1 − 2fm )
dt
1198 !
46
1199 9.3 Replicator equation with one-way mutation
1200 Derivation. To find the case with one-way mutation, we simply let all µgm (t)Vm terms
1201 from the symmetric mutation replicator equation be zero. This yields:
dfm # $
2
= fm (1 − fm )(αm − αw ) + µgw (t) 1 − 2fm + fm (A31)
dt
1202 !
t∗ (x, t) = t∗+ (x, t) t∗+ (x, t) < t and t∗− (x, t) > tM
t otherwise
where: + ! " ,
1−x
ln x
exp δ(t − tM ) + 1 − ln b
t∗− (x, t) =
g0 − dv
! 1−x "
ln − ln(b) + δt − cw ktM
t∗+ (x, t) ≈ x
g 0 − d v − cw k + δ
Neglecting ongoing forward mutation the mutant must emerge at some frequency fe in
order to reach frequency x by time t:
ψ(t)
≥x
ψ(t) + fe−1 − 1
1207 fe−1 is determined by the number of old variant virions at time te . Early in infection,
1208 the old variant population grows near-exponentially at a rate G0 = g0 − dv . There are
1209 two cases: either time of new variant emergence (first mutation to produce a new
1210 variant lineage that evades stochastic extinction) occurs before the antibody response is
1211 present (te ≤ tM ) or it emerges once the antibody response is already present (te > tM ).
47
1212 9.4.1 Case 1: te ≤ tM
! "
1213 It follows that if te ≤ tM , fe−1 ≈ b exp (g0 − dv )te = b exp(G0 te ).
. /
G 0 te 1−x
be ≤ ψ(t) +1
x
1214 Taking the natural log of both sides and subtracting off ln b from both sides:
5. / 6
1−x
G0 te ≤ ln ψ(t) +1 − ln b
x
! "
ln ψ(t) 1−x
x
+ 1 − ln b
te ≤
G0
! "
1215 This establishes a value for t∗ when te < tM . In that case, ψ(t) = exp δ(t − tM ) , and:
+ ! " ,
1−x
ln x
exp δ(t − tM ) + 1 − ln b
t∗− (x, t) = (A33)
g0 − dv
2 3
fe−1 = b exp [G0 tM ] exp G1 (te − tM ) = b exp [cw ktM + G1 te ] (A34)
1220 The expression from A32 no longer yields a closed form solution; it is now
! a "
1221 transcendental equation, because ψ(t) now contains te terms: ψ(t) = exp δ(t − te ) .
1222 To deal with this, we make the approximation:
ψ(t)
fm (t) =
ψ(t) + fe−1
1223 This approximation is excellent unless the initial frequency fe is large, and it always
1224 yields an underestimate of fm (t), . This is desirable; we would like to be pessimistic
1225 about the probability of replication selection given early tM and early te > tM to avoid
1226 biasing ourselves in favor of our hypothesis (namely, that replication selection is
1227 unrealistically common if tM is early).
1228 Letting fm (t) = x, the desired frequency at t, we wish to solve:
ψ(t)
x≤
ψ(t) + fe−1
1229 We find:
48
fe−1 x + ψ(t)x ≤ ψ(t)
1−x
fe−1 ≤ ψ(t)
x
! "1−x
b exp (cw ktM + G1 te ) ≤ exp δ(t − te )
x
. /
1−x
ln(b) + cw ktM + G1 te ≤ δt − δte + ln
x
. /
1−x
te (G1 + δ) ≤ ln − ln(b) + δt − cw ktM
x
! 1−x "
ln x
− ln(b) + δt − cw ktM
te ≤
G1 + δ
1231 Note that since we used an underestimate of fm (t), this approximate t∗ is a lower bound
1232 for the true t∗ ; it may be that the new variant can actually emerge later and still
1233 successfully be replication-selected to the desired frequency x.
1234 Finally, it may be that t∗+ (x, t) > t and t∗− (x, t) > t. This indicates that the mutant will
1235 be at at least frequency x if it emerges at t itself. In that case, we therefore have
1236 t∗ (x, t) = t.
1237 Combining:
t− (x, t) t− (x, t) < t and t− (x, t) ≤ tM
∗ ∗ ∗
t∗ (x, t) = t∗+ (x, t) t∗+ (x, t) < t and t∗− (x, t) > tM (A36)
t otherwise
1238 !
b 0 w̄j #
b−1
b $
pcib (κw , fmt , wb) = E(pcib (xw , 1, b)) = (1 − e−w̄ ) + e−w̄ 1−
w̄ j=0
j! j+1
49
1241 where w̄ = v(1 − fmt )(1 − κw ) is the mean number of old variant virions present after
1242 IgA neutralization.
Derivation.
b
pcib (xw , 1, b) =
xw + 1
∞
0 w̄k 1
E(pcib (xw , 1, b)) = b e−w̄
k=0
k! k+1
0∞
b w̄k+1 −w̄
= e
w̄ k=0
(k + 1)!
∞
b 0 w̄j −w̄
= e
w̄ j=1 j!
We know that: ∞
0 λj
−λ
e + e−λ = 1
j=1
j!
∞
0 λj
e−λ = 1 − e−λ
j=1
j!
So substituting in:
∞
b 0 w̄j −w̄ b
e = (1 − e−w̄ )
w̄ j=1 (j)! w̄
1244 This is exact when b = 1, but in other cases pcib (xw , 1, b) is more properly given by
1245 max{ xwb+1 , 1}, since whenever we have xw + 1 < b, a new variant survives with certainty.
1246 So for each such xw < b − 1 we need to add on correction term equal to 1 − xwb+1 ,
weighted by the probability of that xw takes on that value: e−w̄ w̄xw ! .
xw
1247
1248 Summing from xw = 0 to xw = b − 1 and factoring out e−w̄ gives us the complete
1249 correction term:
w̄j # b $
b−1
0
−w̄
e 1−
j=0
j! j+1
1250 Adding that to the expression derived above completes the derivation.
1251 !
50
1252 9.6 pcib = 1 only if there is no competition
1253 pcib < 1 if κw < 1 and fmt < 1, and pcib = 1 if κw = 1 or fmt = 1.
1254 Derivation. This is most easily seen by rewriting pcib as an infinite sum:
+0
b−1
w̄j
∞
0 w̄j b ,
pcib (κw , v, b) = e −w̄
1+ (A37)
j=0
j! j=b
j! j + 1
If w̄ = v(1 − fmt )(1 − κw ) = 0, we have pcib = 1, as expected. Since v > 0, this occurs
either when there are no old variant virions (fmt = 1) or when the old variant is
neutralized with probability 1 (κw = 1). Otherwise, w̄ > 0 and we therefore have:
+0
b−1
w̄j
∞
0 w̄j b , + 0 w̄j 0 w̄j ,
b−1 ∞
pcib = e −w̄
1+ <e −w̄
1+ 1 =1 (A38)
j=0
j! j=b
j! j + 1 j=0
j! j=b
j!
1255 The last step uses the fact that the Poisson distribution is a proper probability
1256 distribution, and therefore sums to 1. !
Derivation. For b = 1,
dpcib e−w̄ (w̄ − ew̄ + 1)
=
dw̄ w̄2
1260 This is negative for w̄ > 0, as ex > x + 1 for x > 0.
1261 That pcib is decreasing in w̄ for b > 1 can be seen by writing:
b−2
0 0∞
w̄j w̄j b
S(w̄) = + (A41)
j=0
j! j=b−1 j! j + 1
51
1263 We begin by showing that S % (w̄) < S(w̄). We differentiate A40 with respect to w̄:
b−1
0 ∞
0
% w̄j−1 w̄j−1 b
S (w̄) = 0 + +
j=1
(j − 1)! j=b (j − 1)! j + 1
b−2 ∞
(A42)
0 w̄j 0 w̄j b
= +
j=0
j! j=b−1 j! j + 2
1264 The first b − 1 terms in S % (w̄) are equal to the corresponding terms in equation A41 for
1265 S(w̄). Each subsequent term is smaller in S % (w̄) than in S(w̄), since j+2
b b
< j+1 for
1266 b, j > 0. So we have established that S (w̄) < S(w̄) for b > 1.
%
dpcib + ,
= −e−w̄ S(w̄) + e−w̄ S % (w̄) = e−w̄ S % (w̄) − S(w̄) < 0 (A43)
dw̄
1268 since e−w̄ > 0 and S % (w̄) − S(w̄) < 0.
1269 It follows that pcib is decreasing in w̄. !
+0
b−1
w̄j
∞
0 w̄j b ,
−w̄
pcib (κw , v, b) = e 1+
j=0
j! j=b
j! j + 1
+0
b
w̄j 0∞
w̄j b + 1 ,
pcib (κw , v, b + 1) = e −w̄
1+ (A44)
j=0
j! j=b+1
j! j + 1
+0
b−1
w̄j 0∞
w̄j b + 1 ,
−w̄
=e 1+
j=0
j! j=b
j! j + 1
+0
∞
w̄j (b + 1) − b ,
−w̄
pcib (b + 1) − pcib (b + 1) = e
j=b
j! j+1
+0∞
w̄j 1 , (A45)
= e−w̄
j=b
j! j + 1
>0
1276 since this is a sum of positive terms, multiplied by a positive number.
52
1277 !
psurv (b) v
≈ (1 − κm )pcib
pdrift (b) b
v + 0 w̄j 0 w̄j b ,
b−1 ∞
= (1 − κm )e−w̄ 1+
b j=0
j! j=b
j! j + 1
+0
b−1
w̄j 1
∞
0 w̄j 1 ,
−w̄
= v(1 − κm )e +
j=0
j! b j=b
j! j + 1
Similarly:
psurv (b + 1) v +0b
w̄j 0∞
w̄j b + 1 ,
−w̄
≈ (1 − κm )e 1+
pdrift (b + 1) b+1 j=0
j! j=b+1
j! j + 1
v + 0 w̄j 0 w̄j b + 1 ,
b−1 ∞
= (1 − κm )e−w̄ 1+
b+1 j=0
j! j=b
j! j + 1
b−1
0 0 w̄j 1 ∞
−w̄ w̄j 1
= v(1 − κm )e +
j=0
j! b + 1 j=b j! j + 1
0 w̄j b−1 . /
psurv (b + 1) psurv (b) 1 1
− ≈ v(1 − κm )e−w̄ − <0
pdrift (b + 1) pdrift (b) j=0
j! b+1 b
1282 That the expression is negative follows from the fact that all terms in the sum are
negative, since b+1 < 1b for b ≥ 1 while w̄j! is always positive. The terms multiplying the
1 j
1283
pinoc = (1 − e−vfmt )
w̄ = v(1 − fmt ) ≈ v
53
e−w̄ ≈ 0
b
pcib ≈
v
b
psurv ≈ (1 − e−vfmt )( )
v
1290 And since −vfmt is small, we can use the linear approximation to the exponential:
b
psurv ≈ vfmt = bfmt
v
1291 !
54
1292 References
1293 Archetti, I., & Horsfall, F. L. (1950). Persistent antigenic variation of influenza A
1294 viruses after incomplete neutralization in ovo with heterologous immune serum.
1295 Journal of Experimental Medicine, 92 (5), 441–462.
1296 Asaduzzaman, S. M., Ma, J., & van den Driessche, P. (2018). Estimation of
1297 cross-immunity between drifted strains of influenza A/H3N2. Bulletin of
1298 Mathematical Biology, 80 (3), 657–669.
1299 Baccam, P., Beauchemin, C., Macken, C. A., Hayden, F. G., & Perelson, A. S. (2006).
1300 Kinetics of influenza A virus infection in humans. Journal of Virology, 80 (15),
1301 7590–7599.
1302 Ball, F., Britton, T., & Neal, P. (2016). On expected durations of birth–death processes,
1303 with applications to branching processes and SIS epidemics. Journal of Applied
1304 Probability, 53 (1), 203–215.
1305 Bazinet, A. L., Zwickl, D. J., & Cummings, M. P. (2014). A gateway for phylogenetic
1306 analysis powered by grid computing featuring GARLI 2.0. Systematic Biology,
1307 63 (5), 812–8. https://doi.org/10.1093/sysbio/syu031
1308 Bedford, T., Rambaut, A., & Pascual, M. (2012). Canalization of the evolutionary
1309 trajectory of the human influenza virus. BMC Biology, 10 (1), 38.
1310 Bedford, T., Riley, S., Barr, I. G., Broor, S., Chadha, M., Cox, N. J., Daniels, R. S.,
1311 Gunasekaran, C. P., Hurt, A. C., Kelso, A., Et al. (2015). Global circulation
1312 patterns of seasonal influenza viruses vary with antigenic drift. Nature,
1313 523 (7559), 217.
1314 Bonhoeffer, S., Lipsitch, M., & Levin, B. R. (1997). Evaluating treatment protocols to
1315 prevent antibiotic resistance. Proceedings of the National Academy of Sciences,
1316 94 (22), 12106–12111.
1317 Boni, M. F., Gog, J. R., Andreasen, V., & Feldman, M. W. (2006). Epidemic dynamics
1318 and antigenic evolution in a single season of influenza A. Proceedings of the
1319 Royal Society B: Biological Sciences, 273 (1592), 1307–1316.
1320 Burke, D. F., & Smith, D. J. (2014). A recommended numbering scheme for influenza A
1321 HA subtypes. PLoS ONE, 9 (11), e112302.
1322 https://doi.org/10.1371/journal.pone.0112302
1323 Canini, L., Holzer, B., Morgan, S., Dinie Hemmink, J., Clark, B.,
1324 sLoLa Dynamics Consortium, Woolhouse, M. E., Tchilian, E., & Charleston, B.
1325 (2020). Timelines of infection and transmission dynamics of H1N1pdm09 in
1326 swine. PLoS pathogens, 16 (7), e1008628.
1327 Cao, P., & McCaw, J. M. (2017). The mechanisms for within-host influenza virus
1328 control affect model-based assessment and prediction of antiviral treatment.
1329 Viruses, 9 (8), 197.
1330 Caton, A. J., Brownlee, G. G., Yewdell, J. W., & Gerhard, W. (1982). The antigenic
1331 structure of the influenza virus A/PR/8/34 hemagglutinin (H1 subtype). Cell,
1332 31 (2), 417–427.
1333 Clements, M., Betts, R., Tierney, E., & Murphy, B. (1986). Serum and nasal wash
1334 antibodies associated with resistance to experimental challenge with influenza A
1335 wild-type virus. Journal of Clinical Microbiology, 24 (1), 157–160.
1336 Coro, E. S., Chang, W. W., & Baumgarth, N. (2006). Type I IFN receptor signals
1337 directly stimulate local B cells early following influenza virus infection. Journal
1338 of Immunology, 176 (7), 4343–4351.
55
1339 Coudeville, L., Bailleux, F., Riche, B., Megas, F., Andre, P., & Ecochard, R. (2010).
1340 Relationship between haemagglutination-inhibiting antibody titres and clinical
1341 protection against influenza: Development and application of a Bayesian
1342 random-effects model. BMC Medical Research Methodology, 10 (1), 18.
1343 Davenport, F. M., & Hennessy, A. V. (1956). A serologic recapitulation of past
1344 experiences with influenza A; antibody response to monovalent vaccine. Journal
1345 of Experimental Medicine, 104 (1), 85–97.
1346 Davis, A. K., McCormick, K., Gumina, M. E., Petrie, J. G., Martin, E. T., Xue, K. S.,
1347 Bloom, J. D., Monto, A. S., Bushman, F. D., & Hensley, S. E. (2018). Sera from
1348 individuals with narrowly focused influenza virus antibodies rapidly select viral
1349 escape mutations in ovo. Journal of Virology, 92 (19), e00859–18.
1350 De Jong, J., Smith, D. J., Lapedes, A., Donatelli, I., Campitelli, L., Barigazzi, G.,
1351 Van Reeth, K., Jones, T. C., Rimmelzwaan, G., Osterhaus, A., Et al. (2007).
1352 Antigenic and genetic evolution of swine influenza A (H3N2) viruses in europe.
1353 Journal of Virology, 81 (8), 4315–4322.
1354 Debbink, K., McCrone, J. T., Petrie, J. G., Truscon, R., Johnson, E., Mantlo, E. K.,
1355 Monto, A. S., & Lauring, A. S. (2017). Vaccination has minimal impact on the
1356 intrahost diversity of H3N2 influenza viruses. PLoS Pathogens, 13 (1), e1006194.
1357 https://doi.org/10.1371/journal.ppat.1006194
1358 Dinis, J. M., Florek, N. W., Fatola, O. O., Moncla, L. H., Mutschler, J. P.,
1359 Charlier, O. K., Meece, J. K., Belongia, E. A., & Friedrich, T. C. (2016). Deep
1360 sequencing reveals potential antigenic variants at low frequencies in influenza A
1361 virus-infected humans. Journal of Virology, 90 (7), 3355–3365.
1362 Ferguson, N. M., Galvani, A. P., & Bush, R. M. (2003). Ecological and immunological
1363 determinants of influenza evolution. Nature, 422 (6930), 428.
1364 Fonville, J. M., Wilks, S., James, S. L., Fox, A., Ventresca, M., Aban, M., Jones, T. C.,
1365 Le, N., Pham, Q., Et al. (2014). Antibody landscapes after influenza virus
1366 infection or vaccination. Science, 346 (6212), 996–1000.
1367 Frensing, T., Kupke, S. Y., Bachmann, M., Fritzsche, S., Gallo-Ramirez, L. E., &
1368 Reichl, U. (2016). Influenza virus intracellular replication dynamics, release
1369 kinetics, and particle morphology during propagation in mdck cells. Applied
1370 Microbiology and Biotechnology, 100 (16), 7181–7192.
1371 Ghafari, M., Lumby, C. K., Weissman, D., & Illingworth, C. J. R. (2020). Inferring
1372 transmission bottleneck size from viral sequence data using a novel haplotype
1373 reconstruction method. bioRxiv.
1374 Gog, J. R. (2008). The impact of evolutionary constraints on influenza dynamics.
1375 Vaccine, 26, C15–C24.
1376 Grenfell, B. T., Pybus, O. G., Gog, J. R., Wood, J. L., Daly, J. M., Mumford, J. A., &
1377 Holmes, E. C. (2004). Unifying the epidemiological and evolutionary dynamics of
1378 pathogens. Science, 303 (5656), 327–332.
1379 Hadjichrysanthou, C., Cauët, E., Lawrence, E., Vegvari, C., de Wolf, F., &
1380 Anderson, R. M. (2016). Understanding the within-host dynamics of influenza A
1381 virus: from theory to clinical implications. Journal of The Royal Society
1382 Interface, 13 (119), 20160289.
1383 Han, A. X., Maurer-Stroh, S., & Russell, C. A. (2019). Individual immune selection
1384 pressure has limited impact on seasonal influenza virus evolution. Nature Ecology
1385 & Evolution, 3 (2), 302.
56
1386 Hartfield, M., & Alizon, S. (2014). Epidemiological feedbacks affect evolutionary
1387 emergence of pathogens. The American Naturalist, 183 (4), E105–E117.
1388 Hartfield, M., & Alizon, S. (2015). Within-host stochastic emergence dynamics of
1389 immune-escape mutants. PLoS Computational Biology, 11 (3), e1004149.
1390 Hensley, S. E., Das, S. R., Bailey, A. L., Schmidt, L. M., Hickman, H. D.,
1391 Jayaraman, A., Viswanathan, K., Raman, R., Sasisekharan, R., Bennink, J. R.,
1392 Et al. (2009). Hemagglutinin receptor binding avidity drives influenza A virus
1393 antigenic drift. Science, 326 (5953), 734–736.
1394 Hoft, D. F., Lottenbach, K. R., Blazevic, A., Turan, A., Blevins, T. P., Pacatte, T. P.,
1395 Yu, Y., Mitchell, M. C., Hoft, S. G., & Belshe, R. B. (2017). Comparisons of the
1396 humoral and cellular immune responses induced by live attenuated influenza
1397 vaccine and inactivated influenza vaccine in adults. Clin. Vaccine Immunol.,
1398 24 (1), e00414–16.
1399 Hu, Y., & Li, T. (2009). Highly accurate tau-leaping methods with random corrections.
1400 Journal of Chemical Physics, 130 (12), 03B618.
1401 Iwasa, Y., Michor, F., & Nowak, M. A. (2004). Evolutionary dynamics of invasion and
1402 escape. Journal of Theoretical Biology, 226 (2), 205–214.
1403 Javaid, W., Ehni, J., Gonzalez-Reiche, A. S., Carreno, J. M., Hirsch, E., Tan, J.,
1404 Khan, Z., Kriti, D., Ly, T., Kranitzky, B., Barnett, B., Cera, F., Prespa, L.,
1405 Moss, M., Albrecht, R. A., Mustafa, A., Herbison, I., Hernandez, M. M., Pak, T.,
1406 . . . van Bakel, H. (2020). Real time investigation of a large nosocomial influenza
1407 a outbreak informed by genomic epidemiology. medRxiv.
1408 https://doi.org/10.1101/2020.05.10.20096693
1409 Jones, E., Oliphant, T., Peterson, P., Et al. (2001). SciPy: Open source scientific tools
1410 for Python. http://www.scipy.org/
1411 Katoh, K., Misawa, K., Kuma, K., & Miyata, T. (2002). MAFFT: a novel method for
1412 rapid multiple sequence alignment based on fast Fourier transform. Nucleic
1413 Acids Research, 30 (14), 3059–3066. https://doi.org/10.1093/nar/gkf436
1414 Kennedy, D. A., & Read, A. F. (2017). Why does drug resistance readily evolve but
1415 vaccine resistance does not? Proceedings of the Royal Society B: Biological
1416 Sciences, 284 (1851), 20162562.
1417 Kimura, M. (1968). Evolutionary rate at the molecular level. Nature, 217 (5129),
1418 624–626.
1419 Koel, B. F., Burke, D. F., Bestebroer, T. M., van der Vliet, S., Zondag, G. C.,
1420 Vervaet, G., Skepner, E., Lewis, N. S., Spronken, M. I., Russell, C. A., Et al.
1421 (2013). Substitutions near the receptor binding site determine major antigenic
1422 change during influenza virus evolution. Science, 342 (6161), 976–979.
1423 https://doi.org/10.1126%2Fscience.1244730
1424 Koel, B. F., van Someren Greve, F., Vigeveno, R. M., Pater, M., Russell, C. A., &
1425 de Jong, M. D. (2019). Disparate evolution of virus populations in upper and
1426 lower airways of mechanically ventilated patients. bioRxiv, 509901.
1427 Koelle, K., Cobey, S., Grenfell, B. T., & Pascual, M. (2006). Epochal evolution shapes
1428 the phylodynamics of interpandemic influenza A (H3N2) in humans. Science,
1429 314 (5807), 1898–1903. https://doi.org/10.1126/science.1132745
1430 Koelle, K., Kamradt, M., & Pascual, M. (2009). Understanding the dynamics of rapidly
1431 evolving pathogens through modeling the tempo of antigenic change: Influenza
1432 as a case study. Epidemics, 1 (2), 129–137.
57
1433 Koelle, K., & Rasmussen, D. A. (2015). The effects of a deleterious mutation load on
1434 patterns of influenza A/H3N2’s antigenic evolution in humans. eLife, 4, e07361.
1435 Krammer, F. (2019). The human antibody response to influenza A virus infection and
1436 vaccination. Nature Reviews Immunology, 1.
1437 Krammer, F., & Palese, P. (2019). Universal influenza virus vaccines that target the
1438 conserved hemagglutinin stalk and conserved sites in the head domain. Journal
1439 of Infectious Diseases, 219 (Supplement_1), S62–S67.
1440 Kucharski, A. J., & Gog, J. R. (2011). Influenza emergence in the face of evolutionary
1441 constraints. Proc. R. Soc. Lond. B: Biol. Sci.
1442 Kucharski, A. J., Lessler, J., Read, J. M., Zhu, H., Jiang, C. Q., Guan, Y.,
1443 Cummings, D. A., & Riley, S. (2015). Estimating the life course of influenza A
1444 (H3N2) antibody responses from cross-sectional data. PLoS Biology, 13 (3),
1445 e1002082.
1446 Lam, J. H., & Baumgarth, N. (2019). The multifaceted B cell response to influenza
1447 virus. Journal of Immunology, 202 (2), 351–359.
1448 Lau, L. L., Cowling, B. J., Fang, V. J., Chan, K.-H., Lau, E. H., Lipsitch, M.,
1449 Cheng, C. K., Houck, P. M., Uyeki, T. M., Peiris, J. M., Et al. (2010). Viral
1450 shedding and clinical illness in naturally acquired influenza virus infections.
1451 Journal of Infectious Diseases, 201 (10), 1509–1516.
1452 Le Sage, V., Jones, J. E., Kormuth, K. A., Fitzsimmons, W. J., Nturibi, E.,
1453 Padovani, G. H., Arevalo, C. P., French, A. J., Avery, A. J., Manivanh, R.,
1454 McGrady, E. E., Bhagwat, A. R., Lauring, A. S., Hensley, S. E., &
1455 Lakdawala, S. S. (2020). Pre-existing immunity provides a barrier to airborne
1456 transmission of influenza viruses. bioRxiv.
1457 https://doi.org/10.1101/2020.06.15.103747
1458 Lee, J. M., Eguia, R., Zost, S. J., Choudhary, S., Wilson, P. C., Bedford, T.,
1459 Stevens-Ayers, T., Boeckh, M., Hurt, A., Lakdawala, S. S., Hensley, S. E., &
1460 Bloom, J. D. (2019). Mapping person-to-person variation in viral mutations that
1461 escape polyclonal serum targeting influenza hemagglutinin. bioRxiv, 670497.
1462 Leonard, A. S., McClain, M. T., Smith, G. J., Wentworth, D. E., Halpin, R. A., Lin, X.,
1463 Ransier, A., Stockwell, T. B., Das, S. R., Gilbert, A. S., Et al. (2016). Deep
1464 sequencing of influenza A virus from a human challenge study reveals a selective
1465 bottleneck and only limited intrahost genetic diversification. Journal of Virology,
1466 90 (24), 11247–11258.
1467 Lessler, J., Riley, S., Read, J. M., Wang, S., Zhu, H., Smith, G. J., Guan, Y.,
1468 Jiang, C. Q., & Cummings, D. A. (2012). Evidence for antigenic seniority in
1469 influenza A (H3N2) antibody responses in southern China. PLoS Pathogens,
1470 8 (7), e1002802.
1471 Lewis, N. S., Russell, C. A., Langat, P., Anderson, T. K., Berger, K., Bielejec, F.,
1472 Burke, D. F., Dudas, G., Fonville, J. M., Fouchier, R. A., Et al. (2016). The
1473 global antigenic diversity of swine influenza A viruses. eLife, 5, e12217.
1474 Linderman, S. L., Chambers, B. S., Zost, S. J., Parkhouse, K., Li, Y., Herrmann, C.,
1475 Ellebedy, A. H., Carter, D. M., Andrews, S. F., Zheng, N.-Y., Et al. (2014).
1476 Potential antigenic explanation for atypical H1N1 infections among middle-aged
1477 adults during the 2013–2014 influenza season. Proceedings of the National
1478 Academy of Sciences, 111 (44), 15798–15803.
58
1479 Lumby, C. K., Nene, N. R., & Illingworth, C. J. (2018). A novel framework for inferring
1480 parameters of transmission from viral sequence data. PLoS Genetics, 14 (10),
1481 e1007718.
1482 Lumby, C. K., Zhao, L., Breuer, J., & Illingworth, C. J. R. (2020). A large effective
1483 population size for within-host influenza virus infection. eLife, 9 (e56915).
1484 https://doi.org/10.7554/eLife.56915
1485 Luo, S., Reed, M., Mattingly, J. C., & Koelle, K. (2012). The impact of host immune
1486 status on the within-host and population dynamics of antigenic immune escape.
1487 Journal of The Royal Society Interface, 9 (75), 2603–2613.
1488 Magal, P., Seydi, O., & Webb, G. (2018). Final size of a multi-group SIR epidemic
1489 model: Irreducible and non-irreducible modes of transmission. Mathematical
1490 Biosciences, 301, 59–67.
1491 Matrosovich, M. N., Matrosovich, T. Y., Gray, T., Roberts, N. A., & Klenk, H.-D.
1492 (2004). Neuraminidase is important for the initiation of influenza virus infection
1493 in human airway epithelium. Journal of Virology, 78 (22), 12665–12667.
1494 McCrone, J. T., Woods, R. J., Martin, E. T., Malosh, R. E., Monto, A. S., &
1495 Lauring, A. S. (2018). Stochastic processes constrain the within and between
1496 host evolution of influenza virus. eLife, 7, e35962.
1497 Memoli, M. J., Han, A., Walters, K.-A., Czajkowski, L., Reed, S., Athota, R.,
1498 Angela Rosas, L., Cervantes-Medina, A., Park, J.-K., Morens, D. M., Et al.
1499 (2019). Influenza A reinfection in sequential human challenge: Implications for
1500 protective immunity and universal vaccine development. Clinical Infectious
1501 Diseases, ciz281.
1502 Miller, J. C. (2012). A note on the derivation of epidemic final sizes. Bulletin of
1503 Mathematical Biology, 1–17.
1504 Moncla, L. H., Zhong, G., Nelson, C. W., Dinis, J. M., Mutschler, J., Hughes, A. L.,
1505 Watanabe, T., Kawaoka, Y., & Friedrich, T. C. (2016). Selective bottlenecks
1506 shape evolutionary pathways taken during mammalian adaptation of a 1918-like
1507 avian influenza virus. Cell Host & Microbe, 19 (2), 169–180.
1508 Neuzil, K. M., Jackson, L. A., Nelson, J., Klimov, A., Cox, N., Bridges, C. B., Dunn, J.,
1509 DeStefano, F., & Shay, D. (2006). Immunogenicity and reactogenicity of 1 versus
1510 2 doses of trivalent inactivated influenza vaccine in vaccine-naive 5–8-year-old
1511 children. Journal of Infectious Diseases, 194 (8), 1032–1039.
1512 Nobusawa, E., & Sato, K. (2006). Comparison of the mutation rates of human influenza
1513 A and B viruses. Journal of Virology, 80 (7), 3675–3678.
1514 Ohta, T. (1992). The nearly neutral theory of molecular evolution. Annual Review of
1515 Ecology and Systematics, 23 (1), 263–286.
1516 Pawelek, K. A., Huynh, G. T., Quinlivan, M., Cullinane, A., Rong, L., & Perelson, A. S.
1517 (2012). Modeling within-host dynamics of influenza virus infection including
1518 immune responses. PLoS Computational Biology, 8 (6), e1002588.
1519 Perelson, A. S., Rong, L., & Hayden, F. G. (2012). Combination antiviral therapy for
1520 influenza: Predictions from modeling of human infections. Journal of Infectious
1521 Diseases, 205 (11), 1642–1645.
1522 Perron, G. G., Inglis, R. F., Pennings, P. S., & Cobey, S. (2015). Fighting microbial
1523 drug resistance: A primer on the role of evolutionary biology in public health.
1524 Evolutionary Applications, 8 (3), 211–222.
1525 Petrova, V. N., & Russell, C. A. (2018). The evolution of seasonal influenza viruses.
1526 Nature Reviews Microbiology, 16 (1), 47.
59
1527 Quinlivan, M., Nelly, M., Prendergast, M., Breathnach, C., Horohov, D., Arkins, S.,
1528 Chiang, Y.-W., Chu, H.-J., Ng, T., & Cullinane, A. (2007). Pro-inflammatory
1529 and antiviral cytokine expression in vaccinated and unvaccinated horses exposed
1530 to equine influenza virus. Vaccine, 25 (41), 7056–7064.
1531 Recker, M., Pybus, O. G., Nee, S., & Gupta, S. (2007). The generation of influenza
1532 outbreaks by a network of host immune responses against a limited set of
1533 antigenic types. Proceedings of the National Academy of Sciences, 104 (18),
1534 7711–7716.
1535 Russell, C. A., Fonville, J. M., Brown, A. E., Burke, D. F., Smith, D. L., James, S. L.,
1536 Herfst, S., Van Boheemen, S., Linster, M., Schrauwen, E. J., Katzelnick, L.,
1537 Mosterín, A., Kuiken, T., Maher, E., Neumann, G., Osterhaus, A. D. M. E.,
1538 Kawaoka, Y., Fouchier, R. A. M., & Smith, D. J. (2012). The potential for
1539 respiratory droplet–transmissible A/H5N1 influenza virus to evolve in a
1540 mammalian host. Science, 336 (6088), 1541–1547.
1541 Russell, C. A., Jones, T. C., Barr, I. G., Cox, N. J., Garten, R. J., Gregory, V.,
1542 Gust, I. D., Hampson, A. W., Hay, A. J., Hurt, A. C., Et al. (2008). The global
1543 circulation of seasonal influenza A (H3N2) viruses. Science, 320 (5874), 340–346.
1544 Sigal, D., Reid, J. N., & Wahl, L. M. (2018). Effects of transmission bottlenecks on the
1545 diversity of influenza a virus. Genetics, 210 (3), 1075–1088.
1546 Smith, D. J., Lapedes, A. S., de Jong, J. C., Bestebroer, T. M., Rimmelzwaan, G. F.,
1547 Osterhaus, A. D., & Fouchier, R. A. (2004). Mapping the antigenic and genetic
1548 evolution of influenza virus. Science, 305 (5682), 371–376.
1549 Stamatakis, A. (2014). RAxML version 8: A tool for phylogenetic analysis and
1550 post-analysis of large phylogenies. Bioinformatics, 30 (9), 1312–3.
1551 https://doi.org/10.1093/bioinformatics/btu033
1552 Strelkowa, N., & Lässig, M. (2012). Clonal interference in the evolution of influenza.
1553 Genetics, 192 (2), 671–682. https://doi.org/10.1534/genetics.112.143396
1554 Suess, T., Remschmidt, C., Schink, S. B., Schweiger, B., Heider, A., Milde, J.,
1555 Nitsche, A., Schroeder, K., Doellinger, J., Braun, C., Et al. (2012). Comparison
1556 of shedding characteristics of seasonal influenza virus (sub) types and influenza
1557 A (H1N1) pdm09; Germany, 2007–2011. PloS ONE, 7 (12), e51653.
1558 Tsang, T. K., Cowling, B. J., Fang, V. J., Chan, K.-H., Ip, D. K., Leung, G. M.,
1559 Peiris, J. M., & Cauchemez, S. (2015). Influenza A virus shedding and infectivity
1560 in households. Journal of Infectious Diseases, 212 (9), 1420–1428.
1561 Valesano, A. L., Fitzsimmons, W. J., McCrone, J. T., Petrie, J. G., Monto, A. S.,
1562 Martin, E. T., & Lauring, A. S. (2019). Influenza b viruses exhibit lower
1563 within-host diversity than influenza a viruses in human hosts. BioRxiv, 791038.
1564 Varble, A., Albrecht, R. A., Backes, S., Crumiller, M., Bouvier, N. M., Sachs, D.,
1565 García-Sastre, A., Et al. (2014). Influenza A virus transmission bottlenecks are
1566 defined by infection route and recipient host. Cell Host & Microbe, 16 (5),
1567 691–700.
1568 Volkov, I., Pepin, K. M., Lloyd-Smith, J. O., Banavar, J. R., & Grenfell, B. T. (2010).
1569 Synthesizing within-host and population-level selective pressures on viral
1570 populations: The impact of adaptive immunity on viral immune escape. Journal
1571 of The Royal Society Interface, 7 (50), 1311–1318.
1572 Wang, Y.-Y., Harit, D., Subramani, D. B., Arora, H., Kumar, P. A., & Lai, S. K.
1573 (2017). Influenza-binding antibodies immobilise influenza viruses in fresh human
1574 airway mucus. European Respiratory Journal, 49 (1), 1601709.
60
1575 Weis, J. F., Baeten, J. M., McCoy, C. O., Warth, C., Donnell, D., Thomas, K. K.,
1576 Hendrix, C. W., Marzinke, M. A., Mugo, N., Matsen IV, F. A., Et al. (2016).
1577 Prep-selected drug resistance decays rapidly after drug cessation. AIDS, 30 (1),
1578 31.
1579 Welkers, M. R., Pawestri, H. A., Fonville, J. M., Sampurno, O. D., Pater, M.,
1580 Holwerda, M., Han, A. X., Russell, C. A., Jeeninga, R. E., Setiawaty, V., Et al.
1581 (2019). Genetic diversity and host adaptation of avian H5N1 influenza viruses
1582 during human infection. Emerging Microbes & Infections, 8 (1), 262–271.
1583 Wikramaratna, P. S., Sandeman, M., Recker, M., & Gupta, S. (2013). The antigenic
1584 evolution of influenza: drift or thrift? Philosophical Transactions of the Royal
1585 Society B: Biological Sciences, 368 (1614), 20120200.
1586 Wiley, D., Wilson, I., & Skehel, J. (1981). Structural identification of the
1587 antibody-binding sites of Hong Kong influenza haemagglutinin and their
1588 involvement in antigenic variation. Nature, 289 (5796), 373.
1589 Wilker, P. R., Dinis, J. M., Starrett, G., Imai, M., Hatta, M., Nelson, C. W.,
1590 OConnor, D. H., Hughes, A. L., Neumann, G., Kawaoka, Y., Et al. (2013).
1591 Selection on haemagglutinin imposes a bottleneck during mammalian
1592 transmission of reassortant H5N1 influenza viruses. Nature Communications, 4,
1593 2636.
1594 Wrammert, J., Smith, K., Miller, J., Langley, W. A., Kokko, K., Larsen, C.,
1595 Zheng, N.-Y., Mays, I., Garman, L., Helms, C., Et al. (2008). Rapid cloning of
1596 high-affinity human monoclonal antibodies against influenza virus. Nature,
1597 453 (7195), 667.
1598 Xue, K. S., & Bloom, J. D. (2019). Reconciling disparate estimates of viral genetic
1599 diversity during human influenza infections. Nature Genetics, 51 (9), 1298–1301.
1600 https://doi.org/https://doi.org/10.1038/s41588-019-0349-3
1601 Xue, K. S., & Bloom, J. D. (2020). Linking influenza virus evolution within and
1602 between human hosts. Virus evolution, 6 (1), veaa010.
1603 Xue, K. S., Stevens-Ayers, T., Campbell, A. P., Englund, J. A., Pergam, S. A.,
1604 Boeckh, M., & Bloom, J. D. (2017). Parallel evolution of influenza across
1605 multiple spatiotemporal scales. eLife, 6, e26875.
1606 https://doi.org/10.7554/eLife.26875
1607 Yu, G., Smith, D. K., Zhu, H., Guan, Y., & Lam, T. T.-Y. (2017). ggtree: An r package
1608 for visualization and annotation of phylogenetic trees with their covariates and
1609 other associated data. Methods in Ecology and Evolution, 8 (1), 28–36.
1610 https://doi.org/10.1111/2041-210X.12628
1611 Zinder, D., Bedford, T., Gupta, S., & Pascual, M. (2013). The roles of competition and
1612 mutation in shaping antigenic and genetic diversity in influenza. PLoS
1613 Pathogens, 9 (1), e1003104.
1614 Zuccarino-Catania, G. V., Sadanand, S., Weisel, F. J., Tomayko, M. M., Meng, H.,
1615 Kleinstein, S. H., Good-Jacobson, K. L., & Shlomchik, M. J. (2014). CD80 and
1616 PD-L2 define functionally distinct memory B cell subsets that are independent
1617 of antibody isotype. Nature Immunology, 15 (7), 631.
61