Professional Documents
Culture Documents
Running Performance, - VO, and Running Economy: The Widespread Issue of Endogenous Selection Bias
Running Performance, - VO, and Running Economy: The Widespread Issue of Endogenous Selection Bias
Running Performance, - VO, and Running Economy: The Widespread Issue of Endogenous Selection Bias
DOI 10.1007/s40279-017-0789-9
REVIEW ARTICLE
123
N. T. Borgen
123
Endogenous Selection Bias in the Literature on Running Performance
123
N. T. Borgen
3 Endogenous Selection Bias To see why, we have to consider what factors influence
the probability of being included in the population of
_ 2max and RE
3.1 Inverse Association Between VO interest (sub-02:15:00 marathon). Consider the empirically
based but simplified example in Fig. 1a where RE and
Let us consider a hypothetical study (for simplicity, costs _ 2max are determined independently, both RE and
VO
and availability are no issue here). Say that we defined the _ 2max affect race performance (RP), and race perfor-
VO
population of interest as male runners with marathon times mance affects the probability of being an elite athlete. In
below 02:15:00, and that we have drawn a random sample this example, VO _ 2max and RE are marginally independent,
from this population of 50 male athletes where we have meaning that knowing an individual’s level of VO _ 2max in
(perfectly) measured RE and VO _ 2max . For these 50 hypo- the full population (elites and non-elites) does not provide
_ 2max and
thetical athletes, we find a correlation between VO any information about the individual’s level of RE (no
RE of 0.25. The correlation is significant and the confi- correlation between VO _ 2max and RE). However, condi-
dence interval is fairly narrow, and we can accordingly tioning on elite status (or restricting the population of
generalize the effect to the population. interest to elites) will induce a spurious inverse relationship
From a merely descriptive point of view, this correlation between VO _ 2max and RE, assuming that the path coefficient
is valid in the sense that the correlation would have been p is negative and the path coefficient c is positive [4].
very similar had we used data on the entire population of Linear path modeling is a useful tool to see why two
sub-02:15:00 male marathoners. Our hypothetical study marginally independent variables (RE and VO _ 2max in
lets us conclude that athletes with high VO_ 2max on average
Fig. 1a) may become dependent if we condition on a
are less efficient (i.e., a higher oxygen cost) and those common outcome of these variables (elite status in Fig. 1a)
having exceptional RE on average have lower VO _ 2max , [41]. If, in a full population sample, we estimate the
which is similar to the conclusions that could be drawn standardized regression coefficient (b) ^ of RE on VO _ 2max
based on the studies in the literature conditional on elite status, we identify
[20, 28, 30, 38, 50, 51].2
However, observed associations consist of both causal cpd2
b^RE;VO2max elite ¼ : ð1Þ
and various noncausal (i.e., spurious) components [10].3 1 p2 d 2
Although the observed inverse association can be gener- Since the true causal effect is 0 (i.e., no correlation),
alized to the population, it is nevertheless at least partly Eq. (1) reveals that conditioning on elite status induces
spurious. It means that the elite marathoners with the bias. This is generally known as Berkson’s paradox [4],
lowest level of VO _ 2max typically have, on average, better endogenous selection bias [10], or collider bias [40].
RE than the elite marathoners with the highest level of Restricting the analysis to a subsample of elite athletes
_ 2max , but that we should not expect to see a detoriation
VO amounts to conditioning on elite status, and hence results in
of an individual’s RE if he/she increased his/her VO _ 2max bias for the same reason [10].4 This is often known as
(through for instance interval training). sample selection bias [41] or sample truncation bias [10].
It may seem counterintuitive at first that restricting the
sample to elite athletes induces a spurious inverse associ-
2 ation between RE and VO _ 2max , but it is actually straight-
Statistical generalization from a sample to a population (of interest)
depends on assumptions such as random sampling. When using forward. Consider the situation where it is known that an
convenience samples, which are commonly used in the literature, the individual is an elite marathoner (sub-02:15:00 marathon)
statistical significance of the correlations may be misleading [5]. With and that this individual has only a mediocre RE compared
convenience samples, not all elites or highly trained individuals are
to other elite marathoners. What could then be inferred
equally likely to be included in the sample, and the study participants
are likely to be more alike with regard to for instance training about his/her VO_ 2max ? It is likely exceptional, because he/
principles (e.g., amount of high-intensity interval training) than the she is unlikely to run a sub-02:15:00 marathon with a
participants would have been had they been selected through a _ 2max . It is not that a high
probability sample. I suspect that this will lead to P values that are too
mediocre RE and a mediocre VO
small and that the uncertainty in the results is underestimated. _ 2max leads to poor RE (causal component), but that
VO
However, the literature routinely report P values without any individuals that become elite athletes despite being
discussion. To explain their sampling procedure, and discuss any
potential bias, researchers should consider using guidelines for
reporting observational studies, for instance the Strengthening the 4
Linear path models allow us to calculate coefficients under the
Reporting of Observational Studies in Epidemiology (STROBE) assumption of linearity and homogeneous effects (no interactions).
statement [53]. Restricting the analysis to elite athletes is the same as adding a control
3
Following the counterfactual model of causality, causal effects are for elite athletes and interactions between elite athletes and all
defined as contrasts between potential outcomes [33]. independent variables.
123
Endogenous Selection Bias in the Literature on Running Performance
a b
V̇O2max V̇O2max
π π
δ δ
RP Elite λ α RP Elite
γ γ
RE RE
Fig. 1 Hypothesized associations between maximal oxygen uptake independent in the population (no arrow between VO _ 2max and RE).
_ 2max ), running economy (RE), race performance (RP), and being an
(VO Controlling for RP or its descendant Elite induces a spurious association
elite athlete (Elite) in the entire population. The single-headed arrows between VO_ 2max and RE, because RP is a common outcome of VO _ 2max
_ 2max affects RP),
represent direct effects from causes to effects (e.g., VO and RE (i.e., collider variable). b An example where (1) an increase in
while bidirectional arrows indicate that two variables have one or more _ 2max is hypothesized to impair RE (a) and (2) the association
VO
causes in common. The Greek letters are path coefficients between pairs between VO_ 2max and RE is confounded (k). Controlling for RP or its
of variables, and are in this article interpreted as correlations (e.g., p is descendant Elite still induces a spurious association between VO _ 2max
the correlation between VO _ 2max and RP). a A simplified example where
and RE
_
both VO2max and RE are assumed to affect RP but are marginally
inefficient must necessarily have other traits that compen- illustrate the amount of bias, we need some assumptions. In
sate, such as a very high VO_ 2max (non-causal component). Fig. 2a, the correlation between RE and VO _ 2max in the full
Note that this endogenous selection bias occurs not population is constrained to be zero. Additionally, the sizes of
simply because the sample is restricted to a subgroup of the the correlations between VO _ 2max and race performance and
population in itself, but because it is restricted in a specific between RE and race performance are constrained to be equal.
way. For instance, restricting the sample to individuals The implication of this latter assumption is that Fig. 2a
with a VO _ 2max above 70 ml kg1 min1 will not lead to illustrates an upper bound for the bias, as further discussed in
endogenous selection bias as long as all individuals in the Appendix S2 of the ESM. Appendix S3 of the ESM provides a
population of interest (individuals with VO _ 2max above supplementary data simulation.
1 1
70 ml kg min ) have an equal chance of being sampled Figure 2a illustrates that the amount of bias is a function
regardless of whether they are elites or not. In that case, of two aspects. First, the bias is greatest in cases where
and assuming that VO _ 2max and RE are truly independently _ 2max and RE explain most of the variation in race per-
VO
determined, there would be no reason to expect that those formance (i.e., stronger correlations between VO _ 2max and
with a relatively low VO_ 2max have a higher RE than those race performance and between RE and race performance).
_
with high VO2max . Thus, the problem occurs when condi- Second, a more elite sample will lead to more bias, which
tioning on the collider variable elite status or, similarly, is intuitive. In a sample of the best 100 marathoners in the
when the sample is restricted to elites only. world, those with (relatively) low VO _ 2max must have
Endogenous selection bias is not about whether the exceptional RE. However, in a sample that consists of all
effects are different for elites and non-elites. Endogenous but the slowest 10% of the population, those with low
selection bias is about how spurious associations are _ 2max could very well have poor RE. That said, the bias
VO
introduced in the data because of (for instance) sample may be substantial even if we include all but the slowest
restriction, resulting in erroneous correlations in the sub- 25% of runners (see the top 75% line in Fig. 2a).
group studied. Thus, the results are not even valid for the After accounting for body mass, the correlation between
subgroup studied: they are statistical artifacts. RE and VO _ 2max in samples of highly trained individuals
ranges from about 0.25 to 0.30 [51]. In Fig. 2a, we see that
3.2 The Size of the Spurious Inverse Relationship among the top 1% and 25% of the runners, the estimated
correlation is about 0.25–0.30 (upper bound) when VO _ 2max
The simple formula in Eq. (1) demonstrates that the associa- and RE explain about 50% of the variation in race per-
tion between VO _ 2max and RE becomes inverse after condi- formance in the population.5 Thus, for the entire
tioning on race performance when working with population
data, and, intuitively, the same holds when restricting the 5
sample to elite athletes only. Figure 2a illustrates the amount The coefficient of determination (R2 ) can be calculated by summing
the squared semipartial correlations, which in this case is identical to
of bias we may expect in settings where (1) the amount of the pairwise correlations (Fig. 1a): R2 ¼ p2 þ c2 . Since p and c are
_ 2max and RE explains is
variation in race performance that VO constrained to be equal, p and c can be calculated from the
1
varied and (2) the sample selectivities differ. However, to figure using ðR2 12Þ2 .
123
N. T. Borgen
a b
0.80 0.00
Estimated correlation
Estimated correlation
0.60 -0.20
0.40 -0.40
0.20 -0.60
0.00 -0.80
0.00 0.20 0.40 0.60 0.80 1.00 -0.80 -0.60 -0.40 -0.20 0.00
2
R True (causal) correlation
Fig. 2 Illustration of the size of endogenous selection bias (see with the top 1, 25, 50, 75, and 99% race performances, as well as for
Appendix S1 in the ESM for the Stata code). a The figure is the entire popoulation. b The figure is constructed by generating a set
constructed by generating a set of hypothetical full populations where of hypothethical full populations where the correlation between
the correlation between RE and VO _ 2max is constrained to be zero, _ 2max and race performance varies between 0 and 0.75 (x-axis).
VO
while the amount of variation in race performance (R2 ) that RE and From each of these populations, the correlation between VO _ 2max and
_ 2max explain varies between 0 and 0.98 (x-axis). The sizes (but not
VO race performance is estimated (y-axis) for individuals in the
signs) of the correlations between RE and race performance and subpopulations with the top 1, 25, 50, 75, and 99% race performances,
between VO_ 2max and race performance are constrained to be equal. as well as for the entire popoulation. Replacing VO _ 2max with RE
From each of these populations, the correlation between RE and produces the same results, only with different signs
_ 2max is estimated (y-axis) for individuals in the subpopulations
VO
correlation between RE and VO _ 2max to be spurious, RE and illustrated the amount of bias we may expect given some
_ 2max may only need to account for approximately 50%
VO simplified assumptions. In the next section, I show that the
of the variation in race performance, which, based on the effects of RE and VO_ 2max on race performance may be
literature, is plausible [11, 26, 32, 47]. This demonstrates biased for the same reason.
that restricting the sample to elite athletes has the potential
to substantially bias the findings in the literature. _ 2max on Race
3.3 The Effects of RE and VO
Thus far, the discussion has relied on the simplified Performance
model where RE and VO _ 2max are assumed to be marginally
independent (Fig. 1a). However, we could also expect RE Many studies that have used elite samples have failed to
and VO _ 2max to be marginally dependent (Fig. 1b). First, find a significant effect of RE and VO _ 2max on race per-
elite runners have both higher VO _ 2max and higher RE than formance [18, 28, 30]. Consider the study of 32 competi-
good runners [43]. This may indicate that some unobserved tive Kenyan runners, in which Mooses et al. [28] found
background factors (e.g., genetics) affect both VO _ 2max and neither a significant effect of RE on IAAF score
(r ¼ 0:01) nor a significant effect of VO _ 2max on IAAF
RE, as suggested by the curved dotted line in Fig. 1b.
Second, we have also seen several explanations for why a score (r ¼ 0:29) [28].6 A study of Olympic trials qualifiers
_ 2max may actually impair RE [30, 38, 51]. Thus, also found a nonsignificant correlation between VO _ 2max
high VO
Fig. 1b includes a direct effect of VO_ 2max on RE. and race performance (r ¼ 0:21) [30].
In this more complex example, the path coefficients However, restricting the sample to elite athletes not only
become more complicated (for a general example, see induces an inverse spurious association between RE and
Pearl [41]), but the bias is no less present. The correlation _ 2max , it also biases the effects of RE and VO
VO _ 2max on race
between VO _ 2max and RE in an elite sample would be equal performance. Conditioning on elite status, which is a
to the causal effect a and some bias caused by (1) unob- descendant of the outcome variable race performance,
served confounding (k) and (2) endogenous selection bias.
Section 3.1 explained why restricting the sample to
6
contain only elite athletes may result in a spurious inverse This study also found an inverse relationship between RE and
_ 2max (r ¼ 0:42), which, as discussed in Sect. 3.1, is at least
VO
association between VO _ 2max and RE, while Sect. 3.2 has
partially spurious.
123
Endogenous Selection Bias in the Literature on Running Performance
induces a spurious association between the predictor vari- The amount of attenuation in studies by Mooses et al.
able and all unmeasured causes of elite status. [28] and Morgan and Daniels [30] is difficult to predict, as
To keep things simple, let us first consider what happens we do not know the true correlation between race perfor-
if we condition on elite status in a regression of race per- mance and elite status (d); nor do we know if RE and
formance on VO _ 2max in a population sample where VO _ 2max _ 2max are exogenous (i.e., the no-confounding assump-
VO
is hypothesized to be exogenous. Based on Fig. 1a, we tion). Sample sizes of 32 [28] and 22 [30] also mean that
identify the following [41]: the point estimates are imprecise. However, assuming that
RE and VO _ 2max are exogenous, correlations of 0:01,
pð1 d2 Þ
b^RP;REelite ¼ ð2Þ 0.29, and 0:21 in an elite sample (top 1%) would be
1 p 2 d2
expected if the true correlations were about 0:05; 0.7,
In this particular example, this means that the estimate is and 0:6. Although this is only speculation based on
biased towards zero, as d [ 0. To see the estimated effect assumptions such as the no-confounding assumption and
of RE, p simply needs to be replaced by c. Restricting the taking the point estimates at face value, it suggests that we
sample to a subsample of elite athletes amounts to condi- could expect large bias when regressing race performance
tioning on race performance, and hence results in bias for on predictors in a sample that consists of elite athletes.
the same reason. Some studies have used longitudinal data to investigate
Figure 2b illustrates the size of the endogenous selection how change in VO _ 2max relates to change in race perfor-
bias in some settings, based on the model in Fig. 1a (see mance [19, 52]. The next section discusses how the within
Appendix S3 in the ESM for a supplementary data simu- estimator removes endogenous selection bias but also has
lation). If there is no bias, we should expect the value on some drawbacks.
the x-axis (the true correlation between VO _ 2max and race
performance) to perfectly match the value on the y-axis
(the estimated correlation), as is the case when the corre- 4 Why Longitudinal Data May Not Be
lation is estimated in the entire population. If the value of the Solution
the y-axis is smaller than the value on the x-axis, as is the
case in all the subpopulations, then the effect of VO _ 2max is 4.1 Within-Subject Variance as a Solution
underestimated.
There are two important take-away points from Fig. 2b. Let us consider the question of what level of RE individual
First, we see that the amount of attenuation bias depends on i1 would have if he/she had a VO_ 2max of 70 ml kg1 min1
the sample selectivity. When comparing results from 1 1
rather than 65 ml kg min . Since we could only
studies of elite athletes, highly-trained runners, recreational observe the actual VO_ 2max (65 ml kg1 min1 ) and not the
runners, and untrained runners, it may be tempting to, for
counterfactual one (70 ml kg1 min1 ), we have a missing
example, conclude that VO _ 2max matters more for untrained
data problem. In the cross-sectional case, we solve this
than elites. In fact, cross-sectional studies often suggest missing data problem by comparing individual i1 with
that using a homogeneous sample of elite athletes in itself _ 2max of
another individual i2 who has a VO
could explain the failure to find a significant relationship of 1 1
70 ml kg min under the assumption that these two
RE and/or VO _ 2max to race performance [20, 28, 30].
individuals are equal in all other relevant aspects. How-
However, Fig. 2b illustrates that the amount of attenuation ever, when using a sample consisting of elites, the indi-
is a function of the sample selectivity, and that even if the vidual i2 is not otherwise identical to i1 , as he/she likely has
(causal) correlation is identical for all runners, one may a lower RE. This is why the cross-sectional comparison
obtain quite different results in different samples simply breaks down, as explained in Sect. 3.
because of endogenous selection bias. However, if we have repeated observations of each
Second, the amount of attenuation bias depends on the individual (i.e., longitudinal data), then we can compare
true (causal) correlation, and the bias is zero when the true individual i1 at time t1 with the same individual at time t2 .
correlation is zero. Thus, unlike the case of the inverse Under the assumption that the bias is invariant over time
relationship between RE and VO _ 2max , adjusting for a
and there is no selective attrition, the fixed effects model
descendant of the outcome variable would only generate takes account of bias caused by both confounding and
bias if there is a marginal association (confounding or endogenous selection [3, 6, 54], and we can identify causal
causal effect) between VO _ 2max and race performance. This effects. Although not motivated by endogenous selection
can also be seen by replacing p with 0 in Eq. (2), meaning bias, some studies have used this methodology to investi-
that the numerator reduces to 0, the denominator reduces to gate the association between VO_ 2max and race performance,
1, and the estimated association is 0. finding no association [18, 52].
123
N. T. Borgen
4.2 Small Sample Size and Noisy Measures (prior) race performance is either (1) the common outcome
of two variables (VO _ 2max and RE) or (2) the outcome
Despite the fact that the fixed effects estimator is a solution variable or a descendant of the outcome variable (race
to endogenous selection bias, the use of within-subject performance), then the sample restriction induces bias in
variation has major drawbacks that render it basically the analysis.
ineffective in the literature. For instance, studies of elite The main conclusions of this review can be summarized
athletes have demonstrated that heavy training may not as follows. First, I have demonstrated that the inverse
change VO _ 2max at all or may change it only marginally relationship between RE and VO _ 2max that many studies find
[18, 22]. Thus, by discarding all between-subject variation, [28, 30, 50, 51] is likely spurious. Second, I have demon-
we utilize only a fraction of the variance in the data set. strated that endogenous selection bias may substantially
The implication is likely large standard errors and impre- attenuate the effects of predictors on race performance,
cise results. which may explain why some studies that use elite samples
Imprecise results are especially problematic given the _ 2max [18, 28, 30] and
fail to find significant effects of VO
small sample sizes in the literature [18, 51, 52], and RE [15, 28]. Third, I have shown that a more elite sample
accordingly their low statistical power. Adding individual will lead to more bias, but that the bias may be substantial
fixed effects will likely not increase statistical power, even in samples of recreational runners. Fourth, I have
which means that the correlations need to be very large for argued that using within-subject variation is problematic.
this design to be able to detect any significant effects Given the small sample sizes in the literature, the fact that
[3, 14].7 This casts some doubt on the studies that find no _ 2max changes only marginally in elite athletes [18, 22],
VO
effect of RE and VO _ 2max using longitudinal data [18, 52].
and the problem of measurement error [17, 48], null find-
Additionally, RE and VO _ 2max would likely differ ings [18] are not surprising.
depending on factors such as altitude [28], time of year Studies in the literature provide many interesting find-
[22], and running surface [48, 49], and they would be ings; for example, the relationship between anthropometric
measured with some degree of error [17, 47]. For instance, variables and RE in Mooses et al. [28].8 Like many other
the typical measurement error of RE is shown to be about studies, Mooses et al. [28] also contribute by describing
2.4% [48]. Although this amount of measurement error is key characteristics of elite distance runners. However,
of little concern in a cross-sectional case, it will likely caution should be exercised when estimating correlations
substantially attenuate the estimates when using a within- between RE, VO _ 2max , and race performance in observa-
subject estimator. For example, because VO _ 2max changes tional samples of elite runners.
very little in elite athletes, the within-subject variation we The challenge of estimating causal effects using a ran-
observe may to a large extent be caused by random mea- dom sample of observational data from a population is in
surement error (i.e., noise) [17], and because the mea- itself formidable [33], and having a (convenience) sample
surement error is random, it is accordingly not related to of elite or well-trained athletes magnifies the challenge.
race performance. In sum, null findings may not be that Although eliminating all sources of bias, such as mea-
surprising in studies that evaluate longitudinal data surement error, is difficult to achieve, the bias caused by
[18, 52]. endogenous selection in this particular literature is so
fundamental that it cannot be ignored. Finding a good
solution to the problem of endogenous selection bias is
5 Conclusions difficult, mainly because elite runners by definition are
rare, but also because they are unlikely to participate in
In this article, I have provided a critical review of the randomized trials (which would eliminate endogenous
literature that investigates the associations between RE, selection bias).
_ 2max , and elite running performance. Studies in this
VO Nevertheless, perhaps the best solution is to conduct
literature routinely use samples of highly trained individ- _ 2max
experiments to investigate the effects of changes in VO
uals, which inevitably results in endogenous selection bias. and RE and how these relate to changes in race
The crux of the problem is that restricting the analysis
sample to a population of interest amounts to conditioning 8
This association is also biased by endogenous selection, but the bias
on whatever increases the probability of being in the group
is likely small and the findings accordingly informative. Given the
of interest, such as prior race performance. If, for instance, following model: anthropometric variable ! RE ! RP ! elite, then
the elite variable is a descendant of the outcome variable RE, and
7
This means that studies which estimate within-subject correlations restricting the sample to elites amounts to conditioning on the
with small sample sizes and find significant effects most likely outcome variable. However, since the effects of RE on elite are less
exaggerate the correlation. than the effects of RP on elite, the bias would most likely be small.
123
Endogenous Selection Bias in the Literature on Running Performance
123
N. T. Borgen
25. Joyner MJ, Coyle EF. Endurance exercise performance: the 43. Pollock ML. Submaximal and maximal working capacity of elite
physiology of champions. J Physiol. 2008;586(1):35–44. distance runners. Part I: cardiorespiratory aspects. Ann N Y Acad
26. Maughan RJ, Leiper JB. Aerobic capacity and fractional utilisa- Sci. 1977;301(1):310–22.
tion of aerobic capacity in elite and non-elite male and female 44. Ramsbottom R, Williams C, Fleming N, et al. Training induced
marathon runners. Eur J Appl Physiol Occup Physiol. physiological and metabolic changes associated with improve-
1983;52(1):80–7. ments in running performance. Br J Sports Med.
27. Marcora SM, Staiano W, Manning V. Mental fatigue impairs 1989;23(3):171–6.
physical performance in humans. J Appl Physiol. 45. Rothman KJ, Greenland S, Lash TL. Case-control studies. In:
2009;106(3):857–64. Rothman KJ, Greenland S, Lash TL, editors. Modern epidemi-
28. Mooses M, Mooses K, Haile DW, et al. Dissociation between ology. Philadelphia: Wolters Kluwer; 2008. p. 111–27.
running economy and running performance in elite Kenyan dis- 46. Saltin B, Larsen H, Terrados N, et al. Aerobic exercise capacity at
tance runners. J Sports Sci. 2015;33(2):136–44. sea level and at altitude in Kenyan boys, junior and senior runners
29. Morgan DW, Baldini FD, Martin PE, et al. Ten kilometer per- compared with Scandinavian runners. Scand J Med Sci Sports.
formance and predicted velocity at VO _ 2max among well-trained 1995;5(4):209–21.
male runners. Med Sci Sports Exerc. 1989;21(1):78–83. 47. Saunders PU, Pyne DB, Telford RD, et al. Factors affecting
30. Morgan DW, Daniels JT. Relationship between VO _ 2max and the running economy in trained distance runners. Sports Med.
aerobic demand of running in elite distance runners. Int J Sports 2004;34(7):465–85.
Med. 1994;15(07):426–9. 48. Saunders PU, Pyne DB, Telford RD, et al. Reliability and vari-
31. Morgan DW, Pate R. Could the correlation between maximal ability of running economy in elite distance runners. Med Sci
oxygen uptake and economy be spurious? Med Sci Sports Exerc. Sports Exerc. 2004;36(11):1972–6.
2004;36(2):345. 49. Smoliga JM. What is running economy? A clinician’s guide to
32. McLaughlin JE, Howley ET, Bassett DR Jr, et al. Test of the key concepts, applications and myths. Br J Sports Med.
classic model for predicting endurance running performance. 2017;51(10):831–2.
Med Sci Sports Exerc. 2010;42(5):991–7. 50. Tartaruga MP, Brisswalter J, Peyré-Tartaruga LA, et al. The
33. Morgan SL, Winship C. Counterfactuals and causal inference: relationship between running economy and biomechanical vari-
methods and principles for social research. 2nd ed. Oxford: ables in distance runners. Res Q Exerc Sport. 2012;83(3):367–75.
Oxford University Press; 2015. 51. Shaw AJ, Ingham SA, Atkinson G, et al. The correlation between
34. Midgley AW, McNaughton LR, Wilkinson M. Is there an optimal running economy and maximaloxygen uptake: cross-sectional
training intensity for enhancing the maximal oxygen uptake of and longitudinal relationships in highly trained distance runners.
distance runners? Sports Med. 2006;36(2):117–32. PLoS One. 2015;10:e0123101.
35. Noakes TD, Myburgh KH, Schall R. Peak treadmill running 52. Vollaard NB, Constantin-Teodosiu D, Fredriksson K, et al. Sys-
velocity during the VO _ 2max test predicts running performance. tematic analysis of adaptations in aerobic capacity and submax-
J Sports Sci. 1990;8(1):35–45. imal energy metabolism provides a unique insight into
36. Noakes TD. The central governor model of exercise regulation determinants of human aerobic performance. J Appl Physiol.
applied to the marathon. Sports Med. 2007;37(4):374–7. 2009;106(5):1479–86.
37. Ogueta-Alday A, Rodrı́guez-Marroyo JO, Garcı́a-López J. 53. Von Elm E, Altman DG, Egger M, et al. The Strengthening the
Rearfoot striking runners are more economical than midfoot Reporting of Observational Studies in Epidemiology (STROBE)
strikers. Med Sci Sports Exerc. 2014;46(3):580–5. statement: guidelines for reporting observational studies. Int J
38. Pate RR, Macera CA, Bailey SP, et al. Physiological, anthropo- Surg. 2014;12(12):1495–9.
metric, and training correlates of running economy. Med Sci 54. Winship C, Mare RD. Models for sample selection bias. Annu
Sports Exerc. 1992;24(10):1128–33. Rev Sociol. 1992;18:327–50.
39. Paavolainen L, Häkkinen K, Hämäläinen I, et al. Explosive- 55. Wang CY, Haskell WL, Farrell SW, et al. Cardiorespiratory fit-
strength training improves 5-km running time by improving ness levels among US adults 20–49 years of age: findings from
running economy and muscle power. J Appl Physiol. the 1999–2004 National Health and Nutrition Examination Sur-
1999;86(5):1527–33. vey. Am J Epidemiol. 2010;171(4):426–35.
40. Pearl J. Causality: models, reasoning, and inference. 2nd ed. 56. Wilber RL, Pitsiladis YP. Kenyan and Ethiopian distance run-
Cambridge: Cambridge University Press; 2009. ners: what makes them so good? Int J Sports Physiol Perform.
41. Pearl J. Linear models: a useful ‘‘microscope’’ for causal analysis. 2012;7(2):92–102.
J Causal Inference. 2013;1(1):155–70.
42. Pitsiladis Y, Wang G, Wolfarth B, et al. Genomics of elite
sporting performance: what little we know and necessary
advances. Br J Sports Med. 2013;47(9):550–5.
123