Professional Documents
Culture Documents
2021 Cartón-Llorente Absolute Reliability and Agreement Between Stryd and RunScribe Systems For The Assessment of Running Power
2021 Cartón-Llorente Absolute Reliability and Agreement Between Stryd and RunScribe Systems For The Assessment of Running Power
2021 Cartón-Llorente Absolute Reliability and Agreement Between Stryd and RunScribe Systems For The Assessment of Running Power
Abstract
The advent of portable power meters has revolutionized training in cycling, allowing an accurate field-based assessment
of athletes. In a similar way, researchers have recently developed low-cost gait analysis equipment to assess running
power in a more natural environment. The purpose of this study was to evaluate the absolute reliability of two different
power meters and the agreement between these two wearable devices (i.e., Strydä and RunScribeä) for measuring
power during treadmill running. About 49 endurance runners performed a running protocol on a treadmill at self-
selected comfortable velocities. Power output (W) was measured using the Strydä and RunScribeä systems, which
were attached to the same shoe. The absolute reliability, based on coefficient of variation, was 0.32 6 0.29% for Strydä
and 1.68 6 1.49% for RunScribeä, while the standard error of the mean were 0.3 6 0.2 W and 2.6 6 2.5 W for Strydä
and RunScribeä, respectively. Data from both devices showed significant correlations (r = 0.783, p \ 0.001) and the
ICC (r = 0.855) reported an almost perfect reliability. Bland–Altman plots revealed no heteroscedasticity of error
(r2 = 0.030), although a moderate systematic bias (212.3 6 26.6 W), and wide limits of agreement (39.8–64.3 W) were
found. Considering the increased popularity of using power meter devices in running, scientists, coaches, athletes, and
general users should be aware that data from these devices are reliable, but not interchangeable, due to the variation
shown for running power output data.
Keywords
Endurance, runners, sensors, wearables, power meters
Table 3. Descriptive values and comparative analysis of power output during running from the Strydä and RunScribeä systems.
Power output (W) 238.9 6 37.7 251.2 6 42.0 0.002 0.783 ( \ 0.001) 0.855 (0.707–0.924)
Discussion
This study aimed at determining the absolute reliability
and evaluating the agreement between the Strydä and
RunScribeä systems for measuring power during run-
ning on a treadmill at a comfortable self-selected velo-
city (11.7 6 1.3 km/h). The major findings of the current
work were twofold: (i) both systems provided reliable
power output data during running and, (ii) despite
power data from both devices showing an almost per-
fect association and no heteroscedasticity of error, using
the systems interchangeably may not be recommended
due to the inconsistency found between systems.
Both the Strydä and RunScribeä systems showed a
high absolute reliability with a mean CV lower than
Figure 2. Pearson’s correlation analysis between RunScribe 3% and a SEM less than 3 W, especially the Strydä
and Stryd’s power output data achieved during a 3-min treadmill system with a CV ranging between 0.03% and 0.6%
running protocol. across participants and a SEM of 0.3 6 0.2 W. For the
SEE: standard error of estimate. RunScribeä system, the CV ranged between 0.1% and
Circles indicate individual data of the crossing point between average 3.2% and the SEM was 2.6 6 2.5 W. These results seem
values for each device.
to be supported by a recent study by Cerezuela-Espejo
et al.,12 which also found that the Strydä system is
more reliable than RunScribeä. However, the high
variability found for RunScribeä in the aforemen-
tioned study did not match the results of the current
study, likely due to their comparison between varying
conditions (i.e., treadmill and track). Despite these
methodological differences, the current results for both
devices are consistent, as they are in line with results
from several previous studies27–29 which assessed the
reliability of analogous cycling power meters27,28 (CV
1%–3.05% and SEM ~1 6 5 W) or PowerTap Hub29
(CV 1.7%–2.7% and SEM 2.9 6 3.3 W). Furthermore,
the CV and SEM found in these studies were similar to
results from the current study, highlighting the absolute
reliability of the Strydä and RunScribeä systems to
assess power output during running on a treadmill at a
Figure 3. Bland-Altman plots for the measurement of power comfortable velocity.
output during running at a self-selected comfortable speed for Regarding the comparative analysis between Strydä
the Strydä and RunScribeä systems. and RunScribeä systems for power assessment, an
The plot includes the mean difference (dotted line) and 95% limits of almost perfect association was observed between data.
agreement (dashed lined), along with the regression line (solid line).
Furthermore, heteroscedasticity of error was not found
(r2 = 0.030), thus the concurrence between both systems
power output during running at comfortable self- was strong. Despite these results, the limits of agreement
selected speeds obtained from the Strydä system as between the two devices varied widely, and a large ran-
compared to the RunScribeä system. Limits of dom error (26.55 W) was detected, so the comparison of
Cartón-Llorente et al. 5
data should be made with caution. Since previous stud- publication of this article: This study was funded by
ies on spatiotemporal parameters during running have the University of San Jorge (Universidad San Jorge.
shown that longer recording intervals resulted in smaller Villanueva de Gállego, Zaragoza, Spain).
systematic bias, random errors, and narrower limits of
agreement,30 the differences in power assessment are ORCID iDs
also expected to decrease for longer recording periods. Antonio Cartón-Llorente https://orcid.org/0000-
However, in the aforementioned work,30 it was stated 0001-5551-6037
that the variability in the spatiotemporal parameters of Diego Jaén-Carrillo https://orcid.org/0000-0003-
running can be accurately calculated within 25–30 steps 0588-0871
(i.e., approximately 10 s), so the 1-min intervals selected Noel Marcen-Cinca https://orcid.org/0000-0002-
should fit the purpose of this study. 2840-9882
In addition, some methodological differences Felipe Garcı́a-Pinillos https://orcid.org/0000-0002-
between systems could also play a role in this discre- 7518-8234
pancy. The Strydä system uses a sampling rate of
1000 Hz, whereas the RunScribeä system’s rate is only References
500 Hz. In addition, although both devices base their 1. Bourdon PC, Cardinale M, Murray A, et al. Monitoring
power output calculations on kinematic data, the algo- athlete training loads: consensus statement. Int J Sports
rithms applied by each system may not be exactly the Physiol Perform 2017; 12(Suppl 2): S2161–s2170.
same. Given that there is not a unique definition of run- 2. Barnes KR and Kilding AE. Running economy: mea-
ning power,31 different methods for calculating power surement, norms, and determining factors. Sports Med
output in running exist.32,33 Therefore, the power val- Open 2015; 1(1): 8.
ues obtained with each device may vary based on the 3. Luedke LE, Heiderscheit BC, Williams DS, et al. Influ-
ence of step rate on shin injury and anterior knee pain in
assumptions made by each system. Unfortunately, the
high school runners. Med Sci Sports Exerc 2016; 48(7):
specific algorithms have not been disclosed by the com-
1244–1250.
panies, thus making their comparisons challenging. 4. Mujika I. The alphabet of sport science research starts
Some limitations need to be considered for a proper with Q. Int J Sports Physiol Perform 2013; 8(5): 465–466.
interpretation of these results. The reliability was estab- 5. Borresen J and Lambert MI. The quantification of train-
lished based on within-subject variation (CV), so the ing load, the training response and the effect on perfor-
current analysis might not be able to explain the slight mance. Sports Med 2009; 39(9): 779–795.
inherent differences between runs performed by the 6. Mujika I. Quantification of training and competition
same subject on different days. Notwithstanding these loads in endurance sports: methods and applications. Int
limitations, the current study provides positive out- J Sports Physiol Perform 2017; 12(Suppl 2): S29–S217.
comes for the absolute reliability of the Strydä and 7. Sanders D, Myers T and Akubat I. Training-intensity
distribution in road cyclists: objective versus subjective
RunScribeä devices to measure power output during
measures. Int J Sports Physiol Perform 2017; 12(9):
running on a treadmill at a self-selected velocity, with 1232–1237.
the Strydä system reporting more reliable power data 8. Sanders D, Taylor RJ, Myers T, et al. A field-based
compared to the RunScribeä device. Furthermore, a cycling test to assess predictors of endurance perfor-
good level of agreement between devices was observed mance and establishing training zones. J Strength Cond
with no heteroscedasticity of error, but the large ran- Res 2020; 34(12): 3482–3488.
dom error detected prevents their interchangeable use 9. Passfield L, Hopker JG, Jobson S, et al. Knowledge is
for running power output assessment. power: issues of measuring training and performance in
Considering that the use of power meter devices for cycling. J Sports Sci 2017; 35(14): 1426–1434.
competing and training is becoming more popular, 10. Norris M, Anderson R and Kenny IC. Method analysis
of accelerometers and gyroscopes in running gait: a sys-
users like scientists, athletes, and coaches should know
tematic review. Proc IMechE Part P: J Sports Engineer-
that the Strydä and RunScribeä systems showed an
ing and Technology 2014; 228(1): 3–15.
adequate reliability for running power assessments, 11. Garcia-Pinillos F, Latorre-Roman PA, Ramirez-Cam-
with the Strydä system exhibiting more reliable data pillo R, et al. How does the slope gradient affect spatio-
for running power evaluation. temporal parameters during running? Influence of
athletic level and vertical and leg stiffness. Gait Posture
Declaration of conflicting interests 2019; 68: 72–77.
12. Cerezuela-Espejo V, Hernandez-Belmonte A, Courel-Iba-
The author(s) declared no potential conflicts of interest nez J, et al. Are we ready to measure running power?
with respect to the research, authorship, and/or publi- Repeatability and concurrent validity of five commercial
cation of this article. technologies. Eur J Sport Sci. Epub ahead of print April
2020. DOI: 10.1080/17461391.2020.1748117.
13. Hollis CKR, Resch JE and Hertel J. Gait mechanics as
Funding
measured by a wearable sensor while running at two
The author(s) disclosed receipt of the following finan- speeds on different surfaces. Sports Biomech 2019; 7: 1–
cial support for the research, authorship, and/or 11.
6 Proc IMechE Part P: J Sports Engineering and Technology 00(0)
14. Garcia-Pinillos F, Roche-Seruendo LE, Marcen-Cinca 24. Hopkins WG, Marshall SW, Batterham AM, et al. Pro-
N, et al. Absolute reliability and concurrent validity of gressive statistics for studies in sports medicine and exer-
the stryd system for the assessment of running stride cise science. Med Sci Sports Exerc 2009; 41(1): 3–13.
kinematics at different velocities. J Strength Cond Res. 25. Koo TK and Li MY. A guideline of selecting and report-
Epub ahead of print May 2018. DOI: 10.1519/ ing intraclass correlation coefficients for reliability
jsc.0000000000002595. research. J Chiropr Med 2016; 15(2): 155–163.
15. Koldenhoven RM and Hertel J. Validation of a wearable 26. Landis JR and Koch GG. The measurement of observer
sensor for measuring running biomechanics. Digit Bio- agreement for categorical data. Biometrics 1977; 33(1):
mark 2018; 2(2): 74–78. 159–174.
16. Garcia-Pinillos F, Latorre-Roman PA, Roche-Seruendo 27. Bouillod A, Pinot J, Soto-Romero G, et al. Validity, sen-
LE, et al. Prediction of power output at different running sitivity, reproducibility, and robustness of the powertap,
velocities through the two-point method with the Stryd() stages, and garmin vector power meters in comparison
power meter. Gait Posture 2019; 68: 238–243. with the SRM device. Int J Sports Physiol Perform 2017;
17. Austin CL, Hokanson JF, McGinnis PM, et al. The rela- 12(8): 1023–1030.
tionship between running power and running economy in 28. Nimmerichter A, Schnitzer L, Prinz B, et al. Validity and
well-trained distance runners. Sports 2018; 6(4): 142. reliability of the garmin vector power meter in laboratory
18. Aubry RL, Power GA and Burr JF. An assessment of and field cycling. Int J Sports Med 2017; 38(6): 439–446.
running power as a training metric for elite and recrea- 29. Bertucci W, Duc S, Villerius V, et al. Validity and relia-
tional runners. J Strength Cond Res 2018; 32(8): 2258– bility of the PowerTap mobile cycling powermeter when
2264. compared with the SRM Device. Int J Sports Med 2005;
19. van Dijk H and van Megen R. The secret of running: 26(10): 868–873.
maximum performance gains through effective power 30. Garcia-Pinillos F, Latorre-Roman PA, Ramirez-Cam-
metering and training analysis. Maidenhead: Meyer & pillo R, et al. Minimum time required for assessing step
Meyer Sports, 2017. variability during running at submaximal velocities. J
20. Schieb DA. Kinematic accommodation of novice tread- Biomech 2018; 80: 186–195.
mill runners. Res Q Exerc Sport 1986; 57(1): 1–7. 31. Williams KR and Cavanagh PR. A model for the calcula-
21. Lavcanska V, Taylor NF and Schache AG. Familiariza- tion of mechanical power during distance running. J Bio-
tion to treadmill running in young unimpaired adults. mech 1983; 16(2): 115–128.
Hum Mov Sci 2005; 24(4): 544–557. 32. Fukunaga T and Matsuo A. Effect of running velocity
22. Hopkins WG. Measures of reliability in sports medicine on external mechanical power output. Ergonomics 1980;
and science. Sports Med 2000; 30(1): 1–15. 23(2): 123–136.
23. Atkinson G and Nevill AM. Statistical methods for asses- 33. Schepens B, Willems PA, Cavagna GA, et al. Mechanical
sing measurement error (reliability) in variables relevant power and efficiency in running children. Pflugers Arch
to sports medicine. Sports Med 1998; 26(4): 217–238. 2001; 442(1): 107–116.