Professional Documents
Culture Documents
Biomedical Signal Processing and Control: Lydia Stappenbeck, Ben Barsties v. Latoszek, Ben Janotte, Bernhard Lehnert
Biomedical Signal Processing and Control: Lydia Stappenbeck, Ben Barsties v. Latoszek, Ben Janotte, Bernhard Lehnert
a r t i c l e i n f o a b s t r a c t
Article history: Objective: The purpose of the study was to explore the extent to which different Praat versions affect the
Received 26 August 2019 reproducibility of results performing Acoustic Voice Quality Index (AVQI) and Acoustic Breathiness Index
Received in revised form (ABI).
28 December 2019
Method: Seven Praat versions were selected and categorized into three groups based on hierarchical
Accepted 8 March 2020
cluster analysis. The differences/distances, diagnostic accuracy, and concurrent validity were evaluated
Available online 16 March 2020
among the three groups. Group three had just one Praat version. This version was found after two month
to have a computation bug in smoothed cepstral peak prominence (i.e., an important measure for AVQI
Keywords:
Praat
and ABI) before a new update removed this bug. For the analyses, a previous database of 218 German
Acoustic Voice Quality Index voice samples and auditory-perceptual judgment results were used.
Acoustic Breathiness Index Results: The AVQI and ABI results between group 1 and 2 (p = 0.53, and p = 0.62, respectively) demonstrated
Open source software no significant differences. However, the results between these two groups and group 3 yielded significant
differences for AVQI and ABI (all p < 0.00001). The concurrent validity for AVQI (r = 0.84 to 0.86) and
ABI (r = 0.84 to 0.85) were strong in all three groups. The diagnostic accuracy of both indices was also
sufficient for group one and two but low in group three particularly in sensitivity (AVQI: 23% and ABI:
9%, respectively).
Conclusion: AVQI and ABI are two robust measurements in the evaluation of voice quality. However,
caution is warranted using updates in open source software such as Praat for patient care or research.
© 2020 Elsevier Ltd. All rights reserved.
https://doi.org/10.1016/j.bspc.2020.101938
1746-8094/© 2020 Elsevier Ltd. All rights reserved.
2 L. Stappenbeck, B. Barsties v. Latoszek, B. Janotte et al. / Biomedical Signal Processing and Control 59 (2020) 101938
Table 2
Descriptive results of perceived and acoustic voice quality evaluations from the 218 voice samples.
Vocally-healthy subjects 0.32 0.30 0.12 0.22 1.03 0.65 2.29 0.93
Voice-disordered subjects 1.23 0.86 0.94 0.86 3.41 2.76 3.72 2.25
Table 3
Descriptive statistics of differences between Praat versions group 1 and group 2 (as represented by version 5.3.57 minus 6.0.48).
Descriptive Differences
Table 4
Descriptive statistics of differences between Praat versions group 1 and group 3 (as represented by version 5.3.57 minus 6.0.46).
Descriptive Differences
Table 5
Descriptive statistics of differences between Praat versions group 2 and group 3 (as represented by version 6.0.48 minus 6.0.46).
Descriptive Differences
Fig. 3. Bland-Altman-plot with marginal histogram of AVQI values in Praat version 5.3.57 and version.6.0.48 (group 1 and 2).
sample t-test confirmed the results of the descriptive statistics of Barsties v. Latoszek et al. [22], the outcomes of sensitivity and speci-
Table 3, namely that no significant differences existed between ficity showed the following results: group 1 (sensitivity = 72% and
the AVQI and ABI results between group 1 and 2 (p = 0.53, and specificity = 90%), group 2 (sensitivity = 71% and specificity = 90%),
p = 0.62, respectively). However, the results of the paired sample and group 3 (sensitivity = 23% and specificity = 100%).
t-test between these two groups and group 3 yielded significant The AUC of ABI showed slightly higher results than AVQI but also
differences for AVQI and ABI (all p < 0.00001) and confirmed the a small variation that ranged between 91.5% to 92.8% was reported
findings of the results in Tables 4 and 5. The Bland-Altman-plot for in all three groups. The three groups also showed an excellent dis-
group 1 and 2 shows that differences are centered about 0.25 for criminatory power of ABI in differentiating between normal and
the results of AVQI (Fig. 3). These absolute differences rarely exceed breathy voices (see Fig. 6). At the predefined threshold at 3.42 by
0.25. Differences between group 1 and 2 are centered about 0.5 for Barsties v. Latoszek et al. [22], the outcomes of sensitivity and speci-
the results of ABI Fig. 4). ficity showed the following results: group 1 (sensitivity = 72% and
specificity = 95%), group 2 (sensitivity = 70% and specificity = 95%),
and group 3 (sensitivity = 9% and specificity = 100%).
3.3. Diagnostic accuracy
The AUC of AVQI varied slightly among the three groups 3.4. Criterion-related concurrent validities
between 88.4% and 89.7. An excellent discriminatory power of AVQI
in differentiating between normal and hoarse voices was confirmed The Spearman rank-order coefficients of the expert panel’s judg-
in all three groups (see Fig. 5). At the predefined threshold at 1.85 by ments for overall voice quality and the AVQI results of all groups
L. Stappenbeck, B. Barsties v. Latoszek, B. Janotte et al. / Biomedical Signal Processing and Control 59 (2020) 101938 5
Fig. 4. Bland-Altman-plot with marginal histogram of ABI values in Praat version 5.3.57 and version.6.0.48 (group 1 and 2).
Fig. 5. Receiver Operator Characteristic curve of AVQI in Praat version group 1 (green Fig. 6. Receiver Operator Characteristic curve of ABI in Praat version group 1 (green
line), group 2 (red line) and group 3 (blue line). line), group 2 (red line) and group 3 (blue line).
shown an additional finding about a slight variance of AVQI and Declaration of Competing Interest
ABI results between group 1 and 2. Although these differences of
AVQI and ABI outcomes between group 1 and 2 remained sta- The authors declare that they have no known competing finan-
tistically irrelevant, there must be an unresolved bug introduced cial interests or personal relationships that could have appeared to
somewhere between Praat v.6.0.06 and v.6.0.21 because the out- influence the work reported in this paper.
comes of AVQI and ABI can vary between about 0.25 to 0.5 for
single cases. This variation can have consequences to the clini-
cal interpretation in individual cases, particularly between normal References
voice quality to abnormal voice quality. Thus, it seems as if the
[1] B. Barsties, M. De Bodt, Assessment of voice quality: current state-of-the-art,
AVQI script v. 03.01 and the ABI script v. 01.01 provide stable out- Auris Nasus Larynx 42 (2015) 183–188, http://dx.doi.org/10.1016/j.anl.2014.
comes despite possible computation bugs within the performing 11.001.
software. [2] P.H. Dejonckere, P. Bradley, P. Clemente, G. Cornut, L. Crevier-Buchman, G.
Friedrich, P. Van De Heyning, M. Remacle, V. Woisard, Committee on
This research exemplifies how a vivid community of users helps Phoniatrics of the European Laryngological Society (ELS), A basic protocol for
identify prominent problems within open source software to be functional assessment of voice pathology, especially for investigating the
solved possibly quickly and thoroughly. It also shows that slight efficacy of (phonosurgical) treatments and evaluating new assessment
techniques. Guideline elaborated by the Committee on Phoniatrics of the
differences can only be detected by a systematic analysis such as European Laryngological Society (ELS), Eur. Arch. Otorhinolaryngol. 258
the present study. Even though small margins of AVQI and ABI (2001) 77–82.
outcomes between group 1 and 2 had statistically no influence [3] P.H. Dejonckere, L. Crevier-Buchman, J.P. Marie, M. Moerman, M. Remacle, V.
Woisard, et al., Implementation of the European Laryngological Society (ELS)
between the Praat versions of group 1 and 2, there is no guar-
basic protocol for assessing voice treatment effect, Rev. Laryngol. Otol. Rhinol.
antee that future updates of Praat versions possibly might have 124 (2003) 279–283.
an influence. Currently, there is no independent control system [4] G. Friedrich, P. Dejonckere, The voice evaluation protocol of the European
Laryngological Society (ELS) – first results of a multicenter study,
available to assess these effects as disclosed in the present study.
Laryngorhinootologie 84 (2005) 744–752.
Therefore, medical professional users are well advised to verify [5] P. Boominathan, J. Samuel, R. Arunachalam, R. Nagarajan, S. Mahalingam,
variations in AVQI and ABI outcomes by using a new update of a Multi parametric voice assessment: sri ramachandra university protocol,
Praat version. For example, a defined voice sample of voice record- Otolaryngol. Head Neck Surg. 66 (2014) 246–251, http://dx.doi.org/10.1007/
s12070-011-0460-y.
ings could be pretested on the new Praat version in comparison to [6] B. Barsties v. Latoszek, Y. Maryn, E. Gerrits, M. De Bodt, A meta-analysis:
a previous Praat version. The option to have recourse to all pre- acoustic measurement of roughness and breathiness, J. Speech Lang. Hear.
viously published Praat versions is most helpful to review one’s Res. 61 (2018) 298–323.
[7] Y. Maryn, P. Corthals, P. Van Cauwenberge, N. Roy, M. De Bodt, Toward
own clinical and research results of AVQI and ABI. For a potential improved ecological validity in the acoustic measurement of overall voice
pretest among Praat versions, we uploaded, under consideration of quality: combining continuous speech and sustained vowels, J. Voice 24
ethical permission and data privacy, natural voice samples evalu- (2010) 540–555, http://dx.doi.org/10.1016/j.jvoice.2008.
[8] B. Barsties, Y. Maryn, The improvement of internal consistency of the acoustic
ating the outcomes of AVQI and ABI values (http://www2.medizin. voice quality index, Am. J. Otolaryngol. 36 (2015) 647–656, http://dx.doi.org/
uni-greifswald.de/hno/index.php?id=470). Furthermore, we rec- 10.1016/j.amjoto.2015.04.012.
ommend using the Praat version which was used in the validation [9] B. Barsties, Y. Maryn, External validation of the acoustic voice quality index
version 03.01 with extended representativity, Ann. Otol. Rhinol. Laryngol. 125
studies of each language to ensure that the Praat version has
(2016) 571–583, http://dx.doi.org/10.1177/0003489416636131.
no significant influence on the outcome of AVQI and ABI. In [10] B. Barsties v. Latoszek, Y. Maryn, E. Gerrits, M. De Bodt, The Acoustic
summary, caution is warranted for medical professionals using Breathiness Index (ABI): a multivariate acoustic model for breathiness, J. Voice
31 (2017) 511.e11–511.e27, http://dx.doi.org/10.1016/j.jvoice.2016.11.017.
updates in open source software (e.g., Praat) for patient care or
[11] Y. Maryn, M. De Bodt, N. Roy, The acoustic voice quality index: toward
research. improved treatment outcomes assessment in voice disorders, J. Commun.
Disord. 43 (2010) 161–174, http://dx.doi.org/10.1016/j.jcomdis.2009.12.004.
[12] B. Barsties, Y. Maryn, The acoustic voice quality index. Toward expanded
measurement of dysphonia severity in German subjects, HNO 60 (2012)
715–720, http://dx.doi.org/10.1007/s00106-012-2499-9.
5. Conclusion
[13] V. Reynolds, A. Buckland, J. Bailey, et al., Objective assessment of pediatric
voice disorders with the acoustic voice quality index, J. Voice 26 (672) (2012)
AVQI and ABI results can be altered by Praat versions. Therefore, e1–e7, http://dx.doi.org/10.1016/j.jvoice.2012.02.002.
[14] Y. Maryn, M. De Bodt, B. Barsties, et al., The value of the acoustic voice quality
it is recommended (a) to use Praat versions of previous valida-
index as a measure of dysphonia severity in subjects speaking different
tion studies which tested AVQI and ABI outcomes or (b) to verify languages, Eur. Arch. Otorhinolaryngol. 271 (2014) 1609–1619, http://dx.doi.
with test signals and compare the outputs with outputs of earlier org/10.1007/s00405-013-2730-7.
releases for all functions of Praat. These test signals can be own [15] Y. Maryn, H.T. Kim, J. Kim, Auditory-perceptual and acoustic methods in
measuring dysphonia severity of Korean speech, J. Voice 30 (2016) 587–594,
recordings of the users or voice samples from the University of http://dx.doi.org/10.1016/j.jvoice.2015.06.011.
Greifswald with the reports of correct AVQI and ABI results. [16] K. Hosokawa, B. Barsties, T. Iwahashi, et al., Validation of the acoustic voice
quality index in the Japanese language, J. Voice 31 (2017) 260.e1–260.e9,
http://dx.doi.org/10.1016/j.jvoice.2016.05.010.
[17] V. Uloza, T. Petrauskas, E. Padervinskis, et al., Validation of the acoustic voice
quality index in the Lithuanian language, J. Voice 31 (2017) 257.e1–257.e11,
Funding http://dx.doi.org/10.1016/j.jvoice.2016.06.002.
[18] E. Kankare, v. Barsties, B. Latoszek, Y. Maryn, et al., The acoustic voice quality
None. index version 02.02 in Finnish speaking population, Logoped Phon Vocol.
(2019), http://dx.doi.org/10.1080/14015439.2018.1556332, In Press.
[19] G.H. Kim, Y.W. Lee, I.H. Bae, H.J. Park, S.G. Wang, S.B. Kwon, Validation of the
acoustic voice quality index in the Korean language, J. Voice (2019), http://dx.
doi.org/10.1016/j.jvoice.2018.06.007, In Press.
CRediT authorship contribution statement [20] K. Hosokawa, B. Barsties v. Latoszek, T. Iwahashi, M. Iwahashi, S. Iwaki, C.
Kato, M. Yoshida, H. Sasai, A. Miyauchi, N. Matsushiro, H. Inohara, M. Ogawa,
Y. Maryn, The acoustic voice quality index version 03.01 for the
Lydia Stappenbeck: Methodology, Software, Writing - original Japanese-speaking population, J. Voice 33 (2019) 125.e1–125.e12, http://dx.
draft preparation. Ben Barsties v. Latoszek: Project administration, doi.org/10.1016/j.jvoice.2017.10.003.
Methodology, Conceptualization, Writing - reviewing & editing. [21] J. Delgado Hernandez, N.M. Leon Gomez, A. Jimenez, et al., Validation of the
acoustic voice quality index version 03.01 and the acoustic breathiness index
Ben Janotte: Data curation. Bernhard Lehnert: Formal analyses, in the Spanish language, Ann. Otol. Rhinol. Laryngol. 127 (2018) 317–326,
Writing and reviewing, Supervision. http://dx.doi.org/10.1177/0003489418761096.
L. Stappenbeck, B. Barsties v. Latoszek, B. Janotte et al. / Biomedical Signal Processing and Control 59 (2020) 101938 7
[22] B. Barsties v. Latoszek, B. Lehnert, B. Janotte, Validation of the acoustic voice function index for voice pathology screening, Eur. Arch. Otorhinolaryngol. 276
quality index version 03.01 and acoustic breathiness index in German, J. Voice (2019) 1737–1745, http://dx.doi.org/10.1007/s00405-019-05433-5.
34 (2020) 157e17–157e25. [35] T. Pommée, Y. Maryn, C. Finck, D. Morsomme, The acoustic voice quality
[23] T. Pommée, Y. Maryn, C. Finck, D. Morsomme, Validation of the acoustic voice index, version 03.01, in French and the voice handicap index, J. Voice (2018),
quality index, version 03.01, in French, J. Voice (2018), http://dx.doi.org/10. http://dx.doi.org/10.1016/j.jvoice.2018.11.017, In Press.
1016/j.jvoice.2018.12.008, In Press. [36] I. Priss, B. Barsties v. Latoszek, U. Jäger-Priss, B. Lehnert, Questionnaire for the
[24] M. Englert, B. Barsties v. Latoszek, Y. Maryn, M. Behlau, Validation of the assessment of the voice self-concept in a neurological practice : applicability
acoustic voice quality index, version 03.01, to the Brazilian Portuguese for the identification of patients with high consultation needs, Nervenarzt 90
language, J. Voice (2019), In Press. (2019) 601–608, http://dx.doi.org/10.1007/s00115-018-0642-x.
[25] K. Hosokawa, B. Barsties v. Latoszek, C.A. Ferrer, T. Iwahashi, M. Iwahashi, S. [37] M. Englert, L. Lima, A.C. Constantini, v. Barsties, B. Latoszek, Y. Maryn, M.
Iwaki, C. Kato, M. Yoshida, M. Umatani, A. Miyauchi, N. Matsushiro, H. Inohara, Behlau, Acoustic Voice Quality Index – AVQI for brazilian portuguese
M. Ogawa, Y. Maryn, Acoustic breathiness index for the Japanese-speaking speakers: analysis of different speech material, Codas 31 (2019), e20180082,
population: validation study and exploration of affecting factors, JSLHR 66 http://dx.doi.org/10.1590/2317-1782/20182018082.
(2019) 2617–2631, http://dx.doi.org/10.1044/2019 JSLHR-S-19-0077. [38] M. Englert, L. Lima, M. Behlau, Acoustic voice quality index and acoustic
[26] Y. Maryn, D. Weenink, Objective dysphonia measures in the program Praat: breathiness index: analysis with different speech material in the Brazilian
smoothed cepstral peak prominence and acoustic voice quality index, J. Voice Portuguese, J. Voice (2019), http://dx.doi.org/10.1016/j.jvoice.2019.03.015, In
29 (2015) 35–43, http://dx.doi.org/10.1016/j.jvoice.2014.06.015. Press.
[27] C. Batthyany, Y. Maryn, I. Trauwaen, E. Caelenberghe, J. van Dinther, A. [39] M. Hirano, Psycho-acoustic evaluation of voice, in: G.E. Arnold, F. Winckel,
Zarowski, F. Wuyts, A case of specificity: how does the acoustic voice quality B.D. Wyke (Eds.), Disorders of Human Communication 5. Clinical Examination
index perform in normophonic subjects? Appl. Sci. 9 (2019) 2527, http://dx. of Voice, Springer Verlag, Vienna, Austria, 1981, pp. 81–84.
doi.org/10.3390/app9122527. [40] R Core Team, R: A Language and Environment for Statistical Computing, R
[28] B. Barsties v. Latoszek, N. Ulozaitė-Stanienė, Y. Maryn, T. Petrauskas, V. Uloza, Foundation for Statistical Computing, Vienna, Austria, 2017, https://www.R-
The influence of gender and age on the acoustic voice quality index and project.org/. (Accessed 15 February 2017).
dysphonia severity index: a normative study, J. Voice 33 (2019) 340–345, [41] H. Wickham, ggplot2: Elegant Graphics for Data Analysis, Springer-Verlag,
http://dx.doi.org/10.1016/j.jvoice.2017.11.011. New York, 2009.
[29] B. Barsties v. Latoszek, B. Lehnert, Internal validation of the acoustic voice [42] X. Robin, N. Turck, A. Hainard, N. Tiberti, F. Lisacek, J.-C. Sanchez, M. Müller,
quality index version 03.01 and acoustic breathiness index, pROC: an open-source package for R and S+ to analyze and compare ROC
Laryngorhinootologie 97 (2018) 630–635, http://dx.doi.org/10.1055/a-0596- curves, BMC Bioinformatics 12 (2011) 77, http://dx.doi.org/10.1186/1471-
7819. 2105-12-77.
[30] V. Uloza, B. Barsties v. Latoszek, N. Ulozaite-Staniene, T. Petrauskas, Y. Maryn, [43] Z. Du, Y. Hao, reportROC: An Easy Way to Report ROCAnalysis. R Package
A comparison of Dysphonia Severity Index and Acoustic Voice Quality Index Version 3.4, 2019, https://CRAN.R-project.org/package=reportROC. (Accessed
measures in differentiating normal and dysphonic voices, Eur. Arch. 29 July 2019).
Otorhinolaryngol. 275 (2018) 949–958, http://dx.doi.org/10.1007/s00405- [44] B. Lehnert, BlandAltmanLeh: (Slightly Extended) Bland-altman Plots. R
018-4903-x. Package Version 0.3.1, 2019, https://CRAN.R-project.org/
[31] J.M. Lee, N. Roy, E. Peterson, R.M. Merrill, Comparison of two multiparameter package=BlandAltmanLeh. (Accessed 29 July 2019).
acoustic indices of dysphonia severity: the acoustic voice quality index and [45] S.C. Johnson, Hierarchical clustering schemes, Psychometrika 32 (1967)
cepstral spectral index of dysphonia, J. Voice 32 (2018) 515.e1–515.e13, 241–254.
http://dx.doi.org/10.1016/j.jvoice.2017.06.012. [46] J.M. Bland, D.G. Altman, Statistical methods for assessing agreement between
[32] B. Barsties v. Latoszek, N. Ulozaitė-Stanienė, T. Petrauskas, V. Uloza, Y. Maryn, two methods of clinical measurement, Lancet 1 (1986) 307–310.
Diagnostic accuracy of dysphonia classification of DSI and AVQI, [47] F.E. Harrell, J.C. Slaughter, Biostatistics for Biomedical Research, 2019, https://
Laryngoscope 129 (2019) 692–698, http://dx.doi.org/10.1002/lary.27350. hbiostat.org/doc/bbr.pdf. (Accessed 30 June 2019).
[33] P. Bottalico, J. Codino, L.C. Cantor-Cutiva, K. Marks, C.J. Nudelman, J. [48] L.G. Portney, M.P. Watkins, Foundations of Clinical Research, Applicaions to
Skeffington, R. Shrivastav, M.C. Jackson-Menaldi, E.J. Hunter, A.D. Rubin, Practice, 2nd ed., Prentice Hall, Upper Saddle River, New Jersey, 2000.
Reproducibility of voice parameters: the effect of room acoustics and [49] L.R. Frey, C.H. Botan, P.G. Friedman, et al., Investigating Communication: An
microphones, J. Voice (2018), http://dx.doi.org/10.1016/j.jvoice.2018.10.016, Introduction to Research Methods, Prentice Hall, Englewood Cliffs, New
In Press. Jersey, 1991, pp. 40.
[34] N. Ulozaite-Staniene, T. Petrauskas, V. Šaferis, V. Uloza, Exploring the
feasibility of the combination of acoustic voice quality index and glottal