Professional Documents
Culture Documents
Human Mods 2
Human Mods 2
Noise Bruit
Mechanisms
(1) Neural reflex to the hyperstimulation of the auditory system
Nonaka et al. (1997) Neuroscience Research (2003) International Congress Series 1240.
Physiological relationship between vocalization and audition Greater activation of laryngeal and abdominal muscles with increasing loudness of the auitory stimulation
Decerebrate cats
Houde et al. 1998, Eliades et al. 2003, Hashimoto et al. 2003 (neurosciences)
Mechanisms
(1) Neural reflex to the hyperstimulation of the auditory system
Acoustic reflex
Reduces the ear sensitivity to high intensity levels Attenuation of high frequencies
(Margolis 1975, Al-Azazi 200)
Mechanisms
(2) Audio-phonation loop
Regulation of voice parameters from the auditory feedback
Noise
Automatism Servomechanism
Auditory feedback
Egan, 1972 Fairbanks, 1954 Korn 1954 Lane, Tranel, & Sisson 1970 Tonkinsson 1994
Mechanisms
(2) Audio-phonation loop
Very short reaction times
Neural reflex
Noise
Bauer, Mittal, Larson, & Hain (2006) (loudness) Leydon, Bauer, & Larson (2003) (pitch)
Auditory feedback
Mechanisms
(2) Audio-phonation loop
Regulation that cannot be completely inhibited
Pick, Siegel, Fox, Garber, & Kearney (1989) Siegel et al. (1974)
Noise
Auditory feedback
Mechanisms
(2) Audio-phonation loop
Regulation observed in babies and children
Siegel et al. (1976) Garber et al. (1980) Amazi et al. (1982)
Auditory feedback
Noise
As well as in animals
Sinnott et al. (1975) Manabe et al. (1998)
Different paths
3
1-Air (external)
2-Bone conduction
3-Tendons and muscles conduction
Individual HRTF
Important attenuation above 2kHz Enhancement btw 700 and 1200Hz (~F1)
Von Bekesy (1960), Hood (1962) Maurer et al. (1990), Prschmann (2000) Stenfelt et al. (2002)
Occlusion effect
Mechanisms
(3) Social regulation (active) communicative adaptation
Noise
Junqua (1993)
Mechanisms
(3) Communicative adaptation
Hyper & Hypo theory, Lindblom (1990)
Situation perturbed
intelligibility
Mechanisms
(3) Communicative adaptation
Distance Noise
Lombard speech Traunmller et al. (2000) Kim (2005) Junqua (1992-1999) Garnier et al. (2006-2010) Cooke et al. (2006-2010) Patell and Schell (2008) Picheny et al. (1985, 1986)
Lindblom (1992) Dodane et al. (2006) Burnham et al. Uther et al. (2007)
Mechanisms
(3) Communicative adaptation
Distance Noise
Lombard speech
Foreign people
Mechanisms
(3) Communicative adaptation
In fact, many other speech modifications (developed in parts 3 and 4):
Global acoustic and articulatory modifications that may be related to voice intensity
Mechanisms
(3) Communicative adaptation
Global acoustic and articulatory modifications that may be related to voice intensity
Fundamental frequency (f0) Spectral slope
Correlation established
(Schulmann 1989, Geumann 1999, 2001)
Speech rate
Mechanisms
(3) Communicative adaptation
Linguo-specific modifications that are not directly related to voice intensity (developed in Part 4)
Segment recognition / discrimination F2, F3, Lip rounding, spreading, Speech segmentation Pauses Syllable lengtening Intonation patterns Information highlighting Contrasts in intensity, f0,
Mechanisms
(3) Communicative adaptation Influence of these modifications on
Automatic Speech Recognition (--)
Junqua 1993, Bond
DModifs not related to voice intensity greater or significant only in interactive task
(S1) No interaction
Articulatory modifs
-visible (lips)
-non/less visible (tongue) (S3) Audiovisual interaction
different mechanisms considered, different protocols -Noisy conditions -Sound immersion methods -Speech production tasks
Amazi et al. 1982 [8] Bond et al. 1989 [31] Boril et al. 2005 [32] Castellanos et al. 1996 Davis et al. 2006 [62] Dejonckere et al. 1983 Dieroff et al. 1966 [71] Dohalska et al. 2000 [73] Egan 1972
[80] [65] [46]
Bruit de conversations Bruit rose 25 bruits de voiture (issus de CAR2E) , 4 bruit blancs entre 62-125Hz, 75-300Hz, 220-1120Hz et 840-2500Hz Bruit blanc Bruit blanc, Bruit de conversations Bruit blanc Bruit denfants dans une maternelle Bruit de traffic ferroviaire Bruit blanc large bande, basse frquences, hautes frquences et mdium Bruit blanc Bruit blanc et bruit de conversations Bruit blanc Bruit blanc et bruit de conversations
90 dB SPL 95 dB 90 dB SPL 85dB SPL 80 dB SPL 30, 50, 70, 90 dB 85-90 dB Non prcis 20 120 dB par pas de 10 dB 100-105 dB (A) 85dB SPL 85dB SPL 85dB SPL 80dB SPL 40, 50, 60, 70, 80, 90 dB 95 dB (A) 71, 75, 79 et 83 dB Non prcis 60, 70 et 80 dB SPL 66 dB (A), 79 dB (A) 80 dB SPL 90 dB SPL 80, 90, 100 dB 60, 80, 100, 110 dB 60, 70, 80 dB SPL 70, 80, 90 dB SPL 30, 70-78, 74, 78-85, 87 dB 90 dB 68, 70, 82, 85 dB SPL 30, 70-78, 74, 78-85, 87 dB 65 95 dB SPL 60, 70, 80, 90, 100 SPL 35, 45, 55, 65 dB(A) 31, 39, 61, 65, 75 dB(A) 80, 90, 100 dB 65, 75, 85 dB (C) 80 dB
Junqua 1993 [171] Kadiri 1998 [176] Kim et al. 2005 Korn 1954 [185] Lamprecht 1988
[191] [179]
Bruit blanc et bruit de conversations Bruit blanc de mme enveloppe spectrale quun bruit de conversations Bruit blanc Bruit blanc Bruit de voiture tournant au ralenti, roulant 35 et 55 miles /h avec la vitre ouverte ou ferme
[244]
Lane et al. 1970 [194] Lee et al. 2004 [204] Mixdorff et al. 2006 Mokbel 1992 [245] Papon 2006
[265]
Bruit de conversations Bruit de voiture roulant 90 et 130 km/h, 4 bruits blanc : large bande, filtr par un passe bas 1kHz, 1.5 kHz ou par un passe haut 1.7 kHz Bruit de conversations Bruit blanc Bruit blanc filtr par un passe-bas 3.5 kHz
[305]
Pick et al. 1989 [271] Pisoni et al. 1985 [272] Schultz-Coulon et al. 1976 Siegel et al. 1974 [315] Sinnott et al. 1975
[318]
Bruit blanc Bruit de mme enveloppe spectrale quun bruit de conversations Bruit dans la bande 200-500Hz, Bruit dans la bande 8-16kHz Bruit dune classe calme, bruit de ventilation, bruit de classe bruyante, bar avec de la musique forte Bruit rose
Sodersten et al. 2005 [322] Stanton 1988 [327] Ternstrm et al. 2002
[346]
bruit blanc stationnaire, maternelle, bar avec de la musique forte, bruit de ventilation Bruit dune classe calme, bruit de ventilation, bruit de classe bruyante, bar avec de la musique forte Chorale Bruit rose
[360]
Bruit blanc de mme enveloppe spectrale quun bruit de conversations Salle rverbrante, cantine, salle informatique, voiture, chaufferie
[361]
Noisy conditions
Noises of interest :
Factories (Borsuk et Klajman, 1967, Pruszewicz et al. 1974) Cars (Egan 1972, Mokbel 1992, Boril et Polack 2005, Lee et al. 2004) Train stations (Dohalska et al. 2000) Conversations, Crowds (Ternstrm et al. 2002, 2006, Zeiliger et al. 1994,
Sodersten et al. 2005, Davis et al. 2006, Garnier et al. 2007)
Loud music (Ternstrm et al. 2002, 2006) Preschool (Sodersten et al. 2002, Dieroff et al. 1966)
Noisy conditions
Spectral envelops
Broadband noises
White gaussian noise (Egan 1972, all
reference studies, )
Pink noise (Frank et al. 2003) Ventilation noise (Ternstm et al. 2006)
Noise enhanced in medium or high frequencies Band-pass / High-pass filtered broadband noise
From 60 to 105 dB (SPL, A, C / at ear) ?? Different limitations depending on the country Linearity of the Lombard effect
Garnier et al. (2006)
Noise levels
Noisy conditions
Spectral envelops and energetic masking
Broadband noises
White gaussian noise (Egan 1972, all
reference studies, )
Pink noise (Frank et al. 2003) Ventilation noise (Ternstm et al. 2006)
Types of noise
Temporal fluctuations
Music, small number of talkers important fluctuations Broadband noise, large number of talkers stationnary
Informational masking
Number of talkers Same language as target speech Reverse
Noisy conditions
Noise levels
From 60 to 105 dB dB SPL, A, C Calibration at speakers ear Different limitations depending on the country dB A
Cocktail-party noise White noise
Noise level
Noise level
Amazi et al. 1982 [8] Bond et al. 1989 [31] Boril et al. 2005 [32] Castellanos et al. 1996 Davis et al. 2006 [62] Dejonckere et al. 1983 Dieroff et al. 1966 [71] Dohalska et al. 2000 [73] Egan 1972
[80] [65] [46]
Casque avec retour de sa propre voix Casque avec Casque Casque Casque, Haut-parleurs Casque In situ In situ Casque Casque Casque Casque Casque avec et sans retour de sa propre voix 72 dB
Junqua 1993 [171] Kadiri 1998 [176] Kim et al. 2005 Kim 2005 [180] Korn 1954
[185] [179]
Casque, Haut-parleurs Casque Casque Casque avec retour de sa propre voix In situ Haut-parleurs Casque Haut-parleurs Casque Casque Casque
[305]
Lamprecht 1988 [191] Lane et al. 1970 [194] Lee et al. 2004
[204]
Nonaka et al. 1997 [256] Papon 2006 [265] Pick et al. 1989
[271]
Pisoni et al. 1985 [272] Schultz-Coulon et al. 1976 Siegel et al. 1974 [315] Sinnott et al. 1975 [318] Sodersten et al. 2005 Stanton 1988 [327] Ternstrm et al. 2002
[346] [322]
Casque Casque avec retour de sa propre voix Casque Haut-parleurs Casque Haut-parleurs Haut-parleurs Casque Casque + protections auditives
Ternstrm et al. 2006 [345] Tonkinson 1994 [353] Frank et al. 2003
[97]
[360]
[361]
Casque
~ok
Articulation, duration
ok
-denoising algorithms
Pabon 2006
+ other parameters (f0, spectrum centroid, F1) . Not always just an offset
TF
voice Significant effect on some parameters only Does not compensate for the headphone effect
Recorded Noise
Source signal
(Noise)
Recorded signal
Denoised signal
(Speech + Noise)
(Speech)
Estimated TF
Ternstrm et al. 2002
Ok, under the condition that the speaker does not move too much
dsignation dimages, puis contage dune histoire propos de ces images lecture lecture
[46]
[148]
Phrases quilibres phontiquement, chiffres et nombres, commandes, dates, heures, etc. 13 phrases quilibres phontiquement 10 phrases quilibres issus des phrases de 1969 Texte denviron 30s 44 phrases de 7 12 syllabes Passage dun texte 9 voyelles en contexte h-d, puis en contexte d-d et s-s. Logatomes incopors dans des phrases types. 49 mots (chiffres, lettres, commandes) 50 phrases quilibres, nombres, 82 logatomes de type C(C)VC 50 mots (commandes) dont 4 monosyllabiques, 36 bisyllabiques, 6 trisyllabiques et 4 quadrisyllabiques. 10 phrases quilibres issus des phrases de 1969 Voyelle /a/ tenue Phrase porteuse avec un mot cible changeant (commande aronautique) 13 chiffres, 26 lettres, 10 numros de tlphone, 20 phrases Phrases phontiquement quilibres 12 mots (chiffres et commandes) Mots du jeu Pictionnary, mots construits et non existants 15 mots aronautiques (chiffres et commandes) Texte
[2]
Castellanos et al. 1996 Davis et al. 2006 [62] Dejonckere et al. 1983 Dieroff et al. 1966 [71] Dohalska et al. 2000 Egan 1972 [80] Hofler 1984 [149] Junqua 1992
[173]
lecture lecture
Auditeur passif 2.5 m non oui Non prcis non non non
[65]
[73]
Auditeur passif 2.5 m oui Retour de lauditeur par le biais dune ardoise non non non Oui, systme dardoise non non non
Lane et al. 1970 [194] Lee et al. 2004 [204] Mixdorff et al. 2006 Mokbel 1992 [245] Nonaka et al. 1997 Papon 2006 [265] Pick et al. 1989 [271] Pisoni et al. 1985
[272] [256]
lecture Lecture Phonation induite par stimulation lectrique lecture Parole spontane Lecture Lecture
Siegel et al. 1974 [315] Sinnott et al. 1975 [318] Sodersten et al. 2005
[322]
non non Retour de la comprhension d auditeur (invisible) via un vumtre non Retour de la comprhension d auditeur (invisible) via un vumtre Retour de la comprhension d auditeur (invisible) via un vumtre non non
lecture lecture
lecture
6 textes de 90 s chacun
Chant Lecture
12 passages
Speaker
Experimenter