Professional Documents
Culture Documents
Hearing A Mistuned Harmonic in An Otherwise Periodic Complex PDF
Hearing A Mistuned Harmonic in An Otherwise Periodic Complex PDF
Hearing A Mistuned Harmonic in An Otherwise Periodic Complex PDF
tone a)
William Morris Hartmann
Physics
Department,MichiganStateUniversity,
E. Lansing,Michigan48824andInstitutdeRecherche
et
CoordinationAcoustique/Musique,
31, rueSaint-Merri,F-75004,Paris,France
StephenMeAdams
") andBennettK. Smith
InstitutdeRechercheet CoordinationAcoustique/Musique,
31, rueSaint-Merri,F- 75004,Paris,France
1712 J.Acoust.
Soc.Am.88(4),October
1990 0001-4966/90/101712-13500.80© 1990Acoustical
Society
ofAmerica 1712
their relativesaliencedependsuponthe amountof mistun- justthelevelof thesinetoneforhisownconvenience.
When
inganduponthestimulus duration. Someofthesecues,such thelistenerwassatisfied with hismatch,heagainpressed the
as a shift of the low pitch or roughness, do not seemto be greenbutton,and the trial with a particularfundamental
relatedto the segregation of a mistunedharmonicasa dis- frequencyandmistunedharmonicwasfinished.At theend
tinct entity.Therefore,whilethedetectionofinharmonicity, of a trial, the listenerreceivedfeedback,a whitelamp if his
as revealedin discriminationexperimentsmay be of some match was within ñ 0.8 harmonic number of the correct
interestin itself,the datafrom theseexperiments cannotbe value,a redlampif hismatchwaslowerthanthisrangeanda
applieduncriticallyto thequestionof segregation of entities. blue lamp if higher.
This conclusionwas also reached by Moore et al. Each experimentalrun consistedof 12 or 16 trials
(1986) whentheyrepeatedtheir 1985experiments, thistime (matches)and on eachtrial a differentharmonicof the com-
askinglisteners to ignoreall othercuesandto respondposi- plextonewasmistuned.The mistuning,measured asa per-
tivelyonlywhentheyweresurethat theyheardout the mis- centage of theharmoniccomponent frequency, wasthesame
tunedharmonic.Not surprisingly, the resultsof thissecond for eachmistunedharmonicthroughoutthecourseof a run.
seriesof experiments differedfrom thefirst.Thresholdswere Runs typicallylasted10 min after which the listenercould
higherand the dependence upon mistunedharmonicnum- come out of the sound-treated room and rest.
ber wasmuch lessapparent.
The goal of the experimentsdescribedin the present B. Listeners
article was the same as in the discriminationexperiments The three authors B, S, and K servedas listeners.They
describedabove,namely,to probethe limits of a listener's rangedin agefrom 33-47 years;all weremaleswith normal
ability to hear out a mistunedharmonic.The method,how- hearingaccordingto their own reports,and all wereexperi-
ever,was different.The presentexperimentsattemptedto encedin psychoacoustical
tasks.They had musicalexperi-
ensurethat a positiveresponse occurredonly whenthe lis- enceandcouldsingtheharmonicseriesup to the tenthhar-
tenerindeedheardout themistunedharmonicby askingthe monic (jumping octavesof course) or, in one case,the
listenerto matchthe pitchof the segregated
component.If eighth.
thelistenercorrectlymatchedthefrequencyof themistuned
harmonic, then that was taken asevidencethat the mistuned
harmonicwassegregated becausethe probabilityof making C. Stimuli
a correctmatchby chancewassmall.
Mainly, thestimulifor thisexperimentwerea subsetof
It was expectedthat this procedurewould give results
thoseusedby Mooreet al. (1986). The nominalfundamen-
that are more reliable than either of the experimentsby
tal frequency was200or 400or 800Hz, andthedurationof
Moore et al. This alsomade it possibleto extendthe mea-
thecomplextoneswas410or 50 ms.Therewere16harmon-
surementsto higher harmonic numbers(higher than the
icsof equalamplitudein thecomplextonespectrum, except
6th), where interestingeffectswere expected.Further, we
thattherewereonly 12whenthefundamentalfrequencywas
hopedto eliminatethe considerableintersubjectvariability
800 Hz. The low-frequency harmonicswerecandidates for
seenby Moore and his colleagues.
mistuning:thefirst 16for a fundamentalof 200 Hz, thefirst
12 for fundamentalsof 400 and 800 Hz. The proportional
I. METHODS frequencydeviationfor mistunedharmonicswasconstant
fora givenrunandvariedparametricallybetween0.5% and
A. Procedure
8%. All the harmonics,includingthe mistunedharmonic,
The listener was seated in a double-walled sound-treat- startedat a positive-going
zerocrossing(sinephase).The
edroom(SolunaSN 1). Beforehim wasa response boxwith complextonehada standard levelof 60dB SPLpercompo-
threepushbuttons,threelamps,andthreepotentiometers. nent;for low-levelrunsthelevelwasreducedto 30 dB SPL
When the listenerpushedthe greenbutton,the computer per component. Both equalizedand nonequalized tones
choserandomlya particularfrequencyfor the fundamental wereused(seetheAppendix).Toneswereturnedonandoff
of thecomplextone(randomizedaccordingto a rectangular with a raised-cosine envelopeof 10-msduration,and the
distribution with a width of ñ 10% about the nominal fun- delaypreceding eachtonewas300msfor 410-mstonesand
damental)anda particularharmonicof the complextoneto 200 msfor 50-mstones.The matchingsinetonehad a dura-
be mistuned.When the listenerthenpressedthe yellowbut- tion of 300 ms, and it wasdelayedsimilarlyto the complex
ton, he heardthe complextone;whenhe pressedthe orange tones.
16
20 ß
18
z
o
:• 12 16
10
14
z
z
3:8 o
12
6 T
10
2
6
0o 2 4. 6 8 10 12 14 16 4
MISTUNED HARMONIC
2
0 50
'• 12
+•+o
4O
10
3O
Z
-r
(_•
8 20
10
'• 6
o ø
o
00 • 4• ' 6• , 8• ß10• ß12I , 14I , 16r
4 o/0 oo o Mistuned Hormonic Number
•2 o/ooo
/., .o., o,.o.... , .... F[G. 4. Hit ratesaveragedoverthe threelistenersfor a nominalfundamen-
00 2 4 6 8 10 12 14 16 tal frequencyof 200 Hz, a levelof 60 dB percomponentanda toneduration
MISTUNED HARMONIC of 410 ms. Equalizedand unequalizedstimulihavebeenaveragedbecause
no importantdifferencebetweentheseconditionscouldbe found.The pa-
rameteristhepercentage of mistuning,4%, 2%, or 1%, aslabeled,andthe
symbol0 indicates0.5% mistuning.
1716 J. Acoust. Soc. Am., VoL 88, No. 4, October 1990 Hartmann et al.: Mistuned harmonic 1716
easyto applythisideato the 16thharmonicbecause itsonly parameter.The horizontalaxis givesthe frequencyof the
neighboris the 15th.At thesefrequenciesthecriticalband- mistuned harmonic, not the harmonic number. Therefore,
widthis about500Hz. Usingthe rule (PlompandSteene- data pointsfor a fundamentalof 400 Hz (triangles3are
ken, 1968) that roughness for two sinetonesdecreases as spacedtwiceaswidelyaspointsfor 200 Hz (circles).
theirspacing increases
beyond1/4 criticalbandwidth( 125 The closedcircles,indicating200Hz, showthepeculiar
Hz), oneexpectsthat 0.5% mistuning(3000 and 3216Hz) nonmonotonic behaviornearthe 15thharmonicasexpected.
might soundsomewhatrougherthan 4% mistuning(3000 Hit ratesfor the other fundamentalfrequencies
seemto de-
and 3328 Hz). creasemore regularly asthe mistunedharmonicnumberin-
Our conclusionis that our experimentcan successfully creases. The closed diamonds for 800 Hz show the consider-
measurethe thresholdmistuningrequiredfor a listenerto able difficulty of this condition;only mistunedsecondand
hearout a mistunedharmonicin a 200-Hz complextoneand third harmonicscouldbe reliably detected,evenat 4% mis-
that this thresholdis roughlya constantpercentage of the tuning.Figure7 showssimilarresultsfor 200- and 400-Hz
mistunedharmonicfrequencyas the mistunedharmonic fundamentalsand2% mistuning.The plotsin Fig. 8 for 1%
numberincreasesfrom 2-11. Beyondmistunedharmonic mistuning
havethesametrendasthosein Fig.7, butall hit
number11, the thresholdfor segregation increases
rapidly. rates are reduced.
We interpretthe low thresholdsactuallyobservedfor har- Figures6-8 invitecomparison withthreshold graphsby
monicnumbers15 and 16 asresultingfrom localroughness Mooreetal. (1986) for fundamentalfrequencies of 100,200,
andnot indicativeof segregationof a singlecomponentonly. and 400 Hz, where the horizontal axis was chosento repre-
sent mistunedharmonicnumber. If one imaginesthat the
B. Effect of fundamental frequency segregation of mistunedharmonicsdependsmainly upon
The effectof thefundamentalfrequencyon theabilityof auditoryfilteringwitha bankofconstant-Qfilters,thenthat
listenersto hear out a mistunedharmonicwas studiedby way of plottingthe data makesgoodsense.Sucha plot
comparinghit ratesfor givenfractionalmistunings,without shouldproducea universalcurvein that the threshold(or
reducingthe resultsto thresholdplots.The threelisteners hit rate) shouldbe independent of fundamental frequency,
eachdid tenrunswith4% mistuningfor eachof threefunda- i.e.,independent of thefrequencyspacing of theharmonics.
mentalfrequencies, 200, 400, and 800 Hz. The averageover On the other hand, if oneimaginesthat the segregation of
listenersis shownin Fig. 6, with fundamentalfrequencyasa mistunedharmonics dependsmainlyuponneuralsynchrony
andis limitedasneuralsynchronydisappears for increasing
frequency,thenthe kind of plot that we havechosen,with
frequency ashorizontalaxis,is morelikelyto leadto a uni-
versalcurve.The datain Figs.6-8 supportthe latter inter-
pretation.Theysuggest thatsynchrony vanishes ratherab-
100 ruptly as frequency
increases
from 2 to 3.5 kHz. This
80
loo
7O
90
6O
8o
,5O
70
4O
60
3O
50
20
40
10
30
%,
J 0.8 •.6 2.4 3.2 4.0 4.8 20
MistunedHarmonicFrequency(kHz) 10
1717 J. Acoust. Soc. Am., VoL 88, No. 4, October 1990 Hartmann ot a/.: Mistuned harmonic 1717
frequencylimit for usefulsynchronyis lower than the upper The figure showsthat it was more difficult to hear out
limit of 5 kHz basedupon physiologicaldata in cat (John- the mistuned fundamental at the low level, a fact that finds a
son, 1974). Our limiting value does seem consistentwith trivial explanationin termsof equal-loudness contours.At
Plomp's(1967) psychephysicallimit baseduponthe beat- 30 dB a 200-Hz tonecan barelybe heard.
ing of mistunedconsonances.
For instance,one doesnot The figure also showsthat there were a few mistuned
hear beats between sine tones at 2000 and 3000 Hz, even harmonicsthatcouldbeheardoutbetterat thelowersignal
though beatsat a 2 to 3 ratio are easilydetectedin lower levelthan at the highersignallevel.Thesewerethe 4th and
frequencyranges.Unfortunately,the data of Moore et al. 5th. For example,hit ratesfor 1% mistuningof the4th har-
(1986) do not contributeon this point becausethey go no monicaveraged 60% at 60 dB percomponent but 86% at 30
higherthan 2.4 kHz. dB percomponent
(410-mstone),a difference
of -- 26%.3
This peculiarleveleffectfor the mistuned4th harmonicwas
C. Effect of signal level magnifiedfor 50-msduration (no figureshown), where the
The overalllevelof the signalcanbe expectedto affect differenceaveraged-- 70%. A similarlevel effectfor mis-
the spreadof the neuralexcitationpatternalongtenetopic tuned4th harmonicsin brief toneshasalsoappearedwith
coordinates. If the ability to segregate
a mistunedharmonic great strengthin discriminationexperiments(Hartmann,
dependscriticallyuponthe spatialdistributionof excitation 1986).
for the mistunedharmonicrelativeto neighboringharmon- By contrast,the detectionof mistunedharmonicsin the
ics,onemightseea leveldependence in thehit rate.To test vicinityof the 7th harmonicwaseasierfor the higherlevel
thisidea,thelistenersdid runsat a levelof 30 dB percompo- tone.For 410-mstonesand 1% mistuningthe differencein
nent to be comparedwith the standardrunsat a levelof 60 averagehit ratesfor a mistuned7th wasabout20% favoring
dB per component.Comparisonswere made for a funda- the 60-dB levelover 30 dB. A similarplot madefor a funda-
mentalfrequencyof 200 Hz andsignaldurationsof 410 and mental frequencyof 400 Hz and duration of 410 ms showed
50 ms, and for a fundamental of 400 Hz at a duration of 410 againthat hit ratesfor the 5th harmonicwerehigherat 30 dB
ms. thanat 60 dB, but that hit ratesfor highermistunedharmon-
The resultsof theexperimentfor 200 Hz and410 msare ic numberwerehigherat 60 dB.
shownin Fig. 9. This figureincludesall the availabledata, Theseresultscan be comparedwith the predictionsof
from blocksof runs with 1% and 2% mistuningand for anymodelin whichdetectionisbaseduponoverlapof exci-
equalizedand unequalizedspectra(See the Appendix), a tationpatternscreatedby neighboring harmonies.If excita-
total of eightblocksin all. The scatterplot in Fig. 9 showsa tion patternsexpand with increasinglevel (Whitfield,
differenceof hit rates,specificallythe hit rate at the 60-dB 1967), thenthe comparison leadsto a paradoxicalresult.
level minus the hit rate at the 30-dB level. In those cases Considerthemistuned7th harmonic:We findexperimental-
where hit rates for both levels were 100% or 0%, no data ly that it is more easily detectedat 60 dB than at 30 dB.
point wasplotted.
100
100
80 o
oo o
60
o o o o
40 o o oO O
o ooo o o
2O
0 OO O•OOOOO
60: 0
o8o • oo
o--o•-o-o--o o o
5O OOOo• o o o
-20
40
o oo o oooõ
o o oo
-40 o õo o o
o
30
-60
20
-80
lO
-1000 ß • ß 4I ß • ß 8I I 10I " 12I • 14I I 16I
•'.o..... 0.8 Mistuned Harmonic Number
Mistuned Hormonic Frequency(KHz)
FIG. 9. Each circlerepresents
the differencebetweenthe hit rate in runs
wheretheIcyelwas60dB/component
andcorrespondingrunswherethe
FIG. 8. SameasFig.7 exceptthatthemistuning
was1%.Thefigureshows levelwas30dB/component.
Whenthedifference
ispositive,
performance
that hit rates are similar for 200 and 400 Hz. Circles at 3.0 and 3.2 kHz wasbetterat the60-dBlevel.Somecircleshavebeenshiftedlaterallysothat
shouldpresumably
beignoredin thecomparison
with the400-Hz data. no circlesoverlapentirely.
1718 J. Acoust. Sec. Am., Vol. 88, No. 4, October 1990 Hartmann eta/.: Mistuned harmonic 1718
Presumably,thismeansthat detectionof the mistuned7th is Our data showed that the hit rates for 50-ms tones were
aidedbyincreasingthewidthof theexcitationpatterns;
at 30 lessthan those for 410-ms tones. This tended to be true for all
dB, theamountof overlapissuboptimal.If thisargumentis mistunedharmonicnumbers;it wasespeciall:/true for the
accepted,thenthe predictionfor the mistuned4th harmonic low-frequencyharmonics1 and 2. Such a reductionis of
is clear. Near the 4th harmonicthe excitationpatternsareparticularinterestif one supposes that the detectionof a
evenmorewidelyseparated(on a physiologicalscale) than mistunedharmonicdependsupon time-dependentfeatures
in the vicinityof the 7th. Therefore,oneexpectsthat again
of the waveform.The elemental temporal feature in the
detectionperformancewill be betterfor increasedlevel.Ex-physicaldescriptionof the stimulusis the dephasingof a
perimentally,however,thereversebehaviorisobserved. Ac-mistunedharmonicwith respectto otherharmonics.A tone
tually, to explainour data would requirethat the relevant
of brief durationlimits the amountof &phasing that can
aspectof the excitationpatternshouldbroadenalongthe occur,andthislimitationisgreatestif themistunedharmon-
tonotopiccoordinate astheleveldecreases.
Thisparadoxical ic hasa lowfrequency. For example,if thesecond harmonic
resultisan exampleof theconsiderable
difficultyinvolvedin of 200 Hz is mistunedby 2%, then that harmonicis de-
developinga comprehensive model for the segregation of phasedat an 8-Hz rate,for a total of 144degafter 50 ms.Hit
mistuned harmonics. ratesfor the firstseveralharmonicswerelow enoughto per-
mit an estimateofa dephasingthreshold.The resultwas 171
D. Effect of duration degwith a standarddeviationof 54 deg,with no particular
The dependence of performanceon tonedurationcan dependenceon level. Apparently, the auditory systemcan
givecluesto the process involvedin segregating
a mistuned work with rather small valuesof the dephasing,approxi-
harmonic.If, for instance,the processwereidenticalto fre- matelyequalto half a cycle.
quencydiscrimination, thenonewouldnotexpecta signifi- Although a dephasingthresholdof half a cyclehasan
cantdurationdependence, solongasdurationsare longer intuitiveappeal,it isnotclearthattheconceptofa dephasing
than about50 ms.This is sobecausefrequencydifference thresholdis very useful.The data of Moore et al. (1986) for
limensat intermediatefrequencies do not changemuchas 410 and 50 msdo not showa particularlylargeeffectat low
tonedurationis increasedbeyond50 ms. harmonic numbers.For all the listeners,the thresholdsfor
thesetwo durationsare roughly parallel as the harmonic
numberisincreased.To explainthisresult,onemightinvoke
I. Hit rates and thresholds two limitationsimposedby shortdurations,a dephasing li-
The two experimentsby Moore et al. (1985) usedfour mitationfor low-frequencyharmonicsand a spectralresolu-
durations: 50, 110, 410, and 1610 ms. The different dura- tion limitation for higherharmonics.
tionsledto differentfunctionswhenthethresholdmistuning
was plotted against the mistuned harmonic number, n.
2. The perceived delay of the mistuned harmonic
When the cues were unrestricted, the thresholds for 410 and
1610 ms decreasedwith increasingn. This behavioris ex- A peculiareffectoccurswhena mistunedharmonicex-
pectedif thedetectionof inharmonicityis mediatedby beats perimentis run at short durationssuchas 50 ms. Listeners
or roughnessandif thekeyelementin detectionis therate of hear the mistunedharmonicsegregatedfrom the complex
this kind of modulation. A constantrate, and henceconstant tone,but the mistunedharmonicemergesfrom the complex
sensationaccordingto this explanation,correspondsto a tone only after a delay. The effectis striking. It was noticed
decreasingpercentagethreshold(decreasingas I/n) as n by the threelistenersin this study.It wasnoticedby all the
increases.If this explanationis correctthen one would ex- listenersin Hartmann's 1985study.Listenerscannotagree
pectthat, for tonedurationsas shortas 50 ms, whereit is whetherthe mistunedharmonicpersistsafterthe endof the
hardto detectbeatsor roughness,the thresholdsshouldnot complextoneor whetherit endstogetherwith the complex
decreaseas a functionof n. The data from the 1985 study tone.They are all agreedthat the mistunedharmonicseems
agreeonthispoint.For 50-mstonestherewasno systematic to start later than the complextone.
tendencyfor thresholdsto decreaseasthe mistunedharmon- ßThe perceiveddelay is consistentwith a two-part idea
ic number increased. aboutauditory organization:The first part is that segrega-
One further expectsthat, if the listenersin the 1986 tion of a tonefrom a complexrequiresinformationand that
studyby Moore et al. were successful in ignoringbeatsor integrationis the defaultoperationin the absenceof infor-
roughness,as they were instructedto do, then the tendency mation.The secondpart is that informationaboutmistuning
for thresholdsto decreasewith increasingn shoulddisap- is acquiredovertime, possiblyas the mistunedharmonicis
pear,evenfor durationsof 410 and 1610ms. This too was dephasedwith respectto neighboringharmonics.A corre-
borneout by the data, as shownin Fig. 2 of that article. spondingtwo-partideain the contextof streamsegregation
Our own experimentson durationdependence useda wasproposedby Bregman(1978).
fundamentalfrequencyof 200 Hz, and levelsof 60 dB per
componentand 30 dB per component.There were two dura- E. Segregation versus resolution
tions, 50 and 410 ms. These conditions are a subsetof the The ability of listenersto hear out a harmonicin a peri-
conditionsusedby Mooreet at., permittinga directcompari- odic complextone was studiedin experimentsby Plomp
son.ListenersB and $ did I0 runswith mistunings of I%, ( 1976pp. 2ff;seealsoPlomp, 1964andPlompandMimpen,
2%, and 4%. 1968). The experimentaltechniquewas straightforward:
1719 d. Acoust.Soc. Am., Vol. 88, No. 4, October 1990 Hartmannot al.: Mistunedharmonic 1719
The listenerhearda complextonewith a dozenharmonics.
Then, the listenerheardtwo sinetones;onesinetonehad a
frequencyequalto oneof the harmonics,the othersinetone
had a frequencythat washalfwaybetweentwo harmonics.
Thelistener's taskwastochoose whichsinetonewaspresent
as a harmonicin the complextone.The data showedthat 4o
listenerscouldeasilyhearout harmonieswith low harmonic o
listenerin a segregationexperiment.
Curiously,it is possibleto gain informationaboutaudi-
tory frequency resolution from our mistuned harmonic
matchingexperiments.But the informationdoesnot come
from the hit rate; the information comes from the misses.
that s.d.'s were much less than this value for matches to
Figure 3 clearlyshowsthat, whena listenermakesincorrect
matchesneara low harmonic(n = 1-5) the matchingfre- harmonics1-4, but a few s.d.'sapproachedthis value for
quenciesare not randomlyscattered;instead,they tend to harmonic5. The datacanbedisplayedmoreconveniently in
lineupatcorrect harmonic values.Thisshows thatlistener• relativeterms,dividing all thes.d.'sby theassignedharmon-
arehearingouttheharmonicthattheyarematching,evenif ic number.A scatterplotshowingall thedataisgivenin Fig.
it is the wrong(i.e., not mistuned)harmonic.This ideacan 10.For reference,the top of the graphisat 6%, equivalentto
a musical semitone.4 Also shown is a curve that is the line
bemadequantitativeby measuringthewidthof thescatterof
matchesaround harmonic values. If the width is small, then 0.289/n for randommatching.
that is evidence that the listener heard out the harmonic as a The collection of dataon themisses was,by nature,op-
resolvedcomponent. portunisticand not systematic.Nonetheless,the data do in-
To do the scatteranalysis,we first assignedeachmissto dicatea dramaticchangeat aboutthe 5th harmonic.This is
a bin, labeled by the harmonic number closestto the miss. in nearagreementwiththeresultsof PlompandMirapenfor
For instance, matches at harmonic numbers 4.3 or 3.6 were hearingout the harmonics of a 200-Hz fundamental.
They
assigned
to thebinfor the4th harmonic.A separate
analysis actuallyconcluded that the 5th wasthe highestharmonic
that could be resolved.
wasdonefor eachlistenerandeachexperimental condition,
i.e., for each block often matches.If there were lessthan four But althoughour data can be used,as above,to learn
matchesin a bin, then that bin was dropped.If there were aboutfrequency
resolution,
theprincipalpurpose
of thissec-
four or more matches in a bin, then the unbiased standard tion of the paperis to emphasizethe considerable difference
deviationof the matcheswascomputed,aboutthe (possibly betweenresolutionandsegregation. In particular,thereisno
nonintegral)mean. contradiction
whenPlompandMirapenfindthat thehigh-
A convenient reference is the value of the standard devi- est resolvable harmonic of 200 Hz is the 5th and when we
ation for randommatching.For example,a randommatch findthatlisteners
cansuccessfully
segregate
a 10thharmon-
in the 4th harmonic bin is equally likely to be anywhere ic that hasbeenmistunedby only 1%.
between 3.5 and 4.5 harmonic numbers. The value of the
IlL PITCH SHIFTS
standarddeviationfor randommatchesis thusequalto the
squarerootof theintegralofx 2between-- 1/2 and + 1/2, As notedabove,thematchingdatarevealed
pitchshifts.
i.e., the squareroot of 1/12, namely0.289. The data showed The pitch of a sinetone (the mistunedharmonic) embedded
1720 J. Acoust. Soc. Am., VoL 88, No. 4, October 1990 Hartmann eta/.: Mistuned harmonic 1720
lO.
8. 8.
6. 6.
4.
2. 2.
o.-- O.
•2o •2.
-•. -4.
•6.
8.
in a complextoneisdifferentfromthe.pitchin isolation.As a 6.
result,the frequencyof the isolatedmatchingtonethat best
matchesthe pitch of the mistunedharmonicis not equal to
4.
the frequencyof the mistunedharmonic;it is shifted.There
havebeenpreviouseffortsto measurethe pitchesof thecom-
2.
ponentsof a complextone (Terhardt, 1971;Houtsma,198!;
Petersetal., 1983).The mistunedharmonictechnique isone
way to makeharmoniesstandout from the complexsothat O.
theirpitchescanbematched.
The data to be presentedhere were takenfrom those •2.
pitch matchesthat were foundto be "hits." Therefore,for
any mistuned harmonic, the standard deviation of the
matcheswas within the 2.5% self~consistency window. We
further requiredthat at leastfour matchessurvivethe self-
consistency testto calculatea meanpitch shift. Becauseof
theserequirements, it wasnot possible
to finda pitchshiftfor
every mistunedharmonic number in every condition. The
shift itself was calculatedas a percentagedeviation of the
matchingfrequencyfrom the actual frequencyof the mis- 10o 2 4 6 8 10 12 14 16
tuned harmonic. Results for listenersB, S, and K are shown MISTUNED HARMONIC
in Figs.11-13,respectively,
for thecaseofa 200-Hz funda-
mental, 60 dB per componentand 410-msduration. Open
symbolsin the figuresshowshiftsfor runswith positivemis- FIG. 13.Sameas Fig. I l but for lislenerK.
1721 J. Acoust.Soc. Am., Vol. 88, No. 4, October 1990 Hartmann eta/.: Mistuned harmonic 1721
tunings;closedsymbolsare for runswith negativemistun- closedsymbolscorresponding to negativemistuning.For
ings.On runswith negativemistuning,only the 12 lowest 65% of thosepoints,the pitchshiftis largerthan the stan-
harmonicswerecandidatesfor mistuning. darddeviation.This is evidencethat the negativepitchshift
The figuresshowthat positivemistuningsusuallyled to is genuine.
positivepitchshifts;thistendency isverystrong.The figures Althoughthe Terhardtalgorithmfor pitchappearsto
alsoshowthat negativemistunings tendedto producenega- fail, it still is quitepossiblethat the pitch shiftsobservedin
tive pitchshifts.The effectfor negativemistuningsis not so ourexperiment resultfromeffectsthataretonotopically lo-
strongnorconsistent astheshiftforpositivemistunings, but cal.If thisisso,thenthepitchshiftsshouldbesensitive to the
it is clearlypresentin the data.In otherterms,mostof the separation betweenadjacentharmonics, or spectraldensity.
datashowa formof"overshoot"effect.Whenthemistuning To test this idea, we studiedthe pitch shiftsobtainedfor
is positive,the pitch is shiftedto a still morepositivevalue, nominalfundamental frequencies of 400 and 800Hz. Only
and,whenthemistuningisnegative,thepitchbecomes even positivemistuningswere used.The amountof significant
more negative. datawaslimited;pitchshiftswerenormallygreaterthanthe
The pitchmodelof Terhardt (1979) andhiscolleagues standard deviations for harmonies 3, 5, 6, and 7 for 400 Hz
(Terhardt et aL, 1982) includesan algorithmfor the calcula- but only for the third harmonic for 800 Hz. Becausethe
tion of the spectralpitchesof the harmonics.The algorithm spectraldensityis smaller for an 800-Hz fundamentalthan
has the flavor of a place theory and doesnot requirethe for 400 Hz, oneexpectssmallerpitchshiftsfor the 800-Hz
partialsof a tone to be strictly harmonic.It is, therefore, fundamental in anygivenfrequencyregion.Thisexpectation
applicableto our stimuli.Accordingto the algorithm,the wasborneout by thedata:The shiftfor the third harmonicof
spectralpitchesare shiftedbecauseof partial maskingef- 800 Hz was 1.8%, considerablylessthan the shift for the
fects.For example,thespectralpitchof thefundamentalof a sixth harmonic of 400 Hz, which was 4.8%.
complextoneisshifteddownwardbecause theexcitationfor Similarly,onemight expectthe pitchshift to decrease
the fundamentalis maskedfrom aboveby higherharmonics, with decreasingsignallevel becausea smallersignallevel
mainlythe second.The predictionsof the algorithm,calcu- should lead to less interaction between the mistuned har-
lated from the formulasgivenin Terhardt et al. (1982), are monicand its neighbors.To testthis idea, the three listeners
shownin Fig. 14. The figureshowspredictionsfor -- 4% did runswith a 200-Hz fundamental,a mistuningof 2%, and
and 4% mistuning;the predictionsfor other valuesof mis-
tuning actually used in our experimentslie between the
curvesfor -- 4% and4%. The algorithmpredictsa shiftfor
zero mistuning(harmoniccomplex), approximatelymid-
way betweencurvesfor -- 4% and 4%, althoughPeterset
aL (1983) did not find such shifts.
Comparingthe predictedshiftswith the observedshifts
showsthat the algorithmcorrectlypredictsthe trendof the
data when the mistuningis positive.When the mistuningis
negative,however,the algorithmfails completely.Experi-
mentally,negativemistuningusuallyleadsto negativepitch
shifts. By contrast,the algorithm predictspositivepitch
shiftsfor negativemistuning.Figure14showsthat, for har-
monics2-7, the predictedpitchshiftsfor negativemistun-
ingsareevenmorepositivethanthepredictedpitchshiftsfor
positivemistunings. This is a resultof theupwardspreadof
masking.For the third harmonic, for example,the major
contributionto the pitch shiftisthe secondharmonic.When
the third harmonicis mistunednegatively,it comescloserto
the secondharmonic,and sothe pitch shiftbecomeslarger
(morepositive).
• The factthatthealgorithmfailssobadly
for negativemistuningsuggests that thereis somethingquite
wrongwith the ideathat pitch shiftsare mainly determined
by partial masking.
The reliabilityof pitchshiftdatacanbeassessed by com- I I I I [ I I I I I I I I I I I
1722 J. Acoust. Soc. Am., Vol. 88, No. 4, October 1990 Hartmann eta/.: Mistuned harmonic 1722
a durationof 410 ms,usingthreedifferentsignallevels.Per teetedat 60 than at 30 dB, but a mistuned4th harmonic was
component, theselevelswere60, 30, and 12 dB SPL. The moreeasilydetectedat 30 thanat 60 dB. If thedetectionof a
goaloftheexperiment wastocompare pitchshiftsfora given mistunedharmonicrequiresan overlapbetweenexcitation
mistuned harmonic across the different levels. The matches patternsassociatedwith the mistunedharmonic and its
thatpassed theacceptance testspermitted36comparisons of neighbor (s), and if excitationpatternwidthsincreasewith
60 dB with 30dB,and 15comparisons of 30dB with 12dB. increasing level,thentheleveleffect,if any,shouldbeoppo-
Of these51comparisons therewereonlynineeases thatwere siteto the onethat we observed.Experimentswith tonesof
inconsistentwith the rule that lower signallevelslead to briefdurationshowedthat theamountof waveformdephas-
smallerpitchshifts. ing requiredto segregate a harmoniccanbe small.
An interestingvirtueof the matchingmethodis that the
IV. CONCLUSIONS data usedto determinesegregationperformancecan alsobe
The abilityof a listenerto segregate
a mistunedharmon- usedto studya quitedifferentmatter, namely,the pitchesof
ie from a complextonebackgroundis an exampleof audi- thespectralcomponents of a complextone.Our experiments
tory organizationbaseduponsteady-state cues.We began foundthat there are pitch shiftsand that thesetend to be
ourstudyof thiseffectwiththeintentionof improvingonthe exaggerated versionsof the mistuning;i.e., positivemistun-
dataof Mooreandhiscolleagues, byusinga matchingexper- ingleadsto positivepitchshiftandnegativemistuningleads
imentinsteadof a discrimination experiment.We foundthat to negativepitch shift.The negativeshiftsare smallerand
thematchingexperiment hasa numberof advantages, but it lessstablethan the positiveshifts,but they are undeniably
real.By contrast,a modelwherebypitchshiftsarecausedby
is not perfect.First, we wererequiredto discardasartificial
successful matches for harmonies above the 12th. We attrib- partial maskingof excitationstrengthpredictsthat pitch
uted theseto localized roughnessor beats and not to the shiftsshouldbe positivefor bothpositiveand negativemis-
tunings.
segregation of a mistunedharmonic.Thisproblemoccurred
for a fundamentalfrequencyof 200 Hz; it did not occurfor
fundamentals of 400 and 800 Hz. Second,usingthe match- ACKNOWLEDGMENTS
ing techniquedid not eliminatethe largeintersubjectvari-
ability that seemsto be characteristicof mistunedharmonic The workof WMH wassupported bytheNationalInsti-
detectionexperiments. tutesof Health and by the US-France CooperativeScience
The matchingtechnique enabledusto extendmeasure- Programof the National ScienceFoundationand the Centre
mentsto higherfrequencies, andthisrevealedan important National de la RechercheScientifique.
absolutefrequencylimit, between2.2 and 3.5 kHz, for the
segregation ofa mistunedharmonic.We understandthisre-
APPENDIX: EXPERIMENT DETAILS
sult by supposing that segregation
is mediatedby the detec-
tion of asynchronyin neuralfibersstimulatedby both the This Appendixincludesfurther detailson the experi-
mistunedharmonicand neighboring harmonies.The high- mentalproceduresand the analysis.
frequency barrierthenreflectsa limitonencoding byneural
synchrony asfrequency increases.
For frequencies below2 kHz, segregation thresholds,
expressed as a percentof mistunedharmonicnumber,ap- 1. Equal,zed stimuli
pearedto be roughly independentof mistunedhartsonic Mostof thepitchmatching
experiments,
whichprovide
number,neglectingspecialeffectsfor the fundamentaland data for this article, were done twice. In the first set, the
individual differences. To some extent this result reflects our electricalsignalsentto the phoneshad 16 harmonicswith
dataanalysismethod.Our criteriafor a "hit" includeda self- equalamplitudes.For the secondset,the harmonicampli-
consistency
window that was a constantpercentageof the tudesof theelectricalsignalwereadjustedsothattheampli-
mistuned harmonic number. Had we chosen a window with tudesof theacoustical signal,asmeasured witha spectrum
a constantfrequencywidth for all mistunedharmonicsthen analyzerandflat-platecoupler,wereall equal.Eachsetcom-
segregation thresholds,expressed as a percentof mistuned prisedten matchesby a listenerto eachmistunedharmonic
harmonicnumber,wouldhaveincreased for increasing
mis- for each percentageof mistuning.Although some data
tunedharmonicnumber.However,thismethodological ar- pointsfor individuallistenersweredifferentfor thetwo stim-
bitrarinessis unlikelyto havehadan importanteffecton our ulussets,therewaslittle agreement amongthedatafor dif-
conclusions because,when detectionperformancedeterio- ferentlisteners with respectto potentialeffects of equaliza-
ratesat high frequency,it deterioratesdramatically.With tion.Because therewasnosystematic effectof equalization,
eitherwindowcriterion,the resultsshowdearly that segre- we combined the data for the two stimulus sets. Data from a
gationthresholdsdo not decreaseasthe mistunedharmonic third setof tenmatches,
with 16unequalized harmonics of
number increases.By contrast, earlier thresholds from whichonlythefirst12mightbemistuned, werealsoaddedto
Moore et al. (1983), where detectioncould be basedupon the setwhenit wasdiscoveredthat theydid not differin any
any cue,did decreasewith increasingharmonicnumber. importantway from the dataof the othertwo sets.Increas-
Further experimentsfound that segregationperfor- ingtheamountof datain thiswayhadtheeffectof smooth-
manceshowedparadoxicalbehaviorasa functionof overall ing the data for individuallisteners,a desiredresultfor our
signallevel. A mistuned7th harmonic was more easilyde- purposes.
723 d. Acoust.Soc. Am.,Vol. 88, No. 4, October 1990 Hartmanneta/.: Mistunedharmonic 1723
2. Success by persistence grationofauditoryentities,"inAuditoryFunction,
editedbyG. M. Edel-
man,W. E. Gall, and W. M. Cowan (Wiley, New York), pp. 623-645.
In a matchingexperimentof our kind, misleadingre- Houtsma,A. J. M. (1981). "Noise-inducedshiftsin the pitch of pure and
suitsmaybeobtainedifs listenerpersistentlymakesa partic- complextones,"J. Acoust.Soc.Am. 70, 1661-1668.
Johnson,D. H. (1974). "The response
of singleauditory-nervefibersin the
ular match. If, for example, the listener always matchesthe
cat to singletones;synchronyand averagerate," Ph.D. thesis,MIT,
third harmonicof thecomplextone,then,on the trialswhere Cambridge,MA.
the third harmonic is actually mistuned,the listenerwill Klein, M. A., and Hartmann W. M. (1981). "Binaural edgepitch," 1.
score100% correct,evenif he cannotidentify the third har- Aconst. Soc. Am. 70, 51-61.
Marin, C. M. H., McAdams, S., and Lienard, J-S. (1989). "The role of
monicasthecorrectanswer.Thiseffectisa potentialsource beatsin concurrentsoundsegregation basedon frequencymodulation
of error. We discoveredonly one caseof such persistent cues,"in Proceedings of the 5th Annualmeetingof theInternationalSo-
matching,listenerK at 200 Hz, 60 dB/eomponentand0.5% cietyfor Psychophysics, Ca*sis,France,pp. 120-125.
mistuning.Here, it wasnecessary to correctthe hit rate for MeAdams,S. (1984a). "Spectralfusion,spectralparsingandtheformation
of auditoryimages,"Ph.D. thesis,StanfordUniversity.
the mistunedthird by subtractingthe rather large percen- MeAdams,S. (1984b). "The auditoryimage:A metaphorfor musicaland
tageof matchesto the mistunedthird harmonicwhen har- psychological researchon auditoryorganization,"in Cognitioe Processes
monics 1,2,4, and 5 were mistuned. in the Perception of.4rt, editedby R. Crozierand A. Chapman(North
Holland, Amsterdam), pp. 183-187.
MeAdams,S. (1989). "Segregation of concurrentsounds.I: Effectsof fre-
To testouranalysis
method,
wereversed
theorderofoperations:
Wefirst quencymodulationcoherence,"J'.Acoust.Soc.Am. 86, 2148-2159.
foundthehit-rateplotsfor eachpercentage of mistuning,
averaged over Moore, B.C. J. ( 1982). An Introductionto thePsychologyof Hearing ( Aca-
the threelisteners,
givenasFig. 4, andtheninterpolatedfor the 60% hit demic, London), 2nd ed.
rateto finda singlethreshold
plot.That plotwascompared withtheplot Moore,B.C. J., Peters,R. W., andGlasberg,B. R. (1985). "Thresholdsfor
shownin Fig. 5. For eachof the 16mistunedharmonics,thedifference
was thedetection ofinharmonicity in complextones,"J. Acoust.Soc.Am. 77,
nevergreaterthan 0.3 percentage points,exceptfor the mistuned7th, 1861-1867.
wherethe differencewas0.4 points. Moore, B.C. J., Peters,R. W., and Ginsberg,B. R. (1986). "Thresholdsfor
Thatlisteners
canidentify
frequency
ranges
wherebeating
takesplaceina hearingmistunedpartialsas separatetonesin harmoniccomplexes,"J.
broadbandcontextwas recentlyshownby Marin et al. (1989). Acoust. Soc. Am. 80, 479-483.
Theimprovement
indetecting
some
mistuned
harmonics
asthesignal
lev- Moore,B.C. J., andGlasberg,B. R. (1989). "Differencelimensfor phasein
el is decreaseddid not continueindefinitely.Runs done at a level of 15 normal and hearing-impairedlisteners,".I. Acoust.Soc.Am. 86, 1351-
dB/componcnt showedhit ratesthat were alwayslessthan the ratesfor 60 1365.
dB/component. Moorer, J. A., Chauveau,A., Abbolt, C., Eastty,P., and Lawson,1. (1979).
4Data for the s.d. of the missescome from ten-run blocks as follows: five "The 4C machine,"Cornput.Music J. 3, 16-24.
unequalized
at0.5% mistuning;fiveunequalizedandtwoequalized at 1%; Peters,
R. W.,Moore,B.C.J.,andGlasberg,
B.R. (1983)."Pitchofcom-
five uncqualizedand threeequalizedat 2%; five unequalizedand one ponentsof a complextone,"J. Acoust.Soc.Am. 73, 924-929.
equalizedat 4%. PIomp,R. (1964). "The earasa frequencyanalyzer,"J. Acoust.So:. Am.
5Thereisnoeasywayto explain
whythepredicted
pitchshiftbecomes 36, 1628-1636.
largerforpositivemistuning
thanfornegativewhenthemistunedharmon- PIomp,R. (1967). "Beatsof mistunedconsonances,"
J. Acoust.Soc.Am.
ic numberis greaterthan 8. Terhardt'salgorithmis semiempirical
and 42, 462-474.
owesat leastasmuchto experimental dataasto modelbuilding. PIomp,R., andMirapen,A.M. (1968). "The earasa frequencyanalyzer
II," 1. Acoust. Soc. Am. 43, 764--767.
Bregman,A. S. (1978). "Auditory streamingis cumulative,"J. Exp. Plomp,R., and Steeneken,H. J. M. (1968). "Interferencebetweentwo sim-
Psych.:Human Percep.Perform.4, 380-387. ple tones,"J. Acoust.Soc.Am. 43, 883-884.
Bregman,A. S., and Pinker,S. (1978). "Auditory streamingand the build- PIomp, R. (1976)..4spectsof Tone Sensation--/! Psychophysicai Study
ing of timbre," Can. J. Psych.32, 19-31. (Academic, London).
Gardner, R. B., and Darwin, C. 1. (1986). "Grouping of vowelharmonics Rasch, R. A. (1978}. "The perceptionof simultaneousnotesas in poly-
by frequencymodulation:Absenceof effectson phonereit categoriza- phonie music," Acustica40, 21-33.
tion," Percept.Psychophys. 40, 183-187. Terhardt,E. ( 1971}. "Pitch shiftsof harmonics,an explanationof the oc-
Hartmann,W. M. (1985). "Perceptualentitiesfromcomplexinharmonic laveenlargementphenomenon," Proc.7th ICA 3, 621-624.
tones," Assn. for Researchin Otolaryngology8th Meeting, pp. 168 Terhardt,E. (1979). "Calculatingvirtualpitch,"Hear.Res.1, 155-182.
(airs). Terhardt,E., Stoll,G., and Sweewann,M. ( 1982}. "Algorithmfor extrac-
Hartmann,W. M. (1986). "Pitchand the perceptualorganizationof com- tion of pitch and pitch saliencefrom complextonal signals,"J. Aconst.
plexspectra,"J. Acoust.So:. Am. Suppl.1 79, S65. Soc. Am. 71, 679-688.
Hartmann, W. M. (1988). "Pitch perceptionand the segregation
and inte- Whitfield, L C. (1967). The.4uditoryPathway(Arnold, London).
1724 J. Acoust. Soc. Am., Vol. 88, No. 4, October 1990 Hadmann et al.: Mistuned harmonic 1724