Professional Documents
Culture Documents
Surround Sound Psychoacoustics
Surround Sound Psychoacoustics
Criteria
Criteria for the design
for the matrix and
of matrix
design of and discrete surround-sound systems
discrete surround-sound systems
M ichael Gerzon
by Michael
Mathematical Institute, University of Oxford
Mathematical Institute.
v---€)
in a horizontal direction,
direction, and and itit isis byby no no by an omnidirectional microphone microphone (see (see
means evident that this this is
is the
the best
best way
way of of Fig. 1). remaining directional
I). The remaining directional infor- infor-
mation at low frequencies
frequencies reaching reaching the the
---e---
difference of
listener is the difference of thethe waveforms
waveforms
at the two ears, which is is the
the velocity
velocity of of
field along the
the sound field the ear-axis
ear-axis (see (see
Fig. I).
1). This is thethe information
information picked picked
(a) up by a sideways-pointing
sideways-pointing velocity velocity or or
figure-of-eight microphone.
microphone.
The fixed-head theories
theories thus thus assume
assume
that the information picked picked up up by by an an
omnidirectional and by aa sideways-facing
sideways-facing
velocity microphone is is all
all that
that isis available
available quadraphonic
Fig. 2. Low-frequency quadraphonic
to the brain. The assumption
assumption that that nono useuse isis available to
localization information available to the
the
T made of amplitude differences
differences at at the
the two
two ears.
A ears amounts to assuming assuming that that com-com-
(b) information:
Omnidirectional information:
ponents of the velocity microphone
microphone infor- infor- U=
0 L8s + LF
= L Lr + RpRF + RR8b
mation that are 90° out out of of phase
phase with
with the the x-velocity information:
omnidirectional information are are notnot used
used X Real(- L 8B +
X = RealC—Z. y LFLF + Rp
Rp -— R R11s))
Omnidirectional and
Fig. 1. Omnidirectional and velocity
velocity in deducing the direction of of sounds.
sounds. The The y-velocity information:
microplwnes
microphones (picture
(picture b) receiving
receiving the
the "moving head" theories assume assume that that thethe YY = Real(L8B + LF Lr -- RF
RF -- RR8s))
same low frequency
frequency information
information as
as the
the velocity microphone information may may For "live" sounds we mustmust have
have
human hearing system (picture a).
system (picture a). point in any direction,
direction, but still still assume
assume 02 =
()1 = i(X + JfJ.
t(..¥22+ yz ).
1974
485
Wireless World.
World, December 1974
iideal.
deal, BMX
BMX & & TMX
However,
However, it is possible to analyse the QMX
QMX with fullfullbandwith
w ith bandwith
directional energy field
field into omnidirectional
omnidirectional — discrete
discrete
and vector components
components analogous to those
used for the sonnd amplitude field at low
the sound z 1eQ
frequencies. If one assumes that the
effect of head movement is
head movement is nsed the
by the
used by ...Q
hla:: 1!50
k1
A
~
brain, these sound energy components ~/
y
can be be used to estimate the probable 0
0 120 ~-
subjective mid-
mid- and high-frequency
high-frequency sound
~
w
direction. For a sound reproduced through through >
several speakers, this this direction may be 8a:: 90
v
d.
rear
sound from that speaker. Calculations
using this theory indicate that various
four-speaker sound reproduction systems
mid-high frequency sound localiza-
-'
:::>
~
()
30
00
front
w 30
30 60 S900 120
120 150
■
150 IE180
back
the mid-high
give the localiza- INTENDED
I NTENDED DIRECTION
D IRECTION
tions shown in in Fig. which agrees well
Fig. 4, which
Tetrahedra/loudspeaker with experimental data26 26•
. Fig. 4. Perceived localization vs intended
Fig. 3. Tetrahedral loudspeaker layout
shown embedded inin a cube. Note that if the number of channels direction of sounds in degrees, according
equals the number of speakers (as for frequency theory of this
the mid-high frequency
to the
"discrete"
"discrete" and QMX via via four speakers), paper, for
for various systems via a square of
then phantom inter-speaker sounds are speakers as in Fig.
Fig. 2. Triangles indicate
independently by the author10,, Gibson drawn toward the nearest nearest speaker. speaker positions. QMX
Q,MX data only applies
al 27 , Eargle28,, Madsen (unpublished)
et aF, Cooper3311-• 32
Cooper 32
has called this the "detent" for a full bandwidth system. Compare
5 effect, but it is not significant for his BMX
CooperS,
and Cooper , is capable of correct low Figs 19 and 20 of reference 26.
with Figs
frequency results, as is the four-channel (two-channel) or TMX (three-channel)
5 " pull" by the speakers
A similar "pull"
QMX system system~ and the tetrahedral with- with- systems. A
systems:
height system of the author6'• '100'•29 29
,, which is found for tetrahedral
tetrahedral with-height repro- recording, we can reinforce the sense of
is reproduced via the the speaker layout of duction (Fig.
(Fig. 3), butbut not
not when a cube of direction created; to the author's know-
Fig. 3. It is also possible to to design a speakers is used. been done in
yet been
ledge, this has not yet
decoder for discrete recordings so as to The ratio of the length length of the above- surround-sound recordings.
satisfy all low-frequency requirements. defined energy vector vector to to the total re-
It is well known that velocity micro- produced energy should ideally be unity; Reverberation to aid localization
phones give an exaggerated bass for very very iu
in practice the larger it is
the larger is the better It isis possible to to locate sounds more
close sounds. BecauseBecause the ears use velocity defined the sound image—itimage- it is this that accurately in a moderately reverberant
microphone information to localize sounds, makes TMX better better than two-channel room than when there is no reverberation.
close loudspeakers modify the directional BMX. Although the mechanism is not under-
effect at the ears. In particular, 90° out- This mid-high frequency theory holds stood, it is found that correctly recorded
of-phase velocity components caused by only so long as the the ears do not have too reverberation
reverberation also aids sound localization
phase shifts are converted to phase great a directionality in their response during reproduction34 34 ,
, although poor arti-
differences between the the ears. This causes to sounds. The data of Sivian & & ficial reverberation makes the sound
the very phase-shifted
very low frequencies of phase-shifted White1717 and Rolls19 19
on the ear's image moremore indistinct. The author has
sounds to be rotated around the listener. listener. directionality show that above about computed the the distribution of reverberation
reverberation
This effect has been been observed by BauerBauer 5kHz aa new theory is needed. energy around the listener given by various
et alaP300 via twotwo speakers, but can be re- recording techniques34 3
4,, and it is is found
moved electronically. The degree of the the Localization above 5KHz localization
that the most accurate sound localization
effect is inversely proportional to
is inversely to loud- In 1907, Rayleigh1111 found that that when is obtained when the energy is uniformly uniformly
speaker distance. the head was stationary the ability to
the distributed, and not concentrated too
distributed,
1
Statistical methods may may be used to distinguish front from rear relied entirely much in any one direction.
•much
apply the above theory to listeners not on .high frequencies. This has been con-
high frequencies. Thus if a surround-sound system is is to
placed in in the centre of the loudspeaker firmed by Stevens & Newman Ne·.vman1414 and work optimally, it must be capable of
15
15
layout. The details are involved, but give Roffler
Roffier & Buder ,
& Butler , who showed that the
showed capturing all nuances of reverberant sound
results somewhat similar to the mid-high ears could localize sounds in the plane and of reproducing these
these uniformly around
frequency theory of sound localization of symmetry of the human head quite
the human listener. Certain popular commercial
the listener.
described next. accurately despite the the two ears receiving matrix systems assign the original sound
the same sound waveform!waveform] This ability the two available channels in such
field to the
89
Mid-high frequency localization disappeared when the pinnae were masked. a discontinuous manner manners. that these
' 9 that
Above 700Hz, the wavelength of sound Conversely, many workers have found criteria cannot be satisfied. "Variable
is sufficiently small that the phase re- that dummy head recordings (which in- matrix" or "logic" decoders, which work
lationships between
between the loudspeakers are corporate the effect of the pinnae's by pushing the whole sound field towards
no longer of primary importance in sound acoustic obstruction) give good spatial those directions in which the sound is
localization. Under these conditions, what localization when reproduced either via momentarily strongest, clearly cannot
matters is the directional behaviour of the headphones or via loudspeakers with the reproduce those nuances of reverberation
33
energy field around the listener. It is pinnae masked33 .• Perhaps using the ulti- needed by the ears to localize sounds.
possible to show that, because of the microphone technique,
mate "purist" microphone The· "detent" effect of discrete repro-
The"
positive nature
nature of energy (in the mathemati- Edmund Rolls of Oxford University has duction (Fig. 4) 4) also prevents uniformly
cal sense), one can only exactly recreate recordings using
made similar recordings using micro- distributed reverberation.
the energy field of a live sound source the ears of real
phones inside the real heads!
loudspeakers
through a small number of loudspeakers loc~lization mechanism is
The pinnae localization is Acknowledgment
sound happens to be at the position
if the sound not well understood, but appears to to rely by
This article is a revision of a paper by
of one of these. Thus at mid and high high on the fact that sounds from each Michael Gerzon given at the 1974 Festival
not all of the ear's localization
frequencies, not direction arrive inside the listener's ear du Son, Paris. (Published in French in
in French
mechanisms can be satisfied in in a practical with a distinctive colouration. Thus, if Journees d'Etudes 1974
Conferences des Joumees 1974
reproduction system. can reproduce that colouration in a
we can Son- Editions Radio.)
du Festival du Son—Editions
486
Wireless World, December 1974
1974
References
Abbreviations JAES
Abbreviations
Audio
Audio Engineering
JAES and and JASA
Engineering Society,
JASA mean
Society, and
mean Journal
and Journal
Journal of
Journal of
of the
the
of the
the
Integrated injection logic
Integrated injection logic
Acoustical
Acoustical Society Society of ofAmerica,
America, respectively.
respectively.
1.I. Fletcher,
Fletcher, H. H. Stereophonic
Stereophonic sound sound film film system-
system-
general
general theory, JASA. vol.
theory,JASA. vol. 13,
13, 1941,
1941, pp. pp. 88-99.
88-99.
2.
2. Blumlein,
Blumlein, A. A. D.
D. British
British Patent
Patent 394.325
394,325 (1931).(1931).
3.
3. de de Boer,Boer, K. K. Stereophonic
Stereophonic sound sound production,
production , The development of new
Philips
Philips Tech.Tech. Rev.,
Rev.• vol.
vol. 5,5, 1940,
1940, pp. pp. 107-44,
107-44.
new techniques in driving one
one input
input of two other gates. Re-
4.
4. Shoner,
ShQrter, G. G. Four-channel
Four-channel stereo,stereo, Wireless
Wireless World,
World, circuit integration has apparently been arranging the interface gives (b) in the the
vol.
vol. 78, 78, 1972,
1972, pp. pp. 2-5,
2-5, 54-7.
54-7. See See alsoalso Wireless
Wireless concentrated in in the field of m.o.s.
m.o.s. devices,
devices, drawing, which can be further simplified
World Annual
World Annual, 1975,1915, pp. 84-9,
84-9. and the amount of information appearing by replacing thethe base resistor by an active
5.
5. Cooper,
Cooper, D. D. H.H. & & Shiga,
Shiga. T. T. Discrete-matrix
Discretc·matr•x multi- multi·
channel
channel stereo,stereo, T/UrS,
JAES, vol.
voL 20,20, 1972,
1972, pp. pp. 346-60.
346-60.
in the technical press about m.o.s. m.o.s. has current source and by substituting a multi- multi-
6-6. Gerzon,
Gerzon, M. M. A. A. Periphony:
Periphony: with-height
with-height sound sound tended to obscure the latest arrival arrival on the collector transistor for those with common
reproduction,
reproduction, JAES, JAES, vol.voL 21,21, 1973,
1973, pp. pp. 2-10.
2-10. bipolar logic field—integrated
field-integrated injection bases. The result is (c), where the input
7. Scheibcr,
7. Scheiber, P. P. Analyzing
Analyzing phase-amplitude
phase-amplitude matrices, matrices, logic (i(i22.l.
.l. for short). Its characteristics emitter is termed the injector, the whole
JAES, vol. vol. 19,
19, 1971,
1971 , pp.
pp. 835-9.
835-9.
are impressive
impressive and it seems set to take over
8. Fellgctt,
8. FeUgett, P. P. B.
B. Perspectives
Perspectives for for surround
surround sound,sound, circuit being contained within the area
Hi-Fi Sound Annual, Annual, 1974. from conventional t.t.l. circuitry when pack- pack- of a t.t.l. multi-emitter input transistor.
9. Fellgett,
9. Fellgett, P. P. B,B. Japanese
Japanese regular regular matrix, matrix, Hi-Fi
HI-Fi ing density and low power dissipation are The combining of the two base emitter
News, Dec.,
News, Dec., 1972.
1972. the essential requirements of a system. junctions of the interface gives protection
10.
10. Gerzon,
Oerzon, M. M. A.A. Principles
Principles of of quadraphonic
quadraphonic re- ro- As a result of the elimination of passive
cording
cording (in two two parts),
parts), Studio
Studio Sound,
Sound, Aug. Aug. & & Sept.,
Sept., against the effect, when junction voltages
1970.
1970. components in the basic gate and aareduc- on different chips differ, of one gate
11.*
11.* Strutt,
Strutt, J. J. W.
W. (Lord
(Lord Rayleigh).
Rayleigh). On On our o ur perception
perception tion in the number of of devices per gate, up monopolizing the current output from from the
of
of soundsound direction,
direction, Phil.
Phil. Mag.,
Mag. , vol. vol. 13, 13, 1907.
1907, nn, pp. to 3000 gates can be fabricated in one previous gate, starving others connected
214-32. chip—an
chip- an increase by by a factor of ten over
12.* Strutt, J. W. O ur perception of the direction of aa
12.' Strutt, J. W. Our perception of the direction of in parallel.
source of
source of sound,
sound, Nature,
Nature, vol.vol. 14,
14, 1876,
1876, pp. pp. 32,
32, 33.
33. t.t.l. chips. The speed of i22.l. .1. is lower than The basic gate can operate at a current
13.* Strutt,
13.'" Struu, J. J. W.
W. Acoustical
Acoustical observations—I.Phil.
observations- l ,Phl/. that of t.t.l. (delay around 30ns instead of of around InAlnA and a logic swing of 0.6V,
of0.6V,
Mag.,
Mag.,l871, 1877, pp. pp. 456. 457.
456.457. 110ns)
Ons) but the speed-power product is only which means interface
interface circuits are needed
14.
14. Stevens,
Stevens, S. S. S.
S. && Newman,
Newman, E. E. B. B. Localization
Localization of of about 0.4pJ or less for ii21.l.,
actual
actual sources sources of of sound,
sound, Amer. A mer. J. J. Psycho!.,
Psycho/., vol. vol.
.l., compared between ij22.I.
.1. and other logic systems or
48,
48, 1936,
1936, pp. pp. 297-306.
297-306. with lOOpJ.
IOOpJ . Cost is lower than in i.es i.cs linear devices. Variations of voltage and
15.
15. Roffler,
Romer, S. S. K.
K. && Butler,
Butler, R. R. A, A. Factors
Factors that that influ-
inllu- using the m.o.s. technology, particularly current can be obtained for different
ence the
ence the localization
localization of of sound
sound in the the vertical
vertical plane,
plane, so as the same chip can coutain contain both applications.
JASA, vol.
JASA, vol. 43,
43, 1968.
1968, pp.
pp. 1255-9.
1255-9.
digital and analogue circuits.
16. Harwood,
16. Harwood, H. H. D.D. Stereophonic
Stereophonic image image sharpness,
sharpness, The new logic family can be used in a
Wireless World,
Wireless World, vol.vol. 74,
74, 1968,
1968, pp.pp. 207-11.
207-1 I. The circuit takes the form of a radically similar range of work as other i.s.i.
J.s.i. systems.
17. Sivian,
17. Sivian, L. J. J. && White,
White, S. S. D.D. JASA,
JASA, vol. vol. 4,
4, 1933,
1933, rationalized direct-couplcd-transistor-logic
direct-coupled-transistor-logic It was originated by Philips at Eindhoven,
pp. 296-8.
pp. (d.c.t.1.)
(d.c.t.l.) element. In the diagram at (a), a Netherlands, and and at about the same time,
18. Wiener,
18. Wiener, F. M, M. OnOn thethe diffraction
diffraction of of a progressive
progressive
typical d.c.t.1.
d.c.t.l. gale
gate (on the left) is shown
sound
sound wave wave by by the
the human
human head. head, JASA, JA.SA, vol. vol. 19.
19,
but independently, by by IBM at Boblingen.
1947, pp.
1947, pp. 143-6.
19.
19. Rolls,
RoUs, E. (private
(private communication).
communication).
20. Clark,
20. Clark, H. H. A. A. M.,
M., Dutton,
Dutton, G. G. F. F. & & Vanderlyn,
Vanderlyn,
P.
P. B. B. TheThe stereosonlc
stereosonic recording
recording & & reproducing
reproducing
system, I.R.E.
system, J.R.E. Trans,
Trans. on on Audio,
Audio, 1957, 1957, pp. pp. 96-111.
96-111.
21. Bauer,
21. Bauer, B. B. B. B. Phaser
Phasor analysis
analysis of of some
some stereo-
stereo-
phonic
phonic phenomena,-LdSVI.
phenomena,JASA. vol. vol. 33.
33. 1961,
1961, pp. pp. 1536-9.
1536-9.
22.
22. Makita,
Makita, Y. Y. OnOn thethe directional
directional localization
localization of of
sound
sound in in the
the stereophonic
stereophonic sound sound field,field, EBUEBU Review,
Review, Tr,
part A
part A no. 73, 1962, pp. 102-8. 102- 8.
23.
23. Leakey,
Leakey, D. D. M.M. SomeSome measurements
measurements on on the
the
elfects of
effects of interchannel
interchannel intensity
intensity and and time time difference
difference
in two-channel
two-channel sound sound systems,
systems, JASA, JASA, vol. vol. 31,
31, 1959,
1959,
pp. 977-87.
977~87.
24.
24. Tager,
Tager, P. P. G.
G. Some
Some features
features of of physical
physical structure
structure
of
of acoustic fields of stereophonic
acoustic fields stereophonic systems,systems, JSMPTE,JSMPTE, X
vol.
vol. 76, 76, 1967, pp. pp. 105-10.
I05-10.
25. Fellgett,
25. Fellgett, P. P. B.B. Directional
Directional information
information in in repro-
repro-
duced sound,
duced sound, Wireless
Wireless World,
World, vol. vol. 78, 78, 1972,
1972, pp.pp.
413-7.
413-7.
26. Kohsaka,
26. Kohsaka, O., 0., Satoh,
Satoh, E. E. & & Nakayama.
Nakayama, T. T.
Sound-image localization in multichannel matrix
Sound-image matrix
reproduction, JAES, vol. 20,1972,
reproduction, JA ES, vol. 20, 1972, pp. 542-8. 542-8.
27. Gibson,
27. Gibson, J. J. J,,
J., Christensen,
Christensen, R. R. M. M. & & Limberg,
Limberg, o- o-
A.
A. L. L, R. R . Compatible
Compatible f.m. f.m. broadcasting
broadcasting of of pano-
pano-
ramic sound,sound,JAES,JAES, vol. 20, 1972, pp. 816-22, 816-22.
28.
28. Eargle,
Eargle, J. M. M. Multichannel
Multichannel stereo stereo matrix
matrix systems:
systems: (a)
an
an overview,
overview. JAES,JA.ES, vol.
vol. 19,
19, .11971,
971, pp. pp. 552-9.
552-9. : a)
29.
29. Gerzon,
Gerzon, M. A. A. Experimental
Experimental tetrahedral
tetrahedral record-record-
ing
ing (in (in three
three parts),
parts), Studio
S tudio Sound,
S ound, Aug.,Aug., Sept.Sept. & & OoL
Oct.
1971.
1971. .
30.
30. Bauer,
Bauer, B. 8. B.,
8., Gravereaux,
Gravereaux, D. D . W. W. & & Gust,
Gust, A. A. J.
J.
Compatible stereo-quadraphonic
Compatible stereo-quadraphonic (SQ) (SQ) recordrecord system,
system,
THjES,
JAES, vol. 19, I971,p. 1971, p. 641.
64 1.
31. Cooper,
31. Cooper, D. D. H.H. Proposal
Proposal for for QMX QM.X discrete^
discrete/
matrix
matrix carrier-channel
carrier-channel disc, disc, privately
privately circulated
circulated
report, July
report, July 15,1972.
15, 1972. Injector
injector -o
32.
32. Cooper,
Cooper, D. D. H.,
H., Shiga,
Sruga, T. T. & & Takagi,
Takagi, T. T. QMX
QMX O
carrier-channel
carricr-<:hannel disc, JAES, vol. 21, 1973, pp.614-24.
disc,JAES, pp.614-24.
33.
33. Sennheiser
SennheiSet Kunstkopf-Stereofonie,
Kunstkopf-Stereofonie, 45 45 revyrain.
rev/ min.
record,
record, SennheiserSennheiser Electronic,
Electronic, 3002 3002 Bissendorf
Bissendorf
(1973). .
34.
34. Gerzon,
Gerzon, M. M. A.A. Recording
Recording techniques
techniques for for multi-
multi·
channel
channel stereo, stereo, British
Brillsh Kinemalography,
Kinematography, Sound Sound & & X"
Television,
Televisio11, vol. vol. 53, 1971, pp.274-9.
((C)
c)
*Refs 11-13
*Refs 11- 13 are
are in:
in: Lord
Lord Rayleigh,
Rayleigh, Scientific
Scientific
Papers, Dover
Papers, Dovet Publications,
Publications, New
New York,
York.
(b)
(b)