Professional Documents
Culture Documents
The Effect of Diffuse Reflections On Spa PDF
The Effect of Diffuse Reflections On Spa PDF
1. Introduction
Diffusive architectural surfaces are widely used to evenly distribute sound throughout
rooms and eliminate echoes. To distribute reflected energy throughout a performance
venue, the acoustic designer is primarily concerned with the spatial diffusion a reflect-
ing surface produces. However, for a given listener position, other perceptually rele-
vant effects of a diffusive surface are the temporal diffusion and frequency spectrum of
its reflection. Additionally, individual diffusers have their own sound character
(Kleiner et al., 1992), and diffusers may create effects associated with reduced prefer-
ence, such as decreased reverberance and loudness (Ryu and Jeon, 2008).
This paper further investigates the perceptual effect of diffusers by attempting
to objectively quantify the effect of temporal diffusion on spatial impression by testing
listeners’ abilities to distinguish the positions of laterally separated acoustic sources.
The results are relevant to room acoustic designers who would like to maintain precise
localizability of individual sources in a group, or, on the other hand, achieve a specific
level of blend between sources through the application of diffuse architectural surfaces.
Localizing sound in a room is a complex auditory task. Subsequent to the
direct sound, a multitude of reflections arrive at the ears from different directions and
with various delays, levels, and frequency contents. Despite the additional sound waves
arriving at the ears, and in the absence of a very delayed strong reflection, listeners
perceive only one source, and are able to estimate the arrival direction of the direct
sound relative to their own position, even when the reverberant energy is greater than
that of the direct sound (Hartmann, 1983).
While the direct sound is important for localization, the reflected sound waves
contribute to spatial impression, which is comprised of at least two distinct effects. The first
is a broadening of the apparent width of a sound source, which is primarily determined by
early lateral reflections. The second is the listener’s sense of being enveloped by sound,
which is determined primarily by late-arriving reflections (Barron and Marshall, 1981;
a)
Author to whom correspondence should be addressed.
EL370 J. Acoust. Soc. Am. 133 (5), May 2013 C 2013 Acoustical Society of America
V
Redistribution subject to ASA license or copyright; see http://acousticalsociety.org/content/terms. Download to IP: 84.253.223.176 On: Fri, 01 Nov 2013 06:02:34
Robinson et al.: JASA Express Letters [http://dx.doi.org/10.1121/1.4798648] Published Online 8 April 2013
Bradley and Soulodre, 1995). In addition to the aforementioned contribution to the per-
ceived source width, early reflections naturally amplify a source and may add a certain
sound color because of interfering direct and reflected sound (Lokki et al., 2011).
Due to difficulties in objectively measuring subjective percepts like apparent
source width and envelopment, an objective test was developed as a proxy. In this test,
spatial discrimination was measured by iteratively asking subjects to distinguish which
of three sound signals differed from the other two, while the spatial separation of two
sources was adaptively modified, to determine the threshold of discrimination with var-
ious reflection types.
Fig. 1. (A) A schematic representation of the surfaces used in the image source model. The complete binaural
simulation was generated by adding the early reflections from these surfaces to measured late reverberation. (B)
The listening test setup. In each trial, the left talker in Condition B was adaptively positioned to change the sep-
aration distance angle. The receiver position is indicated by “R” and sources are indicated as “M” and “F” for
male and female, respectively. The male and female voice positions were randomly switched at each trial.
J. Acoust. Soc. Am. 133 (5), May 2013 Robinson et al.: Diffusers and spatial discrimination EL371
Redistribution subject to ASA license or copyright; see http://acousticalsociety.org/content/terms. Download to IP: 84.253.223.176 On: Fri, 01 Nov 2013 06:02:34
Robinson et al.: JASA Express Letters [http://dx.doi.org/10.1121/1.4798648] Published Online 8 April 2013
2 kHz was measured using techniques described by Robinson et al. (2009). These
impulse responses were processed to include only the reflection (Robinson and Xiang,
2010), scaled to normal audio frequencies (Robinson and Xiang, 2013), and normal-
ized to contain the same total energy as a specular reflection. The center panel of
Fig. 2 illustrates that these reflections are slightly more spread in time than specular
responses and have a less even frequency response. In the Simulated Diffusive case, a
reflection was generated to emulate the envelope and temporal spreading present in the
measured diffuse response, while maintaining a flat frequency response. The response
for the simulated diffuse reflection was generated by multiplying white Gaussian noise
with an envelope with the shape of the probability density function of a gamma distri-
bution of the desired length and shape. A similar method has been applied by Siltanen
et al. (2012). In this case, the reflection response was temporally diffused over approxi-
mately 16 ms. The resulting reflection impulse response was whitened by iteratively
multiplying the magnitude response by its inverse in the frequency domain until the
deviation from a flat spectrum was smaller than 60.1 dB. For Specular reflections,
step two was omitted, resulting in ideal reflections that were identical to the direct
sound with the exception of amplitude and direction of arrival. The Measured and
Simulated diffuse responses were normalized to contain energy equal to the specular
reflection. This resulted in three early reflection conditions: Specular, from the image
source model; Measured Diffuse, which included spectral coloration and temporal
spreading; and Simulated Diffuse, which included temporal spreading but not spectral
coloration.
The third step transformed the treated echogram into a binaural impulse
response. The reflections in the echogram were individually convolved with head
related transfer functions (HRTFs) for the appropriate arrival directions, and added to
create the early part of spatialized binaural impulse responses. The HRTF measure-
ment procedure can be found in Pulkki et al. (2010).
Finally, in the fourth step, measured late reverberation was added. Late rever-
beration was measured with a Genelec 1029A loudspeaker at the center of the stage at
12 m distance from the receiving position in the Helsingin Konservatorio concert hall.
An impulse response was recorded with a G.R.A.S. 3-D intensity vector probe with
capsule pairs arranged in X, Y, and Z directions. These were processed with Spatial
Fig. 2. (Color online) The frequency response of the first 200 ms of the left ear from the simulated binaural
impulse response. The upper panel shows specular reflections, the middle panel shows an impulse response uti-
lizing the reflection response measured from a diffuser, and the lower panel shows a simulated diffusive surface
response.
EL372 J. Acoust. Soc. Am. 133 (5), May 2013 Robinson et al.: Diffusers and spatial discrimination
Redistribution subject to ASA license or copyright; see http://acousticalsociety.org/content/terms. Download to IP: 84.253.223.176 On: Fri, 01 Nov 2013 06:02:34
Robinson et al.: JASA Express Letters [http://dx.doi.org/10.1121/1.4798648] Published Online 8 April 2013
Impulse Response Rendering (Pulkki and Merimaa, 2006) into a total of 24 virtual
loudspeakers surrounding the listener, using the same HRTF set as used for the early
reflections. The binaural reverberation was then joined to the simulated early part of
the binaural impulse response with a sinusoidal fade-in duration of 80 ms beginning at
60 ms after the direct sound. The procedure was repeated for all simulated source posi-
tions. The reverberant level was adjusted such that it was clearly audible without intro-
ducing artifacts during fade-in, and produced a smooth decay from the early reflections.
For further details on the complete simulation process, see Lokki et al. (2011).
Figure 2 illustrates example spectrograms including direct sound, reflections, and late
reverberation from the first 200 ms of the impulse responses for all three surface condi-
tions. The resulting impulse responses had similar acoustical parameters (see Table 1)
to measured responses reported by Lokki et al. (2012), and informal listening revealed
them to be similar enough to be considered natural and realistic.
Table 1. Calculated acoustical parameters for the first 6 source positions in the simulated concert hall. Listed
parameters are Early Decay Time (EDT), Reverberation Time (T30), Clarity index (C50), and Inter-aural Cross
Correlation coefficient (IACC). Parameters are averaged from 500 to 2000 Hz.
Position Spec. S.Dif M.Dif Spec. S.Dif M.Dif Spec. S.Dif M.Dif Spec. S.Dif M.Dif
1 1.84 1.78 1.75 2.08 2.08 2.05 0.54 0.32 0.22 0.42 0.42 0.43
2 1.66 1.59 1.58 2.06 2.05 2.02 0.09 0.06 0.59 0.42 0.42 0.40
3 1.71 1.64 1.65 2.06 2.06 2.02 0.44 0.28 0.36 0.28 0.29 0.29
4 1.68 1.62 1.6 2.06 2.05 2.01 0.46 0.35 0.35 0.27 0.28 0.32
5 1.69 1.63 1.64 2.06 2.06 2.01 0.12 0.01 0.05 0.28 0.29 0.29
6 1.72 1.64 1.64 2.06 2.05 2.01 0.20 0 0.08 0.26 0.28 0.30
Mean 1.72 1.65 1.64 2.06 2.06 2.02 0.31 0.15 0.47 0.32 0.33 0.34
J. Acoust. Soc. Am. 133 (5), May 2013 Robinson et al.: Diffusers and spatial discrimination EL373
Redistribution subject to ASA license or copyright; see http://acousticalsociety.org/content/terms. Download to IP: 84.253.223.176 On: Fri, 01 Nov 2013 06:02:34
Robinson et al.: JASA Express Letters [http://dx.doi.org/10.1121/1.4798648] Published Online 8 April 2013
the center as the test progressed. The position of the second talker was determined by
an adaptive one-up, one-down procedure per Levitt (1971), with progressively smaller
step sizes to determine the 50% detection threshold for differences in the separation of
two simultaneous sources. One of the talkers was male and the other female; whether
the male was on the right or left was determined randomly each trial. The male and
female talkers simultaneously recited a different randomly selected sentence from the
speech corpus, which changed with each trial. Listeners were presented with a graphi-
cal user interface with three play buttons. Two of these buttons were assigned to play
condition B and the third was assigned to play condition A. In 5% of the randomly
chosen trials, all of the buttons were assigned to play the same signal. Subjects were
allowed to listen to each signal as many times as they liked before deciding which sig-
nal was different from the others, or if they were all the same. The test continued until
eight reversals were recorded or the listener reached a minimum separation difference
angle of 2.4 on 4 consecutive trials.
Fig. 3. The mean separation difference discrimination angle and 95% confidence intervals for a simulated con-
cert hall. The results for the Specular case are significantly different from the results for both the Diffusive cases
[F(2, 36) ¼ 10.42, p < 0.01]. Number of subjects ¼ 13.
EL374 J. Acoust. Soc. Am. 133 (5), May 2013 Robinson et al.: Diffusers and spatial discrimination
Redistribution subject to ASA license or copyright; see http://acousticalsociety.org/content/terms. Download to IP: 84.253.223.176 On: Fri, 01 Nov 2013 06:02:34
Robinson et al.: JASA Express Letters [http://dx.doi.org/10.1121/1.4798648] Published Online 8 April 2013
Acknowledgments
P.R.’s research was supported by a U.S. Fulbright grant, funded by Finland’s Center for
International Mobility. Additional funding was provided by The Academy of Finland,
project no. 257099 and the European Research Council grant agreement no. 203636.
Thanks are also due to the listening test participants for their volunteered time and effort.
J. Acoust. Soc. Am. 133 (5), May 2013 Robinson et al.: Diffusers and spatial discrimination EL375
Redistribution subject to ASA license or copyright; see http://acousticalsociety.org/content/terms. Download to IP: 84.253.223.176 On: Fri, 01 Nov 2013 06:02:34
Robinson et al.: JASA Express Letters [http://dx.doi.org/10.1121/1.4798648] Published Online 8 April 2013
Gabriel, K., and Colburn, H. S. (1981). “Interaural correlation discrimination: I. bandwidth and level
dependence,” J. Acoust. Soc. Am. 69, 1394–1401.
Hartmann, W. (1983). “Localization of sound in rooms,” J. Acoust. Soc. Am. 74, 1380–1391.
Kleiner, M., Svensson, U., and Dalenb€ ack, B. (1992). “Auralization of QRD and other diffusing surfaces
using scale modeling,” in 93rd AES Convention (Audio Engineering Society, New York), p. 1–13.
Levitt, H. (1971). “Transformed up-down methods in psychoacoustics,” J. Acoust. Soc. Am. 49, 467–477.
Lokki, T., P€ atynen, J., Kuusinen, A., and Tervo, S. (2012). “Disentangling preference ratings of concert
hall acoustics using subjective sensory profiles,” J. Acoust. Soc. Am. 132, 3148–3161.
Lokki, T., P€ atynen, J., Tervo, S., Siltanen, S., and Savioja, L. (2011). “Engaging concert hall acoustics is
made up of temporal envelope preserving reflections,” J. Acoust. Soc. Am. 129, EL223–EL228.
Morimoto, M., and Iida, K. (1995). “A practical evaluation method of auditory source width in concert
halls,” J. Acoust. Soc. Jpn. 16, 59–69.
Perrott, D. (1984). “Concurrent minimum audible angle: A re-examination of the concept of auditory spa-
tial acuity,” J. Acoust. Soc. Am. 75, 1201–1206.
Pulkki, V., Laitinen, M., and Sivonen, V. (2010). “HRTF measurements with a continuously moving
loudspeaker and swept sines,” in Audio Engineering Society Convention (Audio Engineering Society,
New York), Vol. 128, p. 8090.
Pulkki, V., and Merimaa, J. (2006). “Spatial impulse response rendering II: Reproduction of diffuse sound
and listening tests,” J. Audio Eng. Soc. 54, 3–20.
Robinson, P., and Xiang, N. (2010). “On the subtraction method for in-situ reflection and diffusion coeffi-
cient measurements,” J. Acoust. Soc. Am. 127, EL99–EL104.
Robinson, P., and Xiang, N. (2013). “Construction and evaluation of a 1:8 scale model binaural manikin,”
J. Acoust. Soc. Am. 133, EL162–EL168.
Robinson, P., Xiang, N., and D. Antonio, P. (2009). “Measuring the uniform diffusion coefficient:
Synthesized aperture goniometer measurements,” Proc. Meet. Acoust. 6, 015003.
Ryu, J., and Jeon, J. (2008). “Subjective and objective evaluations of a scattered sound field in a scale
model opera house,” J. Acoust. Soc. Am. 124, 1538–1549.
Siltanen, S., Lokki, T., Tervo, S., and Savioja, L. (2012). “Modeling incoherent reflections from rough
room surfaces with image sources,” J. Acoust. Soc. Am. 131, 4606–4614.
EL376 J. Acoust. Soc. Am. 133 (5), May 2013 Robinson et al.: Diffusers and spatial discrimination
Redistribution subject to ASA license or copyright; see http://acousticalsociety.org/content/terms. Download to IP: 84.253.223.176 On: Fri, 01 Nov 2013 06:02:34
View publication stats