Professional Documents
Culture Documents
Coursework 2: Digital Audio Principles - Dither
Coursework 2: Digital Audio Principles - Dither
Coursework 2: Digital Audio Principles - Dither
Mark Thompson
The two prime operations which facilitate the analogue to digital conversion (ADC) of a signal are
sampling and quantisation – generally occurring in that order [Lipshitz, Vanderkooy, 2004: 205]. The
fundamental theory behind the sampling operation (first outlined by Harry Nyquist, 1928) observes
the faultless reconstruction of the original analogue signal. This can be explained mathematically as
sampling is a linear process, and an inverse Fourier transform of the sampled spectrum in the
frequency domain will output the original input signal [Widrow, et al, 1996]. The implementation of
the Nyquist-Shannon sampling theorem is, however, dependent on a number of idealised conditions
[Jerri, 1997: 1565]. Conversely, the quantisation of the analogue signal is a non-linear operation, and
results in an inherent quantisation error, as the finite bit-per-sample resolution cannot precisely
represent the infinite resolution of the analogue signal voltage [Lipshitz, Vanderkooy, 2004: 205].
Examination of how this quantisation error presents during PCM-based ADC is necessary to assess
the implementation of subtractive and non-subtractive dither, and the relative efficacy of the types
of dither noise utilised.
Figure 1: Uniform quantisation "staircase" functions. a) Mid-tread function, and b) mid-riser function - ∆ represents the
quantisation interval [Wannamaker, et al, 2000: 500]
In 1956, Widrow developed a statistical theory of quantisation, relating the operation of quantisers
as a discretization of amplitude through probability density functions, much as sampling is a
discretization of time [Widrow, et al, 1996]. The operation of quantising involves assigning binary
values closest to the input signal’s amplitude at the positions specified during sampling [Lipshitz,
Vanderkooy, 1984: 108]. The underlying problem associated with quantisation, is that values at the
∆ ∆
∆
( )
√
Maximum possible error is from ± ∆/2, the RMS value of the quantisation error can be derived by:
( ) ∆ √
∆ ∆
√ √
∆
√ √
( ) ( ) ( √ )
∆ √
( √ )
( ) (√ )
The effect of the quantisation error is largely determined by the input analogue signal. Where audio
signals are loud and complex, quantisation error manifests at the noise floor as low-level white
noise, independent of the signal [Lipshitz, Vanderkooy, 1984: 108]. For low-level, relatively simple
signals, quantisation errors also occur as low level noise (subject to modulation), however the
artefacts introduced are correlated with the input signal [Lipshitz, Vanderkooy, 1984: 108]. This
correlation with the input signal results in artefacts appearing as harmonic and intermodulation
distortion [Lipshitz, Vanderkooy, 2004: 205], (see figure 1d) to which the human ear is more
sensitive than white noise. This can be explained as the operation of the cochlear and neural
pathways can be functionally described by a series of Fourier transforms in the analysis of complex
sounds [Altes, 1978: 178].
The usefulness of dither was first realised in the removal of contouring effects (produced due to
quantisation error) from PCM video [Lipshitz, Vanderkooy, 1984: 107]. In the early 1960’s, Roberts
applied pseudo-random noise to the transmission of digital television pictures. He noted that by
adding noise prior to quantisation and subtracting the same dither signal from the output, the
resultant quantisation error of the system was independent from the input signal [Lipshitz, et al,
1992: 363]. Roberts derived an expression for the mean-square error (E) of PCM systems, which is
comprised of variance (V) and deviation (D) [Roberts, 1962: 148]. Variance refers to the apparent
white noise of a signal, defined by variance from the mean value of the output. Deviation refers to
the deviation of the mean output ( ) from the input ( ), or a measure of the signal-correlation of
the quantising noise [Schuchman, 1964: 164].
( ) ( )
Introduction of the dither signal reduced the deviation during Roberts’ testing, producing better
perceived picture quality despite an increase in the total noise (E) [Schuchman, 1964: 164]. A
doubling in efficiency of each PCM channel was also observed, where the resolution could be
reduced from 6 to 3 bit resolution without degradation of the input signal [Roberts, 1962: 154]. The
issue with this subtractive dither (SD) operation is that it requires the same pseudo-random dither
signal to be summed prior to quantisation, as well as prior to DAC for subtraction of the dither.
Synchronisation of the noise generators or transmission of the original dither signal is not always
practical and added digital processing or re-quantising of the signal would not satisfy the conditions
required for a complete subtraction of the dither [Wannamaker, et al, 2000: 501].
Non-subtractive dither (NSD) (figure 4) does not subtract the dither signal after quantisation, and
the practical implications of SD do not apply. The issue with NSD quantisation systems arises in the
additional errors (aside from the quantisation error) which also comprise a portion of the total error
of the system [Wannamaker, et al, 2000: 502]. Wannamaker [et al, 2000: 502-504] mathematically
explained a number of theorems inherent with NSD systems, most significantly the improbability for
the total error to be uniformly distributed or statistically independent of the input signal. Where
conditions do allow an input independent total error, the resultant error always exceeds the total
quantisation error associated with SD quantisation systems [Wannamaker, et al, 2000: 504]. Despite
these comparative drawbacks, the use of NSD is far more desirable than dither-less systems.
Figure 4: System block diagrams and relative differential equations for a) subtractive dither system, and b) non-
subtractive dither system [Wannamaker, et al, 2000: 501]
The overall application of dither with regards to digital PCM systems cannot be addressed
thoroughly without consideration for types of dither noise used. The influence of specific type of
dither noise utilised is most significant at quantisation resolutions of less than 6 bits [Rabiner and
Johnson, 1972: 1487]; at these bit depths, variations in statistical non-linearities with the input signal
and noise modulation is most audible. In listening tests published by Wannamaker, et al, [2000: 514]
rectangular or uniform probability density function (RPDF) dither, with a width equal to the
quantisation interval, succeeded in eliminating the quantisation related distortion, however resulted
in noticeable noise modulation. Triangular-PDF dither with amplitude of twice the quantisation
interval is recommended by the author in the same listening test, as both distortion and noise
modulation are removed [Wannamaker, et al, 2000: 514]. The required amplitude does however
reduce the SNR of the system (by approximately 4.8dB) as the quantisation noise is increased
[Lipshitz, Vanderkooy, 2004: 208]. In addition to standard white spectrum TPDF dither, a high-pass
spectrum TPDF dither can be implemented such that the applied noise is less audible as it is in the
upper ranges of the audible threshold [Wannamaker, et al, 2000: 510]. Dither of RPDF and TPDF
types are sensitive to their amplitudes in relation to the quantisation intervals however, non-integer
values of the given amplitudes will result in a failure to prevent distortion or noise modulation
[Lipshitz, Vanderkooy, 2004: 211].
Broadly speaking, the application of dither in modern, high resolution conversion using PCM is
standard. Regarding experimental evaluations, the case for applying dither to quantisation in
extremely low bit depth systems is not always confirmed. For example Rabiner and Johnson’s [1972:
1487] experimental evaluation confirmed that the addition of dither to PCM speech signals at
quantisation resolutions of 2 and 3 bits resulted in reduced speech intelligibility due to the noise of
the dither. Mathmatically, for a 1 bit resolution system, a TPDF dither signal would be undesirable
[Lipshitz, Vanderkooy, 2004: 208], as the width of the dither would account for the entire input
range with amplitude of 2 LSBs (figure 5). Aside from specific exceptions, the use of non-subtracting
TPDF dither as a solution to non-linear quantisation noise is generally recommended [Wannamaker,
et al, 2000: 514] and should be applied for any quantisation or re-quantisation operation for PCM
systems [Lipshitz, Vanderkooy, 2004: 210].