Professional Documents
Culture Documents
A Statistical Theory of Signal Coherence
A Statistical Theory of Signal Coherence
Melvin J. Hinich
Applied Research Laboratories
The University of Texas at Austin
Austin, TX 78713-8029, USA
(512) 835-3278 (Voice) 835-3278 (Fax)
hinich@mail.la.utexas.edu
ABSTRACT
A periodic signal can be perfectly predicted far into the future since it
perfectly repeats every period. There is always some variation in the
waveform over time for signals which are labeled as periodic but which
are not truly deterministic. A formal definition is presented in this paper
for such a varying periodic signal and the properties of such a class of
signals are exploited. A measure called a signal coherence function of the
amount of random variation in each Fourier component of the signal is
defined and its statistical properties are developed. This signal coherence
function is very different from the coherence function between two
stationary signals. The method is applied to a digitized record of an
acoustic signal generated by a boat in a bay in the Baltic Sea south of
Stockholm, Sweden.
1. INTRODUCTION
A periodic signal can be perfectly predicted far into the future since it
perfectly repeats every period. Nature does not produce perfect periodic
signals. There is always some variation in the waveform over time for
signals which are labeled as periodic but which are not truly
deterministic. A formal definition is presented in this paper for such a
varying periodic signal that is called a randomly modulated periodicity
with period T. The properties of such a class of signals are exploited.
K /2
x (t ) = K 1
[
k= K / 2
k + uk (t )]exp(i 2 fk t ) for fk = k/T (1.1)
1) Periodic block stationarity: The joint distribution of {u0 (t1 ),..., uK / 2 (tn )} is
the same as the joint distribution of {u0 (t1 + T ),..., u K / 2 (tn + T )} for all
2) Finite dependence: {u k1 }
( s1 ),..., ukr (sm ) and {u k1 }
(t1 ),..., ukr (tn ) are
independent if sm + D < t1 for some D and any set of k1 ,..., kr , s1 ,..., sm and
t1 ,..., tn .
K /2 K /2
s (t ) = K 1
k= K /2
k exp(i 2 f k t ) and u (t ) = K 1
k= K /2
uk (t ) exp(i 2 fk t ) . (1.2)
Condition 1) implies that cu (t1 , t2 ) = Eu (t1 + T )u (t2 + T ) = Eu (t1 )u (t2 ) if t1- t2<
T but the equality does not necessarily hold when t1 and t2 are in
2
different frames, that is when t1- t2>T. Consequently the stochastic
term is not cyclostationary (Gardner and Franks, 1975).
Even if the uk (t ) are all jointly stationary random processes, u (t ) is not
{
in general stationary since the joint distribution of uk1 (t1 ),..., ukr (tn ) is not }
invariant to a shift of the time origin. For example the covariance
function of uk (t )sin(2 ft ) is a function of t if uk (t ) is a covariance
2
x (t ) = [1 +
k =1
u1k (t )]cos ( 2 fk t ) + [1 + u2 k (t ) ]sin ( 2 fk t )
3
Assume at first that T is known. The highest harmonic frequency is fK =
K/2T. If the signal is sampled at the frequency 2 fK there are K discrete-
time samples x(tn) in an interval of length T time units. To simplify
1
Amplitude
0
1 10 19 28 37 46
-1
Time
-2
-3
-4
-5
notation the time unit will be suppressed and let t denote tn and T = K
from now on.
A common approach to processing signals with a periodic structure is
to segment the observations into frames of length T so that there is
exactly one waveform in each sampling frame. The term sampling frame
is used in this paper in order to match the terminology used in the
speech processing literature. The waveform in frame m is slightly
different from that in frame m +1 due to variation in the vocal tract.
To further simplify notation set the time origin at the start of the first
frame. Then the start of the mth frame is m = (m 1)T where m=1, M.
4
{ x( m ), ..., x ( m + T 1) :: k = 1,..., K }. This joint distribution is the same for
2. SIGNAL COHERENCE
T 1
X (r ) =
t= 0
x((m 1)T + t ) exp( i 2 f r t ) (2.1)
5
T 1 T /2
=
t= 0
T 1 [
k= t / 2
k + uk (t ) ]exp(i 2 kt / T ) exp( i 2 rt / T )
T 1
= r + U (r ) where U (r ) =
t= 0
u (t ) exp( i 2 fr t ) .
The joint distribution of { x( m ), ..., x ( m + T 1)} is the same for each frame.
These observations are random variables before the data is observed and
thus in keeping with standard statistical notation the index m is not used
to subscript the complex valued random variables X(r) and U(r).
The variability of the complex Fourier amplitude X(r) about its mean r
depends on the complex variance of U(r) and the covariances of U(r1) and
U(r2) , E[U (r1 )U ( r2 )] = u (r1 , r2 ) , where
T 1 T 1
u (r1 , r2 ) =
t1 = 0 t2 = 0
cu (t1 , t2 ) exp(i 2 (r1t1 r2 t2 ) / T ) and (2.2)
exp(i 2 r / T )
T 1 T 1
u2 (r ) = =0 t= 0
cu (t , t + ) , which is of the order O(T). For
u2 (r ) = T T [S u ( fr ) + O (1/ T )]
T 1
=0
(1 / T )cu ( ) exp(i 2 rr / T ) , which equals
The larger the value of u (r ) , the greater the variability of that component
6
from frame to frame. If r = 0 and u (r ) 0 , then that component is
r
2
x (r ) = 2
(2.4)
r + u2 (r )
7
In the signal plus noise representation of {x(t )} the signal-to-noise ratio
(r ) = r u 2 (r ) fr . Thus x2 (r ) = (r ) / [ (r ) + 1] is a
2
is for frequency
x2 ( r )
(r) = (2.5)
1 x2 ( r )
A signal coherence value of 0.44 yields a SNR of 0.24 which is 6.2 dB.
This result will be referred to in Section 5.
The estimation of the signal characteristics is addressed in the next
section.
for each m=1, ,M . The sample mean for each t=0, ,T-1
M
x (t ) = M 1 x( m + t ) (3.1)
m=1
8
To simply exposition assume from now on that M is much larger than D
and thus M 1 x2 (t ) is negligible.
Let X (r ) denote the rth component of the DFT of ( x (0),..., x (T 1)) . It then
asymptotically complex normal with mean zero and finite variance, which
implies that X (r ) is a consistent estimator of r with an error of
y ( m + t ) = x( m + t ) x (t ) , (3.2)
and let Ym(r) denote the rth DFT component of ( y ( m ),..., y ( m + T 1)) . It
follows that Ym(r) is the rth DFT component of (u ( m ),..., u ( m + T 1)) plus
variance u2 (r ) :
M
)
u2 (r ) = M 1 Ym (r ) .
2
(3.3)
m=1
Since the us are D dependent, the Ys are as well. Once again using
)
Theorem 27.5, the distribution of u2 (r ) is approximately normally
defined by
9
2
X (r )
x (r ) = 2
. (3.4)
X (r ) + u2 (r )
OP ( M 1/ 2 ) .
It is vital that the frame length used in the analysis be an integer
multiple of the fundamental period of the signal. If not, then the
measured signal coherence will be biased towards zero. For example,
suppose that the signal is exp i 2 ( t / T + ) and the frame length used is
T+1. The value of the sinusoid at the relative time t in the (m+1)st frame
10
If the residual signal is weakly stationary then it has been shown in
Section 2 that for each r , u2 ( r ) = T [S u ( fr ) + O (1/ T )] and r = 0 . Thus
4 SIMULATION RESULTS
T = 100 .
the noise standard deviation equals the cosines amplitude. Only 100
frames were used for this table in order to match the number of frames
used in the real data analysis presented in the next section. The result of
a run with 5,000 frames and the same parameter settings produced
almost the same signal coherence values.
11
Signal Coherence for Randomly Modulated Sinusoids
.01 1.000
0.1 0.995
0.5 0.895
1.0 0.710
2.0 0.447
4.0 0.243
The signal coherence function and the associated power spectrum were
estimated using a signal from a hydrophone at a depth of 25 meters in a
shallow bay in the Baltic Sea as part of an experiment conducted by the
author and Dr.s Leif Persson and John Robinson at the Swedish
National Defense Establishment (FOA) research facility near Stockholm.
A fast workboat powered by a Volvo Penta four cylinder diesel engine
with two counter rotating propellers was driven towards a bottom
mounted hydrophone at a fixed speed of 12 knots and constant heading
starting at a distance of 420 meters from the hydrophone. The data
record used in this paper lasted 102.4 seconds. The acoustic signal from
the boat plus ambient water noise was recorded and digitized using FOA
equipment on shore. An anti-aliasing filter was applied to the signal prior
to digitizing and the signal was sampled at a rate of 12.5 kHz.
12
The digitized data was high passed filtered by taking the DFT of the
entire N = 1.28 106 observations in the file, zeroing out the bins
associated with the band 0 - 10 Hz and then using the DFT to transform
back into the time domain. This filtered record was then segmented into
102 non-overlapping frames of T = 12,500 observations yielding a
resolution bandwidth of 1 Hz for the given sampling rate.
A plot of the spectrum for the band 10 - 95 Hz is shown in Figure 2 and
a plot of the signal coherence function are shown in Figure 3.
14
12
10
6
Spectrum dB
0
10 15 20 25 30 35 40 45 50 55 60 65 70 75 80 85 90 95
-2
-4 Frequency Hz
-6
-8
13
The main peaks in the spectrum are at 12 Hz (4.12 dB) and its first two
harmonics 24 Hz (11.47 dB) and 48 Hz (7.29 dB) but the largest peak is
at 24 Hz. The signal coherence function has peaks around those
frequencies but the largest value for
x (r ) is 0.44 at 50 Hz. From the
result presented at the end of Section 2, this coherence value has a SNR
value of 6.2 dB.
The value at 24 Hz is 0.384, which is the second largest value. Although
the throttle was held at a fixed level during the run there was some
variation in propeller rotation speed as the boat moved through the
water. The magnitudes of the signal coherence values at the frequencies
where the spectrum peaks provides a measure of the variation in these
frequencies over time. The spectral peaks confirm what we know, the
signal has a mean periodic structure due to the motor and fuel pump,
which are the major noise sources in this experiment. The size of the
0.5
0.45
0.4
0.35
Signal Coherence
0.3
0.25
0.2
0.15
0.1
0.05
0
10 15 20 25 30 35 40 45 50 55 60 65 70 75 80 85 90 95
Frequency Hz
14
signal coherence peaks are surprising since they show that the "periodic"
components of the spectrum are highly modulated even though the
motor speed was fixed. The modulation is not due to the ambient noise
since the sound level from other ships was low during the time the runs
were made and the broadband water noise spectrum was small relative
to the acoustic motor signal.
The program that I wrote to compute the spectrum and the signal
coherence computes more statistics than have been presented so far. In
order to determine the correlation distance, correlations are computed
for each frequency in the band over adjacent frames. The correlations for
the data used here were computed for 15 adjacent frames. The
correlation between two adjacent frames for the peak at 24 Hz is 0.87.
The correlation across frames linearly decreases to become statistically
insignificant for a span of six frames. The other bands had insignificant
correlation values. These results provide an estimate of the across frame
dependence of less than one milsecs.
Acknowledgments
I thank Dr. Leif Persson and Dr. John Robinson for their
encouragement to explain this new idea. I also thank Dr. Geoffrey
Edelson and the referees for their constructive criticisms and suggestions
for improving this work.
REFERENCES
Brillinger, D. (1975), Time Series, Data Analysis and Theory, New York:
Holt, Rinehart, and Winston.
15
Carter, G.C., C.H. Knapp, and A.H. Nuttall (1973), "Estimation of the
magnitude-squared coherence function via overlapped Fast Fourier
Transform Processing, "IEEE Trans. On Audio Electroacoustics AU-21,
337-344.
Jenkins, G.M. and D.G. Watts (1968), Spectral Analysis and its
Applications, San Francisco: Holden Day.
16