Professional Documents
Culture Documents
06824044
06824044
cepstrum analysis
Yeou-Jiunn Chen, Aaron Raymond Ang See and
Shih-Chung Chen
The power cepstrum-based parameters for steady-state visually evoked
potential (SSVEP) is proposed. To precisely represent the characteristics of frequency responses of a visually stimulated electroencephalography (EEG) signal, power cepstrum analysis is adopted to estimate
the parameters in low-dimensional space. To represent the frequency
responses of SSVEP, the log-magnitude spectrum of an EEG signal
is estimated by fast Fourier transform. Subsequently, the discrete
cosine transform is applied to linearly transform the log-magnitude
spectrum into the cepstrum domain, and then generate a set of coefcients. Finally, a Bayesian decision model with a Gaussian mixture
model is adopted to classify the responses of SSVEP. The experimental
results demonstrated that the proposed approach was able to improve
performance compared with previous approaches and was suitable
for use in brain computer interface applications.
Introduction: Motor neuron disease weakens the muscles that consummates into loss of voice and voluntary controls of the patients various
limbs, hence detaching them from the outside world [1]. To address
the situation, the augmentative and alternative communication (AAC)
system could actually help patients with neuromuscular impairments
to communicate with other people or devices by sending messages or
commands. Thus, various types of interfaces based on voices or gestures
have been widely used in AAC systems [2, 3]. However, these interfaces
were not suitable for neuromuscular impairments. Recently, the brain
computer interfaces (BCIs) are used to design communication systems
by using electroencephalography (EEG) analysis [47]. It sends messages or commands to the external world through direct measurement
of brain activity, thus it is a suitable interface for neuromuscular impairments. Therefore, to develop a suitable BCI is benecial for patients
with neuromuscular impairments so as to communicate with other
people or devices.
For BCI approaches, to determine the users response, the steady-state
visually evoked potential (SSVEP)-based BCI applies visual stimulus at
specic frequencies and then the electrical activity of the brain is
measured [4]. SSVEPs are advantageous in BCIs because of their excellent signal-to-noise ratio and relative immunity to artefacts. Therefore, to
achieve the best performance of BCIs, many researchers have focused
on selecting stimulator devices and stimuli properties [5]. However, in
practice, the frequency response of a user not only contains many
noises but also a reduction in the level of measured electrical activity.
The accuracy would be greatly reduced, thus to develop a sensitive
parameter would improve the performance of the SSVEP-based BCI.
To actually classify the frequency responses, the coherence and nonstationary signal analysis methods have been used in BCIs [6]. These
approaches are computationally complex and cannot be applied to realtime systems. Another approach, the thresholding method, uses the spectral magnitude estimated by fast Fourier transform (FFT) to classify the
frequency responses [7]. As the threshold is dependent on specic users,
it is difcult to apply to other users. Furthermore, the frequency
responses of a user are usually not only noisy but also lower than the
threshold. Thus, it would greatly reduce the accuracy of the systems
in real-time applications.
To efciently reduce computational time, the parameter dimensions
should be reduced without losing the precision of representing the
characteristics for frequency responses. For data compression, the discrete cosine transform (DCT) had been successfully used to compress
EEG data [8]. DCT then could efciently represent the characteristics
of signals in low-dimensional space. In addition, the cepstrum coefcient derived from the log-magnitude spectrum has been widely used
in speech technologies [9]. It can preserve most information in its spectral envelope. As the log-magnitude spectrum contains the major information of frequency responses, the integrating DCT and cepstrum
would be able to effectively represent the signicant information of
the log-magnitude spectrum in low-dimensional space.
In this Letter, a SSVEP-based BCI using a power cepstrum is proposed to precisely represent the characteristics of frequency responses
in low-dimensional space. To precisely represent the characteristics of
the frequency responses, the power cepstrum analysis integrating FFT
and DCT is proposed to derive coefcients in low-dimensional space.
N
1
X (k) = log
x(n)ei2pk(n/N )
(1)
n=0
p
1
m+ k
X (m) cos
2
M
m=1
M
1
(2)
= arg max
vi
P (v i )
P(C |vi )
P(C)
(3)
As the denominator P(C ) is a constant term for all classes and the prior
probability P(i) is treated as a uniform distribution, (3) can be rewritten
as
(4)
J
wj N C mj , sj
(5)
j=1
where wj, j and j are the mixture weight, mean and the standard
variance associated with the jth Gaussian
J component. wj is subject to
the following constraint: wj 0 and
j=1 wj = 1. These parameters
of the GMMs are estimated by the expectation maximisation
algorithm [10].
Experimental results: To evaluate the proposed approach, ve subjects
(four males and one female) aged between 21 and 23 participated in this
study. The frequency responses were stimulated using a liquid crystal
display screen and the visual stimulator (as shown in Fig. 1) ickering
at ve frequencies from 6 to 10 Hz with 1 Hz increments. Then, the
EEG signals were measured using electrodes placed at the Oz, A1 and
A2 (ground) in accordance with the international EEG 10-20 system.
The sampling rate and the frame size were set to be 1 kHz and 1 s,
respectively. Subsequently, using 4096 points FFT, the log-magnitude
spectrum can be estimated and band limited from 5 to 11 Hz. The
number of cepstrum coefcients used range from 6 to 10 coefcients.
As each subject had 60 epochs for each specic frequency, the
number of mixtures in GMM was set to be two mixtures.
go forward
8 Hz
turn left
7 Hz
turn right
9 Hz
dive
elevate
10 Hz
6 Hz
95.42
91.67
87.08
97.50
91.25
95.00
91.67
86.67
97.08
92.08
95.00
92.92
88.33
93.75
92.50
95.42
94.17
85.83
95.00
92.50
accuracy, %
95
90
85
80
proposed approach
log magnitude spectrum
75
thresholding method
70
8
frequency, Hz
10