Professional Documents
Culture Documents
4 Brain Activity During Motor Imagery and Mental Arithmetic Deep Learning Approach For EEG
4 Brain Activity During Motor Imagery and Mental Arithmetic Deep Learning Approach For EEG
Min-Kyung Jung
Graduate School
Korea University
August 2023
Brain Activity During Motor Imagery
and Mental Arithmetic: Deep Learning
Approach for EEG
by
Min-Kyung Jung
_____________________________________
Graduate School
Korea University
August 2023
ii
The thesis of Min-Kyung Jung has been approved
by the thesis committee in partial fulfillment of
the requirements for the degree of
Master of Engineering
June 2023
__________________________
Committee Chair: Dong-Joo Kim
__________________________
Committee Member: Jae-Ho Han
__________________________
Committee Member: Hakseung Kim
iii
Brain Activity during Motor Imagery and Mental
Arithmetic: Deep Learning Approach for EEG
by Min-Kyung Jung
ABSTRACT
i
graph neural network (GNN) model was proposed that achieves high classification
performance using spatial brain activity and functional connectivity, and the explanation of
the trained model operation based on explainable artificial intelligence was examined. The
GNN utilized connectivity between the left centroparietal-left frontal regions and the
frontoparietal connection as an important feature for mental arithmetic classification. The
findings of this thesis are expected to provide valuable insights to future researchers who
aim to advance brain-computer interface technology based on a thorough investigation of
brain activity.
ii
동작 상상 및 암산 동안의 뇌 활동에 대한 연구:
딥러닝 기반의 뇌파 분석을 통해
정민경
뇌공학과
지도교수: 김 동 주
국문 초록
iii
방법에 대해 논하고자 한다.
인공지능
iv
PREFACE
Jung, Min-Kyung, et al. "Phase Transition in previous Motor Imagery affects Efficiency of
Motor Imagery based Brain-computer Interface." 2021 9th International Winter Conference
on Brain-Computer Interface (BCI). IEEE, 2021.
Jung, Min-Kyung, et al. "Interpretability of Hybrid Feature Using Graph Neural Networks
from Mental Arithmetic Based EEG." 2023 11th International Winter Conference on Brain-
Computer Interface (BCI). IEEE, 2023.
v
TABLE OF CONTENTS
ABSTRACT ........................................................................................................................... i
국문 초록 .............................................................................................................................iii
PREFACE.............................................................................................................................. v
REFERENCES .................................................................................................................... 41
vii
LIST OF TABLES
Table 1.1 Detailed comparison of major control signals used in brain-computer interface
systems .................................................................................................................................. 4
viii
LIST OF FIGURES
Figure 2.1 ERSP of all subjects (use STFT, window size 2 seconds, update period 0.04
seconds). (A) ERSP of selected trials, (B) ERSP of non-selected trials.............................. 15
Figure 2.2 Comparison of the MI classification accuracy between the total-based and upper
-2σ selection-based model ................................................................................................... 19
Figure 2.3 ERSP Difference plot (A) ERSP Difference plot of selected left class trials, (B)
ERSP Difference plot of selected right class trials, (C) ERSP Difference plot of non-
selected left class trials, (D) ERSP Difference plot if non-selected right class trials .......... 21
Figure 3.1. Model performance trained on both datasets. (A) is about connectivity (CON)
and (B) is about connectivity and PSD features (CON-PSD). Red, green, and blue donate
accuracy, recall, and precision, respectively ....................................................................... 30
Figure 3.2. Circular graphs depicting the connectivity of envelope correlation between
nodes, and topographic visualization of each PSD band. (A), (B), and (C) were obtained
from 20, 21, 24 subjects in CON-PSD dataset. The upper and lower rows represented each
baseline (BL), mental arithmetic (MA). Delta, theta, alpha, beta, and gamma PSD bands
were colored purple, blue, green, orange, and red, respectively ......................................... 32
Figure 3.3. Average of subgraphs that found as important in task classification by the
GNNExplainer in (A) subject 20, (B) subject 21, and (C) subject 24 ................................. 34
Figure 2.1 ERSP of all subjects (use STFT, window size 2 seconds, update period 0.04
seconds). (A) ERSP of selected trials, (B) ERSP of non-selected trials.............................. 15
ix
Part I:
Introductory Materials
1
CHAPTER 1. Overview of Brain-Computer Interface
Brain-computer interface (BCI) technology refers to the direct connection between the
human brain and computer, enabling the control of computers through
electroencephalography (EEG), and it can be considered as the ultimate interface between
humans and computers [1]. BCI technology is broadly categorized as human-computer
interface (HCI) technology, and it is also called brain-machine interface (BMI) because it
allows the manipulation of machines such as wheelchairs and robots through EEG. BCI
technology analyzes brain activity to understand the intentions of humans or animals, and
applies it to control robots or computers. The implementation of BCI technology involves
acquiring EEG through devices and then processing the signals to analyze them before
issuing commands to the input and output devices [2].
2
1.2 Electroencephalography
The BCI technology is classified into invasive and non-invasive methods depending on
the location of the EEG measurement, and into evoked signals and spontaneous signals
depending on the EEG measurement paradigm, as shown in Table 1.1 [3]. Invasive methods
in BCI involve the insertion of a microchip into the scalp to measure brainwaves, while non-
invasive methods measure brainwaves externally through helmets or headset devices.
Although non-invasive methods are easier to use, they are susceptible to signal noise,
making accurate measurement difficult. Evoked signal methods require training to induce
the appearance of specific EEG, while spontaneous signal methods analyze EEG to
recognize simple thoughts or actions, enabling the direct transmission of user intent to a
machine. [4]. If evoked-based BCI is like using Morse code to communicate, then the pattern
recognition-based BCI can be compared to having a simple conversation over the phone.
Even though it involves training the user to generate specific EEG patterns, if the training is
consistent with the user's intention, it falls into the category of EEG recognition.
3
Table 1.1 Detailed comparison of major control signals used in brain-computer
interface systems
EEG used in the analysis of BCI are obtained by measuring the subtle signals of the
brain surface, which are synthesized by the electrical signals generated by numerous neurons
in the brain. EEG signals vary spatiotemporally depending on brain activity, measurement
state, and brain function, and show frequencies of 1-50 Hz and amplitudes of approximately
10-200 uV. All analysis methods that can be applied to time series are available for brain
wave analysis [5], including 1) time analysis: mean, standard deviation, autocorrelation
function, cross-correlation function, etc., 2) frequency analysis: power spectrum, etc., 3)
time-frequency analysis: short-time Fourier transform (STFT), wavelet transform, Wigner
distribution, etc., 4) statistical models: autoregressive, etc., and 5) spatiotemporal analysis:
principal component analysis (PCA), independent component analysis (ICA), etc. In
addition, with the advancement of signal processing, machine learning and deep learning
methodologies are actively being utilized in the analysis of EEG [6, 7].
4
1.3 Motor Imagery
5
1.4 Mental Arithmetic
6
1.5 Thesis Objectives
BCI research has been primarily focused on the ultimate goal of manipulating external
machines solely through thoughts, thus often involving mental simulations without actual
physical actions [19, 20]. Unlike mapping brain responses to external sensory-motor stimuli,
the mapping of brain regions activated by such tasks remains a central challenge in BCI
research due to the distributed nature of brain networks involved in cognitive tasks and
limitations of various imaging techniques [21]. The high temporal resolution of EEG is
inherently suitable for mapping the dynamic engagement of brain regions during mental
processes, leading to increased interest in EEG within recent BCI research [22].
Representative BCI tasks include MI and MA, which predominantly elicit brain activation
around the central area and the prefrontal cortex covering the forehead, respectively.
Moreover, recent advancements in data analysis methodologies have led to the active
application of deep learning approaches in EEG-BCI research, achieving high performance.
However, deep learning algorithms exhibit characteristics that make it difficult to directly
examine the inferential process, resembling a black box. Consequently, various attempts are
being made to overcome these limitations and interpret the functioning of models. In this
thesis, deep learning algorithms were applied for MI and MA classification, and through
additional analysis, the operating mechanisms of the models were investigated to examine
brain activity during the tasks. The contributions of this thesis can be summarized as follows:
2) Confirming the advantages of integrating the analysis of spatial neural activity and
functional connectivity during classification tasks for BCI.
3) Investigating the functioning of the trained models and discussing brain activity during
mental tasks.
7
Part II:
Main Study
8
CHAPTER 2. Phase Transition in previous Motor Imagery
affects Efficiency of Motor Imagery based Brain-computer
Interface
The contents of this chapter are approved for publications in: 2021 9th International Winter
Conference on Brain-Computer Interface (BCI).
DOI: 10.1109/BCI51272.2021.9385321
2.1 Summary
9
2.2 Introduction
10
2.3 Materials and Methods
The open-access MI dataset from BCI competition IV 2b [30] was used for this study.
The dataset includes EEG recordings from 9 subjects during binary MI task (i.e., left hand
and right hand). The training data, which consisted of the first 2 sessions and the 3rd session,
comprised 240 trials without visual feedback and 160 trials with visual feedback,
respectively. The validation data consisted of 2 sessions of EEG data that comprised a total
of 320 trials. EEG data were recorded at a 250 Hz sampling rate from 3 bipolar (C3, Cz, and
C4) and 3 monopolar EOG channels. Details of data acquisition protocol are stated in [30].
The actual recorded data were re-referenced using common average reference (CAR)
and filtered with bandpass of 1 to 40 Hz (FIR filter). The baseline set at -3 to -1 seconds was
removed due to contamination by beep sound and rejected trials was eliminated. The
preprocessing of EEG signals data was conducted using MATLAB R2018b (MathWorks,
MA, USA) and EEGLAB toolbox [31].
In order to reflect the change in the state of the brain, the dataset was configured as
follows. In the data after signal processing, the window size is 2.25 seconds, and the update
period is composed of 0.25 seconds. Window size and update period were selected
empirically to match the data balance in learning the model. The EEGNet [32] used in this
study does not perform feature extraction because it uses the temporal signal as it is, and
details are described in Model and training part. Two classifications were performed using
the dataset. The first classifies whether a cue exists in the window, and the second classifies
whether the left or right in the 3~7 seconds MI section.
11
Model and Training
EEGNet was used for both cue detection to detect changes in brain state and MI
classification to distinguish MI of both hands. EEGNet is a model that can be trained with
minimal pre-processing of EEG signals, and the model can be trained with relatively little
trial compared to other artificial intelligence models for EEG analysis.
Model was trained with Adam [33] optimizers and label smoothing cross-entropy loss.
Cosine annealing learning rate scheduler [34] and early stopping [35] were also used. A 5-
fold cross-validation was performed. The EEG signals for the construction (i.e., training and
testing data) of the subject-specific model were collected only within a single subject.
Data Selection
Before MI was performed, data was selected in the following way to determine the
effect of changes in brain state on the classification performance of the model. Using the
softmax values of the model that detected changes in brain states, the difference between the
mean softmax values before and after cue is calculated to compare the distribution across
the entire trial. In order to calculate the optimal threshold of the data selection algorithm
proposed in this study, the criteria of 0 σ(median), 1σ (top 84.1%) and 2σ (top 97.2%) are
used.
12
Spectral Perturbation
The event-related spectral perturbation (ERSP) [36] analysis was performed to confirm
the differences appearing in the three channels. ERSP is a time-frequency analysis of EEG
signals and is an analysis method used to observe the frequency change over time. In this
study, a short-time Fourier transform (STFT) [37] is used to perform time-frequency analysis.
The calculated STFT is analyzed in two ways. First, frequency analysis according to cue was
conducted. Using the grand average on the estimated STFT, we analyzed the change in
power appearing in all subjects. Second, in order to compare the MI of the right hand and
the left hand, ERSP is calculated by the following formula.
(2.1)
Through this calculated ERSP difference, it is possible to observe the change in spectral
power during MI in each channel. The following formula was applied equally in three
channels.
Statistical Analysis
Non-parametric analysis was used in this study due to the small cohort size (each dataset
evaluated 5 times). The differences in the classification accuracy between the models were
assessed by the Wilcoxon rank-sum test. Statistical significance was based on p < 0.05.
ERS
Statistical analysis was performed using SPSS 24 (IBM Corp., Armonk, NY, United States).
13
2.4 Results
In our study, we verified that the brain state transition is detected according to the
change of time (Figure 2.1). Table 2.1 presents the detection accuracy for each of the 5 folds
and the overall performances was 72.42 ± 3.12%. To identify these transition, we performed
supplementary analysis. When trials exceed certain threshold, change of ERSP appeared
before and after cue, as shown in Figure 2.2. Especially, before cue high δ band power was
appeared, after cue the power was decreased 9.95%. Unlike, ERSP of trial that cannot exceed
threshold No noticeable change has emerged at δ band Units.
14
Figure 2.1 ERSP of all subjects (use STFT, window size 2 seconds, update period 0.04 seconds). (A) ERSP of selected
trials, (B) ERSP of non-selected trials
15
Table 2.1 Cue detection accuracy
16
Model Performance Improvement Through Optimal Threshold
17
Table 2.2 Classification performance of selected-base model MI
18
Figure 2.2 Comparison of the MI classification accuracy between the total-based and
upper -2σ selection-based model
19
Difference of Each Class
Based on these results, right and left EEG signal of a top 97.8% trial was compared. As
shown Figure 2.3, When left and right MI were performed, C3 and C4 channel showed
different aspects respectively. When imagining the left-hand MI, the power appears to be
10.3 % larger in the α band on the C3 channel and 5.8 % greater in the β band. On the other
hand, right hand MI can see 8.2 percent larger than the high β band of C4. However, in both
hemispheres of the unselected trials, there is no difference in spectral power.
20
Figure 2.3 ERSP difference plot (A) ERSP difference plot of selected left class trials,
(B) ERSP difference plot of selected right class trials, (C) ERSP difference plot of non-
selected left class trials, (D) ERSP difference plot if non-selected right class trials
21
2.5 Discussion and Conclusion
Further, by rejecting trials that had no difference between the state before and after cue,
notable increase in classification performance was achieved (Figure 2.2). The rejected trials
could not distinguish between right and left MI ERSP (Figure 2.3). In other words, there is
no significant difference in both hemispheres of brain activity pattern between the right-hand
and the left-hand MI, which could indicate that the subject was not sufficiently prepared to
proceed with the MI task. In this context, selecting appropriate (i.e., well-prepared) trials
might be improving the performance of the classification models.
Some limitations should be mentioned; this study was not able to analyze the overall
changes in brain activity patterns since the EEG was measured using only three electrodes
(C3, C4, and Cz). Second, though the change in delta band power before and after the cue
was observed, it is unclear how this change could affect the MI. Follow-up investigation
using multidimensional MI-BCI dataset is warranted.
22
CHAPTER 3. Interpretability of Hybrid Feature Using Graph
Neural Networks from Mental Arithmetic Based EEG
The contents of this chapter are approved for publications in: 2023 11th International Winter
Conference on Brain-Computer Interface (BCI).
DOI: 10.1109/BCI57258.2023.10078648
3.1 Summary
An increased cognitive load could lead to increased human error and decreased
performance in workplaces [40, 41], and temporary attention impairment [42]. Given that
an adequate level of attention and cognitive capacity is crucial for problem-solving [43],
monitoring and evaluation of cognitive load could be helpful in a wide variety of situations.
The network organization of the brain can be understood via graph theory [51]. With
graph neural network (GNN), a deep learning method designed to be applied to graphs,
recent studies have reported notable classification performance in emotion recognition [52],
identifying reading tasks [53], and mental arithmetic tasks [54]. Nonetheless, the existing
GNN models do not offer an explanation of brain regions and spectral components that were
more relevant in model output.
This study aims to utilize a GNN for the identification of the mental arithmetic (MA)
task. In this endeavor, the connectivity and spectral information were extracted from EEG
recordings during MA task and subjected to a state-of-the-art GNN model. Further, the
GNNExplainer was applied to grant the model an explainability on its output. The methods
described in this study are expected to yield high classification performance, and specify the
brain regions and EEG bands that were relevant to MA tasks.
24
3.3 Materials and Methods
Dataset Description
An open-access EEG dataset that contains recordings of two mental states, i.e., MA and
baseline (BL) were assessed and subjected to analyses [55]. The dataset comprises twenty-
nine healthy subjects (14 males, 15 females) with an average age of 28.5 +- 3.7 years. The
EEG recordings were acquired from 30 electrodes placed according to the international 10-
5 system at a sampling frequency of 200 Hz. The MA experiment consisted of 3 sessions,
each comprised of 20 trials. MA and BL trials were presented randomly 10 times in each
session. During MA tasks, subjects were instructed to repeatedly perform simple arithmetic
calculations. Each subject underwent a total of 60 trials.
Preprocessing
All data processing was conducted with MATLAB R2020a (MathWorks, Natick, MA,
USA) and BBCI toolbox [56]. First, the raw EEG signal was re-referenced using the
common average reference. The recordings of each trial were segmented into 10-seconds
epochs that contained a task period. The dataset was filtered by a band-pass of 1 to 40 Hz
(fifth-order Butterworth filter). Electrooculography (EOG) channels were rejected for
analysis.
25
EEG Connectivity and Power Spectral Density
Edge and node features were extracted from EEG raw signals, because the GNN model
receives graph-type data. For computing edge features, connectivity was utilized. Then, for
extracting node features, the node strength and power spectral density were used. The
connectivity was derived by the envelope correlation method to generate a connectivity
matrix that represents the relationship of inter-node [57]. To compute envelope correlation,
analytic signals were obtained by applying the Hilbert transform, and the mean of Pearson
correlation between orthogonalizing values of two signals was calculated. The inter-node
relations were evaluated via adjacency matrices that represent connectivity, distance, and
degree information. In addition, the node strength was used as node feature. Power spectral
density values of delta (1-4 Hz), theta (4-8 Hz), alpha (8-12 Hz), beta (12-30 Hz), and gamma
(30-40 Hz) frequency bands were utilized to derive node features. Ultimately, each graph
yielded 435 edges above the main diagonal of the square matrix, and the dimension of node
features was 30 * 6.
Experiments
GNN model was trained by optimizing the negative log-likelihood loss function using
Adam [33] and backpropagation. The 5-fold cross-validation method is followed to evaluate
the performance of the trained model; the 5-folds average was calculated to evaluate the final
model performance. To avoid possible overfitting problems, data augmentation was
executed to increase the size of the training dataset size to ten times. Augmented data was
generated by adding Gaussian noise with μ = 0, and σ = 0.001 to the original data. As for the
model evaluation, only the non-augmented original test dataset was used. Experiments were
conducted once for each CON and CON-PSD features.
26
Classification using Graph Neural Networks
To identify and classify the MA task-related brain activity patterns, this study utilized
hierarchical graph pooling with a structure learning model (HGP-SL) [58]. HGP-SL is the
state-of-the-art method for enhancing the effectiveness of GNN, and has shown notable
performance on open datasets for graph classification (e.g., Mutagenicity [59], PROTEINS
[60], etc.). HGP-SL enhances GNN by 1) graph pooling that effectively preserves a subset
of informative nodes, the 2) structure learning that performs graph structure learning for the
pooled subgraph.
p = ( i ) = (I −(D ) A ) H
i i
−1
i i
1
(3.1)
, where Ai and Hi were the adjacent and node representations matrices, and || · ||1 performs
L1 norm row-wisely. Di denotes the diagonal degree matrix of Ai, and Ii the identity matrix.
Subsequently, the pooled subgraph is generated by selecting a pooling ratio that of high-
scored nodes. Edge and node features of the subgraph were imported as of selected nodes.
The other characteristic of HGP-SL, i.e., structure learning, derives edges by evaluating
attention score. for nodes that disengage from each other after pooling, the graph structure
information vanishes. To avoid this problem, the HGP-SL employs an attention mechanism
that permutes the similarity between preserved nodes and can be formulated as follows:
(
Ei ( p, q ) = a H i ( p,:) H i ( q,:) ) + A ( p, q )
i (3.2)
, where σ(·) is the activation function and || represents the concatenation operation. Hi (p, :)
and Hi (q, :) indicate the p-th and q-th row of matrix Hi, which denote the representations of
node vp and vq, respectively.
Next, the similarity score was normalized to better compare the similarity between
27
edges. Normalization was conducted via sparemax function that returns the Euclidean
projection of input onto the probability simple, and can be formulated as follows:
Si ( p, q ) = sparsemax ( Ei ( p, q ) ) (3.3)
, where [x]+ = max [61], and τ(·) was the threshold function that returned a threshold
according to the procedure. Thus, sparsemax preserves the values above the threshold, and
the other values will be truncated to zeros.
Graph level of hierarchical representation value was extracted from each layer by
readout method that performed global mean pooling and global max pooling. The last layer
(e.g., multi-layer perceptron (MLP) and softmax layer) estimates the label using the
concatenated value of representation after the HGP-SL layer.
GNNExplainer
The GNNExplainer identified a subgraph structure that has a crucial role in trained
GNN’s classification [62]. The primary objective for the GNNExplainer is to generate a
subgraph that explains the decision. This process is maximizing the mutual information
between the subgraph and the computation graph. The secondary objective was the subgraph
needs to be minimal. The GNNExplainer is literally the combination of prediction loss and
edge size loss. The GNNExplainer was performed in the following order. In first, the
computation graph which is the k-hops neighbor for node classification, or the entire graph
for graph classification was extracted. Then, a mask for each edge in the computation graph
was initialized. And then, a neural network that learns the mask with loss was constructed.
Lastly, the mask was used, to reduce the computation graph to a minimal graph.
28
3.4 Results
Model Performance
29
Figure 3.1 Model performance trained on both datasets. (A) is about connectivity (CON) and (B) is about connectivity and
PSD features (CON-PSD). Red, green, and blue donate accuracy, recall, and precision, respectively
30
PSD and Envelope Connectivity Pattern
During MA trials, certain trends were observed in connectivity and PSD. As for the
connectivity, the connectome within frontal channels (AFp1, AFp2, AFF1h, AFF2h, AFF5h,
AFF6h, F3, F4, F7, and F8) was increased or maintained, as illustrated in Figure 3.2. In
addition, the connection within the parieto-occipital lesion (P4, P8, PPO2h, POO2, Pz,
POO1, PPO1h, P3, and P7) was stronger than baseline. In terms of PSD, theta-band power
at the frontal electrode had increased, whereas beta-band power had decreased in the central
region.
31
Figure 3.2 Circular graphs depicting the connectivity of envelope correlation between nodes,
and topographic visualization of each PSD band. (A), (B), and (C) were obtained from 20,
21, 24 subjects in CON-PSD dataset. The upper and lower rows represented each baseline
(BL), mental arithmetic (MA). Delta, theta, alpha, beta, and gamma PSD bands were colored
purple, blue, green, orange, and red, respectively.
32
Explanation of Model using GNNExplainer
Certain subgraph patterns that derived by the GNNExplainer, such as Figure 3.3A, C in
BL, and Figure 3.3B in MA, were observed in all subjects. Two prominent patterns were 1)
from CCP5h to the frontal regions (AFF1h, AFp1, AFF5h, and F3), and 2) from AFF5h to
POO1.
33
Figure 3.3 Average of subgraphs that found as important in task classification by the
GNNExplainer in (A) subject 20, (B) subject 21, and (C) subject 24
34
3.5 Discussion and conclusion
Previous studies regarding the connectivity during MA tasks reported that the
connectivity of the frontal and the left parietal regions are associated with cognitive
workload, and active involvement of left parietal area during MA [63-65]. The findings of
this study are consistent with previous studies; the connection within the frontal region was
sustained or increased, and the parieto-occipital lesion (P4, P8, PPO2h, POO2, Pz, POO1,
PPO1h, P3, and P7) was included in the connectivity. Further, there was an incremental
tendency in the theta band power of the frontal region, similar to a previous MA study which
suggested that high frontal theta activities are related to cognitive control and concentration
[63]. In addition, there was a tendency that the beta band power in the central cortex
decreased, which is suspected to associated with mental workload [63, 66].
Limitations of this study should be mentioned. First, this research was performed in an
offline environment with retrospective data. Second, the adding PSD information resulted
35
in significant increase in classification performance, however the explainability measure
utilized in this study was not applicable to PSD features. Future studies with similar design
should aim to validate the proposed methods, preferably with a real-time, prospective design
in a larger cohort.
36
Part III:
Conclusions
37
CHAPTER 4. Concluding Remarks
In the study, the potential for enhancing the understanding of spatial neural activity and
brain functional connectivity patterns during MI and MA tasks was demonstrated through
the use of deep learning-based EEG-BCI applications.
Chapter 3 presented a GNN model that can reasonably classify spatial neural activity
and functional connectivity during MA. Additionally, an explainable artificial intelligence
(XAI) technique was used to investigate the model's explanation. The input to the GNN
model, consisting of PSD and connectivity patterns, showed features related to attention and
mental workload, indicating that it contained rich features for classification. Moreover, GNN
XAI identified the frontoparietal connection from the centroparietal region to the left frontal
region as an important feature for MA classification. The findings of this study indicate that
the connectivity patterns reported here could be important in MA classification, and that the
connectivity analysis by “explainable” GNN could be beneficial in investigating brain
activity patterns during cognitively demanding tasks.
38
4.2 Limitations and Suggestions
Several shared limitations of the studies introduced in this thesis should be mentioned.
First, the study designs and data analysis were conducted in offline environments. To apply
BCI system in real-life device control, an online system that has low computational
complexity, enables real-time analysis, and having simplified measurement systems is
required. Second, the results of the above-mentioned studies may have insufficient statistical
significance due to the small number of subjects. Hence, validation of the results with
additional data is required to ensure statistical reliability. Third, all participants in the
experiment are healthy. Although a major objective of the BCI application is to help patients
with motor disabilities whose brain activity might be different from healthy individuals,
external validation with data from patients with neuromotor disabilities is required to
compensate such differences. Future works will validate the results with more sample size
data from motor disabled patients.
39
4.3 Conclusions
This thesis investigates deep learning methods for MI and MA classification systems.
For MI classification, a methodology is proposed to quantitatively reject trials based on
changes that occur before and during mental tasks. The proposed method achieved higher
performance on a relatively small dataset, contrary to the common belief that the
performance of deep learning models improves with more data. Based on the absence of
MRCPs in the rejected data, it can be inferred that the lack of changes in EEG signals with
respect to cues indicates decreased subject concentration. For MA classification, a GNN
model is developed that reasonably classifies spatial neural activity and functional
connectivity. The developed model achieves high performance by integrating node and edge
features for analysis. In addition, XAI verifies that the GNN model highlights features
related to attention and working memory for the purpose of classification. The investigated
methods in all processes of MI and MA BCI hold promise for enhancing the practical utility
of high performance BCIs.
40
REFERENCES
45