Professional Documents
Culture Documents
Actandloc
Actandloc
Abstract—WiFi-based human sensing technology has unique availability, resistance to light interference, overcoming terrain
properties such as widespread availability, resistance to light limitations, and respecting user privacy.
interference, overcoming terrain limitations, and respecting user For smart city applications, user activity and location aware-
privacy. It has been utilized in various fields. One interesting
application is simultaneous human activity recognition and lo- ness is a crucial factor. There are several interesting studies
calization, where the location and activity of a person standing on the potential for simultaneous human activity recognition
between WiFi transmitters can be predicted using received WiFi and localization using WiFi signals [18], [19]. These studies
signals and machine learning models. At the receiver, channel successfully demonstrate that it is possible to predict features
estimation process calculates the channel state information (CSI) of activities and features of locations within the same CSI sam-
which plays role as the input features for machine learning
models. However, CSI dataset collection is labor-intensive and ples. However, there are some limitations. In study [18], since
affected by hardware requirements and timing synchronization the dataset is not publicly available, it is challenging to conduct
between transmitters. Therefore, dataset is not publicly released additional research on the subject. The study [19] has publicly
in some studies. Other studies used a cable for timing between released the dataset with 1394 CSI samples in various activities
transmitters which limits the practicality in real-world scenarios. and locations. The WiFi transmitter and receiver are connected
In this study, we release the first public dataset with wireless
timing synchronization, namely WiARL-UIT, for simultaneous by a cable which provides perfect timing synchronization.
human activity recognition and localization. The dataset has 1500 However, the sync cable makes it impractical for real-world
CSI samples for 2 activities at 3 locations. For CSI dataset scenarios. Therefore, in this paper, we propose a framework
collection, we have developed a framework consisting of two for collecting WiFi CSI samples according to simultaneous
software-define radio (SDR) devices (USRP B200 Mini) trans- task of activity recognition and localization. The proposed
mitting IEEE 802.11n WiFi protocol. Finally, we have evaluated
the performance of various machine learning models in terms framework consists of software-defined radio USRP devices
of accuracy, precision, recall, and F1-score. As a result, the best with wireless timing synchronization. Using this framework,
accuracy is 97.0% with the CNN model, showing the feasibility of we have collected a dataset of 1500 CSI samples for various
the research direction of simultaneous human activity recognition locations and activities, then made it publicly available.
and localization. Dataset and source code are publicly share at The main contributions of our research can be summarized
https://github.com/F4tt/WiARL-UIT-Dataset.
as follows:
Index Terms—Activity Recognition, WiFi Human Sensing,
Indoor Localization, Machine Learning, CSI Fingerprints. • Developing a SDR-based framework with wireless timing
synchronization to obtain WiFi channel state information
for simultaneous human activity recognition and indoor
I. I NTRODUCTION
localization.
WiFi technology is now widely popular and virtually always • Collecting a dataset named WiARL-UIT with 1500 sam-
present in our lives. Transmitted WiFi signals are multi- ples for various locations and activities. The dataset is
path faded and can be affected by people standing on their publicly shared at https://github.com/F4tt/WiARL-UIT-
propagation paths. WiFi-based human sensing applications Dataset.
exploit the signal variations in amplitude and phase to predict • Evaluating the performance of various deep learning
the information contained in received signals. Currently, WiFi- models on the WiARL-UIT dataset in terms of accuracy,
based human sensing is extensively utilized in various fields, precision, recall, and F1-score. The highest achieved
including activity recognition [1]–[10], indoor localization accuracy is 97.0% with the CNN model, showing the
[11]–[14], and healthcare applications [15]–[17]. This success feasibility of the research direction. The source code is
is attributed to WiFi’s unique properties, such as widespread also shared by the link above.
were 88.13% for location recognition and 95.68% for action
recognition. This study demonstrates that it is possible to
separate features of actions and features of locations within
the same CSI sample.
To advance the research direction, we implemented an
SDR-based framework for collecting WiFi-based CSI dataset,
analyzed the dataset for various machine learning models, and
publicly shared the collected dataset.
C. Publicly Available Datasets
In recent years, several datasets have been publicly released
for research in human recognition using WiFi signals such
as [2], [9], [19], [20]. Because collecting CSI datasets is
Fig. 1. ”Application of simultaneous activity recognition and localization labor-intensive, these public datasets are valuable data sources
for smart-home scenarios.”
for researchers to evaluate ML models for human activity
recognition or indoor localization. However, for simultaneous
II. R ELATED W ORK human activity recognition and localization, the public dataset
in [19] is unique. In the study [19], the dataset was constructed
A. Human Sensing Using WiFi Signals with 6 actions performed sequentially at 16 different locations
Recently, many studies have been dedicated to the research in indoor environments with a total of 1394 data samples.
direction of human sensing using variations in WiFi signal What is special about this dataset is that CSI samples contain
waveforms [1]–[10]. Based on the channel state information information including both action and location, suitable for
received from channel estimation, many interesting ideas have models that can simultaneously recognize human activities and
been implemented to predict the human behaviors standing on indoor localization in the future.
the propagation path of the WiFi signals. For example, the To further contribute to the field of human sensing based
study [1] demonstrates the possibility of WiFi-based human on WiFi signals in general, and simultaneously recognize
recognition for various activities such as stumbling, slipping, human activities and indoor localization in particular, we have
fainting, sitting down, and standing up. This study highlights publicly released the WiARL-UIT dataset in this study.
the practicality of human action recognition in the medical
III. M ETHODOLOGY
field. Similarly, in [3], WiFi signals are used to detect human
hand gestures with 96% accuracy. This can be an interesting A. Recognition of Human Activity and Indoor Localization
idea for applying to smart city scenarios. Moreover, the studies Simultaneously using CSI
[3], [4] analyze the ability to recognize user keystrokes in the The initial foundation of this topic is based on wireless
conditions of small signal amplitudes. Finally, human local- sensing, specifically WiFi sensing. When it comes to WiFi
ization is also studied in [11]–[14] with impressive results. sensing, is defined as a sensing technology that detects phys-
ical or biological phenomena and converts that information
B. Simultaneous Human Activity Recognition and Indoor Lo- into electric signals that can be analyzed for various purposes.
calization Wi-Fi sensing uses Wi-Fi - also known as wireless local area
Although there have been a lot of studies on human recogni- networks (WLAN), to identify and interpret presence and
tion based on WiFi signals, there has not been much research motion. WiFi sensing detects and interprets motion through
on the joint task of activity recognition and localization two stages:
simultaneously. The Fig.1 demonstrates the application of • Pre-processing and initial filtering: The raw data provided
simultaneous activity recognition and localization for smart- by WiFi devices (referred to as ’CSI’ or ’channel state
home scenarios. information’) is filtered to eliminate pets and other non-
The first study on this topic is [18], which aims to precisely human movements.
identify actions taken despite location changes. A system • Higher-level processing: Artificial intelligence (AI) and
of eight wireless devices is deployed at two locations: in a machine learning algorithms then analyze the data for
laboratory and an apartment. All devices are synchronized advanced applications, such as interpreting breathing pat-
by wireless signals. The best results were achieved when terns.
recognizing the ”stand” action at different locations with an In the first stage, CSI (Channel State Information) can be
accuracy of 98% in the laboratory and 100% in the apartment. utilized for recognition tasks by analyzing the variations in the
The second study [19] has shown the feasibility of iden- wireless channel caused by the influence of the human body
tifying both actions and locations from Channel State Infor- between the transmitter and receiver devices. CSI is a signal
mation (CSI) samples. A 1D Convolutional Neural Network metric captured in an orthogonal frequency-division multiplex-
model was trained to distinguish the positions and actions of ing (OFDM) transmission system that describes the amplitude
volunteers with a pre-prepared dataset. The results achieved and phase changes across multiple subcarrier frequencies as
wireless signals travel from a transmitter to a receiver. The
OFDM system is modeled as:
y = Cx+n (1)
where x and y are complex vector indicating transmitted and
received signal respectively, n is a noise vector and finally,
C is a channel-information matrix. CSI is collected for each
subcarrier and the value for each of them is defined as a com-
(s)
plex number with a real component (Cr ) and an imaginary
(s)
component (Ci ). With that, denoting the subcarrier index as
s, we can calculate amplitude from raw CSI,
q
(s) 2 (s) 2
A(s) = (Ci ) + (Cr ) (2) Fig. 2. Broadcast and Receive systems
, and phase as well,
(s)
ϕs = atan2(Ci , Cr(s) ) (3)
In the next stage, we will use machine learning models
to train a dataset for the problem of simultaneous human
activity recognition and localization. The idea for simultaneous
recognition is described as follows: Given predefined labels
during the dataset collection process as shown in Fig.5, we
simultaneously differentiate information about the location (L)
and information about the action (A) recorded in the CSI
Fig. 3. System Framework
samples to determine which label the CSI sample belongs to.
Unlike traditional action recognition or location recognition
problems, here we require simultaneous accuracy in both at 3 locations, ”Right”, ”Mid”, and ”Left” within the WiFi
L and A. Suppose when comparing a sample CSI with a coverage area, described in Fig.4. This led to acquiring 6
predetermined label, the results show similarity in L, but labels, as depicted in Fig.5. For every label, we amassed
the result when comparing information about A returns ”not 250 CSI samples, totaling 1500 CSI samples for the dataset.
similar”, then we can conclude the sample CSI does not belong This dataset fulfills all the necessary criteria for our proposed
to that label. model’s objective of simultaneous recognition of both position
B. Proposed Platform for WiFi CSI Collection and action. As discussed in subsection II-C, there is a publicly
available dataset in [19] that aligns with the requirements
With two universal software radio peripherals (USRPs), we
for simultaneous recognition goals. This dataset offers the
implement the standard IEEE 802.11 protocol to collect CSI
advantage of creating numerous labels, up to 64 labels (6
samples. The USRP mainly consists of a motherboard (USRP
actions x 16 locations). However, the CSI samples per label are
B200-Mini) and a WiFi antenna (VERT-2450), which is used
limited (15 samples). To contribute to this field, we curated the
to broadcast or receive WiFi signals under the control of GNU
WiARL-UIT dataset with 6 labels but increased the number
Radio*. As shown in Fig.2, there is a view of the USRP and the
of CSI samples for each label to 250. The aim was to improve
details are listed below. Meanwhile, the assembling diagram
performance, reduce overfitting, increase accuracy, etc.
is shown in Fig. 3.
• USRP B200-Mini: A software-defined radio (SDR) de- B. CSI Fingerprint Analysis
vice with a wide frequency range from 70 MHz to 6 GHz We have simulated CSI fingerprint samples corresponding
and a user-programmable Xilinx Spartan-6 XC6SLX75 to different labels, as stated in Fig.5. We have presented 6
FPGA that can be embedded IEEE 802.11a/g/p protocol samples in Fig.6, each corresponding to a different label. The
to send and receive WiFi packages for CSI samples. x-axis in the figure represents the number of packets while the
• Antennas VERT-2450: To broadcast or receive WiFi y-axis represents the amplitude of the CSI. Each CSI sample
signals under the control of GNU Radio* consists of 52 subcarrier series is the number of orthogonal
• Computer and USB cables: To control B200-Mini when frequency division multiplexing (OFDM) [15] sub-carriers that
set in the same local area network as B200-Mini carry data in parallel in WiFi protocol. Fig.6 shows that CSI is
not too different despite the differences in location and action.
IV. DATA COLLECTION
In Fig.7 and Fig.8, we have provided examples of samples
A. Dataset that perform the same action in different locations. Addition-
To collect the WiARL-UIT dataset, we assigned a vol- ally, Fig.9 represents samples where we perform two different
unteer to execute 2 actions, ”HandUp” and ”ToTheLeft”, actions at the same location. Each sample with an x-axis
Fig. 6. CSI Fingerprints of 6 labels corresponding to the locations and actions
mentioned in Fig.5
V. M ODELS P ERFORMANCE
To obtain the results, we proceeded to use the collected
dataset WiARL-UIT to train machine learning models. Here
we propose using both complex neural network models such