Download as pdf or txt
Download as pdf or txt
You are on page 1of 40

Revolutionizing Future Connectivity: A

Contemporary Survey on AI-empowered


Satellite-based Non-Terrestrial Networks in 6G
Shadab Mahboob and Lingjia Liu
Wireless@Virginia Tech, Bradley Department of Electrical and Computer Engineering,
Virginia Polytechnic Institute of Technology, Blacksburg, VA, 24060 USA
{mshadab,ljliu}@vt.edu

Abstract—Non-Terrestrial Networks (NTN) are expected to (AR), Virtual Reality (VR), Tactile Internet, Holographic Type
arXiv:2303.01633v2 [cs.NI] 9 Jun 2023

be a critical component of 6th Generation (6G) networks, Communication (HTC), remote health and surgery, etc. require
providing ubiquitous, continuous, and scalable services. Satellites
extremely high throughput, low latency, high reliability, and
emerge as the primary enabler for NTN, leveraging their
ubiquity at the same time which cannot be met with current
extensive coverage, stable orbits, scalability, and adherence to
technological standards [4]. Consequently, the next-generation
international regulations. However, satellite-based NTN presents
global wireless standard, namely, 6th Generation (6G) has
unique challenges, including long propagation delay, high Doppler
shift, frequent handovers, spectrum sharing complexities, and become the current research focus for the industry and research
intricate beam and resource allocation, among others. The community [5].
integration of NTNs into existing terrestrial networks in 6G
introduces a range of novel challenges, including task offloading,6G is expected to provide an extremely high data rate (peak
network routing, network slicing, and many more. To tackle data rate up to 1 Tbps and user-experienced data rate up to
10 Gbps, around 100 times higher than 5G), very low latency
all these obstacles, this paper proposes Artificial Intelligence
(AI) as a promising solution, harnessing its ability to capture(in the order of µs), high reliability (around 100 times better
intricate correlations among diverse network parameters. We than 5G) and extreme coverage to support the diverse set of
begin by providing a comprehensive background on NTN and
AI, highlighting the potential of AI techniques in addressing future applications [6]–[8]. Due to the limited coverage area
and geographical constraints, it is not possible to guarantee
various NTN challenges. Next, we present an overview of existing
works, emphasizing AI as an enabling tool for satellite-based ubiquitous connectivity with existing terrestrial-only network
NTN, and explore potential research directions. Furthermore, weinfrastructures. Non-Terrestrial Networks (NTNs), networks
discuss ongoing research efforts that aim to enable AI in satellite-
involving space and aerial platforms, can provide us with
based NTN through software-defined implementations, while also
discussing the associated challenges. Finally, we conclude by multicast opportunities over very large areas as well as can serve
providing insights and recommendations for enabling AI-driven users even in remote areas or during times of natural calamities
satellite-based NTN in future 6G networks. [4]. Furthermore, the launching and maintenance costs for
Index Terms—Non-Terrestrial Networks (NTN), Space-Air- satellites have significantly decreased as they are deployed at
Ground Integrated Networks (SAGIN), Artificial Intelligence (AI),
lower heights (typically around 600 km). These satellites can
Machine Learning (ML), Deep Learning (DL), 5G-Advanced, 6G,
Satellite, Beam-hopping, Handover, Spectrum sharing, Doppler provide much higher throughput and lower latency compared
shift, Resource allocation, Computational offloading, Network to legacy satellites and potentially can support different use
routing, Network slicing, Channel estimation, Security, Open cases of 6G. So NTN is considered to become one of the
Radio Access Network (O-RAN), RAN Intelligent Controller major technological enablers of future 6G networks visioning
(RIC). connectivity anywhere and anytime [4], [9]–[11]. Tech giants
such as SpaceX Starlink, Amazon Kuiper, and OneWeb have
I. I NTRODUCTION already begun to invest billions of dollars in this field, reflecting
The Third Generation Partnership Project (3GPP) has already its massive potential for future growth [12].
started the standardization towards the 5th Generation (5G)- Although NTN presents numerous potential benefits for
Advanced in Release 17 and 18 to facilitate its worldwide the development of future 6G networks, it also entails
deployment [1], [2]. 5G-Advanced provides much higher data several challenges that need to be addressed, primarily due
rates, lower latency, increased capacity, and more efficient to the unique characteristics of its mobility and propagation
spectrum utilization than any of its predecessors. It supports environments [13], [14]. Due to the long distances between
a wide range of applications encompassing all 5G use the space-borne Base Stations (BS) and the ground User
cases such as Ultra Reliable Low Latency Communications Equipment (UE), the propagation delay is usually higher in
(URLLC), massive Machine Type Communication (mMTC), NTN environments. Additionally, high-speed air or space-borne
and enhanced Mobile Broadband (eMBB) communication with platforms necessitate modifications to existing handover and
different Key Performance Indicator (KPI) requirements [3]. paging protocols, as well as introduce a significant Doppler shift
Nevertheless, future applications such as Augmented Reality in carrier frequencies. The large path loss also increases the

1
minimum power requirement for reliable transmission, initiating research challenges for the integration of NTN into future 6G
the need for novel beam and resource allocation strategies. networks. [35] specifically focuses on the integrated Space-Air-
Spectrum sharing in the same frequency band with existing Ground Integrated Network (SAGIN) in 6G while discussing
terrestrial or other services also requires further study in order to the above topics in the context of NTNs. [18] presents the real
avoid interference between terrestrial and non-terrestrial users. system prototypes along with the general overview discussion
Even though currently there are some stand-alone satellite on NTNs. [19] presents the challenges from the aspects of
network deployments, the ultimate goal is the convergence of different communication layers to provide better insights for
terrestrial and non-terrestrial environments for extreme network addressing these issues.
performance in 6G. [4]. This potential integrated environment Likewise, AI has been acting as a driving force for various
requires efficient computing, routing, and slicing algorithms applications in wireless environments, especially in the last
for meeting the expected KPI requirements of 6G. couple of decades; many surveys have been published on these
Artificial intelligence (AI) is currently having a profound topics recently [37]–[40]. To facilitate the potential of AI in the
and revolutionary impact on a multitude of industries, including 5G-Advanced and 6G environments several research articles
but not limited to healthcare, military, transportation, and and surveys are in the literature [28], [41]–[43]. Some relevant
e-Commerce [25]. AI encompasses a wide array of smart surveys are also published focusing on different aspects of
machines, while Machine Learning (ML) is a popular subset AI-enabled 6G like pervasive network intelligence [44], green
of AI that allows machines to learn from large amounts of data communications [45], privacy [46], [47], network access and
and make decisions without the need for explicit programming routing [48]. As NTNs are expected to be integrated into the
[26]. Deep Learning (DL) is a special subset of ML that existing terrestrial environment for the development of 6G
studies Artificial Neural Networks (ANNs) which contain networks, it is clear that AI is expected to play a crucial role in
more than one hidden layer, often implemented to simulate the this process. To unleash the full potential of AI to enable NTN
human brain [27]. DL is currently being leveraged in various in 6G, we need to have a clear understanding of the potential
applications, such as computer vision, speech recognition, issues of NTN, we can gain insight into what AI tools can be
and bioinformatics, outperforming human-level performance useful down the road to resolve those issues.
in these particular domains. The cellular domain is still in There have been a few research articles capturing the key
its infancy in terms of AI integration [28] compared to other aspects of AI as an enabling technology for NTN in 6G in
fields due to the complex and dynamic nature of wireless the recent past. In [18], [19], a short discussion on important
networks. As an integral part of 6G networks, challenges applications of AI/ML in satellite-based NTN communication
associated with NTN deployment provides an enticing field for 6G is provided along with the general discussion on
for AI applications. However, while deploying algorithms in a NTN. In [21], several potential AI approaches for sustainable
real environment, practical implementation difficulties may integrated Terrestrial and Non-Terrestrial Networks (TNTN)
arise to provide reliable vertical connectivity between the with a focus on maritime networking are discussed in a concise
ground and space networks. To reach optimal performance, manner. In [15], a brief discussion of ML approaches to
theoretical advancements in communication system design tackle different potential problems associated with integrated
must be complemented by appropriate AI solutions for NTN TNTNs is presented. In [16], it provides a short discussion
integration into 6G. on ML approaches for a limited number of issues related to
next-generation mega-satellite networks. In [22], a compact
A. Related Works discussion on different ML and DL techniques at various layers
The possibility of potential integration of NTNs into 5G- of the Open Systems Interconnection (OSI) model for NTN
Advanced [14] and future 6G networks to support various integration into existing 5G infrastructures is presented. Even
future high-demanding use cases has attracted significant though the above-mentioned works attempt to capture the role
attention from the research community in recent times. This of AI in future 6G networks for enabling integrated TNTN
emerging area of research has spurred numerous investigations environments, they are generally brief and do not provide a
to address the unique challenges and opportunities posed comprehensive overview of works in this particular domain.
by NTN integration. [29] discusses the potential integration In [17], the potential role of AI techniques in the provision
aspects for satellites, which is an integral part of NTNs, into of NTN-based Intelligent Internet of Things (IoT) services is
future communication networks. [30] presents a summary of discussed; they do not focus on cellular environments for future
3GPP efforts towards supporting NTNs in the 5G-Advanced integrated TNTN 6G networks. In [20], reviews of potential AI
networks, and in [31], the critical issues related to current approaches for both broadcasting and communication satellites
5G specifications to enable 5G-NTN are investigated and are provided. However, they do not focus on the issues
insights are provided about potential solutions. Furthermore, related to NTN integrated 6G networks, rather only focus
many surveys and research articles such as [13], [18], [19], on general satellite communication. In [23], a comprehensive
[32]–[36] are published, which delve into various aspects of review of the control approaches like coverage, spectrum,
state-of-art research directions in this domain. These articles interference, and mobility management required by NTN
aim to provide a comprehensive overview of the architectural platforms that are solved using Reinforcement Learning (RL)
evolution, technological enablers, potential use cases, and open formulations is presented, but they do not focus on other AI

2
TABLE I: Related papers on AI approaches for Satellite-based NTNs in 6G.
Ref. Pub. Background Discussion on AI-Enabled NTN in 6G
year NTN challenges AI relevance Research thrusts 6G perspective Current efforts Practical challenges
2019 Short
[15]
2019 Short Short
[16]
2020 Comprehensive, but Short and only covers
[17] only covers IoT IoT applications
applications
2021 Short Complete
[18]
2021 Short uncategorized Does not cover
[19] AI for NTN
2021 Comprehensive, but
[20] does not cover the 6G
perspective
2022 Short Short
[21]
2023 Short
[22]
2023 Comprehensive, but Only covers RL
[23] only covers RL
2023 Comprehensive, but
[24] does not cover the 6G
perspective

approaches related to prediction and estimation. A very recent 1) We provide a relevant background discussion on NTNs
comprehensive survey paper on ML and DL applications on focusing on the challenges associated with it to deploy in
satellite communications is published [24]. However, they do 6G networks and various AI approaches along with their
not discuss the current research efforts from the integrated 6G relevance to addressing different challenges in NTN.
perspective and the potential challenges of applying ML and 2) We provide a systematic survey of existing and relevant
DL techniques in this domain. research works in each research thrust to organize the
The list of related articles along with the key features is current research progress in these fields. This helps us to
provided in Table I. get an insight into the current status and potential future
research scopes of different relevant research fields in
this domain.
B. Contribution 3) We summarize the current AI testbeds for satellite
As discussed in section I-A, most of the existing networks and potential integration efforts to current 5G
articles either focus on a discussion of architecture and software-defined testbeds for implementing integrated
challenges associated with NTN or AI approaches for wireless satellite-terrestrial integrated networks.
communications from a broader point of view. Although some 4) We provide a discussion on various practical
research articles also discuss the potential research scopes for complications associated with applying AI approaches
AI-powered NTNs to some extent, those discussions are either to NTN as future open issues. This helps us access
generally not very comprehensive or do not capture the role of the maximum potential of AI techniques while being
AI in NTN integrated 6G networks in a complete manner. Also, mindful of the practical constraints of NTN integration
the current research efforts and practical complications related into next-generation wireless networks.
to AI-empowered NTN-integrated 6G networks are not covered. 5) We provide a discussion on insights and recommendations
In this survey article, we aim to present a comprehensive on various aspects of applying AI techniques to satellite-
survey on various AI approaches to address different unique based NTNs for future 6G networks.
challenges posed by NTN. To provide our readers with a
C. Paper Organization
better understanding, we also provide a necessary relevant
background discussion on NTN and its challenges for realizing The rest of the paper is organized as follows. In Section II,
in 6G and various AI approaches along with their relevance we provide a compact overview discussion of NTN discussing
to address different challenges in NTN. We also provide its platforms, use cases, architecture, and characteristics; discuss
an overview discussion on current research efforts and the potential challenges associated with its deployment in 6G. In
general complications related to realizing AI approaches in real Section III, we introduce different types of AI approaches to
integrated TNTN environments in 6G. The main contributions provide a brief overview of relevant AI techniques to solve
of this article can be summarized as follows: various challenges associated with NTNs. We then summarize

3
Definitions Supervised Learning
Introduction
Target Frequency Band Machine Learning
Unsupervised Learning
Propagation Delay
Use Cases
DeepDeep Learning
Learning
Reinforcement Learning
Propagation Loss Background on
NTN
General Architecture
Moving Base Stations Beam Beam
Computational
Beam Hopping
Hopping Hopping
Offloading
Large Coverage Area Fundamental AI and Its
Characteristics Relevance to NTN
Challenges Handover Optimization Network Routing

Channel Estimation
Challneges in NTN
Doppler shift Estimation Network Slicing
Mobility Management
Current Research
Thrusts
Doppler Shift Estimation
Spectrum Sharing Channel Estimation

ML Testbeds
Resource Management
Others: Security and
Resource Allocation
Current Progress Traffic Predicition
Spectrum Sharing DeepTestbeds
OAI-SDR Learning

Network Procedures Problem Definition, Conventional Approach, AI solutions

Challenges
Limited Onboard Beam Beam
Aging of Information General Learning
Capability Interrelated Issues
Hopping Hopping
Approaches

Additional Comm.
Security Aspects Recurrent Learning
Overheads Insights and Control Feedback Design
Architectures
Recommendations

Environmental Scalability Issue


Conditions Online Implementation Distributed Learning
Models

Scarcity of Quality Conclusions


Lack of Convergence Enabling O-RAN-based Development in
Data
RIC Miniaturization

Hyperparameter Lack of Generalization


Settings Energy Efficiency Secured System Design

Fig. 1: Structure of the paper.

the existing AI approaches to address various NTN challenges in future 6G networks in Section VII. We illustrate the structure
categorizing them into different NTN research thrusts in of the paper showing the major components in Figure 1 for
Section IV. Furthermore, We summarize the current research better understanding. We also provide the list of acronyms in
efforts from the industrial and research community to enable Table II for the convenience of the readers.
AI applications for enabling satellite-based NTN in future
II. BACKGROUND ON NTN
6G networks in Section V. We also discuss the technical
challenges associated with the integration of AI to NTNs in To understand the role of AI in enabling NTNs in 6G, we
Section VI. Finally, we provide a discussion on insights and provide a concise background discussion on NTNs and the
recommendations for enabling AI-enabled satellite-based NTN challenges associated with NTNs to realize them in 6G in this
section. First, we familiarize the readers with various space and

4
TABLE II: List of acronyms.
Acronyms Definitions Acronyms Definitions
3GPP Third Generation Partnership Project MADRL Multi-Agent Deep Reinforcement Learning
5G 5th Generation MAP Maximum A Posteriori
5G-ALLSTAR 5G AgiLe and flexible integration of SaTellite And MARL Multi-Agent Reinforcement Learning
cellulaR MC Monte Carlo method
5G-EMUSAT 5G New Radio EMUlation over SATellite MDP Markov Decision Process
6G 6th Generation MEMS Micro-Electro-Mechanical Systems
AC Actor-Critic MEO Medium Earth Orbit
ACO Ant-Colony based Optimization MiM Man-in-the-Middle
AI Artificial Intelligence MIRSAT MultI-layer awaRe SDN-based testbed for SAtellite-
ANN Artificial Neural Networks Terrestrial networks
AoA Angle of Arrival ML Machine Learning
AoD Angle of Departure MLP Multi-Layer Perceptron
AR Augmented Reality MMSE Minimum Mean Square Error
ATM Asynchronous Transfer Mode mMTC massive Machine Type Communication
BS Base Station MNL Minimum Network Load
CCI Co-Channel Interference MSQ Maximum Signal Quality
CG Coordinate Graph MST Maximum Service Time
CNN Convolutional Neural Network mULC massive Ultra-reliable low-Latency Communication
CSD Cyclo-Stationary Detection NGEO/NGSO Non-Geostationary Earth Orbit
CSI Channel State Information NN Neural Network
CU Central Unit NTN Non-Terrestrial Networks
DDPG Deep Deterministic Policy Gradient OAI OpenAirInterface
DDQN Double Deep Q-Learning Network OFDM Orthogonal Frequency Division Multiplexing
DL Deep Learning O-RAN Open Radio Access Network
DN Deconvolutional Network OSI Open Systems Interconnection
DoS Denial-of-Service OSPF Open Shortest Path First
DP Dynamic Programming PCA Principal Component Analysis
DPG Deterministic Policy Gradient PG Policy Gradient
DQN Deep Q-Learning Network PNN Probabilistic Neural Network
DRL Deep Reinforcement Learning PSD Power Spectral Density
DRN Deep Residual Network PSO Particle Swarm Optimization
DRQN Deep Recurrent Q-Learning Network QoE Quality of Experience
DU Distributed Unit QoS Quality of Service
ED Energy Detection RC Reservoir Computing
eMBB enhanced Mobile Broadband RAN Radio Access Network
ELM Extreme Learning Machine RIC RAN Intelligent Controller
ESA European Space Agency RL Reinforcement Learning
ESN Echo-State Network RMS Root Mean Square
EVD Eigen Value-based Detection RNN Recurrent Neural Network
FCNN Fully Connected Neural Network RSRP Reference Signal Received Power
FlexRIC Flexible RIC RSRQ Reference Signal Received Quality
GA Genetic Algorithm RU Radio Unit
GAN Generative Adversarial Network SA Simulated Annealing
GNN Graph Neural Network SAGIN Space-Air-Ground Integrated Networks
GEO/GSO Geostationary Earth Orbit SARSA State-Action-Reward-State-Action
GNSS Global Navigation Satellite System SDN Software Defined Network
GPS Global Positioning System SDR Software Defined Radio
GRU Gated Recurrent Unit SINR Signal to Interference and Noise Ratio
HAPS High Altitude Platform System SL Supervised Learning
HARQ Hybrid Automatic Repeat Request SNR Signal to Noise Ratio
HIBS High-altitude International Mobile Base Station SoC System-on-Chip
HTC Holographic Type Communication SVM Support Vector Machine
IC Integrated Circuit TDD Time Division Duplexing
ICI Inter-Carrier Interference TNTN integrated Terrestrial and Non-Terrestrial Network
IoT Internet of Things UE User Equipment
IP Internet Protocol UL Unsupervised Learning
ISL Inter-Satellite Link ULBC Ultra-reliable low Latency Broadband Communication
ITU International Telecommunication Union uMBB ubiquitous Mobile BroadBand
KPI Key Performance Indicator URLLC Ultra Reliable Low Latency Communication
LEO Low Earth Orbit V2X Vehicle-to-Everything
LSM Liquid State Machine VR Virtual Reality
LSTM Long-Term Short Memory VSAT Very Small Aperture Terminal

air-borne NTN components along with the general architectures unique characteristics of NTNs which pose new challenges
and use cases in 6G. We clarify that we focus on satellite-based for integrating them into existing terrestrial networks for 6G.
NTN while discussing NTNs for the rest of the paper due to Depending on the nature of these challenges, we present
their critical role in enabling 6G with ubiquitous coverage, the current research trends in this domain in section IV by
predictable trajectory, and scalability. Then we emphasize on combining them with the AI techniques discussed in section

5
35000
km
GEO
Satellite
Spaceborne
platforms
10000
km
MEO
Satellite
2000
km

LEO 500 km
Satellite
platforms
Airborne

20 km

HAPS

Airships 10 km

Terrestrial 
BS
Urban Rural Remote Isolated
Ground
 

Fig. 2: An illustration of different NTN components in 6G.

III. • Geostationary Earth Orbit (GEO or GSO) Satellites:


These satellites have an orbital period of 24 hours which is
A. Definition the same as the time required for the Earth to complete a
full rotation on its axis. As a result, these satellites appear
Non-Terrestrial Network (NTN) refers to any network stationary from the ground and are named Geostationary
operating through the air or space-borne vehicle(s) for Earth Orbit (GEO or GSO) Satellites. These satellites
communication [30]. This definition implies that two distinct orbit on the Earth’s equatorial plane at an altitude of about
types of NTN platforms (space-borne and air-borne) can be 35,786 km to maintain this orbital period. Due to this high
utilized for NTN at different heights which is illustrated in altitude, it has an extremely large beam footprint (typically
Figure 2. the diameter ranges from 200 to 1000 km) covering a
1) Space-borne platforms: Space-borne platforms, such as pretty wide area. However, it also incurs an extremely long
satellites, are deployed in space for communication [30]. They propagation delay (typically around 270 ms) [49] which
move around the Earth in specific orbits with varying angular makes it infeasible for low-latency communications. These
velocities, relying on gravity to provide the necessary centripetal satellites have been used in broadcasting services for a
force to maintain their orbits. The orbital period of a satellite very long time, but are not very suitable for low-latency
refers to the time required for the satellite to complete one emerging applications.
full revolution around the Earth. Due to differences in orbital • Non-Geostationary Earth Orbit (NGEO or NGSO)
periods, some satellites may not be visible to ground observers Satellites: As the name suggests, these satellites orbit
all the time. To characterize this, another term is used to denote around the Earth at a period lower than 24 hours, so
the duration of direct visibility for a satellite. This is known they are not stationary with respect to a ground observer.
as the horizon time, which refers to the maximum duration As the orbital period is smaller, their angular velocity is
during which the satellite is within the line of sight of a given also higher but the altitude is lower compared to GEO
ground station or receiver. Depending on their mobility with satellites. Depending on the heights, they can be divided
respect to the Earth, satellites can be classified into two broad into two categories: Low Earth Orbit (LEO) and Medium
categories: Geostationary (GEO/GSO) and Non-Geostationary Earth Orbit (MEO) satellites. Typically they are deployed
(NGEO/NGSO) Earth Orbit satellites. We discuss these two at a height ranging from 200 to 2000 km for LEO and
types of satellites below and summarize their key features in 2000 to 25000 km for MEO satellites. The horizon time is
Table III.

6
much smaller for NGEO satellites due to smaller orbital bandwidth with moderate latency requirements, for example,
periods, for example, the LEO satellites deployed at a multimedia applications; (2) mMTC: low power and bandwidth
height of around 500-600 km with an orbital period of and no strict delay requirements, for example, IoT; and (3)
1.5-2 hours can have a horizon time of 5-10 minutes URLLC: low latency and high-reliability requirements, for
depending on channel conditions. Due to smaller heights, example, remote medical surgery. However, future applications
these satellites have a smaller beam footprint (diameter such as AR, VR, Tactile Internet, HTC, intelligent transport
ranges from 5 to 500 km) with a much smaller propagation and automation, multi-sense communication, global ubiquitous
delay (typically around 20 ms for LEO satellites and 94 connectivity, etc. require extremely high throughput, low
ms for MEO satellites) [49] compared to GEO satellites. latency, high reliability, and ubiquity at the same time which
With their proximity to Earth and lower cost of launch and cannot be met with current 5G standards [4]. Based on the
maintenance, these satellites, especially the LEO satellites, characteristics, these new applications are classified into three
have gained significant attention in recent years. Their more new groups,
reduced propagation delay and path loss make them an 1) Ubiquitous MBB (uMBB): High throughput and extreme
attractive choice for facilitating high-speed data transfer coverage requirements, combining both eMBB and mMTC.
and real-time communication, so as to transform the future Examples: Digital twins, pervasive intelligence, global
6G connectivity. ubiquitous connectivity, etc.
2) Ultra-reliable low Latency Broadband Communication
TABLE III: Key features of different types of satellites. (ULBC): High throughput and low latency requirements,
Attribute GEO MEO LEO combining both eMBB and URLLC. Examples: HTC, AR,
Orbital height (km) 35786 2000-25000 200-2000 VR, Tactile Internet, multi-sense experiences, etc.
Typical diameter of 1000 500 100 3) massive Ultra-reliable low-Latency Communication
beam footprint (km)
Propagation delay (ms) 270 94 12-25
(mULC): Extreme coverage and low latency requirements,
Orbital period (hours) 24 12 1.5-2 combining both mMTC and URLLC. Example: Vehicle-to-
Horizon time 24 hours 1-2 hours 5-10 minutes Everything (V2X), intelligent transport and automation, etc.
The principal strength of NTNs lies in their extreme coverage.
2) Air-borne platforms: High Altitude Platform Systems Due to its extreme coverage, satellites can reach underserved
(HAPS) refer to air-borne platforms that can be used for or unserved areas such as islands, remote locations, ships,
wireless communication. Airships, balloons, and airplanes are airplanes, etc. where terrestrial communication is either difficult
the most prominent types of air-borne platforms in NTN. They or impossible to some extent. In times of natural disaster,
are viewed as air-borne counterparts of terrestrial base stations terrestrial links can be unavailable, in which case users
serving as High-altitude International Mobile Base Stations can benefit from the reliable backup of non-terrestrial links.
(HIBS) [50]. They usually operate at the stratosphere region This ensures resilient and robust communication with global
with an altitude of around 20 km and a beam footprint size connectivity which is considered to be one of the main features
with a diameter of several km. Despite it having a lot smaller of future 6G networks. With the advancements in antenna
propagation delay compared to space-borne platforms, it has techniques and miniaturization, high throughput satellites are
some additional challenges related to stabilization on air and also deployed in low earth orbits. Furthermore, the considerably
refueling. low latency for LEO satellite systems makes the satellite
While both satellites and airborne platforms can be utilized useful even for low-latency applications. Combining all these,
in the development of NTNs, satellites are often considered satellites are expected to be one of the major driving forces
more critical for discussions related to NTNs. This is due toward revolutionizing the future 6G applications extensively.
to their global coverage, stable and predictable orbits, high They can potentially serve any potential use case with low
scalability, and the existence of international regulations that latency, high throughput, and large coverage requirements in
govern satellites. As such, satellite networks comprise a 6G.
significant portion of future NTN-enabled communication
networks. Therefore, for the purposes of this article, we will C. General Architecture
primarily focus on satellite-enabled NTNs in the context of The general architecture for a satellite-based NTN comprises
6G communication technology. of following components as shown in Figure 3 [49]:
1) Satellite: Satellite is the key component of this
B. Use Cases in 6G architecture. It carries the payload between the UE and
NTNs are anticipated to be a major component of 6G the ground station. In the case of a transparent payload, it
communication systems, providing a wide array of vertical works as a simple relay that transmits the payload after
services, such as transport, health, energy, automotive, public RF filtering, frequency conversion, and amplification
safety, and many more. The International Telecommunication to the ground station (or UE). Conversely, in the case
Union (ITU) has identified three major categories of of a regenerative payload, it processes the payload after
applications for 5G that are based on network performance and modulation and coding on top of these actions, so it works
user Quality of Experience (QoE): (1) eMBB: extremely high like a BS that needs onboard processing capabilities.

7
Satelllite 
ISL

Service 
Link
Feeder 
Link
Next Data Network
Generation
UE Gateway Ground  Core
Terminal

Fig. 3: General communication architecture for satellite-based NTN.

2) Gateway: Gateway refers to the ground station that 3) C-Band (4-8 GHz): Primarily used for satellite television
connects NTN to the public data network. In the case of broadcasting.
a transparent payload, the ground terminal needs to be 4) X-Band (4-8 GHz): Primarily used in military
equipped with a terrestrial base station. In the case of communications.
a regenerative payload, the ground terminal only relays 5) Ku-Band (12-18 GHz): Primarily used for satellite
the received information to the core networks. broadcasting services.
3) User Equipment (UE): User equipment is either 6) Ka-Band (26-40 GHz): This frequency band is used
handheld or Very Small Aperture Terminal (VSAT) for high-speed data transmission, including broadband
within the coverage area of the satellite. internet access via communication satellites.
4) Feeder Link: Feeder link connects a satellite to the However, as per 3GPP, currently, two frequency bands (S
gateway. and Ka-band) are targeted in particular for integrated TNTN
5) Service Link: Service link connects UEs to the serving environments considering performance and regulatory concerns
satellite. [49]. The two target frequency bands are:
6) Inter-Satellite Links (ISLs): ISLs provide connectivity • S-Band: The downlink frequency band is 2170-2200 MHz
between multiple satellites deployed in NTN so that a and the uplink frequency band is 1980-2010 MHz.
payload can be delivered to other cells. • Ka-band: The downlink frequency band is 19.7-21.2 GHz
and the uplink frequency band is 29.5-30 GHz.
D. Fundamental Characteristics of NTN 2) Propagation Delay: Propagation delay is the time
duration taken for a signal to reach its destination. For
NTN presents us with several potentially promising use cases
communication signals, we can calculate the propagation delay
for the next generation of wireless networks as discussed in
for a signal by using the equation: t = dc where d is the
section II-B. However, as can be seen from section II-A, it also
distance between the source and destination and c = 3 × 108
has a number of unique characteristics due to the large distances
m/s is the speed of light. Considering the speed of light as
between satellites and ground transceivers, the high mobility
a constant, we observe the propagation delay for a signal
of NGEO satellites, and the proposed frequency range for
is proportional to the propagation distances. Satellites are
operation. In this subsection, we will delve into these features
located very far from the surface of the earth as discussed
of NTN and discuss their impact on network performances and
in section II-A. Consequently, the propagation delay is going
procedures.
to be extremely large for NTNs. The GEO, MEO, and LEO
1) Target Frequency Band: The allowable frequency range satellites can have a one-way propagation delay of about 270,
of operation is 0.5-100 GHz [51]. Traditionally, six major 94, and 20 ms respectively as shown in Table III. These values
frequency bands within this range are used in satellite are much larger, especially for GEO and MEO cases compared
communications, which are listed below: to conventional terrestrial networks, which generally have a
1) L-Band (1-2 GHz): Global Positioning System (GPS) very negligible propagation delay of around a few µs [49]. This
carriers and satellite mobile phones, e.g. Iridium, use extended propagation delay has an effect on different network
this band. procedures and performances for communication systems.
2) S-Band (2-4 GHz): This frequency band is used for 3) Propagation Loss: The propagation loss, or path loss,
weather radar, marine radar, etc. satellite communications. refers to the reduction in power density that an electromagnetic

8
signal experiences as it travels through space. The most compared to GEO satellites as discussed in section II-A, so
significant component of this path loss is the free space path they do not appear static from the earth’s surface. Due to the
loss, which is proportional to the distance between the source dynamic nature of NGEO satellites, they turn into moving
and destination and the frequency of the signal [52]. For NTNs, base stations in case of regenerative payloads. Due to this
this free space path loss is much higher (around 60-120 dB) high-speed movement of NGEO satellites, different mobility
than it is for terrestrial networks, due to the greater distances issues arise for NTN platforms.
between satellites and the use of higher carrier frequencies. 5) Coverage Area: One of the most important features
In fact, the Ka-band is not suitable for GEO satellites, as it of the satellites is the large beam footprint associated with
does not meet the minimum link budget for them. The basic them due to their long distances from the earth’s surface. This
path loss component also includes shadow fading [53], as with enables the network coverage of very large areas compared
traditional terrestrial networks. to the coverage area of terrestrial counterparts. It provides
us with ubiquitous network coverage including remote, even
isolated areas. However, this also creates the necessity for
200
modifications in existing timing and synchronization procedures
195 2 GHz-GEO for conventional terrestrial networks. The cell area is much
190
larger, so the UEs situated at the farthest side of the cells
Average Propagation Loss (dB)

experience a larger delay compared to the UEs situated closer


185 20 GHz-LEO-1400 km to the satellites [49]. So the timestamps for different network
180 procedures need to be modified according to the distances of
the users as we will see in the next subsection.
175 20 GHz-LEO-550 km

170 E. Challenges Associated with NTN


165 2 GHz-LEO-1400 km
NTN offers a range of unique features due to the large
160 distances between the transceivers and the high mobility of
NGEO satellites, as outlined in section II-D. These features
155 2 GHz-LEO-550 km
open up possibilities for new use cases, taking advantage of the
150 extensive coverage offered by the satellites. The high mobility
10 20 30 40 50 60 70 80 90
Elevation Angle (degree) of the satellites also allows for the deployment of satellites
across the globe to provide global network coverage. However,
Fig. 4: Propagation loss for satellites at different heights and NTN also presents a number of new challenges that must be
with different carrier frequencies. tackled due to these characteristics, which are discussed in
detail below:
In addition to that, there is attenuation due to atmospheric 1) Channel Estimation: In wireless communications,
gases that depends on frequency, elevation angle, altitude above Channel State Information (CSI) refers to the information
sea level, and water vapor density [54]. Another important which represents the state of a communication channel between
component is attenuation due to rain and fog, which is typically the transmitter(s) and the receiver(s); the process of obtaining
significant for frequencies above 6 GHz [55]. Additionally, this information is known as channel estimation. By having
scintillation corresponds to rapid fluctuation in amplitude and access to CSI, it is possible to adjust transmissions to the
phase of the propagating signal in the ionosphere (for Sub-6 current channel conditions, which is essential for achieving
GHz) and troposphere (for above 6 GHz) [56]. Depending on reliable communication with high data rates in multi-antenna
different scenarios, either flat fading based on ITU two-state systems with effective channel resources and interference
model [57] or fast fading [49] can be considered. The average management. There are numerous advanced approaches, such as
propagation loss for different types of satellites in different Maximum Likelihood estimation and Minimum Mean Square
frequency bands is illustrated in Figure 4. This high path loss Error (MMSE) estimation, for effective channel estimation
necessitates the need for efficient power allocation strategies in traditional terrestrial cellular networks. Nevertheless, these
in NTN. methods are not well-suited for NTN, particularly for LEO
4) Moving Base Stations: As discussed in section II-C, for satellites due to the inherent time-variant nature of the satellite
regenerative payloads, satellites can be used as base stations for communication channels. LEO satellites usually move from
improved network performances. The terrestrial base stations horizon to horizon in approximately 5-10 minutes, so a UE
are located at fixed locations. As the GEO satellites do not remains within the coverage of a specific LEO satellite for a
change their relative positions with respect to the ground very short time period. Furthermore, the propagation delay for
terminal, they appear static in nature with respect to the earth’s satellite networks, especially in the case of GEO satellites, is
surface, so the scenario is similar to terrestrial ones. However, considerably larger (250 ms RTT) in comparison to general
the scenario is very different for NGEO satellites where they terrestrial networks. Therefore, the CSI estimated by the LEO
need to maintain a lower height and higher angular velocity satellites can be outdated [58]. Because of these reasons, the

9
CSI estimation in NTN necessitates new efficient techniques received signal needs to meet the minimum RSRP requirement.
in addition to the traditional terrestrial estimation methods. That means the transmitted signal power needs to be much
2) Mobility Management: Since an NGEO satellite operates higher (typically at least 10 times the terrestrial transmitted
at a lower altitude, the coverage area of each NGEO satellite signals) than terrestrial signals. This poses a great obstacle for
is smaller than that of a GEO satellite. Typically around 5- traditional UEs as they have power limitations. Furthermore,
20, NGEO satellites form complex mega-constellations to the target frequency bands as discussed in section II-D for
sustain global coverage across the earth. The NGEO satellite NTN are limited. To support a large number of satellite UEs,
needs to move at a much higher speed than the earth’s this spectrum resource appears to be scarce in NTN systems.
rotational speed (can be up to around 7.8 km/s) to get the So efficient resource (spectrum and power) allocation strategies
necessary centripetal force to move around the earth at that are needed for integrating NTN into terrestrial networks.
low altitude. As a result, these satellites typically orbit around 5) Spectrum Sharing: As discussed in section II-D, the S-
the earth pretty fast (usually within around 2-10 hours) as Band and Ka-Band are the target bands for NTN. On top of
discussed in section II-A. This quick orbital motion poses a this limited spectrum allocation, we have interference from
great challenge for integrating NGEO satellites into traditional terrestrial users in these bands. In S-Band, we already have
wireless communication systems. Due to the smaller orbital existing terrestrial communication from 4G LTE devices. With
period, any specific terrestrial UE can be only visible to an the advent of mm-wave technology, terrestrial communication
NGEO satellite for a very short span of time, typically several is also using the Ka-band in 5G. So the satellite users will
minutes. So the UE needs to undergo multiple handovers within suffer from co-channel interference with the terrestrial users in
a short span of time interval regardless of its mobility [14]. both bands. To avoid this interference, we have to come up with
If the satellite covers an area using multiple spot beams, the efficient spectrum-sharing techniques to put the interference
scenario is worse because the spot beam is much smaller below a certain threshold ensuring proper decoding of the
compared to a total coverage area of an NGEO satellite. received signals.
So the UEs need to through multiple (beam) handovers 6) Effect on Network Procedures: Timing advances ensure
within a few minutes, even when they are stationary, for synchronous uplink transmissions for all UEs. The UEs can
seamless continuation of data sessions. This frequent handover be located at different distances from the gNB, so there is a
phenomenon in NGEO satellite networks creates a lot of differential propagation delay between different UEs. If the
overhead in communication channels, leading to an overall uplink reception is not synchronized, the gNB needs to make
degradation in network performance. sure the allocation of resource blocks to a specific UE does
3) High Doppler Shift: Doppler shift is the shift in the not include the resource blocks already in use by other UEs,
signal frequency due to the motion of the transceivers. In which is inefficient in terms of resource allocation. Due to
the case of NGEO satellites, satellites are moving at a very the long propagation delay, the TA is much larger than the
high speed under a specific constellation. Due to this relative transmission time slots in NTN compared to NR. Also due
motion between UEs and satellites, Doppler shift happens in to the mobility of LEO satellites, the delay is time-varying
the original signal frequency. Due to frequency offsets, UEs and TA needs dynamic updates for proper uplink alignments.
tune to different carrier frequencies than the original carrier The other processes affected by the long propagation delay are
frequencies. So the frequency synchronization is lost, and the Random Access, Hybrid Automatic Repeat Requests (HARQ)
UEs may interfere with the other users. This is known as Inter- procedures, etc [49]. These procedures need to be modified
Carrier Interference (ICI) between multiple UEs. Generally, properly to compensate for the long propagation delay.
even for the high mobility scenarios in terrestrial networks, the
frequency shift is pretty negligible, and so is the Doppler shift. F. Network Aspects
However, the frequency offset is pretty significant in NTN due On top of all these challenges, integration into existing
to the much higher speed of the NGEO satellites. The Doppler terrestrial networks comes with several open research issues
shift value mainly depends on the carrier frequency and height to be addressed. Computational offloading, which involves
of the satellites. For NGEO satellites operating at Ka-Band, transferring the computational burden to satellite networks for
the Doppler shift can go from 225 kHz to even 720 kHz [49] supporting devices with low computing power, particularly in
depending on the heights. This can cause significant ICI among IoT applications, gets complicated due to extended propagation
NTN users which requires efficient strategies for compensation delay and highly mobile NGEO satellites. Network routing has
of the Doppler effect. been studied for a long time, and network slicing has been
4) Resource Management: Spectrum and power are the discussed since the implementation of 5G. However, with the
two fundamental resources for any communication system. emergence of NTN, the integration of terrestrial networks calls
In NTN, the allocation of these two resources becomes an for research in this area with new effective strategies. The ever-
even more complex problem due to the high path loss and changing network topology of mobile NTN platforms makes it
limited spectrum availability. As discussed in section II-D, challenging to solve these problems in a complex environment.
the path loss associated with Non-Terrestrial Networks is
much higher compared to terrestrial networks. To correctly Key Takeaways: We note that satellite-based NTNs can be
decode the transmitted symbols from the received signals, the extremely useful to provide ubiquitous connectivity, service

10
continuity, and extreme reliability for diverse future 6G requires some initial set of parameters and initialization that
applications. Nevertheless, the extreme nature of the satellite need to be carefully tuned to achieve expected performances.
networks, e.g., long distance between transceivers, high 2) Training Phase: After setting up the preliminary model
mobility for NGEO satellites, spectrum sharing with existing with initialization, the most important phase – training begins.
services, and high propagation loss, etc. impose a highly The training data is provided as input to the initial model.
challenging environment to address for the research community. Typically the raw data collected for a specific problem may
These challenges also open a new door for AI applications not be properly structured to be used for the model. Moreover,
to move toward the future 6G revolution. In the following these data may contain redundant and unnecessary information
section, we discuss how AI can be incorporated so that we can which is not beneficial for learning the model. Consequently,
address the issues for potential TNTN integration for future data needs to be preprocessed in a suitable manner to have
6G networks. good performance. The features also need to be chosen in such
a way that they can capture the correlation for empowering
III. AI AND ITS R ELEVANCE TO NTN C HALLENGES the learning process. The output of the model is fetched
AI refers to the simulation of human intelligence processes for performance evaluation. Based on the feedback from the
(e.g. visual reception, speech recognition, computer vision, etc.) evaluators, the model is adapted to improve its performance.
by machines, especially computer systems. This human-level This whole learning process is known as ’training’.
cognitive ability is achieved through either some predefined The training can be either offline or online. In the case of
algorithms or learning from data-based approaches [25]. Many offline training, training data is generated in the pre-training
practical systems are very diverse and complex. The rule-based phase all at once and can be used to train the model. In this case,
approaches are not very feasible for these systems because of training continues until some predefined number of iterations or
an enormous number of scenario possibilities. As a result, the some constraints are met. In the case of online training, training
learning-based approaches show a lot more promise compared data is generated in an incremental manner instead of being
to predefined approaches in these types of real systems. As our generated all at once. This specifically suits the fast-changing
focus for this paper is mostly on NTN which has an extremely environment like wireless networks and provides benefits in
complex and time-variant topology, we focus on learning-based terms of scalability, adaptability, and real-time learning.
approaches when we consider AI. In this section, we give an 3) Testing Phase: After the training, we have a trained ML
overview of these approaches to get an intuition of how these model based on our provided data. This model can be used to
approaches can be useful in solving NTN issues discussed in later evaluate in the real environment. Similarly, as training data,
the next section. testing data can be generated and preprocessed for evaluation.
The performance evaluator provides the accuracy of the model
A. Machine Learning (ML) using the testing data as inputs. This whole process is known
Machine Learning (ML) is a special subset of AI approaches as ’testing’. In the case of offline learning, the testing phase
where machines learn algorithms to perform a task by starts once the training is done. On the other hand, in the case
generalizing from past experiences or historical data without of online training, the testing is generally executed in a parallel
being explicitly programmed for it [59]. The performance manner with training.
of human intelligence processes can be improved with each
iteration evolving through new feature extractions in ML B. Deep Learning (DL)
approaches. Generally speaking, each ML approach has three In complex real-world problems, feature extraction can turn
distinctive features, namely task, performance measure, and out to be extremely challenging using generic ML models.
experience [60]. A machine is first assigned to learn to perform There may be hundreds of parameters that need to be learned
a specific task. It starts with a model with an initial set of and the outputs may not be linearly correlated to the inputs. So
random parameters. Then at every iteration, the model is general ML models may not provide satisfactory performance
recalculated based on some performance measures, essentially in learning these problems. To facilitate mapping outputs to
representing the learning process. Thus utilizing experiences, it inputs, Neural Networks (NNs) [61] are widely used in ML
can learn how to perform the task properly which is the main frameworks. With the availability of a large amount of data,
goal of ML approaches. NNs have emerged as a key technology to be used in ML in the
A generic ML model works in three phases utilizing various recent past. The learning process can be largely benefited from
components [26] - Pre-Training Phase, Training Phase, and the introduction of NNs to deal with complicated large-scale
Testing Phase. We discuss the fundamental components of problems. This learning process involving NNs to estimate
these three phases as shown in Figure 5 below: the models is known as Deep Learning (DL) [27] which is a
1) Pre-Training Phase: The Pre-Training Phase includes the special important subset of ML.
choice of learning approach along with the necessary model NNs are inspired by the biological neural networks in the
initialization. The selection and design of the learning approach brain, more specifically the nervous system. To mimic the
greatly depend on the nature of the problem for the learning operation of the brain, the NNs are composed of multiple layers
systems. We show in the next few subsections different learning where each layer consists of multiple neurons followed by an
strategies for different problems. Each ML model generally activation function. Generally, the neurons in one layer are

11
Choice of Selecting values
Model
learning for model
initialization
approach parameters

Pre-Training Phase

Training Data
Training Data Preliminary ML Performance
model evaluation
Model
Training Phase Adaptation

Training
After
Environment/
Source

Trained ML Performance
Testing Data
Testing Data model evaluation

Testing Phase

Fig. 5: Generic ML model.

connected to the neurons in the adjacent layers. The connecting for mapping inputs to outputs. As the output label is clearly
edges have weights that represent the relationship between the defined, the model can improve its performance by comparing
neurons. Each layer output can be viewed as some intermediate its predicted outputs with the actual outputs [67]. Depending on
decisions which eventually result in the final output values. The the type of output labels, SL problems can be broadly classified
weights are generally trained through a number of iterations into two categories: regression and classification problems.
using backpropagation algorithms [62]. Generally, the cost In regression, the output labels are continuous, whereas, in
function associated with the model to calculate the difference classification, the output labels correspond to distinct classes.
between the predicted and actual outputs is not very simple,
so we use different numerical methods like Gradient Descent ML Approaches: There are a number of SL algorithms
[63], Stochastic Gradient Descent [64], Mini-Batch Stochastic to train the model. Regression [68] is a statistical method
Gradient Descent [65], Newton’s method [66] etc. and so on that investigates the relationship between a dependent (target)
to estimate the gradients of the cost function with respect variable to one or more independent (given) variables. In
to corresponding weights. At each iteration, the weights are this method, the functional mapping between inputs and
updated by an amount based on these calculated gradients and outputs is estimated by minimizing the error between the
a predefined learning rate. As we move towards the gradient predicted and actual outputs. Linear regression [69] focuses on
descent direction, it helps us to reduce the cost at every iteration. regression problems, whereas logistic regression [70] focuses on
In this manner, we can map the inputs to outputs through NNs. classification problems. Decision tree is used in classification
problems by forming a tree-like structure to learn the best
C. Learning Paradigms split at every node level based on a statistical measure
Depending on how an algorithm is being trained and on the like information gain [71]. The classification starts at the
basis of the availability of the output for training, learning root node and traverses down along the branches based on
approaches can be classified mainly into three categories: intermediate decisions till the leaf nodes which represent
Supervised Learning (SL), Unsupervised Learning (UL), and the final classification decisions. Naive Bayes Model [72]
Reinforcement Learning (RL). A short overview of different is a form of a simple probabilistic classifier that uses the
types of learning approaches is shown in Figure 6. These Bayesian Theorem to decide the classes under the strong
approaches are discussed below: assumption of feature independence. It is very useful, especially
1) Supervised Learning (SL): In an SL model, a training in high-dimensional classification problems. Support Vector
dataset containing a set of features as inputs and corresponding Machine (SVM) [73] is another important type of classifier
current outputs is provided to the model. The model with an that decides the splitting hyperplane between different classes
initial set of parameters is trained through a number of iterations by maximizing the distances between the nearest data point

12
Naïve Model Based Methods:
Bayes
Dynamic Programming
Monte Carlo Method
Reinforcement
Learning
Linear
Regression Model Free Methods:

Q Learning
SARSA
Policy Gradient
Decision Supervised Machine
Tree Learning Learning
K-means
K-means clustering
clustering
Support
Vector
Machine Unsupervised K-nearest
K-nearest neighbors
neighbors
Learning

Logistic
Principal
Principal Component
Component
Regression
Analysis
Analysis

Fig. 6: Taxonomy of ML approaches.

(in both classes) and the hyperplane. generate final classification results. A counterpart of CNN is
DL Approaches: Different DL approaches are also proposed the Deconvolutional Network (DN) [79] which takes the classes
in the literature to tackle complicated SL problems effectively. as inputs and generates CNN input features by comparing them
Perceptron [74] is one of the first NN architectures that have with actual CNN inputs.
been proposed. It is a single-layer NN which can do binary Another important type of NN is Recurrent Neural Networks
classification like logistic regression. The main difference is (RNN) [80] with back loops. So the neurons in a layer are
to introduce a simple activation function (step function) as a not only connected to previous layer neurons but also can be
first step to more complex and advanced architectures. The connected to the neurons from the subsequent layers. (Figure 9)
simplest multi-layer NN architecture is the Fully Connected This allows it to capture temporal correlation among different
Neural Networks (FCNN) (Figure 7). This is also known as layers and can be useful where decisions from past iterations
Multi-Layer Perceptron (MLP). It has multiple hidden layers or samples can influence current ones. However, they suffer
between the input and output layers without any back loops. from vanishing gradient issues due to long-term temporal
As the name suggests, all the neurons between two adjacent dependencies [81]. To tackle this issue more sophisticated
layers are connected to each other. Extreme Learning Machine architectures like Gated recurrent units (GRU) [82] and Long-
(ELM) [75] is a very special type of NNs where the neurons Term Short Memory (LSTM) [83] with special memory cells
are randomly connected and the training is done one-shot and gates are introduced. Reservoir computing (RC) [84] is
using least square fits. Another different type of NN is the a low training complexity RNN framework for computation
Deep Residual Network (DRN) [76] with extra connections where the inputs are fed into a fixed and non-linear system,
passing input from one layer to a later layer as well as the known as a reservoir, and then mapped into outputs from the
next layer. There is also Probabilistic Neural Networks (PNN) reservoir neurons. Liquid State Machines (LSMs) are examples
[77] which can recognize the underlying pattern and generate of RCs where the neurons are randomly connected receiving
the probability distribution function for different classes. time-varying inputs. Echo-State Networks (ESNs) are also
Convolutional Neural Networks (CNN) [78] is an important a type of RC that uses a sparsely connected hidden layer
type of NNs that can take multidimensional inputs like images (reservoir) with typically 1% connectivity. The connectivity
and classify them with great accuracy by discovering spatial and weights of hidden neurons are fixed and randomly assigned.
features (Figure 8). The CNNs are composed of convolutional 2) Unsupervised Learning (UL): In UL, a raw unlabeled
layers and subsequent pooling layers. The convolutional layers dataset is provided to discover existing patterns and features
divide the whole input into smaller blocks and scan through [85] using some statistical learning approach. This is very
them to learn the different features. The idea is to exploit useful when the data is not labeled. The algorithms find the
the high correlation among neighboring cells with reduced underlying structure of the data and predict the outputs by
complexity. A pooling layer is used to simplify this extraction adapting the model. Here the classes are not explicitly stated,
process by getting rid of redundant features. Often CNN is so the classes need to be generated based on the distribution of
accompanied by an FCNN to take care of nonlinearity and input features in multi-dimensional spaces. It can be even used

13
16@48x48 1x256
16@16x16
8@64x64

1x10

Convolution Max-Pool Dense


Input Layer ∈ ℝ⁴ Hidden Layer ∈ ℝ⁶ Hidden Layer ∈ ℝ⁶ Output Layer ∈ ℝ¹

Fig. 7: Fully Connected Neural Network (FCNN). Fig. 8: Convolutional Neural Network (CNN).

Input Layer ∈ ℝ⁸ Hidden Layer ∈ ℝ⁶ Hidden Layer ∈ ℝ⁴ Hidden Layer ∈ ℝ⁶ Output Layer ∈

Fig. 9: Recurrent Neural Network (RNN). Fig. 10: Autoencoder.

for generating labeled data to transform the original problem networks (GAN) [93] consist of any two networks, with one
into an SL problem, which is usually easier to solve. generating data (generative network) and the other judging
ML Approaches: There are a number of unsupervised the generated data (discriminating network). The prediction
learning algorithms in the literature. K-means Clustering [86] accuracy of the discriminating network is then used to evaluate
divides all the data points into K clusters in which each data the error for the generating network. This creates a form of
point belongs to the cluster having the nearest mean. The mean competition between the discriminator and the generator to get
of the data points in a particular cluster defines the center of the better in their corresponding tasks. We can also use ensemble
cluster. Principal Component Analysis (PCA) [87] is primarily learning methods [94] comprising multiple learning methods
used for dimensionality reduction of a high dimensional dataset. for better performances.
It reduces the number of correlated features converting them 3) Reinforcement Learning (RL): In RL, an agent learns to
into a set of uncorrelated features, which are also termed behave in a particular environment by performing thousands of
as principal components, using orthogonal transformation of actions and getting rewards or penalties based on those actions
basis vectors. Reducing the dimensions of inputs also reduces [95]. This behavior (formally known as policy) is defined by
the number of features to be learned. KNN [88] algorithm the set of actions the agent learns from its experiences. The
determines the k-nearest neighbors for all the data points of environment is defined by some mathematical models, the most
an unknown feature vector whose class is to be identified. common one is the Markov Decision Process (MDP) [96]. Here
DL Approaches: Generally speaking, autoencoders [89] are the feedback is neither provided using explicit labels like in
used to help reduce the noise in data. In an autoencoder, first, we SL nor the model is learned like in UL, but the behavior of an
encode a high dimensional input, then decode it to reconstruct agent is learned through the rewards or penalties based on the
the input at the output again (Figure 10). The intermediate set of actions taken by going from one state to another with a
hidden layer neurons represent a compressed representation transition probability. The goal is to find out the optimum policy
of the inputs getting rid of irrelevant and noisy components. so that the total reward can be maximized (or the total penalty
Some other variations of this architecture are variational [90], can be minimized) over a horizon of future time intervals given
noisy [91], and sparse [92] autoencoders. Generative adversarial the current state of the agent. In Figure 11, we show a generic

14
structure of an RL framework as an MDP model. this approach, instead of an iterative approach for updating
Q values in the Q-table, a NN is used to estimate the Q
function value approximately. To prevent a large overestimation
New Reward of action values, another DL framework is introduced on
New State Environment top of DQN for a fair evaluation of policies in Double
Deep-Q-Network (DDQN) [108], [109]. In Dueling Deep-
Q-Network [110] method, both state and action values are
Action
separately estimated. As the expected value function may
be overestimated as the expected value does not capture
the complete probability distribution of random variables,
Distributional Deep-Q-Network [111] is considered to update
Current State the Q function value based on its distribution. In the case
Agent of continuous action spaces, DL aided DPG method, known
Current Reward
as Deep Deterministic Policy Gradient (DDPG) Q-Learning
[112] provides better results. To deal with partial observable
environments, Deep Recurrent Q-Networks (DRQN) [113] by
Fig. 11: MDP model. introducing an LSTM layer in the FCNN architecture of DQN.

ML Approaches: Depending on whether an RL model is D. Synergy between AI and NTN


explicitly created or not, RL can be fundamentally divided Like many other fields, NTN is expected to be a
into two major categories: model-based and model-free RL major advancement in the realm of AI applications [16].
methods [97]. In the model-based methods, the transition More precise and pragmatic analytical models with reduced
probabilities between different states are assumed to be known, overhead consumption, and efficient algorithms with a lower
whereas, in model-free methods, these probabilities are learned computational complexity are the primary catalysts for the
through iterations. Dynamic Programming (DP) [98] is the deployment of AI-enabled NTN in next-generation wireless
most popular model-based method used in practice. However, networks. In the preceding sections, we give a concise overview
model-free methods like Monte Carlo (MC) method [99] are of NTN and AI to introduce these two crucial aspects of this
most commonly used due to their flexibility and practicality article. Now, we motivate our readers by outlining the primary
in real systems. Q-learning [100] is one of the most popular motivating forces behind combining AI and NTN for future
model-free methods where ”Q” refers to the expected rewards wireless networks.
for an action taken in a given state over the time horizon, 1) Complex Task Automation: In NTNs, the complexity of
known as the value function. Another important counterpart tasks and procedures involved in communication networks is
of Q-learning is State-Action-Reward-State-Action (SARSA) significantly heightened. These tasks encompass a wide range
learning method where the agent learns the optimal policy in an of operations, including resource allocation, channel estimation,
online fashion [101]. This Q-learning is extended to the context modulation, coding, and the intricacies of satellite management
of stochastic games [102] involving multiple agents in [103], control. Attempting to perform these tasks manually is not
which is also known as Multi-Agent Reinforcement Learning only challenging but often unfeasible. The complexity involved
(MARL) method. Another approach for model-free RL is to in optimizing network performance for satellites, in particular,
learn the policy directly instead of learning the value functions, renders manual operations insufficient. Moreover, these tasks
which is known as Policy Gradient (PG) method [104]. If the require meticulous precision to ensure uninterrupted service
policy gradient is estimated in a deterministic fashion, it is and mitigate potential hazards. However, the advent of ML
called Deterministic Policy Gradient (DPG) method [105]. To and DL approaches has brought forth the ability to automate
have the benefits of both approaches, the Actor-Critic (AC) these multifaceted tasks by leveraging feedback obtained from
method is proposed in [106] where the critic estimates the the network. By harnessing ML and DL, not only can accurate
value function and the actor updates the policy gradient in the actions be executed, but complex chains of procedures can
direction suggested by the critic. also be automated seamlessly without the need for human
DL Approaches: General RL approaches without NNs intervention.
can work well for small-scale experiments. However, when 2) Tractable Solutions: The deployment of next-generation
the action or state space is really large, the computation NTNs is more complex than any other previous-generation
complexity exponentially increases. This phenomenon is quite cellular network due to its multifaceted architecture. For
common in practical systems like communication networks. To instance, the integration of satellite networks introduces a
estimate the value functions or policies with RL frameworks significant number of additional parameters to consider for
with large state or action spaces, DL approaches can be optimum network performance [21]. However, this can result
very useful. This learning approach is also known as Deep in computationally intractable solutions for practical networks,
Reinforcement Learning (DRL). The most popular and simplest even if the solutions are computationally tractable, they may
DRL approach is Deep Q-Network (DQN) [107] method. In be very inefficient. Resource management in TNTN networks

15
is a prime example of this, as resource optimization in TNTN users. This might lead to a large overhead in communication
networks often turns into non-convex optimization problems, channels, resulting in a decrease in the overall throughput of
where only suboptimal or heuristic solutions can be obtained the network. AI can be used to reduce the control overhead of
using numerical techniques [114], [115]. Fortunately, DL NTNs significantly. For example, to calculate the Doppler shift,
techniques can approximate complicated functions with the help the UEs must be provided with the latest ephemeris information
of NNs, as discussed in section III. As a result, complicated of the satellites [118]. However, this would cause an immense
network functionalities can be characterized with NNs and overhead and a decline in the achievable data rate for the UEs.
resource management issues can be solved in a tractable manner Alternative DL techniques can be employed to estimate the
[116], [117]. Doppler shift without requiring any ephemeris information
3) Data-Driven Decision Making: Although probabilistic from the satellites [119]. This leads to a significant decrease in
and deterministic models can be used to model NTN transmission overhead over communication channels, resulting
functionalities, these models are often derived using very in superior network throughput.
strong assumptions to get the general closed-form expressions, 7) Real-time Implementation: Network optimization and
resulting in significant deviations in performances in management decisions in NTNs usually require real-time
simulations compared to real networks. In contrast, ML models implementation, usually in the order of milliseconds to tens of
are obtained based on data, which means different scenarios milliseconds. Consequently, complex algorithms cannot be used
are taken into account during training, without the need for to obtain these real-time decisions. In most cases, the algorithms
making any assumptions. For instance, different propagation become either heuristic or offline. To have an online adaptable
loss models for different scenarios are adopted, but these models approach, AI techniques can be considered as a suitable option.
are often accompanied by certain assumptions that can lead For example, an online DRL-based approach can be used to
to significant discrepancies in real networks. In the case of obtain resource management decisions in real-time and ensure
NTNs, the situation is even worse due to the introduction of proper utilization of available resources in NTNs [120]. This
more path-loss components, such as atmospheric attenuation, is particularly valuable in latency-sensitive decision-making,
scintillation, etc. AI approaches, on the other hand, have the such as scheduling, handover decisions, etc.
potential to capture real scenarios with more precision than 8) Leveraging CSI: In communication networks, CSI is
theoretical models. fed back to the BS from the UE to assist in selecting
4) Adaptability and Learning: AI algorithms can adapt different schemes - such as modulation, channel coding, etc.
to changing network conditions and learn from experience. - for improved network performance. Leveraging this data,
Through ML techniques, AI can continually improve its which contains the general state of the channel, different ML
performance, optimize network operations, and adapt to approaches can be benefited. For example, RL approaches can
evolving user demands. By leveraging AI techniques such use this data to train models. This implies that we do not
as RL and predictive modeling, NTNs can adaptively allocate need to modify the information segments sent from the UE to
resources, optimize network parameters, and proactively detect the BS for deploying these RL schemes, but rather can rely
and mitigate faults. AI enables NTNs to dynamically respond on feedback already existing in the communication networks.
to changing network conditions, enhance operational efficiency, This again illustrates the capability of AI to integrate into
and ensure uninterrupted service delivery. The ability to learn traditional communication networks without any additional
from data and make intelligent decisions without human overhead costs.
intervention empowers NTNs to continually improve their
performance, optimize resource utilization, and deliver reliable Key Takeaways: The data-driven ML and DL approaches are
connectivity in complex and evolving environments. the major AI technologies for empowering satellite-based NTNs
5) Reduced Computation Complexity: Obtaining optimal for the next-generation 6G networks. Due to their inherent
algorithms for various challenges in NTNs can be a daunting capability of capturing practical scenarios with real-time
task. Even if such algorithms are derived for complex systems, tractable solutions, different learning paradigms, such as SL,
their computational complexity often renders them impractical UL, and RL can be extremely beneficial in addressing various
for real-world implementation. This complexity arises from challenges associated with future NTN-empowered 6G networks.
the vast number of variables that govern different network Consequently, there have been a lot of research activities to deal
procedures in NTNs. However, data-driven AI techniques offer with these challenges in the literature. In the following section,
a promising solution by reducing the dimensions of high- we explore various current research thrusts for incorporating
dimensional data through feature learning. Particularly, DL AI into NTN in greater detail to get insight into potential
approaches have demonstrated remarkable effectiveness in research scopes.
extracting implicit features from complex systems. As a result,
these techniques prove highly valuable in addressing the diverse IV. C URRENT R ESEARCH T HRUSTS
challenges encountered in NTN environments. AI is considered to be one of the major driving forces
6) Reduced Transmission Overhead: In some cases, for empowering next-generation NTNs. To unleash the great
traditional methods heavily rely on the exchange of information potential of AI in this field, exploring potential research thrusts
between various network participants, such as satellites and of AI-NTN integration is extremely important. The scarce

16
network resources, high mobility, and complex and time- non-convexity of the problem, obtaining a globally optimal
varying hierarchical network topology give rise to different solution with efficient algorithms gets difficult. In [115], the
unique challenges in realizing NTNs for future wireless steepest gradient descent algorithm is chosen to get the sub-
networks. Conventional optimization and estimation approaches optimal solution using the optimal set of precoding vectors.
are not always feasible for practical deployment in real Some heuristic iterative approaches are also proposed in [114],
networks. Various data-driven AI techniques are being explored [130], [131] to tackle these non-convex problems in a practical
by researchers due to their inherent capability of learning the and feasible manner. Different meta-heuristic approaches like
surrounding environment and providing superior performances Genetic Algorithm (GA) [132], Simulated Annealing (SA)
in practical scenarios. In this section, we discuss the current Algorithm [133], Particle Swarm Optimization (PSO) [134],
research thrusts for AI applications into NTNs: and combined metaheuristic approaches like GA-SA [135]
have been considered to generate suboptimal solutions with a
A. Beam Hopping reduced amount of computational complexity.
Modern communication satellites form multiple beams to The main challenge in designing a beam-hopping pattern
support a large number of users over a large area through in an optimization framework lies in the large search space
spatial multiplexing into different NTN cells. Each satellite associated with an optimal solution. The size of the search
can effectively reuse the allocated spectrum with very low space for finding out an optimal beam hopping pattern
co-channel interference as well as provide strong signals at the scales exponentially with the number of beams in the
ground user terminals with relatively low transmission power satellite networks. Modern satellites can have hundreds to
using beamforming techniques. However, due to the high cost thousands of beams depending on their coverage area, so
and low availability of onboard processing computing resources the computational complexity becomes pretty high, and the
in satellite systems, mostly simple fixed beam allocation computation time becomes pretty large to find out the exact
policies are used in traditional satellite communication. These solutions. The low-complexity suboptimal solutions using
strategies lack the flexibility to adapt to the temporal and iterative and metaheuristic approaches achieving satisfactory
spatial variation of traffic demands in real satellite networks. performances in real networks are not very abundant. In this
Beam hopping is a technique for allocating beams in a flexible context, the DL approaches turn out to be a suitable alternative
manner so that these changes can be addressed efficiently. It for this problem.
refers to a procedure for activating different beams according In [116], [117], an SL approach is considered by forming
to the current demands of an NTN cell covered by those labeled datasets with beam hopping patterns as outputs and
beams, so effectively hopping the set of active beams from one channel matrix, transmission power, and traffic demand as
combination to another [121]. In Figure 12, a simple beam- inputs. First, a mixed integer linear problem formulation for
hopping scenario is depicted, where we have different NTN matching the offered capacity to traffic demands is reduced to
cells with varying demands. We classify the cells into three a simple linear programming problem. A training dataset is
different categories, e.g., high, medium, and low, based on generated using conventional optimization algorithms and a DL
their traffic demands. In the first scenario, the low-demand model is trained on this dataset by considering beam hopping
NTN cells, e.g. cell 9, have less number of active beams than patterns as labels. Furthermore, the optimization framework
high-demand NTN cells, e.g. cell 6, even lesser than moderate- can be potentially transformed into an RL problem to capture
demand NTN cells, e,g, cell 1, 2, 5 or cell 13. However, due to the optimal beam-hopping pattern in a learning environment. In
mobility or change in traffic patterns, the traffic demand in cell [122], [123] the transmission delay is minimized considering
13 reduces and in cell 5 increases. As a result, we can see the the power and beam allocation constraints using a DRL
intensity of the beams also changes accordingly in these two approach. The state space consists of the average transmission
cells at a later time, and a new beam-hopping pattern emerges. delay and the buffer length with beam hopping pattern as
The key question of beam hopping is to find out which beams actions and the negative Hadamard product of the current states,
need to be activated when and for how long while maximizing the negative of total queuing delay as the reward function. In
the network performances given the capacity constraints [128]. [124], a combined DRL-metaheuristic approach is considered
This can be effectively formulated as an optimization problem to optimize both the throughput and delay fairness while
considering different network performance metrics such as at the same time designing different reward functions for
system throughput, delay, fairness, etc. as the objective(s) the two cases. In [125], [126], a network consisting of real-
along with power and spectrum constraints. In [129], a time and non-real-time traffic is considered. A multi-objective
convex optimization framework with an objective to match the problem minimizing the transmission delay for real-time traffics,
system capacity to traffic demand along with power allocation maximizing the throughput for non-real-time traffics as well as
constraints is considered. This yields a close-form solution overall delay fairness is considered. Individual reward functions
giving insights into resource allocation policies for maximizing are designed to capture each of the goals. In [127], a cooperative
network performances from different perspectives. However, multi-agent framework is considered to dynamically allocate
from the perspective of real networks, the convex objective the power and bandwidth to illuminating beams optimizing
function is not very realistic, so the results are not applicable throughput and delay fairness using a DDQN. In table IV,
to real networks in a straightforward manner. Assuming the we summarize the AI approaches for beam-hopping in NTN.

17
Low capacity beams

High capacity beams

Medium capacity beams

(a) Scenario 1 (b) Scenario 2

Fig. 12: A simple beam hopping example in satellite-based NTN.

TABLE IV: Summary of AI approaches for beam-hopping in satellite-based NTN.


Reference Target Optimization Objectives Learning Approach DL Comments on Models
Tool
Throughput Tx Delay Delay Capacity- SL UL RL RL Model DL Model
Fairness Demand
Ratio
[116] FCNN
[117] FCNN
[122] DQN FCNN
[123] DQN CNN
[124] Proximal Policy FCNN
Optimization
[125] DQN CNN
[126] DQN CNN
[127] DDQN FCNN

As traffic demand changes with time, recursive architectures needs to undergo multiple handovers within a short span of
such as RNN, ESN, etc. should be also explored to design the time interval regardless of its mobility status for seamless
NN for DL architectures used to address beam-hopping issues. continuation of the data sessions [14]. This frequent handover
Also, distributed learning architectures can be useful to design
phenomenon in LEO satellite networks creates a lot of overhead
efficient beam-hopping schemes. in communication channels and results in overall degradation
in network performances. Moreover, due to lower altitudes, the
B. Handover Optimization coverage area of an LEO satellite is much smaller compared
In order to maintain an orbital path around the Earth at to a GEO satellite. Typically a large number of LEO satellites
a lower altitude, an LEO satellite needs to move at a much are needed to maintain global coverage across the Earth
higher velocity (around 7.8 km/s) compared to a GEO satellite. with complex constellations. In the case of an ultra-dense
So these satellites orbit around the Earth typically within constellation of LEO satellites (like Starlink), each UE is
2 hours [49]. Due to the smaller orbital period, any LEO generally covered by multiple satellites, so the UE can choose
satellite remains visible to a ground UE for only several the best one from the list of suitable candidate LEO satellites.
minutes which poses a great challenge for integrating these This problem can be potentially solved in an optimization
satellites into the traditional terrestrial networks. The UE

18
LEO 4

LEO 2 LEO 1: Moving out of coverage


LEO 3
LEO 2: More network load,
LEO 1 more service time, bad channel
condition

LEO 3: Less network load, less


service time, moderate channel
condition

LEO 4: Currently not in the


horizon, will provide excellent
channel condition with moderate
network load and best service
time

UE that needs
handover

(a) Phase 1

LEO 4

LEO 2 LEO 3
LEO 1: Moved out of coverage

LEO 2: More network load, less


service time, bad channel
condition
LEO 1
LEO 3: Moved out of coverage

LEO 4: Great channel condition,


best service time, moderate
network load

UE that needs
handover

(b) Phase 2
Fig. 13: A typical handover scenario in LEO satellite-based NTN.

19
TABLE V: Summary of AI approaches for handover optimization in satellite-based NTN.
Reference Target Optimization Objectives Learning Approach DL Comments on Models
Tool
Signal Network Service Tx Delay SL UL RL RL Model DL Model
Quality Load Time
[136] Q-Learning
[137] Online Q-
Learning
[138] Multi-Agent Q-
Learning
[139] DQN CNN
[140] CNN
[141] Auction based FCNN
approach
(Game theory)
[142] DDQN FCNN
[143] Successive FCNN
DQN

framework jointly considering different handover decision handover criteria like Quality of Service (QoS), service time,
criteria. etc. [145], [146]. A bipartite graph matching problem between
In traditional terrestrial communication networks, a UE the satellites and the users [147] is also considered in the
chooses to attach to a BS based on periodic signal power literature to provide the optimal handover decision for satellites.
and quality measurements, such as Reference Signal Received In addition, a network flow-based cost minimization approach
Power (RSRP), and Reference Signal Received Quality (RSRQ) is considered in [148] by weighting each edge as the QoS
for the link between the BS and the UE. Moreover, load perceived by the user. A handover strategy based on a potential
balancing is also important to ensure no BS gets overloaded game in a bipartite graph is considered in [149]. Different
or underloaded as a result of initial attachment or handover heuristic algorithms are also proposed to solve the problem
procedures. However, for LEO satellite networks, choosing a [150]–[152]. A dynamic optimization problem is considered to
satellite BS merely based on signal measurements and network be solved based on forecasting in [153]. Channel reservation
load information is not enough due to the limited visibility is also associated to design an efficient handover algorithm
time of these satellites. So the UE also needs to take the while balancing the load for satellites in [154].
potential service time into account before attaching to a satellite. An RL framework can be naturally adopted for solving
In Figure 13, a simple general handover scenario involving this problem considering the handover criteria as states and
multiple LEO satellites and a single UE is shown. Here in UEs as agents who act by selecting a suitable LEO satellite
phase 1 (Figure 13a), the UE is connected to an LEO satellite, and collecting a reward based on the network performances.
indicated as LEO 2, and it needs an immediate handover to In [136], only the overall signal quality of the network is
some other neighboring satellite covering the UE, either to maximized using the RL approach without considering any
LEO 1 or LEO 3 as it will soon lose the coverage of LEO 2. other criteria. In [137], [139], a multi-objective optimization
As it is shown in Figure 13a and 13b, LEO 2 has more network problem considering satellite load and signal quality constraints
load and bad channel condition, but offers more service time; is solved using the DRL approach. In real networks, we have
LEO 3 has less network load, moderate channel condition, but a large number of UEs; the handover decision for one UE
offers less service time. Furthermore, a new satellite, LEO can affect another UE, so the handover problem needs to be
4 becomes available for providing coverage to the UE with solved in a cooperative manner. In [138], a MARL framework
excellent channel conditions, great service time with moderate is considered where multiple UEs cooperatively optimize the
network load. So even for a simple case involving 4 LEO number of handovers in the whole network considering different
satellites, the handover decision is not straightforward for a handover criteria. In [140], using graph matching, a database of
single UE. So finding a suitable handover strategy for a UE optimum handover decisions in satellite networks is produced
jointly considering all handover criteria becomes a complicated and later it is used to predict handover decisions using a CNN
problem to be solved. model. Advanced DL architectures like Auction based DL [141],
Different simple greedy strategies, like Maximum Service DDQN [142], Successive DQN [143], etc. are also considered to
Time (MST), Maximum Signal Quality (MSQ), or Minimum provide optimal handover decisions. In table V, we summarize
Network Load (MNL) [144] are adopted to solve the problem in the AI approaches for handover optimization in NTN involving
a simple heuristic manner but none of these approaches provide LEO satellites. However, as all the system models consider
the optimal solution. The satellite handover scenario can be the agents located at the UE side, it does not comply with
also modeled as a directed graph between different satellites the current standards where the handover decision is generally
for a single user where the weights can be set by different controlled by the BSs (satellites in this case). Furthermore,

20
the distributed multi-agent learning architectures give rise to than one frequency position in Orthogonal Frequency Division
stability issues in real implementations. The handover criteria Multiplexing (OFDM) carrier in a 5G integrated NTN system.
also need to be carefully investigated to provide the agents These different theoretical approaches can estimate the
with the necessary information to learn the mobility behavior Doppler shift with a certain accuracy in different scenarios.
of the environment. These issues need to be resolved in an However, the methods are generally very cumbersome due
efficient manner for future research work in this domain. to the complexity associated with the orbital mechanics of
the satellites. Most of these methods come with simplifying
C. Doppler Shift Estimation assumptions to keep the approach feasible for practical systems,
As the LEO satellites move around the Earth typically at a thereby affecting accuracy. Moreover, due to the constant high-
very high speed, both the satellite and ground user transceivers speed movement of LEO satellites, the wireless environment
experience a large Doppler effect due to their relative velocity. associated with it becomes time-variant. The computation
If the transmitter moves towards (or away from) the receiver, complexity increases more to model these temporal variations
the emitted signal from the transmitter may take less (or more) using traditional estimation approaches. Additionally, the UEs
time to reach the receiver depending on the direction of the may need the ephemeris information of the satellites to compute
movement, hence the frequency of the signal increases (or the Doppler shift associated with its motion, which creates
decreases). This shift in signal frequency due to the motion of large additional overheads in the communication channels. To
the transmitter, the receiver, or both refers to the Doppler shift. characterize this Doppler effect, ML-based algorithms seem
If the original frequency is f0 , the Doppler shift due to the to appear as potential practical alternatives to the research
motion of transceivers towards some specific direction with community.
some specific relative velocity can be given by: In wireless communication systems, due to the mobility of
the transceivers, the channel between the transceivers changes
v significantly resulting in received signal power variation and
δf = f0 × × cos(θ)
c Doppler shift. So, intuitively, the CSI of this channel should
Here contain information about the Doppler shift. This idea has been
v = The relative velocity of the transceiver already explored in terrestrial networks to generate a model
θ = The angle between the direction of the transceiver and the using ML [119], [155], [156]. The ground truth values or the
direction of the propagating signal labels are usually generated using the ephemeris information.
For LEO satellites, due to high mobility, This frequency Different channel characteristic variables like Rician K factor,
offset is pretty significant (48 kHz with a center frequency azimuth Angle of Arrival (AoA) width, mean azimuth AoA
of 2GHz [49]). Due to these frequency offsets, UEs tune to and channel estimation errors are generated randomly, and
some different carrier frequencies from their originally assigned averaged Power Spectral Density (PSD) is used as inputs with
carrier frequencies. This may lead to ICI between multiple some preprocessing to a multi-layered FCNN to estimate the
UEs as discussed in section II-E. Doppler shift in [155]. In [119], RSRP values mapped from
There have been significant efforts to characterize the an ambiguity reducer are used to generate the weights for an
Doppler effect for LEO satellites since the launching of MLP. In [156], different time and frequency domain signals
communication satellites. In [158], an equation for Doppler shift with various modulation schemes, delay profiles, and Signal
is derived for the simple case of LEO satellites with circular to Noise Ratio (SNR) have been used as inputs to a hybrid
orbits in the equatorial plane and ground observing points on CNN-LSTM model to estimate the Doppler shifts. In NTN, the
the equator. In [159], the Doppler shift is analytically derived research in this domain is still at the early stage The estimated
assuming the trajectory of the satellite with respect to the earth CSI is used as input to a CNN model to estimate the Doppler
by a great circle arc and the speed of the satellite as constant. shift in [157]. In the future, other potentially efficient SL
In [118], the Doppler shift is characterized by considering a models can be also explored to generate the real-time accurate
new orbit generator using different orbital parameters through a Doppler shift in an online manner. In table VI, we summarize
rigorous analysis. UEs with Global Navigation Satellite System the AI approaches for Doppler shift estimation in NTN. Even
(GNSS) can get the global positioning of satellites and estimate though the DL techniques are found to be useful in estimating
the amount of Doppler shift needed to be addressed for the Doppler shift using channel parameters, Doppler shift can
next transmission slot [160]. However, this increases the cost be also estimated by analyzing the predictable trajectory of
and complexity which may not be feasible for ground UEs [58]. the satellites. Complexity analysis is required to justify the
Additionally, The GNSS signals are weak, not ubiquitous, and applicability of these DL architectures replacing the state of
susceptible to interference and spoofing. Recently, there have art methods in real systems.
been also efforts to estimate the Doppler shift in LEO satellite
systems using various other approaches, such as stochastic D. Spectrum Sharing
geometry [161], Maximum A Posteriori (MAP) [162], algebraic In traditional communication systems, satellite, and terrestrial
solutions [163], two-stage estimators consisting of time-varying cellular networks generally occupy different frequency bands,
Burg spectral analyzer and alpha-beta filter [164] etc. In [165], so they do not interfere with each other. However, the satellites
the Doppler shift is estimated using reference signals in more in the new integrated TNTN environment for 6G are expected

21
TABLE VI: Summary of AI approaches for Doppler shift estimation in satellite-based NTN.
Reference Input to the ML model Learning Approach DL Comments on Models
Tool
SL UL RL RL Model DL Model
[155] Averaged PSD FCNN
[156] Modulation scheme, delay profiles, SNR CNN-LSTM
[119] RSRP FCNN
[157] CSI CNN

TABLE VII: Summary of AI approaches for spectrum sharing in TNTN.


Reference Problem Insight Learning Approach DL Comments on Models
Tool
Spectrum Spectrum Spectrum SL UL RL RL Model DL Model
Sensing Occupancy Access
Prediction
[166] CNN-BiLSTM
[167] CNN
[168] CNN-LSTM
[169] CNN-LSTM
[170] Modified Q-
Learning
[171] SVM-CNN
[172] MADDPG FCNN

to use the same S and Ka-Band as discussed in section II-D.


This improves the overall spectral efficiency of the integrated
networks as well as provides a better QoE for the users. Satellite
However, as both the satellite and the ground network use the
same frequency band, the signals produced by them interfere
with each other, i.e. cause Co-Channel Interference (CCI) to
each other. In Figure 14, a simple spectrum-sharing scenario in
the downlink channel in an integrated TNTN network is shown.
The satellite user is connected to a satellite and the downlink
channel is indicated using the green link. There are three more Desired 
terrestrial BSs using the same channel as the satellite provide Signal
CCI to the satellite user (indicated by red links).
In TNTN, the spectrum-sharing phenomenon needs more
attention because we have a hierarchical network scenario
consisting of non-terrestrial and terrestrial BSs as shown
in Figure 3. To support this complex topology in a single
framework, we need to come up with efficient spectrum-
Interfering 
sharing strategies causing low interference to the users [173]. Signal
In conventional spectrum sharing methods, we use efficient
Satellite 
frequency reuse, leveraging directional antennas, adaptive UE
power control, etc. methods to mitigate the effect of CCI. Terrestrial
However, traditional four-color frequency reuse can effectively BS

reduce the level of interference at the expense of more spectrum.


beamforming approach can reduce the interference greatly, but
that too comes at the cost of increasing complexity.
Fig. 14: General spectrum sharing scenario in TNTN.
To tackle this situation, a process called spectrum sensing is
introduced in cognitive radio networks, where the unlicensed
users can sense the occupancy status of the target band using good performance but with more computational complexity
some radio sensing method [174]. The popular spectrum (CSD and EVD). For these reasons, ML has been adopted
sensing methods are Energy Detection (ED) [175], Cyclo- for spectrum sharing to capture the correlation with a reduced
Stationary Detection (CSD) [176], Eigen Value-based Detection computational complexity which can be extended to integrated
(EVD) [177] etc. However, these methods either are simple satellite-terrestrial network scenarios [173].
with poor performance in low SNR scenarios (ED) or provide Different intelligent learning approaches are adopted to

22
tackle the spectrum-sharing problem for next-generation TNTN linear and non-convex due to objective function nonlinearity
networks [178]. In [166], a spectrum-sharing strategy is and complex constraints involving Signal to Interference and
developed for LEO satellites from the GEO satellite spectrum Noise Ratio (SINR) [188]. Furthermore, the carrier assignment
historical occupancy data using a CNN-BiLSTM model. Here indicator variables result in a mixed-integer programming
the LEO satellite users are considered as unlicensed secondary problem [188]. Hence, no optimal solution can be determined
users and the GEO satellite users are considered as the licensed using the known methods of convex optimization with low
primary users. In [168], a CNN-LSTM-based spectrum sensing computation complexity. Instead, suboptimal and metaheuristic
method is introduced for satellites to capture the spatial and approaches are proposed, which tackle parts of the problem
temporal correlation effectively for spectrum occupancy of separately and then iteratively tune the parameters [188].
satellite systems. In [169], a CNN-LSTM model is introduced Different suboptimal approaches are adopted to optimize
to predict the frequency assignment for satellites based on resource allocations [189]–[193] for satellite systems. However
historical data. In [170], a modified Q-Learning algorithm to reduce computation complexity several heuristic [194]
is used in an RL setup for the adaptive selection of access and metaheuristic approaches like GA [195], PSO [196] are
and modulation schemes for NGSO satellites in an NGSO- explored to reach the suboptimal solutions within a shorter
GEO system. In [171], an SVM model is first used for low computation time.
complexity spectrum sensing, then a CNN-based spectrum To tackle this resource allocation issue in real satellite
prediction model based on historical data is developed. In networks in a practical manner, ML approaches are being
[172], a cooperative Multi-Agent Deep Reinforcement Learning started to be adopted by the research community. A DL
(MADRL) framework is considered for bandwidth management framework is combined with conventional optimization
in a game-theoretic model minimizing inter-beam interference. algorithms to overcome the computation complexity issue of
In [167], a CNN-based spectrum reconstruction method from the conventional approach in [185], [186] by reducing the
incomplete data is discussed for satellite networks. In table feature space. A model-free DRL framework is adopted for
VII, we summarize the AI approaches for spectrum sharing in power allocation of high throughput satellites in [182]. A Q-
TNTN. However, these spectrum sensing decisions need to be learning-based long-term capacity allocation algorithm in an RL
in real-time to increase the overall throughput of the secondary framework is introduced for a heterogeneous satellite network
users; this means the conventional LSTM architectures need to in [179]. In [120], an Actor-Critic and Critic Only based RL
be replaced by efficient low-complexity ESN architectures to framework is considered for optimal resource allocation for
tackle this in an online manner. Furthermore, spatial spectrum LEO satellite networks. Different advanced RL frameworks
sharing scenarios need to be also considered along with the like DRL [180], [181], [197], Multi-objective DRL [184]
state of art temporal spectrum sharing scenarios leveraging the and MADRL [183] are also proposed to solve the resource
benefits of future 3D SAGIN networks. allocation issue for satellites. In table VIII, we summarize the
AI approaches for resource allocation in TNTN. As both power
E. Resource Allocation and spectrum are equally important and scarce resources for
Power and spectrum are the two fundamental resources NTNs, new DL architectures need to be explored to jointly
for any type of wireless network, and NTN is also not an allocate these resources in an efficient manner for NTNs.
exception. The spectrum allocation is typically performed
by the assignment of carriers with equal width from the F. Computation Offloading
allocated spectrum for that service. Hence, the number of One of the most important applications of satellite-terrestrial
assigned carriers and their positions are optimized to achieve integrated networks is enhancing the computation capabilities
good signal quality with the minimum resources. Often, the of existing terrestrial network architectures leveraging satellites.
carrier assignment is achieved by the orthogonal splitting of With traditional terrestrial networks, supporting a diverse
the spectrum resources, which is also known as frequency set of new applications like AR, VR, etc. with high data
reuse. However, the strict orthogonality of the frequency bands processing and extremely low latency requirement can get very
cannot be always achieved to achieve better spectral efficiency. challenging. Generally, terrestrial BSs are deployed sparsely
In case of lack of orthogonality of spectrum resources used by due to high infrastructure and maintenance costs. Due to
different transceivers can also introduce CCI. The interfering resource constraints, in case of the high demand for data
signal can be effectively suppressed by increasing transmission processing for these types of applications, the BSs need to
power for the original signals. However, as power is also a offload the computation tasks to the terrestrial cloud via the
scarce resource, we cannot increase the transmission power satellites [207]. However, due to longer propagation delay, the
indefinitely and increasing transmission power will result in a latency requirements set by the applications are difficult to
decrease in energy efficiency. For better resource utilization, a be met [208]. Nevertheless, due to the emergence of LEO
more robust radio resource management needs to be designed satellites with comparatively low propagation delay, the overall
by controlling both power and spectrum resources [187]. delay is considerably reduced. Also instead of acting as relays,
Generally, an optimization framework can be considered to the satellite can now also do the processing works acting as
optimize the system performance with bandwidth and power edge-servers. So we can consider a three-level hierarchical
constraints. In most cases, such optimization problems are non- architecture comprising of ground UEs connected to terrestrial

23
TABLE VIII: Summary of AI approaches for resource allocation in satellite-based NTN.
Reference Target Objective Learning Approach DL Comments on Models
Tool
Spectral Efficiency Energy Efficiency SL UL RL RL Model DL Model
[120] AC
[179] Q-learning
[180] DQN FCNN
[181] DQN CNN
[182] DQN FCNN
[183] MARL
[184] DQN Ensembles of
FCNN
[185] FCNN
[186] FCNN

TABLE IX: Summary of AI approaches for task offloading in TNTN.


Reference Problem Insight Learning Approach DL Comments on Models
Tool
Energy Tx Delay Computational SL UL RL RL Model DL Model
Consumption Resources
[198] DDPG FCNN
[199] Model-Free FCNN
[200] Value Iteration, FCNN
DQN, DDQN
[200] Dueling DDQN FCNN
[201] MARL
[202] DQN based FCNN
MARL
[203] LSTM
[204] DQN FCNN
[205] Distributed
FCNN (For
solving
Optimization)
[206] Distributed
FCNN (For
solving
Optimization)

BSs, LEO satellites, and terrestrial cloud as shown in Figure 15 are some predefined models highly dependent on different
where the terrestrial BSs can offload the computational tasks to network states causing a large overhead in networks. Moreover,
LEO satellites and to terrestrial clouds via the LEO satellites. they usually converge to the solutions after a large number of
iterations causing high computational complexity.
The main challenges in task offloading problems lie in
meeting the delay constraints for low-latency applications To tackle these issues, different ML approaches are proposed
while minimizing the energy consumption for the satellites. in the literature to solve task offloading problems. In [199], a
So this can be formulated as an optimization problem to DRL-based task offloading framework dependent on channel
come up with an efficient offloading approach for integrated state information is proposed. A similar DRL-based framework
TNTN architecture. Such an optimization problem is solved is also considered in [214] with additional consideration of the
using different conventional approaches like 3D hypergraph dynamic queue condition in satellites. In [200], both DQN and
matching [209], game theory [210], stochastic approach [211], DDQN are explored to solve the task offloading problem in a
efficient algorithms [212] in the existing literature. In [213], a decentralized manner. DDPG algorithm is considered to solve
joint optimization framework comprising task offloading and the optimization problem in a DQN framework in [198] while
resource allocation is also considered in an integrated satellite- taking the potential security issues into account. An LSTM
terrestrial environment. Although these algorithms work well in model is used to solve the task offloading problem while
theory for particular scenarios, in real networks, the feasibility considering channel conditions and energy dynamics in [203].
of these algorithms is compromised due to different issues. A DL-based caching strategy is considered in satellite edge
Some of these works do not consider the cooperation among networks in [204]. As we have multiple satellites in the real
terrestrial cloud and LEO satellite servers which result in networks, to improve the overall system performances, different
sub-optimal approaches [209], [210]. Also, these approaches multi-agent architectures are considered both in a distributed

24
LEO Satellites

Offloading
Offloading

Offloading

Terrestrial BSs

Terrestrial UEs
Terrestrial Cloud

Fig. 15: Task offloading in satellite-terrestrial integrated networks.

[205] and cooperative environment [201], [202]. Distributed environment in [226]. However, the dynamics of satellite-
architectures for generating discrete offloading decisions in a terrestrial networks are very different from traditional terrestrial
supervised manner are also considered in [206]. In table IX, networks due to their highly dynamic network topology, link
we summarize the AI approaches for task offloading in TNTN. status, and traffic conditions. Depending on the instantaneous
As the delay constraints vary with network traffic types, the network topology, a static-dynamic combined routing scheme is
offloading decisions need to be derived taking network traffic considered in [227]. An ant-colony-based optimization (ACO)
types into account. Potential research works can show how framework is considered in [228]. In [229], a Kalman filter-
computational offloading can be done for various network based Wolf Colony Optimization algorithm is used to solve
traffics and show superior network performances. the local optimal solution issue in [228]. An improved ACO
framework is considered to find out the optimal set of links
G. Network Routing with multiple network constraints in [230]. A Coordinate
In wireless networks, depending on the traffic and channel Graph (CG) model-based network routing approach for three-
conditions, the network traffics are routed to different paths dimensional TNTN is considered in [231]. In [232], minimum
among different network nodes so that the overall network flow maximum residual path-based network flow algorithm
performance can be improved. In any network with static is used to find out the optimal network routing path for
channel and traffic conditions, this routing problem can be satellites. In [233], a 3-dimensional network mapping using
transformed into the well-known shortest path problem and hyperbolic geometry is considered for integrated satellite-
solved by Dijkstra’s algorithm [223]. Here the network nodes terrestrial networks.
can be considered as nodes in the graph and the edges can To cope with the dynamic environment in integrated satellite-
represent the links between different nodes. The weights of the terrestrial networks, different DL architectures are proposed
edges can be defined based on the target network performance in the literature. In [218], fuzzy logic is used to evaluate task
metrics like delay, jitter, throughput, packet loss, etc. However, requirements to improve the CNN output for optimal path
the topology of the real satellite-terrestrial integrated networks allocation. The network routing optimization problem can be
(shown in Figure 16) are very complex and dynamic due to put into an RL framework. In [215], a speed-up Q-learning
hierarchical network architecture and uncertain channel and algorithm is used to find out the optimal routing strategy for
traffic conditions, respectively. So simple Djikstra’s algorithm TNTN. A similar Q-learning-based RL framework is also
cannot be directly applied to meet the performance requirements considered to solve the routing problem for LEO satellites
in these networks. in [216]. To tackle the complexity issue, a DRL framework
In real satellite networks, Asynchronous Transfer Mode is used to generate optimal routing strategies in TNTN [222]
(ATM) routing is introduced in [224]. The well-known Open and LEO satellite networks [216]. FCNN [219], CNN [15],
Shortest Path First (OSPF) [225] protocol-based Internet etc. architectures are used to solve the routing problem in a
Protocol (IP) routing is also adapted to the dynamic satellite supervised manner. Other ML frameworks like GNN [221] and

25
Satellites

Space-Air
Segment

Satellite-
Ground Links Terrestrial
Segment
Ground
Ground Terminal
Terminal

UEs
UEs
Terrestrial BS

Satellite
Terrestrial BS
Terrestrial Gateway Terrestrial
Links Links
Terrestrial
Terrestrial
Gateway
Gateway

Fig. 16: Network routing in satellite-terrestrial integrated networks.

TABLE X: Summary of AI approaches for network routing in TNTN.


Reference Target Optimization Objective Learning Approach DL Comments on Models
Tool
Throughput Tx Error Rate SL UL RL RL Model DL Model
Delay/jitter
[215] Q-Learning
[216] Q-Learning
[217] Distributed Q-
Learning
[218] Fuzzy-CNN
[15] CNN
[219] FCNN
[220] ELM
[221] Graph Neural
Network (GNN)
+ FCNN
[222] DDPG LSTM

ELM [220] are also considered to solve the routing problem radio resources as per the demand of the users belonging to
in NTNs. In table X, we summarize the AI approaches for the slices. Slicing is useful for wireless networks as each
network routing in TNTN. Recursive NN architectures need to slice can share the same physical network infrastructures
be also explored for capturing the temporal behavior in network while receiving necessary radio resources for guaranteeing
routing decisions. Furthermore, the channels are extremely a minimum level of service to the users. Also, network slicing
dynamic and time-varying in the case of TNTNs; the channel provides the flexibility to switch users between slices with
conditions can be also considered in the learning criteria of different amounts of allocated resources responding to changes
RL frameworks. in traffic conditions. Integrated TNTN networks appear to be an
excellent candidate for applying the concept of network slicing
H. Network Slicing
due to their diversified traffic patterns. In fact, different use
Network slicing refers to the process of virtually partitioning cases like mMTC and eMBB applications can be extensively
the physical network into different network slices corresponding benefited through the network slicing in these networks. In
to different service requirements. The slices are allocated with

26
Figure 17, a simple network slicing scenario is shown. Here solved by the DDPG algorithm. Here both the actor and critic
the network consists of a satellite and a terrestrial BS which networks are FCNNs consisting of four layers. Distributed
form 3 slices in a combined manner. Slice 1 is for high-priority learning architectures can be potentially explored in future
users, they share network resources from the satellite and the works for real network implementations.
terrestrial BS (depicted by green links). Slice 2 is for users with
I. Channel Estimation
low latency requirements, the terrestrial BS provides resources
to the users (shown by red links). Slice 3 is for the remote Channel estimation is an important aspect of NTNs for both
users who can only be served by the satellite (shown by blue comprehensive network planning and interference management
links). like other wireless networks. It refers to the method of
estimating the effects of the channel of the transmitted signal
in a wireless environment. Typically the channel effect is
embedded in an information block, called CSI in modern
Satellite
communication systems. There are some conventional ways
of estimating CSI like MMSE or LS method, but they are
computationally expensive and not always suitable for real
networks. It is also very challenging to get timely CSI
information in NTN due to long propagation delays and fast-
changing propagation environments.
Therefore, ML-based methods are increasingly being adopted
by the research community and vendors, as a promising
alternative for channel prediction. This channel estimation
can potentially be turned into an SL problem by considering
channel features such as distance, time delay, received power,
azimuth AoA and Departure (AoD), elevation angle, Root
Terrestrial 
Mean Square (RMS) Delay Spread, and frequency as inputs
BS and CSI as output labels. In [240], the reciprocity property of
the downlink and uplink channels in Time Division Duplexing
Slice 3 Slice 2 (TDD) systems is considered. So the downlink channel is
estimated from uplink CSI using an LSTM-based DL model.
Slice 1
In [241], CSI is estimated from historical CSI data using a
CNN-LSTM model. However, as channel estimation is a near-
real-time process, low-complexity NNs such as ESNs need to
Fig. 17: Network slicing in satellite-terrestrial integrated be explored for realistic implementations.
networks. J. Other Research Thrusts
The other research thrusts to address NTN challenges using
In a general network slicing framework, a composite
AI techniques can be in traffic prediction and steering [242]–
utility function consisting of different network performance
[246], security and jamming issues [247]–[256], etc.
characteristics like average throughput and other costs like
slice reconfiguration cost, resource reservation cost, etc. is Key Takeaways: As evidenced by the above discussion, various
formulated as an objective function that needs to be minimized. RL techniques are used to examine network optimization
The constraints are generally the minimum service level problems such as handover, beam, and resource allocation,
requirements depending on the type of services for particular task offloading, network routing, and network slicing, while
slices. To ensure real-time implementation, simple heuristic SL techniques are employed to tackle estimation problems,
approaches are tested on real platforms [234]–[236]. However, such as Doppler shift, channel state, and spectrum sensing.
these approaches do not guarantee optimal performance UL techniques have not been extensively covered in the
guarantee. To tackle this issue, different AI-based approaches literature due to the ambiguity and difficulty of applying them
seem promising as discussed in [237]–[239]. in real networks. To further illustrate the interrelations between
In [237], a general Radio Access Network (RAN) slicing different NTN challenges and AI techniques, we present Figure
problem is considered where the objective function is a 18.
weighted function of bandwidth and spectrum consumption
satisfying QoS and inter-slice isolation constraints. In a simple V. AI-NTN I NTEGRATION : C URRENT S TATUS
2-slice satellite-terrestrial integrated network, different DL In the previous section, we discuss how AI can be beneficial
architectures are tested. In [238], an ML approach similar to the for us in resolving potential NTN issues for the next-generation
meta-heuristic ACO approach is considered to realize network 6G networks. In this section, how AI can be applied to real
slicing in a TNTN environment. An air-ground integrated systems to resolve the challenges associated with NTNs. We
network is considered in a DRL framework in [239], later begin our discussion by discussing the current ML testbeds for

27
Handover

Channel Estimation
Beam and Resource
Allocation
Doppler Shift NTN
Estimation Challenges Network Routing

Spectrum Sharing / Network Routing


Sensing

Task Offloading

Supervised Reinforcement
Learning ML Learning

Fig. 18: Relationship diagram different AI techniques and NTN challenges.

satellites. Then we discuss how AI can be potentially applied to the heterogeneity and variability of the networks and meet the
future 6G networks by utilizing the RAN Intelligent Controller strict application requirements [264]. Even though there has
(RIC) embedded in the Open Radio Access Network (O-RAN) been a significant amount of research on addressing different
framework [257] to overcome the inflexibility of the monolithic issues in 5G cellular networks, an open interface for the
cellular networks. Finally, we provide a discussion on current deployment of AI algorithms is required. O-RAN offers a
research efforts toward realizing the Software Defined Radio general framework for the deployment of AI algorithms in 5G-
(SDR) based 5G-NTN platform development in the O-RAN Advanced networks [265]. It achieves this by facilitating an
framework. open interface that enables the exchange of network KPIs and
control information between the RAN and the AI controller.
A. ML Testbeds for Satellite Networks This integration allows for the implementation of a closed-loop
MultI-layer awaRe SDN-based testbed for SAtellite- control framework for the RAN using different AI approaches
Terrestrial networks (MIRSAT) testbed [258] provides a [266], [267]. As a potential integral part of future 6G networks,
Software Defined Network (SDN) based experimentation NTNs are expected to be deployed in the O-RAN framework
platform for testing network slicing algorithms on NGSO to leverage AI capabilities effectively.
constellations. The European Space Agency (ESA) has In 5G, a base station, namely gNB has multiple functional
numerous completed projects focusing on the applicability splits, namely:
of AI techniques in satellite networks such as MLSAT [259]
1) Central Unit (CU): responsible for higher layers such as
and SATAI [260]. There are also several other ongoing projects
non-real-time link and network layer functionalities.
of ESA focusing on AI-satellite issues like AI integrated 5G-
2) Distributed Unit (DU): responsible for lower layer such as
Satellite testbed [261], AI-based interference detection [262],
near real-time link and upper PHY layer functionalities.
AI-based signal processing [263], etc. All these testbeds show
3) Radio Unit (RU): responsible for low PHY layer
promises for AI to be an integral part of future satellite-
functionalities.
terrestrial integrated networks.
A new central controller entity called RIC is introduced in the
B. AI-NTN Integration through O-RAN-based RIC O-RAN architecture which can provide network monitoring
Traditional 5G networks with little or no reconfiguration and control functionalities in near real-time and non-real-time
capabilities suffer from a wide variety of challenges to satisfy through external and internal applications, called xApps and

28
Open
A1 R1
Applying AI API Near real-time Non-real-time Applying AI
using xApps RIC RIC using rApps

E2 O1

Uu FH F1 NG Next
UE RU DU CU Generation
Core

Ground/ On
Ground On the air
the air
deployment deployment
deployment

Fig. 19: Various architectural deployments for NTN in the O-RAN framework for regenerative payload.

rApps respectively, for the purpose of network optimization. C. Current Research Efforts for AI-NTN Integration through
Evidently, these xApps and rApps can provide us with an O-RAN-based RIC
effective way of deploying AI algorithms extracting network To facilitate the integration of NTN into 6G networks,
KPIs, and sending control commands for optimizing network there have been already some technical advancements and
performance. experimental research works towards developing real prototypes
To realize the NTN architecture in the O-RAN framework, for testing and evaluation of proof-of-concept methods.
the satellites can be used for either transparent or regenerative OpenAirInterface (OAI) is an open-source 3GPP compliant
payloads as discussed in Section II-C. In the case of SDR-based protocol stacks that are widely used across the
regenerative payloads, where the NTN platforms work as research community for experimentation with 5G networks
BSs, there are multiple options for potential O-RAN-based [268]. As specified in the O-RAN framework, OAI protocol
NTN deployment. There can be three different architectural stacks adopt the notion of RIC by enabling service-oriented
deployments for NTN gNBs in the regenerative architecture: controllers using an efficient Software Development Kit, called
1) RU in the space/air, CU and DU on the ground, Flexible RIC (FlexRIC) [269]. This RIC provides an interface
2) Both RU and DU in the space/air, CU on the ground, for applying AI algorithms in order to optimize the network
3) CU, DU, and RU in the space/air. performances through xApps as discussed before. This enables
The non-real-time which does not need to consider latency us to perform experiments for testing diverse AI approaches
requirements, is expected to be deployed on the ground for optimizing the performances of real 5G networks.
considering power, onboard capability, and mobility constraints. OAI has been adopted for developing experimental
However, the near real-time RIC needs to be close to DUs to prototypes with 5G-NTN adaptations due to its efficient and
provide near real-time control functionalities which provide flexible design and structure [270]. Currently, there are several
two different options for its deployments with corresponding research projects on 5G-NTN that are leveraging OAI protocol
pros and cons. The near real-time RIC should be on the ground stacks to perform experiments with NTN adaptations for both
when only RU is in the air, whereas it should be also in the air in-lab validation and over-the-satellite testing. 5G AgiLe and
in the other two cases. There is a clear trade-off between the flexible integration of SaTellite And cellulaR (5G-ALLSTAR)
latency and power, mobility, and onboard capability constraints. [271] and 5G New Radio EMUlation over SATellite (5G-
If the near-real-time RIC is in the air, the latency for control EmuSat) [272] project developed a 5G-NTN platform with
commands will be low, whereas the cost will be high for necessary PHY and MAC layer 5G-NR adaptations on top of
hosting it in the air. In Figure 19, the potential framework OAI 5G protocol stacks and a satellite-channel emulator for
for AI-Enabled NTN deployment in O-RAN framework as in-lab validation. 5G-EmuSat even has also demonstrated its
specified in [257] is illustrated. Depending on the deployment over-the-satellite capability by having direct access to a UE
scenarios of the near-real-time RIC, the xApps can be also using a satellite channel. 5G Space Communications lab also
deployed also in the air or on the ground, and so do the AI has performed in-lab validation experiments extending OAI-
control algorithms. 4G protocol stacks for NTN along with ISL implementation

29
using SDR [273]. Two current ongoing projects focusing on in the feedback. For NTNs, as we know from Section II-D, the
GEO and LEO satellites, named 5G-GOA [274] and 5G-LEO propagation delay is extremely high due to the long distance
[275] respectively, are currently working on implementing between the transmitter and the receiver. Thus the feedback
necessary 3GPP NTN adaptations extending from 5G protocol exchange time intervals are quite high compared to terrestrial
stacks of OAI. Even though current implementations are mostly environments which hampers the online training approach
for demonstration purposes, integration of NTN into OAI 5G greatly. Furthermore, to adapt to the highly time-varying
protocol stacks paves the way for deploying AI algorithms environment, AI models usually need to send an appropriate
through xApps in the future. chain of control commands to the network components. Due to
high propagation delay, these actions may get outdated resulting
Key Takeaways: Currently, there are some deployed ML testbeds in degraded network performances.
specifically designed for satellite networks. Moreover, O-RAN
is envisioned to unleash the great potential of AI in enabling C. Additional Communication Overheads
the future 6G networks via satellite-based NTNs by addressing On top of long propagation delay, non-terrestrial platforms
various challenges associated with it. Nevertheless, both O-RAN also have limited bandwidth due to the scarcity of spectrum
and NTN standardization aspects are still in the development resources and ensure no additional interference to the licensed
process, and different SDR-based 5G protocol stacks, such as services. As the generic RL frameworks depend on the feedback
OAI, are being incorporated with NTN adaptations. received from the environment, the additional overhead
VI. AI-NTN I NTEGRATION : C HALLENGES introduced by the network parameters results in undesirable
network resource consumption. Even though CSI feedback in
NTNs come with an intrinsic set of challenges when it comes
5G networks contains a set of network parameters, this may not
to deploying AI models. Even though there is a significant
be enough for all different network problems. This additional
decrease in the launching and maintenance cost of various
communication overhead puts an additional burden on the
NTN platforms, especially satellites, cost optimization is still
limited spectrum of resources allocated for the non-terrestrial
one of the major limiting factors of realizing NTNs for 6G
platforms.
communication on a large scale. With that being the case, these
platforms have limited power, spectrum, and computational D. Security Aspects
resources which limits the performance of the AI models. The
Applying AI in NTNs introduces more vulnerability to
unavoidable long propagation delay along with the complex
various security attacks by introducing new attack surfaces
and time-varying nature of the NTN environment introduces
and less transparency. Adversarial attacks, data poisoning, and
additional challenges for AI models to be trained and deployed
model evasion involving the manipulation of input data to
in real-time. In this section, we discuss these open research
AI models can cause degradation in network performance
issues to get an insight into designing an efficient AI-based
and reliability. Denial-of-Service (DoS) attacks can cause
non-terrestrial system with robust and superior performance.
interruptions in crucial network operations by overwhelming
A. Limited Onboard Capability the network with too much resource consumption. While
The performance of AI algorithms highly depends on carrying the control information between the AI controller
the availability of the computational resources and data and the network, an attacker can intercept and possibly modify
processing capabilities of the AI hardware blocks. Most non- this information, known as Man-in-the-Middle (MiM) attack,
terrestrial platforms such as satellites have a very limited resulting in a degradation in network performances. Therefore,
amount of computational resources due to cost optimization. the AI models need to be designed in such a way that they can
More computational capability typically means more power detect and mitigate these attacks while maintaining the overall
consumption as well as more physical space which increases network performance.
the overall maintenance cost for the non-terrestrial platforms. A
E. Environmental Conditions
lot more advancements in miniaturization and power efficiency
are necessary to ensure the adapt AI models and algorithms into NTN platforms, especially satellites are generally deployed
non-terrestrial platforms with adequate onboard capabilities. in pretty hostile environments with extreme radiation, extreme
temperatures, and other extreme environmental conditions. The
B. Aging of Information computational hardware for AI models is very susceptible
The long propagation delay is a great challenge in the to radiation as they are built on customized circuitry. High
way of AI-based NTN deployment. Online RL frameworks radiation can compromise the performance of the AI models
are very promising for solving different NTN challenges due by introducing errors in their hosting hardware platforms. Also,
to their inherent capability of adapting to fast time-varying in space, these circuitry elements need to withstand extreme
environments as we show in the previous sections. However, temperatures for a long period of time. To ensure the proper
the performance of these algorithms is highly dependent on performance of these models, the hosting hardware components
the feedback received from the environment. As the network are required to be more advanced and have rigorous testing to
changes very rapidly, the feedback has to be real-time or near ensure extreme environmental tolerance, which increases the
real-time to ensure the integrity of the information embedded cost of the satellite operation.

30
F. Scalability Issue of these parameters for any given problem to provide the
RL frameworks can be naturally applied to address a variety best performance. In fact, tuning these parameters to provide
of NTN problems with control objectives as we discuss in satisfactory performance for a particular problem is not any
the previous section. However, for large-scale 6G satellite- straightforward process, but rather dependent on empirical
terrestrial integrated wireless networks, the complex network speculations. This means the training process is not a one-time
topology entails high dimensional state and action spaces that event, but rather a trial-and-error process that involves multiple
can lead to high computational complexity for RL models attempts to determine the most suitable parameters. Moreover,
[276]. In the case of MARL frameworks, the state space grows depending on the nature of the problem, it can be challenging
exponentially with the increase in the number of agents [277], to determine possible candidates for the parameters to begin
making this approach infeasible for large-scale real networks. with. This results in a very uncertain and time-consuming
Although DRL approaches can be helpful in reducing the state training process. For NTNs, these issues are more severe due
space [278], more research is needed to effectively address to their high network complexity resulting in a more complex
this challenge in order to successfully deploy RL approaches set of hyperparameters.
in TNTNs.
J. Lack of Generalization
G. Lack of Convergence
An important challenge to be addressed when applying the As data-driven ML approaches are used to train ML
distributed RL framework in real networks for solving various algorithms, it can be difficult to generalize these algorithms
important NTN challenges like handover optimization is to deal to different scenarios. Trained models are able to capture the
with its uncertainty in convergence [103]. In this framework, characteristics of the training data, but this does not always
multiple agents try to optimize their goals based on the rewards guarantee successful performance with test data due to the
received from the environment. In a competitive environment, varying nature of NTNs. A model trained for a specific scenario
when all the agents are attempting to maximize their long-term may not be successful in another, and may not be able to
returns, they may take conflicting actions, resulting in a non- adapt to different NTN scenarios. Even if the model has not
stationary environment with no convergence to an optimum encountered certain scenario features during training, it is
state [277]. As a result, no optimum policy can be obtained for desirable to have a model that is generalizable and performs
the system as a whole. As highlighted in [138], this convergence well in any context. Developing such models is one of the
issue has limited the number of UEs (agents) that can be biggest challenges of the NTN domain due to their high network
considered in the simulation environment, thus hindering the complexity. As there are no theoretical performance bounds for
potential of this approach. these empirical ML models, unpredictable performance drops
can occur while deploying in the real environment.
H. Scarcity of Quality Data
All ML approaches are data-driven, so the availability of Key Takeaways: The cost-limited on-board computation, highly
suitable training data is of paramount importance for the dynamic environmental conditions, and long propagation delay
improved performance of these methods. However, in satellite- introduce a diverse set of challenges to realize the AI-enabled
terrestrial integrated networks, the generation of quality data NTN environment for future 6G networks. These challenges
can be sometimes very costly and inefficient, even impossible need to be addressed with efficient solutions to ensure superior
at times due to spectrum and intermittent connectivity network performances in real NTN deployments.
constraints. Due to this inherent data generation issue, applying
different ML approaches can get extremely challenging. VII. I NSIGHTS AND R ECOMMENDATIONS
Additionally, the data distribution and characteristics in non-
terrestrial environments may differ significantly from terrestrial In this comprehensive study, we delve into the realm of
environments, requiring careful consideration during model NTNs and their relationship with AI techniques, establishing a
training and adaptation. As a result, the training procedure can solid background for our exploration. We explore the synergy
be greatly hampered resulting in performance degradation of between NTNs and AI, highlighting how these two domains
these approaches in real networks. intersect and complement each other. Moving forward, we shift
our focus to the current research thrusts in the field, examining
I. Complicated Hyperparameter Settings ongoing efforts to bring these concepts to fruition in real-
The complexity of satellite-terrestrial networks, such as their world networks. While highlighting these advancements, we
topology and time-varying nature, can make traditional ML also address the potential challenges that must be overcome to
approaches less effective. As a result, DL methods have become realize the full potential of NTNs in the context of future 6G
increasingly popular due to their powerful feature extraction networks. Within this section, we provide an in-depth discussion
capabilities through NNs. The performance of any NN is reliant of valuable insights and recommendations for leveraging
on the hyperparameter settings, such as the number of layers, various AI techniques in the context of satellite-based NTNs.
activation functions, number of neurons in a layer, and learning These insights aim to guide and inform the implementation
rate. However, there is no way of deriving an optimal set and integration of AI in shaping the future of 6G networks.

31
A. Interrelated Issues behavior inherent in these NTN problems. Unlike feed-forward
Section IV sheds light on the interconnected nature of the networks, recurrent architectures possess the ability to capture
various issues encountered in NTNs. It is crucial to recognize and process temporal dependencies within the problem domain.
that addressing a singular problem can serve as an initial step By leveraging RNNs or other similar architectures, we can
toward resolving larger, more complex challenges inherent effectively model and solve the corresponding NTN challenges
in TNTNs. However, when transitioning these solutions into in a more comprehensive and accurate manner.
real-world networks, it becomes imperative to acknowledge
and account for the intricate interdependencies among various E. Online Implementation
aspects. An illustrative example of such interrelations lies in
the dynamic nature of network load status following a user’s One major limitation of the current works in the domain
attachment to a satellite. In this scenario, integrating resource is the limited consideration given to online implementation
allocation strategies into the handover decisions can yield and the associated computational complexity when designing
enhanced network performance. By considering the broader algorithms for various control operations in NTNs. This
context and understanding how different aspects influence one oversight poses a significant hurdle to the practical application
another, we can develop more holistic and effective approaches of these algorithms in real NTNs as many control decisions
for real-world NTN implementations. in NTN systems must be made in real time, and the use
of complex deep feed-forward NNs becomes impractical.
B. General Learning Approaches To address this challenge, exploring alternative options
Upon examining the insights presented in section IV, it becomes imperative. One such option involves investigating
becomes evident that SL and RL approaches take center stage low-complexity architectures such as ESNs and ELMs or
in addressing the diverse array of challenges faced by satellite- combining them with traditional feedforward-NNs. These low-
based NTNs in future 6G networks, primarily due to the complexity architectures offer a more viable solution for online
availability of real-world data and feedback mechanisms within implementation, enabling the deployment of DL algorithms in
existing networks. The nature of UL approaches presents a real NTN networks in a timely and efficient manner.
unique set of challenges in the context of highly dynamic
and time-varying NTNs. Understanding and capturing the F. Distributed Learning Models
intrinsic behavioral patterns within such networks prove to
be particularly hard. However, it is important to note that UL In the context of integrated satellite-terrestrial networks,
approaches still hold the potential in deriving the distribution of the adoption of distributed learning models can significantly
crucial network parameters that may not be readily accessible enhance scalability. These models involve distributing the
in real networks. These derived parameters can play a pivotal training and inference processes of machine learning algorithms
role in addressing various NTN challenges. across multiple computing nodes, resulting in accelerated
computation and improved efficiency. Various distributed
C. Control Feedback Design approaches, such as data parallelism, model parallelism,
One of the major motivating factors for implementing ensemble learning, and federated learning, offer promising
feedback-based learning, such as RL methods, in NTNs, is solutions to address the diverse challenges faced by NTNs
the inherent feedback system of the current cellular networks. in extended network environments [279]. By leveraging these
CSI information is readily available for the BSs which can distributed approaches, NTN systems can effectively harness
be helpful in network optimization approaches. However, with the power of parallel computing and collaborative learning to
the emergence of NTNs, new challenges arise, necessitating overcome constraints and achieve optimal performance.
the efficient design of feedback mechanisms to minimize
the overall overhead while improving network performance. G. Enabling O-RAN-based RIC
This consideration is crucial, as AI approaches for addressing
various issues may require similar types of feedback. The Currently, there are some ongoing research efforts focused on
utilization of combined feedback can prove highly beneficial developing SDR-based prototypes for NTNs with adaptations
in optimizing network performance and achieving efficient to OAI 4G and 5G protocol stacks, as discussed in Section
resource allocation, thus enhancing the overall effectiveness of V-C. However, to fully unlock the potential of AI in NTN for
AI algorithms in NTNs. future 6G networks, the integration of the RIC into these
implementations is crucial. This integration is particularly
D. Recurrent Learning Architectures important given that the immense benefits of AI in addressing
Presently, the majority of DL algorithms deployed to NTN deployment challenges for future 6G networks are
tackle the time-varying nature of NTNs in beam-hopping, demonstrated in section IV but current 5G networks lack a
resource allocation, network slicing, etc. rely on feed-forward dedicated interface for applying AI algorithms. By enabling the
NNs. While these architectures have proven successful in O-RAN framework with RIC, the deployment of AI algorithms
computer vision applications such as image detection and in real NTN networks can be efficiently performed, paving the
classification, they may not effectively capture the temporal way for advanced capabilities and improved performance.

32
H. Development in Miniaturization VIII. C ONCLUSION
The limited availability of computational resources currently NTN is considered the driver of ubiquitous, reliable, and
poses a challenge to the onboard capability of satellites, scalable 6G wireless networks. It adds new dimensions to
especially when deploying AI algorithms. However, the the existing traditional terrestrial communication systems by
miniaturization of satellite components and equipment has providing connections to remote and isolated areas subject
emerged as a solution to this issue. By reducing the weight to geographical constraints and offloading the primary links
and size of equipment, miniaturization enables the integration during traffic peaks. However, diverse unique challenges
of more powerful processors and larger memory devices within are accompanied by the deployment of NTN in existing
the limited space available on satellites. This advancement in communication systems. The long propagation delay, high
computational resources greatly facilitates the deployment of AI Doppler effect, spectrum sharing, complicated resource
algorithms, unlocking new possibilities for satellite applications. allocation, and fast and frequent handover are the major
Achieving miniaturization in satellite technology requires problems associated with NTN deployment. Integration to
innovations in material science, efficient Integrated Circuit (IC) existing terrestrial networks presents a set of new problems
design, advancements in IC fabrication technologies, System- such as task offloading, network routing, network slicing, etc.
on-Chip (SoC) integration, and Micro-Electro-Mechanical to be addressed in an efficient manner. The convergence of
Systems (MEMS) design, among others. The development of AI and NTN allows for the building of sustainable AI-based
miniaturization is particularly crucial for NTNs, as it enhances Non-Terrestrial Networks addressing many of these challenges.
the onboard capability of satellites and enables the realization Depending on the characteristics of the problem at hand, various
of advanced technologies and functionalities in space-based learning approaches can be employed. When dealing with
systems. prediction and estimation problems, SL techniques appear to
be a more suitable choice. On the other hand, for tasks involving
closed-loop control, RL techniques show greater promise. By
I. Energy Efficiency tailoring the learning approach to the specific problem, we
The launching and maintenance of satellites require can effectively leverage the strengths of each technique and
substantial power consumption, which imposes limitations on achieve optimal results.
the onboard capability of satellites. Consequently, efficient However, the integration of AI into NTNs presents
energy system design becomes a critical criterion for NTNs. certain challenges that need to be addressed. Both the
To address this, various aspects need to be considered, industry and research community are collaborating to ensure
including lightweight component design, advanced power the successful implementation of AI-based NTNs in next-
management techniques, efficient power conversion, optimized generation wireless networks. This includes the establishment
propulsion system design, effective energy storage systems, of ML testbeds specifically designed for satellite networks and
etc. By focusing on these factors, satellite systems can achieve the adaptation of SDR-based OAI 4G/5G protocol stacks for
higher energy efficiency, which is essential for the successful NTN applications. In order to realize satellite-based NTNs
deployment of advanced AI algorithms. The performance of in future 6G networks, several practical challenges must be
these algorithms relies on the availability of computational overcome. These challenges include addressing the constraints
resources, making energy efficiency a crucial aspect to of cost-limited onboard capabilities, managing the highly
maximize the satellite’s capabilities within the given power time-varying nature of satellite networks, and mitigating the
constraints. effects of long propagation delays. It is important to consider
these interconnected issues and develop joint solutions to
enhance overall network performance. Furthermore, exploring
J. Secured System Design low-complexity and distributed learning architectures that
incorporate efficient control feedback mechanisms is essential
As highlighted in section VI, security concerns in NTNs
for enabling real-time, online implementation. Additionally,
can be highly significant, introducing new attack vectors and
ensuring the secure, compact, and energy-efficient design of
vulnerabilities. NTNs are susceptible to a range of security
NTN platforms is integral to the successful deployment of
attacks, including adversarial attacks, data poisoning, DoS
satellite-based NTNs in the 6G era.
attacks, Fuzzy attacks, MiM attacks, and more. These attacks
have the potential to severely impact network performance and
R EFERENCES
compromise the integrity and confidentiality of data. To address
these challenges, it is essential to design efficient intrusion [1] I. Rahman, S. M. Razavi, O. Liberg, C. Hoymann, H. Wiemann,
detection and prevention systems specifically tailored for secure C. Tidestav, P. Schliwa-Bertling, P. Persson, and D. Gerstenberger,
“5G evolution toward 5G Advanced: An overview of 3GPP releases 17
NTNs. By continuously monitoring a set of relevant network and 18,” Ericsson Technol. Rev., vol. 2021, no. 14, pp. 2–12, 2021.
parameters and detecting anomalies in the network’s behavioral [2] X. Lin, “An overview of 5G Advanced evolution in 3GPP release 18,”
patterns, mitigation techniques can be promptly deployed to IEEE Commun. Stand. Mag., vol. 6, no. 3, pp. 77–83, 2022.
[3] J. Pang, S. Wang, Z. Tang, Y. Qin, X. Tao, X. You, and J. Zhu, “A new
ensure optimal network performance and safeguard against 5G radio evolution towards 5G-Advanced,” Science China Information
potential degradation caused by security breaches. Sciences, vol. 65, no. 9, p. 191301, 2022.

33
[4] W. Jiang, B. Han, M. A. Habibi, and H. D. Schotten, “The road towards [25] Y. Lu, “Artificial intelligence: a survey on evolution, models, applications
6G: A comprehensive survey,” IEEE open j. Commun. Soc., vol. 2, pp. and future trends,” J. Manag. Anal., vol. 6, no. 1, pp. 1–29, 2019.
334–366, 2021. [Online]. Available: https://doi.org/10.1080/23270012.2019.1570365
[5] W. Chen, X. Lin, J. Lee, A. Toskala, S. Sun, C. F. Chiasserini, and [26] J. G. Carbonell, R. S. Michalski, and T. M. Mitchell, “1 - an
L. Liu, “5G-Advanced towards 6G: Past, present, and future,” 2023. overview of machine learning,” in Mach. Learn., R. S. Michalski,
[6] “6G: The next horizon,” Huawei, Nov. 2021, white paper. [Online]. J. G. Carbonell, and T. M. Mitchell, Eds. San Francisco (CA):
Available: https://www.huawei.com/en/huaweitech/future-technologies/ Morgan Kaufmann, 1983, pp. 3–23. [Online]. Available: https:
6g-the-next-horizon //www.sciencedirect.com/science/article/pii/B9780080510545500054
[7] “6G – connecting a cyber-physical world,” Ericsson, Feb. 2022, [27] S. Dong, P. Wang, and K. Abbas, “A survey on deep learning
white paper. [Online]. Available: https://www.ericsson.com/en/ and its applications,” Comput. Sci. Rev., vol. 40, p. 100379, 2021.
reports-and-papers/white-papers/a-research-outlook-towards-6g [Online]. Available: https://www.sciencedirect.com/science/article/pii/
[8] “Samsung 6G white paper: The next hyper-connected experience S1574013721000198
for all,” Samsung, 2020, white paper. [Online]. Available: https: [28] C.-X. Wang, M. D. Renzo, S. Stanczak, S. Wang, and E. G. Larsson,
//research.samsung.com/next-generation-communications “Artificial intelligence enabled wireless networking for 5G and beyond:
[9] M. Z. Chowdhury, M. Shahjalal, S. Ahmed, and Y. M. Jang, “6G wireless Recent advances and future challenges,” IEEE Wireless Commun. Mag.,
communication systems: Applications, requirements, technologies, vol. 27, no. 1, pp. 16–23, 2020.
challenges, and research directions,” IEEE open j. Commun. Soc., vol. 1, [29] H. Al-Hraishawi, H. Chougrani, S. Kisseleff, E. Lagunas, and
pp. 957–975, 2020. S. Chatzinotas, “A survey on nongeostationary satellite systems: The
[10] P. Yang, Y. Xiao, M. Xiao, and S. Li, “6G wireless communications: communication perspective,” IEEE Commun. Surveys Tuts., vol. 25,
Vision and potential techniques,” IEEE Netw., vol. 33, no. 4, pp. 70–75, no. 1, pp. 101–132, 2023.
2019. [30] X. Lin, S. Rommer, S. Euler, E. A. Yavuz, and R. S. Karlsson, “5G
[11] Z. Zhang, Y. Xiao, Z. Ma, M. Xiao, Z. Ding, X. Lei, G. K. Karagiannidis, from Space: An Overview of 3GPP Non-Terrestrial Networks,” IEEE
and P. Fan, “6G wireless networks: Vision, requirements, architecture, Commun. Stand. Mag., vol. 5, no. 4, pp. 147–153, 2021.
and key technologies,” IEEE Veh. Technol. Mag., vol. 14, no. 3, pp. [31] G. Araniti, A. Iera, S. Pizzi, and F. Rinaldi, “Toward 6G non-terrestrial
28–41, 2019. networks,” IEEE Netw., vol. 36, no. 1, pp. 113–120, 2022.
[12] X. Lin, S. Cioni, G. Charbit, N. Chuberre, S. Hellsten, and J.-F. [32] F. Rinaldi, H.-L. Maattanen, J. Torsner, S. Pizzi, S. Andreev, A. Iera,
Boutillon, “On the path to 6G: Embracing the next wave of low earth Y. Koucheryavy, and G. Araniti, “Non-terrestrial networks in 5G &
orbit satellite access,” IEEE Commun. Mag., vol. 59, no. 12, pp. 36–42, beyond: A survey,” IEEE access, vol. 8, pp. 165 178–165 200, 2020.
2021. [33] G. Geraci, D. Lopez-Perez, M. Benzaghta, and S. Chatzinotas,
[13] M. Giordani and M. Zorzi, “Non-Terrestrial Communication in the 6G “Integrating terrestrial and non-terrestrial networks: 3D opportunities
Era: Challenges and Opportunities,” CoRR, vol. abs/1912.10226, 2019. and challenges,” IEEE Commun. Mag., pp. 1–7, 2022.
[Online]. Available: http://arxiv.org/abs/1912.10226 [34] A. Vanelli-Coralli, A. Guidotti, T. Foggi, G. Colavolpe, and G. Montorsi,
[14] M. Bacco, F. Davoli, G. Giambene, A. Gotta, M. Luglio, M. Marchese, “5G and beyond 5G non-terrestrial networks: trends and research
F. Patrone, and C. Roseti, “Networking challenges for non-terrestrial challenges,” in 2020 IEEE 3rd 5G World Forum (5GWF), 2020, pp.
networks exploitation in 5G,” in 2019 IEEE 2nd 5G World Forum 163–169.
(5GWF), 2019, pp. 623–628.
[35] S. Zhang, D. Zhu, and Y. Wang, “A survey on space-aerial-terrestrial
[15] N. Kato, Z. M. Fadlullah, F. Tang, B. Mao, S. Tani, A. Okamura, and
integrated 5G networks,” Comput. Netw., vol. 174, p. 107212, 2020.
J. Liu, “Optimizing space-air-ground integrated networks by artificial
[Online]. Available: https://www.sciencedirect.com/science/article/pii/
intelligence,” IEEE Wireless Commun. Mag., vol. 26, no. 4, pp. 140–147,
S1389128619314045
2019.
[36] X. Zhu and C. Jiang, “Integrated satellite-terrestrial networks toward
[16] B. A. Homssi, K. Dakic, K. Wang, T. Alpcan, B. Allen, S. Kandeepan,
6G: Architectures, applications, and challenges,” IEEE Internet Things
A. Al-Hourani, and W. Saad, “Artificial intelligence techniques for next-
J., vol. 9, no. 1, pp. 437–461, 2021.
generation mega satellite networks,” arXiv preprint arXiv:2207.00414,
2022. [37] A. Feriani and E. Hossain, “Single and multi-agent deep reinforcement
learning for AI-enabled wireless networks: A tutorial,” IEEE Commun.
[17] E. T. Michailidis, S. M. Potirakis, and A. G. Kanatas, “AI-inspired
Surveys Tuts., vol. 23, no. 2, pp. 1226–1252, 2021.
non-terrestrial networks for IIoT: Review on enabling technologies and
applications,” IoT, vol. 1, no. 1, p. 3, 2020. [38] M. Elsayed and M. Erol-Kantarci, “AI-enabled future wireless networks:
[18] M. M. Azari, S. Solanki, S. Chatzinotas, O. Kodheli, H. Sallouha, Challenges, opportunities, and open issues,” IEEE Veh. Technol. Mag.,
A. Colpaert, J. F. Mendoza Montoya, S. Pollin, A. Haqiqatnejad, vol. 14, no. 3, pp. 70–77, 2019.
A. Mostaani, E. Lagunas, and B. Ottersten, “Evolution of non-terrestrial [39] D. C. Nguyen, P. Cheng, M. Ding, D. Lopez-Perez, P. N. Pathirana, J. Li,
networks from 5G to 6G: A survey,” IEEE Commun. Surveys Tuts., A. Seneviratne, Y. Li, and H. V. Poor, “Enabling AI in future wireless
vol. 24, no. 4, pp. 2633–2672, 2022. networks: A data life cycle perspective,” IEEE Commun. Surveys Tuts.,
[19] O. Kodheli, E. Lagunas, N. Maturo, S. K. Sharma, B. Shankar, J. F. M. vol. 23, no. 1, pp. 553–595, 2020.
Montoya, J. C. M. Duncan, D. Spano, S. Chatzinotas, S. Kisseleff, [40] C. Li, Z. Cao, and Y. Liu, “Deep AI enabled ubiquitous wireless sensing:
J. Querol, L. Lei, T. X. Vu, and G. Goussetis, “Satellite communications A survey,” ACM Comput. Surv. (CSUR), vol. 54, no. 2, pp. 1–35, 2021.
in the new space era: A survey and future challenges,” IEEE Commun. [41] K. B. Letaief, W. Chen, Y. Shi, J. Zhang, and Y.-J. A. Zhang, “The
Surveys Tuts., vol. 23, no. 1, pp. 70–109, 2021. roadmap to 6G: AI empowered wireless networks,” IEEE Commun.
[20] F. Fourati and M.-S. Alouini, “Artificial intelligence for satellite Mag., vol. 57, no. 8, pp. 84–90, 2019.
communication: A review,” Intell. Converged Networks, vol. 2, no. 3, [42] R. Shafin, L. Liu, V. Chandrasekhar, H. Chen, J. Reed, and J. C.
pp. 213–243, 2021. Zhang, “Artificial intelligence-enabled cellular networks: A critical path
[21] S. Saafi, O. Vikhrova, G. Fodor, J. Hosek, and S. Andreev, “AI-aided to beyond-5G and 6G,” IEEE Wireless Commun. Mag., vol. 27, no. 2,
integrated terrestrial and non-terrestrial 6G solutions for sustainable pp. 212–217, 2020.
maritime networking,” IEEE Netw., vol. 36, no. 3, pp. 183–190, 2022. [43] S. Jere, Y. Song, Y. Yi, and L. Liu, “Distributed learning meets 6G:
[22] R. Giuliano and E. Innocenti, “Machine learning techniques for non- A communication and computing perspective,” IEEE Wirel. Commun.,
terrestrial networks,” Electron., vol. 12, no. 3, p. 652, 2023. vol. 30, no. 1, pp. 112–117, 2023.
[23] T. Naous, M. Itani, M. Awad, and S. Sharafeddine, “Reinforcement [44] X. Shen, J. Gao, W. Wu, M. Li, C. Zhou, and W. Zhuang, “Holistic
learning in the sky: A survey on enabling intelligence in NTN-based network virtualization and pervasive network intelligence for 6G,” IEEE
communications,” IEEE Access, vol. 11, pp. 19 941–19 968, 2023. Commun. Surveys Tuts., vol. 24, no. 1, pp. 1–30, 2022.
[24] A. Bhattacharyya, S. M. Nambiar, R. Ojha, A. Gyaneshwar, [45] B. Mao, F. Tang, Y. Kawamoto, and N. Kato, “AI models for green
U. Chadha, and K. Srinivasan, “Machine learning and deep communications towards 6G,” IEEE Commun. Surveys Tuts., vol. 24,
learning powered satellite communications: Enabling technologies, no. 1, pp. 210–247, 2022.
applications, open challenges, and future research directions,” Int. J. [46] Y. Sun, J. Liu, J. Wang, Y. Cao, and N. Kato, “When machine learning
Satell. Commun. Netw., vol. n/a, no. n/a, 2023. [Online]. Available: meets privacy in 6G: A survey,” IEEE Commun. Surveys Tuts., vol. 22,
https://onlinelibrary.wiley.com/doi/abs/10.1002/sat.1482 no. 4, pp. 2694–2724, 2020.

34
[47] Z. M. Fadlullah, B. Mao, and N. Kato, “Balancing QoS and security [72] K. M. Leung, “Naive bayesian classifier,” Polytech. Univ. Dept. Comput.
in the edge: Existing practices, challenges, and 6G opportunities with Sci./Finance Risk Eng., vol. 2007, pp. 123–156, 2007.
machine learning,” IEEE Commun. Surveys Tuts., vol. 24, no. 4, pp. [73] M. A. Hearst, S. T. Dumais, E. Osuna, J. Platt, and B. Scholkopf,
2419–2448, 2022. “Support vector machines,” IEEE Intell. Syst. Appl., vol. 13, no. 4, pp.
[48] F. Tang, B. Mao, Y. Kawamoto, and N. Kato, “Survey on machine 18–28, 1998.
learning for intelligent end-to-end communication toward 6G: From [74] F. Rosenblatt, “The perceptron: a probabilistic model for information
network access, routing to traffic control and streaming adaption,” IEEE storage and organization in the brain.” Psychol. Rev., vol. 65, no. 6, p.
Commun. Surveys Tuts., vol. 23, no. 3, pp. 1578–1598, 2021. 386, 1958.
[49] 3GPP, “Study on New Radio (NR) to support non-terrestrial [75] G.-B. Huang, Q.-Y. Zhu, and C.-K. Siew, “Extreme learning machine:
networks,” 3rd Generation Partnership Project (3GPP), Technical theory and applications,” Neurocomputing, vol. 70, no. 1-3, pp. 489–501,
Report (TR), TR 38.811, Sept. 2020, V15.4.0. [Online]. Available: 2006.
https://www.3gpp.org/ftp//Specs/archive/38 series/38.811/ [76] K. He, X. Zhang, S. Ren, and J. Sun, “Deep residual learning for image
[50] L. C. Alexandre, A. Linhares, G. Neto, and A. C. Sodre, “High-altitude recognition,” in Proc. IEEE Conf. comput. vision pattern recognit., 2016,
platform stations as IMT base stations: Connectivity from the pp. 770–778.
stratosphere,” Comm. Mag., vol. 59, no. 12, p. 30–35, dec 2021. [77] D. F. Specht, “Probabilistic neural networks,” Neural netw., vol. 3, no. 1,
[Online]. Available: https://doi.org/10.1109/MCOM.001.2100477 pp. 109–118, 1990.
[51] N. Cassiau, G. Noh, S. Jaeckel, L. Raschkowski, J.-M. Houssin, [78] Y. LeCun, Y. Bengio et al., “Convolutional networks for images, speech,
L. Combelles, M. Thary, J. Kim, J.-B. Dore, and M. Laugeois, “Satellite and time series,” handbook brain theory and neural netw., vol. 3361,
and terrestrial multi-connectivity for 5G: Making spectrum sharing no. 10, p. 1995, 1995.
possible,” in 2020 IEEE Wireless Commun. Netw. Conf. Workshops [79] M. D. Zeiler, D. Krishnan, G. W. Taylor, and R. Fergus,
(WCNCW), 2020, pp. 1–6. “Deconvolutional networks,” in 2010 IEEE Comput. Soc. Conf. comput.
[52] H. Friis, “A note on a simple transmission formula,” Proc. IRE, vol. 34, vision pattern recognit. IEEE, 2010, pp. 2528–2535.
no. 5, pp. 254–256, 1946. [80] L. R. Medsker and L. Jain, “Recurrent neural networks,” Des. Appl.,
[53] J. Salo, L. Vuokko, H. El-Sallabi, and P. Vainikainen, “Shadow fading vol. 5, pp. 64–67, 2001.
revisited,” in 2006 IEEE 63rd Veh. Tech. Conf., vol. 6, 2006, pp. 2843– [81] R. Pascanu, T. Mikolov, and Y. Bengio, “On the difficulty of training
2847. recurrent neural networks,” in Int. Conf. mach. learn. Pmlr, 2013, pp.
[54] Attenuation by atmospheric gases and related effects, International 1310–1318.
Telecommunications Union, Recommendation ITU-R P.676-13, [82] K. Cho, B. Van Merriënboer, D. Bahdanau, and Y. Bengio, “On the
Aug. 2022. [Online]. Available: https://www.itu.int/rec/R-REC-P. properties of neural machine translation: Encoder-decoder approaches,”
676-13-202208-I/en arXiv preprint arXiv:1409.1259, 2014.
[55] Attenuation due to clouds and fog, International Telecommunications [83] S. Hochreiter and J. Schmidhuber, “Long short-term memory,” Neural
Union, Recommendation ITU-R P.840-8, Aug. 2019. [Online]. comput., vol. 9, no. 8, pp. 1735–1780, 1997.
Available: https://www.itu.int/rec/R-REC-P.840-8-201908-I/en [84] B. Schrauwen, D. Verstraeten, and J. Van Campenhout, “An overview
[56] Ionospheric propagation data and prediction methods required of reservoir computing: theory, applications and implementations,” in
for the design of satellite networks and systems, International Proc. of the 15th Eur. Symp. Artif. Neural Netw.. p. 471-482 2007, 2007,
Telecommunications Union, Recommendation ITU-R P.531-14, pp. 471–482.
Aug. 2019. [Online]. Available: https://www.itu.int/rec/R-REC-P. [85] T. Hastie, R. Tibshirani, J. Friedman, T. Hastie, R. Tibshirani, and
531-14-201908-I/en J. Friedman, “Unsupervised learning,” elements statist. learn.: Data
[57] Propagation data required for the design systems in the land mining, inference, and prediction, pp. 485–585, 2009.
mobile-satellite service, International Telecommunications Union, [86] D. Steinley, “K-means clustering: a half-century synthesis,” Brit. J.
Recommendation ITU-R P.681-11, Aug. 2019. [Online]. Available: Math. Statist. Psychol., vol. 59, no. 1, pp. 1–34, 2006.
https://www.itu.int/rec/R-REC-P.681-11-201908-I/en [87] H. Abdi and L. J. Williams, “Principal component analysis,” Wiley
[58] 3GPP, “Solutions for NR to support non-terrestrial networks interdisciplinary rev.: comput. statist., vol. 2, no. 4, pp. 433–459, 2010.
(NTN),” 3rd Generation Partnership Project (3GPP), Technical [88] T. Cover and P. Hart, “Nearest neighbor pattern classification,” IEEE
Report (TR), TR 38.821, Jan. 2020, V16.0.0. [Online]. Available: trans. inf. theory, vol. 13, no. 1, pp. 21–27, 1967.
https://www.3gpp.org/ftp//Specs/archive/38 series/38.821/ [89] P. Baldi, “Autoencoders, unsupervised learning and deep architectures,”
[59] A. L. Samuel, “Some studies in machine learning using the game of in Proc. of the 2011 Int. Conf. on Unsupervised and Transfer Learning
checkers,” IBM J. Res. Dev., vol. 3, no. 3, pp. 210–229, 1959. Workshop - Volume 27, ser. UTLW’11. JMLR.org, 2011, p. 37–50.
[60] T. Mitchell, Machine Learning, 1st ed. New York, NY: McGraw-Hill, [90] D. P. Kingma and M. Welling, “Auto-encoding variational bayes,” arXiv
1997. [Online]. Available: https://www.cs.cmu.edu/∼tom/mlbook.html preprint arXiv:1312.6114, 2013.
[61] C. M. Bishop, “Neural netw. appl.” Review of scientific instruments, [91] P. Vincent, H. Larochelle, Y. Bengio, and P.-A. Manzagol, “Extracting
vol. 65, no. 6, pp. 1803–1832, 1994. and composing robust features with denoising autoencoders,” in Proc.
[62] R. Rojas, “The backpropagation algorithm,” in Neural netw. Springer, 25th Int. Conf. mach. learn., 2008, pp. 1096–1103.
1996, pp. 149–182. [92] M. Ranzato, C. Poultney, S. Chopra, and Y. Cun, “Efficient learning
[63] S. Boyd, S. P. Boyd, and L. Vandenberghe, Convex optimization. of sparse representations with an energy-based model,” Adv. neural inf.
Cambridge, UK: Cambridge univ. press, 2004. process. syst., vol. 19, 2006.
[64] S.-i. Amari, “Backpropagation and stochastic gradient descent method,” [93] A. Creswell, T. White, V. Dumoulin, K. Arulkumaran, B. Sengupta,
Neurocomputing, vol. 5, no. 4-5, pp. 185–196, 1993. and A. A. Bharath, “Generative adversarial networks: An overview,”
[65] J. Konečnỳ, J. Liu, P. Richtárik, and M. Takáč, “Mini-batch semi- IEEE signal process. mag., vol. 35, no. 1, pp. 53–65, 2018.
stochastic gradient descent in the proximal setting,” IEEE J. Sel. Topics [94] X. Dong, Z. Yu, W. Cao, Y. Shi, and Q. Ma, “A survey on ensemble
Signal Process., vol. 10, no. 2, pp. 242–255, 2015. learning,” Frontiers Comput. Sci., vol. 14, no. 2, pp. 241–258, 2020.
[66] J. J. Moré and D. C. Sorensen, “Newton’s method,” Argonne Nat.Lab., [95] L. P. Kaelbling, M. L. Littman, and A. W. Moore, “Reinforcement
IL (USA), Tech. Rep., 1982. learning: A survey,” J. artif. intell. res., vol. 4, pp. 237–285, 1996.
[67] I. Muhammad and Z. Yan, “Supervised machine learning approaches: [96] M. L. Puterman, “Markov decision processes,” Handbooks operations
A survey.” ICTACT J. Soft Comput., vol. 5, no. 3, 2015. res. manage. sci., vol. 2, pp. 331–434, 1990.
[68] R. J. Freund, Regression Analysis: Statistical Modeling of a Response [97] R. Sutton and A. Barto, “Reinforcement learning: An introduction,”
Variable, 2nd ed. Hoboken, NJ: John Wiley & Sons, 2006. IEEE Trans. Neural Netw., vol. 9, no. 5, pp. 1054–1054, 1998.
[69] A. Gross and J. Smuda, Linear Regression. New York, NY: Springer, [98] R. Bellman, “Dynamic programming,” Sci., vol. 153, no. 3731, pp.
2017. 34–37, 1966.
[70] R. E. Wright, Logistic regression. Washington DC: Amer. Psychol. [99] N. Metropolis and S. Ulam, “The monte carlo method,” J. Amer.
Assoc., 1995. statistical assoc., vol. 44, no. 247, pp. 335–341, 1949.
[71] C. Kingsford and S. L. Salzberg, “What are decision trees?” Nat. [100] C. J. Watkins and P. Dayan, “Q-learning,” Mach. Learn., vol. 8, no. 3,
Biotechnol., vol. 26, no. 9, pp. 1011–1013, 2008. pp. 279–292, 1992.

35
[101] G. A. Rummery and M. Niranjan, On-line Q-learning using [124] X. Hu, L. Wang, Y. Wang, S. Xu, Z. Liu, and W. Wang, “Dynamic beam
connectionist systems. Cambridge, UK: Univ. Cambridge, Dept. Eng. hopping for DVB-S2X GEO satellite: A DRL-powered GA approach,”
Cambridge, UK, 1994, vol. 37. IEEE Commun. Lett., vol. 26, no. 4, pp. 808–812, 2022.
[102] L. S. Shapley, “Stochastic games,” Proc. Nat. acad. sci., vol. 39, no. 10, [125] Y. Zhang, X. Hu, R. Chen, Z. Zhang, L. Wang, and W. Wang,
pp. 1095–1100, 1953. “Dynamic beam hopping for DVB-S2X satellite: A multi-objective deep
[103] J. Hu and M. P. Wellman, “Nash Q-learning for general-sum stochastic reinforcement learning approach,” in 2019 IEEE Int. Conf. Ubiquitous
games,” J. mach. learn. res., vol. 4, no. Nov, pp. 1039–1069, 2003. Comput. Commun. (IUCC) Data Sci. Comput. Intell. (DSCI) Smart
[104] R. S. Sutton, D. McAllester, S. Singh, and Y. Mansour, “Policy gradient Comput., Netw. Services (SmartCNS), 2019, pp. 164–169.
methods for reinforcement learning with function approximation,” Adv. [126] X. Hu, Y. Zhang, X. Liao, Z. Liu, W. Wang, and F. M. Ghannouchi,
neural inf. process. syst., vol. 12, 1999. “Dynamic beam hopping method based on multi-objective deep
[105] D. Silver, G. Lever, N. Heess, T. Degris, D. Wierstra, and M. Riedmiller, reinforcement learning for next generation satellite broadband systems,”
“Deterministic policy gradient algorithms,” in Int. Conf. mach. learn. IEEE Trans. Broadcast., vol. 66, no. 3, pp. 630–646, 2020.
Pmlr, 2014, pp. 387–395. [127] Z. Lin, Z. Ni, L. Kuang, C. Jiang, and Z. Huang, “Dynamic beam pattern
[106] V. Konda and J. Tsitsiklis, “Actor-critic algorithms,” Adv. neural inf. and bandwidth allocation based on multi-agent deep reinforcement
process. syst., vol. 12, 1999. learning for beam hopping satellite systems,” IEEE Trans. Veh. Technol.,
[107] V. Mnih, K. Kavukcuoglu, D. Silver, A. A. Rusu, J. Veness, M. G. vol. 71, no. 4, pp. 3917–3930, 2022.
Bellemare, A. Graves, M. Riedmiller, A. K. Fidjeland, G. Ostrovski et al., [128] Q. Zhao, Y. Hu, Z. Pang, and D. Ren, “Beam hopping for LEO satellite:
“Human-level control through deep reinforcement learning,” nature, vol. Challenges and opportunities,” in 2022 Int. Conf. Culture-Oriented Sci.
518, no. 7540, pp. 529–533, 2015. Technol. (CoST), 2022, pp. 319–324.
[108] H. Hasselt, “Double Q-learning,” Adv. neural inf. process. syst., vol. 23, [129] J. Choi and V. Chan, “Optimum power and beam allocation based on
2010. traffic demands and channel conditions over satellite downlinks,” IEEE
Trans. Wireless Commun., vol. 4, no. 6, pp. 2983–2993, 2005.
[109] H. Van Hasselt, A. Guez, and D. Silver, “Deep reinforcement learning
with double Q-learning,” in Proc. AAAI Conf. artif. intell., vol. 30, no. 1, [130] X. Alberti, J. M. Cebrian, A. Del Bianco, Z. Katona, J. Lei, M. A.
2016. Vazquez-Castro, A. Zanus, L. Gilbert, and N. Alagha, “System capacity
optimization in time and frequency for multibeam multi-media satellite
[110] Z. Wang, T. Schaul, M. Hessel, H. Hasselt, M. Lanctot, and N. Freitas,
systems,” in 2010 5th Adv. Satell. Multimedia Syst. Conf. 11th Signal
“Dueling network architectures for deep reinforcement learning,” in Int.
Process. Space Commun. Workshop, 2010, pp. 226–233.
Conf. mach. learn. PMLR, 2016, pp. 1995–2003.
[131] R. Alegre-Godoy, N. Alagha, and M. A. Vázquez-Castro, “Offered
[111] M. G. Bellemare, W. Dabney, and R. Munos, “A distributional capacity optimization mechanisms for multi-beam satellite systems,” in
perspective on reinforcement learning,” in Int. Conf. mach. learn. 2012 IEEE Int. Conf. Commun. (ICC), 2012, pp. 3180–3184.
PMLR, 2017, pp. 449–458.
[132] A. Paris, I. Del Portillo, B. Cameron, and E. Crawley, “A genetic
[112] T. P. Lillicrap, J. J. Hunt, A. Pritzel, N. Heess, T. Erez, Y. Tassa, D. Silver, algorithm for joint power and bandwidth allocation in multibeam satellite
and D. Wierstra, “Continuous control with deep reinforcement learning,” systems,” in 2019 IEEE Aerosp. Conf., 2019, pp. 1–15.
arXiv preprint arXiv:1509.02971, 2015.
[133] G. Cocco, T. de Cola, M. Angelone, Z. Katona, and S. Erl, “Radio
[113] M. Hausknecht and P. Stone, “Deep recurrent Q-learning for partially resource management optimization of flexible satellite payloads for
observable MDPs,” in 2015 aaai fall Symp. ser., 2015. DVB-S2 systems,” IEEE Trans. Broadcast., vol. 64, no. 2, pp. 266–280,
[114] R. Alegre, N. S. Alagha, and M. A. Vázquez-Castro, “Heuristic 2018.
algorithms for flexible resource allocation in beam hopping multi-beam [134] N. Pachler, J. J. G. Luis, M. Guerster, E. Crawley, and B. Cameron,
satellite systems,” in 29th AIAA International Communications Satellite “Allocating power and bandwidth in multibeam satellite systems using
Systems Conference (ICSSC-2011), 2011. particle swarm optimization,” in 2020 IEEE Aerosp. Conf., 2020, pp.
[115] G. Zheng, S. Chatzinotas, and B. Ottersten, “Generic optimization of 1–11.
linear precoding in multibeam satellite systems,” IEEE Trans. Wireless [135] A. I. Aravanis, B. Shankar M. R., P.-D. Arapoglou, G. Danoy, P. G.
Commun., vol. 11, no. 6, pp. 2308–2320, 2012. Cottis, and B. Ottersten, “Power allocation in multibeam satellite
[116] L. Lei, E. Lagunas, Y. Yuan, M. G. Kibria, S. Chatzinotas, and systems: A two-stage multi-objective optimization,” IEEE Trans.
B. Ottersten, “Deep learning for beam hopping in multibeam satellite Wireless Commun., vol. 14, no. 6, pp. 3171–3182, 2015.
systems,” in 2020 IEEE 91st Veh. Tech. Conf. (VTC2020-Spring), 2020, [136] M. Chen, Y. Zhang, Y. Teng, B. Liu, and L. Zhang, “Reinforcement
pp. 1–5. learning based signal quality aware handover scheme for LEO satellite
[117] ——, “Beam illumination pattern design in satellite networks: Learning communication networks,” in Human Centered Comput.: 5th Int. Conf.
and optimization for efficient beam hopping,” IEEE Access, vol. 8, pp. , HCC 2019, Čačak, Serbia, August 5–7, 2019, Revised Selected Papers
136 655–136 667, 2020. 5. Springer, 2019, pp. 44–55.
[118] S. Amiri and M. Mehdipour, “Accurate doppler frequency shift [137] H. Xu, D. Li, M. Liu, G. Han, W. Huang, and C. Xu, “QoE-driven
estimation for any satellite orbit,” in 2007 3rd Int. Conf. Recent Advances intelligent handover for user-centric mobile satellite networks,” IEEE
Space Technol., 2007, pp. 602–607. Trans. Veh. Technol., vol. 69, no. 9, pp. 10 127–10 139, 2020.
[119] T. Kim, K. Ko, I. Hwang, D. Hong, S. Choi, and H. Wang, “RSRP- [138] S. He, T. Wang, and S. Wang, “Load-aware satellite handover strategy
based doppler shift estimator using machine learning in high-speed based on multi-agent reinforcement learning,” in GLOBECOM 2020 -
train systems,” IEEE Trans. Veh. Technol., vol. 70, no. 1, pp. 371–380, 2020 IEEE Global Commun. Conf., 2020, pp. 1–6.
2021. [139] J. Wang, W. Mu, Y. Liu, L. Guo, S. Zhang, and G. Gui, “Deep
[120] W. Usaha and J. A. Barria, “Reinforcement learning for resource reinforcement learning-based satellite handover scheme for satellite
allocation in LEO satellite networks,” IEEE Trans. Syst., Man, Cybern., communications,” in 2021 13th Int. Conf. Wireless Commun. Signal
Part B (Cybern.), vol. 37, no. 3, pp. 515–527, 2007. Process. (WCSP), 2021, pp. 1–6.
[121] Y. Li, Y. Fan, S. Liu, L. Liu, and W. Yang, “Overview of beam hopping [140] C. Zhang, N. Zhang, W. Cao, K. Tian, and Z. Yang, “An AI-Based
algorithms in large scale LEO satellite constellation,” in 2021 IEEE Optimization of Handover Strategy in Non-Terrestrial Networks,” in
20th Int. Conf. Trust, Secur. Privacy Comput. Commun. (TrustCom), 2020 ITU Kaleidoscope: Industry-Driven Digit. Transformation (ITU
2021, pp. 1345–1351. K), 2020, pp. 1–6.
[122] Y. Han, C. Zhang, and G. Zhang, “Dynamic beam hopping resource [141] S. Jung, M.-S. Lee, J. Kim, M.-Y. Yun, J. Kim, and J.-H. Kim,
allocation algorithm based on deep reinforcement learning in multi- “Trustworthy handover in LEO satellite mobile networks,” ICT Express,
beam satellite systems,” in 2021 3rd Int. Academic Exchange Conf. Sci. vol. 8, no. 3, pp. 432–437, 2022.
Technol. Innov. (IAECST), 2021, pp. 68–73. [142] D.-F. Wu, C. Huang, Y. Yin, S. Huang, M. W. A. Ashraf, Q. Guo,
[123] X. Hu, S. Liu, Y. Wang, L. Xu, Y. Zhang, C. Wang, and W. Wang, and L. Zhang, “LB-DDQN for handover decision in satellite-terrestrial
“Deep reinforcement learning-based beam hopping algorithm in integrated networks,” Wireless Commun. Mobile Comput., vol. 2021,
multibeam satellite systems,” IET Commun., vol. 13, no. 16, pp. pp. 1–11, 2021.
2485–2491, 2019. [Online]. Available: https://ietresearch.onlinelibrary. [143] H. Liu, Y. Wang, and Y. Wang, “A successive deep Q-learning based
wiley.com/doi/abs/10.1049/iet-com.2018.5774 distributed handover scheme for large-scale LEO satellite networks,” in

36
2022 IEEE 95th Veh. Tech. Conf. :(VTC2022-Spring). IEEE, 2022, CoRR, vol. abs/2108.07757, 2021. [Online]. Available: https:
pp. 1–6. //arxiv.org/abs/2108.07757
[144] E. Papapetrou, S. Karapantazis, G. Dimitriadis, and F.-N. Pavlidou, [166] X. Ding, L. Feng, Y. Zou, and G. Zhang, “Deep learning aided spectrum
“Satellite handover techniques for LEO networks,” Int. J. Satell. prediction for satellite communication systems,” IEEE Trans. Veh.
Commun. Netw., vol. 22, no. 2, pp. 231–245, 2004. [Online]. Available: Technol., vol. 69, no. 12, pp. 16 314–16 319, 2020.
https://onlinelibrary.wiley.com/doi/abs/10.1002/sat.783 [167] X. Ding, L. Feng, J. Cheng, and G. Zhang, “Spectrum reconstruction
[145] Z. Wu, F. Jin, J. Luo, Y. Fu, J. Shan, and G. Hu, “A graph-based satellite via deep convolutional neural networks for satellite communication
handover framework for LEO satellite communication networks,” IEEE systems,” IEEE Trans. Commun., vol. 70, no. 9, pp. 5989–6001, 2022.
Commun. Lett., vol. 20, no. 8, pp. 1547–1550, 2016. [168] X. Ding, T. Ni, Y. Zou, and G. Zhang, “Deep learning for satellites based
[146] X. Lv, S. Wu, A. Li, J. Jiao, N. Zhang, and Q. Zhang, “A weighted spectrum sensing systems: A low computational complexity perspective,”
graph-based handover strategy for aeronautical traffic in LEO SatCom IEEE Trans. Veh. Technol., pp. 1–6, 2022.
networks,” IEEE Netw. Lett., vol. 4, no. 3, pp. 132–136, 2022. [169] Z. Ren, J. Jin, W. Li, R. Wen, and Y. Zhan, “Frequency prediction and
[147] S. Zhang, A. Liu, and X. Liang, “A multi-objective satellite handover assignment among SatComs networks: A CNN- LSTM approach,” in
strategy based on entropy in LEO satellite communications,” in 2020 2022 IEEE/CIC Int. Conf. Commun. China (ICCC Workshops), 2022,
IEEE 6th Int. Conf. Comput. Commun. (ICCC), 2020, pp. 723–728. pp. 106–111.
[148] S. Zhang, A. Liu, C. Han, X. Ding, and X. Liang, “A network-flows- [170] Z. Ren, J. Jin, W. Li, and Y. Zhan, “Intelligent action selection for
based satellite handover strategy for LEO satellite networks,” IEEE NGSO networks with interference constraints: A modified Q-learning
Wireless Commun. Lett., vol. 10, no. 12, pp. 2669–2673, 2021. approach,” IEEE Trans. Aerosp. Electron. Syst., vol. 58, no. 3, pp.
[149] Y. Wu, G. Hu, F. Jin, and J. Zu, “A satellite handover strategy based 2231–2242, 2022.
on the potential game in LEO satellite networks,” IEEE Access, vol. 7, [171] M. Jia, X. Zhang, J. Sun, X. Gu, and Q. Guo, “Intelligent resource
pp. 133 641–133 652, 2019. management for satellite and terrestrial spectrum shared networking
[150] J. Miao, P. Wang, H. Yin, N. Chen, and X. Wang, “A multi-attribute toward B5G,” IEEE Wireless Commun. Mag., vol. 27, no. 1, pp. 54–61,
decision handover scheme for LEO mobile satellite networks,” in 2019 2020.
IEEE 5th Int. Conf. Comput. Commun. (ICCC), 2019, pp. 938–942. [172] X. Hu, X. Liao, Z. Liu, S. Liu, X. Ding, M. Helaoui, W. Wang, and F. M.
[151] M. Rahman, T. Walingo, and F. Takawira, “Adaptive handover scheme Ghannouchi, “Multi-agent deep reinforcement learning-based flexible
for LEO satellite communication system,” in AFRICON 2015, 2015, satellite payload for mobile terminals,” IEEE Trans. Veh. Technol.,
pp. 1–5. vol. 69, no. 9, pp. 9849–9865, 2020.
[152] W. Zhaofeng, H. Guyu, Y. Seyedi, and J. Fenglin, “A simple real-time [173] C. Zhang, C. Jiang, J. Jin, S. Wu, L. Kuang, and S. Guo, “Spectrum
handover management in the mobile satellite communication networks,” sensing and recognition in satellite systems,” IEEE Trans. Veh. Technol.,
in 2015 17th Asia-Pacific Netw. Oper. Manage. Symp.(APNOMS), 2015, vol. 68, no. 3, pp. 2502–2516, 2019.
pp. 175–179. [174] J. Mitola and G. Maguire, “Cognitive radio: making software radios
[153] Y. Li, W. Zhou, and S. Zhou, “Forecast based handover in an extensible more personal,” IEEE Pers. Commun., vol. 6, no. 4, pp. 13–18, 1999.
multi-layer LEO mobile satellite system,” IEEE Access, vol. 8, pp. [175] A.-A. A. Boulogeorgos, N. D. Chatzidiamantis, and G. K. Karagiannidis,
42 768–42 783, 2020. “Energy detection spectrum sensing under RF imperfections,” IEEE
[154] Y. Liu, X. Tang, Y. Zhou, J. Shi, M. Qian, and S. Li, Trans. Commun., vol. 64, no. 7, pp. 2754–2766, 2016.
“Channel reservation based load aware handover for LEO satellite [176] P. Semba Yawada and A. J. Wei, “Cyclostationary detection based on
communications,” in 2022 IEEE 95th Veh. Tech. Conf. : (VTC2022- non-cooperative spectrum sensing in cognitive radio network,” in 2016
Spring), 2022, pp. 1–5. IEEE Int. Conf. Cyber Technol. Automat., Control, Intell. Syst. (CYBER),
[155] E. Yoon, S. Kwon, U. Yun, and S.-Y. Kim, “Doppler spread estimation 2016, pp. 184–187.
based on machine learning for an OFDM system,” in Wireless Commun. [177] Y. Zeng and Y.-C. Liang, “Eigenvalue-based spectrum sensing
Mobile Comput., 2021. algorithms for cognitive radio,” IEEE Trans. Commun., vol. 57, no. 6,
[156] T. Ngo, B. T. Kelley, and P. Rad, “Deep learning based prediction pp. 1784–1793, 2009.
of doppler shift for mobile communications,” in 2021 Telecoms Conf. [178] E. Biglieri, “An overview of cognitive radio for satellite
(ConfTELE), 2021, pp. 1–6. communications,” in 2012 IEEE 1st AESS Eur. Conf. Satell.
[157] S. Kim, J. Park, and C. Lee, “CNN-based doppler shift estimation for Telecommun. (ESTEL), 2012, pp. 1–3.
low earth orbit satellites,” in 2022 37th Int. Tech. Conf. Circuits/Syst., [179] C. Jiang and X. Zhu, “Reinforcement learning based capacity
Comput. Commun. (ITC-CSCC), 2022, pp. 1–3. management in multi-layer satellite networks,” IEEE Trans. Wireless
[158] M. Katayama, A. Ogawa, and N. Morinaga, “Carrier synchronization Commun., vol. 19, no. 7, pp. 4685–4699, 2020.
under doppler shift of the nongeostationary satellite communication [180] B. Zhao, J. Liu, Z. Wei, and I. You, “A Deep Reinforcement Learning
systems,” in [Proc.] Singapore ICCS/ISITA ‘92, 1992, pp. 466–470 Based Approach for Energy-Efficient Channel Allocation in Satellite
vol.2. Internet of Things,” IEEE Access, vol. 8, pp. 62 197–62 206, 2020.
[159] I. Ali, N. Al-Dhahir, and J. Hershey, “Doppler characterization for LEO [181] X. Hu, S. Liu, R. Chen, W. Wang, and C. Wang, “A deep reinforcement
satellites,” IEEE Trans. Commun., vol. 46, no. 3, pp. 309–313, 1998. learning-based framework for dynamic resource allocation in multibeam
[160] Z. Chenggong, C. Xi, and H. Zhen, “A comprehensive analysis on satellite systems,” IEEE Commun. Lett., vol. 22, no. 8, pp. 1612–1615,
doppler frequency and doppler frequency rate characterization for GNSS 2018.
receivers,” in 2016 2nd IEEE Int. Conf. Comput. Commun. (ICCC), [182] N. Dai, D. Zhou, M. Sheng, and J. Li, “Deep reinforcement learning
2016, pp. 2606–2610. based power allocation for high throughput satellites,” in 2021 IEEE
[161] T. A. Khan and M. Afshang, “A stochastic geometry approach to doppler 94th Veh. Tech. Conf. (VTC2021-Fall), 2021, pp. 1–5.
characterization in a LEO satellite network,” in ICC 2020 - 2020 IEEE [183] X. Li, H. Zhang, W. Li, and K. Long, “Multi-agent DRL for user
Int. Conf. Commun. (ICC), 2020, pp. 1–6. association and power control in terrestrial-satellite network,” in 2021
[162] J. Lin, Z. Hou, Y. Zhou, L. Tian, and J. Shi, “Map estimation based IEEE Global Commun. Conf. (GLOBECOM), 2021, pp. 1–5.
on doppler characterization in broadband and mobile LEO satellite [184] P. V. R. Ferreira, R. Paffenroth, A. M. Wyglinski, T. M. Hackett,
communications,” in 2016 IEEE 83rd Veh. Tech. Conf. (VTC Spring), S. G. Bilén, R. C. Reinhart, and D. J. Mortensen, “Multiobjective
2016, pp. 1–5. reinforcement learning for cognitive satellite communications using
[163] N. H. Nguyen and K. Doğançay, “Algebraic solution for stationary deep neural network ensembles,” IEEE J. Sel. Areas Commun., vol. 36,
emitter geolocation by a LEO satellite using doppler frequency no. 5, pp. 1030–1041, 2018.
measurements,” in 2016 IEEE Int. Conf. Acoust., Speech Signal [185] S. Nie, J. M. Jornet, and I. F. Akyildiz, “Deep-learning-based resource
Process.(ICASSP), 2016, pp. 3341–3345. allocation for multi-band communications in CubeSat networks,” in
[164] M. Pan, J. Hu, J. Yuan, J. Liu, and Y. Su, “An efficient blind doppler shift 2019 IEEE Int. Conf. Commun. Workshops (ICC Workshops), 2019, pp.
estimation and compensation method for LEO satellite communications,” 1–6.
in 2020 IEEE 20th Int. Conf. Commun. Technol.(ICCT), 2020, pp. 643– [186] T. S. Abdu, S. Kisseleff, L. Lei, E. Lagunas, J. Grotz, and S. Chatzinotas,
648. “A deep learning based acceleration of complex satellite resource
[165] X. Lin, Z. Lin, S. E. Löwenmark, J. Rune, and R. S. Karlsson, management problem,” in 2022 30th Eur. Signal Process. Conf.
“Doppler shift estimation in 5G new radio non-terrestrial networks,” (EUSIPCO), 2022, pp. 1092–1096.

37
[187] S. Kisseleff, E. Lagunas, T. S. Abdu, S. Chatzinotas, and B. Ottersten, [208] R. Xie, Q. Tang, Q. Wang, X. Liu, F. R. Yu, and T. Huang, “Satellite-
“Radio resource management techniques for multibeam satellite systems,” terrestrial integrated edge computing networks: Architecture, challenges,
IEEE Commun. Lett., vol. 25, no. 8, pp. 2448–2452, 2021. and open issues,” IEEE Netw., vol. 34, no. 3, pp. 224–231, 2020.
[188] L. Kuang, X. Chen, C. Jiang, H. Zhang, and S. Wu, “Radio resource [209] L. Zhang, H. Zhang, C. Guo, H. Xu, L. Song, and Z. Han, “Satellite-
management in future terrestrial-satellite communication networks,” aerial integrated computing in disasters: User association and offloading
IEEE Wireless Commun. Mag., vol. 24, no. 5, pp. 81–87, 2017. decision,” in ICC 2020-2020 IEEE Int. Conf. Commun. (ICC). IEEE,
[189] T. S. Abdu, S. Kisseleff, E. Lagunas, and S. Chatzinotas, “Power and 2020, pp. 554–559.
bandwidth minimization for demand-aware GEO satellite systems,” in [210] Y. Wang, J. Yang, X. Guo, and Z. Qu, “A game-theoretic approach
2021 IEEE Global Commun. Conf. (GLOBECOM), 2021, pp. 1–6. to computation offloading in satellite edge computing,” IEEE Access,
[190] T. S. Abdu, E. Lagunas, S. Kisseleff, and S. Chatzinotas, “Carrier and vol. 8, pp. 12 510–12 520, 2019.
power assignment for flexible broadband GEO satellite communications [211] S. Huang, G. Li, E. Ben-Awuah, B. O. Afum, and N. Hu, “A stochastic
system,” in 2020 IEEE 31st Annu. Int. Symp. Pers., Indoor Mobile mixed integer programming framework for underground mining
Radio Commun., 2020, pp. 1–7. production scheduling optimization considering grade uncertainty,” IEEE
[191] T. S. Abdu, S. Kisseleff, E. Lagunas, and S. Chatzinotas, “Limits of Access, vol. 8, pp. 24 495–24 505, 2020.
smart radio resource assignment in GEO satellite communications,” in [212] J. Gao, L. Zhao, and X. Shen, “Service offloading in terrestrial-satellite
2021 IEEE Wireless Commun. Netw. Conf. (WCNC), 2021, pp. 1–7. systems: User preference and network utility,” in 2019 IEEE Global
[192] P. Gu, R. Li, C. Hua, and R. Tafazolli, “Dynamic cooperative spectrum Commun. Conf. (GLOBECOM). IEEE, 2019, pp. 1–6.
sharing in a multi-beam LEO- GEO co-existing satellite system,” IEEE [213] J. Kim, T. Kim, M. Hashemi, C. G. Brinton, and D. J. Love, “Joint
Trans. Wireless Commun., vol. 21, no. 2, pp. 1170–1182, 2022. optimization of signal design and resource allocation in wireless
[193] T. Salih Abdu, L. Lei, S. Kisseleff, E. Lagunas, S. Chatzinotas, D2D edge computing,” in IEEE INFOCOM 2020-IEEE Conf. Comput.
and B. Ottersten, “Precoding-aided bandwidth optimization for high Commun. IEEE, 2020, pp. 2086–2095.
throughput satellite systems,” in 2021 IEEE 4th 5G World Forum [214] T. Chen, J. Liu, Q. Ye, W. Zhuang, W. Zhang, T. Huang, and Y. Liu,
(5GWF), 2021, pp. 13–17. “Learning-based computation offloading for IoRT through Ka/Q-band
[194] P. Angeletti and R. De Gaudenzi, “Heuristic radio resource management satellite–terrestrial integrated networks,” IEEE Internet Things J., vol. 9,
for massive MIMO in satellite broadband communication networks,” no. 14, pp. 12 056–12 070, 2021.
IEEE Access, vol. 9, pp. 147 164–147 190, 2021. [215] Y. Yin, C. Huang, D.-F. Wu, S. Huang, M. Ashraf, and Q. Guo,
[195] A. Paris, I. Del Portillo, B. Cameron, and E. Crawley, “A genetic “Reinforcement learning-based routing algorithm in satellite-terrestrial
algorithm for joint power and bandwidth allocation in multibeam satellite integrated networks,” Wireless Commun. Mobile Comput., vol. 2021,
systems,” in 2019 IEEE Aerosp. Conf., 2019, pp. 1–15. 2021.
[216] X. Wang, Z. Dai, and Z. Xu, “LEO satellite network routing algorithm
[196] N. Pachler, J. J. G. Luis, M. Guerster, E. Crawley, and B. Cameron,
based on reinforcement learning,” in 2021 IEEE 4th Int. Conf. Electron.
“Allocating power and bandwidth in multibeam satellite systems using
Technol.(ICET), 2021, pp. 1105–1109.
particle swarm optimization,” in 2020 IEEE Aerosp. Conf., 2020, pp.
[217] Y. HUANG, W. Shufan, K. Zeyu, M. Zhongcheng, H. HUANG,
1–11.
W. Xiaofeng, A. J. TANG, and X. CHENG, “Reinforcement learning
[197] B. Deng, C. Jiang, H. Yao, S. Guo, and S. Zhao, “The next generation
based dynamic distributed routing scheme for mega LEO satellite
heterogeneous satellite communication networks: Integration of resource
networks,” Chinese J. Aeronaut., 2022.
management and deep reinforcement learning,” IEEE Wireless Commun.
[218] F. Wang, D. Jiang, Z. Wang, Z. Lv, and S. Mumtaz, “Fuzzy- CNN
Mag., vol. 27, no. 2, pp. 105–111, 2020.
based multi-task routing for integrated satellite-terrestrial networks,”
[198] S. Sthapit, S. Lakshminarayana, L. He, G. Epiphaniou, and C. Maple, IEEE Trans. Veh. Technol., vol. 71, no. 2, pp. 1913–1926, 2021.
“Reinforcement learning for security-aware computation offloading in [219] D. Liu, J. Zhang, J. Cui, S.-X. Ng, R. G. Maunder, and L. Hanzo, “Deep
satellite networks,” IEEE Internet Things J., vol. 9, no. 14, pp. 12 351– learning aided routing for space-air-ground integrated networks relying
12 363, 2021. on real satellite, flight, and shipping data,” IEEE Wireless Commun.
[199] D. Zhu, H. Liu, T. Li, J. Sun, J. Liang, H. Zhang, L. Geng, and Y. Liu, Mag., vol. 29, no. 2, pp. 177–184, 2022.
“Deep reinforcement learning-based task offloading in satellite-terrestrial [220] Z. Na, Z. Pan, X. Liu, Z. Deng, Z. Gao, and Q. Guo, “Distributed
edge computing networks,” in 2021 IEEE Wireless Commun. Netw. Conf. routing strategy based on machine learning for LEO satellite network,”
(WCNC). IEEE, 2021, pp. 1–7. Wireless Commun. Mobile Comput., vol. 2018, 2018.
[200] N. Waqar, S. A. Hassan, A. Mahmood, K. Dev, D.-T. Do, and [221] M. Liu, J. Li, and H. Lu, “Routing in small satellite networks: A GNN-
M. Gidlund, “Computation offloading and resource allocation in MEC- based learning approach,” arXiv preprint arXiv:2108.08523, 2021.
enabled integrated aerial-terrestrial vehicular networks: A reinforcement [222] Z. Tu, H. Zhou, K. Li, G. Li, and Q. Shen, “A routing optimization
learning approach,” IEEE Trans. Intell. Transp. Syst., vol. 23, no. 11, method for software-defined SGIN based on deep reinforcement
pp. 21 478–21 491, 2022. learning,” in 2019 IEEE Globecom Workshops (GC Wkshps), 2019,
[201] Z. Qin, H. Yao, T. Mai, D. Wu, N. Zhang, and S. Guo, “Multi-agent pp. 1–6.
reinforcement learning aided computation offloading in aerial computing [223] E. W. Dijkstra, “A note on two problems in connexion with graphs,” in
for the internet-of-things,” IEEE Trans. Services Comput., 2022. Edsger Wybe Dijkstra: His Life, Work, and Legacy, 2022, pp. 287–290.
[202] Y. Lyu, Z. Liu, R. Fan, C. Zhan, H. Hu, and J. An, “Optimal computation [224] M. Werner, “A dynamic routing concept for ATM-based satellite personal
offloading in collaborative LEO- IoT enabled mec: A multi-agent deep communication networks,” IEEE J. Sel. Areas Commun., vol. 15, no. 8,
reinforcement learning approach,” IEEE Trans. Green Commun. Netw., pp. 1636–1648, 1997.
2022. [225] J. T. Moy, OSPF: anatomy of an Internet routing protocol. Boston,
[203] B. Mao, F. Tang, Y. Kawamoto, and N. Kato, “Optimizing computation MA: Addison-Wesley Professional, 1998.
offloading in satellite- UAV-served 6G IoT: A deep learning approach,” [226] L. Wood, A. Clerget, I. Andrikopoulos, G. Pavlou, and W. Dabbous,
IEEE Netw., vol. 35, no. 4, pp. 102–108, 2021. “IP routing issues in satellite constellation networks,” Int. J. Satell.
[204] J. Zhang, X. Zhang, P. Wang, L. Liu, and Y. Wang, “Double-edge Commun., vol. 19, no. 1, pp. 69–92, 2001.
intelligent integrated satellite terrestrial networks,” China Commun., [227] Y. Lv, C. Xing, N. Xu, X. Han, and F. Wang, “Research of adaptive
vol. 17, no. 9, pp. 128–146, 2020. routing scheme for LEO network,” in 2019 IEEE 5th Int. Conf. Comput.
[205] H. Li, C. Chen, C. Li, L. Liu, and G. Gui, “Aerial computing offloading Commun. (ICCC), 2019, pp. 987–992.
by distributed deep learning in collaborative satellite-terrestrial networks,” [228] E. Sigel, B. Denby, and S. Le Hégarat-Mascle, “Application of ant colony
in 2021 13th Int. Conf. Wireless Commun. Signal Process. (WCSP). optimization to adaptive routing in a LEO telecommunications satellite
IEEE, 2021, pp. 1–6. network,” in Annales des télécommun., vol. 57, no. 5/6. PRESSES
[206] Q. Tang, Z. Fei, and B. Li, “Distributed deep learning for cooperative POLYTECH. ROMANDES, 2002, pp. 520–539.
computation offloading in low earth orbit satellite networks,” China [229] S. Liu, D. Wu, and L. Zhang, “A routing model based on multiple-
Commun., vol. 19, no. 4, pp. 230–243, 2022. user requirements and the optimal solution,” IEEE Access, vol. 8, pp.
[207] T. Lv, W. Liu, H. Huang, and X. Jia, “Optimal data downloading by 156 470–156 483, 2020.
using inter-satellite offloading in LEO satellite networks,” in 2016 IEEE [230] N. Zhao, X. Long, and J. Wang, “A multi-constraint optimal routing
Global Commun. Conf. (GLOBECOM). IEEE, 2016, pp. 1–6. algorithm in LEO satellite networks,” Wireless Netw., pp. 1–12, 2021.

38
[231] J. Tao, Z. Na, and N. Zhang, “Time-varying graph model for LEO strategies in GEO SATCOM scenario,” Int. J. Satell. Commun. Netw.,
satellite network routing,” in 2022 9th Int. Conf. Dependable Syst. vol. 40, no. 2, pp. 96–119, 2022.
Appl.(DSA). IEEE, 2022, pp. 486–491. [252] I. Ashraf, M. Narra, M. Umer, R. Majeed, S. Sadiq, F. Javaid, and
[232] R. Kucukates and C. Ersoy, “High performance routing in a LEO N. Rasool, “A deep learning-based smart framework for cyber-physical
satellite network,” in Proc. Eighth IEEE Symp. Comput. Commun. ISCC and satellite system security threats detection,” Electron., vol. 11, no. 4,
2003, 2003, pp. 1403–1408 vol.2. p. 667, 2022.
[233] S. Lv, H. Li, J. Wu, H. Bai, X. Chen, Y. Shen, J. Zheng, R. Ding, H. Ma, [253] N. Moustafa, I. A. Khan, M. Hassanin, D. Ormrod, D. Pi, I. Razzak, and
and W. Li, “Routing strategy of integrated satellite-terrestrial network J. Slay, “DFSat: Deep federated learning for identifying cyber threats
based on hyperbolic geometry,” IEEE Access, vol. 8, pp. 113 003– in IoT-based satellite networks,” IEEE Trans. Ind. Inform., 2022.
113 010, 2020. [254] N. Koroniotis, N. Moustafa, and J. Slay, “A new intelligent satellite
[234] Y. Drif, E. Chaput, E. Lavinal, P. Berthou, B. Tiomela Jou, deep learning network forensic framework for smart satellite networks,”
O. Grémillet, and F. Arnal, “An extensible network slicing Comput. Elect. Eng., vol. 99, p. 107745, 2022.
framework for satellite integration into 5G,” Int. J. Satell. Commun. [255] M. Abdrabou and T. A. Gulliver, “Physical layer authentication for
Netw., vol. 39, no. 4, pp. 339–357, 2021. [Online]. Available: satellite communication systems using machine learning,” IEEE Open
https://onlinelibrary.wiley.com/doi/abs/10.1002/sat.1387 J. Commun. Soc., 2022.
[235] T. Ahmed, A. Alleg, R. Ferrus, and R. Riggio, “On-demand network [256] G. Oligeri, S. Sciancalepore, S. Raponi, and R. Di Pietro, “Past-AI:
slicing using SDN/NFV-enabled satellite ground segment systems,” in Physical-layer authentication of satellite transmitters via deep learning,”
2018 4th IEEE Conf. Netw. Softwarization Workshops (NetSoft). ieee, IEEE Trans. Inf. Forensics and Secur., vol. 18, pp. 274–289, 2022.
2018, pp. 242–246. [257] “O-RAN: Towards an open and smart RAN,” O-RAN ALLIANCE, Oct.
[236] C. Suzhi, W. Junyong, H. Hao, Z. Yi, Y. Shuling, Y. Lei, W. Shaojun, 2018, white paper. [Online]. Available: https://www.o-ran.org/resources
and G. Yongsheng, “Space edge cloud enabling network slicing for 5G [258] M. Minardi, T. X. Vu, L. Lei, C. Politis, and S. Chatzinotas, “Virtual
satellite network,” in 2019 15th Int. Wireless Commun. Mobile Comput. network embedding for NGSO systems: Algorithmic solution and SDN-
Conf. (IWCMC), 2019, pp. 787–792. Testbed validation,” IEEE Trans. Netw. Service Manage., pp. 1–1, 2022.
[237] L. Lei, Y. Yuan, T. X. Vu, S. Chatzinotas, M. Minardi, and [259] “MLSAT - machine learning and artificial intelligence for satellite
J. F. M. Montoya, “Dynamic-adaptive AI solutions for network slicing communication,” European Space Agency (ESA), 2020, Accessed:
management in satellite-integrated B5G systems,” IEEE Netw., vol. 35, May 31, 2023,. [Online]. Available: https://artes.esa.int/projects/mlsat
no. 6, pp. 91–97, 2021. [260] “SATAI - machine learning and artificial intelligence for satellite
[238] W. Wu, C. Zhou, M. Li, H. Wu, H. Zhou, N. Zhang, X. S. Shen, and communication,” European Space Agency (ESA), 2020, Accessed:
W. Zhuang, “AI-native network slicing for 6G networks,” IEEE Wireless May 31, 2023,. [Online]. Available: https://artes.esa.int/projects/satai
Commun. Mag., vol. 29, no. 1, pp. 96–103, 2022. [261] “ANChOR - data-driven network controller and orchestrator
[239] T. K. Rodrigues and N. Kato, “Network slicing with centralized for real-time network management,” European Space Agency
and distributed reinforcement learning for combined satellite/ground (ESA), 2021, Accessed: May 31, 2023,. [Online]. Available:
networks in a 6G environment,” IEEE Wireless Commun. Mag., vol. 29, https://artes.esa.int/projects/anchor
no. 1, pp. 104–110, 2022. [262] “SkyMon PIA - SkyMon predictive interference analysis,” European
[240] Y. Zhang, A. Liu, P. Li, and S. Jiang, “Deep learning ( DL)-based Space Agency (ESA), 2021, Accessed: May 31, 2023,. [Online].
channel prediction and hybrid beamforming for LEO satellite massive Available: https://artes.esa.int/projects/skymon-pia
MIMO system,” IEEE Internet Things J., vol. 9, no. 23, pp. 23 705– [263] “Spaice - satellite signal processing techniques using a Commercial Off-
23 715, 2022. The-Shelf AI chipset,” European Space Agency (ESA), 2022, Accessed:
[241] Y. Zhang, Y. Wu, A. Liu, X. Xia, T. Pan, and X. Liu, “Deep May 31, 2023,. [Online]. Available: https://artes.esa.int/projects/spaice
learning-based channel prediction for LEO satellite massive MIMO [264] L. Bonati, M. Polese, S. D’Oro, S. Basagni, and T. Melodia, “Open,
communication system,” IEEE Wireless Commun. Lett., vol. 10, no. 8, programmable, and virtualized 5G networks: State-of-the-art and the
pp. 1835–1839, 2021. road ahead,” Comput. Netw., vol. 182, p. 107516, 2020.
[242] Y. Zhang, S. Qu, and K. Wen, “A short-term traffic flow forecasting [265] M. Polese, L. Bonati, S. D’Oro, S. Basagni, and T. Melodia,
method based on chaos and RBF neural network,” Syst. Eng., vol. 11, “Understanding O-RAN: Architecture, interfaces, algorithms, security,
01 2007. and research challenges,” IEEE Commun. Surveys Tuts., 2023.
[243] L. Yang, X. Gu, and H. Shi, “A noval satellite network traffic prediction [266] H. Lee, Y. Jang, J. Song, and H. Yeon, “O-RAN AI/ML workflow
method based on GCN-GRU,” in 2020 Int. Conf. Wireless Commun. implementation of personalized network optimization via reinforcement
Signal Process. (WCSP), 2020, pp. 718–723. learning,” in 2021 IEEE Globecom Workshops (GC Wkshps), 2021, pp.
[244] N. Li, L. Hu, Z.-L. Deng, T. Su, and J.-W. Liu, “Research on GRU 1–6.
neural network satellite traffic prediction based on transfer learning,” [267] N. Salhab, R. Rahim, R. Langar, and R. Boutaba, “Machine learning
Wireless Pers. Commun., vol. 118, no. 1, pp. 815–827, 2021. based resource orchestration for 5G network slices,” in 2019 IEEE
[245] F. Zhu, L. Liu, and T. Lin, “An LSTM-based traffic prediction algorithm Global Commun. Conf. (GLOBECOM), 2019, pp. 1–6.
with attention mechanism for satellite network,” in Proc. of the 2020 [268] F. Kaltenberger, G. d. Souza, R. Knopp, and H. Wang, “The
3rd Int. Conf. Artif. Intell. Pattern Recognit., 2020, pp. 205–209. OpenAirInterface 5G New Radio implementation: Current status and
[246] Z. Liu, W. Li, J. Feng, and J. Zhang, “Research on satellite network roadmap,” in WSA 2019; 23rd Int. ITG Workshop Smart Antennas, 2019,
traffic prediction based on improved GRU neural network,” Sensors, pp. 1–5.
vol. 22, no. 22, p. 8678, 2022. [269] R. Schmidt, M. Irazabal, and N. Nikaein, “FlexRIC: An SDK for
[247] C. Han, L. Huo, X. Tong, H. Wang, and X. Liu, “Spatial anti-jamming next-generation SD-RANs,” in Proc. 17th Int. Conf. Emerg. Netw.
scheme for internet of satellites based on the deep reinforcement learning Experiments Technol., ser. CoNEXT ’21. New York, NY, USA:
and stackelberg game,” IEEE Trans. Veh. Technol., vol. 69, no. 5, pp. Assoc. Comput. Machinery, 2021, p. 411–425. [Online]. Available:
5331–5342, 2020. https://doi.org/10.1145/3485983.3494870
[248] C. Han, A. Liu, H. Wang, L. Huo, and X. Liang, “Dynamic anti-jamming [270] S. Kumar, A. K. Meshram, A. Astro, J. Querol, T. Schlichter, G. Casati,
coalition for satellite-enabled army IoT: A distributed game approach,” T. Heyn, F. Völk, R. T. Schwarz, A. Knopp et al., “OpenAirInterface as
IEEE Internet Things J., vol. 7, no. 11, pp. 10 932–10 944, 2020. a platform for 5G-NTN research and experimentation,” in 2022 IEEE
[249] C. Han, A. Liu, L. Huo, H. Wang, and X. Liang, “Anti-jamming routing Future Netw. World Forum (FNWF). IEEE, 2022, pp. 500–506.
for internet of satellites: a reinforcement learning approach,” in ICASSP [271] J. Kim, G. Casati, A. Pietrabissa, A. Giuseppi, E. C. Strinati, N. Cassiau,
2020-2020 IEEE Int. Conf. Acoust., Speech Signal Process.(ICASSP). G. Noh, H. Chung, I. Kim, M. Thary et al., “5G-ALLSTAR: An
IEEE, 2020, pp. 2877–2881. integrated satellite-cellular system for 5G and beyond,” in 2020 IEEE
[250] H. Xu, Y. Cheng, J. Liang, and P. Wang, “A jamming recognition Wireless Commun. Netw. Conf. Workshops (WCNCW). IEEE, 2020,
algorithm based on deep neural network in satellite navigation system,” pp. 1–6.
in China Satell. Navigation Conf. Springer, 2020, pp. 701–711. [272] “5G-EMUSAT:5G New Radio emulation over satellite,” 5GMETEROS,
[251] S. Arif, A. J. Hashmi, W. Khan, and R. Kausar, “A smart reactive 2022, Accessed: May 23, 2023,. [Online]. Available: https://5gmeteors.
jamming approach to counter reinforcement learning-based antijamming eurescom.eu/open-calls/1st-open-call-summary/5g-emusat/

39
[273] O. Kodheli, A. Astro, J. Querol, M. Gholamian, S. Kumar, N. Maturo,
and S. Chatzinotas, “Random access procedure over non-terrestrial
networks: From theory to practice,” IEEE Access, vol. 9, pp. 109 130–
109 143, 2021.
[274] S. Kumar, A. Astro, O. Kodheli, J. Querol, S. Chatzinotas, T. Schlichter,
G. Casati, T. Heyn, F. Völk, S. Kaya, R. T. Schwarz, A. Knopp,
A. Kapovits, and F. Kaltenberger, “5G-NTN GEO-based in-lab
demonstrator using OpenAirInterface5G,” in ASMS/SPSC 2022, 11th
Adv. Satell. Multimedia Sys. Conf. .17th Signal Process. Space Commun.
Workshop, 6-8 September 2022, Graz, Austria, IEEE, Ed., Graz, 2022.
[275] “5G-LEO - OpenAirInterface™ extension for 5G satellite links,”
2022, Accessed: May 23, 2023,. [Online]. Available: https:
//artes.esa.int/projects/5gleo
[276] G. Dulac-Arnold, N. Levine, D. J. Mankowitz, J. Li, C. Paduraru,
S. Gowal, and T. Hester, “Challenges of real-world reinforcement
learning: definitions, benchmarks and analysis,” Mach. Learn., vol.
110, no. 9, pp. 2419–2468, 2021.
[277] L. Canese, G. C. Cardarilli, L. Di Nunzio, R. Fazzolari, D. Giardino,
M. Re, and S. Spanò, “Multi-agent reinforcement learning: A review of
challenges and applications,” Appl. Sci., vol. 11, no. 11, p. 4948, 2021.
[278] Y. Li, “Deep reinforcement learning: An overview,” arXiv preprint
arXiv:1701.07274, 2017.
[279] J. Verbraeken, M. Wolting, J. Katzy, J. Kloppenburg, T. Verbelen,
and J. S. Rellermeyer, “A survey on distributed machine learning,”
ACM Comput. Surv., vol. 53, no. 2, mar 2020. [Online]. Available:
https://doi.org/10.1145/3377454

40

You might also like