Professional Documents
Culture Documents
2018 Blind Radio Tomography
2018 Blind Radio Tomography
Abstract—From the attenuation measurements collected by SLFs are instrumental in a number of problems, including
a network of spatially distributed sensors, radio tomography radio tomographic imaging [3], channel-gain cartography [4],
constructs spatial loss fields (SLFs) that quantify absorption and device-free passive localization [2], [5], [6]. Specifically,
of radiofrequency waves at each location. These SLFs can be
used for interference prediction in (possibly cognitive) wireless the absorption mapped by SLFs allows one to discern objects
communication networks, for environmental monitoring or in- in space, thus enabling radio tomographic imaging. The latter
trusion detection in surveillance applications, for through-the- is of interest in environmental monitoring for surveillance or
wall imaging, for survivor localization after earthquakes or fires, intrusion detection [7]. Relative to existing alternatives using
and so on. The cornerstone of radio tomography is to model camera sensors, radio tomographic imaging features lower
attenuation as the bidimensional integral of the SLF of interest
scaled by a weight function. Unfortunately, existing approaches hardware costs and benefits from the ability of radio frequency
(i) rely on heuristic assumptions to select the weight function; and waves to penetrate physical structures such as trees or build-
(ii) are limited to imaging changes in the propagation medium ings. The latter characteristic also renders radio tomography
or they require a separate calibration step with measurements in appealing for through-the-wall imaging [8], [9], which finds
free space. The first major contribution in this paper addresses (i) multiple military and civilian applications including security
by means of a blind radio tomographic approach that learns the
SLF together with the aforementioned weight function from the and responding to emergency situations. For example, these
attenuation measurements. This challenging problem is tackled techniques may enable the police or emergency services to
by capitalizing on contemporary kernel-based learning tools locate persons in burning buildings, survivors in rescue op-
together with various forms of regularization that leverage prior erations, or kidnappers in hostage situations. Similarly, SLFs
knowledge. The second contribution addresses (ii) by means of a are also useful in channel-gain cartography, where the goal is
novel calibration technique capable of imaging static structures
without separate calibration steps. Numerical tests with real and to predict the channel attenuation for links between arbitrary
synthetic measurements validate the efficacy of the proposed pairs of locations where no sensors are deployed [4]. Channel-
algorithms. gain maps obtained from SLFs solve the classical problem
Index Terms—radio tomography, tomographic imaging, of predicting the interference inflicted to receivers that never
channel-gain cartography, kernel-based learning. transmit, as necessary in cognitive radio and for unlicensed
access to television broadcasting systems [10]–[13], where
the non-collaborative nature of primary users precludes any
I. I NTRODUCTION direct form of channel estimation between secondary trans-
mitters and primary receivers. Further applications of these
Tomographic imaging enjoys extensive popularity and channel-gain maps include network planning or interference
widespread usage in natural sciences, notably in medical imag- management in cellular networks.
ing [1]. The principles underpinning tomographic methods The fundamental principle underlying radio tomography is
have been carried over to construct spatial loss fields (SLFs), that closely located radio links exhibit similar shadowing due
which are maps quantifying the attenuation experienced by to the presence of common obstructions. This correlation is
electromagnetic waves in radio frequency bands at every related to the geometry of the propagation environment by
spatial position [2]. To this end, pairs of collaborating sensors the model in [14], [15], which prescribes that the attenuation
deployed across the area of interest estimate the attenuation due to shadowing is proportional to the line integral of a
introduced by the channel between them. Different from bidimensional SLF. Inspired by this model, [2], [3], [16]
traditional techniques, radio tomography relies on incoherent proposed various techniques for radio tomographic imaging.
measurements, meaning that no phase information is available. Since these techniques avoid calibration issues by estimating
This simplification saves the costs incurred by the accurate the difference between the SLF at consecutive time instants
synchronization necessary to acquire phase differences among instead of the SLF itself, they reveal the location of changes
waveforms received at different sensors. in the propagation medium but are unable to image static
structures. Similarly, [9] builds on the arguments in [17] to
This work was supported by NSF grants 1343248, 1442686, 1508993,
1509040 as well as the FRIPRO TOPPFORSK grant WISECART 250910/F20 replace the SLF with an indicator function of the voxels that
from the Research Council of Norway. Parts of this work have been presented contain objects in motion and therefore also suffers from this
at the IEEE Global Conference on Signal and Information Processing, Greater limitation. In contrast, the scheme in [7] estimates the SLF
Washington, D.C. (USA), Dec. 7-9, 2016. Code for the simulations is available
at the author’s websites. directly and therefore can image static structures, but involves
D. Romero was with the Dept. of ECE and Digital Tech. Center, Univ. of a separate calibration stage where these structures are absent.
Minnesota, USA. He is now with the Dept. of Information and Communication On the other hand, existing radio tomography approaches
Technology, Univ. of Agder, Norway. E-mail: daniel.romero@uia.no. D. Lee
and G. B. Giannakis are with the Dept. of ECE and Digital Tech. Center, approximate the aforementioned line integral by a discretized
Univ. of Minnesota, USA. E-mails: {leex6962,georgios}@umn.edu. version of the bidimensional integral of the SLF scaled by a
'LJLWDO2EMHFW,GHQWL¿HU763
;,(((3HUVRQDOXVHLVSHUPLWWHGEXWUHSXEOLFDWLRQUHGLVWULEXWLRQUHTXLUHV,(((SHUPLVVLRQ
6HH KWWSZZZLHHHRUJSXEOLFDWLRQV VWDQGDUGVSXEOLFDWLRQVULJKWVLQGH[KWPO IRU PRUH LQIRUPDWLRQ
IEEE TRANSACTIONS ON SIGNAL PROCESSING (EARLY ACCESS) 2
function that quantifies the impact of the absorption at each and it ∈ RT its t-th column. 0T is the T × T all-zero
spatial point on the shadowing attenuation of each link. In [2], matrix. Superscript (·) stands for transposition, ⊗ for the
[3], [9], [16], such a weight function is the indicator of an el- Kronecker product, and || · || for the Euclidean norm. The
lipse with foci at the transmitter and receiver positions and can vectorization of an M × N matrix X := [x1 , . . . , xN ] is given
be used to account for attenuation introduced by obstructions by vec(X) := [x
1 , . . . , xN ] .
inside the first Fresnel ellipsoid. Building upon these works,
[7] proposes a more sophisticated weight function that assigns II. BACKGROUND AND P ROBLEM S TATEMENT
a higher weight to obstructions that lie close to the straight path
This section introduces the radio tomographic model and
between transmitter and receiver. However, since the choice
formulates both the non-blind and blind radio tomography
of these weight functions relies on heuristic arguments, one
problems. Although the exposition builds on the framework
expects that the performance of radio tomographic methods
in [7], the proposed approaches readily carry over to the
can be improved upon adopting weight functions that capture
frameworks in [2], [3], [16] and in [9] just by replacing the
the actual propagation phenomena more accurately.
SLF here with the difference of consecutive SLFs or with an
A different body of literature applies radio tomography
indicator function of voxels containing moving objects.
for channel-gain cartography. Remarkably, [18] applies the
radio tomographic model in [15] to motivate a linear time- Consider a bidimensional geographical area indexed by the
evolution model for channel-gain cartography. A more explicit closed and convex set A ⊂ R2 . After averaging out the effects
application of radio tomography is reported in [19], where the of small-scale fading, the power gain between a transmiter
weight function is constant on its elliptic support. located at x ∈ A and a receiver located at x ∈ A is given in
To sum up, all tomography works so far adopt a heuris- dB units by
tically selected weight function. In contrast, the main con- g(x, x ) = gTX (x) + gRX (x )
tribution of this paper comprises three blind estimators that
− γ 0 10 log10 ||x − x ||2 − s(x, x ) (1)
simultaneously learn the weight function and the SLF from
the sensor measurements, therefore suppressing the need for where gTX (x) (resp. gRX (x )) is the combined gain of the
heuristic considerations. The learned weight function reveals power amplifier (low-noise amplifier) and transmit (receive)
insightful information about propagation in the medium of antenna, assumed omnidirectional for simplicity, of the sensor
interest and opens the door to validating models such as at x ∈ A (x ∈ A); γ 0 is the pathloss exponent, and s(x, x )
those in [17] and [7]. As noted in [3], the non-blind radio is the attenuation due to shadow fading. All other constant
tomography problem, where the weight function is given factors have been absorbed into gTX (x) and gRX (x ). The
and only the SLF has to be estimated, is intrinsically ill- radio tomographic model in [7], which generalizes that in [14]
posed. Hence, the blind problem addressed here is “doubly” and [15], prescribes that
challenging, since besides the SLF one has to estimate the
weight function. To cope with these difficulties, the weight s(x, x ) = w(x, x , x̃)f (x̃)dx̃ (2)
function is estimated here through kernel-based learning, a A
framework that is extensively popular due to its simplicity, where f : A → R+ is the SLF and w : A × A × A → R+ is
universality, and because it leads to computationally efficient the weight function. Whereas f (x̃) represents the absorption
algorithms [20], [21]. On the other hand, the SLF is estimated at location x̃ ∈ A, the weight w(x, x , x̃) quantifies the impact
using three alternative regularization criteria. of the absorption at x̃ on the attenuation between x and x .
The second contribution is a novel calibration technique Typically, function w confers a greater weight w(x, x , x̃) to
that simultaneously estimates the antenna gains and path loss those locations x̃ lying closer to the line segment between x
exponent together with the SLF and weight function. Different and x and its selection is described later in this section.
from existing alternatives, the proposed method can image Equation (2) models how the nature and spatial distribution
static structures and does not need a separate calibration stage of obstructions in the propagation medium affect the attenua-
where obstacles are removed from the propagation medium. tion between each pair of locations. Its relevance is twofold:
Therefore, the present work constitutes a significant step first, as mentioned in Sec. I, f represents absorption across
forward in most applications involving tomographic imaging, space and therefore it can be used for imaging; see Sec. V
channel-gain cartography, and device-free localization. for examples. Second, if both w and f are known, the gain
The rest of the paper is organized as follows. Sec. II between any two points x and x can be recovered through
reviews the radio tomography model, states the problem, and (1) and (2), which enables channel-gain cartography.
defines the calibrated and uncalibrated scenarios. The novel The goal of radio tomography is to estimate f . To this
blind algorithm is then derived for the calibrated scenario in end, N sensors at locations {x1 , . . . , xN } ⊂ A collabora-
Sec. III, and extended to the uncalibrated scenario in Sec. IV. tively obtain channel-gain measurements. Specifically, at time
Numerical tests with synthetic as well as real measurements t = 1, . . . , T , sensors n(t) and n (t) measure g(xn(t) , xn (t) )
are provided in Sec. V. Finally, Sec. VI summarizes the main e.g. through pilot sequences, where n(t), n (t) ∈ {1, . . . ,
conclusions. N } ∀t. These measurements can be expressed as ǧ t =
Notation: Sets are represented with calligraphic letters, g(xn(t) , xn (t) ) + t , t = 1, . . . , T , where t stands for
whereas bold uppercase (lowercase) letters denote matrices measurement error. It is instructive to consider first that
(column vectors). IT represents the T × T identity matrix {gTX (xn )}N N
n=1 , {gRX (xn )}n=1 , and γ 0 are known. In such a
IEEE TRANSACTIONS ON SIGNAL PROCESSING (EARLY ACCESS) 3
calibrated scenario, it follows from (1) that {ǧ t }Tt=1 contain where β > 0 is selected by the user and ζ β (x, x , x̃) :=
the same information as {št }Tt=1 , where max1/2 β 2 , (x − x̃2 + x̃ − x 2 )2 − x − x 22 . Sim-
ilar to (6), if x̃ is out of the first Fresnel zone, then (7)
št :=gTX (xn(t) ) + gRX (xn (t) ) (3)
prescribes a zero weight. Otherwise, if x̃ lies inside a smaller
− γ 0 10 log10 ||xn(t) − xn (t) ||2 − ǧ t ellipse with foci at x and x and minor axis length β, then
=s(xn(t) , xn (t) ) − t . w(x, x , x̃) equals the reciprocal of the area of that ellipse.
Finally, if x̃ lies outside of the smaller ellipse but inside the
Thus, the fusion center may use {št }Tt=1 rather than {ǧ t }Tt=1 .
first Fresnel zone, then w(x, x , x̃) equals the reciprocal of the
Sec. IV will deal with the uncalibrated scenario, where
area of the smallest ellipse containing x̃ and having x and x
{gTX (xn )}N N
n=1 , {gRX (xn )}n=1 or γ 0 are unknown.
as foci. Although w in (7) is intuitively more accurate than
So far, works on radio tomography have focused on the
its predecessors, the rationale behind its selection is heuristic
non-blind problem, where one estimates f given w as well as
and may not accurately capture real propagation phenomena.
the measurements and radio locations {(xn(t) , xn (t) , št )}Tt=1 .
This idea is reinforced by noting that (7), as well as (6), is
The rest of this section describes the selection of w for this
discontinuous on the boundary of the first Fresnel zone.
problem and formulates its blind counterpart.
To bypass this need for heuristically selecting w, the goal of
The radio tomographic model originally proposed in [14, eq.
this paper is to learn w from the data {(xn(t) , xn (t) , št )}Tt=1 .
(4)] and [15, eq. (9)] is expressed in terms of a line integral as
However, since f is generally unknown, the blind radio
x
1 tomography problem involves learning w and f given just the
s(x, x ) = f (x̃)dx̃ (4)
x − x 2 x measurements {(xn(t) , xn (t) , št )}Tt=1 .
and can be viewed as a special case of (2) upon setting
x III. B LIND RADIO TOMOGRAPHY ESTIMATORS
1
w(x, x , x̃) = δ(||x̃ − x̌||2 )dx̌ (5) As explained in Sec. II, existing radio tomography schemes
x − x 2 x estimate f from the measurements {(xn(t) , xn (t) , št )}Tt=1
where δ stands for the Dirac delta. after setting w based on heuristic arguments. In contrast, the
Informally, w in (5) is a
function assigning a weight 1/ x − x 2 to the point x̃ if present section proposes three estimators that obtain both f
it lies on the line segment between x and x , and zero other- and w from {(xn(t) , xn (t) , št )}Tt=1 . To this end, Secs. III-A,
wise. Therefore, this model only accounts for the attenuation III-B, and III-C formulate the estimation problem as a generic
introduced by obstacles obstructing the line of sight. optimization program that can accommodate different forms
However, objects that do not obstruct the line of sight can of prior information through regularization. Subsequently,
still introduce attenuation if they lie close to it. The approach Sec. III-D proposes three solvers for different regularizers and
in [3] and [16] captures this effect by assigning the weight Sec. III-E introduces a computationally efficient approxima-
1/ x − x 2 to all locations x̃ lying within an ellipse with tion for large measurement records.
foci at x and x through the function
⎧ A. Blind radio tomography as a function estimation problem
⎪
⎨0 if x − x̃2 + x̃ − x 2
In the radio tomography literature, the integral in (2) is
w(x, x , x̃) := > x − x 2 + λ/2, (6)
⎩
⎪
approximated as
1/ x − x 2 otherwise
L
where λ > 0 is selected by the user. The weight function in s(x, x ) c w(x, x , x̃l )f (x̃l ) (8)
(6) is referred to as normalized ellipse function in [7], where λ l=1
is set to the carrier wavelength so that w(x, x , x̃) for fixed x
and x becomes the indicator of the first Fresnel zone, defined where {x̃l }Ll=1 is a grid of points in A and c is a constant that
by {x̃ ∈ R2 : x − x̃2 + x̃ − x 2 ≤ x − x 2 + λ/2}. can be set to unity without loss of generality by absorbing
Despite capturing propagation effects more accurately than any scaling factor in f . It can be recognized from (8) that the
(5), function (6) still assigns the same weight across all x̃ shadowing value s(x, x ) depends on f only through its values
within the first Fresnel zone, regardless of the distance from x̃ at the grid points. Thus, it suffices to estimate the L entries of
to the direct path. Since the attenuation introduced by an obsta- the real-valued vector f := [f (x̃1 ), . . . , f (x̃L )] rather than
cle is expected to be a decreasing function of this distance, [7] the function f (x) for all x. On the other hand, finding w is
proposes the following inverse area elliptical function1 more challenging since s(x, x ) may be evaluated at arbitrary
real-valued coordinate pairs (x, x ) and, therefore, w(x, x , x̃)
w(x, x , x̃) := (7) needs to be known for all x, x ∈ A. Therefore, estimating w
⎧
⎪
⎨0 if x − x̃2 + x̃ − x 2 > x − x 2 + λ/2, does not boil down to a vector estimation problem; instead, w
4 must be estimated as a function on A3 ⊂ R6 .
⎪ otherwise
⎩ πζ (x, x , x̃) x − x 2 + ζ 2 (x, x , x̃) Intuitively, the number of measurements required to estimate
β 2 β w with a target accuracy depends on the size of its domain, in
1 The disagreement between (7) and [7, eq. (19)] owes to typographical
this case A3 . Conversely, for a given number of measurements,
errors in [7] and to the fact that the problem [7, eq. (20)] is solved here in one expects that the larger the domain of w is, the lower the
closed form. quality of its estimate will be. For this reason, the rest of this
IEEE TRANSACTIONS ON SIGNAL PROCESSING (EARLY ACCESS) 4
section presents two techniques to reduce the aforementioned the larger B, the larger the number of measurements required
problem of estimating a function on A3 into the problem of to maintain the estimation performance. Therefore, the size
estimating a function on a smaller domain by exploiting the of this region must be increased only if a sufficiently large
known structure of w. It is worth noting that, despite enhancing number of measurements is given.
estimation performance, these techniques are not necessary for The two techniques introduced in this section are applied
the methods in this paper, which can handle in principle any next to simplify (8). To this end, apply the re-parameterization
weight function defined on A3 . prescribed by the first technique to obtain the shadowing
The first technique relies on the assumption that w depends attenuation of the t-th measurement from (8), which yields
on {x, x , x̃} only through φ1 (x, x ) := ||x − x ||2 and L
φ2 (x, x , x̃) := ||x − x̃||2 + ||x̃ − x ||2 , which respectively de- s(xn(t) , xn (t) ) w(φ(xn(t) , xn (t) , x̃l ))f (x̃l ) (11)
note the length of the line of sight from x to x and path going l=1
through the intermediate point x̃. In other words, such a weight
function can be expressed as2 w(x, x , x̃) = w(φ(x, x , x̃)), after absorbing c in f . The second technique, which
where φ(x, x , x̃) := [φ1 (x, x ), φ2 (x, x , x̃)] . Besides being confines the support of w to B, allows a reduction
intuitively reasonable, this assumption is satisfied by all weight in the number of summands in (11) by disregarding
functions in the literature; cf. Sec. II. For example, the weight those with φ(xn(t) , xn (t) , x̃l ) ∈/ B since they result in
function in (6) satisfies w(x, x , x̃) = w(φ(x, x , x̃)) for w φ(xn(t) , xn (t) , x̃l ) = 0. For the t-th measurement define
with φ := [φ1 , φ2 ] . Similarly, the weight function in (7) can as the set comprising the indices of the Lt grid points x̃l
be expressed as w(x, x , x̃) = w(φ(x, x , x̃)) for for which φ(xn(t) , xn (t) , x̃l ) is in B. With this notation, (11)
⎧ becomes
⎪
⎪ 0, if φ 2 > φ1 + λ/2
⎨ Lt
2 2 s(xn(t) , xn (t) ) w(φt,l )f (x̃it,l ) (13)
w(φ) := min Ω(φ1 , φ2 ), Ω φ1 , φ1 + β (10)
⎪
⎪
⎩ l=1
otherwise
where φt,l := φ(xn(t) , xn (t) , x̃it,l ), l = 1, . . . , Lt .
where Ω(φ1 , φ2 ) := 4/(πφ2 φ22 − φ21 ). Thus, it is reason- In short, expression (13), which was obtained through the
able to seek an estimate of w among the class of functions re-parameterization and support confinement techniques in
satisfying this assumption, thereby reducing the problem of this section, will prove decisive to lower the computational
estimating a function of 6 variables to that of estimating a complexity and improve the estimation performance of the
function of only 2. More formally, the sought w(φ) will be estimators proposed in the rest of the paper.
defined for φ ∈ B0 := {φ ∈ R2 : 0 ≤ φ1 ≤ D, φ1 ≤ φ2 ≤
2D} ⊂ R2 , where D := supx,x ∈A ||x − x ||2 is the diameter
B. Function estimation via kernel-based learning
of A and the condition φ1 ≤ φ2 follows from the triangle
inequality. Since B 0 ⊂ R2 , this re-parameterization of w is Before formulating the blind radio tomography problem,
expected to significantly reduce the number of measurements this section reviews kernel-based learning, which is one of the
needed to attain a target estimation accuracy. most prominent frameworks for non-linear function estimation
The second technique to reduce the size of the domain of due to its simplicity, good performance, low computational
w relies on the fact that this function is expected to take complexity, and universality, in the sense that any continuous
significant values only on a small region of the space, the function vanishing at infinity can be learned with arbitrary
rest being close to zero. For example, one may assume along accuracy under general conditions; see e.g. [21].
the lines of (9) and (10) that w(x, x , x̃) only takes non-zero Kernel-based methods seek function estimates within large
values within a certain ellipsoid with foci at x and x , such as classes of functions termed reproducing kernel Hilbert spaces
the Fresnel ellipsoid. In terms of the re-parameterization in the (RKHSs) and defined as
previous paragraph, such an ellipsoid can be expressed as the ∞
set of x̃ such that φ2 (x, x , x̃) ≤ φ1 (x, x ) + λ/2, implying H := w(φ) = αi κ(φ, φi ) : αi ∈ R; φ, φi ∈ B ∀i .
that one may confine the support of w to the reduced set i=1
B := {φ ∈ R2 : 0 ≤ φ1 ≤ D, φ1 ≤ φ2 ≤ 2D, φ2 ≤ In this expression, κ : B × B → R denotes a reproducing ker-
φ1 + λ/2} ⊂ B 0 . Besides this ellipsoid, the user can select nel, which is a function satisfying two properties [20]: (i) it is
further regions B or ellipsoids. Although w is expected to symmetric, meaning that κ(φ1 , φ2 ) = κ(φ2 , φ1 ), ∀φ1 , φ2 ∈
take small values off the Fresnel ellipsoid, these values may B; and (ii) it is positive definite, meaning that:
be estimated by selecting a larger domain set, even B = B 0 .
However, this operation comes with a caveat: as expected, I I
αi αi κ(φi , φi ) ≥ 0,
2 Althoughthe symbol w is used to represent both functions w(x, x , x̃) i=1 i =1
and w(φ), there is no ambiguity since the former function takes 6 scalar
arguments whereas the latter takes 2. ∀I > 0, {αi }Ii=1 ⊂ R, {φi }Ii=1 ⊂ B.
IEEE TRANSACTIONS ON SIGNAL PROCESSING (EARLY ACCESS) 5
alternating minimization approach where, at step [S1], one Input: {(xn(t) , xn (t) , št )}T
t=1 , μf , μw , κ.
minimizes (P2) with respect to α for fixed f and, at step 1: For every t = 1, . . . , T , obtain
[S2], one minimizes (P2) with respect to f for fixed α. More Lt in (12).
φt,l := φ(xn(t) , xn (t) , x̃it,l ), l = 1, . . . , Lt .
specifically, in [S1], one obtains the (k +1)-st iterate α[k +1], 2: Form K as described in Sec. III-C.
k = 0, 1, . . . , as
3: Initialize f [0].
1 4: For k = 0, 1, . . . until convergence
α[k + 1] = arg min š − A[k]α2 + μw α Kα (20)
α T [S1] α update
Compute A[k] = (IT ⊗ f [k])Ψ K.
where A[k] := (IT ⊗ f [k])Ψ K ∈ RT ×L̃ depends on the Obtain α[k + 1] via (21).
[S2] f update
k-th iterate f [k]. From the first-order optimality conditions of
Compute B[k + 1] := T
t=1 (it ⊗ α [k + 1]Kt Ψt ).
(20), its minimizer can be found in closed form as
Obtain f [k + 1] via subroutines in Tables II, III, or IV.
Lt
−1 5: Set ŵ(φ) = T l=1 αt,l [k + 1]κ(φ, φt,l ).
α[k + 1] = A [k]A[k] + μw T K A [k]š. (21) t=1
Return function ŵ(x, x , x̃) := ŵ(φ(x, x , x̃)) and vector f [k].
Input: B[k + 1], š, μf , f [k]. Input: f [k], γ x [k], γ y [k], dx [k], dy [k], B[k + 1], š, μf , ν, Lx .
1: For l = 1, 2, . . . , L [S2–1] Update dual variables:
s̃l [k] := š − B −l [k]f−l [k]. 1: γ x [k + 1] = γ x [k] + ν(Dx f [k] − dx [k])
This regularizer adds the absolute differences of function The complexity of the algorithm in Table I is dominated by
values at grid points located consecutively along a row or a the inversion of the L̃×L̃ matrix in (21), which requires O(L̃3 )
column. Thus, this regularizer promotes sparsity in the local operations. Applications demanding high-resolution estimates
differences and therefore promotes constant regions in the of f or w, and hence requiring large TT , may therefore incur
estimate of F . For ρ(f ) = TV(F), expression (23) becomes prohibitive complexity since L̃ := t=1 Lt increases with T .
To bypass such a bottleneck, this section presents a technique
1 to approximate the solution to (P1) for large T at affordable
f [k + 1] = arg min š − B[k + 1]f 22 + μf TV(F).
f T computational complexity.
(27) Observe that the size of the aforementioned L̃ × L̃ matrix is
determined by the number of terms in the sum of (16), which,
Efficiently solving (27) is challenging since the TV regular- as dictated by the representer theorem, equals the number of
izer is not differentiable. TV problems are generally solved Lt ,T
different vectors {φt,l }l=1,t=1 where w is evaluated in the
either through algorithms based on the iterative shrinkage- objective of (P1). Thus, the size of such a matrix would be
thresholding algorithm (ISTA) [29], or through the alternating reduced if w in (P1) were only evaluated at a reduced set of
direction method of multipliers (ADMM) [30], [31]. Unfortu- vectors {φ̄l }L̄
l=1 , where L̄ L̃. To this end, approximate
nately, ISTA-based algorithms for TV problems are typically w(φt,l ) ≈ w(φ̄r(t,l) ) in (P1), where r(t, l) is such that φ̄r(t,l)
complicated by proximal operations requiring inner loops with
is the best approximation of φt,l in {φ̄l }L̄l=1 , to obtain
additional iterative methods such as the gradient projection
(GP) algorithm. To circumvent this challenge, this works T Lt 2
1
pursues an ADMM approach. min št − w(φ̄r(t,l) )f (x̃it,l )
To simplify notation, express (26) in terms of f as w∈H,f ∈RL T t=1 l=1
TV(f ) := Dx f 1 + Dy Pf 1 , where P is a permutation (P1’) +μw w2H + μf ρ(f ).
matrix such that Pf = vec(F ), whereas Dx := ILy ⊗
ΔLx ∈ RLy (Lx −1)×L and Dy := ILx ⊗ ΔLy ∈ RLx (Ly −1)×L In this case, applying the representer theorem shows that the
IEEE TRANSACTIONS ON SIGNAL PROCESSING (EARLY ACCESS) 8
1 1
The second set is obtained after assembling the structure, and 0.9 0.9
5 5
is used to estimate f for a postulated w. Unfortunately, in 0.8 0.8
0.6 0.6
building, to calibrate the system. To circumvent the limita- 15 15
0.5 0.5
tions of these approaches, this section proposes estimators 0.4 0.4
w, from the uncalibrated data {(xn(t) , xn (t) , ǧ t )}Tt=1 , where 25 0.2 25 0.2
30 30
G = {gTX (x1 ), . . . , gTX (xN ), gRX (x1 ), . . . , gRX (xN )} and the
0.9
5 5
0.4
0.8
proposed algorithm will estimate these 2N gains. However, 10
0.35
10 0.7
if for example all sensors are of the same manufacturer and 0.3
0.6
15 0.25 15
model, one may assume that gTX (x1 ) = . . . = gTX (xN ) = γ 1 0.5
0.2
and gRX (x1 ) = . . . = gRX (xN ) = γ 2 . In this case, G contains 20
0.15
20 0.4
just two elements, which implies that only two gains have to 25 0.1
25
0.3
30 0.1
higher for a given T . Adding t to both sides of (1) yields 5 10 15 20 25 30
0 30
5 10 15 20 25 30
(c) (d)
ǧ = Ωγ 1 − φ1 γ 0 − š (30)
Fig. 1: (a) True SLF F; sensor locations are marked with red
where φ1 := 10 log10 ([φ1 (xn(1) , xn (1) ), . . . , crosses. (b-d) Estimated F̂ using the algorithm in Table V
φ1 (xn(T ) , xn (T ) )] ), ǧ := [ǧ 1 , . . . , ǧ T ] , γ 1 := (σ12 = 5 × 10−2 , σ22 = 7 × 10−2 , L̄ = 2, 500, T = 3, 000)
[γ 1 , . . . , γ C ] , and Ω ∈ R T ×C
is a matrix whose t-th with (b) Tikhonov regularization (Q = I900 , μf = 10−4 ,
row has ones at columns c1 and c2 , where c1 and c2 are μw = 2 × 10−2 ); (c) 1 -norm regularization (μf = 2 × 10−4 ,
such that gTX (xn(t) ) = γ c1 and gRX (xn (t) ) = γ c2 . It follows μw = 9 × 10−5 ); (d) TV regularization (μf = 4 × 10−5 ,
from (30) that š = Ωγ 1 − φ1 γ 0 − ǧ = Ω̃γ − ǧ, where μw = 3 × 10−4 , ρ = 1 × 10−3 ).
Ω̃ := [−φ1 , Ω] and γ = [γ 0 , γ 1 , . . . , γ C ] . Then, (P2’) can
be reformulated as
V. N UMERICAL TESTS
1
(P2”) min ǧ − Ω̃γ + (IT ⊗ f )Ψ RK̄ᾱ22 This section demonstrates the benefits of the proposed
γ,ᾱ,f T
algorithms through numerical tests with both synthetic and
+ μw ᾱ K̄ᾱ + μf ρ(f ).
real measurements.
Minimizing (P2”) with respect to γ, one obtains
A. Tests with synthetic measurements
1
min P ⊥
Ω̃
(ǧ + (IT ⊗ f )Ψ RK̄ᾱ)22
ᾱ,f T This section illustrates the effectiveness of the proposed
+ μw ᾱ K̄ᾱ + μf ρ(f ) (31) estimators in exploiting prior information through the three
regularizers in Sec. III-D. The test setup comprises a square
area A = [0.5, 30.5]2 over which the grid {x̃l }900 l=1 :=
where P ⊥ Ω̃
:= IT − Ω̃(Ω̃ Ω̃)−1 Ω̃ . Comparing (P2’) with {1, . . . , 30}2 of L = 900 points is defined. All experiments
(31), it follows that the algorithm in Table V can be em- adopt the weight function in (10) with λ = 0.39 and β = 1.1,
ployed to solve (31) if one replaces š with −P ⊥ Ω̃
ǧ, A[k] whereas a certain f will be specified per experiment. A total
with ĀΩ̃ [k] := P ⊥ (IT ⊗ f
[k])Ψ
R K̄, and B[k] with of N = 80 sensors were deployed uniformly at random
⊥
T Ω̃
B̄ Ω̃ [k] := P Ω̃ t=1 (it ⊗ ᾱ [k]K̄t Rt Ψt ). outside of objects over A at positions {xn }N n=1 . The t-th
From (P2’) and (31), it follows that the price to be paid measurement is obtained by the n(t)-th and n (t)-th sensors,
for working with uncalibrated data is that the information where n(t) and n (t) are drawn per t uniformly at random
along the column span of Ω̃ is neglected. The dimension of without replacement from {1, . . . , N }. To focus on the impact
this column span therefore embodies the uncertainty in the of regularization, the effects of calibration are not accounted
calibration. As expected, the larger this dimension, the larger for in this section, where {gTX (xn )}N N
n=1 , {gRX (xn )}n=1 , and
the T required to attain a target estimation performance. Since γ 0 are assumed known, implying that the fusion center uses the
this dimension increases with C, it is important to keep the “shadowing” measurements {št }Tt=1 . The latter are generated
cardinality of G as low as possible. To this end, one can assign as št := s(xn(t) , xn (t) ) − t , where s(xn(t) , xn (t) ) is
multiple sensors to the same γ c0 ∈ G. Although some error is obtained through (11) and {t }Tt=1 are independent zero-mean
incurred if the gains of these sensors are not exactly γ c0 , the Gaussian random variables with variance σ2 = 10−2 unless
overall effect of reducing C may pay off. otherwise stated.
IEEE TRANSACTIONS ON SIGNAL PROCESSING (EARLY ACCESS) 10
1 1
0.7 0.9
0.9 0.9
5 5 5 5
0.8
0.8 0.6 0.8
0.7
10 0.7 10 10 0.7 10
0.5
0.6
0.6 0.6
15 0.4 15 0.5
15 0.5 15 0.5
0.4
0.4 0.3 0.4
20 20 20 20
0.3
0.3 0.3
0.2
0.2 0.2
25 0.2 25 25 25
0.1
0.1 0.1 0.1
30 30 0 30 30 0
0 5 10 15 20 25 30 0 5 10 15 20 25 30
5 10 15 20 25 30 5 10 15 20 25 30
Algorithm in Table V was tested with a Gaussian kernel regions and sharp edges on Fig. 3a. The estimates for the three
κ(φ, φ ) = exp −(1/2)(φ − φ ) diag−1 {σ12 , σ22 }(φ − φ ) regularizers are depicted in Figs. 3b, 3c, and 3d. As expected,
and for the three regularizers in Sec. III-D. To cope with the the TV-regularizer leads to the best estimate since it promotes
large number of variables, the support confinement technique the kind of structure present in f .
in Sec. III-A is applied with B equal to the first Fresnel To illustrate the influence of the number of measurements,
zone. To simplify computations, the technique in Sec. III-E is the estimated SLF in the setup of Fig. 3d is depicted in Fig. 4
applied by drawing the vectors {φ̄l }L̄ l=1 uniformly at random for different values of T . As expected, the quality improves for
Lt ,T
and without replacement from {φt,l }l=1,t=1 . larger T . A visually satisfying estimate is already obtained for
The rest of the section presents three simulations illustrating T = 1, 800, which corresponds to T /N = 22.5 measurements
the importance of appropriately capturing prior information per sensor on average. However, the needed T will be dictated
through the regularizers. Comparisons between the proposed by the specific application and how quality is quantified there.
estimator and non-blind alternatives will be provided later. To corroborate the ability of the proposed algorithms for
The
3 first experiment adopts the smooth function f (x) = learning w(φ1 , φ2 ), Fig. 5 compares the true w(φ1 , φ2 ) with
cf j=1 exp(−x − x̄j 22 /σf,j 2
), where x̄1 = [15, 13] , x̄2 = ŵ(φ1 , φ2 ) obtained through the TV-regularized estimator in
[7, 20] , x̄3 = [20, 20] , σf,1
2
= 13, σf,22 2
= σf,3 = 5, and the same setting as Fig. 3. Note that such functions are only
cf is a constant ensuring that maxl f (x̃l ) = 1. Fig. 1a defined for φ2 ≥ φ1 since the triangle inequality imposes
depicts the values over the bidimensional grid of this function, that φ2 (x, x , x̃) ≥ φ1 (x, x ) for all (x, x , x̃). The fit is
whereas Figs. 1b, 1c, and 1d depict its estimates for the three satisfactory except in the vicinity of the points where w
regularizers. As expected, the SLF estimated through Tikhonov is discontinuous or non-smooth. These singularities are a
regularization is the most satisfactory in this case since it consequence of the simplifications adopted in [7] to postulate
promotes smooth estimates, a property present in f . (10). However, it is reasonable to expect that a function w
In the second experiment, f was set to the sparse function accurately capturing the actual physics must be continuous and
on Fig. 2a. The estimated SLFs for the three regularizers are smooth. Hence, these discontinuities are just model artifacts,
displayed in Figs. 2b, 2c, and 2d. In this case, the Tikhonov and the fitting error in those regions is thus not a limitation.
regularizer does not yield a good estimate since f is not The next experiment investigates the robustness of the
smooth. In contrast, the 1 -norm regularizer leads to the most proposed algorithms against measurement noise t . The nor-
accurate estimate since it exploits the sparsity of f . malized error f̂ − f 2 /f 2 averaged over sensor locations
In the third experiment, f was set to the function with flat and realizations of {t }Tt=1 will be used to quantify estimation
IEEE TRANSACTIONS ON SIGNAL PROCESSING (EARLY ACCESS) 11
1 1
5 0.9 5
0.9
0.7
0.8
0.8 1
E ˆ − 2 / 2
10 10 0.7
0.6 0.6
0.7
15 15 0.5
0.6 0.4
20 20
0.3
0.5
0.5
25 25 0.2
0.4
0.1
0.4
30 30 0
5 10 15 20 25 30 5 10 15 20 25 30
(a) (b) 0.3
1
0.9
0.9
5 5
0.8
0.8
0.2 −4
0.7 10
0.7 10 10−3 10−2 10−1
(σ)
10
0.6
0.6
15 15 0.5
0.5
30 0
30 0
regularization (setting of Fig. 2c); (diamond) TV regularization
5 10 15 20 25 30 5 10 15 20 25 30
(setting of Fig. 3d).
(c) (d)
0.04
0.03
0.02
0.02
0.01
500 1000 1500 2000 2500 3000
0
L̄
19 19.1 19.2 19.3 19.4 19.5 19.6 19.7 19.8 19.9 20
attenuation ŝ(x, x ) is obtained through (8) by replacing w This suggests adopting a performance metric quantifying error
and f with their estimates. For comparison purposes, s(x, x ) between s(x, x ) and ŝ(x, x ), such as the following normal-
IEEE TRANSACTIONS ON SIGNAL PROCESSING (EARLY ACCESS) 12
20 20
0.85 0.85
15 15
0.8 0.8
[ft]
[ft]
10 0.75 10 0.75
0.7 0.7
5 5
0.65 0.65
0.6 0.6
5 10 15 20 5 10 15 20
[ft] [ft]
(a) (b)
20 20
0.65
0.85
0.6
15 15
0.8
0.55
[ft]
[ft]
10 10 0.75
0.5
Fig. 8: Testbed configuration. 0.7
0.45
5 5
0.65
0.4
(Tikhonov) φ1 = 13.5
(Tikhonov) φ1 = 14 calibration stage into (1). The performance metric is
(Tikhonov) φ1 = 14.5
1 (Tikhonov) φ1 = 15 E{ t∈E (ǧ t − ĝ(xn(t) , xn (t) ))2 }
(1 -norm) φ1 = 13.5 NMSE :=
(1 -norm) φ1 = 14 E{ t∈E ǧ 2t }
0.8 (1 -norm) φ1 = 14.5
(1 -norm) φ1 = 15 where the expectation is taken over realizations of E.
(TV) φ1 = 13.5
ŵ(φ1 , φ2 )
20 20 20
0.8
0.85
0.85
0.75
15 15 15
0.8
0.8 0.7
[ft]
[ft]
[ft]
10 10 0.75 10
0.65
0.75
0.7
0.6
5 5 5
0.7 0.65
0.55
0.6
5 10 15 20 5 10 15 20 5 10 15 20
[ft] [ft] [ft]
Fig. 11: Estimated F̂ via the algorithm in Sec. IV (L̄ = 2, 000) with (a) Tikhonov regularization (σ12 = 8 × 10−2 , σ22 =
3.5 × 10−1 , μf = 3.2 × 10−1 , μw = 5 × 10−3 ); (b) 1 -norm regularization (σ12 = 8 × 10−2 , σ22 = 3.5 × 10−1 , μf = 3 × 10−3 ,
μw = 1.3 × 10−1 ); and (c) TV regularization (σ12 = 0.1, σ22 = 3.8 × 10−1 , μf = 9.5 × 10−3 , μw = 3.3 × 10−3 , ρ = 1 × 10−3 ).
[14] N. Patwari and P. Agrawal, “NeSh: A joint shadowing model for links
in a multi-hop network,” in Proc. IEEE Int. Conf. Acoust., Speech, Sig.
Process., Las Vegas, NV, Mar. 2008, pp. 2873–2876.
[15] P. Agrawal and N. Patwari, “Correlated link shadow fading in multi-hop
wireless networks,” IEEE Trans. Wireless Commun., vol. 8, no. 9, pp.
4024–4036, Aug. 2009.
[16] J. Wilson and N. Patwari, “Radio tomographic imaging with wireless
networks,” IEEE Trans. Mobile Comput., vol. 9, no. 5, pp. 621–632, D. Romero (M’16) received his M.Sc. and Ph.D.
2010. degrees in Signal Theory and Communications from
[17] N. Patwari and J. Wilson, “Spatial models for human motion-induced the University of Vigo, Spain, in 2011 and 2015,
signal strength variance on static links,” IEEE Trans. Inf. Forensics respectively. From Jul. 2015 to Nov. 2016, he was a
Security, vol. 6, no. 3, pp. 791–802, Sep. 2011. post-doctoral researcher with the Digital Technology
[18] E. Dall’Anese, S.-J. Kim, and G. B. Giannakis, “Channel gain map Center and Department of Electrical and Computer
tracking via distributed kriging,” IEEE Trans. Veh. Technol., vol. 60, Engineering, University of Minnesota, USA. In Dec.
no. 3, pp. 1205–1211, 2011. 2016, he joined the Department of Information and
[19] D. Lee, S.-J. Kim, and G. B. Giannakis, “Channel gain cartography for Communication Technology, University of Agder,
cognitive radios leveraging low rank and sparsity,” IEEE Trans. Wireless Norway, as an associate professor. His main research
Commun., vol. 16, no. 11, pp. 5953–5966, Nov. 2017. interests lie in the areas of machine learning, opti-
[20] B. Schölkopf and A. J. Smola, Learning with Kernels: Support Vector mization, signal processing, and communications.
Machines, Regularization, Optimization, and Beyond, MIT Press, 2002.
[21] C. Carmeli, E. De Vito, A. Toigo, and V. Umanita, “Vector valued
reproducing kernel Hilbert spaces and universality,” Analysis Appl., vol.
8, no. 1, pp. 19–61, 2010.
[22] C. M. Bishop, Pattern Recognition and Machine Learning, Information
Science and Statistics. Springer, 2006.
IEEE TRANSACTIONS ON SIGNAL PROCESSING (EARLY ACCESS) 15
Donghoon Lee (S’12) received his B.Eng. and Georgios B. Giannakis (Fellow’97) received his
M.Eng. degrees in electrical engineering from Ko- Diploma in Electrical Engr. from the Ntl. Tech. Univ.
rea University in Seoul, Korea in 2010 and 2013, of Athens, Greece, 1981. From 1982 to 1986 he
respectively. Currently, he is working towards the was with the Univ. of Southern California (USC),
Ph.D. degree with the Department of Electrical and where he received his MSc. in Electrical Engineer-
Computer Engineering (ECE) at the University of ing, 1983, MSc. in Mathematics, 1986, and Ph.D. in
Minnesota (UMN), Twin Cities. From 2010 to 2011, Electrical Engr., 1986. He was with the University of
he worked for the Korea Institute of Science and Virginia from 1987 to 1998, and since 1999 he has
Technology (KIST) in Seoul, Korea. been a professor with the Univ. of Minnesota, where
His current research interests include statistical he holds an Endowed Chair in Wireless Telecom-
signal processing and machine learning, with appli- munications, a University of Minnesota McKnight
cations to wireless communications and networking. Presidential Chair in ECE, and serves as director of the Digital Technology
Center.
His general interests span the areas of communications, networking and
statistical learning - subjects on which he has published more than 400 journal
papers, 700 conference papers, 25 book chapters, two edited books and two
research monographs (h-index 128). Current research focuses on learning from
Big Data, wireless cognitive radios, and network science with applications to
social, brain, and power networks with renewables. He is the (co-) inventor of
32 patents issued, and the (co-) recipient of 9 best journal paper awards from
the IEEE Signal Processing (SP) and Communications Societies, including
the G. Marconi Prize Paper Award in Wireless Communications. He also
received Technical Achievement Awards from the SP Society (2000), from
EURASIP (2005), a Young Faculty Teaching Award, the G. W. Taylor Award
for Distinguished Research from the University of Minnesota, and the IEEE
Fourier Technical Field Award (inaugural recipient in 2015). He is a Fellow
of EURASIP, and has served the IEEE in a number of posts, including that
of a Distinguished Lecturer for the IEEE-SP Society.