Professional Documents
Culture Documents
Wu Tip2023
Wu Tip2023
Wu Tip2023
21 tion. To solve the proposed LRTCFPan model, we develop an 3), acquire a low spatial-resolution multispectral (LR-MS) 39
22 alternating direction method of multipliers (ADMM)-based al- image while capturing higher spatial information into a gray- 40
23 gorithm. Comprehensive experiments at reduced-resolution (i.e.,
24 simulated) and full-resolution (i.e., real) data exhibit that the
scaled panchromatic (PAN) image through another sensor. 41
25 LRTCFPan method significantly outperforms other state-of-the- Pansharpening refers to the spatial-spectral fusion of the LR- 42
26 art pansharpening methods. The code is publicly available at: MS image and the corresponding PAN image, aiming to yield 43
28 Index Terms—Low-rank tensor completion (LRTC), Dynamic LRTCFPan model, the whole procedure is depicted in Fig. 1. 45
29 detail mapping (DDM), Tubal rank, Alternating direction me- Different methodologies have recently been developed to 46
30 thod of multipliers (ADMM), Pansharpening, Super-resolution.
address the pansharpening problem. The most classical cate- 47
31
gory is the component substitution (CS)-based methods. Some 48
34
35
H igh-resolution multispectral (HR-MS) remote sensing
images play a crucial role in many practical applications,
e.g., change detection [1], target recognition [2], and classifica-
[6] method, the Gram-Schmidt adaptive (GSA) [7] method,
the band-dependent spatial-detail (BDSD) [8] method, and the
51
52
This research is supported by NSFC (Grant Nos. 12171072, 12271083), stituted with the PAN image. Generally, the CS-based methods 56
Key Projects of Applied Basic Research in Sichuan Province (Grant are appealing for their reduced computational burden, but they 57
No. 2020YJ0216), Natural Science Foundation of Sichuan Province (Grant
No. 2022NSFSC0501), and National Key Research and Development Program inevitably cause severe spectral distortion [10]. Another widely 58
of China (Grant No. 2020YFA0714001). (Corresponding authors: Ting-Zhu used category is the multi-resolution analysis (MRA)-based 59
Huang; Liang-Jian Deng.) methods. These methods inject the spatial details extracted 60
Z. C. Wu, T. Z. Huang, L. J. Deng and J. Huang are with
the School of Mathematical Sciences, University of Electronic Science from the PAN image via multi-scale decomposition into the 61
and Technology of China, Chengdu, Sichuan, 611731, China (e-mails: upsampled LR-MS image. The instances of this class are the 62
wuzhch97@163.com; tingzhuhuang@126.com; liangjian.deng@uestc.edu.cn; “à-trous” wavelet transform (ATWT) [11] method, the additive 63
huangjie uestc@uestc.edu.cn).
J. Chanussot is with Univ. Grenoble Alpes, Inria, CNRS, Grenoble INP, wavelet luminance proportional (AWLP) [12] method, and 64
LJK, 38000 Grenoble, France, also with the Aerospace Information Research the smoothing filter-based intensity modulation (SFIM) [13] 65
Institute, Chinese Academy of Sciences, Beijing 100045, China (e-mail: method. Compared with CS methods, the MRA methods are 66
jocelyn.chanussot@grenoble-inp.fr).
G. Vivone is with the Institute of Methodologies for Environmental Analy- characterized by higher spectral coherence while reducing spa- 67
sis, CNR-IMAA, 85050 Tito Scalo, Italy (e-mail: gemine.vivone@imaa.cnr.it). tial preservation. Overall, both the CS and MRA methods have 68
2
80 lot of computational resources and training data [25], which better characterize the spatial structure information of the 127
81 severely limits their computational efficiency, generalization PAN image. Within such a regularizer, we also explore a 128
82 ability, and model interpretability. new procedure for estimating injection coefficients. 129
85 ally realizing a trade-off between performance and efficiency. the LRTC-based framework, aiming for better completion 132
86 The variational methods are characterized by high general- and global characterization. 133
87 ization and model interpretability [24]. These methods, e.g., The remainder of the paper is organized as follows. The 134
88 [25], [30]–[36], consider the pansharpening problem as an ill- notations and preliminaries are introduced in Section II. The 135
89 posed inverse problem constructing the link among the LR-MS related works and the proposed model are described in Sec- 136
90 image, the PAN image, and the underlying HR-MS image, thus tion III. The proposed algorithm is provided in Section IV. The 137
91 formulating an optimization model. The promising results have numerical experiments are performed in Section V. Finally, the 138
92 been generated by adopting traditional image super-resolution conclusion is drawn in Section VI. 139
95 e.g., [24], [36]. However, due to the coupling of the ill-posed A. Notations 141
96 blurring and downsampling problems, many super-resolution Scalars, vectors, matrices, and tensors are denoted by low- 142
97 models either exhibit the unnecessary solving complexity for ercase letters, e.g., a, lowercase bold letters, e.g., a, upper- 143
98 decoupling, e.g., [24], or result in the unintuitive mixture of case bold letters, e.g., A, and calligraphic letters, e.g., A, 144
99 unfolding-based and tensor-based modeling, e.g., [39]. respectively. For a third-order tensor A ∈ RI1 ×I2 ×I3 , we 145
100 In this paper, we propose a novel variational pansharpening employ A(:, :, i) or A(i) for its i-th frontal slice, A(i, j, :) 146
101 method, i.e., the low-rank tensor completion (LRTC)-based for its (i, j)-th tube, and A(i, j, k) or ai,j,k for its (i, j, k)-th 147
102 framework with the deblurring regularizer, called LRTCFPan. element. TheqP Frobenius norm of A ∈ RI1 ×I2 ×I3 is defined as 148
104 Firstly, we formulate a new ISR degradation model, thus Fourier transformation (DFT) on all the tubes of A. Relying 150
105 theoretically decoupling and converting the original pansharp- upon the MATLAB command, we have Ā = fft(A, [ ], 3). 151
106 ening problem into the LRTC-based framework, which directly Conversely, A can be obtained from Ā via the inverse DFT 152
107 eliminates the downsampling operator before regularization. along each tube, i.e., A = ifft(Ā, [ ], 3). 153
108 Secondly, motivated by both the high-pass modulation (HPM)
109 scheme and the local similarity of remote sensing images, we B. Preliminaries 154
110 develop a new local-similarity-based dynamic detail mapping
111 (DDM) regularizer, which is imposed on the LRTC-based For clarity, we provide some definitions and theorems, and 155
112 framework to dynamically capture the high-frequency infor- briefly introduce the LRTC basics. 156
113 mation of the PAN image. Furthermore, the low-tubal-rank Definition II.1 (Tensor convolution (t-Conv)). Given a third- 157
114 characteristic is investigated, and the low-tubal-rank prior is order tensor A ∈ RI1 ×I2 ×I3 and a convolution kernel tensor 158
115 introduced for better completion and global characterization. B ∈ Rm×m×I3 , where set {B(i) }Ii=1
3
indicates various kernels 159
116 Under the ADMM framework, the proposed LRTCFPan model along the spectral dimension. Then, the t-Conv between A and 160
117 is efficiently solved. Extensive experiments confirm the supe- B yields a tensor A • B ∈ RI1 ×I2 ×I3 , whose i-th frontal slice 161
118 riority of the proposed LRTCFPan method over other classical is defined by 162
123 izer for pansharpening. Such a strategy directly eliminates [40]). Let A ∈ RI1 ×I2 ×I3 be a third-order tensor, then it can 165
124 the downsampling operator and provides a valuable per- be factorized as 166
In particular, the inverse DFT S = ifft(S̄, [ ], 3) gives the rank priors, such as the Tucker rank [45], the multi-rank [46], 205
following equation and the fibered rank [44]. Mathematically, the general rank- 206
I3
minimization tensor completion model is formulated as 207
1 X
S(i, i, 1) = S̄(i, i, k), min rank(X ) s.t. PΩ (X ) = Y, (1)
I3 X
k=1
where X is the underlying tensor, Y is the observed tensor,
179 where S̄(:, :, k) is the singular value matrix of the k-th frontal
Ω is the index set indicating available entries, and PΩ (·) is
180 slice of Ā. That is, rankt (A) = max (rankm (A)).
the projection function keeping the entries of X in Ω while
181 Definition II.3 (Tensor singular value [43]). Given a third- forcing all the other values to zeros, i.e.,
182 order tensor A ∈ RI1 ×I2 ×I3 , then the singular values of A (
are defined as the diagonal elements of S(i, i, 1), where S is xi1 ,i2 ,··· ,iN , if (i1 , i2 , · · · , iN ) ∈ Ω,
183 (PΩ (X ))i1 ,i2 ,··· ,iN :=
184 provided by the t-SVD A = U ∗ S∗V H . 0, otherwise.
185 Therefore, rankt (A) is equivalent to the number of non- Remark II.1. According to the requirements of the projection 208
186 zero tensor singular values of A, and its non-convex approxi- function in (1), variables X and Y must have the same size, 209
187 mation can be given via the following Definition II.4. and their elements in the set Ω must be numerically equivalent. 210
192 where S̄ = fft(S, [ ], 3), in which S is provided by the t-SVD Y ∈ Rh×w×S , and the PAN image P ∈ RH×W . Additionally, 217
193 A = U ∗ S∗V H , t is the rankt (A), and is a small positive H = h × r and W = w × r hold, where r is the scale factor. 218
197 of tensor Y ∈ RI1 ×I2 ×I3 , a closed-form minimizer of regarded as the degraded version of the underlying HR-MS 221
X 2 the single image super-resolution problem [47], [48], there also 224
198 is given by the t-SVT as Proxτ (Y), which is defined by exists an acknowledged and widely used degradation model for 225
248 Two common options for defining the coefficient are G = 1 the following rank-minimization problem 281
256 As previously described, the coupled formulation between unstable computational accuracy and hypersensitivity, which 288
257 blurring and downsampling typically causes two drawbacks: 1) are explained by the nonuniqueness of Yb and the oversensi- 289
258 the unnecessary solving complexity for decoupling, and 2) the tivity of P
bLP for different low-pass filters. Moreover, although 290
259 inconsistency in modeling form. To alleviate these limitations, Y is originally adopted to approximate the low-frequency
b 291
260 we consider developing a new ISR degradation model by in- information of X , the chaotic relationship is inevitably caused 292
261 vestigating the downsampling operator. As illustrated in Fig. 3, owing to Y = (X • B + N1 ) ↓r . To address these deficiencies, 293
262 the “nearest” downsampling ↓r can actually be refined into a we consider directly computing the low-frequency information 294
263 two-stage operator, i.e., “nearest” sampling and decimation, of X by X • B and developing a novel strategy for estimating 295
264 and the former is a sampling mode for the LRTC problem. the coefficient. Resultantly, we have 296
271 Consequently, the new ISR degradation model can be repre- tails, we conduct model (8) on each image patch to learn more 302
272 sented as Y = (X •B+N1 ) ↓r , which assumes that the LR-MS accurate coefficients (see Fig. 4), thus completely forming the 303
273 image is the blurred, noisy, then downsampled version of the local-similarity-based DDM regularizer. Equipped with such a 304
274 underlying HR-MS image. When the LR-MS image is further regularizer, the rank-minimization model (6) is improved as 305
275 processed, the degradation model can be equivalently rewritten min rank(X • B + N1 ) + λ1 kX − X • B − Dk2F
X ,X •B+N1
276 as the following projection-based form (9)
s.t. PΩ (X • B + N1 ) = Y ↑r,0 .
PΩ (X • B + N1 ) = Y ↑r,0 , (5) 306
Singular values
Singular values
Singular values
60 60 60
316 of the optimal CP-rank approximation cannot be assured [59]. 40 40 40
317 Moreover, since the X • B + N1 for pansharpening is merely 20 20 20
+ λ2 kX klt ΥE3 := (P
b • B) ↓r −E3 . (18)
s.t. PΩ (X • B + N1 ) = Y ↑r,0 . Ultimately, coefficients Gnew , i = 1, 2, · · · , S, can be esti-
(i)
345
(11) mated by 346
364 subproblem can be given as solution of the i-th problem is given by 373
2
2
η1
Λ1
η2
Λ2
(i) −1
(i) (i) ‡
min
X − Q+
+
X −R+
X ←F Σ ./ η3 F(B ) · F(B ) + η1 + η2
X 2
η1 F 2
η2
F
2 (23) (27)
η3
X • B − Z + Λ3
.
+ with 374
2
η3
F (i) (i) (i) (i)
Σ =η1 F(Q ) + η2 F(R ) − F(Λ1 ) − F(Λ2 )
365 According to the modulation transfer function (MTF)-matched (28)
(i)
366 filters [60], the B(i) , i = 1, 2, · · · , S, can be configured with + η3 F(Z(i) ) − F(Λ3 ) · F(B(i) )‡ ,
367 different blurring kernels [36]. Accordingly, we can rearrange
368 problem (23) as the frontal slice-based expression, i.e., where F(·) and F −1 (·) are the 2-D fast Fourier transform 375
(FFT) and its inverse operator, respectively, and ‡ denotes the 376
(i)
2
S
η1 X
(i) Λ complex conjugate. 377
min
X − Q(i) + 1
X 2 i=1
η1
2) T -subproblem: Similarly, the T -subproblem is 378
F
(i)
2 min λ2 kZ − T k2F + λ3 kT klt + ι(T ). (29)
S
η2 X
(i) Λ T
+
X − R(i) + 2
(24)
2 i=1
η2
Based on Theorem 2 and the definition of indicator function 379
F
S
(i) 2
ι(T ), we have 380
η3 X
(i) (i) (i) Λ3
+
X ⊗ B − Z +
,
2 η3
i=1
F
T ← PΩ Prox λ3 (Z) + Y ↑r,0 ,
c (30)
2λ2
369 which is equivalent to c
where Ω indicates the complementary set of Ω. 381
(i) 2
S 3) Q-subproblem: By fixing the other estimated directions
X η1
(i) (i) Λ1
382
min
X − Q +
X 2 η
for alternating, we obtain the Q-subproblem as 383
i=1
1
F
2
(i) 2
η1
Λ1
η2
(i)
(i) Λ2
min kQklt +
X − Q+
. (31)
+
X − R +
(25) Q 2
η1
F
2
η2
F Based on Theorem 2 again, we can immediately get 384
(i) 2
!
η3
(i)
(i) (i) Λ3
Λ1
+
X ⊗ B − Z + .
2
η3
Q ← Prox1 X + . (32)
F
η1 η1
7
EXP [61] PRACS [9] C-GSA [62] BDSD-PC [63] AWLP [12] GLP-CBD [60] GLP-FS [64] MF-HG [65]
GT 18’TIP [32] 19’IF [35] 19’CVPR [25] RR [66] CDIF [49] BAGDC [51] LRTCFPan GT
0.15
0.1
EXP [61] PRACS [9] C-GSA [62] BDSD-PC [63] AWLP [12] GLP-CBD [60] GLP-FS [64] MF-HG [65]
0.05
18’TIP [32] 19’IF [35] 19’CVPR [25] RR [66] CDIF [49] BAGDC [51] LRTCFPan GT
Fig. 6. The fusion results on the reduced-resolution Guangzhou dataset (source: GF-2). The first two rows: the visual inspection of the ground-truth (GT)
image and the close-ups of the fused images. The last two rows: the residual maps using the GT image as a reference.
2
To validate the superiority of the proposed LRTCFPan
2
X − R + Λ2
,
η2
406
min λ1 kR − Z − DkF + (33) method, we conduct comprehensive numerical experiments on
R 2
η2
F 407
2λ1 (Z + D) + η2 X + Λ2 and the Rio dataset (source: WV-3). The scale factors for all 410
R ← . (34)
2λ1 + η2 the datasets are 4, i.e., r = 4. Numerically, all experimental 411
387 5) Z-subproblem: The Z-subproblem is data are pre-normalized into [0, 1]. All the experiments are 412
2 η3
X • B − Z + Λ3
of RAM and an Intel(R) Core(TM) i5-4590 CPU: @3.30 GHz. 414
min λ1 kR − Z − DkF +
Z 2
η3
F (35) For each sensor, e.g., GF-2, QB, and WV-3, S + 1 low-pass 415
(i.e., the blurring kernels of the MS image), and the (·)LP 417
388 Correspondingly, the closed-form solution is given by (i.e., the blurring kernel of the PAN image). According to 418
2λ1 (R − D) + 2λ2 T + η3 X • B + Λ3 [60], the kernels designed to match the modulation transfer 419
Z ← . (36)
2(λ1 + λ2 ) + η3 functions (MTFs) of MS and PAN sensors are advisable. 420
389 Under the ADMM framework, the Lagrangian multipliers More specifically, these S + 1 blurring kernels are assumed 421
391 The solving pseudocode for the proposed LRTCFPan model FS [64], MF-HG [65], 18’TIP [32], 19’IF [35], 19’CVPR [25], 427
392 is summarized in Algorithm 1. RR [66], CDIF [49], and BAGDC [51]. It is worth remarking 428
that the source codes of the competitors are available at either 429
393 B. Computational Complexity Analysis the website2 or the authors’ homepages. The hyper-parameters 430
TABLE I
T HE QUALITY METRICS ON 82 IMAGES WITH A PAN SIZE OF 256 × 256 FROM THE REDUCED - RESOLUTION G UANGZHOU DATASET ( SOURCE : GF-2).
(B OLD : BEST; U NDERLINE : SECOND BEST )
TABLE II
T HE QUALITY METRICS ON 42 IMAGES WITH A PAN SIZE OF 256 × 256 FROM THE REDUCED - RESOLUTION I NDIANAPOLIS DATASET ( SOURCE : QB).
(B OLD : BEST; U NDERLINE : SECOND BEST )
440 in synthesis (ERGAS) [69], and the Q2n [70], are adopted. the simulated HR-MS image, the simulated LR-MS image, 452
441 When evaluated at full-resolution (i.e., real) data, the quality and the simulated PAN image can be considered as the blurred 453
442 with no reference (QNR) [71] metric, which consists of the and downsampled versions of the underlying HR-MS image, 454
443 spectral distortion index (i.e., Dλ ) and the spatial distortion the real LR-MS image, and the real PAN image, respectively. 455
444 index (i.e., Ds ), is employed. Since the ISR degradation model Y = (X •B+N1 ) ↓r assumes 456
that the real LR-MS image is the blurred and downsampled 457
EXP [61] PRACS [9] C-GSA [62] BDSD-PC [63] AWLP [12] GLP-CBD [60] GLP-FS [64] MF-HG [65]
GT 18’TIP [32] 19’IF [35] 19’CVPR [25] RR [66] CDIF [49] BAGDC [51] LRTCFPan GT
0.1
0.09
0.08
0.07
0.06
0.05 EXP [61] PRACS [9] C-GSA [62] BDSD-PC [63] AWLP [12] GLP-CBD [60] GLP-FS [64] MF-HG [65]
0.04
0.03
0.02
0.01
18’TIP [32] 19’IF [35] 19’CVPR [25] RR [66] CDIF [49] BAGDC [51] LRTCFPan GT
Fig. 7. The fusion results on the reduced-resolution Rio dataset (source: WV-3). The first two rows: the visual inspection of the ground-truth (GT) image
and the close-ups of the fused images. The last two rows: the residual maps using the GT image as a reference.
465 Figs. 6-7. Compared with the GT image, Fig. 6 reveals that LRTCFPan method can recover the correct direction of the 503
466 the GLP-CBD, the GLP-FS, the CDIF, and our LRTCFPan shadow of the car. Therefore, the effectiveness and superiority 504
467 methods obtain the better performance from both spectral of the LRTCFPan method are corroborated at full resolution. 505
491 whereas the LR-MS image (or the recovered image of the 1) Parameter Analysis: In Algorithm 1, seven hyperparam- 526
492 EXP method) is the spectral reference. According to Fig. 8, eters are theoretically involved, including the regularization 527
493 although many compared approaches, e.g., the PRACS, the parameters (i.e., λ1 , λ2 , and λ3 ), the penalty parameters (i.e., 528
494 AWLP, the GLP-FS, the MF-HG, and the 19’IF, obtain clearer η1 , η2 , and η3 ), and the blocksize of the block-based DDM 529
495 details, the inferior spectral fidelity is caused. Moreover, the regularizer. Among them, λ3 and η1 control the low-tubal-rank 530
496 C-GSA, the GLP-CBD, the 18’TIP, and the CDIF methods properties of X • B + N1 and X , respectively. Empirically, 531
497 generate abnormal colors, structures, or artifacts. In contrast, λ3 and η1 can be pre-determined within a small range, e.g., 532
498 the LRTCFPan and the BDSD-PC methods show the better {10−4 , 10−3 , 10−2 , 10−1 }. Similarly, the blocksize can also be 533
499 trade-off between spatial sharpening and spectral consistency. selected from {8 × 8, 10 × 10}, showing promising results in 534
500 From Fig. 9, we can observe that only the C-GSA, the BDSD- almost all the experiments. Afterwards, the remaining param- 535
501 PC, the 19’IF, and the LRTCFPan methods can reconstruct the eters, i.e., λ1 , λ2 , η2 , and η3 , are searched by jointly reaching 536
502 right shape and color of the acquired car. Especially, only the the optimal SAM, SCC, ERGAS, and Q2n metrics. For 537
10
TABLE III
T HE QUALITY METRICS ON 15 IMAGES WITH A PAN SIZE OF 256 × 256 FROM THE REDUCED - RESOLUTION R IO DATASET ( SOURCE : WV-3).
(B OLD : BEST; U NDERLINE : SECOND BEST )
TABLE IV
T HE QUANTITATIVE RESULTS FOR ALL THE COMPARED METHODS ON ( A ) 15 IMAGES FROM THE FULL - RESOLUTION G UANGZHOU DATASET ( SOURCE :
GF-2) AND ( B ) 15 IMAGES FROM THE FULL - RESOLUTION I NDIANAPOLIS DATASET ( SOURCE : QB). T HE SIZE OF THE PAN IMAGE IS 400 × 400.
(B OLD : BEST; U NDERLINE : SECOND BEST )
538 brevity, Fig. 10 presents the performance of varying λ1 , λ2 , Rio (sensor: WV-3) datasets. For a better presentation, the 550
539 η2 , and η3 on the reduced-resolution Guangzhou data (source: maximum number of iterations is empirically set to 200. 551
540 GF-2). Obviously, λ1 = 5 × 10−2 , λ2 = 1.8 × 101 , η2 = 8.1, In any considered case, the value of the objective function 552
541 and η3 = 1.8 are the best parameters for configuration. By becomes stable as the iteration number increases, implying 553
542 adopting the same strategy on different datasets, all parameter the numerical convergence behavior of Algorithm 1. 554
545 norm of Definition II.4 is non-convex, the convergence of the model, we further conduct the ablation study of model (12) 556
546 proposed ADMM-based LRTCFPan algorithm cannot be the- on the reduced-resolution Guangzhou image (sensor: GF-2). 557
547 oretically guaranteed. As depicted in Fig. 11, we numerically The following three sub-models are generated to independently 558
548 illustrate the algorithm convergence on the reduced-resolution verify the contributions of the two low-tubal-rank priors and 559
549 Guangzhou (sensor: GF-2), Indianapolis (sensor: QB), and the proposed local-similarity-based DDM regularizer. 560
11
Submodel-I: Submodel-III:
min kX klt + λ1 kX • B − T k2F + λ2 kT klt
min kX klt + λ1 kX − X • B − Dk2F + λ2 kX • B − T k2F X ,T
X ,T
s.t. PΩ (T ) = Y ↑r,0 .
s.t. PΩ (T ) = Y ↑r,0 ,
After all optimal parameter configurations are satisfied, the 561
Submodel-II: quantitative results of these models are reported in Table VII. 562
s.t. PΩ (T ) = Y ↑r,0 , better, implying the remarkable effectiveness of the regularizer. 565
12
566 Moreover, two low-tubal-rank priors also realize incremental whose augmented Lagrangian function is 573
571 is usually involved, e.g., [24], leading to the following con- N1 ) ↓r is employed, we only need to consider the augmented 575
1 2 1
min kZ ↓r −YkF s.t. Z = X • B, (38) L(X , T ) = kX • B − T k2F + ι(T ). (40)
X ,Z 2 2
13
TABLE V TABLE VI
T HE QUALITY METRICS FOR 42 IMAGES FROM THE FULL - RESOLUTION T HE HYPER - PARAMETER SETTINGS OF THE PROPOSED MODEL FOR
R IO DATASET ( SOURCE : WV-3). T HE SIZE OF THE PAN IMAGE IS DIFFERENT CASES . (R: R EDUCED RESOLUTION ; F: F ULL RESOLUTION )
400 × 400. (B OLD : BEST; U NDERLINE : SECOND BEST )
Dataset Case λ1 λ2 λ3 η1 η2 η3 Blocksize
Method Dλ Ds QNR Time[s]
R 0.05 18 10−4 10−4 8.1 1.8 8×8
0.004 ± 0.001 ± ± 0.06 Guangzhou
EXP [61] 0.105 0.019 0.892 0.019 F 1.00 50 10−4 10−4 2.1 4.7 10 × 10
PRACS [9] 0.018 ± 0.013 0.054 ± 0.035 0.928 ± 0.040 0.51 R 0.11 65 10−4 10−4 1.1 8.7 8×8
C-GSA [62] 0.044 ± 0.038 0.075 ± 0.064 0.887 ± 0.086 0.94 Indianapolis
F 0.40 75 10−1 10−3 2.1 6.7 10 × 10
BDSD-PC [63] 0.020 ± 0.011 0.044 ± 0.021 0.937 ± 0.029 0.14 R 0.14 56 10−4 10−4 4.2 8.3 8×8
AWLP [12] 0.051 ± 0.057 0.058 ± 0.072 0.898 ± 0.101 0.57 Rio
F 1.10 36 10−4 10−4 6.2 3.3 10 × 10
GLP-CBD [60] 0.065 ± 0.084 0.046 ± 0.037 0.894 ± 0.100 119.49
GLP-FS [64] 0.045 ± 0.047 0.056 ± 0.064 0.904 ± 0.091 0.26
10
4
14000 104
3 5
MF-HG [65] 0.053 ± 0.050 0.064 ± 0.055 0.889 ± 0.087 0.18 2.5 12000
4
8000
3
19’IF [35] 0.087 ± 0.043 0.096 ± 0.048 0.828 ± 0.080 55.94 1.5
1
6000 2
4000
19’CVPR [25] 0.016 ± 0.006 0.046 ± 0.012 0.939 ± 0.016 73.26 0.5
2000
1
0
RR [66] 0.062 ± 0.052 0.086 ± 0.077 0.861 ± 0.103 102.93 0 0
-0.5 -2000 -1
CDIF [49] 0.028 ± 0.009 0.048 ± 0.016 0.926 ± 0.018 182.18 0 50 100
Iteration
150 200 0 50 100
Iteration
150 200 0 50 100
Iteration
150 200
BAGDC [51] 0.060 ± 0.055 0.048 ± 0.048 0.898 ± 0.088 4.60 (a) (b) (c)
LRTCFPan 0.022 ± 0.013 0.022 ± 0.027 0.956 ± 0.036 153.51 Fig. 11. The curves of the objective function values on the reduced-resolution
Ideal value 0 0 1 - (a) Guangzhou (sensor: GF-2), (b) Indianapolis (sensor: QB), and (c) Rio
(sensor: WV-3) datasets.
3 2
SAM 10 150
SCC Traditional ISR Model Traditional ISR Model
2 ERGAS Our ISR Model
1 Our ISR Model
Q4 8
1 Best point
100
Runtime (s)
0
Runtime (s)
6
0
-1
SAM 4
-1 50
SCC
-2 ERGAS
-2 Q4 2
Best point
-3 -3 0 0
5 10-8 5 10-5 5 10-2 5 101 1.8 10-5 1.8 10-2 1.8 101 1.8 104 1 50 100 150 200 250 300 162 322 642 1282 2562
Parameter Parameter 2
Iteration Spatial Size of LR-MS image
1
-3 -2
8.1 10-3 8.1 100 8.1 103 8.1 106 1.8 10-3 1.8 100 1.8 103 1.8 106
Parameter 2
Parameter 3
unfolding-based and tensor-based modeling, e.g., [39]. 586
(c) (d) 5) Applicable Scope: Since the proposed LRTCFPan model 587
them with the same range of values, the obtained indexes are post-processed istic of numerous multispectral images. For such a statistical 590
by zero-mean normalization, i.e., (index − Mean(index))/Std(index). analysis, all simulated experimental data, i.e., 82 Guangzhou 591
Moreover, the means and the standard deviations of the SAM, the SCC, the
ERGAS, and the Q4 are provided for four subfigures, i.e., (a) 2.341 ± 0.467; images (sensor: GF-2), 42 Indianapolis images (sensor: QB), 592
0.947 ± 0.017; 2.897 ± 0.596; 0.812 ± 0.069, (b) 14.913 ± 9.950; and 15 Rio images (sensor: WV-3), are employed. According 593
0.588 ± 0.298; 19.503 ± 9.366; 0.267 ± 0.362, (c) 3.515 ± 3.335; to Fig. 13, the corresponding multispectral images demonstrate 594
0.904 ± 0.120; 13.638 ± 11.367; 0.494 ± 0.438, and (d) 10.008 ± 8.247;
0.500 ± 0.405; 17.518 ± 10.917; 0.358 ± 0.394. specific low-rank characteristics. Consequently, the applicabil- 595
577 Under the ADMM algorithm framework, the proposed ISR numerical experiments, only the traditional CS, MRA, and 598
578 degradation model avoids the computational complexity (i.e., variational pansharpening methods are involved. To compre- 599
2
579 O(HW S)) of solving 21 kZ ↓r −YkF . As depicted in Fig. 12, hensively demonstrate the performance, we further compare 600
580 the computational times are reduced. Moreover, since the the proposed LRTCFPan model with the CNN-based DCFNet 601
581 downsampling operator ↓r is eliminated by the tensor comple- method [73] on all reduced-resolution data, i.e., 82 Guangzhou 602
582 tion step, the matrixization of ↓r is not included in the resulting images (sensor: GF-2), 42 Indianapolis images (sensor: QB), 603
583 model. Consequently, the proposed LRTCFPan model can be and 15 Rio images (sensor: WV-3). Particularly, the pretraining 604
584 formulated in the tensor-based form, which is more physically datasets of the DCFNet model for the GF-2, QB, and WV- 605
585 intuitive than the matrix-based modeling or the mixture of 3 cases are the Beijing (sensor: GF-2), Indianapolis (sensor: 606
14
TABLE VII
T HE QUANTITATIVE RESULTS OF THE ABLATION EXPERIMENT ON THE REDUCED - RESOLUTION G UANGZHOU DATA ( SOURCE : GF-2).
(B OLD : BEST; U NDERLINE : SECOND BEST )
ISR
Low-Rank Prior Low-Rank Prior Local-Similarity-Based
Configuration Degradation PSNR SSIM SAM SCC ERGAS Q4
for X • B + N1 for X DDM Regularizer
Model
EXP [61] ! % % % 29.3053 0.8016 2.4860 0.9429 3.1620 0.8360
Submodel-I ! % ! ! 35.0918 0.9150 2.0104 0.9802 1.6582 0.9359
Submodel-II ! ! % ! 34.9260 0.9109 2.0373 0.9794 1.6971 0.9327
Submodel-III ! ! ! % 29.9401 0.7899 2.5329 0.9494 2.9143 0.8332
LRTCFPan ! ! ! ! 35.1550 0.9155 2.0089 0.9803 1.6470 0.9364
Ideal value - - - - +∞ 1 0 1 0 1
TABLE VIII
T HE QUALITY METRICS OF DIFFERENT METHODS ON THE REDUCED - RESOLUTION G UANGZHOU ( SENSOR : GF-2), I NDIANAPOLIS ( SENSOR : QB), AND
R IO ( SENSOR : WV-3) DATASETS . (B OLD : BEST; U NDERLINE : SECOND BEST )
The HR-MS image X The blurred image X B The blurred and noisy image X B + N 1 VI. C ONCLUSIONS 621
Tubal-rank approximation
(a) (b) (c) spatial information from the PAN image to the underlying HR- 629
Fig. 13. The statistics of the approximation of the tubal rank on different two low-tubal-rank constraints are simultaneously imposed. 631
simulated datasets, including (a) 82 Guangzhou images (sensor: GF-2), (b)
42 Indianapolis images (sensor: QB), and (c) 15 Rio images (sensor: WV-3).
To regularize the proposed model, we developed an efficient 632
The standard deviation of Gaussian noise is 0.01. ADMM-based algorithm. The numerical experiments demon- 633
R EFERENCES 635
609 WV-3 case, the DCFNet method is significantly superior to target detection and recognition in multiband imagery: A unified ML 641
detection and estimation approach,” IEEE Trans. Image Process., vol. 642
610 the LRTCFPan method, which is reasonable provided that the 6, no. 1, pp. 143–156, 1997. 1 643
611 Rio dataset is included in the training data of the former. [3] P. Zhong and R. Wang, “Learning conditional random fields for 644
612 Furthermore, when applied to the Indianapolis dataset (testing classification of hyperspectral images,” IEEE Trans. Image Process., 645
vol. 19, no. 7, pp. 1890–1907, 2010. 1 646
613 images), the DCFNet method does not exhibit the advantage [4] P. X. Zhuang, Q. S. Liu, and X. H. Ding, “Pan-GGF: A probabilistic 647
614 over the LRTCFPan method, even if the former is pretrained method for pan-sharpening with gradient domain guided image filtering,” 648
615 on the Indianapolis dataset. Instead, the DCFNet method is Signal Process., vol. 156, pp. 177–190, 2019. 1 649
[5] P. Kwarteng and A. Chavez, “Extracting spectral contrast in landsat the- 650
616 inferior to the LRTCFPan method on the Guangzhou dataset matic mapper image data using selective principal component analysis,” 651
617 owing to its limited generalization ability. Consequently, the Photogramm. Eng. Remote Sens., vol. 55, no. 1, pp. 339–348, 1989. 1 652
618 superior algorithm robustness and generalization capability of [6] W. Carper, T. Lillesand, and R. Kiefer, “The use of intensity-hue- 653
saturation transformations for merging SPOT panchromatic and multi- 654
619 the LRTCFPan method are mainly demonstrated, which may spectral image data,” Photogramm. Eng. Remote Sens., vol. 56, no. 4, 655
620 endow such a method with more practical significance. pp. 459–467, 1990. 1 656
15
657 [7] B. Aiazzi, S. Baronti, and M. Selva, “Improving component substitution [30] C. Ballester, V. Caselles, L. Igual, J. Verdera, and B. Rouge, “A 734
658 pansharpening through multivariate regression of MS + Pan data,” IEEE variational model for P+XS image fusion,” Int. J. Comput. Vis., vol. 735
659 Trans. Geosci. Remote Sens., vol. 45, no. 10, pp. 3230–3239, 2007. 1 69, no. 1, pp. 43–58, 2006. 2 736
660 [8] A. Garzelli, F. Nencini, and L. Capobianco, “Optimal MMSE pan [31] Y. Y. Jiang, X. H. Ding, D. L. Zeng, Y. Huang, and J. Paisley, “Pan- 737
661 sharpening of very high resolution multispectral images,” IEEE Trans. sharpening with a hyper-Laplacian penalty,” in Int. Conf. Comput. Vision 738
662 Geosci. Remote Sens., vol. 46, no. 1, pp. 228–236, 2007. 1 (ICCV), 2015, pp. 540–548. 2 739
663 [9] J. Choi, K. Yu, and Y. Kim, “A new adaptive component-substitution- [32] T. Wang, F. Fang, F. Li, and G. Zhang, “High-quality Bayesian 740
664 based satellite image fusion by using partial replacement,” IEEE Trans. pansharpening,” IEEE Trans. Image Process., vol. 28, no. 1, pp. 227– 741
665 Geosci. Remote Sens., vol. 49, no. 1, pp. 295–309, 2010. 1, 7, 8, 9, 10, 239, 2018. 2, 7, 8, 9, 10, 11, 12, 13 742
666 11, 12, 13 [33] L. J. Deng, G. Vivone, W. H. Guo, M. Dalla Mura, and J. Chanussot, 743
667 [10] G. Vivone, L. Alparone, J. Chanussot, M. Dalla Mura, A. Garzelli, G. A. “A variational pansharpening approach based on reproducible Kernel 744
668 Licciardi, R. Restaino, and L. Wald, “A critical comparison among Hilbert space and Heaviside function,” IEEE Trans. Image Process., 745
669 pansharpening algorithms,” IEEE Trans. Geosci. Remote Sens., vol. 53, vol. 27, no. 9, pp. 4330–4344, 2018. 2 746
670 no. 5, pp. 2565–2586, 2014. 1, 4 [34] A. M. Teodoro, J. M. Bioucas-Dias, and M. A. Figueiredo, “A con- 747
671 [11] M. J. Shensa, “The discrete wavelet transform: Wedding the a trous and vergent image fusion algorithm using scene-adapted Gaussian-mixture- 748
672 Mallat algorithms,” IEEE Trans. Signal Process., vol. 40, no. 10, pp. based denoising,” IEEE Trans. Image Process., vol. 28, no. 1, pp. 451– 749
673 2464–2482, 1992. 1 463, 2018. 2 750
674 [12] X. Otazu, M. Gonzalezaudicana, O. Fors, and J. Nunez, “Introduction [35] L. J. Deng, M. Y. Feng, and X. C. Tai, “The fusion of panchromatic and 751
675 of sensor spectral response into image fusion methods. Application to multispectral remote sensing images via tensor-based sparse modeling 752
676 wavelet-based methods,” IEEE Trans. Geosci. Remote Sens., vol. 43, and hyper-Laplacian prior,” Inf. Fusion, vol. 52, pp. 76–89, 2019. 2, 7, 753
677 no. 10, pp. 2376–2385, 2005. 1, 7, 8, 9, 10, 11, 12, 13 8, 9, 10, 11, 12, 13 754
678 [13] J. G. Liu, “Smoothing filter-based intensity modulation: A spectral [36] Z. C. Wu, T. Z. Huang, L. J. Deng, G. Vivone, J. Q. Miao, J. F. 755
679 preserve image fusion technique for improving spatial details,” Int. J. Hu, and X. L. Zhao, “A new variational approach based on proximal 756
680 Remote Sens., vol. 21, no. 18, pp. 3461–3472, 2000. 1 deep injection and gradient intensity similarity for spatio-spectral image 757
681 [14] J. F. Yang, X. Y. Fu, Y. W. Hu, Y. Huang, X. H. Ding, and J. Paisley, fusion,” IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens., vol. 13, pp. 758
682 “PanNet: A deep network architecture for pan-sharpening,” in Int. Conf. 6277–6290, 2020. 2, 6 759
683 Comput. Vision (ICCV), 2017, pp. 5449–5457. 2 [37] Q. Wei, N. Dobigeon, and J. Y. Tourneret, “Fast fusion of multi-band 760
684 [15] J. F. Hu, T. Z. Huang, L. J. Deng, T. X. Jiang, G. Vivone, and J. Chanus- images based on solving a Sylvester equation,” IEEE Trans. Image 761
685 sot, “Hyperspectral image super-resolution via deep spatiospectral Process., vol. 24, no. 11, pp. 4109–4121, 2015. 2, 3 762
686 attention convolutional neural networks,” IEEE Trans. Neural Netw. [38] R. W. Dian, S. T. Li, A. J. Guo, and L. Y. Fang, “Deep hyperspectral 763
687 Learn. Syst., pp. 1–15, 2021. 2 image sharpening,” IEEE Trans. Neural Netw. Learn. Syst., vol. 29, no. 764
688 [16] Z. Zhu, J. Hou, J. Chen, H. Zeng, and J. Zhou, “Hyperspectral image 99, pp. 1–11, 2018. 2 765
689 super-resolution via deep progressive zero-centric residual learning,” [39] R. W. Dian and S. T. Li, “Hyperspectral image super-resolution via 766
690 IEEE Trans. Image Process., vol. 30, pp. 1423–1438, 2020. 2 subspace-based low tensor multi-rank regularization,” IEEE Trans. 767
691 [17] T. Huang, W. Dong, J. Wu, L. Li, X. Li, and G. Shi, “Deep hyperspectral Image Process., vol. 28, no. 10, pp. 5135–5146, 2019. 2, 3, 13 768
692 image fusion network with iterative spatio-spectral regularization,” IEEE [40] M. E. Kilmer, K. Braman, N. Hao, and R. C. Hoover, “Third- 769
693 Trans. Comput. Imaging, vol. 8, pp. 201–214, 2022. 2 order tensors as operators on matrices: A theoretical and computational 770
694 [18] L. J. Deng, G. Vivone, C. Jin, and J. Chanussot, “Detail injection-based framework with applications in imaging,” SIAM J. Matrix Anal. Appl., 771
695 deep convolutional neural networks for pansharpening,” IEEE Trans. vol. 34, no. 1, pp. 148–172, 2013. 2, 3, 5 772
696 Geosci. Remote Sens., vol. 59, no. 8, pp. 6995–7010, 2020. 2, 4 [41] M. E. Kilmer and C. D. Martin, “Factorization strategies for third-order 773
697 [19] P. Addesso, G. Vivone, R. Restaino, and J. Chanussot, “A data-driven tensors,” Linear Alg. Appl., vol. 435, no. 3, pp. 641–658, 2011. 3 774
698 model-based regression applied to panchromatic sharpening,” IEEE [42] Z. Zhang, G. Ely, S. Aeron, N. Hao, and M. Kilmer, “Novel methods for 775
699 Trans. Image Process., vol. 29, pp. 7779–7794, 2020. 2 multilinear data completion and de-noising based on tensor-SVD,” in 776
700 [20] Z. R. Jin, L. J. Deng, T. J. Zhang, and X. X. Jin, “BAM: Bilateral acti- IEEE Conf. Comput. Vision Pattern Recognit. (CVPR), 2014, pp. 3842– 777
701 vation mechanism for image fusion,” in ACM Int. Conf. on Multimedia 3849. 3 778
702 (ACM MM), 2021. 2 [43] C. Lu, J. Feng, Y. Chen, W. Liu, Z. Lin, and S. Yan, “Tensor robust 779
703 [21] Y. D. Wang, L. J. Deng, T. J. Zhang, and X. Wu, “SSconv: Explicit principal component analysis with a new tensor nuclear norm,” IEEE 780
704 spectral-to-spatial convolution for pansharpening,” in ACM Int. Conf. Trans. Pattern Anal. Mach. Intell., vol. 42, no. 4, pp. 925–938, 2020. 3 781
705 on Multimedia (ACM MM), 2021. 2 [44] Y. B. Zheng, T. Z. Huang, X. L. Zhao, T. X. Jiang, T. H. Ma, and T. Y. 782
706 [22] X. Y. Fu, W. Wang, Y. Huang, X. H. Ding, and J. Paisley, “Deep Ji, “Mixed noise removal in hyperspectral image via low-fibered-rank 783
707 multiscale detail networks for multiband spectral image sharpening,” regularization,” IEEE Trans. Geosci. Remote Sens., vol. 58, no. 1, pp. 784
708 IEEE Trans. Neural Netw. Learn. Syst., vol. 32, no. 5, pp. 2090–2104, 734–749, 2020. 3, 5 785
709 2021. 2 [45] Y. B. Zheng, T. Z. Huang, T. Y. Ji, X. L. Zhao, T. X. Jiang, and T. H. 786
710 [23] X. Lu, J. Zhang, D. Yang, L. Xu, and F. Jia, “Cascaded convolutional Ma, “Low-rank tensor completion via smooth matrix factorization,” 787
711 neural network-based hyperspectral image resolution enhancement via Appl. Math. Model., vol. 70, pp. 677–695, 2019. 3 788
712 an auxiliary panchromatic image,” IEEE Trans. Image Process., vol. 30, [46] X. L. Zhao, W. H. Xu, T. X. Jiang, Y. Wang, and M. Ng, “Deep plug- 789
713 pp. 6815–6828, 2021. 2 and-play prior for low-rank tensor completion,” Neurocomputing, vol. 790
714 [24] Z. C. Wu, T. Z. Huang, L. J. Deng, J. F. Hu, and G. Vivone, “VO+Net: 400, pp. 137–149, 2020. 3 791
715 An adaptive approach using variational optimization and deep learning [47] K. Zhang, W. Zuo, and L. Zhang, “Deep plug-and-play super-resolution 792
716 for panchromatic sharpening,” IEEE Trans. Geosci. Remote Sens., vol. for arbitrary blur kernels,” in IEEE Conf. Comput. Vision Pattern 793
717 60, pp. 1–16, 2022. 2, 3, 4, 12 Recognit. (CVPR), 2019, pp. 1671–1681. 3 794
718 [25] X. Y. Fu, Z. H. Lin, Y. Huang, and X. H. Ding, “A variational pan- [48] Q. Song, R. Xiong, D. Liu, Z. Xiong, F. Wu, and W. Gao, “Fast image 795
719 sharpening with local gradient constraints,” in IEEE Conf. Comput. super-resolution via local adaptive gradient field sharpening transform,” 796
720 Vision Pattern Recognit. (CVPR), 2019, pp. 10265–10274. 2, 7, 8, 9, IEEE Trans. Image Process., vol. 27, no. 4, pp. 1966–1980, 2018. 3 797
721 10, 11, 12, 13 [49] J. L. Xiao, T. Z. Huang, L. J. Deng, Z. C. Wu, and G. Vivone, “A 798
722 [26] F. Fang, F. Li, C. Shen, and G. Zhang, “A variational approach for pan- new context-aware details injection fidelity with adaptive coefficients 799
723 sharpening,” IEEE Trans. Image Process., vol. 22, no. 7, pp. 2822–2834, estimation for variational pansharpening,” IEEE Trans. Geosci. Remote 800
724 2013. 2 Sens., 2022. 3, 7, 8, 9, 10, 11, 12, 13 801
725 [27] X. He, L. Condat, J. M. Bioucas-Dias, J. Chanussot, and J. Xia, “A [50] L. Loncan, L. B. Almeida, J. M. Bioucasdias, X. Briottet, et al., 802
726 new pansharpening method based on spatial and spectral sparsity priors,” “Hyperspectral pansharpening: A Review,” IEEE Geosci. Remote Sens. 803
727 IEEE Trans. Image Process., vol. 23, no. 9, pp. 4160–4174, 2014. 2 Mag., vol. 3, no. 3, pp. 27–46, 2015. 4 804
728 [28] H. A. Aly and G. Sharma, “A regularized model-based optimization [51] H. Lu, Y. Yang, S. Huang, W. Tu, and W. Wan, “A unified pansharpening 805
729 framework for pan-sharpening,” IEEE Trans. Image Process., vol. 23, model based on band-adaptive gradient and detail correction,” IEEE 806
730 no. 6, pp. 2596–2608, 2014. 2 Trans. Image Process., vol. 31, pp. 918–933, 2022. 4, 7, 8, 9, 10, 11, 807
731 [29] C. Chen, Y. Li, W. Liu, and J. Huang, “SIRF: Simultaneous satellite 12, 13 808
732 image registration and fusion in a unified framework,” IEEE Trans. [52] F. L. Hitchcock, “The expression of a tensor or a polyadic as a sum of 809
733 Image Process., vol. 24, no. 11, pp. 4213–4224, 2015. 2 products,” J. Math. Phys., vol. 6, no. 1-4, pp. 164–189, 1927. 5 810
16