Professional Documents
Culture Documents
Shannon Theory On General Probabilistic Theory
Shannon Theory On General Probabilistic Theory
Abstract
This paper is an effort to extend Shannon theory to general probabilistic theory.
1
A state ω and a mesurement M defines the probablity dsitribution
cl conv S = S.
2
our concern: the size |X | of the set X of messages and the average success
probability Psucc of decode:
1 X
Psucc = hω(x), mx i .
|X |
x∈X
Note m0 does not appear none of figures of merits. Therefore, M = (mx )x∈X ,
which is subject to the restriction
X
mx ≥ 0, mx ≤ u, (2)
x∈X
N (F ) := sup |X | × Psucc
X ,ω(·),M
X
= sup hω(x), mx i. (3)
X ,ω(·),M x∈X
3
This also can be written as
X
N (F ) = sup hω, mω i,
M
ω∈ F
where the supremum taken over all M ’s such that mω 6= 0 at most finite subset
of F . (Then the sum in the RHS is well-defined.)
Then X
N (F ) ≤ sup hω, mω i + hη, G(M )i
M
ω∈ F
Therefore,
N (F ) ≤ inf ϕ(η), (4)
η≥0
where
X
ϕ(η) := sup hω, mω i + hη, G(M )i
M
ω∈ F
X
= hη, ui + sup hω − η, mω i
M
ω∈ F
hη, ui , if η ≥ ω, ∀ω ∈ F,
= .
∞ otherwise
Below we show the inequality (4) saturates and inf in the RHS can be re-
placed by min using Lemma 15. To this end, we check the hypothesis of the
lemma : (i) existence of a decoder M1 with G(M1 ) > 0 and (ii) N (F ) is finite.
First, consider M1 such that
u − m1 , ω = ω0 ,
mω =
0, otherwise.
Recall ”≥” is proper in the sense that there is an interior point m01 of the positive
cone. Also, there is a > 0 with au ≥ m01 . Thus, if m1 = a1 m01 , M1 satisfies (2)
and G(M1 ) = m1 > 0. Thus (i) is verified.
Second, N (F ), being positive, is bounded from below. To show N (F ) is
bounded from above, we use (4) and the assumption that B ∗ is finite dimen-
sional: let (ξi )li=1 be the maximal set of linearly independent vectors inf F .
Then expand ω ∈ F is written as
l
X l
X
ω= ai ξi ≤ max |ai | ξi .
i
i=1 i=1
4
Since the map ω → maxi |ai | is continuous and F is comapct, this can be
bounded by amax > 0 from above. Therefofe,
l
X
ω ≤ amax ξi ,
i=1
and
l
X l
X
N (F ) ≤ ϕ(amax ξi ) = amax hξi , ui = amax l < ∞,
i=1 i=1
verifying (ii).
Thus, N (F ) is rewritten as follows:
N (F ) = min{hη, ui ; c ≥ 0, ∀ω ∈ F ω ≤ η, η ≥ 0}
= min{c; c ≥ 0, ∀ω ∈ F ω ≤ cω0 , ω0 ∈ S}.
where Bε1 (ω) is the closed ε−ball centered at ω with respect to the norm k·k1 .
The former and the latter is a GPT version of and the Renner’s max-relative
entropy and the ε-smoothed max relative entropy, respectively [2][3].
and
ε ε ε
2Dmax (ωkω0 ) ≤ p2Dmax (ω1 kω0 ) + (1 − p)2Dmax (ω2 kω0 ) , (7)
ε ε ε
Dmax (ωkω0 ) ≤ pDmax (ωkω0,1 ) + (1 − p)Dmax (ωkω0,2 ). (8)
0
Proof. It suffices to show the last two inequality, since Dmax = Dmax .
0 1
To show (7), To show this, observe, for any ωi ∈ Bε (ωi ) (i = 1, 2),
ω 0 := pω10 + (1 − p)ω20
= p{(ω10 − ω1 ) + (ω1 − ω)}
+ (1 − p) {(ω20 − ω2 ) + (ω2 − ω)} + ω
= p (ω10 − ω1 ) + (1 − p) (ω20 − ω2 ) + ω
5
is an element of pBε1 (0) + (1 − p)Bε1 (0) + ω = Bε1 (ω). If ωi0 ≤ 2λi ω0 (i = 1, 2) in
addition, then
ω 0 ≤ p2λ1 + (1 − p)2λ2 ω0 .
so that
p2λ−λ1 + (1 − p)2λ−λ2 = 1.
Then
= 2λ ω0 .
Also, ω 00 ∈ Bε1 (ω) due to convexity of Bε1 (ω). Thus, to each λi > Dmax (ωi00 kω0,1 ),
there is δ > 0 such that
ε
Dmax (ωkω0 ) ≤ Dmax (ω 00 kω0 )
≤λ
00 00
= − log2 (p2−Dmax (ω1 kω0,1 ) + (1 − p)2−Dmax (ω2 kω0,2 ) ) + δ
00 00
≤ −p log2 2−Dmax (ω1 kω0,1 ) − (1 − p) log2 2−Dmax (ω2 kω0,1 ) + δ
= pDmax (ω100 kω0,1 ) + (1 − p)Dmax (ω200 kω0,2 ) + δ
ε ε
≤ pDmax (ωkω0,1 ) + (1 − p)Dmax (ωkω0,2 ) + δ
and that δ & 0 as maxi=1,2 {λi − Dmax (ωi00 kω0,1 )} & 0. Thus, taking this limit,
(8) is proved.
N (F ) = min{c; c ≥ 0, ∀ω ∈ F ω ≤ cω0 , ω0 ∈ S}
= inf max 2Dmax (ωkω0 ) ,
ω0 ∈S ω∈F
Define also
log2 N ε (F ) := inf max Dmax
ε
(ωkω0 ), (9)
ω0 ∈S ω∈F
6
4 An upper bound to capacity
Motivated by the consideration of the asymptotic efficiency of message sending
by the (poosiblly) noisy channels, let us consider a sequence of systems Sn
(n = 1, 2, · · · ) with the underlying ordered vector space B n with the unit element
un , the set Mn of all the measurements, and the set S n of all states. Let X n
be the set of messages to be send.
Then a channel is represented by a convex subset F n of S n , which phys-
ically corresponds to the set of states of output system of the (parallel use
of) channel(s). Let X n be the set of messages to be send, ω n (·) be a map
x ∈ X n → ω n (x) ∈ F n representing an encoder, and M n be a tuple of effects
M n = {mnx }x∈X n with
X
mnx ≤ un ,
x∈X n
a n ∞ 1 X n
C ({F }n=1 ) : = sup { R; lim n|
hω (x), mnx i ≥ a, |X n | = b2nR c}.
n n
{X ,ω (·),M }n=1n ∞ n→∞ |X n x∈X
0
These two definitions are equivalent indeed. Let C denote the RHS of the above
alternative definition. Since this alternative definition put no restriction on
0
|X n |, C ≥ C.
On the other hand, for any ε > 0, there is a sequence {X1n , ω1n (·), M1n }∞
n=1
and nε such that
1 0
∀n ≥ nε , log2 |X1n | ≥ C − ε.
n
0 0
If |X1n | > b2n(C −ε) c, we only have to reduce the seze of |X1n | to b2n(C −ε) c:
Align elements of X1n so that hω1n (x), mn1,x i is decreasing, let X n be the set of
0
−ε)
the first b2n(C c- th elements of X1n , and let ω n (x) = ω1n (x) and mnx = mn1,x .
0
−ε)
Then |X n | = b2n(C c and the average success probability does not decrease:
1 X n 1 X n
n
hω (x), mnx i ≥ n| hω1 (x), mn1,x i.
|X | n
|X 1 n
x∈X x∈X
7
Obviously,
lim C a = sup C a
a↓0 a>0
1 X n
= sup sup{R ; lim hω (x), mnx i ≥ a, |X n | = b2nR c}
{X n ,ω n (·),M n }∞ a>0 n→∞ |X n |
n=1 n x∈X
= C.
and
sup{c; ∃a ∈ A, (c, a) ∈ Z c } = inf{c; ∀a ∈ A, (c, a) ∈ Z}
respectively, we have (11).
8
where we had used the above lemma. For any ω 0 (·) with ω 0 (x) ∈ Bε1 (ω(x)), and
cω0 with ω 0 (x) ≤ cω0 ,
X X
hω(x), mx i ≤ hω 0 (x), mx i + ε|X |
x∈X x∈X
X
≤c hω0 , mx i + ε|X |
x∈X
= c + ε|X |.
Thus, taking the infimum about c and ω0 , we have the asserted inequality.
Lemma 5
( )
a 1 X
n
C ≥ lim sup log2 hω (x), mnx i n
− a|X | , (13)
n→∞ n X n ,ω n (·),M n
x∈X n
and
( )
a 1 X
n
C ≤ lim sup log2 hω (x), mnx i n
− (a − δ) |X | , ∀ δ ∈ (0, a),
n→∞ n X n ,ω n (·),M n
x∈X n
(14)
where |X n | = 2nR .
Proof. To show (13), observe
1 X n
Ca ≥ sup { R; ∀n, hω (x), mnx i ≥ a}.
{X n ,ω n (·),M n }∞ |X n | n
n=1 x∈X
(The RHS is well-defined, since the set { R; ∀n, |X1n | x∈X n hω n (x), mnx i ≥ a}
P
is not empty: for any channel, 0 (which means |X n | = 1 for all n) is an element
of this set.) Rearrange the terms of the inequality
1 X n
hω (x), mnx i ≤ 1 + a
|X n | n x∈X
to obtain X
|X n | ≥ hω n (x), mnx i − a|X n |.
x∈X n
Therefore,
1 X n
sup {|X n |; hω (x), mnx i ≥ a}
X n ,ω n (·),M n |X n | n
x∈X
1 X
≥ sup {|X n |; n hω n (x), mnx i ≥ a}
X n ,ω n (·),M n |X | n
x∈X
( )
X X
n n n n n n
≥ sup hω (x), mx i − a|X | ; hω (x), mx i − a|X | ≥ 0
X n ω n (·),M n x∈X n x∈X n
X
n
= sup { hω (x), mnx i − a|X | }. n
X n ω n (·),M n x∈X n
9
By |X n | = 2nR taking limn→∞ of both ends, we have the asserted inequality.
To show (14), consider {X n ω n (·), M n } such that limn→∞ |X1n | x∈X n hω n (x), mnx i ≥
P
or equivalently,
X δ nk
hω nk (x), mnx k i − ( a − δ) |X nk | ≥ |X |.
2
x∈X nk
Theorem 6
1 1
C a ≤ lim lim log2 N ε (F n ) = lim lim inf ε
sup Dmax (ω n kω0n ),
n
ε↓0 n→∞ ε↓0 n→∞ n ω0n ∈S n ω n ∈F n
1 1
C ≤ lim lim log2 N ε (F n ) = lim lim inf ε
sup Dmax (ω n kω0n ), (15)
ε↓0 n→∞ n ε↓0 n→∞ n ω0n ∈S n ω n ∈F n
Proof. By (12),
( )
X
a−δ n n
log2 N (F ) ≥ sup log2 hω (x), mnx i n
− (a − δ) |X | ,
X n ,ω n (·),M n x∈X n
10
Lemma 7 If g ∈ Gn and η ∈ (B n )∗ ,
kg(η)k1 = kηk1 .
kg ∗ (m)k = sup hω, g ∗ (m)i = sup hg(ω), m)i = sup hg(ω), m)i = kmk .
ω∈S n ω∈S n g(ω)∈S n
Similarly,
(g −1 )∗ (m)
=
(g ∗ )−1 (m)
= kmk. Thus, kmk ≤ 1 is equivalent to
kg ∗ (m)k ≤ 1. Therefore,
kg(η)k1 = sup hg(η), mi = sup hη, g ∗ (m)i = sup hη, g ∗ (m)i = kηk1 .
m:kmk≤1 m:kmk≤1 m:kg ∗ (m)k≤1
∀ω ∈ S n ∃ξ ∈ Bε (ω), ξ ≤ N ε (S n )ωm .
(N ε (S n ) + δ)ω 0 − ξg ≥ 0.
Therefore,
Z Z
0
0 ≤ N ε (S n ) g(ωm )dµn (g) − ξg dµn (g)
g∈Gn g∈Gn
Z
= N ε (S n )ωm − ξg dµn (g).
g∈Gn
11
Also,
Z
Z
n n
ω − ξg dµ (g)
=
(ω − ξg )dµ (g)
g∈Gn 1 g∈Gn 1
Z
n
≤ kω − ξg k1 dµ (g) ≤ ε.
g∈Gn
Thus
kηk1 := sup hη, m0 i = 2 sup hη, mi.
m0 :−u≤m0 ≤u m:0≤m≤u
Proof. Since ”≤” is trivial by definition, we prove ”≥”. By the previus lemma,
there is η+ ≥ 0 with
η ≤ η+
and kη+ k1 = 2 kηk1 = hη+ , ui. Combined with
η+ ≤ kη+ k1 N (S n )ωm ,
we have
η ≤ 2 kηk1 N (S n )ωm .
Also, by Lemma 9, there is ξ ∈ (B n )∗ with kξ − ωk1 ≤ ε satisfies
ξ ≤ N ε (S n )ωm .
12
If η := ω − ξ,
ω = ξ + η ≤ (N ε (S n ) + 2 kηk1 N (S n )) ωm
≤ (N ε (S n ) + 2εN (S n )) ωm .
Therefore, by definitin of N (S n ),
N (S n ) ≤ N ε (S n ) + 2εN (S n ).
6 Quantum channels
Let the system Sn is the quantum system with underlying Hilbert space H⊗n ,
and the measurement set Mn be the set of all the positive operator valued
measures over finite sets. In this case, the bound (15) is tight. as is shown
below. In particular, if channel F n is the convex closure of F ⊗n ( F ⊂ S),
or equivalentlly, the repeted use of memoryless channel with separable signal
states, (15) equals celebrated Holevo’s bound.
13
By (15),
1 ε
C ≤ lim inf sup lim D (ω n kω0n )
ε↓0 {ω0n ∈S n }∞ ∞
n=1 {ω n ∈F n }n=1
n→∞ n max
1 ε
= sup inf sup lim Dmax (ω n kω0n )
ε:ε>0 {ω0n ∈S n }∞
n=1 {ω n ∈F n }∞ n→∞ n
n=1
1 ε
≤ inf sup sup lim D (ω n kω0n )
{ω0n ∈S n }∞ ∞
n=1 {ω n ∈F n }n=1 ε:ε>0
n→∞ n max
= inf sup D(({ω n } k {ω0n }) , (16)
{ω0n ∈S n }∞ ∞
n=1 {ω n ∈F n }n=1
inf inf {λ; lim fn (λ, an ) = 0} = inf {λ; lim min fn (λ, an ) = 0} (19)
{an }∞
n=1
λ n→∞ λ n→∞ an ∈An
and
sup inf {λ; lim gn (λ, an ) = 0} = inf {λ; lim sup gn (λ, an ) = 0}. (20)
{an }∞ λ n→∞ λ n→∞ an ∈An
n=1
14
Proof. By (10),
∃{an }∞
n=1 , lim fn (λ, an ) = 0 ⇔ lim min fn (λ, an ) = 0.
n→∞ n→∞ an ∈An
proving ⇒.
Next, by (11),
∀{an }∞
n=1 , lim gn (λ, an ) = 0 ⇔ lim sup gn (λ, an ) = 0.
n→∞ n→∞ an ∈An
⇐ is shown by observing
1
where ε := 2 limn→∞ supan ∈An gn (λ, an ) > 0, we have
1
= lim sup gn (λ, an ) > 0.
2 n→∞ an ∈An
15
By (19) and (20), (17) is rewritten to
( )
X
n n
n nλ n
C = inf λ; lim sup min n n
p (ω )tr ω − 2 ω0 + = 0
n→∞ pn ω0 ∈S
ω n ∈F n
( )
X
pn (ω n )tr ω n − 2nλ ω0n + = 0
= inf λ; lim min
n n
sup
n→∞ ω0 ∈S pn
ω n ∈F n
n nλ n
= inf λ; lim min
n n
sup tr ω − 2 ω0 + = 0 ,
n→∞ ω0 ∈S ω n ∈F n
where to derive the second identity, we had used Fan’s minimax theorem:
which is (18).
If F n represents repeated use of the memoryless channel represented by F ,
or equivalently F n is the convex closure of F ⊗n , the RHS of (18) equals so-called
Holevo bound of the quantum-classical channel,
C({F n }∞ n ∞
n=1 ) ≥ C({F }n=1 ).
C({F n }∞ n ∞
n=1 ) = C({F }n=1 ) = inf sup D(ωkω0 ).
ω0 ∈S ω∈F
16
References
[1] J. M. Borwein and D. Zhuang, On Fan’s minimax theorem, Math. Program-
ming, Vol. 34, 232–234 (1986).
[2] Datta, Nilanjana. ”Min-and max-relative entropies and a new entanglement
monotone.” IEEE Transactions on Information Theory 55.6 (2009): 2816-
2826.
[3] Nilanjana Datta, Renato Renner: Smooth Renyi Entropies and the Quantum
Information Spectrum, IEEE Transactions on Information Theory, vol. 55,
pp. 2807-2815, 2009 (Doi: 10.1109/TIT.2009.2018340)
[4] M. Hayashi and H. Nagaoka, General formulas for capacity of classical-
quantum channels. IEEE Trans. Info. Theory, 49:1753, 2003.
[5] Ogawa, Tomohiro, and Hiroshi Nagaoka. ”Strong converse to the quantum
channel coding theorem.” IEEE Transactions on Information Theory 45.7
(1999): 2486-2489.
[6] Andreas Winter, Coding Theorems of Quantum Information Theory, Ph.D.
dissertation, Uni Bielefeld, 1999. (https://arxiv.org/abs/quant-ph/9907077)
sup f (M ).
M ∈Ω,G(M )≥0
Therefore,
sup f (M ) ≤ inf ϕ(η), (21)
M ∈Ω,G(M )≥0 η≥0
where
ϕ (η) := sup f (M ) + hη, G (M )i .
M ∈Ω,G(M )≥0
The inequality (21) is called weak duality. Remark that neither f , G, nor Ω need
to be convex for the weak duality to hold. However, for the inequality (21) to
17
saturate, these conditions are necessary. The following lemma gives conditions
for strong duality, or the saturation of the inequality.
Lemma 15 (Theorem 1, Section 8.6 of [?] ) Suppose that f , G and Ω are con-
vex. Suppose also there exists an M1 such that G (M1 ) > 0 and sup {f (M ) ; G (M ) ≥ 0, M ∈ Ω}
is finite. Then
sup f (M ) = min ϕ (η) ,
M ∈Ω,G(M )≥0 η≥0
hη0 , G (M0 )i = 0
18