Professional Documents
Culture Documents
Stochastic Processes in Classical and Quantum Physics and Engineering, Parthasarathy, 2023
Stochastic Processes in Classical and Quantum Physics and Engineering, Parthasarathy, 2023
Harish Parthasarathy
Professor
Electronics & Communication Engineering
Netaji Subhas Institute of Technology (NSIT)
New Delhi, Delhi-110078
First published 2023
by CRC Press
4 Park Square, Milton Park, Abingdon, Oxon, OX14 4RN
and by CRC Press
6000 Broken Sound Parkway NW, Suite 300, Boca Raton, FL 33487-2742
© 2023 Manakin Press
CRC Press is an imprint of Informa UK Limited
The right of Harish Parthasarathy to be identified as the author of this work has been
asserted in accordance with sections 77 and 78 of the Copyright, Designs and Patents Act
1988.
All rights reserved. No part of this book may be reprinted or reproduced or utilised in any
form or by any electronic, mechanical, or other means, now known or hereafter invented,
including photocopying and recording, or in any information storage or retrieval system,
without permission in writing from the publishers.
For permission to photocopy or use material electronically from this work, access www.
copyright.com or contact the Copyright Clearance Center, Inc. (CCC), 222 Rosewood
Drive, Danvers, MA 01923, 978-750-8400. For works that are not available on CCC
please contact mpkbookspermissions@tandf.co.uk
Trademark notice: Product or corporate names may be trademarks or registered
trademarks, and are used only for identification and explanation without intent to infringe.
Print edition not for sale in South Asia (India, Sri Lanka, Nepal, Bangladesh, Pakistan or
Bhutan).
British Library Cataloguing-in-Publication Data
A catalogue record for this book is available from the British Library
ISBN: 9781032405391 (hbk)
ISBN: 9781032405414 (pbk)
ISBN: 9781003353553 (ebk)
DOI: 10.4324/9781003353553
Typeset in Arial, BookManOldStyle, CourierNewPS, MS-Mincho, Symbol,
TimesNewRoman and Wingdings
by Manakin Press, Delhi
2
Preface
Preface
This book covers a wide range of problems involving the applications of
stochastic processes, stochastic calculus, large deviation theory, group represen
tation theory and quantum statistics to diverse fields in dynamical systems, elec
tromagnetics, statistical signal processing, quantum information theory, quan
tum neural network theory, quantum filtering theory, quantum electrodynam
ics, quantum general relativity, string theory, problems in biology and classical
and quantum fluid dynamics. The selection of the problems has been based
on courses taught by the author to undergraduates and postgraduates in elec
tronics and communication engineering. The theory of stochastic processes has
been applied to problems of system identification based on time series models of
the process concerned to derive time and order recursive parameter estimation
followed by statistical performance analysis. The theory of matrices has been
applied to problems such as quantum hypothesis testing between two states in
cluding a proof of the quantum Stein Lemma involving asymptotic formulas for
the error probabilities involved in discriminating between a sequence of ordered
pairs of tensor product states. This problem is the quantum generalization of
the large deviation result regarding discriminating between to probability dis
tributions from iid measurements. The error rates are obtained in terms of the
quantum relative entropy between the two states. Classical neural networks are
used to match an inputoutput signal pair. This means that the weights of the
network are trained so that it takes as input the true system input and repro
duces a good approximaition of the system output. But if the input and output
signals are stochastic processes characterized by probability densities and not
by signal values, then the network must take as input the input probability
distribution and reproduce a good approximation of the output probability dis
tribution. This is a very hard task since a probability distribution is specified
by an infinite number of values. A quantum neural network solves this problem
easily since Schrodinger’s equation evolves a wave function whose magnitude
square is always a probability density and hence by controlling the potential
in this equation using the input pdf (probability distribution function) and the
error between the desired output pdf and that generated by Schrodinger’s equa
tion, we are able to track a time varying output pdf successfully during the
training process. This has applications to the problem of transforming speech
data to EEG data. Speech is characterized by a pdf and so is EEG. Further the
speech pdf is correlated to the EEG pdf because if the brain acquires a disease,
the EEG signal will get distorted and correspondingly the speech signal will also
get slurred. Thus, the quantum neural network based on Schrodinger’s equation
can exploit this correlation to get a fairly good approximation of the EEG pdf
given the speech pdf. These issues have been discussed in this book. Other prob
lems include a proof of the celebrated Lieb’s inequality which is used in proving
convexity and concavity properties of various kinds of quantum entropies and
quantum relative entropies that are used in establishing coding theorems in
quantum information theory like what is the maximum rate at which classical
information can be transmitted over a Cq channel so that a decoder with error
probability converging to zero can be constructed. Some of the inequalities dis
v
3
vi
4
chapter 2
2.1 String theory and large deviations
2.2 Evaluating quantum string amplitudes using functional integration
2.3 LDP for Poisson fields
2.4 LDP for strings perturbed by mixture of Gaussian and Poisson noise
chapter 3
ordinary, partial and stochastic differential equation models for biological
systems,quantum gravity, Hawking radiation, quantum fluids,statistical signal
processing, group representations, large deviations in statistical physics
3.1 A lecture on biological aspects of virus attacking the immune system
3.2 More on quantum gravity
3.3 Large deviation properties of virus combat with antibodies
3.4 Large deviations problems in quantum fluid dynamics
3.5 Lecture schedule for statistical signal processing
3.6 Basic concepts in group theory and group representation theory
3.7 Some remarks on quantum gravity using the ADM action
3.8 Diffusion exit problem
3.9 Large deviations for the Boltzmann kinetic transport equation
3.10 Orbital integrals for SL(2, R) with applications to statistical image pro
cessing.
3.11 Loop Quantum Gravity (LQG)
vii
Brief Contents
ix
Detailed Contents
1. Stability Theory of Nonlinear Systems Based on Taylor
Approximations of the Nonlinearity 1–10
1.1 Stochastic Perturbation of Nonlinear Dynamical Systems:
Linearized Analysis 1
1.2 Large Deviation Analysis of the Stability Problem 3
1.3 LDP for Yang-Mills Gauge Fields and LDP for the Quantized
Metric Tensor of Spacetime 4
1.4 String in Background Fields 6
2. String Theory and Large Deviations 11–14
2.1 Evaluating Quantum String Amplitudes Using
Functional Integration 11
2.2 LDP for Poisson Fields 12
2.3 LDP for Strings Perturbed by Mixture of Gaussian and
Poisson Noise 13
3. Ordinary, Partial and Stochastic Differential Equation Models for
Biological Systems, Quantum Gravity, Hawking Radiation,
Quantum Fluids, Statistical Signal Processing,
Group Representations, Large Deviations in Statistical Physics 15–60
3.1 A lecture on Biological Aspects of Virus Attacking
the Immune System 15
3.2 More on Quantum Gravity 23
3.3 Large Deviation Properties of Virus Combat with Antibodies 25
3.4 Large Deviations Problems in Quantum Fluid Dynamics 31
3.5 Lecture Schedule for Statistical Signal Processing 34
3.6 Basic Concepts in Group Theory and Group Representation Theory 37
3.7 Some Remarks on Quantum Gravity Using the ADM Action 38
3.8 Diffusion Exit Problem 39
3.9 Large Deviations for the Boltzmann Kinetic Transport Equation 41
3.10 Orbital Integrals for SL(2,R) with Applications to
Statistical Image Processing 42
3.11 Loop Quantum Gravity (LQG) 44
3.12 Group Representations in Relativistic Image Processing 45
3.13 Approximate Solution to the Wave Equation in the Vicinity of
the Event Horizon of a Schwarchild Black-hole 48
3.14 More on Group Theoretic Image Processing 51
3.15 A Problem in Quantum Information Theory 58
3.16 Hawking Radiation 59
xi
4. Research Topics on Representations 61–74
4.1 Remarks on Induced Representations and Haar Integrals 61
4.2 On the Dynamics of Breathing in the Presence of a Mask 66
4.3 Plancherel Formula for SL(2,R) 69
4.4 Some Applications of the Plancherel Formula for a Lie Group to
Image Processing 71
5. Some Problems in Quantum Information Theory 75–86
5.1 Quantum Information Theory and Quantum Blackhole Physics 81
5.2 Harmonic Analysis on the Schwarz Space of G = SL(2,R) 83
5.3 Monotonicity of Quantum Entropy Under Pinching 84
6. Lectures on Statistical Signal Processing 87–106
6.1 Large Deviation Analysis of the RLS Algorithm 88
6.2 Problem Suggested
6.3 Fidelity Between Two Quantum States 90
6.4 Stinespring’s Representation of a TPCP Map, i.e,
of a Quantum Operation 92
6.5 RLS Lattice Algorithm, Statistical Performance Analysis 93
6.6 Approximate Computation of the Matrix Elements of
the Principal and Discrete Series for G = SL(2,R) 94
6.7 Proof of the Converse Part of the Cq Coding Theorem 95
6.8 Proof of the Direct part of the Cq Coding Theorem 100
6.9 Induced Representations 102
6.10 Estimating Non-uniform Transmission Line Parameters and
Line Voltage and Current Using the Extended Kalman Filter 104
6.11 Multipole Expansion of the Magnetic Field Produced
by a Time Varying Current Distribution 105
7. Some Aspects of Stochastic Processes 107–126
7.1 Problem Suggested by Prof. Dhananjay 121
7.2 Converse of the Shannon Coding Theorem 122
8. Problems in Electromagnetics and Gravitation 127–128
9. Some More Aspects of Quantum Information Theory 129–140
9.1 An Expression for State Detection Error 129
9.2 Operator Convex Functions and Operator Monotone Functions 131
9.3 A Selection of Matrix Inequalities 133
9.4 Matrix Holder Inequality 138
10. Lecture Plan for Statistical Signal Processing 141–182
10.1 End-Semester Exam, Duration: Four Hours, Pattern Recognition 143
xii
10.2 Quantum Stein’s Theorem 153
10.3 Question Paper, Short Test, Statistical Signal Processing 154
10.4 Question Paper on Statistical Signal Processing, Long Test 157
10.5 Quantum Neural Networks for Estimating EEG pdf from
Speech pdf Based on Schrodinger’s Equation 166
10.6 Statistical Signal Processing 173
10.7 Monotonocity of Quantum Relative Renyi Entropy 178
11. My Commentary on the Ashtavakra Gita 183–186
12 LDP-UKF Based Controller 187–192
13. Abstracts for a Book on Quantum Signal Processing Using
Quantum Field Theory 193–200
14. Quantum Neural Networks and Learning Using Mixed State
Dynamics for Open Quantum Systems 201–208
15. Quantum Gate Design, Magnet Motion, Quantum Filtering,
Wigner Distributions 209–238
15.1 Designing a Quantum Gate Using QED with a Control
c-number Four Potential 209
15.2 On the Motion of a Magnet Through a Pipe with a Coil Under
Gravity and the Reaction Force Exerted by the Induced Current
Through the Coil on the Magnet 211
15.3 Quantum Filtering Theory Applied to Quantum Field Theory for
Tracking a Given Probability Density Function 219
15.4 Lecture on the Role of Statistical Signal Processing in General
Relativity and Quantum Mechanics 221
15.5 Problems in Classical and Quantum Stochastic Calculus 223
15.6 Some Remarks on the Equilibrium Fokker-Planck Equation for
the Gibbs Distribution 226
15.7 Wigner-distributions in Quantum Mechanics 228
15.8 Motion of An Element having Both an Electric and a Magnetic
Dipole Moment in an External Electromagnetic Field Plus a Coil 236
16. Large Deviations for Markov Chains, Quantum Master
Equation, Wigner Distribution for the Belavkin Filter 239–248
16.1 Large Deviation Rate Function for the Empirical
Distribution of a Markov Chain 239
16.2 Quantum Master Equation in General Relativity 241
16.3 Belavkin Filter for the Wigner Distirbution Function 246
xiii
17. String and Field Theory with Large Deviations, Stochastic
Algorithm Convergence 249–260
17.1 String Theoretic Corrections to Field Theories 249
17.2 Large Deviations Problems in String Theory 250
17.3 Convergence Analysis of the LMS Algorithm 252
17.4 String Interacting with Scalar Field, Gauge Field and
a Gravitational Field 253
xiv
Chapter 1
Chapter 1
Stability Theory
Stability theory of
ofNonlinear
Systems based
nonlinear systems Based on
on
Taylor
Taylor Approximations
approximations of of
the Nonlinearity
the nonlinearity
dx(t)/dt = f (x(t)), t ≥ 0
δx(t) = exp(tA)δx(0), t ≥ 0
where
A = f � (x0 )
is the Jacobian matrix of f at the equilibrium point x0 and hence it follows that
this linearized system is asymptotically stable under small perturbations in all
1
2 Stochastic Processes in Classical and Quantum Physics and Engineering
10CHAPTER 1. STABILITY THEORY OF NONLINEAR SYSTEMS BASED ON TAYLO
where w(t) is stationary Gaussian noise with mean zero and autcorrelation
Rw (τ ) = E(w(t + τ )w(t))
Let δx(t) be a small perturbation around x0 (t) so that x(t) = x0 (t) + δx(t)
solves the noisy problem. Then, assuming w(t) to be small and δx(t) to be of
the ”same order or magnitude” as w(t), we get approximately upto linear orders
in w(t), the linearized equation
where
Φ(t) = exp(tA)
Assuming δx(0) to be a nonrandom initial perturbation, we get
E(δx(t)) = Φ(t)δx(0)
Then
r
r
Φ(t) = exp(−tλ(k))Ek
k=1
and we get
r
r f ∞ f ∞
, Px = T r(Ek .Rw (t1 − t2 ).Em )exp(−λ(k)t1 − λ(m)t2 )dt1 dt2
k,m=1 0 0
A sufficient condition for this to be finite is that |Rw (t)| remain bounded in
which case we get when
that
r
r
Px ≤ B (Re(λ(k) + λ(m)))−1 ≤ 2Br2 /λR (min)
k,m=1
where
λR (min) = mink Re(λ(k))
Let PE (x) denote the probability of a sample path of the process that starts at
x0 end ends at the stop time τE = t at which the process first hits the boundary
S. Then, from basic LDP, we know that the approximate probability of such a
path is given by
exp(−E−1 infy∈S V (x0 , y, t))
where
f t
V (x, t) = infu:dx(s)/ds=f (x(s))+g(x(s))u(s),0≤s≤t,x(0)=x0 u(s)2 ds/2
0
4 12CHAPTERStochastic Processes
1. STABILITY in Classical
THEORY OF and Quantum Physics
NONLINEAR SYSTEMSand Engineering
BASED ON TAYLO
and
V (x0 , y, t) = infx:x(0)=x0 ,x(t)=y V (x0 , x, t)
The greater the probability of the process hitting S, the lesser the mean value
of the hitting time. This is valid with an inverse proportion. Thus, we can say
that
τE ≈ exp(V /E), E → 0
where
V = infy∈S,t≥0 V (x0 , y, t)
More precisely, we have the LDP
Note that τE = exp(V /E) for a given path x implies that the probability of x
PE (x) = exp(−V /E). This is because of the relative frequency interpretation of
probability given by Von Mises or equivalently because of ergodic theory: If the
path hits the boundary N (E) times in a total of N0 trials, with the time taken
to hit being τE for each path, then the probability of hitting the boundary is
N (E)/N . However, since it takes a time τE for each trial hit, it follows that the
number of hits in time T is T /τE . Thus if each trial run in the set of N trials is
of duration T , then the probability of hit is
PE = N (E)/N = T /(N.τE )
τE ≈ exp(V /E)
1
Note that the time derivatives N ' , N a of N and N a do not appear in L. QGTR
is therefore described by a constrained Hamiltonian with the constraints being
P N = ∂L/∂N ' = 0,
H = ∂L/∂N = 0
H(N )ψ = 0∀N
where
H(N ) = N.Hd3 x
Dµ = ∂µ + Γµ
denote the spinor gravitational covariant derivative. Then the YangMills field
tensor is given by
Fµν = Dµ Aν − Dν Aµ + [Aµ , Aν ]
√
SY M (A) = g µα g νβ T r(Fµν Fαβ ) −gd4 x
Problem: Derive the canonical momentum fields for this curved spacetime
YangMills Lagrangian and hence obtain a formula for the canonical Hamilto
nian density.
6 Stochastic Processes in Classical and Quantum Physics and Engineering
14CHAPTER 1. STABILITY THEORY OF NONLINEAR SYSTEMS BASED ON TAYLO
are
where δ is a small perturbation parameter and ((η ab )) = diag[1, −1] The prob
lem is to derive the correction to string propagator caused by the nonlinear
perturbation term. The propagator is
Dµν (τ, σ|τ 1 , σ 1 ) =< 0|T (X µ (τ, σ).X ν (τ 1 , σ 1 ))|0 >
and we see that it satisfies the following fundamental propagator equations: let
x = (τ, σ), x1 = (τ 1 , σ 1 ), ∂0 = ∂τ , ∂1 = ∂σ
Then, 1
∂0 Dµν (x|x1 ) = δ(x0 − x 0 ) < [X µ (x), X ν (x1 )] >
µ
+ < T (X,0 (x).X ν (x1 )) >
µ
=< T (X,0 (x).X ν (x1 ) >
and likewise
1 µ
∂02 Dµν (x|x1 ) = δ(x0 − x 0 ) < [X,0 (x), X ν (x1 )] >
Stochastic Processes in Classical and Quantum Physics and Engineering 7
1.4. STRING IN BACKGROUND FIELDS 15
µ
+ < T (X,00 (x).X ν (x� )) >
� µ
= δ(x0 − x 0 ) < [X,0 (x), X ν (x� )] >
+∂12 Dµν (x|x� ) + δ. < T (F µ (X, X,a X,b )(x).X ν (x� )) >
or equivalently
η ab ∂a ∂b D(x|x� ) =
δ. < T (F µ (X, X,a X,b )(x).X ν (x� )) >
� µ
+δ(x0 − x 0 ) < [X,0 (x), X ν (x� )] > − − −(1)
Now consider the canonical momenta
µ ν
Pµ (x) = ∂L/∂X,0 = gµν (X(x))X,0 (x)
ν
−2Hµν (X(x))X,1 (x)
Using the canonical equal time commutation relations
so that
ν
[X µ (τ, σ), X,1 (τ, σ � )] = 0
and
[X µ (τ, σ), Pν (τ, σ � )] = iδµν δ(σ − σ � )
we therefore deduce that
ρ
[X µ (τ, σ), gνρ (X(τ, σ � ))X,0 (τ, σ � )] = iδµν δ(σ − σ � )
or equivalently,
ν
[X µ (τ, σ), X,0 (τ, σ � )] = ig µν (X(τ, σ))δ(σ − σ � )
to get
µ
η ab X0,ab =0
µ
η ab δX,ab = δ.F µ (X0 , X0' , X0'' )
and solve this to express δX µ as the sum of a classical part (ie a particular
solution to the inhomogeneous equation) and a quantum part (ie the general
solution to the homogeneous equation). The quantum component is
2
δXqµ (τ, σ) = −i (aµ (n)/n)exp(in(τ − σ))
n=0
where
[aµ (n), aν (m)] = nη µν δ[n + m]
Note that in particular, DX0µ (x) = 0 and hence the approximate corrected
propagator equation becomes
if we do not attach the perturbation tag δ to the F term, ie, we do not regard
F as small, then the unperturbed Bosonic string X0 satisfies
and then the corrected propagator Dµν satisfies the approximate equation
DDµν (x|x' ) =
µ
+δ.F,1ρ (X0 , X0' , X0'' )(x) < T (δX ρ (x)δX ν (x' )) >0
µ
+δ.F,2ρa (X, X ' , X '' )(x) < T (δX,a
ρ
(x).δX ν (x' )) >0
µ ρ
+δ.F,3ρab (X, X ' , X '' )(x) < T (δX,ab (x).δX ν (x' )) >0
−iη µν δ 2 (x − x' )
provided that we assume that the metric is a very small perturbation of that
of Minkowski flat spacetime. Here, < . >0 denotes the unperturbed quantum
average, ie, the quantum average in the absence of the external field Hµν and
also in the absence of metric perturbations of flat spacetime. We write
so
< T (δX µ (x).δX ν (x' )) >0 =
L
− (θ(τ − τ ' )/n)exp(−in(τ − τ ' − σ + σ ' ))
n>0
L
− (θ(τ ' − τ )/n).exp(−in(τ ' − τ − σ ' + σ))
n>0
µ
< T (δX,0 (x).δX ν (x' )) >0 =
L
− (θ(τ − τ ' )/n)(−in)exp(−in(τ − τ ' − σ + σ ' ))
n>0
L
− (θ(τ ' − τ )/n).(in)exp(−in(τ ' − τ − σ ' + σ))
n>0
L
=i (θ(τ − τ ' )exp(−in(τ − τ ' − σ + σ ' ))
n>0
L
−i (θ(τ ' − τ )exp(−in(τ ' − τ − σ ' + σ))
n>0
µ
< T (δX,1 (x).δX ν (x' )) >0 =
10 Stochastic Processes in Classical and Quantum Physics and Engineering
18CHAPTER 1. STABILITY THEORY OF NONLINEAR SYSTEMS BASED ON TAYLO
�
− (θ(τ − τ � )/n)(in)exp(−in(τ − τ � − σ + σ � ))
n>0
�
− (θ(τ � − τ )/n).(−in)exp(−in(τ � − τ − σ � + σ))
n>0
�
= −i (θ(τ − τ � )exp(−in(τ − τ � − σ + σ � ))
n>0
�
+i (θ(τ � − τ )exp(−in(τ � − τ − σ � + σ))
n>0
µ
= − < T (δX,0 (x).δX ν (x� )) >0
Note that we have the alternate expression
µ
< T (δX,1 (x).δX ν (x� )) >0 =
∂1 D0µν (x|x� )
because differentiation w.r.t the spatial variable σ commutes with the time or
dering operation T . Further,
µ
< T (δX,00 (x).δX ν (x� )) >0
µ
=< T (δX,11 (x).δX ν (x� )) >0
= ∂12 D0µν (x|x� )
µ
< T (δX,01 (x).δX ν (x� )) >0
µ
= ∂1 < T (δX,0 (x).δX ν (x� )) >0
�
= ∂1 [i (θ(τ − τ � )exp(−in(τ − τ � − σ + σ � ))
n>0
�
−i (θ(τ � − τ )exp(−in(τ � − τ − σ � + σ))] =
n>0
�
i (θ(τ − τ � )(in)exp(−in(τ − τ � − σ + σ � ))
n>0
�
−i (θ(τ � − τ )(−in)exp(−in(τ � − τ − σ � + σ)) =
n>0
�
− (θ(τ − τ � )n.exp(−in(τ − τ � − σ + σ � ))
n>0
�
− (θ(τ � − τ )n.exp(−in(τ � − τ − σ � + σ))
n>0
Chapter 2
Chapter 2
String Theory and Large
String theory and Deviations
large
deviations
we have
L
Pr (σr ) = X,τ (σr ) = a(n)exp(−inσr )
n=0
This equation constrains the values of a(n). We thus define the numbers Prn
by
L
Pr (σr ) = Prn exp(−inσr ) − − − (1)
n
and the numbers {Prn } characterize the momentum of the external line attached
at the point σr . Note that for reality of the momentum, we must have Prn =
;
Pr,−n if we assume that the Prn s are real and therefore
L
Pr (σr ) = 2 Prn cos(nσr )
n>0
19
11
12 Stochastic Processes in Classical and Quantum Physics and Engineering
20 CHAPTER 2. STRING THEORY AND LARGE DEVIATIONS
Note that Pr (σ) is the momentum field corresponding to the rth external line
;
and this is given to us. Thus, the Prn s are determined by the equation
L
Pr (σ) = 2 Prn cos(nσ)
n>0
H0 = (1/2)P µ Pµ + (1/2)∂1 X µ ∂1 Xµ
In the presence of a vector potential field Aµ (σ), the charged string field has the
Hamiltonian density H with Pµ replaced by Pµ + eAµ . Write down the string
field equations for this Hamiltonian and assuming that the vector potential is a
weak Gaussian field, evaluate the rate function of the string field X µ .
Thus, J
−a
log(ME (E f )) = λ(x)(exp(f (Ex)) − 1)dx
Stochastic Processes in Classical and Quantum Physics and Engineering 13
2.3. LDP FOR STRINGS PERTURBED BY MIXTURE OF GAUSSIAN AND POISSON N
�
= �−n λ(�−1 x)(exp(f (x)) − 1)dx
Then,
lim�→0 �n .log(M� (�−a f ))
�
= λ(∞) (exp(f (x)) − 1)dx
assuming that
λ(∞) = lim�→0 λ(x/�)
exists. More generally, suppose we assume that this limit exists and depends
only on the direction of the vector x ∈ Rn so that we can write
then we get for the GartnerEllis limiting logarithmic moment generating func
tion
lim�→0 �n .log(M� (�−a f ))
�
= λ(∞, x̂)(exp(f (x)) − 1)dx = Λ(f ¯ )
where W k are independent white Gaussian noise processes and Nk are indepen
dent Poisson processes. Assume that the rate of Nk is λk /�. Calculate then the
LDP rate functional of the string process X µ . Consider now a superstring with
Lagrangian density
¯ α ∂α ψ
L = (1/2)∂α X µ ∂ α Xµ + ψρ
Assume that Bµν (X) and Jα (X) are small Gaussian fields. Calculate the joint
rate function of the Bosonic and Fermionic string fields X µ , ψ.
Chapter 3
Chapter 3
Ordinary, Partial and
ordinary,Stochastic Differential
partial and
stochasticEquation Models for
differential
equation
Biologicalmodels
Systems,forQuantum
biological systems,quantum
Gravity, Hawking Radiation,
gravity,
QuantumHawking
Fluids,radiation,
Statistical
quantum fluids,statistical
Signal Processing, Group
signal processing, group
Representations, Large
representations, large
Deviations in Statistical
deviations in statistical
Physics
physics
23
15
16 24CHAPTERStochastic ProcessesPARTIAL
3. ORDINARY, in Classical
ANDandSTOCHASTIC
Quantum Physics and Engineering EQUATI
DIFFERENTIAL
write Eext , Bext for the externally applied electromagnetic field and Eint , Bint
for the internally generated field, then we have
where �
Aint (t, r) = (µ/4π) J(t − |r − r� |/c, r� )d3 r� /|r − r� |,
� t
Φint (t, r) = −c2 divAint dt
0
The total electromagnetic field within the blood fluid is the sum of these two
components:
E = Eext + Eint , B = Bext + Bint
We have control only over Eext , Bext and can use these control fields to ma
nipulate the blood fluid velocity appropriately so that the virus does not enter
into the lung region. Let rk (t) denote the position of the k th virus particle. It
satisfies the obvious differential equation
with initial conditions rk (0) depending on where on the surface of the body
the virus entered. If the virus is modeled as a quantum particle with nucleus
at the classical position rk (t) at time t, then the protein molecules that are
stuck to this virus surface have relative positions (rkj , j = 1, 2, ..., M } and their
joint quantum mechanical wave function ψk (t, rkj , j = 1, 2, ..., M ) will satisfy
the Schrodinger equation
M
�
ih∂t ψk (t, rkj , j = 1, 2, ..., M ) = (−h2 /2mkj )�2rkj ψk (t, rkj , j = 1, 2, ..., M )
j=1
M
�
+ ek Vk (t, rkj − rk (t))ψk (t, rkj , j = 1, 2, ..., M )
j=1
where Vk (t, r) is the binding electrostatic potential energy of the viral nucleus.
There may also be other vector potential and scalar potential terms in the above
Schrodinger equation, these potentials coming from the prevalent electromag
netic field. Some times it is more accurate to model the entire swarm of viruses
as a quantum field rather than as individual quantum particles but the result
ing dynamics becomes more complicated as it involves elaborate concepts from
quantum field theory. Suppose we have solved for the joint wave functions ψk
of the protein blobs on the k th virus. Then the average position of the lth blob
on the k th viral surface is given by
�
< rkl > (t) = rkl ψk (t, rkj , j = 1, 2, ..., M )ΠM 3
j=1 d rkj , 1 ≤ k ≤ M
18 Stochastic Processes in Classical and Quantum Physics and Engineering
26CHAPTER 3. ORDINARY, PARTIAL AND STOCHASTIC DIFFERENTIAL EQUATI
We consider the time at which these average positions of the lth protein blob on
the k th viral surface hits the boundary S = ∂V of a particular cell. This time
is given by
τ (k, l) = inf {t ≥ 0 :< rkl > (t) ∈ S}
At this time, replication of this protein blob due to cell ribosome action begins.
Let λ(n)dt denote the probability that in the time interval [t, t + dt], n protein
blobs will be generated by the ribosome action on a single blob. Let Nkl (t)
denote the number of blobs due to the (k, l)th one present at time t. Then for
where λ(kl, r, n)dt is the probability that in time [t, t + dt], n blobs will be
generated from r blobs. Clearly
λ(klr, n) = rλ(n − r)
because each protein blob generates blobs indepndently of the others and hence
the probability of two blobs generating more than one each is zero. In an
other discrete time version of blob replication we can make use of the theory of
branching processes.
Let X(n) denote the number of blobs at time n. At time n + 1, each blob
splits into several similar blobs, each splitting being independent of the others.
let ξ(n, r) denote the number of blobs into which the rth blob splits. Then, we
have the obvious relation
X(n)
�
X(n + 1) = ξ(n, r)
r=1
Hence, if
Fn (z) = E(z X(n)
is the moment generating function of X(n) and if
is the moment generating function of the number of blobs into which each blob
splits, then we get the recursion
The effect of quantum gravity on the motion of the virus globules: Since
the size of the virus is on the nano/quantum scale, the effects of gravity on it
motion will correspondingly be quantum mechanical effects. Consider the ADM
action for quantum gravity obtained as the integral
�
a �
S[qab , N, N ] = L(qab , qab , N, N a )d4 x
where qab are the spatial components of the metric tensor in the embedded
3dimensional space at each time t and N, N a appear in the Lagrangian but
not their time derivatives. The qab are six canonical position fields and the
corresponding canonical momentum fields are
�
P ab = ∂L/∂qab
and these turn out to be fourth degree polynomials in the momenta but highly
nonlinear in the position fields. Let eia denote a tetrad basis for the three
dimensional embedded manifold. Thus,
The metric relative to this tetrad basis is therefore the identity δij and the group
of transformations of this tetrad that preserve the metric is therefore SO(3)
which can be regarded as SU (2) using the adjoint map. Note that SU (2) is in
fact the covering group of SO(3). The spin connection involves representing the
spatial connection in the SU (2)spin Lie algebra representation. The form of
this connection can in terms of the spatial connection Γabc can be derived from
the fact that the corresponding covariant derivative Db should annihilate the
tetrad eia . Thus if Γjb
i
denotes the spin connection, then
or equivalently,
Γijb = −eja (ea,b
i
− Γcba eic ) = −eaj eia:b
where a double dot denotes the usual spatial covariant derivative. Equivalently,
we can lower the index i using the identity SO(3) metric and write
Thus, the spin connection Γija is antisymmetric w.r.t (ij) and hence can be
expressed as
Γija = Γka (τk )ij = Γka (τk )ij
where τi , i = 1, 2, 3 is the standard basis for the Lie algebra of SO(3) which
satisfy the usual angular momentum commutation relations:
[τi , τj ] = �(ijk)τk
Note that (τi )jk = �(ijk) upto a proportionality constant. We observe that the
spatial connection coefficients of the embedded 3D manifold are
and the spatial curvature tensor is constructed out of these in the usual way.
Our aim is to develop the calculus of general relativity in a form that resembles
the calculus of the YangMills nonAbelian gauge theories. This means that we
must introduce a nonAbelian SO(3) connection Γ = Γi τi so that the curvature
tensor has the representation
R = dΓ + Γ ∧ Γ = [d + Γ, d + Γ]
Γk = Γak dxa
or equivalently,
Γ = Γk τk = Γka τk dxa
and hence
= (Γm k l b
a,b − Γa Γb �(klm))τm dx ∧ dx
a
m
= Rba τm dxb ∧ dxa
where
m
Rba = (1/2)(Γm m k l
a,b − Γb,a ) − Γa Γb �(klm)
where
q µν = qab X,a
µ ν
X,b
µ
T µ = X,0 = N µ + N.nµ = N a X,a
µ
+ N.nµ
where N a , N satisfy the orthogonal relations
gµν (T µ − N a X,a
µ ν
)X,b = 0,
gµν nµ nν = 1
We can equivalently write
g µν = qab X,a
µ ν
X,b + N 2 (X,µ0 − N a X,a
µ ν
).(X,0 − N b X,b
ν
)
δg µν = δqab X,a
µ ν
X,b
µ
+2N.δN.(X,0 − N a X,a
µ ν
).(X,0 − N b X,b
ν
)
−N 2 δN a X,a
µ ν
(X,0 − N b X,b
ν
)
µ
−N 2 ((X,0 − N a X,a
µ ν
)X,b δN b
Given the metric quantum fluctuations, the corresponding quantum fluctuations
in the equation of motion of the virus particle will be
µ
d2 δX µ /dτ 2 + 2Γαβ (x)δX α dX α /dτ
dynamics of the virus but since the virus size is also of the atomic dimensions,
such a quantum probabilistic description is more accurate.
A cruder model for the virus motion would be to express its trajectory as
the sum of a classical and a quantum trajectory in accordance with Newton
Heisenberg equation of motion with a force that is the sum of a classical com
ponent and a purely quantum fluctuating component:
with r0 (t) a purely classical trajectory and δr(t) a quantum, ie, operator valued
trajectory, we get
d2 r0 (t)/dt2 = f0 (r0 (t)),
d2 δr(t)/dt2 = f0� (r0 (t))δr(t) + δf (r0 (t))
Here δf (r) is an operator valued field and f0 (r) is a classical cnumber field.
Expressing the first order perturbed trajectory in state variable form gives us
or equivalently � �
d δr(t)
dt δv(t)
� �� �
0 I δr(t)
=
f0� (r0 (t)) 0 δv(t)
� �
0
+
δf (r0 (t))
Now we can also include damping terms in this model by allowing the force to
be both position and velocity dependent:
d2 r0 (t)+δr(t))/dt2 = f0 (r0 (t)+δr(t), r0� (t)+δr� (t))+δf (r0 (t)+δr(t), r�0 (t)+δr� (t))
d2 δr(t)/dt2 = f0,1 (r0 (t), r�0 (t))δr(t) + F0,2 (r0 (t), r�0 (t))δr� (t) + δf (r0 (t), r0� (t))
After introducing the state vector as above, and solving the linearized equations
using the usual state transition matrix obtained in terms of the Dyson series
expansion, we get
� t
δr(t) = Φ(t, s)δf (r0 (s), r0� (s))ds
0
Stochastic Processes in Classical and Quantum Physics and Engineering 23
3.2. MORE ON QUANTUM GRAVITY 31
L
+E((1 − X(t)λ(r)dt)z X(t) )
r
So writing
F (t, z) = E(z X(t)
we get
where L
φ(z) = λ(r)z r
r
and hence we derive the following differential equation for the evolution of the
generating function of the number of particles at time t:
Note that,
J
qab = eia eib , det(q) = det(e)2 , det(q) = det(e) = det((eia ))
Db (Eia ) = 0
follows from
Db eia = 0
24 Stochastic Processes in Classical and Quantum Physics and Engineering
32CHAPTER 3. ORDINARY, PARTIAL AND STOCHASTIC DIFFERENTIAL EQUAT
we get
eai Fai = �(a1 a2 a3 )�(i1 i2 i3 )eai11 eai22 eai33
= 6.det((eai )) = 6.det(e)−1
The factor of 6 comes because of the identity
�(i1 i2 i3 )�(i1 i2 i3 ) = 6
More generally,
ebi Fai = �(aa2 a3 )�(i1 i2 i3 )eib1 eai22 eai33
= �(aa2 a3 )�(ba2 a3 )det(e)−1 = δab .6.det(e)−1
Thus,
Fai = 6.det(e)−1 eia
Taking the inverse gives
or equivalently,
Eia = 6.Fia
We can derive another alternate expression for Fia or equivalently for Eia . Let
Then,
eib Gai = �(aa2 a3 )�(ba2 a3 )det(e) = 6δba .det(e)
Thus
Gai = 6.eai det(e) = 6.Eia
In particular,
Da Eia = 0
or equivalently,
√
( qeai ),a − Γjia Eja = 0
Note that this is the same as
√ j a
q(eai:a − Γia ej ) = 0
ie
Da eia = 0
Substituting for the spin connection
j
Γia = Γjia = Γak (τk )ji = Γka �(kji)
we get
a
Ei,a − Γak �(kji)Eja = 0 − − − (1)
In other words, the covariant derivative of Eia w.r.t the spin connection Γka
vanishes. This result is of fundamental importance because it states that these
position fields Eia which are called the Ashtekar position fields, are covariantly
flat w.r.t the spin connection. We wish to express the SO(3) spin connection
Γka in terms of the Ashtekar canonical position fields Eia . To this end, we note
that (1) can be expressed as
Γka �(kji) =
Γabc = (1/2)((eia eib ),c + (eai eci ),b − (eci ebi ),a )
Jv (t, r) = −D�n(t, r)
and hence the conservation law for the virus number is given by
divJv + ∂t n = g
or equivalently,
∂t n(t, r) − �2 n(t, r) = g(t, r)
26 Stochastic Processes in Classical and Quantum Physics and Engineering
34CHAPTER 3. ORDINARY, PARTIAL AND STOCHASTIC DIFFERENTIAL EQUATI
Now let us take the drift current also into account caused by the presence of
charge on the virus body interacting with the external electric field/ Let v(t, r)
denote the velocity of the fluid in which the viurs particles are immersed. Then,
if E(t, r) is the electric field and µ the virus mobility, we have that the total
drift plus diffusion current density is given by
J = nµE − D�n
and its conservation law is given by
divJ + ∂t n = g
or equivalently,
∂t n − D�2 n + µndivE + µE.�n = g
Let q denote the charge carried by a single virus particle. Then the virus charge
density is given by
ρ = qn
and by Gauss’ law,
divE = −ρ/�
so our equation of continuity for the virus particles is given by
∂t n − D�2 n + (µq/�)n2 = g
This is the pde satisfied by the virus number density. If g(t, r) is the sum of a
random Gaussian field and a random Poisson field both of small amplitude, then
the problem is to calculate the rate functional of the field n(t, r). Using this
rate functional, we can predict the probability of the rare event that the virus
concentration in the fluid will exceed the critical concentration level defined by
the solution to the above pde without the generation term by an amount more
than a given threshold. Note that the mobility/drift term for the current is
obtained as follows: let v(t) denote the velocity of a given virus particle and m
its mass, γ the viscous damping coefficient. Then, the equation of motion of the
virus particle is given by
mv � (t) = −γv(t) + qE(t)
and at equilibrium, v � = 0 so that
v(t) = qE(t)/γ = µE(t), µ = q/γ
Now let us consider the motion of the virus particle taking into account random
fluctuating forces. Its equation of motion is
mdv(t) = −γv(t) + qE(t) + σ.dB(t)
where B is standard three dimensional Brownian motion. The solution to this
stochastic differential equation is given by
� � t
v(t) = exp(−γ(t − s)/m)(q/m)E(s)ds + σ exp(−γ(t − s)/m)dB(s)
0
Stochastic Processes in Classical and Quantum Physics and Engineering 27
3.3. LARGE DEVIATION PROPERTIES OF VIRUS COMBAT WITH ANTIBODIES35
Note that the pdf p(t, v) of v(t), ie, of a single virus particle satisfies the Fokker
Planck equation
p(t, v)d3 v gives us the probability that the virus particle will have a velocity in
the range d3 v around v. n(t, r)p(t, v)d3 rd3 v is the number of particles having
position in the range d3 r around r and having a velocity in the range of d3 v
around v.
MATLAB simulations for virus dynamics based on a stochastic differential
equation model:
[a] Simulation of blood velocity flow: The heart pumping force field per unit
volume can be represented as
f (t, r) = f0 (t)K(r)
where f0 (t) is a periodic function and K(r) is a point spread function that is
a smeared version of the Dirac δfunction at the point r0 , the position of the
heart. The equation of motion of a virus particle in this force field is given by
dr(t) = v(t)dt, mdv(t) = −γv(t)dt + f (t, r(t))dt + σdB(t) − −(1)
The corresponding pdf is p(t, r, v) in position space. If n0 is the total number
of virus particles in the fluid, then n0 p(t, r, v)d3 rd3 v equals the total number of
virus particles in the position range d3 r and having velocity in the range d3 v
centred around (r, v). It satisfies the FokkerPlanck equation
Simulation of blood flow in the body: The force exerted by the heart is
localized around the position of the heart and as such this force density field
can be represented as
f (t, r) = f0 (t)K(r)
where f0 (t) is a periodic function of time and K(r) is a smeared version of
the Dirac δfunction centred around the heart’s position. The fluiddynamical
equations to be simulated are
div(ρ.v) + ∂t ρ = g
where ρ(t, r) is the virus density and g(t, r) is the rate of generation of virus
mass per unit volume. Assuming an equation of state
p = F (ρ)
we get a set of four pde’s for the four fields ρ, v by replacing �p with F � (ρ)�ρ.
Standard CFD (Computational fluid dynamics) methods can be used to dis
cretize this dynamics and solve it in MATLAB.
The spread of virus from a body within a room: Let C(t) denote the concen
tration of the virus at a certain point within the room. It increases due to the
presence of a living organism carrying the virus and decreases due to the pres
ence of ventilation, medicine etc. other effects. Thus, the differential equation
for its dynamics can be expressed in the form
where I(t) denotes the rate of increase due to the external living organism and
X(t) denotes the other factors like medicine, ventilation etc. Formally, we can
solve this equation to give
� t
C(t) = Φ(t, 0))C(0) + Φ(t, s)I(s)ds
0
where � t
Φ(t, s) = exp(− β(X(τ ))dτ )
s
This dynamical equation can be generalized in two directions, one, by including
a spatial dependence of the virus concentration and describing the dynamics
using a partial differential equation, and two, by including stochastic terms.
Let C(t, r) denote the virus concentration in the room at the spatial location r
and at time t. The virus diffuses from a region of higher concentration to a region
of lower concentration in such a way that if D denotes the diffusion coefficient,
the virus diffusion current density, ie, number of virus particles flowing per unit
area per unit time at (t, r) is given by
J(t, r) = −D.�C(t, r)
Stochastic Processes in Classical and Quantum Physics and Engineering 29
3.3. LARGE DEVIATION PROPERTIES OF VIRUS COMBAT WITH ANTIBODIES37
Let X(t, r) denote the external factors at the point r at time t. Its effect on the
decrease in the virus concentration at the point r� at time t can be described
via an ”influence function” F (r� , r� ) which usually decreases with increase in the
spatial distance |r� − r|. So we can to a good degree of approximation, describe
the concentration dynamics as
�
∂t C(t, r) = I(t, r) − divJ(t, r) − F (r, r� )β(X(t, r� ))C(t, r� )d3 r�
�
= I(t, r) + D�2 C(t, r) − F (r, r� )β(X(t, r� )C(t, r� )d3 r�
This equation can be solved easily using Fourier transform methods. More
generally, we can consider a situation when the influence of external agents
located at different points r1 , ..., rn takes place via nonlinear interactions so
Volterra interaction terms are added to the dynamics to give
∂t C(t, r) =
��
= I(t, r)+D�2 C(t, r)− Fn (r, r1 , ..., rn )β(X(t, r1 ), ...,
n≥1
X(t, rn ))C(t, r1 )...C(t, rn )d3 r1 ...d3 rn
Of course, this equation does not describe memory in the action of external
agents. To take into account such memory effects, we further modify it to
∂t C(t, r) =
LJ
− Fn (r, r1 , ..., .rn )β(r1 , ..., rn , X(t−|r−r1 |/v, r1 ), ..., X(t−|r−rn |/v), rn )
n≥1
Remark: Consider the wave equation pde with source and nonlinear terms:
J
∂t2 F (t, r) − c2 �2 F (t, r) + δ. H(τ, r, F (t − τ, r' ))dτ d3 r' = s(t, r)
where
∂t2 F0 (t, r) − c2 �2 F0 (t, r) = s(t, r),
J
∂t2 F1 (t, r) 2 2
− c � F1 (t, r) = − H(τ, r, F0 (t − τ, r' ))dτ d3 r'
J
−1
F1 (t, r) = (4π) H(τ, r' , F0 (t − |r − r' |/c − τ, r'' ))dτ d3 r' d3 r''
It is clear from this expression therefore that delay terms coming from the finite
speed of propagation of the virus effects can be described using second order
partial differential equations in spacetime like the wave equation with source
terms.
div(v) = 0
where L is the nonlinear NavierStokes vector operator:
We can write
v = curlψ, divψ = 0, Ω = curlv = −�2 ψ
and then get
∂t ψ = F (ψ) + h
where now F is a nonlinear functional of the entire spatial vector field ψ and
h = −G.g = −G.curlf is the modified force field. Here F (ψ) is a functional of
ψ at different spatial points but at the same time t which is why after spatial
discretization, it can be regarded as a nonlinear state variable system. Ideally
speaking, we should therefore write
where D is the region of three dimensional space over which the fluid flows.
Note that
F (ψ) = −G.curl(�2 ψ × curlψ) + ηρ−1 �2 ψ
Quantization: Introduce an auxiliary field φ(t, r) and define an action functional
� � �
S[ψ, φ] = φ.partialt ψd rdt − φ.F (ψ)d rdt − φ.hd3 rdt
3 3
Note that � �
3
φ.∂t ψd rdt = (1/2) (φ.∂t ψ − ψ.∂t φ)d3 rdt
� �
φ.F (ψ)d3 rdt = − (Gφ).curl(�2 ψ × curlψ)d3 rdt
32 Stochastic Processes in Classical and Quantum Physics and Engineering
40CHAPTER 3. ORDINARY, PARTIAL AND STOCHASTIC DIFFERENTIAL EQUATI
�
= curl(Gφ).(�2 ψ × curlψ)d3 rdt
Pψ = δS/δ∂t ψ = φ/2,
Pφ = δS/δ∂t φ = −ψ/2
Then the Hamiltonian is given by
�
H = (Pψ .∂t ψ + Pφ .∂t φ)d3 r − S
� �
3
= φ.F (ψ)d rdt + φ.hd3 rdt
dψ/dt = δH/δPψ = 0
and likewise for φ. Note that these equations imply that ψ(t, r) = ψ(r) is time
independent and hence Pψ varies linearly with time.
Pψ = ∂L/∂∂t ψ = iψ ∗ ,
Pψ ∂ t ψ + P ψ ∗ ∂ t ψ ∗ − L =
a�ψ ∗ .�ψ
and hence the second quantized Hamiltonian density of the free Schrodinger
particle is
� � �
H = Hd3 x = a �ψ ∗ .�ψ.d3 x = −a ψ ∗ �2 ψ.d3 x
a = h2 /2m
Stochastic Processes
3.4. LARGE in Classical PROBLEMS
DEVIATIONS and Quantum IN
Physics and Engineering
QUANTUM 33
FLUID DYNAMICS41
or equivalently,
[ψ(t, r), ψ ∗ (t, rf )] = hδ 3 (r − rf )
We expand the wave operator field ψ(t, r) in terms of the stationary state energy
functions of the first quantized Schrodinger Hamiltonian operator
−ar2 + V
to get
ψ(t, r) = a(n)un (r)exp(−iω(n)t), ω(n) = E(n)/h
n
Actually, since the electronic field is Fermionic, the above commutation relations
should be replaced with anticommutation relations and hence all the above
commutators should actually be anticommutators. This picture does not show
the existence of the positron which is because the Schrodinger equation is a
nonrelativistic equation. If we use the Dirac relativistic wave equation, then
positrons automatically appear.
Assuming h to be a ”small” classical random field, we can ask about the statistics
of ψ(t, r) in a given quantum state ρ, ie for things like the quantum moments
T r(ρ.ψ(t1 , r1 )...ψ(tN , rN ))
This will be a functional of h(s, .), s ≤ max(t1 , .., tN ) and hence this moment
will be a classical random variable and we can ask about the large deviation
properties of this moment after replacing h by Eh with E → 0.
[14] Large deviations for Poisson process driven stochastic differential equa
tions. Let N (t), t ≥ 0 be a Poisson process with rate λ. Consider the scaled
process N� (t) = �.N (t/�). Then N� is a Poisson process whose jumps are in
units of � and whose rate is λ/�. Consider the stochastic differential equation
dX(t) = F (X(t))dt + dN� (t), t ≥ 0
The rate function of the process N� , � → 0 is evaluated as follows:
� T
−1
Λ(f ) = �.logE[exp(� f (t)dN� (t))]
0
� T
= λ. (exp(f (t)) − 1)dt
0
so the corresponding rate function is
� T � T
IN (X) = supf ( f (t)X � (t) − λ. (exp(f (t)) − 1)dt
0 0
where
J(u) = supv (uv − λ.(exp(v) − 1))
and then the rate function of the solution X(t) to the sde over the time interval
[0, T ] is given by
� T
IX (X) = J(X � (t) − F (X (t)))dt
0
More generally, consider a spatial Poisson field N (t, dξ) with a rate measure of
λ.dF (ξ), ie
E(N (t, E)) = λ.tF (E)
where E takes values in the Borel sigma field of Rn . X(t) is assumed to satisfy
the sde �
dX(t) = F (X(t))dt + g(X(t), u(t), ξ)N (dt, dξ)
where u(t) is an input signal field. Now scale the Poisson in both time and
amplitude to obtain the following sde satisfied by X(t):
�
dX� (t) = F (X� (t))dt + � g(X� (t), u(t), ξ).N (dt/�, dξ), t ≥ 0
We require to calculate the rate function of the process X� over the time interval
[0, T ]. To this end, we consider first a more general situation: Let N (dξ) be
a spatial Poisson field on Rn which we write formally as N (dx1 , ...dξn ). We
assume a constant rate field measure, ie,
Nδ1 ...δn (dξ1 , ..., dξn ) = δ1 ...δn N (dξ1 /δ1 , ..., dξn /δn )
Then, �
Mδ1 ...δn (f ) = Eexp( f (ξ1 , ..., ξn )Nδ1 ...δn (dξ1 , ..., dξn ) =
�
= exp(λ. (exp(δ1 ...δn .f (δ1 ξ1 , ...δn ξn )) − 1)dξ1 ...dξn )
�
= exp(λ.(δ1 ...δn )−1 (exp(δ1 ...δn .f (ξ1 , ...ξn )) − 1)dξ1 ...dξn )
from which we get for the scaled logarithmic moment generating function
T r(Cπ(g)) = 0∀g ∈ G
and hence since the set of linear combinations of π(g), g ∈ G equals Mn (C), it
follows that
T r(CX) = 0∀X ∈ Mn (C)
from which we deduce that C = 0. The claim is completely proved.
Note: This problem was taken from V.S.Varadarajan, ”Lie groups, Lie alge
bras and their representations”, Springer.
�µ nν = nν,µ − Γσµν nσ
µ ν ν
X,a X,b nν,µ = X,b nν,a
ν
= −nν X,ab
since
ν
nν X,b =0
µ ν σ
X,a X,b Γµν nσ
µ ν
= X,a X,b Γσµν nσ
µ ν
= (1/2)X,a X,b (qσµ,ν + qσν,µ − qµν,σ )nσ
µ ν
X,a X,b qσµ,ν =
µ µ
X,a qσµ,b = qσa,b − X,ab qσµ
Thus,
µ ν
X,a X,b qσµ,ν nσ = qσa,b nσ
σ
= qσa,b (X,0 − N c X,c
σ
)
= qσa,b X,σ0 − N c (qca,b − qσa X,bc
σ
)
Stochastic Processes in Classical and Quantum Physics and Engineering 39
3.8. DIFFUSION EXIT PROBLEM 47
Likewise,
µ ν
X,a X,b qσν,µ nσ = qσb,a nσ
µ ν
X,a X,b qµν,σ nσ
µ ν σ
= X,a X,b qµν,σ (X,0 − N c X,c
σ
)
µ ν
= X,a X,b qµν,0 − N c X,a
µ ν
X,b qµν,c
µ ν
= qab,0 − qµν (X,a X,b ),0 − N c (qab,c − qµν (X,a
µ ν
X,b ),c )
ν µ
= qab,0 − qaν X,b0 − qµb X,a0 − N c qab,c + N c qaν X,bc
ν
+ N c qµb X,ac
µ
Now
ν ν
qaν X,b0 = qab,0 − qaν,0 X,b
µ µ
qµb X,a 0 = qab,0 − qµb,0 X,a
Alternately,
ν ν
qaν X,b0 = qa0,b − qaν,b X,0
µ µ
qµb X,a 0 = qb0,a − qµb,0 X,0
Now,
gµν (X,µ0 − N a X,a
µ ν
)X,b =0
gives
q0b = N a qab
So we can substitute this expression for qa0 in the above equations.
or equivalently,
H(x, p) = py − L(x, y), p = ∂y L(x, y)
and
V (t, x, y) = S(t, x, y)
Then
V (x, y) = inft≥0 V (t, x, y) = inft≥0 S(t, x, y) = S(x, y)
We note that it is hard to compute the exact value of E(τ (�)) for � > 0 be
cause this involves computing the Green’s function of the diffusion generator.
However, computing the limiting value of �.logE(τ (�)) as � → 0 is simpler as it
involves only computing the stationary solution of the HamiltonJacobi equa
tion.
Stochastic Processes in Classical and Quantum Physics and Engineering 41
3.9. LARGE DEVIATIONS FOR THE BOLTZMANN KINETIC TRANSPORT EQUATIO
w(p + q, q)f (t, r, p + q) ≈ w(p, q)f (t, r, p) + (q, �p )(w(p, q)f (t, r, p))
≈ (q, �p )(w(p, q)f (t, r, p)) + (1/2)T r(qq T �p �Tp )(w(p, q)f (t, r, p))
Now letting � �
q.w(p, q)d3 q = A(p), qq T w(p, q)3 q = B(p)
divp (A(p)f (t, r, p)) + (1/2)T r(�p �Tp B(p)f (t, r, p))
which is the FokkerPlanck equation that one encounters in diffusion processes
described in the form of stochastic differential equations driven by Brownian
motion.
The large deviation problem: If the scattering probability function w(p, q)
has small random fluctuations, the we can ask what would be the deviation
in the resulting solution to the particle distribution function f ? Of course, to
answer this question, we must expand the solution f in powers of a perturbation
parameter tag δ that is attached the random vector and matrix valued functions
A(p), B(p). Specifically, assuming w(p, q) to be of order δ, we have that A(p)
and B(p) are random functions respectively of orders δ and δ 2 .
42 Stochastic Processes in Classical and Quantum Physics and Engineering
50CHAPTER 3. ORDINARY, PARTIAL AND STOCHASTIC DIFFERENTIAL EQUAT
G = KAN
where ( )
cos(θ) −sin(θ)
K = {u(θ) : 0 ≤ θ < 2π}, u(θ) =
sin(θ) cos(θ)
A = {diag(a, a−1 ) : a ∈ R − {0}}
( )
1 n
N ={ : n ∈ R}
0 1
Define ( )
0 1
H = diag(1, −1), X =
0 0
( )
0 0
Y =
1 0
Then,
[H, X] = 2X, [H, Y ] = −2Y, [X, Y ] = H
Consider the following element in the universal enveloping algebra of G:
Ω = 4XY + H 2 − 2H
Then,
Noting that
G = KN A
we can write
G/A = KN
and hence � �
f (xhx−1 )dx = f (unhn−1 u−1 )dudn
G/A K×N
�
= f¯(nhn−1 )dn
N
where �
f¯(x) = f (uxu−1 )du
K
Now let
nhn−1 − h = M, n ∈ N, h ∈ A
It is clear that M is strictly upper triangular since n is unipotent, ie, upper
triangular with ones on the diagonal. Then, for fixed h,
(Simply write � �
1 m
n= ,m ∈ R
0 1
and note that dn = dm and
� �
1 −m
n−1 =
0 1
h = diag(a, 1/a)
and observe that � �
−1 0 (1/a − a)m
nhn −h=
0 0
) Note that for a = exp(t) and h = diag[a, 1/a], we have
This gives
� � �
−1 −1
f (nhn )dn = f (M + h)(dn/dM )dM = Δ(h) f (M + h)dM
N n n
where n is the Lie algebra of N and consists of all strictly upper triangular real
matrices. Finally, using Weyl’s integration formula,
� � � �
2 −1
f (x)dx = dh|Δ(h)| f (xhx )dx = |Δ(h)|f (M + h)dhdM
G A G/A A×n
4452CHAPTERStochastic Processes
3. ORDINARY, in Classical
PARTIAL AND andSTOCHASTIC
Quantum Physics and Engineering EQUATI
DIFFERENTIAL
�
= |exp(t)−exp(−t)|f (a(t)+sX)dtds, a(t) = diag(exp(t), exp(−t)) = exp(tH)
R×R
parallel displacements of the connection Aja around loops in the graph and the
connection terms Aja integrated over edges by SO(3) holonomy group elements.
In other words, the whole programme of quantizing the EinsteinHilbert action
using such discretized graphs becomes simply that of quantizing a function of
SO(3) holonomy group elements, one element associated with each edge of a
graph and a holonomy group element corresponding to the curvature obtained
by parallely transporting the connection around a loop.
= I(f, g)
Thus, I(f, g) is an invariant function of two image fields defined on G. I can be
evaluated using the above integration formula. Note that this formula is true
for any function χ on G, not necessarily a character. Now consider
�
J(f, g) = f (x)g(y)χ(yx−1 )dxdy
G×G
�
= f (u−1 x).g(u−1 y)χ(yx−1 )dxdy
46 Stochastic Processes in Classical and Quantum Physics and Engineering
54CHAPTER 3. ORDINARY, PARTIAL AND STOCHASTIC DIFFERENTIAL EQUATI
�
= f (x)g(y)χ(uyx−1 u−1 )dxdy = J(f, g)
χ(uxu−1 ) = χ(x)∀u, x ∈ G
with c a constant that equals ±1. Note also that H varies over the Cartan
subalgebra a and since G is a compact group, it follows that α(H) is pure
imaginary for all α ∈ Δ and H ∈ a. Now, Weyl’s integration formula gives for
any function f on G, �
χ(f ) = f (g)χ(g)dg =
�
C. |Δ(h)|2 f (xhx−1 )χ(h)dhdx
A×G/A
�
= C. ¯
Δ(h)F f (h)χ(h)dh
A
on the one hand and on the other, using the fact that
L
Δ(H) = E(s).χsn,n−1,...,1 (H)
s∈W
where
χn−1,...,0 (H) = exp(inθ1 + ... + i2θ2 + iθ1 )
we get
DΔ(0) = |W |Πn≥r>s≥1 (r − s)
and hence
(Du)(0) = χ(0)(DΔ(0)) = d(χ).|W |.Πr>s (r − s)
where
d(χ) = χ(0)
is the dimension of the irreducible representation. Equating these two expres
sions, we obtain Weyl’s dimension formula
Πr>s (mr − ms )
d(χ) = d(m1 , ..., mn ) =
Πr>s (r − s)
48 Stochastic Processes in Classical and Quantum Physics and Engineering
56CHAPTER 3. ORDINARY, PARTIAL AND STOCHASTIC DIFFERENTIAL EQUATI
This argument works for the compact group SU (n) where a dominant integral
weight is parametrized by an n tuple of nonnegative integers (m1 ≥ m2 ≥ ... ≥
m1 ≥ 0). In the general case of a compact semisimple Lie group G, we use
Weyl’s character in the form
�
�(s)exp(s.(λ + ρ)(H))
χ(H) = s∈W
Δ(H)
�
Δ(H) = �(s).exp(s.ρ(H)) = Πα∈Δ+ (exp(α(H)/2) − exp(−α(H)/2))
s∈W
Then define
D = Πα∈Δ+ ∂(Hα )
We get
DΔ(0) = |W |.Πα∈Δ+ ρ(Hα )
Thus, on the one hand,
Equating these two expressions gives us Weyl’s dimension formula for the irre
ducible representation of any compact semisimple Lie group corresponding to
the dominant integral weight λ in terms of the roots:
Πα∈Δ+ (λ + ρ)(Hα )
d(χ) = d(λ) =
Πα∈Δ+ ρ(Hα )
or equivalently,
where
µ = mc/h
with h denoting Planck’s constant divided by 2π. Now,
√
−g = (g00 g11 g22 g33 )1/2 = r2 sin(θ)/c3
−(c2 /r2 )r2 (sin(θ)ψ,θ )),θ − (c2 /r2 sin2 (θ))(r2 sin(θ)ψ,φφ + µ2 r2 sin(θ)ψ = 0
This equation can be rearranged as
where
1 ∂ ∂ 1 ∂2
L2 = −( sin(θ) + )
sin(θ) ∂θ ∂θ sin2 (θ) ∂φ2
is the square of the angular momentum operator in nonrelativistic quantum
mechanics. Thus, we obtain using separation of variables, the solution
�
ψ(t, r, θ, φ) = fnl (t, r)Ylm (θ, φ)
nlm
ψ(t, r, θ, φ)
�
= [a(n, l, m)gnl (r)Ylm (θ, φ)exp(iω(nl)t)+a(n, l, m)∗ ḡnl (r)Y¯lm (θ, φ)exp(−iω(nl)t)]
nlm
or
dr/dt = ±α(r) = ±(1 − 2m/r), m = GM/c2
The solution is �
rdr/(r − 2m) = ±t + c
so that
2m.r + 2mln(r − 2m) = ±t + c
The change in time t when the particle goes from 2m+0 to 2m−0 is purely imag
inary an is given by δt = 2mπi since ln(r − 2m) = ln|r − 2m| + i.arg(r − 2m)
with arg(r − 2m) = 0 for r > 2m and = π for r < 2m. Thus, a photon
wave oscillating at a frequency ω has its amplitude changed from exp(iωt) to
exp(iω(t + 2mπi)) = exp(iωt − 2mπω). The corresponding ratio of the proba
bility when it goes from just outside to just inside the critical radius is obtained
by taking the modulus square of the probability amplitude and is given by
P = exp(−4mπω). The energy of the photon within the blackhole is hω (where h
is Planck’s constant divided by 2π) and hence by Gibbsian theory in equilibrium
statistical mechanics, its temperature T must be given by P = exp(−hω/kT ).
Therefore
hω/kT = 4πmω
or equivalently,
T = 4πmk/h = 8πGM k/hc2
This is known as the Hawking temperature and it gives us the critical temper
ature of the interior of the blackhole at photons can tunnel through the critical
radius, ie the temperature at which the blackhole can radiate out photons.
We have
d/ds(u(θ)a(t)n(s)) = u(θ)a(t)n(s)X
so
X ∗ = ∂/∂s
Then,
Now,
n(s)−1 Hn(s) = exp(−s.ad(X))(H) = H − s[X, H] = H − 2sX
So
H ∗ − 2sX ∗ = ∂/∂t
Finally,
Problem: Suppose I(f, g) is the invariant functional for a pair of image fields
f, g defined on SL(2, R). If f changes by a small random amount and so does
g, then what is the rate functional of the change in I(f, g) ? Suppose that for
each irreducible character χ of G, we have constructed the invariant functional
ˆ denote the set of all the irreducible
Iχ (f, g) for a pair of image fields f, g. Let G
characters of G. Consider the Plancherel formula for G:
�
f (e) = χ(f )dµ(χ)
Ĝ
Note that (f, g) → Iχ (f, g, h) is not an invariant. We define the Fourier trans
form of the class function ψ by
�
ψ̂(χ) = ψ(h)χ(h−1 )dh
G
54 Stochastic Processes in Classical and Quantum Physics and Engineering
62CHAPTER 3. ORDINARY, PARTIAL AND STOCHASTIC DIFFERENTIAL EQUATI
Then, we have �
δe (g) = χ(g)dµ(χ),
(Note that we are assuming G to be a unimodular group. In the case of SL(2, R),
this anyway true), we have
�
f (g) = T r(πχ (h−1 g))f (h)dhdµ(χ)
�
= T r(fˆ(πχ )πχ (g))dµ(χ)
Ĝ
and hence, �
Iχ (f, g, h) = f (x)g(y)χ(yx−1 h)dxdy
G×G
�
= f (x)g(y)T r(πχ (y)πχ (x−1 )πχ (h))dxdy
but, �
ψ̂(χ) = ψ(g)T r(πχ (g −1 ))dg =
ˆ χ ))
T r(ψ(π
We have, � �
ψ(h)πχ (h)dh = ψ0 (h)πχ (h−1 )dh
= ψˆ0 (πχ )
Stochastic Processes in Classical and Quantum Physics and Engineering 55
3.14. MORE ON GROUP THEORETIC IMAGE PROCESSING 63
j
= ψ(h)χ(y −1 h−1 yx)dh.dµ(χ)
j
= ψ(h)χ1 (h, x, a)dµ(χ)
where
j j
−1 −1
χ1 (h, x, a) = a(y)χ(y h yx)dy = a(y)χ(h−1 yxy −1 )dy
G G
Remark: SL(2, R) under the adjoint action, acts as a subgroup of the Lorentz
transformations in a plane, ie, in the t−y −z space. Indeed, represent the space
time point (t, x, y , z) by
( )
t + z x + iy
Φ(t, x, y , z) =
x − iy t − z
where
A ∈ SL(2, R)
Clearly, by taking determinants, we get
� � �2 �
t2−x2−y −z 2
= t2 − x2 − y 2 − z 2
and then choosing A = u(θ) = exp(θ(X − Y )), a(v) = exp(vH), n(s) = exp(sX)
successively gives us the results that
for the case A = u(θ), which corresponds to a rotation of the x − z plane around
the y axis by an angle 2θ. The case A = a(v) gives
or equivalently,
x → x + st, t → t + sx
where Kx is the infinitesimal generator of Lorentz boosts along the x axis while
Xy is the infinitesimal generator of rotations around the y axis. Specifically,
Kx = L�x (0), Xy = Ry� (0)
The characters of the discrete series of SL(2, R). Let m be a nonpositive integer.
Let em−1 be a highest weight vector of a representation πm of sl(2, R). Then
Define
π(Y )r em−1 = em−1−2r , r = 0, 1, 2, ...
Then
π(H)em−1−2r = π(H)π(Y )r em−1 = ([π(H), π(Y )r ] + π(Y )r π(H))em−1
= (m − 1 − 2r)π(Y )r em = (m − 1 − 2r)em−1−2r , r ≥ 0
Thus, considering the module V of SL(2, R) defined by the span of em−2r , r ≥ 0,
we find that
� exp((m − 1)t)
T r(π(a(t)) = T r(π(exp(tH)) = exp(t(m − 1 − 2r)) =
1 − exp(−2t)
r≥0
exp(mt)
=
exp(t) − exp(−t)
Now,
π(X)em−1−2r = π(X)π(Y )r em−1 = [π(X), π(Y )r ]em−1
= (π(H)π(Y )r−1 + .... + π(Y )r−1 π(H))em−1
= [(−2(r−1)π(Y )r−1 +π(Y )r−1 π(H)−2(r−2)π(Y )r−1
+π(Y )r−1 π(H)+...+π(Y )r−1 π(H)]em−1
= [−r(r − 1) + r(m − 1)]π(Y )r−1 em−1 = r(m − r)em+1−2r
Also note that
π(Y )em−1−2r = em−3−2r , r ≥ 0
Problem: From these expressions, calculate T r(π(u(θ)) where u(θ) = exp(θ(X−
Y )). Show that it equals exp(imθ)/(exp(iθ) − exp(−iθ)). In this way, we are
able to determine all the discrete series characters for SL(2, R).
58 Stochastic Processes in Classical and Quantum Physics and Engineering
66CHAPTER 3. ORDINARY, PARTIAL AND STOCHASTIC DIFFERENTIAL EQUATI
exp(−hω/kT ) = exp(−ωΔ)
T = h/kΔ
Let
D(ρ|σ) = T r(ρ.(log(ρ) − log(σ)))
the relative entropy between two states ρ, σ defined on the same Hilbert space.
We know that it is nonnegative and that if K is a quantum operation, ie, a
CPTP map, then
D(ρ|sigma) ≥ D(K(ρ)|K(σ))
Consider three subsystems A, B, C and let ρ(ABC) be a joint state on these
three subsystems. Let ρm = IB /d(B) the completely mixed state on B. IB is
the identity operator on HB and d(B) = dimHB . Let K denote the quantum
operation corresponding to partial trace on the B system. Then consider the
inequality
= D(ρ(AC)|ρ(AC)) = 0
Evaluating gives us
or equivalently,
−H(ABC) + H(AC) + log(d(B)) ≥ 0
Now, let K denote partial tracing on the Csubsystem. Then,
gives
−H(ABC) + H(AC) + log(d(B)) ≥ D(ρ(AB)|Im ⊗ ρ(A))
60 Stochastic Processes in Classical and Quantum Physics and Engineering
68CHAPTER 3. ORDINARY, PARTIAL AND STOCHASTIC DIFFERENTIAL EQUATI
A(r) − B(r)(dr/dt)2 = 0
say where �
F (r) = A(r)/B(r)
Let F (r) have a zero at the critical radius r0 so that we can write
F (r) = G(r)(r − r0 )
where G(r0 ) = 0. Then, the time taken for the photon to go from r0 + 0 to
r0 − 0 is given by
� r0 +0
Δ= dr/((r − r0 )G(r)) = G(r0 )−1 lim�↓0 (log(�) − log(−�))
r0 −0
= −iπ.G(r0 )−1
so that the ratio change in the probability amplitude of a photon pulse of fre
quency ω is the quantity exp(−ωπ.G(r0 )−1 ) and equating this to the Gibbsian
probability exp(−hω/kT ) gives us the Hawking temperature as
T = h.G(r0 )/πk
Chapter 4 Chapter 4
Research
Research Topics on Topics on
Representations
represenations,
Using the Schrodinger equation to track the time varying probability density
of a signal. The Schrodinger equation naturally generates a family of pdf’s as
the modulus square of the evolving wave function and by controlling the time
varying potential in which the Schrodinger evolution takes place, it is possible
to track a time varying pdf, ie, encode this time varying pdf into the form the
weights of a neural network. From the stored weights, we can synthesize the
time varying pdf of the signal.
gives �
f (x)h(x−1 y)dx = 0∀y ∈ G∀h ∈ V
G
69
61
62 Stochastic Processes in Classical and Quantum Physics and Engineering
70 CHAPTER 4. RESEARCH TOPICS ON REPRESENATIONS,
or �
ˆ
h(x)π(n.x−1 )dx = χ(n)−1 h(π)
or
ˆ
π(n)h(π) ˆ
= χ(n−1 )h(π), n∈N
Define �
Pπ,N = π(n)dn
N
Assume without loss of generality that π is a unitary representation of G. Then,
Pπ,N is the orthogonal projection onto the space of all those vectors w in the
representation space of π for which π(n)w = w∀n ∈ N . Then the above equation
gives on integrating w.r.t n ∈ N ,
ˆ
Pπ,N h(π) = 0, h ∈ V, χ �= 1
and
ˆ
Pπ,N h(π) ˆ
= h(π), h ∈ V, χ = 1
on using the Schur orthogonality relations for the characters of N . We therefore
have The equation �
f (x)h(x−1 y)dx = 0
G
gives us �
f (x)h(x−1 y)π(y −1 )dxdy = 0
G
or �
f (x)h(z)π(z −1 x−1 )dxdz = 0
or
ˆ ˆ(π) = 0, h ∈ V
h(π)f
It also implies �
f (x)h(x−1 y)π(y)dxdy = 0
or �
f (x)h(z)π(xz)dxdz = 0
or
fˆ(π)∗ ĥ(π)∗ = 0
which is the same as the above. Suppose on the other hand that f satisfies
�
f (xny)dn = 0, x, y ∈ G − − − (1)
N
Stochastic Processes ON
4.1. REMARKS in Classical
INDUCED and REPRESENTATIONS
Quantum Physics and Engineering 63
AND HAAR INTEGRALS71
which implies �
f (z)π(z −1 xn)dzdn = 0
G×N
This implies
fˆ(π)π(x)Pπ,N = 0, x ∈ G
It also implies �
f (xny)π(x−1 )dxdn = 0
which implies �
f (z)π(nyz −1 )dzdn = 0
or equivalently,
Pπ,N π(y)fˆ(π) = 0, y ∈ G
In particular, (1) implies
h1 (xn) = h1 (x), x ∈ G, n ∈ N
or �
f (x)h1 (z)π(z −1 x)dxdz = 0
or equivalently,
ˆ 1 (π)fˆ(π)∗ = 0
h
for all unitary representations π of G and for all functions h on G. Now,
� �
ˆ 1 (π) = h(xn)π(x−1 )dxdn = h(z)π(nz −1 )dzdn = P.h(π)
h ˆ
for all unitary representations π of G and for all functions h on G. This condition
is the same as (2) if we observe that fˆ(π) can be replaced by fˆ(π)∗ because
f (x) being orthogonal to the matrix elements of U (x) is equivalent to f (x−1 )
being orthogonal to the same because for compact groups, a finite dimensional
representation is equivalent to a unitary representation which means that U is
an appropriate basis is a unitary representation. Thus, we have proved that
a necessary and sufficient condition that a function f be orthogonal to all the
matrix elements of the representation of G induced by the identity character χ0
on N is that f be a cusp form, ie, it should satisfies
�
f (xny)dn = 0∀x, y ∈ G
N
Then, � �
h1 (xn0 ) = h(xn0 n)χ(n)dn = h(xn)χ(n−1
0 n)dn
N
Note that this condition fails for noncompact groups since the integral dn on
N cannot be normalized in general. Then, the condition for f to be orthogonal
to U is that J
f (x)U (x−1 )dx = 0 − − − (4)
G
and this condition is equivalent to
J
f (x)h1 (xy)dx = 0
or equivalently,
ˆ ˆ(π)∗ = 0 − − − (5)
Pχ,π,N .h(π).f
for all functions h on G and for all unitary representations π of G where
J
Pχ,π,N = P = χ(n)π(n)dn
N
fˆ(π)π(x)P = 0, x ∈ G − − − (6)
fˆ(π)h(π)P
ˆ = 0 − − − (7)
for all functions h on G as follows by multiplying (5) with h(x−1 ) and integrating
over G. This condition is the same as
ˆ ∗ fˆ(π)∗ = 0
Pχ,π,N h(π)
ˆ ˆ(π)∗ = 0 − − − (8)
Pχ,π,N h(π)f
being ejected from the nose and mouth of the person per unit time. Then, we
have a simple conservation equation
We can ask the question of determining the statistics of the first time τc at which
the CO2 concentration nc (t) exceeds a certain threshold Nc . Sometimes there
can be a chemical reaction between the CO2 trapped in the volume with some
other gases. Likewise other gases within the volume can react and produce CO2 .
If we take into account these chemical reactions, then there will be additional
terms in the above equation describing the rate of change of concentration of
CO2 . For example, consider a reaction of the form
according to Arrhenius’ law of mass action. After time t, therefore, nc (0)−nc (t)
molecules of CO2 have reacted with (nc (0) − nc (t))b/a moles of X. Thus,
equations (1) and (2) have to be jointly solved for nc (t) and nX (t). This can be
reduced to a single differential equation for nc (t):
dnc (t)/dt = −K.nc (t)a (nX (0) − (nc (0) − nc (t))b/a)b − − − (3)
and hence
dnc (t)/dt = −Knc (t)a nX (t)b + (K1 r/p)nU (t)p nV (t)q + f (nc (t), t)
where
dnX (t)/dt = (−Kb/a)nc (t)a nX (t)b , dnU (t)/dt = −K1 nU (t)p nV (t)q ,
nXj (t) = nXj (0) − (aj /a0 )(nc (0) − nc (t)), j = 1, 2, ..., n,
nYj (t) = nYj (0) + (bj /a0 )(nc (0) − nc (t)), j = 1, 2, ..., m
Stochastic Processes in Classical and Quantum Physics and Engineering 69
4.3. PLANCHEREL FORMULA FOR SL(2, R) 77
where
Δ(θ) = (exp(iθ) − exp(−iθ)), Δ(t) = exp(t) − exp(−t)
Now �
� 2π �
|m|Θm (f ) = ΔL (t)Ff,L (t) |m|Θm (a(t))dt
modd 0 modd
� �
+ ΔB (θ)Ff,B (θ). |m|Θm (u(θ))dθ
R modd
70
78 Stochastic Processes
CHAPTER in Classical
4. RESEARCH and Quantum
TOPICS Physics and Engineering
ON REPRESENATIONS,
Now, � �
|m|Θm (a(t)) = |m|exp(−|mt|)/(exp(t) − exp(−t))
modd modd
� �
mexp(−mt) = (2k + 1)exp(−(2k + 1)t)
m≥1,modd k≥0
�
m.exp(−mt) = −d/dt(1 − exp(−t))−1 = exp(−t)(1 − exp(−t))−2 , t ≥ 0
m≥0
So
�
(2k+1)exp(−(2k+1)t) = exp(−t)(2exp(−2t)(1−exp(−2t))−2 +(1−exp(−2t))−1 )
k ≥0
= exp(−t)(1+exp(−2t))/(1−exp(−2t))2 = (exp(t)+exp(−t))/(exp(t)−exp(−t))2
Thus,
� �
|m|exp(−|mt|) = 2 m.exp(−m|t|) = tanh(t) = w1 (t)
modd m≥0
say.
� �
|m|Θm (u(θ)) = m.exp(imθ)/(exp(iθ) − exp(−iθ)) = w2 (θ)
modd modd
say. Then, the Fourier transform of Ff� ,B (θ) (ie Fourier series coefficients)
�
Fˆf� ,B (m) = Ff� ,B (θ)exp(imθ)dθ =
�
− Ff,B (θ)(im)exp(imθ)dθ = −imFˆf,B (m)
So
�
−imΘm (f ) = sgn(m)(−imFˆf,B (m)) − im exp(−|mt|)Ff,L (t)dt
�
= sgn(m)Fˆf� ,B (m) − im exp(−|mt|)Ff,L (t)dt
Now, �
�
F̂f� ,B (m) = Ff� ,B (0) −i w1 (t)Ff,L (t)dt
m
Now, �
Ff,B (θ) = (exp(iθ) − exp(−iθ)) f (xu(θ)x−1 )dx
G/B
This formula was generalized to all semisimple Lie groups and also to padic
groups by HarishChandra.
or equivalently, �
f (e) = χ(f )dµ(χ)
Ĝ
72 Stochastic Processes in Classical and Quantum Physics and Engineering
80 CHAPTER 4. RESEARCH TOPICS ON REPRESENATIONS,
where �
ψ1 (x) = χ(ψ 0 )χ(x)dµ(χ), ψ 0 (x) = ψ(x−1 )
so that �
χ(ψ ) = 0
ψ(x−1 )χ(x)dx
where �
fˆ(πχ ) = f (x)πχ (x−1 ))dx
Now, � �
−1
πab (x yx)dx = πac (x−1 )πcd (y)πdb (x)dx
�
= πcd (y) π̄ca (x).πdb (x)dx = d−1 πcd (y)δ(c, d)δ(a, b)
�
= d(χ)−1 χ(ψ 0 )χ(g)
χ∈Ĝ
� �
= d(χ)−1 χ(g) ψ(x)χ̄(x)dx
ˆ
χ∈G
Is this result true for an arbitrary semisimple Lie group when we replace d(χ) by
the Plancherel measure/formal dimension on the space of irreducible characters
? In other words, is the result
�
δg (x) = χ(g)χ̄(x)dµ(χ)
Ĝ
true ?
Chapter 5
Chapter 5
Some Problems in Quantum
Information
Some problems Theory
in quantum
information theory
A problem in quantum information theory used in the proof of the coding the
orem for Cq channels : Let S, T be two nonnegative operators with S ≤ 1, ie,
0 ≤ S ≤ 1, T ≥ 0. Then
or equivalently,
((T +S)−1/2 −1)T +T ((T +S)−1/2 −1) ≤ T +((T +S)−1/2 −1)T ((T +S)−1/2 −1)
83
75
7684CHAPTERStochastic
5. SOMEProcesses in Classical
PROBLEMS and Quantum
IN QUANTUM Physics and Engineering
INFORMATION THEORY
is a minimum. Now,
C(T ) = 1 + T r((B − A)T )
It is clear that this is a minimum iff
T = {B − A ≤ 0} = {A − B ≥ 0}
= min0≤s≤1 exp(φ(s|A|B))
where
φ(s|A|B) = log(T r(A1−s B s ))
To obtain the minimum in the upper bound, we set the derivative w.r.t s to
zero to get
d/ds(T r(A1−s B s ) = 0
or
T r(log(A)A1−s B s ) = T r(A1−s B s log(B))
It is not clear how to solve this equation for s. Now define
Now,
T r(A⊗n(1−s) B ⊗ns ) = (T r(A1−s B s ))n
Therefore, we can write
limsupn→∞ n−1 log(Pn (e)) ≤
p(i)| < ei |fj > |2 = P (i, j), q(j)| < ei |fj > |2 = Q(i, j)
�
= p(i) < ei |fj > (< fj |ei > − < fj |T |ei >)
i,j
�
=1− p(i) < ei |fj > t(j, i)
i,j
and likewise,
� �
T r(σT ) = < ei |σ|fj > t(j, i) = q(j) < ei |fj > t(j, i)
i,j i,j
where
t(j, i) =< fj |T |ei >
So,
�
T r(ρ(1 − T )) + T r(σT ) = 1 + t(j, i)(q(j) − p(i)) < ei |fj > t(j, i)
i,j
Now,
� � �
| p(i) < ei |fj > t(j, i)|2 ≤ p(i)|ei |fj > |2 . p(i)|t(j, i)|2
i,j i,j i,j
�
= p(i)|t(j, i)|2
i,j
Thus, �
T r(ρ(1 − T )) ≥ 1 − p(i)|t(j, i)|2
i,j
�
T r(σT ) = q(j) < ei |fj > t(j, i)
i,j
So �
T r(σT )2 ≤ q(j)|t(j, i)|2
i,j
Stochastic Processes in Classical and Quantum Physics and Engineering 79
87
and likewise,
� �
T r(σT ) = q(j)| < fj |T |ei > |2 ≥ a(i, j)| < fj |T |ei > |2
i,j i,j
where
a(i, j) = min(p(i), q(j))
Now,
| < ei |1 − T |fj > |2 + < ei |T |fj > |2 ≥ (1/2)| < ei |1 − T |fj > + < ei |T |fj > |2
T r(ρ⊗n (1 − Tn )) + T r(σ ⊗n Tn ) ≥
where I1 is the rate function under P n of the random family n−1 log(P n /Qn ).
Evaluating the moment generating function of this r.v gives us
Then, �
infx≤0 I1 (x) = sups≥0 − log( P (ω)1−s Q(ω)s ))
ω
�
= −infs≥0 log( P (ω)1−s Q(ω)s ))
ω
�
− p(i) < ei |fj > |2 (log(q(j)) + log(| < ei |fj > |2 ))
i,j
= D(P |Q)
M (λ) = E[exp(λX)]
Stochastic Processes in Classical and Quantum Physics and Engineering 81
5.1. QUANTUM INFORMATION THEORY AND QUANTUM BLACKHOLE PHYSICS8
µ = Λ� (0)
Further,
Λ�� (λ) ≥ 0∀λ
ie Λ is a convex function. Hence Λ� (λ) is increasing for all λ. Now suppose
x ≥ µ. Consider
ψ(x, λ) = λ.x − Λ(λ)
Since
∂λ2 ψ(x, λ) = −Λ�� (λ) < 0
it follows that ∂λ ψ(x, λ) is decreasing or equivalently that ψ(x, λ) is concave in
λ. Suppose λ0 is such that
∂λ ψ(x, λ0 ) = 0
Then it follows that for a fixed value of x, ψ(x, λ) attains its maximum at
λ0 . Now ∂λ ψ(x, λ0 ) = 0 and ∂λ ψ(x, λ) is decreasing in λ. It follows that
∂λ ψ(x, λ) ≤ 0 for λ ≤ λ0 . Hence, ψ(x, λ) is decreasing for λ ≥ λ0 . Also for
λ < λ0 , ∂λ ψ(x, λ) ≥ 0 (since ∂λ ψ(x, λ0 ) = 0 and ∂λ ψ(x, λ) is decreasing for all
λ) which means that ψ(x, λ) is increasing for λ < λ0 . Now
∂λ ψ(x, 0) = x − µ ≥ 0
ψ(x, λ) ≤ ψ(x, 0) = x, ∀λ ≤ 0
Thus,
I(x) = supλ∈R ψ(x, λ) = supλ≥0 ψ(x, λ)
in the case when x ≥ µ. Consider now the case when x ≤ µ.
where ψ = ψ(t, r), π = π(t, r), the canonical position and momentum fields are
functions of only time and the radial coordinate.
Stochastic Processes in Classical and Quantum Physics and Engineering 83
5.2. HARMONIC ANALYSIS ON THE SCHWARZ SPACE OF G = SL(2, R)91
where d(k)' s are the formal dimensions of the discrete series while c(mn, µ)
are determined from the wave packet theory for principal series, ie, from the
asymptotic behaviour of the principal series matrix elements. The asymptotic
behaviour of the principal and discrete matrix elements is obtained from the
differential equations
we get on noting that the restriction of the Casimir operator to R.H is given by
d2
(Ωf )(a(t)) = ΔL (t)−1 ΔL (t)f (a(t))
dt2
where
ΔL (t) = et − e−t
as follows by applying both sides to the finite dimensional irreducible characters
of SL(2, R) restricted to R.H on noting that the restriction of such a character is
given by (exp(mt)−exp(−mt))/(exp(t)−exp(−t)), we get a differential equation
of the form
d2
fmn (a(t), µ) − qmn (t)fmn (a(t), µ) = −µ2 fmn (a(t), µ)
dt2
where qmn (t) is expressed in terms of the hyperbolic Sinh functions. From
this equation, the asymptotic behaviour of the matrix elements fmn (a(t), µ) as
|t| → ∞ may be determined. It should be noted that the discrete series modules
can be realized within the principal series modules and hence the asymptotic be
haviour of the matrix elements fmn (a(t), µ) will determine that of the principal
series when µ is nonintegral and that of the discrete series when µ is integral.
84 Stochastic Processes in Classical and Quantum Physics and Engineering
92CHAPTER 5. SOME PROBLEMS IN QUANTUM INFORMATION THEORY
This means that if we consider the U (g)module V = U (g)fm and the map
T : π(U (g))em → V defined by T (π(a)em ) = a.fm , then it is clear that T is a
well defined linear map with kernel
We have �
ρ1 = T r2 (|ψ >< ψ|) = p(i)|ψi >< ψi |
i
Stochastic Processes in Classical and Quantum Physics and Engineering 85
5.3. MONOTONICITY OF QUANTUM ENTROPY UNDER PINCHING 93
��
ρ2 = T r1 (|ψ >< ψ| = p(i)p(j) < ψj |ψi > |i >< j|
i
Now let σ1 , σ2 be two mixed states and let p > 0, p + q = 1. Consider the
spectral decompositions
� �
σ1 = p(i)|ei >< ei |, σ2 = q(i)|fi > fi |
i i
Then, �
pσ1 + qσ2 = (pp(i)|ei >< ei | + qq(i)|fi >< fi |)
i
Then
0 ≤ D(ρ|σ) = T r(ρ.log(ρ)) − T r(ρ.log(σ))
86 Stochastic Processes in Classical and Quantum Physics and Engineering
94CHAPTER 5. SOME PROBLEMS IN QUANTUM INFORMATION THEORY
� �
= −H(ρ) − T r(ρ.Pi .log(Pi ρ.Pi )Pi ) = −H(ρ) − T r(Pi ρ.Pi .log(Pi ρ.Pi )
i i
� �
= −H(ρ) − T r(Pi ρ.Pi .log( Pj ρ.Pj )) = −H(ρ) + H(σ)
i j
Thus,
H(σ) ≥ H(ρ)
which when stated in words, reads, pinching increases the entropy of a state.
Remark: We’ve made use of the following facts from matrix theory:
� �
log( Pi .ρ.Pi ) = Pi .log(ρ)Pi
i i
and use
log(1 − z) = −z − z 2 /2 − z 3 /3 − ...
combined with Pj Pj = Pi δ(i, j) to get
�
log( Pi ρ.Pi ) = −A − A2 /2 − A3 /3 − ...
i
where
� � �
A= Pi (1 − ρ)Pi , A2 = Pi (1 − ρ)2 Pi , ..., An = − Pi (1 − ρ)n Pi
i i i
so that �
log(1 − A) = Pi .log(ρ)Pi
i
A particular application of this result has been used above in the form
��
H( p(i)p(j) < ψj |ψi > |i >< j|)
i
��
≤ H( p(i)p(j) < ψj |ψi >< k|i >< j|k > |k >< k|)
i,k
� �
= H( p(k)|k >< k|) = H({p(k)}) = − p(k).log(p(k))
k k
Chapter 6
Chapter 6
Lectures on Statistical
Lectures onSignal Processing
Statistical
Signal Processing
[1] Large deviation principle applied to the least squares problem of estimating
vector parameters from sequential iid vector measurements in a linear model.
V (n)� s are iid and have the pdf pV . Then θ is estimated based on the measure
ments X(n), n = 1, 2, ..., N using the maximum likelihood method as
N
�
θ̂(N ) = argmaxθ N −1 log(pV (X(n) − H(n)θ))
n=1
Consider now the case when H(n) is a matrix valued stationary process in
dependent of V (.). Then, the above maximum likelihood estimate is replaced
by
�
θ̂(N ) = argmaxθ [ΠN N
n=1 pV (X(n) − H(n)θ)]pH (H(1), ..., H(N ))Πj=1 dH(j)
In the special case when the H(n)� s are iid, the above reduces to
N
� �
θ̂(N ) = argmaxθ N −1 log( pV (X(n) − Hθ)pH (H)dH)
n=1
The problem is to calculate the large deviation properties of θ̂(N ), namely the
rate at which P (|θ̂(N ) − θ| > δ) converges to zero as N → ∞.
95
87
88 Stochastic Processes in Classical and Quantum Physics and Engineering
96 CHAPTER 6. LECTURES ON STATISTICAL SIGNAL PROCESSING
N
� N
�
θ(N ) = [ H(n)T H(n)]−1 [ H(n)T X(n)]
n=1 n=1
N
� N
�
R(N ) = H(n)T H(n), r(N ) = H(n)T X(n)
n=1 n=1
Then,
R(N +1) = R(N )+H(N +1)T H(N +1), r(N +1) = r(N )+H(N +1)T X(N +1)
Then,
R(N +1)−1
= R(N )−1 −R(N )−1 H(N +1)T (I+H(N +1)H(N +1)T )−1 H(N +1)R(N )−1
so that
where K(N +1) is a Kalman gain matrix expressible as a function of R(N ), H(N +
1). Our aim is to do a large deviation analysis of this recursive algorithm. Define
δθ(N ) = θ(N ) − θ
Then we have
δθ(N + 1) = δθ(N ) + K(N + 1)(X(N + 1) − H(N + 1)θ − H(N + 1)δθ(N ))
We assume that the noise process V (.) is small and perhaps even nonGaussian.
This fact is emphasized by replacing V (n) with �.V (n). Then, the problem
is to calculate the relationship between the LDP rate function of δθ(N ) and
δθ(N + 1).
Stochastic Processes in Classical and Quantum Physics and Engineering 89
6.2. PROBLEM SUGGESTED BY MY COLLEAGUE PROF. DHANANJAY97
M ξ �� (t) = F (t) − M g
where F (t) is the z component of the force exerted by the magnetic field gener
ated by the current induced in the coil upon the magnet. Now, the flux of the
magnetic field generated by the magnet through the coil is given by
�
Φ(t) = n0 Bz (t, X, Y, Z)dXdY dZ
2 ,h≤z≤h+b
X2 +Y 2 ≤RT
where
Bz (t, X, Y, Z) = ẑ.B(t, X, Y, Z),
B(t, X, Y, Z) = curlA(t, R − ξ(t)ẑ)
where
R = (X, Y, Z)
and where RT is the radius of the tube. The current through the coil is then
I(t) = Φ� (t)/R0
where R0 is the coil resistance. The force of the magnetic field generated by
this current upon the falling magnet can now be computed easily. To do this,
we first compute the magnetic field Bc (t, X, Y, Z) generated by the coil using
the usual retarded potential formula
.(−x.sin
ˆ (φ) + ŷ.cos(φ))/|R − RT (x̂.cos(φ) + ŷ.sin(φ)) − ẑ(b/2n0 π)φ|)RT dφ
Note that Ac (t, X, Y, Z) is a function of ξ(t) and also of m(t) and m� (t). It follows
that I(t) is a function of ξ(t), ξ � (t), m(t), m� (t), m�� (t). Then we calculate the
interaction energy between the magnetic fields generated by the magnet and by
the coil as
�
� −1
Uint (ξ(t), ξ (t), t) = (2µ) (Bc (t, X, Y, Z), B(t, X, Y, Z))dXdY dZ
and then we get for the force exerted by the coil’s magnetic field upon the falling
magnet as
F (t) = −∂Uint (ξ(t), ξ � (t), t)/∂ξ(t)
Remark: Ideally since the� interaction energy between the two magnetic fields
appears in the Lagrangian (E 2 /c2 − B 2 )d3 R/2µ, we should write down the
EulerLagrange equations using the Lagrangian
in the form
d
∂L/∂ξ � − ∂L/∂ξ = 0
dt
which yields
d
M ξ �� (t) − ∂U/∂ξ � + ∂Uint /∂ξ + M g = 0
dt
Theorem:
√ √
F (ρ, σ) = T r(| ρ. σ|)
Proof: Let |u >∈ P (ρ). Then, we can write
�
|u >= |ui ⊗ wi >
i
Stochastic Processes in Classical and Quantum Physics and Engineering 91
6.3. FIDELITY BETWEEN TWO QUANTUM STATES 99
where �
σ= |vi >< vi |
i
But then �
< u|v >= < ui |vi >
i
Then we have
ρ = XX ∗ , σ = Y Y ∗
√ √ √ √
= T r( ρ σV U ∗ ) = T r( ρ. σW )
√ √ √ √
= T r[( ρσ. ρ)1/2 ] = T r[( σ.ρ. σ)1/2 ]
92 Stochastic Processes in Classical and Quantum Physics and Engineering
100 CHAPTER 6. LECTURES ON STATISTICAL SIGNAL PROCESSING
ρ ∈ Cp×p
so
V ∈ CN p×p
Note that
V ρV ∗ = ((Ea ρEb∗ ))1≤a,b≤N
as an N p × N p block structured matrix, each block being of size p × p. Now
define
|u >= [1, 0, ..., 0]T ∈ CN
Then, � �
ρ 0
|u >< u| ⊗ ρ = ∈ CN p×N p
0 0
Since the columns of V are orthonormal, we can add columns to it to make it a
square N p × N p unitary matrix U ∗ :
U = [V, W ] ∈ CN p×N p , U ∗ U = IN p
Then, � �� �
∗ ρ 0 V∗
U (|u >< u| ⊗ ρ)U = [V, W ]
0 0 W∗
= V ρV ∗
from which it follows that
�
T r1 (U (|u >< u| ⊗ ρ)U ∗ ) = T r1 (V ρV ∗ ) = Ea ρEa∗
a
Stochastic Processes in Classical and Quantum Physics and Engineering 93
6.5. RLS LATTICE ALGORITHM, STATISTICAL PERFORMANCE ANALYSIS101
X = ((Bij ))1≤i,j≤N
and
T r2 (X) = ((T r(Bij )))1≤i,j≤N ∈ CN ×N
where these vectors are all of size N + 1. Form the data matrix of order p at
time N :
XN,p = [z −1 x(n), ..., z −p x(n)] ∈ RN +1×p
Then the forward prediction filter coefficient estimate is
N
L
−1
N x(n − i)x(n − j) ≈ R(j − i)
n=max(i,j)
So we write
where now δRN (j, i) is a random variable that is a quadratic function of the
Gaussian process {x(n)} plus a constant scalar. Equivalently,
where
Rp = ((R(j − i)))1≤i,j≤p
is the statistical correlation matrix of size p and δRN,p is a random matrix.
Likewise,
N
�
N −1 (z −i x(N ))T x(N ) = N −1 x(n − i)x(n) = R(i) + δRN (0, i)
n=i
Thus,
N −1 XTN,p x(n) = r(1 : p) + δrN (0, 1 : p)
where
r(1 : p) = [R(1), ..., R(p)]T , δrN (1 : p) = [δRN (0, 1), ..., δRN (0, p)]T
where fmn (t, µ) = fmn (a(t), µ). This determines the matrix elements of the
principal series. For the discrete series matrix elements, we observe that these
are
φkmn (x) =< πk (x)em |en >
Stochastic Processes
6.7. PROOF OF in Classical
THE and Quantum
CONVERSE PARTPhysics
OF THE andCQ
Engineering 95
CODING THEOREM103
with
π(X)e−k−1 = 0
Note that we can using the method of lowering operators, write
Now,
� �
h(x)φ−k,−k−1−r,−k−1−s (x)dx = h(x)fk+1 (Y ∗s ; x; Y r )dx
G G
�
= h(Y s ; x; Y ∗r )fk+1 (x)dx
G
where
Is (p, W ) = minσ Ds (p ⊗ W |p ⊗ σ)
where
Note that
so that
lims→0 Ds (p ⊗ W |p ⊗ σ) =
�
p(x)T r(W (x)log(W (x)) − T r(Wp .log(σ))
x
where �
Wp = p(x)Wx
x
We define
�
f (t) = T r(tWx1−s + (1 − t) ps (y)Wy1−s )1/(1−s) , t ≥ 0
y
Note that �
Is (p, W ) = (T r( p(y)Wy1−s )1/(1−s) )1−s
y
it follows that
f (t) ≤ f (0)∀t ≥ 0
Thus
f � (0) ≤ 0
This gives
� �
T r[(Wx1−s − ps (y)Wy1−s ).( ps (y)Wy1−s )s/(1−s) ] ≤ 0
y y
Stochastic Processes
6.7. PROOF OF in Classical
THE and Quantum
CONVERSE PARTPhysics
OF THEandCQ
Engineering 97
CODING THEOREM105
or equivalently,
� �
T r(Wx1−s .( ps (y)Wy1−s )s/(1−s) ) ≤ T r( ps (y)Wy1−s )1/(1−s)
y y
σ = A1/(1−s) /T r(A1/(1−s) )
Thus,
minσ Ds (p ⊗ W |p ⊗ σ) =
�
(−1/s)[T r( p(x)Wx1−s )1/(1−s) ]1−s
x
The maximum of this over all p is attained when p = ps and we denote the
corresponding value of σ by σs . Thus,
�
( x ps (x)Wx1−s )1/(1−s)
σs = �
T r( x ps (x)Wx1−s )1/(1−s)
n
� with φ(i) ∈ A . Let Yi , i = 1, 2, ..., N
Now let φ(i), i = 1, 2, ..., N be a code
be detection operators, ie, Yi ≥ 0, i Yi = 1. Let �(φ) be the error probability
corresponding to this code and detection operators. Then
N
�
1 − �(φ) = N −1 T r(Wφ(i) Yi )
i=1
98 Stochastic Processes in Classical and Quantum Physics and Engineering
106 CHAPTER 6. LECTURES ON STATISTICAL SIGNAL PROCESSING
= N −1 T r(S(φ)T )
where
We observe that
� N
�
T r(S(σn )T ) == N −1 T r(σn Yi ) = T r(σ.N −1 . Yi )
i i=1
= N −1 T r(σn ) = 1/N
where σn = σ ⊗n . Then for s ≤ 0 we have by monotonicity of quantum Renyi
entropy, (The relative entropy between the pinched states cannot exceed the
relative entropy between the original states)
1−s s
log(T r(Wφ(i) σns )
n
�
= log(T r(Wφ1l−s s
(i) σs ))
l=1
n
�
= n.n−1 log(T r(Wφ1−s σ s ))
l (i) s
l=1
n
�
≤ n.log(n−1 . T r(Wφ1−s σ s ))
l (i) s
l=1
�
≤ n.log(T r( ps (y)Wy1−s )1/(1−s) ]1−s )
y
�
= n(1 − s).log(T r( ps (x)Wx1−s )1/(1−s) )
x
Thus, �
T r(S(φ)1−s S(σns )s ) ≤ (T r( ps (x)Wx1−s )1/(1−s) )n(1−s)
x
Stochastic Processes in Classical and Quantum Physics and Engineering 99
6.7. PROOF OF THE CONVERSE PART OF THE CQ CODING THEOREM107
and hence
�
N −s (1 − �(φ))1−s ≤ (T r( ps (x)Wx1−s )1/(1−s) )n(1−s)
x
or equivalently,
�
1 − �(φ) ≤ exp((s/(1 − s))log(N ) + n.log(T r( ps (x)Wx1−s )1/(1−s) ))
x
�
= exp((s/(1 − s))log(N ) + n.log(T r( ps (x)Wx1−s )1/(1−s) ))
x
�
= exp(ns((1 − s)log(N )/n + s−1 log(T r( ps (x)Wx1−s )1/(1−s) )) − − − (a)
x
where �
Wp = p(x)Wx
x
Also define
�
Is (X, Y ) = −(1/s(1 − s))log(T r( ps (x)Wx1−s )1/(1−s) ))
x
R > Is (X, Y )
or equivalently,
liminfn �(φn ) = 1
This proves the converse of the Cq coding theorem, namely, that if the rate of
information transmission via any coding scheme is more that the Cq capacity
I(X, Y ), then the asymptotic probability of error in decoding using any sequence
of decoders will always converge to unity.
where N = N (n), φ(i) ∈ An with φ(i), i = 1, 2, ..., N being iid with probability
distribution pn = p⊗n on An . Define the detection operators
N
� N
�
Y (φ(i)) = ( π)−1/2 .πi .( πj )−1/2 , i = 1, 2, ..., N
j=1 j=1
�N
Then 0 ≤ Yi ≤ 1 and i=1 Yi = 1. Write
�
S = πi , T = πj
j=i
Then,
Y (φ(i)) = (S + T )−1/2 S(S + T )−1/2 , 0 ≤ S ≤ 1, T ≥ 0
and hence we can use the inequality
Then, �
1 − Y (φ(i)) =≤ 2 − 2πi + 4 πj
j:j=i
and hence
N
�
�(φ) = N −1 T r(Wφ(i) (1 − Y (φ(i))))
i=1
N
� �
≤ N −1 T r(W (φ(i))(2 − 2πi ) + 4N −1 T r(W (φ(i))πj )
i=1 j
i=
Note that since for i = � j, φ(i) and φ(j) are independent random variables in
An , it follows that W (φ(i)) and πj are also independent operator valued random
variables. Thus,
N
�
E(�(φ)) ≤ (2/N ) E(T r(W (φ(i)){W (φ(i)) ≤ 2N Wpn }))
i=1
�
+(4/N ) T r(E(W (φ(i))).E{W (φ(j)) > 2N Wpn })
i=j,i,j=1,2,...,N
�
=2 pn (xn )T r(W (xn ){W (xn ) ≤ 2N.W (pn )})
xn ∈An
�
+4(N − 1) pn (xn )pn (yn )T r(W (xn ){W (yn ) > 2N W (pn )})
xn ,yn ∈An
�
= (2/N ) pn (xn )T r(W (xn ){W (xn ) ≤ 2N.W (pn )})
xn
�
+4(N − 1) pn (yn )T r(W (pn ){W (yn ) > 2N W (pn )})
yn ∈An
�
≤2 pn (xn )T r(W (xn )1−s (2N W (pn ))s )
xn
�
+4N pn (yn )T r(W (yn )1−s W (pn )s (2N )s−1 )
yn
Then define
where
ρ(g, x) = (dµ(xH)/dµ(gxH))1/2 , g, x ∈ G
with µ a measure on G/H. U is called the representation of G induced by the
representation σ of H. We define
�
� f �= |f (g)|2 dµ(g)
G
There is yet another way to describe the induced representation, ie, in a way
that yields a unitary representation of G that is equivalent to U . The method
is as follows: Let X = G/H and let F1 be the space of all square integrable
Stochastic Processes in Classical and Quantum Physics and Engineering 103
6.9. INDUCED REPRESENTATIONS 111
γ : G × X → U(V )
satisfies
γ(g1 g2 , x) = γ(g1 , g2 x)γ(g2 , x), g1 , g2 ∈ G, x ∈ X
Then define a representation U1 of G in F1 by
Then,
b(g, x)H = s(gx)−1 gx = H
because
gs(x)H = gx, s(gx)H = gx
Now observe that
b(g1 , g2 x)b(g2 , x) = s(g1 g2 x)−1 g1 s(g2 x)s(g2 x)−1 g2 s(x) = s(g1 g2 x)−1 g1 g2 s(x)
= b(g1 g2 , x), g1 , g2 ∈ G, x ∈ X
implying that γ = σ(b) is a (G, X, V ) cocycle, ie,
Then
U (g)T (f1 )(g1 ) = U (g)f (g1 ) = ρ(g, g −1 g1 H)f (g −1 g1 )
= ρ(g, g −1 g1 H)γ(g −1 g1 , H)−1 f1 (g −1 g1 H)
= ρ(g, g −1 g1 H)(γ(g −1 , g1 H)γ(g1 , H))−1 f1 (g −1 g1 H)
On the other hand,
Now,
γ(g1 , H) = γ(gg −1 g1 , H) = γ(g, g −1 g1 H)γ(g −1 g1 , H)
or equivalently,
Thus,
T (U1 (g)f1 )(g1 ) = ρ(g, g −1 g1 H)γ(g −1 g1 , H)−1 f1 (g −1 g1 H)
= ρ(g, g −1 g1 H)T (f1 )(g −1 g1 ) = U (g)T (f1 )(g1 )
In other words,
T U1 (g) = U (g)T
Thus, U1 and U are equivalent unitary representations.
� �
= (µ/4π) n
(1/c n!) (∂tn J(t, r� ))|r − r� |n−1 d3 r�
n≥0
For |r| greater than the source dimension and with θ denoting the angle between
the vectors r and r� , we have the expansion
�
|r − r� |α = (r2 + r 2 − 2r.r� )α/2
= rα − α.rα−2 r.r�
Further,
In the special case when the source is a wire carrying a time varying current
I(t), we can write
J(t, r� )d3 r� = I(t)dr�
and then the above equation gets replaced by
Chapter 7
Chapter 7
Some Aspects of Stochastic
Processes
Some aspects of stochastic
processes
�n[1] In a single server queue, let the arrivalsthtake place at the times Sn =
k=1 Xk , n ≥ 1, let the service time of the n customer be Yn and let his
waiting time in the queue be Wn . Derive the recursive relation
Wn+1 = max(0, Wn + Yn − Xn+1 ), n ≥ 1
where W1 = 0. If the Xn� s are iid and the Yn� s are also iid and independent
of {Xn }, then define Tn = Yn − Xn+1 and prove �nthat Wn+1 has the same
distribution as that of max0≤k≤n Fk where Fn = k=1 Tk , n ≥ 1 and F0 = 0.
When Xn , Yn are respectively exponentially distributed with parameters λ, µ
respectively explain the sequence of steps based on ladder epochs and ladder
heights by which you will calculate the distribution of the equilibrium waiting
time W∞ = limn→∞ Wn in the case when µ > λ. What happens when µ ≤ λ ?
Solution: The (n + 1)th customer arrives at the time Sn+1 . The previous, ie,
the nth customer departs at time Sn + Wn + Yn . If this time of his departure is
smaller than Sn+1 , (ie, Sn + Wn + Yn − Sn+1 = Wn + Yn − Xn+1 < 0), then the
(n + 1)th customer finds the queue empty on arrival and hence his waiting time
Wn+1 = 0. If on the other hand, the time of departure Sn + Wn + Yn of the nth
customer is greater than Sn+1 , the arrival time of the (n + 1)th customer, then
the n + 1th customer has to wait for a time
Wn+1 = (Sn + Wn + Yn ) − Sn+1 = Wn + Yn − Xn+1
before his service commences. Therefore, we have the recursion relation
Wn+1 = max(0, Wn + Yn − Xn+1 ), n ≥ 0
115
107
108
116 Stochastic 7.
CHAPTER Processes
SOMEinASPECTS
Classical and
OFQuantum Physics PROCESSES
STOCHASTIC and Engineering
By iteration, we find
Now the Tk� s are iid. Therefore, Wn+1 has the same distribution as
n
�
max(0, Tn+1−k , r = 1, 2, ..., n)
k=r
r
�
= max(0, Tk , r = 1, 2, ..., n)
k=1
provided that this limiting � r.v. exists, ie, the provided that the infinite series
n
converges w.p.1. Let Fn = k=1 Tk , n ≥ 1, F0 = 0
Define τ1 denote the first time k ≥ 1 at which Fk > F0 = 0. For n ≥ 1, let
τn+1 denote the first time k ≥ 1 at which Fk > Fτn , ie, the n + 1th time k at
which Fk exceeds all of Fj , j = 0, 1, ..., k − 1. Then, it is clear that W∞ has the
same distribution as that of limFτn , provided that τn exists for all n. In the
general case, we have
P (W∞ ≤ x) = limn→∞ P (Fτn ≤ x)
where if for a given ω, τN (ω) exists for some positive integer N = N (ω) but
τN +1 (ω) does not exist, then we set τk (ω) = τN (ω) for all k > N . It is also
clear using stopping time theory, that the r.v’s Un+1 = Fτn+1 − Fτn , n = 1, 2, ...
are iid and nonnegative and we can write
∞
�
W∞ = U n , U0 = 0
n=0
Stochastic Processes in Classical and Quantum Physics and Engineering 109
117
[2] Consider a priority queue with two kinds of customers labeled C1 , C2 and
a single server. Customers of type C1 arrive at the rate λ1 and customers of type
C2 arrive at the rate λ2 . A customer of type C1 is served at the rate µ1 while a
customer of type C2 is served at the rate µ2 . If a customer of type C1 is present
in the queue, then his service takes immediate priority over a customer of type
C2 , otherwise the queue runs on a first come first served basis. Let X1 (t) denote
the number of type C1 customers and X2 (t) the number of type C2 customers
in the queue at time t. Set up the ChapmanKolmogorov differential equations
for
P (t, n1 , n2 ) = P r(X1 (t) = n1 , X2 (t) = n2 ), n1 , n2 = 0, 1, ...
and also set up the equilibrium equations for
Solution: Let θ(n) = 1if n ≥ 0 and θ(n) = 0 if n < 0 and let δ(n, m) = 1 if
n = m else δ(n, m) = 0.
P (t+dt, n1 , n2 ) = P (t, n1 −1, n2 )λ1 dt.θ(n1 −1)+P (t, n1 , n2 −1)λ2 dt.θ(n2 −1)+
∂t P (t, n1 , n2 ) =
The equilibrium equations are obtained by setting the rhs to zero, ie ∂t P (n1 , n2 ) =
0.
and hence �
Snk (t)Snk (s) = min(t, s)
nk
Let ξ(n, k), k ∈ I(n), n ≥ 1, ξ(0, 0) be iid N (0, 1) random variables and define
the random functions
N
� �
BN (t) = ξ(n, k)Snk (t), N ≥ 1
n=0 k∈I(n)
Then since Snk (t) are continuous functions, BN (t) is also continuous for all N .
�
Further, the Snk s for a fixed n are nonoverlapping functions and
Thus, �
maxt |Snk (t)| = 2−(n+1)/2
k∈I(n)
Let
b(n) = max(|ξ(nk)| : k ∈ I(n)}
Then, by the union bound
� ∞ √
n n+1
P (b(n) > n) ≤ 2 .P (|ξ(nk)| > n) = 2 exp(−x2 /2)dx/ 2π
n
≤ K.2n+1 n.exp(−n/ 2)
Stochastic Processes in Classical and Quantum Physics and Engineering 111
119
Thus, �
P (b(n) > n) < ∞
n≥1
which means that by the BorelCantelli lemma, for a.e.ω, there is a finite N (ω)
such that for all n > N (ω), b(n, ω) < n. This means that for a.e.ω,
N
� �
|ξ(n, k, ω)||Snk (t)|
n=1 k∈I(n)
N
�
≤ n.2−(n+1)/2
n=1
N
� �
= δ(n, m)δ(k, r)Snk (t)Smr (s)
n,m=0 k,r∈I(n)
N
� � ∞
� �
= Snk (t)Snk (s) → Snk (t)Snk (s) = min(t, s)
n=0 k∈I(n) n=0 k∈I(n)
Thus,
� t � 1
sphi(s)ds + t φ(s)ds = λφ(t)
0 t
112 Stochastic Processes in Classical and Quantum Physics and Engineering
120 CHAPTER 7. SOME ASPECTS OF STOCHASTIC PROCESSES
Differentiating,
� 1
tφ(t) + φ(s) − tφ(t) = λφ� (t)
t
Again differentiating
−φ(t) = λφ�� (t)
Let λ = 1/ω 2 . Then, the solution is
B(−t.cos(ωt)/ω + sin(ωt)/ω 2 )
� 1
φ(s)ds = A(sin(ω) − sin(ωt))/ω + (B/ω)(cos(ωt) − cos(ω))
t
Substituting all these expressions into the original integral equation gives
B(−t.cos(ωt)/ω + sin(ωt)/ω 2 )
+tA(sin(ω) − sin(ωt))/ω + (Bt/ω)(cos(ωt) − cos(ω))
= (A.cos(ωt) + B.sin(ωt))/ω 2
On making cancellations and comparing coefficients,
Therefore,
A = 0, ω = (n + 1/2)π, n ∈ Z
and for normalization � 1
φ2 (t)dt = 1
0
we get
� 1
2
B sin2 (ωt)dt = 1
0
or
(B 2 /2)(1 − sin(2ω)/2ω) = 1
or √
B= 2
Thus, the KL expansion for Brownian motion is given by
��
B(t) = λn ξ(n)φn (t)
n∈Z
Stochastic Processes in Classical and Quantum Physics and Engineering 113
121
� √
= ((n + 1/2)π)−2 ξ(n). 2sin((n + 1/2)πt)
n∈Z
[c] Reflection principle for Brownian motion with application to the evalua
tion of the distribution of the first hitting time of the process at a given level.
[d] The BorelCantelli lemmas with application to proving almost sure con
vergence of a sequence of random variables.
[e] The FeynmanKac formula for solving Schrodinger’s equation using Brow
nian motion with complex time.
[f] Solving Poisson’s equation �2 u(x) = s(x) in an open connected subset of
R with the Dirichlet boundary conditions using Brownian motion stop times.
n
[4] A bulb in a bulb holder is changed when it gets fused. Let Xn denote
the time for which the nth bulb lasts. If n is even, then Xn has the exponential
density λ.exp(−λ.x)u(x) while if n is odd then Xn has the exponential density
µ.exp(−µ.x)u(x). Calculate (a) the probability distribution of the number of
bulbs N (t) = max(n : Sn ≤ t) changed in [0, t] where Sn = X1 + ... + Xn , n ≥ 1
and S0 = 0. Define
Pictorially illustrate and explain the meaning of the r.v’s δ(t), γ(t). What hap
pens to the distributions of these r.v’s as t → ∞ ?
Solution: Let n ≥ 1.
� �
P (N (t) ≥ n) = P (Sn ≤ t) = P ( Xk + Xk ≤ t)
1≤k≤n,keven 1≤k≤n,nodd
Specifically,
n
� n
�
P (N (t) ≥ 2n) = P ( X2k + X2k−1 ≤ t)
k=1 k=1
= F ∗n ∗ G∗n (t)
where
F (t) = 1 − exp(−λt), G(t) = 1 − exp(−µt)
Note that with f (t) = F � (t), g(t) = G� (t), we have
so
L(f ∗n (t)) = λn /(λ + s)n , L(g ∗n (t)) = µn /(µ + s)n
and hence
say. Then,
� t
P (N (t) ≥ 2n) = ψn (x)dx
0
Likewise,
� t
P (N (t) ≥ 2n − 1) = P (S2n−1 ≤ t) = φn (x)dx
0
where
δ(t) = t − SN (t) is the time taken from the last change of the bulb to the current
time. γ(t) = SN (t)+1 − t is the time taken from the current time upto the next
change of the bulb. Thus, for 0 ≤ x ≤ t,
�
P (δ(t) ≤ x) = P (SN (t) > t − x) = P (Sn > t − x, N (t) = n)
n≥1
� �
= P (t − x < Sn ≤ t < Sn+1 ) = P (t − x < Sn < t, Xn+1 > t − Sn )
n≥1 n≥1
�� t
= P (Xn+1 > t − s)P (Sn ∈ ds)
n≥1 t−x
Likewise,
�
P (γ(t) ≤ x) = P (SN (t)+1 ≤ t + x) = P (Sn+1 ≤ t + x, Sn ≤ t < Sn+1 )
n≥0
� �� t
= P (Sn ≤ t < Sn+1 ≤ t + x) = P (Sn ∈ ds)P (t − s < Xn+1 < t + x)
n≥ 0 n≥0 0
where c(n) are any r.v’s is cyclostationary with period 2π/ω. In fact, this process
is a periodic process. A wide sense cyclostationary process is a process whose
mean and autocorrelation function are periodic with period T for some fixed
T > 0, ie, if
µ(t) = E(X(t)), R(t, s) = E(X(t)X(s))
then
µ(t + T ) = µ(t), R(t + T, s + T ) = R(t, s)∀t, s ∈ R
An example of a WSS process that is not stationary is
where ω1 + ω2 = ω3 and φ1 , φ2 are iid r.v’s uniformly distributed over [0, 2π).
This process has second moments that are invariant under time shifts but its
third moments are not invariant under time shifts. Likewise, an example of a
WScyclostationary process that is not cyclostationary is the process X(t)+Y (t)
with Y (t) having the same structure as X(t) but independent of it with frequen
cies ω4 , ω5 , ω6 (and phases φ4 , φ5 , φ4 + φ5 with φ1 , φ2 , φ4 , φ5 being independent
and uniform over [0, 2π)) such that (ω1 + ω2 − ω3 )/(ω4 + ω5 − ω6 ) is irrational.
This process is also in fact WSS. To see why it is not cyclostationary, we compute
its third moments:
E(X(t)X(t+t1 )X(t+t2 ))
= E(cos(ω3 t+φ1 +φ2 ).cos(ω1 (t+t1 )+φ1 ).cos(ω2 (t+t2 )+φ2 ))
116 Stochastic Processes in Classical and Quantum Physics and Engineering
124 CHAPTER 7. SOME ASPECTS OF STOCHASTIC PROCESSES
= (1/4)(cos((ω1 + ω2 − ω3 )t + ω1 t1 + ω2 t2 )
+cos((ω1 + ω2 − ω3 )t + ω2 t1 + ω1 t2 ))
+(1/4)(cos((ω4 + ω5 − ω6 )t + ω4 t1 + ω5 t2 )
+cos((ω4 + ω5 − ω6 )t + ω5 t1 + ω4 t2 ))
which is clearly not periodic in t since (ω1 + ω2 − ω3 )/(ω4 + ω5 − ω6 ) is irrational.
E⊂ Oα
α∈I
E⊂ O αk
k=1
(b) What is a regular measure on (Rn , B(Rn ))?. Show that every probability
measure on (Rn , B(Rn )) is regular.
Solution: A measure µ on (Rn , B(Rn )) is said to be regular if given any
B ∈ B(Rn ) and � > 0, there exists an open set F and a closed set C in Rn such
that C ⊂ B ⊂ F and µ(F − C) < �. Let us now show that any probability
measure (and hence any finite measure) on (Rn , B(Rn )) is regular. Let P be a
probability measure on the above space. Define C to be the family of all sets
B ∈ B(Rn ) such that ∀� > 0, there exists a closed set C and an open set F such
that C ⊂ B ⊂ F and P (F −C) < �. We prove that C is a σfield and then since it
contains all closed sets (If C is a closed set and Fn = {x ∈ Rn : d(x, C) < 1/n},
then Fn is open, C ⊂ C ⊂ Fn and P (Fn − C) → 0 because Fn ↓ C), it
will follow that C = B(Rn ) which will complete the proof of the result. Let
B ∈ C and let � > 0. Then, there exists a closed C and an open F such that
C ⊂ B ⊂ F and P (F − C) < �. Then F c is closed, C c is open, F c ⊂ B c ⊂ C c
and P (C c − F c ) = P (F − C) < �, thereby proving that B c ∈ C, ie, C is closed
under complementation. Now, let Bn ∈ C, n = 1, 2, ... and let � > 0. Then, for
each n we can choose a closed NCn and an open Fn such that Cn ⊂ Bn ⊂F∞ n with
P (Fn − Cn ) < �/2n .
Then, n=1 Cn is closed for any finite integer N , n=1 Fn
is open and for B = n Bn , we have that
N
Cn ⊂ B ⊂ Fn , N ≥ 1
n=1 n
118126 Stochastic 7.
CHAPTER Processes
SOMEinASPECTS
Classical and
OFQuantum Physics PROCESSES
STOCHASTIC and Engineering
and further
N
N
P( Fn − Cn ) ≤ P (( (Fn − Cn )) Fn )
n n=1 n=1 n>N
N
� � N
� �
≤ P (Fn − Cn ) + P (Fn ) ≤ �/2n + P (Fn )
n=1 n>N n=1 n>N
F= (B(RN ) × RZ+ )
N ≥1
Stochastic Processes in Classical and Quantum Physics and Engineering 119
127
Note that
BN = B(RN ) × RZ+
is a field on RZ+ for every N ≥ 1 and that
BN ⊂ BN +1 , N ≥ 1
˜N ∈
It suffices to prove that Q is countably additive on F , or equivalently, if B
˜ ˜
BN is a sequence such that BN ↓ φ, then Q(BN ) ↓ 0. Equivalently, it suffices
to show that if BN ∈ B(RN ) is a sequence such that
˜ N = BN × R Z + ↓ φ
B
then
PN (BN ) ↓ 0
Suppose that this is false. Then, without loss of generality, we may assume that
there is a δ > 0 such that PN (BN ) ≥ δ∀N ≥ 1. By regularity of probability
measures on RN for N finite, we can find for each N ≥ 1, a compact set
KN ∈ B(RN ) such that KN ⊂ BN and PN (BN − KN ) < δ/2N . Now consider
Then
K̃N ⊂ KN ⊂ RN
and K˜ N being a closed subset of the compact set KN is also compact. K ˜ N is
closed because each Kj , j = 1, 2, ..., N is compact and hence closed. Define
˜ N × RZ + , N ≥ 1
ˆN = K
K
ˆ N is decreasing and that
Then, it is clear that K
˜ N = BN × R Z +
K̂N ⊂ B
Further,
ˆ N ) = Q(B
Q(K ˜N ) − Q(B
˜N − K
ˆN )
Now,
˜N ) = PN (BN ) ≥ δ
Q(B
and
N
�
˜N − K
Q(B ˆ N ) = PN (BN − (Kk × RN −k ))
k=1
N
= PN ( (BN − (Kk × RN −k ))
k=1
N
≤ PN ( (Bk − Kk ) × RN −k )
k=1
120 Stochastic Processes in Classical and Quantum Physics and Engineering
128 CHAPTER 7. SOME ASPECTS OF STOCHASTIC PROCESSES
N
� N
�
≤ Pk (Bk − Kk ) = δ/2k ≤ δ/2
k=1 k=1
Thus,
ˆ N ) ≥ δ/2
Q(K
and hence K ˆ N is nonempty for each N . Hence, so is K
˜ N for each N . Now
ˆ ˜ ˆ
KN ⊂ BN ↓ φ. Hence KN ↓ φ and we shall obtain a contradiction to this as
˜ N is nonempty, we may choose
follows: Since K
˜ N , N = 1, 2, ...
(xN 1 , ..., xN N ) ∈ K
Then,
xN 1 ∈ K1 , (xN 1 , xN 2 ) ∈ K2 , ..., (xN 1 , ..., xN N ) ∈ KN
Since K1 is compact, xN 1 , N ≥ 1 has a convergent subsequence say {xN1 ,1 }.
Now, (xN1 ,1 , xN1 ,2 ) ∈ K2 and since K2 is compact, this has a convergent subse
quence say (xN2 ,1 , xN2 ,2 ). Then, {xN2 ,1 } is a subsequence of {xN1 ,1 } and hence
also converges. Continuing this way, we finally obtain a convergent sequence
(yn,1 , ..., yn,N ), n = 1, 2, ... in KN such that (yn,1 , ...yn,j ) ∈ Kj , j = 1, 2, ..., n
and of course (yn,1 , ..., yn,j ), n ≥ 1 converges. Let
Hence
(y1 , y2 , ...yN ) ∈ K̃N , N ≥ 1
and hence,
ˆN , N ≥ 1
(y1 , y2 , ...) ∈ K
which means that �
(y1 , y2 , ...) ∈ ˆN
K
N ≥1
which contradicts the fact that the rhs is an empty set. This proves the Kol
mogorov consistency theorem.
Stochastic Processes in Classical and Quantum Physics and Engineering 121
7.1. PROBLEM SUGGESTED BY PROF.DHANANJAY 129
where Bc (t, r) is the magnetic field produced by the coil. The coil current is
where RL is the coil resistance. Then, the magnetic field produced by the coil
is
1 2n0 π
Bc (t, r) = (µ/4π)I(t) [(−ˆ
x.sin(φ)+ˆ
y.cos(φ))×(r−R0 (cos(φ)ˆ
x
0
+sin(φ)ŷ)−(a/2πn0 )φẑ]
that Bm (t, r) is also a function of r0 (t). Hence, Φ(t) is also a function of r0 (t)
and hence I(t) is also a function of r0 (t) and r�0 (t). Thus, Bc (t, r) is a function
of r0 (t), r�0 (t), m(t), m� (t), ie, we can express it as
dm(t)/dt = γ.m(t) × Bc (t, r0 (t)|r0 (t), r0� (t), m(t), m� (t)) − − − (2)
Next, the force exerted on the magnet by the magnetic field produced by the
coil is given by
F(t) = �r0 (t) (m(t).Bc ((t, r0 (t)|r0 (t), r0� (t), m(t), m� (t))
which means that the rectilinear equation of motion of the magnet must be
given by
M r0�� (t) = F(t) − M gẑ
However to be more precise, the term V = −m(t).Bc (t, r0 (t)|r0 (t), r0� (t), m(t), m� (t))
represents a potential energy of interaction between the magnet and the reac
tion field produced by the coil. So its recilinear equation of motion should be
given by
d/dt∂L/∂r0� (t) − ∂L/∂r0 (t) = 0
where L, the Lagrangian is given by
M r0�� (t) + d/dt(�r�0 (t) m(t).Bc (t, r0 (t)|r0 (t), r0� (t), m(t), m� (t)))
−�r0 (t) (m(t).Bc (t, r0 (t)|r0 (t), r0� (t), m(t), m� (t))) + mgẑ = 0 − − − (3)
(2) and (3) constitute the complete set of six coupled ordinary differential equa
tions for the six components of m(t), r0 (t).
Pu i = P
let D1 , ..., DM be disjoint sets in B n such that νuk (Dk ) > 1 − �, k = 1, 2, ..., n,
where νu (v) = Πnk=1 νu(k) (v(k)) for u = (u(1), ..., u(n)), v = (v(1), ..., v(n)). Let
vx ∈ T (N (x|u), νx , δ)∀x ∈ A
Now, �
P (x)νx (y) = Q(y),
x
� �
P (x)νx (y)2 ≥ ( P (x)νx (y))2 = Q(y)2
x x
and therefore,
√ �
|N (y|v) − nQ(y)| ≤ δ an Q(y)(1 − Q(y)
proving the claim. For v ∈√Ek , we have v ∈ T (n, νuk , δ) and hence by the result
just proved, v ∈ T (n, Q, δ a).
and since �
|N (y|v) − nQ(y)| ≤ δ nQ(y)(1 − Q(y)), y ∈ B
we have
� �
nQ(y) − δ nQ(y)(1 − Q(y)) ≤ N (y|v) ≤ nQ(y) + δ nQ(y)(1 − Q(y)), y ∈ B
and therefore,
√ √
Πy Q(y)nQ(y)+δ nQ(y)(1−Q(y))
≤ Qn (v) ≤ Πy Q(y)nQ(y)−δ nQ(y)(1−Q(y))
where ��
c = −δ Q(y)(1 − Q(y))log2 (Q(y))
y
It follows that
M
�
√ √
µ(Ek ) = µ( Ek ) ≤ µ(T (n, Q, δ a)) ≤ 2nH(Q)+c an
k=1 k
and hence
νu (T (n, νu , δ)) ≥ 1 − ab/δ 2
for any u ∈ An . In particular,
and hence, � √
νu (v) ≤ 2− x N (x|u)H(νx )−c n
Stochastic Processes in Classical and Quantum Physics and Engineering 125
7.2. CONVERSE OF THE SHANNON CODING THEOREM 133
√
= 2−nI(P,ν)+c n
where �
I(P, ν) = P (x)H(νx ), P = Pu
x
Thus, √
(1 − ab/δ 2 − �)2nI(P,ν)−c n
≤ µ(Ek ), k = 1, 2, ..., M
so that
√ � √
M (1 − ab/δ 2 − �)2nI(P,ν)−c n
≤ µ(Ek ) ≤ 2nH(Q)+c n
or equivalently,
√
log(M )/n ≤ H(Q) − I(P, ν) + O(1/ n)
or equivalently √ √
(g µα g νβ −gFαβ ),ν = J µ −g
Writing
gµν (x) = ηµν + hµν (x)
where η is the Minkowski metric and h is a small perturbation, we have approx
imately
g µν = ηµν − hµν , hµν = ηµα ηνβ hαβ
and √
−g = 1 − h/2, h = ηµν hµν
We have
Fµν = Aν,µ − Aµ,ν
and writing
Aµ = A(0) (1)
µ + Aµ
(1)
where Aµ(0) is the zeroth order solution, ie, without gravity and Aµ the first
order correction to the solution, ie, it is a linear functional of hµν , we obtain
using first order perturbation theory
(0) (1)
Fµν = Fµν + Fµν
135
127
128 Stochastic Processes in Classical and Quantum Physics and Engineering
136CHAPTER 8. PROBLEMS IN ELECTROMAGNETICS AND GRAVITATION
where
(k)
Fµν = A(k) (k)
ν,µ − Aµ,ν , k = 0, 1
√
g µα g νβ −g =
(ηµα − hµα )(ηνβ − hνβ )(1 − h/2)
= ηµα ηνβ − fµναβ
where
fµναβ = ηµα ηνβ h/2 + ηµα hνβ
+ηνβ hµα
Then √
g µα g νβ −gFαβ =
(0)
ηµα ηνβ Fαβ
(1) (0)
+ηµα ηνβ Fαβ − fµναβ Fαβ
= F 0µν + F 1µν
where
(0)
F 0µν = ηµα ηνβ Fαβ ,
(1) (0)
F 1µν = ηµα ηνβ Fαβ − fµναβ Fαβ
Also √
J µ −g = J 0µ + J 1µ , J 0µ = J µ , J 1µ = −J µ h/2
The gauge condition √
(Aµ −g),µ = 0
becomes √
0 = (g µν −gAν ),µ =
[(ηµν − hµν )(1 − h/2)Aν ],µ
= [(ηµν − kµν )(A(0) (1)
ν + Aν )],µ
where
kµν = hµν − hηµν /2
and hence equating coefficients of zeroth and first orders of smallness on both
sides gives
ηµν A(0)
ν,µ = 0,
Someaspects
Some more More Aspects
of of
Quantum information
quantum Information Theory
theory
so that < ea |eb >= δ(a, b), < fa |fb >= δ(a, b). The relative entropy between
these two states is
D(ρ|σ) = T r(ρ.(log(ρ) − log(σ)))
� �
= p(a)log(p(a)) − p(a)log(qb )| < ea |fb > |2
a a,b
�
= (p(a)| < ea |fb > |2 (log(p(a)| < ea |fb > |2 ) − log(qb | < ea |fb > |2 ))
a,b
�
= (P (a, b)log(P (a, b)) − log(Q(a, b))) = D(P |Q)
a,b
P (a, b) = p(a)| < ea |fb > |2 , Q(a, b) = q(b)| < ea |fb > |2
137
129
130138CHAPTER
Stochastic
9. SOME Processes
MOREin Classical and
ASPECTS Quantum Physics
OF QUANTUM and Engineering
INFORMATION THEORY
� �
= p(a)(1− < ea |T |ea >) − q(a)| < fa |eb > |2 < eb |T |fa >
a a,b
2 ∗
Suppose T is a projection, ie, T = T = T . Then,
� �
T r(ρ(1 − T )) = p(a) < ea |1 − T )|ea >= p(a) < ea |(1 − T )2 |ea >
a a
�
= p(a)| < ea |1 − T |fb > |2
a,b
Likewise, �
T r(σT ) = q(b) < ea |T |fb > |2
a,b
Hence,
�
2P (�) = (p(a)| < ea |1 − T |fb > |2 + q(b) < ea |T |fb > |2 )
a,b
�
≥ min(p(a), q(b))(| < ea |1 − T |fb > |2 + q(b) < ea |T |fb > |2 )
a,b
�
≥ (1/2)min(p(a), q(b))| < ea |fb > |2
a,b
because if u, v are positive real numbers, then as t varies over [0, 1], tu + (1 − t)v
attains is minimum value of v when t = 0 if u ≥ v and u when t = 1 if u < v.
In other words,
mint∈[0,1] (tu + (1 − t)v) = min(u, v)
In other words we have proved that
�
2P (�) ≥ (1/2)min0≤t(a,b)≤1,a,b=1,2,...,n [P (a, b)(1 − t(a, b)) + Q(a, b)t(a, b)]
a,b
The quantity on the rhs is the minimum error probability for the classical hy
pothesis testing problem between the two bivariate probability distributions
P, Q.
Stochastic Processes in Classical and Quantum Physics and Engineering 131
9.2. OPERATOR CONVEX FUNCTIONS AND OPERATOR MONOTONE FUNCTIONS1
f (K ∗ XK) ≤ K ∗ f (X)K
we have
KL = M K, LK ∗ = K ∗ M
Define the matrix � �
K −M
U=
L K∗
and observe that since L∗ = L, M ∗ = M and
K ∗ K + L2 = I, −K ∗ M + LK ∗ = 0, −M K + KL = 0, M 2 + KK ∗ = I
and
f (KBK ∗ + M AM ) ≤ Kf (B)K ∗ + M f (A)M
132 Stochastic Processes in Classical and Quantum Physics and Engineering
140CHAPTER 9. SOME MORE ASPECTS OF QUANTUM INFORMATION THEORY
we have
p
� p
�
f( Kj∗ Aj Kj ) ≤ Kj∗ f (Aj )Kj
j=1 j=1
In fact, defining ⎛ ⎞
K1 0 ... 0
⎜ K2 0 ... 0 ⎟⎟
K=⎜
⎝ .. .. ... .. ⎠
Kp 0 ... 0
we observe that � �p �
Kj∗ Kj 0
K ∗K = j=1 ≤I
0 0
ie K is a contraction and hence by the above result
f (K ∗ T K) ≤ K ∗ f (T )K
Taking
T = diag[A1 , A2 , ..., Ap ]
we get � �p �
Kj∗ Aj Kj 0
K ∗T K = j=1
0 0
and hence, � �p �
f( j=1 Kj∗ Aj Kj ) 0
0 f (0)
� �p �
Kj∗ f (Aj )Kj 0
j=1
≤
0 0
which results in the desired inequality. Note that from the above discussion, we
also have � �
f( Kj Bj Kj∗ ) ≤ Kj f (Bj )Kj∗
j j
Stochastic Processes in Classical and Quantum Physics and Engineering 133
9.3. A SELECTION OF MATRIX INEQUALITIES 141
A−1 (AB)A = BA
B −1 (BA)B = AB
Thus, (A + cn I)B has the same eigenvalues as those of B(A + cn I) for each n.
Taking the limit n → ∞, the result follows since the eigenvalues of (A + cI)B
are continuous functions of c.
[2] Let A be any square diagonable matrix. Let λ1 , ..., λn be its eigenvalues.
We have for any unit vector u,
Specifically,
wk+1 = r(k + 1, k)ek + r(k + 1, k + 1)ek+1 , ..., wn = r(n, k)ek + ... + r(n, n)en
Then,
< fk , Aek >= λk
Thus,
|λk | ≤� fk � . � Aek �
Let A⊗a k denote the kfold tensor product of A with itself acting on the
k fold antisymmetric tensor product of Cn . Then, the eigenvalues of A⊗a n
are λi1 ...λik with n ≥ i1 > i2 > ... > ik ≥ 1 and likewise its singular values
are si1 ...sik , n ≥ i1 > i2 > ... > ik ≥ 1 where s1 ≥ s2 ≥ ... ≥ sn ≥ 0 are
the singular values of A. Now if T is any operator whose eigenvalue having
maximum magnitude is µ1 and whose maximum singular value is s1 , then we
have
|µ1 | = spr(T ) = limn � T n �1/n ≤� T �= s1
Applying this result to T = A⊗a k gives us
� AB �≤� BA �
In fact since AB is normal, its spectral radius coincides with its spectral norm
and further, since AB and BA have the same eigenvalues, the spectral radii of
AB and BA are the same and finally since the spectral radius of any matrix
is always smaller than its spectral norm (� T n �1/n ≤� T �), the result follows
immediately:
� AB �= spr(AB) = spr(BA) ≤� BA �
First we show that this inequality holds for s = 1/2, then we show that if it
holds for s = µ, ν, then it also holds for s = (µ + ν)/2 and that will complete
the proof.
1/2 1/2 1/2 1/2
| < y|S1 S2 + T1 T2 |x > |2
1/2 1/2 1/2 1/2
≤ (| < S1 y, S2 x > | + | < T1 y, T2 x > |)2
1/2 1/2 1/2 1/2
≤ [� S1 y � . � S2 x � + � T1 y � . � T2 x �]2
1/2 1/2
≤ [� S1 y �2 + � T1 y �2 ]
1/2 1/2
×[� S2 x �2 + � T2 x �2 ]
=< y, (S1 + T1 )y >< x, (S2 + T2 )x >≤< y, R1 y >< x, R2 x >
−1/4 1/4 −1/4 1/4
Then, replacing y by R1 R2 x and x by R2 R1 x in this inequality gives
us (recall that R1 and R2 are assumed to commute)
−1/4 1/4 1/2 1/2 1/2 1/2 −1/4 1/4
< x|R1 R2 (S1 S2 + T1 T2 )R2 R1 |x >
1/2 1/2
≤< x, R1 R2 x >
Now define the matrices
−1/4 1/4 1/2 1/2 1/2 1/2 −1/4 1/4
B = R1 R2 (S1 S2 + T1 T 2 , A = R 2 R1
This implies
−1/2 −1/2
| < R1 y, CR2 x > | ≤� y � . � x �
136 Stochastic Processes in Classical and Quantum Physics and Engineering
144CHAPTER 9. SOME MORE ASPECTS OF QUANTUM INFORMATION THEORY
and by normality of
−1/4 −1/4 −1/4 −1/4
R1 R2 CR1 R2 =
1/4 −1/4 −1/2 −1/4 −1/4
(R1 R2 )(R1 CR1 R2 ) = PQ
where
1/4 −1/4 −1/2 −1/4 −1/4
P = (R1 R2 ), Q = (R1 CR1 R2 )
we get
� P Q �≤� QP �=
−1/2 −1/2
� R1 CR2 �≤ 1
which implies since P Q is positive that
PQ ≤ I
or equivalently
−1/4 −1/4 −1/4 −1/4
(R1 R2 CR1 R2 )≤I
from which we deduce that (Note that R1 and R2 commute)
1/2 1/2
C ≤ R1 R2
and
R11−ν R2ν ≥ S11−ν S2ν + T11−ν T2ν ,
Then, replacing R1 by R11−µ R2µ , R2 by R11−ν R2ν , S1 by S11−µ S2µ , S2 by S11−ν S2ν ,
T1 by T11−µ T2µ and T2 by T11−ν T2ν and noting that these replacements satisfy all
the required conditions of the theorem, gives us
1−(µ+ν)/2 (µ+ν)/2 1−(µ+ν)/2 (µ+ν)/2 1−(µ+ν)/2 (µ+ν)/2
R1 R2 ≥ S1 S2 + T1 T2 ,
ie the result also holds for s = (µ + ν)/2. Since it holds for s = 0, 1/2, 1 we
deduce by induction that it holds for all dyadic rationals in [0, 1], ie, for all s of
the form k/2n , k = 0, 1, ..., 2n , n = 0, 1, .... Now every real number in [0, 1] is a
limit of dyadic rationals and hence by continuity of the inequality, the theorem
has been proved.
Stochastic
9.3. A Processes
SELECTIONin Classical and Quantum
OF MATRIX Physics and Engineering
INEQUALITIES 137
145
An application example:
Remark: Let T be a Hermitian matrix of size n × n. Let M be an n − k + 1
dimensional subspace and let λ1 ≥ ... ≥ λn be the eigenvalues of T arranged in
decreasing order. Then,
and thus
infdimN =n−k+1 supx∈N < x, T x >= λk
Likewise, if M is a k dimensional subspace, then
and hence
supdimN =k infx∈N < x, T x >= λk
Thus,
λk (AB) = λk (BA) ≤ λk (C), k = 1, 2, ..., n
where
1/2 1/2
C = R1 R2
Then,
λk (A−B)↓ ≤ supx∈M < x, (A−B)x >≤ supx∈M < x, Ax > −infx∈M < x, Bx >
≤ λk (A)↓ − λk (B)↑
This is a stronger result than the previous one.
138 Stochastic Processes in Classical and Quantum Physics and Engineering
146CHAPTER 9. SOME MORE ASPECTS OF QUANTUM INFORMATION THEORY
≤� A⊗a k � . � B ⊗a k �
Now, in the same way, ≤� A⊗a k � equals Πkj=1 sj (A)↓ and likewise for B. Thus
we obtain the fundamental inequality
s(AB) ≤ s(A).s(B)
Now, �
T r(|A|1/p ) = sj (A)1/p
j
and likewise for B. Therefore, we have proved the Matrix Holder inequality
for any two matrices A, B such that AB exists and is a square matrix. In
fact, the method of our proof implies immediately that if ψ(.) is any unitarily
invariant matrix norm, then
149
141
142 Stochastic Processes in Classical and Quantum Physics and Engineering
150CHAPTER 10. LECTURE PLAN FOR STATISTICAL SIGNAL PROCESSING
models, simultaneous time and order recursive prediction of a time series using
the RLSLattice filter. (four lectures)
[8] The noncausal Wiener filter, causal Wiener filtering using Wiener’s spec
tral factorization method and using Kolmogorov’s innovations method, Kol
mogorov’s formula for the mean square infinite order prediction error. (four
lectures)
[9] An introduction to stochastic nonlinear filtering: The KushnerKallianpur
filter, the Extended Kalman filter approximation in continuous and discrete
time, the KalmanBucy filter for linear state variable stochastic models. (four
lectures)
[10] Power spectrum estimation using the periodogram and the windowed
periodogram, mean and variance of the periodogram estimate, high resolution
power spectrum estimation using eigensubspace methods like Pisarenko har
monic decomposition, MUSIC and ESPRIT algorithms. Estimation of the power
spectrum using time series models. (four lectures)
[11] Some new topics:
[a] Quantum detection and estimation theory, Quantum Shannon coding
theory. (four lectures)
[b] Quantum filtering theory (four lectures)
Total: Forty eight lectures.
Then, given X0 = x
MN (f ) = (πf )N (1)
where π is the transition probability operator kernel:
�
π(g)(x) = π(x, dy)g(y)
and �
πf (g)(x) = exp(f (x))π(x, dy)g(y)
ie
πf (x, dy) = exp(f (x))π(x, dy)
Stochastic Processes in Classical and Quantum Physics and Engineering 143
151
where λm (f ) is the maximum eigenvalue of the matrix exp(Df )π. Now let
�
I(µ) = supu>0 µ(x)log((πu)(x))/u(x))
x
where L
Q(θ, θm ) = ln(p(X, Z|θ)).p(Z|X, θm )
z
where L
144 Stochastic Processes in Classical and Quantum Physics and Engineering
Q(θ, θm ) = ln(p(X, Z|θ)).p(Z|X, θm )
z
where
γn,k (X, θm ) = E(z(n, k)|X, θm )
L L
= z(n, k)p(Z|X, θm ) = z(n, k)p(z(n, k)|X, θm )
Z z(n,k)
Derive a formula for p(z(n, k)|X, θm ). Derive the optimal equations for θm+1
by maximizing Q(θ, θm ).
P = E1 ∪ E2 ∪ ... ∪ EK
LK L
E= � Xn − µ(k) �2
10.1. ENDSEMESTER EXAM, DURATION:FOUR
k=1 n∈E
HOURS, PATTERN RECOGNITION
k
r(n, k) = 1, n ∈ Ek , r(n, k) = 0, n ∈
/ Ek
and expressing �
E= r(n, k) � Xn − µ(k) �2
n,k
Relate this clustering problem to the problem of calculating the µ(k)� s and the
π(k)� s in the Gaussian mixture model. Specifically, compare the equations for
calculating r(n, k), µ(k) in this clustering problem to the equations for calcu
lating the same in the GMM. Explain how the GMM model is a ”smoothened
version” of the clustering model by remarking that the event Xn ∈ Ek is to
be interpreted in the GMM context by saying that Xn has the N (x|µ(k), Σk )
�N
distribution and that n=1 r(n, k)/N = card(Ek )/N corresponds to the prob
ability π(k) of the k th component in the GMM occurring. Derive the explicit
value that r(n, k) corresponds to in the GMM by showing that the optimal π(k)
in the GMM satisfies
N
� π(k)N (Xn |µk , Σk )
π(k) = �K
n=1 m=1 π(m)N (Xn |µ(m), Σm )
�N �
Compare this with the quantity π(k) = n=1 (r(n, k)/N ), n,k r(n, k) = N in
the clustering problem.
[3] Let {Z(n) : n = 1, 2, ..., N } be a Markov chain in discrete state space
value that r(n, k) corresponds to in the GMM by showing that the optimal π(k)
in the GMM satisfies
N
�and Quantum
Stochastic Processes inπ(k)
Classical
= Physics
π(k)N (X n |µk ,and
Σk )Engineering 145
�K
n=1 m=1 π(m)N (Xn |µ(m), Σm )
�N �
Compare this with the quantity π(k) = n=1 (r(n, k)/N ), n,k r(n, k) = N in
the clustering problem.
[3] Let {Z(n) : n = 1, 2, ..., N } be a Markov chain in discrete state space
and let π(Z(n), Z(n + 1)|θ) be its one step transition probability distribution.
Let π0 (Z(0)) denote its initial distribution. Let X = (X(n) : n = 0, 1, ..., N ) be
the HMM emission random variables with the conditional probability density
p(X(n)|Z(n), θ) defined.
[a] Show that the EM algorithm for estimating θ recursively amounts to
maximizing Q(θ, θ0 ) where
Q(θ, θ0 ) =
N
�
E(ln(p(X(n)|Z(n), θ))|X, θ0 )
n=0
N
� −1
+ E(ln(π(Z(n), Z(n + 1)|θ0 ))|X, θ0 )
n=0
[b] Show that for evaluating the conditional expectation in [a], we require the
conditional probabilities p(Z(n)|X, θ0 ) and p(Z(n), Z(n+1)|X, θ0 ). Explain how
you would evaluate these conditional probabilities recursively by first proving
the relations
p(X|Z(n), θ0 ) = p(X(N ), X(N −1), ..., X(n)|Z(n), θ0 ).p(X(n−1),
154CHAPTER 10. LECTURE PLAN FOR STATISTICAL SIGNAL PROCESSING
X(n−2), ..., X(0)|Z(n), θ0 )
and
p(X|Z(n + 1), Z(n), θ0 ) =
p(X(N ), ..., X(n + 1)|Z(n + 1, θ0 ).p(X(n − 1), ..., X(0)|Z(n), θ0 )
4 [a] How would you approximate (based on matching only first and second
order statistics) an N dimensional random vector X by a p << N dimensional
random vector S using the linear model
ˆ + V, X
X ≈ WS + µ + V = X ˆ = WS + µ
where S, V are zero mean mutually uncorrelated random vectors with covari
ances ΣS , ΣV respectively and W is an N × p matrix to be determined. To
solve this problem, you must assume that E(X) is well approximated by µ and
Cov(X) is well approximated by WΣS WT + ΣV .
[b] Now consider the maximum likelihood version of the above PCA in which
the data is modeled by an iid sequence:
Xn = WSn + µ + Vn , n = 1, 2, ..., N
where the S�n s are iid N (0, ΣS ) random vectors, the Vn� s are iid N (0, Σv ) ran
dom vectors with the two sequences being mutually independent. Calculate
the equations that determine the optimal choice of W, µ that would maximize
p(Xn , n = 1, 2, ..., N |W, µ). Also by assuming the S�n s to be latent random
4 [a] How would you approximate (based on matching only first and second
order statistics) an N dimensional random vector X by a p << N dimensional
random vector S using the linear model
146 Stochastic Processes in Classical and Quantum Physics and Engineering
X ≈ WS + µ + V = X̂ + V, X̂ = WS + µ
where S, V are zero mean mutually uncorrelated random vectors with covari
ances ΣS , ΣV respectively and W is an N × p matrix to be determined. To
solve this problem, you must assume that E(X) is well approximated by µ and
Cov(X) is well approximated by WΣS WT + ΣV .
[b] Now consider the maximum likelihood version of the above PCA in which
the data is modeled by an iid sequence:
Xn = WSn + µ + Vn , n = 1, 2, ..., N
where the S�n s are iid N (0, ΣS ) random vectors, the Vn� s are iid N (0, Σv ) ran
dom vectors with the two sequences being mutually independent. Calculate
the equations that determine the optimal choice of W, µ that would maximize
p(Xn , n = 1, 2, ..., N |W, µ). Also by assuming the S�n s to be latent random
vectors, derive the EM algorithm recursive equations for estimating W, µ from
the Xn , n = 1, 2, ..., N .
Solution:
p({Xn }N N
n=1 , {Sn }n=1 |W, µ)
= p({Xn }N N
n=1 |{Sn }, W, µ)p({Sn }n=1 )
= ΠN
n=1 [fV (Xn − WSn − µ)fS (Sn )]
Q(θ, θ0 )
where
θ = (W, µ), θ0 = (W0 , µ0 )
and
N
�
N
Q(θ, θ0 ) = E[log(fV (Xn − WSn − µ))|{Xm }m=1 , W0 , µ0 )]
n=1
N
�
N
10.1. ENDSEMESTER log(fS (S
+ EXAM, n ))|{Xm }m=1 , W0 ,HOURS,
DURATION:FOUR µ0 ) PATTERN RECOGNITION
n=1
N
�
= E[log(fV (Xn − WSn − µ))|Xn , W0 , µ0 )]
n=1
N
�
+ log(fS (Sn ))|Xn , W0 , µ0 )
n=1
Note that only the first sum in this expression must be maximized since the
second one does not depend on W, µ. However, if ΣS , Σv also have to be
estimated then the parameter vector would be
θ = (W, µ, ΣS , Σv )
and then the second term would also count. Note further, that
p({Sn }N |{Xn }N , W, µ) =
Note that only the first sum in this expression must be maximized since the
second one does not depend on W, µ. However, if ΣS , Σv also have to be
estimated then the parameter vector would be
Stochastic Processes in Classical and Quantum Physics and Engineering 147
θ = (W, µ, ΣS , Σv )
and then the second term would also count. Note further, that
p({Sn }N N
n=1 |{Xn }n=1 , W, µ) =
[ΠN N
n=1 fV (Xn − WSn − µ)fS (Sn )]/Πn=1 fX (Xn |W, µ)
[d] Consider now the following data matrix version of PCA. X is a given
N × M real matrix. Show that it has a singular value decomposition (SVD)
X = UΣV
and show that for any α > 0 however small, β(N ) if η = η(N ) is obtained as
above for the given α, then β(N ) converges to zero at the rate of D(p0 |p1 ), ie,
and show that for any α > 0 however small, β(N ) if η = η(N ) is obtained as
above for the given α, then β(N ) converges to zero at the rate of D(p0 |p1 ), ie,
where �
D(p0 |p1 ) = p0 (X).log(p0 (X)/p1 (X)).dX
[6] This problem deals with nonparametric density estimation. Let p1 (X), p0 (X)
be two different probability densities of a random vector X ∈ Rn . To estimate
p1 at a given point X, we take samples from this distribution and draw a sphere
of minimum radius R1 with centre at X, for which the first sample point falls
within this sphere. Let V (R1 ) denote the volume of such a sphere. Then, esti
mate p1 (X) as 1/V (R1 ). Likewise do so for estimating p0 (X). Let the minimum
radius for this be R0 . Justify using the law of large numbers that such a non
parametric density estimate is reliable. Show that given a sample point X, in
order to test whether X came from p1 or from p0 , we use the ML criterion,
namely decide p1 if p̂1 (X) ≥ p̂0 (X) and decide p0 otherwise, where p̂ denotes
the estimate of p obtained using the above scheme. Show that this MLE method
is the same as the minimum distance method, in the sense that if the first sam
10.1. ENDSEMESTER
ple point that we decide EXAM,
p1 if R1DURATION:FOUR HOURS,
(X) < R0 (X) and decide p0 PATTERN RECOGNITION1
otherwise where
R1 (X) is the distance of the first point taken from the p1 sample from X and
likewise R0 (X) is the distance of the first point taken from the p0 sample from
X.
[7] Let a group G act on a set M and let f0 , f1 : M → R be two image fields
defined on the set M such that
where
f˜1 (g) = f1 (gx0 ), f˜0 (g) = f0 (gx0 )
Let π be a unitary representation of G in a the vector space Cp . Define the
Fourier transform of a function F on G evaluated at π by
�
F(F )(π) = F (g)π(g)dg
G
where dg is the left invariant Haar measure on G. Show that the above image
model on G gives on taking Fourier transforms
where dg is the left invariant Haar measure on G. Show that the above image
model on G gives on taking Fourier transforms
Hence show that the above model for the image Fourier transforms is equivalent
to
F(f˜1 )(π)PH = π(g0 )F(f˜0 )(π)PH + F(w)(π)PH
From this model, explain how by varying the representation π, we can estimate
the transformation g0 by minimizing
�
E(g0 )10. ˜1 )(πk )PH − πk (g0 )F(f˜0 )(πk )PH �2
W (k) � F(f
= LECTURE
158CHAPTER PLAN FOR STATISTICAL SIGNAL PROCESSING
k
Estimating the EEG parameters from the speech parameters using the EKF.
The EEG model is
XE (t + 1) = A(θ)XE (t) + WE (t + 1) ∈ RN
δX = x(k)Lk , dπ(δX) = x(k)dπ(Lk )
m=1 m=1
Estimating the EEG parameters from the speech parameters using the EKF.
The EEG model is
XE (t + 1) = A(θ)XE (t) + WE (t + 1) ∈ RN
where
p
�
A(θ) = A0 + θ(k)Ak
k=1
We assume that the speech and EEG parameters are correlated, ie, there exists
an affine relationship connecting the two of the form
rs = Cθ + d
The regression matrices C, d are assumed to be fixed, ie, they do not vary
from patient to patient. These matrices can be estimated using the following
10.1. ENDSEMESTER
procedure: Choose a bunchEXAM,
of KDURATION:FOUR
patients numbered HOURS,
1, 2, ..., KPATTERN RECOGNITION
having various
kinds of brain diseases. For each patient, collect his EEG and speech data and
apply the EKF separately to these data to using the above models and noisy
measurement models of the form
and then the optimal equations for the regression matrices obtained by setting
the gradient of E(C, d) to zero are given by
K
�
(rsk − Cθk − d)θkT = 0,
k=1
E(C, d) = � r̂sk − Cθ̂k − d �
k=1
and then the optimal equations for the regression matrices obtained by setting
the gradient of E(C, d) to zero are given by
K
�
(rsk − Cθk − d)θkT = 0,
k=1
K
�
(rsk − Cθk − d) = 0
k=1
Cµθ + d = µs
where
K
�
Rθ = K −1 . θk .θkT ∈ Rp×p
k=1
K
�
µθ = K −1 θk ∈ Rp×1
k=1
K
�
Rsθ = K −1 rsk θkT ∈∈ R(m+1)×(m+1) ,
k=1
K
�
−1
µs = K rsk ∈ R(m+1)×1
k=1
and
ξ(t) = [ξ1 (t), ..., ξm (t)]T
The speech model then becomes in state variable form
for the speech parameters in terms of the EEG parameters using our regres
sion scheme: First transform the speech model into a state variable model by
introducing state variables:
152 Stochastic Processes in Classical and Quantum Physics and Engineering
ξ1 (t) = XS (t − 1), ..., ξm (t) = XS (t − m)
and
ξ(t) = [ξ1 (t), ..., ξm (t)]T
The speech model then becomes in state variable form
where ⎛ ⎞
rs1 r2 rs3 ... rsm
⎜ 1 0 0 ... 0 ⎟
B(rs) = ⎜
⎝ ..
⎟
.. .. ... .. ⎠
0 0 0.. 1 0
We can write
m
�
B(rs) = rsk Bk
k=1
Now
rsk = (Cθ)k + dk , k = 1, 2, ..., m + 1
Writing
m
�
F1 (ξ, θ) = rsk Bk ξ + u.rsm+1
k=1
m
�
= ((Cθ)k + dk )Bk ξ + u(Cθ)m+1 + dm+1 )
k=1
we find that
�m
∂F1 /∂ξ = [ ((Cθ)k + dk )Bk ]
10.2. QUANTUM STEIN’S THEOREM k=1 161
m
� m
�
∂F1 /∂θ = [ Ck1 Bk ξ+Cm+1,1 u, Ck2 Bk ξ
k=1 k=1
m
�
+Cm+1,2 u, ..., Ckp Bk ξ+Cm+1,p u]
k=1
�m
Stochastic Processes in Classical and Quantum
+CPhysicsu,and
..., Engineering
C B ξ+C 153
m+1,2 kp k m+1,p u]
k=1
To prove this, we first use the monotonicity of the Renyi entropy to deduce that
for any detection operator Tn ,
≤ T r(ρ⊗ns) σ ⊗n(1−s) )
for any s ≤ 0. Therefore, if
T r(σ ⊗n Tn ) → 0
then
T r(σ ⊗n (1 − Tn )) → 1
and we get from the above on taking logarithms,
or equivalently writing
we get
α ≥ s−1 T r(ρs σ 1−s )∀s ≤ 0
Taking lims ↑ 0 in this expression gives us
162CHAPTER 10. LECTURE PLAN FOR STATISTICAL SIGNAL PROCESSING
α ≥ −D(σ|ρ) = −T r(σ(log(σ) − log(ρ)))
This proves the first part. For the second part, we choose Tn in accordance with
the optimal NeymanPearson test. Specifically, choose
Tn = {exp(nR)ρ⊗n ≥ σ ⊗n }
Assume that
p
�
A(z) = 1 + a[k]z −k = Πrm=1 (1 − zm z −1 )Πr+l
m=r+1 (1 − zm z
−1
)(1 − z̄m z −1 )
k=1
where p = r + 2l and z1 , ..., zr+l are all within the unit circle with z1 , ..., zr being
real and zr+1 , ..., zr+l being complex with nonzero imaginary parts. Assume
that the measured process is
where v[.] is white, independent of w[.] and with autocorrelation σv2 δ[n]. Assume
that
2
σw + σv2 A(z)A(z −1 ) = B(z)B(z −1 )
with
p
B(z) = KΠm=1 (1 − pm z −1 )
where p1 , ..., pp are complex numbers within the unit circle. Find the value of K
2
in terms of σw , σv2 , a[1], ..., a[p]. Determine the transfer function of the optimal
noncausal and causal Wiener filters for estimating the process x[.] based on y[.]
and cast the causal Wiener filter in time recursive form. Finally, determine the
mean square estimation errors for both the filters.
Given an AR(2)process
with w[.] and v[.] being white and mutually uncorrelated processes with vari
2
ances of σw and σv2 respectively. Cast this time series model in the above state
variable form with A being a 2×2 matrix, X[n] a 2×1 state vector, b a 2×1 vec
tor and C a 1 × 2 matrix. Write down explicitly the Kalman filter equations for
this model and prove that in the steady state, ie, as n → ∞, the Kalman filter
converges to the optimal causal Wiener filter. For proving this, you may assume
an appropriate factorization of σv2 A(z)A(z −1 ) + σw
2
into a minimum phase and
a nonminimum phase part where A(z) = 1 + a[1]z −1 + a[2]z −2 is assumed to
be minimum phase. Note that the steady state Kalman filter is obtained by
setting the state estimation error conditional covariance matrices for prediction
and filtering, namely P[n + 1|n], P[n|n] as well as the Kalman gain K[n] equal
to constant matrices and thus express the Kalman filter in steady state form as
ˆ
X̂[n + 1|n + 1] = DX[n|n] + f z[n + 1]
where D, f are functions of the steady state values of P[n|n] and P[n + 1|n]
which satisfy the steady state Riccati equation.
Which one has a larger trace ? Explain why this must be so.
[4] Let X1 , ..., XN be iid N (µ, R). Evaluate the Fisher information matrix
elements
∂2
−E[ ln(p(X1 , ..., XN |µ, R)]
∂µa ∂µb
∂2
−E[ ln(p(X1 , ..., XN |µ, R)]
∂µa ∂Rbc
∂2
−E[ ln(p(X1 , ..., XN |µ, R)]
∂Rab ∂Rcd
Taking the special case when the X�k s are 2dimensional random vectors, evalu
ˆ ab ), a, b = 1, 2
ate the CramerRao lower bound for V ar(µ̂a ), a = 1, 2 and V ar(R
ˆ
where µ̂ and R are respectively unbiased estimates of µ and R.
156 Stochastic Processes in Classical and Quantum Physics and Engineering
10.3. QUESTION PAPER, SHORT TEST, STATISTICAL SIGNAL PROCESSING, M.TE
[5] Let ρ(θ) denote a mixed quantum state in CN dependent upon a classical
parameter vector θ = (θ1 , ...θp )T . Let X, Y be two observables with eigende
compositions
N
� N
�
X= |ea > x(a) < ea |, Y = |fa > y(a) < fa |, < ea |eb >=< fa |fb >= δab
a=1 a=1
First measure X and note its outcome x(a). Then allowing for state collapse,
allow the state of the system to evolve from this collapsed state ρc to the new
state
m
� m
�
T (ρc ) = Ek ρc Ek∗ , Ek∗ Ek = I
k=1 k=1
Then measure Y in this evolved state and note the outcome y(b). Calculate the
joint probability distribution P (x(a), y(b)|θ). Explain by linearizing about some
θ0 , how you will calculate the maximum likelihood estimate of θ as a function
of the measurements x(a), y(b). Also explain how you will calculate the CRLB
for θ. Finally, explain how you will select the observables X, Y to be measured
in such a way that the CRLB is minimum when θ is a scalar parameter.
while for calculating order recursive updates, we require to use the projection
operator update formula in the form
Pspan(,ξ) = PW + PP⊥
Wξ
and
� �
c bT
b A
in terms of A−1 . Explain these statements in the light of the recurisive least
squares lattice algorithm for simultaneous time and order recursive updates of
the prediction filter parameters.
Stochastic Processes in Classical and Quantum Physics and Engineering 157
166CHAPTER 10. LECTURE PLAN FOR STATISTICAL SIGNAL PROCESSING
where Xk (t) acts in the atom Hilbert space. Note that writing
that
�
W (t) = exp(itH0 )V (t).exp(−itH0 ) = (Yk (t) ⊗ ak (t) + Yk (t)∗ ⊗ ak (t)∗ )
k
show that upto O(δ 2 ) terms, the total system ⊗ field evolution operator U (t)
can be expressed as
U (t) = U0 (t).S(t)
where
U0 (t) = exp(−itH0 ) = exp(−itHA ).exp(−itHF )
is the unperturbed evolution operator and
� t �
2
S(t) = 1 − iδ W (s)ds − δ W (s1 )W (s2 )ds1 ds2
0 0<s2 <s1 <t
158 Stochastic Processes in Classical and Quantum Physics and Engineering
10.4. QUESTION PAPER ON STATISTICAL SIGNAL PROCESSING, LONG TEST167
ρA (T ) = T r2 ρ(T )
Use the fact that ρ(0) = ρA (0) ⊗ |φ(u) >< φ(u)| and that
Remark: More generally, we can expand any general initial state of the
system (atom) and bath (field) as
�
ρ(0) = FA (u, ū) ⊗ |φ(u) >< φ(u)|dudū
�
= FA (u, ū)exp(−|u|2 ) ⊗ |e(u) >< e(u)|du.dū
In this case also, we can calculate assuming that ρ(0) is the initial state, final
state at time T using the formula
by evaluating this using the fact that if Y1 , Y2 are system operators, then
�
(Y1 ⊗ ak )( F (u, ū) ⊗ |φ(u) >< φ(u)|dudū)(Y2 ⊗ am )
�
= Y1 F (u, ū)Y2 exp(−|u|2 ) ⊗ ak |e(u) >< e(u)|am dudū − − − (1)
Stochastic Processes in Classical and Quantum Physics and Engineering 159
168CHAPTER 10. LECTURE PLAN FOR STATISTICAL SIGNAL PROCESSING
Now
ak |e(u) >= u(k)|e(u) >
a∗k |e(u) >= ∂|e(u) > /∂u(k)
so (1) evaluates to after using integration by parts,
�
Y1 F1 (u, ū)Y2 u(k)exp(−|u|2 ) ⊗ (∂/∂ ū(m))(|e(u) >< e(u)|)du.dū
�
= (−∂/∂ ū(m))(Y1 F (u, ū)Y2 ).exp(−|u|2 u(k))) ⊗ |e(u) >< e(u)|dudū
�
= (−∂/∂ ū(m) + u(m))(Y1 F (u, ū)Y2 u(k))|φ(u) >< φ(u)|du.dū
Likewise,
�
(Y1 ⊗ a∗k )( F (u, ū) ⊗ |φ(u) >< φ(u)|dudū)(Y2 ⊗ am )
�
= Y1 F (u, ū)Y2 ⊗ a∗k |φ(u) >< φ(u)|am dudū
�
= Y1 F (u, ū)Y2 exp(−|u|2 ) ⊗ (∂/∂u(k)).(∂/∂ ū(m))(|e(u) >< e(u)|)du.dū
�
= (∂ 2 /∂u(k)∂ū(m))(Y1 F (u, ū)Y2 .exp(−|u|2 )) ⊗ |e(u) >< e(u)|du.dū
�
= (exp(|u|2 )(∂ 2 /∂u(k)∂ū(m))(Y1 F (u, ū)Y2 .exp(−|u|2 )))⊗|φ(u) >< φ(u)|du.dū
�
= (∂/∂u(k) + ū(k))(∂/∂ ū(m) − u(m))(Y1 F (u, ū)Y2 ) ⊗ |φ(u) < φ(u)|dudū
and �
(Y1 ⊗ ak )( F (u, ū) ⊗ |φ(u) >< φ(u)|dudū)(Y2 ⊗ a∗m )
We leave it as an exercise to the reader to evaluate these terms and hence derive
upto the second order Dyson series approximation for the evolution operator
the approximate final state of the atom and field and then by partial tracing
over the field variables, to derive the approximate final atomic state under this
interaction of atom and field (ie, system and bath).
[2] If the state of the system is given by ρ = ρ0 + δ.ρ1 , then evaluate the
VonNeumann entropy of ρ as a power series in δ.
Hint:
160 Stochastic Processes in Classical and Quantum Physics and Engineering
10.4. QUESTION PAPER ON STATISTICAL SIGNAL PROCESSING, LONG TEST169
where the a[k]� s are chosen to minimize Ee[n]2 . The optimal normal equations
are
E(e[n]x[n − m]) = 0, m = 1, 2, ..., p
and these yield the YuleWalker equations
p
�
R[m − k]a[k] = −R[m], m = 1, 2, ..., p
k=1
a = −R−1
p rp
where
Rp = ((R[m − k]))1≤m,k≤p , rp = ((R[m]))pm=1
The minimum mean square prediction error is then
p
�
σ 2 = E(e[n]2 = E(e[n]x[n]) = R[0] + a[k]R[k] = R[0] + aT rp
k=1
If the order of the AR model p is large, then e[n] is nearly a white process, in
fact in the limit p → ∞, e[n] becomes white, ie, E(e[n]x[n − m]) becomes zero
for all m ≥ 1 and hence so does E(e[n]e[n − m]). The ideal known statistics AR
spectral estimate is given by
σ2
SAR,p (ω) = S(ω) = ,
|A(ω)|2
A(ω) = 1 + aT e(ω),
e(ω) = ((exp(−jωm)))pm=1
Since in practice, we do not have the exact correlations available with us, we
estimate the AR parameters using the least squares method, ie, by minimizing
N
�
(x[n] + aT xn )2
n=1
Stochastic Processes
170CHAPTER 10.in LECTURE
Classical and Quantum
PLAN FORPhysics and Engineering
STATISTICAL 161
SIGNAL PROCESSING
where
xn = ((x(n − m)))pm=1
The finite data estimate of Rp is
N
�
ˆ p = N −1
R xn nT
n=1
and that of rp is
N
�
−1
r̂p = N x[n]xn
n=1
We write R for Rp and R̂ = R+δR for R̂p . Likewise, we write r for rp and r +δr
for r̂p for notational convenience. Note that R̂ and r̂ are unbiased estimates of R
and r respectively and hence δR and δr have zero mean. Now the least squares
estimate â = a + δa of a is given by
σ̂ 2 = σ 2 + δσ 2 = R̂[0] + âT r̂
and hence
δσ 2 = δR[0] + aT δr + δaT r + δaT δr
= δR[0] + aT δr + (−R−1 δr − R−1 δRa + R−1 δR.R−1 δr + R−1 δR.R−1 δr)T δr
+(−R−1 δr − R−1 δRa)T δr
again upto quadratic orders in the data correlation estimate errors. In terms
of the fourth moments of the process x[n], we can now evaluate the bias and
mean square fluctuations of â and σ̂ 2 w.r.t their known statistics values a, σ 2
respectively. First we observe that the bias of â is
E(δai ) =
162 Stochastic Processes in Classical and Quantum Physics and Engineering
10.4. QUESTION PAPER ON STATISTICAL SIGNAL PROCESSING, LONG TEST171
N
�
= N −2 (E(x[n − k]x[n − j]x[l]x[l − m]) − R[j − k]R[m])
n,l=1
upto fourth degree terms in the data. Note that here we have not yet made any
assumption that x[n] is a Gaussian process, although we are assuming that the
process is stationary at least upto fourth order. Writing
we get
E(δRjk .δrm ) =
N
�
N −2 C(k − j, l − n + k, l − n + k − m)
n,l=1
VN = [vp+1 , ..., vN ]
so that
V = [VS |VN ]
is an N × N unitary matrix. The signal correlation matrix model is
R = EDE ∗ + σ 2 I
Stochastic Processes in Classical and Quantum Physics and Engineering 163
172CHAPTER 10. LECTURE PLAN FOR STATISTICAL SIGNAL PROCESSING
is the steering vector along the frequency ω. Note that since N > p, E has full
column rank p. The signal eigenvalues of R are
λi = µi + σ 2 , i = 1, 2, ..., p
λi = σ 2 , i = p + 1, ..., N
Thus,
Rvi = λi vi , i = 1, 2, ..., N
The known statistics exact MUSIC spectral estimator is
which is infinite when and only when ω ∈ {ω1 , ..., ωp }. Alternately, the zeros of
the function
Q(ω) = 1/P (ω) = 1 − N −1 � VS∗ e(ω) �2
are precisely {ω1 , ..., ωp } because, the VN∗ e(ω) = 0 iff e(ω) belongs to the signal
subspace span{e(ωi ) : 1 ≤ i ≤ p} iff {e(ω), e(ωi ), i = 1, 2, ..., p} is a linearly
dependent set iff ω ∈ {ω1 , ..., ωp }, the last statement being a consequence of the
fact that N ≥ p + 1 and the VanDerMonde determinant theorem. Now, owing
to finite data effects, vi gets perturbed to vi + δvi for each i = 1, 2, ..., p so that
correspondingly, VS gets perturbed to VˆS = VS + δVS . Then Q gets perturbed
to (upto first orders of smallness, ie, upto O(δR)
so that
δQ(ω) = (−2/N )Re(e(ω)∗ δVS VS∗ e(ω))
Let ω be one of the zeros of Q(ω), ie, one of the frequencies of the signal. Then,
owing to finite data effects, it gets perturbed to ω + δω where
Q� (ω)δω + δQ(ω) = 0
ie,
δω = −δQ(ω)/Q� (ω)
and
Q� (ω) = (−2/N )Re(e(ω)∗ PS e� (ω)) = (−2/N )Re < e(ω)|PS |e� (ω) >
16410.4. QUESTION
Stochastic
PAPERProcesses in Classical andSIGNAL
ON STATISTICAL QuantumPROCESSING,
Physics and Engineering
LONG TEST173
where
PS = VS VS∗
is the orthogonal projection onto the signal subspace. Note that by first order
perturbation theory
and
δλi =< vi |δR|vi >
So �
δVS = column( |vj >< vj |δR|vi > /(λi − λj ) : i = 1, 2, ..., p]
j=i
and hence
δQ(ω) = (−2/N )Re(< e(ω)|δVS VS∗ |e(ω)) >)
where as before PS = VS VS∗ is the orthogonal projection onto the signal subspace
and
S = [Q1 , Q2 , ..., Qp ]
R = EDE ∗ + σ 2 I, R1 = EDΦ∗ E ∗ + σ 2 Z
R̂S = RS + δRS = R̂ − σ̂ 2 I = R − σ 2 I + δR − δσ 2
Thus,
δRS = δR − δσ 2 I
Likewise, the estimate of RS1 = EDΦ∗ E ∗ is
so that
δRS1 = δR1 − δσ 2 Z
The rank reducing numbers of the matrix pencil
are clearly the diagonal entries of Φ. Since R(E) = R(VS ), it follows that these
rank reducing numbers are the zeroes of the polynomial equation
or equivalently of
det(M − γVS∗ RS1 VS ) = 0
where
M = VS∗ RS VS = diag[µ1 , ..., µp ]
with
µk = λk − σ 2 , kj = 1, 2, ..., p
being the nonzero eigenvalues of RS . When finite data effects are taken into
consideration, these become the rank reducing numbers of
or equivalently, upto first order perturbation theory, the rank reducing number
γ gets perturbed to γ̂ = γ + δγ, where
where
M − γX + δM − γδX − δγX
or equivalently,
< η|δM − γδX�ξ >
δγ =
< η|X|ξ >
which is the desired formula.
naturally generates a family of pdf’s. Further, based on the assumption that the
speech pdf and the EEG pdf for any patient are correlated in the same way (a
particular kind of brain disease causes a certain distortion in the EEG signal and
a correlated distortion in the speech process with the correlation being patient
independent), we include a potential term in Schrodinger’s equation involving
speech pdf and hence while tracking a given EEG pdf owing to the adaptation
algorithm of the QNN, the QNN also ensures that the output pdf coming from
Schrodinger’s equation will also maintain a certain degree of correlation with
the speech pdf. The basic idea behind the quantum neural network is to mod
ulate the potential by a ”weight process” with the weight increasing whenever
the desired pdf exceeds the pdf generated by Schrodinger’s equation using the
modulus of the wave function squared. A hieuristic proof of why the pdf gener
ated by the Schrodinger equation should increase when the weight process and
hence th modulated potential increases is provided at the end of the paper.
First consider the following model for estimating the pdf p(t, x) of one signal
via Schrodinger’s wave equation which naturally generates a continuous family
of pdf’s via the modulus square of the wave function:
The weight process W (t, x) is generated in such a way that |ψ(t, x)|2 tracks the
given pdf p(t, x). This adaptation scheme is
where α, β are positive constants. This latter equation guarantees that if W (t, x)
is very large and positive (negative), then the term −βW (t, x) is very large and
negative (positive) causing W to decrease (increase). In other words, −β.W
is a threshold term that prevents the weight process from getting too large in
magnitude which could cause the breakdown of the quantum neural network.
We are assuming that V (x) is a nonnegative potential. Thus, increase in W
will cause an increase in W V which as we shall see by an argument given below,
will cause a corresponding increase in |ψ|2 and likewise a decrease in W will
cause a corresponding decrease in |ψ|2 . Now we add another potential term
p0 (t, x)V0 (x) to the above Schrodinger dynamics where p0 is the pdf of the
input signal. This term is added because we have the understanding that the
output pdf p is in some way correlated with the input pdf. Then the dynamics
becomes
ih∂t ψ(t, x) = (−h2 /2)∂x2 ψ(t, x) + W (t, x)V (x)ψ(t, x) + p0 (t, x)V0 (x)ψ(t, x)
fluctuate to W (t, x) + δW (t, x). The linearized dynamics for these fluctuations
are
δψ " = (δA" + iA" δS/h + iAδS " /h + iS " δA/h − AS " δS/h2 )exp(iS/h)
δψ "" = (δA"" +iA"" δS/h+2iA" δS " /h+iAδS "" /h+2iS " δA" /h+iS "" δA/h−AS " δS " /h2
d d
−A" S " δS/h2 − AS "" δS/h2 − AS " δS " /h2 − S 2 δA/h2 − iAS 2 δS/h3 ).exp(iS/h)
Thus we obtain the following differential equation for the wave function fluctu
ations caused by random fluctuations in the EEG and speech pdf’s:
= (−h2 /2)δA"" − ihA"" δS/2 − ihA" δS " − ihA.δS "" /2 − ihS " δA"
d d
−ihS "" δA/2+AS " δS " /2+A" S " δS/2+AS "" δS/2+AS " δS " /2+S 2 δA/2+iAS 2 δS/2h
+V0 (x)p0 (t, x)(δA + iAδS/h) + V0 (x)Aδp0 (t, x)
+V (x)W (t, x)(δA + iAδS/h) + V (x)AδW )
On equating real and imaginary parts in this equation, we get two pde’s for the
two functions A and S. However, it is easier to derive these equations by first
deriving the unperturbed pde’s for A, S and then forming the perturbations.
The unperturbed equations are (See Appendix A.1)
Appendix
A.1
Proof that increase in potential causes increase in probability density deter
mined as modulus square of the wave function.
Consider Schrodinger’s equation in the form
We write
ψ(t, x) = A(t, x).exp(iS(t, x)/h)
where A, S are real functions with A nonnegative. Thus, |ψ|2 = A2 . Substitut
ing this expression into Schrodinger’s equation and equating real and imaginary
parts gives
h∂t A = −h∂x A.∂x S − (1/2)A∂x2 S)
−A∂t S = −((h2 /2)∂x2 A − (1/2)A(∂x S)2 ) + V A
We write
S(t, x) = −Et + S0 (x)
so that E is identified with the total energy of the quantum particle. Then, the
second equation gives on neglecting O(h2 ) terms,
�
(∂x S0 ) = ± 2(E − V (x))
and then substituting this into the first equation with ∂t A = 0 gives
or
ln(S0� ) = −2h.ln(A) + C
so
A = K1 (S0� )−1/2 = K2 (E − V (x))−1/4
which shows that as long as V (x) < E, A(x) and hence A(x)2 = |ψ(x)|2 will
increase with increasing V (x) confirming the fact that by causing the driving
potential to increase when pE − |ψ|2 is large positive, we can ensure that |ψ|2
will increase and viceversa thereby confirming the a validity of our adaptive
algorithm for tracking the EEG pdf.
A.2 Calculating the empirical distribution of a stationary stochastic process.
Let x(t) be a stationary stochastic process. The empirical density estimate
of N samples of this process spaced τk , k = 1, 2, ..., n − 1 seconds apart from the
first sample, ie, an estimate of the joint density of x(t), x(t+τk ), k = 1, 2, ..., N −1
is given bylog(
� T
−1
p̂T (x1 , ..., xN ; τ1 , ..., τN −1 ) = T ΠN
k=1 δ(x(t + τk ) − xk )dt
0
170 Stochastic Processes in Classical and Quantum Physics and Engineering
10.5. QUANTUM NEURAL NETWORKS FOR ESTIMATING EEG PDF FROM SPEECH
� T
= E[exp(T −1 f (x(t), x(t + τ1 ), ..., x(t + τN −1 ))dt)]
0
= MT (f |τ1 , ..., τN −1 )
say. Then, the rate functional for the family of random fields p̂T (., τ1 , ...τN −1 ), T →
∞ is given by
�
I(p) = supf ( f (x1 , ..., xN )p(x1 , ..., xN )dx1 ...dxN − Λ(f ))
where
Λ(f ) = limT →∞ T −1 log(MT (T.f ))
assuming that this limit exists. It follows that as T → ∞, the asymptotic
probability of p̂T taking values in a set E is given by
or equivalently,
Appendix A.3
We can also use the electroweak theory or more specially, the DiracYang
Mills nonAbelian gauge theory for matter and gauge fields to track a given pdf.
Such an algorithm enables us to make use of elementary particles in a particle
accelerator to simulate probability densities for learning about the statistics of
signals.
Let Aaµ (x)τa be a nonAbelian gauge field assuming values in a Lie algebra
g which is a Lie subalgebra of u(N ). Here, τa , a = 1, 2, ..., N are Hermitian
generators of g. Denote their structure constants by {C(abc)}, ie,
[τa , τb ] = iC(abc)τc
�µ = ∂µ + ieAaµ τa
Stochastic Processes in Classical and Quantum Physics and Engineering 171
180CHAPTER 10. LECTURE PLAN FOR STATISTICAL SIGNAL PROCESSING
and the corresponding YangMills antisymmetric gauge field tensor, also called
the curvature of the gauge covariant derivative is given by
ieFµν = [�mu , �ν ] =
a
= ie(Aν,µ − Aaµ,ν ) − eC(abc)Abµ Acν )τa
so that
a
Fµν = Fµν τa
where
a a
Fµν = Aν,µ − Aaµ,ν ) − eC(abc)Abµ Acν
The action functional for the matter field ψ(x) having 4N components and the
gauge potentials Aaµ (x) is then given by (S.Weinberg, The quantum theory of
fields, vol.II) �
S[A, ψ] = Ld4 x,
where
ψ¯ = ψ ∗ γ 0
The equations of motion obtained by setting the variation of S w.r.t the matter
wave function ψ and the gauge potentials A are given by
Dν F µν = J µ
where Dµ is the gauge covariant derivative in the adjoint representation, ie,
[∂ν + ieAν , F µν ] = J µ
The problem with this formalism is that a photon Lagrangian term (−1/4)(Bν,µ −
Bµ,ν )(B ν,µ − B µ,ν ) also has to be added to the total Lagrangian which results
in the Maxwell field equation
¯ µψ
F,νphµν = −eψγ
where
ph
Fµν = Bν,µ − Bµ,ν
is the electromagnetic field. In order not to violate these Maxwell equations, we
must add another external charge source term to the rhs so that the Maxwell
equation become
¯ µ ψ + ρ(x)δ µ
F,νphµν = −eψγ 0
so that
B0 = W (p − |ψ|2 )V
The change in the Lagrangian produced by this source term is −ρ(x)B0 (x).
This control charge term can be adjusted for appropriate adaptation. Thus in
effect our Lagrangian acquires two new terms, the first is
¯ 0 B0 ψ = eψ ∗ ψ.B0
eψγ
(�2 B0 )(W γ 0 ψ)
The resulting equations of motion for the wave function get modified to
[1] [a] Prove that the quantum relative entropy is convex, ie, if (ρi , σi ), i = 1, 2
are pairs of mixed states in Cd , then
for p1 , p2 ≥ 0, p1 + p2 = 1.
hint: First prove using the following steps that if K is a quantum operation,
ie, TPCP map, then for two states ρ, σ, we have
Then define � �
p1 ρ1 0
ρ= ,
0 p2 ρ2
� �
p1 σ1 0
σ= ,
0 p2 σ 2
and K to be the operation given by
where
E1 = [I|0], E2 = [0|I] ∈ Cd×2d
Note that
E1∗ E1 + E2∗ E2 = I2d
To prove (a), make use of the result on the asymptotic error rate in hypothesis
testing on tensor product states: Given that the first kind of error, ie, probability
of detecting ρ⊗n given that the true state is σ ⊗n goes to zero, the minimum
(NeymanPearson test) probability of error of the second kind goes to zero at the
rate of −D(ρ|σ) (this result is known as quantum Stein’s lemma). Now consider
any sequence of tests Tn , ie 0 ≤ Tn ≤ I on (Cd )⊗n . Then consider using this
test for testing between the pair of states K ⊗n (ρ⊗n ) and K ⊗n (σ ⊗n ). Observe
that the optimum performance for this sequential hypothesis testing problem is
the same as that of the dual problem of testing ρ⊗n versus σ ⊗n using the dual
test K ∗⊗n (Tn ). This is because
D(ρ|σ) ≥ D(K(ρ)|K(σ))
[b] Prove using [a] that the VonNeumann entropy H(ρ) = −T r(ρ.log(ρ)) is
a concave function, ie if ρ, σ are two states, then
and
σ = I2d /2d
where ρk ∈ C d×d
, k = 1, 2. Then choose
and
K(X) = E1 XE1∗ + E2 XE2∗ ∈ Cd×d , X ∈ C2d×2d
Note that
E1∗ E1 + E2∗ E2 = Id
Show that
K(σ) = K(I2d /2d) = Id /d
Then we get using [a],
D(K(ρ)|K(σ)) ≤ D(ρ|sigma)
where
H(p) = −p1 log(p1 ) − p2 log(p2 )
Show that
H(p) ≤ log(2)
and hence deduce the result.
[2] [a] Let X[n], n ≥ 0 be a M vector valued AR process, ie, it satisfies the
first order vector difference equation
R[0] = E(X[0]X[0]T )
and then evaluate δA upto quadratic orders in δR(j, i). Finally, use the fact
that since X[n], n ≥ 0 is a Gaussian process,
[c] Calculate the Fisher information matrix for the parameters {Aij } and
hence the corresponding CramerRao lower bound.
[3] Let X[n] be a stationary M vector valued zero mean Gaussian process
with ACF R(τ ) = E(X[n]X[n−τ ]T ) ∈ RM ×M . Define its power spectral density
matrix by
�∞
S(ω) = R(τ )exp(−jωτ ) ∈ CM ×M
τ =−∞
Prove that S(ω) is a positive definite matrix. Define its windowed periodogram
estimate by
SˆN (ω) = X
ˆ N (ω)(X
ˆ N (ω))∗
17610.6. STATISTICAL
Stochastic SIGNAL
ProcessesPROCESSING,
in Classical and M.TECH,
Quantum Physics
LONGand Engineering
TESTMAX MARKS:501
where
N
� −1
ˆ N (ω) = N −1/2
X W(n)X(n)exp(−jωn)
n=0
and discuss the limits of these as N → ∞. Express these means and covariances
in terms of
∞
�
Ŵ (ω) = W (τ )exp(−jωτ )
τ =−∞
[4] This problem deals with causal Wiener filtering of vector valued sta
tionary processes. Let X(n), S(n), n ∈ Z be two jointly WSS processes with
correlations
Let �
SXX (z) = RXX (τ )z −τ
τ
Prove that
RXX (−τ ) = RXX (τ )T
and hence
SXX (z −1 ) = SXX (z)T
Assume that this power spectral density matrix can be factorized as
with �
� l[n] �< ∞,
n≥0
�
� g[n] �< ∞
n≥0
Then suppose �
H(z) = h[n]z −n
n≥0
Stochastic Processes in Classical and Quantum Physics and Engineering 177
186CHAPTER 10. LECTURE PLAN FOR STATISTICAL SIGNAL PROCESSING
is the optimum causal matrix Wiener filter that takes X[.] as input and outputs
ˆ of S[.] in the sense that
an estimate S[.]
�
Ŝ[n] = h[k]X[n − k]
k≥0
is a minimum, then prove that h[.] satisfies the matrix WienerHopf equation
�
h[k]RXX [m − k] = RSX [m], m ≥ 0
k≥0
Prove that this is a valid solution for a stable Causal matrix Wiener filter, it is
sufficient that �
� RSX [n] �< ∞
n≥0
ie, the map A → A−1 is operator convex on the space of positive matrices.
Solution: Proving (1) is equivalent to proving that
where
X = A−1/2 BA−1/2
It is enough to prove (2) for any positive matrix X. By the spectral theorem
for positive matrices, to prove (2), it is enough to prove
for all positive real numbers x. Proving (3) is equivalent to proving that
(t + (1 − t)x)(tx + 1 − t) ≥ x
or equivalently, adding and subtracting 2t(1 − t)x on both sides to complete the
square,
−2t(1 − t)x + t(1 − t)x2 + t(1 − t) ≥ 0
or equivalently,
1 + x2 − 2x ≥ 0
which is true.
Problem: For p ∈ (0, 1), show that x → −xp , x → xp−1 and x → xp+1 are
operator convex on the space of positive matrices.
Solution: Consider the integral
� ∞
I(a, x) = ua−1 du/(u + x)
0
It is clear that this integral is convergent when x ≥ 0 and 0 < a < 1. Now
� ∞
−1
I(a, x) = x ua−1 (1 + u/x)−1 du
0
Stochastic Processes in Classical and Quantum Physics and Engineering 179
188CHAPTER 10. LECTURE PLAN FOR STATISTICAL SIGNAL PROCESSING
� ∞
= xa−1 y a−1 (1 + y)−1 dy
0
on making the change of variables u = xy. Again make the change of variables
Then
� 1 � 1
I(a, x) = xa−1 z a−1 (1−z)−a+1 (1−z)(1−z)−2 dz = xa−1 z a−1 (1−z)−a dz
0 0
and since for u > 0, as we just saw, x → (u + x)−1 is operator convex, the proof
that f (x) = xa−1 is operator convex when a ∈ (0, 1). Now, Now,
Our aim is to show that for s ∈ (0, 1) and a TPCP map K, we have for two
states A, B,
Ds (K(A)|K(B)) ≥ Ds (A|B)
180 Stochastic Processes in Classical and Quantum Physics and Engineering
10.7. MONOTONOCITY OF QUANTUM RELATIVE RENYI ENTROPY189
Ds (K(A)|K(B)) ≤ Ds (A|B)
Let s ∈ (−1, 0). Then the above formula shows that x1−s is operator convex.
Hence, if A, B are any two density matrices and K a TPCP map, then
K(A)1−s ≤ K(A1−s )
whence
B s/2 K(A)1−s B s/2 ≤ B s/2 K(A1−s )B s/2
for any positive B and taking taking trace gives
T r(K(A)1−s B s ) ≤ T r(K(A1−s )B s )
K(B)s ≤ K(B s )
and so
A1/2 K(B)s A1/2 ≤ A1/2 K(B s )A1/2
for any positive A and replacing A by K(A1−s ) and then taking trace gives
and hence if I denotes the identity matrix of the same size as A, B, then
Note that if {ek : k = 1, 2, ..., n} is an onb for Cn , the Hilbert space on which
A, B act, then
� �
I= |ek >< ek |, V ec(I) = |ek ⊗ ek >= |ξ >
k k
�
V ec(I).V ec(I)∗ = |ξ >< ξ| = Eij ⊗ Eij
ij
where
Eij = |ei >< ej |
Thus, for s ∈ (0, 1),
�
T r(K(A)1−s K(B)s ) ≥ T r(Kij A1−s Kji B s )
i,j
where
Kij = K(Eij )
Likewise, if s ∈ (0, 1) so that s − 1 ∈ (−1, 0), 2 − s ∈ (1, 2), then both the maps
A → As−1 and A → A2−s are operator convex and hence for any TPCP map
K , K(A)s−1 ≤ K(A)s−1 , K(A)2−s ≤ K(A2−s ), from which we immediately
deduce that
or equivalently,
Remark: Lieb’s inequality states that for s ∈ (0, 1), the map
K(A) = (A + U AU ∗ )/2
which proves the monotonicity of the Renyi entropy when t ∈ (0, 1) under
pinching operations K of the above kind. Now, this inequality will also hold for
any pinching K since any pinching can be expressed as a composition of such
elementary pinchings of the above kind. We now also wish to show the reverse
inequality when t ∈ (−1, 0), ie show that if t ∈ (0, 1), then
My commentary
My commentary on the on
TheGita
Ashtavakra Ashtavakra Gita
193
183
184 Stochastic Processes in Classical and Quantum Physics and Engineering
194 CHAPTER 11. MY COMMENTARY ON THE ASHTAVAKRA GITA
soul is indestructible and imperishable as even the Bhagavad Gita states. The
Bhagavad Gita however distinguishes between two kinds of souls, the jivatma
and the paramatma, the former belonging to the living entities in our universe
and the latter to the supreme personality of Godhead. The Bhagavad Gita
tells us that within each individual entity, both the jivatma and the paramatma
reside and when the individual becomes arrogant, he believes that his jivatma is
superior to the paramatma and this then results in self conflict owing to which
the individual suffers. By serving the paramatma with devotion as outlined in
the chapter on Bhakti yoga, the jivatma is able to befriend the paramatma and
then the individual lives in peace and tranquility. To explain selfknowledge
more clearlh without bringing in the notion of a God, Ashtavakra tells Raja
Janaka that there are two persons who are doing exactly the same kind of work
every day both in their offices as well as in their family home. However, one
of them is very happy and tranquil while the other is depressed and is always
suffering. What is the reason for this ? The reason, Ashtavakra says is that the
second person believes that his self is doing these works and consequently the
reactions generated by his works affect him and make him either very happy
or very sad. He is confused and does not know how to escape from this knot
of physical experiences. The first person, on the other hand, knows very well
that he is compelled to act in the way he is acting owing to effects of his past
life’s karma’s and that in fact his soul is pure and has nothing at all to do with
the results and effects of his actions. He therefore neither rejoices nor does he
get depressed. The physical experiences that he goes through do not leave any
imprints on his self because he knows that the physical experiences are illusory
and that his soul which pure and without any blemish fills the entire universe
and is the truth. Ashtavarka further clarifies his point by saying that the person
in ignorance will practice meditation and try to cultivate detachment from the
objects of this world thinking that this will give him peace. Momentarily such
a person will get joy but when he comes of the meditation, immediately all
the desires of this illusory physical world will come back to him causing even
more misery. He will try to run to a secluded place like a forest and meditate
to forget about his experiences, but again when his meditation is over, misery
will come back to him. However, once he realizes that he is actually not his
body but his soul and that the soul is free of any physical quality and that
it is the immeasurable truth, then he will neither try forcefully to cultivate
detachment nor will he try forcefully to acquire possessions. Both the extremes
of attachment and detachment will not affect him. In short, he attains peace
even living in this world immediately at this moment when true knowledge
comes to him. According to Ashtavkara, the seeker of attachement as well as
the seeker of detachment are both unhappy because they have not realized that
the mind must not be forced to perform an activity. Knowledge is already
present within the individual, he just has to remain in his worldly position as
it is, performing his duties with the full knowledge that this compulsion is due
to his past karmas and then simultaneously be conscious of the fact that his
soul is the perfect Brahman and it pervades the entire universe. At one point,
Ashtavakra tells Janaka that let even Hari (Vishnu) or Hara (Shiva) be your
Stochastic Processes in Classical and Quantum Physics and Engineering 185
195
guru, even then you cannot be happy unless you have realized at this moment
the falsity of the physical world around you and the reality of your soul as
that which pervades the entire universe. Such a person who has understood
this, will according to Ashtavakra, be unmoved if either if death in some form
approaches him or if temptations of various kinds appear before him. He will
remain neutral to both of these extremes. Such a realized soul has eliminated all
the thought processes that occur within his mind. When true knowledge dawns
upon a person, his mind will become empty and like a dry leaf in a strong wind,
he will appear to be blown from one place to other doing different kinds of
karma but always with the full understanding that these karmas have nothing
at all to do with his soul which is ever at bliss, imperturbable, imperishable
and omnisicient. In the quantum theory of matter and fields, we learn about
how the observer perturbs a system in order to measure it and gain knowledge
about the system. When the system is thus perturbed, one measures not the
true system but only the perturbed system. This is the celebrated Heisenberg
uncertainty principle. Likewise, in Ashtavakra’s philosophy, if we attempt to
gain true knowledge about the universe and its living entities from our sensory
experience, we are in effect perturbing the universe and hence we would gain
only perturbed knowledge, not true knowledge. True knowledge comes only
with the instant realization that we are not these bodies, we are all permanent
souls. Even from a scientific standpoint, as we grow older, all our body cells
eventually get replaced by new ones, then what it is that in our new body after
ten years that we are able to retain our identity ? What scientific theory of
the DNA or RNA can explain this fact ? What is the difference between the
chemical composition of our body just before it dies and just after it dies ?
Absolutely no difference. Then what is it that disinguishes a dead body from
a living body ? That something which distinguishes the two must be the soul
which is not composed of any chemical. It must be an energy, the life force
which our modern science is incapable of explaining.
Chapter 12 Chapter 12
LDP-UKF
LDPUKF Basedcontroller
based Controller
State equations:
x(n + 1) = f (x(n), u(n)) + w(n + 1)
Measurement equations:
w, v are independent white Gaussian processes with mean zero and covariances
Q, R respectively. xd (n) is the desired process to be tracked and it satisfies
Where
K �
� �
−1
ΣXY = K [ P (n + 1|n)ξ(k)(h(x̂(n+1|n)+ P (n + 1|n)η(k))−ẑ(n+1|n))T ],
k=1
K
� �
ẑ(n + 1|n) = K −1 h(x̂(n + 1|n) + P (n + 1|n)η(k), u(n + 1))
k=1
K
� �
ΣY Y = K −1 . (h(x̂(n + 1|n) + P (n + 1|n)η(k))(, , )T
k=1
197
187
188 Stochastic Processes in Classical and Quantum Physics and Engineering
198 CHAPTER 12. LDPUKF BASED CONTROLLER
K
� �
P (n + 1|n) = K −1 [(f (x̂(n|n) + P (n|n)ξ(k), u(n)) − x̂(n + 1|n))(, , )T ] + Q
k=1
where
ΣXX = P (n + 1|n)
Tracking error feedback controller: Let
f (n) = xd (n) − x̂(n|n), f (n + 1|n) = xd (n + 1) − x̂(n + 1|n)
Controlled dynamics:
x(n + 1) = f (x(n), u(n)) + w(n + 1) + Kc f (n)
is minimized where
e(n) = x(n) − x̂(n), e(n + 1|n) = x(n + 1) − x̂(n + 1|n), x̂(n) = x̂(n|n)
� �
= f � (x̂(n), u(n))f (n) − f � (x̂(n), u(n)) P (n|n)K −1 ξ(k) − − − (2)
k
Finally,
f (n + 1) = xd (n + 1) − x̂(n + 1|n + 1) =
f (n + 1|n) − K(n)(z(n + 1) − ẑ(n + 1|n))
= f (n+1|n)−K(n)(h(x(n+1), u(n+1))+
� �
v(n+1)−K −1 h(x̂(n+1|n)+ P (n + 1|n)η(k), u(n+1))
k
will be only of the second order of smallness. Doing so, we can express (4) and
(5) in the form after equilibrium has been obtained as
It is a zero mean vector valued white Gaussian process and its covariance matrix
is block diagonal:
E(W (n)W (m)T ) = RW (n)δ(n − m)
where
RW (n) = E(W (n)W (n)T ) = diag[Q, R, P [n|n)/K, P (n + 1|n)/K]
Let P denote the limiting equilibrium value of P [n|n] as n → ∞. The limiting
equilibrium value of P [n + 1|n] is then given approximately by
P+ = f � (x̂, u)P f � (x̂, u)T + Q
where x̂ may be replaced by the limiting value of xd , assuming it to be asymp
totically d.c and f � (x, u) is the Jacobian matrix of f (x, u) w.r.t x. We write
f � (xd , u) = F and then
P+ = F P F T + Q
If we desire to use a more accurate value of P+ , then we must replace it by
can be expressed as
= Rχ [n, m|Kc ]
say. The LDP rate function of the process χ[.] over the time interval [0, N ] is
then given by
N
�
I[χ[n] : 0 ≤ n ≤ N ] = (1/2) χ[n]T Qχ [n, m|Kc ]χ[m]
n,m=0
where
Qχ = ((Rχ [n, m|Kc ]))−1
0≤n,m≤N
�N
According to the LDP for Gaussian processes, the probability that √ n=0 �
χ[n] �2 will exceed a threshold δ after we scale χ by a small number �| is given
approximately by
N
�
exp(−�−1 min(I(χ) : � χ[n] �2 > δ))
n=0
Chapter 1
An introduction to quantum electrodynamics
[1] The Maxwell and Dirac equations.
[2] Quantization of the electromagnetic field using Bosonic creation and an
nihilation operators.
[3] Second Quantization of the Dirac field using Fermionic creation and an
nihilation operators.
[4] The interaction term between the Dirac field and the Maxwell field.
[5] Propagators for the photons and electronpositron fields.
[7] The Dyson series and its application to calculation of amplitudes for
absorption, emission and scattering processes between electrons and positrons.
[8] Representing the terms in the Dyson series for interactions between pho
tons, electrons and positrons using Feynman diagrams.
[9] The role of propagators in Feynman diagrammatic calculations.
[10] Quantum electrodynamics using the Feynman path integral method.
[11] Calculating the propagator of fields using Feynman path integrals.
[12] Electrons self energy, vacuum polarization, Compton scattering and
anomalous magnetic moment of the electron.
Chapter 2
The design of quantum gates using quantum field theory
[1] Design of quantum gates using first quantized quantum mechanics by
controlled modulation of potentials.
[2] Design of quantum gates using harmonic oscillators with charges per
turbed by and electric field.
203
193
194 Stochastic Processes in Classical and Quantum Physics and Engineering
204CHAPTER 13. ABSTRACTS FOR A BOOK ON QUANTUM SIGNAL PROCESSING
Chapter 3
The quantum Boltzmann equation with applications to the design of quan
tum noisy gates, ie, TPCP channels.
[1] The classical Boltzmann equation for a plasma of charged particles inter
acting with an external electromagnetic field.
[2] Calculating the conductivity and permeability of a plasma interacting
with an electromagnetic field using first order perturbation theory.
[3] The quantum Boltzmann equation for the one particle density opera
tor for a system of N identically charged particles interacting with a classical
electromagnetic field. The effect of quantum charge and current densities on
the dynamics of the Maxwell field and the effect of the Maxwell field on the
dynamics of the one particle density operator.
[4] The quantum nonlinear channel generated by the evolving the one particle
density operator under an external field in the quantum Boltzmann equation.
Chapter 4
The HudsonParthasarathy quantum stochastic calculus with application to
quantum gate and quantum TPCP channel design.
[1] Boson and Fermion Fock spaces.
[2] Creation, annihilation and conservation operator fields in Boson and
Fermion Fock spaces.
[3] Creation, annihilation and conservation processes in Boson Fock space.
[4] The HudsonParthasarathy quantum Ito formulae.
[5] HudsonParthasarathySchrodinger equation for describing the evolution
of a quantum system in the presence of bath noise.
[6] The GKSL (Lindlbad) master equation for open quantum systems.
[7] Design of the Lindlbad operators for an open quantum system for gener
ation of a given TPCP map, ie, design of noisy quantum channels.
[8] Schrodinger’s equation with Lindlbad noisy operators in the presence of
an external electromagnetic field. Design of the electromagnetic field so that
the evolution matches a given TPCP map in the least squares sense.
Stochastic Processes in Classical and Quantum Physics and Engineering 195
205
Chapter 5
NonAbelian matter and gauge fields applied to the design of quantum gates.
[1] Lie algebra of a Lie group.
[2] Gauge covariant derivatives.
[3] The nonAbelian matter and gauge field Lagrangian that is invariant
under local gauge transformations.
[4] Application of nonAbelian matter and gauge field theory to the design
of large sized quantum gates in the first quantized formalism, gauge fields are
assumed to be classical and the control fields and current are also assumed to
be classical.
Chapter 6
The performance of quantum gates in the presence of quantum noise.
where
Hk = −(�k − ieA(t, rk ))2 /2m + eΦ(t, rk )
= H0k + Vk
where
+Lindbladnoise
Here, we have neglected the cubic terms, ie, the terms involving ρ123 . After
making the appropriate cancellations, we get
+Lindbladnoise
We can express this approximate equation as
iρ1,t (t) = [H1 , ρ1 (t)] + (N − 1)T r2 [V12 , exp(−it.ad(H1 + H2 ))(ρ1 (0) ⊗ ρ1 (0))]
which means that the above quantum Boltzmann equation as it evolves the one
particle state, determines a nonlinear trace preserving mapping. This suggests
that the quantum Boltzmann equation can be used to design nonlinear TPCP
maps by controlling the electromagnetic potentials A, Φ as well as the Lindblad
terms.
Now let ρ1 (t, r, r� ) denote the position space representation of the one particle
density operator. Then the charge density corresponding to this operator is
given by eρ1 (t, r, r) and the current density is
This is based on the fact that if ψ(t, r) denotes a pure state in the posi
tion representation, ie, a wave function, then the mixed state is an average
of ψ(t, r)ψ(t, r' )∗ over ensembles of such pure states, ie,
J(t, r) = (ie/2m)(ψ(t, r)∗ (V − ieA(t, r))ψ(t, r) − ψ(t, r)(V + ieA(t, r))ψ(t, r)∗ )
DA(t, r) = µ0 J(t, r) = µ0 ((e/m)A(t, r)ρ1 (t, r, r)−(e/m)Vr! Im(ρ1 (t, r' , r)|r! =r ))
where
D = (c−2 ∂t2 − V2 )
is the wave operator. The complete system of the quantum Boltzmann or rather
quantum Vlasov (quantum for the density operator and classical for the em field)
is therefore summarized below:
V12 = V (|r1 − r2 |)
ρ1 (t) = exp(−itad(H1 ))(ρ1 (0))+
f t
(N −1) exp(−i(t−s)ad(H1 ))(T r2 [V12 , exp(−is.ad(H1 +H2 ))(ρ1 (0)⊗ρ1 (0))])ds
0
f t
+(N −1) exp(−i(t−s)ad(H1 ))(T r2 [V12 , exp(−isad(H1 +H2 +V12 ))(g12 (0))])ds
0
+Lindbladnoiseterms.
DA(t, r) = µ0 ((e/m)A(t, r)ρ1 (t, r, r) − (e/m)Vr! Im(ρ1 (t, r' , r)|r! =r ))
DΦ(t, r) = E−1
0 (eρ1 (t, r, r))
198 Stochastic Processes in Classical and Quantum Physics and Engineering
208CHAPTER 13. ABSTRACTS FOR A BOOK ON QUANTUM SIGNAL PROCESSING
The unperturbed field ψ0 (t, r), ie, when Aµ = 0 can be expanded as a superposi
tion of annihilation and creation operator fields in momentum space of electrons
and positrons. This is the case when we second quantize the motion of a sin
gle electronpositron pair. When however we second quantize the motion of N
identical electronpositron pairs and then form the marginal density operator of
a single electronpositron pair, we obtain the quantum Boltzmann equation for
the one particle density operator field ρ1 (t, r� , r� ) which in the absence of mu
tual interactions V (r1 − r2 ), is simply ψ0 (t, r)ψ0 (t, r� )∗ which is expressible as
a quadratic combination of Fermionic creation and annihilation operator fields
in momentum space. The single particle Dirac Hamiltonian is
with u(P, σ) and v(−P, σ) being eigenvectors for √ HD0 (P ) = (α, P ) + βm with
eigenvalues ±E(P ) respectively where E(P ) = P 2 + m2 . Thus, we can write
after discretization in the momentum domain,
�
ρ100 (t, r, r� ) = [a(k)a(m)f1 (t, r, r� |k, m) + a(k)a(m)∗ f2 (t, r, r� )] + H.c
k,m
∗
where a(k), a(k) are respectively Fermionic annihilation and creation operators
satisfying the canonical anticommutation relations
[a(k), a(m)∗ ]+ = δ(k, m), [a(k), a(m)]+ = 0 = [a(k)∗ , a(m)∗ ]+
After interaction with the quantum em field, the density operator ρ10 (t, r, r� )
satisfies the equation
where
Hint (t) = e(α, A(t, r)) − eΦ(t, r)
where A(t, r), Φ(t, r) are now both Bosonic quantum operator fields expressible
upto zeroth order approximations as linear combinations of Bosonic annihila
tion and creation operators c(k), c(k)∗ , k = 1, 2, ... that satisfy the canonical
commutation relations:
where
i∂t ρ101 (t) = [HD0 , ρ101 (t)] + [Hint (t), ρ100 (t)]
This has the solution
� t
ρ101 (t) = exp(−i(t − s)ad(HD0 )([Hint (s), ρ100 (s)])ds
0
It is clear that this expression for ρ101 (t) is a trilinear combination of the pho
tonic creation and annihilation operators and the electronpositron creation and
annihilation operators such that it is linear in the former and quadratic in the
latter. Thus, we can write approximately
�
ρ101 (t, r, r� ) = [c(k)a(l)a(m)g1 (t, r, r� |klm)+c(k)∗ a(l)a(m)g2 (t, r, r� |klm)+
klm
where
g12 (t, r1 , r2 |r1� , r2� )
Note that in perturbation theory,
ρ1 = ρ10 + ρ11
Let ρ(t) be the state of an evolving quantum system according to the dynamics
where V (t) is the control potential and θ(ρ) is the Lindblad noise term. Our aim
is to design this control potential V (t) so that the probability density ρ(t, r, r)
in the position representation tracks a given probability density function p(t, r)
and the quantum current (e/m)�r� Im(ρ(t, r, r� )|r� =r ) tracks a given current
density J(t, r). By analogy with pure state dynamics and fluid dynamical sys
tems, the corresponding problem is formulated as follows: Consider the wave
function ψ(t, r) that satisfies Schrodinger’s equation for a charged particle in an
electromagnetic field:
i∂t ψ(t, r) = (−(� + ieA(t, r))2 /2 − eΦ(t, r) + W (t, r)V (t, r))ψ(t, r)
The aim is to control the neural weight W (t, r), so that �pq (t, r) = |ψ(t, r)|2
tracks the fluid density ρ(t, r) that is normalized so that ρ(t, r)d3 r = 1 and
further that the probability current density
tracks a given current density J(t, r) of the fluid. Note that we are assuming
that p, J satisfy the equation of continuity:
∂t ρ + div(J) = 0
211
201
202 Stochastic Processes in Classical and Quantum Physics and Engineering
212CHAPTER 14. QUANTUM NEURAL NETWORKS AND LEARNING USING MIXED
This is an effective QNN algorithm for simulating the motion of a fluid because
the corresponding quantum mechanical probability and current density pairs
(pq , Jq ) automatically by virtue of Schrodinger’s equation satisfy the equation
of continuity and simultaneously guarantee that pq is a probability density, ie,
it is nonnegative and integrates over space to unity. Thus rather than using a
QNN for tracking only the density of a fluid or a probability density, we can in
addition use it to track also the fluid current density by adjusting the control
weights W (t, r). Note that for open quantum systems, ie, in the presence of
Lindblad noise terms, the equation of continuity for the QNN does not hold.
This means that when bath noise interacts with our QNN, we would not get an
accurate simulation of the fluid mass and current density. However, if there are
sources of mass generation in the fluid so that the equation of continuity reads
then we can use the Lindblad operator coefficient terms as additional control
parameters to simulate ρ, J for such a system with mass generation g > 0 or
mass destruction g < 0.
|J(t, r) − Jq (t, r)|2 = |Im(ψ(t, r)�ψ(t, r)∗ ) + A(t, r)|ψ(t, r)|2 − J(t, r)|2
is the spacetime instantaneous norm error square between the true fluid current
density and that predicted by the QNN. Its gradient w.r.t A(t, r) is
so
∂t Φ(t, r) = −µδ|p(t, r) − |ψ(t, r)|2 |2 /δΦ(t, r) =
2µ[ψ̄(t, r)δψ(t, r)/δΦ(t, r) + ψ(t, r)δψ̄(t, r)/δΦ(t, r)][p(t, r) − |ψ(t, r)|2 ]
Now using the Dyson series expansion,
� �
ψ(t, r) = U0 (t)ψ(0, r)+ (−i)n U0 (t−t1 )V (t1 )U0 (t1 −t2 )....U0 (tn−1 −tn )ψ(0, r)dt1 ...dtn−1
n≥1 0<tn <...<t1 <t
Stochastic Processes in Classical and Quantum Physics and Engineering 203
213
giving
Note that here, we can more generally consider the error energy to be a weighted
linear combination of the error energies between the fluid mass density and its
QNN estimate and that between the fluid current density and its QNN estimate:
E = w1 |p(t, r)−|ψ(t, r)|2 |2 +w2 |J(t, r)−Im(ψ(t, r)�ψ(t, r)∗ )+A(t, r)|ψ(t, r)|2 |2
while calculating the updates for the vector and scalar potentials using this
¯ r)
energy function, we must take into account the dependence of ψ(t, r) and ψ(t,
on both A(t, r) and Φ(t, r). For this we write the Hamiltonian as
where
H0 = −�2 /2, V1 (t) = Φ(t, r)+(1/2)(divA(t, r)+2(A(t, r), �)), V2 (t) = A(t, r)2 /2
= (−i/2)�ψ(t, r)
and taking conjugate
so we get
δ|ψ(t, r)|2 /δA(t, r) =
¯ r)�ψ(t, r) − ψ(t, r)�ψ̄(t, r))
(−i/2)(ψ(t,
¯ r)�ψ(t, r))
= Im(ψ(t,
from which we easily deduce the quantum Ito formula of Hudson and Parthasarathy.
We can also introduce a conservation process Λ(t)
�
Λ(t) = a∗n am < en |χ[0,t] |em >
n,m
These equations yield the other quantum Ito formulas involving the products
of the creation and annihilation operator differentials with the conservation
operator differentials. Now suppose that we assume that our wave function on
the tensor product of the system and bath space has the form
where |φ(u) > is the bath coherent state and |ψs (t) > is the system state.
Then substituting this into the HudsonParthasarathy noisy Schrodinger equa
tion gives us using
dA(t)|φ(u) >= dt.u(t)|φ(u) >
dA(t)∗ |φ(u) >= (dB(t) − u(t)dt)|φ(u) >
where
B(t) = A(t) + A(t)∗
is a classical Brownian motion process and since the bath is in the coherent
state, it is should be regarded as a Brownian motion with drift. Thus, the HPS
equation assumes the form for such a special class of pure states on system and
bath space assumes the form of a stochastic Schrodinger equatiion
i∂t |ψs (t) >= (−(iH + LL∗ /2)dt + u(t)dtL − (dB(t) − u(t)dt)L∗ )|ψs (t) >
L = LM + LD + Lint =
Thus we identify
¯ µ ∂µ − m)ψ,
LM = (−1/4)Fµν F µν , LD = ψ(iγ
¯ µ ψ.Aµ
Lint = eψγ
To proceed further, we write down the Schrodinger equation corresponding to
this second quantized system as
where H(t) is the second quantized Hamiltonian of the system expressed in terms
of the creation and annihilation operator fields of the photons and Fermions. To
206
216CHAPTERStochastic ProcessesNEURAL
14. QUANTUM in Classical and QuantumAND
NETWORKS Physics and Engineering
LEARNING USING MIXED
∂α ∂ α Aµ = eψ ∗ αµ ψ,
∂ µ Aµ = 0
in deriving the former. In the absence of interactions, we can express the solu
tions as �
ψ0 (x) = (a(k)fk (x) + a(k)∗ gk (x)),
k
�
µ
A (x) = c(k)φµk (x) + c(k)∗ φ¯µk (x)
k
where c(k), c(k) satisfy the Bosonic CCR while a(k), a(k)∗ satisfy the Fermionic
∗
CAR. Writing
we get
a.Zeroth order perturbation equations.
∂α ∂ α Aµ1 = eψ0∗ αµ ψ0
n−1
�
∂α ∂ α Aµn = e ψk∗ αµ ψn−1−k
k=0
Chapter 15 Chapter 15
The contribution of this term upto second order in the em potential to the
scattering amplitude between two states |p1 , σ1 , p2 , σ2 > and |p�1 , σ1� , p�2 , σ2� > in
which there are two electrons with the specified four momenta and zcomponent
of the spins or equivalently two positrons or one electron and one positron is
given by
Let u(p, σ) denote the free particle Dirac wave function in the momentumspin
domain. Thus it satisfies
(γ µ pµ − m)u(p, σ) = 0, σ = 1, 2
219
209
210 Stochastic Processes in Classical and Quantum Physics and Engineering
220CHAPTER 15. QUANTUM GATE DESIGN, MAGNET MOTION,QUANTUM FILTE
Taking the adjoint of this equation after premultiplying by γ 0 and making use
of the selfadjointness of the matrices γ 0 γ µ gives us
ū(p, σ)(γ µ pµ − m) = 0
where
ū(p, σ) = u(p, σ)∗ γ 0
Now observe that for one particle momentumspin states of an electron,
and therefore,
ψ(x)|p, σ >e = exp(−ip.x)u(p, σ)|0 >
and for a one particle positron state
or equivalently,
Thus one of the terms in A2 (p�1 , σ1� , p�2 , σ2� |p1 , σ1 , p2 , σ2 ) for two electron momen
tum state transitions is given by
�
−e2 ¯ 1 )⊗ψ(x
< p�1 , σ1� , p�2 , σ2� |(ψ(x ¯ 2 ))γ µ ⊗γ ν )(ψ(x1 )⊗ψ(x2 ))|p1 , σ1 , p2 , σ2 >
Dµν (x1 −x2 )d4 x1 d4 x2
�
= Dµν (x1 −x2 )(ū(p�1 , σ1� )⊗ū(p�2 , σ2� ))(γ µ ⊗γ ν )(u(p1 , σ1 )⊗u(p2 , σ2 ))exp(−i(p1 −p�1 ).x1 −i(p2 −p�2 ).x2 )d4 x1 d4 x2
ˆ µν (p1 − p� )δ 4 (p1 + p2 − p� − p� ).
=D 1 1 2
[(ū(p�1 , σ1� )γ µ u(p1 , σ1 )).(ū(p2� , σ2� )γµ u(p2 , σ2 ))/(p1� − p1 )2 ]δ 4 (p1 + p2 − p1 − p�2 )
Likewise another term having a different structure also comes from the same
expression for A2 and that is
[(ū(p�2 , σ2� )γ µ u(p1 , σ1 )).(ū(p1� , σ1� )γµ u(p2 , σ2 ))/(p2� − p1 )2 ]δ 4 (p1 + p2 − p1 − p�2 )
The total amplitude is a superposition of these two amplitudes. Here, Dµν (x)
is the photon propagator and Dˆ µν (p) is its four dimensional spacetime Fourier
transform and is given by ηµν /p2 where p2 = pµ pµ . This propagator is defined
by
Dµν (x1 − x2 ) =< 0|T (Aµ (x1 )Aν (x2 ))|0 >
Stochastic Processes in Classical and Quantum Physics and Engineering 211
15.2. ON THE MOTION OF A MAGNET THROUGH A PIPE WITH A COIL UNDER GR
Now we consider a situation when apart from the quantum em field, there is
also an external classical control em field described by the four potential Acµ .
Then in the nth order term in the Dyson series for the scattering of M electrons,
there will be a term of the form
�
< p�k , σk� , k
= 1, 2, ..., M |T (Hint (x1 )...Hint (xn ))|pk , σk , k = 1, 2, ..., M > d4 x1 ...d4 xn
where
¯
Hint (x) = eψ(x)γ µ
ψ(x)(Aµ (x) + Acµ (x))
Here, Aµ is the four potential of the quantum electromagnetic field while Acµ (x)
is the four potential of the classical cnumber control electromagnetic four po
tential. Expanding these, it is clear that this term will be a sum of terms of the
form
�
¯ k )γ µk ψ(xk ))|pk , σk , k = 1, 2, ..., M >
< p�k , σk� , k = 1, 2, ..., M T (Πnk=1 ψ(x
. < 0|T (Aµ1 (x1 )...Aµr (xr ))|0 > Πnm=r+1 Acµm (xm )d4 x1 ...d4 xn
Problem formulation
Assume that the coil is wound around a tube with n0 turns per unit length
extending from a height of h to h + b. The tube is cylindrical and extends from
z = 0 to z = d. Taking relativity into account via the retarded potentials, if
m(t) denotes the magnetic moment of the magnet, then the vector potential
generated by it is approximately obtained from the retarded potential formula
as
A(t, r) = µm(t − r/c) × r/4πr3 − − − (1)
where we assume that the magnet is located near the origin. This approximates
to
A(t, r) = µm(t) × r/4πr3 − µm� (t) × r/4πcr2 − − − (2)
Now Assume that the magnet falls through the central axis with its axis always
being oriented along the −ẑ direction. Then, after time t, let its z coordinate
be ξ(t). We have the equation of motion
where F (t) is the z component of the force exerted by the magnetic field gener
ated by the current induced in the coil upon the magnet. Now, the flux of the
magnetic field generated by the magnet through the coil is given by
�
Φ(t) = n0 Bz (t, X, Y, Z)dXdY dZ − − − (4)
2 ,h≤z≤h+b
X2 +Y 2 ≤RT
where
Bz (t, X, Y, Z) = ẑ.B(t, X, Y, Z) − − − (5a)
B(t, X, Y, Z) = curlA(t, R − ξ(t)ẑ) − − − (5b)
where
R = (X, Y, Z)
and where RT is the radius of the tube. The current through the coil is then
where R0 is the coil resistance. The force of the magnetic field generated by
this current upon the falling magnet can now be computed easily. To do this,
we first compute the magnetic field Bc (t, X, Y, Z) generated by the coil using
the usual retarded potential formula
.(−ˆ
x.sin(φ)+ˆ
y.cos(φ))/|R−RT (ˆ
x.cos(φ)+ˆ
y.sin(φ))−ẑ(b/2n0 π)φ|)RT dφ−−−(7a)
Note that Ac (t, X, Y, Z) is a function of ξ(t) and also of m(t) and m� (t). It follows
that I(t) is a function of ξ(t), ξ � (t), m(t), m� (t), m�� (t). Then we calculate the
interaction energy between the magnetic fields generated by the magnet and by
the coil as
�
� −1
Uint (ξ(t), ξ (t), t) = (2µ) (Bc (t, X, Y, Z), B(t, X, Y, Z))dXdY dZ − − − (8)
and then we get for the force exerted by the coil’s magnetic field upon the falling
magnet as
F (t) = −∂Uint (ξ(t), ξ � (t), t)/∂ξ(t) − − − (9)
Remark: Ideally since the� interaction energy between the two magnetic fields
appears in the Lagrangian (E 2 /c2 − B 2 )d3 R/2µ, we should write down the
EulerLagrange equations using the Lagrangian
in the form
d
∂L/∂ξ � − ∂L/∂ξ = 0 − − − (11)
dt
which yields
d
M ξ �� (t) − ∂U/∂ξ � + ∂Uint /∂ξ + M g = 0 − −(12)
dt
3. Numerical simulations
Define the force function as in (9):
Neglecting the term involving the gradient of Uint w.r.t ξ � (this is justified
because the dominant reaction force of the coil field on the magnet is due to the
spatial gradient of the dipole interaction energy), we can discretize (2) with a
time step of $δ to get
Fig.1 shows plot of ξ[n] obtained using a ”for loop”. The current through the
coil as a function of time has also been plotted in fig.2 using the formula
�
d
I(t) = R0−1 (n0 Bz (t, X, Y, Z)dXdY dZ)
dt 2 ,h≤z ≤h+b
X2 +Y 2 ≤RT
= I(t|ξ(t), ξ � (t))
where
with
A(X, Y, Z) = µ0 .m × R/4πR3 , R = (X, Y, Z)
Here relativistic corrections have been neglected. The formula for the interaction
energy between the magnet and the field generated by the coil discussed in the
previous section can be simplified after making appropriate approximations to
−m.Bc = mBcz where Bcz , the zcomponent of the reaction magnetic field
produced by the coil is evaluated at the site of the magnet, ie, at (0, 0, ξ(t)).
Thus, the formula for the force of the field on the coil is approximated as
The coil magnetic field is evaluated using the approximate nonrelativistic Biot
Savart formula
� 2n0 π
Bc (t, X, Y, Z|ξ, ξ � ) = (µ0 I(t|ξ, ξ � )/4π)) RT dφ(−ˆ
x.sin(φ)+ˆ
y.cos(φ))×
0
In this expression, ξ(t) is the position vector of the centre of the magnet at time
t and R� is the position on the magnet electron loop relative to its centre. Since
the magnet current is a constant in the case of a permanent magnet, the above
simplifies to
�
A(t, R) = (µI0 /4π) dR� /|R − ξ(t) − R� |
magnet
in the special case m is a constant. It is also clear from the above formulas that
for given ξ(t), ξ � (t), the coil current is a linear function of m. This is because
Stochastic Processes in Classical and Quantum Physics and Engineering 215
15.2. ON THE MOTION OF A MAGNET THROUGH A PIPE WITH A COIL UNDER GR
where
Ik = I(tk ), Pk = P (ξ(tk ), (ξ(tk+1 ) − ξ(tk ))/(tk+1 − tk ))
The estimates of m obtained in this way for different measurements are shown
in Table(.).
Remark: m is the z component of the magnetic moment which equals in mag
nitude the total magnetic moment since the magnet is assumed to be oriented
along the z direction. The flux through the coil can be expressed as
�
Φ(t) = m.(µn0 /4π) curl(ẑ×R−ξ(t)ẑ)/|R−ξ(t)ẑ |3 ).ẑdXdY dZ
2 ,h≤Z≤h+b
X 2 +Y 2 ≤RT
P (ξ(t), ξ � (t)) =
�
d
[(µn0 /4πR0 ) curl(ẑ × R/|R − ξ(t)ẑ|3 ).ẑdXdY dZ
dt 2 ,h≤Z ≤h+b
X 2 +Y 2 ≤RT
where the functions fk (x), gk (x) are eigen functions of the Dirac equation with
boundary conditions determined by the vanishing of the Dirac wave function
on the boundary of the magnet. These are four component wave functions. In
other words, they satisfy
for all x within the bounding walls of the magnet and they vanish on the bound
ary, or rather the charge density corresponding to these functions as well as the
normal component of the spatial current density vanishes on the boundary. The
magnetic dipole moment of the magnet after second quantization is given by
�
m(t) = ψ(x)∗ (eJ/2m)ψ(x)d3 x
M
where
J = L + gσ/2
is the total orbital plus spin angular momentum of an electron taking the g
factor into account, normally we put g = 1. It is a conserved vector operator
for the Dirac Hamiltonian. The a(k), a(k)∗ are the annihilation and creation
operators of the electrons and positrons. They satisfy the CAR
giving
� t
δξ(t) = (t − s)F (s, ξ0 (s), ξ0� (s), a, a∗ )ds
0
[γ(k), γ(l)∗ ]+ = 0
and further they anticommute with the a(m), a(m)∗ . To check the consistency
of these operations, we have
in agreement with
γ(m)γ(k) = −γ(k)γ(m)
Further,
a(k)a(m)∗ |ψ(γ) >= (δ(k, m) − a(m)∗ a(k))|ψ(γ) >=
(δ(k, m) − a(m)∗ γ(k))|ψ(γ) >= (δ(k, m) + γ(k)a(m)∗ )|ψ(γ) >
while on the other hand,
and for this to equal γ|ψ(γ) >, we require that c2 = c5 = c6 = 0 and c3 = −c4 .
In other words,
|ψ(γ) >= (c1 γ + c3 )|0 > −c3 γ|1 >
We then get
a∗ |ψ(γ) >= (−c1 γ + c3 )|1 >
Note that we require that
γγ ∗ + γ ∗ γ = 1
We have
∂/∂γ|ψ(γ) >= c1 |0 > −c3 |1 >
∗
which cannot equal a |ψ(γ) >. So to get agreement with this equation also, we
must suppose that
d2 = d4 = 0, d3 = −c2 , d1 = −c4
This gives
Thus,
c 1 = c3 = 0
Stochastic Processes inFILTERING
15.3. QUANTUM Classical andTHEORY
Quantum APPLIED
Physics andTO QUANTUM FIELD 219
Engineering THEORY FO
and we get
|ψ(γ) >= (c2 γ ∗ + c4 )|0 > +(−c4 γ − c2 γ ∗ γ)|1 >
As a check, we compute
Now suppose that we have a term for example as a(k)∗ a(m)a(n)∗ a(r) and
we wish to evaluate its expectation value in the coherent state |ψ(γ) >. It will
be given by
< ψ(γ)|a(k)∗ a(m)a(n)∗ a(r)|ψ(γ) >=
−γ(r) < ψ(γ)|a(k)∗ a(m)a(n)∗ |ψ(γ) >
= −γ(r)γ(k)∗ < ψ(γ)|a(m)a(n)∗ |ψ(γ) >=
= −γ(r)γ(k)∗ < ψ(γ)|δ(m, n) − a(n)∗ a(m)|ψ(γ) >
= −δ(m, n)γ(r)γ(k) + γ(r)γ(k)∗ γ(n)∗ γ(m)
The problem is how to interpret this in terms of real numbers ?
= Le + Lkg + Lint
where Le , the electronic Lagrangian, is given by
¯ µ ∂µ − me )ψ,
Le = ψ(iγ
and Lint , the interaction Lagrangian between the KG field and the electron field
is given by
¯
Lint = −eψψ.φ
Note that we could also include interaction terms of the form given by the real
and imaginary parts of
ψ̄γ µ1 ...γ µk ψ.∂µ1 ...∂µk φ
The Schrodinger evolution equation for such a system of quantum fields is ob
tained by introducing Fermionic and Bosonic creation and annihilation operators
to describe the noninteracting fields in the presence of zero potential V , then
applying perturbation theory to approximately solve for the interacting fields
in the presence of the potential V , then use this approximate solution to write
down an expression for the total second quantized field Hamiltonian in terms of
the creation and annihilation operators.
The electronpositron field interacting with a KG field and the photon field:
¯ µ (i∂µ + e1 Aµ ) + e2 φ − m1 )ψ
L = (−1/4)Fµν F µν + ψ(γ
+(1/2)∂µ φ.∂ µ φ − m2 φ2 /2
The interaction Hamiltonian derived from this expression is
�
¯
Hint (t) = e1 ψ(x)γ µ
ψ(x)Aµ (x)d3 x
�
+e2 ¯
ψ(x)ψ(x)φ(x)d 3
x
One of the terms for the scattering amplitude of two electrons in the Dyson
series formulation is
�
2 � � ¯ 1 )γ µ ψ(x1 )ψ(x
e1 < p1 , p2 | T (ψ(x ¯ 2 )γ ν ψ(x2 ))DA (x1 − x2 )d4 x1 d4 x2 |p1 , p2 >
µν
Another term is
�
¯ 1 )ψ(x1 )ψ(x
e22 < p�1 , p�2 | T (ψ(x ¯ 2 )ψ(x2 ))Dφ (x1 − x2 )d4 x1 d4 x2 |p1 , p2 >
A
p2 > Dµν (x1 −x2 )Dφ (x3 −x4 )dx1 dx2 dx3 dx4
Stochastic Processes in Classical and Quantum Physics and Engineering 221
15.4. LECTURE ON THE ROLE OF STATISTICAL SIGNAL PROCESSING IN GENERAL
A
.Dµν (x1 − x2 )Dφ (x3 − x4 )dx1 dx2 dx3 dx4
One of the terms in the scattering amplitude of two electrons in the presence
of an external electromagnetic potential is
A(p�1 , p�2| p1 , p2 )
�
=< p�1 , p�2 | ¯ 1 )γ µ T (ψ(x1 )ψ(x
Acµ (x1 )T (ψ(x ¯ 2 ))γ ν ψ(x2 )ψ(x
¯ 3 ))γ ρ ψ(x3 ))Dνρ (x2 −x3 )dx1 dx2 dx3 |p1 , p2 >
�
= ¯ 1 )γ µ Se (x1 −x2 )γ ν ψ(x2 )ψ(x
Acµ (x1 ) < p�1 , p2� |T (ψ(x ¯ 3 )γ ρ ψ(x3 )|p1 , p2 > Dνρ (x2 −x3 )dx1 dx2 dx3
which evaluates to
�
Acµ (x1 )exp(i(p�1 .x1 +p�2 .x3 −p1 .x2 −p2 .x3 ))ū(p�1 )γ µ Se (x1 −x2 )γ ν u(p1 ).ū(p2� )γ ρ u(p2 )Dνρ (x2 −x3 )dx1 dx2 dx3
x1 − x2 = x, x2 − x3 = y
and by expressing Acµ (x1 ) in terms of its momentum domain Fourier integral,
this integral can be expressed as
(ρ + p)v ν v:ν
µ
− p,µ − f µ + v µ (p,ν v ν + fν v ν ) = 0
v,t (t, r) = F1 (t, r, v(t, r), �v(t, r), ρ(t, r), �ρ(t, r)) + w(t, r)
where w is the noise field. Now to quantize this system of equations, there are
two available routes. The first is to introduce an auxiliary four vector field η(t, r)
which acts as a Lagrange multiplier and write down the Lagrangian density for
the pair of fields ξ, η as
L(ξ, η, ξ,t , �ξ, �η) = η(t, r)T (ξ,t (t, r) − F (t, r, ξ(t, r), �ξ(t, r)))
δξ L = 0
which result in
Now we can express the Hamiltonian density of this system of two fields as
follows. The Hamiltonian density of this system of fields is first computed: The
canonical momenta are
Pη = ∂L/∂η,t = 0
Pξ = ∂L/∂ξ,t = η
H = ηξ,t − L
It is impossible to bring this to Hamiltonian form owing to the strange nature
of of the constraints involved. Indeed, to bring this to Hamiltonian form, ξ,t
must be expressed in terms of the canonical momenta, ξ, η and their spatial
gradients which is not possible. Therefore, we add a regularizing term to give
the Lagrangian density the following form:
Pξ = ∂L/∂ξ,t = η + �.ξ,t ,
Pη = ∂L/∂η,t = µη,t
Thus,
H = Pξ .ξ,t + Pη .η,t − L =
E(I(P, f ) − I(P ∪ Q, f ))2 ≤ T.maxk suptk <s<tk+1 E(f (s) − f (tk ))2
and that this quantity converges to zero as |P | → 0. Deduce using the inequality
ρ(t + τ ) − ρ(t) = −i[H0 , ρ(t)]τ − (τ 2 /2) < [V (t), [V (t), ρ(t)]] >
= −i[H0 , ρ(t)]τ − (τ 2 /2)(< V (t)2 ρ(t) + ρ(t)V (t)2 − 2V (t)ρ(t)V (t) >)
Now, write �
V (t) = ξ(k)Lk
k
where ξ(k) are iid r.v’s with zero mean and variance 1/τ in accordance with the
assumption that V (t) is operator valued white noise. L�k s are assumed to be
selfadjoint operators in the Hilbert space. Then show that the above equation
gives
�
ρ� (t) = −i[H0 , ρ(t)] − (1/2) (L2k ρ(t) + ρ(t)L2k − 2Lk ρ(t)Lk )
k
which is a special case of the master equation for open quantum systems.
Problem: let
H = P 2 /2m + U (Q)
in L2 (R) where P = −id/dQ and put
Lk = gk (Q, P ) = αk Q + βk P
Now,
[H, Q] = −iP/m, [H, P ] = iU ' (Q)
Make the induction hypothesis
n
2 n
2
n k n
ad(H) (Q) = Fnk (Q)P , ad(H) (P ) = Gnk (Q)P k
k=0 k=0
exp(tA).exp(tB) = exp(Z(t))
Then,
exp(Z)g(adZ)(Z ' ) = Aexp(Z) + exp(Z)B
where
g(z) = (1 − exp(−z))/z
Thus,
g(adZ)(Z ' ) = exp(−adZ)(A) + B
or equivalently
Z ' = f1 (adZ)(A) + f2 (adZ)(B)
where
f1 (z) = g(z)−1 , f2 (z) = exp(−z)g(z)−1
226 Stochastic Processes in Classical and Quantum Physics and Engineering
236CHAPTER 15. QUANTUM GATE DESIGN, MAGNET MOTION,QUANTUM FILTER
In particular for X = Q,
Q� = P/m − (1/2)([Q, L∗ ]L + L∗ [L, Q])
and for X = P ,
P � = −U � (Q) − (1/2)([P, L∗ ]L + L∗ [L, P ])
Now to get an equation that describes the motion of a particle in a potential
U (Q) with linear velocity damping, we assume that
L = aQ + bP, L∗ = āQ + ¯bP
Then,
¯ + ¯bP ] = ib¯
[L, Q] = [aQ + bP, Q] = −ib, [Q, L∗ ] = [Q, aQ
¯
[Q, L∗ ]L + L∗ [L, Q] = ib(aQ ¯ + ¯bP )(−ib) = 2.Im(āb)Q
+ bP ) + (aQ
¯ + ¯bP )(ia) = 2.Im(āb)P
[P, L∗ ]L + L∗ [L, P ] = (−iā)(aQ + bP ) + (aQ
Thus the HeisenbergLindblad equations read
Q� = P/m − Im(āb)Q,
P � = −U � (Q) − Im(āb)P
Eliminating P between these two equations gives us
Q�� = (−U � (Q) − m.Im(āb)(Q� + Im(āb)Q))/m − Im(āb)Q�
or
mQ�� = −(U � (Q) + (Im(āb))2 )Q − 2Im(āb)Q�
which is the velocity damped equation in a potential U (Q) + γ 2 Q2 /8 and with
velocity damping coefficient γ = 2Im(āb). The harmonic correction to the po
tential U (Q) is actually a quantum correction as can be verified by not assuming
Planck’s constant h = 1 as we have done here. (Here h stands for Planck’s con
stant divided by 2π). Now, the quantum fluctuationdissipation theorem can
also be expressed as
L∗ L + exp(−βadH)(L∗ L) − 2Lexp(−βadH)(L∗ ) = 0
For high temperatures or equivalently small β, this reads
[L∗ , L] − β([H, L∗ L] − L[H, L∗ ]) = 0
where O(β 2 ) terms have been neglected. Now for L = aQ + bP , we get
¯ + ¯bP, aQ + bP ] = iab
[L∗ , L] = [aQ ¯ − iab¯ = 2Im(āb)
[H, L∗ L] = [H, L∗ ]L + L∗ [H, L]
so
[H, L∗ L] − L[H, L∗ ] = [[H, L∗ ], L] + L∗ [H, L]
228 Stochastic Processes in Classical and Quantum Physics and Engineering
238CHAPTER 15. QUANTUM GATE DESIGN, MAGNET MOTION,QUANTUM FILTE
so that
� �
W (t, Q, P )dQ = ρ̂(t, P1 , P2 )δ(P1 − P2 )δ((P1 + P2 )/2 − P )dP1 dP2
= ρ̂(t, P, P )
is the marginal density of P (t). This suggest strongly that the complex valued
functionW (t, Q, P ) should be regarded as the quantum analogue of the joint
probability density of (Q(t), P (t)). Our aim is to derive the pde satisfied by W
Stochastic Processes in Classical and Quantum Physics and Engineering 229
15.7. WIGNERDISTRIBUTIONS IN QUANTUM MECHANICS 239
when ρ(t) satisfies the quantum FokkerPlanck equation. To this end, observe
that
� �
[P 2 , ρ]ψ(Q) = (− ρ,11 (Q, Q� )ψ(Q� )dQ� + ρ(Q, Q� )ψ �� (Q� )dQ�
�
= (−ρ,11 + ρ,22 )(Q, Q� )ψ(Q� )dQ�
Also
[U, ρ](Q, Q� ) = (U (Q) − U (Q� ))ρ(Q, Q� )
¯ + ¯bP )(aQ + bP )ρ(Q, Q� )
L∗ Lρ(Q, Q� ) = (aQ
= |a|2 Q2 ρ(Q, Q� ) − |b|2 ρ,11 (Q, Q� ) − iab
¯ Qρ,1 (Q, Q� )
¯
−iab(ρ(Q, Q� ) + Qρ,1 (Q, Q� ))
How to transform from W (Q, P ) to ρ(Q, Q� )?
�
W (Q, P ) = ρ(Q + q/2, Q − q/2)exp(−iP q)dq
gives �
W (Q, P )exp(iP q)dP/(2π) = ρ(Q + q/2, Q − q/2)
or equivalently,
�
W ((Q + Q� )/2, P )exp(iP (Q − Q� ))dP/(2π) = ρ(Q, Q� )
So
�
ρ,1 (Q, Q� ) = ((1/2)W,1 ((Q+Q� )/2, P )+iP W ((Q+Q� )/2, P ))exp(iP (Q−Q� ))dP
In the classical case, ρ(Q, Q� ) is diagonal and so such a term contributes only
when Q� is in the vicinity of Q. For example, if we have
then,
(U (Q� ) − U (Q))ρ(Q, Q� ) = ρ2 (Q)(U (Q� ) − U (Q))δ � (Q� − Q)
or equivalently, �
(U (Q� ) − U (Q))ρ(Q, Q� )f (Q� )dQ� =
just as in the classical case. In the general quantum case, we can write
� �
ρ(Q, Q� )f (Q� )dQ� = n
ρn (Q)∂Q f (Q)
n≥0
Now, this is equivalent to saying that the kernel of ρ in the position represen
tation is given by �
ρ(Q, Q� ) = ρn (Q)δ (n) (Q − Q� )
n
and hence
�
(U (Q� )−U (Q))ρ(Q, Q� )f (Q� )dQ�
� �
= ρn (Q) δ (n) (Q−Q� )(U (Q� )−U (Q))f (Q� )dQ�
n
�
� �
= ρn (Q)(−1)n ∂Q
n
� ((U (Q ) − U (Q))f (Q ))|Q� =Q
n
� � �
n n
= ρn (Q)(−1) U (k) (Q)f (n−k) (Q)
k
n≥0,1≤k≤n
Stochastic Processes in Classical and Quantum Physics and Engineering 231
15.7. WIGNERDISTRIBUTIONS IN QUANTUM MECHANICS 241
Thus,
(U (Q� ) − U (Q))ρ(Q, Q� ) =
� � �
k n
ρn (Q)(−1) U (k) (Q)δ (n−k) (Q� − Q)
k
n≥0,1≤k≤n
−U (Q)∂P f (Q, P )
and thus
(U (Q� ) − U (Q))ρ(Q, Q� ) =
� �
−2 exp(iP (Q−Q� )) (i/2)2k+1 ((2k+1)!)−1 (U (2k+1) ((Q+Q� )/2)∂P2k+1 W ((Q+Q� )/2, P ))dP
k≥0
and likewise,
�
ρ,22 (Q, Q� ) = ((1/4)W,11 ((Q+Q� )/2, P )−P 2 W ((Q+Q� )/2, P )−iP W,1 ((Q+Q� )/2, P ))exp(iP (Q−Q� ))dP
232 Stochastic
242CHAPTER ProcessesGATE
15. QUANTUM in Classical and Quantum
DESIGN, MAGNET Physics and Engineering FILTE
MOTION,QUANTUM
Thus, the noiseless quantum FokkerPlanck equation (or equivalently, the Liouville
SchrodingerVonNeumannequation)
is equivalent to
i∂t (W (t, (Q + Q� )/2, P )) =
(−iP/m)W,1 (t, (Q + Q� )/2, P )
�
+2 (i/2)2k+1 ((2k + 1)!)−1 (U (2k+1) ((Q + Q� )/2)∂P2k+1 W (t, (Q + Q� )/2, P ))
k≥0
or equivalently,
i∂t W (t, Q, P ) =
(−iP/m)W,1 (t, Q, P ) + U � (Q)∂P W (t, Q, P )+
�
+2 (i/2)2k+1 ((2k + 1)!)−1 (U (2k+1) (Q)∂P2k+1 W (t, Q, P ))
k≥0
This can be expressed in the form of a classical Liouville equation with quantum
correction terms:
2πhC = 1
Then, �
ρ(Q + q/2, Q − q/2) = W (Q, P )exp(iP q/h)dP
Further,
the equation
or equivalently,
which clearly shows that the quantum correction to the Liouville equation is a
power series in h2 beginning with the first power of h2 . Note that the quantum
corrections involve ∂P2k+1 W (t, Q, P ), k = 1, 2, ... but a diffusion term of the form
∂P2 W is absent. It will be present only when we include the Lindblad terms for
it is these terms that describe the effect of a noisy bath on our quantum system.
More generally, if
�
θ(ρ) = (L∗k Lk ρ + ρL∗k Lk − 2Lk ρLk )
k
where
Lk = gk (Q)
are real functions, then
where �
F (Q, Q� ) = (gk (Q) − gk (Q� ))2
k
We define
G(Q, q) = F (Q + q/2, Q − q/2)
or equivalently,
F (Q, Q� ) = G((Q + Q� )/2, Q − Q� )
Then,
θ(ρ)(Q, Q� ) = G((Q + Q� )/2, (Q − Q� )ρ(Q, Q� ) =
� �
Gn ((Q + Q� )/2)(Q − Q� )n W ((Q + Q� )/2, P )exp(iP (Q − Q� ))dP
n≥0
� �
= Gn ((Q + Q� )/2)in ∂Pn W ((Q + Q� )/2, P )exp(iP (Q − Q� ))dP
n≥0
ie,
Gn (Q) = n!−1 ∂qn G(Q, q)|q=0
Stochastic Processes in Classical and Quantum Physics and Engineering 235
15.7. WIGNERDISTRIBUTIONS IN QUANTUM MECHANICS 245
where
L = g0 (Q) + g1 (Q)P
We have
L∗ = ḡ0 (Q) + P ḡ1 (Q)
L∗ Lψ(Q) = ḡ0 +P ḡ1 )(g0 +g1 P )ψ(Q)
= |g0 (Q)|2 ψ(Q)+P |g1 (Q)|2 P ψ(Q)+ḡ0 g1 P ψ(Q)+P ḡ1 g0 ψ(Q)
Now, defining
|g1 (Q)|2 = f1 (Q), |g0 (Q)|2 = f0 (Q),
we get
P f1 (Q)P ψ(Q) = −∂Q f1 (Q)ψ � (Q) = −f1� (Q)ψ � (Q) − f1 (Q)ψ �� (Q)
Assume that after time t, the centre of the element is located at the position
R(t). It should be noted that if n(t) denotes the unit vector along the length of
the element at time t, then we can write
m(t) = K1 n(t), p(t) = K2 n(t)
where K1 , K2 are constants. Thus, we can write
m(t) = K1 S(t)n(0), p(t) = K2 S(t)n(0)
The angular velocity tensor Ω(t) in the lab frame can be expressed using
S � (t) = Ω(t)S(t)
or equivalently,
Ω(t) = S � (t)S(t)−1 = S � (t)S(t)T
The angular velocity tensor in the rest frame of the element is
where e3 is the unit vector along the z direction. While writing the Euler
Lagrange equations of motion from this Lagrangian, we must take into ac
count the constraint S(t)T S(t) = I by adding a Lagrange multiplier term
−T r(Λ(t)(S(t)T S(t) − I)) to the above Lagrangian.
as
A(t, r) = (µ/4π)[m(t − r/c) × r/r3 + m� (t − r/c) × r/cr2 ]
Remark: The interaction energy between a system of charged particles in
motion and a magnetic field is derived from the Hamiltonian term
�
(pi − ei A(t, ri ))2 /2mi
i
Now if the system of charged particles spans a small region of space around the
point r and in this region, the magnetic field does not vary too rapidly, then we
can approximate the above interaction energy term by
� �
U =− ei (vi , A(t, r + ri� )) = − ei (vi , (ri� , �)A(t, r))
i i
For bounded motion, the time average of the rhs is zero and we get denoting
time averages by < . >,
< (vi , (ri� , �)A) >= − < (ri� , (vi , �)A) >
Hence,
< (vi , (ri� , �)A) >= (1/2(< (vi , (ri� , �)A) − (ri� , (vi , �)A) >)
provided that A does not vary too rapidly with time. This formula can be
expressed as
U = −m.B
where �
m = (1/2) ei ri� × vi
i
and hence,
N
�
ΛN (N f ) = logE[exp( f (X(n)))] =
n=1
249
239
240 Stochastic Processes in Classical and Quantum Physics and Engineering
250CHAPTER 16. LARGE DEVIATIONS FOR MARKOV CHAINS, QUANTUM MAST
limN →∞ N −1 ΛN (N f ) = log(λ(f ))
We claim that
�
I(q) = J(q) = supu>0 q(x)log(u(x)/πu(x))
x
where �
πu(x) = π(x, y)u(y)
y
I(q) ≤ J(q)
f (x) = log(u(x)/πu(x))
Then, clearly
πf u(x) = u(x)
and hence for every n ≥ 1,
πfn u = u
and hence
log(λ(f )) = limn n−1 log(µT πfn u) = 0
ie
λ(f ) = 1
But then, both sides of (a) equal zero and the proof is complete.
and
ν
gµν = 0, µ =
We substitute this into the EinsteinHilbert Lagrangian density
√
Lg = g µν −g(Γµνα
Γβαβ − Γµβ
α
Γβνα )
and retain terms only upto O(δ 2 ). Likewise, we compute the Lagrangian density
of the electromagnetic field
Lem = (−1/4)Fµν F µν
and retain terms only upto O(δ), ie, terms linear in the Vµ . The resulting
Lagrangian of the gravitational field interacting with the external em field will
have the form
A0 = g 0µ Aµ = g 00 A0 = (1 − δ.V0 )A0
Keeping this model in mind, it can be abstracted into the Lagrangian of a set of
KleinGordon field interacting with an external cnumber field. The canonical
momenta are
Pµ = ∂L/∂Vµ,0 = 2δ 2 C1 (µν0β)Vν,β
= 2δ 2 C1 (µν00)Vν,0 + 2δ 2 C1 (µν0r)Vν,r
which can be easily solved to express Vµ,0 as a linear combination of Pν and
Vν,r . In this way our Hamiltonian density has the form
H = P 2 /2 + ω 2 Q2 /2 + β(P Q + QP ) + γE(t)Q
H = P 2 /2 + ω 2 Q2 /2 + E(t)(aP + bQ)
If E(t) is taken as quantum noise coming from a bath, then we can derive the
following HudsonParthasarathy noisy Schrodinger equation for the evolution
operator U (t):
where
L = aQ + bP
where
L = aP + bQ, H = P 2 /2 + U (Q)
The coherent state in which the Belavkin filter is designed to operate is
Then,
dYo (t) = dYi (t) − jt (c̄L + cL∗ )dt
where
jt (X) = U (t)∗ XU (t)
Note that
Let
dπt (X) = Ft (X)dt + Gt (X)dYo (t)
with
Ft (X), Gt (X) ∈ ηo (t)
Let
dC(t) = f (t)C(t)dYo (t), C(0) = 1
The orthogonality principle gives
and hence by quantum Ito’s formula and the arbitrariness of the complex valued
function f (t), we have
E[(jt (X) − πt (X))dYo (t)|ηo (t)] + E[(djt (X) − dπt (X))dYo (t)|ηo (t)] = 0
Note that
where
θ0 (X) = i[H, X] − (1/2)(LL∗ X + XLL∗ − 2LXL∗ ),
θ1 (X) = −LX + XL = [X, L], θ2 (X) = L∗ X − XL∗ = [L∗ , X]
Then the above conditions give
Then,
dπt (X) = (πt (θ0 (X)) + πt (θ1 (X))u(t) + πt (θ2 (X))ū(t))dt
+Gt (X)(dYo (t) + πt (c̄L + cL∗ − cu(t) − c̄ū(t))dt)
Stochastic Processes in Classical and Quantum Physics and Engineering 245
16.3. BELAVKIN FILTER FOR THE WIGNER DISTIRBUTION FUNCTION255
Consider the simplified case when u = 0, ie, filtering is carried out in the
vacuum coherent state. Then, the above Belavkin filter simplifies to
dπt (X) = πt (θ0 (X))dt+(πt (XL∗ +LX)−πt (L+L∗ )πt (X))(dYo (t)−πt (L+L∗ )dt)
Equivalently in the conditional density domain, the dual of the above equation
gives the Belavkin filter for the conditional density
where
θ0∗ (ρ) = −i[H, ρ] − (1/2)(LL∗ ρ + ρLL∗ − 2L∗ ρL)
Now let us take
L = aQ + bP, a, b ∈ C
and
H = P 2 /2 + U (Q)
and translate this Belavkin equation to the Wignerdistribution domain and
then compare the resulting differential equation for W with the KushnerKallianpur
filter for the classical conditional density. We write
�
ρ(t, Q, Q� ) = W (t, (Q + Q� )/2, P )exp(iP (Q − Q� ))dP
Note that the KushnerKallianpur filter for the corresponding classical proba
bilistic problem
where
α = 2Re(a), β = 2Re(b)
where
L∗ p = −P ∂Q p + U � (Q)∂P p + γ.∂P (P f ) + (σ 2 /2)∂P2 p
246 Stochastic Processes in Classical and Quantum Physics and Engineering
256CHAPTER 16. LARGE DEVIATIONS FOR MARKOV CHAINS, QUANTUM MASTE
Let us see how this KushnerKallianpur equation translates in the quantum case
using the Wigner distribution W in place of p. As noted earlier, the equation
is equivalent to
2
+(σ12 /2)∂Q W (t, Q, P ) + (σ22 /2)∂P2 W (t, Q, P )
+d1 ∂Q ∂P W (t, Q, P ) + (d3 Q + d4 P + d5 )∂Q W (t, Q, P )
Remark: Suppose b = 0 and a is replaced by a/h. Then, taking into account
the fact that in general units, the factor exp(iP (Q − Q� )) must be replaced by
exp(iP (Q − Q� ))/h) in the expression for ρ(t, Q, Q� ) in terms of W (t, Q, P ), we
get the Lindblad correction as
This equation clearly displays the classical terms and the quantum correction
terms.
259
249
250 Stochastic Processes in Classical and Quantum Physics and Engineering
260CHAPTER 17. STRING AND FIELD THEORY WITH LARGE DEVIATIONS,STOCH
In the case when the field Bµν,ρ is a constant, this gives corrections to the string
field that are quadratic functions of the {aµ (n)}:
Xµ = Xµ(0) + Xµ(1)
where �
Xµ(0) = xµ + pµ τ − i aµ (n)exp(in(τ − σ))/n
n=0
�
−i bµ (n).exp(in(τ + σ))/n
n=0
and
(1) (0)ν
ηαβ Xµ,αβ = −�αβ Bµν,ρ (X)X,α
(0)ρ
X,β
= �αβ Bµν,ρ (aρ (n)bν (m)uα vβ exp(i(n(τ − σ) + m(τ + σ)))
+bρ (n)aν (m)vα uβ exp(i(n(τ + σ) + m(τ − σ))))
where
(uα )α=0,1 = (1, −1), (vα )α=0,1 = (1, 1)
The rate function of the Poisson process is computed as follows. Let λ(�) be the
mean arrival rate of the Poisson process. Then
� T
M� (f ) = E[exp(� f (t)dN (t))] =
0
� T
exp((λ/�) (exp(�f (t)) − 1)dt)
0
Thus the GartnerEllis logarithmic moment generating function of the family of
processes �.N (.) over the time interval [0, T ] is given by
� T
Λ(f ) = �.log(M� (f /�)) = λ (exp(f (t)) − 1)dt
0
f T f T
= supx (ξ " (t)x − λ(exp(x) − 1))dt = η(ξ " (t))dt
0 0
where
η(y) = supx (xy − λ(exp(x) − 1))
By the contraction principle, the rate function of the process {X(t) : 0 ≤ t ≤ T }
is given by
f T
JT (X) = η(X "" (t) + γX " (t) + U " (X(t)))dt
0
assuming σ = 1.
Application to control: let the potential U (x|θ) have some control parameters
θ. We choose θ so that
inflX −Xd l>δ JT (X)
is as large as possible. This would guarantee that the probability of deviation
of X from desired noiseless trajectory Xd obtained with θ = θ0 by an amount
more than δ is as small as possible. To proceed further, we write
η(X "" + γX " + U " (X|θ)) = η(e"" + γe" + U " (X|θ) − U " (Xd |θ0 ))
≈ η(e"" + γe" + U "" (Xd |θ0 )e + U1 (Xd |θ0 )T δθ + δθT U2 (Xd |θ0 )δθ)
where
U1 (Xd |θ0 ) = ∂θ U "" (Xd |θ0 ),
U2 (Xd |θ0 ) = (1/2)∂θ ∂θT U "" (Xd |θ0 )
In particular, consider the string field equations in the presence of a Poisson
source:
∂02 X µ − ∂12 X µ = fkµ (τ, σ)dN k (τ )/dτ
Here
σ 0 = τ, σ 1 = σ
By writing down the Fourier series expansion
�
X µ (τ, σ) = aµ (n)exp(in(τ − σ)) + δX µ (τ, σ)
n
where δX µ is the Poisson contribution, evaluate the rate function of the field
δX µ and hence calculate the approximate probability that the quantum average
of the string field in a coherent state will deviate from its unforced value by an
amount more than a threshold over a given time duration with the spatial region
extending over the entire string.
252 Stochastic Processes in Classical and Quantum Physics and Engineering
262CHAPTER 17. STRING AND FIELD THEORY WITH LARGE DEVIATIONS,STOCH
and hence
n−1
�
λ(n) = (I − 2µR)n λ(0) + 2µ (I − 2µR)n−k−1 2µr
k=0
where
h0 = R−1 r
is the optimal Wiener solution. It follows that if all the eigenvalues of I − 2µR
are smaller than unity in magnitude, then
limn→∞ λ(n) = h0
However, we shall show that Cov(h(n)) does not converge to zero. It converges
to a nonzero matrix. We write
or equivalently,
δh(n+1) = ((I−2µX(n)X(n)T )
−(I−2µR))λ(n)+(I−2µX(n)X(n)T )δh(n)+2µ(d(n)X(n)−r)
= −2µ(X(n)X(n)T − R)λ(n) + (I − 2µX(n)X(n)T )δh(n) + 2µ(d(n)X(n) − r)
and hence
E(δh(n + 1) ⊗ δh(n + 1)) =
Stochastic Processes in Classical and Quantum Physics and Engineering 253
17.4. STRING INTERACTING WITH SCALAR FIELD, GAUGE FIELD AND A GRAVIT
E(Xi (n)Xj (n)Xk (n)Xm (n)) = Rij Rkm + Rik Rjm + Rim Rjk
limn→∞ E(δh(n)δh(n)T )
Note that in the general nonGaussian iid case, in case that E(δh(n) ⊗ δh(n))
converges, say to V , then the above equation implies that
� �
2 µ ν
+ Φ(X)d σ + �αβ X,α X,β Bµν (X )d2 σ
In flat spacetime, gµν = ηµν and the approximate string equations simplify to
ρ ν
ηαβ Xµ,αβ = −Φ,µ (X) + �αβ Bµν,ρ (X)X,α X,β
Where
D(0) (σ) = ln|σ|
Now we raise the question about what field equations should be satisfied by Bµν
and Φ so that the string action has conformal invariance. It For solving this
problem, we must replace the above action by
�
S[X, Φ, g] = (1/2)η αβ exp(φ(σ))gµν (X)X,αµ ν 2
X,β d σ
� �
2 µ ν
+ exp(φ(σ))Φ(X)d σ + �αβ exp(φ(σ))X,α X,β Bµν (X)d2 σ
since this corresponds to using the string sheet metric exp(φ(σ))ηαβ and cor
respondingly replacing the string sheet area element d2 σ by the invariant area
element exp(φ(σ))d2 σ. For small φ, conformal invariance then requires the
quantum average of the above action to be independent of φ. Taking into ac
count the larger singularity involved in the product of four string field terms at
the same point on the string world sheet as compared with the product of two
string field terms, this means that the quantity
Another way to approach this problem is to start with the string equations
of motion:
�Xµ = −Φ,µ (X) + �αβ Bµν,ρ (X)X,α ρ ν
X,β
and express it in terms of perturbation theory by writing
X µ = xµ + δX µ = xµ + Y µ
256
266CHAPTER Stochastic
17. STRING Processes in Classical
AND FIELD THEORYand Quantum Physics and
WITH LARGE Engineering
DEVIATIONS,STOCHA
as
�Y µ = −Φ,µν (x)Y ν + �αβ (Bµν,ρ (x) + Bµν,ργ (x)Y γ )Y,α
ρ ν
Y ,β
giving
where x + Z is the string field in the absence of any interactions. With this
expansion in mind, we then require that
E = {i : pi > eb }
where �
ψ(s) = log p1−s
i
i
and where
b = b(s, R) = (ψ(s) − sR)/(1 − s)
and it is clear that
�
lims→0 ψ(s)/s = H(p) = − pi log(pi )
i
it follows that bn (s, R) → −∞ and hence exp(bn (s, R)) → 0 and further,
Now define
L
�
P (p, L) = p↓i
i=1
provided that s is chosen sufficiently close to zero and that R > H(p). In other
words, we have proved that if R > H(p), then
P (p(n) , exp(nR)) → 1
Motion of a string field in the presence of an external gauge field with poten
tials that are linear in the spacetime coordinates. Evaluation of the low energy
field action in terms of the statistics of the string field.
Writing �
Bµν,ρ (X) = An (µνρ|µ1 , ..., µn )X µ1 X µ2 ...X µn
we find that for the expansion
�
X µ = xµ − i a(n)µ exp(inua σa )/n − ib(n)µ exp(inv a σa )/n
n
where
(ua ) = (1, −1), (v a ) = (1, 1)
we find that
�αβ Bµν,ρ (X)∂β X ν )∂α X ρ =
�
�αβ (−i)n1 +...+nr +m1 +...+ms Crs (µνρ|µ1 , ...,
r,s,n1 ,...,nr ,m1 ,...,ms ,l1 ,l2 r
µr , ν1 , ..., νs )[Πk=1 (aµk (nk )/nk )Πk=1
s
(bνk (mk )/mk )]
(aν (l1 )bρ (l2 )exp(i(l1 u+l2 v).σ)−bν (l1 )aρ (l2 ))
exp(i(l1 v+l2 u).σ))uT �v.exp(i((n1 +...+nr )u+(m1 +...+ms )v).σ)
258 Stochastic Processes in Classical and Quantum Physics and Engineering
268CHAPTER 17. STRING AND FIELD THEORY WITH LARGE DEVIATIONS,STOC
where the coefficients Crs (.) are determined in terms of An (µνρ|µ1 , ...µn ). In
particular if Bµνρ are constants, then this gauge term contributes only quadratic
terms in the creation and annihilation operators and hence can be expressed as
�
Bµνρ = Bµν,ρ (uT �v)(aν (l1 )bρ (l2 )exp(i(l1 u+l2 v).σ)−bν (l1 )aρ (l2 ))exp(i(l1 v+l2 u).σ)
l1 ,l2
In short, in this special case, we can express the gauge field interaction corrected
string field upto first order perturbation terms as
� �
X µ (σ) = xµ + aµ (n)fn (σ) + Fνρ
µ
aν (n)aρ (m)gnm (σ)
n n,m
= xµ + δX µ (σ)
where
[aµ (n), aν (m)] = η µν δ(n + m)
Now consider a point field φ(x). Assume that its Lagrangian density is given
by L(φ(x), φ,µ (x)). The average action of this string field when string theoretic
corrections are taken into account in a coherent state |φ(u) > of the string is
given by
�
< φ(u)| L(φ(x + δX(σ)), φ,µ (x + δX(σ)))dσd4 x|φ(u) >
We can evaluate this action upto quadratic terms in the string perturbation by
using the Taylor approximations
φ(x + δX(σ)) = φ(x) + φ,µ (x)δX µ (σ) + (1/2)φ,µν (x)δX µ (σ)δX ν (σ)
φ,µ (x)(x + δX(σ)) = φ,µ (x) + φ,µν (x)δX µ (σ) + (1/2)φ,µν (x)δX µ (σ)δX ν (σ)
and then us the following identities:
0,
For n + m =
For n > 0
< φ(u)|aµ (n)aν (−n)|φ(u) >= η µν + uµ (n)uν (−n)
and so on.
so that the master equation obtained as the equation for ρs (t) where
is given by
[2] Given a master equation of the above type, how would you adapt the
parameters θk (t), k = 1, 2, ..., d where
d
� d
�
H = H0 + θk (t)Vk , L = L0 + θk (t)Lk
k=1 k=1
where T0 , Tk , Tkm are linear operators on the space of mixed states in the un
derlying Hilbert space. It follows then that
fields appearing in the original Lagrangian. Theh, we write down the equa
tions of motion corresponding to this stochastically perturbed Lagrangian. The
equations of motion will be stochastic partial differential equations which are
nonsupersymmetric because of the stochastic perturbations. Now we calculate
the change in the modified action functional under an infinitesimal supersym
metry transformation and require that this change be bounded by a certain
small number in the sense of expected value of modulus square calculated by
assuming that the Bosonic and Fermionic fields are solutions to the unperturbed
supersymmetric equations of motion. This condition will impose a restriction
on the amplitude of the cnumber stochastic fields. Keeping this restriction,
we then try to calculate using the theory of large deviations, the probability
that the solution of the stochasically perturbed field equations which are non
supersymmetric will deviate from the supersymmetric unperturbed solutions by
an amount more than a certain threshold. We then repeat this calculation by
adding error feedback control terms to the perturbed dynamical equations and
choose the the control coefficients so that the resulting deviation probability is
minimized. This in effect, amounts to attempting to restore supersymmetry by
dynamic feedback control when the supersymmetry has been broken.