Professional Documents
Culture Documents
(Marchenko) Demonstrational Optics I
(Marchenko) Demonstrational Optics I
Demonstrational Optics
Part 1: Wave and
Geometrical Optics
Oleg Marchenko
St. Petersburg State University
St. Petersburg, Russia
Sergei Kazantsev
Paris Observatory
Paris, France
and
Laurentius Windholz
Technical University of Graz
Graz. Austria
Marchenko, Oleg.
Demonstrational optics / Oleg Marchenko, Sergei Kazantsev, Laurentius Windholz.
p. cm.
Includes bibliographical references and index.
Contents: pt. 1. Wave and geometrical optics.
QC355.3.M372004
535--dc22
2003061896
Preface ix
1. HISTORICAL INTRODUCTION 1
1 Wave nature of light 1
2 Electromagnetic theory 6
3 Optical Spectroscopy 7
4 Quantum Optics 8
5 Atom model and relativity 10
6 Coherence and precision optical instrumentation 13
7 Light correlation and statistics 15
2. PROPERTIES OF LIGHT WAVES 17
1 Laws of electromagnetism 17
1.1 Maxwell's equations 17
1.2 Electric dipole radiation 20
2 Properties of electromagnetic waves 23
2.1 Wave equation 23
2.2 Plane waves 27
2.3 Plane monochromatic wave 28
2.3.1 Frequency, wavelength, light velocity 28
2.3.2 Transversality of electromagnetic waves 30
2.4 Spherical waves 33
2.5 Polarization states 35
3 Quasi-monochromatic waves 37
3.1 Envelope and carrier frequency 37
3.2 Spectrum of quasi-monochromatic oscillations 38
4 Energy and momentum of light 42
v
vi
Index
289
Preface
ix
Chapter 1
HISTORICAL INTRODUCTION
1
O. Marchenko et al., Demonstrational Optics
© Kluwer Academic/Plenum Publishers, New York 2003
2 DEMONSTRATIONAL OPTICS
Sun , as well as Sun rays breaking through clouds, led to the idea that
light propagates from a source as a beam along a straight line trajectory.
For a long time , light was thought to be a stream of particles, emitted
from all visible bodies, penetrating into the eye. This mod el held un-
til the beginning of the 18th cent ury, and even LNEWTON (1643-1727)
considered light to have corpuscular nature. This model of light is in
agreement with the laws of geometrical optics.
But observations showed, that light propagation is a more complex
process. Thorough examination of the boundary between shadow and
light showed, that light also penetrates into the region of the geometrical
shadow, violating slightly the idea of linear light propagation. Obser-
vation of the light and dark bands near the boundary of a geometrical
shadow, and the similar effect of alternating sequences of bright and dark
bands, when light passes through a narrow slit in an opaque screen, lead
to ideas that light may be deflected slightly by geometrical objects.
In order to explain this diffraction phenomenon the Dutch scien-
tist CH.HUYGENS (1629-1695) introduced the following concept of light
propagation [1]. Light can be represented as a kind of oscillation, or
light wave, while propagating. Every point of the light wave originating
from a source may act as the origin of secondary waves propagating in
all directions. The resultant wave front at a particular moment may be
regarded as a superposition of all the secondary waves. This principle,
known as the HUYGENS principle, allows the determination of the posi-
tion of the current wave front , if the position of the wave front is known
for an earlier moment. A geometrical scheme, illustrating this principle,
is shown in Fig.!.!. Based on HUYGENS' geometrical representation the
phenomenon of light diffraction is easy to understand when considerat-
ing secondary sources located outside the obstacle. Light waves arising
from these sources and propagating in all directions are the reason for
the bright bands within the region of the geometrical shadow.
Within the framework of HUYGENS' principle the phenomena of light
refraction and reflection have been explained as well. HUYGENS pro-
vided an explanation for the birefringence phenomenon, discovered by
E .BARTOLINUS (1625-1698) in 1665. It was supposed, that within the
birefringent crystal there exists a second extraordinary wave apart from
the normal ordinary one. Its velocity is dependent on the propagation
direction within the crystal. The shape of the wave front of the extra-
ordinary wave is an ellipse, and not a circle as for the ordinary wave.
Studying the birefringence in Iceland Spar, HUYGENS discovered the po-
larization of the ordinary and the extraordinary beams. An explanation
of this phenomenon was offered much later.
Historical Introduct ion 3
Figure 1.2. Interference of light waves from sunlight S in the double-slit scheme of
T .Young.
estimated the wavelength of the light wave and found it to be very small ,
on the order of magnitude of >. rv 500 nm.
At the beginning of the nineteenth century, a significant contribution
to the field of optics was made by A.FRESNEL (1788-1827). The main
idea of FRESNEL's work on the quantitative explanation of the diffrac-
tion phenomenon was the unification of HUYGENS' principle together
with the concept of secondary wave interference [3]. This new principle,
called later the HUYGENS-FRESNEL principle, allowed the description
of experimentally well-known diffraction phenomenona: The diffraction
from the edge of an opaque screen and the diffraction from a small hole
in the screen . Later on G .KIRCHHOFF (1832-1887) int roduced a rigorous
mathematical interpretation of the HUYGENS-FRESNEL principle. Even
today the KIRCHHOFF integrands are used for approximate computa-
tions of diffraction patterns.
In 1816 FRESNEL conducted several important optical experiments.
He observed the interference of waves reflected from two mirrors posi-
tioned at an angle close to 1800 • He also noted an interference pattern
created by two refracted light beams intersecting two glass prisms. Sig-
nificant improvements were achieved in the determination of the na-
ture of light by studies of interference phenomena by FRESNEL and
D.F.ARAGO (1786-1853) (Figs.1.4, 1.5). They also examined the in-
terference of polarized light beams . The experimental fact that no in-
terference pattern can be observed when superimposing two light waves
polarized in two mutually orthogonal planes was in contradiction to the
dominant concept of a longitudinally oscillating light wave. Based on
this crucial result, YOUNG proposed that the oscillations of light waves
are strictly transversal.
The idea of transversal oscillations in light waves enabled FRESNEL
to work out a new approach to the phenomena of polarization and the
propagation of light inside matter. According to FRESNEL light emit-
ted by a radiating body is not polarized. Every molecule of a body
emits a linearly polarized wavelet, but, due to the chaotic motion of the
molecules, the resultant light wave emerging from a radiating body after
superposition of the elementary wavelets results in a wave with oscilla-
tions that chaotically vary their direction. Such light is called natural
light.
Based on these considerations, FRESNEL obtained important results
concerning light wave propagation within anisotropic crystals. Waves
having mutually orthogonal oscillation directions can propagate in every
crystal, but they will propagate with different velocities if the crystal is
birefringent. Studies of the optical activity of quartz became the basis of
FRESNEL'S hypotheses on how the propagation velocity of the circularly
polarized waves depends on the type of polarization. Waves with left and
right circular polarizations have different velocities when propagating
along the optical axis of a crystal. FRESNEL showed experimentally,
that an incoming linearly polarized wave is split into two waves with left
-:0--------
:~
--_. --;,----- Screen
~
Figure 1.5. Light rays, falling on a rather small disk, are deflected from straight lines
due to diffraction on the disk, causing Poisson's spot.
6 DEMONSTRATIONAL OPTICS
2. Electromagnetic theory
The concept of the electromagnetic field, which was completed by
G.K.MAXWELL (1831-1879) was proved to be totally consistent with the
theoretical assumptions and notations of FRESNEL. The concept of the
force field was introduced by M.FARADAY (1791-1867) into the science
of electricity and magnetism in the first half of the nineteenth century
[4] . Initially the lines of force of electric and magnetic fields were treated
as an aid for describing electric and magnetic phenomena in a clear and
pictorial way. But during the following studies of electromagnetic fields,
especially after the discovery of electromagnetic induction by FARADAY,
this concept was filled with a new physical meaning and enabled the
formulation of the laws of electromagnetism in a concise and clear form.
The strict mathematical representation of the concept of the electro-
magnetic field was accomplished by MAXWELL [5]. He expressed the
phenomenological electromagnetic laws in terms of consistent mathe-
matical equations. MAXWELL'S hypotheses of the displacement current
connected the alternating electric and magnetic fields existing within
a limited space region, even when no electric conductivity exists in the
material bodies transporting the electromagnetic field. This fact became
the principle point for the physical introduction of free electromagnetic
waves. MAXWELL came to the conclusion, that light is an electromag-
netic wave with a frequency within a particular range. The velocity
of the electromagnetic waves propagating in a medium should depend
on the mediums electric permittivity and magnetic permeability (c, J.L) .
The measurements of these constants were performed independently as
electrical quantities, and from these figures the velocity of light could
be determined to a value consistent with optical measurements. At this
historical moment electromagnetism and optics led to the same conclu-
sions, namely that (i) a light wave is a transversal wave and (ii) that the
Historical Introduction 7
3. Optical Spectroscopy
At the middle of the eighteenth century it was discovered, that the
radiation of different flames are not only comprised of a continuous-
intensity spectrum. Some parts of the spectrum were observed to be
more intense than the background spectrum, and showed the well-ex-
pressed character of discrete spectral lines. Systematic studies of optical
effects within absorption spectra were started after the discovery of the
dark lines in the solar spectrum by J .F RAUNHOFER (1787-1826), named
Fraunhofer lines in his honor. The chemical spectral analysis was based
on the idea that all the gaseous chemical elements possess a specific line
spectrum. Since then studies of line spectra have become a part of op-
tics, and large efforts were made to create and to improve the required
optical devices and instruments like prism and grating spectrographs.
During the second half of the nineteenth century, atomic spectroscopy
has become a widely developed experimental science, gathering phe-
nomenological data of principle significance. Nevertheless the classical
electrodynamics of MAXWELL did not explain the emission of these dis-
crete line spectra. An explanation of the spectral peculiarities of optical
radiation was not given until the beginning of the twenteeth century,
when the fundamentals of quantum theory were put forward.
The requirements of experimental atomic spectroscopy stimulated the
creation of more sophisticated spectral devices and new generations of
optical instruments. The first diffraction gratings were used by FRAUN-
HOFER to study the spectrum of the Sun at the beginning of the nine-
8 DEMONSTRATIONAL OPTICS
a b
Figure 1.6. (a) Two plane parallel plates, forming a so-called Fabry-Perot interfer-
ometer , give rise to a set of reflections of an incident ray between their inner surfaces.
The interference of these rays is the reason for appearence of distinct and sharp in-
terference fringes. (b) A fragment of a typical interference pattern after a Fabry -
Perot interferometer.
4. Quantum Optics
Quantum theory was founded mainly on the pioneering results of
M.PLANCK (1858-1947), who resolved the black body problem [6] . Using
a statistical approach to the thermodynamical problem of the equilib-
rium between radiation and matter, he introduced discrete portions of
light energy (quanta) hu (1900).
Historical Introduction 9
radiation substance light quanta substance
0---
0- hv
hv
a b
minimal portion of energy (or the energy quantum) and the frequency v
of a monochromatic wave within the continuous spectrum of radiation.
For a number of years the idea of the energy quantization was doubted
and PLANCK 'S method for the computation of the entropy was regarded
as a special mathematical trick. A new interpretation of the idea of
quanta was provided in the work of EINSTEIN (1879-1955)[7] who ex-
plained in 1905 the light-electric effect (or photoeffect ; the emission of
electrons from a metal surface by interaction with ultraviolet light), as-
suming that such a discrete energy portion lu/ is handled to one electron.
His treatment of radiation based on a statistical approach introduced the
concept of the light quanta. These light-quanta much later were named
as photons. In a way this was a step back from the wave concept to
a particle concept of light and did lead later to a dualistic picture of
light where wave and corpuscular properties are necessary to describe
its nature.
While in PLANCK'S theory the entropy of matter was treated statisti-
cally, EINSTEIN considered the statistical nature of the entropy of radia-
tion itself. Such an approach to electromagnetic radiation led EINSTEIN
to the hypothesis of the quanta of the electromagnetic field. Using the
energy conservation law in the process of the absorption of a photon by
an electron, EINSTEIN has explained a number of photochemical effects,
including the phenomenological features of the photoeffect. Therefore,
the field of quantum optics is built upon the quantum nature of in-
teraction of electromagnetic fields with matter and originates from the
explanation of the peculiarities of black body radiation and the empirical
laws of the photoeffect (Fig.1.7).
/
/----(t---', hv
,,
/
8 ~
/
o o
I
I
I
I I
\
\ ,
I
\
,, /
,
/
/
a b
Figure 1.8. The planetary model of an atom by Rutherford and Bohr (a) : according
with Bohr 's principles, but in contrast to Maxwell 's electrodynamics, every electron
moves along a stationary orbit . An energy quantum hv is emitted during the transi-
tion of the electron from a higher orbit to a lower one .
netic theory, since an electron moving along a closed orbit should emit
electromagnetic waves. The loss of energy needed for radiation should
lead to the electron falling into the positively charged nucleus. In other
words, such a system of charged particles was treated to be unstable.
In 1913 N.BOHR (1885-1962) made an at tempt to explain th e stability
of the hydrogen atom [8] which turned out to be very fruitful. BOHR
accepted the planetary model of atom by RUTHERFORD , but added the
assumption that the electron can run around the nucleus in stable orbits
when it has certain amounts of angular momentum. Being in one of
these orbits the electron should not emit radiation and therefore not
lose energy. The electron emits radiation only when it is undergoing a
transition between two such orbits (Fig.1.8). The energy of the emitted
quantum should be equal to the difference of energies En of these orbits
(n is an integer, numerating the stable electron orbits) : E n+ 1 - En = lu/
(again in contradiction to electromagnetic theory, where the frequency
of the emitted wave should be determined by the rotation frequency
of the orbiting electron) . When absorbing energy a transition of the
electron from a lower orbit to a higher one takes place. Using this
semi-classical model BOHR obtained a formula for the BALMER spectral
series of atomic hydrogen , which was in excellent agreement with the
spectroscopic observations.
In several respects BOHR'S theory caused discomfort. For example,
this model could not explain the intensity relations between the spectral
lines of the BALMER series, and it could not be used for atoms heavier
than hydrogen. Also the extended model of A.SOMMERFELD (1868-
1951) failed in these respects. Nevertheless, the first works of BOHR
12 DEMONSTRATIONAL OPTICS
Let us return to the question of a transport medium for light , or, more
general , electromagnetic waves. To prove the existence of an absolute
aether within which the Earth is moving, MICHELSON constructed a
special type of interferometer and performed his famous experiment in
1887 (together with E.MoRLEY (1838-1923)) in which he showed that
the light velocity does not depend on the direction relative to the ve-
locity of Earth and that it is impossible to detect the "aether wind"
(Fig.1.9). This experiment - together with the transformation equa-
tions of LORENTZ - was the basis on which EINSTEIN could later build
his theory of special relativity. This theory removed completely the con-
cept of aether and the electromagnetic field has received the status of
an independent physical entity.
Let us emphasize once more that all the progress in quantum theory
and relativity theory is based on the development of very precise and so-
phisticated optical instruments, like spectrographs, interferometers and
large telescopes (to test relati vity theory in space) .
Historical Introduction 13
stationary aether
c
K K
-v -v
a b
Figure 1.g. With the assumption of a "stationary aether", the reference K, vibrations
of the aether, caused by the light source S in the unifomly moving laboratory K',
would propagate with different velocities cr and C2 . Therefore the velocity of light
propagation should be dependent on the orientation of velocity v relative to the aether
(a) . According to the special theory of relativity the aether doest not exist : the light
velocity c neither depends on the orientation of the unifomly moving laboratory frame
with respect to another "stationary" reference K nor on a choice between uniformly
moving references .
distance between the beam axes (the interferometer base) the contrast of
the interference pattern decreases (Fig. 1.11). A complete disappearance
of the interference takes place at a certain base length. The angular
dimension of the star can be determined from the base length and the
known light wavelength. MICHELSON introduced a quantitative charac-
teristic of the interference contrast (called the visibility function), which
allows analysis of the form of the radiating surface. Similar visibility
functions were obtained by MICHELSON when studying the effect of the
shape of the emission spectral lines on the interference pattern contrast
in the MICHELSON interferometer. In particular he obtained, that the
spectral profile of the red cadmium line has a Gaussian shape.
Later on, the studies of MICHELSON were the background for theoreti-
cal models of the partial coherence theory. At that time the development
of this theory was provided by the works of P.H.VAN ZITTERT [12] .
a b c
PRINCIPLE PROPERTIES
OF LIGHT WAVES
1. Laws of electromagnetism
1.1 Maxwell's equations
The transversal wave nature of light was confirmed by studies of elec-
tric and magnetic phenomena carried out during the late in nineteenth
cent ury. Furthermore, light was found to be an electromagnetic wave.
These studies led to laws which established the link between the sources
of electric and magnetic fields and these fields itself. J .K .MAXWELL was
able to find his famous set of equations for the properties of the electric
field strength E and the magnetic induction B. Two of these equations
may be written (for propagation of the electromagnetic field in vacuum)
in the following form:
f
s
E dS = ~q
co
(2.1)
fs B dS = 0 (2.2)
17
O. Marchenko et al., Demonstrational Optics
© Kluwer Academic/Plenum Publishers, New York 2003
18 DEMONSTRATIONAL OPTICS
a b
Figure 2.1. The electric field flow through an element dS is the scalar product E ·dS,
where S = oS. 0 is the un it vector of the element dS (a) . The total electric field
flow through the closed surface S surrounding the positive charge +q is propotional
to th e quantity of q (b) .
c
2
f
L
B dl = c~ J
S
j dS
c= 1/JcO/-Lo
where f.LO = 471" . 10- 7 N·A-2.
The flux of vector j on the right hand side of the last equation is
calculated through the open surface S, determined by the closed contour
L. The orientation of the elementary vector dl indicates the direction to
pro ceed along the contour L , as shown in Fig.2.2.
Like moving charges and currents , an alternating electri c field also
causes a magnetic field. This physical result may be generalized by the
PROPERTIES of LIGHT WAVES 19
Be?
a b
Figure 2.2. The magnetic induction B caused by current I (a) illustrates the law of
Biot-Savart-Laplace. The line of the vector B around the closed contour L is caused
by the current distribution j, where the magnitude of B is propotional to the flux of
t he vector j through the area surrounded by contour L (b) .
c
2
f B dl = ~ JE(t) dS + e10 J j dS (2.3)
L S S
In accordance with the electromagnetic induction law of M .FARADAY,
a variation of the magnetic field flux through a certain surface creates
a curling electric field. The following MAXWELL equation is another
mathematical expression of FARADAY 'S law:
f
L
E dl =- ~ J
S
B(t) dS . (2.4)
J
current (see Fig .2.3)
eo~ E dS (2.5)
S
Integral equations (2.1-2.4) are called MAXWELL'S equations for electric
and magnetic fields in vacuum.
Figure 2.3. The magnetic field is Figure 2.4. Illustration of the electro-
caused by changes in the electric field magnetic induction: The electric field
E(t) and the current distribution j around the closed contour L is caused
by changes in the magnetic induction
flow through the contour L.
frame is at rest and (ii) when the reference is in uniform motion. This
statement follows from the special theory of relativity: the laws of elec-
tromagnetism must be similarly represented in a system of reference at
rest and one moving with uniform velocity. In both cases, one has mov-
ing, but not accelerated charges. In the special case of a single charge,
this property leads to the conclusion that a charge at rest or in uniform
motion does not emit electromagnetic radiation. It should be noted that
this statement is completely correct in case of a charge or a system of
charges in vacuum, whereas uniform motion of charges inside a medium
may cause electromagnetic radiation. Such radiation caused by charged
particles moving inside a medium does not contradict MAXWELL 'S elec-
trodynamics. The peculiarities of this kind of radiation will be discussed
in Chapter 4.
The existence of electromagnetic radiation, or electromagnetic waves,
is one of the fundamental consequences of MAXWELL'S electrodynamics.
An electromagnetic wave is considered to be free oscillations of elec-
tric and magnetic fields propagating through space. Let us treat now
the particular case of electromagnetic waves caused by a charge moving
accelerated in vacuum.
We know the electric field of an electric point charge q is radially
symmetric. The electric field strength decreases as the square of the
distance r between the charge position and the observation point:
y t
t
I
~- - - - - -
:--f - - - - - ;
. i
"
,-
" r = c (t-to)
i
,, ,,
I ,
. lq
AB .' .. I ..----...
Figure 2.5. Electric field of a moving Figure 2.6. The tangential compo-
positive charge. nent of the electric field line
v = ar (2.7)
x~v(t-to)
E _ _ 1_!L __1_ q
I - 411"co r 2 - 411"co c2 (t - to)2
Substituting the expressions for Ell and T = via into formula (2.9), we
find the following equation for the tangential component of the field El..
divE = ~p (2.12)
eo
divB = 0 (2.13)
2
c rot
B
= -oE 1.
ot +-J
eo
(2.14)
rotE = -oB
ot- (2.15)
where p is the electric charge density in space and j is the current density
in space.
The elementary flow of an electric or magnetic vector from an in-
finitesimal volume is given by the scalar magnitude of the divergence
(Fig.2.7,a,b). If electric charges are distributed with the density p(x, y, z)
24 DEMONSTRATIONAL OPTICS
s -
//
---
div E
<,
"
B
( -k i
I ~I
--- -
\ /
-, P (X,y,Z) //
.......
b
a
Figure 2.7. Divergence of electric and Figure 2.8. The integration over a
magnetic field lines. A spatial charge finite volume V enclosed by surface
distribution p(x, y, z), existing inside S results in an electric vector flow
a closed surface S, results in a diver- through S . This flow may be consid-
gence of field E from every infinites- ered as caused by the superposition of
imal volume element (a}; closed mag- elementary contributions of divE in-
netic lines result in zero divergence (b) . side the volume V .
•I aE/at it aE/at
I I
I
I "
I
• rot B
~dS I
I I I I
X I I I Sil I u IL
I I I I I
," I
Y I I
I
I
I
I
I
,I
I I
a b
Figure 2.9. The infinitesimal curl element of the magnetic induction B represented
by the vector rotB is caused by an alternating electric field 8E/Ot. The vector rotB
is normal to the element dB (a) . The superposition of all vectors rotB of elementary
curls results in the alternating flow of the electric field throught a finite contour L
(b).
following form:
divE = 0 (2.16)
divB = 0 (2.17)
8E
2rotB
c = at (2.18)
rotE = -8B
-
8t
(2.19)
(\7 . E) = divE = 0 .
The second term of the right-hand side of (2.21), expressed in terms of
the (\7 . \7) operator, takes the form
2 8 2E 8 2E 8 2E
-(\7. \7)E = -\7 E =- 8x 2 - 8y2 - 8z 2
(2.23)
(2.24)
PROPERTIES of LIGHT WAVES 27
( ~&t - c~)
8z
(~+
&t
c~)
8z
/= 0 '
and introduce new variables ~, TJ:
~=t-z/c TJ=t+z/c
so that
Then
and
/ = h(~) + h(TJ) = h (t - ~) + h (t + ~)
For example, let us discuss the case of h = 0, so that / = h(t - z/c).
In this case the field component f varies with time in a plane z = const ;
and for a fixed time t the field component varies depending with z.
The field component / = h(t - z/c) will have the same magnitude if
coordinate z and time t satisfy the condition t - z/c = const, i.e,
z = const + ct .
This equation implies, that if the field has a certain magnitude at time
t = 0 and point z , the field will have the same magnitude at time t over
the whole (x , y)-plane at a distance ct from point z. This plane, always
normal to the direction of propagation, is called the phase plane (see
Fig.2.1O). So, we can say that all components of the electromagnetic
field of the plane wave will have the same magnitude over the entire
phase plane . This plane propagates along the z-axis with velocity of
light c.
In other words, h(t-z/c) is a plane wave, traveling in the positive z-
direction. It is obvious, that h(t+z/c) is another plane wave, traveling
in the negative z-direction with the same velocity c.
28 DEMONSTRATIONAL OPTICS
c
x
az=ctd
Phase plane
Figure 2.10. T wo positions of t he phase plane separated by a time interval tlt for a
plan e wave propagating in th e z- directio n,
where the phase of the cosine function contains a variable term (27f IT)( t-
zl c) and a constant term - the initial phase sp, Two constant vectors
Eo and B o determine the amplitude of the oscillations.
It is customary to use t he frequency v = liT and th e circular fre-
quency w = 27fv = 27f IT. The const ant factor Tc becomes a length
A = T c = cf» that is called the wavelength of the wave. Apart from the
wavelength, the constant k = 27f I A, or k = wi c may also be used. This
magnitude is called the propagation number. Taking into account th e
definitions of frequency , wavelength and propagation number, the phas e
PROPERTIES of LIGHT WAVES 29
Rotating Light
toothed wheel source
I ~
I:_~====;.:
Mirror
-- 7 km - .
y •
Telescope
Figure 2.11. Schematic setup of FIZEAU 'S experiment to measure the velocity of
light .
x IJ
Red 760 - 630 nm (4 - 4.8)·W 4 Hz
Orange 630 - 600 nm (4.0 - 5) .10 14 Hz
Yellow 600 - 570 nm (5 - 5.3).10 14 Hz
Green 570 - 500 nm (5.3 - 6) .10 14 Hz
Light blue 500 - 450 nm (6 - 6.7).10 14 Hz
Blue 450 - 430 nm (6.7 - 7) .10 14 Hz
Violet 430 - 400 nm (7 - 7.5).10 14 Hz
2 8E 8B
c rotB = tit ' rotE = -tit (2.27)
From the first equation one can write the z-coordinate of vector rotB
as follows:
8By 8Bx
(rotB )z = 8x - By = 0 ,
wt - k R+cp .
Objective
~
Condenser lens
Hg lamp
:= ~-~ --~
Pinhole
__- _
-
- -~
- ------ ... :
V
Figure J2.14. Plane wave production using a bright light source . A pinhole illumi-
nated by the light of a H9 lamp through a condenser lens acts as a point source .
The lens is used to form a light beam with a plane wave front . A beam with nearly
constant diameter is produced .
to the beam and displaced along the axis of the beam. If the image or the
white spot size on the screen does not change while moving the screen ,
we say that the light beam is parallel and represents a plane wave. But
upon closer inspection of the spot on the screen produced by the mercury
lamp light , one would notice that the white central spot is surrounded
by colored rings. Such an effect implies that different monochromatic
components are present in the beam , which produce colored neighboring
rings of t he image. So one can assume that the white light from t he
mercury lamp consists of a set of monochromatic components.
~\
.> \
~~ \
~ \
Condenser lens
Hg lamp
Pinhole
Figure 2.15. A light wave with a spherical wave front from a bright source. Here a
pinhole plays the role of the source .
Figure 2.16. Spatial distribution of the electromagnetic field for a plane monochro-
matic wave propagating in the positive direct ion of the z-axis at a fixed moment t .
The wave is linearly polarized in the direction of the x-axis.
y y
E
Figu re 2.18. Oscillations of the electric field of a wave composed by two monochro-
matic waves of equal amplitude and wit h a small difference between their frequenci es.
The slowly varying function A(t ), shown by the dashed line, is the envelope of th e
resul ting oscillat ion .
the projection of vector E and the z axis. For this reason t he linear
polarization is sometimes called plane polarization.
In the case of circular polarization of a plane monochromat ic wave, t he
elect ric field vector can be characterized by two orthogonal projections
on t he (x, y)-plane, both having the same ampli tude of oscillation , but a
phase difference of tt /2. Th e to p of th e vector E rotates with a const ant
velocity on a circle in the project ion plane. Its trace in space is on a
screwed line. An example is shown in Fig.2.17,b.
3. Quasi-monochromatic waves
In reality, light emitted by a particular light source may be represented
as a mixture of monochromatic waves having different frequen cies, but
it can hardly be treated as a monochromatic wave. Nevertheless, such
light can be regarded as consisting of different spectral components,
each distribut ed within a narrow rang e of wavelengths. Such a type of
light radiation is often called quasi-monochromatic. In practice, we can
generate quasi-monochromatic light by means of different types of optical
filters and other devices which provide a frequenc y selecti on within a
narrow band of wavelengths.
PROPERTIES of LIGHT WAVES 37
a b
2+1/1]
2Eo cos [ 271' 1/2 -2 1/1t ] cos [1/
271' 2 t
When the difference between 1/2 and 1/1 is small, the resulting oscillations
look like a monochromatic wave with a slowly varying amplitude:
E(t) = 2Eo cos [271'~l/tl ' cos [271'I/ot] = A(t) cos [271'I/ot]
where A(t) = 2Eo cos [271'~l/t] , and I/o = (1/2 + 1/1)/2 is the mean fre-
quency or carrierfrequency. The frequency difference between each orig-
inal wave and the carrier is given by ~I/ = 1(1/2 -l/d/21 . The amplitude
A(t) , a slow function of time, is the envelope of the high frequency func-
tion E(t). The resulting oscillations of the electric field E(t) are shown
in Fig.2.18.
For the common case of a quasi-monochromatic oscillation A(t) is
a time dependent amplitude and «p(t) is a phase, varying slowly in
time with respect to the period T of the fast oscillations of the field
at carrier frequency I/o (T = 1/1/0). The oscillations of such a quasi-
monochromatic wave at any point may be represented by the expression
This function has its maximum at the carrier frequency Vo (Fig .2.19,b).
In principle the carrier frequency Vo and the functions describing the
amplitude A(t) and the phase ~(t) can have an arbitrary form. But in
all cases, a mathematical procedure exists , which restricts the arbitrari-
ness by expanding the dependence E(t) in terms of the monochromatic
oscillations. E(t) can be represented by a FOURIER integral of a complex
exponential function of the frequency, calculated from -00 to 00:
J
00
where £(v) is a complex function called the spectrum of E(t) [43] . The
spectrum £(v) results from the FOURIER spectrum of the function E(t):
J
00
where £(v) exp( -i27Tvt) is the contribution to E(t) by the spectral com-
ponent at frequency t/ ,
Mathematically, the left-hand part of eq. (2.36) is the FOURIER int e-
gral of the function £(v). But if v is a frequency, what about negative
PROPERTIES of LIGHT WA VES 39
IE(v)1 IE(v)1
~_1_11
-v -v
2 I
o o
a b
Figure 2.20. The total spectrum of oscillations caused by the superposition of two
monochromatic oscillations of frequencies Ill, 112 (112 - III « (Ill + 112)/2) (a) . The
spectrum of positive frequencies associated with cosine expansion of the original func-
tion of time (b) . Spectral components of the spectrum used in the cosine expansion
are twice as high as that of the full spectrum .
Here the first value £(_)(v) is defined for v ~ 0 and the second £(+)(v)
is defined for v 2: O. Now we shall reduce the sum (2.39) to a function
of the positive frequency v only.
If E(r)(t) is a real function, the sum in (2.39) will be real under the
condition
(2.40)
where both values £(+) (v) and £(-) ( -v) are defined for positive t/, Now,
defining £(+)(v) = 1£(v)1 exp(icp) and using (2.40) for £(_)(v) in (2.39),
we obtain £(_)(v) = 1£(v)1 exp(-icp).
40 DEMONSTRATIONAL OPTICS
J
00
J J
o 00
where £(_)(v) is defined for v:::; 0, and £(+)(v) for v 2 O. Then, for the
given lvi, we always find two values £(_)(v) and £(+)(v), which satisfy
condition (2.40) that the original function E(r)(t) is real. Summation
of each pair of similar values results in expression (2.41), so that a su-
perposition of monochromatic waves given by the integral (2.36) may be
reduced to a summation of the positive frequencies only. Therefore, for
a physical field E(r) we may write:
J
00
J
00
It is obvious that
E(r)(t) = Re{E(t)} ,
and
E(t)E*(t) = A2(t) (2.45)
The complex signal provides simpler mathematical transformations than
the physical field.
coE 2 + coc2 B 2
w=----- (2.46)
2
where w is the quantity of the electromagnetic energy contained within
a unit of volume.
42 DEMONSTRATIONAL OPTICS
~f
Figure 2.21 Assuming
a plane monochromatic
s ~. z
wave, the energy density
inside the volume La is
constant.
bodyof photodetector
ITI 0 photoelectron 0
00 --0 0
1\fV\fv~ 0 0
0 0
0
coupled electrons
I. ~t .1
Figure 2.22. The release of a photoelectron takes a finite time interval l:1t, which is
much longer than the period T of the oscillations of the incident light wave. Each
coupled electron in the matter of the photodetector could receive the energy portion
hu, where h is Plank's constant and II is the light frequency. Photoelectrons released
from the photodetector's body by incident light give rise to the photocurrent of the
detector.
J
Dot
S dt = ccoEo
2 J
Dot
2
cos (271"vt) dt
E5 (
= ccoTtlt Sin(471"Vtlt))
1 + 471"vtlt
o 0
According to our assumption tlt > T = 1/v, or tltv > 1, the value of
the phase in the second term inside the parenthesis is much larger than
271". Hence,
Isin(471"vtlt)I j(471"vtlt) « 1
44 DEMONSTRATIONAL OPTICS
--
A(l)
pholodeleclor
° 0 °0 'J 1(1)-41)
""
o
o 0 o 0
00
o 0
0 00
o
0
~~.
-
0 0 0 00 I
E(l) 0° 0° o 0
o
o 0
pholocurrenl
1= c£oEE*/2 (2.50)
Therefore, the intensity is the square of the envelope of the rapid oscil-
lations at the optical carrier frequency vo.
Let us discuss a photodetection process caused by a light wave, where
oscillations of the electric vector occur according to expression (2.34)
PROPERTIES of LIGHT WAVES 45
(Fig.2.23). The light wave penetrating into the material of the pho-
todetector gives rise to an absorption of light energy by the electrons of
photosensitive atoms. A number of these electrons is no longer coupled
to the atoms and give rise to a photocurrent.
As we have discussed, the creation of a single photoelectron takes the
finite time interval !:i.t. The same duration may be taken as a time inter-
val which provides a smoothing of the photocurrent, consisting of a huge
amount of photoelectrons. It means that variations of the photocurrent
in time i(t) follow the time dependency of the light wave intensity. In
turn, according to the expressions (2.34) and (2.51) we may write
i(t) A 2(t ) coS2[7r(Vl - V2)t]
roJ roJ
It is clear, this time dependence oscillates slower than the electric field
of the incident wave, which oscillates at the carrier frequency Vo = (VI +
v2)/2. No electronic device is able to resolve the carrier frequency itself.
Example 1.
It is well known that the mean flux energy density of the Sun radiation
near the Earth's surface is equal to (8) = 1.4 kW/m 2 . Let us estimate
the amplitude of the electric field strength of a light wave arriving the
Earth surface, assuming a quasi-monochromatic radiation.
The mean flux energy density (8) is given by (8) = !ceoE6. Substi-
tution of numerical quantities leads to Eo ~ 1 kV/m.
Example 2.
Let two waves of identical frequency propagate along the same direc-
tion. If these waves have mutually orthogonal linear polarization states
the flux energy density will be equal to the sum of the flux energy den-
sities 8 1 + 82 for every wave.
Actually, the flux energy density may be represented in terms of
the scalar product of electric field strengths 8 = !ceo(E. E). The
electric field vector E of the resulting wave is the sum of two mutu-
ally orthogonal vectors Ex and E y : E = Ex + E y , therefore one gets:
(E· E) = (Ex + Ey)·(E x + Ey) = (Ex'Ex) + (Ey·E y) for the scalar prod-
uct (E · E). Hence, the flux energy density for the resulting field should
be equal to the sum of the individual flux energy densities.
Example 3.
Let us find the spectral distribution for a wave train given for a short
time interval as a cosine function Eo cos(27rvot) . This wave train exists
for a limited time interval 7, so that t satisfies the following inequalities:
-7/2 < t < 7/2. At times t < -7/2 and t > 7/2 the field strength is
zero (Eo = 0).
46 DEMONSTRATIONAL OPTICS
-f
E jr/2
-r/2
[exp(i271"vot ) + exp( - i271"vot)] exp(i271"vt)dt =
K'
I
I
I
I
I
I
1 i
A B c
Figure 2.24- A source S emits N complete oscillations in the moving reference frame
K' over the distance AG. The same number of complete oscillations is emitted in the
rest reference frame K . The length of this wave train in the rest reference frame is
BG .
BC AC-AB v
== =1-- (2.52)
AC AC c
According to the main principle of the special theory of relativity the
wave train runs with the velocity c in both references, hence the dura-
tions of the wave train are different, and we express their ratio (2.52)
as
At _ BC -1 V
At' - AC - --;;
If the frequencies of the emitted light are denoted by Vo and VI in K'
and K references respectively, where Vo = N I At' and VI = N I At, then
a relation between Vo and VI will occur:
VI = vo/(l- vic)
With the assumptions v « c and v > 0, the previous expression results
in a positive difference between the frequencies Av = VI - Vo in the form:
v
Av = VI - Vo = Vo-c (2.53)
He-Ne
laser
/ J:.
Photo Telescopic Retro-rellecting
Dr-
detector sytem prism
Diaphraqrn
Mirror
:--==t=
eam splitter -
v
Figure 2.25. A setup for observing the Doppler effeet by means of a moving retrore-
fleeting prism.
4.3 Monochromaticity
The model of a plane monochromatic wave is the first approximation
of a real light wave. Nevertheless, all the features of light waves, such
as frequency, wavelength, polarization state and the shape of the wave
front, may be understood through this model. In turn, the concept of a
quasi-monochromatic wave is closer to the radiation from real sources ,
since it deals with wave trains of finite durations. One may treat such
wave trains being composed of monochromatic waves, the frequencies of
which are distributed within a finite interval.
Another reason why the quasi-monochromatic approximation is often
used is that optics, as any other physical science, must be based on
50 DEMONSTRATIONAL OPTICS
(2.56)
The simplest optical device of this kind is the human eye. The ability
of the human eye to distinguish colors is based on the difference of the
eye's sensitivity to different wavelengths of visible light. Colored glasses
and optical color filters for a camera are also examples of such simple
devices.
We should note that the function f(!:i.v) was discussed based on the
argument !:i.v, rather than on the frequency v (Fig.2.27). Since we deal
with quasi-monochromatic waves, it is more convenient to represent the
spectral dependency of the intensity in terms of the frequency difference
to emphasize the fact that the time-dependency of an envelope of the
original oscillations of an electromagnetic field has to be a slow function
of time.
In practice such spectral dependencies are often used, for example,
in optical spectroscopy when studying shapes of spectral lines. A lot
of methods for selecting very narrow regions of the visible spectrum are
available. The frequency value corresponding to the maximum intensity
in the spectral distribution is usually recognized as the carrier frequency.
For a given value of Vo the dependency I(!:i.v) can be investigated by
measuring the line shape. There are a large number of spectral lines,
which can be described by special functions known as the Lorentzian
and Gaussian curves. These functions are also even with respect to the
central frequency, so the concept of the carrier frequency may be used
in these cases as well.
52 DEMONSTRATIONAL OPTICS
1(L\v)
o L\v
Figure 2.27. Illustrating the principle of optical filtration . The dependency of the
light intensity I(~/I) of a thermal source (the curve presented by the thin line) and
the spectral characteristic of an optical filter, presented by the thick line. The total
intensity selected by the optical filter is proportional to the area below its transparency
function . /10 specifies the carrier frequency of a quasi-monochromatic wave selected by
the filter . The spectral region, which may be associated with such a wave, is denoted
by ~/lo .
Another method for selecting a required spectral region with well know
spectral characteristic is the use of so called interference filters . An
optical interference filter is characterized by a spectral curve of a well
known shape. This symmetric curve has a sharp maximum at a certain
frequency that may be taken as the carrier frequency. We should note
that applying the concept of quasi-monochromatic waves is more correct
than the monochromatic concept, even when the spectral distribution
does not have a symmetrical shape.
=k; .-~ - - - --
fA,
'II
\ X/~---'"
:-'
a b
Figure 2.28. The states of two beams of natural light are identical until a certain
polarizing device is inserted into the beam . In t he first state, the original beam it is
assumed to be composed of two mutually orthogonal linearl y polar ized beams of the
same intensity (a) . In the second state the same original beam is assumed to be a
composition of two beams of circularly polarized light : one is left and another is right
cicularly polarized (b) .
of wave trains where each train is emitted randomly with respect to the
others. Therefore no certain time interval and no polarization state can
be assigned to natural light.
Nevertheless , it is possible to introduce some order or correlation into
the chaotic radiation of a thermal source by using some passive methods.
For example, if an optical filter is inserted into a beam of natural light
t hen most of the wave trains passing t hrough the optical filter will have
approximately the same duration, which can be roughly estimated by
(see (2.55)):
T rv l/llv
where tlv is the effective spectral region of the optical filter. The symbol
for the duration T is used to point out the fact that we are dealing with
a huge amount of practically identical wave trains, instead of a single
train considered in the previous example. In other words, this symbol
is usually used to specify a mean or average magnitude, since, in fact ,
T plays the role of the mean value of the wave train duration obtained
after inserting the optical filter.
It is typical for natural light that the observer is designing the polar-
ization state. A beam of natural light does not have a certain polariza-
tion state due to the chaotic nature of the radiating processes. Neverthe-
less, we can prepare a light beam with a desired polarization, using an
appropriate polarization device. Here we can assume that the original
beam of natural light consists of two beams with mutually orthogonal
polarization. Every beam has the same magnitude of the intensity, so
that the total intensity is equal to their sum. In 'other words, for linear
polarization state we assume two beams of intensity of 10/2, polarized in
two orthogonal directions, propagating along the same direction, where
54 DEMONSTRATIONAL OPTICS
p = -qx , (2.58)
where -q is the electron charge and the radius vector x is drawn from the
origin of the reference frame to the negative charge. The electric dipole
moment of the displaced charge is subjected to harmonic oscillations as
PROPERTIES of LIGHT WAVES 55
x r
Figure 2.29. Stimulation of electric dipole radiation by the electric field E of a light
wave. The dipole moment p of the electron -q is along the direction of E. The
radiation intensity is dependent on the azimuthal angle (J and on the distance r .
a function of time t' occuring with light frequency w. Let us find the
second derivative of the dipole moment vector with respect to t', using
(2.57) and (2.58):
2
d p(t' ) = _q~x(t') = -qa(t') = qx w2 cos(wt') (2.59)
dt,2 dt,2 0
where a(t') is the electron acceleration and Ixol is the amplitude of elec-
tron's oscillations.
Substituting a(t') from eq.(2.59) to (2.10), we obtain the following
relationship for the radiation field of the harmonic oscillator:
Let us find an expression for the radiation intensity of the harmonic os-
cillator as a flux of the mean radiation energy density. Following (2.50),
the density of the energy flux is expressed by the quantity
S = CEoE2
Substituting expression (2.60) for the electric dipole field, we find
J
00
z
B
Figure 2.31. The Lorentz force initiated by a linearly polarized plane monochromatic
wave. The velocity v of the negative elementary charge -q is anti parallel to the
electric vector E of the wave; the Lorentz force is in the same direction as the unit
vector n of wave propagation.
interacts with the electrical charges of the medium and gives rise to a
force applied to the elementary free electron charge -q:
F = -qE-q[v x B] (2.63)
where v is charge velocity. The second term is the LORENTZ force that
may be transformed by expressing the magnetic field B in terms of
the electric field strength E. We introduce a unit vector n along the
direction of propagation of the light wave (Fig.2.31). Then we may
write B = [n x EJle and the expression for the LORENTZ force F L takes
the form
FL= -q"!.. x [n x E] = - ~ n(v. E) (2.64)
e e
In transforming the triple vector product we have assumed that vector
v is anti-parallel to vector E of the electric field, causing motion of the
elementary charges. The product of the force -qE and the velocity v
is equal to the power N delivered by the electromagnetic wave to the
charge. Hence, it is possible to write
(2.65)
= = I
a b
Figure 2.32. Light pressure force caused by reflection by a mirror (a), and through
scattering by a small sphere (b) .
N
Fl p = K,n-
c
(2.67)
/
Laser beam \
Lens Water /
Lens Small scattenng Trace of laser
sphere beam
a b
Figure 2.33. A glass cell to demonstrate the light pressure effect (a); a small sphere
moving when the laser beam is on (b).
where V is the volume of the sphere , which for the above parameters
may be estimated as V = 7rd3/6 ~ 500.x 3 . Now, for X = 550 nm one gets
a ~ 4 .10 4 m/s 2 • If the acceleration of gravity 9 ~ 10 mis, so a ~ 4 ·10 3g.
In the demonstrational experiment that we shall consider now, very
small spheres of transparent plastic material, suspended in water, which
is used to overcome the difficulties associated with the heating of the
spheres (by absorbing laser light to a small extent) and thermal forces.
The preferable mean radius of these spheres is about if = 50 nm and
the radiation of an argon ion laser in the green range of the spectrum
(.x = 514 nm with a power of N ~ 1 W) is used. The experimental
setup for observation of the light pressure effect is shown in Fig.2.33,a.
The laser beam passing along the horizontal direction falls on a short-
60 DEMONSTRATIONAL OPTICS
focus lens (f = 0.5 cm) mounted on the vertical side of a glass cell.
The trajectory of the light beam can be seen due to light scattering in
the water . As there are only a small number of the particles at the
scattering centers , the water will appear dark. However, if a scattering
particle passes close to the focus of the lens, a bright green asterisk is
observed which moves along the horizontal direction.(Fig.2.33,b) The
speed and the brightness decrease when it moves away from the focus.
Of course a friction force works against the accelleration due to the light
force, so the particles are moving slowly in the water.
The kinetics of these suspended particles is quite complicated since
there are many contributing physical effects. For example , the linearly
polarized laser beam induces an electric dipole on a sphere . For a con-
dition if :::::; O.lA, the angular distribution of the emitted intensity of
this dipole will be symmetric with respect to the axis of the dipole.
(Fig.2.32,b). Such symmetrical angular distribution also implies that
the net momentum of the scattering light is equal to zero, hence the
change of light momentum caused by scattering by a particle is equal to
the light momentum falling on the particle per unit time . Therefore the
factor K, in (2.67) is equal to unity.
Owing to the small difference between the optical density of the
spheres and the water , the quantity of pressure force is sufficiently small .
Apart from light pressure, the moving particle undergoes a flow resis-
t ance by water, which is described by STOKES ' law. Therefore the mo-
tion of the particle is slow enough for observation. Here we shall not
give the calculations of these forces, but it should noted that a thorough
examination shows that the two forces - the light pressure and the
STOKES force - give a sufficient explanation of the observed velocity of
the particles without including forces induced by thermal gradients. A
particle experiencing light pressure is shown in Fig.2.33,b .
Eph = hu .
PPh = lik ,
F = -bope . (2.69)
L=±wjw (2.70)
SUMMARY
k z
Figure 2.35.
PROBLEMS
2.4 It is known that our eyes are sensitive to a very weak flux of visible
light under conditions of full darkness near the peak of the spectral
64 DEMONSTRATIONAL OPTICS
V .~· S
Figur e 2.36.
sensitivity of the eye, which lies in the blue-green region of the spectrum
(.A = 550 nm). Normal observers require only approximately 50 quanta
to see a light flash, provided that such weak flashes from a light source
follow periodically in intervals of TO :::::: 5 s needed to reach threshold
visibility. It should be taken into account that only about 20% of the
quanta incident upon the sensitive cells of the eye will be absorbed, on
other words the effective quantum efficiency q of an eye is about q = 0.2.
In the experimental setup shown in Fig.2.36 a weak flux from a source
S passes through the lens L , the optical filter F and then the parallel
beam of blue-green rays falls on a slit of dimensions a = 2 mm times
b = 5 mm. A rotating disc, placed behind the slit , having an orifice of
th e same size, opens periodically the hole, and periodical flushes may be
observed by the eye. The distance R between the centers of the wheel
and the second slit is equal to R = 10 em. Estimate the intensity of
light necessary to permit threshold visibility when penetrating into the
eye.
n ~
_·_·_·-6~:'~~·_·_·~ -. ~.
Figure 2.37.
Figure 2.38.
t ~F:=
1 =~=====J Ph 011
D
Figure 2.39.
A = 480 nm, its power N = 10 kW, and the effective cross-section of the
light beam S = 5 mm 2 .
1-4-----C------.l
Figure 2.40.
SOLUTIONS
the volume 1rid?- /4, thus the energy density may be estimated to be
w ~ E/(1rld?- /4) ~ 2.5.10 5 J/m 3 .
a a aTo
T > - = - = --
- v OR 21rR
For the given interval T and slit's area ab, light energy U = I abr passes
through the opened slit:
a2bTo
U = Iobr = I 21rR
Because only 20% of quanta are absorbed by the sensitive cells, the
threshold amount of energy 50hv is approximately equal to the quantity
qU:
a2bTo
50hv=qI--
21rR
Finally, for the intensity we can write the following:
1= iu/ 501rR
qa2bTo
Substitution of the numerical values gives the following estimation for
the intensity: 1= 10-10 W/m 2 . As we have seen in Example 1, the the
mean flux energy density of the Sun radiation near the Earth's surface
is equal to 1.4 . 103 W/m 2 . This means that our eyes are sensitive to
light intensity over an enormous range of intensities (about 1013 ) .
I c I
Figure 2.41.
Due to total reflection by disk 's surface , the change in t he light mo-
mentum per unit time is equal to -2~P. In turn, the change in the
momentum of the disk per unit time is equal to 2~P . Hence, the light
pressure force acting upon the disk is:
F = 2~P = 2nCl7/ik .
Since this force is directed normal to the disk surface we can introduce
a unit vector I directed along k then write the following expression for
the light force:
F = 2l7cn/ikl .
We substitute tu»/ c for /ik in the expression for light force:
F = 217nfiw I .
The magnitude nfiw is th e energy density of the light beam that , with
w = ntu», gives the formula:
F = 217wl .
Now, since the power of the beam N = Swc, then substituting the last
expression in the formula obtained above gives:
PROPERTIES of LIGHT WAVES 69
+L
-L
Figure 2.42.
2.6. Since the cross section of the laser beam is less than that of the
mica plate the total energy flux passes through the plate. The polariza-
tion state of the light pulse becomes circular as it propagates through
the plate.Let the direction of circular polarization of the pulse become
that shown in Fig.2.42 and the total angular momentum of the pulse be
L. After passing of the light pulse through the mica plate, the plate has
got the angular momentum -L.
In order to calculate the magnitude of L we represent the light pulse
in terms of a flux of light quanta, every quantum carries the angular
momentum h. It is our task to calculate the total number of quanta Q
contained within the light pulse. We use the expression for the power
of the light beam that was obtained in the previous problem 2.5: N =
Srduac, where the power N represents the light energy of the beam,
passing through the fixed area S per unit time. Hence, the total light
energy of a light pulse of duration T is equal to the following magnitude:
NT, whereas the total amount of light quanta is Q = NT/IUv. Therefore,
the net angular momentum of the transmitted light pulse is lNT /w, and
the mica plate receives the following angular momentum:
L= _INT
W
_ 1T fL
n=-=-
q qc
The intensity I behind the diaphragm D, represented in terms of flux of
light quanta, is
1= 1tiJ..J
q1rr2
Substitution of the numerical magnitudes gives the intensity I :::::: 8.5 .
1O-7 Jj (m2s) and the mean number of quanta is only n:::::: 8 .10- 4 •
Chapter 3
LIGHT POLARIZATION
71
O. Marchenko et al., Demonstrational Optics
© Kluwer Academic/Plenum Publishers, New York 2003
72 DEMONSTRATIONAL OPTICS
are possible, but some optical experiments showed that light waves have
a transversal character.
MAXWELL 'S electromagnetic theory provided an adequate explana-
tion of the transversal nature of light waves. Despite discrepancies with
the mechanic model of aether, physicists began to treat light as electro-
magnetic waves in the visible region of the spectrum. Later, based on
the principles of EINSTEIN'S theory of relativity, which had been per-
fectly corroborated by a number of basic optical experiments such as
MICHELSON'S experiments and observation of star aberration, the prob-
lem of the aether was finally solved. The fundamental ability of a varying
electromagnetic field to reproduce itself, which produces a running elec-
tromagnetic wave, propagating even in vacuum, allowed a dismissal of
the concept of aether as a special light transport medium.
Adoption of the electromagnetic nature of light initiated new studies
concerning interaction between light and matter. Practically all optical
phenomena known at the beginning of the XIX century, had been ex-
plained by LORENTZ 's electron theory, based on the electron structure
of matter. Interaction of a light wave with matter was explained as an
effect of the electromagnetic field on the bounded atomic electrons. All
approaches of the classical theory of optical phenomena were founded
on the concept of a harmonic oscillator affected by the field of a mono-
chromatic alternating electromagnetic field.
Basic types of light wave polarization, ways of polarizing light from
a natural light beam and a number of polarization phenomena arising
when the light waves interacts with dielectric bodies will be discussed in
this chapter.
E
y
where Eox = al exp( -i!pI ) and Eoy = a2 exp ( -i!p2 ) are the complex
amplitudes; al and a2 are the real amplitudes; !PI and !P2 are the initial
phases.
The ratio of the complex components Eox, EOy is given by the following
expression:
EOy = a2 exp( -ic5) (3.2)
Eox ci
where 15 = !P2- !PI is the phase difference between the orthogonal os-
cillations. Depending on the value of 15, different polarization states
may be realized. For example, electric field oscillations at point z = 0,
represented in terms of real functions, may by written as follows:
/00 Cj",Cj 00 /
/) = 0 0< /) < ItI2 Ii = 7tI:! 7tl2 < /) < It /) = -It -It < 0 < -It/2 0 = -na -7tl2 < 0 < 0 0=0
Figure 3.2. The polarization states depend on the phas e difference 6 for al = a2 .
L x
Figure 3.3. Different cases of the superposition of two waves of orthogonal linear
polarizations, depending on the wave amplitudes (8 0). =
Light Polarization 75
where the superscript " R" specifies right circular polarization . Due to
th e superposition of th ese four waves a linear polarization st ate will
occur. Resulting oscillations of such a superposition have the following
two orthogonal components Ex(O, t ), Ey(O , t ) :
L 'P/2 = 0
x
Figure 3.5. Oscillations of a linearl y pol arized wave cau sed by superpositio n of two
circular orthogonal polariza tion stat es of equa l amplit udes a nd th e ph ase differen ce
<p .
76 DEMONSTRATIONAL OPTICS
Figure 3.6. For an elliptically polari zed wave, the ellipse may be described by two
pro jections of the electric vect or a l , a 2 at a given magnitude of the phase difference 8.
Another repr esent ation of the ellipse may be perfor med in t erm s of its major semiax is
a and minor semiaxis b and t he an gle Xi tan X = =r- bja.
as follows from (3.4) and (3.5). The fact th at th e resulting oscillation rep-
resents a linear polariz ation state follows from t he link between Ex(O, t)
and E y(O, t) :
(3.7)
which is valid for any inst ant in t ime and for any point of observatio n.
Examples of resulting oscillat ions are shown in Fig.3.5 (in case of <p = 7f,
when cos(<p / 2) = 0, t hen Ex = 0).
determined by cj and a2 :
(3.8)
Figure 3.7. Spherical waves emitted by a circularly moving charge. These waves
have different polarization states, depending on the direction of observation.
rotator's center, so that the vector S lies in the plane x , Z, then the am-
plitude of E y will not depend on B, whereas for Ex one gets Ex rv cos B.
Hence, the superimposed wave will have an elliptical polarization.
In the upper semi-sphere one will observe left elliptical polarization
due to the given motion direction of the charge. This polarization state
is reduced to circular polarization at the pole point P where both am-
plitudes of the waves Ex, E y have the same value. At the opposite pole
of the sphere right circular polarization will be observed. For equatorial
points the amplitude of one of the waves emitted by the linear oscilla-
tors is always equal to zero which leads to linear 'polarization within the
plane x, y.
P,
He - Ne laser
Analyzer Polarizer Analyzer
Figure 3.8. The given direction of the Figure 3.9. Linearly polarized light
polarization plane of a He-Ne laser Pi caused by the polarizer Pi is tested by
may be used to determine the principle the analyzer P2 . The intensity of the
dire ction of an analyzer P2 . transmitted light is dependent on the
prin ciple directions Pi an d P2.
Figure 3.10 . Natural light passed Figure 3.11. When natural light
through an ideal polarizer and an ana- passes through a real (imperfect)
lyzer , the latter rotated by an angle a polarizer, one can find two orthogonal
with respect to the polarizer. polarized components of intensities
III and Is: in the outgoing beam . l s:
is much smaller, depending on the
quality of the polarizer.
polarized light can not exceed one half of the intensity of the natural light
beam incident on the polarizer. In real cases the intensity of the linearly
polarized beam is less that one half of the incident beam intensity due
to various light energy losses due to the polarizer. Moreover, a small
component of the light beam polarized orthogonally to the principle
direction might also exist in the outgoing light beam (Fig .3.11). So,
in a light beam of natural light passing through a real polarizer two
orthogonal components will always exist . One says the light beam is
characterized by a non-zero value of its degree of polarization P, and
this beam has a partial polarization, in contrast to totally polarized
monochromatic light.
The degree of polarization P mentioned above results from measure-
ments of the transformation of natural light by a polarizer. It is deter-
mined in terms of light intensities polarized in two orthogonal directions
t, and III :
P = III - h (3.12)
III+h
where III corresponds to the principle direction of the polarizer. For an
ideal polarizer this quantity is equal to unity but for real devices it is
always less than one.
II~a) = 0.95 Ilr) cos2 30° + 0.95 if) sin 230°::::::! 0.34 10
a
Ii ) = 0.01 It) sin 2 30° + 0.01 i:t> cos2 30° : : : ! 0.0012 10
82 DEMONSTRATIONAL OPTICS
= a 1 + a2
2 2
80
2 2
81 = a 1 - a2 (3.13)
82 = 2aIa2 cos {j ,
83 = 2a1a2 sin {j ,
where 8 = <PI - <P2 is the phase difference between two orthogonal com-
ponents of the electric vector having amplitudes a1 and a2. It can be seen
that three independent magnitudes a1, a2 and 8 are needed to express
the four STOKES parameters. Only three among them are independent.
It is easy to verify the following equality:
(3.14)
phase difference ¢ between the two orthogonal electric field vectors. For
example, a quarter-wave plate is one frequently used retardation device ,
the operating principle of which will be examined in Chapter 5.
Another representation of the polarization state based on a vector
and matrix formalism was introduced by R .C.JONES. In this method
an electric amplitude vector E is written as:
where Eox , Eoy are the real amplitudes and CPx , cpy the phases of the x-
and y-components of the electric field vector. With such notation, for
example, right-handed polarization may be represented by the vector
E = [ Eox exp(icpx) ]
R Eox exp( icpx - in/2)
where EOy = Eox , and cpy = CPx - n /2. It is convenient to normalize this
vector by dividing each element by th e value of the length of the vector
ER . The amplitudes then have t he value Eox = Eoy = ER/V2, hence
after factoring this electric vector will have the form:
where the factor ER is omitted and exp( -in /2) = -i. In a similar way
one can write an expression for left-handed polarization:
Figure 3.12 A pa r-
tially polar ized quasi -
monochromatic beam
passes throu gh a re-
z tardat ion plate , which
ca uses a retardat ion 1J for
orthogonal components of
th e elect ric vect or, and
then through a polarizer.
The pr inciple direction of
Retardation Polarizer the polar izer P makes an
plate
an gle (J wit h t he z--axis.
Similar expressions may be found for all pairs of ort hogonal polariz ation
states.
where Ax, <I>x and A y, <I>y are slowly varying functions of t ime with
respect to t he carrier frequency v (see.(2.35)) . With the phase difference
Light Polarization 85
80 = (A;) + (A~ ,) , .
81 = (A;) - (A~)
82 = 2 (AxA y cos 8) (3.16)
83 = 2 (AxA y sin 8)
where 8 is a slowly varying function of time, too . Here we assume
that the interval for the measurement (for the averaging procedure) is
much longer than the effective time for any change of the functions A;
and A~, so that the mean magnitudes (A;) , (A~,), (AxA y cos 8) and
(AxA y sin 8) can be treated as constants.
Let us consider how the STOKES parameters may be determined by
means of measurements of intensity. In order to simplify the calcula-
tions of intensities we introduce complex functions associated with the
components of the electric vector (see (2.44)):
the intensity 1(0, ¢) at several values of the angle 0 and the retardation
¢. For example, the following set 1(0,0), I(rr/2,0), I(rr/4,0), I(3rr/4,0),
I(rr/4,rr/2), I(3rr/4,rr/2) results in the following STOKES parameters:
(3.23)
+
'A
direction of the polarizer is horizon tal, that is parallel to the line of sight .
The scattered light brightness is maximal for vertical orientation of the
polarizer.
Rot ating an analyzer in front of th e eye of th e observer shows that the
scattered light is linearly polarized (Fig.3.13), even when natural light
passes through th e tube (polarizer removed) . Maximal scattered inten-
sity also corresponds to a vertical orientation of the anal yzer , whereas
its minimum corr esponds to the horizontal orient ation.
An explanation of these observations is easily possible using the basic
prop erties of scattered dipole radiation which are described by eq. (2.62)
(see Chapter 2, 2.4). Let Ix and I y (Ix = I y) be the intensities of two
polarization states of the incoming light along th e x- and y-axes of a
Cartesian system, as shown in Fig.3.14. The cha rges of th e micropar-
tides begin to oscillat e in both vertical and horizontal directions, but
only th e vertical oscillations (in th e direction of the y-axis) lead to the
emission of linearly polarized light in the direction of the observer at
point A. For the int ensity of light scattered wit hin the (x, z)-plane the
dependency I(B) rv (Iy + Ix sin 2 B) is valid, where B is th e angle between
t he x- axis and the direction of observat ion (Fig.3.14). In particular ,
B = 0 corresponds to zero and B = 7r / 2 to maximal scattered dipole
radiat ion. Since th e radi ation ente ring th e eye of the observer at point
A is polarized in th e y-direction, a polarizer in front of th e eye causes
also a dependence of the intensity on rotation of th e polarizer angle. At
point A th e stray light is verti cally polarized. At point B, observing at
an angle B, horizontally polarized light is also observed .
4·
Figure 3.14. The polarization of the scat tered light is dependent on th e angle of
obs ervation. At point A, the stray light is vertically polarized. At point B, obser ving
at angl e 8, horizontally polarized light is also observed.
Light Polarization 89
Figure 3.15. Generation of a linearly polarized beam by the reflection of natural light
from the flat boundary of a transparent dielectric medium (nl > n2). If the angle
of incidence is equal to the angle of polarization Bp , the reflected beam is completely
linearly polarized. The electric vector of the reflected beam is directed perpendicularly
to the plane of incidence.
__ LE t " - - - l - + - - f ".
Figure 3.16. Principle setup for generation and detection of linearly polarized super
high frequency radio waves (wavelength 3 cm).
Detecting horn
Emitting horn
Figure 3.17. The emitting and detecting horns of electromagnetic waves with 3 ern
wavelength . If the detecting horn is rotated on the axis of the propagation direction,
the output signal of the detector is changing.
SUMMARY
The polarization state of a wave can be measured for a certain po-
sition, but may change for different points of observation, e.g. from
elliptical to linear polarization. Only in the case of a plane wave is the
polarization state always the same.
The superposition principle provides a simple solution for most typical
problems by representing a certain polarization in terms of two orthogo-
nal polarization states. Such orthogonal states are given by two mutually
orthogonal linear polarizations, or by a pair of left and right circularly
polarized waves.
92 DEMONSTRATIONAL OPTICS
.
A J'
\ . :
a \."
Figure 3.18.
Dipole radiation is the most common model for describing the radi-
ation of elementary oscillators within a medium under the influence of
a light wave. It is characterized by a simple angular dependency of the
emitted intensity and of the direction of the linear polarization.
PROBLEMS
3.1. Deduce the STOKES parameters and J ONES vectors for the fol-
lowing polarization states:
x - and y-linear polarization;
linear polarization at 45° and at -45°;
right-handed and left-handed circular polarization.
F R P
Figure 3.19.
tor:
Using the STOKES parameters 80, 81, 82, 83, one may introduce an ef-
fective phase difference between these components. Deduce formulae for
such an effective phase difference and discuss the results.
3.6. Derive the formula (3.12) using the definition of the degree of
polarization (3.22) as given by the STOKES parameters.
The thi ckness of two wedges at t he point where the light ray passes
may be varied to obtain a desired ret ardation ¢ between two orthog onal
compone nts of th e electric vector. In order to select a certain thickness
a narrow slit is mounted in front of the compensator. Discuss a way to
measure t he phase difference 8, provided that the polarizer is fixed and
¢ is varied by t he t hickness of t he compensator plates. The t ra nsmit ted
int ensity is measured by a photodetector.
SOLUTIONS
3.1. To simplify the not ation we assume that two possible ort hogonal
components of the elect ric vector of a polarized monochromatic wave
have unit amplitudes. Then for x-linear polarization these orthogo-
nal components at a point of observation are given by the expressions:
Ex(t) = cos(27rvt), Ey(t) = 0. These expressions result in JONES vectors
and th e STOKES parameters:
80 = 1, 81 = 1, 82 = 83 = °
In a similar way, in the case of y-linear polarization Ex(t ) = 0, Ey(t) =
cos(27rvt ):
E= [~ ] 80 = 1, 81 = -1 , 82 = 83 = °.
For linear polarizat ion at 450 the ort hogonal components are given by
Ex(t) = cos(27rvt) , Ey(t) = cos(27rvt), hence the appropri at es J ONES
vector and STOKES par ameters take t he form:
E=~[~] 80 = 2, 81 = 0, 82 = 2, 83 = °
For linear polarization at -45 0 the orthogonal components are given
by Ex(t) = cos(27rvt), Ey(t ) = - cos(27rvt), t hen the appropriate Jones
vect or and the STOKES parameters take the form:
E ~ !1 ]
= [ 80 = 2, 81 = 0, 82 = -2, 83 = °.
The JONES vectors associat ed with right- and left-handed circular polar-
ization have been considered ab ove (see 3.2). For th e right-handed po-
larization , real functions associated wit h t he orthogonal component s are
Ex(t) = cos(27rvt), Ey(t ) = cos(27rvt -7r j2), so that 8 = cf>x - cf>y = 7rj2
80 = 2, 81 = 0, 82 = 0, 83 =2
Light Polarization 95
In the case of left-handed polarization 83 = -2:
80 = 2, 81 = a, 82 = a, 83 = -2
thus
3.3. We use the results which have been obtained in the previous
problem and rewrite the STOKES parameters for several polarization
states in a vector form:
Ph.D.
/
/
/
a /
/
/
/
/
/
/
/
/
/
/
Figure 3.20.
dealing with the vectors of real components, then the scalar pr odu ct for
the first pair is given by
S182 = 1. 1- 1. 1+0.0+0 .0 = 0
The scalar products of the other pairs can be calculat ed in t he same way.
It can be seen that all th e pairs connecte d to two ort hogonal polarization
states permit a scalar product zero.
where the magnitudes Sl, S2,S3 are associat ed only with the second
beam. Since prop erties of a polarization state follow from measurements
of int ensity, we also assume that the sensitive area of a photodetector
used for the measurements is norm al to the propagation of th e second
beam , which is shown by a dashed line in Fig.3 .20. Hence th e t ot al in-
tensity of the combin ation is measur ed as It cos a + h . Fin ally, for th e
degree of polarizat ion we have:
P, . - V
/ Sl2+ s2
2 + s3
2 _ P 12
T
m tx - It cos a + 12 - It cos a + h
Light Polarization 97
3.6. The formula (3.12) was obtained under the assumption that two
orthogonal components of the electric vector Ex, E y are completely inde-
pendent, therefore the phases cfI x(t) ,.cfI y(t) are independent , too. Hence,
98 DEMONSTRATIONAL OPTICS
in thi s case, the phase difference 0 = <P x - <P y varies uniformly within
the inte rval -7f , n , and th e mean values (AxA y cos 0) and (AxA y sin 0)
are both equal to zero. With these condit ions the STOK ES par ameters
corresponding to such a polarizat ion state are as follows:
So = Ix + I y Sl = Ix - I y o , o .
According to th e definition of the degree of polarization (3.22) we get
t he expression:
p = J sr + s~ + s5 Sl Ix - I y
So = So = Ix + I y
with respect to the major semi-axis a of the the ellipse. It is the semi-
axis which is known due to a position of the optical axis of the plate.
This linear polarization may be detected by the polarizer. When the
intensity transmitted through the polarizer is zero, its principle direction
is perpendicular to Ex. By rotating the polarizer, the intensity '" E~
can be expressed in terns of the intensities '" E~ and", El
PROPAGATION OF
LIGHT WAVES IN MEDIA
It is well known from daily experience, that a light beam falling onto
the surface of a transparent medium partially passes into the substance
and is partially reflected on the surface. In this chapter we shall consider
the laws governing these phenomena, restricting ourselves to the case of
absolutely transparent media, wherein the absorption and scattering of
light are absent.
This restriction to absolutely transparent media is a simplification,
usually applied in optics. Real media always contain a certain amount
of distributed microparticles. Being centers for scattering, these particles
provide the visibility of light trajectories, known as light rays. It is of
interest to note that even such a clear medium as the Earth's atmosphere
with very few dust impurities possesses scattering properties, which take
place on the air molecules itself. Due to the large path length in the
atmosphere these scattering processes are extremely effective. Blue light
is scattered more effectively then red, yellow or green light. This is the
reason for the blue color of the sky. Sun rays traveling into the Earth's
atmosphere also undergo scattering by random air concentrations (or
fluctuations of the air density) arising due to chaotic molecular motion.
Additionally, the presence of dust and water vapors in the lower layers
of the atmosphere yields a strong forward scattering of light rays when
the sun is near to the horizon and where the thickness of air is large.
Under these conditions the sun appears to be red.
But for absolutely transparent media under laboratory conditions the
effect of molecular scattering is negligible, so the molecules of such a
medium may therefore be considered as immovable. Moreover we assume
that no free electrons or ions exist , and the medium is an ideal dielectric,
wherein the positive and negative charges are joined together and the free
101
O. Marchenko et al., Demonstrational Optics
© Kluwer Academic/Plenum Publishers, New York 2003
102 DEMONSTRATIONAL OPTICS
rot E =
aB
-- (4.5)
at
rotH
aD
= at (4.6)
The wave equations appropriate to thi s syst em of MAXWELL 'S equations
are similar t o (2.22) and (2.23):
a 2E a2 E a2 E a2E
ax 2 + 8y2 + az2 = J.LC at2 and
a 2H a 2H a 2H a 2H
ax 2 + ay 2 + a z2 = J.LC at 2
Light Waves in Media 103
S=ExH
where S is the unit vector in the direction of propogation; the factor co] k
is equal to the magnitude of the phase velocity of light in the material:
V=-=--=-
w
k
1
.jJi£
A
T
(4.8)
(4.9)
n=J;o~~ (4.10)
is the refractive index of the optical material, where /-l ~ /-l0, that is, we
assume the magnetic permeability /-l of the dielectrics to be practically
104 DEMONSTRATIONAL OPTICS
--,
d
...<. "::
/'
---
<,
/'
1.._-
<,
(
I
--- -....
b
Figure 4.1. A small rectangle abed surr ounds an element AS , enclosing two med ia
having £1 and £2 .
f
L
E dl = - ~ I
S
BdS ,
we t ake the first int egral around t he rect angle abed and the second one
over its area 6.s. If the height of th e rect angle h = ad = be is smaller
t han th e lengths ab and ed, then the largest contribution to the first
integr al will be given by the tangenti al components of the field, rather
Light Waves in Media 105
f
abed
E dl ~ (t 1 . E 1 + t2 . E2)ab ,
where tl and t2 are the unit tangents along the sides of the rectangle:
where n12 is the unit normal vector to the boundary pointing from the
first medium into the second one; s is the unit normal vector of the plane
of the rectangle (see Fig.4.1). Integration over the area of the rectangle
gives, for a rather small fj.S, the following:
-~
dt
J BdS ~ - (~B)
dt
·sfj.S = - (~B)
dt
·s(ab)(bc)
118
tl . E 1 + t2 . E 2 = - (:t B ) ·s(bc)
E t 2 = En (4.12)
in other words, the tangential component of the electric vector does not
change across the boundary. An analysis of equation (4.6), which is
similar to (4.5), results tina the boundary condition for the tangential
components of the magnetic vector H,
Ht2 = H n (4.13)
;'
-._-
;'
f DdS =0 .
or
En2c2 = cIEnl (4.14)
Here we use the notations E n 2 , E n 1 for the normal components of the
electric vector. It is seen that equation (4.4) is similar to (4.3) , therefore
for the normal components of the vector B one can derive the condition:
Light Waves in Media 107
Water
surface
(4.15)
8=(3+"( ,
where 8 is the angle between the two active sides of the prism , called the
refractive angle. So, relations (4.17) and (4.18) provide the solution of
refraction on the prism. In any application the case of minimal horizontal
inclination of the incident ray is of interest. If this inclination is specified
by an angle cp, the angle between the incident ray and the outgoing ray,
the minimal value of ip will be observed at f3 = "(, or when the refracted
ray passes inside the prism parallel to its basic side. In this case
is valid.
The refractive index of the prism material can be determined using a
spectrometer (Fig. 4.5). Light from a source passes through a narrow
vertical slit and is then collimated by an objective into a practically
parallel beam. This beam is refracted by the prism and focused by
the second objective into the output plane. The colored pictures of the
entrance slit are observed by means of an eye-piece while slowly turning
the prism to find the position which gives the minimal inclination cpo
By means of a spectral lamp (e.g. a Hg-Cd discharge lamp) emitting
several well-known wavelengths, the refractive index can be determined
in its dependence on A. For each wavelength the minimal deviation is
observed for a different angle.
In the opposite case, when the refractive properties of the prism are
known, one can examine the spectrum from a light source. For example,
the line spectrum of a Hg-Cd discharge lamp consisting of a set of bright
lines, is shown in Fig.4.6,a. Contrarily, the continous spectrum of a
xenon lamp (presented in Fig.4.6,b) is close to the visible region of the
Light Waves in Media 109
A A
a b
Figure 4.4. Refraction by a triangular prism for a general case of incidence (a) . The
case of minimal inclination for a certain wavelenth is fulfilled for rays which pass
parallel to the prism's basic side (b) .
Vertical
slit Objective
(Collimator)
Spectral
lamp
Eye
piece
Figure 4.5. A spectrometer used for the determination of the angle of minimal decli-
nation of rays . For each spectral line of the discharge lamp the prism has to be turned.
From these angles , the refraction index of the prism material can be determined.
(4.20)
110 DEMONSTRATIONAL OPTICS
(A ~ 630 nm).
Figure 4.6. The spectrum of a spectral lamp , e.g. containing Hg and Cd, shows some
discrete wavelengths (or spectral lines) (a) . The continous spectrum of a xenon lamp
is close to the solar spectrum (b) .
Light Waves in Media 111
Reflected ray
Figure 4.7. Total reflection of a laser beam, passing through two nearly-touching
glass prisms . The horizontal incident beam is divided into two beams - one is the
horizontal transmitted beam, the other is the vertical reflected beam. Their intensity
ratio is determined by the distance between the touching surfaces. The ext ernal forces
needed to provide a distance small enough are indicated by th e arro ws.
z
Evanescent
field
A Detecting horns ~
Paraffin prisms ~ B
Emitting hom
Figure 4.9. The evidence for the surface wave demonstrated with radio-waves.
.~';i)':_>c- -
/ /" /
Figure 4.12 A small ball
lens used to increase the
_l _ numerical aperture of an
optical fiber.
Light Waves in Media 115
2.2.4 Rainbow
Refraction and reflection of sun rays by water drops contained in the
atmosphere is responsible for the well-known natural phenomenon of the
rainbow. We can see a rainbow as a colored arc in the sky after rain when
a cloud moving away from us is illuminated by the Sun . The description
of this phenomenon is based on the reflection and refraction of solar rays
in spherical water drops . In the simplest case there are one reflection
and two refractions for a ray, falling on the inner surface of a drop as
shown in Fig.4.13. One can easily find a relation between the directions
116 DEMONSTRATIONAL OPTICS
of the incident and the reflected rays , assuming a spherical shape for the
water drop.
For example, let a parallel beam SA fallon a drop at the point A and
let a = 60° be the angle of incidence. For a given refractive index for red
rays in water, nred = 1.331, we find, for the angle of refraction (3, sin (3 =
sin a / nand (3 = 40°33'. After total reflection on the inner surface of the
drop this ray will return to the surface, refracting again at point C. The
angle of refraction is also equal to a = 60° as shown in figure Fig.4 .13.
Through geometrical considerations, now one can find the angle between
a ray reaching an observer and a horizontal line to be n = 4(3 - 2a. For
th e red ray in our example this angle is equal to n ::= 42°. Due to
dispersion a blue ray will leave the drop at a smaller angle, as follows
from the value of the refractive index for blue rays nblue = 1.345. In this
case n is approximately 40°, therefore the observer will see the blue arc
of the rainbow below the red arc , i.e. as the inner arc. It is clear , the
angle of 60° assumed above is rather arbitrary, and a pencil of incident
rays with different directions and different incident angles should be
considered, instead of only one ray falling on the drop at the particular
angle a. The problem has to be treated with scattering theory, and is
not so simple . However, the estimation given above gives approximately
correct values of the visible angular dimension of a rainbow.
Boundary
of medium
Figure 4.14. Reflection and refraction at the boundary of two media. Amplitudes
of the incident, reflected and refracted waves polarized in the plane of incidence are
marked by II, and those polarized normal to this plane by .L.
Let Ey) , Ey) and Ef) be the complex amplitudes of incident, re-
flected, and refracted waves polarized normally to the plane of incidence,
t
respectively, and let EI~i) , EI~r) and Efl ) be the complex amplitudes of
incident, reflected and refracted waves polarized in the plane of inci-
dence , respectively (see FigA.14) . The fact that these three waves are
located in one plane, according to SNELL'S law and the law of reflection
(4.15),( 4.16), enable one to find relations between their amplitudes.
The boundary conditions for the electrical components of the waves
are also used for this purpose. Each component of the electric vector of
the incident, reflected and refracted wave may be presented in a similar
form:
E = Eoexp[-i(27l"lIt - kr)]
For the origin, where x = y = z = 0, the variable parts for all the waves
are reduced to the factor exp( - i 27l"lIt ). This factor, being common to
all three waves, may be omitted, hence one can apply the boundary
conditions in the forms (4.12), (4.13) to the complex amplitudes. Since
the tangential components of the vectors E and H are continuous across
th e boundary we get:
E(i)
y
+ E(r)
y
= E(t)
y
118 DEMONSTRATIONAL OPTICS
H(i)
x
+ H(r)
x
= H(t)
x
H(i)
y
+ H(r)
y
= H(t)
y
Using the relation (4.11): Ho=~(n x Eo) and the angles ()i, ()r and
()twe represent the projections of the elect ric and magnetic vectors in
terms of E.l and Ell. In this way we obtain from the first pair of relations
E(i)
.l
+ E.l(r) = E(t)
.l
and ~l cos ()·(E(i) _ E(r)) = ~e2E(t) cos () .
t . l . l . l t
J.L J.L
for the components of the electric vector polarized normal to the plane of
incidence . The second relation allows the introduction of the refractive
indices nl = Jel/eo and n2 = Je2/eo (see 4.10). Thus this pair of
relations may be presented in the form:
E(i) +E(r)
.l.l.l
= E(t) and nl cos ()·(E(i)
t .l
- E(r))
.l
= n2E(t) cos ().
.l t
. (4• 21)
These relations, connected with the magnetic vector, permit the follow-
ing relationships for the parallel polarized components:
The relations (4.23) are called the FRESNEL formulae , which directly
provide the simple phenomenological consequences connected with the
reflection and the refraction of polarized light. We shall introduce re-
lationships between the intensity of the incident light beam and the
intensities of the refracted and the reflected beams before considering
applications of the FRESNEL formulae.
The real ratio of EY)/ Er) can be used to derive a ratio between the
incident intensity and the reflected one in the following form:
(4.24)
The quantity R1.. defines the portion of the intensity corresponding to the
reflected wave. It is called the power reflectivity for the wave polarized
normally to the plane of incidence. If we calculate the ratio Ef)/ Er)
from (4.23) and call the square of the absolute value T1.. = IEf)/ Er) 12 ,
we obtain
T 1.. = sin 2()t sin 2()i
(4.25)
sin 2 ( ()i + ()t)
This quantity represents the portion of the intensity corresponding to
the transmitted light , provided the polarization is normal to the plane
of incidence. The quantity T 1.. is called transparency. Assuming that ab-
sorption is absent and that a wave polarized in the plane of the incidence
is also absent, we get
I j--" E(~
II
Figure 4.15 Components
O)
E11---11 of the electric vector of an
incident and reflected wave
n,
E II(i ) an d E(r)
II ' bot h In
i t he
plane of incidence, are in
opposite directions.
This fact results from the general concept, that the intensity of two waves
linearly polarized in two mutually orthogonal directions is represented
by the sum of the intensities of both components.
Taking into account that EI~r) is oriented in the opposite direction with
respect to EI~i) (see Fig.4.15), we conclude that the phase undergoes a
Light Waves in Media 121
change of 71" in this case. For the same case, n2 - nl > 0, the ratio
Et) I E~) = - E~r) I EI~i) is real and negative (see (4.30)) and again a
phase change of 71" takes place. So the oscillations of the electric field
vector in the wave reflected by an optically thicker medium in the case
of normal incidence undergo a phase change of 71" with respect to the
incident wave. Contrarily, the vibrations of the refracted wave are in
phase with the incident wave.
The distinction between the amount of reflectivity of the parallel and
the perpendicular components disappears in the case of normal inci-
dence, and from the relations (4.29) and (4.30) we find:
The value of R when the wave falls normally on the boundary between
glass and air (n2 ~ 1.5 and nl ~ 1) can be estimated as:
R = (1.5 - 1) 2 ~ 0.04 .
1.5 + 1
This result means, that only 4% of the incident intensity is reflected.
The FRESNEL formulae allow the establishment of phase changes be-
tween incident , refracted and reflected wave. All ratios between par-
allel and normal components of these waves, which can be formed by
means the FRESNEL formulae (4.23), are real. This means that the phase
change between appropriate pairs of the components may be invariant
or changes by 71" . It can be seen from (4.23) that E~t) and E~) have the
same signs as EI~i) and E~), therefore the phase of the transmitted wave
is equal to that of the incident wave.
In the case of the reflected wave, the ratios between the orthogonal
components depend on the relative refractive index or on the relationship
between the incident and refracted angles. With reflection by a denser
medium, when n2 > nl and ()i > ()t, the signs of E~) and Et) are
different, which leads to a phase change of 71". Similar considerations
predict a phase change of 71" for EI~r) and E~) .
Boundary
of medium
Figure 4.16. Refraction and reflection at the BREWSTER angle (nl < n2). An ind uced
dipole located on the dielectric boundary may be represented by projections of the
dipole moment PI! and Pl., where the PI! lies within the incident plane and Pl. is
normal to this plane. Oscillations of PI! do not cause radiation in the direction of the
reflected wave, so only Pl. gives rise to th e reflected linear polarized wave.
the angles:
cos Ot n2
= (4.32)
cos Oi nl
This relationship together with the law of reflection (4.15, 4.16) is valid,
when
(4.33)
This implies that the reflected and the refracted rays form an angle of
90° (Fig. 4.16). Now, using (4.32) and (4.33) we find the condition for the
angle of incidence, which is called the BREWSTER'S angle or polarizing
angle, in the form:
(4.34)
As we have seen, the existence of this effect can easily be deduced using
the model of harmonic oscillation of surface charges and the radiation
charact eristics of a dipole (see Chapter 3, sec.3.2).
Light Waves in Media 123
711 =
T.!.
(ni + n~)2
2n2nl
(4.35)
Taking the values of the refractivity indices of the glass as n2 :::::: 1.45 and
of air as nl :::::: 1.0 we estimate this ratio as 711fT.!. :::::: 1.145. This ratio
124 DEMONSTRATIONAL OPTICS
~
I
I
a b
E, E,
Figure 4.18. A stack of thin plane-parallel plates. The electric vector component Ell
of the incident light beam lies in the plane normal to the flat surfaces of the plates.
Th e transmitted light beam is polarized to a high degree in the plane normal to the
surfaces of the plates.
4. Dispersion
As we already pointed out the propagation properties of electromag-
netic radiation (and of light waves) within an ideal uniform dielectric
can be described in terms of the dielectric constant e and the refrac-
tive index: n 2 = cleo. The simplest model of an ideal uniform dielectric
medium is an ensemble of identical classical oscillators, consisting of elec-
trons which can move around their equilibrium positions. If the initial
displacement of the electron is ~x , it experiences (for small dislocations
~x ) a linearly increasing force with displacement: F = -k~x and will
then oscillate with its resonant frequency wo = y'klrne.
The dependence on frequency of the elementary oscillators in the elec-
tromagnetic field of the light wave is called the dispersion of the medium,
i.e. the dependency of the dielectric permittivity and the refractive in-
dex on the frequency of the wave. It is well known from mechanics and
from the theory of electricity that the properties of an oscillator depend
on the ratio between the frequency of an external force and the resonant
frequency of the oscillator. Generally two different limit cases can be
distinguished. The first is when the frequency of the external periodical
force is much smaller or much larger than the resonant frequency wo , and
Laser mirror
Discharge lube
Brewsterwindow
Figure 4.19. Generation of laser light of high degree of linear polarization state.
Inside the laser resonator, the light passes many times through the BREWSTER window
of the discharge tube. This provides the linear polarization of the laser beam .
126 DEMONSTRATIONAL OPTICS
the second case when the frequency of the external force is close to woo
In optics the second case is commonly associated with light absorption
within a certain frequency band being characteristic for the medium.
The optical equivalent of the first case is the region of dispersion far
from the band of absorption.
d 2x dx 2 e
-dt 2 + 2')'-
dt
+ wox = - -~e'
E (4.38)
which tells us that x(t) has the same temporal behavior as E(t) . Multi-
plying both parts of (4.41) by -e we find the expression for p(t)
e2/~
p(t) = -x(t)e = W 2 - 2
w -
e2 "
t')'w
E(t)
o
Hence, the atomic polarizability a in (4.36) is given by
e2/~e
a = --;;----';:;----::---
w6 -w 2 - 2i')'w
Taking into account equation (4.37) for the relative dielectric permittiv-
ity we get:
e2N 1
E = 1 + -- -."..-----,,,---- (4.42)
2
~co w6 - w - 2i')'w
128 DEMONSTRATIONAL OPTICS
e2 N 1
fi2 =E= 1+---~---- (4.43)
meo w5 - w2 - 2i'yw
We formally represent the complex refractive index in (4.43) by intro-
ducing its real and imaginary parts:
fi = n + ir:
so that
fi2 = (n + iK,)2 = n 2 - K,2 + 2inK (4.44)
Introducing the constant
(4.45)
Example
For diluted gases under normal conditions (at 00 C and 1060 mbar)
the frequency wp ~ 3 . 1014 radian per second, whereas for the visible
range of spectrum w ~ 3.6.10 15 radian per second (green line). Referring
to the fact that hard ultraviolet is strongly absorbed by air it is possible
to estimate Wo ~ 1 . 1016 radian per second. (This fact is a fundamental
condition for the existence of biological life of the Earth!)
Figure 4-20. The function -x/(l + x 2 ) corresponds to the dispersion curve new) at
x = llwh, and the function 1/(1 + x 2 ) of the same argument is associated with the
imaginary part of n.
The second curve K(W) shows the absorption profile, which in this
particular case is given by a LORENTZIAN curve described by the depen-
dency
(4.52)
Horizontal Objective yl
slit
X'
--++-~H-#--:>I't-- 'L.I- ' _ ._ ._ . . _ . _ . - . _. _. - . -=--
. -'Ar - _ . ~
Observer
Vertical y
slit
Figure 4.21. A setup for observing the disperion of glass in Newton 's scheme of
crossed prisms.
1 1 w2
----".---=-2 ~ -+-
w6 -w w6 w6
and the expression for the refractive index, represented in terms of the
wavelength, takes the form
(4.53)
where
w2 41T
2C2
w2
A = 1 +...E. and B = p
w6 w6
Relationship (4.53) is called CAUCHY 'S formula. This approximation
provides an accurate description only for low density media, for instance,
gases, within which the refractive index is close to one.
slit . If the refractive edge of the prism is vertical and the base of the
prism is located as shown in Fig.4.21, then the spectrum in the (x ,y)-
plane will be horizontal; its" blue" side is associated with larger x than
the "red" side. In (XI ,yl) we have to rotate the coordinate system due
to the real image formed by the second objective. When now inserting
a second glass prism after the horizontal slit, the spectrum within th e
(XI, yl)-plane will bend (Fig.4.22).
The observer behind the (XI, yl)-plane may interpret the bending of
the spectrum as caused by the dependency n(>.), because the displace-
ment of the spectrum's colors is connected to wavelengths.
It is obvious, that the shape of. the spectrum, presented in Fig.4.22 ,
has to depend on the positions of the bases of the crossed prisms , which
may be illustrated by the following exampl e. Let point A lie on the
optical axis of a lens (Fig.4.23) and let the lens produce a real image PI
of the plane P. Without the prism the image AI would be located on
the optical axis too.
Now, let a point source of monochromatic light be set at the point
A, and let us insert a prism. We find the location of the real image of
this source on plane PI caused by refraction in the prism. Let a light
-- x'
Figure 4.22. The spectrum of a xenon lamp formed by two crossed prisms. The
shape of the spectrum gives the dependency n(A) in the Cartesian system x', y' . The
value of A decreases in the positive direction of the x'-axis.
V~rtical / R y
slit Sodium U Objective
vapor Heat source
Figure 4.24. A setup for observation of the absorption and dispersion behaviour
around the D- lines of sodium, using a sodium vapor prism .
ray, leaving from A along the optical axis, be refracted by the prism in
the direction CD . The extension of the short line CD intersects plane
P at point B, which is a virtual image of point A formed by the prism .
In turn, a straight line drawn from B through the center of the lens
will intersect PI at a point BI . Therefore BI is the real image of the
monochromatic source located at A.
If a source of quasi-monochromatic light is set at point A, a spectrum
of the source will appear within PI . In the case of a point source , the
spectrum is a colored straight line, which runs from B" to BI, provided
that B" and BI are associated with the red and violet limiting wave-
lengths of visible light provided by this source . Now it is easy to take
into account the effect of the second prism crossed with the first one.
D · lines
•
y'
sharp absorbing spectral lines (called the D-lines of sodium at )'1 = 589.0
nm and A2 = 589.6 nm). Metal vapors are commonly used in atomic
physics experiments since they provide strong absorption lines in the
visible region of the spectrum.
The setup for observation of the dependency of the refractive index
on frequency near the absorption lines of sodium is shown in Fig. 4.24.
A xenon lamp , a bright source of light with a continuous spectrum, is
used . With the help of a condenser lens it illuminats a narrow vertical
slit. By the first objective, an image of this slit is produced in the plane
of a narrow horizontal slit mounted at a considerable distance from the
vertical one. A second objective provides an image of the horizontal slit
on the screen xt, yl. If a glass prism is placed after the vertical slit, a
spectrum of the lamp is formed on the screen. An evacuated tube of
suitable size ('" 70 em length, '" 5 em in diameter) acts like a prism
with a horizontal base when it contains a column of sodium vapor with
vertically varying density. For generating this vapor, a small amount of
metallic sodium is placed in the middle of the tube before its evacuation.
By heating the bottom of the tube and keeping the top of the tube at a
low temperature, a vertical column of sodium vapor of varying density
appears.
The inhomogeneous heating causes the vapor density to be higher at
the bottom of the tube and smaller near the top of the tube. This column
of sodium vapor refracts the light rays like a prism with a horizontal edge,
therefore a curved spectrum just like that in NEWTON's experiment can
be observed. The spectrum around the D-lines (which are not resolved)
is shown in Fig.4.25. In the vicinity of the D-lines all light is absorbed by
the metal vapor (dark band), and the refractive index increases and then
jumps to another value on the other side of the absorption region. The
colored spectral band is strongly bent, but its slope always has the same
sign (normal dispersion). This spectral curve represents the behavior
of the refractive index with respect to the variation of the wavelength.
When using a spectral device with more dispersion (e.g. a set of several
prisms instead of the one shown in Fig. 4.23), a small region of normal
dispersion would then appear between the resolved D-lines.
Figure 4.26. The spatial distribution of the electric vector in a wave composed by
two monocromatic waves with .a small difference in frequencies. With propagation in
a medium , the envelope A(t) propagates with the group velocity u .
denoted by the dotted line. It is clear that the time - spatial dependency
of the envelope is given by the cosine function :
One can find two neighboring zero points of the envelope from the rela-
tion
c5kfj.z = ±7f/2 , (4.58)
at t = canst, or from
(4.59)
at z = const . If t = const , .6.z is the spatial separation of points having
the same phase , whereas for z = canst, fj.t is their temporal separation.
In both cases (4.57) and (4.58) the net wave, being limited by the two
neighboring zeros, is usually called a wave group, or wave train. The
velocity of the wave group , which is denoted by u , is the velocity of the
envelope , which is called the group velocity. When fixing any point of
the envelope, the group velocity can be found from the condition
c5wt - c5kz = 0 ,
where
z c5w
u= - = - (4.60)
t 15k
In the general case, a wave group may be treated in terms of the su-
perposition of elementary waves within a narrow region of wavenumbers
k around k o associated with the carrier frequency Wo of the wave group.
Let us consider a wave group E(t, z), propagating along the z-axis.
Mathematically such a wave group is represented by summing up the
elementary monochromatic waves:
where the function g(k) describes the amplitude distribution of the ele-
mentary monochromatic waves.
We fix a certain propagation number ko and assume that the concept
of group velocity is valid, because the dependency of w on k is very weak
within a narrow region of k. This means that the expansion of w(k) near
ko,
_ dw(k) d2w(k) 2
w(k) - w(ko) + ~(k - ko) + 2dk 2 (k - ko) + ... ,
may be restricted only to its linear dependency on (k - ko) :
dw(k)
w(k) ~ w(ko) + ~(k - ko) .
For w(k) we substitute the latter expression into the exponential factor
of the wave and expand k to k = k - ko + ko . We then get
Thus the expression for the wave group takes the form:
Figure 4-27. The dispersion curve v(>.) associated with waves propagating on the
surface of water . The function v(>.) has its minimum at Vo = 23 cmls (under normal
conditions) and >'0 = 1.7 em. The region of wavelenths between >'0 and>. = 0
corresponds to anomalous dispersion, the region between >'0 and >. -+ 00 to normal
dispersion.
density p. This part of the surface energy of the unit mass element is
given by the expression kCJ/ p. Therefore, the larger the deformation is,
the higher the surface energy. The second part of this energy is formed
by gravity. The unit mass element of the water surface possesses a higher
energy depending on the gravitational acceleration 9 and the wavelength
>.. Hence, this part of the surface energy is directly propotional to 9 and
A. Therefore the relationship between the phase velocity v and the prop-
agation number k = 21T/ A is given by the expression
v2 = fL + k~ (4.63)
k P
The function V(A) is shown in Fig. 4.27. Under normal conditions the
minimal velocity is v ~ 23 cui]« when Ao = 1.7 em. If >. < 1.7 em, the
so-called capillary waves are observed . The phase velocity of this type
of wave increases extensively with a decrease in the wavelength. In the
other case, when>. » 1.7 em, gravity is the dominant factor for the
appearance of surface waves, which are called gravity waves. One can
observe gravity waves with a wavelength up to >. "" 3/4 km, a period of
"" 23 s and a phase velocity of "" 120 km per hour.
Now we will obtain a relationship between the phase velocity v and
the group velocity u. From the relationship between the propagation
number and the angular frequency, k- v ="W, we find
kbv + v8k = 8w
140 DEMONSTRATIONAL OPTICS
v2 :::::: ft
k
and the RAYLEIGH formula takes the form u = (1/2)v.
4.3.4 Simulation of the motion of wave trains
In practice it is difficult to realize convenient conditions for observ-
ing the motion of wave trains on the surface of water. Nevertheless,
this phenomenon is unique in showing both normal and anomalous dis-
persion. By using the constants g, p and a in (4.63) we may create a
computational experiment describing the propagation of wave trains on
the surface of water.
Light Waves in Media 141
Figure 4.28. Images produced by the computing technique described in the text
illustrates the motion of wave groups on the surface of water. The initial position
of every wave group is marked by the point A on the z- axis, which corresponds to
th e centers of the wave groups at t = O. Points B,a,n correspond to the centers of
th e wave groups after the time interval At = 8.5 s. The wave group positioned at
point a moved without dispersion; the wave group arriving at B moved under normal
dispersion; and the group arriving point D moved under anomalous dispersion. The
white strips indicate maxima of the amplitudes within every wave group.
a(z, t) rv ~
120
cos [(2
)..: + a m4~060) (tvo - z + 50)] (4.66)
v
'-....2.- 1 .
-- - - -~
I I I Figure 4.29 The region of
I I I
I I I wavelengths between A1
I I I
I I I and A2 is considered to
I I I
I I I form a wave group, propa-
I I I
I I gating without dispersion.
142 DEMONSTRATIONAL OPTICS
Vo -- --~ I I I
J I I Figure 4-90 The region of
I I I
I I I wavelengths between A1
I J I
J I I and A2 for forming awave
I I J
I I J group , which propagates
I I : under conditions of normal
dispersion .
v
2
Vo -- - -~
I I I
I I I Figure 4-91 The region of
I I I
I I I wavelengths between A1
I
I
I
I
I
I and A2 for forming a wave
I
I
I
I
I
I
group, which propagates
I I I under conditions of anom-
alous dispersion.
where Q: = 0.25, 400 is the net number of points z taken for the calcu-
lation and 50 specifies the initial position of the wave group at t = O.
Summing up the monochromatic components is similar to moving from
point 1 to point 2 in Fig.4.29. We convert the distribution a( z , 0) into
a set of values, each providing a level of a gray scale, and create a pic-
t ure showing the spatial distribution of the amplitude. The distribution
a(z , t) at t = 0 is shown in Fig.4.28, where the point A specifies the cen-
ter of the wave group. In a similar way, the distribution a(z, t) for the
time t = 8.5 s is calculated. This distribution is also shown in Fig.4.28,
and the center of the wave group is marked by C. If the wave group
propagates between points A and C with the velocity u = Vo = 23 cui]«
the distance AC = 23 x 8.5 = 195.5 em.
To illustrate the case of normal dispersion we take into account the
dependency of the phase velocity on the wavelength and choose a new
region for variation of the wavelength (Fig.4.30), where the wavelengths
are now distributed around A2 = 1.8 em. The spatial distribution of the
amplitude is given by the expression
120
a(z, t) '" . ; cos
[(2A: - Q:
60) {t(Vl +
m~O f3 * m) - z
]
+ 50}
nisms since the properties of this radiation did not correspond to any
known optical process of luminescence in liquid or solid bodies and did
not even correspond to the radiation of decelerated charged particles.
One of the most thoroughly studied subjects at the time of this discov-
ery was the phenomenon of luminescence, which was understood as light
emission by the microparticles of material substances, atoms or mole-
cules. Experimental and theoretical investigations, as well as known
laws of classical and quantum mechanics, allowed the classification of
different types of luminescence in terms of the characteristic radiation
time by an atom or a molecule. This effective radiation time depends
on the type of particles and on the excitation conditions. All the known
experimental methods of measurements for the radiation time lead to
the conclusion that VAVILOV-CHERENKOV radiation was of a different
nature. This conclusion was supported by the experimental fact that
th e intensity of VAVILOV-CHERENKOV radiation was weakly dependent
on the kind of liquid. All experiments showed weak blue emission of
VAVILOV-CHERENKOV radiation, which had almost the same intensity
for all liquids , and was dependent only on the incident , - ray intensity.
As a possible mechanism of this phenomenon, VAVILOV proposed the
so-called deceleration radiation. Under the influence of , - rays, fast
free electrons are set free from atomic particles and move through the
liquid. As a result of the electrostatic interaction with the surrounding
at oms or molecules, these fast electrons are quickly decelerated. As it is
known, accelerated or decelerated motion of a charged particle creates
radiat ion. Nevertheless, further studies of the intensity and the spectrum
of the VAVILOV-CHERENKOV radiation showed that the relative intensity
of the deceleration radiation in the optical part of the spectrum is about
100 times lower than the experimentally recorded intensity of this new
typ e of radiation.
Later, a rigorous theory of the VAVILOV-CHERENKOV phenomenon
was created by LFRANCK and LTAMM [28]. According to this theory
t he cause of VAVILOV-CHERENKOV radiation is an electron moving in
th e medium with a very high velocity exceeding the phase velocity of
light in this medium. Such fast electrons leave the atoms of the liq-
uid after absorption of hard "t: radiation in the VAVILOV-CHERENKOV
experiments.
Let us discuss the mechanism of VAVILOV-Chrerenkov radiation at
a qualitative level. Suppose that a charged particle is moving in an
optically transparent medium with the velocity v q exceeding the phase
velocity of light in this medium:
c
vq > v =-
n
Light Waves in Media 145
Figure 4.32. The polarization effect on the particles of a medium caused by a negative
charge moving with relativistic velocity v. The polarized particles located along the
trajectory of the charge become coherent sources of secondary spherical waves. The
superposition of the secondary waves is the reason why coherent Vavilov-Cherenkov
radiation is emitted by the medium.
This simple result is obtained under the supposition, that the moving
charge creates a secondary wave at every point along its trajectory and
that the resulting field arises by superposition of these secondary waves
while accounting for their phases.
A similar mechanism of sound wave creation takes place when a body
is moving in a medium with a velocity exceeding the velocity of sound.
The well-known effect of shock wave generation, when an airplane ex-
ceeds the sound barrier, has a similar physical nature (secondary wave
creation by the flying airplane) to the generation of an "opt ical shock
wave" by the relativistic charged particle flying through a medium. In
the acoustic case the shock wave front is located at the surface of the
MACH cone and propagates at the angle () with respect to the airplane
trajectory. The opening angle of the MACH cone () may be determined
from the expression analogously to (4.67):
V
cos() = - s
VI
where VI is the velocity of the airplane after exceeding the sound barrier,
V s is the velocity of sound in the air, (VI> Vs).
CHERENKOV radiation is used by a number of devices known as CHE-
RENKOV counters for the registration of fast charged particles. In order
Light Waves in Media 147
Photo
-t--- multipliers
Opaque housing
Figure 4-34. Scheme of a Cherenkov counter for detecting cosmic particles . Two
identical photomultiplies, both having a photocathode diameter of 16 em, are arranged
facing each other vertically. A light flash in the fluorescing glass is detected by the
first photomultipier and the Vavilov-Cherenkov radiation in the organic glass by the
second one.
+q -q
---o-.... ......::::;j~-"-""O
-v
Figure 4.35 A positive
charge moving towards
a flat infinite metallic
boundary will give rise to
transient radiation when
penetrating the metal.
+q
Figure 4,36. A set of dielectric flat plates may cause an appreciable amount of tran-
sient radiation.
particle is also directed towards the plane. The sign of the image of the
particle is opposite to the sign of the real particle.
Therefore , the problem of the motion of a charged particle towards the
boundary is reduced to the equivalent problem of an electric dipole with
a variable dipole moment (FigA.35). Once the charged particle crosses
the metal boundary, both the image charge and the dipole disappear.
It is this disappearance of the dipole moment that creates the transient
radiation. Apart from the transient radiation an ordinary deceleration
radiation exists as well. As the transient radiation is characterized by
several distinguishing features, it is possible to separate it from the back-
ground of the deceleration radiation.
Transient radiation arising from the crossing of a transparent dielectric
boundary is rather weak. For this reason a set of immediately adjacent
transparent plates is used (FigA.36). Transient radiation becomes more
concentrated in a specific direction when the energy of the charged parti-
150 DEMONSTRATIONAL OPTICS
e- Ve
11?A~ B
----'+6'..,1 :, 12 ,------=i1
'--_...J
e+ I
6
Figure 4.37. A moving electron e" and its image e+ caused by the flat surface of a
diffraction grid. Electron and image compose an electric dipole of the length h . The
length of the dipole increases to b if the electron moves along a depth of the grid .
This change of the electric dipo le is a reason for production of visible light waves.
A
T-Tl = mT=m-
C
While the electron is moving to the next groove of the grating the
next pair of wave perturbations arise in the same way. The amplifica-
tion condition for these waves in the direction AS is the same. Hence,
5
c
AI--_...J
1---- d
Figure 4.39. Two steps located at the points A and B. Both points emit spherical
waves. A superposition of these waves will give rise to a maximum of radiation of A
at the angle 0, if rnA = d(c/v e - cos 0), where V e is the electron velocity and m is an
even number.
152 DEMONSTRATIONAL OPTICS
while moving over the grooves of the grating the electron excites an in-
creasingly amplified wave, propagating at angle () with respect to the
surface of the grating.
To estimate the wavelength of the radiation we use the parameters of
the PARSELL and SMITH experiment: an electron energy of 300 keY and
a grating with a groove distance of d = 1.67 um. Using the relativistic
formula for the kinetic energy of a moving electron:
£ = £0 _ £0
J1-~
we determine the ratio velc for an electron with the energy 300 keY.
The total energy of an electron at rest is £0 = 511 keY. Substituting
this value into the last formula we find vel c ~ 0.8. Referring to formula
(4.68) we see that radiation in the visible spectral range is possible at
small angles () for m = 1. For example, for () = 20° and cos () ~ 0.94,
cju; - cos () is close to 0.3. So, for a grating of period 1.67 um, the
wavelength of the generated light is estimated to be A ~ 0.5jlm. This
radiation corresponds to the green part of the visible spectrum.
Summing up these phenomena, we note that an occurrence of radia-
tion is possible even for uniform motion of charged particles, if it takes
place either inside a material medium or when the particle is crossing the
boundary between two media with different refraction indices . The qual-
itative interpretation of these phenomena lead to the fact that uniform
motion of a charged particle creates secondary waves inside a medium.
These secondary waves will superimpose according to the equi-phase
condition, because they are generated by the uniform linear motion of a
single charge . It is important to not e, that the emergence of radiation is
strict ly connected to the motion of one charged particle perturbing the
medium in the same way over its trajectory. In cases where an ensemble
of charged particles is moving, with a certain velocity spread, such as an
electron beam, these effects disappear.
SUMMARY
PROBLEMS
4.3. A light wave falls on the flat surface of a dielectric material with
nearly grazing incidence (Fig.4.41). Prove that the oscillations of the
154 DEMONSTRATIONAL OPTICS
electric vector of the reflected wave will have a phase shift of 7r with
respect to the incident wave.
Figure 4.40.
I
I
~ 7; )
I
I
Figure 4.41.
Figure 4.42.
Light Waves in Media 155
Figure 4.43.
4.6. A muon of energy E.p. = 0.5 GeV propagates within dense barium
crown glass. Due to dispersion, CHERENKOV radiation is grouped within
a narrow range of conical surfaces. The deflection of rays within the
visible range of wavelengths may be taken into account by a difference
in the angles () of the CHERENKOV radiation. Estimate the difference
fj.(} occurring between blue (AI = 486 nm) and red rays (Ared = 659
nm) provided that the refractive indices are nl = 1.615 and n2 = 1.605,
respectively.
",
Figure 4.44.
SOLUTIONS
4.1. We write the expression for the complex amplitude of the electric
vector in the incident wave as follows:
E, = Eo exp[i(wt - kz + <p)]
The glass plate introduces the phase difference t::.<p = kd(n - 1) thus
the electric field occurring after the plate, may be presented by the
expression:
It can be seen that the first term of the transmitted wave is the same as
the incident one, whereas the second term may be treated as an action
of the atomic oscillators of the glass plate. In other words, the field
Eokd(n - 1) exp[i(wt - kz + sp - 7r/2)] emitted by the glass plate has
a very small amplitude compared with that of the incident wave and a
phase shift of 7r/2.
Figure 4.45.
With pi = tiki and pt = tik t , we get the relations for the transformations
of propagation number k and wavelength A:
l'Oi l'Ot
Ai sm = At sm
Since the frequency of the wave v is unchanged while the photons prop-
agate through the surface of the media, after multiplying both sides of
the last expression by civ, we can write SNELL'S law:
Figure 4.46.
term n2 cos ()j tends to zero at ()j -+ 7r /2, whereas the term nl cos ()t
remains a finite value. The result for E~r) becomes E~r) -+ -E~j). In
a similar way EY) -+ - Er) at ()j -+ 7r /2. Hence, for grazing incidence
both components of the reflected wave obtain a change in phase of 7r
with respect to the appropriate components of the incident wave.
t) =cos(()j-()t) t) =cos(()j-()t}tanaj
E(t) E(j )
tanat=
Ell Ell
It can be seen that for the refracted, transmitted wave, the rotation of
the electric vector occurs towards the plane of incidence with respect
to the position of the electric vector in the incident wave, because the
factor cos(()j - ()t} leads to a decrease of the value of tan at. In turn, in
the case ofthe reflected wave the term cos(()j - ()t)/ cos(()j + ()t} increases
with tan ar , which means the absolute value of the angle a r increases
with respect to the angle aj of th e incident wave. The rotation of the
polarization direction occurs away from the plane of incidence.
4.5. For the given refractive indices at two wavelengths one can esti-
mate the constant B in the Cauchy formula . From n l = A + B/>..i and
Light Waves in Media 159
1 1) -1 >.21 >.2
B = (n2 -nl) ( >.2 - >.2 = (n2 -nl) >.2 ~2
2 1 1- 2
L.l.
. 0 =cl:1n
"0 SIn --
n 2vJ.l
or using the well known representation sin 0 by cos 0, the expression for
1:10 takes the form
1:10 = I:1n 1
n J(nvJ.l/c)2 - 1
The relativistic formula for energy e= £01 )1- v~/c2 , where £0 = 100
MeV is the muon rest energy, permits the following ratio for (vJ.l/c)2 :
(vJ.l/c)2 = 1- (£0/£)2 = 0.8. One may take the mean value of the refrac-
tive index as n = (nl + n2)/2 = 1.61 for n. Substitution of the numeri-
cal values in the expression for 1:10 gives 1:1() = (I:1n/nh/1.61 · 0.8 - 1 ~
6.10- 3 rad.
4.7. Let the proton run a short distance I:1l between two adjacent
points of its trajectory (Fig.4.47). Radiation emitted by the proton at
every point will group between two conical surfaces enclosing the angles
01 for red rays and ()2 for blue ones. Hence, in order to find the light
power we must calculate the light energy within a volume 1:1 V bounded
by the two ring-shaped areas and by the short line I:1l , which is shown
in Fig.4.47. We assume that these rings are rather small and have the
area (J. We represent (J by the radius R and the angular width 1:10 =
()2 - ()1, provided that 1:10 is rather small:
Al
Figure 4.47.
since TJ is the number of photons emitted per unit length. The .mean
light energy contained within 6.V is given by
Since light moves the distance 6.1 during the time interval T = 6.1/v,
where v is the mean phase velocity of light, the power may by expressed
as
N = E/T = 2rrvR26.()sinBTJh"iJ .
By definition, the solid angle 6.0 containing the radiation is equal to
6.0 = a / R 2 and the power N/),o within the solid angle 6.0 is equal to:
where n is the mean refractive index of the glass. Now we use the result
obtained in the previous problem: boO sinO = cbonl(n2~) and substitute
b.OsinO by cbonl(n2~) . Then we get for N/:;.n :
_ 21rc2 b.nryhlJ
N /:;.0. - -3 V.
n p
OPTICAL ANISOTROPY
163
O. Marchenko et al., Demonstrational Optics
© Kluwer Academic/Plenum Publishers, New York 2003
164 DEMONSTRATIONAL OPTICS
o o
a b
Figure 5.1. The double image of letter D observed throught a calcite crystal (a) ; if
the crystal is rotating the double image rotates too (b) . The dotted line 00' specifies
a spatial diagonal of the crystal.
crystal doubles. From this doubling of the image the name birefringence
was derived .
Fig.5.1 shows an image of the letter D observed through the faces of
the crystal. Let us draw a line 00' through two apexes of the crystal
and through geometrically similar points of the images (88') , as shown
in Fig .5.2,a,b. When the crystal rotates, the double image will rotate
in space in such a way, that the lines 00' and 88' stay parallel. This
simple observation indicates the complicated nature of light refraction
in the calcite crystal.
For observing double refrac tion in a crystal in another way, a narrow
parallel beam is directed normally onto a natural face of the crystal.
Two beams exit from the opposite side, parallel to the incident beam
o -, 0,
..... s ,
,s
-. ,
,
s' ., s'
,
,
0'
0'
a b
Figure 5.2. When rotating the crystal the double image turnes in a way that a line
ss' drawn through two similar points of the images remains parallel to the spatial
diagonal 00' , as shown for the cases (a) and (b).
OPTICAL ANISOTROPY 165
-
Figure 5.4. Testing the polarization
Figure 5.3. Spatial splitting of a nat- states of the ordinary and extraordi-
ural light beam into an ordinary and nary rays . Double arrows indi cate two
an extraordinary ray. orthogonal linear polarization orienta-
tions.
(Fig.5 .3). The ordinary beam passes along the direction of the incident
beam, and the second, extraordinary beam is shifted with respect to the
direction of the original beam. In other words the angle of refraction of
the extraordinary beam is not equal to zero, even in the case of normal
incidence . When rotating the crystal around the axis of the incident
beam the image of the extraordinary beam on a screen behind the crystal
moves in a circular path around the ordinary beam.
For natural, non-polarized incident light the two images of the out -
going beams have the same intensity. It is easy to verify that the output
beams are polarized in two mutually orthogonal directions. If we put an
analyzer in the outgoing beam, its rotation results in a periodic decrease
in the ordinary beam intensity and an increase in the intensity of the
extraordinary beam (and vise versa). The maximum intensity for the
ordinary beam corresponds to complete extinction of the extraordinary
beam (and vise versa) . The arrows in Fig.5.4 . indicate the orientation
of the polarization for both beams.
Now let a broad beam of light fall on the crystal to produce over-
lapping spots on a screen as shown in Fig.5.5. Rotation of the crystal
around the incident beam does not change the position of the ordinary
beam , whereas the image of the extraordinary beam will rotate. Insert-
ing an analyzer in the outgoing beams provides successive extinction and
brightening of images, but for the region where the images overlap the
screen brightness remains invariable (Fig.5 .5,b) .
This procedure confirms that the total intensity of both beams is
constant and equal to the intensity of the incoming beam 10. This beam
is decomposed into two beams of the intensities 10/2, both polarized
in mutually orthogonal planes . If the intensity of one beam after the
166 DEMONSTRATIONAL OPTICS
Ordinary
ray
Natural
light Extraordinary
Ordinary extraord inary
beam beam
a b
Figure 5.5. With normal incidence on a natural face of calcite, the ordinary ray
propagates along the direction of the incident ray, whereas the extraordinary ray is
deflected from this path (a) . Due to the deflection of the extraordinary ray a shift
between the centers of the outgoing rays exists. If the beam diameter is large , the
outgoing beams partly overlap .
analyzer varies as (10/2) cos2 a, then the intensity of the second beam
has to vary according to the expression (10/2) sin 2 a, where a is the
angle between the principle direction of the analyzer and the direction
of polarization of the first beam.
Now we will discuss the mutual spatial arrangement of the natural
faces of calcite in more detail. All its natural faces are rhomboids with
*1I
*
/
,/
/
/
a b
Figure 5.6. The spatial arrangement of the natural faces of calcite. The spatial
diagonal 00' is the optical axis of the cryst al (a) . A beam of natural light has still
no certain polarization state after passing throught the crystal along the optical axis
(b).
OPTICAL ANISOTROPY 167
Symmetry axis
o
o o
f4-- - 2.5.x. - -.-j
a b
Figure 5.7. The negative molecular ion C0 3 is composed of oxygen atoms 0 , at the
corner of an equilateral triangle, and of a carbon atom at the center of the triangle.
The ion C03 has a 3rd order symmetry axis normal to the plane of the equilateral
triangle.
the apex angles of 78° and 102° as shown in Fig.5.6,a. The spatial
diagonal DO' connects two opposite obtuse spatial angles. Cutting the
crystal in a way that the cut surfaces are orthogonal to DO' will result
in a cross section has the shape shown in Fig.5.6,b.
Let us place a crystal cut in such a manner into a narrow beam of
natural light . If this beam passes along the direction 00' (perpendicular
to the cutting faces), there will be no birefringent effect. This special
direction is called the optical axis of the crystal.
E
~->---- k
B
a b
Figure 5.8. Polarization of oxygen atoms by the electric vector of a light wave with
linear polarization. When the light wave propagates perpendicular to the symmetry
axis (a) the resulting dipole moment of the ion is smaller than the dipole moment
indu ced by waves propagating parallel to the symmetry axis (b).
Symmetry axis
Upper layer of Ca of 3rd order
extraordinary rays. The second case with the wave propagation along
the symmetry axis corresponds to the propagation of ordinary rays.
We should note that the contribution of the Ca atoms to the polariza-
tion is small compared with the contribution of the oxygen atoms. Our
considerations are still valid when Ca atoms and CO;; ions compose a
calcite crystal. Figure 5.9 shows three layers of the lattice. The upper
layer consists of calcium atoms. Every trio of such atoms compose an
equilateral triangle.
The second layer is composed of negative ions of CO;;. The next layer
also consists of calcium atoms, which are placed in another position than
in the upper layer, but each three atoms are also located in the corners of
an equilateral triangle. When we speak about the hexagonal system of
such a crystalline lattice we assume that there is a symmetry axis of the
third order oriented normally to the plane of each equilateral triangle
through its center. The lattice structure stays the same after rotation
by an angle of 1r /3 around the symmetry axis.
We see that the net polarization of atoms due to the action of a
linearly polarized light wave is higher when the electric vector is directed
perpendicular to the symmetry axis, i.e, when the wave travels along
this axis. Contrarily, the net polarization is smaller when the electric
vector is directed along the symmetry axis and the wave travels across
to this axis. This is the case of a wave travelling perpendicular to the
axis with a linear polarization in the plane parallel to the axis. The
170 DEMONSTR ATIONAL OPTICS
symmetry axis is therefore the main optical axis of the calcite crystal. A
smaller polarization is associated with a smaller value of the refractive
index (n e = 1.486) and a higher value of light velocity, i.e , with an
ext raordinary ray. A higher value of polarization connected with a larger
value of th e refractive index (no = 1.658) corresponds to ordinary wave
propagation.
Let us select a C0 3 ion in the crystal (Fig.5.1O,a). Its symmetry axis
DO' is parallel to the optical axis of the crystal. Let a light beam pass
the crystal along the directio n 88' oriented with an arbitrary angle with
respect to DO'. The oscillations of the electric vector of the ordinary
beam occur perpendicular with respect to the plane defined by DO' and
88' . This means that the electric vector E in the ordinary light wave
is paralle l to the molecular plane of C03. Since t he direction 88' was
selected arb itrarily, the fact that vector E is parallel to t he molecular
plane holds for every angle of incidence of the light beam at the crystal
face. Hence t he degree of polarization of the molecular ion C0 3 , t he
refractive index and the phase velocity for an ordinary beam does not
depend on the propagation direction 88' .
For an extraordinary beam the polari zation plane of E always coin-
cides with the plane containing the 88'- and OO'-directions (Fig .5.10,b).
Therefore, the polarization of the C03 ions depends critically on the an-
s' s'
s
~I I
s
~I I
i
I
0 0 1
a b
Figure 5.10. For a given direction of propagation SS' of a natural light beam , os-
cillations of the electric vector E of the ordinary beam always take place in a plane
parallel to that of the molecular ion [a}; oscillations of the electric vector E of the
extraordinary beam always take place in a plane normal to that of the molecular ion
(b) . In the latter case the net induced electric dipole moment p of the ion depends on
the angle between the direction of propagation and the optical axis of t he ion, 00' .
OPTICAL ANISOTROPY 171
gle between 551 and 00 1 and, hence, the refractive index and the phase
velocity of the extraordinary wave are also dependent on the propagation
direction.
Conclusion. For the example of calcite we analyzed the main fea-
tures of the birefringence phenomenon. Spatial anisotropy of the crys-
talline lattice of calcite is the main reason for its optical anisotropy. In
general, the anisotropy leads to two waves which propagate within the
crystal along two different directions. Only two directions within the
crystal exists for which ordinary and extraordinary ray will travel with-
out spatial separation. The first case is propagation along the optical
axis. Here the definition of ordinary and extraordinary rays loses its
sense, both rays propagate with the velocity of the ordinary ray. The
second case is the propagation perpendicular to the optical axis. Here
both rays propagate in the same direction, but with maximally different
velocities given by clno (ordinary ray) and cine. (extraordinary ray).
rotH
aD
= 7ft rotE = -aB
- (5.1)
at
The material equations (4.1) and (4.2) written for isotropic media in
Chapter 4, have the same forms here, except that the quantity cleo =
n 2 is dependent on the direction of propagation within an anisotropic
medium. For this reason the direction of propagation of the wave energy
does not always coincide with that of the wave front. As we have seen
in Chapter 4, the direction of energy propagation is determined by the
POYNTING vector S :
S=ExH
We define two unit vectors now: one is t, specifying the direction of
energy propagation, and the other, as before, is s, normal to the wave
front. The vectors t and s differ in space.
According to (4.7) and (4.11) we can write down the relationships
w
Ho = kS x Do and (5.2)
172 DEMONSTRATIONAL OPTICS
D x
s
z
Figure 5.11. In a Cartesian system x,y ,z the vectors E,D,t ,S and s are located in
the Oxz-plane, the magnetic vector H is perpendicular to this plane.
Now we substitute the magnitude from (5.3) for the magnetic vector in
(5.2) :
1
- 2- sx (Eo x s) = Do
V/L
The product s x (Eo x s) is then transformed as follows:
where Eo.L is the component of the electri c vector normal to the direction
of propagation, as specified by the unit vector s. Therefore, the two
vectors Do and Eo are not parallel, since, as it follows from the two last
relations,
1 n2
-2-Eo.L
V /L
= -2-EQ.i
C /L
= Do
We see that the vectors E and D are both at right angles to the magnetic
vector H (Fig.5.H). The latter forms a right angle with s as well as
with vector D . Thus Hand D are transversal to the direction of the
propagation s . The electric vector E is also at right angle to H, so
that E , D and s are coplanar. However E includes an angle a with the
direction of propagation, but E is at a right angle to the direction of
energy propagation denoted by the unit vector t. This means that light
energy propagates at the angle a to the normal of wave front s with
a velocity V r , which is not equal to the phase velocity v = cfn . It is
OPTICAL ANISOTROPY 173
s
Figure 5.12. Two systems of wavelets propagating away from the face of a negative
uniaxial crystal into the bulk of the crystal. The vector D lies on the wave front
formed by elliptical wavelets, whereas the vector E is outside the wave front . In
cont rast , both vectors D and E are on the wave front formed by spherical wavelets.
\ 0 Natural light
0'
Extraordinary Ordinary Direction of
ray ray optical axis
--
optical axis __~ optical axis __~
~ .
a b
Figure 5.14. In the case of a positive crystal two wavelets emitted by point 0 prop-
agate within the crystal with different phase velocities, so that the elliptical wavelet
is enclosed by the spherical one (a) . For a negative crystal the spherical wavelet has
less velocity than that of the elliptical one, therefore the spherical wavelet is enclosed
by the elliptical one (b) .
Canada
balsam
~/ n=1 .53
90·
Extraordinary ray
Figure 5.16. If two Nicol prisms are set into a natural light beam, one before the
other, the first prism will play the role of the polarizer, and the second that of the
analyzer. One may cross the prisms by rotating the analyzer around the optical axis
of the beam .
176 DEMONSTRATIONAL OPTICS
The extraordinary beam, which can pass the prism, turns out to be
completely polarized. Two NICOL prisms inserted into a beam of natural
light are generally used as crossed polarizers (Fig .5.16). The passing
beam is dislocated by a certain amount from the incident beam.
The WOLLASTON prism allows the creation of two orthogonally po-
larized beams as shown in Fig.5.17. A WOLLASTON prism is made of
two rectangular prisms glued along the hypotenuse in such a way that
its optical axes 01 O~ and 0202 are orthogonal. Ordinary and extra-
ordinary beams propagate in the same direction in the first prism . An
ext raordinary beam leaving the first prism propagates as an ordinary
one in the second prism, and vice versa .
The refractive index for the first beam in the second prism is higher ,
so it is deflected toward the upper face (see Fig.5.17), while the second
beam now enters a medium with a lower refractive index and is refracted
toward the opposite side. A light beam entering perpendicular to the
outer surface of the first prism splits into two perpendicularly polarized
beams deviating symmetrically from the original beam direction.
Ez
Dzi====t"J====t-t:===t--
I I
d I
I
I
I
z
z
a b
Figure 5.18. A set of plates modeling an uniaxial crystal. Two cases of propagation
of a linear polarized wave are possible: one is the propagation along the "optical axis"
of the plates and the other is the propagation transversal to this axis . In both cases
the electric vector E is parallel to the faces of the plates (a) . Dependencies of the
projections of the electric displacement vectors D'; and D z on z-coordinate are shown
in (b).
Let s be the dielectric constant of each plate. If the plates are in vacuum,
the dielectric constant of the medium in which they are immersed is equal
to eo.
o
~:h ~ ~
x a d z
~B
E
a b
Figure 5.1 g. A linearly polarized wave which propagates perpendicularly with re-
spect to the optical axis. The electric vector E is normal to the faces of the plates
(a) . Dependencies of the projections E., D. and E. on z-coordinate are shown in
(b).
(5.5)
y
0'
2 n 2d
n.L = e.L/eo = n2d + (n2 _ l)a (5.6)
(5.7)
where k = 2rr/ >. is the wave number in vacuum . As we have seen in
Chapter 3 the superposition of two waves polarized in two mutually
orthogonal directions will lead , in general , to an elliptic polarization of
the resulting wave.
It can be seen from (5.7) that in the case of a negative crystal, where
n e - no < 0, the phase velocity of the ordinary ray is smaller than
that of the extraordinary one. Hence, the phase of the ordinary ray is
retarded by 8 with respect to that of the extraordinary ray. Plates of
180 DEMONSTRATIONAL OPTICS
E(t ) = Ex(t)
V2 cos(wt) + yEoT
+ Ey(t) = xEoT V2 cos(wt + 0) . (5.8)
In particular, if
2n n
8 = >:(no - ne)h = 2nm + 2"
Detecting hom
45° "" II
O.
Emitting horn Wooden block
the end of the electric vector E(t) will undergo a circular rotation, be-
cause the relation (5.8) can be reduced to the following form:
J2 J2 .
E(t) = EoxT cos(wt) - EOYT sm(wt)
Symmetry plane
x/ /'/: / - - - - -_ 7
Cleavage plane / - 7..:r-
/
/ r>,L 'r-----~--
/
h / /
A / / ,' ./ //
/ t...... /
/ - ,- -- -=- - //
// / / / 0/
/ / , / / //
/ / / Optical axis: " / / /
/
--r---------h-/ /
/ /
/ '/
----------_/
/ I /
Figure 5.22. An elementary cell of mica has a symmetry plane , which contains the
optical axis of the crystal. The direction of the optical axis is deflected from the
cleavage planes of the crystal by a very small amount, which means that a light wave,
falling normal onto the mica crystal, propagates nearly normally to the optical axis .
~~
Mica plate
x
~'~ " , }J4 -plate
X
"
-
, I
, }J4-p late
LJ'
a b
Analyzer
Ana lyze r
O,"~, ~c ~ ~
,
.. ljj
ax"
W~
Opt ical axis .
~
.... ",- "
c d
Figure 5.24. Test of the polarization state of a light wave by means of a polarizer and
,\/4-plate. There are two ways to transform a linearly polarized beam into a circularly
polarized one by means of a '\/4-plate (a, b). A natural light beam passing through
the '\/4-plate does not change its polarization property which may be examined by
t he anal yzer (c); a circularly polarized beam can be transformed by means of the
,\/4-plate into a linearly polarized beam , shown by the analyzer (d) .
x
a
Figure 5.25. Three domains of a liquid crystal, each grouped around certain di-
rections "1, "2, "1 (a). Prolonged molecules of the liquid crystal grouped around
micro-groves existing on the surface of a glass pla t e (b).
Drop of
liquid crysta l Analyzer
Polar izer
/
Mercury
lam p
Diaph ragm
Heat Drop of
preventing water
filler
Figure 5.26. Setup for the demonstration of the optical anisotropy of a small drop
of liquid crystal. The drop of liquid crystal can lose its anisotropic property under
heating. To avoid this, a transparent heat blocking filter is inserted into the beam
before the glass sample with the drops , if a powerful light source is applied.
between two crossed polarizers and placed into a light .beam as shown
in Fig.5.26. IT the analyzer is absent, the images of two drops will be
identical. In the presence of an analyzer crossed with the polarizer, the
image of the water drop vanishes, whereas the image of the liquid crystal
becomes a color pattern, provided the direction of the grooves is 45° with
respect to the principal direction of the polarizer (Fig.5.26). The thin
film of the liquid crystal behaves similar to a stratified structure, or a
uniaxial anisotropic crystal. In general, this film creates elliptically po-
larized light from linearly polarized incident light. The phase difference
is determined mainly by the local thickness of the film. The outgoing
light rays show the same color after passing the analyzer for areas of the
film having the same thickness.
y
Analyzer Ordinaryrays
-,
~
----=--=-- - - - ._. -J _.- - _.
Extraordinary rays
x Liquid Screen
Horizontal crystal
slit
Figure 5.27. A setup for the demonstration of refraction of ordinary and exraordinary
rays by a liquid crystal prism . The distance between the prism and the screen is 4 m;
the objective, projecting the image of the vertical slit onto the screen , has the focal
length 50 em. The width of the slit is about 0.3 mm .
blue - - -
green _ _
yellow ---.
a b
Figure 5.28. Two spectra formed by the liquid crystal prism; one is caused by ex-
traordinary rays (a) , the second by ord inary rays (b) . Deflection of rays of both
spectra .increases from the bottom to top of the pictures. The distance Ay between
two identical spectral lines is approximately equal to 6 mm .
OPTICAL ANISOTROPY 187
rays are deflected by larger angles than the extraordinary rays, and for
this reason , two spatially shifted spectra are observed (Fig.5.28).
It should be noted that the quasi-crystalline state of a liquid crystal is
realized only within a particular temperature range . When heating the
liquid crystal film used in this demonstration by a flow of hot air (about
45°C in our case), the quasi-crystalline state breaks down and the liquid
will show the optical properties of an ordinary isotropic liquid .
5. Artificial birefringence
When a transparent isotropic material is subjected to an external
electric or magnetic field or to mechanical stress, it may become optically
anisotropic.
Optical axis
Filter
Screen
Polarizer Analyzer
Transparent
plastic sample
Figure 5.30 . Setup for the demonstration of the photo-elastic effect. A transparent
plasti c sample is placed between two crossed polarizers, one with a horizontal direction
of polarization (x-axis), the other with a vertical dire ction of polarization (y-axis) .
To prov ide a distinct image of the sample, an optical filter has to be inserted into the
beam .
to th e optical axis of the plastic sample and the analyzer is crossed with
the polarizer a dark band will be observed on the screen (Fig.5.31).
This dark band within the image corresponds to a mechanically isotropic
local medium or to a non-stressed region of the sample. Local regions of
the sample which correspond to the parts of the image situated above
this dark band are affected by pressing forces, whereas regions below
thi s band are mechanically stretched.
A L---.....,...-.~
Figure 5.31. The inverse real image of the sample placed between two crossed polar-
izers and stressed in the vert ical direction. The regions AB and B e above the dark
bands are associated with compressed domains within the sample, whereas t he region
be low these bands is associated with mechani cally stetched domains.
OPTICAL ANISOTROPY 189
Kerr ce
Filter
Lamp
A>. -"- I h A n
-v-
I U
'CY
V V
E
U
Polarizer Analyzer Screen
Figure 5.32. Setup for observation of the Kerr effect. A parallel light beam passes
trough the Kerr cell placed between crossed polarizers. The principle direction of the
polarizer is at an angle 45° with the flat plates of the capacitor. Due to the despersion
properties of nitrobenzene, a distinct dark field may be obtained with the use of an
optical filter .
(5.10)
+
E E
a b c
Crossedpolarizers
Electric
spark Kerr cell
y-~
M1~
Figure 5.94. A setup for the estimation of the relaxation time of nitrobenzene mole-
cules in an electric field.
OPTICAL ANISOTROPY 191
ilt = llc ,
were e is the velocity of light. Now, if L is the geometrical path needed
for light propagation from the spark to the capacitor via M I , M2, M3
and M 4 , the time T corresponding to L is given by
T = Lie .
The geometrical condition of the demonstrational experiment
L» 1
may be easily fulfilled. This condition implies that one can assume the
disorien tation process of the nitrobenzene molecules to start approxi-
mately at the moment of spark ignition, i.e, at the moment to. While
light is passing from the spark over the mirrors, the disorientation of the
molecules can be terminated, if L is large enough. If TO is the character-
istic time constant of the disorientation process , the inequality
T > TO
is valid in this case. By decreasing the geometrical light path L be-
tween th e mirrors, shifting the mirrors MI and M2 towards M3 , M 4 , the
condition
T~TO
can be provided. When this condition has been achieved the light beam
will pass through the KERR cell. Such a clear demonstrational experi-
ment is feasible, because the geometrical length L , corresponding to the
typical orientation time constant of TO ~ 10- 9 S, is equal to 30 em.
Crossed polarizers
KDP • crystal
He-Ne laser
Mirror
Cavity
o
Oscilloscope H.F. - Generator (600MHz)
Figure 5.35. Setup for the estimation of the velocity of light . The laser beam is
modulated with help of a KDP crystal.
For a K DP crystal cut along its optical axis and fixed between two
plates of an electric capacitor, the phase difference, arising between the
ordinary and the extraordinary rays, when an electric field is applied,
can be expressed as
C=KU , (5.11)
where U is the voltage applied to the capacitor plates and K is a constant
(d. (5.10)). This phenomenon is known as the POCKELS effect.
A crystal placed between the plates of a capacitor is called a POCKELS
cell. There are two apertures at the centers of the plates to allow light
to pass along the optical axis. Usually the cell is placed between two
crossed polarizers. The POCKELS cell is frequently used as a fast optical
shutter operating under the action of a high-frequency voltage.
We now consider a specific application of this optical shutter for the
creation of a train of short light pulses (the intensity of the light is mod-
ulated with a very high frequency). The arrangement for the demonstra-
tion of this effect is shown in Fig.5.35. The principle of the fast optical
shutter is based on the use of a high-frequency radio-wave cavity which
has the same function as the electric capacitor of the POCKELS cell. This
cavity is made of two coaxial cylinders as shown in Fig.5.36.
The high-frequency electromagnetic energy from a radio frequency
generator excites electromagnetic field vibrations within the cavity by
means of a loop fixed near the side wall of the cavity (Fig.5.36).
The distribution of the electric field is characteristic for standing
waves. The node of the electric field is on the right side, as shown
in the figure, and the antinode of the field is close to the left of the cav-
OPTICAL ANISOTROPY 193
ity, where a K DP crystal, cut along its optical axis, is fixed in a central
hole. The frequency of the field oscillations is 600 MHz.
Light from a He-Ne -laser (>. = 632.8 nm) passes a semi-transparent
plate, a polarizer, the K DP crystal and a second polarizer (crossed with
the first one) and then reaches a mirror, which can be moved along a
slide. The reflected light passes back through all the components and is
reflected to a photomultiplier by the semitransparent plate. The crossed
polarizers form the 'principle part of the optical shutter together with
the cavity and the crystal. We can obtain the relation for the intensity
of the light passing through the POCKELS cell, providing first that the
electric vector of the incident beam is at an angle 45° with the principle
direction of the first polarizer, and second that the absolute value of
phase difference in (5.11) is not larger than unity :
181« 1
This inequality is safely valid, because the value 8 = 7r / 4 is attained
when U ~ 8000 V is applied to the KDP crystal. Under the conditions
of the experiment IUlmax « 8000V.
Let AA' and BB' be the principle directions of the first and second
polarizer, respectively, as shown in Fig.5.37.
If U = a the anisotropy is absent and the oscillations of the electric
vectors for ordinary and extraordinary rays are in the directions x and y ,
respectively, and they have the same amplitude Eo. In the experiment,
the field of the He-Ne laser is polarized along the direction AA' of the
first polarizer. Then the field vector is projected onto the directions x
Electricfield distribution
KDP - crystal
1 cm
Loop
Togenerator
Figure 5.36. The arrangement of the resonance cavity for production of a train of
short light pulses based on the Pockels effect .
194 DEMONSTRATIONAL OPTICS
y y
A Eo Y
\ A B
B
,7~+--- - +-~:>
\ -, / /
\ / -, / /
-,
\
4~~" -,
, -, //
//
/ /
/ -, //
I -, X
,/I~S
1
, -,
-,
\ /
/ / , -,
\ X 8' I /
/
/
-, AI
X B
AI
a b
Figure 5.37. Illustration of t he opera t ional pr inciple of the opti cal shut te r. A K DP
cryst al is placed between two crosse d pol arizers with the principle directions AA'
and BB' . Oscillation s of th e elect ric vector of the light of a He-Ne - laser occur
along dire ct ion AA' (a). With a voltage applied to the crys tal, a small deflection of
th e elect ric vector from direction BB' by an angle {j occ urs, leading to light beam
transmission through th e shutter (b) .
and y . The superposit ion of th ese oscillations is linearly polar ized and
direct ed along AAI , so that the projection of the resulting vect or on BBI
is equal to zero and the light is blocked. With increasing volt age U a
phase difference 8 is introduced, and the projection of the ampli tude of
the resulting field in t he direct ion BB' is given by
E
.../2.sm8 ~ Eo-.../28
= Eo-8 8
Hence, the intensity of light passing through the crossed polarizers is
given by the expression
(5.12)
we find
I(U) = Io".2UJ cos2(21r ft) (5.13)
In our case f = 600 MHz. We find that the maxima of the intensity due
to (5.13) is Io",2UJ , when
and
T = 1/(2f)
we get
c = 2(L2 - L 1) = 4f(L2 - LI) (5.14)
T
Measurements show that £2- L1 = 0.125 m. So for the velocity of light
we find c ~ 4 . 6 . 0.125 . 106 = 3.0· 108 ta]».
6. Optical activity
6.1 Natural quartz
Natural quartz is an anisotropic uniaxial crystal. In order to demon-
strate its optical properties, let a beam from a source of white light pass
through a polarizer and an optical filter, and let it fall normally on a
thin parallel plate of quartz cut across its optical axis (Fig.5.38).
When rotating an analyzer placed after the plate, we find that the
direction of linear polarization of the transmitted beam is at a certain
angle relative to the principle direction of the polarizer, but the light still
remains linearly polarized. This implies that the quartz plate gives rise
196 DEMONSTRATIONAL OPTICS
Polarizer Analyzer
Filter
Lamp
Quartz sample
Figure 5.38. Setup for observation of the optical activity of quartz. A parallel beam
of natural light formed by an objective passes through the polarizer and then through
the quartz sample along its optical axis . The polarization direction of the transmitted
light beam is deflected with respect to its original direction which may be tested by
the analyzer. An optical filter is inserted into the beam because the rotation caused
by the quartz is very sensitive to the wavelength.
Figure 5.39. The natural modifications of quartz (one is right- • and another is left-
rotating quartz) cause a rotation of the polarization plane in two opposite directions.
In both cases the light beams propagate along the optical axes of the quartz samples
010~ and 0202, Both samples have identical shapes and sizes.
OPTICAL ANISOTROPY 197
Color A (nm) ¢
red 600 63°67'
yellow 570 68°13'
green 540 78°6'
violet 420 130°
d
¢= K.' A2
Quartz Analyser
prisms
He-Nelaser
Figure 5.40. Setup for repeating the Fresnel experiment with two quartz prisms, one
is cut from left- , the other from right-rotating quartz.
Figure 5.41. The operating principle of the telescopic system. A short line p within
the plane of the vertical slit is projected by the objective as a real inverse image p'
near the focal plane of the objective. This image is then transformed by the eye-piece
of the telescope into an enlarged real image p" on the screen. The objective is close
to the slit, whereas the distance between the eye-piece and the screen is about 9 m.
L
electric vector E . This sep-
aration is caused by two
right- and one left-rotating
quartz prisms. The first
boundary between adja-
cent prisms provides an ini-
Right quartz Right quartz tial spatial separation for
these circularly polarized
beams, whereas the second
boundary enlarges the sep-
aration.
Mirror plane y
.-.-.-....--
A~
UW
.- . . '
.- - _.-
a
--- - - - - - - a'
a b
Figure 5.43. A spiral and its mirror image. The point A of the first coil of the spiral
is reflected across the mirror plane to the point A'. Moving a short line a from point
A over the spiral causes th e line a to rotate clockwise, whereas its image a' rotates
ant iclockwise (a) . An electrical current i induced by the electric field E of the incident
wave has two orthogonal components i", and i y (the third i z component is not shown )
(b) .
E, E
J. ...
Detecting hom
Emitting hom
Spirals
Figure 5.44. Demonstration of the rotation of the polarization plane of a radio wave
by wire spirals. The distance between the emitting and detecting horns is about 30
cm . A 4 cm wide paper box contains the wire spirals and is placed between these
horns. The rotation of the detecting horn around its axis allows verification of the
polarization state of the transmitted wave.
202 DEMONSTRATIONAL OPTICS
Polarizer Analyzer
Glass samp le
Lamp
Filler
Magnet
Figure 5.45. The setup for observing the Faraday effect with a glass sample. The
sample is placed between the pole shoes of an electromagnet along the lines of the
magnetic field H . Due to dispersion, the angle of rotation depends on wavelength; an
optical filter is therefore inserted into the beam .
Figure 5.46. Two magnetic domains, - one is marked as the bla ck region , and the
ot her as the bright one - have magnetic moments m directed in opposi t e direction s.
Both vecto rs m are normal to the faces of th e thin film of ferromagnet ic material.
T wo possible directions of linear polarization E 1 and E2 exist in the outgoing beam,
both inclined with resp ect to the electric vector E of the incident wave. The bright
region is associ at ed with E 1 and the da rk one with E2 .
Eyepiece
of microscope
Objective of
Non-polarized microscope
light beam
, - , -, - , - , -~-
Eye
Film of FeDy03
Polarizer Analyzer
Figure 5.47. Setup for observing magnetic domains in a thin film of Fe D y03 by
applying linearly polarized light.
o 0.3 mm
a b
Figure 5.48. The patterns observed by means of a microscope with 70-fold magni-
fication . The pattern of the face of the ferromagnetic film in absence of an external
magnetic field (a) . When a weak external magnetic field is applied, the light and
dark regions of the original move, which may result in two regions, corresponding
to opposite directions of magnetic moment. One is large and dark, and the other is
bright and narrow (b) .
netic fields along the external magnetic field lines increases whereas the
number of the domains having an opposite direction decreases . The vis-
ible effect of such an orientational motion in the external magnetic field
will be an expansion of the boundaries of the local dark areas at the
screen, as shown in Fig.5.48,b .
SUMMARY
As we have seen, optical anisotropy is brought into existence by the
fact that waves of different polarization states propagate inside an opti-
cally anisotropic medium with different phase velocities.This anisotropy
was most thouroughly discussed for the crystalline lattice of calcite,
which possesses a symmetry axis of the 3rd order.
In general, the simplest model of a uniaxial crystal is a layer structure
with a symmetry axis. Such a layer structure model is sufficient for a
qualitative explanation of the phenomena of optical anisotropy of liquid
crystals and of artificial anisotropy of solid and liquid dielectrics.
Microstructures of bodies capable of rotation of the polarization plane
are more complicated. As a rule the positions of atoms and molecules
of such a substance resemble a screw line or a spiral. A number of
optically isotropic media show polarization plane rotation when inserted
into a magnetic field. This phenomenon is known as the FARADAY effect.
PROBLEMS
206 DEMONSTRATIONAL OPTICS
0'
o
-----_ .. _- ---------
Figure 5.49.
Figure 5.50.
5.1 A beam of natural light falls on a plane plate of calcite at the angle
()i= 45°. The plate is cut in such a way that its optical axis is normal
to the plane of incidence (Fig.5.49) . Calculate the angular separation of
the refracted rays, if the indexes for ordinary and extraordinary rays are
no = 1.658 and n e = 1.486, respectively.
Figure 5.51.
mate the difference b.n between the indices for left and right circularly
polarized waves for the two wavelengths mentioned above.
5.5. In the experiment with the liquid crystal prism described above,
two spectra are observed on the screen at a distance of b = 2 m from
the objective (Fig.5.52). The colored spectra formed by ordinary and
extraordinary rays are shifted in the vertical direction by a value of b.y =
0.5 em.
The prism angle is 7°. Estimate the difference b.n oe of the indexes for
ordinary and extraordinary rays in the liquid crystal.
a b
Figure 5.53.
I (U)
I I
E
U
a b
Figure 5.54.
the images shown in Fig.5.53. The first image (Fig.5.53,a) occurs if the
principle plane of the analyzer is parallel to that of light waves passing
through regions associated with a certain direction of magnetic moments .
When rotating the analyzer further , the contrast of the image de-
creases to zero, and then a new distinct image appears at a certain
position of the analyzer. This image is the inverse of the first one
(Fig.5.53,b). Estimate the difference D.n of the indices, corresponding
to left and right circularly polarized waves, at A = 600 nm for a film
thickness of d = 0.1 mm. The analyzer has to be rotated by an angle
D.r.p = 40° to come from one image to the other.
5.7. A linearly polarized laser beam falls through a POCKELS cell as
shown in Fig.5.54.
The direction of polarization of the laser beam is normal to the prin-
ciple direction P of the polarizer. It is well known that the intensity of
OPTICAL ANISOTROPY 209
M
Q
p
Figure 5.55.
the transmitted light beam will vary with the voltage U as follows:
I(U) = IoXU2
provided that U is rather small and I(U) = 0 at U = O. X is a pro-
portional constant. Nevertheless, in practice a linear dependency of the
intensity on the applied voltage is often needed . A simple way to pro-
vide the required linear dependency of the intensity variation is to place
a phase plate in front of the crystal (e.g. a quarter-wave plate). Explain
the action of the quarter-wave plate.
, p
Figure 5.56.
both edges) pass through the compensator and then through an ana-
lyzer. Derive a formula for the phase difference between extraordinary
and ordinary rays for a given wavelength >. and find the angle of the
principle direction of the analyzer and the positions of the dark and
bright bands of the field after the analyzer.
SOLUTIONS
5.1 Since the incident ray is normal to the optical axis, the angles of
refraction Bot, Bet for the ordinary and extraordinary rays are calculated
as follows:
nosin Bot = sin Bi
Substitution of the numerical magnitudes gives Bot ~ 25°B and Bet ~ 280 ,
hence the angular separation between these rays is 8 = 30 •
5.2. Taking the z--axis perpendicular to the optical axis of the quarter-
wave plate of mica, the orthogonal components of the electric vector Eo
of the incident beam may be represented as follows:
OPTICAL ANISOTROPY 211
Figure 5.57.
The ordinary and extraordinary rays are retarded by the mica plate:
2~ 2~
'Px = Tdno, 'Py = Tdne
Since mica is a negative crystal (ne- no < 0), the phase difference 6 =
'Py - 'Px is negative and (per definition for the quarter-wave plate) equal
to - ~ /2. Hence, the components of the electric vector after passing the
quarter-wave plate take the forms:
e; = ~ Eo sin(wt + 'P) ,
Ex = ~Eosin(wt+'P) and
This means that the outgoing beam has a linear polarization along the
same direction as the incident one.
If the quartz plate is replaced by a quarter-wave plate of mica with
its optical axis parallel to that of the first plate, the phase difference
provided by the second plate will be equal to -at /2. In this case the
resulting direction of linear polarization is perpendicular to that of the
incident beam (Fig.5.57) .
212 DEMONSTRATIONAL OPTICS
p
/
/
/
L /
Figure 5.58.
5.3. Let a linear polarized wave fall on the quartz plate at the point
a as shown in Fig.5.58.
The electric vector of the wave P may be represented by two com-
ponents Land R associated with the electric vectors of left and right
circularly polarized waves propagating within the quartz crystal. These
waves get the phase increments <h and <PR while propagating within the
crystal from A to point B (on the exit face of the crystal). If d = AB is
the thickness of the plate, the phase increments may be represented by
the following expressions:
2~ 2~
<PL = nLd-r <PR = nRd-r
where tit. and nR are the refractive indices for left and right circularly
polarized waves of wavelength >.. Due to the difference An = nR - »i:
the resulting electric vector P' composed of two components L' and R'
rotates at some angle <p just before leaving the plate. In the case pre-
sented in Fig.5.58 the component L' gets a smaller phase increment than
R', thus right rotation takes place. Since <p = <PR - <PL, the magnitude
of An can be estimated using
x
An = <P21rd
Substitution of the numerical values for sp, >. and d results in the follow-
ing: Anred = 2.7.10- 5 for red rays and Angreen = 2.9 .10- 5 for green
rays.
5.4. The linearly polarized beam falls normally on the face of the first
prism as shown in Fig.5.59, hence the angle of incidence of the beam on
the second, refracting face is equal to ().
Due to the difference An = nR - nL between the refractive indexes
nR and nL two refracted waves exist outside the prism: one is the right-
OPTICAL ANISOTROPY 213
Figure 5.59.
hand circularly polarized wave refracted with the angle D:R and the other
is the left-hand circularly polarized wave refracted with the angle D:L.
We can write the following expressions for the angles of refraction:
where nR > ni: The refractive face of the second prism is parallel to
that of the first prism , thus the angles of incidence of these circularly
polarized waves on the second prism are equal to D:R and D:L. Since
n.i. > nR is valid for the second prism, the refractive angles f3R and f3L
may be expressed in the following form:
uJ: -_ - f
sin- [nR
-- - -nL] -_ tan e [nR
- - -nL] -_ tan ()nh - ni
cos () »i. nR nt. nR nRnL
We assume that both indices tit. and n« differ from a mean magni tude of
refractive index no to a very small extent ±6.n = ±(nR-nL). Therefore,
by introducing 6.n and no, the expression for 8 may be transformed:
J:
u = tan
()nh - ni ~ tan
O(nL + nR)6.n
2 ~
026.n
tan - -
nRnL no no
214 DEMONSTRATIONAL OPTICS
Figure 5.60.
5.6. This problem is similar to that which has been solved in 5.3. and
we can use the result obtained above:
oX
~n = <fJ 21C'd '
L
,
Figure 5.61 .
I(U)
612
,0 P
I
6tpl
611
----- I
I
I
E I
I I
U
6U 6U
a b
Figure 5.62.
the two angles shown in Fig.5.61 have the same absolute value, provided
that the initial position of the polarization plane of the incident wave is
that shown by the vertical dashed lines. Hence, for the positive angle ip,
t1tp = 2tp is valid. Therefore, the value of t1n may be estimated from
the expression
where the term 82 is omitted due to 8 « l:i.cp, which also means l:i.U «
Uconst ' It is seen that under these conditions, variations of the intensity
occur practically linearly with that of the alternating voltage . It is clear,
that, for increased ti.cp, a certain variation of U provides a larger varia-
tion of I(U) (see Fig.5.62,b). Note again that with the phase plate for
U = 0, the transmitted intensity is not zero!
5.8. If the test plate has a retardation of nearly A/4 and the principle
direction of the polarizer makes the angle 45° with one of the axes of
the quarter-wave plate, then the transmitted light falling on the mirror
M will be nearly circularly polarized. For example, let this polarization
be right -handed. The light reflected by the mirror has a left circular
polarization state and will be transformed into a linearly polarized one
after passing the quarter wave plate again but with the direction of
polarization normal to the principle direction of the polarizer. Therefore
the observer will see a dark field. If the quarter wave plate is rotated,
there will be four positions per complete rotation where the field will
be dark. If the test plate is slightly thicker than A/4 but still placed at
45° from the principle direction of the polarizer, the light beam will be
slightly elliptically polarized after transmission through the retardation
plate, with the major axis of the ellipse normal to the principle direction
of the polarizer. The field will not become completely dark when turning
the imperfect quarter-wave plate.
OPTICAL ANISOTROPY 217
y y
p
b
a1 x
x
A
C ~h~
h2
c
a .. X
5.9. Let the x-axis be normal to the edges of the quartz wedges
(Fig.5.58,a), and let first a light beam of small diameter (e.g. a laser
beam) pass the arrangement. The amplitudes of the electric vector are
given by
If 0 = 7rm (m = 0, ±1, ... ) linearly polarized light will result, for which
the directions of polarization follows the relation
GEOMETRICAL OPTICS
219
O. Marchenko et al., Demonstrational Optics
© Kluwer Academic/Plenum Publishers, New York 2003
220 DEMONSTRATIONAL OPTICS
light beam. In this case the focus of the beam is located at infinity and
the wave front is a plane surface.
A plane mirror is an example of a simple optical device, in which
an image can be constructed with the help of homocentric beams. The
light beams SSI and SS2 originating from a point source S located at a
distance h from the surface of the mirror (Fig.6.2) will propagate along
the linear trajectories SlS~ and S2S;
s S'
2
a b
Figure 6.3. An astigmatic pencil of rays. A small element of a curved surface ABGD
is limited by circular arcs AB, GD and AD, BG. Cross sections GGE , AH Band
DFG, having the same radius of curvature, are mutually parallel and have centers
of curvature, located on the short line bb'. Another type of arc presented by parallel
cross sections AED, HOF and BGG is specified by a shorter radius of curvature.
The centers of curvature lie on the short line aa'.
with such properties is called a normal astigmatic beam. Lines aa' and bb'
are called the focal lines and the distance between them, R max - Rmin , is
the astigmatic difference. It is evident that a pencil of astigmatic beams
does not have a single focus point. However, as the astigmatic difference
is reduced, the focal lines will become closer to each other and their
lengths will become smaller . At the limit as r2 - rl --+ 0 the focal lines
will be reduced to a point as the astigmatic beam is transformed into a
homocentric one.
The origin of astigmatism is not linked strictly to light beams origi-
nating from a curved surface. In particular, this phenomenon takes place
under observation of objects through a thick layer of a transparent ma-
terial such as glass or water when the layer is not perpendicular to the
ray SS' (Fig.6.4). Let us demonstrate this optical effect by an example.
A point source S is located under a water layer of thickness h as shown
in Fig 6.4. The homocentric light beam emitted into the surrounding
water from source S is transformed into an astigmatic beam outside the
water due to refraction. Two outgoing light rays SM1 and SM2 , af-
ter refraction propagating outside the water along directions MIPland
M2P2 , will reach an observer positioned at a certain angle with respect
to the water surface. When rotating the picture out of the plane by a
small angle around SO, the rays will still intersect SO at points SI and
S2 , but C 1 moves perpendicular to the plane of the drawing. Therefore,
the ray pencil outside the water has two focal lines, a radial one S1S2 and
a meridian one formed by all points C1 . Now by variation of the angle
of incidence, point Cl will move along the curve SS' between point S
and point S'. This line is called the caustic line. In turn, when rotating
the picture plane around SO , the caustic line SS' will draw a surface ,
which is called the caustic surface.
"I n,
F
---L
-----------o
y ~
x f-----<~,---- f/---l. Xl
1------ X,----I+-''''---- X2---~
Figure 6.5. Refraction on a spherical surface . (a) The case of refraction by an op-
tically dense medium (n r > n,) . A light ray 8M is refracted by an optically dense
medium and intersects the main optical axis x at point 8' . (b) Definition of the
focal points F and F' of a spherical refracting surface (b). F and F' are located on
opposite sides of the refracting surface . Focal length IOF'1 is longer than IFOI. (c)
Mutual positions of distances z , Xl, x', X2 and coordinates of focii I, !' with respect
to the apex of the refracting surface are given, as needed for application of Newton's
formula .
where -<p is the angle between the main optical axis and the orthogonal
to the surface, eM. According to (6.2) the following relation between
the angles u, - <p and -u' is valid: nl(u - <p) = -n2(<p - u') , or
(6.3)
Let h be the length of the perpendicular from the point M to the main
optical axis, and Xl and X2 be the coordinates of points Sand S' , re-
spectively. Under the paraxial approximation for the small angle -<p
(please note ip < b) the expression -<p ::::: h/r is valid. In the same way,
for the small angles u one gets u ::::: -h/XI (u> 0 , Xl < 0), and for the
small angle -u' one gets -u' ::::: h/X2 (u' < 0 , X2 > 0). Substituting the
angles obtained by the expressions above into (6.3), one gets
n ni nr - ni
-r - - =
X2 Xl r
or alternatively
n21 1 n21 -1
- -- = (6.4)
X2 Xl r
after introducing the value of the relative refractive index nr/nil desig-
nated as n21. In this way we find (using the paraxial approximation)
that the coordinate X2 of the image position of the point source S' can
be expressed in terms of the quantities Xl, rand n21 . The value X2 de-
pends neither on h nor on the angles of incidence and refraction for light
beams with respect to the refracting surface. As a conclusion, it follows
directly that all paraxial beams emitted by a point source and confined
within the homocentric pencil cross the main optical axis at the same
point S' . In other words, a spherical refracting surface under the parax-
ial approximation transforms a homocentric incident light beam into an
outgoing homocentric beam . The quantity on the right-hand side of the
equat ion (6.4) is named the optical force:
p = n21 -1 (6.5)
r
(6.6)
226 DEMONSTRATIONAL OPTICS
1= _2- = r (6.7)
P n21-1
The following relations are valid for the principal focal lengths:
(6.8)
Substituting the expressions for the main optical lengths (6.6) and (6.7)
into (6.4) we find a relation between the focal lengths of the refracting
surface and the coordinates of the source and its image on the main
optical axis:
L+£=1
Xl X2
(6.9)
Xl = X + I; X2 = X' + I'
Substitution of the above relations into the formula (6.7) leads to:
s~---
_ . __._._._;~._._._ . .- ._.
o~ :;-'.
"j '--~r
a b
c d
Figure 6.6. Refraction of a parallel ray by spherical surfaces with positive or negative
optical force. The ray is refracted by the surface with positive optical force (a.b) .
When nr/ni > 1, the vector n makes a sharp angle with propagation direction x
(a). When ni/nr < 1, the vector n does not make a sharp angle with direction x
(b). Refraction of a parallel ray by a spherical surface with negative optical force
(c,d) . When nr/ni < 1, the vector n makes a sharp angle with direction x (c). When
n r/ni > 1, the vector n does not make a sharp angle with direction x (d) .
than the focal length outside this dense medium. For example, the focal
length of a glass spherical surface (n = 1.5) surrounded by air (n ~ 1)
is r . n2I!(n21 - 1) ~ 3 · r , where r is the radius of the surface and the
outer focal length is r- (n21 - 1) ~ 2· r. The ratio of the first value to
the second one is about the relative refractive index of the glass.
In the case of a concave surface, beams originally propagating along
the main optical axis will be refracted in such a way that they will never
intersect this axis. (Fig .6.6,c,d). One can state that a concave surface of
an optically dense medium always possess a negative optical force. The
unit vector n directed along the internal normal to the surface, as well
as the center of curvature C, will always be located outside the optically
denser medium. If n is directed along the propagation of the refracted
ray (Fig.6.6,c) the value of r should be treated as positive, in the other
case, where n is directed in opposite direction, the value of r should
be considered negative (Fig.6.6,d). The optical force will be negative in
both cases . In the case shown in Fig.6.6,c (r > 0) , the difference n r - ni
is negative, the relat ive refractive index of the optically dense medium
n21 = tuf n ; > 1 and the optical force P = (1 - n21)/r will be negative.
When the refracted ray propagates outside the optically dense medium
(Fig.6.6,d) the value r should be treated as a negative, whereas the
difference n r - ni is positive, which again gives a negative optical force
P = (n 21 -1)/r, where n21 = nr/ni' It should be noted that in the case
of a concave optically dense medium, the focal length belonging to this
medium will always be shorter than the focal length outside the medium.
For a glass spherical surface (n = 1.5) surrounded by air (n ~ 1), the
focal length is r- n2I!(n21 -1) ~ 2· r, where r is the radius of the surface,
and the other focal length (in air) is r . (n21 - 1) ~ 3· r.
Conclusions. The formula (6.5) obtained for the particular case of
a positive optical force is also valid for cases where the optical force is
negati ve or for cases of a concave spherical surface of the optically denser
medium. Formula (6.5) implies that for a given direction of the incident
beam and a given position of the apex, one can obtain the optical force
of the refracting surface and compute t he coordinates of the focii and
of the conjugated points. The sign of the optical force is dependent
on th e shape of the refrac ting surface (convex or concave) and on the
relati ve refractive index, which is specific to this surface. The signs
of the coordinates of the focii and the conjugated points follow from
expressions (6.6-6.7),(6.9-6.10)
c d
Figure 6.7. Real and imaginary images caused by a spherical refracting surface. A
surface of positive optical force gives rise to a real image of the object SP placed
behind the first focus of the surface (a) , and an imaginary image if the object is
placed between this focus and the apex (b) . Imaginary images produced by a surface
of negative optical force (c,d) . The object SP is located inside the refracting surface
between the second focus and the apex (c) , or outside the surface behind the second
focus (d).
minus sign indicates that the short lines SP and S' P' have mutually
opposite directions. Please note that this is true only using the paraxial
approximation. Using either the ratio Fa I SF = I Ix or NEWTON'S
formula we get the following expression for the linear magnification :
I x'
{3 = -; = - I" (6.11)
o x
Figure 6.8. Reflection by a spherical mirror. The positive direction of the z--axis
makes a sharp angle with the direction of light propagation. The center of the cur-
vature of the reflecting spherical surface C is always located in the region of negative
z . So the values Xl , X2 and / are all negative with respect to the apex.
will make a sharp angle u' with the optical axis x and intersect this
axis at point P. Let -i and i be small positive angles of incidence and
reflection , and Xl and X2 be the coordinates of the source S and its image
P , respectively. From the reflection law (4.15) we get
-i = i' . (6.12)
For the angle of refraction i' the following relation is valid:
i' = -cp +u'
where cp is a positive angle between the radius CM and the optical axis.
In turn, for the positive angle -i we can get:
-i = -cp+u .
Using these relations and the expression (6.12) we find:
u+u' = 2cp . (6.13)
The paraxial approximation allows the small angles in the last ex-
pression to be represented by the distance h , by the coordinates Xl , x2
and by the radius of the curvature r (the latter should be taken as a
negative, since the coordinate of the center C is negative):
h h h
'P;:::::: - - , u~--, u'~--
r Xl X2
Figure 6.9. A centered optical system . 00' is the main optical axis of the system .
F l is the first focus of the first spherical surface and F~ is the second focus of the n-th
spherical surface of the system. The coordinates of focal points and that of positions
Xl of the source S and that of images X2n are measured from their respective apexes.
The coordinates of the principal focii Fs,F; and the principal planes
H,H' of a system (Fig .6.11) allow the construction of the image gener-
ated by this centered optical system. Coordinates of the object, of its
image and of the principal focal lengths are taken respectively from the
first and the second principal plane .
Geometrical Optics 235
The position of the point source P in Fig.6.1l is determined by the
length M P = Xl. The light beam PM parallel to the optical axis
will pass the second principal plane H' at point M' and passes then
through the second principal focus F~ after refraction by the system.
Another beam originating from point P passes through the first principal
focus F s , intersects the first principal plane at point N, is refracted and
continous parallel to the optical axis (beam N N'). The intersection
point of the beams M' F~ and N' N gives the location of the image p '
of the source. From the triangles N P M and N FsH we get
HFs NH
MP= NM
Because
MP = -Xl and HFs = -Is ,
where Is is the first principal focal length of the system, we get
Is NH
-=-- (6.15)
Xl NM
In a similar manner, from the triangles M' p' N' and M' H' F~ we obtain
I~ M 'H'
(6.16)
X2 = M'N'
Is + I~ = M' H' + N H = 1
Xl X2 MN
Finally, we obtain
Is + I~ = 1 (6.18)
Xl x2
This expression is similar to expression (6.9). In other words, it means
that when the principal focal lengths and the source position are known ,
it is possible to find the location of the image using the paraxial approx-
imat ion. If the positions of the principal planes are known, it is possible
to use the method illustrated by Fig.6.1l for a construction of the image
without analyzing the beam behavior inside the system in detail.
236 DEMONSTRATIONAL OPTICS
4. Lenses
4.1 General relations
A lens is a centered system having two spherical surfaces (as shown
in Fig .6.12). Usually a lens is made of transparent material, for instance
glass. In most cases this material is optically denser than the medium
outside the surfaces. The direction of the main optical axis is specified
by the direction of light propagation. Let us introduce two Cartesian
systems, one (x,y) positioned with its origin at the apex of the first
surface , the other (x, if) with its origin at the apex of the second refract-
ing surface. These two spherical surfaces, each having a positive optical
force, will cause refractions of an incident ray as shown in Fig.6 .12,a.
This ray, propagating parallel to the main optical axis at a distance hI ,
is refracted by the first surface and crosses the optical axis at the sec-
ond focal point of the first surface (at the coordinate ff) . It then leaves
the second surface at a distance ba and intersects the optical axis at
the second focus of the centered system. The coordinate f~s is taken
from the apex of the second surface as a coordinate of the system (x, if).
Extending the incident ray parallel to the optical axis from the point
of incidence to a point of intersection with the refracted ray, one can
find the position of the second principal plane H'. The coordinate XH',
which specifies the position of second principal plane , has to be measured
from the apex of the second refracting surface. For given parameters f~
, f 2 and the distance d between the apexes of the refracting surfaces ,
one can find the coordinates f~s and X H' .
With the distance d between the origins of our coordinate systems,
we can link the coordinates x and x :
x=x+d (6.19)
The light ray intersects the main optical axis at point f~ of the (x, y)
system and, at the same time, at point Xl of the (x,if) system:
(6.20)
The point x~ , the conjugate of point Xl, has the coordinate f~s (the
second focus of the optical system) . Hence, formula (6.18) becomes
h/(f~ - d) + f~/f~s =1
when using Xl from (6.20) and x'l = f~s ' From the last relation, for f~s
we get
I.os, =
f'
2d
d - f~
+h - f~
(6.21)
Geometrical Optics 237
-----~- - - - ---
a
x
fI
1
b
x
Figure 6.12. The principal focal lengths and planes of a positive lens. Cartesian
systems (x , y) and (x,Y) are located at the first and second apex of two refracting
surfaces , respectively. An incident ray parallel to the optical axis intersects the optical
axis at the second focus I~ •. The intersection of the outgoing ray with the extension
of the incoming ray defines the second principal plane H ' (a). Another parallel ray
falling on the second surface intersects the optical axis at point 10. after refraction
on the first surface, and its extension defines the principal plane H (b) .
d+h (6.23)
fos = h d + 12 - f~
(6.24)
It should be noted that the coordinates of the first focus fos and of
the second focus f~s are measured from the apexes. It would be useful to
have the coordinates of the focii relative to the positions of the principal
planes : fs = fos - XH, f~ = f~s - XH'· Substituting the magnitudes
fos , f~s ' XH and XH' in the last expressions by (6,21 - 6.24) gives
hh fU2
fs = d+ 12 - f~ and f s' = - d+ 12 - f~
(6.25)
These expressions for the coordinates of the principal focii and the prin-
cipal planes for the general case of an optical system formed by two
centered refracting spherical surfaces are the basis for computations of
simple optical devices. Please note that all equations given in sections
6.3 and 6.4 are valid for general cases. It is not necessary that the re-
fractive indexes of the media have the same values outside the centered
system on the left and on the right side.
1
P=,=
fs
Geometrical Optics 239
According to (6.7) and (6.8), we introduce the optical forces of the first
and the second refracting surfaces:
(6.26)
Substituting these relations into the expression for the optical force of
the lens, we find the following formula:
(6.27)
A lens is called positive if its optical force P > 0 and negative in the
opposite case, P < O.
In a similar way, using (6.25) and the relations (6.26, 6.27), one can
find the relation
is = - ~ = -t; (6.28)
for the first focal length is of the lens. Hence, the focal lengths of a thick
lens are expressed by a single optical force. It should be noted that this
result follows from eq. (6.25), where the values is and i; are measured
from their respective principal planes H and H'.
Let us determine the coord inates of the principal planes of the lens in
terms of the optical forces. Following (6.24), (6.26) and (6.27), we get
H H'
r2 r1
-o- ._ .~ -_ ._ .~ ._ ._ . · ~ - _· _ · ~· _· _ · _ · - o -·
f1 f2 f• f~ f~ f~
Figure 6.13. A thick convex-convex lens. Positions of principal points are calculated
for a relative refractive index n = 1.5, rl/r2 = 3/2 and d/rl = 4/7 (a) . Construction
of an image using the principal planes and the focal lengths of the lens (b) . The dashed
lines indicate ray paths inside the lens obtained by the geometrical construction using
the principal planes and focii. The solid lines inside the lens body show the real light
paths.
I
H,H '
Figure 6.14. Glass ball lens. The given geometrical construction is valid for n = 1.5.
The coordinates of the focii from their respective apexes are both r /2, where r is lens
rad ius. Both principal planes are located at the center of the lens.
construct the geometrical paths of the refracted rays inside the lens as
shown in Figs.6.13, 6.14, 6.15.
H H'
_ . _ . ~ ._. _ ._ . _ ._ . _._ . _ . -o
._ .- ' - ' - , - , - , 0 · _ · -<)- - · -
f' f'
1 2
Figure 6.15. A thick negative glass lens composed of two concave surfaces . The
geometrical construction of ray paths were obtained for n = 1.5, Tl/T2 = 3/2, d/rl =
1/2 (a) . Construction of an image by means of the known positions of the principal
planes and focii (b). Dashed lines show the ray paths from the inner surfaces to the
principal planes , solid lines show the real light paths inside the lens.
'
J
f
08 - = - /' = h!I-hif = hiU~
= /=" 8 - if
08 8
(6.31)
In the case of thin lenses, for the optical force P = 1/ i~ of the lens one
gets
1 1
P = -- + - = PI + P2 (6.32)
!I i~
from the definitions of the optical forces of the refracting surfaces (6.27
). Therefore, for the focal length of a thin lens, the following relation is
242 DEMONSTRATIONAL OPTICS
valid:
~ = -!, = (n - 1) (~ - ~) (6.33)
fs fs rl r2
In this case one can regard a thin lens to be characterized by a single
magnitude f, the focal length of the lens. f follows from the radii of
the refracting surfaces and the value of the relative refractive index (eq.
6.31). As in the case of a thick lens there are different kinds of thin lenses,
each being a specific composition of two refracting surfaces. For example,
a thin lens with two convex surfaces will possess a positive optical force.
The focal lengths of the lens are shorter than the shortest focal length
of the refracting surfaces. A thin lens composed of two concave surfaces
has a negative optical force, and the focal lengths will follow the same
rule mentioned above. In the particular case of a thin lens with one
plane surface, the focal length will be determined by the optical force
of the spherical surface. A thin lens composed of two spherical surfaces ,
where one is a concave surface and the other is a convex surface (called
meniscus), can have a positive or negative net optical force, depending
on the ratio of the radii of the surfaces .
For correcting chromatic errors (see section 6.5) it is necessary to com-
bine two thin lenses, having different signs of optical force and consisting
of materials of different relative dispersion to form a lens with a certain
resulting optical force. Even such combinations, known as achromatic
lenses, can be treated in good approximation as one thin lens.
As the thickness of a thin lens may be neglected, one can consider
the ray passing though the center of the lens as not being refracted.
This property of thin lenses is often used for the construction of an
image created by a thin lens. It is seen in Fig.6.16,a,b that the images
are obtained by means of two rays , one passing parallel to the optical
axis, and other one propagating through the center of the lens without
refraction. The intersection of these rays gives the position of the image
point above or below the optical axis.
1------- x, --------'~-----
s
r,
t - - - - -- x, -----j
H.H'
Figure 6.16. Image construction for thin lenses having a positive (a) and a negative
optical force (b) . In both cases rays passing through the center of the lens are not
refracted .
a small angle Q with the optical axis. The first ray, falling on the lens
at point M , will int ersect the optical axis at the second focus F' . The
second ray, passing the lens center, is not refracted and forms the angle
a' = IFSI / IFOI with the optical axis. The third ray hits the lens at
point M' and will (after refraction) intersect the second focal plane at
point pl. All outgoing rays are parallel and intersect the second focal
plane (drawn through the second focus normal to the main optical axis)
with the same angle.
In contrast, in most applications using the paraxial approximation one
has the problem of obtaining an image caused by a system of parallel
rays. Following the rays ofFig.6.17 backwards, one sees that all incoming
parallel rays are refracted to point S in the first focal plane. The size of
the image of a far distanced object, such as the Sun , can be determined
if the angles of the incoming rays are known.
244 DEMONSTRATIONAL OPTICS
Figure 6.17. Image construction for rays coming from a point S on the first focal
plane of a thin positive lens. After refraction these rays are parallel (M ' P'IIM F'IIOF')
and form the angle Ct.' = IFSI / IFOI with the optical axis.
(6.34)
P2 P1
XH = d X r = _d (6.35)
P H P
For given focal lengths of the lenses fl, f{ and 12, f~ , the focal lengths
of the optical system can be calculated by means of formulae (6.21) and
(6.23) or by (6.25).
An optical system consisting of two thin lenses is used forf several
simple optical devices such as an eye-piece and different telescopic sys-
tems. An eye-piece is commonly used as an element of a telescope
or microscope to collect light from an extended primary image formed
by the telescope or microscope objective . The advantages of the two-
component eye-piece over a single lens is mainly the fact that it can
provide a higher correction of the imperfections of the lenses forming
the eye-piece (so called aberrations, which will be considered in section
6.5).
As examples let us regard the RAMSDEN eye-piece and the Huv-
GENIAN eye-piece , both built with two plane-convex positive thin lenses.
In the RAMSDEN eye-piece system shown in Fig.6.18,a, the first and
the second lenses have the focal length f and the distance d between
Geometrical Optics 245
the lenses is d = 2f /3. For the optical powers of the lenses we get
PI = P2 = 1/f. Using f{ = -II = f , f 2 = - h = f and d = 2f/3 ,
calculations by means of eqs. (6.21) and (6.23) give
-fos = f~s = f/4 .
In turn, the coordinates of the principal planes obtained for the given
parameters follow from (6.22) and (6.24):
-Xw=XH=f/2.
The positions of the focii and the principal planes with respect to the
components of the RAMSDEN eye-piece are shown in Fig.6.18,a. Let an
object be placed in the first focal plane of the eye-piece. One can regard
the top point of the object as a point source emitting a pair of rays .
One ray propagates parallel to the main optical axis, and the other ray
propagates at a small angle with respect to the first one. It is easy to
see an analogy between this point source and our considerations in the
previous section. Two parallel rays are formed by the optical system
after refraction of the incident rays.
According to the construction rules given in section 6.3, the ray paral-
lel to t he optical axis has to reach the first principal plane H (t he dashed
line, joining the inner surface of the first lens and H), t hen the second
one H' and then it has to intersect the main optical axis at the second
focus of the system f~. The inner part of the ray trace from H' to the
surface of the second lens is shown by a dashed line; the continuation of
this dashed line outside the lens is shown by a solid line. In a similar
way one can find the path of the ray, falling on the first lens at a small
angle, between the lenses. This ray must leave the lens system parallel
to the ray treated before. The real traces of the rays between the lenses
can be found by connecting the entry and exit points of the rays on both
single lenses.
The function of the RAMSDEN eye-piece is similar to that of a single
positive lens. The net optical force of this eye-piece, calculated by the
formula (6.34), is P = 4/(3f) > O. Hence, an equivalent single lens would
have the focal length 3f/4.
In the case of the HUYGENIAN eye-piece the first lens, being positive
and having the focal length 3f, is at a distance d = 2f from the second
positive lens with the focal length f. The positions of the principal planes
are given by
XH = d P2
p = 3f and X H' = -d p
PI
= -f ·
As a result, the principal focii F, F ' are located symmetrically from the
second lens at equal distances ±f/2 (Fig.6 .18,b). The total optical force
246 DEMONSTRATIONAL OPTICS
H' H
---- - -- --
H' H
f~
Figure 6.18. RAMSDEN (a) and HUYGENIA N (b) eye-pieces. The focal length of the
first lens /l , the distance between the lenses d and the focal length of the second lens
h fulfill the ratio /l : d : h = 3 : 2 : 3 for the RAMSDE N eye-piece, and 3 : 2 : 1 for
the HUYGENIAN eye-piece. The object line located in the first focal plane is a primary
image generated by the telescope (or microscope) objective.
Diaphragm
Figure 6.19. Setup for observation of the longitudinal spherical aberration caused
by a lens. A mercury lamp , emitting a set of bright spectral lines, is used . In order
to observe distinct focii of quasi-monocrornatic light , an optical filter is inserted into
the beam.
248 DEMONSTRATIONAL OPTICS
8f = /' - /
is called the longitudinal spherical aberration (Fig .6.20). The reason for
this behavior is that the lens has two spherical surfaces. Light is focused
into one point only for rays close to the optical axis - thorough Chapter
6 we have used the paraxial approximation up to this point. There are
surfaces which do not show longitudinal aberration. These surfaces have
aspherical shape (called eikonal, see later), but, for technical reasons,
most of the lenses used in applications still have spherical shape.
Longitudinal spherical aberration causes blurring of the image. Ele-
mentary beams falling on the lens at different distances from the optical
axis will be focused within the short length 8f. The cross section of the
Geometrical Optics 249
refracted beams by any plane aa' (Fig.6.21) will have a circular shape. A
cross section with minimal diameter will exist at a certain point between
the focii F' and F". In the case of a lens with a positive optical force
(converging lens) the value of the longitudinal spherical aberration is
negative: 8/ < O. For a lens with a negative optical force this quantity is
positive: 8/ > O. In the latter case the boundary beams are more weakly
refracted than the paraxial ones. This peculiarity is used to design op-
tical systems corrected for the longitudinal spherical aberration.
h1r
1.0
0.75
1_
0.5
0.25
6 r -1
"Figure 6.22. Illustration of the operating principle of an astigmatic lens . The max-
imal value of Of appears at hlr ~ 0.6, where h is the distance from the optical axis
and r is the lens radius. The dashed line shows the variation in Of for a single positive
lens.
ray, passing through the center of the lens from point source P, and the
intersection of peripheral rays.
In addition to the above described distortions of the pencils of beams
refracted by spherical surfaces, an ordinary astigmatism also takes place.
Astigmatism in optical systems may exist even for narrow pencils of
beams if they are propagating at a considerable inclination to the optical
axis. Astigmatic beams, arising as a result of refraction, are character-
ized by two focus lines, which have properties regarded at the beginning
of this chapter. Astigmatism of an optical system may be corrected by
special selection of the curvature radii and the optical forces of the re-
fracting surfaces. Optical systems corrected for astigmatism are called
anastigmatic.
I- --I
\-l -r
4.1 1 tJ B
a b c
Figure 6.24. Pincushion (c) and barrel (b) distortion, shown for the example of a
quadratic grid (a) . With pincushion distortion images of points outside the optical
axis are further from the axis of the beam (c), and with barrel distortion they are
closer to the axis of the beam (b). The linear amount of distortion varies with +h2 (c)
and with _h2 (b), the square of the distance h from the axis, thus nonradial straight
lines are imaged as curved lines.
Geometrical Optics 251
White light Red rays
I
---
. _ - _ . _ . _. _ . - . _ . _ . _ . _ . _ . _- -~ ~ ~~
-:;:;. - .- .-
~-,.;.,,=,
.
- -..
Longitudinal
chromatic
aberration
Figure 6.25. Axial chromatic aberration as the longitudinal variation of the focal
position of red rays FR with respect to blue rays FB. The longitudinal chromatic
aberration is the distance FBFR.
Image
White light
lateral
chromatic
Aperture aberration
Figure 6.27. Lateral chromatic aberration, or the vertical distance from the off-axis
image of a point obtained by red rays compared to that with blue rays . The aperture
is illuminated by white light .
The radius of the curvature of the refracting surfaces of the lens, and
hence its focal length, may be varied slightly by muscular contraction.
Such an adjustment of the focal length of the eye, called accommodation,
provides a sharp image of objects at an infinite distance as well as only
a few centimeters from the eye. Usually the light sensitive layer of the
eye (called the retina) is assumed to be at a distance of 17 mm from
the inner refracting surface of the lens, whereas the focal length for an
unaccommodated eye is about 15 mm. For a normally sighted eye having
such parameters, an object at a distance of about 25 em from the eye is
254 DEMONSTRATIONAL OPTICS
imaged on the retina. This distance is called the distance of best vision
(denoted by 10 in Fig.6.29)
p'
pI
1
Figure 6.30. A magnifier. An object P{' r", placed from the eye at the distance of
best vision 10, is observed at an angle u. A gain in magnification is provided by a
thin lens positioned close to the eye. The object PIP is now located near the first
focal plane and can be observed by the eye at the angle u /. The lens gives rise to an
imaginary, erect image P{PI at the distance of best vision.
6.2 Magnifier
The simplest magnifier consists of one positive lens (Fig.6.30). This
lens L is located in front of the eye so that the observed object PiP is
placed in the vicinity of its focus plane F, closer to the lens. In this case,
an imaginary and magnified image P~ P' appears, located at the distance
of best vision lo, so that the eye may observe it without accommodation.
Considering the lens to be thin, the beams Pi 0 and P~ 0 will pass the
lens without refraction. Therefore, the observation angle u' of the object
is the angle between the beams P~ 0 and P' O. Without the magnifying
lens, the object would be located at Lo and would be observed under the
angle u. The smaller the focal length of the lens is (the closer the object
is located to the lens), the higher the angle u' will be.
For determining the magnification of the magnifier f3, the ratio
-,- --
PiP / PiP
has to be estimated. Assuming that the object is located practically
in the focal plane of the lens, the following approximate value may be
proposed for the magnification:
f3 = p{P' = ~ ~ 25 (6.36)
PiP f f
where f is the focus length of the magnifier, expressed in em.
256 DEMONSTRATIONAL OPTICS
Figure 6.91. Formation of the object image by a system of parallel rays passing
through a telescope. Rays from opposite sides of the object (points A and B) fallon
t he telescope objective at the angle o . Practically parallel rays from point A, inclined
0/2 to the optical axis, form point A', positioned at the first focal plane of the eye-
piece. This point gives rise to parallel rays leaving the eye-piece at an angle f3 /2
> a/2 .
-E)
A
a
o
Figure 6.92. A remote object of ball shape is observed by the eye at point 0 under
an angle 0 « 1.
{3 = _o:foo (6.37)
t;
An observer , placing an eye accommodated to infinity behind the eye-
piece, can observe the enlarged image of the object. The larger the ratio
fob/ fep is, the larger the observation angle of the object image in the
focal plane of the objective will be. The construction of a telescopic
system allows a shift of the eye-piece along the optical axis, allowing
th e production of a real image of the object, for example, in the plane
of a photographic plate. In another case, where the eye-piece is shifted
closer towards the objective, th e eye-piece produces an imaginary image
at distance lo. In such a case, the action of the eye-piece is similar to
the magnifier treated before, and the image can be observed with the
eye accommodated to lo.
The image of the object is reversed when observed through such a
telescope. This does not matter when observing the stars of the sky, but
is troublesome when observing terrestrial objects. Therefore, terrestrial
telescopes are practically always used with a combination of prisms or
lenses which turn the image upright.
The objective of the GALLILEO telescopic system consists of a long
focus achromatic lens and a short focus negative lens acting as the eye-
piece (Fig.6.33). The negative lens is mounted in such a way, that its
first focus coincides with the position of the focal plane of the objec-
tive. Advantages provided by an eye-piece with negative focal length
are a shorter total length of the telescopic system and an upright im-
age. A disadvantage is the more difficult correction of imaging errors .
258 DEMONSTRATIONAL OPTICS
Figure 6.33. The GALLILEO telescope. Parallel rays from a remote object falling on
the objective at small angle to the optical axis 00' form a primary image A' B' on
its focal plane . The latter is fitted with the focal plane of a thin negative lens. Point
A' gives rise to two parallel rays leaving the lens.
Such devices are mainly used for small magnifications and lower quality
requirements, e.g. for opera glasses.
reflecting telescopes, different second mirrors are often used for varying
the net focal length of the system of both mirrors.
A principle advantage of the reflecting telescope is the absence of
spherical (in case of paraboloidal shape) and chromatic aberrations in
the mirrors. The primary mirror of a reflecting telescope can be made
larger than the glass lenses of the objective of a refracting telescope ,
since optical inhomogeneities in a mirror block are of no significance .
As the magnification is given by the ratio of the focal lengths of the
objective mirror and the eye-piece lens and has nothing to do with the
mirror diameter, why are astronomers interested in using large-diameter
telescopes? The reason is that the resolving power increases with diam-
eter (see Vol II, Diffraction) and that the point brightness increases with
the square of the diameter of the primary collecting element . In such
a way, a large scale telescope allows the detection of weaker stars with
higher angular resolution.
6.4 Microscope
A microscope is designed for the observation of tiny obje cts when
a significant magnification is needed, that is substantially more than
an ordinary magnifier may achieve. The principle construction of the
microscope comprise a very short-focus objective and an ocular in the
role of the magnifier (Fig .6.35).
An object subjected to observations by a microscope is located close
to the focal plane of the objective, which produces a real magnified image
260 DEMONSTRATIONAL OPTICS
p"
--::~::::>::::::---- Eye
sLiJ~~~~~n:
pi
I-• - - - - - - 10 - - - - - - -
Figure 6.35. Microscope. A short line SP located near the focus FOb of the objective
is imaged on a plane close to the focus F ep of an eye-piece. The latter forms an
enlarged erect image. Point p' of the primary image is imaged to point P" so that
the final image is at the distance of best vision 10 from the eye.
Figure 6.36. Effect of lowering the diameter of a pencil of rays . The image of an
extented object is formed by the lens (a) . Point A located in the plane P is imaged
to point A' on the plane pI, whereas point B, being out of the plane P , is imaged to
point B ' outside of plane r'. The pencil of rays forms a circle in planee: Action of
a diaphragm D (b). A reduction in the diameter of off-axis rays by the diaphragm
D leads to a decrease of the size of the circle in the plane P' .
Entrance
Aperture
slop
I
pupil
Field
stop
- _._
-~
Field stop
a
Field stop
- -
I Exit pupil
b
Figure 6.39. Field lens in eye-pieces. Geometrical paths of rays through a telescope
with the RAMSDEN eye-piece (a) . The vertical arrow indicates one half of the image
of a remote object. Light rays through the top point of the arrow pass through the
aperture of the field lens, then they form two outgoing parallel rays . Geometrical
paths of rays through a telescope with the HUYGENIAN eye-piece (b). The vertical
arrow , showing one half of the image of a remote object (at its position without the
eye-piece), is located at first focal plane of the eye-piece. Incident rays which pass
through the field lens aperture in a way, that their extensions would intersect at the
top point of the primary image, form outgoing parallel rays .
n = nCr)
In the frame of geometrical optics, we understand a medium to be inho-
mogeneous if noticeable spatial variations of the refractive index n occur
over distances greatly exceeding the light wavelength. Even when treat-
ing the shortest space interval as mathematically infinitesimal, the length
of this interval is much larger than the wave length (and we assumed
,\ = 0). Nevertheless, this interval should be shorter than the character-
istic length of noticeable spatial variations of the refractive index. Then,
Geometrical Optics 265
over very short spatial intervals, we can assume n(r) :::::: const, and within
this characteristic length the behavior of a light wave is similar to that
inside an uniform medium.
Let us regard an infinitely small path element dl between two points
of the light beam trajectory with coordinates ro and r (Fig.6.4D):
I dr I = I r - ro I = dl
The unit vector dr / dl will be the tangent to the trajectory at the point
roo For this infinitesimal part of the beam trajectory, the light wave can
be regarded as a plane wave with the wave vector k oriented tangentially
to the trajectory, fulfilling
where s is the beam vector and ko = 27T/ "\0, where ..\0 is the wavelength
in vacuum. Using the beam vector, it is possible to represent the electric
field of this plane wave at point ro in the following form:
dr = idx+jdy+kdz ,
where i, j , k are the unit vectors associated with the Cartesian system
(x , y , z). The same is valid for the scalar function
as as as
dS(r) = ax dx+ aydy+ azdz ,
where
as. as. as as
gradS = -l+-J+
ax ay
-k=-
az ar
is the gradient of S at point r. The expression (6.41) shows that dS
is represented by a scalar product of two vectors: gradS and dr. Now
consider P and Q to be two points on a surface S(r) = C , where C is
a const ant. These points are chosen so that Q is at distance dr from P.
Moving from P to Q, the change dS(r) in S(r) = C will be still zero,
since C is a constant. Hence,
dS(r) = ds-qrad S = 0 ,
For a given dS, the distance Idrl is a minimum when dr is chosen parallel
to grad S , or parallel to the normal vector of the first surface, drawn at
P. In another words , for a given distance Idrl, the change in the scalar
function S is maximized by choosing dr parallel to grad S. Thus grad S
Geometrical Optics 267
grad S(r)
a b
Figure 6.41. Geometrical representation of the gradient of scalar function S(r). The
vector r is drawn to point P on the surface S(r) = C with fixed Cj the vector dr
drawn from P to Q can take an arbitrary position on the surface , so that dS(r) = OJ
grad S(r) is parallel to the unit normal vector n drawn at point P (a). Two surfaces
S(r) = C1 and S(r) = C2 separated by an infinitesimal distance IP Rlj the vector dr
drawn from P to Q can be chosen arbitrarily while Q belongs to the surface S(r) = C2j
the minimum of Idrl is obtained for dr II gradS(r). The vector gradS(r) is normal
to S(r) = C 1 at point P (b).
becomes the vector pointing in the direction of the fastest space rate of
change of the function S.
When applying the differential forms considered above to the problem
of light propagation within an inhomogeneous medium, it is assumed
that variations in the refractive index occur smoothly. Particularly, it
means that any small element of the surface S(ro) = C around a fixed
point ro must be regarded as a plane. Using the given ko, according
to the definition of the eikonal (6.40), the product koS(r) represents a
phase within an element of a plane wave front; then the change in phase
over the interval dr is represented by
kodS(r) = kodr·gradS
Now, one can regard the phase of the light wave to be changing over the
same infinitesimal interval dr, similar to that in case of a plane wave in
(6.39):
t::.cp = konsdr
The last expressions enables one to get the following:
kodr ·grad S = konsdr
Hence, for the gradient of eikonal, the final equation can be written as
ns = grad S (6.43)
268 DEMONSTRATIONAL OPTICS
The eikonal equation represents the fact that the light beam vector s is
directed along the shortest path Idrl joining two points of wave surfaces
S = C1 and S = C2. In other words, the light trajectory specified by the
vector s, penetrating surface S = C1 at point rl, is directed along the
normal to the surface S = C 1 . Therefore, the vector grad S(r) and the
vector s both have the same direction.
When considering light beam vectors at two adjacent points, one can
now take into account changes in the direction of these vectors caused by
variations of the refractive index n = n(r). For an infinitesimal element
dl between two points r1 and r2, let the vectors 81 and 82 make a small
angle a (Fig.6.42). As we assume that the refractive index changes
smoothly, it is enough to take into account the first order derivative
dnfdl. From the eikonal equation we get
d(ns) d as a dS (6.44)
"dl = dl Or = or df
Now,
dS as or
dl or Ol
Since ar/Ol = s and as/or = ns, the derivative dS/dl can be expressed
as dS/dl = n. Substituting this result into (6.44) we get
d(ns) an
~ = or = grad n . (6.45)
As lsi = 1 it is evident, that the following relation is valid for the small
angle a:
a = Idsl = Idsl (6.46)
lsi
The non-zero value of Idsl indicates thatthe light path between rl
and r2 has to be characterized by a curvature radius R, originating from
Geometrical Optics 269
grad n
--=----..
Figure 6.43. TI ~ relationship 1/R = (ngradn)/n. The vertical vector gradn makes
a sharp angle with the light trajectory; the latter is represented by the arc of a circle
with the curvature radius R and center C. Two adjacent beam vectors Sl and Sl
drawn from the ends of an infinitesimal light path dl make a small angle 0 . The inner
unit normal vector n of the light trajectory is parallel to the vector ds .
Idsl = dljR
As the normal unit vector n at the point ro has the same direction as
ds, the last equation can be written in the following vector form:
ds n
= (6.47)
dl R
Differentiation of the left-hand side of expression (6.46) gives
d(ns) dn ds
----;{l = diS + n dl (6.48)
Heigth
"0
S~dS
S4'--~ ds
•orad "
~ =n gradn (6.49)
R n
The left-hand side of the last equation is never negative ; therefore, vec-
tors nand gradn form a sharp angle. In turn, according to (6.49), the
unit vector n is always directed along a positive increment of the beam
vector ds. Therefore , this positive increment ds always makes a sharp
angle with the vector grad n, and the latter always points towards the
direction of increasing refractive index. Hence, the light beams prop-
agating in the optically inhomogeneous medium are bent towards the
region of increasing refractive index.
For example, let an inhomogeneous medium be characterized by a
vertically decreasing refractive index (Fig.6.44). For a given no < nl,
there is a set of horizontal lines, each drawn over a fixed increment
~n < 0, so that the gradient of the refractive index points downwards
in the picture. At first let the beam vector 81 make a sharp angle with the
gradient vector. An infinitesimal increment ds of 81 gives a new vector
82. This vector, making a sharp angle with gradn, is directed towards
the region of increasing refractive index and results to the following
vector 83, and so on. Hence, a light trajectory composed by a sequence
of vectors 81, 82, 83, 84, will be bent towards the region with a larger
refractive index.
glycerine (nl = 1.5) and carefully covered by water (no = 1.3). The
relatively sharp boundary between water and glycerine soon disappears
because of diffusion and a liquid with a continous variation of the re-
fractive index appears: at the bottom of the cell n ~ nl and at the top
n ~ no. The refractive index gradient in this optically inhomogeneous
liquid is a vertical vector directed downwards. While passing through
this liquid, a thin horizontal pencil of beams bends in the direction of
the larger refractive index, as seen in the Fig.6.45,b.
! Optical
density
! grad n
Figure 6.45. A medium with spatially varying optical density (a); bending of a laser
beam propagating through an optically inhomogeneous liquid (b) .
7.2.1 Mirages
The natural phenomenon of a mirage is explained by the bending of
light beams in inhomogeneously heated air (Fig.6.46,a). The layers of
the hotter and less dense air are located near the surface of the hot
desert sand . The temperature drops with altitude, whereas the density
increases. The refractive index of air, proportional to the local density,
varies smoothly from a certain minimal value at the sand surface up to
larger values within higher layers of air. In this case, the refractive index
gradient is directed vertically upwards ; therefore, a horizontal light beam
is bent upwards from the heated surface (Fig.6.46,a,b).
Light beams originating from a bright region of the sky near the hori-
zon pass to an observer along a curved trajectory with its lowest part
passing in the vicinity of the surface of the earth. Therefore, an apparent
image of this region of the sky is located close to the earth's surface, at
the (dashed) extension of the line of sight. The illusion of a shining water
surface arises. The same illusion occurs for highway asphalt heated by
272 DEMONSTRATIONAL OPTICS
grad n
Light ray
_II __
Heated surface
a b
Figure 6.46. Mirage caused by a heated surface (a) ; mutual positions of the light
path and the gradient of refractive index (b) .
~ Ray of sight
_ Igrad n
- - - - - - - - -- - - . - Li~
1
Cold water surface
a b
Figure 6.47. Mirage caused by a cold surface (a); mutual positions of the light path
and the gradient of refractive index (b)
~~
AtrnOSPh ere
Figure 6.48 Astronomical
Earth
refraction.
J J~~ = J
B B B
As the integration path was arbitrarily selected, the value of the integral
on the left hand side of the last equation is equal to the eikonal difference
of the points A and B, and does not depend on the integration path.
Introducing an infinitesimal element of the optical path ndl let us
compare an integral over the optical path for an arbitrary curved tra-
jectory
J
B
ndl
A
with the integral (6.50). The scalar product nsdr may be expressed in
terms of the infinitesimal element of the integration path in the following
manner:
nsdr = ndl cos(;,di-) ,
where (S4r) is the angle between the vectors s and dr . It is evident,
that
nsdr = ndl cos(;,di-) ~ ndl .
Therefore, the optical path along an arbitrary curve is always larger
than the optical path corresponding to the light beam trajectory:
J J
B B
nsdr:5 ndl
A A
274 DEMONSTRATIONAL OPTICS
a b
Figure 6.4g. FERMAT 'S principle . Curved lines with arrows show light trajectories
(a) . Two optical paths between fixed points A and B (b) . The optical path along the
shortest path ACB is associated with a light trajectory.
Equality between these optical paths is attained only in the case when
the integration path coincides with the light beam path. Let us compare,
for example, the curve ACB associated with the optical path of light ,
and another arbitrary path ADB, which does not coincide with the
first one (Fig.6.49,b) . It is clear that the net path-length over ADB is
longer than that over ACB. Therefore, light beam propagation always
occurs along the shortest optical path. An element of the optical path
is connect ed to the time interval dt via the velocity of light in a medium
v:
dl = vdt .
Therefore, light propagation between two points takes place in the min-
imal amount of time .
SUMMARY
Geometrical optics uses the concept of light rays as mathematical lines
along which light propagates. Nothing is presupposed about the nature
of light with the exception that no deviations from light paths due to
effects like diffraction takes place. Light pencils are considered here as
a set of independently propagating rays obeying the well known laws
of straight line propagation within a homogeneous medium, as well as,
refraction and reflection laws at the boundary between two media.
Basic laws of geometrical optics may be derived from the wave concept
of light for the limiting case .A -- 0, when the wave nature of light
becomes non-essential. For most practical problems , the approximation
in terms of the geometrical optics gives excellent results. This fact is
due to the very small wavelength of light compared to the dimensions
of the optical elements usually applied (mirrors, lenses, diaphragms and
so on) .
Geometrical Optics 275
PROBLEMS
6.1. In order to get a rough estimate of the dispersive properties of
glass used for the production of lenses, the quantity
1:::.= nF-nC
iio -1
is generally applied, where nc, no and nF are the refractive indexes
for the FRAUNHOFER C, D and F lines (>" = 659 nm, >.. = 589 nm and
>.. = 486 nm). Derive a formula for the change of in the focal length of
a thin lens caused by a change in the refractive index on of the glass.
The quantity on should be expressed by 1:::..
6.2 Derive the formula for the focal length of two thin lenses separated
by d. (Fig.6.50). The focal lengths of the lenses are fa and fb, respec-
tively. Consider the case of two positive (a) and positive and negative
_._.--- _._.-p-'~
lenses (b).
;4 d ..;
-L_' ~i .=:1._
Figure 6.50.
6.3. Two thin lenses of the focal lengths !l and 12, respectively, are
used for designing a simple achromatic system to roughly reduce the
longitudinal chromatic aberration. Derive a formula for the distance d
between these lenses, which provides conditions of achromaticity of the
system for wavelengths in the region between blue and red FRAUNHOFER
lines (see. Problem 6.1). Discuss the particular case both lenses are made
of an identical material.
from flint glass. Both lenses are in contact. For the given focal length
of the achromatic doublet, f = 50 em, calculate the focal lengths of
both lenses, using the following values of the refractive indices of crown
n Ce) and flint n(f) glasses: : n~) = 1.530, n~) = 1.527, n~) = 1.525,
n~) = 1.575, n~) = 1.565, n~) = 1.555 (the letters F,D,C refers to the
corresponding FRAUNHOFER lines mentioned in Problem 6.1).
t ~
_ .- ._._._ .-F,F' ~ \~
o-.- ._._._. _._._._._._ .- d-. -
-l+-----f,J!
Figure 6.51.
Figure 6.52.
Figure 6.53.
-~. rl
. .
Figure 6.54.
--~"
-~- - _
--=:----------------
r2
Figure 6.55.
6.9 The divergent meniscus shown in Fig.6.55 has two spherical re-
fracting surfaces; r2 = rI/4 and the index of the glass is n = 1.5. The
separation between the surfaces is d = r2/2. Calculate all magnitudes
as in problem 6.8, and construct an image of an object placed on the
second focal plane of the lens.
SOLUTIONS
6.1 From the expression for the focal length f of a thin lens (6.33),
one can obtain
1 = (..!..+..!..) ,
fen - 1) rl r2
where rl, r2 are the radii of the spherical surfaces of the lens. One can
see that the quantity fen - 1) does not dependent on the wavelength;
therefore , the variation o(J{n - 1)) = 0, or (n - l)of + fon = 0 may be
written as follows:
of on
f + (n-l) = 0 .
For estimating the longitudinal chromatic aberration of the thin lens,
the quantity on/en -1) may, roughly, be replaced by the quantity t1, so
that the following approximation for the distance of is valid:
of=ft1 ,
where 8f is the distance between the images obtained by parallel red
and blue rays.
6.2 Focal lengths of thin lenses are usually measured from the apexes
and , for this reason, we use the expressions (6.21) and (6.23) for the
Geometrical Optics 279
coordinates of focii f~s and fos of the system consisting of two thin
lenses:
d- fb , d- fa
fos = -fa d - fb - fa fos = fb f f
d - [b - [b
or
fos = fafb + fad
fb + fa - d fb + fa - d
/,' = fafb fbd
as fb + fa - d fb + fa - d
Further, XH = fad/Ub + fa - d) and XH, = -fbd/Ub + fa - d) are the
coordinates of the principal planes Hand H' measured from the apexes
of the first and second lenses, respectively (see (6.22,6.24)). In turn,
the coordinates of the focii measured from the points XH and Xw are
fs = fos - XH and f~ = f~s - XiI. Using such a representation of the
coordinates of the optical system, the focii fs and f~ may be expressed
in the form:
1 lId 1 lId
------+- -=-+---
[» - fb fb fafb ' f~ fb fb fafb
Figure 6.56.
Since both representations are equivalent, we will use last form, as-
suming that the appropriate coordinates of the principal planes H, H'
are known. The focal length f may either be positive, if fa + fb - d > 0,
or negative, if fa + fb - d < O. In a particular case where d = fa + fb,
280 DEMONSTRATIONAL OPTICS
a parallel beam passing through the system will still stand parallel, as
shown in Fig.6 .56. In the case of a combination of a positive lens ( fa)
and a negative one (- fb), several special cases may also appear. For
example, if d = 0 and fa = - fb' the action of both lenses cancels out
and no refraction occurs. The same is true for fa - d = - fb .
6.3 Let us represent the focal length f of the system of two thin lenses
in the following form (see. Problem 6.2):
1 fI+h-d 1 1 d
7= fIh = fI + 12 - fIh .
The change of due to changes of refractive indices will be equal to zero
if of/ f = O. The variation of the right-hand side of the expression for
1/f gives the following:
of = ofI + 012 _ dofI _ doh _ 0
f fI 12 If 12 fdi - ,
or
of = ofI + 012 _ .s: (OfI + 012) = 0
f fI 12 fIh fI 12
By introducing the quantities of the longitudinal chromatic aberration
for both lenses, 6. 1 and 6.2, respectively, we can substitute 6fdh by
6. 1 and 612/12 by 6. 2, which gives
d = 126.1 + fI6.2
6.1 + ~2
for the appropriate distance d. In the particular case where both lenses
are made of the same material with ~1 = 6.2, the separation d = (fI +
12)/2 is roughly independent of wavelengths, at least between the blue
and red Fraunhofer lines.
Here .6. 1 belongs to the positive crown lens and .6.2 to the negative flint
lens. For the focal lengths, one then can get
6.5 Let the laser beam fall on the first mirror M 1 at point A, as shown
in Fig.6.57. Since the incident beam is parallel to the optical axis of the
system, the reflected beam will intersect this axis at the focal point F . In
turn, the beam propagating from point F to the second mirror M2 will
pass parallel to the optical axis after reflection at a distance hI from this
axis. Because the focal length of a spherical mirror is equal to r /2, the
ratio h/h 1 is equal to that of the radii: h/h1 = rdr2' or hI = h(r2/rl);
hence, hI < h. As a result, the beam, passing between the mirrors, will
approach the optical axis.
~-----------------"">iC
Figure 6.57.
After the second pair of reflections, the distance between the passing
beam and the optical axis will be h 2 = h(r2/rl)2, after the third reflec-
tion h 3 = h(r2/rl)3 is observed, and so on. The beam can leave the
system after n pairs of reflections if the following inequality is true:
d > 2h n = 2h(r2/r;)n .
Since the approximation of paraxial rays is valid, one may regard all
angles between the traveling beams and the optical axis as being small, so
that the net path length l between two adjacent reflections , for example
the path ABC, is roughly equal to l = 2(rd2 + r2/2) = rl + r2. Hence,
if the beam leaves the system after n pairs of reflections , the net path
282 DEMONSTRATIONAL OPTICS
between the mirrors will be equal to in = n(rl + r2), and the delay may
be est imated by the following expression:
6.6 We use the solution of Problem 5.4. where a relation between the
angular separation 8 and the difference b.n was obtained (for the case
where the refractive angles of both prisms are 300 ) :
6.7. With the fixed position of the pinhole 8, so that its center lies
a little above the optical axis, the optical system is adjusted by a dis-
placement of the lens system along the optical axis to provide a distinct
image 8' of the pinhole at a position immediately adjacent to the pin-
hole. The pinhole and its image lie on the same plane if and only if this
plane is the focal plane of the optical system. That means, if the rays
fall parallel on the mirror M , they are then reflected back , as shown in
Fig.6.58. If this is the case, the distance from the apex of the first lens
to t he plane of the pinhole and image is equal to the magnitude los.
Geometrical Optics 283
M
f05 - - - - -
Figure 6.58.
H H'
--- -- Ft
Fs 81 82
Figure 6.59.
For the coordinates !l and fi, connected with the first surface which
has a positive optical force, and measured from the apex aI, we can
therefore write
!l=-~ and
n-1
The second surface of radius r2 has a negative optical force; hence, the
appropriate coordinates measured from the apex a2 are expressed as
follows:
f2 = -~
n -1
II _
J2 - -
nr2
n -1
In order to calculate the coordinates of the focii of the meniscus we find
the expressions for the focal lengths of the optical system, f~s and fos ,
284 DEMONSTRATIONAL OPTICS
J. - _ nrl. r2 + (n - l)d
os - n - 1 (n - l)d + r2 - nrl
Both focal lengths are measured from the apexes of the refracting sur-
faces. In a similar way, the coordinates of the principal planes are cal-
culated by means of the expressions (6.22, 6.24):
d
Xw = -nr2 and
(n - l)d + r2 - nrl
d
XH = -rl-;----:--,-------
(n - l)d + r2 - nrl
We substitute r2 = 4rl for r2 and d = rI/2 for d in all expressions and
obtain
6.9. We may use the results of the previous problem, considering the
convergent meniscus, but we have to take into account that rl is now
larger than r2 , opposite to the previous case.
-- - _. _ . _ . _ . _ . -0 - -_ . -
F.'
s Fs
H HI
Figure 6.60.
Taking into account this fact , we rewrite the magnitudes obtained for
the convergent meniscus in the following form:
References
[2] T .Young, Phil. Trans Roy. Soc. (London) 1802, v.12, p. 387
[6] M.Planck , Vert. d. Deutsch Phys. ber .2, 1900,202, 237; Ann d.Phys. (4)
[7] A.Einstein , Ann d. Phys . (4), 17, 1905, 132; 29, 1906, 199.
[9] A.A.Michelson, E.W.Morley, Amer. J . Sci. (3), 34, 833, 1887; Phil. Mag.12, 449,
1887
286 DEMONSTRATIONAL OPTICS
[17] E .Wolf., Proc . Roy. Soc. A, 230, 1955; Nuovo Cimento 12, 1954
[22] M.Born , E.Wolf, Principles of Optics , 2nd edition , Pergamon Press NY 1964
[25] Pohl, R.W. Einfuhrung in die Optik. Vierte und furfte verbesserte anflage. Berlin ,
Springer - Verlag, 1943 (Einfuhrung in die Physik ).
[26] Cherenkov P.A. Soviet Academy of Sciences Doklady 1934, v.2, p.451;
[27] Vavilov S.N. Soviet Academy of Sciences Doklady 1934, v.2, p. 457.
[28] Tamm I.E ., Franck I.M. Soviet Academy of Sciences Doklady 1937, v.14, p. 107.
[30] Bracewell R.H. The Fourier Tansformation and its Applications. 2-nd ed.- N.Y.:
Mc.Graw-Hili Book Company 1979.
[34] Elias P., Gray D.S., Robinson D.Z., Journ, Opt . Soc. Amer. 42, 127, (1952)
[35] Elias R., Journ. Opt. Soc. Amer. 42, 229, (1953)
REFERENCES 287
[36] R. Hanbury Brown, R.C.Jennison, M.K. Das Gupta. Nature 170, 1061 (1952)
[41] E .Kreyszig. Advanced Engineering Mathematics. 482 - 488. N.Y. Jonh Wiley &
Sons, Inc, 1993.
289
290 DEMONSTRATIONAL OPTICS