Professional Documents
Culture Documents
Physics and Technology For Engineers... 2023
Physics and Technology For Engineers... 2023
Physics and Technology For Engineers... 2023
R. Prasad
© The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature
Switzerland AG 2023
This work is subject to copyright. All rights are solely and exclusively licensed by the Publisher, whether
the whole or part of the material is concerned, specifically the rights of translation, reprinting, reuse
of illustrations, recitation, broadcasting, reproduction on microfilms or in any other physical way, and
transmission or information storage and retrieval, electronic adaptation, computer software, or by similar
or dissimilar methodology now known or hereafter developed.
The use of general descriptive names, registered names, trademarks, service marks, etc. in this publication
does not imply, even in the absence of a specific statement, that such names are exempt from the relevant
protective laws and regulations and therefore free for general use.
The publisher, the authors, and the editors are safe to assume that the advice and information in this book
are believed to be true and accurate at the date of publication. Neither the publisher nor the authors or
the editors give a warranty, expressed or implied, with respect to the material contained herein or for any
errors or omissions that may have been made. The publisher remains neutral with regard to jurisdictional
claims in published maps and institutional affiliations.
This Springer imprint is published by the registered company Springer Nature Switzerland AG
The registered company address is: Gewerbestrasse 11, 6330 Cham, Switzerland
Dedicated to my wife
Sushma Mathur (12 Nov, 1950–24 May,
2023)
The idea of writing this manuscript originated from my interaction with B.Tech.
students who often complaint of not having a book that covers both the fundamentals
of physics and modern technologies. The need of such a book was also felt by the
group of eminent faculty with whom I was involved in setting papers for competitive
examinations of various technical boards/institutes. It was realised that the available
books/materials on these topics are incomplete, lopsided or too detailed in some
aspects but lacking in others. Moreover, in most of the available books, modern topics
like sustainability and sustainable energy sources are not even touched. With the view
of providing a balanced description of physics of engineering materials and modern
technologies and with the aim of making readers aware of their moral responsibility
towards sustainable development, the present text is developed in textbook format.
The book contains around 220 illustrative figures and some 35 tables.
The book is divided into ten chapters. Chapter 1 starts with the classification
of engineering materials and their important properties. In order to link specific
material properties and their dependence on atomic structure of constituent atoms/
molecules, details of atomic structure and of atomic/ionic/molecular bonding are
provided in this chapter. Chapter 2 discusses electrical behaviour of solids including
superconductors and associated physics. Chapter 3 details the origin and behaviour
of magnetic materials and types of magnetism along with the fabrication of materials
with desired magnetic properties. Important topics of modern physics, like discovery
and properties of X-rays, dual nature of matter and instances of the failure of classical
physics based on Newtonian Mechanics and Maxwell’s theory of Electromagnetic
radiations, and their successful resolution in the frame work of quantum approach
are discussed in Chap. 4. Basics of quantum mechanics, particularly of Schrodinger
approach is provided in Chap. 5. Some simple one-dimensional problems of potential
wells and of barrier transmission are discussed in this chapter. Since most micro-
and macrosystems consist assemblies of large number of identical particles, their
behaviour is generally predicted by the laws of statistics. Since microsystems obey
quantum mechanics and have discrete energy levels, the appropriate statistics that
may be applied to these systems is quantum statistics. Laws of quantum statistics,
macro- and microstates, a prior equal probability of all microstates associated with a
vii
viii Preface
given macrostate, etc. are discussed in Chap. 6. Technique of optical fiber communi-
cation is discussed in Chap. 7, while details of laser technology and its applications
are detailed in Chap. 8. Chapter 9 gives detailed description of nanomaterials, their
advantages, reasons behind their special properties, fabrication techniques for nanos-
tructures, membranes, sheets, tubes, etc. Chapter 10 is special as it discusses details
of the concept of sustainability, as applied to different fields like the social, economic,
environmental and sustainable energy sources. Methods conducive to a sustainable
development that may be adopted by individuals, by socio-economic groups, cluster
of groups, etc. are discussed in this chapter. It is expected that after going through
this chapter, a reader will become aware of his/her social responsibility towards
sustainable development and engineers in particular will participate in generating
sustainable energy sources so that the benefits of natural resources are left largely
undiminished for the future generations.
Chapters of the book have the following special features:
1. The objective of the chapter is spelled out at the very beginning.
2. Sufficient number of self-assessment questions (SAQ), probing the understanding
of the reader, is uniformly distributed over the text of each chapter. A serious
reader is expected to satisfy himself/herself by answering these questions before
proceeding further.
3. Solved examples are included, wherever required, to illustrate the technique of
problem solving.
4. Problems with answers are provided at the end of each chapter.
5. Large number of short answer questions are included at the end of each chapter.
6. Since most of competitive examinations are based on multiple choice questions,
sufficient number of multiple choice questions (MCQ) with answers is provided
at the end of each chapter. A special feature of these MCQ is that in some cases
more than one alternative may be correct, and therefore all correct alternatives
must be marked for complete answer of the question.
7. Some long answer questions are also provided at the end of each chapter.
8. Each topic of the text is started from the very basics and is developed to the
desired level; therefore, no other book or material is required for reading this
text.
It is expected that the book will prove useful for readers.
I shall very much appreciate receiving feedback from readers on the following
e-mail address:
Rpm166@rediffmail.com
Present book, like my earlier publications, is the result of encouragement and support
extended by my students, members of my research group and colleagues. I would
like to express my gratitude to all my students, members of my research group and, in
particular, Prof. B. P. Singh (Former Chairman, Department of Physics) who showed
great enthusiasm for the book project. While he provided some initial material,
his involvement in developing the text was limited due to other academic commit-
ments. Nevertheless, I would like to record my sincere appreciation to Prof. Singh’s
unwavering support, encouragement and the valuable discussions we frequently had.
Support from Prof. Manoj K. Sharma, Prof. Sunita Gupta, Prof. M. M. Musthafa (Ex-
chairman Department of Physics, Calicut University), Dr. Pushpendra P. Singh, Dr.
D. P. Singh, Dr. Abhishek Yadav, Dr. Unnati, Dr. Mohd. Shuaib and Dr. M. Shariq
Asnain is thankfully acknowledged.
I wish to thank the Aligarh Muslim University, Aligarh, India, and my colleagues
at the Physics Department and at the Department of Applied Physics, Z. H. College
of Engineering and Technology, AMU, Aligarh, with whom I passed more than forty
years of my active life.
Last but not the least, I wish to thank all the members of my family, in particular
my wife Sushma who in spite of being seriously ill, extended all possible support
and encouragement for the completion of the project.
ix
Contents
xi
xii Contents
Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 533
Chapter 1
Engineering Materials, Atomic Structure
and Bounding
Objective
Classification of condensed matter and its correlation with atomic structure and
chemical bonding are important from the view point of engineers. These topics are
discussed in this chapter in sufficient details. It is expected that after going through
the chapter, the reader will be able to identify the special properties of different
engineering materials and will also be able to correlate these special characteristics
of different materials with their atomic structure, electron configuration and atomic/
molecular bonding. This will go a long way in selecting a proper material for specific
engineering requirements as well as in fabricating materials with desired properties.
It is known that matter, on the basis of their physical state, may be classified as solids,
liquids, gases and plasma. The first three states are quite well known; however, the
fourth state, plasma, is rather peculiar. At very high temperature atoms of the matter
get ionised forming plasma that contains ionised atoms and electrons in a state of
rapid motion. The flame of a burning candle is a typical example of plasma.
In material science solids are defined as the matter having the property of crys-
tallinity. Crystallinity means molecules, atoms or ions of the matter spaced at regular,
repeating distances and angles from each other in three dimensions. In condensed
matter atoms or molecules or ions almost touch each other, like that in solids; liquids
also show properties of condense matter; however, most of the time liquids do not
have crystallinity: they are amorphous. Some liquids called liquid crystal, as excep-
tion, do exhibit some regularity of structure over comparatively large distances, but
they do not possess this regularity in three dimensions. In supercritical states of
matter, at very high temperature and pressure, matter is essentially in gaseous state
but relative separation between constituent atoms, etc. is of the order of that in solids
and hence fall in the category of condensed matter. Gases are characterised by atoms/
molecules separated from each other by large distances.
There are some interesting consequences of the above-mentioned classification:
glass and several types of plastics like polyvinylchloride (PVC), for example, are
defined as rigid, supercooled liquids. Most materials of engineering interest are either
solids or rigid supercooled liquids.
Material science also classifies matter in four broad classes: metals, ceramics,
polymers and composites. Metals are characterised by their lustre, good conductors
of heat and electricity and to some extent by their property of ductility. A ceramic
is a material that is neither metal nor organic. It may be crystalline or glassy (rigid
supercooled liquid) or both. Ceramic pottery is quite well known but clay, bricks,
tiles, glass, concrete and cement are some other examples of ceramics. Ceramics,
depending on their composition may be semiconducting, superconducting, ferroelec-
tric or insulator; hence they are finding ever-increasing applications in solid state elec-
tronics, fiber optics, artificial joints, space shuttle tiles, micropositioners, chemical
sensors, body armours, self-lubricating bearings, etc. Polymers are mostly organic
substances made of long chains of molecules. Skin, hair and wood are examples of
polymers.
Another class of materials is called ‘composites’ that are combinations of two
or more of the above-mentioned metals, ceramics and polymers. Composites are
materials designed for specific goals to achieve a combination of properties not
found in any single material. Then there are advanced materials that are finding
applications in highly sophisticated technical fields like, electronics, space tech-
nology, computers, etc. Advanced materials include semiconductors, nanoengineered
materials and biomaterials, etc.
1.1.1 Metals
All materials are made up of atoms, either of the same or of different elements. Atoms
of different elements are characterised by their Atomic Number Z and Atomic Mass
Number A. Russian scientist Dmitri Mendeleev developed a table, called periodic
table, where elements were arranged in order of increasing atomic number from left
to right and from top to bottom.
Elements in periodic table are arranged in groups and rows such that elements
falling in a group exhibit similar chemical behaviour. Based on the observed simi-
larity in their chemical properties, elements have been grouped together as (i) alkali
metals, (ii) alkaline earth metals, (iii) transition metals, (iv) other metals, (v) halo-
gens, (vi) noble gases, (vii) rare earth and lanthanoid elements, (viii) non-metals and
(ix) actinoid elements. These different groups of elements are shown with different
colours in Fig. 1.1 that shows periodic table.
Materials of metal group are composed of one or more metallic elements, like
gold, titanium, nickel, copper, iron and aluminium and often also contain very few
atoms of non-metals like carbon, oxygen and nitrogen. These non-metallic atoms
1.1 Classification of Condensed Matter 3
Materials can undergo two types of fractures under tensile stress: brittle frac-
ture and ductile fracture. In brittle fracture the fractured ends have irregular
shape. The two types of fractures are shown in Fig. 1.2. Metals under tensile
stress undergo ductile fracture, as they have the property to withstand plastic
deformation. Metals have the ability to absorb more energy prior to fracture.
1.1 Classification of Condensed Matter 5
Fig. 1.3 Unit cells of a hexagonal close packed (hcp), b face-centred cubic (FCC) and c body-
centred cubic (bcc) crystal structures
Sliding of atomic layers is possible only in metals because in metals and their
alloys atoms are bound through metallic bonding. In metallic bonding all valence
electrons of each atom get detached from its parent individual atom forming an
electron cloud around all positively charged metallic ions. Since electrons are
no more attached or associated with a particular atom, these electrons are called
delocalised electrons or free electrons. Since electrons in this electron cloud
are not associated with a particular positive atomic ion, the binding of indi-
vidual atomic ion with electron cloud is quite weak. Therefore, on application
of stress, layers of atomic positive ions may move relative to the layers below
or above it, without damaging crystal structure. With the change in the shape
of the metal specimen, the delocalised electron cloud also assumes a new shape
1.1 Classification of Condensed Matter 7
and orientation such that the overall structure, i.e. the crystalline structure of
specimen does not break.
Bonding between atoms of a material depends essentially on two factors; the size
of atoms and their relative separation. There are several types of bonding that are
found in different materials. We shall study more details about bonding in the
next section; however, for present it may suffice that in metals and their alloys
a special type of bonding called metallic bonding is found. To understand
the nature of metallic bonding let us start with two atoms of the same metal
with atomic number Z. When these two atoms are far apart, each atom has the
positive nucleus that is surrounded by a number of electrons such that the total
positive charge of the nucleus is counterbalanced by the total negative charge
of electrons; therefore, there must be Z electrons in each of the two atoms.
These atomic electrons are distributed around the nucleus at different distances
in different discrete energy states. The group of electrons farthest from the
nucleus is termed as valence electrons. Since they are farthest from the nucleus,
valence electrons experience a very weak force of attraction by the positively
charged nucleus, in scientific language one says that valence electrons in an
atom are least bound. If N is the number of valence electrons, then (Z − N)
electrons will be tightly bound with the nucleus and one defines the core of the
atom as the nucleus plus (Z − N) tightly bound electrons. Obviously, the net
charge on atomic core will be (+ Ne), where e is the unit of charge. Atomic
core, therefore, behaves as a positive ion as shown in Fig. 1.5a. For example,
the number of valence electrons in aluminium atom is 3, hence each positive
core or ion of Aluminium has a charge + 3e (3 × 1.6 × 10–19 C).
Fig. 1.5 a Structure of the core and valence electrons in an isolated atom b arrangement of positive
ions (cores) and delocalised electrons in a metallic specimen
8 1 Engineering Materials, Atomic Structure and Bounding
Next let us consider that another similar atom is brought near to the first, so close
that the two atoms start feeling the presence of the coulomb fields of each other.
Since the cores of two atoms are tightly bound, they will not be much affected
by the presence of the other atom, but loosely bound valence electrons of both
atoms will feel almost equal force of attraction by the core of its parent atom as
well as the core of the other atom. Thus valence electrons will now get associated
to both atomic cores. This results in delocalisation of valence electrons, which
means that valence electrons (numbering 2N) are now not confined to the field of
one atom but are relatively free to move from the field of one atom to the field of
the other atom. As a result a bond is formed between the cores of the two atoms
making the system a diatomic molecule. Valence electrons now move around
the diatomic molecule in fixed orbitals. If a third atom is now brought very near
to the diatomic molecule, a tri-atomic molecule is formed having 3N number of
delocalised electrons circulating around three atomic cores in specified orbitals.
In this way a piece of a metal may be considered as a multiatom molecule with
delocalised electron cloud around it, see Fig. 1.5b.
Delocalised electrons may freely move from one core to the other and then to the
other, free to move within the electron cloud but they cannot leave the electron
cloud. It is because if any electron tries to leave the multiatomic molecule
electron cloud, the molecule develops a net positive charge and pulls the electron
back. Often one uses the term ‘free electrons’ for delocalised electrons as
they are not attached firmly to any atom of the specimen and are free to move
from the electric field of one atom to the electric field of any other atom of
the multiatom molecule, yet they are bound to the molecule. The delocalised
electrons are very large in number, but they move in some well-defined orbitals.
Each electron has a fixed discrete value of energy. Not more than two electrons
can have same energy as per Pauli’s exclusion principle. The energy differences
are very small and, therefore, electron energy levels are closed spaced as shown
in Fig. 1.6. As shown in this figure, energy levels up to some energy are filled with
electrons, but there are large number of empty levels. When delocalised electrons
absorb energy from some external source, for example if light is made to fall
on metallic surface delocalised electrons may absorb incident light photons and
may shift to their next higher excited state. Similarly, if the metal specimen is
heated delocalised electrons absorb energy and shift to higher excited states.
Availability of large number of empty levels for electrons plays important role
in metals.
The crystal structure of metals consists of regular arrangement of atomic cores
(or positive ions left after losing valence electrons) in two-dimensional arrays,
called crystal lattice, stacked one over the other in three dimensions. A rough
and enlarged version of metallic crystal structure is shown in Fig. 1.7.
Two lattices marked A and B are shown in the figure with large number of
delocalised electrons moving all around constituting electron cloud. Though
1.1 Classification of Condensed Matter 9
the lattices A and B are shown quite well apart in the figure, but in actual case
successive lattices almost touch each other. In metallic crystals positive ions in
a lattice are very strongly bound with each other; therefore, it requires large
amount of energy to break lattice structure. The strong binding of positive ion
cores in metallic crystals is provided by the cloud of delocalised electrons which
serves as a glue. However, the binding is much weak between two adjacent
lattices. That is why application of small stress, capable of overcoming bonding
between lattices, may slide lattices one over the other, without damaging lattices.
Sliding of lattices with respect to each other results in giving a new shape to the
specimen. Since cloud of delocalised electrons is firmly bound with lattices, it
readjusts its orientation and other parameter to give stability to the new shape.
It may thus be observed that properties of ductility and malleability in metals
originate from metallic bonding that is characterised by delocalised electron
cloud, large number of empty electronic states and very strong binding between
positive ions in crystal lattices.
(c) Different types of metal strengths
10 1 Engineering Materials, Atomic Structure and Bounding
V ∝ I or V = R I (1.3)
In Eq. (1.3), the proportionality constant R is called the resistance of the given
piece of specimen and is a measure of the opposition that the specimen has
offered to the flow of current through it. A large value of R is an indication that
the specimen has offered large opposition to the flow of current, i.e. it is not a
good conductor of electricity. Resistance R is measured in units of Ohm (Ω). The
magnitude of R depends on (i) the material and on two physical parameters, (ii)
area of cross section A of the specimen and its (iii) length L. One may therefore
write,
L L
R∝ or R = ρ (1.4)
A A
Figure 1.8 shows the bar graph for the range of values of conductivity for
different materials. The conductivity of metals and alloys has high value but
varies in a narrow range. Conductivity has largest range for composites, as
expected. Conductivity value at room temperature (≈ 20 ◦ C) for some important
metals and alloys is given in Table 1.3. It may be observed in this table that
silver that has the maximum value of conductivity is the best conductor of
electricity and, therefore, it is frequently used in making electrical connections
in sophisticated electronic circuits.
Large number of delocalised or free electrons in metals is responsible for their
high electrical and thermal conductivities. Electron, being negatively charged,
experiences a force F = −eE when subjected to an electric field of strength
E. The negative sign in this expression tells that the force F is in a direction
opposite to the direction of electric field E.
Figure 1.9 shows a rectangular metallic rod of length ‘d’ which is connected
to a battery of voltage V with a switch ‘sw’. When the switch is made on, face
F 1 of the rod is connected to the positive terminal of the battery and face F 2
to the negative. A current of magnitude I flows through the metallic rod such
that V = R I . As already mentioned, R is a measure of the opposition that the
metallic rod offers to the flow of current. In the following we shall discuss the
mechanism of current flow and the origin of resistance R.
The interior of the rod has a crystalline structure consisting of positively charged
core of metallic atoms (indicated by + symbol) arranged in a regular fashion
in three dimensions. These sheets of positive ions are called crystal lattices and
they are held at their positions by the balance of attractive forces of delocalised
electron cloud and repulsive forces between nearby positive ions. Almost free
(delocalised) electrons, in motion, each with its inherent velocity (indicated by
brown coloured arrows) surround the crystal lattices. With the application of
voltage V across the two opposite faces of the rod electric field E of magnitude
E = Vd , directed from face F 1 to face F 2 , gets established between the two faces
F 1 and F 2 inside the rod. The electric field applies a force F = |eE| directed
from F 2 to F 1 , on each of the delocalised electron and imparts an additional
velocity, say vad directed opposite to the field direction, to each electron. This
additional velocity component on each electron is represented by a small black
arrow in Fig. 1.9. Under the action of two velocities, each delocalised electron
moves in the direction of the resultant velocity. However, the additional velocity
component tries to make every electron in the road to rush towards face F 1 . At
first it appears that all electrons in the metal rod will reach face F 1 in no time and
will accumulate there. But that is not true. These free electrons mostly moving
towards face F 1 collide with crystal lattice and their direction of motion and
the magnitude of velocity both get changed at such collisions. Free electron–
lattice collisions are very frequent, as a result though there is a net flow of
negative charge in the rod from face F 2 towards face F 1 but on average number
of free electrons per unit volume of the rod remains almost constant; there is no
accumulation of electrons in any part of the rod. The net flow of negative charge
(from F 2 to F 1 ) establishes the current I, and conventional current is assumed to
flow from F 1 to F 2 . One may ask a question why collisions between electrons are
not taken into account. The answer is that collisions between electrons are quite
unlikely because of their negligibly small size and very small time that electrons
take in crossing each other. Such electron collisions with crystal lattice are the
major cause for randomisation of electron velocities. Larger the frequency of
14 1 Engineering Materials, Atomic Structure and Bounding
electron–lattice collisions, lesser will be the amount of net charge flow towards
face F 1 , resulting in lower current. It may, therefore, be realised that in metals the
opposition R to the flow of current originates from electron–lattice collisions.
Since inherent speed of electrons increases with temperature, the electron–lattice
collision frequency strongly depends on the temperature of the metallic rod; at
higher temperature the resistance R of the same specimen rod will be larger.
This is confirmed from the experimental observed fact that the same specimen
shows a larger value of resistance R or of resistivity ρ and a lower value for
conductivity σ at higher temperatures. It is, therefore, required to specify the
temperature while mentioning the resistance, resistivity or conductivity of a
specimen.
Large number of electrons colliding with lattice impart energy and momentum to
it that sets the crystal lattice in vibratory motion. This vibratory lattice motion
is quantised, i.e. the lattice in vibratory motion either absorb or emit energy
in packets. Energy packets corresponding to the vibratory lattice motion are
called ‘phonon’. There should be no confusion between phonons and photons;
photon is the energy quanta of electromagnetic field, while phonon is the quanta
of lattice vibration.
Heat is a form of energy, and it may transmit from one place to another by
three distinct methods; (i) radiation (ii) convection and (iii) conduction. Heat
transfer through radiation does not require any medium; it is directly transferred
as energy quanta, photons, from the source to the receiver. Sun light reaches
Earth via radiation crossing a vast region of vacuum. However, both convection
and conduction require some material medium to transfer heat. In convection,
medium particles take heat energy and move away to transport heat energy.
This happens in boiling of water when water molecules absorb heat from the
hot source at the bottom of the container and move out to transfer heat energy to
the top layer. In case of conduction on the other hand, medium particles do not
move, instead they transfer heat energy to the next particle, and then to the next
and so on. Though heat transfer from a hot body by radiations cannot be avoided,
but in solid materials heat transfer essentially takes place via conduction.
Thermal conductivity of a material, generally denoted by κ, tells about the
ability of the material to let heat energy pass through it via the process of
conduction. A large value of κ for a given material means that the material is
a good conductor of heat. Metals and their alloys are good conductors both of
heat and electricity.
Let us consider a rectangular sheet of a material of thickness ∆x (m) and area
of cross section A (m2 ) as shown in Fig. 1.10. Let the front face of the sheet
be at temperature (T + ∆T ) (Kelvin K) and the back face at temperature T
(K). Front surface being at a higher temperature will conduct some heat energy
∆Q (Joule J) in time ∆t towards the back surface. Experimentally it has been
found that heat transfer through conduction ∆Q from front face to the back
face is proportional to the area of cross section A of the surface, time ∆t, the
1.1 Classification of Condensed Matter 15
The left side of Eq. (1.5) gives the amount of heat energy lost by the front
surface per unit time per unit area, and, therefore there is a negative sign, the
negative sign simply shows that energy is lost by the front surface. The units
of thermal conductivity κ may be given as Joule per second per unit area per
unit temperature gradient or watt per meter per Kelvin, i.e. (W/m K). In metals
heat conduction also takes place through the delocalised electrons which are
relatively free to move and transport heat. Since the number of free electrons
in metals is large, the conductivity of metals and alloys is also large. Values of
thermal conductivity for some metals and other materials are shown in Table
1.4.
(f) High melting point of metals
Metals in general have high melting and boiling points. When a solid melts,
its crystalline structure gets destroyed and it changes its phase from solid to
liquid. On further heating the liquid, at some temperature called boiling point,
changes into gaseous phase. A high melting point means that the bonding
between constituent atoms is very strong. In case of metals crystalline structure
is protected by very strong bonding of multiatomic molecules, i.e. by the large
16 1 Engineering Materials, Atomic Structure and Bounding
SAQ: Crystal lattice consists of positively charged ions but they are very tightly
bound. Which forces provide this strong binding?
SAQ: Current in metals is constituted by the flow of electrons; what will happen to
these electrons when a current carrying wire is cut, will electrons go out of
the wire?
SAQ: How can one explain the reduction of conductivity in metals that have small
grains?
SAQ: Why do delocalised electrons in metals remain inside the specimen when
they are not attached to any specific atom?
1.1.2 Ceramics
are very hard, extremely brittle and susceptible to fracture. Figure 1.11 shows the
spread of tensile strengths for different materials. In spite of being very hard, ceramics
may be optically transparent, translucent or opaque.
Type of bonding between constituent atoms is often used to classify a material; for
example, metals are characterised by metallic bonding where delocalised electrons
of constituent atoms provide the ‘glue’ for strong binding. Similarly, polymers have
strong covalent bonds in long molecular chains, while relatively weak van der Waals
bond binds one long molecular chain with the other. On the other hand, ionic bonds
are found in non-metals. In ceramics all types of bonds exist. As an example, Al2 O3 ,
MgO, SiO2 , etc. have ionic bonds, while SiC, BiC, BN, Si3 N, Si2 N2 O, etc. show
covalent bonding. It is interesting to observe how relative content of different types
of bonds changes the melting point of ceramics (see Table 1.7).
Solids generally have crystalline structure with grains, i.e. they are poly crys-
talline; microscopically there are several crystals having different orientations, fused
together. Fine-grained pure alumina and glass are polycrystalline ceramics. Ruby,
diamond, etc. on the other hand are large single crystal ceramics.
Ceramics have many different chemical compositions:
There are simple oxides that have high melting point, like ThO2 (melting point
Tm = 3300 °C), MgO (Tm: 2825 °C), UO2 (Tm: 2810 °C), etc. Ferrites that are
complex oxides like Fe3 O4 , SrF12 O19 ; Titanates: BaTiO3 (Tm: 1625 °C); SrTiO3
(Tm: 2080 °C); Nitrides: Si3 N4 (Tm: 1900–2600 °C); TaN (Tm: 3080 °C); Brides:
HfB2 (Tm: 3350 °C); ZrB2 (Tm: 3245 °C); Silicides: Hf5 Si3 (Tm: 2600 °C); WSi2
(Tm: 2160 °C); Halides: NaCl (Tm: 800 °C); Intermetallides: HfRe2 (Tm: 3160 °C),
Nb3 Sn; Metal ceramics: WC-TiC-Co; Polymer ceramic: Synthetic resins.
Material scientists occasionally divide ceramics into Traditional and Advanced.
Bricks, pottery, glass, porcelain, etc. are time tested daily use general purpose
ceramics. Out of these pottery is generally made from traditional clay while bricks,
tiles, etc. are heavy clay products. However, coarse-grained refractors fired bricks;
silica bricks find special use in making high-temperature oven, etc. Cement and
concrete are other traditional ceramics used in building construction.
Advanced ceramics are those that have been specially engineered, mostly since the
early twentieth century, for highly technical and specific applications. For example,
aluminium nitride (AlN) and beryllium oxide (BeO) ceramics have been devel-
oped to serve as heat-sink for electronic elements. These ceramics are also used as
substrate in electronic packaging. Similarly, SiO2 and polymer-ceramic compounds
are used as thermal protection shields. High-resistivity conductors-ceramics silicon
carbide (SiC), zirconium oxide (ZrO2 ), molybdenum silicate (MoSi2 ), etc. are used
for making heating elements and electrodes.
Ceramics having magnetic and superconducting properties have also been devel-
oped. Complex ferrite and oxides of heavy metals like, (Ba, Sr) Fe12 O19 ; Y Co5 Sm2
(Co, Fe, Cu, Zr)17 and Nd2 Fe14 B, etc. are used for making hard magnets.
Magnet-doped silicon dioxide (SiO2 ) and chromium-doped complex Be3 Al2
(Si3 )6 have been used to make artificial gem stones, former as topaz and the later as
emerald. Artificial diamonds are made from ceramic ZiSiO4 .
Advanced ceramics are finding extensive use in nuclear technology. UO2 , UC and
PuO2 are used as nuclear fuels. Ceramics BeO, BeC2 and ZrO2 have been used to
moderate fast neutrons to thermal energies in nuclear reactors. Similarly, ceramics
B4 C, HfO2 and Sm2 O5 are used for neutron shielding.
Some ceramics are biocompatible that means they are not harmful or toxic for
living tissues. Such ceramics are used for making artificial joints, prostheses, cardiac
valves and other implants.
One may conclude by saying that ceramics are versatile materials that have
applications in almost all walks of human life.
1.1.3 Polymers
Polymers are long-chain molecules of very high, running into hundreds and
thousands, molecular weight. It is for this reason that they are also called
‘macromolecules’. In old literature, when polymer science was not so developed,
term ‘resins’ was used for polymers. As has been mentioned, polymers are substances
made up of recurring structural units, each of which is regarded as derived from a
specific compound. These building blocks or units are called monomer. The physical
and the chemical properties of a polymer depend very strongly on the number of
20 1 Engineering Materials, Atomic Structure and Bounding
monomer units in the chain. As an example take a simple case of normal alkane
hydrocarbon series;
The bond structures of first three members of the series are shown in Fig. 1.12a,
the monomer of the series is given in figure (b), while the series formula is shown
in part (c) of the figure. In series formula ‘n’ gives the number of monomers in that
particular member. It is interesting to note that the physical and chemical properties
of different members of the series change with the number ‘n’ of monomers in the
chain. First four members of the series are gases. The fifth member ‘n-Pentane’ is a
low viscosity fluid with boiling point of ≈ 36 °C. With the increase of the number
of monomers in the chain, the viscosity and boiling points of the member increase.
Some characteristics of series members and the value of ‘n’ for them are tabulated
in Table 1.8.
The boiling point of successive members of the alkane chain also increases with
the number of monomers, but the rate of increase slows down such that the boiling
point for the massive members of the series saturates at about 145 °C. Long-chain
alkanes having 103 to 3 × 103 carbon atoms are known as polyethylene. A big
difference between wax and polyethylene lies in their mechanical behaviour. While
polyethylene is a tough plastic, wax is a brittle crystalline solid.
classification based on (i) molecular forces, (ii) heat treatment, (iii) source, (iv)
structure and (v) mode of polymerisation, etc.
(i) Classification based on molecular forces Two types of bonds are frequent in
polymers; the hydrogen bond and (b) van der Waals bond. These bonds bind
chains and monomers in a chain with each other. Accordingly, polymers may
be classified as:
(a) Elastomers Rubber—like solids fall in this category. In these polymers
chains are coupled with each other by the weakest intermolecular force
which permits the polymer to be stretched. However, there are some cross-
links between the chains that bring back the polymer to its original shape
when deforming force is withdrawn. Examples are neoprene, buna-N, etc.
(b) Fibers Fibers are those polymers which may be drawn into long filaments
with lengths at least 100 times of their radii. This happens because of
the strong bonds between chains, usually hydrogen bonds. As a result
of strong intermolecular force, these materials are closely packed and
have crystalline structure. Examples are polyesters (terylene), polyamides
(nylon), etc.
(c) Resins They are liquid polymers that are used as adhesives, sealants, etc.
Examples are epoxy adhesives and polysulphide sealants.
(ii) Classification based on heat treatment A polymer that may be given different
shapes to make tough and hard utility articles by heating and/or by applying
pressure is called plastic. Plastics may be further classified as: (d) thermoplastic
(e) thermosetting plastic.
(d) Thermoplastic polymers Some polymers become soft on heating and
can be given any desired shape. However, on cooling they again become
hard and tough. The process of heating, reshaping and becoming tough
and hard on cooling can be repeated several times. The intermolecular
forces in these plastics are stronger than that in elastomers and weaker
than those in fibers. Sealing wax, nylon, PVC, etc. are some examples.
(e) Thermosetting polymers Those plastic polymers that undergo some
chemical changes on heating and become infusible mass which cannot be
given any shape are called thermosetting plastic polymers. In such poly-
mers, heating creates large number of new cross-linking bonds that convert
it into an infusible mass. Bakelite is an example of such thermosetting
polymer.
(iii) Classification based on source Based on the source of the polymer there may
be three classes:
(f) Natural Polymers Polymers that are found in nature, in plants and animals
are called natural polymers. Proteins, cellulose, barks, starch, and rubber,
etc. are some examples.
1.1 Classification of Condensed Matter 23
1.1.4 Composites
Modern technologies often require materials with very special properties which are
not available in metals, metal alloys, ceramics and polymers. For example, aerospace
scientists are always in lookout of materials which have very low density, very strong,
highly resistant to abrasion and impact yet quite stiff. This amounts to asking for
two apparently opposite characteristics in the same material, because strong and stiff
materials are generally dense. Further, increasing the strength or stiffness, in general,
decreases the impact strength. An answer to such problems comes from composites;
these are materials produced by the combination of two or more of the three materials,
namely metals, ceramics and polymers.
A composite may be defined as a combination of two or more materials (often
called phases) at a microscopic scale and have chemically distinct phases that results
in better properties than those of the individual components used alone. Though
heterogeneous at a microscopic scale, a composite is statistically homogeneous at
macroscopic scale. In general, out of different phases in a composite, one partic-
ular material has volume wise larger concentration than others. This component
with largest concentration (or bulk material) is called the ‘matrix’. The other mate-
rial which is in relatively smaller amount is termed as the ‘reinforcement’. Rein-
forcements are primarily added to increase the mechanical strength, toughness and
stiffness of the material.
1.1 Classification of Condensed Matter 25
The manmade composites may be divided into three categories, (a) polymer
matrix composites (PMC), (b) metal matrix composites (MMC) and (c) ceramic
matrix composites (CMC).
(a) Polymer matrix composites Some polymers, in particular epoxies and
polyesters, have a notable property that they may be easily moulded into desired
complex shapes. But their drawback is that they do not possess high mechanical
strength as metals. On the other hand materials like glass, boron and aramid have
extremely high tensile and compressive strengths which are, however, not readily
apparent in their solid forms. This happens because when stressed, randomly
distributed surface ‘flaws’ (abnormalities due to impurity, etc.) in these mate-
rials make the solid crack or break much below its theoretical ‘breaking point’
stress. To overcome this problem, fibers of these materials (boron, glass and
aramid) are drawn. The advantage in fibers is that though the random distri-
bution of faults will still be same but only few fibers will be affected by these
faults and a very large number of fibers will have no fault in them and will
break at their theoretical break point. Therefore, a bundle of fibers will reflect
more accurately the optimum performance of the material. It may, however, be
realised that a bundle of fibers will show its tensile strength only in the direction
of its length, just like in a rope. When these fibers are mixed as reinforcement
with a polymer matrix, like that of polyester, the resulting composite shows
exceptional mechanical strength comparable or even more than that of metals.
When a stress is applied to the composite, the matrix material, polyester in this
case, spreads the stress to fibers. Further the bulk matrix protects fibers from
atmospheric wear and tear, abrasion and impact. High strengths and stiffness,
ease of moulding into complex shapes makes the composite superior to metals in
many ways. The overall strength of the composite depends on following factors:
(i) Properties of the matrix polymer
(ii) Properties of the reinforcing fiber
(iii) The ratio of the fiber to the polymer, called fiber volume fraction (FVF)
(iv) Geometry and orientation of fibers in the matrix.
It is obvious that the mechanical strength of the composite will increase with
the increase of FVF, but there are limits to which this ratio may be increased.
Firstly, it is essential that all fibers must be fully rapped with polyester matrix
from all sides so that they are not exposed. Further, the manufacturing process
that involves mixing of reinforcement with matrix often produces faults and
air-inclusion, which may become cause of breakdown. In case of ordinary appli-
cations, like boat-building industry FVF of 30–40% is quite enough. However,
in more sophisticate applications like aero-industry FVF of around 70% have
been obtained by advanced manufacturing methods.
The orientation of the fiber in the composite is also important because the
maximum tensile strength of the fiber is along its length and the tensile strength
in direction normal to the length is negligible. The composite is, therefore,
anisotropic which is in contrast to metals and alloys which are largely isotropic.
26 1 Engineering Materials, Atomic Structure and Bounding
It is, therefore, very important when considering the use of the composite at the
design stage to know the magnitude and direction of the load in the finished
structure. If properly taken into account, the property of anisotropy of compos-
ites may be used to advantage, as composite material may be used only where
there are locked stresses.
There are four main types of direct loads that a composite may have to bear in
a structure. They are;
(i) Tension Load Fig. 1.16a shows the situation when tensile force is applied
to a composite. Response of the composite to tensile load very much
depends on the tensile strength of fiber reinforcement mixed with the
polymer, since it is much higher than that of the matrix material.
(ii) Compressive Load Application of compressive load to a composite is
shown in Fig. 1.16b. In this case the adhesive and stiffness properties of
the matrix polymer are very important as they have to maintain the fiber
straight and to prevent them from buckling.
(iii) Shear Load As shown in Fig. 1.16c a shear load attempts to slide adjacent
layers of the reinforcement fiber over each other. In this case also the
properties of the matrix polymer plays a crucial role, it should not only
have good mechanical strength but should also have very good adhesive
force with fiber so that it remains firmly attached with it.
(iv) Flexure Load As shown in Fig. 1.16d, flexure load is a combination of
tensile, compression and shear loads. Therefore, both the matrix and the
reinforcement fiber must possess good adhesive and mechanical strengths.
(b) Metal matrix composites Metal matrix composites have found usage in our
lives from olden times. Metals like cast iron with graphite, steel with high
carbide contents are all examples of metal matrix composites. Artefacts made
of metal matrix composites as swords, body armours, chains, etc. are all found
in excavation of old habitation sites.
There are many ways to classify metal matrix composites. One very often used
classification is based on the nature of reinforcement component; particles, layer,
fiber. Fiber composites may further be classified as, continuous fiber composite
and whisker composite materials. The continuous fiber metal composites may
either be monofilament or multifilament types.
The reinforcement material in metal matrix composites may have different
objectives. The reinforcement by light metals opens up the possibility of the
application of these light metal reinforced metallic composites in areas where
weight reduction is the first requirement. Frequently used light metals as rein-
forcement are Al2 O3 and SiC. The development objectives of light metal
reinforced composites are;
(i) Increase in yield strength and tensile strength at room temperature and
at higher temperatures, maintaining the minimum toughness or ductility
(ii) Increase fatigue strength particularly at higher temperatures
(iii) Increase in Young’s modulus
(iv) Reduction of thermal elongation
(v) Improvement in corrosion and thermal shock resistances
(vi) Low density
(vii) Mechanical compatibility with the matrix metal (thermal expansion
coefficient that matches with matrix metal)
(viii) Chemical compatibility
(ix) Good process ability
(x) Economic efficiency.
Some of the above-mentioned objectives may be achieved by using non-
metallic inorganic reinforcements. Ceramic particles, fibers and carbon fibers are
frequently used as reinforcement materials in metal matrix composites (MMC).
(c) Ceramic matrix composites (CMC) Ceramic matrix composites mostly
consist of ceramic fibers embedded in a ceramic matrix forming a ceramic
fiber reinforced ceramic composite (CFRC). Carbon and carbon fibers that are
also considered ceramic materials, along with fibers of other ceramic materials,
have been used as reinforcement elements. Typical reinforcing fiber materials
are; Carbon C, Silicon carbide SiC, Alumina Al2 O3 , Mullite or Alumina Silica
Al2 O3 –SiO2 .
Normally a ceramic matrix gets fractured by a tensile stress that produces an
elongation of about 0.05% in the length. However, if normal ceramic matrix is
reinforced by ceramic fibers, the fracture or cracks produced by excessive tensile
stress get covered up by the extension of fibers. An essential requirement for
complete recovery of the fracture site is that the matrix ceramic should also be
able to slid and fill the fracture gap. This requires that the adhesive force between
matrix and fiber is not very strong. A strong bond between the matrix and the
fiber will require a very high elongation capability of the fiber bridging the
fracture gap and would result in a brittle fracture. The adhesive force between
the matrix ceramic and the fiber is reduced by coating the fiber with a thin
28 1 Engineering Materials, Atomic Structure and Bounding
layer of pyrolytic carbon or boron nitride. These coatings weaken the bound
at the fiber–matrix interface. Figure 1.17 shows how a ceramic matrix fiber
reinforced composite repairs fracture or crack sites caused by tensile stress by
the elongation of fiber and sliding of the matrix ceramic.
Ceramic matrix reinforced by fibers composites may display both; the high
insulating or high conductivity properties. As a matter of fact the thermal and
electrical properties of ceramic matrix composites strongly depend on the prop-
erties of its constituents, namely fibers, matrix, pores in matrix, etc. Further,
fibers bring in anisotropy in behaviour of composites. Oxide ceramic matrix
composites are very good insulators. Because of their high porosity their thermal
insulation is much better.
The use of carbon fibers increases the electrical conductivity, provided the fibers
remain in contact with each other and with the voltage source. Silicon carbide
(which is a ceramic and a semiconductor) matrix is a good thermal conductor.
Electrical conductivity of SiC matrix decreases with the rise of temperature as
it is semiconductor.
Some important properties of ceramic matrix composites are;
(1) High thermal shock and creep resistance
(2) High temperature resistance
(3) Excellent resistance to corrosion, wear and aggressive chemicals
(4) High tensile and compressive strengths, thus no sudden failure as compared
to conventional ceramics.
Applications CMC have a wide range of applications, some of which are given
below;
• High-performance breaking systems
1.2 Atomic Structure 29
• Heat exchangers
• Bullet proof armour
• Turbine blades
• Heating elements
• Gas-fired burner parts
• Hot pressed dies
• Stator vanes
• Thrust control flaps for jet engines
• Refractory components
• Filters for hot liquids
• Heat shield systems for space vehicles
• Rocket propulsion components
• Turbo jet engine components.
SAQ: Fibers used as reinforcement in ceramic matrix composites are coated or
painted with some material. What is the need of this coating or painting?
SAQ: What is ‘meant by temperature shock’? Why CMC used in break lining should
have high resistance for temperature shock?
SAQ: What are ‘light metal MMC’? Which light metals are often used?
SAQ: Which polymers are frequently used as matrix material in PMC and why?
Though from engineer’s point of view it is only important to know different properties
of materials so that an appropriate specimen may be selected for the required use, but
if it is required to modify some property or to develop a new material having desired
properties, it is essential to know how and why different materials have different
properties. Key to this lies in the atomic and the molecular bonding of different
materials, i.e. how atoms and molecules are held together in different materials. Some
details of atomic structure along with different types of primary and secondary bonds
are discussed in the following.
All materials are made up of molecules, each molecule in turn, is made up of atoms.
Each atom when looked from a distance appears electrically neutral. However, on
a closer look, each atom has at its centre a nucleus with positive charge Ze. Here,
‘e’ stands for a unit of charge e = 1.6 × 10–19 C. Nucleus contains certain number
N of neutrons, each neutron being neutral, and Z number of proton each with + 1e
charge. Total number of nucleons (neutron and protons together are called nucleons)
in a nucleus is denoted by A, called atomic mass number and A = (N + Z). Number of
30 1 Engineering Materials, Atomic Structure and Bounding
protons Z in a nucleus decides the amount of positive charge on the nucleus and (Z) is
called the atomic number of the nucleus/atom. The nucleus of the atom is surrounded
by a spherical or nearly spherical distribution of negatively charged cloud made of
electrons, each electron denoted by the symbol ‘e’, has − 1e unit of negative charge.
Symbol e is used to denote both the unit of charge as well as an electron, but this
does not create any confusion as the context of its use immediately tells whether it is
used for denoting electron or for charge. The total number of electrons in this cloud
is Z, so that the total negative charge surrounding the nucleus of atomic number Z
is – Ze.
When looked from a distance (much larger than the size of the atom), both the total
positive charge in the nucleus (+ Ze) and the total negative charge (− Ze) contained
in electron cloud appear as if they are held at a point at the centre of the atom (centre
of the nucleus). Since total negative charge is equal in magnitude to the total positive
charge, the net charge at atom’s centre becomes zero. Thus atom, looked from a
distance, appears electrically neutral.
additional quantum number does not appear in the solution of Schrodinger equation
but is added to account for the two possible spin orientations of the electron. Electron
has an inherent spin of value 21 ℏ, here ℏ is quantum mechanical unit of measuring
spins. Magnetic spin quantum number (or simply spin quantum number) m s in case
of electron can have only two possible values; + 21 ℏ or − 21 ℏ. Although in principal
it is not possible to understand any quantum mechanical processes in classical terms,
however, for the sake of understanding, the two inherent spin motions of electron
may be associated with clockwise and anticlockwise directions of spin.
Microscopic systems or entities that follow quantum mechanics also obey a law
or principle called Pauli’s exclusion principle, according to which two electrons
in a given system (or atom) cannot have the same values for all the four quantum
numbers.
[ ∗ The region] of three dimensional space around the nucleus where the func-
tion ψn,l,m l
ψn,l,m l has maximum value is called the atomic orbital or simply
orbital. Orbital is a region of space around the nucleus of the atom where probability
of finding an electron with specified quantum numbers is a maximum. Classically,
orbital may be associated with classic orbit or Bohr orbit of the electron. But with the
difference that classic electron orbit is a well-defined sharp circular/elliptical path in
which electron travels around nucleus, while orbital is a volume of space around the
nucleus where the probability of finding electron with given set of quantum numbers
is maximum. Since there may be many different combinations of electron quantum
numbers, there are several orbitals for an atom.
Let us understand the physical significance of these quantum numbers.
(a) Principal quantum number ‘n’ Principal quantum number ‘n’ defines the
energy level of the electron or principle shell in atom. In quantum mechanics
particles can have only discrete values of energy. Principal quantum number ‘n’
can have only positive non-zero integer values, i.e. n may have values: 1, 2, 3,
4 and so on. Principal quantum number ‘n’ also determines the mean distance
of the electron from the centre of the atom that is from the nucleus. An energy
level with principle quantum number ‘n’ may accommodate a maximum of 2n2
electrons. Thus,
Energy level for which n = 1, may have at the most 2 electrons.
Energy level for which n = 2, may have at the most 8 electrons.
…
…
Energy level for which n = 5, may have at the most 50 electrons.
All electrons in a level of given principal quantum number ‘n’ have very nearly
same energy, but their other quantum numbers (l, m l , and m s ) are different.
As a matter of fact the maximum number of electrons 2n2 in energy level ‘n’ is
nothing but the number of different valid combinations of the remaining three
quantum numbers (l, m l , and m s ).
32 1 Engineering Materials, Atomic Structure and Bounding
(d) The Magnetic spin quantum number m s As already mentioned, m s does not
arise from Schrodinger equation, it is included to specify direction of the inherent
spin of the electron. In classical term, if the electron is spinning in clockwise
direction than ms = + 1/2 and if it is spinning in anticlockwise direction then
ms = − 1/2. These assignments of + 1/2 and − 1/2 are totally arbitrary.
Let us now consider a typical case, suppose there is an electron in prin-
cipal orbital or shell defined by n = 2. We shall now workout what possible
combinations of quantum numbers this electron may have.
Possible values of azimuthal quantum number l that this electron may have are:
l = 0 and l = 1.
Since there are two possible values of l, there will be two sets of values for
magnetic quantum number m l .
The set corresponding to l = 0 will have only one value m l = 0.
The set corresponding to l = 1, m l may have three values: m l = − 1, 0 and
+ 1.
Now corresponding to each set of values of n, l and m l , spin quantum number
ms may have two values: + 1/2 and − 1/2.
Table 1.9 lists the sets of quantum numbers n, l, m l and m s for principle orbital
of n = 2, such that at least one of these quantum numbers is different. If
orbital − 2 has a single electron then it may have one of the eight different
sets of quantum numbers. Each set defines a sub-orbital or sub-shell within
principle shell-2. Table 1.9 tells that second principle shell (n = 2) has eight
sub-shells. Since quantum numbers associated with each sub-shell are different,
a maximum of eight electrons may be accommodated in second principle shell
(Pauli’s exclusion principle).
Table 1.9 Possible sets of different quantum numbers in orbital (shell) of principal quantum number
n=2
Serial Principal quantum Azimuthal quantum Magnetic quantum Spin
number number n number l number ml quantum
number ms
1 2 0 (s) 0 + 1/2
2 2 0 (s) 0 − 1/2
3 2 1 (p) −1 + 1/2
4 2 1 (p) −1 − 1/2
5 2 1 (p) 0 + 1/2
6 2 1 (p) 0 − 1/2
7 2 1 (p) +1 + 1/2
8 2 1 (p) +1 − 1/2
34 1 Engineering Materials, Atomic Structure and Bounding
It is easy to show that the first principal orbital (shell) has only two sub-shells
with set of quantum numbers (n = 1, l = 0, m l = 0, m s = +1/2) and (n =
1, l = 0, m l = 0, m s = −1/2). It is left as an exercise to show that the third
principal shell (n = 3) will have 18 sub-shells and that fourth principal shell 32
sub-shells.
It follows from above that a maximum of two electrons can be accommodated
in I-principal shell, a maximum of 8 electrons in II-principal shell, a maximum
of 18 electrons in III-principle shell, a maximum of 32 electrons in IV-principal
shell and so on.
The principle shells or orbitals are also called electron energy levels and sub-
shells as electron energy states. In n = 1 energy level there are two possible
energy states. Similarly in n = 3, energy level there will be 18 energy states.
Orbital is a three-dimensional space round the nucleus where there is large probability
of finding an electron. There may be several orbitals like, 1s (n = 1, l = 0), 2s (n
= 2, l = 0), 1p (n = 1, l = 1), 3d (n = 3, l = 2) etc.. All these orbitals have
different shapes. Radial probability distribution of electron in hydrogen atom in
1s orbital is shown in Fig. 1.18a. As may be observed in this figure, probability
of finding the electron sharply increase with radial distance, reaches a maximum at
around 0.1 nm from the nucleus and then starts dropping sharply, touching a very low
value at around 0.2 nm and then becomes almost zero with in a small distance. The
probability distribution for 1s orbital is symmetrical in all directions and, therefore, it
appears spherical in 3-D space; the surface boundary diagram of 1s orbital is shown
in Fig. 1.18b where it may be observed that almost 95% chance of finding the electron
is in a spherical volume lying from 0.08 to 0.17 nm from the nucleus. In Fig. 1.18b
the darkness of the colour shade indicates the probability, darker the colour higher
the probability.
Radial probability distribution of electron for 2s orbital (n = 2, l = 1) is shown
in Fig. 1.19a. Note that in this case probability increases from r = 0 and attains a
small maximum value at around r = 0.05 nm and then falls sharply to zero at about r
= 0.1 nm. After touching zero value probability again raises and attains a maximum
value at around r = 0.28 nm and then falls off sharply. In contrast to the case of 1s,
radial probability distribution for 2s orbital shows two maximums, one smaller and
the other larger. In the region between these two maximums probability of finding
electron is zero. This region with probability zero is called the node. Like 1s orbital,
radial probability distribution for 2s is also same in every direction. Boundary surface
diagram of 2s orbital is given in Fig. 1.19b.
Radial probability distribution for 2p orbital, shown in Fig. 1.20, is not symmet-
rical; it has different shapes along the X-, Y- and Z-directions. Though probability
distribution has two bob structure in each direction, the orientation of these bobs
1.2 Atomic Structure 35
Fig. 1.18 a Electron probability distribution as a function of distance from the nucleus for orbital
1s. b Boundary surface diagram for 1s orbital
Fig. 1.19 a Radial probability distribution for 2s orbital. b Boundary surface diagram for 2s orbital
Fig. 1.20 Three different orientations of electron probability distributions for 2p orbital
While introducing quantum numbers associated with electron, it was stated that
principal quantum number ‘n’ essentially defines the energy of the electron. Elec-
trons with principal quantum number n = 1 mean electrons that are nearest to the
nucleus; most tightly bound to the nucleus, having largest negative binding energy
and minimum absolute energy. Electrons with n = 2 are not as close to the nucleus
as n = 1 electrons, have negative binding energy but less than that of n = 1 elec-
trons; have absolute energy more than that of electrons of n = 1 orbital. It may thus
be observed that actual (negative) binding energy of electrons decreases, absolute
energy and distance from the nucleus increases as the value of principle quantum
number ‘n’ increases. One may, therefore, draw an energy level diagram for elec-
trons on a vertical scale as shown in Fig. 1.21. It may, however, be said that this
energy diagram is based on quantum mechanical solution of Schrodinger equation
for hydrogen atom.
The fact that a given principal orbital or major shell contains sub-orbitals or sub-
shells brings out the fact that an electron in different sub-shells of a given major shell
will have different energies. It means that the energy of an electron is decided not
only by the principle quantum number ‘n’, but it also depends on the value of the
azimuthal quantum number ‘l’. Actually, an electron put in different sub-shells of a
given major shell will have slightly different energies in different sub-shells.
Though exact solution of Schrodinger equation is possible only for hydrogen atom,
electron energy level scheme obtained for hydrogen atom may be extended, with
some modifications, to obtain the electron configuration for atoms of other elements.
By electronic configuration one means how electrons are distributed in different
orbitals in an atom. From the analysis of hydrogen atom, it is known that principal
orbital with n = 1 may accommodate a maximum of two electrons in sub-state 1s;
principal orbital n = 2, a maximum of 8 electrons (two electrons in sub-state 2s and
6 electrons in sub-state 2p) and so on. The problem in case of atoms other than that
of hydrogen is to find out the sequence of sub-orbitals with increasing energy.
With the help of the rules discussed above it is possible to write the electron config-
uration for atom of any element. There are three ways of representing electron
configurations.
(a) Orbital notation method In this method the orbitals that have electrons are
written in order of increasing energy and the number of electrons in each
orbital are given as a superscript to the orbital. For example, nitrogen atom has
seven electrons and its electronic configuration may be written as: 1s2 2s2 2p3 .
Figure 1.24 explains the meanings of each character of the notation.
(b) Orbital diagram method In this method orbitals having electrons are repre-
sented by boxes and are written in the order of increasing energies. Electrons in
each orbital are represented by arrows, direction of arrows indicating the direc-
tion of electron spins. For example, the electron configuration of some elements
atoms are given in the last column of Table 1.10.
(c) Short-hand form In this method the last completely filled orbital or shell is
represented in terms of a noble gas. For example, the electron configuration of
lithium in this notation may be written as [He] 2s1 . Electron configurations for
some elements in different notations are given in Table 1.10.
Shell or orbital of highest energy (largest value of n) that has some electrons is
called the valence shell or valence orbital, and the electrons it contains are called
valence electrons. For example the valence shell for Calcium is 4s2 with two valence
electrons; the valence shell for Argon is 3s2 3p6 orbital with (2 + 6 =) 8 valence
electrons; Aluminium has 3s2 3p1 shell as valence shell and it has (2 + 1 =) 3 valence
electrons. It is important to remember that all electrons in different sub-shells of the
highest ‘n’ value shell (that has some electrons) are counted as valence electrons. The
number of valence electrons in case of noble gases is eight. Therefore, it is concluded
that eight electrons in valence shell of any atom make it very stable and chemically
inert.
Valence shell and valence electrons are important because most of the chemical
and some physical properties of the atom are decided by the valence shell and valence
electrons. It is valence electrons that take part in chemical reactions and decides the
type of bonding with other atoms to make molecules.
1.2 Atomic Structure 41
similarly, the valence electron configuration for element 48Cd is: 5s2 4d10 and for
the element 43Tc the outer electron configuration is: 5s2 4d5 .
The electron configuration rules stated above holds good in most cases but there
are four outstanding exceptions where these rules fail to give the correct electron
configuration. These four cases are of:
(a) Chromium Cr; According to the rules the electron configuration of Cr should
be [Ar] 4s2 3d4 but actually it is [Ar] 4s1 3d5
(b) Similarly, for copper Cu, according to rules the electronic configuration should
be[Ar] 4s2 3d9 but its actual configuration is: [Ar] 4s1 3d10
(c) Silver (Ag) according to rules should have electron configuration of [Kr] 5s2
4d9 but actual electron configuration for silver is: [Kr] 5s1 4d10
(d) Also in case of Gold (Au) according to rules the electronic configuration should
be [Xe] 6s2 5d9 but the actual configuration is: [Xe] 6s1 5d10
As may be observed in all the above cases, an enhanced stability is acquired by
half or fully filled sub-shells.
Only valence electrons take part in chemical reactions and in forming molecules,
etc. The inner electrons are generally well protected and mostly do not take part
in combination processes. An American chemist, G. N. Lewis introduced a simple
notation to represent valence electrons in an atom. These notations are called Lewis
symbols. Lewis symbols for elements of second period of periodic table may be
given as (Fig. 1.26).
The dots around the chemical symbol of an element give the number of valence
electrons in the atom of the element. The valency of the element is equal to the
number of dots around it or 8 minus the number of dots around.
Electron configurations discussed above apply to the ground states of atoms.
However, when an atom is excited by giving some energy by an external source, say
by heating, etc., few electrons from its valence orbital shift to the next higher orbital.
Therefore, the electron configuration of an excited atom is different from the ground
state configuration. Similarly, electron configuration of an atomic ion is different
from that of the parent atom.
SAQ: What is the difference between a classical electron orbit and quantum
mechanical orbital?
SAQ: Does the energy of all electrons in a given principal orbital exactly same?
SAQ: Which quantum number includes Pauli’s exclusion principle in quantum
description of electron’s motion in an atom?
SAQ: Write electron configurations of a singly ionised Sodium ion and a doubly
ionised lithium ion.
When two atoms are brought near to each other two types of Coulomb forces come
into play; the repulsive forces between the positively charge nuclei of the two atoms
and between their electron clouds and attractive forces between electron cloud of
one atom and the nucleus of the other atom. The magnitude of both types of forces
increases with the decrease of relative separation r. Figure 1.27a shows the variation
of the attractive, repulsive and net forces between two atoms as a function of their
mutual separation r. The intra-atomic separation r 0 corresponds to the mutual separa-
tion where attractive and repulsive forces cancel each other and the two atoms are in
a state of equilibrium. It is well known that any force F may be converted into poten-
tial energy V (or force F may be derived from potential V ) using the mathematical
operation given by expression;
( )
∂F ∂F ∂F
V = −gradF = − + +
∂x ∂y ∂z
Net potential energy between the two atoms as a function of intra-atomic sepa-
ration r, obtained by using the above expression, is shown in Fig. 1.27b. It may
be observed in this figure that net attractive force gives rise to the negative poten-
tial energy V 0 which is responsible for the binding of the two atoms. The negative
potential energy has its maximum (negative) value at a separation r0 , the equilibrium
distance. Two atoms develop a bond only when they are at a relative separation of r 0 ,
at larger separation they do not bind with each other as shown in the figure. The nega-
tive binding energy decreases on both sides of r 0 , and, therefore, the two atoms are
held fixed at a separation of r 0 . Further, larger the magnitude of − V 0 , more tightly
the two atoms are bound with each other. Atoms bound with each other makes a
molecule. As a matter of fact the magnitude of V 0 decides many properties, physical
and chemical, of the pair of two atoms or the molecule. For example, a larger value
of V 0 corresponds to a higher melting point.
1.3.1 Electronegativity
Chemical reactivity of an atom is decided by the valence electrons in the outer most
orbital of the atom. If the valence shell is completely filled, like that of inert gases,
44 1 Engineering Materials, Atomic Structure and Bounding
the atom has almost no chemical reactivity. However, in case the valence shell is only
partially filled, the atom shows chemical reactivity. Chemical reactivity of atoms may
be measured in terms of parameters called electronegativity or electropositivity.
Figure 1.28 shows the periodic table of elements. The electronegativity of elements
in periodic table starts from almost the middle of the periodic table and increases
towards right. Electronegativity decreases towards left, and atoms with lower value
of electronegativity are said to have electropositivity. In periodic table highly elec-
tronegative halogens and highly electropositive alkali metals are separated by the
noble gases. Atoms of elements of group-1 and group-2 of the periodic table (enclosed
in box A) have partially filled valence shell; they have only one or two electrons in
their valence shell. These atoms having almost empty valence shell, with lower elec-
tronegativity, have the tendency to give up their electrons to other atoms of higher
electronegativity when they come in contact with them. Since they have the tendency
to give their electrons and by doing so they acquire net positive charge, these atoms
are said to have electropositivity. For example, if we consider element potassium
(K) its electron configuration is: 1s2 2s2 2p6 3s2 3p6 4s1 or [Ar]4s1 . The valence shell
of potassium 4s1 contains only on electron. Potassium has the tendency to give up a
electron and becomes positive ion;
On the other hand atoms of elements on right side of the periodic table like,
chlorine (Cl) has many electrons in its valence shell but is not completely filled.
Electron configuration of chlorine is: [Ne] 3s2 3p5 . The valence shell of chlorine
(3s2 3p5 ) has seven electrons, one electron short of the maximum number that (sp)-
orbital can accommodate. Chlorine is highly electronegative; it has the tendency of
taking an electron and becoming a negative ion (anion);
Elements contained in box A in Fig. 1.28 are typical of metallic character (elec-
tropositive), and those contained in box B have characteristics which are intermediate
between metals and non-metals, possessing different degrees of electronegativity.
46 1 Engineering Materials, Atomic Structure and Bounding
Though several attempts were made to explain the bond formation between atoms
on the basis of electron structure, it were Kossel and Lewis, who independently gave
a satisfactory explanation in 1916. They studied the electron structure of noble gases
and found that all of them have eight electrons in their valence shell. Based on this
observation, Kossel and Lewis developed a theory for combination between atoms
called ‘electronic theory of chemical bonding’. According to this theory atoms
can combine either by transfer of valence electron from one atom to the other or
by shearing of valence electrons in order to have an octet (eight) of electrons in
their valence shells. This is called octet rule. The octet rule though useful but is not
universal. There are some limitations of the octet rule. Octet rule essentially applies
to atoms of the second group of periodic table.
It is clear from above that characteristics of valence electrons and the net force of
attraction between two atoms create attractive bonds between atoms. Depending on
their strength and other characteristics bonds between atoms may be divided into
two classes: (a) primary bonds between atoms and (b) secondary bonds between
atoms and molecules.
Primary bonds may further be divided into three types: (i) ionic bond, (ii) covalent
bond and (iii) metallic bond while secondary bonds are into two types: (i) van der
Waals bond and (ii) hydrogen bond.
(A) Primary atomic bonds Primary atomic bonds are characterised by large inter-
atomic forces. Primary bonds involve valence electrons of interacting atoms
and arise from the tendency of atoms to acquire stable electron structure of
completed valence shell. They may be nondirectional or localised (directional)
and may be produced by electron transfer, electron shearing or delocalisation
of electrons
(i) Ionic or electrovalent bond These bonds are formed when two atoms
of very different values of electronegativity combine together to form a
molecule. Ionic bonds are formed typically between highly electropositive
(metallic) and electronegative (non-metallic) elements. For example when
an atom of sodium with very low value of electronegativity (or high value
of electropositivity) combines with an atom of chlorine which has very
high value of electronegativity, an ionic bond gets established between
the two atoms.
1.3 Bonds Between Atoms and Ions 47
As shown in Fig. 1.29, sodium (Na) has only one electron in its valence
shell 3s. Since sodium is electropositive it has the tendency of giving
away this electron, chlorine on the other hand is highly electronegative,
has 7 electrons in valence shell 3s2 3p5 and has the tendency of acquiring
electrons. As a result, when an atom of sodium comes sufficiently close
to the chlorine atom to be with in its Coulomb field, it gives its only
valence electron to the chlorine atom. With this transfer of electron sodium
atom becomes a positive ion and on receiving an extra electron from
sodium, chlorine atom becomes a negative ion. A bond gets established
between positive sodium ion and negative chlorine ion due to coulomb
attraction between them. Thus ionic bonds are formed by the transfer of
valence electron(s) from the lower electron negativity atom to the higher
electronegative atom.
Only valence electrons take part in bonding while the inner shell electrons
and the nucleus of the atom do not take part in chemical bonding, being
well shielded by valence electrons and large force of attraction between
the nucleus and inner shell electrons. Therefore, in pictorial representa-
tions of bonds, the nucleus and inner shell electrons are often represented
by the core. Core of the atom has a positive charge equal in magnitude
of the number of electrons in the valence shell. Figure 1.29 explains the
ionic bond formation in case of NaCl. As shown in this figure, on forming
the bond the size of Na+ ion shrinks (as compared to the size of Na atom)
while the size of negative Cl− ion also shrinks but not so much as that
of sodium ion. The reason for this reduction of size in case of sodium
ion is the fact that on losing the only valence electron the valence shell
disappears. The size of sodium ion then reduces to the size of its core. The
size of negative chlorine ion also decreases because with the increase of
48 1 Engineering Materials, Atomic Structure and Bounding
Fig. 1.30 a Ionic bonds are generally non directional. b Structure of an ionic solid
negative charge in the valence shell, attractive force by its core increases
which results in a valence orbital of reduced size.
Ionic bonds do not have any preferred direction; it is because of the
fact that both the positive ion and the negative ion attract each other by
forces of equal magnitudes, as shown in Fig. 1.30a. Ionic solids have large
lattice energies ranging from 600 to 3000 kJ/mol and have high melting
temperatures. Melting point for NaCl, for example, is 801 °C. The binding
energy for NaCl is ≈ −7.42 × 10−19 J = 4.63 eV.
Ionic solids have a regular arrangement of alternate positive and negative
ions in three dimensions, as shown in Fig. 1.30b. Ionic solids are mostly
ceramics; they are bad conductors of heat and electricity and often brittle.
(ii) Covalent bonds
The term covalent bond was coined by the American Chemist Irving Lang-
muir in 1919. Covalent bonds are formed when two atoms of either same
or nearly same electronegativity join together. This typically happens
in non-metals. In case of covalent bonding, transfer of electrons from one
atom to the other does not take place. Instead, the two interacting atoms
shear electrons to complete the octet (eight electrons each) or duplet in
their valence shells.
Large number of elements in periodic table have either s-shell or the
combination of sp orbitals as their valence shell. The maximum number
of electrons that may be accommodated in s-shell is 2, and for sp shell
8 (2 + 6), therefore, atoms try to complete electron duplet, if s-shell is
valence shell or octet if sp shell is valence shell.
1.3 Bonds Between Atoms and Ions 49
Fig. 1.31 a Covalent bonding in Cl2 molecule. b Lewis dot structure for Cl2 molecule
Fig. 1.32 a Pictorial representation of four covalent bonds in methane (Ch4 ) molecule. b Lewis
dot structure for methane molecule
Fig. 1.33 a Lewis dot structure of C2 H4 molecule with double covalent bonds in carbon atoms.
b Lewis dot structure for N2 molecule with triple covalent bonds in nitrogen atoms
1.3 Bonds Between Atoms and Ions 51
Fig. 1.34 a Covalent radii of the two atoms are r a and r b , while R the separation between the two
nuclei is the bond length. b Bond angle in H2 O molecule
(b) Bond angle It is defined as the angle between the orbitals containing
bounding electron pair around the central atom in a molecule or
complex ion. Bond angle is expressed in degrees, and it gives some
idea about the distribution of orbitals around the central molecule,
which means the shape of the molecule. For example, in case of
water molecule the bond angle is 104.50, as shown in Fig. 1.34b.
(c) Band order In Lewis description of covalent bond, bond order is the
number of covalent bond in the molecule, for example in H2 bond
order is 1, in O2 bond order is 2 and in N2 the bond order is 3.
(d) Polarity of bond When covalent bond is formed in two similar atoms
like H2 , O2 , Cl2 , etc., the shared electrons are equally attracted by
the two atoms and, therefore, the electron pair is situated exactly
between the two identical atoms or nuclei. Such a bond is called
a non-polar covalent bond. However, when two dissimilar atoms
are coupled through a covalent bond, like HF molecule, the sheared
pair of electrons shifts more towards the fluorine because of the
larger force of attraction by it than that by hydrogen nucleus. The
bond in this case is a polar covalent bond. Shifting of the paired
electrons from the centre gives rise to the formation of an electric
dipole. The molecule that has polar covalent bond behaves as a tiny
electric dipole, often represented by the symbol μ. Dipole strength
of such atoms is measured in a unit called Debye denoted by D.
Further 1 D = 3.33564 × 10–30 C m. Dipole moment is a vector
quantity, and the direction of the vector is indicated by direction
52 1 Engineering Materials, Atomic Structure and Bounding
of shift of the shared electron pair from the central position, i.e.
in case of HF molecule from H atom towards F atom. Because of
the associated dipole moment polar covalent bonds are said to have
specific directions.
(e) Bonding energy The bonding energies of covalent bonds may be
very different; it may be very high, for example in case of diamond
which is the hardest material having a melting point of > 3550 °C.
On the other hand bonding energy may be very low as in the case of
bismuth which has a low melting point of around 270 °C.
No material has 100% ionic bonds or 100% covalent bonds; materials
that have predominantly ionic bonds also have a small percentage of
covalent bonds and vice versa.
(iii) Metallic bond
Metallic bonds are found in metals and their alloys. Such bonding occurs
when atoms of low electronegativity join together. Since low electroneg-
ativity atoms have the tendency to lose their valence electrons; the inter-
acting atoms lose all their valence electrons which form a cloud of delo-
calised electrons. No electron of the cloud is essentially attached with
any particular atom rather all electrons are attached with positive cores
of all atoms. Metallic bonding may be looked as an extreme case of
covalent bonding; in covalent bonding nearby atoms shear their valence
electrons but in metallic bonding all atoms shear their valence electrons.
Electrons that are not bound to any particular atom are called delocalised
electrons. Cloud of delocalised electrons works as glue to bind positively
charged cores of atoms, which in absence of delocalised electron cloud
will repel each other and break the crystalline structure of the metal. Alter-
nately, a metallic crystal that has N number of atoms may be looked as
an N-atomic molecule and the cloud of delocalised electrons as electrons
moving in large number of different molecular orbitals. Since the number
of molecular orbitals of a molecule having N atoms will be very large,
criss-crossing each other, the delocalised electrons in these molecular
orbitals appears as an electron cloud.
Looking from the point of quantum mechanics, each electron of the
electron cloud has a discrete set of closely spaced energy levels. Since
the number of delocalised electrons in the cloud is very large, the total
number of electron energy levels becomes very large, almost a continuum
of levels. Only the low lying energy levels of the continuum are filled
with electrons, but large number of electron levels is empty. If energy is
supplied to these delocalised electrons by some external source, say by
shining light on the metallic crystal, electrons absorb the incident light
photons of all frequencies and go to their respective higher energy levels
which were empty. Since the mean life of these excited states is very short
(< 10−9 s), the excited electrons revert back to their lower energy states
1.3 Bonds Between Atoms and Ions 53
Fig. 1.35 a An electric dipole of dipole moment μ. b Attraction between dipole molecules/atoms
give rise to secondary bonding
atoms have completely filled valence shell (s2 p6 ) and therefore, cannot
form primary bonds. However, when two atoms of a noble gas come close
to each other, they induce electric dipoles in each and these dipoles align
to form a fluctuating dipole secondary bond, as shown in Fig. 1.36a.
(ii) Permanent dipole bond However, there are molecules that are permanent
dipoles, like that of NaCl, which have ionic bonding between Na+ and Cl−
ions. In such molecules close packing and alignment results in formation
of permanent dipole secondary bonds (Fig. 1.36b).
(iii) Hydrogen (secondary) bond Hydrogen bond is also a secondary dipole
bond but it is much stronger compared to other secondary bonds.
Hydrogen bond energy may be as large as 50 kJ/mol. This bond is found
Fig. 1.36 a Fluctuating dipole secondary bond b permanent dipole secondary bond
1.3 Bonds Between Atoms and Ions 55
SA 1.1 Two metals A and B respectively, have face centred (fcc) and body centred
(bcc) crystal structures. Which of the two metals will be more ductile and
why? (II).
SA 1.2 What are delocalised electrons? Discuss their role in determining the
resistivity of a metal.
SA 1.3 Why the resistivity of a metal increases with temperature?
SA 1.4 Why does a piece of metal become more malleable on heating and tough,
brittle on cooling and hammering?
1.3 Bonds Between Atoms and Ions 57
SA 1.5 How one may define a Ceramic? Are both Diamond and Graphite Ceramic,
if yes, why?
SA 1.6 How can one differentiate between Metals, Ceramics and Polymers on
the basis of bonding in them? Which property of Ceramics is very much
affected by the relative strength of bond types in it?
SA 1.7 What are thermosetting and thermoplastic polymers? Give one example
of each.
SA 1.8 Discuss the process of crack/fracture repair in ceramic fiber reinforced
CMCs.
SA 1.9 What are light metal reinforced (MMC) and where are these used?
SA 1.10 Which Composite you will use for fabricating light but strong car body.
Give reasons for your answer.
SA 1.11 State and explain the rules that govern the distribution of electrons in an
atom.
SA1.12 Magnesium nucleus has 12 protons, write the electronic configuration of
magnesium atom in three notations.
SA 1.13 What will be the multiplicity of f-orbital if electron spin is neglected.
SA 1.14 What is the special feature of elements in a group of periodic table and
how does it effects the chemical behaviour of elements, explain with an
example.
SA 1.15 Electron configuration of 21 Sc is [Ar] 3d1 4s2 . What will be the configura-
tion of 22 Ti?
SA 1.16 How electro negativities of interacting atoms decide the type of bond
between them?
SA 1.17 How does the melting point of a material related to the strength of the bond
in its molecules?
SA 1.18 The HF molecule has a permanent dipole moment or not? Explain your
answer.
SA 1.19 What are delocalised electrons? How do they play role in creating atomic
bonding?
SA 1.20 What type of bonds do you expect between long molecular chains in
polymers?
(a) [He]2s2 2p2 (b) [He]2s2 2p3 (c) 1s2 2s2 2p2 (d) 1s2 2s2 2p1
ANS: (a), (c)
1.3 Bonds Between Atoms and Ions 59
LA 1.1 Summarise important properties of metals that differentiate them from other
materials. Explain how delocalised electrons are formed in metals and the
role they play in deciding thermal and electrical conductivities of metals.
LA 1.2 What are composites and how are they classified? Discuss important
properties and applications of fiber reinforced Ceramic matrix composites.
60 1 Engineering Materials, Atomic Structure and Bounding
LA 1.3 How can one define a ceramic? Name two ceramics that have electrical prop-
erties opposite to each other and two that have similar electrical properties.
Bring out differences between metals, polymers, ceramics and composites.
LA 1.4 What are polymers? What types of bond are usually found in polymers?
What is meant by the functionality of a monomer and how does it affect the
structure of polymer?
LA 1.5 Discuss quantum mechanical model of electron configuration in atoms and
explain physical significance of different quantum numbers giving their
possible values. What are the rules for filling electrons in different orbitals?
Explain by giving some example.
LA 1.6 What is meant by electronegativity? How does electronegativity decide
nature of bonds between two atoms? Describe various types of primary
bonds giving examples for each type.
LA 1.7 What are secondary bonds and how do they differ from primary bonds?
Give details of fluctuating and permanent dipole secondary bonds bringing
out the points of difference between them. What kind of secondary bonds
are found in water molecules? What are their special characteristics?
Chapter 2
Electrical Behaviour of Condensed
Matter
Objective
Electrical behaviour of solids will be discussed in this chapter. Basis of classifying
solids as insulator, semiconductor, conductor and superconductors will be discussed
in details. After the study of this chapter it is expected that the reader will be able to
understand the behaviour of different crystalline solids when they are subjected to
electric field.
2.1 Introduction
A material may possess several intensive properties that do not depend on the amount
of the material. These quantitative properties are often used as a metric by which the
advantages of one material over the other can be compared for material selection for
a specific purpose. The first and the most important electrical property of a material
is its ‘resistivity’, (or specific resistance) generally denoted by Greek letter ‘ρ’ (rho).
Resistivity is defined as the resistance offered by a unit cube (a block of 1 m × 1 m
× 1 m) of the material between its opposite faces (see Fig. 2.1). The MKS unit of
resistivity is ‘ohm-metre’ written as ‘Ω-m’ in short. The resistance R of a block of a
material of length L and uniform area of cross section A may be written as
L(m)
R(Ω) = ρ(Ω-m) 2
A m
Or
A m2
ρ(Ω-m) = R(Ω) (2.1)
L(m)
Fig. 2.1 a Resistance between two opposite faces of a 1 m × 1 m × 1 m cube is equal in magnitude
to the resistivity of the material. b Resistance of a bar of length L and uniform area of cross section
A is given by R = ρ LA
Both resistivity ρ and resistance R are the measure of the opposition that an
electric current faces while passing through the given specimen, however, ρ is an
intensive property of the material (it does not depend on the amount of the material)
while R is an extensive property that depends both on the size and shape of the
material. Resistivity ρ has a fixed value for a given material at a given temperature;
however, for same material it has different values at different temperatures. For most
substances the temperature dependence of resistivity is given as
ρT = ρ0 (1 + K T ) (2.2)
Table 2.1 Resistivity, conductivity and temperature coefficient for some materials
Element/material Resistivity (Ω-m) at Conductivity (S/m) at Temperature
20 °C 20 °C coefficient K (K)−1
Gold 2.44 × 10–8 4.10 × 107 3.40 × 10–3
Silver 1.59 × 10–8 6.30 × 107 3.80 × 10–3
Copper 1.68 × 10–8 5.96 × 107 4.00 × 10–3
Iron 9.70 × 10–8 1.00 × 107 5.01 × 10–3
Platinum 1.06 × 10–7 9.43 × 106 3.90 × 10–3
Gallium 1.40 × 10–7 7.10 × 106 4.00 × 10–3
Carbon (amorphous) 5.0 × 10–4 to 8.0 × 1.25 × 103 to 2.0 × − 0.5 × 10–3
10–4 103
Carbon (graphite) 2.5 × 10–6 –5.0 × 2.0 × 105 –3.0 × 105
Parallel to basal plane 10–6 3.30 × 102
Perpendicular to basal 3.0 × 10–3
plane
Gallium arsenide 1.0 × 10–3 –1.0 × 108 1.0 × 10–8 –1.0 × 103
(GaAs)
Germanium 4.60 × 10–1 2.17 − 48.0 × 10–3
Silicon 6.41 × 102 1.56 × 10–3 − 75.0 × 10–3
Diamond 1.00 × 1012 1.00 × 10–13
Teflon 1.0 × 1023 to 1.0 × 1.0 × 10–25 to 1.0 ×
1025 10–23
Elements/materials listed in Table 2.1 may be divided into three main classes;
(a) Have very high or large value of resistivity of the order of ≈ 1025 –1012 (Ω-m).
Such materials are called insulators. Insulators offer very high opposition to
the flow of current. An ideal insulator will have infinite value of resistivity and
will not allow current to pass through it.
(b) Materials that have resistivity value in the range of 102 –10–2 (Ω-m) with nega-
tive value of coefficient K. These materials are called semiconductors and are
extensively used for fabricating electronic devices.
(c) Materials for which the resistivity has small (but non-zero) value ≈ 10–5 –10–10
(Ω-m) are called conductors, and these materials offer small opposition to the
flow of current. It may also be observed that coefficient of temperature K for
conductors has a positive value, meaning thereby that the resistance of a given
piece of the specimen conductor will increase on increasing the temperature.
(d) A fourth category of materials that is not included in the table is called supercon-
ductors. Superconductors show zero resistivity under some special conditions
of temperatures, etc. Having zero resistivity or zero resistance, superconductors
are very important materials as no energy is consumed/wasted in passing current
through them.
64 2 Electrical Behaviour of Condensed Matter
In the backdrop of atomic theory of matter, according to which all matter is made
of atoms, the question arises as what is the reason for this difference in electrical
behaviour of different materials? One way of explaining the difference between insu-
lators, semiconductors, conductors and superconductors is in terms of the electron
band theory of condensed matter.
Most solids are crystalline; they have a regular arrangement of atoms in a pattern
which is repeated in three dimensions. Electric current in solids may flow only by a
net movement of charge carriers that is of electrons, under an electric field. Therefore,
for the flow of current in case of solids it is essential that there are relatively free
electrons that may move when an electric field is applied across it. In contrast, electric
current in liquids may also flow due to the motion of ions under applied electric field,
as it happens in case of electrolytes, etc. In order to understand the physics of current
flow and resistivity in solids, it is required to know: (a) which electrons in the solid
and (b) under what conditions these electrons may become carrier of current. For that
one has to understand the electron configuration in different atoms and how electron
configurations of crystals play a role in current flow.
According to the quantum mechanical model of the atom, electrons of an isolated
atom are distributed in discrete energy levels (or orbitals). Electron energy levels
are discrete but closely spaced. Let us take the example of element Aluminium
each atom of which has a total of 13 electrons. Electron configuration of Al-atom
is 1s2 2s2 2p6 3s2 3p1 , and energy distribution of electrons in different energy levels
for an isolated atom is shown in Fig. 2.2a. Electrons in highest energy shell (3s2 3p1
shell in this case) are called valence electrons. Valence electrons are least tightly
bound with the nucleus of the atom and take part in chemical reactions, etc. If two
atoms of Aluminium come very close to each other and form a diatomic molecule,
the electron energy levels of the di –atomic molecule will be like the one shown in
Fig. 2.2b. There will be two levels close to each other corresponding to the two atoms.
If three atoms come close enough to form a tri-atomic molecule, each level will split
into three closely spaced levels, and so on. In a very small crystal of Aluminium
there is very large number of atoms (≈ 1025 atoms) packed very close to each other,
and therefore, electron energy levels group up in bands of very closely spaced levels
separated by band gaps, as shown in Fig. 2.2c. The energy band that contains valence
electrons is called the valence band and the band next to it in energy the conduction
band. The band energy gap between the valence and the conduction band is called
forbidden energy gap. Energy band gaps are regions of energy where no energy
level of the material exists, and no electron of the atoms of material may have that
energy.
It is obvious that in solids only valence electrons may take part in current flow,
since other inner electrons are tightly bound with the nucleus. On application of an
electric field to the crystal, valence electrons in the crystal are subjected to a force and
2.2 Electron Energy Band Theory 65
Fig. 2.2 a Electron configuration of an isolated atom of Aluminium. b Electron energy levels for
a diatomic molecule of Aluminium. c Electron energy bands in an Aluminium crystal
try to gain energy. Valence electrons will be able to absorb energy only if there are
vacant levels available at higher energies. As such valence electrons will be able to
gain energy and move only if (i) there are vacant levels in the valence band and/or (ii)
when valence band is completely filled then the forbidden energy gap is non-existent
or small enough so that valence electrons may shift to the conduction band, which
is completely empty. Thus current flow in crystalline solids is decided by the nature
of the valence band and the forbidden energy gap. It may, however, be noted that a
partially empty valence band may allow the flow of current but only up to an extent
because of the limitation of available unoccupied levels in the valence band. On the
other hand, if the forbidden energy gap is small or does not exist, then considerable
number of electrons from valence band may take part in current flow as there will be
large number of unoccupied levels available to the electrons in conduction band.
SAQ: No electron level of the parent crystalline solid may exist in forbidden energy
gap, however, if some other atoms are imbedded /mixed in the crystal struc-
ture, electron level corresponding to the other atoms may exist in forbidden
energy gap or not?
So far we made only a qualitative discussion of electron energy bands in a crystal.
Though exact quantum mechanical calculations for many body systems are impos-
sible, however, approximate calculations show that energy gap between two consec-
utive bands depends strongly on the relative separation of atoms. Figure 2.3 shows
the variation of band gap with relative separation of atoms. It is clear from the figure,
66 2 Electrical Behaviour of Condensed Matter
depending on the packing of atom in the crystal the two consecutive bands may be
far apart (as at d1 ), may be separated by a small band gap (d2 ) or may overlap as at
distance d3 .
In the light of the above, three different situations may arise (i) forbidden
energy gap is quite large, and valence shell is completely filled; (ii) valence shell
is completely or partially filled but forbidden energy gap is small and (iii) forbidden
energy gap between the valence and conduction bands is either very small or does not
exist. Corresponding to these conditions materials may be divided into insulators,
semiconductors and conductors.
SAQ: What characteristics of atoms decide the size of forbidden energy gap?
SAQ: Valence and conduction bands in a crystalline solid overlap, the solid will be
a (insulator/conductor/semiconductor)? Choose the correct alternative.
2.3 Insulators
A crystalline solid in which valence shell is completely filled and forbidden energy
gap is quite large becomes an insulator. It is because in such materials valence
electrons do not find vacant levels to move when an electric field is applied to the
specimen. Since only the valence and the conduction bands play role in deciding the
electrical nature of materials, it is customary to show only these bands in pictorial
representation of band structures of solids. The band structure of a typical insulator
is shown in Fig. 2.4a, where the valence band is completely filled with the maximum
number of electrons it can hold and the conduction band is empty, however, the
forbidden energy gap E g between the valence and conduction bands is quite large,
larger than 5 eV. If the band gap is large, electrons do not acquire sufficient energy
from the applied electric field to overcome the band gap and shift to conduction band
2.3 Insulators 67
Fig. 2.4 a Energy band diagram for an insulator b a specimen of length ‘d’ subjected to a voltage
V develops an electric field E = V /d. c Table showing forbidden energy gap for some materials at
specified temperature
electric wires and cables. Ceramic insulators are generally used for high voltage
applications.
SAQ: What is the physical significance of specific dielectric strength of a material?
2.4 Semiconductors
Those materials for which forbidden energy gap lies in the range of 0.2–3.0 eV and
electron density at Fermi level of around 1020 m−3 are usually classified as common
type semiconductors. These limits are not very rigid some synthetic materials that
have almost 5 eV forbidden energy gap or even larger also behave as semiconductors.
In principle four factors decide the electrical nature of materials, they are (i) the
magnitude of the forbidden energy gap E g , (ii) the magnitudes of the crystal wave
number or crystal momentum vector (k) at the bottom of the conduction band and
the top of the valence band, (iii) number of available electron energy states around
Fermi energy and (iv) the mobility of charge carriers. It may, however, be mentioned
that all these properties are inter-related and are not totally independent.
Two elements in their very pure form (> 99.999… %) are natural semiconductors;
they are Germanium (Ge) and Silicon (Si). These two naturally occurring semicon-
ductors in their purest form, purity better than 1 part in billion, are called intrinsic
semiconductors. Semiconductors in which some impurity atoms are deliberately
mixed are termed as extrinsic or more frequently doped semiconductors.
It is interesting to observe the difference in electrical properties of elements that
are members of the 14th group of periodic table: 6 C, 14 Si, 32 Ge, 50 Sn and 82 Pb. These
elements have many common properties, for example, all of them have four electrons
in their valence shell, and all have diamond like crystal structure but their electrical
properties are quite different; carbon in diamond form is one of the best insulator,
Silicon is a semiconductor, Germanium is both a semiconductor and half metal, Tin
2.4 Semiconductors 69
(Sn) is a metalloid and lead (Pb) a metal. These differences in electrical behaviour
originate from the difference in the size of their atoms and relative separation between
successive atoms in their crystals. Relative atomic separation in crystalline structure
and the number of valence electrons decides the magnitude of the forbidden energy
gap. In case of diamond the forbidden energy gap is as large as 7 eV, and so it is insu-
lator, while for Silicon and Germanium the forbidden energy gaps are, respectively,
1.02 eV and 0.66 eV, and therefore, the two elements Si and Ge are semiconductors.
Tin has E g of the order of 0.01 eV, and so it is half metal. In the case of lead (Pb) the
valence and conduction bands overlap, and so it is a metal.
SAQ: Which two parameters of atoms of a crystalline solid decide the magnitude
of the forbidden energy gap?
(i) Purification of natural Silicon
The starting material used for the fabrication of semiconductor devices is
monocrystalline Silicon or Germanium. However, most importantly to tech-
nology, Silicon is the principle platform for semiconductor devices. Silicon is
one of the most abundant elements in the crust of earth. The process to transform
raw Silicon into a use-able single crystal substrate for modern semiconductor
processes begins by mining for relatively pure SiO2 . The relatively pure Silicon
dioxide is reduced with carbon in an electric furnace at temperatures ranging
from 1500 to 2000 °C. The reduction process yields metallurgical grade (MG)
Silicon of purity around 97%. However, this Silicon must be further purified to
bring down impurities below the parts-per-billion level. Though several different
methods may be used for further purification of MG-grade Silicon, however,
the two frequently used methods are discussed here.
(a) The trichorosliane method For further purification MG-Silicon is treated
with HCl to form trichlorosilane (TCS) in a fluidized-bed reactor at 300 °C
according to the following chemical reaction
steadily concentrated in the molten portions, which were swept away. The
technique has undergone several improvements since then.
The present day zone refining technique purifies solids by passing a number
of molten zones through the solid in one direction. Each zone carries a
fraction of the impurities to the end of the solid charge, thereby purifying
the remainder of the solid. The basic lay out of the process is demonstrated
in Fig. 2.5a where the impure crystal of the element (Si or Ge) in the form of
a rod is taken and a heating element that may be moved along the length of
the rod from one end to the other is placed, say, at the extreme left position
indicated by 1. When current is passed through the heating element the
part of the semiconductor rod immediately below the heater melts while
the remaining section of the rod remains in solid state. The heater is then
moved slowly towards right to position 2 and then to 3 and so on. As result
of the motion of heater, successive sections of the semiconductor rod go
to molten state then turn back to solid crystalline states one after the other.
Figure 2.5b shows a typical part of the road, where initially the heater was
above section AB, and the section ABBA of the rod was in molten state.
With time the heater has moved to the location above section BC of the
rod, and now the section of the rod BCCB is slowly turning to molten state.
In the mean time section ABBA, which was in molten state earlier, starts
solidifying, and process of solidification or the process of re-crystalline
starts from the face AA and slowly spreads towards the face BB. Two
important points to note are as follows: (i) the mobility of impurity atoms/
ions is more in molten phase of the semiconductor compared to its solid
crystalline phase, and (ii) the melting point of pure crystalline substance
is always higher than the impure substance. As a result impurities diffuse
from the part of the rod that is undergoing solidification to the part still in
molten phase and pure material crystallises at a higher temperature than
the molten material with impurities. Migration of impurities is shown by
red arrows in Fig. 2.5b. In this way, impurities accumulate at the right side
edge BB of section ABBA. When section BCCB of the rod changes to
Fig. 2.5 a Zone refining technique for improving purity of semiconductors. b Impurities shift from
solid phase to the molten phase
2.4 Semiconductors 71
in these roller groves. Both rollers are connected to the same motor and
may rotate at high speed in same direction. Steel wires are painted with
liquid abrasive paint and work as sharp cutting blades. When a monocrystal
is pressed through the wire blades, wafers of Silicon are cut through the
crystal.
A rough sketch of multiwire slicing machine is shown in Fig. 2.7. The
rough drawing is just to understand the principle of working, an actual
multiwire saw is much complicated which has several cylindrical derives
to guide the motion of wire blades and wire spools to maintain a continuous
supply of new wire.
Silicon and Germanium wafers are used for fabricating semiconductor
devices.
(ii) Fermi energy and Fermi level
Electron being a particle having spin 1/2 ℏ is a Fermion and obeys quantum
mechanical statistics called Fermi–Dirac statistics. According to this statistics
at absolute zero temperature (0 K) electrons start filling lowest energy states of
the system (like an atom) obeying exclusion principle, filling in to higher states
after exhausting all lower energy states. The resulting structure of electrons
is termed as ‘Electron Sea’ or ‘Fermi Sea’. The surface of this sea is called
Fermi surface or level. This means at absolute zero temperature no electron can
have energy larger than the Fermi energy Ef of the Fermi level. According to
2.4 Semiconductors 73
1
p(E) = (2.5)
1+e ( E−E f )/kT
Here, E f is the Fermi energy for the system and k is Boltzmann constant. Equa-
tion (2.5) gives the theoretical probability of finding an electron with energy E
at temperature T (K). In an actual system there will be an electron with energy
E only if an electron level with energy E actually exists. Often it so happens
that the given system does not have an allowed energy level at some energy,
for example, in case of crystalline solids no energy levels for electrons of their
atoms exist in forbidden energy gap, in that case Eq. (2.5) will still give some
probability of finding an electron for a level in forbidden energy gap.
When one calculates probability of finding an electron with energy equal to
Fermi energy, i.e. if E = E f then Eq. (2.5) gives
1 1 1
p Ef = = = (2.6)
1+e ( E f −E f )/kT 1 + 1 2
Expression (2.6) tells that probability of finding an electron with Fermi energy
E f is 0.5, and that this probability does not depend on temperature. So if
temperature is 0 K or 100 K, the value of p(E f ) will remain 0.5.
Energy band picture for an intrinsic semiconductor at absolute zero and at some
higher temperature T > 0 K is shown in Fig. 2.8.
At T = 0 K, valence band contains all valence electrons, and the conduction
band is empty, which has no electrons. Therefore, the probability of finding an
electron at the bottom of the conduction band is 0, while that of finding an elec-
tron at the top of the valence band is 1. The probability of finding an electron
74 2 Electrical Behaviour of Condensed Matter
Fig. 2.8 Band structure of a semiconductor at a Zero Kelvin (0 K) b at temperature T (K) > 0 (K)
with probability p(E f ) = 0.5 will be at a point midway between the bottom
of the conduction band and the top of the valence band, i.e., in the middle of
the forbidden energy gap. Therefore Fermi energy level for an intrinsic semi-
conductor lies in the middle of the forbidden energy gap. Further, since p(E f )
does not depend on temperature, the Fermi level for intrinsic semiconductor will
remain in the middle of the forbidden energy gap at all temperatures. Figure 2.8a
shows the conduction band, valence band and Fermi level for intrinsic semi-
conductor at absolute zero temperature. As may be observed in this figure, all
valence electrons of the intrinsic material at absolute zero temperature are in
valence band, and the conduction band is totally empty.
When temperature of an intrinsic semiconductor is raised above 0 K, electrons
in valence band absorb energy from the surrounding environment and if the
energy gained by a valence electron becomes equal or larger than the forbidden
energy gap E g , the electron jumps to the conduction band leaving an electron
vacancy in the valence band. This vacancy of electron which behaves as a
positive charge is called ‘hole’. Hole is just a fictitious entity but it is very useful
in understanding the physics of semiconductors. The concept of hole originates
from the quantum mechanical treatment of current flow in semiconductors.
It so happens that Schrödinger’s equation when applied to a semiconductor,
under some approximations, separates out into two independent components
one describing the motion of electrons and the other the motion of a positively
charged particle. The absence of electron in the valence band is thus assumed
to be the positive particle, hole, the motion of which is described by the second
component of Schrodinger’s equation. For all practical purposes hole is treated
as a positively charged particle with charge + 1 e and a mass very nearly (slightly
more) equal to the mass of an electron me .
2.4 Semiconductors 75
Electrons that have shifted to the conduction band at T > 0 K behave as delo-
calised electrons or free electrons that are not attached to any particular atom
of the semiconductor crystal. Conduction band electrons may be compared to
the delocalised electron cloud in case of metals which is associated with the
crystal lattice but not to any individual atom. When an intrinsic semiconductor
specimen at T > 0 K is subjected to an electric field by applying a voltage
across it, a current consisting of conduction band electrons and valence band
holes may flow through the specimen. Thus at absolute zero a pure or intrinsic
semiconductor behaves as an insulator while at a temperature T > 0 K, the same
specimen behaves as a conductor. Further in a pure semiconductor specimen at
any temperature T > 0 K the number of electrons in conduction band is always
equal to the number of holes in valence band, also the number of electrons in
conduction band (and holes in valence band) increases with the increase in the
temperature T as more valence electrons may jump to the conduction band at
higher temperature.
Fermi level assumes added importance in case of semiconductors, it may be
treated as a reference of energy; energy of electrons in conduction band increases
as one moves upwards from the Fermi level, while the energy of holes increases
as one goes downwards from the Fermi level. That means that an electron at
the top of the conduction band is most energetic while a hole at the bottom of
valence band has largest energy. Further, the probability of finding an electron
say X units of energy above the Fermi level is same as the probability of finding
a hole same X units below the Fermi level.
Figure 2.9 shows the conduction and valence bands for intrinsic Germanium
and intrinsic Silicon crystals at absolute zero and at temperature T > 0 K.
Figure 2.9 is self-explanatory, telling that both intrinsic semiconductors have
empty conduction bands at T = 0 K, and hence behave as insulator. However,
the two points of interest are (i) at T > 0 K, the number of free electrons
in conduction band in Ge is larger than the number of free electrons in Si,
because of the smaller value of its forbidden energy gap. (ii) The energy of free
electrons in conduction band increases vertically upward from the Fermi level,
while energy of holes in the valence band increases vertically downwards from
the Fermi level. Therefore, an electron at the top of the coduction band has
highest energy amongs free electrons, while a hole at the bottom of the valence
band is the one with highest energy amongst holes. Further in a specimen of
an intrinsic material the number of holes is always equal to the number of free
electrons in the conduction band. At any temperature T > 0 K, in an intrinsic
material, new holes and free electrons keep generating on one hand, and on the
other hand they also get annihiliated when a free electron falls back in valence
band and recombines with a hole. The process of electron + hole generation
and annihilation goes on simultaneously in such a way that the average number
of holes and free celectrons remain constant over a period of time.
SAQ: What is Fermi level and what is its physical significance?
76 2 Electrical Behaviour of Condensed Matter
SAQ: Do holes may move in intra atomic space like free electrons? Justify
your answer.
In the previous section we studied the electron band theory and its application in
distingushing different types of crystalline materials according to their electrical
properties. An other equivalent way of describing electrical properties of crystalline
solids is by using the covalent bond picture of these materials. Atoms of Silicon and
Germanium in their intrinsic crystals are held together by covalent bonds. Both these
atoms (Ge and Si) have four valence electrons in their valence shells. Electronic
configuration and arrangnment of electrons in a 14 Si atom are shown in Fig. 2.10.
Since only the valence electron of an atom take part in bonding and inner electrons
plus nucleus does not play any role, it is convinient to represent an atom by a core
(that has nucleus and inner electrons) with positive charge equal to the number of
valence electrons, (+ 4e) in case of Silicon, and four valence electrons. As a matter of
fact any atom with four valence electrons (like Si, Ge, Sn, Pb) may be represented by
a core of + 4e charge and four electrons. An atom for example of Aluminium which
has three valence electrons may be represented by a positively charged core having
2.4 Semiconductors 77
charge + 3e and three valence electrons and a penta valent atom (like phosphorus)
by a core of + 5e charge and five electrons.
In covalent bonding atoms share their electrons; in case of Si and Ge four neigh-
bouring atoms share their one electron each forming the crystal lattice. Figure 2.11
shows the covalent bonding in Silicon or Germanium intrinsic crystal at absolute 0 K
temperature.
As may be seen in the figure, at T = 0 K all covalent bonds are intact and all valence
electrons of each atom are shared by neighbouring four atoms. Since electrons are
held in covalent bonds because of the force of attraction of nearby positively charged
Fig. 2.11 Pictorial representation of covalent bonding in intrinsic Silicon or Germanium crystal at
absolute zero temperature
78 2 Electrical Behaviour of Condensed Matter
cores, they cannot move even when an electric field of moderate strength is applied
to the crystal, hence the crystal behaves as an insulator at T = 0 K.
Covalent bonds are characterised by bond energy; the energy by which electrons
are held within the bond. The covalent bond energy of Silicon is 1.08 eV and that of
Germanium 0.66 eV. It means that if an electron in the covalent bond of Silicon crystal
somehow gets energy either equal to 1.08 eV or large; it may break the covalent bond
and will become a delocalized or free electron which may hop from one atom to the
other in the intra atomic space of the crystal. It may be observed that the covalent
bond energy is simply equal to the forbidden energy gap of band theory. Electrons
may get energy in several ways, like by heating or by putting them in an electric field
etc.
If an intrinsic Silicon or Germanium crystal is heated to say some temperature
T > 0 K, its electrons in covalent bonds will acquire temperature T > 0 K and will
get thermal energy ≈ kT, where k is Boltzmann constant (k = 8.6 × 10–5 eV per
Kelvin). The most likely form of this thermal energy is kinetic energy associated
with vibrations, electrons which were stationary in covalent bonds at T = 0 K, starts
vibrating when the temperature of the specimen is increased. When thermal energy
of the electron increases beyond the bond energy (1.08 eV for Si and 0.66 eV for Ge),
it may break the bond and come out of the bond becoming a delocalised electron,
leaving a hole in the covalent bond. The structure of an intrinsic semiconductor
crystal (Si or Ge) at T > 0 K is shown in Fig. 2.12
The process of bond breaking resulting in creation of electron–hole pairs and
the opposite process of recombination of electron–hole pairs to remake some of
the broken bonds simultaneously keep going in the crystal at temperature T > 0 K.
Ultimately, equilibrium is reached when the rate of new electron–hole pair creation
becomes equal to the rate of recombination of electron–hole pairs.
At equilibrium, rate of reaction given by Eq. (2.7) becomes equal to the rate of
process represented by Eq. (2.8). As a result at any temperature T > 0 K; the average
number of free electrons and holes per unit volume of the semiconductor becomes
constant (Fig. 2.12).
Further, at equilibrium the average number per unit volume (called the number
density or carrier concentration) of free electrons and holes in the crystal is equal.
The average number density of electrons and holes in an intrinsic semiconductor
is equal and constant at a fixed temperature; however the average number density
increases with the rise of temperature. Further, it is a common practice to call free
or delocalised electrons simply as electrons and instead of saying average number
density, simply to say number density. Though obvious, but one must remember that
free electrons (these are the electrons which in electron band theory shifts to the
conduction band on acquiring of energy) are free to move within crystal while holes
2.4 Semiconductors 79
are always bound within the covalent bond and can move from one covalent bond to
the next.
If the (average) number densities of (delocalised) electrons and holes in an intrinsic
semiconductor at temperature T are, respectively, denoted by n ie and n ih , then
n ie = n ih (2.9)
The value of free electron number density n ie for Silicon at 300 K (nearly room
temperature) is of the order of ≈ 1.08 × 1010 cm−3 .
When a specimen of intrinsic semiconductor at T > 0 K is subjected to an electric
field by applying a voltage across its two opposite faces, the free electrons in the
specimen moves within the enter atomic space of the crystal in a direction opposite
to the electric field, as shown in Fig. 2.13. Holes that are bound in covalent bonds
also shift from one covalent bond to the next in the direction of the imposed electric
field (see Fig. 2.13). Resultant current I in the circuit is the sum of the electron and
hole currents. Therefore, both the electrons and holes participate in current flow in a
semiconductor.
SAQ: Consider Fig. 2.13 and draw three successive steps showing the motion of a
hole under applied voltage V.
Addition of a very small amount (of the order of 1 part in 107 parts) of impurities in
an intrinsic semiconductor crystal may drastically change its electrical conductivity,
80 2 Electrical Behaviour of Condensed Matter
Fig. 2.13 Motion of free electrons and holes under electric field
optical and structural properties. The crystal with controlled impurity added to it is
called a doped or extrinsic semiconductor crystal. The process using which small
amount of impurities is added in a controlled way is called doping.
The reason why deliberately added impurities are mixed only in small amount is
(i) to avoid any breakdown in crystal structure and (ii) addition of very small amount
of impurity is sufficient to change the conductivity of the intrinsic semiconductor to
the desired value. Addition of excessive impurity may turn a semiconductor into a
conductor. Further, only two types of impurities are added; impurity atoms with either
2.4 Semiconductors 81
Fig. 2.15 a Process of ion implantation. b Dopant profile in the ion implanted semiconductor wafer.
Highest dopant concentration at range equivalent depth
Here, dC
dx
is the concentration gradient of dopant atoms in direction x and
the negative sign signifies that motion of diffusing atoms is in the direction
of higher to lower concentration. The constant of proportionality D is called
diffusion coefficient. With diffusion of dopant from gaseous phase container
into semiconductor wafer, the concentration difference on the two sides will
decrease and assuming D to be constant, and flux of diffusing atoms will also
decrease. Ultimately, diffusion of dopant atoms will stop when concentration
of atoms in container and in wafer will become equal. The depth of diffusion
in wafer essentially depends on the temperature of dopant atoms, higher the
temperature larger the diffusion depth. Depth profile of diffused atoms in the
wafer at a given temperature is shown in Fig. 2.17b.
According to the law of conservation of matter, the change of dopant concen-
tration with time must be equivalent to the local decrease of the diffusion flux,
in the absence of a source or sink, therefore,
∂C ∂F ∂ ∂C ∂C ∂ 2C
=− = D or =D 2 (2.11)
∂t ∂x ∂x ∂x ∂t ∂x
Boron is the most common trivalent impurity dopant in Silicon; whereas arsenic
and phosphorous are used extensively as pentavalent impurities. These three
elements are highly soluble in Silicon with solubilises exceeding 5 × 1020
atoms per cc in the diffusion temperature range between 800 and 1200 °C.
These dopants can be introduced via several means, including solid sources
(BN for B, As2 O3 for As and P2 O5 for P), liquid sources (BBr for B, AsCl for
As and POCl3 for P) and gaseous sources as B2 H6 , AsH3 and PH3, respectively,
for boron, arsenic and phosphorous.
SAQ: Two exactly identical wafers of Silicon are doped using diffusion tech-
nology. The temperatures of diffusing gas are different for the two
wafers. What will be the difference in the profile of doped impurity in
two cases?
(iii) Doping at monocrystal growth stage
Impurities in controlled amount may also be introduced in a semiconductor
monocrystal while it is being grown from poly crystalline melt. If calcu-
lated amount of either tri or pentavalent dopants is mixed with the molten
polycrystalline a part of these impurities enter the monocrystal.
is of the order of 107 free electrons per cm3 . Apart from free electrons given by
impurity atoms, there are also some electron and holes that are created by the
breaking of covalent bonds. But the number of electrons given by donor impurity
per unit volume is much larger than the number of free electrons produced by
covalent bond breaking. Thus, at T > 0 K, an n-type semiconductor has (a) large
number of free electrons due to the ionisation of donor atoms + (b) large number
of positive ions of donor atoms that are fixed in crystal lattice and cannot move
+ (c) few electrons due to bond breaking and + (d) few holes due to covalent
bond breaking. Both free electrons and holes take part in current flow if an
n-type semiconductor is subjected to electric field by applying voltage across
it. Since the number of free electrons in n-type semiconductor is much larger
than the number of holes, free electrons are called majority carriers and holes
the minority carriers.
Electron band diagram of an n-type semiconductor is shown in Fig. 2.19ii where
the band picture for an intrinsic semiconductor is also given for comparison.
Three major differences between the two diagrams may be observed in Fig. 2.19;
(a) the conduction band of n-type semiconductor has large number of free elec-
trons, most of which are due to the ionisation of donor impurity atoms. (b) There
is a donor level, an energy state for impurity atoms, just below the bottom of
conduction band in forbidden energy gap. Energy difference between the bottom
of conduction band and donor level is equal to the binding energy of the fifth
electron of donor atom which could not be accommodated in covalent bonds. It
sometimes appear confusing that how there could be any level or energy state
in forbidden energy gap? At this point it is important to realise that forbidden
energy gap region is restricted only for energy states or levels of parent intrinsic
atoms, atoms of Si or Ge cannot have energy levels in forbidden energy gap;
however, other impurity atoms may have their energy states in the region of
forbidden energy. Further, Fig. 2.19 refers to a temperature T > 0 K, the temper-
ature at which most of the impurity donor atoms have ionised and thus one
2.4 Semiconductors 87
electron from each donor atom has jumped to the conduction band leaving a
positive donor ion at donor level. Positive donor ions in Fig. 2.19 are repre-
sented by the + sign. (c) An other very crucial observation from the figure is
that Fermi level of n-type material has shifted up words, towards the conduction
band, from the middle of forbidden energy gap. This has happened because of
the large number of electrons in conduction band as compared to the holes in
valence band. In intrinsic material at any temperature T > 0 K, the number of
electrons in conduction band is exactly equal to number of holes in valence
band as both are generated by the process thermal braking of covalent bonds.
An n-type semiconductor contains large number of free electrons as majority
carriers, few holes from covalent breaking, as minority carriers (see Fig. 2.20)
and large number of immobile positive ions of donor atoms fixed in crystal
lattice.
Fig. 2.22 Energy band picture at T > 0 K for a intrinsic semiconductor. b p-type semiconductor
that are minority carriers; (ii) the Fermi level is below the middle of forbidden
energy gap towards the valence band; (c) there is an acceptor level just above
the top of valence band holding negative ions of impurity acceptor atoms; (d)
there are large number of holes in valence band most of which are created by
the ionisation of trivalent impurity acceptor atoms (Fig. 2.22).
As in case of n-type semiconductor, the semiconductor is over all neutral,
similarly, p-type material is also over all neutral.
As shown in Fig. 2.23, a p-type semiconductor has holes as majority carriers,
electrons produced by bond breaking as minority carriers and negative ions of
acceptor impurity atoms fixed in crystal lattice as immobile charges.
The majority charge carriers, electrons in case of n-type semiconductor and
holes in p-type semiconductor are released by the impurity atoms only when
these atoms get ionised. Those impurity atoms that get ionised at room tempera-
ture and release majority carriers are called shallow impurities. The ionisation
energy for shallow impurities is of the order of ≈ kT where k is Boltzmann
constant and T ≈ 300 K.
When an intrinsic material is doped with small amount of impurity, the impurity
atoms in crystal lattice are far apart and do not interact with each other. Such doped
semiconductors are termed as non-degenerate semiconductors. The impurity level
(donor level in case of n-type and acceptor level in p-type) in non-degenerate semi-
conductor is discrete and sharp. However, if impurity concentration is relatively high
and impurity atoms in crystal lattice are near to each other, they interact and the impu-
rity level does not remain a single discrete level but it becomes a band of many energy
levels and sometimes this impurity band may overlap with the nearby (conduction
band in case of n-type and valence band in case of p-type semiconductor) band of
the semiconductor. Such semiconductors are called degenerate semiconductors.
Most of the doped semiconductors that are frequently used are non-degenerate
type.
SAQ: Draw a rough sketch of energy band picture for a degenerate p-type material.
The free electrons in the conduction band of a semiconductor are not really free; their
motion is constrained by the periodic potential of ions in crystal lattice. The effect of
periodic potential is included by assigning an effective mass to electron, denoted by
2.4 Semiconductors 91
m ∗n . Similarly, the motion of holes in covalent bonds is also restricted, and therefore,
holes are also assigned an effective mass m ∗h . The energy momentum relation for a
conduction band electron may be written as
pc2
E=
2m ∗n
Here pc is crystal momentum along a given crystal direction defined by the Miller
index. In some semiconductor materials, like Silicon (Si) and gallium arsenide
(GaAs), the maximum of valence band energy and the minimum of the conduction
band energy lie along the direction defined by pc = 0. As a result in such semiconduc-
tors a transition across forbidden energy gap requires just the absorption or emission
of energy. Those semiconductors for which the maximum of valence band energy
and the minimum of conduction band energy lie in the same crystal direction or have
same value of pc are called direct semiconductors. The time response of direct semi-
conductors to photon absorption/emission (Si and GaAs) is fast, and therefore they
are frequently used in optoelectronic devices. On the other hand, those semiconduc-
tors in which the maximum of valence band energy and the minimum of conduction
band energy have different values of crystal momentum pc are called indirect semi-
conductors. Transition across forbidden energy gap in indirect semiconductors is
slow because of the different values crystal momentum.
SAQ: What may happen when a semiconductor absorbs a photon?
SAQ: The photon absorption and emission processes in direct semiconductors are
fast; why?
Both intrinsic and semiconductors doped with shallow impurities have free electrons
and holes at room temperature. Paul Drude, a German physicist, argued that free
electrons in semiconductors and also in conductors (metals), under some assumptions
may be treated as molecules of an ideal gas. He argued that kinetic theory of gases
which in principle is applicable to an ideal gas may also be applied to free electrons.
The law of equipartition of energy, which follows from kinetic theory, says that
1/2 kT of energy is associated with each degree of freedom, a molecule of an ideal
gas that has three degrees of freedom, at temperature T > 0 K possesses 3/2 kT
of kinetic energy. As such a free electron in semiconductor at room temperature
≈ 300 K will have roughly 6.21 × 10–21 J as kinetic energy. If Vthe is the speed of
electron at room temperature and 9.1 × 10–31 kg the mass of electron, then
1
× 9.1 × 10−31 × Vthe
2
= 6.21 × 10−21 (2.12)
2
Solution of the above equation gives V the , the thermal velocity of a free electron
at room temperature, to be of the order of 1.17 × 105 m/s. This means that at
room temperature free electrons in a semiconductor are moving with high speed of
the order of 105 m/s. If no external potential is applied to a semiconductor and if
there is no charge gradient within the semiconductor, then free electrons in it will
be moving in random directions with velocities of 105 m/s. Fast moving electrons
frequently collide with vibrating crystal lattice (crystal lattice also vibrates because
of temperature), and at each collision its direction of motion gets changed. Hence
in absence of any external voltage and any static charge gradient, the net effect of
2.4 Semiconductors 93
e.τ e.τ
v nD = a.τ = − ∗
ε = −μe ε, where μe = ∗ (2.13)
mn mn
Drift velocity v nD gets superimposed on each electron along with the thermal
velocity Vthe under the influence of the electric field. It is easy to show that
j D = j De + j Dh = eε[n e μe + n h μh ] (2.16)
dn e dn h
jeDiff = e.Vthe .λ. and jhDiff = −e.Vthe .λ. (2.18)
dx dx
Putting Vthe .λ ≡ Dn for electron and Vthe .λ ≡ D p for holes, Eq. (2.18) reduces
to
2.4 Semiconductors 95
dn e
jeDiff = e.Dn . (2.19)
dx
And
dn h
jhDiff = −e.D p . (2.20a)
dx
kT kT
Dn = μe and D p = μ p (2.20b)
e e
There are two types of charge carriers, such as free electrons in conduction band
and holes in valence band in a semiconductor. The concentration or number of
charge carriers per unit volume of the semiconductor which is also called the carrier
number density depends on the concentration of dopant impurity and temperature.
Dopant concentration essentially decides the concentration of majority carriers while
temperature, that determines the rate of covalent bond breaking, controls the minority
carrier concentration.
Both conduction band and the valence band have large number of discrete energy
states where carriers may reside. These energy states, though discrete, yet they are
so closely packed in energy that one cannot talk of any individual state, instead
one talks of the state or level density, i.e. the number of states per unit volume
within energy E and (E + dE). The level density of allowed energy states for free
electrons in conduction band may be denoted by N(E) and for holes in valence band
by P(E). Carrier concentration for electrons and holes in a semiconductor may be
theoretically calculated using the tools of quantum statistics. Let us first calculate the
number density (concentration) of electrons in conduction band of a semiconductor.
Let us denote by n e (E) the density of electrons (number density or number of
electron per unit volume) in conduction band at energy E and (E + dE). This number
density of electrons may be written as the product of the density of energy state at
energy E and (E + dE) in conduction band, and the probability F(E) that the energy
range E and (E + dE) is occupied by electrons. Therefore,
∫Etop ∫Etop
ne = n e (E)dE = N (E)F(E)dE (2.21b)
EC EC
Here m ∗n and h are, respectively, the effective mass of the electron and Planck’s
constant. The probability F(E) that electron occupies the state of energy E is given
by the Fermi–Dirac distribution function of quantum statistics as
1
F(E) = (2.21d)
( E−E f )
1+e
kT
In Eq. (2.21d) E f stands for Fermi energy and k for Boltzmann constant while T
is temperature in Kelvin. It is easy to verify that;
⎧
⎪ E(F) = 1 for E < E f All states with energy less than
⎪
⎨
Fermi energy are filled with electrons
At T = 0 K
⎪
⎪ E(F) = 0 for E < E All states with energy larger than
⎩ f
Fermi energy are empty
means the probability is 0.5 that the state with energy E f is filled with electron.
Figure 2.26i shows the variation of the function N(E) (density of states with energy
E and E + dE) with energy E. This graph basically represents Eq. (2.21c). It may
be observed in this figure that N(E) increases as the square root of energy E.
Variation of occupation probability function F(E) with energy E is shown in
Fig. 2.26ii. It may be observed that function F(E) behaves differently, for two situ-
ations T = 0 K and T > 0 K. As shown in the figure, when temperature T = 0 K,
F(E) = 1 for E < E f and F(E) = 0 for E > E f , (red curve); there is a sharp cut
in the probability at Fermi energy. For T > 0 K; there is not a sharp cut in F(E), it
has a value 1 up to some energy, then starts decreasing, becomes 0.5 at E = E f and
decreases almost exponentially after that.
98 2 Electrical Behaviour of Condensed Matter
Fig. 2.26 Graphical representation of N(E), F(E) and electron concentration ne at T = 0 K and T
>0K
Figure 2.26iii, iv show the number per unit volume of electrons of all energies ne
at T = 0 K and at T > 0 K, respectively. As shown in these figures, the value of ne is
given by the shaded area enclosed between the energy axis (X-axis) and curves for
functions N(E) and F(E).
E−E f
∫Etop ∫∞ ∗ 3/2
2m n −
E−E f
ne = n e (E)dE = 4π E 1/2 e kT
dE (2.21f)
h2
EC EC
The upper limit of integration in above expression is changed from E top , (energy
at the top of conduction band) to ∞ as F(E) approaches to zero exponentially for
large energies. Equation (2.21f) on integration gives
2.4 Semiconductors 99
3/2
2m ∗n kT −
E C −E f
−
E C −E f
ne = 2 e kT
= NC e kT
(2.21g)
h2
where
3/2
2m ∗n kT
NC = 2 (2.21h)
h2
np = 2 e kT
= Nv e kT
(2.21i)
h2
Here,
3/2
2m ∗p kT
NV = 2 (2.21j)
h2
Or
2/3
3 h 2 2/3 0.121h 2 2/3
Ef = √ ∗
ne = ne (2.21k)
16 2π m m∗
American physicist Edwin Herbert Hall in 1879 observed that when an electric
current is passed through a conductor that is placed in a magnetic field, a potential
proportional to the current and the magnetic field develops across the conductor in
a direction perpendicular to both the current and the magnetic field. This effect is
called the Hall Effect and the developed potential difference as Hall voltage. From
the measurements he made, Hall for the first time was able to determine the charge
of the current carriers. Even today Hall Effect is used to steady the charge transport
characteristics of metals and semiconductors.
100 2 Electrical Behaviour of Condensed Matter
Layout of experiment for the study of Hall Effect is shown in Fig. 2.27. A slab of
the conductor/semiconductor of length L in X direction, width w in Y-direction and
thickness t in Z-direction is taken, and a voltage source V is connected to the two
opposite faces so that an electric field ε (= V /L) in direction X is produced within
the slab. Electric field ε establishes a current I x in positive X-direction through the
slab. Current I x may be constituted by the flow of charges of only one polarity (in
case of metallic block by electrons) or it may be produced by charges of opposite
polarities (in case of semiconductor both electron and holes). However for simplicity
we assume that the current I X is due to charge carriers of only one polarity. Let q be
the charge of the carrier. The electric field ε exerts a force qε on each charge carrier
in positive X-direction and imparts an additional average velocity vX , called the drift
velocity, to each charge carrier. If n represents the concentration of charge carriers,
then the current density jx may be written as
jx = nqvx and
If a magnetic field Bz is now applied in Z-direction, the charge carriers that consti-
tute current I x will experience a force in Y-direction. The direction of force is given by
Fleming’s left hand rule (see inset in the figure) and the magnitude by the expression
FY = q B Z vx
Force F Y will deflect charge carriers towards the top of the slab resulting in accumu-
lation of charge carriers on the inside of the top surface of the slab. Accumulation of
charges on the inner top face generates an electric field E Y in negative Y-direction.
Electric field E Y will repel charge carriers and will oppose further accumulation of
charges. Thus charge carriers will experience two opposite forces, one in positive
Y-direction F Y due to magnetic field and the other in negative Y-direction due to
electric field E Y generated by the accumulation of charges. Ultimately a state of
equilibrium will reach when two opposite forces will become equal, and no further
deflection of charge carriers will take place. In the state of equilibrium,
nq Bz vx = nq E Y or B Z vx = E y (2.22b)
∫0 ∫w
VH = E Y dy = − E Y dy = −E Y w
w 0
Substituting the value of E Y from Eq. (2.22b) and of vx from Eq. (2.22a) in the
above expression one gets
1 B Z .Ix B Z .Ix
VH = − = −R H (2.22c)
nq t t
When two ends of the same intrinsic wafer or intrinsic semiconductor monocrystal
are doped, one with n-type impurity of pentavalent atoms and the other by p-type
trivalent impurity, a p–n junction is formed at the boundary of the two sides. It may
102 2 Electrical Behaviour of Condensed Matter
be emphasised that if a p-doped crystal and another n-doped crystal are put together
touching each other, p–n junction will not be formed. For p–n junction to form it is
essential that same crystal or wafer be doped on one side by p-type impurity and on
the other side by n-type impurity then only a p–n junction is formed at the boundary
of the p- and n-type materials within the given wafer or crystal.
As is shown in Fig. 2.28, the Fermi level for isolated n-type material is shifted
upwards from the middle of the forbidden gap and it is shifted downwards for the
isolated p-type material. However, when p- and n-type materials are developed on
the same crystal, the Fermi level cannot be different on two sides because of the
continuity of crystal structure. As a result the band structure of n-side is pulled down
with respect to the band structure of the p-side to equalise the Fermi levels of the
two sides, as shown in Fig. 2.29.
The band structures of the p- and n-sides in a single crystal are shown separated
from each other in Fig. 2.29 just to indicate how the band structure of n-side is pulled
down with respect to the p-side to equalise the Fermi level on two sides. However in
reality the two band structures touch each other at the physical boundary of the p-
and n-sides.
Since the energy of free electrons in conduction band is measured up wards
from the Fermi level, electrons in conduction band on p-side are more energetic as
compared to the electrons in the conduction band on n-side. Similarly, holes on n-side
have more energy than holes on p-side.
Consider the instant when p–n junction got established on doping the two sides.
Initially both the n-side and the p-side were electrically neutral, however, at the
establishment of junction, concentration of electrons on n-side is larger than the
concentration of electrons on p-side and similarly, the concentration of holes on p-
side is larger than of holes on the n-side. Because of the concentration difference,
some electrons diffuse from n-side to p-side, and some holes diffuse from p-side to
n-side. As a result of diffusion of electrons from n-side, the n-type semiconductor
develops a positive charge; the amount of positive charge developed on n-side is
proportional to the number of electrons lost by it due to diffusion. The positive
Fig. 2.28 Band structures of isolated intrinsic, n-type and p-type semiconductors
2.4 Semiconductors 103
charge acquired by n-side try to pull back negatively charged electrons and tries to
stop further diffusion of electrons. Thus two opposite forces; force of diffusion that
tries to transfer electrons from n-side to p-side and the force of attraction between
electrons and positively charged p-side got balanced after the diffusion of some
electrons from n-side to the p-side. This is called the state of equilibrium, in state of
equilibrium that occurs after the diffusion of some electrons from n-side to p-side,
there is no further diffusion of electrons from n-side to p-side.
As already mentioned, initially some holes, which are majority carrier on p-side,
diffuse to n-side, making p-side negatively charged. The amount of negative charge
developed on p-side is proportional to the number of holes that have diffused to
n-side. Again, at the state of equilibrium, that occurs after some holes have already
diffused to n-side, there is no further diffusion of holes.
The state of equilibrium is reached within a fraction of a second as soon as the
p–n junction is formed. After the system attains equilibrium, there is no further
diffusion of electrons from n-side and of holes from the p-side. Further, after the
establishment of equilibrium, the p-side develops a negative potential and n-side a
positive potential. The potential difference between the n-side and the p-side is called
internal potential barrier and is denoted by V B (see Fig. 2.30).
(i) Depletion layer Diffusion of electrons from the n-side leaves a sheath of uncov-
ered positive immobile donor ions on the n-side of the junction and diffusion
of holes a layer of immobile uncovered negative acceptor impurity ions on the
p-side of the junction. Thus around the junction there is a layer of positive
uncovered ion on the n-side and a layer of uncovered negative ions on the p-
side, this region which contains uncovered ions is called depletion layer. As is
104 2 Electrical Behaviour of Condensed Matter
Fig. 2.30 p–n junction diode with bulk p- and bulk n-sides along with depletion layer. Internal
potential barrier V B is also shown in the figure
obvious, no mobile charge carrier, electron or hole, may stay in this depletion
region, as it will be swept by the positive or negative uncovered ions. Since no
mobile charge can stay in depletion region, i.e. it is depleted of mobile charges,
hence the name depletion layer.
Depletion layer has some special properties: (i) no free mobile charge may
stay in this region, (ii) since it has no charge carriers it is like an insulator or
has very high resistance, (iii) there are equal amounts of positive and negative
charges at the two ends of the depletion layer which in itself behaves like an
insulator; therefore, depletion layer works like a parallel plate capacitor. The
capacitance of depletion layer may be changed by applying potential drop across
p–n junction, and thus it provides a capacitor whose capacitance may be varied
by varying voltage across the junction. p–n junction is also called junction diode.
It is because of the fact that p–n junction behaves like an electron tube diode.
Since the total uncovered negative charge on p-side of the depletion layer must
be equal to the total uncovered positive charge on the n-side;
n p .x p = n e X e
SAQ: Which part of a p–n junction has maximum resistance and why?
SAQ: There are positive ions of donor impurity atoms on the bulk n-side and
negative ions of acceptor impurity on the bulk p-side but these ions are
covered with respective charge carriers. Why do ions become uncovered
in the depletion layer of a p–n junction?
SAQ: Can you estimate the thickness of depletion layer for normal doping.
(ii) Biasing of p–n junction diode
Biasing of a device means providing required voltages to different terminals
of the device. Figure 2.32a shows the symbol used for a p–n junction diode in
electronic circuits. A junction diode has two terminals; a terminal connected to
the p-side and the other terminal connected to the n-side. A source of voltage, a
battery, may be connected between these two terminals in two different ways.
When diode terminal attached to the p-side is connected to the positive terminal
of the battery and the n-side to the negative terminal, the arrangement is called
forward bias. However, if the p-side is connected to the negative terminal of
the battery and the n-side to the positive, the arrangement is reverse bias.
(a) Forward bias
Figure 1.32b shows the forward bias arrangement. It may be recalled that
in an unbiased p–n junction at equilibrium diffusion of charge carriers
does not take place because of the internal potential barrier V B , which
restricts any transfer of charges from one side to the other. In forward
bias arrangement the battery potential V opposes or reduces the internal
potential barrier V B . Reduction of internal potential barrier results in two
events: (i) reduction in the width of the depletion layer because on p-
side holes get repelled by the external battery potential + V and covers
some of the uncovered acceptor ions in depletion layer and similarly, on
n-side electrons get pushed into depletion region by the negative external
battery potential and cover some uncovered positive donor ions. (ii) As
a result of the reduction of internal potential barrier, some of majority
106 2 Electrical Behaviour of Condensed Matter
Fig. 2.32 a Symbol for p–n junction diode used in electronic circuit b forward bias junction
c reverse biased p–n junction
carrier holes from p-side and some majority carrier electrons from the
n-side start moving to the other side. This movement of majority charge
carriers constitutes a forward current I f through the circuit as shown in
Fig. 2.33.
Initially, the forward current I f increases slowly till the depletion layer
vanishes completely at battery potential V d when forward current suddenly
rises almost exponentially. Potential V d is called knee potential (or on
potential) and in a way equal to the internal potential barrier V B . For Silicon
p–n junction diode the knee potential has a value of 0.7 V and for Ge based
p-n diode it is 0.3 V. On further increasing the forward bias voltage beyond
V d , the voltage across the junction does not increase but forward current of
larger value flows through the forward biased circuit. A p–n junction diode
in forward bias above on-voltage V d behaves as a battery of 0.7 V in case
of Silicon-based diode and a battery of 0.3 V in case of Germanium-based
diode. When forward bias voltage is increased beyond V d , the depletion
layer disappears and large number of majority carriers diffuse from both
sides to the opposite side. Therefore, forward current if is essentially due
to the diffusion of majority carriers and since the concentration of majority
carriers is quite high (≈ 107 charge carrier cm−3 ) forward current I f of few
milli amperes flows through the circuit.
(b) Reverse bias
Circuit diagram for reverse bias arrangement is shown in Fig. 2.32c. In
reverse bias arrangement the external battery potential add up with the
internal potential barrier V B . This results in the increase of the width of
depletion layer. With enhanced barrier at junction (V + V B ) the majority
carrier on the two sides do not cross the depletion layer of enhanced width.
However, minority carriers, electron on the p-side and holes on n-side are
pushed by the total barrier potential (V B + V ) across the depletion layer
constituting reverse current I r . This flow of minority carriers from one side
to the other is not due to diffusion instead it is due to the large potential
difference across the depletion layer. Minority carrier current in reverse bias
is drift current and is only of the order of few microamps. When reverse
bias voltage is increased beyond the breakdown voltage V b (see Fig. 2.33),
suddenly a large reverse current starts flowing in the circuit. This large
current flows because of the breakdown of the crystal structure. Because
of the large electric field inside the semiconductor crystal (established by
large reverse bias voltage V b ), the atoms in crystal structure break down
releasing large number of electrons.
Graphs showing the variation of forward and reverse currents as a function
of applied voltage are called p–n junction diode characteristics and are
shown in Fig. 2.33 both for the forward and the reverse bias arrangements.
SAQ: Forward current across a p–n junction is generally in mA while the
reverse current is in μA. What is the reason for this difference in
the magnitudes of the two currents?
SAQ: It is known that reverse saturation current I r is very sensitive to
the ambient temperature; for every 100 C rise of temperature it
gets doubled. However the forward current I f is not so sensitive to
temperature. Can you assign a reason for this difference?
Semiconductor materials are the backbone of electronic industry. These mate-
rials are used in fabricating solid state electronic devices that are extensively
used in modern analogue and digital electronics. p–n junctions developed in
108 2 Electrical Behaviour of Condensed Matter
special conditions of doping at more than one place in a monocrystal give rise
to bipolar junction transistors and field effect transistors.
Some important formulae that are applicable in case of semiconductors are given
here without their derivations, which are beyond the scope of the present text. These
formulae may be used to solve numerical problems.
(1) If ne and np, respectively, denote the concentrations of free electrons and holes in
doped semiconductor at temperature T and ni the concentration of free electrons
or holes in the intrinsic semiconductor at same temperature T, then,
n e .n p = n i2 (2.23)
ne ∼
= N D and n p ∼
= NA (2.24)
And
23
−
E F −E V 2π m ∗p kT
n p = NV e kT
; NV ≡ 2 (2.26)
h2
Here, E C is the energy at the bottom of the conduction band, E V the energy at the
top of valence band, k Boltzmann constant, m ∗n , m ∗p , respectively, the effective
masses of electron and hole and T temperature in Kelvin.
(4) Positioning of Fermi level
Fermi energy at T ≈ 0 K is given by Eq. (2.21k) as
2/3
3 h 2 2/3 0.121h 2 2/3
Ef = √ ∗
ne = ne
16 2π m m∗
2.4 Semiconductors 109
Here, E C and E V are, respectively, the energies at the bottom of conduction band
and the top of valence band. m ∗p and m ∗n are, respectively, the effective mass of hole
and electron. Second term on right that depends on temperature is negligible at room
temperature and varies slowly with temperature, therefore, it is often neglected.
Hence Fermi level for intrinsic semiconductor is taken at the middle of the forbidden
energy gap at all temperatures.
n-type
However, for n-type semiconductor, it may be shown that Fermi level E F is
given by
n-type NC
EF = E C − kT ln (2.28)
ND
n-type NV
EF = E V + kT ln (2.29)
NA
Solution: It is given that after doping the Fermi level shifts towards the valence band.
It means that the doping is done with acceptor impurity and that the material has
become p-type after doping.
The band structures of the semiconductor before doping (intrinsic material) and
after doping (p-type material) are shown in Fig. 2.34. As indicated in the figure after
doping (E F − E V ) = 0.5 eV and (E C − E F ) = 0.9 eV.
To calculate concentration of majority carrier holes we use Eq. (2.26) given below;
23
−
E F −E V 2π m ∗p kT
n p = NV e kT
; NV ≡ 2
h2
23 23
2π m ∗p kT 2π × 0.5 × 9.1 × 10−31 × 1.38 × 10−23 × 300
NV ≡ 2 =2 2
h2 6.63 × 10−34
= 8.836 × 1024 .
Next we calculate
E F −E V 0.5×1.6×10−19
− −
n p = NV e kT
= 8.836 × 1024 e 1.38×10−23 ×300 .
Or
n p = 8.836 × 1024 × e−19.32 = 8.836 × 1024 × 4.068 × 10−9 = 3.59 × 1016 m−3 .
Next we calculate the concentration of minority carrier electrons using Eq. (2.25)
given below,
23
−
E C −E F
2π m ∗n kT
n e = NC e kT
; NC ≡ 2 .
h2
Now,
23 3/2
2π m ∗n kT 2π × 0.06 × 9.1 × 10−31 × 1.38 × 10−23 × 300
NC ≡ 2 =2 2
h2 6.63 × 10−34
= 2.70 × 1021 .
Or
E C −E F
−19
− − 1.38×10
0.9×1.6×10
n e = NC e kT
= 2.70 × 1021 × e −23 ×300
= 2.70 × 1021 × e−34.78
Or
Therefore,
(i) majority carrier concentration n p = 3.59 × 1016 m−3
(ii) Minority carrier concentration n e = 2.12 × 106 m−3 and
(iii) Charge carrier concentration n i = n ip = n ie = 2.76 × 1011 m−3 .
Solved Example SE2.5 Given that number density of free electrons in gold at very
low temperature ≈ 0 K is 6.0 × 1022 cm−3 , calculate the Fermi energy for gold. Take
the effective mass of electron to be equal to its mass 9.1 × 10–31 kg.
Solution: In the given problem number density of electrons is given in CGS units
while the electron mass is in MKS units. Let us convert electron number density also
in MKS units; given quantities are;
n e = 6.0 × 1022 cm−3 = 6.0 × 1028 m−3 , the effective electron mass m ∗e =
9.1 × 10−31 kg.
We use the formula given by Eq. (2.21k) for Fermi energy E F at T = 0 K.
2 2/3
E F = 0.121×h
m∗
n e which on substituting the values gives,
e
2
0.121 × 6.60 × 10−34 2/3
EF = −31
6.0 × 1028 J
9.1 × 10
2.5 Conductors 113
8.87 × 10−19
= 8.87 × 10−19 J = eV = 5.548 eV.
1.6 × 10−19
2.5 Conductors
Conductors are solids that are characterised by metallic bonding, having either over-
lapping conduction and valence bands or with negligible forbidden energy gap.
Metals and their alloys are mostly conductors. Their specific resistivity lies in the
range of (1–100) × 10–8 for metals and (1–100) × 10–6 Ω m for most of the alloys.
Overlapping of conduction and valence bands is the outcome of the high degree of
overlap in outer electron orbital’s of individual atoms in some crystals. As a result the
conduction and valence bands become so broad that they overlap. In such materials
the valence electrons are far away from the corresponding nucleus of the atom and
are very loosely bound with its parent nucleus. Also in their crystalline structure the
relative separation of atoms is large so that the forbidden energy gap is either zero
or very small.
Figure 2.35 shows the band structure of a conductor (a) at 0 K and (b) at T > 0 K.
At absolute zero all valence electrons are bound and are not available for conduction
of current. However, with the rise of ambient temperature more valence electrons
become delocalised and at room temperature in most of conductors all valence elec-
trons become delocalised or free and are available for conduction of current. It is
reasonable to assume that at room temperature all valence electrons of all atoms in the
given specimen of conductor are delocalised and are available as free charge carriers.
Obviously, current may flow only when some voltage is applied to the conductor that
establishes an electric field. In absence of any electric field a piece of conductor at
room temperature has large number of free electrons that move in random directions
with thermal velocity which is of the order of 105 m/s. The number density of free
electrons in Silver at temperature 300 K is of the order of 5.8 × 1028 m−3 . These
randomly moving electrons undergo frequent collisions with crystal lattice and are
also trapped at sites of unionised impurity atoms. At each collision the velocity and
direction of motion of electron get changed. When randomly moving free electrons
are subjected to an electric field by applying an external voltage, a drift velocity gets
superimposed on each electron in a direction opposite to the direction of the elec-
tric field. This results in the flow of current through the conductor. Opposition to the
smooth flow of current is generated by frequent lattice-electron collisions. Larger the
rate of collision more will be the opposition to the flow of current. Therefore, resis-
tance or resistivity of conductors is essentially the result of electron–lattice collisions.
With the rise of temperature, the electron density in a conductor does not increase
because all valence electrons are already delocalised at room temperature, however,
the electron–lattice collision rate increases with temperature and hence the resistivity
of conductors increases with temperature. That is why the temperature coefficient of
conductors has a positive value. On the other hand, in a semiconductor, all valence
114 2 Electrical Behaviour of Condensed Matter
Fig. 2.35 Band structure of a conductor a at absolute zero temperature and b at a temperature
higher than absolute zero
electrons are not free at room temperature, and the free electron density rapidly rises
with temperature due to covalent bond breaking. Though electron–lattice collisions
in semiconductors also rise with temperature, but the rate of increase of free charge
carriers (electrons and holes) with temperature is much larger than the increase of
collisions; therefore, the resistivity of semiconductors decreases with temperature.
That is why semiconductors have negative value of temperature coefficient.
electron. As such one can build two separate state energy diagrams one for spin
up electrons and the other for spin down electrons with their own valence and
conduction bands.
In some crystals where atoms are bound by metallic bonding, it so happen that
valence band for electrons of one specific spin orientation is partially filled
while there is a forbidden energy gap for electrons of other spin orientation.
As a result when external voltage is applied, electrons with that spin orienta-
tion for which there are vacant states in valence band contribute to the flow of
current. Electrons with opposite spin orientation do not contribute to current
flow because of forbidden energy gap. Since only about half of the total elec-
trons contribute to current flow, the material is termed as half metal. Examples
are chromium oxide and lanthanum-strontium-magnetite that are half metals
and are also ferromagnetic. Though all half metals are ferromagnetic but all
ferromagnetic materials are not half metals. Energy band structure of a typical
half metal is shown in Fig. 2.37.
Solved Example SE2.6 Density of trivalent Aluminium metal is 2.7 g cm−3 , and its
molecular mass is 27 g/mol; assuming that at room temperature all valence electrons
are non-localised (or free), calculate the number density of free electrons in the metal.
Solution: It is known that a gramme mole of an element contains 6.022 × 1023 atoms
(Avogadro’s number) of the element. Therefore,
Also, the density D of Al is given as D = 2.78 g cm−3 . But density is equal to mass/
volume.
116 2 Electrical Behaviour of Condensed Matter
Fig. 2.37 Energy band structures for a half metal a for spin down electrons b for spin up electrons
M 27
V = = cm3 = 9.71 cm3 (2.31)
D 2.78
It follows from Eqs. (2.30) and (2.31) that 27 g of Aluminium has 6.022 × 1023 atoms
that occupy a volume of 9.71 cm3 .
6.022×1023
Therefore, the number of atoms in 1 cm3 = 9.71
= 0.62 × 1023 .
Since the valency of Aluminium is 3, and all valence electrons are free at room temper-
ature, therefore each atom will contribute three free electrons at room temperature.
Hence the number density of free electrons in Aluminium at room temperature n per
cc is
n = 3 × 0.62 × 1023 per cm3 = 1.86 × 1023 per cm3 = 1.86 × 1029 per m3 .
2.6 Superconductor
material in which the resistivity of the material becomes zero is called the supercon-
ducting state and the property as superconductivity. The characteristic temperature
below which resistivity becomes zero is called the critical or transition tempera-
ture and is denoted by TC . It is obvious that no energy loss occurs when current is
established through a superconductor, no matter for how long current flows through
it.
2.6.1 Background
Research group of Dutch physicist Heike Kamerlingh Onnes, in 1911 found that the
resistivity of a mercury column becomes zero when the temperature of the spec-
imen was reduced below 4.15 K (see Fig. 2.38). Complete disappearance of electric
resistance in some other metals and solids below a certain characteristic very low
temperature was observed in some other materials also.
Onnes and his students were studying the electrical behaviour of wires of different
materials and found that the resistance of a mercury wire took a precipitous drop
when temperature reached to about 4.15 K. The drop in resistance was enormous,
the resistance of the wire dropped at least by a factor of one thousand, so much
so that exact measurement of the resistance became impossible (see Fig. 2.38). In
order to further investigate the phenomenon, Onnes’ group setup a current through
the mercury wire in the form of a ring by connecting at two points of the wire a
voltage source for an instant and then removing the voltage source. To their surprise,
they observed that current kept flowing through the mercury wire ring without any
reduction in its magnitude, so long as the temperature of the wire was kept below
4.15 K. The observed perpetual flow of current was only possible if flow of current
does not encounter any opposition or resistance. As is known opposition to the flow
of current in normal situation arises essentially from electron–lattice collisions and
electron trapping at impurity sites. Disappearance of resistance in case of mercury
wire at temperature below 4.15 K means that electron–lattice collisions have either
suddenly cease to happen below the critical temperature, the temperature below
which mercury wire exhibits superconductivity or at least lattice vibrations are not
opposing the flow of current. Transition or critical temperature for superconductivity
transition for some metals is listed in Table 2.3.
Many well-known scientists including Nobel Lauriat John Bardeen tried to explain
and give a theoretical background for superconductivity but they did not succeed.
The reason why no theoretical explanation of the process could be given at that time
was that the process of superconductivity is a typical quantum phenomenon, and
quantum physics was not in place till 1920 or so.
(i) Meissner effect In the meantime experimental studies on superconductivity
continued and in 1933, two scientists Walter Meissner and Robert Ochsen-
feld discovered another very interesting property of superconductivity; they
found that any material in superconducting state repels the lines of external
magnetic field (Bex ) so long as the applied magnetic field is below the crit-
ical value denoted by BC ex . It means that for external magnetic fields Bex <
Bc ex a superconductor behaves as a perfect diamagnetic material. If a magnet
is brought near to a superconducting material, the superconductor does not
allow magnetic lines of force to penetrate through it, rather it repels them. The
effect is called Meissner effect.
Meissner effect is a typical example that also shows that superconductors are
not just perfect conductors. A perfect conductor may be defined as a conductor
which has a pure crystalline structure without any impurity or missing atom
sites and has small value of resistivity. However, a superconductor is different
from a perfect conductor as it behaves differently than a perfect conductor
when a magnetic field is first applied and then switched off.
Figure 2.39 shows a perfect or ideal conductor and a superconductor, initially
the temperature of both the specimen is above critical temperature T C and both
of them allows the passage of magnetic lines through them. With magnetic
field (Bex < BC ex ) on, if the temperature of both specimens is reduced below
critical temperature T C , the ideal conductor will allow magnetic lines to pass
through it, as they were before the reduction of temperature. It is because
magnetic properties, like susceptibility, of a perfect conductor do not change
2.6 Superconductor 119
Fig. 2.39 Behaviour of perfect or ideal conductor and superconductor with respect to magnetization
and demagnetization
120 2 Electrical Behaviour of Condensed Matter
magnetic flux induces surface current at the outer skin of the ideal conductor,
which in turn establishes a magnetic field in the interior volume of the perfect
conductor specimen. It may, however, be mentioned that the induced surface
currents will be short lived as the resistivity of the conductor will dissipate
energy, and currents will die out.
In case of the superconducting specimen, no magnetic flux is lined with the
specimen volume (as there is no magnetic field inside superconducting volume)
hence at the instant when Bex is switched off no change in magnetic flux will
take place. As such no induced currents will be generated. The interior and
exterior of the superconducting volume will contain no magnetic fields after
the external magnetic field is switched off.
SAQ: How can one explain the total absence of magnetic field in the interior
of a superconducting volume when some external magnetic field is
applied to the superconductor?
SAQ: When an external magnetic field is switched off from a normal
conductor, the conductor retains magnetic field in its interior and
around. How one can explain this retention of magnetic field?
(ii) Magnetic field trapped in a superconducting ring
Figure 2.40i shows a ring shaped superconductor specimen placed in an
external magnetic field Bex at temperature T > T c . Science temperature is
above T C , the specimen ring behaves as a normal material and magnetic lines
penetrate through the opening of the ring. The magnetic flux ϕ linked with the
opening of the ring is given by
It is known that not only metals but some other materials below their transition
temperature Tc become superconductor. Further, if an external magnetic field
Bex is applied across a superconductor specimen, when it is below its transition
temperature, magnetic field Bin inside the superconductor stays zero. This is,
however, true only when the magnitude of the externally applied magnetic field
is below a certain value Bc ex . If the magnitude of externally applied magnetic
field Bex is increased beyond the critical value Bc ex then the superconductor
may respond in two different ways, depending on its type. In the case of
type-I superconductor on increasing the strength of external magnetic field
Bex beyond Bc ex , the superconductivity of the specimen just vanish, though its
temperature is still below Tc . It behaves as an ordinary conducting material
and magnetic field penetrates in the interior of the specimen. This is shown in
Fig. 2.41a where a dotted vertical line at Bc ex divides the figure in two parts;
where the specimen remains a superconductor and the part where superconduc-
tivity is totally lost in spite of its temperature being below critical temperature
T c . Figure 2.41a shows the typical behaviour of a type-I superconductor. Most
metals show type-I superconductivity.
In case of type-II superconductors, there are two values of external critical
magnetic fields Bc1 ex and Bc2 ex such that between these two magnetic field
values the specimen remains partially superconducting as shown in Fig. 2.41b.
For external magnetic fields greater than Bc2 ex , the type-II specimen also
becomes non-superconductor, though its temperature is still below its critical
temperature.
122 2 Electrical Behaviour of Condensed Matter
little jerk or push to an ion of the lattice may make the whole lattice to undergo
vibratory motion. This vibratory lattice motion is quantized, and the quanta of
vibratory motion of the lattice are called phonon.
Figure 2.44 shows two electrons numbered 1 and 2 moving in opposite direc-
tions. Negatively charged electrons attract positive ions of the lattice towards
them, distorting the crystal lattice and setting it in vibratory motion. The density
of positive charge in regions around the lattice distortion increases beyond its
normal value and becomes centres of attraction between electrons and the
distortion. Nearby electrons feel force of attraction by the region of increased
charge density but this force of attraction is over powered by the force of
mutual repulsion between nearby electrons. However, electrons far away get
bound with each other as the force of mutual repulsion between distant elec-
trons is very small and force of attraction due to charge distortion over rides the
repulsion. In this way Cooper electron pairs are formed. This is the reason
why the distance between the two electrons of Cooper pair may range from 50
to 100 nm or more. This is in comparison with the lattice separation; distance
between two neighbouring ions of the lattice, of 0.1–0.4 nm. It is important to
note that only those electrons that are far apart may form Cooper pairs.
Figure 2.44 shows two electrons numbered 1 and 2 moving in opposite direc-
tions. Negatively charged electrons attract positive ions of the lattice towards
them, distorting the crystal lattice and setting it in vibratory motion. The density
of positive charge in regions around the lattice distortion increases beyond its
normal value and becomes centres of attraction between electrons and the
Fig. 2.44 Moving electrons produce distortion in ion lattice and set it in vibratory motion. Distortion
in lattice increases density of positive charge in small regions which attract electrons that are far
away and create Cooper pairs
128 2 Electrical Behaviour of Condensed Matter
John Barden, Leon Cooper and J. Robert Schrieffer, in 1957 developed a quantum
mechanical microscopic theory for superconductivity, which in short is termed as
BCS theory. The theory explains the resistance less flow of current by paired electrons
in some materials below critical temperature T C . The theory is based on the concept
of Cooper pairs which are formed in superconducting specimen below the critical
temperature. The salient features of the theory may be summarised as follows:
• Phonon-(free) electron interactions in some materials, below critical temperature,
give rise to the formation of Cooper pairs.
• The binding energy of a cooper pair is very small; of the order of few milli-electron
volts (≈ 10−3 eV). Therefore, to keep Cooper pairs intact, the superconducting
specimen must be kept below the critical temperature T C .
• Electrons 100 nm or more far apart from each other join to form Cooper pair. It
is because, the force of repulsion between distant electrons is small and may be
overcome by the force of attraction between the phonon and electrons. In case of
2.6 Superconductor 129
nearby electrons the force of repulsion is high and dominates over attraction by
phonon.
• BCS theory requires that the linear momentum of the Cooper pair must be zero;
therefore, the two electrons forming a Cooper pair must be moving in opposite
direction.
• The bound state of two electrons as Cooper pair is lower in energy than the energy
state of unbound free electron, the state corresponding to Cooper pair lies below
the Fermi level.
• Formation of Cooper pair is a transient phenomenon. Suppose at a given instant
a Cooper pair has two electrons marked-1 and 2. It is possible that at the next
instant electron-1 of the Cooper pair may change its partner and another electron
out of the large number of free electron may join with electron-1 to form the
Cooper pair, at the next instant electron marked-1 may be replaced by some other
electron in the Cooper pair and so on. In this way, almost all free electrons get
coupled with each other in forming Cooper pairs. This results in inter-linking or
coupling of all free electrons in the volume of the superconducting specimen and
they move coherently.
• Superconductivity is a quantum mechanical phenomenon; each free electron in
quantum mechanics is represented by a wavefunction that extends over the volume
of the specimen. Wavefunctions of all free electrons therefore overlap generating
a resultant wavefunction representing all free electrons together. The resultant
wavefunction gives rise to the coherent behaviour of all free electrons.
• A Cooper pair has two electrons each with spin 1/2 ℏ and, therefore the spin of
a Cooper pair may either be 0 or 1 ℏ. Particles with integer spin (0, 1 ℏ, 2ℏ…)
are called Bosons and obey Bose–Einstein quantum statistics. Unlike electrons
which follow Fermi- Dirac statistics and not more than two electrons with their
spins in opposite directions can stay in an energy state, large number of Bosons
may stay in a given energy state. As such all Cooper pairs in the superconducting
specimen stay in the lowest energy state, the ground state. Simultaneous stay of
all Cooper pairs in the ground state is referred as Condensation.
• In energy band diagram of a superconductor, the ground band of Cooper pairs is
separated by a forbidden energy gap Eg from the energy band for free electrons, as
shown in Fig. 2.46. The forbidden energy E g is related to the transition or critical
temperature T c by the relation.
E g = 3.53kTC , here k stands for Boltzmann constant.
Energy band diagram for a metal is also shown in Fig. 2.46 for comparison.
• In the superconductive state the current flow is constituted by the motion of
coherent Cooper pairs. In ordinary metal resistance essentially develops out of
the inelastic collisions between the lattice and free electrons, however, in case of
superconductors, where current is constituted by the coherent motion of Cooper
pairs, inelastic scattering between lattice and coherent Cooper pairs is not possible.
It is because for inelastic collisions the Cooper pairs must change into free elec-
trons for which energy equivalent to forbidden energy is required. Below critical
130 2 Electrical Behaviour of Condensed Matter
SAQ: Stable levitation is more easily achieved with type-II superconductors. Why?
SAQ: In a bench-top levitation demonstration the permanent magnet is first pushed
towards the superconductor, why?
SAQ: What is meant by the transient nature of Cooper pair formation?
SAQ: Why Cooper pair-lattice collisions do not take place in the current flow
through a superconductor?
Problems
(c) Maximum current that may pass through a conductor before electric
breakdown
(d) Maximum voltage that may be applied to a conductor before electric
break down
ANS: (a)
MC2.3 If ni , np and ne , respectively, denote the charge carrier concentration
in intrinsic material, concentrations of holes in p-type material and the
concentration of electrons in n-type material, then
(a) n i > n p (b) n i > n e (c) n p > n i (d) n p n e = n i2
ANS: (c), (d)
MC2.4 Resistance of a piece of a semiconductor decreases with the increase of
temperature because;
(a) The mean free path of electron–lattice collisions increases with the rise
in temperature
(b) Rate of lattice-electron collisions decreases with the rise of temperature
(c) With the increase of temperature, the rate of increase of carrier
concentration overtakes the rate of rise of lattice–electron collisions
(d) With the increase of temperature both the rate of increase in carrier
concentration and rate of increase of lattice-electron collisions are nearly
equal
ANS: (c)
MC2.5 When dopant concentration in a given piece of a semiconductor is
increased, the resistance of the piece.
(a) Increases
(b) Decreases
(c) Remains unaltered
(d) May increase or decrease depending the material
ANS: (b)
MC2.6 At room temperature, the thermal velocity of free electrons in a semicon-
ductor is of the order of
(a) 10−5 m
s
(b) 100 m
s
(c) 105 m
s
(d) 1015 m
s
Ans: (c)
MC2.7 Zone refining technique is based on the principle that;
(a) Mobility of impurity atoms is less in molten state, and the melting
point of pure Silicon is lower than the impure Silicon
2.6 Superconductor 133
(b) Mobility of impurity atoms is more in molten state, and the melting
point of pure Silicon is lower than the impure Silicon
(c) Mobility of impurity atoms is less in molten state, and the melting
point of pure Silicon is higher than the impure Silicon
(d) Mobility of impurity atoms is more in molten state, and the melting
point of pure Silicon is higher than the impure Silicon
ANS: (d)
MC2.8 Depth profile of implanted impurity diopant ions is
(a) Uniform (b) Shows a single peak at the range of implanted ion (c)
Uniformly decreases (d) Uniformly increases
ANS: (b)
MC2.9 Given that d p and d e , respectively, represent the width of depletion layer
on the p-side, and on the n-side while N p and N e, respectively, the
concentration of dopants on the n- and p-sides and if d p = 1.5 d n , then;
(a) N p = N e (b) N p = 3 N e (c) N p = 1.5 N e (d) N e = 1.5 Np
ANS: (d)
MC2.10 Reverse current in a pn junction gets doubled for every 100 C rise of
temperature because;
(a) Reverse current is constituted by minority carriers that are produced
by bond breaking which increases with temperature
(b) Reverse current is constituted by majority carriers that are produced
by bond breaking which increases with temperature
(c) Reverse current is constituted by minority carriers that are produced
by bond breaking which decreases with temperature
(d) Reverse current is constituted by majority carriers that are produced
by bond breaking which decreases with temperature
ANS: (a)
MC2.11 Which of the following always has zero magnitude in a superconducting
material?
(a) Electric field E (b) Potential difference V (c) Current I (d) Magnetic
susceptibility χ
ANS: (a), (b)
MC2.12 In an experiment a ring shaped specimen of superconducting material
is placed in a magnetic field of strength B at temperature T > T C . The
temperature T is then reduced and maintained below T C . The magnetic
field is switched off. Which of the following statement(s) will correctly
describe the result of the experiment?
134 2 Electrical Behaviour of Condensed Matter
(a) Magnetic field will remain trapped outside the ring opening but it will
be zero inside the ring opening
(b) Magnetic field will remain trapped inside the ring opening and will be
zero outside the ring opening
(c) Magnetic field will be zero both inside and the outside the ring opening
(d) Magnetic field will remain trapped both inside and outside the ring
opening
ANS: (b)
MC2.13 A current I x in positive X-direction passes through a slab of n-type semi-
conductor. If a magnetic field BZ in positive Z-direction is applied across
the semiconductor slab, the deflection force due to magnetic field on free
electrons constituting the current will be in the direction of
(a) X-axis (b) Y-axis (c) Z-axis (d) 45° between X- and Z-axis
ANS: (b)
MC2.14 Units for Hall coefficient are
m3 C m3 A.s
(a) C
(b) m3
(c) A.s
(d) m3
LA2.1 Discuss the electron energy band theory of crystals and hence explain the
classification of solids according to their electrical properties.
LA2.2 What are the characteristics of Insulators? Why do they have very large
value for resistivity? Draw sketches for the energy band pictures of an insu-
lator and a conductor. Explain the phenomena of breakdown in insulators
and define the dielectric strength.
LA2.3 Draw labelled diagrams for the band structures of a p-type and an n-type
semiconductor. What are acceptor and donor levels and how do these levels
affect the Fermi level?
LA2.4 With the help of a labelled energy band diagram discuss the formation of a
pn junction. What is meant by thermal equilibrium and how it is achieved in
case of a pn junction? Discuss the formation of depletion layer at junction
boundary and list some of its properties.
LA2.5 What is a pn junction diode? Discuss the current flow through a junc-
tion diode under forward and reverse bias and draw its current voltage
characteristics.
LA2.6 With necessary details describe ion implantation method of doping of
semiconductor wafer.
LA2.7 What is Meissner effect? How can it be explained? Discuss stable levitation
and essential conditions to achieve it.
2.6 Superconductor 135
LA2.8 What are Cooper pairs and their condensation? Outline BCS theory of
superconductivity and explain why lattice-Cooper pair collisions do not
take place in superconductors.
LA2.9 Give distinguishing features of type-I and type-II superconductors. Wires
of type-II superconductors are often used for making strong electromag-
nets, explain.
LA2.10 What is isotope effect and why it was so significant in developing a
theory for superconductivity? Explain the process of Cooper pair formation
via electron–phonon interaction. With the help of energy band diagrams
explain the difference between a conductor and a superconductor.
LA2.11 What is Hall Effect? Describe with the help of a diagram the setup for
measuring Hall voltage and derive an expression for Hall voltage in terms
of Hall coefficient.
Chapter 3
Magnetic Materials
Objective
Origin of magnetism in matter, types of magnetisms, their properties, applications,
etc. will be discussed in this chapter. It is expected that after reading this chapter the
reader will be able to understand how magnetic properties develop in materials and
how materials may be classified in terms of their magnetic behaviour. He will also
learn how materials with desired magnetic properties may be developed.
3.1 Introduction
Magnetic materials include a variety of materials that are used in diverse applica-
tions. It is interesting to note that magnetic materials are utilised in generation and
distribution of electricity and in most cases, they are also used in the appliances that
use that electricity. Magnetic materials are used for the storage of data on audio and
video tapes and computer disks. Magnetic materials also have applications in the
field of medicine, they are used in body scanners as well as a range of applications
where they are attached to or implanted into the human body. Non-polluting electrical
vehicles have very efficient motors that utilise advanced magnetic materials.
The fact that Earth behaves like a bar magnet was known to Indian saints and seers
many centuries ago, and they developed and devised codes called ‘Vastushastra’ for
build temples and buildings based on Earth’s magnetic meridian.
In the modern era, however, in 1600 William Gilbert published the first systematic
experiments on magnetism in the pamphlet ‘De Magnet’. By the end of the eighteenth
century, scientists have noticed many electrical and magnetic phenomenon but they
all believed that these two branches of science, the electricity and the magnetism
are quite independent/separate from each other and that perhaps there is no direct
relationship between the two. Lightning, the electric phenomena, was well known to
ancient people but first magnetic material that showed the power of attracting small
iron pieces was the loadstone, mineral magnetite (Fe3 O4 ), found in form of small
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2023 137
R. Prasad, Physics and Technology for Engineers,
https://doi.org/10.1007/978-3-031-32084-2_3
138 3 Magnetic Materials
pieces of rock, perhaps in Greece. Then in July 1820, Danish natural philosopher
Hans Christian Orsted published a pamphlet that showed clearly that electric current
and magnetism are very closely related to each other. It is said that Orsted, born in
August 1777 after completing his Ph.D. in philosophy in 1801, travelled through
Europe, as was customary at that time, and met many scientists/philosophers of
Germany, France, etc. One person he met was Johann Ritter, a scientist who believed
that electricity and magnetism are related. Orsted might have been influenced by
him.
Orsted returned back to Copenhagen in 1803 and tried for a faculty position at
University there, but initially did not succeed. However, he started private lecture
courses, charging admission fee, which attracted many young people. Later he got
a position in Copenhagen University, perhaps on account of his popularity through
private lectures.
Though most of the scientists at that time thought electricity and magnetism to
be two totally independent attributes of matter, but there were some indications, for
example it was observed that a magnetic compass if struck by lightning reverses its
poles; that pointed to some connection between the two. From his writings/lectures
it appears that Orsted believed that electricity and magnetic behaviour are two inde-
pendent properties of all matter and that these properties might interfere with each
other.
During one of his lecture demonstration on 21 April 1820 while setting up his
apparatus, Orsted noticed that whenever he switched on current in his circuit, the
north pole of the magnetic needle placed nearby deflected a bit. He also noticed that
the direction of deflection of the North Pole of the compass needle changed when the
direction of current in his circuit was reversed and that the effect of current on the
movement of magnetic needle may be shielded by interposing an insulator/dielectric
material between the electric circuit and the magnetic needle. Orested published his
findings which were mainly qualitative, in a pamphlet, circulated to other scientists,
on 21 July 1820; but the effect was clear: An electric current generates a magnetic
field.
French scientist Andre-Marie Ampere opined that the production of magnetic
field by electric current is the fundamental feature of magnetism. In fact, he went to
the extent that all magnetism, permanent magnets including, is the result of currents
in them. Later, some 160 years ago, in 1864, James Clerk Maxwell carried out the
first profound unification of Nature’s two forces, the electric force and the magnetic
force in form of his famous Maxwell’s equations.
The present understanding is that stationary electric charges produce an elec-
tric field around them; charges moving with uniform speed (current elements) give
rise to both the electric and the magnetic fields while accelerated charges radiate
electromagnetic fields.
3.2 Electric Current and Magnetic Field 139
μ0 I
B= (3.1)
2πr
Here constant μ0 is the permeability of the free space, is a basic constant of nature
related to the velocity of light c, having the value μ0 = 4π × 10−7 T m/A. Since the
current carrying conductor is very long, the magnetic field strength B at the point
of observation O depends only on the perpendicular (shortest) distance r from the
conductor.
Biot–Savart argued that each little segment dl of the current produces a magnetic
field at the point of observation and that the total magnetic field due to the complete
current carrying conductor (of any shape) is given by,
∫ −
→ ∫ −
→
μ I dl X r̂ μ0 I dl × →r
B→ = 0 = (3.2)
4π r 2 4π |r|3
−
→
In Eq. (3.2) dl is the vector element of length in the direction of the current flow
and r̂ is the unit vector in the direction of vector distance r from dl to the point
of observation O. The line integration is to be carried on the length of the current
carrying conductor.
Figure 3.1a shows the direction of magnetic field lines due to an infinite straight
conductor carrying current I, Fig. 3.1b depicts the right-hand rule to specify the
direction of magnetic field lines, Fig. 3.1c gives the magnitude of magnetic field
strength at point O situated at a perpendicular distance r from the straight infinite
conductor carrying current I and Fig. 3.1d shows the strength of magnetic field due
to a current element dl at distance r from it as given by Biot–Savart law.
A bar magnet suspended in Earth’s magnetic field orients itself in North–South
direction. The North seeking end of the bar magnet is called the North end and the
geographic South seeking end as the South end. When lines of magnetic field of a bar
magnet are drawn using a compass needle, they appear to originate from a point on
the North end and appear to terminate at a point on the South end. Since all magnetic
140 3 Magnetic Materials
Fig. 3.1 a Lines of magnetic field of an infinite conductor carrying current. b Right-hand rule giving
the direction of magnetic field. c Magnitude of magnetic field strength at a point O at distance r from
an infinite conductor carrying current I. d strength of magnetic field due to current element dl at
point O
field lines appear to originate from one point on the North end, this particular point
is called the North Pole of the magnet, and similarly point on the south end where
all magnetic field lines appear to terminate, the South Pole. Number of magnetic
field lines per unit area around a point is a measure of the intensity of magnetic
field at that point. It is obvious that the magnitude of magnetic field intensity at
poles is a maximum. Figure 3.2a shows the magnetic field lines of a bar magnet
drawn using a compass needle. Though when looked from outside it appears as if
the lines of magnetic field of a bar magnet originate from North pole and terminate
at South pole, but the fact is that magnetic field lines are continuous within the bar
magnet as shown in Fig. 3.2b. Parallel magnetic field lines at some place indicate a
uniform magnetic field in that region. North poles as well as the south poles of two
bar magnets repel each other while the opposite poles attract. One special property
of a magnet is that North and South poles occur in pairs, for example if one break a
bar magnet into pieces, each piece develops North and South poles. It is to say that
free North or South magnetic poles (mono poles) never occur in nature. Magnetic
poles to some extant may be compared with electric charges, magnetic north pole
like positive electric charge and magnetic south pole like negative electric charge.
3.3 Magnetic Dipole Moment 141
Fig. 3.3 a Electric field lines (blue) and equipotential lines (red) due to an electric dipole b magnetic
field lines due to a current loop with axis along Z-axis; the yellow line shows the segment of the
current loop c current loop and associated magnetic field lines
enclosed by the current loop. The direction of the magnetic moment of the loop may
be easily determined using the right-hand rule.
When an object having a magnetic dipole moment μ → is placed in an external
−−→
magnetic field B ex t , the object or the magnetic moment associated with the object
experiences a torque that tries to align the dipole moment of the object in the direction
of the external magnetic field. The magnitude and direction of the torque τ are given
−−→ −−→
by the vector equation τ→ = μ×→ B ex t as shown in Fig. 3.4c. In case when B ex t has the
magnitude of 1, and the angle between magnetic moment μ and external magnetic
field Bext is 90°, then τ = μ. One may, therefore, define the magnetic moment of an
object as the maximum torque experienced by the object in unit magnetic field.
3.4 Magnetic Moment of a Charged Particle Moving in a Circular Orbit 143
Fig. 3.4 a Magnetic dipole moment of a small bar magnet. b Magnetic dipole moment of a circular
current loop. c Torque experienced by a magnetic dipole moment in an external magnetic field.
d Potential energy of a magnetic dipole in an external magnetic field
The potential energy U of the interaction between the magnetic moment and the
−−→
external magnetic field is given as U = −μ.→ B ex t . The negative sign in expression for
U is included to indicate that the potential energy of interaction will be a minimum
when both the magnetic moment and the external magnetic fields are parallel and
will be a maximum when they point in opposite directions. It is arbitrarily chosen
to assign a value zero to the potential energy U when magnetic moment points in a
direction perpendicular to the external magnetic field.
Figure 3.5 shows a particle of mass m, charge + q moving with linear velocity v
in a circular path of radius r. We will calculate the magnetic dipole moment µ and
angular momentum L of the particle under classical approach.
Let us first calculate the magnetic (dipole) moment μ → of the particle. A particle
having an electric charge + q, moving in a circular closed path with linear velocity
v (angular velocity ω = v/r ) crosses a fixed point on its circular path after each
144 3 Magnetic Materials
time interval t = 2πr/v. The frequency of crossing the fixed point f = 1t = v/2πr
and the current I constituted by the circulating charge + q and defined as the rate of
flowing of charge, may be written as,
+qv
I = +q f = (3.3)
2πr
Current I flows in the same direction as the direction of motion of the electric
charge + q, in case the charge q is negative, the current I will flow in the direction
opposite to the direction of motion of the charge. The circular area A enclosed by
current I is A = π r 2 and, therefore, the magnetic (dipole) moment of the current loop
is given by,
−
→ +qv ( 2 ) +qvr
μ| = IA = πr = (3.4)
2πr 2
Next, let us calculate the angular momentum L of the charged particle of mass m
which is moving in the circular path of radius r with linear velocity v. By definition,
the angular momentum is given as;
| |
| |
L→ = r→ × p→ = r→ × (m . v→) or | L→ | = mr v sin θ (3.5)
In Eq. (3.5) θ is the angle between the linear velocity v and the radius vector r,
which in the present case of circular motion is always 90° and hence
→ = mrv
| L| (3.6)
The direction of L is given by the vector cross-product rule and in the present
case angular momentum of the particle points in the upward direction, normal to the
−
→
plane containing r and v. It may be observed that if q is positive, both −
→
μ and L
points are in the same direction but if q is negative, −
→
μ will point downwards while
angular momentum L will point upwards.
3.4 Magnetic Moment of a Charged Particle Moving in a Circular Orbit 145
Having obtained expressions for the magnetic moment of the moving charge q
and its angular momentum we calculate the ratio,
| | +qvr
|μ |
|→|=γ = 2 = q (3.7)
| L→ | mr v 2m
So far we considered the motion of a charged particle in classical limits and derived
expressions for magnetic dipole moment and angular momentum. However, it is
known that the behaviour of microscopic particles like neutron, proton and elec-
tron, etc., which are constituents of atom, is better understood in terms of quantum
mechanical treatment. It is interesting to note that all classical expressions derived
here holds good in quantum mechanical treatment also. However, there are two points
of difference between the classical and the quantum mechanical treatments: (i) In
quantum treatment it is not important wether the current flows in a circular path or
not, current must flow in a closed loop enclosing some area A (which may not neces-
sarily be a circular area) and that value of area may be used in above expressions. (ii)
In case of classical treatment L may have different continuous values from zero to
mv, depending on the value of angle θ, but in quantum physics L may assume only
discrete values. In switching from classical to quantum treatment, the expression for
L becomes,
√
L = l(l + 1)ℏ (3.9)
−
→l q →
μ = L (3.10)
2m
and,
√
L= l(l + 1)ℏ (3.11)
Electrons are constituents of all atoms and molecules. It is known that two kinds of
motions, namely orbital motion around the atomic nucleus and inherent spin motion,
are associated with electrons. Both these motions obey laws of quantum mechanics
and have corresponding magnetic moments associated with them. Let us first consider
the orbital motion of an electron.
(i) Orbital motion of electron
The magnetic moment μorb e of electron due to its orbital motion may be obtained
from Eq. (3.10) by replacing (i) q by (− e) where minus sign indicates that the
charge of electron is negative, e = 1.6 × 10−19 C is the charge of the electron
and (ii) m by m e , the mass of electron. With these substitutions, we get
−−→ e →
μorb
e =− L (3.12)
2m e
Equation (3.12) tells that in case of electron, the direction of magnetic moment
μorb
e is opposite to the direction of angular momentum L, because of the negative
charge on electron.
The simplest way of measuring the magnetic (dipole) moment of a particle is
to put the particle in some external field in a given direction and measure the
interaction energy U, from which the magnitude of magnetic moment may be
calculated. Suppose the external magnetic field is applied along the z-axis. Then
the z-component of the magnetic moment will come into play. It is reasonable
to assume that Eq. (3.12) will also hold for the z-components of μle and L,
therefore,
( ) e
μorb
e z
=− Lz (3.13)
2m e
3.5 Magnetic (Dipole) Moment of Electron 147
L z = ml ℏ (3.14)
The negative sign on the RHS of Eq. (3.15) simply indicates that the direction
of μorb
e is opposite to L z .
[ ]
eℏ
Quantity 2m e
is defined as 1 Bohr Magneton represented either by Bm or
μB . Magnetic moment of atomic particle like electron is measured in units of
Bohr magneton. Magnitude of 1 Bm or 1μB depends on the system of units. In
SI system of units 1 Bm = 2meℏ
e
while in Gaussian CGS system 1 Bm = 2meℏe c
M L 3 T −1 Q −1 ; N m( A/m)−1 ; N M T −1 etc.
As already mentioned the negative sign in Eq. (3.16) may be dropped, taking in
account the fact that the magnetic moment of electron due to its orbital motion
is opposite in direction( to the
) angular momentum. It follows from Eqs. (3.15)
and (3.16) that when μorb e z
is measured in units of Bohr magneton (Bm ) and
proper sign of μorb
J is taken into account, then the ratio
( orb )
μe z (in unit Bm ) μorb
= 1 or J
=1 (3.17)
ml J
μ(in Bm )
=g (3.18)
J
Equation (3.18) implies that if μ is the value of the magnetic moment of some
particle measured in units of Bohr magneton (Bm ) and J is the corresponding
quantum number associated with motion of the particle, then the ratio of the
two may be represented by a constant g. The constant g is called the g-factor of
the particle and depends on the nature of the particle as well as on the type and
state of motion of the particle. For example, the value of g-factor for electron
for its orbital motion georbit may be given as;
μorb
J
= georbit = 1 (3.19)
J
3.5 Magnetic (Dipole) Moment of Electron 149
Thus, the g-factor (a dimension less constant) of electron for its orbital motion
georbit = 1.
Difference between Gyromagnetic ratio γ and the g-factor: Both represent
the ratio of magnetic moment to angular momentum, but units for measuring
the magnetic moment and the angular momentum are different. The ratio of
magnetic moment to angular momentum quantum number when magnetic
moment is measured in units of Bohr magneton, gives the g-factor.
(ii) Spin motion of electron
Motion of electron around the nucleus of the atom is termed as its orbital motion.
Apart from that an electron also possesses an inherent motion called the spin
motion. Spin or spin motion is purely a quantum mechanical concept that cannot
be explained in terms of classical physics. It is reasonable to assume that the
inherent spin motion of electron follows the same laws as are followed by its
orbital motion. There is an angular momentum quantum number L associated
with the orbital motion similarly there is a quantum number S associated with
spin motion. One can write down an expression for spin motion corresponding
to Eq. (3.18) of orbital motion as given below;
spin
μs (in units of Bm )
= g spin (3.20)
s
Or
μspin
s (in units of Bohr magneton (Bm )) = gespin s (3.21)
spin
Precise measurement of spin magnetic moment μs of electron gives the value
spin
of ge for electron as 2.002318. Therefore, in case of electron;
spin
The fact that ge ≈ 2, while georbit = 1 implies that spin motion of electron is
twice as effective in producing magnetic field as the orbital motion.
There are many electrons in an average atom and the total magnetic dipole
moment due to all electrons in the atom may be calculated in two different
ways: (a) In this method, called the j − j coupling method, the first step is to
find the total angular momentum ji of the ith electron by quantum mechanically
adding its orbital and spin angular momentums, ji = li + si . In the second step
the total angular momentum J of all the electrons is determined by quantum√
mechanical adding of total angular momentums ji ’s, of electrons; J = ji .
Once total angular momentum J is known one may calculate the magnetic
moment of the atom due to all its electrons using the expression
150 3 Magnetic Materials
( j− j) ( j− j )
μatom = gatom J (3.22)
( j− j)
Here gatom is the g-factor for j − j coupling. It may be mentioned that quantum
mechanical addition of angular momentums is quite different from the classical
vector addition. (b) In second method, called (l − s) coupling, the total orbital
angular momentum L of all electrons in the atom is determined √ by quantum
mechanical adding of their orbital angular momentums, L = li and simi-
larly total spin S of the atom is√
obtained by quantum mechanical adding of spins
of individual electrons, S = si . The resultant angular momentum J of the
atom is then obtained by quantum mechanical addition of L and S; J = L + S.
Expression μ(L+S) (L+S)
atom = gatom J may be used to obtain the magnetic moment of
(L+S) ( j− j )
the atom. g-factors; gatom and gatom may have different values for the same
atom.
(iii) Magnetic moments of nuclear particles
Nucleus of every atom contains protons and neutrons which also perform orbital
and spin motions. Nucleons (both neutron and proton), therefore, like electrons,
have magnetic moments. Magnetic moment of nucleons is measured in a unit
called nuclear magneton, denoted by Nm . 1 Nuclear Magneton (Nm ) = 2m eℏ
p
,
where m p is the mass of proton. One nuclear magneton is about 1836 times
smaller than one Bohr magneton because a proto (1.6 × 10–27 kg) is roughly
1836 times heavier than an electron (9.1 × 10–31 kg). As a result, the contri-
bution of magnetic moments of nucleons is neglected while discussing the
magnetic moment of the atom.
As a concluding remark it may be said that magnetic fields originate from
electric currents.
A common man may categorise solid matter in two classes: either magnetic or non-
magnetic. Magnetic materials are those which attract some other materials like iron
filing, lodestone, pins, etc., whereas non-magnetic solids, like wood, common salt,
chalk, etc. do not show any effect of magnetic field. However, the fact is that the
so-called non-magnetic materials are also affected by magnetic fields but the effect
is so weak that very sensitive detecting instruments and high magnetic fields are
required to demonstrate the effect.
Magnetic behaviour of matter originates essentially from the spin and orbital
motions of electrons in atoms/molecules or ions. Fortunately, all electrons in the
atom or ions, etc. do not contribute to magnetic properties. According to quantum
mechanical laws, an even number of electrons in a given energy level of the atom/ion,
etc. orient their orbital and spin motions in such a way that their magnetic moments
cancel out, resulting in no net magnetic field. Therefore, atoms, ions and molecules
the outer valence shell of which have even number of electrons do not show magnetic
3.6 Magnetic Behaviour of Solids 151
properties. However, if there are odd number of electrons in the outer energy levels of
an atom, ion or molecule, it has a magnetic dipole moment (and hence show magnetic
properties) that may be attributed to the unpaired electrons.
Large number of atoms, like that of bismuth, mercury, silver, copper, inert gases,
etc. have closed shell electron structure with even number of electrons, and hence,
do not show any magnetic properties of their own. However, when put in an external
magnetic field, these atoms develop an induced magnetism opposite to the applied
magnetic field. Such materials are called diamagnetic materials. Copper atom in
ground state has electron configuration: 1s2 2s2 2p6 3s2 3p6 4s1 3d10 ; and is diamag-
netic. Copper ion Cu1+ on loosing 4s1 electron also becomes a diamagnetic ion. Some
other examples are;
Copper (Cu) ground state: [Ar] 4s1 3d10 ; Copper ion (Cu1+ ):[Ar] 3d10 ; Lead (Pb)
ground state: [Xe]6s2 4f14 5d10 6p2 ; Silver (Ag) ground state: [Kr] 4p6 5s1 4d10 ;
Mercury (Hg) ground state: [Xe] 4f14 5d10 6s2
Atoms/ions with odd number of electrons in outer shell show magnetic properties
that may be attributed to the orbital and spin motions of the unpaired electron; such
atoms behave like small dipoles. Since atomic dipoles, in general, are randomly
oriented, the bulk material may show inherent magnetism but quite weak in strength.
These materials are classified as paramagnetic materials. In some special cases, tiny
atomic (or ionic or molecular) dipoles may align themselves in different patterns
giving rise to ferromagnetism, ferrimagnetism and antiferromagnetism.
In order to study different types of magnetism in details we define the
following important magnetic parameters that are frequently used to specify types
of magnetism.
SAQ: It is customary to neglect magnetic effects produced by the motion of protons
present in the nucleus of the atom, while calculating magnetic dipole moment
of the atom. Give justification.
SAQ: Neutron possesses a magnetic dipole moment due to its spin motion which
has negative value. What inference do you draw about the charge distribution
within a neutron?
F = B i dl sin θ , (3.23)
152 3 Magnetic Materials
Here θ is the angle between the direction of current and the force.
In case of unit current (i = 1), force perpendicular to the direction of current per
unit length (dl = 1) is equal to the magnetic induction B at point P. Direction of
B will be perpendicular to the direction of current flow. The SI units for magnetic
induction B is tesla, represented by letter T. One tesla 1 T = one weber per square
metre that corresponds to 104 gauss. 1 T may also be represented as one kilogram
per second squared per ampere (kg s−2 A−1 ).
Having defined B at point P in terms of the force F, one may now define the
strength of magnetic field H at point P as,
B
H= (3.24)
μ0
of the surrounding medium, then the magnitude of magnetic induction at point P (in
presence of the new medium) is given as’
B = μ0 (H + M) = μH (3.25)
Here, μ is the permeability of the medium (iron or brass, etc.). The ratio μμ0 =
μr is called the relative permeability of the medium is just a number/fraction
without units. Permeability and relative permeability of some materials are given in
Table 3.1.
In an isotropic medium B and H are parallel and the permeability is a scalar
quantity. However in an anisotropic medium B and H may not necessarily be parallel
and the permeability is a tensor.
The induced dipole moment per unit volume M, also called the intensity of
magnetisation or simply magnetisation is the effect of the magnetising field H on the
medium. Thus H is the ‘cause’ and M is the ‘result’. It is obvious that the ‘result’ M
will be proportional to its ‘cause’ H, i.e.
M ∝ H or M = χ H (3.26)
B = μH = μ0 (H + M) = μ0 (H + χ H) = μ0 H(1 + χ )
Or
μH = μ0 H(1 + χ )
That gives;
154 3 Magnetic Materials
μ r = (1 + χ ) (3.27)
Magnetic behaviour of all materials, depending on the sign and magnitude of their
magnetic susceptibilities, may be classified into five categories: (a) diamagnetic, (b)
paramagnetic, (c) ferromagnetic, (d) antiferromagnetic and (e) ferrimagnetic. Out of
some 90 stable elements of the periodic table around 38 are diamagnetic, 48 para-
magnetic, 03 ferromagnetic and only 01 antiferromagnetic at room temperature. No
element in natural form is found to exhibit ferrimagnetisms; only some compounds
such as mixed oxides show this type of magnetic behaviour.
Materials with bulk susceptibility χ having small negative value in the range of
−10−6 to − 10−5 are classified as diamagnetic. Negative value of susceptibility
means that in an applied magnetic field diamagnetic materials acquire a magnetisation
which points opposite to the magnetising field H. Atoms of diamagnetic materials
do not show any magnetism in absence of any external magnetic field (H); that
essentially means that the outer shells of these atoms have even number of electrons
that chancel out their magnetic moments (due to their spin and orbital motions) in
pairs. However, when some external magnetic field H is applied to a diamagnetic
atom, it alters the speeds of rotation of the two electrons of a pair in such a way
that a net magnetic moment is generated that is opposite in direction to the applied
field H. The magnitude of bulk susceptibility for most of the diamagnetic materials
does not change with temperature neither it depends on the strength of the external
magnetising field H. Since application of an external magnetic field H effects the
spin and orbital motions of electrons of any atom, diamagnetic effect is produced in
all types of atoms irrespective of their class of magnetism; atoms of paramagnetic,
ferromagnetic, antiferromagnetic and ferrimagnetic materials also show diamagnetic
behaviour, but the magnitude of diamagnetic affect is much too small in these atoms
as compared to other more dominant effects. As such it may not be wrong to say that
diamagnetism is universal and more fundamental than any other type of magnetism.
3.7 Classification of Magnetic Materials 155
eB
ωL = (3.28)
2m e
Here e and me are respectively the charge and mass of the electron.
The angular momentum L p associated with precessional motion is given as;
⟨ ⟩
L p = m e ωL r 2 (3.29)
⟨ ⟩
where r 2 is the mean square distance from an axis through the nucleus parallel to
B.
Larmor precession motion of electron will associate an additional magnetic dipole
moment μep with each electron of the atom, given by;
e
μep = − Lp (3.30)
2m e
If each atom has Z electrons and number of atoms per unit volume (number density
of atoms) in the material is N, then the total additional dipole moment per unit volume
or magnetisation M that will develop in the material due to Larmor precession will
be given as;
( )
e e ( ⟨ ⟩)
M = Z N μep = −N Z Lp = −N Z m e ωL r 2 (3.31)
2m e 2m e
Substituting the value of ω L from Eq. (3.28) in Eq. (3.31) one gets,
( )
e eB (⟨ 2 ⟩)
M = −N Z (m e ) r (3.32)
2m e 2m e
156 3 Magnetic Materials
Fig. 3.7 a Electron undergoing Larmor precession. b H–M graph for a diamagnetic material
M e2 (⟨ 2 ⟩)
= χ = −μ0 N Z r (3.33)
H 4m e
(⟨ ⟩)
The value of r 2 may be calculated with reference to Fig. 3.7a in terms of the
mean square radius ρ of the orbit as,
ρ2 = x 2 + y2 + z2
But
ρ2
x 2 = y2 = z2 =
3
and
2 2
r 2 = x 2 + y2 = ρ
3
(⟨ ⟩) ⟨ ⟩
Therefore, r 2 = 23 ρ 2 substituting this in Eq. (3.33) gives,
( )
M e2 2 ⟨ 2 ⟩ e2 ⟨ 2 ⟩
= χ = −μ0 N Z ρ = −μ0 N Z ρ (3.34)
H 4m e 3 6m e
Langevin assumed that each atom/ion of the material has a permanent or inherent
dipole moment μ and that number density of atoms/ions in the specimen is N per
unit volume. Further, it is assumed that there is no interaction between these inherent
dipoles. On application of an external magnetic field H, each inherent magnetic
dipole experiences a torque τ which tries to align the magnetic dipole in the direc-
tion of external magnetic field. However, on account of the thermal motion (due
to temperature T of the specimen), all atomic dipoles could not align completely
along the direction of H. Different atomic dipoles will align to different degrees with
respect to the field H. Let us assume that a typical inherent dipole align itself at an
angle θ with respect to the direction of the external magnetic field H (see Fig. 3.8).
The potential energy E θ of this magnetic dipole in magnetic field H is given as,
Or
[ ( μH cos θ ) ]
{π
0 μH cos θ e kβ T sin θ dθ
⟨m⟩ = ( ) (3.38)
{π μH cos θ
0 e kβ T
sin θ dθ
μH
y= and x = cos θ ; then dx = − sin θ dθ
kβ T
Or
[ y ]
−y y
μ ey + e y − ey 2 + e−y [⎧ y ⎫ ]
y2 e + e−y 1
⟨m⟩ = [ y ] =μ −
e −y
− ey e y − e−y y
y
Or
( )
1
⟨m⟩ = μ coth y − = μL(y) (3.39)
y
M Nμ
M = N μ and susceptibility χ = = (3.41)
H H
y2 Y3
ey = 1 + y + + + ···
2! 3!
and
y2 Y3
e−y = 1 − y + − + ···
2! 3!
Since successive terms of the two series appearing in the numerator and the
denominator of the above expression decrease very fast, it is enough to retain
first two terms of the series to obtain the approximate value of function L(y).
Or
[ ] ⎡( 2
) ⎤
1 + y2
2
1 1 + y2 1
L(y) ∼
= y3
− =⎣ ( )− ⎦
y + 3.2.1 y y
2
y 1 + y6
162 3 Magnetic Materials
[( )( )( )−1 ]
1 y2 y2 1
= 1+ 1+ −
y 2 6 y
Or
[( )( )(( )) ]
1 y2 y2 1
L(y) ∼
= 1+ 1− −
y 2 6 y
( )[ 2 2 4 ]
1 y y y
= 1+ − − −1
y 2 6 12
N μy μH
M = N μL(y) = = Nμ
3 3kβ T
M N μ2 C
χ= = = (3.42)
H 3kβ T T
N μ2
where C = 3kβ
is called Curie constant.
Expression (3.42) is called Curie law and tells that for the condition μH ≪ kβ T
the magnetic susceptibility of paramagnetic materials is inversely proportional to the
temperature.
Variation of the ratio of magnetisation M to the saturation magnetisation M s with
the value of the parameter y (= μH/k β T ) is shown in Fig. 3.9. It may be noted that
for low temperature and high magnetising field, the ratio M/M s increases with H as
the tangent to the curve for low H and high temperature.
As already mentioned, the above derivation of susceptibility for paramagnetic
materials is applicable only to those atoms/ions or molecules where there is no inter-
action between nearby inherent magnetic moments of the material. However, this
assumption does not hold, particularly, for paramagnetic metals which contain large
number of free electrons in conduction band. Application of an external magnetic
field affects both the magnetic moments due to the orbital and spin motions in such
cases. Since spin of an electron is totally a quantum mechanical concept, Pauli model
of paramagnetism is a quantum mechanical model that is beyond the scope of this
discussion. In Pauli model conduction electrons are considered essentially free and
3.7 Classification of Magnetic Materials 163
under the applied external magnetic field H an imbalance between electrons with
opposite spin is setup that leads to a low value of magnetisation in the direction of
field H. This imbalance may be understood in terms of Fig. 3.10. Figure 3.10a shows
the Fermi–Dirac distribution of electrons with opposite spins before the application
of magnetising field H. As may be observed in this figure the two halves of the distri-
bution for opposite spins are equal. However, on application of external magnetic
field H, the energy of component with spin parallel to H decreases by the amount
μb .H , while those with spin opposite to H increases by the same amount. Here
μb represents the inherent magnetic dipole moment of the atom/ion. As a result of
pulling down (in energy) of parallel spin distribution, some electrons from antipar-
allel side in the neighbourhood of Fermi level, fall into parallel spin side, increasing
the number of electrons with parallel spin over the number of electrons with antipar-
allel spins. This slight imbalance in the number of electrons on the two sides gives
rise to a small value of magnetisation in the direction of the applied field.
The susceptibility, though essentially independent of the temperature but in case
of paramagnetic metals, may have some temperature dependence because of the
change in electronic band structure due to field H.
Some metals like Al, Mg, Ti, V, etc.; some diatomic gases like O2 , and NO,
ions of transition metals and rear earth metals and their salts along with rear earth
oxides show paramagnetism. Many minerals and other materials found in nature are
paramagnetic, for example pyrrhotite (Fe3 S8 ), ilmenite (FeTiO3 ), siderite (FeCO3 ),
quartz (SiO2 ), etc. show paramagnetic behaviour.
SAQ: Use expression (3.36) to calculate the number of dipoles that will have zero
energy in the applied field H.
164 3 Magnetic Materials
Fig. 3.10 Fermi–Dirac distribution of electrons with spin up and down a before the application of
magnetic field H, b after the application of magnetic field H
H i = nw M s (3.43)
H e f f = H + H i = H + nw M s (3.44)
166 3 Magnetic Materials
Fig. 3.11 Magnetic domains in a ferromagnetic material (a) in absence of any external field (b) in
presence of external magnetic field H. c In absence of any magnetising field area/volume occupied
by domains in four basic directions is same. d On applying the magnetising field H, the area/volume
of the domain having magnetisation parallel to the applied field H increases
Let N be the number of atoms per unit volume, J the total angular momentum
quantum number of each atom, then the possible components of magnetic moment
is,
M j gμB
P E = −M j gμ B .H (3.45)
3.7 Classification of Magnetic Materials 167
Now from statistical mechanics it follows that the total magnetic moment per unit
volume or magnetisation M along H is given by
(M )
√+J j gμB
kβ T
N −J M j gμB e
M= (M ) (3.46)
√+J j gμB
kβ T
−J e
M = N g J μB B J (x) (3.47)
where
( ) ( x )
2J + 1 2J + 1 1
B J (x) = coth x− coth (3.48)
2J 2J 2J 2J
and
g J μB Heff g J μB (H + n w M)
x= = . (3.49)
kβ T kβ T
g J μB (n w Ms )
x= (3.50)
kβ T
Or
kβ T x
Ms (T ) = (3.51)
g J μB n w
Ms (0) = N g J μ B (3.52)
Ms (T ) kβ T x
= ( )2 (3.53)
Ms (0) N n w g J μβ
168 3 Magnetic Materials
Also, when one divides Eq. (3.47) by Eq. (3.52), one gets,
Ms (T )
= B J (x) (3.54)
Ms (0)
BJ (x) from Eq. (3.54), the ratio depends on the total momentum J associated with
the atom and hence rate of fall for the ratio is J dependent as shown in Fig. 3.13.
In case when T > T C , when there is no spontaneous magnetisation, the material
behaves as a paramagnetic substance and a small external field may be required to
align some of the atomic dipoles to produce some magnetisation. The external field
must be small to avoid the state of saturation. Now from Eq. (3.49)
g J μB Heff g J μB (H + n w M)
x= = ≪ 1; since T is large and H is small
kβ T kβ T
M = N g J μB B J (x)
( J +1 )
But for small x ≪ 1, B j (x) ∼
= 3J
x; putting this value in the expression above,
one gets;
( ) ( )
J +1 J + 1 g J μB (H + n w M)
M = N g J μB B J (x) ∼
= N g J μB x = N g J μB
3J 3J kβ T
Or
N g 2 μ2B ( J + 1)
M= [H + n w M] (3.55)
3kβ T
Or
Defining
N g 2 μ2B (J + 1)n w
Tc ≡ (3.57)
3kβ
Or
C
χ= (3.58)
T − Tc
( )
Here C = nTCw is a constant for a give material and is called Curie constant.
Equation (3.58) defines Curie–Weiss law and gives the magnetic susceptibility
of a ferromagnetic material above Curie temperature for low external fields. The
variation of magnetic susceptibility of materials at temperatures higher than Curie
temperature, where a ferromagnetic material behaves as a paramagnetic substance, is
well explained by Curie–Weiss law. The Curie temperature Tc for different material
may be determined experimentally by measuring susceptibility at different temper-
atures. Once T C is known other parameters of the material may also be determined;
for example in case of ferromagnetic element Gd, the experimentally determined
value of T C is 292 K; J = S = 7/2, g = 2, N = 3.0 × 1028 m−3 . This data gives
M s (0) = Ng μB J = 1.95 M A m−1 and Bi = μ0 H i = 144 T.
Figure 3.14 shows the temperature dependence of susceptibility for a paramag-
netic material and for a ferromagnetic material. A ferromagnetic material undergoes
phase transition at Curie temperature T C . This results in singularities in the behaviour
of physical properties like susceptibility, magnetisation, specific heat, etc.
(i) Exchange interactions
It can be shown that interactions between atomic magnetic dipoles cannot
generate a magnetic field strong enough to align all atoms of a domain in a
particular direction, i.e. dipole interactions are not strong enough to generate
the internal magnetic field H i ≈ 100T which is required to align atoms in
a domain. Weiss in 1907, while proposing his theory simply assumed that a
sufficiently strong internal field H i responsible for ferromagnetism is present
in each domain, without giving any explanation for its generation. The riddle
was solved in 1928 when Heisenberg introduced the concept of exchange inter-
actions. Exchange interactions originate from electrostatic Coulomb repulsion
between electrons of the neighbouring atoms and Pauli’s exclusion principle.
3.7 Classification of Magnetic Materials 171
H = −2 j S1 S2
Here S1 and S2 are the spins of neighbouring atoms, and j is the exchange
integral (do not confuse this J with total spin J). J > 0 indicates a ferromagnetic
interaction favouring parallel spin alignment (↑↑) while J < 0 indicates an
antiferromagnetic interaction favouring antiparallel spin alignment (↑↓).
SAQ: What happens to the domain walls when the magnitude of the external
field H is changed?
(ii) Spin wave
The lowest energy state of a ferromagnetic system occurs when all spins (spins
of all atoms) are parallel to each other along the direction of magnetisation.
However, when one of the spins tilts or get disturbed, it begins to precess
around the direction of magnetisation. The disturbance so produced propa-
gates as a wave through the system because of exchange interaction between
neighbouring atoms as shown in Fig. 3.15a.
172 3 Magnetic Materials
Spin waves are analogues to lattice waves created by the oscillation of atoms
about their equilibrium position. In spin waves spins precess around equilib-
rium magnetisation and precession of atoms are correlated through exchange
interactions. Spin waves may be quantised, like quantisation of lattice waves
with quanta called phonon. The quantised spin wave is called magnon.
Elements like iron (Fe), cobalt (Co), nickel (Ni), gadolinium (Gd) and dyspro-
sium (Dy) are ferromagnetic at room temperature. The Curie temperature for
Fe, Co, Ni and Gd is respectively, 770 °C, 1131 °C, 358 °C and 565 °C; however,
EuO has Curie temperature of 70 K (343 °C) and EuS even lower.
(iii) Saturation magnetisation Msat
Three parameters, namely the Curie temperature, saturation magnetisation and
magnetocrystalline anisotropy, are called intrinsic properties of a magnetic
material as they do not depend on the microstructure, i.e. on the grain size and
grain orientation in the crystal.
Saturation magnetisation (M sat ) tells about the maximum magnetic field that
may be produced by a material. Msat depends on three factors: (a) strength
of each atomic/ionic magnetic dipole (m), (b) packing density of atomic/ionic
dipoles and (c) the degree of alignment of dipoles at a given temperature.
Factor (a) depends on the nature of the atom and its electronic configuration
while factor (b) is determined by crystal structure and the presence of any non-
magnetic elements within the structure. Factor (c), the degree of alignment of
atomic/ionic dipoles depends on temperature; higher the temperature less will
be alignment, and on magnetic anisotropy of the crystal.
(iv) Magnetic anisotropy
In crystalline magnetic materials it is often observed that there is one particular
crystallographic direction magnetisation along which is easier as compared to
the other directions. For example, in case of the hexagonal crystal structure of
cobalt (Co) it is easy to magnetise in the crystal direction (001) as compared to
any other direction. It is hard to align the magnetic dipoles along the directions
⟨1010⟩ which lie in the plane normal to the crystal axis 001). See Fig. 3.16b. This
anisotropy is created by the coupling of electron orbitals to the crystal lattice. In
3.7 Classification of Magnetic Materials 173
the easy direction of magnetisation this coupling is such that electron orbitals
are in their lowest energy state.
Magnetic anisotropy of a crystal is measured in terms of the anisotropy field
Ha defined as the magnetic field needed to saturate the magnetisation in a hard
direction.
(v) Magnetic hysteresis in ferromagnetic substances
A ferromagnetic material in absence of any external magnetic field behaves
as a diamagnetic material because of random orientation of magnetisations of
different domains. However, application of a small external magnetic field H a
large magnetic induction B or magnetisation M in the direction of the applied
field H is produced. When a ferromagnetic material is magnetised in a particular
direction, it does not revert back to the state of zero magnetisation when the
external magnetising field H is withdrawn. In order to bring a magnetised
ferromagnetic specimen back to the state of zero magnetism, a magnetic field
opposite in direction to the field H has to be applied.
Figure 3.17 shows the behaviour of a ferromagnetic specimen below its Curie
temperature, subjected to an alternating magnetic field. The strength of the
magnetising field H is plotted on the X-axis while the magnitude of the magnetic
flux density B or the strength of magnetisation M (B = μ(H + M)) is shown
on the Y-axis. The starting point is the origin O when H = 0 and B = 0. As
H is increased in a given direction the flux density B in the direction of H
also increases reaching the point a (H 0 , B0 or M 0 ). Any further increase of H
beyond H 0 does not increase B or M above B0 (or M 0 ). This is called the state
of saturation. In the state of saturation, magnetisations of all domains in the
174 3 Magnetic Materials
low value of residual magnetisation and coercive field are called soft magnetic
materials. Soft magnetic materials may be easily magnetised and demagnetised
without loss of much energy. Soft materials have high value of relative perme-
ability (μr ) and low value of coercive field. Silicon steel is a very good example
of soft magnetic materials and is often used as core material in solenoids, trans-
formers, and relays that operate on alternating current generating alternating
magnetic fields.
Common soft magnetic materials are iron, iron–silicon alloys (with 1–5%
silicon) and nickel–iron alloys, also called permeability alloys, with preferred
nickel contents of 42–79%. Addition of molybdenum gives extra electrical
resistivity and addition of copper results in higher initial permeability. Soft
magnetic ceramics, also called ceramic magnets, have been originally made
from iron oxide (Fe2 O3 ) with one or more divalent oxides like that of ZnO,
MgO or NO. The mixture of these oxides is first calcined and grinded to powder,
pressed to the desired shape and sintered. Vectolite is typical light weight and
very high resistivity (like that of an insulator) magnet made by moulding ferric
and ferrous oxides and cobalt oxide. Magnadur (BaO.Fe2 O3 ), made from
BaCO3 (barium carbonate) and ferric oxide is also a soft magnet material.
Table 3.2 lists the properties of some important soft magnetic materials.
Magnets made from hard magnetic materials have strong resistance against
demagnetisation (large coercive field) and large area of hysteresis loop. Details
of some important hard magnetic materials are tabulated in Table 3.3.
176 3 Magnetic Materials
Fifteen elements of the periodic table, namely O, Cr, Mn, Fe, Co, Ni, Nd, Sm,
Eu, Gd, Tb, Dy, Ho, Er and Tm in their solid states show some sort of magnetic
order. As expected, atoms of all these elements have unpaired electrons and associ-
ated magnetic dipole moments essentially from the spins of the unpaired electrons.
Heisenberg has shown that the magnetic order in these fifteen elements originates
from the exchange interactions between the electron clouds of atoms/ions/molecules
subject to Pauli’s exclusion principle (i.e. from electrostatic interactions) and not
from the mutual interaction between the magnetic dipoles of atoms or from the spin–
orbit interactions in atoms, etc. The electrostatic exchange interactions may align
the spin magnetic dipole moments of individual atom/ion/molecule either parallel
or antiparallel to each other. For example, in case of H2 molecule the energy of
the parallel (spin) alignment of two atoms (↑↑), called triplet state, and antiparallel
alignment (↑↓), called singlet state, have different value that depends on the rela-
tive separation of the two atoms as shown in Fig. 3.18. Therefore, the singlet or the
triplet alignment in the ground state of a martial depends on their spins and relative
separation of atoms/ions, etc. in the crystalline structure.
3.7 Classification of Magnetic Materials 177
⇀ ⇀ 1 3
S1 . S2 = for triplet; and − for singlet,
4 4
And the interaction energy U is given as
( )⇀ ⇀ 1( )
U = − E singlet − E triplet S1 . S2 + E singlet + 3E triplet
4
Or
⇀ ⇀
U = −J S1 . S2 + Constant (3.59)
Here E singlet and E triplet respectively, represents the energies of the singlet and triplet
states. Factor J in Eq. (3.59), called ‘exchange coupling constant’ may have positive
(J > 0) or negative (J < 0) values. J > 0 refers to the case when spin will orient in the
same direction (triplet case) while J < 0 refers to the antiparallel alignment of spins
(singlet case). In ferromagnetic materials spins and magnetic dipoles of adjacent
atoms/ions in a given domain align parallel to each other, and it refers to the case
when J > 0. In antiferromagnetic and ferrimagnetic materials, J < 0 and spinful atoms/
ions of such materials align their spins/magnetic moment in opposite directions in
a domain. It may therefore be said that both ferromagnetic and antiferromagnetic/
ferrimagnetic materials have domains, which are created because of the electrostatic
178 3 Magnetic Materials
exchange interactions but in ferromagnetic materials the spin or the magnetic dipole
moments of neighbouring atoms/ions are aligned parallel to each other while in
antiferromagnetic and ferrimagnetic materials the dipole moments of adjacent atoms/
ions are aligned opposite to each other in each domain.
Antiferromagnetism was predicted by French scientist Louis Neel in 1936. It
is experimentally difficult to detect antiferromagnetic material because these mate-
rials above a certain temperature (called Neel temperature) behave like a paramag-
netic material above Curie temperature; and show no magnetism. The final direct
confirmation of Neel’s theory was done by Harry Shull using neutron diffraction in
1938.
Figure 3.19 shows the spin (and magnetic moment) alignments of different types of
magnetic materials in their solid states. Figure 3.19a shows that atoms/ions/molecules
of the material have no net spin and magnetic moment, since they have no unpaired
electrons and, therefore, in absence of an external magnetising field they show no
magnetism. However, when an external magnetising field H is applied, the material
develops weak magnetisation opposite to the direction of the applied field H. They
are diamagnetic. Susceptibility of diamagnetic substance has a negative small value
and generally independent of the temperature. Figure 3.19b shows a paramagnetic
material; atoms/ions/molecules of a paramagnetic material have unpaired electrons
that give rise to an inherent magnetic dipole moment to each of them. Individual
dipoles do not interact with each other. At room temperature (T > 0 K) and in
absence of magnetising field H, the atomic/ionic/molecular dipoles are randomly
distributed (to minimise the magnetic energy of the system) and a paramagnetic
material does not show any magnetisation. However, application of magnetising
field H results in partial alignment of dipoles in the direction of field H generating
a magnetisation in the material. The magnitude of magnetisation increases with the
increase of H reaching a saturation value when all dipoles in the specimen get aligned
to field H. Susceptibility of paramagnetic materials is positive but small; variations
of susceptibility χ and 1/χ of paramagnetic materials with temperature are shown
in Fig. 3.20.
Atoms/ions/molecules of ferromagnetic, antiferromagnetic and ferrimagnetic
materials have unpaired electrons and, therefore, each atom/ion/molecule possesses
a dipole moment (like that of paramagnetic material), but these dipoles interact
with each other (unlike paramagnetism). Mutual interaction between dipoles arises
from electrostatic exchange interaction. The exchange interaction does two things:
(i) it creates domains in the specimen and (ii) spins or magnetic moments of all
atoms in a domain are either aligned parallel or anti-parallel. Materials for which
spins are aligned parallel are called ferromagnetic and those where spins are aligned
anti-parallel are either antiferromagnetic or ferrimagnetic.
In ferromagnetic materials all atoms/ions, etc. have their magnetic moments
aligned parallel to each other in a given domain that gives each domain a finite value
of magnetisation. In absence of magnetising field H and below Curie temperature T c ,
the orientations and magnitudes of magnetisations of different domains are random,
such that the net magnetisation of the material is zero. When external magnetising
field H is switched on, the domain walls in the sample move so as to increase the size
3.7 Classification of Magnetic Materials 179
3.7.4.1 Ferrimagnetisms
Fig. 3.21 a Unit magnetic cell of antiferromagnetic MnO compound. b Temperature dependence
of the susceptibility of antiferromagnetic material below Neel temperature
Using different divalent ions and mixtures of different divalent ions it is possible to
make ferrites with desired magnetic properties. Ferrites made by mixing two or more
divalent ions are called mixed ferrite, an example is (Mn, Mg)Fe2 O4 .
Hexagonal ferrites have the general formula AB12 O19 , where A is a divalent
metal, like barium (Ba), lead (Pb) or strontium (Sr) and B is a trivalent metal like
aluminium (Al), gallium (Ga) or iron (Fe). Hexagonal ferrites have an inverse spinel
like crystal structure with hexagonal symmetry. A common example of hexagonal
ferrite is BaFe12 O19 .
A class of ferrites is called garnets that have a complicated structure which may
be represented as M3 Fe5 O12 . M in this formula stands for some rear earth ion like
yttrium (Y), gadolinium (Gd), samarium (Sm) or europium (Eu). Yttrium iron garnet
(Y3 Fe5 O12 ) is one of the frequently used garnets often denoted as YIG.
Saturation magnetisation of ferrites is not as large as that of ferromagnetic mate-
rials but their biggest advantage is that some ferrites are ceramics and excellent
electric insulators. An insulator magnetic core of ceramic ferrite in high-frequency
transformers eliminates eddy current losses.
SAQ: What are ferrites? And why they are very important? How a ferrite of desired
magnetic properties may be synthesised?
Permanent magnets are required in all walks of life, be it big particle accelerators
used in research or tiny computer memories or fridge-magnetic stickers. Permanent
magnets are characterised in terms of the maximum energy product, i.e. the area
3.8 Permanent Magnetic Materials 183
of the largest rectangle starting from the origin that may be drawn in the second
quadrant of the B–H curve, as shown in Fig. 3.23. (BH)max tells about the magnetic
energy stored in the material per unit volume and is treated as the magnetic figure-
of-merit of the material. Maximum energy product is often measured in units of
kilo-Joule per cubic metre, (kJ m−3 ) in SI system or MGOe (Mega-gauss-Oersted)
in electromagnetic system. Further, 1 MG Oe = 7.958 kJ m−3 . Research in the field of
magnetic materials has led to almost an exponential rise in the magnitude of (BH)max
in the twentieth century starting from 20 kJ m−3 in 1900 to around 450 kJ m−3 in
2000. Increase in the (BH)max value resulted in considerable reduction in the size
of permanent magnets; for example a NdFeB magnet of 102 cc volume will contain
roughly the same magnetic energy as a brass bond lodestone of 105 cc volume; a
reduction of almost 103 in the size of the magnet.
Oldest permanent magnetic material is Lodestone, naturally occurring iron oxide
Fe2 O3 , though loadstone magnets produce low fields but they offer high resistance
to demagnetisation. Magnetic carbon steels, developed in eighteenth century, are
generally alloyed with chromium or tungsten to restrict domain wall movement and
increase coercive field. Magnets made from carbon steel have large saturation field,
order of magnitude larger than loadstone magnets, but have lower value of coercive
field. Synthetic magnets made from alloys of aluminium, cobalt and nickel, called
Alnico magnets, were first developed around 1930 and show considerably larger
values for magnetic hardness as compared to carbon steel. They also have high Curie
temperature of the order of 900 °C. Alnico is cast in a foundry. Magnets of desired
pattern may be made by using sand moulds and pouring molten magnetic material in
the mould. Alnico magnets may also be made by sintering process to form small and
accurate magnets. Alnico magnets have high operating temperature, good corrosion
resistance and long-term magnetic stability. However, their drawbacks are the high
cost on account of Cobalt and low resistance to demagnetisation. Pushing two Alnico
magnets in repulsion may demagnetise both of them.
Ferrite (Fe3 O4 ) is manufactured using powder sintering technology and exact size
tooling into range of industry standard sized disks, rings or blocks of 150 mm ×
100 mm × 25 mm size. These blocks can then be sliced into smaller magnets.
Ferrites are used extensively in loudspeakers and other security systems. The biggest
advantage of ferrite magnets is their very low cost, high resistance to corrosion and
good magnetic stability. Low level of magnetism is their only weakness.
F = q(v X B) = qv B sin θ
Substituting the value of q = 5.0 × 10–6 C; v = 1.5 × 104 m/s; B = 0.25 T and F
= 1.7 × 10–2 N, in the above expression, one gets
Or
1.7 × 10−2
sin θ = = 0.906
5 × 10−6 × 1.5 × 104 × 0.25
Solved Example SE(3.3) Ferrite Fe3 O4 forms a cubic crystal with unit cell edge
length (a) of 0.840 × 10–9 m. Each unit cell of the material contains 8 Fe2+ ions with
each ion having a magnetic moment of 4 μB , and 16 Fe3+ ions that are non-magnetic.
Calculate the saturation magnetisation Bs per unit cell in units of A/m.
Solution: Total magnetic moment of a unit cell due to 8 Fe2+ ions M cell = 8 × 4 μB
= 32 μB .
( )3
Volume of the unit cell Vcell = a 3 = 0.84 × 10−9 = 0.59 × 10−27 m3 .
32 μB 32×(9.27×10−24 A m2 )
Saturation magnetisation per cell B = Mcell =
s Vcell
=
0.59×10−27 m3 0.59×10−27 m3
(on substitution of μB = 9.27 × 10−24 A m2 ).
Or Bs = 5.02 × 105 A/m.
Problems
P3.1 Two identical squares of sides 6 cm, made from conducting wire are placed
side by side on a horizontal table as shown in the figure (P3.1) and a current
186 3 Magnetic Materials
of 45.0 mA is made to flow through the squares. Determine the nature and
magnitude of the force between the two squares.
SA3.1 Write a note on diamagnetism giving Langvine’s theory for it. Why
diamagnetism is called the fundamental magnetism?
SA3.2 Give the major points of similarities and differences between ferromag-
netic and ferrimagnetic materials. What causes domains in these magnetic
materials?
SA3-3 What are the required magnetic properties that a material used for making
permanent magnets must have?. List some important materials used for
making permanent magnets.
SA3.4 Write the expression relating magnetic dipole moment of a charged particle
with its angular momentum in classical limits and discuss how the expres-
sion may be modified for quantum limits. Define 1 Bohr magneton and
give its value in SI units.
SA3.5 Explain why in absence of any external magnetic field H and at a temper-
ature T (a little above 0 K), diamagnetic, paramagnetic and ferromagnetic
materials all show no magnetism. Also write the order of magnitude of bulk
susceptibilities for these materials.
SA3.6 What is exchange interaction and what is its origin? Discuss the role played
by exchange interaction in case of ferro, antiferro and ferric magnetic
materials.
3.8 Permanent Magnetic Materials 187
SA3.7 What are ferrites? Explain how magnetic material with desired value of
saturation magnetisation may be fabricated using mixtures of ferrites.
SA3.8 Draw a typical B–H curve for a ferromagnetic material indicating important
characteristics of the curve. What does the area of the B–H curve represents?
Define (BH)max and discuss the significance of this parameter.
SA3.9 Explain how the application of an external magnetic field H in case of metals
that have free electrons, causes an imbalance in the number of electrons
with opposite spins that leads to a lower value of magnetisation in the
direction of field H.
SA3.10 Explain what is magnetic anisotropy and what causes it.
SA3.11 What are the distinguishing features of magnetically hard and soft mate-
rials? Briefly outline applications of the two types of these materials. What
are ceramic soft magnetic materials and where are they used.
SA3.12 What is meant by singlet and triplet states of spin alignment? Discuss their
role in magnetism.
SA3.13 Draw rough sketches for the variation of bulk susceptibility χ and (1/χ )
with temperature for different types of magnetic materials and define the
Curie and the Neel temperatures.
SA3.14 Write a note on ferrimagnetism giving some examples.
Objective
After reading this chapter the reader is expected to: (i) understand the method of
producing X-rays, their properties and applications, (ii) be able to grasp the concepts
of the dual nature of matter, energy and matter waves, (iii) appreciate the pitfalls of
classical theories of physics in explaining some important experimental observations
and how the quantum approach is able to explain them.
4.1 Introduction
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2023 191
R. Prasad, Physics and Technology for Engineers,
https://doi.org/10.1007/978-3-031-32084-2_4
192 4 X-rays, Dual Nature of Matter, Failure of Classical Physics and Success …
discussed in the later part of the chapter. Quantum mechanical explanations of these
anomalies are also included, towards the end of the chapter.
The discovery of X-rays, that was accidental, was announced by Wilhelm Conrad
Roentgen, in December 1895. W. C. Roentgen, Physics professor at Wurzburg
University, Bavaria, Germany, was studying the properties of cathode rays that are
emitted when an electric discharge is made to pass through the two electrodes of a
cathode tube filled with some gas at low pressure. Roentgen was specifically inter-
ested in whether cathode rays could pass through glass body of the cathode-ray
tube. His cathode-ray tube was covered by thick black paper on all sides, but he was
surprised to see that an incandescent green light nevertheless escaped and projected
onto a fluorescent screen placed nearby. Roentgen found that these mysterious rays
could penetrate through most substances and cast their shadows on the screen. Since
the exact nature and properties of these rays were not known at that time Roentgen
called them X-rays, the term ‘X’ usually used to describe the unknown quantity.
Roentgen also found that the X-ray was capable of passing through human’s tissues
leaving the shadow of bones. Almost immediately X-ray’s uses as a diagnostic tool
to detect bone fractures become a common medical practice.
The X-rays are generally produced in a specially designed vacuum tube invented by
William Crookes which is often called the discharge tube or a cathode-ray tube. A
typical sketch of an X-ray tube is given in Fig. 4.1.
As shown in the figure, the X-ray tube consists of an evacuated glass tube which
is fitted with two electrodes. The electrode which is kept at a negative potential is
called cathode and the other electrode kept at a higher positive potential, the anode.
A potential difference of the order of few tens of kV is maintained between the
two electrodes. Often a heater element is also attached to the cathode which may
be connected to a low-voltage source. When a current is passed through the heating
element, the temperature of cathode increases and thermionic emission of electrons
from cathode takes place. The emitted thermo-electrons get repelled by the nega-
tive potential at cathode and are attracted by the positive anode. Thus electrons get
accelerated and impinge with high speed on to the anode when they get decelerated.
Since accelerated/decelerated charged particles (electrons in this case) emit electro-
magnetic radiations, decelerated electrons in the X-ray tube emit electromagnetic
radiations in the form of X-rays. The X-rays generated due to deceleration of elec-
trons are termed as continuous, soft or Bremsstrahlung X-rays. Bremsstrahlung
X-rays consist of X-ray radiations of all energies starting from a minimum energy
4.2 Discovery, Production and Properties of X-rays 193
Fig. 4.1 Schematic diagram of an X-ray tube. X-rays are produced when energetic electrons
impinge on the anode material
E min to a maximum energy E max , hence the name continuous X-rays. X-rays are also
produced when high-energy electrons hit the atoms of the anode material and shift the
atomic electrons to their higher energy states, thus exciting the anode atoms. Since
atoms cannot remain in excited states for long, excited atoms of the anode revert
back to their ground states emitting X-rays. X-rays produced by the de-excitation
of excited atoms are called characteristic X-rays and their energy depends on the
atoms of anode material. Characteristic X-rays have X-rays of some very definite
wavelengths that depend on the nature of the target atom and are found superimposed
as sharp lines on the continuous background of Bremsstrahlung X-rays.
Since their inception, the X-ray tubes have evolved and undergone several
changes/improvements. A modern cold cathode X-ray tube is shown in Fig. 4.2.
As shown in the figure, the modern X-ray tube contains an anticathode or target
opposite the concave-shaped cathode, further the cathode is not heated and electrons
are not emitted by cathode. The X-ray tube is filled by some inert gas like argon at
low pressure. A spark plug is used to ignite the inert gas which on ionisation produces
electrons. Electrons produced by the inert gas are accelerated between the concave
cathode and the target or the anticathode. Any desired material may be attached to
the anticathode in order to produce characteristic X-rays of that material. Concave
cathode focuses the electron beam to a point on the target which helps in generating
focused X-rays that produce sharp images of dens materials like bones, etc. These
X-ray tubes do not have any heater at its cathode; therefore, they are referred as cold
cathode tubes.
Soon after the discovery of X-rays, efforts were made to study the properties
of these rays. Experimental observations revel that X-rays have a high penetrating
power, travel in straight line and cannot be deflected by electric field or the magnetic
field. When the X-rays are incident on the photographic plates, they are found to
194 4 X-rays, Dual Nature of Matter, Failure of Classical Physics and Success …
blacken the film. When X-rays pass through a gaseous medium, they ionise it and
can also cause photoelectric emission similar to when light is incident on metallic
cathode. The X-rays have broadly been categorised into two categories as continuous
X-rays and discrete/characteristic X-rays.
Fig. 4.3 Deceleration of incident electron by the coulomb field of the electron cloud of the target
atom produce Bremsstrahlung X-ray
radiations. Bremsstrahlung, therefore, stands for the radiations that are emitted by
the stopping of electrons.
The process of X-ray emission as a result of deceleration of electrons is called
Bremsstrahlung. Figure 4.3 shows how an incident electron gets retarded in the
Coulomb field of the electron cloud of the target atom, and the difference of kinetic
energy ΔE at points A and B is converted into an X-ray.
A figure depicting the emission of X-ray as a result of change of velocity of
electron due to scattering by the positively charged nucleus of the target atom is
shown in Fig. 4.4.
Fig. 4.4 Deceleration and deflection of an energetic electron by a positively charged nucleus,
resulting into emission of X-ray photon
196 4 X-rays, Dual Nature of Matter, Failure of Classical Physics and Success …
hc c E hν
E x = hν = ; ν= ; p= = (4.1)
λ λ c c
Symbols h, c and p in above expressions stand respectively for Planck’s constant,
speed of light and the linear momentum of X-ray photon.
Intensity distributions of continuous X-rays as the function of wavelength and as
the function of photon energy E x or frequency γ are shown in Fig. 4.5. It may be
noted in Fig. 4.5a that X-ray distribution curve has a cut-off wavelength, denoted
by λmin while on the higher wavelength side the curve extends almost up to infinity.
Minimum wavelength λmin corresponds to the X-ray of the highest energy, similarly
X-rays of longer wavelengths have smaller energies extending to almost zero energy.
In Fig. 4.5b, the X-ray of highest energy has the largest frequency γ max and on the
lower frequency side the distribution curve extends almost to the origin. An important
property of continuous X-rays is that the cut-off point (λmin or γ max ) depends only
on the potential difference V between the anode and the cathode of the X-ray tube
and it does not depend on the target material.
The minimum wavelength λmin and maximum energy or maximum frequency
νmax of emitted X-ray corresponds to an incident electron losing all of its energy in
a single collision and radiating it away in the form of a single X-ray photon. If we
assume that total kinetic energy (K.E.) of the electron is converted into energy (hν)
of the X-ray photon, then
Fig. 4.5 Intensity distribution of continuous X-rays as a function of a X-ray wavelength, b X-ray
energy or frequency
4.2 Discovery, Production and Properties of X-rays 197
Or
c
ΔE = hνmax = h (4.2)
λmin
hc
hνmax = (4.3)
λmin
The λmin may be called as the cut-off wavelength, which will mainly depend on
the value of accelerating voltage V applied across the anode and cathode. Thus,
hc
hνmax = = eV (4.4)
λmin
And
hc
λmin = (4.5)
eV
When one substitutes the values of h, c and e in Expression (4.5), one gets the
simplified relation between λmin (in units of Å = 10−10 m) and the voltage V between
the anode and cathode of the X-ray tube as,
12.398 × 103
λmin in units of Å = (4.6)
V (volts)
The total X-ray energy emitted per second (intensity) I depends on the atomic
number Z of the target atom, the electric current i passing through the two electrodes
of the X-ray tube and the potential difference V between the electrodes. Intensity I
may be written as:
Here A is a constant and m is also a constant that has the value of ≈ 2 for most
of the metals.
The dumbbell-shaped spectrum of continuous X-rays has a broad peak which
corresponds to the wavelength of the X-rays of maximum emission, the X-rays
which have the highest number in intensity distribution. With the increase of the
voltage V between the electrodes of the X-ray tube, the peak of maximum emission
and the cut-off wavelength λmin both shifts towards the shorter wavelength and the
total area of the intensity curve also increases as shown in Fig. 4.6. This happens
because of the increase in the kinetic energy of electrons with voltage V, resulting
in larger number of electron interactions in which more energy is lost in the form of
X-rays, and because current of the X-ray tube i also increases with V.
198 4 X-rays, Dual Nature of Matter, Failure of Classical Physics and Success …
Fig. 4.6 Variation of the continuous X-ray spectrum with the voltage between electrodes
Fig. 4.7 Representation of X-ray emission during the transition of an electron from L- to K-shell
may revert back to a state of lower excitation by the transfer of an electron from the
next higher shell, the L-shell. The difference in the energy of the ionised atom with
electron vacancy in K-shell and when electron vacancy is in L-shell, is emitted in the
form of an X-ray. This X-ray may be denoted as X L→K and is called the K α X-ray.
Similarly, the vacancy in the K-shell may be filled by an electron from the M-shell
(instead of from L-shell) producing a X M→L -ray which is designated as K β X-ray.
In this way X-rays of different energies may be produced by the de-excitation of an
ionised target atom. In case the high-energy electron in the X-ray tube creates an
electron vacancy in L-shell of the target atom (instead of the K-shell), then in that
case electrons from M, N and other higher shells may fill the electron vacancy in
L-shell producing L α , L β , L γ . . . X-rays. Figure 4.8 shows the electron energy level
diagram of any general target atom and the transitions corresponding to various X-
rays. In the energy level diagram it may be noted that the energy difference between
successive energy levels decreases as one goes higher in energy. The maximum
energy difference occurs between the K- and L-shells, and therefore, the energy of
the K α X-ray is equal to (E L − E K ), where E L and E K are respectively the energy of
the L-shell and of the K-shell. In an actual case, there are large number of high-energy
electrons in the X-ray tube that hit the target almost simultaneously and ionise many
target atoms producing vacancies in different shells of different atoms, as a result
many characteristic X-rays like K α , K β , L α …, etc. are emitted by target atoms.
Since atoms of different materials have different energy level diagrams, the ener-
gies of characteristic X-rays are different for different materials. The characteristic
X-ray spectrum may therefore be treated as a finger print of the atom and is often
used to identify different atoms. The characteristic X-ray spectrum is always found
to be superimposed as sharp peaks on the background of continuous X-ray spectrum
as shown in Fig. 4.9. Since the width of characteristic X-ray peaks is very small as
200 4 X-rays, Dual Nature of Matter, Failure of Classical Physics and Success …
compared to the broad peak of continuous spectra, characteristic peaks are called
characteristic lines.
Most X-ray tubes operate at voltages of the order of 50 kV or less. Therefore,
the maximum energy of continuous X-rays may be at the most 50 keV or a little
less, when an electron accelerated to 50 keV energy loses all its energy in a single
event producing an X-ray of 50 keV. As such the continuous X-ray spectra from
such an X-ray tube will terminate at 50 keV energy. The maximum energy X-ray
in characteristic spectra will correspond to K α line of the target atom. For medium-
weight target elements the energy of K α line lies in the range of 50–80 keV energy.
Evidently, in such cases, the characteristic X-ray lines are found to be superimposed
on the high-energy tail of the continuous spectra, as shown in Fig. 4.9.
As already mentioned, the characteristics (wavelength of maximum emission,
intensity and cut-off or minimum wavelength) of the continuous X-ray spectrum
depends essentially on the magnitude of the voltage between the anode and the
cathode of the X-ray tube; however, the wavelengths or frequencies of characteristic
X-ray lines depend only on the atoms of the target material and do not change with
the voltage between the two electrodes of the X-ray tube.
SAQ: What is the basic origin of X-rays and in what respect their origin is different
from that of gamma rays?
SAQ: Which characteristic X-ray of a given atom will have highest energy and
why?
With the discovery of large number of elements, attempts were made to arrange
elements in some order. First such attempt was made by John Dalton in 1803 when he
arranged elements according to the increasing atomic weight. Later, it was observed
that groups of elements exhibit similar chemical properties suggesting the presence of
recurring patterns of chemical behaviour. Around 1870 Dimitri Mendeleev developed
what is called the periodic table of elements, where elements were largely placed
according to their atomic weights and numbered consecutively. In this periodic table
no physical meaning or significance was attached to the sequence number of the
element. However, some anomalies were found in this arrangement of elements in
the table; for example, the atomic weight of Cobalt was higher (58.93) than that of
Nickel (58.69), but its chemical properties suggested that it should be placed before
Nickel in the periodic table. Similarly, the atomic weight of argon was larger than
that of potassium, and the atomic weight of tellurium was greater than that of iodine,
but the chemical properties of both these elements suggested that they in spite of
their heavier weights should precede the corresponding lower weight partner. These
anomalies indicated that atomic weight is not the correct criteria for numbering
elements in the periodic table. Soon it was realised that numbering of elements in
periodic table should be done according to the number of electrons in the atom of the
element which is equal to the number of units of positive charge on the nucleus of the
atom and is denoted by Z, the atomic number. A final and definitive resolution of this
anomaly was achieved by H. G. Moseley, an English physicist, who in 1913 published
a research paper based on his analysis of characteristic X-ray spectra from several
elements showing that the frequencies of characteristic X-ray lines are proportional
to the squares of whole numbers that are equal to the atomic number plus a constant.
Moseley used Bohr’s atomic model for the analysis of experimentally observed
characteristic X-ray spectra from many elements. In order to appreciate Moseley’s
analysis it is required to re-visit Bohr’s model of the atom.
In Bohr’s model of the atom, it is assumed that the electron moves in a circular
orbit around the positively charged point nucleus, balancing the centrifugal force
by the attractive Coulomb force between the oppositely charged electron and the
nucleus. The breakthrough in Bohr’s model was the quantization of electrons’ angular
momentum. Using this model Bohr derived the following formula for the wave-
length of electromagnetic radiations emitted during transitions of electron between
quantized electron energy states in hydrogen-like atom.
202 4 X-rays, Dual Nature of Matter, Failure of Classical Physics and Success …
1 1 1
= R 2 − 2 Z2 (4.8)
λ nf ni
m e · e4
R∞ = (4.9)
4π cℏ3
and
Mm e ·e4
R= (4.10)
M + m e 4π cℏ3
Or
3Rc 2
νk = Z
4
And
/
√ 3Rc
νk = Z (4.11)
4
4.2 Discovery, Production and Properties of X-rays 203
With this background, we now discuss Moseley’s work and his observations.
A systematic examination of the characteristic X-ray radiations was carried out by
Moseley for large number of elements from Aluminium to gold. He recorded the X-
ray spectra for these elements on a photographic
√ plate. The analysis of the spectra was
done, and the square root of the frequency ( ν) of the particular characteristic X-ray
radiation versus the ordinal of the element’s position in the periodic table, which we
for the present denote by number N, was plotted. Figure 4.10 shows a representative
graph where square root of the frequency of K α lines of different elements is plotted
against the serial number N of the element in periodic table. It was observed that
with the increase in the position of the element in the periodic table, the root of the
frequency of the emitted radiation increases monotonically and the graph may be
fitted with a straight line. Similar graphs for K β lines and for other lines of K and
the L-series were plotted, and it was observed that for each characteristic X-ray the
data points fall on a straight line. These straight lines may be characterised in terms
of their slope which may be denoted by ‘a’ and their intercept ‘b’ on the X-axis.
Moseley observed that the intercept ‘b’ for all members of a series is same, while
each straight line has a different value of slope ‘a’ as shown in Fig. 4.10. √
On the basis of his findings Moseley reached to the conclusion that the ν for
different series may be written as:
Fig. 4.10 Plot between the square root of the frequencies of K and L lines and the serial number
N of the element in the periodic table
204 4 X-rays, Dual Nature of Matter, Failure of Classical Physics and Success …
√
νkα = akα (N − bk )
√
νkβ = akβ (N − bk )
(4.12)
.....................
.....................
√
ν Lα = a Lα (N − b L )
√
ν Lβ = a Lβ (N − b L ) (4.13)
.....................
√
ν Mα = a Mα (N − b M ) (4.14)
The constants a and b in above equations are respectively called the proportion-
ality constant and the screening constant. Moseley also obtained the values of the
proportionality and screening constants for different series from his experimental
plots and found that they may be given in terms of the Rydberg constant R, as given
below,
/ /
3Rc 8Rc
akα = and bk = 1; akβ = and bk = 1,
4 9
/ /
5Rc 3Rc
a Lα = and bk = 7.4; a Lβ = and bk = 7.4.
36 16
Substituting the above values, the root of frequency of K α -series transitions may
be written as:
/
√ 3Rc
νk = (N − 1) (4.16)
4
Moseley assumed that there must be some physical attribute of atoms of the
periodic table that increases in a regular fashion by some fixed amount, from one
element to the next one. He postulated that this can be the charge (Ze) of the nucleus of
the atom which is screened by the negatively charged electrons remaining in orbitals
after the creation of vacancy in a shell.
According to Mosley, the ordinal or serial number N of the element’s position
in the periodic table is equal to the number Z , related to the positive charge (Z e)
carried by the nuclei of the element. The number Z is referred to as the atomic
number of the element and is exactly equal to the number of protons in the nucleus
of the atoms of the materials emitting X-rays. It may be mentioned here that before the
investigations carried out by Mosley, the arrangement of elements in the periodic table
was in the ascending order of their atomic weights and on the basis of their chemical
properties. The Mosley’s results could provide a direct method of determining the
atomic number of the elements and helped in removing the discrepancies in the
periodic table arrangements. As already mentioned, initially the positions of the
transition metals Cobalt (Z = 27) and Nickel (Z = 28) were determined on the basis
of the ascending order of their atomic weights as Ni = 58.71 and Co = 58.93 were
changed. In the same way some empty positions for the still undiscovered elements
were filled. For example, new elements Hafnium (Z = 72), Technetium (Z = 43)
and Rhenium (Z = 75) were discovered as there were missing gaps at these values
of atomic numbers.
It may be noted that the difference in the magnitudes of proportionally constant
‘a’ for different members of a given series, for example, between aKα , aKβ , aKU ,
etc., is very small, and therefore, curves for different members of a series shown in
Fig. 4.10 often merge together, if the resolution of the graph is not good.
It might be of interest to know that Henry Gwyn Jeffreys Moseley was born in
Weymouth, Dorset, England, on November 23, 1887. After is education at Trinity
College, Oxford, he was appointed lecturer in Physics at Rutherford Laboratory,
University of Manchester in 1910. His initial research work was on radioactivity, but
later he carried out detailed studies on X-ray spectra and in 1913–14 published his
famous Moseley law, which paved way to uniquely determine the atomic number of
elements and their positioning in the periodic table. In 1914 he was drafted in army
during the First World War and was shot in the head by a Turkish sniper at the battle
of Suvla Bay. He died at the young age of 27 years.
SAQ: What is the physical significance of Mosley’s law?
SAQ: Calculate the magnitude of proportionality constant aMβ .
different points of the incident wave front interfere with each other giving rise to
an interference pattern. The interference pattern produced by the diffraction of elec-
tromagnetic waves is characteristic of the obstacle, that is the interference pattern
may be treated as the finger print of the diffracting obstacle. The resolution of the
diffraction–interference pattern depends on the wavelength of the electromagnetic
radiations; radiations of shorter wavelength have better resolution and may resolve
structural details of objects/obstacles comparable in dimensions with the wavelength
of the radiation. X-rays are electromagnetic radiations with wavelengths in the range
of nanometer (10−9 m) and are therefore capable of deciphering structural details
of crystals where atoms or group of atoms are arranged at regular and repetitive
distance in nanometer dimensions. X-ray diffraction (XRD) is now a well-establish
and routine procedure to determine the lattice parameters, arrangement of individual
atoms in a single crystal and the phase analysis in case of polycrystalline materials
and compounds.
It may be recalled that crystals are made up of unit cells which are the simplest
repeating structure of a crystalline solid. Arrangement of unit cells results in a crystal
lattice, which is a specific three-dimensional arrangement of unit cells. Incident X-
rays on diffraction from atoms of the crystal lattice produce a distinct interference
pattern characteristic of the atomic arrangement in the crystal lattice (Fig. 4.11).
To calculate the path difference between rays (1 + 3) and rays (2 + 4), we drop
perpendiculars AC and AD from point A on ray-2 and ray-4, respectively. As is clear
208 4 X-rays, Dual Nature of Matter, Failure of Classical Physics and Success …
from the figure, ray-2 travels a distance CB more than ray-1 and ray-4 travels the
distance BD more than ray-3. Therefore, the total path difference (Δ-path) between
the incident and scattered rays is:
The father and son W. H. Bragg and Lawrence Bragg for the first time derived
the above condition of constructive interference between electromagnetic radiations
scattered by consecutive members of a family of crystal planes in 1913; the condition
is called Bragg’s law.
It is easy to verify in Fig. 4.12 that the diffracted rays (2 and 4) are rotated from
their original direction (incident rays 1 and 2) by angle 2θ.
SAQ: The phenomenon of diffraction of X-rays indicates their particle nature or
wave nature?
X-rays are extensively used in medical world for detecting fractures in bones,
detecting breaks/tearing of ligaments, sterilising of medical instruments, cloths,
bandages, etc. High-energy X-rays are now used for exploring underground structures
4.2 Discovery, Production and Properties of X-rays 209
Fig. 4.13 Simplified sketch of a possible configuration of X-ray source, powder sample holder and
X-ray detector for PXRD scanning. In this configuration the sample remains stationary while both
the X-ray source and the detector move by the same angle
210 4 X-rays, Dual Nature of Matter, Failure of Classical Physics and Success …
X-ray peaks at different angles of rotation in Fig. 4.14 originate from the X-rays
diffracted by different families of crystallographic planes in the crystal. The intensity
of the peak is proportional to the density of planes of a given family. As the crystal
is rotated, the angle of incidence θ changes for different families of crystal planes
and that family of planes for which Bragg’s law 2d sin θ = nλ gets satisfied diffracts
X-rays producing constructive interference and a maximum in intensity.
in some direction, loses some energy and is itself scattered with reduced energy in
the complimentary direction. All these facts might have led de Broglie to postulate
matter waves.
According to de Broglie, the wavelength associated with a material particle is
given by, λ = h/ p, where h is Planck’s constant, while p = mv is the linear
momentum of the particle of mass m moving with velocity v. The validity of the
de Broglie relation can best be tested by the results of the experiment. It may be
remarked that the above equation is satisfied by a photon as well because the linear
momentum of a photon p = hν/c. Thus, hp = νc = λ.
De Broglie’s postulate essentially says that a particle of mass m, moving with
velocity v, has an associated wavelength λ = h/mv. In order to appreciate de Broglie
wavelength λ, it is required to understand what is meant by the term wavelength.
Wavelength is essentially the uncertainty in the position of a moving particle. The
location of a moving particle at any instant in its path of motion is uncertain by the
amount of its wavelength. It is obvious that if the physical dimension (the size) of
the moving object is smaller than the associated de Broglie wavelength, then only it
will be possible to carry out experiments to see the effect of wavelength. In case the
size of moving object is larger than the associated wavelength, it may not be possible
to experimentally detect the wavelength associated with the particle.
It can be seen from the expression for de Broglie wavelength that a particle of large
mass will exhibit a smaller wavelength and vice versa. This is the reason, why in our
daily life the wave-like character of even a cricket ball thrown with 80 miles/h is not
observed. In general de Broglie wavelength of only microscopic atomic and nuclear
particles moving with high velocities may be measured experimentally. According
to Einstein’s theory of relativity, the mass of a moving body depends on its speed of
motion. For particles moving with high velocities relativistic express for mass given
below must be used.
m0
m=/ (4.20)
v2
1− c2
h
λ= √ . (4.21)
2m E
h h 6.64 × 10−34 J s
λ= = = = 4.21 × 10−34 m
p mv (0.10 kg) × 30 ms
As can be seen from the value of the wavelength obtained above that it is so
small that it is not even measurable with present day instruments. This shows the
reason why the wave nature of macroscopic objects is not observable in our daily
life. On the other hand, for microscopic particles the wave-like nature is significant
and observable.
As an example, let us consider that an electron of m and charge e is accelerated
across a potential of V volts, then the electron gains a kinetic energy K .E. = eV .
p2
√
Since K .E. = 2m , therefore, p = 2meV .
The de Broglie wavelength λ = √2meV h
.
If in the above expression we substitute the numerical values of Planck’s constant
h, mass of electron and charge of electron, then the above expression reduces to λ =
1.227
√
V
nm, where V is the accelerating potential in volts. For a value of V = 150 V,
the value of wavelength comes out to be λ = √ 1.227
150
nm = 14.247
1.227
nm = 0.100 nm. This
value is comparable to the order of spacing between the atomic planes in crystals. It
indicates that the particle nature of electrons could be verified by crystal diffraction
experiments similar to the X-ray diffraction. The experimental verification of the de
Broglie hypothesis has been described in detail in the next section. Louise Victor de
Broglie was awarded the 1929 Nobel Prize in Physics for his discovery of the wave
nature of electrons.
Clinton Davisson and Lester Germer were involved in the study of the surface prop-
erties of Nickel samples using a beam of low-energy electrons at Bell laboratories,
USA, since 1923. However, the remark by Walter M. Elsasser (scientist at Gottingen,
Germany) that electron scattering by crystalline solids may be used to test the wave
nature of electrons, led Davisson and Germer in 1927 to repeat their experiment,
now with the view to look for the wave nature of electrons.
Experiment carried out by Davisson and Germer provides direct verification of
De Broglie hypothesis of the wave nature of moving bodies and demonstrated that
moving electron has an associated wave. The typical experimental setup used by
Davisson and Germer is given in Fig. 4.15. The thermionic emission of electrons
from a hot tungsten filament was used to provide a beam of electrons. These electrons
were accelerated by applying suitable potential difference with the help of a battery
as shown in Fig. 4.15.
The electron beam was collimated by allowing them to pass through a cylindrical
arrangement with a fine slit. The electron beam is incident on the Nickel crystal
having ordered arrangement of atoms, at normal incidence. Nickel atoms diffracted/
4.3 Dual Nature of Matter 213
scattered the incident electrons. In the experimental setup Nickel target crystal had an
arrangement of rotation at different angles in a plane. The intensity of the electrons
scattered by the crystal in a given direction was measured with the help of movable
detector. The whole experimental arrangement was placed in a highly evacuated
chamber. Several experiments were carried out and intensity of scattered beam at
different angles was recorded for different accelerating voltages. The observed curves
are plotted in polar coordinates in Fig. 4.16. Surprisingly, instead of a continuous
variation of scattered electron intensity with angle distinct maxima and minima were
observed whose position depended on the electron energy. It was observed that there
is a pronounced maximum that appear at ϑ = 50◦ angle of scattering (with respect to
the direction of the incident beam), when the accelerating potential is 54 V. Further,
increasing the accelerating potential indicated that the bump like maximum decreases
and becomes almost insignificant as the accelerating potential reaches to 68 V.
Figure 4.17(i) shows the variation of the intensity of 54 eV energy electrons with
angle of scattering/diffraction by the Ni-crystal. As may be observed in this figure
a prominent maximum in the observed intensity occurs at angle 50°. Figure 4.17(ii)
shows the direction of the incident beam of electrons which is normal to the top face
of the Ni-crystal. However, the incident beam makes an angle of 65° with family of
dominant crystal planes that are responsible for the diffraction of incident electrons.
Further, from X-ray diffraction experiments it was known that the separation between
successive crystal planes d is 0.91 Å.
The reason for the prominent bump like state at ϑ = 50◦ at accelerating potential
of 54 V may be understood as due to the diffraction of electron waves by the crystal
planes in the target Ni-sample. Figure 4.17(i) shows the electron diffraction pattern
for 54 eV electrons as a function of diffraction angle.
214 4 X-rays, Dual Nature of Matter, Failure of Classical Physics and Success …
Fig. 4.16 Polar plots of the diffraction pattern of electrons by Nickel crystal for different
accelerating voltages
Fig. 4.17 (i) Diffraction pattern of 54 eV electrons by Ni-crystal. (ii) Crystal plane family and
diffraction geometry
One may also calculate the wavelength of the waves that have produced the
observed maximum using Bragg’s law. While applying Bragg’s law one must
remember that the angle of scattering that the incident electron beam makes with
the crystal plane is 65° and that the separation between planes is 0.91 Å,
It may be observed that there is very good agreement between the de Broglie
wavelength λde Brg (= associated with 54 eV electrons = 1.68 Å) and the wavelength
λBragg (= 1.66 Å) obtained using Bragg’s law. It clearly proves that a beam of electrons
behaves both as a beam of material particles as well as a beam of waves of wavelength
given by de Broglie formula.
Davisson and Germer experiments do provide a direct verification of de Broglie
hypothesis of the wave nature of moving bodies. Soon after the publication of the
results from Davisson–Germer experiment many more and detailed experiments
were performed all of which confirmed the existence of matter waves. The electron
diffraction was studied by G. P. Thomson using X-ray powder diffraction method,
where an accelerated fine beam of electrons was made to hit normally on to the thin
metallic foil. The other side of the foil was exposed to a photographic plate. As the
electron beam passes through the foil, electrons of the incident beam get diffracted
by the grating like crystal structure in the foil forming a diffraction pattern on the
photographic plate. The diffraction pattern of bright co-centric circular rings around
a central spot got reviled when the photographic plated was developed. In order
to test that the diffraction pattern formed on the photographic plate is due to the
diffraction of electrons of the incident beam, a magnetic field was applied between
the source of electrons and the metallic foil. As expected, the diffraction pattern got
disturbed by the magnetic field confirming that the diffraction pattern is truly due
to the diffraction of de Broglie waves associated with accelerated electrons. Since
then many instruments particularly electron microscopes that use de Broglie waves
associated with accelerated electrons have been constructed and are in wide use.
Since the resolving power of a microscope depends on the wavelength of the waves
used in the instrument, de Broglie waves of very small wavelength may be produced
using high-energy electrons. De Broglie waves of very small wavelengths associated
with high-energy electrons are used in electron microscopes.
It might interest you to know that the postulate of matter waves proposed by de
Broglie was a part of his Ph.D. thesis.
Confirmation of waves associated with particles through several experiments
involving diffraction and interference of particle waves put de Broglie’s theory on
firm footing. However, two big questions regarding the matter waves also spring
up in the background of dual nature of matter. One big question was: what is the
velocity of the matter waves? And the second question was: what (material/field/
function) makes de Broglie waves? For example, in case of waves in a pond or sea,
it is water that moves up and down making the wave, in a wave on a string, different
segments of the string moves to generate wave and in electromagnetic waves (light)
it is the intensity of electric and magnetic fields that varies with time and generate the
216 4 X-rays, Dual Nature of Matter, Failure of Classical Physics and Success …
electromagnetic wave; so the question is: time variation of which quantity constitutes
matter waves?
(a) Velocity of de Broglie wave
Let us address the first question: what is the speed of de Broglie (or matter) waves.
It may be recalled that in order to explain phenomenon like photoelectric effect and
Compton scattering it was necessary to assume that electromagnetic (EM) waves
(light and gamma rays) have an associated particle, called photon. The energy E phot
of photon, the momentum pphot of photon and the wavelength/frequency νphot of
electromagnetic wave are related according to the following relations:
h E phot hνphot c
E phot = hνphot ; λphot = ; pphot = = ; νphot = ,
pphot c c λphot (4.22)
and νphot λphot = c
Here, ‘c’ is the velocity of light in vacuum and ‘h’ is Planck’s constant.
De Broglie argued that if waves have a particle associated with them, then on the
basis of symmetry, moving material objects must also have associated waves, which
he called matter waves. De Broglie assumed that relations corresponding to the set
of relations given by Eq. (4.22) may also be written for matter waves.
Energy of the particle that carry matter waves E matt = hνmatt (4.23)
mc2
νmatt = (4.25)
h
Also the wavelength λmatt of matter waves may be given as
h h
λmatt = =⎛ ⎞ (4.26)
mv
⎝ / 1
⎠m 0 v
2
1− vc2
In analogy to the expression for the velocity of EM wave c = λphot νphot , one may
write the velocity of matter waves as
h mc2 c2
Vmatt = λmatt · νmatt = · = (4.27)
mv h v
4.3 Dual Nature of Matter 217
In Eq. (4.27), Vmatt represents the velocity of the matter wave, while ν is the
velocity of the particle of rest mass m0 . Now there is a contradiction. According to
the theory of relativity, no material particle can move with the velocity of light (c),
and therefore, the velocity v of the particle must be small than
the velocity of light,
c2
i.e. ν < c, and hence the velocity of the matter wave Vmatt = v may exceed the
velocity of light.
This puts a big question mark; what does this mean? The answer is that we have
to re-visit the dynamics of wave propagation.
In general, two velocities may be associated with a wave; they are: (i) the phase
velocity and (ii) the group velocity.
(i) Phase velocity
To understand the concept of phase velocity, let us consider a string or a wire held
fixed at two points in the x-direction. Suppose the wire is plugged in the y-direction
so that it vibrates in y-direction. The displacement ‘y’ of any point on wire at time
‘t’ may be given by,
x
y = A cos 2π ν t − (4.28)
Vphase
Here ν is the frequency and Vphase the wave speed, i.e. the speed with which the
wave travels down the wire. As a matter of fact Vphase is the speed with which the
displacement y travels along the wire in x-direction. Pick up any particular displace-
ment (in vertical direction) y1 at point X 1 of the wire at instant ‘t 1 ’. At the next instant
t 2 , the same displacement y1 will move to another location x 2 of the wire, and next at
instant t 3 , displacement y1 will reach at point x 3 on the wire. In this way the location
of displacement y1 is moving along the length of the wire in the x-direction. No part
of wire is moving in x-direction. Wire is moving (vibrating) in Y-direction. It may
thus be observed that no material particle moves along the x-direction, while the
location of a given displacement moves along the x-direction with speed Vphase , as
shown in Fig. 4.18. This speed, with which the phase of the wave travels, is called the
phase velocity of the wave. Since no material particle travels with the phase velocity,
the phase velocity may have a value equal or even greater than the velocity of light.
2
V matt = cv given by Eq. (4.27) represents the phase velocity of the matter
wave. Further, in matter waves no matter/particles are vibrating in y or in any
other direction.
(b) Group velocity
Equation (4.28) represents a progressive wave,i.e. a wave which is moving in x-
direction with velocity V phase . A negative sign in t − Vphase tells that the disturbance
x
is moving in +ve X-direction, while a positive sign in t + Vphase x
means that the
wave is moving in negative X-direction. It is often more useful to write Eq. (4.28) in
a slightly different form as given below,
218 4 X-rays, Dual Nature of Matter, Failure of Classical Physics and Success …
x
y = A cos 2π ν t −
Vphase
Or
2π νx
y = A cos 2π νt − (4.29)
Vphase
where
4.3 Dual Nature of Matter 219
2π mc2 2π m 0 c2
ω = 2π ν = = √ (4.32)
h h 1 − v2 /c2
And
2π 2π mν 2π m 0 v
k= = = √ (4.33)
λ h h 1 − v2 /c2
ω 2π ν
= 2π = νλ = the velocity of the wave (4.34)
k λ
The two waves will interfere and the resultant wave may be represented as:
Since Δω and Δk are much small compared to 2ω and 2k, respectively, one may
take
2ω + Δω ≈ 2ω and 2k + Δk ≈ 2k.
The magnitude of the group velocity may be obtained using Eqs. (4.32) and (4.33).
From Eq. (4.32)
2π m 0 c2
ω= √
h 1 − v2 /c2
Therefore,
dω 2π mv
= 3/2 (4.39)
dv 2
h 1 − vc2
2π m 0 v
k= √
h 1 − v2 /c2
dk 2π m
= 3/2 (4.40)
dv 2
h 1 − vc2
It follows from Eqs. (4.39) and (4.40) that the group velocity
dω dω dk 2π mv 2π m
Vgroup = = / = 3/2
/ 3/2 = v (4.41)
dk dv dv 2 2
h 1 − vc2 h 1 − vc2
Thus, the group velocity (velocity with which the wave packet or the envelope
of the modulated wave travels) comes out to be equal to the velocity of the particle
associated with the matter waves, while the phase velocity of matter waves ωk = cv .
2
Rutherford in 1911 carried out some ingenuous and revealing experiments in which
thin gold foils were bombarded by energetic alpha particles. Scattering of incident
alpha particles by scattering angles as large as 180° established that there is a body
at the centre of each atom where total positive charge and more than 99% mass of
the atom are contained. The central body was named ‘Nucleus of the atom’, term
nucleus being borrowed from biology. Earlier, J. J. Thomson has already discovered
electron in 1897 and experiments with cathode-ray tube have proved that electron
is an essential constituent of all atoms. Soon after the discovery of atomic nucleus,
several theories for the structure of nuclear atom were proposed. The most convincing
model for atomic structure was the planetary model where it is assumed that electrons
in an atom revolve round the nucleus in circular orbits at different distances from the
4.4 Some Examples of the Failures of Classical Approach and Success … 223
nucleus, like planets revolve round the sun in solar system. The planetary model of the
atom was readily accepted because of its simplicity and compelling similarities with
the planetary system. For example, Coulomb force of attraction between the electron
and the nucleus may well be compared to the gravitational force of attraction between
the sun and the planet and that both these forces have almost similar dependence on
distance. Planetary model of the atom was also attractive as philosophically it was in
confirmation to the adage that big solar system is just an upscale of atomic system. At
the first sight planetary model of the atom appeared to be more stable than the solar
system, because in case of the solar system planets move in outer space where there is
matter of very low density, planets therefore experience a force of drag which reduces
the orbit of the planet and in long run every planet is expected to fall down into the
sun. No such drag was expected in case of the atomic electrons as the planetary
model assumes perfect vacuum around the nucleus where electrons revolve.
The stability of the planetary model of the atom which was based on classical
laws of physics (Newton’s laws of motion and Coulomb’s law) was questioned by
the classical theory of electromagnetism put forward by Maxwell (1862) in the form
of four equations. Maxwell’s theory says that a charge at rest has an electric field
around it which is strongly coupled to the charge; a charge moving with uniform
speed carry both the electric and the magnetic fields strongly attached to the charge
in uniform motion. However, if the charge is accelerated, a part of the electric and
magnetic fields which were strongly attached to the charge get detached and move
out in space with the velocity of light. Thus an accelerated electric charge according
to the classical theory of electromagnetism will radiate electromagnetic field and
loses energy. Although no force of drag is faced by the electrons in the atom, but
because electrons are assumed to have been moving in circular orbits, their direction
of motion changing at each instant, they are in accelerated motion and must radiate
energy. If so, the planetary atom which is based on classical physics is unstable from
the same classical approach. It is estimated that all electrons of an average atom will
spiral back into the nucleus within 10−8 s.
Further, the dying atom should emit electromagnetic (EM) waves of all frequencies
as electrons in different orbits will lose energy at different rates. In summery it may
be said that planetary atom which was the only possible model of the atom is (i)
unstable from the view point of classical physics and that (ii) while dyeing atom
must emit EM radiations of all frequencies.
Experimental facts, however, contradict both the above-mentioned predictions
of the classical approach. Atoms are in general stable, and when they emit EM
radiations, the radiations are not of all frequencies, and atoms on de-excitation emit
discrete EM radiations of fixed energies. As a matter of fact that the emission spectra
of atoms of each element of the periodic table is characteristic of the atom or the
element, it is like the signature or the thumb impression of the atom/element.
Several attempts, within the classical framework, have been made to explain
discrete atomic spectra by making different assumptions about the motion of electrons
in the atom, but none has been successful.
Figure 4.20 shows a representative line spectrum of sodium atom. As may be
observed in this figure, the spectrum has several lines in the ultraviolet region which
224 4 X-rays, Dual Nature of Matter, Failure of Classical Physics and Success …
lie in the invisible part of the EM spectrum; however, the two prominent lines called
D1 and D2 lines of yellow colour (wave lengths 589.0 and 589.6 nm) are characteristic
of sodium. It is the light from these lines which is used in sodium vapour lamps. There
are few other lines in the sodium spectrum that lie in the region of infrared and are
not shown in the figure.
Sodium lamp Sodium vapour lamps are frequently used for street lighting and can
be easily identified by their signature yellow light. Excited sodium atoms in visible
region emit yellow light of two wavelengths 589.0 and 589.6 nm. Sodium lamp has
the advantage that they are very efficient; almost 80% of the electrical energy given
to the lamp is converted into visible yellow light. Further, the lumen output of the
lamp does not drop with age and the light of yellow colour emitted by the lamp is
the colour to which human eye is most sensitive.
Problems associated with the classical planetary model of atom were satisfactorily
addressed by the quantum mechanical model of Schrodinger which is discussed in
Chap. 5 of the book. In quantum mechanical model it is shown that electrons in an
atom are placed in different energy states defined by principal quantum number n,
orbital quantum number l and magnetic quantum number m.
The discovery of photoelectric effect is a story in itself. It is difficult to give the credit
of discovering photoelectric effect to one person alone. First signatures of photoelec-
tric effect appeared in an experiment carried out by German scientist Heinrich Rudolf
Hertz in 1887 to identify electromagnetic waves. Electromagnetic waves, predicted
by Maxwell in 1865, were a hot topic at that time. Hertz in an experiment tried to
generate EM waves by producing a spark between two electrodes which were kept
at a small distance from each other and were maintained at high potential difference.
He observed that production of discharge becomes easier when the cathode was illu-
minated with ultraviolet light. He concluded that ultraviolet light falling on metallic
cathode emits some radiations from the cathode that ionises the gas between the elec-
trodes making it easier to conduct the spark. The next year, in 1888 another German
scientist Wilhelm Hallwachs repeated Hertz experiment with a simple geometry. He
4.4 Some Examples of the Failures of Classical Approach and Success … 225
took a clean circular plate of Zinc and mounted it on an insulating stand. The Zinc
plate was attached by a conducting wire to a gold leaf electroscope. The electro-
scope was then given negative charger. In normal conditions the electroscope lost
its negative charge very slowly. However, when the Zinc plate was illuminated with
ultraviolet light, the electroscope lost its charge very fast. On the other hand if the
electroscope was charged positively, there was no quick leakage of positive charge
even when the Zinc plate was illuminated with ultraviolet light.
The picture remained unclear till 1899 when Thomson conclusively proved
that ultraviolet light falling on metallic cathode/Zinc plate causes electrons to be
emitted from the target. The process of emission of electrons from metallic surfaces
when illuminated with light is called photoelectric effect and the electrons the
photoelectrons.
Philipp Eduard Anton Lenard, who earlier worked as assistant to Hertz, studied
in details the properties of electrons emitted from metallic bodies when illuminated
by ultraviolet and other lights.
A typical experimental arrangement to study the photoelectric effect is shown in
Fig. 4.21. Here, a photosensitive plate C is placed opposite to the metallic plate A
inside an evacuated glass tube. A potentiometer setup is used to apply desired value
of potential difference across plates C and A. The evacuated glass tube has a quartz
window through which EM radiations of desired frequency and intensity from source
S may be made to fall on the photosensitive plate C. Quartz crystal is transparent
to most parts of the EM spectrum, and therefore, quartz windows are used to allow
electromagnetic radiations of wide wavelength range to enter the tube without any
substantial absorption, and ordinary glass on the other hand absorbs most of the
ultraviolet part of spectrum. Often, plate C is also called the cathode and plate A as
anode as they are generally kept, respectively, at negative and positive potentials.
The working of the experimental setup may be understood as follows. When light
from source S, of a given frequency and intensity, is made to fall on the cathode plate
C, photoelectrons with some kinetic energy (or speed) are emitted from the plate.
Now if anode plate A is given a positive potential +V with respect to the cathode
C, emitted electrons are attracted by plate A and are collected on it. Photoelectrons
picked up by the anode plate A flow through the external circuit constituting the
photoelectric current. The current, which may be in the range of few milli- to few
microamperes depending on the frequency and the intensity of the incident EM
radiations, may be recorded by the ammeter in the circuit. On the other hand if the
polarity of the applied potential is reversed, i.e. plate C is given a positive potential
with respect to plate A, photoelectrons emitted from plate C will be repelled by
the potential at plate A, as a result the photoelectric current in the circuit will get
reduced. Magnitude of potential V and its polarity may be easily controlled by the
potentiometer and commutator combination shown in the figure. Under the reverse
voltage condition when plate C is at positive potential and plate A is at negative
potential, the magnitude of the current in external circuit will get reduced because of
the repulsion of photoelectrons. Experiments using electromagnetic (light) radiations
of different frequencies and intensities were carried out in which current through the
external circuit was recorded for different magnitudes and polarities of the potential
difference V between plates C and A. The main observations of these experiments
were that the magnitude of the measured photoelectric current depends on the (i)
material of surface emitting electrons, (ii) intensity of the radiations incident on plate
C, (iii) potential difference V between the plates, only when plate A is at a negative
potential with respect to C. These observations are point wise further elaborated in
the following.
(a) Dependence of photoelectric current on frequency of incident radiation
In experiments where monochromatic EM radiations of different intensities and
frequencies ν1 , ν2 , ν3 , . . ., etc. were made to illuminate the cathode C and photo-
electric currents in the external circuit were recorded, it was found that
(i) When plate A was at a positive potential +V with respect to plate C, the photo-
electric current changed only with the intensity of the incident radiations and
remained constant when EM radiations of different frequencies but of same
intensities were incident on plate C, for all values of positive potential +V.
Moreover, the photoelectric current was recorded immediately without any
time lag at the instant the radiations hit the cathode.
(ii) When EM radiations of same intensity but of different frequencies were incident
on the cathode, as mentioned earlier, it was found that photoelectric current
remained constant for all values of positive potential (+V ); however, the current
becomes zero when the frequency of the incident radiation was reduced to
some value ν 0 or below this value. The maximum frequency ν 0 at which no
photoelectric current passes through the circuit is called the threshold or cut-
off frequency. This indicates that no photoelectrons are emitted when EM
radiations of cut-off frequency ν 0 or of frequency lower than this are made to
4.4 Some Examples of the Failures of Classical Approach and Success … 227
shine the cathode, no matter what is the intensity or for how long the radiations
are made to hit the cathode C. The magnitude of the threshold frequency ν 0
has been found to be different for different metallic cathode surfaces.
(iii) In the case of reverse voltage setting, when plate A was kept at negative potential
(−V ) with respect to plate C, it was observed that the photoelectric current for
incident radiations of all intensities and frequencies decreased sharply with
the increase in the magnitude of the negative potential of plate A, becoming
zero for a the negative potential (−V s ). Negative potential (−V s ) where the
photoelectric current (for all intensities and frequencies of incident radiations)
becomes zero is called retarding potential or cut-off potential and has been
found to have different values for different metallic surfaces used as cathode
plate C. Further the cut-off potential does not depend on the intensity of the
incident radiations.
Let us now try to understand, within the framework of classical physics, the process
of photoelectric effect. Classically, EM radiations like all other waves carry energy
and are a mode of energy transfer. When EM radiations fall on a metallic plate, they
deposit energy in the plate at a certain rate, rate being proportional to the intensity
and the frequency of the radiations. Amount of energy deposited in the cathode plate
will be proportional to the time for which the plate is exposed to radiations and also
to the intensity and the frequency of the radiations. The cathode plate contains atoms
of the metal which have electrons that are bound to the bulk material of the plate
with some binding energy, say w. This w is called the work function of the metal
and is equal to the amount of energy required to take an electron out of the metal
surface. Obviously, the value of w depends on the metal used for cathode. According
to classical physics, emission of photoelectrons from the cathode plate will happen
only if the energy deposited by the incident radiations is at least equal or more than
the work function w of the material. If classical picture of photoelectric effect is
true, then there should be some time lag between the irradiation and recording of
photoelectric current, particularly for very low-intensity and low-frequency incident
EM radiations. Further, incident radiations of any frequency must be able to eject
photoelectrons, and low-frequency radiations, which deposit energy at a lower rate,
should be able to deposit the required energy w in a longer time of irradiation.
Therefore, radiations of all frequencies must be able to produce photoelectrons and
photoelectric current; the classical approach could not explain why radiations of
frequency less than the threshold frequency could not produce photoelectric current.
Photoelectric current is constituted by the number of photoelectrons collected
per unit time at anode A. On the other hand, the rate at which photoelectrons are
emitted from cathode plate will depend on the rate at which energy is deposited in
cathode plate by incident radiations. If the intensity of incident radiations is high,
more photoelectrons will be emitted per unit time, and hence there will be large
photoelectric current (as observed experimentally). However, according to classical
picture, the amount of energy deposited in cathode will increase with time, and
therefore, the photoelectric current should not remain constant but should increase
with time. This contradicts the experimental observations.
228 4 X-rays, Dual Nature of Matter, Failure of Classical Physics and Success …
to C, current decreases (for both intensities) and ultimately becomes zero at the same
value of reverse (or retarding) potential −V s for all values of intensities of incident
radiations.
The above experimental observations may be explained by assuming that the
incident EM radiations of frequency ν (> ν0 ) falling on the surface of plate C emit
photoelectrons of all kinetic energies from zero up to a maximum value E max , where
E max depends on the frequency of the incident radiations. Further, the number of
photoelectrons emitted per unit time is proportional to the intensity of the incident
radiations. In case when plate A is at a positive potential +V with respect to C, all
photoelectrons (of all energies from 0 to E max ) emitted per unit time are collected by
plate A, irrespective of the magnitude of potential V, and the photocurrent remains
constant for all positive values of V. However in case of the reverse potential when
plate A is at a negative potential −V n with respect to plate C, photoelectrons emitted
from plate C get repelled by plate A; at small negative values of V n only some low-
energy photoelectrons are not able to reach A reducing the magnitude of photocurrent,
but when negative potential increases, more energetic photoelectrons are also not able
to reach plate A. Ultimately when V n attains the value −V s , even the most energetic
electrons of energy E max are also cut off and photocurrent becomes zero. For V n =
−V s , one may write:
1
E max = 2
m e vmax = eVs . (4.42)
2
Here m e , e and vmax are respectively the mass, charge and maximum velocity of
emitted photoelectrons.
The above experimental observation that photoelectrons emitted by a certain EM
radiation of frequency ν will have a maximum kinetic energy is not supported by clas-
sical approach. According to classical approach, EM radiations will keep depositing
energy in the target material, and therefore, energy available to photoelectrons in
excess to their work function should also go on increasing with the length of time
the EM wave is kept shining on the plate C.
(d) Dependence of photoelectric current on the frequency of incident light and
on the stopping potential
Figure 4.24 shows the results of an experiment in which EM waves of three different
frequencies ν1 , ν2 and ν3 with (v3 > ν2 > ν1 > v0 the threshold frequency) having
same intensities were incident on plate C, one at a time, and the photoelectric current
for both positive and negative voltage settings between plates C and A was recorded.
Since the magnitude of the photocurrent for any positive voltage on plate A
depends only on the intensity of the incident EM radiations, the straight-line curves
for the three frequencies overlap on each other and are shown by a single straight
line marked as saturation current. However, in case of retarding potential when
plate A is at negative potential with respect to plate C, curves for EM radiations of
three different frequencies cut the X-axis at different cut-off (or retarding) potentials
−V s1 , −V s2 and −V s3 . Since |V s3 | > |V s2 | > |V s1 |, the maximum kinetic energy E max
v3
absorbed and emitted not continuously but in small energy packets which he called
energy quanta. Einstein in 1905 proposed the quantum theory for photoelectric effect,
borrowing the idea of Planck that energy is emitted or absorbed in energy packets
or energy quanta. Einstein assumed that EM radiations are made up of tiny energy
packets, called photons which move with the velocity of light in vacuum. The energy
ν
E pho of a photon of light of frequency ν is given by,
ν
E pho = hν (4.43)
and momentum need to be conserved between these entities. If it is assumed that the
target atom is at rest, the linear momentum pumped by the photon must be shared
between the photoelectron and the residual atom. As such, the residual atom must
recoil in a particular direction to conserve the input linear momentum. Some energy,
say E reco , is consumed in this recoil. It follows from the conservation of energy that:
E elec = E pho − w − E reco (4.44)
Since E reco may have different values for different photoelectrons, the energy
of emitted photoelectrons may differ from each other by the amount oi the recoil
energy, which is very small. Another reason for the difference in the kinetic energies
of photoelectrons is the depth of the atom (which has lost the photoelectron) from the
front surface. When photoelectron is generated deep inside the metallic target, it may
lose some of its kinetic energy in reaching the surface. Thus, differences in the value of
recoil energies and in the energy loss while coming out of the metallic surface produce
distribution in the kinetic energy of emitted photoelectrons. Maximum kinetic energy
is possessed by the photoelectron which is produced just at the front surface of the
target and for which recoil energy is a minimum.
In Fig. 4.26 it is shown that the photoelectric effect is taking place with an electron
of the K-shell. It is because the probability of photoelectric effect with electrons of
inner shells, like the K- or the L-shells of the atom, is a maximum. The reason for
this is the fact that in photoelectric effect conservation of linear momentum demands
that the residual atom (left after the emission of photoelectron) must recoil. This may
happen only when the emitted photoelectron was tightly bound with the atom and
may easily transfer the excess linear momentum to the residual atom. Since K- and
L-shell electrons are most tightly bound with the atom, photoelectric effect is more
likely to take place with these electrons.
Einstein’s quantum mechanical model of photoelectric effect may explain simul-
taneous emission of photoelectrons, without any time lag, with the irradiation of
the target metal surface by EM radiations, if the energy of the incident photon is
more than the work function w of the target metal. A rough estimate of the threshold
frequency ν 0 may be made from the work function w of the target metal as:
w
v0 ≈ (4.45a)
h
Since different metallic surfaces have different values of the work function, the
cut-off or threshold frequency has different value for different materials.
The intensity of incident EM radiations, according to the quantum approach, is
proportional to the number density (number per unit volume) of photons in the beam.
The number of photoelectrons emitted per unit time is also proportional to the number
density of photons in the incident bam, which in turn is proportional to the intensity
of the beam. Therefore, the photoelectric current which is constituted by the emitted
photoelectrons is also proportional to the incident beam intensity as shown in graph
of Fig. 4.23.
It may be remarked that the quantum approach to photoelectric effect given
by Einstein has been able to explain all experimentally observed facts about
photoelectric effect and could address the anomalies posed by the classical approach.
In general one talks about the ionisation energy of an atom, which is the energy
required to take out an electron from the atom when it is in gaseous state. Situation
changes when one considers atoms in a solid, particularly in case of metallic solids.
The structure of a metallic solid may be described in terms of a positive ion lattice
surrounded by a cloud of de-localised electrons. Since in metals, an electron of the
cloud is not bound with an individual atom, the concept of ionisation energy is not
applicable. Instead the concept of work function is used; work function (w) may be
defined as the energy required in taking out one electron (of the electron cloud) to the
surface of the bulk material. In general for metals the work function (w) is smaller
than the ionisation energy (E ioniz ) of the corresponding atom. For example, in case
of copper w = 3.76 eV and E ionz = 7.52 eV and for Silver w = 4.34 eV and E ionz =
8.68 eV.
The residual atom left after the emission of photoelectron is still excited and has an
electron vacancy in one of the inner most shells, like in K- or L-shells. Electrons
from higher shell, like M, N, …, etc., may fill the vacancy of the inner shell. This
transfer of electron from the higher shell to the lower shell is accompanied with the
emission of characteristic lines of the emission spectrum of the atom. For example,
if photoelectron is ejected from K-shell, then an electron say from the M-shell may
move to K-shell and quench the vacancy there, emitting K β X-rays of the atom.
Sometimes it may happen that electron from the M-shell goes to K-shell, but no
4.5 Blackbody Radiations and Their Energy Distribution 235
X-ray is emitted, instate excess energy (that might have gone out in the form of K β
X-ray) is given directly to some outer shell electron (which is loosely bound) and
that electron goes out of the atom. Electrons emitted in this way by the direct transfer
of excess energy are called Auger electrons and the process Auger effect.
Dependence of photoelectric effect on atomic number and energy of photon
The probability of photoelectric effect depends on the energy of incident radiation
and also on the atomic number Z of the target atom and may be represented by the
empirical relation;
Z 4.5
pPhotoelectric ∝ (4.45b)
hν 7/2
It follows from the above expression that the probability of photoelectric effect
is more for atoms of higher atomic number Z (heavier materials) and decreases with
the increasing energy of the incident photon.
SAQ: Why the photoelectric effect is said to be a bound state phenomenon?
Since in this section we will be dealing with thermal radiations, it will be appropriate
to define thermal radiations. It is a common observation that metallic objects when
heated emit electromagnetic radiations in the visible region and that the colour of
the emitted radiations changes with the temperature of the body. In general, not only
metals but all bodies emit EM radiations when they are at a temperature above the
absolute zero. The emitted EM radiations contain waves of many frequencies (or wave
lengths) the distribution of which depends both on the temperature and the material
of the body. The electromagnetic radiations emitted on account of the temperature
of any object are called thermal radiations. Thermal radiations emitted by a perfect
blackbody are termed as blackbody radiations. Thermal radiations emitted by an
object because of its temperature are quite different from the emission line spectra
of atoms or band spectra emitted by excited molecules.
The concept of blackbody and blackbody radiations in thermodynamics has orig-
inated from Kirchhoff’s law of thermal emission, given by German scientist Gustav
Robert Kirchhoff in 1862. There are several ways to express Kirchhoff’s law of
thermal emission. The original law which was in German may be translated in simple
English as: ‘if there is a region of space surrounded on all sides by perfectly insu-
lating boundaries so that no part of thermal radiations may leak through them and
if each part of the boundary is at the same constant temperature T, then the space
surrounded by the boundary is filled with thermal radiations which are characteristic
of temperature T alone’. He named these radiations as blackbody radiations at
temperature T and called the space surrounded by the boundary as the blackbody.
The characteristics of blackbody radiations at temperature T will be same as that of
236 4 X-rays, Dual Nature of Matter, Failure of Classical Physics and Success …
Or
νmax
= Constant = 5.879 × 1010 Hz/K (4.46b)
T
It may appear surprising to note that the second form of the law given by
Eq. (4.46b) cannot be obtained by substituting νmax = λmax c
in Eq. (4.46a). The
reason is that the wavelength of maximum emission λmax is not a single wavelength,
but wavelengths lying in the range λmax and (λmax + dλmax ) are all wavelengths of
maximum emission. Similarly, frequency of maximum emission νmax is also not a
single frequency, but all frequencies in the range νmax and (νmax + dνmax ) are all
frequencies of maximum emission. Now, dλmax /= dνmax that means that the wave-
length and the frequency do not change at the same rate; hence, the two constants
are different.
Wien also proposed a law that may give the energy distribution in blackbody
spectrum. Unlike the displacement law, Wien derived his distribution law using the
laws of thermodynamics and Maxwell Boltzmann distribution law for the speed
of gas molecules. Essentially Wien used the concept of adiabatic compression of
blackbody radiations contained in an enclosure to reach a higher temperature. In
his derivation Wien made many approximations and his original derivation is quite
involved and lengthy. Also, in the light of the quantum theory of radiations it is not
of much relevance now. Skipping the derivation, Wien’s distribution law may be
given as,
C −D/λT
E λ dλ = e dλ. (4.47)
λ5
Here, E λ denotes the energy density contained in spectral range λ and (λ + dλ) of
blackbody spectra. C and D are two constants their values for a given temperature T
may be obtained by fitting the experimental spectrum at temperature T as such these
238 4 X-rays, Dual Nature of Matter, Failure of Classical Physics and Success …
constants are temperature dependent. Though Expression (4.47) was derived using
the laws of classical thermodynamics and Maxwell Boltzmann distribution, but the
values of constants C and D need to be determined from experimental data, and the
expression is, therefore, semi-empirical.
The total emissive power E of the blackbody at temperature T, which may be
defined as the total energy radiated per unit time, may be obtained by integrating
Eq. (4.47) between the limits λ = 0 to λ = ∞, i.e.
∫∞ ∫∞
C −D/λT
E= E λ dλ = e dλ = σ λ4 (4.48)
λ5
0 0
Wien’s distribution law may explain the shape of observed blackbody spectrum
at a given temperature, only qualitatively. The term λC5 e−D/λT of the distribution
formula may be considered to have two parts (a) λC5 and (b) e−D/λT . For small values
of λ exponential part (b) becomes large and over rides the effect of part (a); as
a result for short wavelengths, the energy density rises almost exponentially. On
longer wavelength side the exponential part (b) becomes very small and the fall in
the energy goes almost as λ−5 .
However several attempts to reproduce quantitatively the experimental energy
distribution curve of blackbody radiations at a given temperature T for the whole
range of wavelengths using Wien’s distribution formula failed; Wien’s distribution
law reproduced the lower wavelength part of the experimental energy distribution
but failed to reproduce the longer wavelength part of experimental distribution curve.
Moreover, assuming a nonzero value for the wavelength λ, if temperature T is set to
the value of infinity (∞) in Wien’s distribution
∫∞ formula, it is observed that the total
energy emitted by the blackbody E = 0 E λ dλ at an infinite temperature remains
finite. This is physically unjustified.
In short, Wien’s distribution formula failed as it has two drawbacks: (i) could
not reproduce the longer wavelength part of the experimental blackbody radiation
distribution and (ii) it predicts a finite value of energy being radiated by a blackbody
4.5 Blackbody Radiations and Their Energy Distribution 239
Strutt John William, Third Baron Rayleigh, a British mathematician better known
as Lord Rayleigh and Sir James Jeans in 1905 proposed another energy distribution
law known as Rayleigh–Jeans law to describe the energy distribution of blackbody
radiations. They argued that a blackbody cavity in thermal equilibrium at temper-
ature T (K) may be considered as if it is a cubical enclosure filled with standing
EM waves of different frequencies. They assumed that the walls of the blackbody
cavity contain some hypothetical oscillators that emit and absorb EM radiations
of different frequencies; the waves emitted by a particular oscillator interfere with
waves impinging on the oscillator producing standing waves. Putting the condition
that standing waves must have nodes at container walls, calculated the number of
modes of vibrations per unit volume dN in frequency range ν to (ν + dν) as:
8π ν 2
dN = dν. (4.49)
c3
In their derivation Rayleigh and Jeans assumed that the frequency ν of oscillators
varies continuously. Next they calculated the average energy per mode of vibration
E ave using the law of equipartition of energy of thermodynamics. The law says that
each degree of freedom has energy 21 kB T (kB being Boltzmann constant), and since
there could be two degrees of freedom: one corresponding to the kinetic energy and
the other of potential energy, the average energy for each vibrating oscillator becomes
E ave = kB T (4.50)
240 4 X-rays, Dual Nature of Matter, Failure of Classical Physics and Success …
As such the average energy density of each oscillator of frequency in the range ν
to (ν + dν) may be written as:
8π ν 2 8π ν 2 kB T
E(ν) = E ave = (4.51)
c3 c3
Writing expression (4.51) in terms of wavelength, Rayleigh and Jeans obtained
the following formula for energy density of blackbody radiations as:
8π kB T
E λ dλ = dλ (4.52)
λ4
In Eq. (4.49) kB is Boltzmann constant and T the absolute temperature. It is worth
noting that Rayleigh–Jeans law in comparison to Wien’s distribution law does not
involve any new/unknown constants.
Karl Ernest Ludwig Marx Planck better known as Marx Planck in 1900 put forward
the quantum theory for the energy density distribution of blackbody radiations. Like
Rayleigh–Jean, he also assumed that the blackbody cavity is filled with electromag-
netic waves which are continuously emitted and absorbed by some sort of oscillators
giving rise to the formation of standing waves. Using the condition that these standing
4.6 Quantum Theory of Blackbody Radiations 241
electromagnetic waves must have nodes at the boundaries of the cubical enclosure,
like Rayleigh and Jeans, Planck also obtained the same expression for the average
energy density of each oscillator of frequency in the range ν to (ν + dν) which may
be written as:
8π ν 2
E(ν) = E ave (4.53)
c3
At this stage Planck made a drastic assumption that oscillator cannot have contin-
uously variable energies; he said that oscillators may have only energies in integer
multiples of the quantity hν, where h is Planck’s constant. This assumption means
that there may be oscillators of energies, hν, 2hν, 3hν . . . nhν, where n is a positive
integer. Oscillators with energies 21 hν or 34 hν etc. were not possible. Next he calcu-
lated the probability p(n) of the mode with energy E n = nhν in thermal equilibrium,
using Boltzmann distribution law,
− kEnT
e B
p(n) = (4.54)
∑∞ − kEnT
n=0 e
B
n=∞
exp(−E n )/kT = 1 + x + x 2 + x 3 + · · ·
n=0
1
(1/(1 − x)) = (4.56)
(1 − exp(−E/kT ))
Also,
n=∞
n E exp(−n E/kT ) = E x + 2x 2 + 3x 3 + 4x 4 + · · · (4.57)
n=0
d
xE 1 + x + x2 + x3 + · · ·
dx
242 4 X-rays, Dual Nature of Matter, Failure of Classical Physics and Success …
E
d 1 Ex x exp kT
xE = = E 2 (4.58)
dx 1−x (1 − x)2 1 − exp kT
E exp(−E/kT ) E
E ave = = (4.59)
{1 − exp(−E/kT )} {exp(E/kT ) − 1}
hν
E ave = hν . (4.60)
e kB T − 1
8π ν 2 8π hv3 1
E(ν) = E ave = hν (4.61)
c3 c3 e kB T − 1
This is the Planck distribution function which reproduces the energy density
distribution of blackbody spectrum for all frequency or wavelength regions.
At the time Planck proposed his radical hypothesis, many scientists could not
believe mainly because Planck could not explain why the energies should be quan-
tized. Initially, his hypothesis explained only the experimental data on blackbody
radiation. It was mentioned that if quantization was observed for a large number of
different phenomena, then quantization would become a law. It was also remarked
that one needs to develop a theory that might explain that law. As things worked out,
Planck’s hypothesis was the starting point from which the modern physics grew and
developed.
Planck’s theory of blackbody radiations assumes that electromagnetic radiations
in the blackbody enclosure may have only discrete energies and the oscillators could
only lose or gain energy in the form of packets, referred to as quanta, of size hν, for
a given oscillator of frequency ν. The energy quanta of electromagnetic radiations
are called photons.
Electromagnetic radiations are produced when charged bodies are either accelerated
or decelerated (for example, the emission of continuous X-rays) and also when
electrons shift from one shell of the atom to another shell (example, characteristic
line spectra). In the case of the atomic line spectra, the energy of emitted photons
4.7 Compton Scattering of Gamma Rays 243
where relativistic variation of mass may become significant, therefore, one must use
m e c2 for the rest mass /
energy of electron and the energy of electron after collision
e
E kin may be given as m 2e c4 + pe2 c2 ; here m e is the rest mass of the electron, pe
the final linear momentum of electron and c the velocity of light.
Energy conservation
/
hνi + m e c2 = hνf + m 2e c4 + pe2 c2 (4.62)
hνi hνf
= pe cos φ + cos θ (4.63)
c c
Conservation of the Y-component of linear momentum
hνf
0 = pe sin φ + sin θ (4.64)
c
Equations (4.62), (4.63) and (4.64) may be solved to get Compton equation.
h
λf − λi = Δλ = (1 − cos θ ) (4.65)
mec
Compton equation (4.65) tells that Compton shift in the wavelength Δλ can have a
minimum value of zero, when incident photon passes on along the incident direction
without getting scattered by electron, and the magnitude of wavelength shift increases
with the angle of scattering θ, attaining a maximum value (2h/me c) for backscattering
(θ = 180°) of photon.
In his original experiments, Compton bombarded carbon target with high-energy
X-ray photons and recorded the scattered photon of lower energy. Compton could
explain the experimental data assuming the particle nature of photon and inelastic
4.7 Compton Scattering of Gamma Rays 245
scattering of incident photon by stationary electron. It was the time when the particle
aspect of photon suggested by photoelectric effect was still being debated, Compton’s
analysis of his experiments gave a clear and independent evidence of particle-like
behaviour of electromagnetic radiations.
The quantity mhe c is called the Compton wavelength of electron. In general the
Compton wavelength of a particle of rest mass m0 is given as mh0 c . Physical signif-
icance of Compton wavelength may be derived from the de Broglie wavelength
associated with a particle, according to which a particle of rest mass m0 moving with
velocity v has an associated de Broglie wavelength λde-bro = mh0 v . Since the velocity
of a moving particle cannot exceed the velocity of light c, the minimum value of
the de Broglie wavelength may occur when the velocity of the particle is taken as c.
Putting v = c, gives the de Broglie wavelength as mh0 c , which is the Compton wave-
length of the particle. Since wavelength associated with a particle is a measure of the
uncertainty in the position of the particle, a particle cannot be confined in a space
smaller or equal to its Compton wavelength. For example, Compton wavelength of
6.626×10−34 J s −12
electron = mhe c = 9.1×10 −31 kg×3.0×108 m = 2.427 × 10 m is around 2.4 × 10−12 m,
and therefore, an electron cannot be confined in a space equal or shorter than this.
Since the size (radius) of an average nucleus is of the order of 10−14 m, two order of
magnitudes is smaller than the Compton wavelength of electron: electron cannot be
confined within the nucleus and cannot be a constituent of nucleus.
h
λf − λi = Δλ = (1 − cos θ )
Mato c
M ato in the above expression is the mass of the atom as a whole, and Mhato c is
Compton wavelength of the atom. Since mass of the atom is very large, its Compton
wavelength is very small; hence, change in wavelength is undetectable. Further, the
target atom remains intact; no electron is ejected by the incident photon.
246 4 X-rays, Dual Nature of Matter, Failure of Classical Physics and Success …
Photons, depending on their energy and the atomic number of the target atom, may
interact with the atom, with the bound and the free electrons of the atom and with
the nucleus of the target atom.
Low-energy photons mostly interact with the bound electrons (K-shell or L-shell
electrons) producing photoelectric effect. With the increase of energy the probability
of photon interaction with loosely bound electrons of the atom increases, resulting
in Compton scattering. In case the energy of the photon is larger than 1.02 MeV,
it may annihilate producing an electron and positron pair: the process called pair
production. The minimum energy of photon required to produce an electron positron
pair is 1.02 MeV which is the sum of the rest mass energies of the electron and positron
pair (0.51 + 0.51 = 1.02). Pair production takes place in the field of a nucleus,
when a high-energy photon (E phot > 1.02 MeV) passes through the nuclear field.
Nuclear field facilitates the recoil of the nucleus that is required for the conservation
of momentum in pair production process. High-energy photons may also excite or
disintegrate atomic nucleus. Very high-energy photons, with energies > 150 MeV,
may create mesons.
Figure 4.30 shows the variation with photon energy of the probability for photo-
electric effect, Compton scattering and pair production in lead (Pb). Kink (towards
the top) in the curve for photoelectric effect, called k-edge, shows that probability
for photoelectric effect suddenly increases for photon energy corresponding to the
binding energy of the K-shell electrons. Similar but less pronounced edges (not
shown in the figure) also appear for L and M-shells.
SAQ: What may be the order of magnitude for Compton wavelength of a neutron?
For the explanation and recording of the Compton effect, Compton was awarded a
share of the Nobel Prize in physics in the year 1927. Not only the Compton effect
represented the particle nature of light, but it is also important from the application
point of view. The Compton scattering is of importance in material science where
it is being used to get information regarding wavefunction of electrons in matter. It
is also of importance in the field of radiobiology and radiation therapy. Compton
scattering also has applications in X-ray astronomy and in getting signature of black
hole.
SAQ: In nature there are no free electrons, then why the Compton scattering is said
to take place with free electrons?
Specific heat is defined as the amount of heat required to change the temperature of
unit mass of a substance by unit degree temperature. If ‘m’ kg of a substance is given
a heat energy of amount ΔQ which rises the temperature of the substance by Δθ,
then the specific heat of the substance is
1 ΔQ ΔQ
Specific heat = and the heat capacity =
m Δθ Δθ
Molar or atomic specific heat It is defined as the quantity of heat energy required
to raise the temperature of 1 kg mol or 1 kg atom of any substance by unit degree. It
is obvious that molecular or atomic specific heats are the products of the molecular
weight or the atomic weight with the specific heat of the substance. Molecular or
atomic specific heat of solids is generally denoted by C v . [In case of gases there
may be two types of molecular/atomic specific heats: at constant volume C v and at
constant pressure C p .] In the following discussion the term atomic specific heat will
be used which will also mean molar specific heat in case the solid is a compound
and not an element.
French chemist Pierre Louis Dulong and French physicist Alexis Therese Petit in
1819 on the basis of their observation of atomic specific heat for large number of
solids gave an empirical law which states that ‘gram-atomic heat capacity (atomic
specific heat) of an element is a constant: that is, it is same for all solid elements, about
6 cal per g atom per °C and it is independent of temperature’. More than 60 elements
248 4 X-rays, Dual Nature of Matter, Failure of Classical Physics and Success …
in solid state are found to have their atomic specific heat in the range of 5.38–6.93 cal/
g atom/°C with an average of 6.15. However, the gram-atomic specific heat for some
light elements like Silicon (gram-atomic specific heat 4.95) and diamond with gram-
atomic specific heat of 1.46 cal/g atom/°C does not follow Dulong–Petit law. Further,
it is found that the gram-atomic specific heat of solids depends on temperature and
approaches zero at absolute zero of temperature.
The MKS unit for atomic specific heat is J/kg atom/K and 1 J/kg atom/K = 4.2 ×
103 cal/kg atom/K. Therefore, the average value of atomic specific heat of 6.16 cal/
g atom/°C observed by Dulong–Petit is equal to 25.67 × 103 J/kg atom/K.
Dulong–Petit law may be derived assuming that the classical law of equipartition
of energy of thermodynamics holds good. At absolute zero solids have a crystalline
structure in which atoms or molecules of the solid are at rest being held at their
place by mutual interaction. When energy in the form of heat is supplied to the
solid, the atoms or the molecules start vibrating around their mean position. If the
temperature is not very high, the vibratory motion has six degrees of freedom: three of
translatory motion (associated with kinetic energy) and three of vibrational motion
(associated with potential energy). Now, according to the law of equipartition of
energy, 21 kB T of energy is associated with each degree of freedom, and hence, the
total energy u associated with each atom (or molecule) at temperature T (K) is
u = 6 × 21 kB T = 3kB T (J). If AV denotes the Avogadro number, that is the number
of atoms/molecules in one kilo atom (or mole) of the solid, then the energy possessed
by 1 kilo atom of the substance is
U = AV u = 3kB AV T (4.66)
U = 3RT (4.67)
dU
CV = = 3R (4.68)
dT
4.9 Quantum Approach to Atomic Specific Heat of Solids 249
Equation (4.68) says that the atomic or molecular specific heat for all solids has
a fixed value of 3R (≈ 6.8 cal/g atom/°C) and is independent of temperature. This
confirms Dulong–Petit law.
Initially Einstein in 1905 used the concept that a solid contains quantum harmonic
oscillators all having same energy to derive atomic specific heat of solids. Einstein’s
formulation correctly predicted the temperature dependence of atomic specific heat,
but quantitative agreement with experimental values was poor. Later, in 1912 Peter
Debye modified the concept of atomic oscillators all of same energy and included
oscillators with different values of quantized energies. Debye’s theory correctly
predicted both the temperature dependence and the magnitudes of atomic specific
heats of solids. Essentials of both Einstein and Debye theories are discussed in the
following.
hν
∈ = hν (4.69)
e kB T − 1
Above expression for average energy of an oscillator has been derived earlier (see
Eq. 4.60).
Since each atom has three degrees of freedom, energy associated with each atom
u becomes
3hν
u = hν (4.70)
e kB T − 1
3AV hν
U = AV u = hν (4.71)
e kB T − 1
4.9 Quantum Approach to Atomic Specific Heat of Solids 251
= 3R hν 2 (4.72)
kB T
e kB T − 1
The quantity hν kB
has the dimensions of temperature and is called Einstein’s
temperature which is denoted by θE . Einstein’s temperature θE has a different value
for each solid.
Equation
(4.72) may be written in terms of Einstein’s temperature by
substituting kB = θE to get,
hν
2 θE
θE e T
Cv = 3R θ 2 (4.73)
T
eT −1
E
θE
θ
Substituting e T ≈ 1 and e T − 1 ≈ θTE in Eq. (4.73), one gets,
E
252 4 X-rays, Dual Nature of Matter, Failure of Classical Physics and Success …
2 θE 2
θE e T
θE 1
Cv = 3R θ 2 = 3R θE 2 = 3R
T T
eT −1
E
T
It may be observed that for temperatures T much higher than Einstein temperature
θE , atomic specific heat of solids approaches the Dulong–Petit value 3R.
(b) For the case of low temperatures when θTE ≫ 1
θE θE θE θE
e T −1 ≈e T as e ≫ 1 if
T ≫1
T
Or
2
θE 1
Cv = 3R
θE
(4.74)
T
e T
2
In Eq. (4.74) on the right-hand side, there are two factors θTE and 1θE .
e T
θE 2
Factor T increases with the decrease of temperature T; however, the other
factor 1θE decreases exponentially with the decrease of temperature T. Rate of
T
e
decrease with temperature of the second factor is much faster as compared to the
rate of increase (with the decrease of temperature) of the first factor. As a result
the second factor becomes zero for very low temperatures earlier than the first factor
becomes infinite, hence, the atomic specific heat of solids approaches zero at absolute
temperature T approaches zero.
Though Einstein’s theory for specific heat of solids predicts that the atomic specific
heat for all solids should approach Dulong–Petit value of 3R at high temperatures
and it should approach zero at 0 K temperature, but it could not reproduce the
4.9 Quantum Approach to Atomic Specific Heat of Solids 253
experimental values of specific heats for most of the solids. This theory suffers from
the following drawbacks:
(i) Does not reproduce the experimental values of specific heats for most solids.
(ii) Einstein temperature ϑE and Einstein frequency νE have no physical justifica-
tion; they could not be associated with any property of the solid, like its elastic
constants or melting point, etc.
Einstein in his theory for specific heat of solids assumed that on receiving heat energy,
each atom of the solid vibrates with the same frequency which is quantized. Debye,
on the other hand, assumed that on heating, the solid as a whole, i.e. the crystal
lattices in the solid, undergoes collective vibrations. Lattice vibrations are assumed
to be quantized. The quanta that represent lattice vibration are called PHONON.
Phonon is the counter part of photon which is the quanta of EM waves. In case of the
blackbody radiations it was assumed that the blackbody cavity is filled with photons
of different quantized frequencies; similarly, Debye assumed that on heating a solid
it gets filled with phonons of different frequencies that have quantized energies.
Often, it is said that a solid at some temperature above absolute zero is filled with
a phonon gas. In case of blackbody radiations it was assumed that photons of all
frequencies are present in the blackbody cavity. However, in case of vibrations in
a solids phonons of all frequencies are not present, and phonon frequency is bound
by the medium of its propagation which is the atomic lattice of the solid. So there
is an upper limit on the frequency of phonon in the solid that depends on the elastic
constants and the crystal structure of the material. Debye also assumed that phonon
waves, that are elastic waves, travel with some finite speed in the solid medium like
sound waves. On account of their interference, standing phonon waves are formed
in the solid. There may be three types of standing waves in the solid, longitudinal
waves of velocity C L and two types of transverse waves with two different states of
polarisations with speed C T .
Before proceeding further, let us point out the basic difference in Classical theory,
Einstein’s theory and Debye theory of specific heat of solids.
(i) Both the classical (Dulong–Petit) and Einstein’s models treat each atom of the
solid independently.
(ii) Debye model is more realistic, since it considers collective vibrations of many
atoms. As a matter of fact if one atom of the solid vibrates, the neighbouring
atoms are also set in vibratory motion.
(iii) Einstein’s model assumes that all atoms vibrate with the same frequency. Debye
model, on the other hand, considers the collective vibrations of different groups
of atoms. A group with large number of atoms can vibrate with lower frequency
while the group with fewer atoms may vibrate with higher frequency. Therefore,
the solid will contain vibrating groups of atoms with different frequencies, and
254 4 X-rays, Dual Nature of Matter, Failure of Classical Physics and Success …
the frequencies are being quantized. Vibrating groups of atoms fill the solid
medium with elastic waves (sound waves) which is being confined within the
boundaries of the solid get reflected at boundaries and form standing waves. The
wavelength of an elastic wave depends on the number of atoms in the vibrating
group. The minimum wavelength λmin will correspond to the vibration of only
few atoms and will be related to the lattice constant of the crystalline structure.
It is obvious that the group of atoms that vibrates with minimum wavelength
will have largest value of the frequency of vibrations νmax . Further, if υ is the
speed of the elastic wave in the medium, then υ = λmin νmax . Here νmax is
the frequency of the wave that has the minimum wavelength. It follows from
here that for a given material there will be standing waves of several quantised
frequencies up to the maximum frequency νmax . Also, the νmax will depend
both on the crystal structure (lattice constants) and the speed of the elastic
waves in the medium. It is known that that elastic waves may be of two types:
longitudinal that may travel with some speed say C 1 and transverse that may
travel with a different speed, say C 2 in the medium. Further, transverse waves
may have two different states of polarisations; therefore, there may be three
different types of elastic waves of each frequency ν.
(iv) In Einstein model the energy E of the system is given by
In Einstein model the number of modes is taken equal to the number of atom in
a kilomole (= Avogadro number AV ).
Modes essentially mean the number of standing waves in the volume of the solid.
Since the number of standing waves is quite large, one calculates the number of
standing waves g(ν)dν in a small frequency interval ν and (ν + dε) and integrates it
from zero to νmax to get the total numbers of modes.
It can be shown that the number of modes g(ν)dν for frequency range ν and
(ν + dν) for kg atom of the solid is given as:
9AV ν 2
g(ν)dν = (4.76)
νmax
3
Further, there are large number of phonons with different energies, and the average
energy Eave associated with each mode of vibration may be calculated using quantum
mechanical Maxwell Boltzmann statistics and is given as,
4.9 Quantum Approach to Atomic Specific Heat of Solids 255
hν
Eave = hν
(4.77)
e kB T
−1
The total energy of the system may be obtained by integrating the above expression
over frequency ν
⎛ ⎞
∫νmax ∫νmax 2 ∫νmax
⎝ hν hν 9A ν 9A hν 3
⎠
V V
E= E ν dν = dν = 3 hν dν
e kB T
− 1 νmax
3 νmax e kB T
− 1
0 0 0
(4.79)
⎡ ⎤
⎢ 9A ∫ νmax ⎥
d⎢ ⎞ dν ⎥
3
V
⎣ νmax
3 0
⎛ hν
hν ⎦ hν −1
⎝e B −1⎠
k T
∫ νmax ∂ e kB T −1
But CV = dE
dT
= dT
= 9AV
νmax
3 0 hν 3 ∂T
dν
hν
−1
∫ νmax ∂ e kB T −1 ∫ νmax hν
hνe kB T
Or CV = 9AV
νmax
3 0 hν 3 ∂T
= 9AV
νmax
3 0 hν 3 hν 2 dν.
kB T e kB T −1
2
Or
⎡ ⎤
∫νmax hν
⎢ ν e
2
4 kB T
9AV h ⎥
CV = 3 ⎣ hν 2 ⎦dν (4.80)
νmax kB T 2
0 e kB T − 1
hνmax hν h hνmax
= θD (Debye temperature) and x = ; dx = dν, xmax =
kB T kB T kB T kB T
Therefore,
∫xmax kB T 4 x
9AV h 2 x e kB T
CV = 3 h
dx
νmax kB T 2 (ex − 1)2 h
0
256 4 X-rays, Dual Nature of Matter, Failure of Classical Physics and Success …
∫xmax ∫xmax
9AV T 3 kB4 x 4 ex 9AV kB T 3 x 4 ex
= 3 dx = 3 dx
νmax h 3 (ex − 1)2 hνmax (ex − 1)2
0 kB 0
Or
3 ∫xmax
T x 4 ex
CV = 9AV kB dx (4.81)
θD (ex − 1)2
0
Temperature dependence of the atomic (or molar) specific heat of solids in case
of Debye theory may be discussed through expression (4.81). The value of C V for
high temperatures T >> θ D and for the case of low temperatures T << θ D is discussed
here.
(a) At high temperatures T: x = hν
kB T
is small
2 3
Therefore ex = 1 + x + x2! + x3! + · · · ≈ 1 + x and (ex − 1) ≈ x and (ex ≈ 1).
Substituting (ex − 1) = x in Eq. (4.81), one gets:
3 ∫xmax 3 ∫xmax 4
T x 4 ex T x ·1
CV = 9AV kB dx ≈ 9AV kB dx
θD (e − 1)
x 2 θD (x)2
0 0
3 ∫xmax 3 3 xmax
T ! 2" T x
CV ≈ 9AV kB x dx = 9AV kB
θD θD 3 0
0
3 3
T θD
= 3AV kB
θD T
Or
Equation (4.82) shows that at high temperatures the molar or atomic specific heat
for all solids approaches the Dulong–Petit value.
∫ x 4 ex
(b) At low temperature, when θTD > 1; 0 max (exx −1) 2 dx ≈ 15 π .
4 2
And
3
T 4 2
CV ≈ 9AV kB π (4.83)
θD 15
4.9 Quantum Approach to Atomic Specific Heat of Solids 257
It follows from Eq. (4.83) that at low temperatures, specific atomic or molar heat
decreases as the third power of the absolute temperature and approaches zero at
absolute zero.
It may be remarked that Debye theory correctly predicts the experimentally
observed behaviour of the atomic or molar specific heat of solids.
One big drawback of Einstein’s theory was that Einstein temperature θE was not
related to any property of the solid. Debye temperature θ D , on the other hand, depends
on νmax which in turn is related to the speed of the elastic wave in the solid medium.
Therefore, Debye temperature depends on elastic constants of the solid. Since the
minimum wavelength λmin or maximum frequency νmax may be correlated with the
lattice parameters of the crystal structure, the lattice constant of the crystal structure
may be derived from Debye temperature. For example the size of the vibrating units
sets a limit on the minimum wavelength since shorter wavelengths do not lead to
new modes. The smallest unit of a crystalline solid is the unit cell. Thus, the unit cell
puts constrain on the minimum wavelength of the vibration as:
a = λmin (4.84)
Solved Examples
2d sin ϑ = nλ
a 3.2 × 10−10
d=√ =√ m
h2 + k2 + l 2 12 + 12 + 12
3.2 × 10−10
d= √ m
3
Solution:
(i) First, we need to calculate the energy of incident radiation as: E = hc
λ
.
h = 6.626 × 10−34 J s
Since we wish to convert the energy in eV, we use the relation 1 eV = 1.60 ×
10−19 J
4.9 Quantum Approach to Atomic Specific Heat of Solids 259
4.97 × 10−19
E= eV
1.6 × 10−19
E = 3.10625 eV
Φ = 3.10625 − 1.2 eV
Φ = 1.90625 eV
(ii) The longest wavelength of emitted photons which may cause emission of elec-
trons from the given metallic surface will be if the kinetic energy of emitted
electron becomes almost zero. In that case one gets:
i.e. Φ = hcλ
gives
Therefore, energy of incident radiation = 1.90625 eV = hc λ
.
Substituting the values of Plancks constant and speed of light, one gets the
wavelength of incident radiation to be nearly equal to 6.93 × 10−9 m.
SE4.4 Calculate the value of Compton wavelength for an electron and compare it
with the size of an average nucleus.
Solution: Compton wavelength = h
m0 c
,
where h = Plancks constant = 6.6 × 10−34 J s
Substituting these values, one gets the Compton wavelength equal to 242 ×
10−14 m.
On the other hand, the size of nucleus is typically ≈ 10−14 m.
SE4.5 Calculate the change in wavelength of 511 keV photons undergoing Compton
scattering at an angle of 60°.
Solution: Energy of photon (E) = 511 keV = hc/λ.
Wavelength of photon may be given by, λ = hc
E
6.6 × 10−34 J s 3.0 × 108 ms−1
λ=
511 keV
260 4 X-rays, Dual Nature of Matter, Failure of Classical Physics and Success …
6.6 × 10−34 J s 3.0 × 108 m s−1
λ=
511 × 106 eV 1.6 × 10−19 J
6.6 × 10−34 J 3.0 × 108 m
λ=
817.6 × 10−13 J
19.8 × 10−13 m
λ=
(817.6)
λ = 0.024217 × 10−13 m
λ = 4.4217 × 10−15 m
6.6×10−34 J s
(i) Change in wavelength Δλ = − cos 60◦ )
(9.1×10−31 kg)·(3.0×108 m/s) (1
Δλ = 120.87 × 10−11 m
SE4.6 What is the wavelength of the scattered photon in question SE4.5 above?
λ, = 120.87 × 10−11 m + λ
λ, = 0.012087 × 10−15 + 4.4217 × 10−15 m
λ, = 4.4338 × 10−15 m
SE4.7 Calculate the amount of energy needed to heat the metallic ball of 450 g
from 30 to 80 °C. Given, the specific heat of metallic ball is 0.129 J/g°C.
Solution: Given,
m = 450 g.
c = 0.129 J/g°C.
4.9 Quantum Approach to Atomic Specific Heat of Solids 261
Q = 2904.5 J
Thus, it requires 2904.5 J of energy to heat the metallic ball from 30 to 80 °C.
1. Show that the low-frequency limit of Planck’s Law reduces to the Rayleigh–
Jeans Law.
2. Obtain an expression for the energy of recoiling electron in case of Compton
scattering.
3. What is the maximum energy transferred to an electron in the Compton
scattering?
4. In case of photoelectric effect does the residual atom also recoil? If yes, what
is its effect on the energy of emitted photoelectron?
5. Plot a graph for the intensity of electrons emitted as a function of scattering
angle and explain its various features.
6. What is the angular distribution of electrons emitted in the photoelectric effect?
7. Show that the average forward momentum of electrons emitted in photoelectric
effect is larger than the momentum of incident photon.
8. For higher energy photons the emitted photoelectrons are mostly forward
peaked. Why?
9. State Moseley’s law and discuss its importance.
10. Which experiment conclusively proved wave nature of particles? Very briefly
describe the experiment.
11. Waves carry some disturbance or variation of some parameter with time. Which
parameter varies with time in case of matter waves?
12. What is the principle of working of an electron microscope? Why it has high
magnifying power and resolution?
13. What is the difference between phase and group velocities? Which of them
represents the actual motion of an associated particle?
14. What is (are) main point(s) of difference between Debye and Einstein theories
for atomic specific heat of solids?
15. What is the physical significance of Debye temperature?
16. There are generally two velocities associated with matter waves; what are these
velocities called and what do they represent?
262 4 X-rays, Dual Nature of Matter, Failure of Classical Physics and Success …
17. Give at least one reason why electron cannot be a constituent of nucleus.
m 0 c2
(d) hν
ANS: (c)
LA4.1 Explain, why wave nature of light could not explain the phenomenon of
photoelectric effect.
LA4.2 Differentiate between characteristic and continuous X-rays. Explain on
what factors the intensity of Bremsstrahlung radiations depend.
LA4.3 Explain how one can differentiate between electrons emitted as a result of
Auger and beta decay process.
LA4.4 Calculate the frequency (in Hz) of X-ray emitted, when an atom de-excites
from a level of energy 662 to 300 eV.
LA4.5 Calculate the de Broglie wavelength of electron of energy 500 keV.
LA4.6 Show that the Rayleigh–Jeans law and the Wein’s law are the special cases
of Einstein’s law for blackbody radiation formula.
LA4.7 Calculate the energy of scattered photon undergoing Compton scattering
at 60°.
LA4.8 A body at 47 °C radiates photons. If the Wein’s constant is 4.898×10−3 mK,
what will be the peak of the wavelength radiated?
LA4.9 Show that at very large wavelengths λ, the Planck’s formula for spectral
radiations,
8π hc 1
E(λ) = hc
λ5 exp λkT −1
8π
E(λ) = kT.
λ4
LA4.10 What is meant by the dual nature of matter? With necessary details describe
an experiment that conclusively proved the wave nature associated with a
particle.
LA4.11 What are X-rays, how are they produced? What is Moseley’s law? Give
properties of X-rays and clearly distinguish between continuous and
characteristic X-rays.
LA4.12 What are gamma rays? In what respect they are different from X-
rays? Under what conditions an electromagnetic radiation will produce
photoelectric effect and Compton scattering.
LA4.13 Discuss the phase and group velocities, how do they differ from each other?
4.9 Quantum Approach to Atomic Specific Heat of Solids 265
Objective
An introduction to Schrodinger picture of quantum mechanics is presented in this
chapter. Postulates of quantum mechanics, definitions of operators, operator algebra,
hermitian operators, eigen values and normalised eigen states, etc., are explained in
simple language and with some examples. Application of quantum mechanics has
been explained taking some examples of one-dimensional potentials. It is expected
that after reading this chapter a reader will be able to apply quantum mechanics to
simple cases.
5.1 Introduction
Physics attempts to describe the objects, systems and events of inanimate world and
their time evolution using mathematical language of functions and equations. Further,
the inanimate world is considered to have two components: the matter and the radi-
ations. Matter was assumed to have been made up of particles, while radiations were
considered to have wave nature. Till early twentieth century, behaviour and time
evolution of processes associated with matter were treated using Newton’s laws of
motion and those of radiations using Maxwell’s equations. Newtonian mechanics and
Maxwell equations are the two pillars of classical physics. Classical physics, to a
large extent, successfully explained most of the phenomena of inanimate world partic-
ularly at macroscopic level. However, problems become apparent with the advent of
experimental tools that opened the microscopic world for investigation. It was soon
realised that matter-radiation approach of classical physics is not adequate to explain
phenomenon at microscopic level. The classical theory failed to explain discrete
energy levels of atomic states, why atom is spherical and the wave particle duality,
energy distribution in blackbody radiations, photoelectric effect, Compton scattering,
specific heat of solids, etc., as discussed in Chap. 4. Failure of classical physics,
particularly in explaining physics at the microscopic level, accentuated the necessity
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2023 267
R. Prasad, Physics and Technology for Engineers,
https://doi.org/10.1007/978-3-031-32084-2_5
268 5 Introduction to Quantum Mechanics
In classical mechanics one defines the initial state of a particle or a system in terms
of its position and linear momentum at some initial time. These two parameters
completely define the state of the particle/system in classical mechanics. Next one
uses Newtonian mechanics to study the time evolution of the particle/system, i.e.
how different observable parameters, like the speed, linear momentum, energy, etc.,
of the particle/system change with time. Similarly, in quantum mechanics the first
5.2 Postulates of Quantum Mechanics 269
thing to do is to define the state of a system at some time say t. The first postulate of
quantum mechanics tells how to define the state of a system.
Postulate 1 The state of a quantum system at a given instant of time, t, is completely
defined by a function, ψ(→r , t), of position →r = {x, y, z} and time t.
Function ψ(→r , t) is called the wavefunction or the state function of the system
and is a complex-valued function that contains information about the position of the
system at time t.
It follows from Eq. (5.1) that the probability P of finding the system in volume V
will be given by
˚ ˚ (⇀ )
P(r, t) = p(r, dV , t)dV = ψ r , t |2 dV (5.2)
If the integral on right-hand side of Eq. (5.2) is taken over the whole universe
that is over all the available space, naturally the probability of finding the system
somewhere in the universe will be 1. Therefore,
˚
+∞
(⇀ )
P(some where in the universe) = ψ r , t |2 dV = 1 (5.3)
−∞
Note: Though a volume integral is represented by three integration signs as has been
done so far but now on volume integrals will also be denoted by a single integration
sign to save space.
270 5 Introduction to Quantum Mechanics
Any function of r and t cannot be a valid wavefunction that represents the quantum
system at time t. A valid wavefunction must have the following properties.
(⇀ )
(i) The function must be single valued. A valid ψ r , t must have only one
value for the given values of r and t.
(ii) It must be continuous in the entire region defined by its independent variables.
(iii) Wavefunction must be finite everywhere, and the function and its derivatives
should vanish at infinity.
˝ (⇀ ) 2
(iv) It must be square integrable. Since ψ r , t | dV gives the probability of
finding the system at point defined by r at time t, and probability may have
some value between zero and 1, the square of the function must be integrable.
A function of (r, t) which does not satisfy even one of the above conditions cannot
be a valid quantum wavefunction of a quantum system.
There are always some measurable quantities that are associated with any system
and either all or some of these measurable quantities may change with time as the
system evolves. Let us take the example of a classical system of a particle of mass M
moving with velocity V which changes with time. In this case the linear momentum,
the angular momentum and the velocity are all measurable quantities that change with
time; therefore, they are dynamical measurable variables associated with the system.
In case of a quantum mechanical system the observables are treated as described by
the second postulate of quantum mechanics given below.
Postulate 2 An observable, A, is represented by a linear and hermitian operator
written as Â. Any operator, say, Â, is a mathematical instruction which when
applied to a mathematical object like the wavefunction ψ(x 1 , x 2 , x 3 . . .) gives
another similar mathematical object of same nature φ(x 1 , x 2 , x 3 . . .). It may be
noted that the object φ depends on same variables as the initial object ψ ; i.e.
both φ and ψ are functions of the same space. In mathematical language it may
be expressed as
Equation (5.5) defines the eigenvalue K of operator M . In words one may say, if an
Δ
K, the constant K is called the eigenvalue of operator M for the given wavefunction.
We shall discuss operators in more details in the following sections.
SAQ: What are the distinguishing characters of three versions of quantum
mechanics put forward, respectively, by Schrodinger, Heisenberg and Dirac?
Time evolution, i.e. how the system changes with time, of a classical system is
governed by Newton’s laws of motion and Maxwell’s equations of electromagnetic
waves. The time evolution of a quantum mechanical system may be described by a
partial differential equation called Schrodinger equation.
Postulate 3 Time evolution of the wavefunction ψ(→r , t) of a quantum mechan-
ical system is governed by the following partial differential equation, called
Schrodinger time-dependent equation.
∂ψ(→r , t) ℏ2 −
→2 (→ )
iℏ =− ∇ ψ(→r , t) + V −
r , t ψ(→r , t) (5.6)
∂t 2m
−r , t) the
In Eq. (5.6) m is the mass of the quantum mechanical system, V (→
potential energy function and,
−
→2 ∂2 ∂2 ∂2
∇ = + + , the Laplace operator.
∂ x2 ∂ y2 ∂ z2
∂ψ(x, t) ℏ2 ∂ 2 ψ(x, t)
iℏ =− + V (x)ψ(x, t) (5.7)
∂t 2m ∂ x 2
Solution of Schrodinger equation provides the wavefunction of the quantum
mechanical state.
272 5 Introduction to Quantum Mechanics
(i) Schrodinger equation contains terms with only the first power of ψ; hence, it is
linear in ψ. Further, if a given function ψ is a solution of Schrodinger equation
for a given system, then βψ, where β is constant, will also be a solution of
the Schrodinger equation. This property is referred as homogeneity of the
equation. Schrodinger equation is, therefore, both linear and homogeneous.
(ii) Since Schrodinger equation is first-order differential equation in time, the value
of wavefunction ψ if known at some initial time t 0 , it may be determined at
some later time t uniquely.
(iii) Let us consider a collection of N number of particles in a small volume dτ
of space at some point r→ in space which we take as our quantum mechanical
system. Further, let us assume that each of these N particles is represented by
the same wavefunction ψ(→ r , t). The average number of particles at time t in
volume dτ , which is the probability of finding the collection of particles in
volume dτ , is given by
r , t)∗ ψ(→
⟨Nt ⟩ = N ψ(→ r , t)dτ (5.8)
Equation (5.8) may be used to calculate the average number of particles at different
points in the space at the same instant t by changing the value of position vector r→.
Thus, one may generate a probability distribution function of the system of particles
in space at time t. Same technique may be used to generate the probability distribution
function at some time t , and so on. In this way, Schrodinger equation may be used
to follow the time evolution of the quantum mechanical system of particles.
The significance of the time-dependent Schrodinger equation lies in the fact
that it allows the determination of the time evolution of wavefunction ψ which in
turn provides the time evolution of the probability density distribution function.
Changes in probability density distribution function tell about the changes that
have taken place in the system as it evolves with time.
(iv) If ψ1 (→
r , t), ψ2 (→
r , t) and ψ3 (→
r , t) are three solutions of the Schrodinger equa-
tion for a given system, then ψ(→ r , t) = a1 ψ1 (→
r , t) + a2 ψ2 (→
r , t) + a3 ψ3 (→
r , t),
where a1 , a2 and a3 are arbitrary complex constants, will also be a solu-
tion of the Schrodinger equation. This property of the Schrodinger equation
is called the property of quantum mechanical superposition. In general, the
quantum mechanical principle of superposition may be stated as: if there are
more than one wavefunctions that are solutions of the Schrodinger equation for
a given system, then the linear combination of these wavefunctions will also be
a solution of the Schrodinger equation.
ψ(→
r , t) = φ(→
r ) f (t) (5.9)
∂φ(→
r ) f (t) ℏ2 −
→2
iℏ =− ∇ φ(→
r ) f (t) + V (→
r )φ(→
r ) f (t)
∂t 2m
or
d f (t) ℏ2 −
→
φ(→
r )iℏ =− f (t) ∇ 2 φ(→
r ) + V (→
r )φ(→
r ) f (t) (5.10)
dt 2m
1 d f (t) ℏ2 1 − →2
iℏ =− ∇ φ(→
r ) + V (→
r) (5.11)
f (t) dt 2m φ(→
r)
It may be marked that the terms appearing on the RHS of Eq. (5.11) are all
functions only of r and term on the LHS is a function only of time t. It is obvious
that a time-dependent term cannot be equal to the sum of two time-independent
terms, as is seen in Eq. (5.11). This is possible only if the time-dependent and the
time-independent terms are equal to some constant, say E, i.e.
1 d f (t)
iℏ =E (5.12)
f (t) dt
and
ℏ2 1 − →2
− ∇ φ(→
r ) + V (→
r) = E (5.13)
2m φ(→
r)
d f (t) i
= − E f (t)
dt ℏ
This equation may be easily integrated to give
274 5 Introduction to Quantum Mechanics
f (t) = e(− ℏ Et )
i
(5.14)
ℏ2 −
→2
− ∇ φ(→
r ) + V (→ r ) = Eφ(→
r )φ(→ r) (5.15)
2m
Equation (5.15) is called Schrodinger’s time-independent equation which may
be solved to obtain function φ(→
r ) provided the potential V (→
r ) is known.
Schrodinger’s time-independent equation may also be written as
Ĥ φ(→
r ) = Eφ(→
r) (5.16)
where
ℏ2 −
→2
Ĥ = − ∇ + V (→
r) (5.17)
2m
r )e(− ℏ Et )
i
ψ(→
r , t) = φ(→
r ) f (t) = φ(→ (5.18)
The probability density p for such case (where potential does not depend on time)
may be calculated using the expression
| | || | |
| |
|
|
p = |ψ(→ r )e(+ ℏ Et ) φ(→
r , t)| = |φ(→
r , t)∗ ψ(→ r )e(− ℏ Et ) | = |φ(→
i i
r )2 | (5.19)
Equation (5.19) tells that for a time-independent Schrodinger equation the prob-
ability density at a given point in space does not depend on time; it remains same at
all times.
The states of a quantum system for which the probability density does not depend
on time are called stationary states.
SAQ: Discuss the conditions when Schrodinger’s time-independent equation may
be used.
Operators are instructions of mathematical nature that are required to be carried out
on the object (wavefunction) over which they are made to operate. Some important
classes of operators are discussed in the following.
5.6 About Operators 275
The unity operator operating on an object leaves the object unchanged; for example,
r , t) = φ(→
Îφ(→ r , t).
If, for{ the given scalars α and} β and functions ψ(→r , t) and φ(→ r , t),
A αψ(→ r , t) + β̀φ(→
r , t) = α Aψ(→r , t) + β Aφ(→r , t), then operator A is called a
linear operator.
Let there be a hermitian operator Â, we define another operator † such that
{+∞ [ ] {+∞( )∗
∗
φ (→ r )d x =
r ) Âψ(→ 3
† φ(→
r ) ψ(→
r )d3 x (5.20)
−∞ −∞
Then operator † is called the hermitian adjoint or hermitian conjugate of operator
Â.
Suppose the operator  may be represented by a matrix M of appropriate dimen-
sions, then the hermitian conjugate operator will be given by the matrix M † which
may be obtained by first transposing the matrix M and then taking the complex
conjugate, i.e.
[ ]∗
M † = (M)T (5.21)
Operators that are their own hermitian conjugate are called hermitian operators;
that is to say that if † = Â, then  is a hermitian operator.
276 5 Introduction to Quantum Mechanics
One important property of hermitian operators is that their eigen values are
real. Since dynamic observables of quantum mechanical systems must also be real,
therefore, operators that represent dynamic variables must be hermitian.
Iˆ
Û Û † = Û † Û = Iˆ or Û † = = Û −1 (inverse operator) (5.24)
Û
Equation (5.24) tells that the inverse of a unitary operator is equal to its hermitian
conjugate.
SAQ: What is meant by a linear operator?
Without any derivation or proof we list here some properties of hermitian operators.
(i) The eigenvalues of a hermitian operator are real.
5.6 About Operators 277
(c) Product of two operators: An operator Ĉ may be defined as the product of two
operators  and B̂ when Ĉψ(x) =  B̂ψ(x) = φ(x).
Also,
As is evident, operator B̂ operates on ψ(x) to give a new function ϑ(x) such that
when  operates on ϑ(x) it gives the function φ(x).
(d) Division of two operators: Dividing an operator  by another operator B̂ is
equivalent to multiplying operator  by operator B̂ −1 , the inverse operator of
B̂, provided the inverse operator of B̂ exists.
(e) Power of an operator: Sometimes one finds expression like B̂ n ψ(x) where an
operator is raised to some power n. It simply means that the operator should
operate on the function successively n-times one after the other;
(n−1)
. .. .( )
B̂ n ψ(x) = B̂ B̂ B̂ . . . . . . B̂ B̂ψ(x)
278 5 Introduction to Quantum Mechanics
(f) Commutator operator: In general the products of the two operators  B̂ and
B̂ Â are not equal, i.e. Â B̂ /= B̂ Â. An operator Ĉ defined by the expression
(5.28) and written as [ Â, B̂] is called the commutator of  and B̂.
[ ]
Ĉ = Â, B̂ = Â B̂ − B̂ Â (Commutator) (5.28)
When anticommutator operator [ Â, B̂]+ = 0, the operators  and B̂ are said to
anticommute with each other.
When a system defined by a wavefunction, say, ψ(x) is operated by the product
of two operators  and B̂ and the final result of the operation does not depend on the
order of operations, the operators  and B̂ are said to commute with each other. Just
for example, if operator  means the instruction of writing your name and operator B̂
means drawing of an apple, then the order of operators is not important, the end result
of writing name and drawing the figure of an apple may be achieved by first writing
the name and then drawing the figure of apple, or first drawing the figure of apple and
then writing the name. In this example operator  and operator B̂ commute. However,
if operator  is the command to colour the apple and operator B̂ is the command to
draw the figure of an apple, then the product  B̂ will be meaningless, while product
B̂ Â will mean to draw the figure of an apple and to colour it, a meaningful command.
In case operators  and B̂ represent some dynamic variables of the system (like
the position and linear momentum, etc.) and if they commute, it means that both
dynamic variables of the system may be measured simultaneously with full accuracy
in the given state of the system. On the other hand if they do not commute, then both
the variables cannot be measured with complete accuracy in the given state of the
system. If two operators commute, then they can have the same set of eigenfunctions.
(g) Some properties of commutator: It may be proved that commutator obeys
following rules:
[ ] [ ]
(i) Â, B̂ = − B̂, Â
[ ]† [ † † ]
(ii) Â, B̂ = Â , B̂
[ ( )] [ ] [ ]
(iii) Â, B̂ + Ĉ = Â, B̂ + Â, Ĉ
[ ( )] [ ] [ ]
(iv) Â, B̂ Ĉ = B̂ Â, Ĉ + Â, B̂ Ĉ.
5.6 About Operators 279
Table 5.1 gives the list of some important and frequently used quantum mechanical
operators corresponding to classical observables.
SAQ: Why hermitian operators are used to describe observables?
Solved Examples
SE5.1 Show that function ϕ(y, t) = Aye−β y e−iωt where A, β and ω are arbitrary
2
{+∞ {+∞( )( )
∗
A∗ ye−βy e+iωt Aye−β y e−iωt dydt
2 2
I = ϕ (y, t)ϕ(y, t)dydt =
−∞ −∞
{+∞
∗
y 2 e−2β y dy
2
I =A A (5.30)
−∞
Angular momentum r→ × p→ L→
( ) ( )
∂
L x = −iℏ y ∂z − z ∂∂y L x = ypz − zp y
( )
L y = −iℏ z ∂∂x − x ∂∂z L y = (zpx − x pz )
( ) ( )
L z = −iℏ x ∂∂y − y ∂∂x L z = x p y − ypx
p2
Total energy: 2m + V (→
r)
ℏ →22
H = − 2m ∇ + V̂ (→
r)
Parity operator Mirror reflection
−→
P = −r (−x, −y, −z)
P→ = −iℏ∇;→ px = −iℏ ∂
∂x ; Linear momentum P
py = −iℏ ∂∂y ; pz = ∂
−iℏ ∂z
ℏ →2 2 p2
Kinetic energy T = − 2m ∇ 2m
280 5 Introduction to Quantum Mechanics
To evaluate integral I given by Eq. (5.30), we make use of the Gaussian integral
IGauu given by
{+∞ /
−αx 2 π
IGauu (α) = e dx = (5.31)
α
−∞
{+∞ /
dIGauu (α) d −αx 2 d π
= e dx =
dα dα dα α
−∞
or
{+∞ /
dIGauu (α) 2 −αx 2 π
= x e dx =
dα 4α 3
−∞
∫ +∞ √ π
x 2 e−αx dx =
2
or . Putting x = y and α = 2β in this equation one gets
∫−∞
+∞
/4α3 /
2 −2βy 2 π π
−∞ y e dy = 4(2β)3 = 32β 3 , and putting this back in Eq. (5.30) one get,
{+∞ /
∗ π
y 2 e−2β y dy = A∗ A
2
I =A A (5.32)
32β 3
−∞
Equation (5.32) shows that the function ϕ(y, t) is square integrable and hence
the given function fulfils all the conditions that a valid wavefunction of a quantum
mechanical system must fulfil. Therefore, it may be the state function of a quantum
system.
[ ]
SE5.2 Calculate the value of the commutator Â, B̂ when operators are defined
by the relations; Âϕ(x) = xϕ(x) and B̂ϕ(x) = −iℏ dϕ(x)
dx
, here ϕ(x) is a
valid wavefunction.
[ ] { }
Solution: The commutator Â, B̂ ϑ(x) = Â B̂ϑ(x) − B̂ Âϑ(x) = Â −iℏ dϕ(x)
dx
−
B̂{xϕ(x)}.
Or
[ ] ⎧ ⎫
dϕ(x) dxϕ(x)
Â, B̂ ϑ(x) = −iℏ Â − −iℏ
dx dx
( )
dϕ(x) dxϕ(x) dϕ(x) dxϕ(x)
= −iℏx + iℏ = −iℏ x −
dx dx dx dx
5.6 About Operators 281
( )
dϕ(x) dϑ(x)
= −iℏ x −x − ϑ(x) = iℏϑ(x) (5.33)
dx dx
[ ]
Equation (5.33) tells that the commutator Â, B̂ = iℏ.
SE5.3 How one defines the hermitian conjugate of a hermitian operator? An oper-
ator given by  = − d dϕ(y)
2
2
y
operates on a well-behaved state function ϕ(y).
Obtain an expression for the operator † and show it is same as Â.
{+∞ [ ] {+∞( )∗
∗
φ (→ r )d x =
r ) Âψ(→ 3
† φ(→
r ) ψ(→
r )d3 x
−∞ −∞
In the present case, the operator acts on a wavefunction that is the function of
only y, the above expression may be written as
{+∞ [ ] {+∞( )∗
∗
φ (→y ) Âψ(→y )dy = † φ(→y ) ψ(→y )dy (5.35)
−∞ −∞
{+∞ [ ] {+∞ [ ]
∗ ∗ d2
ILHS = φ (→y ) Âψ(→y )dy = φ (→y ) − 2 ψ(→y )dy (5.36)
d y
−∞ −∞
RHS of Eq. (5.36) may be integrated using the method of integration of two
multiples
{+∞
d dφ ∗ (y) dψ(y)
ILHS = −φ (→y ) ψ(→y )dy|+∞
∗
−∞ + (5.37)
dy dy dy
−∞
282 5 Introduction to Quantum Mechanics
First term in Eq. (5.37) vanishes as both functions φ(y) and ψ(y) are valid state
functions and therefore must vanish at y = ±∞, similarly the derivatives of these
functions must vanish∫at y = ∗±∞.
+∞
Therefore, ILHS = −∞ dφdy(y) dψ(y)
dy
let us integrate this in parts to get
⎧ ⎫ {+∞ 2 ∗ {+∞
dφ ∗ (y) +∞ d φ (y) d2 ∗
ILHS = ψ(y) |−∞ − ψ(y) = − φ (y)ψ(y)
dy dy 2 dy 2
−∞ −∞
∫ +∞ d2 ∗ ∫ +∞ ( d2 φ(y) )∗
or ILHS = −∞ − dy 2 φ (y)ψ(y) = −∞ − dy 2
ψ(y).
Substituting the value of ILHS back in Eq. (5.35), one gets
{+∞ ( 2 )∗ {+∞( )∗
d φ(y)
− ψ(y) = Â †
φ(→
y ) ψ(→y )dy
dy 2
−∞ −∞
( )∗ ( 2 )∗
φ(y)
This give, † φ(→y ) = − d dy 2 .
2 2
And † = − dy
d
2 ; but − dy 2 = Â.
d
Solution: Parity operator P changes the space coordinates r→ (x, y, z) of the state
−
→
function on which it operates to −r (−x, −y, −z). In order to prove that operator P
is hermitian we have to prove that the adjoint P † is equal to operator P. The adjoint
of the operator is defined by the integral equation
{+∞ [ ] {+∞( )∗
∗
φ (→ r )d x =
r ) Âψ(→ 3
† φ(→
r ) ψ(→
r )d3 x
−∞ −∞
{+∞ {+∞
∗
[ ] ( † )∗
φ (→ r )d x =
r ) Pψ(→ 3
P φ(→
r ) ψ(→
r )d3 x (5.38)
−∞ −∞
The RHS of the above equation may be denoted by I RHS and may be written as
5.7 Measurement of a Dynamical Variable in Quantum Mechanics 283
{+∞ {+∞
∗
[ ] [ ]
IRHS = φ (→ r )d x =
r ) Pψ(→ 3
φ ∗ (→
r ) ψ(−→
r )d3 x
−∞ −∞
{+∞
[ ]
= −φ ∗ (−→
r ) ψ(→
r )d3 x
−∞
∫ +∞ [ ] ∫ +∞ [ ]
IRHS = −∞ −φ ∗ (−→ r )d3 x = −∞ {Pφ(r )}∗ ψ(→
r ) ψ(→ r )d3 x putting this back
in Eq. (5.38) one gets
{+∞ {+∞
[ ∗
] ( )∗
{Pφ(r )} ψ(→
r )d x = 3
P † φ(→
r ) ψ(→
r )d3 x
−∞ −∞
Comparing the two sides of the above equation gives P = P † which means that
parity operator is hermitian.
Note: If Pψ(r ) = ψ(r ), the wavefunction is said to have even or positive parity, and
if Pψ(r ) = −ψ(r ), the wavefunction is said to have odd or negative parity.
In case Pψ(r ) /= ±ψ(r ), the wavefunction is said to have helicity.
r , t) = a1 Âϑ1 (→
Âϑ1 (→ r , t); r , t) = a2 Âϑ2 (→
Âϑ2 (→ r , t); r , t) = a3 Âϑ3 (→
Âϑ3 (→ r , t); . . .
r , t) = an Âϑn (→
Âϑn (→ r , t); . . . . . . . . .
(5.39)
284 5 Introduction to Quantum Mechanics
r , t) = a1 Âϑ1 (→
Âϑ1 (→ r , t); r , t) = a1 Âϑ2 (→
Âϑ2 (→ r , t); r , t) = a1 Âϑ3 (→
Âϑ3 (→ r , t)
(5.40)
r , t) = ai ϑi (→
Âψ(→ r , t) where i may have any value from 1 to 5.
This means that if same measurement is repeated several times, then according to
quantum mechanics, one of the five eigenvalues will appear as the measured value
each time. Quantum mechanics does not tell which eigenvalue value will appear in a
particular measurement. However, quantum mechanics does predict the probability
with which a particular eigenvalue will appear in repeated measurements. Now there
may be three cases: (a) when the eigenvalue spectrum of the operator  is discrete and
non-degenerate, (b) when the eigenvalue spectrum is discrete but degenerate and (c)
when the eigenvalue spectrum in continuous and not discrete. Method of calculating
probability for a particular eigenvalue in each of the above case is discussed in the
following.
(a) In the case when the eigenvalue spectrum of the operator  is (discrete
) and
non-degenerate, then quantum mechanics gives the probability P a j that the
measured value will be a particular aj as
|( )|2
( ) | ϑj, ψ |
P aj = (5.41)
(ψ, ψ)
In Eq. (5.41), ϑ j (→
r , t) is the eigenfunction of operator  corresponding to
eigenvalue aj , i.e.
r , t) = a j ϑ j (→
Âϑ j (→ r , t) (5.41a)
And,
5.7 Measurement of a Dynamical Variable in Quantum Mechanics 285
{+∞
( )
ϑj, ψ = r , t)∗ (ψ(→
ϑ j (→ r , t))d3 x (5.42)
−∞
{+∞
(ψ, ψ) = |(ψ(→
r , t))|2 d3 x (5.43)
−∞
∑m ||∫ +∞ k∗ 3 |
|2
( ) |
k=1 −∞ ϑ j (→
r , t)ψ(→ r , t)d x |
P aj = ∫ +∞ (5.45)
−∞
|(ψ(→ r , t))| d x
2 3
(b) If the operator  does not have discrete set of eigen values but the eigen-
value spectrum is continuous, then the probability that the measured eigen
value will lie between a, and (a, + da, ) is given by
( ) |ψ(a)|2
dP a , = ∫ +∞ da (5.46)
−∞
|ψ(a , )|2 da ,
SAQ: Under what assumption(s) the wavefunction for a system may be separated
into two independent parts one depending on space coordinates and the other
on time coordinates.
286 5 Introduction to Quantum Mechanics
Let us once again consider a quantum mechanical system in a state defined by ψ(→ r , t).
Further, let there be a hermitian operator  that corresponds to a dynamic variable
‘a’. We assume that operator  has only five discrete non-degenerate eigenvalues
designated as; a1 , a2 , a3 , a4 and a5 with corresponding eigen states φi (→ r , t); i =
1, 2, 3, 4, 5. Now according to the postulate 4 of quantum mechanics, whenever any
attempt will be made to measure the variable ‘a’, the process of measurement will
switch the system from initial state ψ(→ r , t) to one of the eigen states φi (→r , t). The
process of switching will be random; in first attempt the system may be switched
to eigen state φ3 (→
r , t) to give the value of variable ‘a’ as a3 , in the next attempt of
measurement the system may be switched to eigen state φ2 (→ r , t) to give the value as
a2 and so on. Therefore, according to quantum mechanics, the result of measurement
will be one of the eigen values; it may be a1 or a5 , or a3 , or a2 or a4 , only one of the
eigen values and nothing else.
We now consider the actual measurement of the dynamic variable ‘a’ in a labo-
ratory experiment. Suppose the laboratory measurement is repeated three times in
identical conditions and experimental values x 1 , x 2 , x 3 have been recorded. The
normal practice is to take the mean value of these three experimental values as the
final experimental value; so the experimental value of variable ‘a’ is given as
Now the question is: To which eigen value of operator  the experimental value
aexp may be compared? Quantum mechanics says that the experimental value of a
variable should be compared with the expectation value of the variable ‘a’ and not
with any individual eigen value of the operator. The expectation value of a variable
is defined as under.
Expectation value of a variable ‘a’ in state ψ is denoted by
∫ +∞ [ ]
−∞ ψ ∗ (→
r , t) Âψ(→
r , t) d3 x
⟨a⟩ = ∫ +∞ (5.48)
−∞ ψ ∗ (→
r , t)ψ(→
r , t)d3 x
{+∞ [ ]
⟨a⟩ = ψ ∗ (→
r , t) Âψ(→
r , t) d3 x (5.49)
−∞
5.7 Measurement of a Dynamical Variable in Quantum Mechanics 287
SAQ: Is it possible to measure the value of a dynamical variable when the system
is not in one of the Eigen states of the hermitian operator corresponding to
the variable?
Solved Examples
SE5.5 Calculate the expectation value of variable x 2 for a system in quantum state
ψ(x) = 4e−k(x−b) . Here k and b are real constants.
2
or
(
∫ +∞ )
2 −2k(x−b)2
⟨ 2⟩ −∞ x e dx
x = ∫ +∞ ( 2)
(5.51)
−2k(x−b) dx
−∞ e
In Eq. (5.51) both the enumerator and the denominator are definite integrals of
standard form with values given as
∫ +∞ ( )
2 −2k(x−b)2 √ π
⟨ 2⟩ −∞ x e dx 3 1
x = ∫ +∞ ( ) = √32k
π
=
−2k(x−b)2 dx 4k
−∞ e 2k
Solution: As given in the problem, the initial state of the system is a superposition
state which is the linear combination of two eigen states ϕ1 and ϕ2 of the Hamiltonian
H of the system. Further, according to the fourth postulate of quantum mechanics,
whenever total energy of the system will be measured, the process of measurement
will drive the system to one of the two possible eigen states ϕ1 or ϕ2 .
Suppose, the system is driven to eigen state ϕ1 .
Then using the expression
∫ +∞
Equation (5.53) may further reduced using the identity −∞ χm χn = 1, i f m =
n, other wise = 0. |∫ |
| +∞ ∗ ( 1 ) ( ∗ 1 )|2
| −∞ ϕ1 2 ϕ1 + ϕ1 3 ϕ2 | dx
P(a1 = K ) = ∫ +∞ ( 1 )∗ ( 1 ) ( 1 )∗ ( 1 )
−∞ 2 ϕ1 ϕ + 3 ϕ2 3 ϕ2 dx
2 1
Or |∫ | .
| +∞ ∗ ( 1 )|2
ϕ ϕ
| −∞ 1 2 1 | dx
= ∫ +∞ ( 1 )∗ ( 1 ) ( 1 )∗ ( 1 )
−∞ 2 ϕ1 ϕ + 3 ϕ2 3 ϕ2 dx
2 1
Since it is given that eigenfunctions ϕn are normalised,
{+∞ {+∞
| ∗ |2 | ∗ |2
|ϕ ϕ1 | dx = |ϕ ϕ1 | dx = 1
1 1
−∞ −∞
∫ |
1 +∞ | ∗ |2
|
4 −∞
ϕ1 ϕ1 dx 1
P(a1 = K ) = ∫ +∞ | | ∫ | | = 4
= 9/13
1 |ϕ ∗ ϕ1 |2 dx + 1 +∞ |ϕ ∗ ϕ2 |2 dx 1
+ 1
4 −∞ 1 9 −∞ 2 4 9
It may be observed that quantum mechanically, the total energy of the system may
assume a value of either 1 K or 8 K. However, the expectation value of energy that
may be compared with experimentally measured energy is 3.15 K.
ℏ2 ∂ 2 x
x )e(− ℏ Et )
i
− φ(→
x ) + V (→ x ) = Eφ(→
x )φ(→ x ); ψ(→
x · t) = φ(→
2m ∂ x 2
E of the particle may have any value, from E = V min onwards in a continuous way.
However, in quantum mechanics, the particle cannot have all continuous values, the
particle can stay only with some discrete values of energy, say, E 0 , E 1 , E 2 , E 3 , etc.
Particle cannot have energy between E 0 and E 1 ; E 1 and E 2 or E 2 and E 3 and so
on. The energy spectrum of the particle in a potential well is shown in Fig. 5.2. The
lowest energy state with energy E 0 is called the ground state and the state next higher
in energy E 1 the first excited state and so on. The energy spectrum can be divided
into two distinct groups: (i) energy states below V 1 (the lower height of the potential
well) and (ii) states with energy E > V 1 .
Energy states with energies below V 1 are called bound states. It is because if
looked classically, the particle with energy V min < E < V 1 will remain confined
within the region of space defined by the potential well, it will be constrained to
move up to one end of the potential and will then return back at the classical turning
point to travel to the other turning point on the other end. Hence these states are
called bound states. Energy states above energy V 1 are called scattering states as
once the particle is in one of the scattering states it is not confined to the potential
well and may scatter away.
Important properties of bound state energy states and corresponding wavefunc-
tions are
1. The bound state energy levels in the case of a one-dimensional potential are
discrete and non-degenerate.
2. The ground state wavefunction φ 0 (x) has no node, which means that in the
space +∞ > x > −∞ the wavefunction does not become zero. Wavefunction
φ1 (x) for the first excited state has one node (becomes zero at one value of x),
wavefunction φ2 (x) for the second excited state has two nodes, φ 3 (x) three
nodes and so on.
SAQ: What will be the difference in the description of bound states looked from
classical physics?
A particle is said to be a free particle if it has energy E but does not face any potential
V (x).
Space part of one-dimensional time-independent Schrodinger equation describing
a free particle of mass m and energy E may be written as
ℏ2 ∂ 2 x
− φ(→
x ) = Eφ(→
x) (5.54)
2m ∂ x 2
or
∂2x 2m E
φ(→
x ) + 2 φ(→
x) = 0 (5.55)
∂x 2 ℏ
Substituting k 2 = 2m E
ℏ2
; E > 0, above equation reduces to
∂2x
φ(→
x ) + k 2 φ(→
x) = 0 (5.56)
∂x2
Equation (5.56) has two solutions; φ1 = eikx and φ2 = e−ikx that satisfy the
equation. The complete wavefunction ψ(x, t) that contains the time dependence
e(− ℏ Et ) for φ1 and φ2 may be written as
i
and
292 5 Introduction to Quantum Mechanics
Since ψ1 (x, t) and ψ2 (x, t) are both solutions of Schrodinger equation, it follows
from the principle of quantum mechanical superposition, that their linear combination
ψ(x, t), given below, will also be a solution of Schrodinger equation of the system.
First term, A1 e ℏ ( px−Et) , in Eq. (5.60) represents a plane wave travelling in the
i
x-direction. It may be observed that there are no boundary conditions and hence no
restrictions on the values of E, which means that the energy of a free particle can
have any value, i.e. energy may have continuous values.
From the derivation of Eq. (5.60) it appears that according to quantum mechanical
treatment, a free particle may be represented as two plane waves moving in positive
x-direction and the other in negative x-direction with well-defined energies (E =
ℏ2 k 2
2m
) and well-defined linear momentum +ℏk and −ℏk, respectively. However, this
is not correct. It is because the first point that must be considered is whether the wave-
function ψ(x, t) given by Eq. (5.60) is a quantum mechanically valid wavefunction
or not. Problems with this wavefunction are
(i) The probability density of finding the particle at some point x for either of the
∫ +∞ φ1 or φ2 is given by
two solutions
P(x) = −∞ φi∗ φi dx is equal to ||Ai |2 (i = 1, 2) which does not depend either
on distance x or on time t. This implies that the total probability of finding
the particle somewhere in the space will tend to become infinite; which is
physically impossible.
(ii) The second problem is that [ the wavefunctions ψ(x, t) could not be normalised
∫ +∞ ∫ +∞ ∫ +∞ ]
because −∞ ψ ∗ ψdx = |A1 |2 −∞ dx + |A2 |2 −∞ dx → ∞, which tends
to infinity.
(iii) Lastly, the velocity with which the particle is moving v is given as v = mp = ℏk
m
while the velocity of the right or left moving plane wave is denoted by vwav =
ℏk
2m
.
It may be observed that according to this description the particle is moving
with twice the velocity of the plane wave which represents it.
5.8 Some One-Dimensional Problems 293
{+∞
1
ψ(x, t) = √ Ak (k)ei(kωx−ωt) dk (5.61)
2
−∞
The amplitude
{+∞
Ak (k) = ψ(x, 0)e−ikωx dx (5.62)
−∞
Here,
E
ω= (5.63)
ℏ
Since an infinite (or sufficiently large) number of plane waves that superimpose on
each other are coherent, they interfere producing a pattern of interference maximums
with largest amplitude at x = 0 as shown in Fig. 5.3. The intensity of successive
maximums decreases with the increase of distance x on both sides of the origin.
The envelope of the interference maximum makes the wave packet that is localised
around x = 0.
Fig. 5.3 Snapshot of the wave packet formed by the interference of large number of plane waves.
The wave packet is localised at x = 0 and moves in +x-direction with the group velocity V group
294 5 Introduction to Quantum Mechanics
Since the exponential factor eikx oscillates rapidly, the wavefunction ψ(x, t) given
by Eq. (5.61) undergoes destructive interference and vanishes at x = ∓∞. In a
similar way, the amplitude function Ak (k) defined by Eq. (5.62) is localised at k =
0 in k-space and goes to zero for large values of k.
The size of the wave packet in space is specified by the half-width ∓Δx where the
intensity of maximum [|ψ(x = ∓Δx)|2 ] drops to √1e of its maximum value at origin
[|ψ(x = 0)|2 ]. Similarly, the size of the wave packet corresponding to the amplitude
function Ak (k) in k-space is defined by the half-width ∓Δk, such that the magnitude
of the function [|Ak (k = ∓Δx)|2 ] at k = ∓Δk is √1e of [|Ak (x = ∓Δx)|2 ]. That is
[|ψ(x=∓Δx)|2 ] = [|Ak (k=∓Δx)|2 ] = √1 .
[|ψ(x=0)|2 ] [|Ak (k=0)|2 ] e
It can be shown that both ψ(x) and Ak (k) are normalised to unity.
One may ask the physical interpretation of the wave packet. The physical inter-
pretation of the wave packet may be given as follows; |ψ(x, t)|2 is the probability
density that the particle is found at point x at time t, while |ψ(x, t)|2 dx may be
interpreted as the probability of finding the particle in the interval x and (x + dx).
Similarly, |Ak (k)|2 gives the probability density of measuring the wave vector value
as k or momentum value p = k/ℏ of the particle. Also, |Ak (k)|2 dk may be defined as
the probability of measuring the value of wave vector between k and (k + dk).
Identifying a free particle by a wave packet removes all difficulties associated with
plane wave representation of the free particle. Wave packet and the wavefunctions
are normalised; wave packet description provides probability density which never
approaches infinity, momentum and location of the particle are not known exactly at
a given time. Problem regarding the speed of the particle and the plane wave is no
more, particle which is represented by the wave packet moves with the group velocity
while individual waves travel with the phase velocity. Hence there is no confusion
about the velocity of the particle and the velocity of the wave.
SAQ: In quantum physics a free particle is represented by wave packet. What is the
mechanism by which the wave packet is formed?
A one-dimensional infinite potential well of small width ‘a’ is shown in Fig. 5.4.
The potential well is asymmetric because it is not symmetrical about the vertical axis
through x = 0. The potential V (x) is zero between x = 0 and x = a; and for all other
values of x it is infinite. In mathematical language the potential well may be defined
as
⎧
0, for 0 < x < a
V (x) = (5.64)
∞, for x ≤ 0, x ≥ a
We wish to study the energy states for a particle of mass m and energy E placed
inside the potential well. Let us first look the problem from the point of view of
classical physics. Since the particle is confined from both sides by infinite potential
walls, the particle will not be able to cross the potential walls at x = 0 and at x = ∞,
will turn back
√
when it hits the potential walls and will move with constant speed v
= p/m = ∓ 2m m
E
from one end to the other end of the well. From the viewpoint of
quantum physics, since the particle is confined in a small space it corresponds to the
problem of bound states.
The time-independent Schrodinger equation for this case (as in the case of a free
particle) may be written as
ℏ2 ∂ 2 x
− φ(→
x ) = Eφ(→
x)
2m ∂ x 2
or
∂2x 2m E
φ(→
x ) + k 2 φ(→
x ) = 0, where k 2 = 2 ; E > 0 (5.65)
∂x 2 ℏ
Function ϑ(x) will also be a solution of Eq. (5.65). It is now required to determine
the value of arbitrary constants A and B. To find the values of these constants, we
review the boundary conditions; at x = 0 and x = a, where the potential is infinite,
function ϑ(x) must vanish.
296 5 Introduction to Quantum Mechanics
Therefore, when
Also, when
One may expend exponential terms in sine and cosine forms to get
or
or
In Eq. (5.68), arbitrary constant A is not zero; therefore, sin ka = 0; this is possible
only when
The value n = 0 is not taken because in that case the function ϑ(x) will become
zero everywhere. √
On substituting the value of k = 2mℏ
E
in Eq. (5.69) one gets
√
2m E π 2 ℏ2
a = nπ ; or E = n 2 ; n = 1, 2, 3 . . . (5.70)
ℏ 2ma 2
Equation (5.70) tells that a particle confined in an infinite potential well may have
many discrete values for its energy E corresponding to different values ( 2 of2 )integer n.
π ℏ
The lowest energy state, called the ground state, has energy E 0 = 12 2ma 2 ; the first
( 2 2)
π ℏ
excited state has the energy E 1 = (2)2 2ma 2 and so on. It may be observed that the
energy separation between successive energy states is not uniform; it increases with
the increase of the value of n. Following conclusions may be drawn from Eq. (5.70):
(i) Particle in a box can move within the infinite potential well only with fixed
π 2 ℏ2
discrete value of energies given by, E (n−1) = n 2 2ma 2 , where n can have integer
(ii) There is infinite number of discrete energy levels or states that a particle in an
infinite potential well may have. Separation between two consecutive energy
states increases with the value of the positive nonzero integer n.
(iii) Energy E (n−1) is inversely proportional to the square of the width of the potential
well; i.e. E (n−1) ∝ a12 .
It may be noted that energy E (n−1) is the kinetic energy of the particle, since the
potential energy in the region between two infinite potential walls is zero. Figure 5.5
shows the spectrum of allowed energy states of a particle in an infinite potential well.
As shown in this figure, the minimum energy that a particle may have inside the box
is E 0 and the higher energy states are not equidistant, rather the separation increases
with the energy of the state. The lowest energy state E 0 is called the ground state and
the next higher energy state E 1 as the first excited state, E 2 the second excited state
and so on.
Since the energy eigenvalue E (n−1) depends on the value of the positive integer n,
there will be n eigenfunctions corresponding to each energy value. The space part of
eigenfunctions, using Eq. (5.68), may be written as
( nπ )
ϑ(n−1) (x) = Dn sin x (5.71)
a
298 5 Introduction to Quantum Mechanics
where Dn is a constant, the value of which may be obtained using the condition of
normalisation given below
{+∞ {+a
∗ ∗
|Dn | 2
ϑ(n−1) ϑ(n−1) dx = |Dn | 2
ϑ(n−1) ϑ(n−1) dx = 1
−∞ 0
or
∫a
We now make the use of the standard result 0 sin2 (θ ) = a
2
− sin42θ |a which gives
{a } /
2
|Dn |2
= 1 or Dn = (5.72)
2 a
Therefore, the normalised space part of the eigenfunction for (n − 1) energy state
may be written as
/
2 ( nπ )
ϑ(n−1) (x) = sin x (5.73)
a a
Eigenfunctions, ϑ0 (x), ϑ1 (x) and ϑ2 (x) for the ground state, first excited state
and the second excited states within the width ‘a’ of the potential well are shown
in Fig. 5.6. It may be observed that ϑ0 (x) does not have zero value for any value of
x, it does not have any node within the width of the potential well, ϑ1 (x) has one
node as it gets a zero value for x = 0.5a. ϑ3 (x), and the normalised space part of the
eigenfunction for the second excited state has two nodes, indicated by arrows in the
figure. In general it may be said that the eigenfunction for the (n − 1)th excited state
will contain n-nodes.
The total wavefunction including the time-dependant part may be written as
or
/
2 ( nπ ) −i
( ) ( )
n2 π 2 ℏ n2 π 2 ℏ
−i t t
ψ(n−1) (x, t) = ϑn (x)e 2ma 2 = sin x e 2ma 2 (5.74)
a a
Since Schrodinger equation is linear, the most general stationary state solu-
tion for one-dimensional infinite well is given as the linear combination of several
wavefunctions of type (5.74) with different multiplicative constants Cn
5.8 Some One-Dimensional Problems 299
Fig. 5.6 Space part of eigenfunctions for ground, first and second excited states for a particle in
infinite potential
/
2 ( nπ ) −i
∞ ( )
gen
∑ n2 π 2 ℏ
t
ψ(n−1) (x, t) = Cn sin x e 2ma 2 (5.75)
n=1
a a
Results obtained for one-dimensional case may be extended to the two and the three-
dimensional infinite potential wells of width ‘a’ in each direction. In the case of 1D
box the energy levels of the particle are specified by Eq. (5.70) as given below
π 2 ℏ2
E (n−1) = n 2
2ma 2
In a two-dimensional case, the corresponding expression for energy may be written
as ( )( π 2 ℏ2 )
E (n−1) = n 2x + n 2y 2ma 2 where nx and ny may have values 1, 2, 3, …
So the ground state energy of the particle in a two-dimensional well of width ‘a’
will be
( ) ( 2 2)
( ) π 2 ℏ2 π ℏ
E 0,0 = 12 + 12 2
= 2
2ma 2ma 2
300 5 Introduction to Quantum Mechanics
( )( π 2 ℏ2 ) ( 2 2)
π ℏ
And the first excited state energy as E 0,1 = 12 + 22 2ma 2 = 5 2ma 2 and so
on.
In the three-dimensional case the energy may be written as(
( )( π 2 ℏ2 ) π 2 ℏ2
)
Ground state energy E (0,0,0) = 12 + 12 + 12 2ma 2 = 3 2 .
( 2 ) ( 2 2 )2ma ( 2 2 )
π ℏ π ℏ
Energy of first excited state E (0,0,1) = 1 + 12 + 22 2ma 2 = 6 2ma 2 .
In the last section we studied the problem of a particle in an infinite potential well.
In this section we will study the quantum mechanical behaviour of a particle of mass
m and kinetic energy E which is projected on a one-dimensional potential barrier of
height V 0 and of width ‘a’. A potential barrier is a space of width ‘a’ where there is
a potential of constant magnitude V 0 all over the space, and this space with potential
V 0 is surrounded from all sides by the free space where there is no potential, as
shown in Fig. 5.6. The potential barrier may be defined as
⎧
⎨ = 0 when X < 0
V = = V0 when 0 ≤ X ≤ a
⎩
= 0 when X > a
Fig. 5.7 Particle of mass m and energy E impinging on a one-dimensional potential barrier of
height V 0 and width a
ℏ2 d2 φ(x)
− + V (x)φ(x) = Eφ(x) where − ∞ < x < +∞ (5.76)
2m dx 2
Here, φ(x) represents the wavefunction of the particle.
Since the interesting case of barrier tunnelling occurs when the energy of the
particle is less than the barrier height, we assume that V 0 > E. Further, the total
one-dimensional space may be divided in to three regions: Region-I from x = −∞
to x = 0 where V (x) = 0; Region-II from x = 0 to x = a where V (x) = V 0
and Region-III from x = a to x = +∞ where V (x) = 0. If ϑ1 (x), ϑ2 (x) and ϑ3 (x),
respectively, denote the space part of the particle wavefunctions in Region-I, Region-
II and Region-III, then Schrodinger equations for the three regions may be written
as
Region-I
ℏ2 d2 ϑ1 (x) d2 ϑ1 (x) 2m E
− = Eϑ1 (x) ⇒ + kI2 ϑ1 (x) = 0, where kI2 =
2m dx 2 dx 2 ℏ2
(5.77)
Region-II
ℏ2 d2 ϑ2 (x) d2 ϑ2 (x)
− + V0 ϑ2 (x) = Eϑ2 (x) ⇒ = kII2 ϑ2 (x);
2m dx 2 dx 2
302 5 Introduction to Quantum Mechanics
2m(V0 − E)
where kII2 = (5.78)
ℏ2
Region-III
ℏ2 d2 ϑ3 (x) d2 ϑ3 (x) 2m E
− = Eϑ3 (x) ⇒ + kI2 ϑ3 (x) = 0, where kI2 =
2m dx 2 dx 2 ℏ2
(5.79)
It may be noted that Region-I and Region-III have no potential and we are consid-
ering a particle
/ of mass M and energy E; therefore, the value of the wave number
kI = kIII = 2Mℏ2
E
.
We now attempt to write wavefunctions for the three regions. Let us first consider
Regions-I and III where the potential is zero and regions are free.
Region-I
Let us examine Eq. (5.77). Wavefunction ϑ1 (x) = Ae+ikI x + Be−ikI x , where A and
B are arbitrary complex constants, satisfies Eq. (5.77), which may be verified by
differentiating the wavefunction ϑ1 (x) twice with respect to x. As may be observed,
Ae+ikI x represents a plane wave moving in positive x-direction and Be−ikI x represents
a plane wave moving in negative x-direction. It may be noted that wave represented
by Aeikx = A(cos kx + i sin kx) or by Be−ikx = B(cos kx − i sin kx) has oscillatory
nature. What happens is that the incident wave when hits the potential barrier at x
= a, a part of the wave gets reflected back. The reflected wave moving in negative
x-direction is represented by ϑIref = Be−ikI x . Hence, one may write
Region-III
Similarly, ϑ3 (x) = Fe+ikI x + Ge−kI x satisfies Eq. (5.79) for Region-III. However, it
may be realised that there must not be any reflected wave in Region-III as the potential
free space extends up to +∞. This means that constant G in above expression must
be zero. Hence the wavefunction in Region-III may be written as
Region-II
Wavefunction ϑ2 (x) must satisfy Schrodinger Eq. (5.78)
5.8 Some One-Dimensional Problems 303
d2 ϑ2 (x) 2m(V0 − E)
= kII2 ϑ2 (x); where kII2 =
dx 2 ℏ2
Since it is assumed that V 0 > E, kII2 is a real positive quantity and kII will also be
real quantity. The general solution to the Schrodinger equation in Region-II is given
by
It is worth noting that the wavefunction in Region-II shows two waves, one De−kII x
that is not oscillatory but decays exponentially with x and the other CekII x that
increases exponentially with x. The wavefunction in Region-III may thus be written
as
Having derived expressions for the space part of wavefunctions in three regions,
we now proceed to find the value of unknown constants, A, B, C, D and F. To
obtain the values of these constants, one uses the boundary conditions that the three
wavefunctions must obey.
(i) Condition of continuity demands that wavefunctions on the two sides of the
potential boundary should match each other, that means
Putting
Putting
(ii) Condition of smooth joining demands that the first derivatives of wavefunc-
tions on the two sides of the potential boundary evaluated at the boundary must
be equal.
And
dϑ2 (x) dϑ3 (x) ikI ikI a
= |x=a that gives CekII a − De−kII a = Fe (5.88)
dx dx kII
It follows from Eqs. (5.86) and (5.88), once by adding and by subtracting, that
( )
F ikI a ikI −kII a
C= e 1+ e (5.89)
2 kII
and
( )
F ikI a ikI kII a
D= e 1− e (5.90)
2 kII
= e −
A 2 kI 2
⎧ ⎫
F ikI a ikII
= e cosh(kII a) − sinh(kII a) (5.91)
A kI
and
⎧ ⎫
B F ikI a ikI
1− = e cosh(kII a) + sinh(kII a) (5.92)
A A kII
and
F ikI a 2
e ={ ( ) } (5.93a)
A 2 cosh(kII a) + i kI −
kII kI
sinh(kII a)
kII
F ikI a
We substitute the value of A
e from Eq. (5.93a) in Eq. (5.94) to get
( )
B
kII
kI
{sinh(kII a)}
− kI
kII
= −i { ( ) } (5.95)
A 2 cosh(kII a) + i kkIII − kkIII sinh(kII a)
F 2e−ikI a
={ ( ) } (5.96)
A 2 cosh(kII a) + i kkIII − kI
sinh(kII a)
kII
|F|2 |B|2
From Eqs. (5.95) and (5.96) one may calculate |A|2
and |A|2
as given below
( )∗ ( )
|F|2 F∗ F F F
= ∗ =
|A| 2 A A A A
⎛ ⎞∗
−ikI a
2e
= ⎝{ ( ) }⎠
2 cosh(kII a) + i kI − kII sinh(kII a)
kII kI
⎛ ⎞
−ikI a
⎝{ 2e
( ) }⎠
2 cosh(kII a) + i kkIII − kkIII sinh(kII a)
⎛ ⎞
|F|2 ⎜ 4 ⎟
=⎝ ( 2 2 )2 ⎠ (5.97)
|A|2 k −k
4 cosh2 (kII a) + kIIII K II sinh2 (kII a)
and
( 2 2 )2
kII −kI
|B|2 kII K I
sinh2 (kII a)
= ( ( 2 2 )2 ) (5.98)
|A|2 kII −kI
4 cosh (kII a) + kII K I
2
sinh (kII a)
2
If v denotes the velocity of the incident particle, then the flux incident on the potential
barrier is given by
Transmission coefficient T , which is the probability that the incident particle will
cross through the potential barrier and appear on the other side of it, is defined as the
ratio of the transmitted flux to the incident flu and may be given by
Equation (5.101) tells that transmission coefficient has a finite nonzero value indi-
cating that a quantum particle has a finite probability of going across a potential barrier
of height larger than particle’s energy, a phenomena which is not possible in classical
physics. The process of going across the potential barrier of height V 0 by a quantum
particle of energy E, where V 0 > E, is called quantum mechanical tunnelling. Many
phenomena in physics, like alpha radioactive decay, transfer of charge in digital elec-
tronics, etc., can be explained only on the basis of quantum tunnelling. Tunnelling
is a quantum mechanical effect without and classical counterpart.
Equation (5.101) for the transmission coefficient may be rewritten using the
identity cosh2 (kII a) = 1 + sinh2 (kII a) as
1
T =⎧ ( 2 2 )2 ⎫ (5.102)
k +k
1 + 41 kIIII K II sinh2 (kII a)
We now substitute the values of kI and kII in above equation from Eqs. (5.78) and
(5.79) as
kI2 = 2m
ℏ2
E
and kII2 = 2m(Vℏo2−E) to get
1
T ={ ( √ )} (5.103)
V02
1+ 1
4 E(V0 −E)
sinh 2 a
ℏ
2m(V0 − E)
In the case when the barrier height V 0 is much larger than energy E of the particle,
i.e.
/
(a √ ) a √2mV E
0
2m(V0 − E) = 1− ≫1 (5.104)
ℏ ℏ V0
(a √ ) 1 a√2m(V0 −E )
sinh 2m(V0 − E) ≈ e ℏ (5.105)
ℏ 2
( √ )
Substituting this value for sinh ℏa 2m(V0 − E) in Eq. (5.103) one gets
1 1
T ={ ( √ )} = ⎧ √ ⎫
V02 2m (V0 −E )
1+ 1
4 E(V0 −E)
sinh2 ℏa 2m(V0 − E) 1+ 1 V02 1
e
2a
ℏ
4 E(V0 −E) 4
or
( ) ( √ )
16E E − 2a 2mℏ(V0 −E )
T = 1− e (5.106)
V0 V0
It may be observed that Eq. (5.106) gives the magnitude of the transmission
coefficient in the low energy limit. It may further be shown that in case when E and
V 0 are comparable, i.e. E ∼ V0 , transmission coefficient becomes
1
T =( ) (5.107)
ma 2 V0
1+ 2ℏ2
As mentioned earlier, when the incident particle beam hits the potential barrier at x
= 0, a part of the incident beam gets reflected. The reflection coefficient denoted as
R and defined as the ratio of the reflected flux to the incident flux may be written as
⎛ ( )2 ⎞
kII2 +kI2
|B| 2 sinh 2
(k a)
⎜ II
kII K I ⎟
R= =⎝ ( 2 2 )2 ⎠ (5.108)
|A|2
kII −kI
4 cosh (kII a) + kII K I
2
sinh (kII a)
2
1
R= ( ) (5.109)
2ℏ2
1+ ma 2 V0
Tunnelling and reflection are quantum phenomena based on the wave nature of
matter, when a wave hits the boundary separating the two media a part of the wave is
reflected and a part is transmitted. In case the width of the second medium is small,
the transmitted wave propagates to the other end of the second medium and may be
transmitted across the second boundary.
308 5 Introduction to Quantum Mechanics
It may be observed that the part of the wave transmitted in the potential barrier does
not have oscillatory nature, rather it decays exponentially. Matched and smoothly
joined wavefunctions at the boundaries of the potential barrier are shown in Fig. 5.8.
SAQ: It has been shown that a free particle may be represented by a wave packet
and not by a plane wave. However in this example of quantum tunnelling the
wavefunction for the free particle is taken as a plane wave. Can you give a
reason why is it justified?
We have seen how quantum mechanics predicts non-uniform discrete energy levels
for a particle confined in a limited space, in contrast to the continuous energy states
predicted by classical physics. Similarly, quantum mechanical tunnelling has no
parallel in classical physics. Uncertainty principle, put forward by Heisenberg in
1927, is another characteristic of quantum mechanics. It tells that there is fuzziness
in nature, particularly at microscopic level. In order to appreciate the uncertainty
principle it is required to define pairs of dynamic observables called canonical
conjugates. Typical canonical conjugates are position of a particle r→ and its linear
momentum − →p ; their three components; position coordinate x and x-component of
linear momentum px ; position coordinate y and y-component of linear momentum
p y and position coordinate z and z-component of linear momentum pz form pairs
of conjugate variables. Another pair of canonical conjugates is the energy E of the
particle and time t at which the energy is measured. According to this principle, each
observable of the canonical pair cannot be measured simultaneously with complete
accuracy. According to Heisenberg uncertainty principle, it is not possible to nail
down with absolute accuracy the speed of a microscopic particle and its location
simultaneously, more one tries to accurately determine the speed less he knows
about the position of the particle. If Δx and Δpx are, respectively, the uncertainties
in the measurement of the x-coordinate of a particle and its linear momentum px in
x-direction at some instant of time, then according to the uncertainty principle
Δx · Δpx ≥ ℏ/2
5.10 Correspondence Principle and Ehrenfest’s Theorem 309
Similarly,
Here, ΔE and Δt are the uncertainties in the measurement of time t and energy E
of the particle. A common misconception about the principle of uncertainty is that it
arises because of the limitations of measuring instruments; however, the fact is that
in spite of the availability of most accurate and precise measuring instruments, the
law puts limits on the minimum uncertainties in measured values. The argument put
forward in support of the uncertainty principle is that the very process of measurement
alters the conditions of the system introducing uncertainties. Though Heisenberg
derived the principle, however, the derivation is beyond the scope of our present
discussion.
Uncertainty principle has several applications in quantum physics; for example the
excited states of a system have a certain mean life, say, Δτ , which is the uncertainty
in time, and therefore, the uncertainty ΔE in the energy of the excited state, from
uncertainty principle may be given by
ℏ
ΔE ≈ (5.111)
2Δt
where ΔE is called the width of the excited state. It may be noted that in most
cases the ground state of the system is very stable which means that Δt is large, and
therefore, the energy width of the ground state is small, i.e. ground states are sharp.
SAQ: What may be the probable reason for uncertainty in quantum mechanical
measurements?
Correspondence principle essentially demands that any new theory, under suitable
approximations/extensions must merge or dissolve into older theories. Particularly,
in the case of quantum mechanics which works for microscopic systems and is
characterised by quantum numbers, in the limit of large quantum numbers it should
give same results as are predicted by classical mechanics.
Bohr was the first scientist who gave his well-known thesis on Complementarity
and its Copenhagen Interpretation. In fact Bohr put three different aspects of his corre-
spondence principle: first is the frequency interpretation, according to which corre-
spondence principle is a statistical asymptotic agreement between one component
in the Fourier decomposition of the classical frequency and the quantum frequency
in the limit of large quantum numbers. Secondly, there is intensity interpretation
according to which it is a statistical agreement in the limit of large quantum numbers
between the quantum intensity and the classical intensity. Quantum intensity may
be understood in terms of the probability of a quantum transition while classical
310 5 Introduction to Quantum Mechanics
intensity as the square of the amplitude of one component of classical motion. The
third interpretation deals with selection rules, according to which the correspondence
principle is the statement that each allowed quantum transition between stationary
states corresponds to one harmonic component of the classical motion.
Ehrenfest’s theorem, in a way, also establishes a bridge between the quantum
mechanics and the classical physics. The theorem states that ‘The average values of
observables in quantum mechanics obey the classical mechanics’. As an example,
it is possible to write the equation of motion for the expectation values of the position
momentum operators, given below, exactly in the same way as the equation of motion
in classical physics.
( ( )) / \
d2 x̂ ∂ V (x)
= − (5.112)
dx 2 ∂x
Equation (5.112) could be derived using tools of quantum mechanics, but it looks
like Newton’s equation of motion where force and hence acceleration may be derived
from the gradient of the potential.
SAQ: What is the logic behind Correspondence principle?
Solved Examples
SE5.7 Calculate the energy difference between the 2nd excited state and the ground
state of a 1D, 2D and 3D infinite potential wells of same width a in each
direction.
π ℏ2 2
Solution: Let us denote 2ma 2 = ε0 .
The energy difference between the second excited state and the ground state
and is zero for all other values of x, calculate the value of constant B.
{+∞
ψ ∗ ψdx = 1
−∞
∫ +∞ ∗ ∫b ∗ ∫ b[ ( 2 )]
2 2
But −∞ ψ ψdx = 0 ψ ψdx = 0 Bx b − 2x dx =
∫ [ {
2 b 2 4
}]
B 0 x b − 4b x + 4x dx.
2 2 4
∫b[ { }] ∫ b [{ }]
Or B 2 [ 0 x 2 b4 − 4b2 x 2 +] 4x 4 dx = B 2 0 b4 x 2 − 4b2 x 4 + 4x 6 dx = 1.
3 2 5 7
Or B 2 b4 x3 − 4b5x + 4x7 |b|0 = 1.
{ 7 7 7
} ( 7) /
Or B 2 b3 − 4b5 + 4b7 = 1 Or B 2 11b 105
= 1. So, B = 105
11b7
.
SE5.9 In a particular molecule the position of a certain ion of mass 6.0 × 10–26 kg
can be determined to an accuracy of 1 µm. Calculate the accuracy with which
the speed of the ion may be determined.
SE5.10 Calculate the probability that an electron of mass 9.1 × 10−31 kg and of
energy 1.0 eV may tunnel through a potential barrier of height 1.5 eV and
width 0.5 nm.
( √ )
(
−19 ) − 2×0.5×10−9 (
2×9.1×10−31 0.5×1.6×10−19 )
16 × 1.6 × 10 1 1.055×10−34
T = 1− e
1.5 × 1.6 × 10−19 1.5
[√ ] [ −9 ×3.816×10−25
]
8 −1×10−9 14.56×10−50 − 1×101.055×10
or T = 2.25
e = 3.56e
1.055×10−34
= 3.56 × e−3.61 =
−34
3.56 × 0.027.
Hence transmission probability T = 0.096 = 9.6 × 10−2 .
312 5 Introduction to Quantum Mechanics
Problems
P5.1 Calculate the transmission coefficient for an electron of mass 9.1 × 10–31 kg
and kinetic energy 1.3 eV for a potential barrier of height 1.5 eV and width
0.5 nm. The value of rationalised Planck constant ℏ is 1.055 × 10–34 J s.
ANS: 0.187
d2 x
P5.2 Calculate the eigenvalue of function sin(2x) for operator  = dx2
.
ANS: −4
P5.3 A particle of mass m is confined in a one-dimensional infinite potential well
of width a. The particle at instant t is in state χ (x) characterised by χ (x) =
B{θ1 (x) + θ2 (x)}, where θ1 (x) and θ2 (x) are, respectively, the normalised
wavefunctions for the ground and third excited states of the system and B
the normalisation constant. Determine the value of B and the average value
Δ
of operators x̂ and px .
⟨ ⟩ ⟨ ⟩
Δ
ANS: B = √12 , x̂ = 2a , px = 0
P5.4 The mean life of an excited nuclear state is 4 ns, what will be the order of its
energy width?
ANS: 8.2 × 10–8 eV
P5.5 A metallic ball of 2.0 g is constrained to roll inside a glass tube of 2.0 cm
length, which is closed at both the ends. If this ball is considered similar to
a particle moving in one-dimensional infinite square well, what is the value
of quantum number n if the ball is initially given an energy of 2.0 × 10−3 J?
ANS: n = 1.77 × 1029
P5.6 Calcuate the de Broglie wavelength of a proton moving with a speed of
3 × 105 m/s. Given that the mass of a proton is 1.672 × 10–27 kg.
ANS: 1.33 × 10–12 m
d
P5.7 Show that cos 2x is not an eigenfunction of the operator dx
but is an
2
eigenfunction of operator dxd 2 . Obtain the eigen value.
ANS: −4
P5.8 Two observables B and C have corresponding operators B̂ and Ĉ. Both
operators have a common set of eigenfunctions. Show that the two operatiors
commute.
P5.9 Show that the eigen values of a hermitian operator are real.
5.10 Correspondence Principle and Ehrenfest’s Theorem 313
SA5.1 Name two examples where the classical physics failed. State some distin-
guishing features of Schrodinger, Heisenberg and Dirac picture of quantum
mechanics.
SA5.2 How the state of a system at a given instant is specified in quantum
mechanics? What physical significance may be associated with the wave-
function of a system?
SA5.3 Take an example of a suitable wavefunction to explain the essential
characteristics of a valid wavefunction.
SA5.4 Give the statements of the postulates of quantum mechanics.
SA5.5 What is an operator? Define a hermitian operator.
SA5.6 What is the utility of Schrodinger equation? Give properties of this equation.
SA5.7 Write Schrodinger’s time-independent equation and discuss under what
conditions the equation may be used.
SA5.8 Define at least four algebraic operations of operators. When do two
operators commute?
SA5.9 Two hermitian operators  and B̂ commute, what information about the
eigen values of these operators is conveyed to you?
SA5.10 A system at instant t is defined by the wavefunction ψ(r, t). An operator
 operating on the system gives three discrete and non-degenerate eigen
values a1 , a2 and a3 for a given dynamic variable that corresponds, respec-
tively, to the eigen states φ1 (r, t), φ2 (r, t) and φ3 (r, t) of the operator. Is it
possible to write ψ(r, t) in terms of φ1 (r, t), φ2 (r, t), and φ3 (r, t)? If yes,
write the expression.
SA5.11 Differentiate between the Eigen value and the expectation value of an oper-
ator. Explain the difference in the outcome of a quantum and a classical
measurement of a dynamic variable.
SA5.12 Distinguish between bound and scattering states of a potential and explain
characteristics of bound states and their wavefunctions.
SA5.13 Give reasons why a plane wave cannot represent a free particle.
SA5.14 A particle is incident on a potential barrier of height greater than the
kinetic energy of the particle, without derivation, write expressions for the
particle wavefunctions in different regions of space and give the boundary
conditions that these wavefunctions must obey.
SA5.15 Write a note on uncertainty principle.
SA5.16 Discuss the principle of correspondences and the logic behind it.
MC5.1 Operators that are their own hermitian conjugate are called
314 5 Introduction to Quantum Mechanics
(a) null operator, (b) linear operator, (c) hermitian operator (d) delta
function
ANS: (c)
MC5.2 If for an operator (− F̂) = − F̂ † , then F̂ is
(a) hermitian, (b) anti-hermitian, (c) null operator and (d) unitary operator
ANS: (b)
MC5.3 The probability for obtaining the eigen value (aj ) for a system in quantum
state ψ(r, t) and eigenfunctions ϑ j (r, t) is given by
| +∞ |2
|{ |
( ) | |
P a j = || ϑ j (→ r , t))d3 x ||
r , t)∗ (ψ(→
| |
−∞
Only when
(a) ψ(r, t) is normalised, (b) ϑ j (r, t) are normalised, (c) eigen states are
non-degenerate and (d) operator is hermitian
ANS: (a), (b), (c) and (d)
MC5.4 A particle in quantum mechanics is represented by a wave packet. The
size of the wave packet in one-dimensional space is defined by the width
∓Δx, where the intensity of the packet changes to
1
(a) e2 -times of its central value, (b) e times of central value, (c) e
times of
central value and (d) √1e times of central value
ANS: (d)
MC5.5 The ratio of the ground state energies in a 2D and 3D infinite potential
wells of equal sides is
(a) 3/5 (b) 2/3 (c) 3/2 (d) 5/3
ANS: (b)
MC5.6 Transmission coefficient for a particle of mass m and energy E through a
potential barrier of height V 0 (V 0 > E) and width ‘a’ is given as
( √ )
( ) − 2a 2m (V0 −E )
ℏ
(a) ( 1
a2 V
) (b) ( 1
2
) (c) 16E
V0
1− E
V0
e (d)
1+ 2ℏ20 1+ a2ℏ
2V
[ ( √ ) ]−1
0
( ) −
2a 2m (V0 −E )
ℏ
16E
V0
1− E
V0
e
ANS: (c)
MC5.7 A particular excited state of a system has a width of 1.0 × 10−7 eV. The
mean life of the state will be of the order of
(a) 3.3 × 10−9 s (b) 3.3 × 10−7 s (c) 3.3 × 10−5 s (d) 3.3 × 10−3 s
5.10 Correspondence Principle and Ehrenfest’s Theorem 315
ANS: (a)
MC5.8 In a classical experiment the speed v of electron was measured three times
to get experimental values as 100 m/s, 110 m/s and 120 m/s. The expec-
tation value of speed ⟨v⟩ obtained using quantum mechanical tools was
found to be ⟨v⟩ = 115 m/s. The expectation value 112 m/s should be
compared with the experimental value
(a) 100 m/s, (b) 110 m/s, (c) 120 m/s and (d) none of the experimental
values
ANS: (b)
MC5.9 If T and R, respectively, denotes the transmission and reflection coeffi-
cients for a potential barrier of height v0 and breadth a for an incident
particle of mass m and energy E ∼ V0 , the ratio R/T is given by
ma 2V0 ℏ2 2ℏ2 ma 2 V0
(a) 2V0 ℏ2
, (b) ma
, (c) ma 2 V0
and (d) 2ℏ2
ANS: (d)
MC5.10 The energy of the first excited state of a particle confined in a 1D infinite
potential well is 12 eV. The fifth excited of the system will be at energy
(a) 27 eV, (b) 48 eV, (c) 70 eV and (d) 108 eV
ANS: (d)
LA5.1 Give reasons why the state function for a free particle cannot be represented
as a plane wave. Explain how a wave packet representation may remove all
problems associated with plane wave representation. How the wave packet
is formed and what are measures for the size of the wave packet in k-space
and spatial space?
LA5.2 Derive expression for the energy levels of a one-dimensional infinite poten-
tial well and show that the eigenfunction for different energy levels shows
nodes depending on the degree of excitation.
LA5.3 A particular experiment of measuring the value of a dynamic variable was
carried out both quantum mechanically and classically. What differences do
you expect in the results obtained in two sets of experiments. What is the
expectation value in quantum experiment and with which value of classical
result you will compare it?
LA5.4 Define a potential barrier and discuss the quantum mechanical tunnelling of
a 1D potential barrier by a particle. Obtain expressions for the wavefunctions
in different region of space around the barrier and apply boundary conditions
to obtain transmission coefficient.
LA5.5 Define (a) the sum, (b) the product of two operators, (c) multiplication of an
operator by a complex number and (d) the commutation of two operators.
Also show that that anticommutator of two hermitian operators is hermitian.
316 5 Introduction to Quantum Mechanics
LA5.6 Define
[ ]the commutator of two operators and obtain the value of commutator
Δ
Objective
Systems having large number of identical particles obey laws of statistics. These
laws assume specific manifestation in case of quantum systems. The distribution
of particles in different quantum energy levels and in different energy states of the
given level and transition from one microstate to the other, etc., are all governed
by quantum distribution laws. Quantum laws of statistics will be introduced in this
chapter. It is expected that a reader will be able to apply laws of quantum statistics
to real cases after going through this chapter.
6.1 Introduction
velocities or energies, etc. in a given range, say, velocities between v and (v + Δv)
or energies between some value ∈ and (∈ + Δ∈) and how this number changes with
time. Conversely, one will like to know as to how a given amount of energy E will
get distributed into different groups of particles in the system. This information is
contained in what is called the distribution function, so there may be several types
of distribution functions like the velocity distribution function or energy distribu-
tion function, etc. for a system. Statistical mechanics or Quantum statistics is
the tool to obtain these distribution functions. Quantum statistics uses the theory
of probability, like the classical statistics, but assumes discrete, i.e. non-continuous
values for physical variables like velocity, energy, momentum, etc. Quantum statis-
tics further assumes that an assembly of identical particles or entities may follow
different kinds of statistics, like Fermi–Dirac, Bose–Einstein or Maxwell–Boltz-
mann statistics. These statistics differ from each other as to how the entities of the
system may be distributed into various energy levels and energy states in a level.
In statistical mechanics, the science of bulk matter is an incomplete and evolving
science. New ideas and concepts permit a fresh approach to old problems. With new
concepts one looks for features ignored in past and expect exiting results. Important
new concepts are: deterministic chaos, fractals, self-organised criticality (SOC),
turbulence and intermittency. These words represent large fields of study, all using
quantum statistics, which have changed how we view nature. Disordered systems,
percolation theory and fractals find applications not only in physics and engineering
but also in economics and other social sciences.
Thus having obtained the required distribution function from the quantum statis-
tics, one analyses the distribution function to obtain the value of a parameter called
the ‘partition function’. Partition function, which depends on the type of the statis-
tics obeyed by the constituent particles of the system, is the most important parameter
from the point of view of quantum statistics. Considerable efforts are put in obtaining
an appropriate partition function for a given system. Partition function, which is like
the heart of quantum thermodynamics, may be used to obtain physical observables
of the system, i.e. the quantities like temperature, pressure, volume, specific heat
capacities, entropy, etc. that may be measured experimentally. In this chapter we will
see how one can use statistical mechanics (or quantum statistics) to get distribution
functions and also how these distribution functions can be further analysed using
the tools of quantum thermodynamics to yield the all important partition functions
which in turn provide values of the required system observables.
Statistical mechanics may be applied to solve problems related to real systems that
contain large number of identical entities or particles. The formalism of statistical
physics may be developed both for the classical systems andfor quantum systems.
6.3 Energy Levels, Energy States, Degeneracy and Occupation Number 319
The resulting energy distribution and calculating the values of physical observables
is simpler in the classical case. However, the formulation of the method is more
transparent in the quantum mechanical formalism. In addition, the absolute value
of the entropy without any undermined constant and the behaviour of the entropy
when absolute temperature approaches zero, may be obtained only in the quantum
mechanical treatment. In the following sections we will see how quantum statistics
may be applied to an assembly of non-interacting (or free) particles.
In Chap. 5 it was shown that a particle confined in an infinite potential of width ‘a’
has discrete set of non-uniform energy levels with energies,
π 2 2
∈ j = n 2j (6.1)
2ma 2
If V denotes the volume of the space in which the particle is confined, then V = a 3
and Eq. (6.1) may be written as
2 (2π )2 − 2
∈ j = n 2j V 3 (6.2)
8m
h
In Eq. (6.2), is rationalised Planck’s constant = 2π = 1.05457 × 10−34 J s.
The integer n j is made up of three independent integers, n x , n y and n z , called the
quantum numbers, such that
n 2j = n 2x + n 2y + n 2Z (6.3)
Each of these n x , n y and n z can have non-zero integer values like 1, 2, 3, …, etc.
So far terms ‘energy level’ and ‘energy state’ have been used interchangeably,
as in most cases energy levels were non-degenerate; however, now onwards energy
level and energy state will have definite meaning as specified here.
The value of n2j defines an energy level of the system. Each energy level may
have one or more energy states. A set of the different values of quantum numbers
n x , n y and n z subject to the condition given by Eq. (6.3) defines the number of
states of the given energy level.
Since the minimum value that n x , n y and n z may have is 1, the minimum value of
n 2j = 12 + 12 + 12 = 3 and, therefore, the lowest energy level has the energy, ∈1 ,
given as,
2 (2π )2 − 2
∈1 = 3 V 3 (6.4)
8m
320 6 Quantum Statistics
It may be observed that only one set of n x , n y and n z can give the value 3 to n 2j .
In the language of quantum mechanics it is said that the level ∈1 has only one energy
state. A level that has only one energy state is called a non-degenerate level. The
degeneracy of a level is denoted by g j and is equal to the number of energy states in
the level. The degeneracy g1 of level at energy ∈1 is 1, i.e. g1 = 1.
The next level will be one in which one of the quantum numbers n x , n y or n z has
the value 2 and the other two have values 1. This gives rise to three different sets of
quantum numbers, giving the same value of n 2j = 22 + 12 + 12 = 6. These three sets
are
(n x = 1, n y = 1, n z = 2); (n x = 1, n y = 2, n z = 1) and (n x = 2, n y = 1, n z = 1)
All the three different energy states mentioned above have the same energy ∈2 =
6 (2π )2 − 23
2
8m
V . The level with energy ∈2 has three states and the degeneracy of this
level g2 = 3.
Let us consider the level with energy ∈ = 14 (2π ) 2 2
V − 3 . The six different sets of
2
8m
n x , n y and n z shown in Table 6.1 give the same value of n 2j = 14 and hence the same
energy.
This level, therefore, has sixfold degeneracy or g = 6 for this level. In general the
energy levels are non-equidistant and have different folds of degeneracy.
In particular it may be observed that the three-dimensional energy expression
(6.2) is an equation of sphere of radius R,
2 (2π )2 − 2
∈ j = n 2j V 3
8m
or
8m∈ j 2
n 2j = n 2x + n 2y + n 2z = 2 V 3 = R2
(2π )2
The sphere of radius R is shown in Fig. 6.1. It may be observed in this figure that
the positive non-zero integer values of nx , ny and nz lie in 1/8 quadrant of the sphere.
It means that for large values of energies the density of non-zero positive nx , ny and
nz points essentially fill the volume of the 1/8 quadrant.
6.3 Energy Levels, Energy States, Degeneracy and Occupation Number 321
One may now treat R or energy ∈ as continuous variable and may obtain the
number of lattice points consistent with energy ≤ ∈ j , which is essentially the volume
of the 1/8 of the sphere. The number of energy states
3/2
1 4 1 2 3/2 1 8m∈ j 2
G(∈) = πR = π R
3
= π 2 V 3 (6.5)
8 3 6 6 (2π )2
In order to have a feel of the magnitude of the degeneracy G(∈, Δ∈), one may
calculate the degeneracy using Eq. (6.6) for molecules/atom moving in a room of size
10 m × 10 m × 10 m at temperature 300 K, taking m ≈ 10−25 kg and Δ∈ j = 0.01∈ j ,
it comes out to be of the order of 1030 .
A quantum mechanical system of N identical non-interacting particles confined
in a given volume of space has many energy levels with each level having a certain
fold of degeneracy. Depending on the properties of the particles, each energy level
accommodates a certain number of particles. If the jth level contains N j particles,
then N j is called the occupation number of the level. The occupation numbers and
the energy of different levels satisfy the following conditions:
N j = N and ∈j Nj = E (6.7)
322 6 Quantum Statistics
Fig. 6.2 Schematic representation of energy levels, energy states, occupation no. and degeneracy
of a hypothetical system
Here, E is the total energy of all particles and N their total number.
Figure 6.2 is a schematic representation of the energy levels, energy states, folds
of degeneracy and the occupation numbers of an imaginary assembly. The energy
levels in the figure are represented by horizontal lines, their energies are written on
the left-side vertical scale, and energy states in each level are shown by pink brackets;
the number of energy states in a given level gives the degeneracy of the level. Particles
in different energy states are shown by round dots. Total number of particles in a
level gives the occupation number of the level. It may be observed in the figure that
in some levels there are empty energy states that contain no particle.
SAQ: What creates degeneracy in an energy level?
are in constant motion; therefore, it is not possible to put a mark on one particular
molecule and identify it at all times. Hence, molecules of a gas are indistinguishable
because they are non-localised. On the other hand, in a crystalline solid, atoms or
molecules are identical, but they may be differentiated or distinguished from one
another on the basis of their location in the crystalline lattice. As such, the atoms or
molecules in a crystalline solid are distinguishable as they are localised. In general
non-localised particles are indistinguishable, while localised entities are distinguish-
able because of their fixed location. Atoms/molecules of a paramagnetic salt if put
in an external magnetic field align either parallel or antiparallel to the applied field
and, therefore, may be distinguished from each other through their orientation in
the external field. Similarly, nucleons (neutrons and protons) in a nucleus may be
distinguished from each other on the basis of the orientation of their spins and so
they are distinguishable. The fact that particles of an assembly are indistinguishable
or distinguishable, as we will see, plays an important role in quantum statistics.
6.3.2 Macrostate
Fig. 6.3 Particle distribution in four microstates with different sets of occupation numbers
6.3.3 Microstates
As already mentioned, the energy and occupation numbers of different energy levels
as well as the degeneracy (or the number of states associated with a given energy
level), etc. are all decided by the nature of particles in a given assembly. Further, the
number of Macrostates and their configurations for a given assembly are characterised
only by the occupation numbers of different energy levels, and Macrostates do not
depend on the degeneracy of energy levels.
In order to understand the concept of microstates, let us consider the Macrostate
(a) of Fig. 6.3 that has all the five particles in level-3 and is characterised as (N 1 = 0,
N 2 = 0, N 3 = 5 and N 4 = 0). Let us assume that the degeneracy of level-3 is three;
i.e. g3 = 3. Now the five particles in level-3 may be accommodated in three energy
states, shown by coloured brackets in Fig. 6.4, in different ways. If it is assumed
that there is no restriction on the number of particles that may be accommodated
in a state, then six different configurations of particles in level-3 may be shown by
six figures in the lower part of the figure. Each of these configuration is called a
microstate of the Macrostate (N 1 = 0, N 2 = 0, N 3 = 5, N 4 = 0) shown at the top.
It may be noted that the six microstates shown in the figure are not the only
microstates associated with the given Macrostate, there may be many more, and the
actual number will depend on the rules that govern the distribution of particles in
different energy states. A microstate of the system is defined not only by the number
of particles in a level but also by the number of particles in each energy state of
6.3 Energy Levels, Energy States, Degeneracy and Occupation Number 325
each level. Since shifting of a single particle from one energy state of a given level
to another energy state of the same level results in a new microstate, the possible
number of microstates associated with a given Macrostate is very large.
In conclusion it may be said that an assembly of identical non-interacting particles
may have several Macrostates and that each Macrostate may have a very large number
of microstates associated with it.
Ω= Wk
k
328 6 Quantum Statistics
As shown in Fig. 6.4, the microscopic structure of the system changes almost
continuously with time as the system moves through different Macrostates. Since
Macrostates live for short times, it is not possible to measure the physically important
system properties in a particular Macrostate. Actually whenever some measurement
is done the system passes through a large number of Macrostates during the process
of measurement. Therefore, the measured physical quantity is the average value over
a large number of Macrostates of the system. A change in the Macrostate means
change in the occupation numbers of the levels. The measured value of the physical
quantity, therefore, depends on the average values of occupation numbers of different
energy levels of the system. As such values of occupation numbers for different levels
averaged over a large number of Macrostates are of paramount importance from the
point of experimental determination of system properties.
Quantum statistics provides a method to compute the average occupation number
of some level, say level j, of the Macrostate k, which is denoted by N jk . It is quite
obvious that the value of N jk will depend on the probability Wk of the Macrostate k.
The value of Wk depends on the nature of the particles of the system. Quantum
mechanics classifies particles according to the statistical distribution law that is
followed by a group of large number of identical particles. There are three different
statistical distribution laws, namely Bose–Einstein, Fermi–Dirac and Maxwell–
Boltzmann distribution law, one of which is followed by a group of large number
of identical particles. Each distribution law gives a different value of Wk . We first
calculate the value of Wk for systems that follow these three distribution laws.
In Eq. (6.8) curly brackets represent states and lower case letters the particles.
The sequence shown in Eq. (6.8) is made up of gj numbers (representing states)
and N j particles that mean a total number of elements in the sequence are (gj +
N j ). Any sequence of these (gj + N j ) elements of the type indicated in Eq. (6.8)
gives a way of distribution of particles in different states of the level. But there is
one condition on the valid sequence that represents particle distribution is that the
sequence must start with a number representing the state. A sequence of the type
{ab(2)}{(1) f gl} . . . g j lm is not valid as it starts with a letter and not numbers.
If the sequence starts with one out of the g j numbers, the number of remaining
elements becomes g j + N j − 1 . The number of different ways in which these
remaining
elements
g j + N j − 1 may be arranged is factorial gj + Nj − 1 ,
i.e. g j + N j − 1 !. To compute the total numberof valid sequences
it is required
to multiply thenumber of different
ways in which g j + N j − 1 elements can be
arranged (i.e. g j + N j − 1 !) by g j , any one of which may be the first element
of the valid sequence. Thus the total number of different ways in which N j particles
may be distributed in g j states is,
N total = g j N j + g j − 1 ! (6.9)
thus observe that in N total some; otherwise identical sequences have been counted as
different sequences. This has happened because we assigned distinguishable letters
a, b, c, etc. to the undistinguishable particles. It is, therefore, required to correct N total
for this over counting. The number of particles is N j , and the number of different
ways in which these particles can be arranged is N j!. Hence, correction for this over
counting may be applied by dividing N total by N j !.
Similarly, sequences like
{(1)ab}{(2)}{(3)cde} . . . g j lm and
{(2)}{(3)cde} . . . {(1)ab} . . . g j lm have also been counted as different sequences
in N total . But actually these are not two different sequences. It may be noted that
states are distinguishable, which means that number 1, 2, 3, … are different but
at which location in the sequence a particular number occurs is not important. As
shown in the two sequences above the state {(1)ab} appears in the first location of
the sequence or it appears at any other location does not matter so long the number
of particles in the state remains same. As such, the two sequences shown above
refer to the same distribution. There are g j numbers, and the possible ways in which
they may be arranged are g j !. Correction for this over counting may be applied by
dividing N total by g j !
Finally, the corrected number of different ways in which N j indistinguishable
particles may be distributed in g j states or the number of different distributions for
the jth level ω j is,
N total gj gj + Nj − 1 ! gj + Nj − 1 !
ωj = total
Ncorrected = = = (6.10)
(g j !) N j ! (g j !) N j ! gj − 1 ! Nj!
Before proceeding further let us check the correctness of Eq. (6.10). For simplicity
we assume that there are only three particles (N j = 3) and only 3 states (g j =
3); then according to Eq. (6.10), the number of different ways in which these 3
indistinguishable particles may be distributed in 3 states is,
5!
ω(3, 3) = = 10
(2!)(3!)
These ten different ways of particle distribution are shown in Fig. 6.7, where dots
represent particles.
Application of formula given by Eq. (6.10) for calculating the number of ways
of particle distribution to the case of a non-degenerate level needs a mention. For
a non-degenerate level g j = 1 and, therefore, in the denominator of expression
(g j +N j −1)! (1+N j −1)!
{(g j −1)!}( N j !) one gets {0!}( N j !) = 0! . Now for a non-degenerate level there is only
1
one way of distributing indistinguishable particles, that is all particles are in the same
state. Hence, in order to match formula given by Eq. (6.10) with the experimental
fact we should use the convention that 0! = 1. This convention will also make the
formula valid for the state which is empty and has no particle. For an empty state N j
= 0 and wempty = (g −1j !(0!)
0+g −1)!
= 0!1 = 1.
( j )
6.5 The Bose–Einstein Statistical Distribution 331
Fig. 6.7 Ten different ways of distributing three particles in three different states
Equation (6.10) gives the number of possible ways in which particles may be
distributed in any level. Suppose in one of the levels particles are distributed in one
of the ways given by Eq. (6.10). Then in each of the remaining levels particles may
be distributed according to any one of the distribution out of those specified by Eq.
(6.10). Therefore, the total number of possible distributions or the statistical (ther-
modynamic) probability of any Macrostate in Bose–Einstein distribution is given
by,
gj + Nj − 1 !
W Bose-Ein
= (6.11)
j
gj − 1 ! Nj!
The symbol j f ( j ) means the forming of products of each term of function f ( j)
for each value of j.
Suppose there is a system of three energy levels, such that in level-1 there are 2
particles and 2 states, in level-2: 3 particles and 3 states and in level-3: 1 particle
and 3 states. In other words the occupation numbers and the folds of degeneracy of
level-1, level-2 and level-3 are, respectively, N 1 = 2, g1 = 2; N 2 = 3, g2 = 3; and
N 3 = 1, g3 = 3. Once the occupation numbers of the levels have been fixed it means
that the Macrostate is fixed. The thermodynamic probability of this Macrostate for
332 6 Quantum Statistics
The distribution shown above has no particle in first state, a particle in state-2, a
particle in state-3, no particle in state-4… and one particle in the last state g j .
The problem of particle distribution may be looked in the following way:
Suppose initially all the g j states in level j are empty. We now take one particle
(out of the total N j particles) and put it in one of the states. This first particle may be
put in any of the gj states. That means that for the first particle there are gj different
ways of filling the states. If n i denotes the number of ways in which the ith particle
can be filled, then n1 = g j .After putting the first particle in any state, the number
of
empty states left is g j − 1 . Second particle may now be put in one of the g j − 1
states. It means that the number of different
ways
n 2 in which second particle can
be filled in remaining states is n 2 = g j − 1 . Continuing the same argument, the
number of ways in which the third, the fourth and so on up to nth particle filling will
6.6 The Fermi–Dirac Statistical Distribution 333
be, respectively, n 3 = g j − 2 , n 4 = g j − 3 , . . . n n = g j −
n + 1 . Thenumber
of ways in which the last N j th particle can be filled is n N j = g j − N j + 1 .
The total number of ways of distributing particles in all states n total = n 1 × n 2 ×
n3 × . . . n N j .
Or
n total = g j × g j − 1 × g j − 2 × . . . g j − N j + 1
gj!
n total = (6.12)
gj − Nj !
n total gj!
ωj = = (6.13)
Nj! Nj! gj − Nj !
Once again, to test the correctness of Eq. (6.13) we calculate the number of
different ways in which three particles can be distributed in three energy states of
level j, when particles follow Fermi–Dirac statistics. In this case, g j = 3 and N j = 3.
Substituting these values in Eq. (6.13) one gets,
3!
ωj = = 1.
3!0!
It may be noticed that if particles obey Bose–Einstein statistics then three particles
may be distributed in ten different ways in three energy levels. On the other hand
they may have only one way of distribution if they obey Fermi–Dirac statistics.
Now for any one of the ω j arrangement of particles in a given level, there are
ω j ways of distribution of particles in any other level. Therefore, the number of
ways in which fixed number of particles in each level may be distributed in different
energy states of each level, that is the thermodynamic probability of a Macrostate in
Fermi–Dirac statistics is,
334 6 Quantum Statistics
gj!
W Fermi-Dirac = (6.14)
j
Nj! gj − Nj !
SAQ: What are the main points of difference between Bose–Einstein and Fermi–
Dirac statistics?
N! N!
N level = = (6.15)
[(N1 !)(N2 !)(N3 !) . . .] j Nj!
Suppose there are two independent systems A and B with entropies S A and S B . Let
the thermodynamic probabilities of the two systems be Ω A and Ω B . It is known that
entropies are additive; therefore, the total entropy S total of the two systems put together
is
Stotal = S A + S B (6.18)
Ωtotal = Ω A .Ω B (6.19)
S = f (Ω) (6.20)
where, f represents some function. Our task is to explore the nature of function f .
It follows from Eqs. 6.18 and 6.19 that
f (Ω A ) + f (Ω B ) = f (Ω A Ω B ) (6.21)
d f (Ω A ) d f (Ω A Ω B )
+0=
dΩ A dΩ A
or
d f (Ω A ) d f (Ω A Ω B ) d(Ω A Ω B ) d f (Ω A Ω B )
= = ΩB
dΩ A d(Ω A Ω B ) dΩ A d(Ω A Ω B )
or
d f (Ω A ) d f (Ω A Ω B )
= ΩB (6.22)
dΩ A d(Ω A Ω B )
d f (Ω B ) d f (Ω A Ω B )
= ΩA (6.23)
dΩ B d(Ω A Ω B )
When Eqs. (6.22) and (6.23) are multiplied, respectively, by Ω A and Ω B one gets,
d f (Ω A ) d f (Ω B )
ΩA = ΩB (6.24)
dΩ A dΩ B
The two sides of Eq. (6.24) contain functions of two independent variables, and
hence this equation will be true only when the two sides are equal to some constant.
Let this constant be denoted by k B . So that
338 6 Quantum Statistics
d f (Ω A ) d f (Ω B ) d f (Ω)
ΩA = ΩB = ··· = Ω = kB
dΩ A dΩ B dΩ
or
dΩ
d f (Ω) = k B
Ω
or
f (Ω) = k B ln Ω
or
S(Ω) = k B ln Ω (6.25)
The numerical value of constant k B , called Boltzmann constant, has been deter-
mined by actually matching the value of entropy with ln Ω and has been found to
be,
R(Gas constant)
kB = = 1.38062 × 10−23 J K−1 (6.26)
A(avogadro’s number)
SAQ: What is entropy? What property of the system is measured by the entropy of
the system?
It has already been discussed that the average occupation numbers play an important
role so far as the system observables are concerned. Since even very small physical
systems at room temperature contain very large number of particles and available
energy levels, it is almost impossible to calculate average occupation numbers by
counting levels and calculating possible ways of particle distributions. The average
occupation numbers are, therefore, determined theoretically, using the distribution
laws of quantum statistics, and expressions for entropy change are borrowed from
classical thermodynamics. The expression for average occupation number per energy
state W g
is called the distribution function. It is possible to derive distribution
functions for different statistical distributions, but that is beyond the scope of the
present discussion. Significance of distribution functions lies in the fact that they
may be directly related to the observables, like temperature, pressure, volume, etc.
of s system in equilibrium.
Distribution functions for some important distributions are given here.
Distribution function for Bose–Einstein distribution
6.9 The Distribution Function 339
NJ 1
= μ−∈ j (6.27)
gj −
e T kB − 1
NJ 1
= (6.28)
gj (∈ j −μ)
1 + e kB T
NJ 1 (μ−∈ j )
= ∈ −μ = e k B T (6.29)
gj (j )
e kB T
Equation (6.29) tells that the average number of particles per state in every level
exponentially decreases with the energy ∈ j of the level, for a system of indistin-
guishable particles obeying classical statistics. Further, the decrease of the number
of particles per state is faster at low temperature. The same result will be found in
case of the systems that follow Maxwell–Boltzmann distribution, though particles
are distinguishable in this distribution.
Distribution function for Maxwell–Boltzmann distribution
Or
Nm μ−∈m
= Ne T kB
gm
Above equation holds for any level of the system, and generalising it one may
write
μ−∈
NJ j
= Ne T kB
(6.30)
gj
Solved Examples
the number of Macrostates of the system if the total energy of all particles
is 10ε.
Solution Macrostates of an assembly are defined by the different sets of possible
occupation numbers subject to the conditions: i Ni = N (no. of particles) =
5 and i Ni ∈i = (Total energyE) = 10∈
These conditions restrict the number of Macrostate to 5 as shown below.
SE6.2 Assuming that level-4, level-3, level-2 and level-1 in the above problem
SE6.1 have, respectively, 4, 5, 2, and 1-folds of degeneracy, and the particles
obey Fermi–Dirac statistics calculate the number of microstates associated
with each of the five Macrostates. In which Macrostate the system will stay
for the longest time?
Solution It is given that g4 = 4, g3 = 5, g2 = 2 and g1 = 1. Also, it is given
that the particles obey Fermi–Dirac statistics. The number of microstates in case of
Fermi–Dirac distribution is given by the relation,
g !
W Fermi-Dirac = j { N !}(gj −N ! where symbols have their usual meaning.
j j j)
Let us calculate the microstates of Macrostate (N 1 = 1, N 2 = 1, N 3 = 0 and N 4
= 3)
1! 2! 4!
W Fermi-Dirac = × × =8
{1!}(0)! {1!}(1!) {3!}(4 − 3)!
g Nj j
W Maxwell-Boltzmann
= N!
j
Nj!
Problems
P6.1 A system of identical non-interacting particles has 5 particles with total energy
5E, distributed over three energy level of energies 0, E and 2E. Show that the
system may have three Macrostates and designate these Macrostates in terms
of their occupation numbers. Assuming that each level is threefold degen-
erate, calculate the number of microstates associated with the Macrostate
with occupation numbers [N (E=0) = 2, N(E=E) = 1, N (E=2E) = 2} when parti-
cles obey (A) Bose–Einstein statistics, (B) Fermi–Dirac statistics and (C)
Maxwell–Boltzmann statistics.
ANS: (A) 108 (B) 27 (C) 7290
342 6 Quantum Statistics
SA6.14 What does the distribution function for a statistical distribution tells? What
is its physical significance?
MC6.1 Sum of average occupation numbers over all energy levels of a system is
equal to
(a) Total number of levels (b) total number of particles (c) average value
of the degeneracy of levels (d) average value of quantum thermodynamic
probability
ANS: (b)
MC6.2 The entropy of an assemble and its quantum thermodynamic probability
have functional relationship that is
(a) Linear (b) binomial (c) exponential (d) logarithmic
ANS: (d)
MC6.3 Experimental values of system observables of an assembly depend
strongly on
(a) Occupation number of the level with highest energy (b) occupation
number of the level with least energy (c) average occupation numbers of
all levels (d) temperature of the assembly
ANS: (c)
MC6.4 A system has four particles which are distributed in three energy levels of
energies, 0, E and 2E. If the total energy of the system is 5E, the number
of Macrostates of the system is
(a) 4 (b) 3 (c) 2 (d) 1
ANS: (c)
MC6.5 In question (MC6.4), if each level has threefold degeneracy and the parti-
cles obey Maxwell–Boltzmann statistics then the maximum number of
microstates associated with a Macrostate will be
(a) 100 (b) 238 (c) 324 (d) 972
ANS: (d)
MC6.6 Which of the following expressions represent the quantum probability of
an assembly of particles obeying Maxwell–Boltzmann distribution law?
Symbols have their usual meaning.
6.9 The Distribution Function 345
ANS: (c)
MC6.7 A typical Macrostate of a system of four identical particles has two parti-
cles each in two levels of twofold degeneracy each. The quantum thermo-
dynamic probability of the Macrostate is 1 when particles follow statistical
distribution law A and 96 when B. Statistical distribution laws A and B
are respectively
(a) Bose–Einstein, Maxwell–Boltzmann (b) Fermi–Dirac, Maxwell–
Boltzmann (c) Maxwell– Boltzmann, Bose–Einstein (d) Fermi–Dirac,
Bose–Einstein
ANS: (b)
MC6.8 Which of the following expression represent Fermi–Dirac distribution
function?
(g j +N j −1)! gj! g Nj j
(a) j {(g j −1)!}( N j !) (b) j { N j !}(g j −N j )! (c)N ! Nr
j N j ! (d) gr =
1
μ−∈r
−
e T kB
−1
ANS: (b)
MC6.9 Four identical particles equally distributed in two twofold degenerate
levels have 96 microstates. Nature of the particles and the statistics they
follow are
(a) Distinguishable, Bose–Einstein statistics (b) indistinguishable, Fermi–
Dirac statistics (c) distinguishable, Maxwell–Boltzmann statistics (d)
indistinguishable, classical statistics
ANS: (c)
MC6.10 What happens to a level of energy ∈ of a system when the volume of the
system is doubled?
ANS: (d)
Objective
Basics of optical fiber communication for terrestrial transfer of information are
discussed in this chapter. It is expected that after reading this chapter the reader
will be able to understand why optical communication is better and faster than both
the wireless and metallic cable transmissions. He will also appreciate the technique
of optical fiber transmission and its requirements.
7.1 Introduction
The most common method of transferring information from one place to another
place is either using wireless transmission or transmission using metal core cables.
In both these methods, though information in analogue form as well in digital signal
form may be transferred but digital mode is preferred on account of its transmis-
sion reliability and less affect from surrounding environmental conditions. In digital
transmission, the required information is first converted into digital electronic pulses,
mostly voltage pulses, which are then transmitted through metal core cables for terres-
trial transmission or are made to modulate a high-frequency carrier wave for wireless
transmission.
In 1870, John Tyndall demonstrated that short bursts of light pulses may also be
used in place of digital electronic pulses to transmit information. He further showed
that these light pulses may be sent through cables made of very fine glass fiber to
long distances without much loss. However, actual use of optical fiber communica-
tion started only in 1927. At present optical fiber communication is fast replacing
conventional metal core cable transmission for terrestrial communication. Optical
fiber communication at present is also being used for medical purposes, like in
endoscopes, for computer networking and in civil and military avionics.
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2023 347
R. Prasad, Physics and Technology for Engineers,
https://doi.org/10.1007/978-3-031-32084-2_7
348 7 Optical Fiber Communication
For terrestrial transmission, optical fiber communication using light signals may be
compared with communication done using digitalised electronic signal via copper
core coaxial cables. Some of the important advantages of optical fiber communication
are listed below.
1. Increased information carrying capacity (large bandwidth): Optical fiber
provides high signal bandwidth resulting in significantly greater information
carrying capacity. In a way bandwidth may be compared with the diameter of
a pipe; if some fluid is transported through a pipe, pipe with bigger diameter
will carry more fluid. Moreover, if fluid is filled in a container, the container will
get filled faster with the pipe of larger diameter. Similarly, an optical fiber has
the capacity of holding and down (or up) loading of significantly larger amount
of data in comparison with a metal core coaxial cable. Optical fibers may be
classified as single mode (SM) and multimode (MM) types. Typical bandwidth
for (MM) type fiber is between 200 and 600 MHz-km, and it is more than 10 GHz
for (SM) fiber. The bandwidth for metal core cables ranges from 10 to 25 MHz-km
only.
2. Immunity from electromagnetic interference: In metal core cable transmis-
sion signal in the form of electric/electronic voltage pulse is used. Electronic
pulses in nearby cables interfere with each other (referred as cross-talking).
Electronic pulses also get affected by stray electromagnetic fields, particularly
high-frequency radio waves. No such interference of stray electromagnetic fields
is felt by the light pulses travelling through optical fibers. Therefore, transmission
is secure and faithful.
3. Low loss of pulse intensity: Designing of optical fiber is such that a light pulse
propagated through the fiber does not lose intensity or power, and this is achieved
by guiding the pulse through the fiber using total internal reflection. On the other
hand, electrical pulses inherently radiate energy and lose intensity as they travel
through metal core cable. No significant loss in light pulse intensity allows the
use of optical fiber cables of much higher lengths before a repeater station (an
intermediate station where pulses are re-strengthen) is installed. In comparison,
more repeater stations after shorter cable lengths are required for electrical signal
transmission using metal core cables. As a result of recent technological advances
in fabricating optical fiber, light can be guided through 1 km of fiber with an
intensity loss of as low as 0.16 dB (≈ 3.6%).
4. Reduction in size, weight and cost: Since optical fiber is made of silica glass,
optical fiber cables are light weight, small in diameter and cost substantially less
than a coaxial copper core cable. Normally, the diameter of an optical fiber is 1/
8 of the diameter of coaxial copper cable.
5. Enhanced security: Since leakage of light pulse from fiber cable is totally absent
and since light pulse propagated through fiber does not radiate, it is almost
impossible to detect the presence of underground fiber cable by scanning at
overground. Hence these fiber cables are very secure. In comparison, electronic
7.3 Basics of Optical Fiber Communication 349
pulse travelling through coaxial cable radiates electromagnetic energy that may
be detected by overground instruments, and their location may be detected rather
easily making them unsecure.
6. No grounding, no spark hazard: Since electric voltages are neither transmitted
nor develop during light pulse transmission through fiber cable, problems associ-
ated with sparking and with proper grounding do not arise in optical fiber trans-
mission. Optical fiber transmission is also immune to the potential difference
between the transmitting and receiving stations.
SAQ: In spite of so many advantages of optical fiber communication, there is one
big limitation. Can you point out the limitation?
Optical fibers are mostly made from fused silica glass, as thin as human hair, and
designed to propagate light waves along their length using the principle of total
internal reflection. Light wave entering from one end of the fiber undergoes successive
total internal reflections from side walls and travels down the length of the fiber along
a zigzag path. At each total internal reflection almost all energy of the light wave is
reflected back into the fiber; however, only a negligibly small fraction of light pulse
intensity or power may escape from the side wall. As such the incident light wave
propagates through the fiber almost undiminished in power up to long fiber lengths.
Construction details of a typical optical fiber are shown in Fig. 7.1. A typical fiber
has a central core, generally of fused silica glass, of diameter ≈ 50 µm which is
surrounded by cladding. The refractive index ‘n’ of core material is always greater
than the refractive index n1 of cladding material. The overall diameter of cladding is
of the order of 125–200 µm. Total internal reflection of the light wave incident on
the fiber takes place at the boundary of core of higher refractive index and cladding
of lower refractive index. Silicon coating is provided over cladding, which improves
the quality of light transmission through the core. A tight-fitting buffer jacket,
generally made of plastic, surrounds Silicon coating. The main purpose of buffer
jacket is to protect the core and other coverings from absorbing moisture. Moisture
(–OH molecule) has very high absorption coefficient for light waves, and therefore,
attempt is made to totally eliminate the presence of moisture from the fiber assembly.
Buffer jacket is surrounded by a strength member, which provides mechanical
strength to the fiber cable. The whole assembly is covered by an outer jacket made
of polyurethane. This arrangement of different layers of shielding ensures that the
fiber cable is not damaged during pulling, stretching, bending, rolling, etc.
SAQ: What is the purpose of buffer jacket and strength member in a fiber cable?
350 7 Optical Fiber Communication
Optical materials that are used in the fabrication of optical fiber must have following
properties:
(a) Material of the core and cladding must be transparent for the light wave lengths
used for transmitting optical signals in form of light pulses.
(b) Materials used for fabricating core and cladding should be such that their refrac-
tive indices may be manipulated by adding some impurities. As a matter of fact
the refractive index of core material should be slightly higher than that of the
cladding, which may be done by adding some impurities in core fiber material
or cladding material.
(c) Materials used for making both the core and the cladding must be such that they
may be drawn in long, very thin and flexible fibers.
Both glass (silica or silicate) and transparent plastic are good candidates for core
and cladding materials; glass has the edge of transmitting optical signals without
much attenuation to long fiber lengths as compared to plastic, while plastic has the
advantage of more mechanical strength as compared to glass. Therefore, depending
on the requirement, optical fiber cables of both glass and plastics are being used for
transmitting information in form of light signals.
In most glass fibers, silica with refractive index 1.458 (for light of wavelength
850 nm) is used as the basic material. Core material of higher refractive index may
be produced from this basic silica by adding small amounts of impurities like P2 O5 ,
GeO2 or B2 O3 , while silica itself may be used as cladding.
A new class of glasses, called ‘halide glasses’, has been recently developed which
has halide group (fluorine, chlorine, bromine, etc.) anions imbedded in them. These
glasses are found to transmit optical signals of infrared light with much smaller
loss over long distances. These glasses are used for core material, while cladding
material of lower refractive index is fabricated by replacing zirconium fluoride (ZrF4 )
by Hafnium fluoride (HaF4 ) in halide glasses.
Optical fiber cables with glass core and cladding made of plastic (Silicon resins)
have also been used for long-term applications. A popular cladding made of plastic
7.3 Basics of Optical Fiber Communication 351
polymer is ethylene propylene with refractive index 1.338 (for IR wavelength 850 nm)
which is often used to cover the plastic core.
Plastic core and plastic cladding fiber cables are also in use, though they generally
have higher loss of signal strength over long transmission lengths. Methyl methacry-
lates (n = 1.49) cladding and polystyrenes core (n = 1.6) fiber cables are used for
their durability and rigidness.
Even in the best optical fiber light is gradually attenuated as it travels through the fiber.
The linear attenuation coefficient or attenuation value is expressed in units of dB/km,
i.e. decibel per kilometre. Attenuation coefficient depends on the wavelength of light
used, and out of several causes of attenuation one important cause is the absorption
of optical signal light in water molecules (OH), that are invariably present in fiber
cables. Water molecules are imbedded in fused silica glass, being absorbed at the
time of manufacturing.
Linear attenuation as a function of wavelength of light is shown in Fig. 7.2. Three
natural dips in attenuation coefficient occur for wavelengths of 850 nm, 1300 nm
and 1550 nm; all the three wavelengths are in infrared region (IR) of light spectrum.
Light pulses of these three wavelengths are, therefore, often used for optical fiber
transmission. A big advantage of optical fiber is that three different data sets, each
coded in one these three wavelengths may be simultaneously transmitted by the
same fiber without any interference from each other. Simultaneous transmission of
multiple data sets by the same carrier is called multiplexing.
Other causes of signal attenuation will be discussed later in this section.
SAQ: What happens to the energy contained in a light photon when it is absorbed
by some molecule, like (OH) molecule?
Transmission of light wave through optical fiber without substantial loss of power has
been made possible by the principle of total internal reflection. Light waves may travel
through vacuum with characteristic high speed of 299,792,458 m/s ≈ 3 × 108 m/s,
denoted by ‘c’. However, in different mediums light travel with different speeds,
all smaller than ‘c’ the speed of light in vacuum. For example, the speed of light
in diamond is only 1.23881181 × 108 m/s and in pure water only 2.25407883 ×
108 m/s. The ratio (speed of light in vacuum/speed of light in medium) is called the
refractive index of the medium and is often denoted by either μ or n. Refractive index
of diamond is, therefore, 2.42 and of water 1.33. A medium that has a larger value
of refractive index as compared to another medium is called denser medium, and
352 7 Optical Fiber Communication
light travels slower in a denser medium. The phenomenon of total internal reflection
occurs when light travels from a denser medium to a less dense (or rarer) medium.
Figure 7.3 shows four different situations of the passage of a light ray from denser
to a rarer medium. As shown in Fig. 7.2a, a ray of light in denser medium when
incident at the boundary of denser to rarer medium refracts in to the rarer medium.
The refracted ray shifts away from the normal; i.e. the angle of refraction in rarer
medium is larger than angle of incidence in denser medium. Further the angle of
refraction in rarer medium increases with the increase of the angle of incidence in
denser medium (Fig. 7.3b). On increasing the angle of incidence in denser medium
a situation is reached such that for angle of incidence of θC (in denser medium)
the refracted ray travels along the boundary separating the two media, as shown in
Fig. 7.3c. The angle of incidence θC for which the refracted ray travels along the
boundary is called the critical angle. On further increasing the angle of incidence
to values larger than the critical angle θC (Fig. 7.3d), the incident light ray does not
travel in the rarer medium, and instead it is reflected back in the denser medium. The
boundary separating the two media works as a mirror for incident angles larger than
critical angle θC (in denser medium). This phenomenon of reflection of light back
into the denser medium at the boundary of the two media (for incidence angle > θC )
is called total internal reflection. From Snell’s law it follows that
sin θc n1 n1 n
or θc = sin−1
1
= or sin θc =
sin 90 n n n
In ideal situation of total internal reflection no part of the energy contained in
the incident light ray is either transmitted to the rarer medium or is absorbed at the
7.3 Basics of Optical Fiber Communication 353
boundary of the two media. Thus in total internal reflection a light ray is reflected
back in denser medium without any loss of its power or intensity.
Optical fiber is like a cylindrical waveguide made of low loss material, such that
the light incident on the core is guided through it (the core) by successive total
internal reflections from the core-cladding boundary. Only those incident rays that
hit core-cladding boundary with angles of incidence greater than critical angle θC
suffer total internal reflection and are guided further into the core without any loss
of intensity; other rays that hit the boundary with smaller angles of incidence may
refract into the cladding material and are lost.
Depending on the values of refractive indices of the core and the cladding, optical
fibers may be classified into three types.
(a) Single-mode step-index fiber
Figure 7.4 shows a single-mode step-index optical fiber. Let us understand what is
meant by the mode. In simple words, mode means the different paths of transmission
of the light signal through the core of the fiber. In a single-mode fiber, the diameter
of the core is very small so that all incident light rays after undergoing total internal
reflections from the core-cladding boundary travel (almost) along the axis of the core.
As such the transmission path for light rays through the core is only one, along the axis
of the cylindrical core. Hence such fibers are called single-mode fibers. Step index
means that the refractive index n of the core material is same over all sections of the
core, and the refractive index n1 of the cladding is smaller than n but remains constant
all over the cladding material. There is a step change in the value of the refractive
354 7 Optical Fiber Communication
index at the core-cladding boundary, hence the name step-index fiber. Typical values
of the ratio of core diameter (2a) and the cladding diameter (2b) (both 2a and 2b in
micrometre µm) are: 8/125 and 50/125 for single-mode step-index fibers. If n and
n1 , respectively, denote the refractive indices of the core and the cladding, then the
fractional refractive index change Δ may be given as,
(n − n 1 )
Δ= (7.1)
n
Obviously, Δ is very small. Usually, the value of core refractive index n ranges
from 1.44 to 1.46, depending on the wavelength of the light, and the value of Δ lies
typically between 0.001 and 0.02.
A multimode step-index fiber is shown in Fig. 7.5. Typical diameter ratio (2a/2b) for
multimode fibers is 85/125 and 100/140. Since the diameter of the core is sufficiently
large, light pulse incident at the input end of the core may travel through several
different paths, along the axis, at many possible different angles such that after total
internal reflection at core-cladding boundary they all are guided through the core
and travel along different paths. Each of these paths represents a different mode
of transmission. In a step-index fiber, the refractive index n of the core and n1 of
cladding has fixed values. Refractive index of core is uniform throughout the core,
and similarly, the refractive index of cladding is uniform throughout the cladding;
there is a step change in the value of refractive index at the core-cladding boundary.
The problem with multimode step-index fiber is that light pulse or signal trans-
mitted through different modes (paths) travels different distances in the core. Since
the refractive index within the core has same value (n) in all parts of the core, light
signals travelling through different modes take different times to reach the terminal
point of the core. As a result, at the terminal point several images of the incident
light signal will be formed, each displaced with respect to the other by a very small
time interval. Thus the time definition of the incident signal will not remain sharp at
the terminal end. In an ideal situation images of the incident light signal travelling
through different modes along the fiber core should reach the terminal point at the
7.3 Basics of Optical Fiber Communication 355
same instant, completely overlap each other to produce a sharp and intense image.
In a multimode step-index fiber final signal image is blurred.
Gradual decrease in the refractive index of core material from axis towards the core
periphery helps in reducing or eliminating time dispersion between signals travelling
through different modes. To understand the working of a graded-index multimode
fiber it may be recalled that light travels faster in medium of lower refractive index
as compared to the medium of higher refractive index. Incident light signals take
peripheral paths and have longer path lengths travel faster (because of the lower
value of refractive index), as compared to signals that travel via axial modes (shorter
paths lying near the core axis) through the medium of higher refractive index. In
nutshell, signals that travel through longer paths pass through medium of lower
index and therefore, travel faster, while signals that go through shorter paths travel in
medium of higher index and, therefore, move with lower speed. The net result is that
all signals, travelling through different modes in the core, reach the terminal point at
the same instant.
Light rays that are guided in the core suffer phase change at each total internal
reflection at core-cladding boundary. Some of the rays that are guided through the core
are in opposite phase and interfere destructively, annihilating each other, while those
rays that were in phase strengthen each other on undergoing constructive interference.
Hence, though several modes may be allowed but only some of them really take place.
A light ray is guided by total internal reflection within the fiber core if its angle
of
incidence at the core-cladding boundary is greater than critical angle ∠θc = sin−1 nn1
and remains so as the ray undergoes total internal reflection from core-cladding
boundary again and again proceeding ahead.
Rays in planes passing through the core (or fiber) axis are called meridional rays.
One such plane that passes through the core axis is shown in Fig. 7.7, and typical path
of a guided meridional ray is shown by ABCD. It may be noted that a meridional
plane always intersects the cylindrical core-cladding boundary at 90°. Therefore,
meridional rays intersect the fiber axis and are reflected in the same plane without any
change in their angle of incidence θ . These rays are guided if their angle i with the core
axis is smaller than the compliment of the critical angle θC [= π2 − θC = cos−1 nn1 ].
Since θC is usually small, meridional rays are approximately paraxial.
Any general ray is identified by its plane of incidence, a plane parallel to the
fiber axis and passing (or containing) the ray and the angle θ that the ray makes in
incidence plane. The incident ray after total internal reflection (called skewed ray) is
confined in a plane that is normal to the core-cladding boundary and is specified by
7.3 Basics of Optical Fiber Communication 357
the distance R by which the normal plane is offset from the core axis and the angle
ϕ that the normal plane makes with the incident plane, as shown in Fig. 7.8.
Guided paths in fiber core for meridional and skewed rays are shown in Fig. 7.9.
As may be seen in this figure, the meridional rays move in a plane, while the skewed
rays follow a helical path confined between two cylindrical shells.
SAQ: Explain why meridional rays are nearly parallel to the fiber axis?
light rays impinging on the core-cladding boundary at an angle greater than the
critical angle undergo total internal reflection. In the following we will attempt to
find the value of the maximum angle of incidence for a ray of light that enters the core
from air (refractive index 1) and undergoes total internal reflection at core-cladding
interface. Acceptance angle is defined as the maximum angle of incidence at the
interface of air and core media for which the light ray enters the core and travels
along the boundary of the core and cladding.
Figure 7.10 shows a light ray incident on fiber core (refractive index n) from air
(refractive index 1). It may be noted in the figure that axis of cylindrical core acts
as normal to the air–core interface and that angle of incidence ∠i and the angle of
refraction ∠r are are related with each other by Snell’s law,
sin i n
= =n (7.2)
sin r 1
Condition for total internal reflection at point B in the figure is,
n1
sin ϕ ≥ (7.3)
n
Fig. 7.10 Passage of a light ray incident on the fiber core from air
1/2 n 1
1 − sin2 r ≥
n
or
n 2
1
sin2 r ≤ 1 −
n
or
n 2 1/2
1
sin r ≤ 1 − (7.4)
n
The refracted ray will travel along the boundary of core-cladding interface. The
acceptance angle is, therefore, given by,
360 7 Optical Fiber Communication
1/2
∠i acc = sin−1 n 2 − n 21 (7.7)
Rays incident from air to the fiber core making (with core axis) angles smaller
than acceptance angle ∠i acc will undergo total internal reflection at the core-
cladding boundary and will be guided in the core. A ray that is incident on the
air–core interface making an angle (with core axis) larger than the acceptance angle
will not undergo total internal reflection at the core-cladding boundary and will
refract into cladding. This is shown in Fig. 7.11, where ray AOBC incident at air–
core interface with incidence angle iacc after refraction at core-cladding boundary
travels along the boundary. Ray DOHK that makes angle of incidence larger than
acceptance angle (iacc ) is transmitted to the cladding after suffering refractopn at
core-cladding interface. Ray EOFG that is incident at air–core interface with angle
of incidence smaller than the acceptance angle undergoes total internal reflection at
core-cladding interface and is guided through thre fiber core.
As shown in Fig. 7.12 all rays incident at the air–core interface within the cone of
half angle i acc will fall at core-cladding boundary with angle of incidence larger than
the critical angle and will suffer total internal reflection and will be guided. On the
other hand, rays with angle of incidence at air–core interface larger than the angle of
acceptance i acc will suffer refraction at the core-cladding interface and will be lost
in cladding. Larger the opening of acceptance cone, more light rays may be guided,
which means that the light-gathering power of the fiber will be large.
Solved Example SE7.1 Assuming that the core of a fiber is made of glass of refrac-
tive index 1.46 and is surrounded by a cladding of refractive index 1.40, calculate the
critical angle θc , numerical aperture NA, angle of acceptance and fractional refractive
index change Δ. What will happen to the critical angle, NA and angle of acceptance
if cladding is replaced by air of refractive index 1?
Solution In the first part of the question it is given that: n = 1.46 and n1 = 1.40.
Therefore ∠θc = sin−1 nn1 = sin−1 1.40 1.46
= sin−1 0.959 = 81.70◦ .
And θc = (90 − 81.70) = 8.30◦ .
1/2
Further, numerical apperture NA = n 2 − n 21 = 0.414.
And acceptence angle ∠i acc = sin−1 NA = sin−1 0.414 = 27.17◦
n − n1 1.46 − 1.40
Δ= = = 0.041
n 1.46
In second part of the problem it is asked to calculate the same quantities when n1
= 1.
−1 n 1 −1 1.00
∠θc = sin = sin = sin−1 0.685 = 48.03◦
n 1.46
1/2 /
Also, numerical apperture NA = n 2 − n 21 = (1.46)2 − 1 = 1.0
And acceptence angle ∠i acc = sin−1 NA = sin−1 1 = 90◦ .
It may be observed that by replacing the cladding of index 1.40 by air of index 1.0,
i.e. by increasing the difference (n − n1 ) the angle of acceptance increases which in
turn will increase the light-gathering power of the fiber. It may, however, be shown
that with the increase of acceptance angle, the number of modes also increases
rapidly which is not good. It is for this reason that the difference between refractive
indices of core and cladding is kept small so that the number of modes does not
increase to a very large value.
Light rays are electromagnetic waves with cross electric and magnetic fiels, and the
propogation of guided wave in the core medium o f a fiber may be studied following
the variation in electric and magnetic field vectors subject to Maxwell’s equations
and boundary conditions imposed by core-cladding indices. There are some special
solutions of these equations, called modes, each of which has a distinct propogation
constant, a characteristic field distribution in the transfer plane and two independent
polarisation states. In the analysis of guided electromagnetic waves a parameter
called fiber parameter or V parameter evolves which may be given as,
a a
V = 2π (NA) = 2π n(2Δ)1/2 (7.9)
λ λ
Here a is the radius of the core, λ the wavelenght of light, n the refractive index
of core material and Δ = n−nn
1
. V is an important fiber parameter that governs the
number of modes and their propogation constants.
The maximum number of modes N m supported by a step-index fiber is given by;
1 2
Nm = V (7.10)
2
However, for V < 2.405, the step-index fiber can support only one mode and is
called a single-mode step-index fiber (SMF). The wavelenght of light corresponding
to V = 2.405 is called the cut-off wavelength of the fiber and is often denoted by λc .
2πa
λc = n(2Δ)1/2 (7.11)
2.405
In case of graded-index fiber which may support large number of modes and has
large value of V, the maximum number of modes is given by;
1 2
Nm = V (7.12)
4
7.3 Basics of Optical Fiber Communication 363
There are two important parameters that are associated with any optical signal; they
are (i) power of the signal, often denoted by P, and (ii) spectral spread, i.e. the signal
has a specific wavelength λ with a spectral spread of Δλ. As the optical signal travel
sthrough the fiber its power diminishes, which is referred as signal attenuation. Since
different wavelength components of the optical signal travel with different speeds in
the fiber, the temporal spread of the signal becomes broadened, and this is referred as
dispersion. Attenuation and dispersion limit the performance of optical fiber medium
as a data transmission channel. Attenuation limits the magnitude of the optical power
transmitted, while dispersion limits the rate of data transmission.
(A) Attenuation
Experimentally it has been observed that the power of an optical signal transmitted
through optical fiber diminishes exponentially with distance on account of absorp-
tion in fiber medium and scattering. Attenuation is generally measured in terms of
the power transmission ratio and distance travelled by the signal. The attenuation
coefficient α in units of dB/km is given as;
1 Pinc
α= 10 log10 (7.13)
L PL
Here, L is the distance (generally in km) travelled by the optical signal in fiber, Pinc
the incident power of the signal at L = 0 and PL the power of the signal after trav-
elling a distance L in the fiber. Quantity PPincL is called the power transmission ratio.
The graphical relation between attenuation coefficient α and the power transmission
ratio is shown in Fig. 7.13. It may be observed in Fig. 7.13 that power transmission
ratio becomes 0.5 (signal power becomes half of its original value) when attenua-
tion coefficient has the value 3 dB. Similarly, attenuation of 10 dB corresponds to
power transmission ratio of 0.1 and 20 dB of transmission ratio of 0.01. It may be
observed that while attenuation coefficients are additive, power transmission ratios
are multiplicative.
Signal attenuation in optical fibers may be divided into two components: (i) due
to intrinsic causes that are due to the intrinsic properties of the fiber material and (ii)
extrinsic that are due to impurities, etc. not associated with the fiber material.
(i) Intrinsic causes of attenuation:
Two major causes of signal attenuation while travelling through the fiber are:
(a) Absorbtion in fiber core material and scattering. In optical fibers made of fused
silica glass core, absoption of optical signal strongly depends on the signal
wavelength. Fused silica glass has two strong absoption bands: one in ultraviolet
(UV) region (λ between 0.6 and 0.9 µm) and the other in infrared (IR) region
364 7 Optical Fiber Communication
(λ between 1.6 and 1.9 µm). Between these two wavelength limits there is a
natural window for wavelengths in the region of nea- infrared region (λ between
1.0 and 1.6 µm) where there is no inherent absoption by fused silica glass.
Absorption of a given wavelength of light occurs when atoms and/or
molecules of the medium (SiO2 , fused silica) have either vibrational, rotational,
molecular or electronic excited states that exactly match with the energy of
the light wavelength. A light signal or photon of wavelength λ has an energy
ε = hc λ
and if the atom/molecule of the medium has some excited states that
differ exactly by ε amount of energy, the photon gets absorbed shifting the
system to the higher state of excitation. The near-infrared absorption band in
fused silica glass is due to vibrational bands, while the ultraviolet absorptrion
band arises because of the molecular and electronic excitations.
(b) Scattering of optical signal wavelengths by core medium is another major cause
of signal attenuation. The random localised variations of the molecular posi-
tions in silica glass of fiber core create random inhomogenities of the refrective
index and act as minute scattering centres. These inhomogenity centres scatter
optical signal light, in a way similar to the scattering of sun rays by dust parti-
cles in atmosphere. This type of scattering is called Rayleigh scattering and is
characterised by 1/λ4 law, which means that scattering is inversely proportional
to the fourth power of wavelength. Lower wavelengths are scattered more as
compared to the longer wavelengths. The wavelength window for which there
is no inherent attenuation of optical signal in fused silica glass is bounded by
Rayleigh scattering on the shorter wavelength side and by infrared absorption
on the long wavelength side.
(ii) Extrinsic causes of attenuation
Impurities, mostly of metallic ions and dissolved water vapours (OH ion) in silica
glass, cause attenuation of optical signals. Modern-day technology of making fused
silica glass has made it possible to remove almost 100% of all metallic ion impurities,
7.3 Basics of Optical Fiber Communication 365
however. Water vapours dissolved in glass still cause absoption of optical signals. It
is for this reason that optical signals of some specific wavelengths, for which (OH)
ion absorptrion and Rayleigh scattering are minimum and intrinsic absorptions are
also small, are used. Bends and other physical distortions in fiber cables also cause
scattering/attenuation of signals. Further, attenuation of optical signal is least in step-
index single-mode fibers and is considerabley higher in multimode graded-index
cables.
(B) Dispersion
c dn
v= , where N = n − λ (7.14)
N dλ
There are four main causes of signal dispersion in fibers. They are: (i) mate-
rial dispersion, (ii) modal dispersion, (iii) waveguide dispersion and (iv) nonlinear
dispersion.
(i) Matertial dispersion
Glass is a dispersive medium which essentially means that its refractive index is a
function of wavelength. Since an optical pulse is a wave packet, different components
waves of the wave packet travel with different group velocities in glass. It can be
shown that the temporal width of an optical pulse of spectral width σλ (in nm, 10−9 m)
after travelling a distance L becomes σr which is given by;
where
λ d2 n
Dλ = − (7.16)
c dλ2
Dλ is called material dispersion coefficient. Generally, L is measure in km (kilo-
metre, 103 m), σλ in nm (nenometre, 10−9 m) and σr in ps (picco second = 10−12 s);
therefore, the units of dispersion coefficient are ps/km-nm. Wavelength depen-
dance of material dispersion coefficient for fused silica glass as a function of signal
wavelength is shown in Fig. 7.15.
As may be observed in this figure, coefficient has a negative value for wavelengths
below 1.3 µm. Wave packets of long wavelength travel faster than wave packets
of shorter
ps wavelength.
At λ = 1.3 µm D(1.3) = 0; forλ = 1.55 µm D(1.55) =
ps
+17 km−nm ; and for λ = 0.87 µm D(0.87) = −80 km−nm .
(ii) Modal dispersion
As the name suggests, modal dispersion takes place in multimodal transmission
cables. The reason for this dispersion is the fact that group velocities of signal through
different modes are different. A single optical impuse of light (optical signal) entering
a M-mode fiber at z = 0 spreads into M signals, each travelling through the fiber
with a different group velocity. As a result, signals from different modes reach the
receiver end L (km) distance away after different time delays (see Fig. 7.14). If one
denotes by τk the time delay of the signal through the kth mode, then τk = vLk , where
vk is the group velocity of the signal through kth mode. If vmin and vmax represent
respectively the minimum and the maximum values of the group velocities, then
(L/vmin ) − (L/vmax ) will be the time spread of the optical pulse at receiver end.
The rms value of spread over all pulse widths at receiver end may be given by
σr = 21 [(L/vmin )−(L/vmax )]. σr is called the response time of the fiber. It can be
shown that in a step-index multimode fiber vmin = vmax (1 − Δ), hence,
1 L 1 1
στ = [(L/vmin )−(L/vmax )] = −
2 2 vmax (1 − Δ) vmax
L
= (1 − Δ)−1 − 1 (7.17)
2vmax
L
στ = [Δ] (7.18)
2vmax
Optical signal has electric and magnetic fields associated with it. Field distribution
in fiber depends on the ratio (a/λ), where a is the radius and λ the wavelength of the
light signal. Because of this dependance, group velocities of different modes differ
from each other, even when material dispersion is neglected. Dispersion produced as
a result of difference in modal group velocies on account of radius/wavelength ratio
is called waveguide dispersion. In analogy with Eq. (7.15), the waveguide response
may be written as,
Here, V is the V parameter of the fiber and β the propogation constant of the incident
signal.
(iv) Nonlinear dispersion
Another type of dispersion occurs when the intensity of the input light signal is suffi-
ciently high. It occurs because for high-intensity light signal the refractive indices
become intensity dependent. High-intensity parts of the signal pulse undergo phase
shifts that are different from the low-intensity part. As a result the frequencies of
the signal get shifted by different amounts, which result in the modification of
group velocities and change in pulse shape. Under some special conditions, material
dispersion may counter balance nonlinear dispersion, so that the optical pulse travels
through the fiber without altering its temporal profile. The wave in such a condition
(when there is no change in its temporal profile) is called a soliton.
Figure 7.16a–c show dispersions produced by different types of fibers to a sharp
input optical pulse. It may be observed in these figures that the output pulse is shifted
in time with respect to the input pulse, and in most cases its shape has also changed
as a result of dispersion.
In any information transmission the first step is to convert the given information
into digital data in form of electrical signals. For optical fiber communication the
informastion which is already available in digital electric/electronic form needs to
be converted into the form of optical (light) pulses. Essentially there are four main
components of a fiber network: (i) optical transmitter, (ii) optical fiber cable, (iii)
optical connectors and (iv) optical receiver. Figure 7.17 shows the simplistic layout
of a optical fiber network.
after crossing a thin p+ layer and produces electron hole pairs which are swept;
holes towards the p-side which is biased by negative potential and electrons
towards the n-side biased by positive potential. An electric current signal is
thus generated whenever a light signal shines the PIN diode.
Avalanche photodiode operates in reverse bias mode, with high reverse bias
voltage that may be up to 100 V. Optical (light) signal falling on depletion
region of a PAD produces charge carriers, holes and electrons which get accel-
erated under the high reverse bias and produce secondary charge carriers by
avalanche breakdown of depletion medium. As a result of secondary carrier
production high current flows through the photodiode generating a current pulse
corresponding to each light signal.
στ = |Dλ |σλ L .
Problem
Fig. 7.21 Time response of an impulse optical pulse through a 100 km single-mode fiber
SA7.1 List the advantages and the drawback of optical fiber communication.
SA7.2 List important causes of signal attenuation in fiber communication.
SA7.3 What is dispesion of signal and how does it effect optical fiber communica-
tion?
SA7.4 Give a brief construction details of an optical fiber and explain the purpose
of each component.
SA7.5 What is the difference between a graded-index and a step-index multimode
fiber? Which is better and why?
SA7.6 What are the essential components of an optical fiber link? Give brief
description of each component.
SA7.7 Define acceptance angle, numerical aperture (NA) and V parameter of a fiber
and give physical significance of each.
ps µs pm km−nm
(a) km−nm
(b) km−nm
(c) km−nm
(d) ps
ANS: (a)
MC7.2 Acceptance angle θacc is given by,
(a) sin−1 (NA) (b) sin−1 (2Δ) (c) sin−1 n(2Δ)1/2 (d) sin−1 n(2Δ)2
ANS: (a), (c)
MC7.3 Maridional rays are contained in a plane that
(a) Is at a fixed distance from fiber axis (b) passes through fiber axie (c) is
normal to fiber axis (d) makes an angle equal to the critical angle with fiber
axis
ANS: (b)
MC7.4 Bandwidth of a communication channel is a measure of the
(a) Time taken to transmit 100 kb data over 1 km (b) transmission capacity
of the channel (c) amount of data that may be up or down loaded per unit
time (d) change in temporal priofile of the transmitted signal
ANS: (b), (c)
MC7.5 In optical fiber transmission dispersion is a measure of the change in signal’s
(a) Power profile (b) frequency profile (c) shape profile (d) temporal profile
ANS: (d)
MC7.6 LASER source-based optical transmitter has
(a) Smallest spectral width (b) large bandwidth (c) least dispersion (d) least
attenuation
ANS: (a), (b)
MC7.7 The power of an optical signal becomes one-tenth of its original value
after travelling a distance of 100 km in an optical fiber. The attenuation
coefficient for the link in dB/km is
(a) 0.001 (b) 0.01 (c) 0.1 (d) 1.0
ANS: (c)
MC7.8 The V parameter for a 25 µm radius step-index fiber is 10. The maximum
number of modes supported by the fiber is
(a) 10 (b) 20 (c) 40 (d) 50
LA7.1 Describe in detail the components and their use in an optical fiber link.
Discuss the advantages of optical fiber communication and its applications.
LA7.2 Give details of the guided propogation of an optical pulse through a fiber
explaining the importance of acceptance angle, numerical apperture and V
parameter.
376 7 Optical Fiber Communication
LA7.3 Discuss in detail the attenuation of optical signal while guided through a
fiber. What is dispersion and how does it adversly affect signal propogation?
LA7.4 Discuss different types of optical fibers and their merits. What is meant by
‘bandwidth’ and how is it related to the modes of fiber propogation?
Chapter 8
Laser Technology and Its Applications
Objective
Physics behind the working of LASER sources, their classification and some of their
important applications will be discussed in this chapter. After reading this chapter the
reader will be able to appreciate and understand special properties of LASER sources,
the processes of population inversion, induced emission and light amplification using
induced emission of radiations. He will also be able to follow why laser sources are
used in some special applications.
8.1 Introduction
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2023 377
R. Prasad, Physics and Technology for Engineers,
https://doi.org/10.1007/978-3-031-32084-2_8
378 8 Laser Technology and Its Applications
Broadly speaking, complete EM spectrum may be divided into two parts; visible
light and invisible light. Different parts of EM spectrum may be identified by their
specific names; like Gamma rays, X-rays, microwaves, radio waves, etc. As is shown
in the figure, different components of EM spectrum have different frequency or
wavelength ranges, but all EM waves travel with the same velocity c (= 3 × 108 m/
s) in vacuum. The frequency ν and the wavelength λ of an EM wave in vacuum
are related with the expression ν = c/λ. Since radiations transport energy from one
place to another, EM radiations carry with them energy. Quantum mechanics (QM),
the mechanics appropriate for microscopic systems, tells that energy carried by EM
radiations is in the form of small energy packets. Energy packets or energy quanta of
EM radiations are called Photon. The energy contents of a photon of an EM wave
of frequency ν are E = hν; here h is a constant called Planck’s constant with value
4.1357 × 10–15 eV s or 8.62 × 10–34 J s in SI units. As may be seen, Plank’s constant
is very small and, therefore, energy content of photon is very small. Order of energy
(in eV) contained in different components of EM waves is given at the top in Fig. 8.1.
Each component of EM radiations is of great use for humanity; X-rays are used to
take photographs of bones, etc., gamma rays for treating cancer patients, microwaves
for making ovens for cooking, ultraviolet light for sterilising surgical instruments and
killing bacteria, visible light for observing objects, infrared for heating and so on. In
order to use EM radiations, it is required to develop sources for different components
of EM radiations. Interaction of EM radiations with matter plays important role in
developing light sources.
8.3 Interaction of Electromagnetic Radiation with Matter 379
Interaction of EM radiations with matter may be divided into two steps that compete
with each other (a) absorption and (b) spontaneous emission.
(a) Absorption
Materials are made up of atoms and molecules. When all atoms present in a material
are identical, the material is called an element. In the other case, when material
is made of different types of atoms, it may be called a salt, alloy, composite, etc.
depending on the nature of binding between atoms and molecules.
Each individual atom of a material, according the quantum theory, may exist in
one of the several quantised discrete energy states. Out of these energy states, state
with lowest energy is called the ground state and other states of higher energies,
the excited states. Energy states are not equidistant in energy from each other, the
energy separation between successive states decreases with the increase of energy.
In absence of any external radiations (at absolute zero temperature) all atoms of
the given material stay in their ground or lowest energy state which is most stable.
However, if the temperature of the material specimen is raised to some higher value T
K, some atoms absorb thermal energy from surroundings and shift to excited states.
At any given absolute temperature T Kelvin, when system is in thermal equilibrium,
number N E of atoms in an excited state of energy E is given by;
NE = N0 e(− kT )
E
(8.1)
Here, N 0 and k are respectively the number of atoms in ground state and Boltzmann
constant. Since the factor e(− kT ) is always less than one, N E < N (<E) . It means that
E
SAQ: What is meant by thermal equilibrium? How can one identify if a system is
in thermal equilibrium?
Let us now consider any two successive excited states of the atom with energies
E 1 and E 2 having atom populations N 1 and N 2 respectively. It may be noted that E 2
> E 1 but N 2 < N 1 . Now suppose a beam of EM radiations of frequency ν is made to
fall on the specimen. This beam of EM radiation will be a bunch of photons each of
energy E p = hν. If E p matches with the energy difference [E 2 − E 1 ] between two
energy states of the atom, i.e. if
hν = E 2 − E 1 (8.2)
then some of the incident photons may be absorbed by atoms in state E 1 and will
shift to the next higher state of energy E 2 . Absorption of one photon by one atom in
lower energy state E n will reduce the number of incident photons by one, decrease
the population of lower energy state by 1 and increase the population of atoms in the
higher energy state by 1. Schematic diagram showing absorption of one photon by
an atom in the lower energy state E 1 and shifting to higher energy state E 2 is shown
in Fig. 8.3. The process in which atoms in a lower energy state are made to absorb
photons externally incident on them is either called absorption or more specifically
induced absorption.
The number of absorption events N abs in time Δt is proportional to the number
of atoms N 1 in state E 1 , number N of photons per unit volume in EM beam and time
Δt. Number of photons per unit volume of the incident beam may be expressed as
energy density Q = Nhν of incident photon beam. Therefore,
Nabs ∝ N1 .Q.Δt
or
Nabs
R Abs = = B12 N1 .Q (8.3b)
Δt
8.3 Interaction of Electromagnetic Radiation with Matter 381
Absorption of photons by atoms lifts them from a lower energy state to a state of
higher energy. The process is called photoexcitation. However, atoms of a material
may be excited by several other means; for example the atoms of the filament of a
bulb get excited when an electric current is passed through the filament. The atoms
of a solid body, like a metal ball, go to excited states when it is heated. Therefore,
photoabsorption is one of the many ways of atomic excitation.
Atom excited to a state of higher energy cannot live in the excited state for long. In
normal case the mean life time τ of atomic excited states is quite short of the order of
10−8 –10−9 s. Mean life is a statistical parameter. Number of atoms in an excited state
decreases exponentially with time. According to the exponential law of spontaneous
decay, if N (t=0) is the number of excited atoms at initial time t = 0, then number of
excited atoms N t at a later time t is given as;
Nt = N(t=0) e− τ
t
(8.4)
Figure 8.4 shows how, say 100 atoms in an excited state decreases exponentially
with time. If τ is the mean life for this decay, then nearly 63 (63%) of original
population of atoms in excited state will lose their extra energy and got de-excited.
After one mean life only about 37 (37%) of original population of excited atoms will
survive. About 98% of excited atoms will revert back to lower energy state after a
time of the order of 5τ. Reverting back from an excited state to the lower energy
state of an excited atom is called de-excitation. Mean life of an excited atomic state
is a parameter that cannot be controlled by any physical or chemical process; it
is a property of the excited state which cannot be altered. Natural de-excitation is a
spontaneous process. An important observation that follows from the spontaneous de-
excitation curve is that atoms in a given excited state do not de-excite simultaneously
at the same instant; 98% of excited atoms de-excite in a time span of about five mean
lives.
Excited atoms may shake-off their extra energy and de-excite in several ways,
including the spontaneous emission of a photons. Emission of photons from the
excited atoms, without any external stimulation, is termed as spontaneous photon
de-excitation. In a way it is just the opposite of absorption. In normal course both
Fig. 8.5 a Populations of the lower and the excited states, b, c show the spontaneous photon
emission, d shows that de-excitation photons are generally emitted at different times and in different
directions
8.4 Einstein Prediction of Stimulated Emission 383
The origins of stimulate emission, the back bone of laser physics, may be traced
back to an idea from Albert Einstein in the formative years of quantum theory. In
1917 Einstein published a paper entitled ‘Zur Quantentheorie des Strahlung’ which
in English translates as ‘On the Quantum theory of Radiation’. In this paper Einstein
predicted stimulated emission of radiations in order to satisfactorily reproduce the
quantum mechanical expression of radiation energy density Q. Earlier, way back in
1900 an expression Q was derived by Planck.
Einstein argued that in case of an atomic system having two energy states E 2 and
E 1 (E 2 > E 1 ) with number of atoms N 2 and N 1 (N 2 < N 1 ) respectively, if bombarded
by a beam of N photons per unit volume, each of energy hν = (E 2 − E 1 ), the system
384 8 Laser Technology and Its Applications
will be in thermal equilibrium only when the decay rate (by spontaneous emission
of photons) of state E 2 to state E 1 and the photon absorption rate of state E 1 to state
E 2 are equal. These transition rates are given respectively by Eqs. (8.5a) and (8.3b).
On equating the two rates one gets,
A21 N2 = B12 N1 .Q
or
A21 N2
Q= (8.5b)
B12 N1
E2 E1
However, from QM N2 = N0 e− kT and N1 = N0 e− kT .
− ( 2kT 1 )
E −E
Therefore, N2
N1
= e substituting this value of N 2 /N 1 in Eq. (8.5b) one
gets,
A21 − ( E2 −E1 )
Q= e kT
(8.5c)
B12
Earlier, another German scientist Max Planck carried out detailed study of black-
body radiations and in 1900 gave the following expression for the energy density of
radiations Q,
8π hν 3 1
Q= 3
hν (8.5d)
c e kT − 1
Einstein found that the two expressions for energy density of radiations, given
by Eqs. (8.5c) and (8.5d), do not agree. He therefore suggested that there may be
another factor that has been missed while driving Eq. (8.5c).
Einstein proposed that apart from spontaneous decay there may be a stimulated
or induced decay of atoms in excited state E 2 that might also contribute to the decay
process. He further assumed that the decay rate RDsti due to stimulated emission is
proportional to the number of atoms N 2 in state E 2 and also the energy density Q.
Thus,
Here B21 is the constant of proportionality but different from A21 . Another important
point to note is that Einstein assumed that stimulated decay rate is not only propor-
tional to N 2 , the number of atoms in state E 2 but also to the energy density Q (=
N hν).
With the addition of another mode of decay the total decay rate of level E 2 is now
the sum of decay rates due to spontaneous emission and decay rate due to stimulated
emission. In thermal equilibrium the sum of the spontaneous and stimulated decay
rates must be equal to the absorption rate;
8.4 Einstein Prediction of Stimulated Emission 385
sp
RDsti + RD = R Abs
or
A21 N2
B21 N2 Q + A21 N2 = B12 Q N1 or Q =
(B12 N1 − B21 N2 )
or
A21
Q=
B12 NN21 − B21
( E2 −E1 )
Substituting the value of N1
N2
= 1
( E2 −E1 ) = e kT in above expression for Q,
−
e kT
one gets;
⎛ ⎞
A21 A21 ⎝ 1 ⎠ = A21 1
Q= ( E2 −E1 )
= ( E2 −E1 ) B12 (hν)
B12 e kT − B21 B21 B12
e kT − 1 B21
B21
e kT −1
B21
(8.5f)
8π hν 3 1
If one now compares Eq. (8.5f) with the expression of Q = c3 hν given
e kT −1
by Planck Eq. (8.5d), one gets
A21 8π hν 3 B12
= 3
and = 1 (or B12 = B21 ) (8.5g)
B21 c B21
Constants B12 , B21 and A21 are called Einstein’s coefficients and relations specified
by Eq. (8.5g) Einstein’s relations.
The ratio R of rates for spontaneous emission to stimulation emission may be
given as,
A21 N2 hν hν
R= = e kT − 1 ≈ e kT (8.5h)
B21 N2 Q
Equation (8.5h) tells that the ratio of spontaneous to simulated decay increases
exponentially with energy of photon. For example if one calculates ratio R for the
light of frequency 4.7 × 1014 emitted by ruby laser at room temperature (300 K) using
the values of h = 8.63 × 10−34 J s and Boltzmann constant k = 1.28 × 10−23 J/K; one
gets R = 3.7 × 1032 , it means that when there is thermal equilibrium spontaneous
decay will be 1032 times more probable than stimulated decay. Obviously, stimulated
decay in thermal equilibrium will be completely masked by spontaneous decay. In
order to enhance stimulated decay one has to go to a situation where the system is
386 8 Laser Technology and Its Applications
not in thermal equilibrium. Further details to achieve such situation will be discussed
in next sections.
SAQ: In what respect the spontaneous de-excitation differs from stimulated de-
excitation?
Willis Lamb and R. C. Retherford were the first in 1947 to experimentally observe
stimulated emission. In 1950 Alfred Kastler gave the idea creating a non-equilibrium
state by population inversion for which he was awarded Nobel Prize of Physics in
1968.
Let us consider two states of an atom with excitation energies E 1 and E 2 (E 2 > E 1 ).
Further, let there be N 2 excited atoms in state E 2 and almost no atom in state E 1 . It is
also assumed that conservation laws allow photon de-excitation of atoms in state E 2
to E 1 . In normal situation atoms from state E 2 will follow spontaneous exponential
decay and will de-excite to state E 1 emitting photons of energy (E 2 − E 1 ) in random
directions with random phases and states of polarisation.
However, a special phenomenon, called induced de-excitation or induced/
stimulated photon emission may occur if a photon of energy hν = (E 2 − E 1 ) is
made to hit excited atoms of state E 2 . The incident photon will immediately make
one excited atom of state E 2 to de-excite by emitting a photon of energy hν = (E 2 −
E 1 ). This photo-de-excitation of excited atom by another photon of same energy is
called induced or stimulated photon emission. In stimulated photon emission, the
incident photon simply induces an excited atom to emit a photon identical to the inci-
dent photon, without undergoing any change in itself. Induced de-excitation does not
depend on the mean life of the excited state E2 , and it occurs immediately as the
incident photon interacts with an excited atom in state E 2 . Further, the phase, state
of polarisation and direction of motion of the incident photon and the photon
produced by stimulation emission are same.
With stimulated photon emission by the incident photon, there are now two
photons of same energy, same phase, same state of polarisation and moving in
the same direction. The two photons each of energy (E 2 − E 1 ) now induce two
more excited atoms to de-excite by emitting two more photons. Thus a single inci-
dent photon produces a chain reaction in which the numbers of induced photons
multiply rapidly. As a result almost all excited atoms in state E 2 undergo stimu-
lated de-excitation at the same instant by emitting large number of photons that are
all moving in the same direction, have same energies, have same phases, and same
states of polarisation. It is important to note that stimulated emission of photons does
not depend on the mean life of the excited state, and it occurs almost instantaneously.
Frame (a) in Fig. 8.6 shows two energy states of a system with energies E 2
and E 1 (E 2 > E 1 ) with one excited atom in state E 2 and an empty state E 1, before
8.5 Stimulated (or Induced) Emission of Photons 387
Fig. 8.6 Stimulated emission of photons a when there is only one excited atom, b when there is
large number of atoms in excited state
388 8 Laser Technology and Its Applications
Here, B21 is the probability of stimulated transition, N2 the number of excited atoms
in state of energy E 2 and Q the energy density of incident photon beam.
While discussing the laser action above, it was assumed that the population of excited
atoms in state of higher energy E 2 is large while the lower energy state E 1 is empty.
This assumption that there were large number of excited atoms in state E 2 and either
no or very few atoms in lower energy state E 1 is physically not justified. Quantum
statistics tells that in thermal equilibrium, when a system is in steady state, the number
of excited atoms in a state of higher energy is always less than their number in a state
of lower energy as dictated by Eq. (8.1). The situation when number of excited atoms
is more in a state of higher energy than their number in a state of lower energy is
technically referred as population inversion. It is obvious that population inversion
is normally not possible, and if a system has population inversion then it is not in
thermal equilibrium. However, as will be seen, in some special cases, population
inversion can be achieved using some technical tricks.
Let us now discuss why population inversion is a prerequisite for viable laser
action. Let us assume that there are more atoms in the state E 1 of lower energy
and only few excited atoms in state E 2 of higher energy, i.e. there is no population
inversion. In this situation, if some stimulated photons are released by laser action
from the de-exciting atoms in state E 2 , they (stimulated photons) may be absorbed
immediately by atoms in state E 1 . Since the number of atoms in state E 1 is much larger
than that in state E 2 , the probability of photon absorption will be large; as a result
either negligible or no coherent photons will be available for making a light source,
which means that there will be no gain of photons in the system even after stimulated
emission. It is, therefore, necessary to have population inversion for having large
photon gain factor so as to have a viable laser technology based source.
Some important characteristics of simulated emission (laser action) are;
• It requires population inversion between two chosen energy states of the system.
• It may be initiated by a photon of energy hν equal to the energy difference between
the two states.
• Photons emitted in stimulated emission are identical in all respect to the incident
photon, they are coherent, having same energy (or frequency), same phase, same
state of polarisation and same direction of motion.
• The process of stimulated emission may be controlled from outside.
• In process of stimulation emission multiplication of photons takes place.
8.5 Stimulated (or Induced) Emission of Photons 389
There are some essential requirements for practical application of laser technique,
they are;
I. There should be two states of an atomic system, one of higher energy E2
and the other with lower energy E1 such that photon transition from state
E2 to state E1 is allowed. Quantum states of a system have several other good
quantum numbers, like spin angular momentum, parity, magnetic moment, etc.,
apart from its energy. Transition between two states of a system is governed
by some conservation laws. It is possible that photon transition between some
energy states is prohibited on account of some conservation law. However, for
lasing action to take place, photon transition from higher lasing state E 2 to the
lower lasing state E 1 must be allowed.
II. The state with higher energy E2 must have a mean life of at least 10−4 s
or more. As has already been discussed, population inversion is essential for
practical application of laser action. It means that the population of atoms in
state E 2 must be increased to a high value before triggering stimulated emission.
Normally, the highest population of atoms is in ground state, and it is required
to lift atoms from the ground state to the state of energy E 2 either directly or
via some other intermediate step. The process/technique of lifting in energy
atoms from a state of lower energy to the state of higher energy for simulated
emission is called pumping. Pumping takes time to create population inversion
to the desired level. If state E 2 is very short lived, with life time of the order of
10−8 s, atoms pumped to state E 2 will decay out by spontaneous emission before
sufficient degree of population inversion is achieved. Hence, it is required that
the mean life of state E 2 should be of the order of 10−4 s or larger. Nature has
provided atomic systems with excited states that have longer live times. Those
excited states that have live times larger than 10−8 s are called metastable states.
These metastable states are generally chosen as the upper state for lasing action.
III. Pumping is the process of supplying energy to atoms in a state of lower
energy so as to lift them to the state of higher energy to create and maintain
population inversion. Pumping essentially is a method of putting additional
energy in the active medium to produce and sustain population inversion.
Energy, to atoms in lower energy state (mostly the ground state), may be given
in many different forms, like via absorption of photons by atoms, via some
chemical reaction or through inelastic collisions of atoms, etc. Some details of
these pumping options will be discussed in the following.
390 8 Laser Technology and Its Applications
8.5.3 Pumping
French Physicist Alfred Kastler in 1950 proposed a method to alter the relative popu-
lation of excited levels by optical irradiation of atoms in ground state. He visualised
a two-step process where in atoms in some level A are hit with a beam of photons of
an appropriate energy, which they absorb (called induced absorption) to move to the
excited state B. Some of the excited atoms in state B may revert back to level A and
some others to another excited state C (may be a metastable state). If de-excitation
rate of level C to level A is lower than the feeding rate from state B, population of
state C will increase with time at the cost of the population of A, see Fig. 8.7.
Optical pumping is often used in solid state lasers.
In case of optical pumping energy to atoms in lower energy state (generally the ground
state) is supplied by shining a beam of light (photons) on them. In case of pumping
by electric discharge, the required energy to atoms in ground state is supplied by
establishing a large potential difference across the system, which are mostly gases.
Under high electric field gas atoms/molecules get ionised emitting electrons. These
electrons get accelerated under the high electric field and collide with other atoms/
molecules to excite them to higher energies. Some of the excited atoms/molecules
may feed an intermediate state (exactly as in case of optical pumping) the population
of which may increase with time at the cost of the ground state population. This
method of pumping is used in gas lasers like argon laser.
(c) Inelastic atom–atom collision
between excited atom A* with ground state atoms B, transfer excitation and a part
of kinetic energy to atoms B and excites it to a higher excitation state B*. Atoms in
excited state B* may partly de-excite to the ground state B or to some intermediate
state C. If the decay rate to the intermediate state C is larger than the decay rate to
the ground state, the population of intermediate state C will increase with time and
may become larger than the population of ground state B. He–Neon (He–Ne) laser
uses this mechanism of pumping.
Source of energy that shifts ground state atoms/molecules to higher excited state in
case of optical pumping is photons of characteristic energy, and in cases of electric
discharge and inelastic atom–atom collisions the electrostatic field. Desired atomic/
molecular excited states in some cases may also be achieved by heating the active
laser medium. Heat energy may work as the energy source for pumping. Except that
heat energy becomes the source of excitation, the population inversion in this case
also is achieved the same way as it is in optical pumping.
(e) Chemical pumping
Some chemical reactions leave the product molecule or atom in an excited state,
while the ground state of the system (atom/molecule) is unstable or dissociative.
The energy released in chemical reaction is converted into the excitation energy.
Such chemical reactions automatically produce population inversion and are used
for making laser sources. Following chemical reactions are often used in laser sources
that may deliver power up to hundreds of watts.
Reaction Laser
H + Br2 → HBr∗ + Br HBr
F + H2 → HF∗ + H HF
F + D2 → DF∗ + D DF
CS + O → CO∗ + S CO
This technique is used in fabricating solid state semiconductor diode lasers. These
diode lasers are compact with an active medium in the solid phase, which directly
converts electrical energy into laser radiations. The laser energy output of diode lasers
may be gainfully employed to further pump other solid state lasers.
Atomic and molecular systems have several excited states with at least one or two
of these as metastable states. In cases where the first excited state of the system
392 8 Laser Technology and Its Applications
is a metastable state, lasing action may be achieved between the ground and the
metastable state, under suitable conditions.
Layout of a three-level lasing scheme is shown in Fig. 8.8. Atoms/molecules from
the ground state are pumped using the appropriate pumping method to the energy
level E 2 . Level E 2 undergoes spontaneous de-excitation both to the metastable state
E 1 with decay constant λ2 and to the ground state with decay constant λ1 . In case the
decay constant (number of decays per unit time) λ2 is much larger than λ1 , (λ2 >> λ1 )
and the mean life of the metastable state E 1 is sufficiently large, the population of state
E 1 will increase with pumping time at the cost of the population of the ground state.
Eventually, after some time of continuous pumping, population inversion between
the ground and the first excited state E 1 will be achieved. A photon of energy (E 1 −
E g ) may now trigger lasing action between the two states. In this scheme the first
excited state E 1 is the upper lasing level while the ground state the lower lasing level.
In three-level lasing scheme (Fig. 8.8) the ground state is populated directly from
energy level E 2 through spontaneous decay (decay constant λ1 ) and also from level
E 1 via lasing. As such continuous pumping at sufficiently rapid rate is essential for
maintaining population inversion. This drawback may be overcome in four-level
lasing as shown in Fig. 8.9.
In four-level lasing, atoms/molecules from the ground state are pumped to an
excited state E 3 which decays partially to the ground state E g with decay constant
λ1 and mostly to another excited state E 2 with large decay constant λ2 . State E 2
is metastable. There is another energy state E 1 which is fed by state E 2 , however,
as E 2 is metastable state spontaneous decay of E 2 to state E 1 is very weak. State
E 1 de-excites to ground state with decay constant λ3 . With appropriate pumping,
population of metastable state may be increased with respect to the population of
state E 1 and population inversion may be easily achieved and maintained since state
E 1 spontaneously decays to ground state. The metastable state E 2 and the state E 1
are respectively the upper and the lower lasing levels.
states is allowed. Let the system be kept in some medium. Suppose that a beam of
monochromatic photons of frequency ν = (E2 −E h
1)
and intensity I0 is projected in
the medium. In normal situation when there is no population inversion, photons from
the incident beam will be absorbed by the atoms in the lower energy state as the beam
travels through the medium. As a result the intensity of the beam will exponentially
decrease with distance x travelled in the medium, i.e. Ix = I0 e−αx , the coefficient
of absorption α will be proportional to the difference (N 1 − N 2 ). Now suppose that
optical pumping is done and population inversion occurs resulting in laser action and
emission of laser photons of frequency ν. Under the changed circumstances N 2 is
now greater than N 1 and since α is proportion to (N 1 − N 2 ), it will become −α and
the intensity of the incident photon beam will be given by; Ix = I0 e+kx , where k (=
−α) is a constant that will be proportional to (N 2 − N 1 ). It means that the intensity
of the incident beam will increase as it will travel the active media. Here k is called
the gain coefficient of the medium. It can be shown that the gain coefficient k of the
medium is given as,
In resonator cavity laser beam undergoes multiple oscillations in the cavity media
between the pair of mirrors to obtain large gain before leaving the cavity through
the partially polished mirror. Laser oscillations can only sustain in the active media
of the cavity if it attains at least unit gain after a round trip from mirror to mirror
and overcome the various losses in the cavity. Laser beam propagating through the
cavity medium undergoes scattering, etc. with medium atoms/molecules and lose
its intensity. If γ denotes the coefficient of beam intensity loss in the medium, the
overall gain coefficient of the beam will become (k − γ ).
Let us consider a laser beam of initial intensity I 0 that starts from mirror M 1
towards mirror M 2 as shown in Fig. 8.10. As the beam hits mirror M 2 its intensity
becomes I 1 given by I1 = I0 e(k−γ )L . The beam suffers reflection at mirror M 2 and if it
is assumed that the refractivity of the two mirrors is respectively R1 and R2 , intensity
of the beam after reflection at M 2 will become I2 = R2 I1 = R2 I0 e(k−γ )L . While
travelling from M 2 to M 1 through the active medium the beam intensity will change
and will have the magnitude I3 given by I3 = I2 e(k−γ )L = R2 I0 e(k−γ )L e(k−γ )L =
R2 I0 e2(k−γ )L at the instant when the beam hits mirror M 1 . On suffering another
reflection at M 1 the beam intensity will change to I4 = R1 I3 = R1 R2 I0 e2(k−γ )L .
Therefore, the net gain G in the intensity of laser beam in one round trip between
two mirrors of the cavity is given by G = II04 = R1 R2 e2(k−γ )L . The gain G of the
laser beam must either be 1 or larger than 1 for sustained oscillations of the beam in
the cavity. Therefore, the threshold condition for sustained oscillations of laser beam
in the cavity is given by
8.5 Stimulated (or Induced) Emission of Photons 395
G = R1 R2 e2(k−γ )L = 1
The layout of a plane parallel resonator also called Fabry–Perot optical cavity, that
uses one fully reflecting and one partially reflecting plain mirrors around the active
medium is shown in Fig. 8.13. Two mirrors are held normal to the optical axis of
the medium. Spontaneous de-excitation photons trigger laser bunches that move in
random directions as shown in Fig. 8.13a. Laser bunches moving at some angle with
the optic axis are allowed to be lost but bunches moving along the optical axis stays
in the medium and undergo multiple reflections from the two mirrors. Photons of
bunches moving along the optical axis trigger stimulated emission from excited atoms
in the higher lasing level, thus generating additional laser bunches all moving along
the axis (see Fig. 8.13b). As a result of multiple reflections, the medium is filled with
large number of laser bunches all in phase but moving in opposite directions along
the optical axis. This happens only if the gain per trip of photon bunches between the
two mirrors is either equal or larger than the threshold gain. Laser bunches moving
in opposite directions produce stationary or standing waves. For standing wave, it
is required that the optical path length travelled by a wave between consecutive
396 8 Laser Technology and Its Applications
2L = mλm (8.7a)
the line width that do not satisfy sustain oscillation condition, die out. Final inten-
sity pattern of the laser beam emerging from the cavity through partially polished
mirror M 2 is shown in Fig. 8.12a. These frequencies; ν1 , ν2 , ν3 etc constitute axial
or longitudinal modes.
The difference Δν in the frequencies of any two successive modes is given as c/μ2L
from Eq. (8.8) as shown in Fig. 8.12b.
Fig. 8.13 a Fabry–Perot resonator showing bunches of laser photons moving in different directions.
Photon bunches are produced by lasing action initiated with spontaneous de-excitation photons
moving in random directions. b Laser bunches of photons moving along the optic axis of the active
material undergo multiple reflections at two mirrors and stimulate further lasing action adding
more coherent photons. Coherent photon waves strengthen each other by constructive interference
producing an intense laser beam that emerges from the partially reflecting mirror. c Laser bunches
moving along the optical axis but in opposite directions produce stationary waves of multiple
longitudinal modes
398 8 Laser Technology and Its Applications
Fig. 8.14 Typical shapes of beam spot corresponding to different TEM configurations
It follows from Eq. (8.7a) that the number ‘m’ of possible axial modes for a given
laser beam may be calculated as, m = 2L λ
, where λ is the wavelength of maximum
emission. For example, in case of Ruby laser λ = 694.3 nm = 694.3 × 10−9 m and
2L
hence the number of possible axial modes in this case may be as large as 694.3×10 −9 . If
−2
the length L of cavity is say, 25 cm, then number of axial modes may be 2×25×10
694.3×10−9
=
72×10 . Further, Δν = 2L the frequency difference between two successive modes
4 c/μ
3×108
for L = 25 cm and μ = 1, comes out to be 2×25×10 −2 = 8.0 × 10 Hz.
8
The line width Δw for ruby laser is typically around 0.3 GHz (= 0.3 × 109 Hz or
0.53 nm). It may be observed here that for the case of ruby laser with cavity length
of 25 cm, Δν = 6 × 108 Hz is more than Δw = 0.3 × 109 Hz . This means that
only one axial mode will be sustained by the cavity (Fig. 8.13).
Laser light has three special properties that light emitted by an ordinary incan-
descent electric bulb or any such source do not have. These properties are: (i)
monochromaticity, (ii) coherence and (iii) directionality.
(i) Monochromaticity
Laser light photons are emitted when atoms/molecules from some excited state of
energy E 2 are stimulated to decay to a state of lower energy E 1 (see Fig. 8.9).
Since, the upper and lower lasing states are quantum mechanical states, they have
discrete and definite energies, in the present case E 2 and E 1 respectively. Therefore,
all the emitted laser photons have the same energy E p = (E 2 − E 1 ). This property
that all emitted laser photons have same energy (or wavelength or frequency) is
called monochromaticity. In contrast, light photons emitted by other sources like
incandescent lamps have a spectrum of wavelengths (energies), from infrared to
ultraviolet.
A detailed study, however, shows that though the energy or wavelength of all laser
photons is very nearly same, but there is some small spread in the wavelength (or
400 8 Laser Technology and Its Applications
frequency) of laser photons also. There are three main reasons for this small spread
in wavelength of laser photons.
(a) Natural line width
Though it is said that quantum states E 2 and E 1 have discrete energies, but these
energies have some inherent uncertainties. These uncertainties in energy of excited
state arise from another fundamental law of quantum mechanics called the uncer-
tainty principle. According to the uncertainty principle, the uncertainty ΔE in energy
E of a quantum mechanical state and its mean life τ are related by the relation;
Here, h is Planck’s constant. It follows from Eq. (8.9) that ΔE will be zero only when
mean life τ is infinite. Since the mean lives of atomic and molecular excited states
are not infinite, there is always some uncertainties ΔE 2 and ΔE 1 associated with
the energies of the upper and lower lasing levels. These uncertainties are referred as
natural line widths of energy levels. Uncertainties in energies of the upper and the
lower lasing levels produce spread in the energy E p of laser photons;
E p = (E 2 ∓ ΔE 2 ) − (E 1 ∓ ΔE 1 ) = (E 2 − E 1 ) ∓ ΔE pNat (8.10)
Spread ΔE pNat in laser photon energies is often called natural spread or natural
width of energy level.
(b) Doppler broadening
It is a common experience that the frequency of the whistle sound emitted by a
moving train appears changed when the train approaches the observer and when it
moves away from the observer. This change in frequency is called Doppler effect. A
similar change in frequency (or wavelength or energy) of laser photons occurs when
they are emitted by moving atoms/molecules in upper lasing state E 2 . According to
the kinetic theory of matter, any object at a temperature T > 0 K, has some kinetic
energy, as such atoms/molecules in upper lasing level E 2 are in state of random
motion and the frequency of the laser photon they will emit will be different if the
emitting atom/molecule is moving in the direction of the emitted photon or opposite
Dop
to it. Therefore, another uncertainty ΔE p in the energy of laser photons, called
Doppler uncertainty, is introduced by the random thermal motion of emitting atom/
molecule. Of course Doppler uncertainty will depend on the temperature of the laser
source.
(c) Recoil broadening
A photon of frequency ν carries a linear momentum P = hν c
. Let us assume that
an atom/molecule in upper level E 2 is stationary and emits a photon of frequency ν.
Since emitted photon has carried a linear momentum P, the emitting atom/molecule
must recoil back to conserve the linear momentum. The recoil of the atom will give
8.6 Special Characteristics of Laser Light 401
atom some velocity, say v and energy E atom = 1/2M atom v 2 . The question is who will
supply this energy E atom to the recoiling atom? The answer is the emitted photon. As
a matter of fact the photon will not be emitted with frequency ν, it will be emitted
with some lower frequency ν ' so that the energy difference h(ν − ν ' ) may take care
of atomic recoil. Actually, at some temperature T > 0 K, atoms/molecules in state
E 2 are not stationary but in random motion, different atoms/molecules will recoil
by different amounts and this will introduce additional spread ΔE pRec in the emitted
laser photons.
In some special cases, two or three groups of monoenergetic laser light photons, each
with its own energy uncertainty, may be emitted from the same system.
Let us consider an atomic/ion/molecular system that has energy level structure
as shown in Fig. 8.15a. Atoms/ions of the system may be pumped from the ground
state E g by some suitable pumping mechanism to the excited state of energy E 5 ,
which spontaneously decays to the metastable state E 4 . Since E 4 is a metastable state
of comparatively long mean life, spontaneous decay of state E 5 accumulates large
402 8 Laser Technology and Its Applications
hc hc hc
λ1 = ; λ2 = ; λ3 = (8.11)
(E 4 − E 3 ) (E 4 − E 2 ) (E 4 − E 1 )
The intensity of each laser group will depend on its branching ratio, and each
laser light will have its own wavelength uncertainty, ∓Δλ1 , ∓Δλ2 and ∓ Δλ3 ,
respectively. It is possible to select any one of the three laser lights by manipulating
or adding suitable devices in optical resonator. This way of selecting a particular
laser light of a given wavelength out of several groups (three in the present example)
is technically called tuning to the particular frequency or wavelength.
Figure 8.15b shows another energy level scheme of some atomic/ionic or molec-
ular system, which has got two metastable states E 4 and E 3 which may be fed by
the spontaneous decay of the higher energy state E 5 . The two metastable states
may decay, respectively, to states E 2 and E 1 . Appropriate pumping mechanism may
continuously populate state E 5 which in turn feeds metastable states to produce the
state of population inversion between E 4 and E 2 and E 3 and E 1 . Photons produced by
the spontaneous de-excitation of metastable states may initiate laser action resulting
in two groups of lasers with wave lengths λ1 ∓ Δλ1 and λ2 ∓ Δλ2 .
(ii) Coherence
In lasing action almost all excited atoms/molecules in the upper lasing level E 2
undergo stimulated de-excitation at the same instant releasing large number of
photons. Each of these photons is an electromagnetic wave; the special property
of these electromagnetic waves is that they are all in phase and are moving in same
direction. In-phase means that the troughs and crests of all these electromagnetic
waves occur at the same instant, i.e.; the crests and troughs of different electromag-
netic waves fall on each other. The large number of in-phase electromagnetic waves
moving in the active medium in same direction interfere constructively giving rise
to a single light wave of very large amplitude. Since the intensity of a light wave is
proportional to the square of the amplitude, the constructive interference of photon
waves results in a very high intensity light beam.
(iii) Directionality
A laser source emits light only in one direction, in contrast to other light sources,
like electric bulb, light-emitting diode or tube light which emit light in all directions.
Resonance or optical cavity associated with a laser source is responsible for the
property of directionality of laser light. For example, in case of the optical cavity
that has two plain mirrors, one fully and one partially reflecting, held parallel to each
other; the direction of the laser light is defined by the direction of the normal to the
two mirrors, from full to partially reflecting mirror.
Generally the intensity profile of a laser beam has Gaussian distribution; intensity
is maximum, say I 0 at the centre of the beam and it falls off rapidly towards the edges
as shown in Fig. 8.16. The points where the intensity drops to 1/e of its central value
are called outer edge points. Distance 2w between the outer edge points is referred as
the minimum beam spot size. Directionality of laser beam is often defined in terms
of the full angle divergence ϕ, given as;
1.27λ
ϕ= (8.12)
2w
Here λ is the wavelength of the laser light
In a practical laser source light comes out from an aperture of diameter ‘d’ (which
is different from 2w) and spreads out because of diffraction. Assuming laser light to
be a plane wavefront, it may be shown that the light propagates as a parallel beam
2
for a distance of dλ , called the Rayleigh’s range, and there after it spreads because
of diffraction. The angular spread of the laser beam due to diffraction Δθ is given
by,
λ
Δθ = (≈ 10−5 or 10−6 rad) (8.13)
d
In case of laser the angular spread Δθ is generally less than 0.01 mrad; meaning
there by that a laser beam spreads less than 0.01 mm for every 1 m of its propagation
distance. In contrast, the angular spread for light from an ordinary source may be
1 m in travel path of 1 m or more.
When light diverges from a point source (see Fig. 8.17) and r 1 and r 2 are the radii
of the beam at distances d 1 and d 2 , respectively, the angle of beam divergence in
radians is given as,
(r2 − r1 )
ϕ= (8.13a)
(d2 − d1 )
If ϕ is the angle of beam divergence of a laser beam then the area of beam spread
Aspr at a distance D from the source is given by,
(iv) Irradiance
The irradiance (power per unit area) of a typical laser source is far greater than other
sources of light. It is largely due to the directionality and compactness of the laser
beam. One may take the example of an ordinary bulb of 10 W power, which spreads
its light uniformly in all directions, the irradiance I bulb at a distance of 1 m from the
bulb will Ibulb = 4π.1 2 m2 = 0.8 W/m . On the other hand, a typical laser light of
10 W 2
(v) Focusability
An ordinary source of light must have an appreciable transverse extent so as to
produce a significant amount of light energy. The images of these non-laser sources
formed by lenses and mirrors have finite size which may be determined from the
laws of geometrical optics. The amount of light (energy) at the image position is
determined by the amount of light from the source intercepted by the lens or mirror.
As a result, in case of ordinary light sources large amount of light energy cannot be
focused at a small spot. In case of laser source, on the other hand, the transverse extent
of a laser source is very small which allows the lens or mirror to intercept almost all of
the power of the laser beam and focus it at a very small spot at image point. Moreover,
8.7 Classification of Laser Sources 405
because of the high directionality of laser beam, laser beam behaves as a bundle of
parallel light rays coming from a point source at infinity. As a consequence almost
total laser beam power gets focused at a very small spot at image position. Basically,
the size of the image spot is mostly governed by diffraction and lens aberrations and
can be as small as the wavelength of laser light. Property of focusability makes it
possible to deposit large amount of laser beam energy in a very small area to drill fine
holes in thick and hard sheets of matter. Because of the special property of focusing
in spots of the size of light wavelength, laser beam is used in ocular surgery where
target irradiance of ≈ 109 to 1012 W/cm2 is required.
Above-mentioned special characteristics, monochromaticity, coherence and
directionality, of laser light makes it extremely useful but also more hazardous.
One should never look directly on a laser beam because the highly collimated beam
may focus to nearly a microscopic dot on the retina of the eye, causing almost instant
damage to the retina. Coherence of laser light is important for observing interfer-
ence effects that has important applications in precision measurement of distances.
The branch of optics, interferometry, uses superimposition of coherent light to make
extremely fine measurements of very small distances (of the order of the wavelength
of light), surface irregularities or changes of refractive index. High intensity of laser
light, an outcome of coherence, deposits large amount of (heat) energy in a very
small area hit by the beam. The output power of a laser source may vary from a
few thousandths of a watt (in case of laser pointers) to several thousand watts. Laser
beams may be used as a cutting tool for thick metallic sheets or for welding of
metallic components. Laser light is also finding applications in medical field where
laser sources are used in equipment used for endoscopy and other surgical operations
including treating cancer cells.
SAQ: Which factor(s) determine the intensity of image formed by lens or mirror?
Why images formed using laser sources are very bright?
In the following we will discuss laser sources according to the classification based
on the physical state of the gain medium or active medium.
As the name suggests, laser sources in which the active medium is a solid material
are included in this class. Solid state lasers may be further divided into two types: (i)
doped insulator rod type and (ii) semiconductor diode laser.
Most of solid state laser sources use a rod of some crystalline insulating material
called the host which is doped with some ions, called the active ion. The first laser
ever made was a solid state laser which used a synthetic ruby (aluminium oxide
crystal) as host and chromium as an active species. Later, other solid state lasers
using sapphire as host and titanium atom as active species, etc. were also developed.
Generally, solid state lasers are named by prefixing the chemical symbol of the
active specie followed by the name of the host crystal, for example, Cr.ruby laser or
Ti.sapphire laser, Nd.YAG (Neodymium. Yttrium Aluminium Garnet, Y3 Al5 O12 )
laser, etc.
Commonly used host materials are, Y3 Al5 O12 (yttrium aluminium garnet, YAG),
Al2 O3 (sapphire), YVO4 (Yttrium orthovanadate),Y3 Sc2 Al3 O12 , Al2 BeO4 (alexan-
drite), Mg2 SiO4 (forsterite), etc. Silicate and phosphate glasses are also used as host
materials.
Light-emitting atoms, called active species, are embedded in the host rod. Active
ions may be of (a) transition metals like, Fe2 + , CO2 + , V2 + , Cr3 + , Ti3 + , Ni2 + , etc.;
(b) of rare earths like, Nd3 + , Pr3 + , Sm2 + , Pm3 + , Eu2,3 + , Dy3 + , Tb3 + , Ho3 + , Yb3 + ,
etc. and actinides such as U3 + .
The layout of most of the solid state laser sources is similar. Host crystal with
doped active ions is taken in the form of a rod, the dimensions of which may vary
from few cm to few tens of cm in length and a fraction of cm to few cm in diameter.
In most cases optical pumping is used to achieve population inversion. Since atomic/
molecular excited states in solids have band structure, optical pumping with a light
source that has broad spectrum covering the absorption band of laser material is most
efficient way of pumping. Both pulsed and continuous light sources may be used for
optical pumping, however, in case of continuous light source unabsorbed light energy
increases the temperature of the laser tube. Therefore, in such cases cooling of laser
cell is required. On the other hand pulsed light sources that produce flashes of light
of few micro- to millisecond duration with a gap of about the same duration are often
used to reduce heating problem. The active ion implanted host rod and the source of
pumping light/flash lamp(s) are generally kept in an elliptical glass tube, the inside
of which is polished for concentration of the light emitted by the flash lamp on to
the laser rod by reflection from polished surfaces. Several different arrangements of
8.7 Classification of Laser Sources 407
mirrors of different curvatures, including parabolic, etc. are used to focus flash light
on the host material rod for maximum pumping speed (see Fig. 8.18). Laser sources
that use flash light are inherently pulsed laser sources as pumping and population
inversion occurs only for the duration of the flash light. The choice of flash lamp
depends on the energy of the active ion state(s) that are required to be populated
by pumping and, therefore, on the level scheme of the active ion. Halogen lamps
and high-pressure mercury discharge lamps are generally used as continuous light
sources for pumping, while low-pressure quartz or glass-sealed xenon or krypton
lamps are used as pulsed light sources.
Figure 8.19a shows a typical circuit that may be used to produce flashes of light.
As shown in the figure, energy storing capacitor is charged through a resistance R
to the maximum value V (few kV) of power supply voltage. A coil of few turns
called triggering coil, wrapped round the flash tube may be energised periodically
by a RF power supply. Triggering coil when energised produces ionisation of some
gas molecules in flash tube, thus reducing the electric resistance of the tube. Once
electric resistance of flash tube is reduced, the high voltage across the energy storing
capacitor produces an electric discharge in the flash tube emitting light. The duration
of the discharge is governed by the time constant of the LCR1 circuit, where R1 is
the equivalent resistance of the flash tube. The off-time of the flash lamp is governed
by the charging time of the capacitor to the power supply voltage through resistance
Fig. 8.18 Different geometrical arrangements of flash lamps and host laser rod in solid state lasers
408 8 Laser Technology and Its Applications
R. Sometimes the flash tube is made in the form of a helix, and the lasing-doped
insulator rod is placed inside the helix for efficient pumping (see Fig. 8.19b). The
purpose of surrounding the light source and the lasing rod with mirror system and
focusing the light on rod is, maximise the flux of emitted light on the rod so that large
number of ions from lower energy state(s) may be pumped to higher energy state(s)
for population inversion.
Overall efficiency is an important parameter of a laser system. Overall efficiency is
defined as the ratio of the total power required to pump the laser to the optical output
power of the system. Typical efficiencies range from fractions of percent to 25%
or more. Many important laser sources have efficiencies as small as 0.05%, in such
cases the difference of between the input power and the output power is converted
into heat. Most laser sources, therefore, have a cooling system associated with them.
Laser systems with solid gain medium are typically cooled by surrounding the gain
medium in a cooling jacket; water or oil flows through the jacket to remove heat.
Low power sources may be cooled by forced air cooling, and very low power systems
may not need any cooling at all. Further details of Cr.ruby and Nd.YAG laser sources
are given in the following.
(a) Cr.ruby laser source
First ever successful laser source was made by Theodore Maiman on 16 May 1960
at Hughes Research Laboratories in California using a chromium doped synthetic
ruby rod. Present-day ruby laser has undergone several improvements since its initial
stage.
A rod of synthetic ruby crystal is made by doping a small amount (0.05% by
weight) of chromium oxide (Cr2 O3 ) in aluminium oxide (Al2 O3 ) so that some of
aluminium ions Al3 + are replaced by chromium ions Cr3 + that gives the crystal
8.7 Classification of Laser Sources 409
pinkish red colour. Aluminium ions act only as host, and the actual laser light is
emitted by laser action taking place in the excited states of chromium ions. The
length of the ruby rod in laser sources may vary from 2 to 40 cm, and diameter from
0.5 to 2 cm, depending on the power output of the source. The two ends of the ruby
rod are either made flat and polished or two plane mirrors, one totally reflecting and
the other partially reflecting are put at the two ends, mirrors being held parallel to
each other and normal to the axis of the ruby rod, making the optical cavity. Often
a spring-controlled by a microscrew is attached to the fully reflecting mirror, so that
fine rotation of the microscrew may tilt in small steps the orientation of the mirror
for fine tuning of the laser wavelength.
Figure 8.20 shows the layout of a ruby laser. Optical pumping is employed in this
laser source using a xenon flash lamp, which may ether be placed near the ruby rod
or may be wound round it as a helical-shaped glass tube filled with xenon. Flashes
of light are produced when the flash lamp gas is excited by an RF oscillator powered
by a power supply. Xenon flash lamp generally emits bursts of blue-green light
(450–600 nm) of durations of few milliseconds. Parabolic mirrors are kept around
the ruby rod to focus flash light on the rod for optical pumping. Since flash light
used for optical pumping is produced in short duration pulses, the laser output of
the ruby source is not continuous but it is pulsed. Laser light is produced in pulses
of short durations of milliseconds one after the other. The energy band structure of
Cr3 + ion is shown in Fig. 8.21, where it may be seen that optical pumping using
blue-green light of xenon flash lamp may excite chromium ions from the ground
state E g to excited state bands E 3 and E 4 . Both these energy bands have very short
mean lives, respectively, ≈ 10−8 and 10−9 s and de-excite rapidly through radiation
less transition to the metastable state E 2 . The mean life of metastable band is around
10−3 s, which is much larger than the mean lives of states E 3 and E 4 . Continuous
pumping of chromium ions from the ground state to states E 3 and E 4 and the rapid
decay of these states to the metastable state E 2 , establishes population inversion
between state E 2 and the ground state E g and also between the metastable state E 2
and the excited state E 1 . Since state E 2 undergoes spontaneous photon decay mostly
to the ground state E g and only partially to state E 1 , photons emitted in spontaneous
decay initiate laser action between corresponding levels. Two groups of laser light
with wavelengths 694.3 nm and 692.7 nm are emitted (in form of bursts of laser
lights of millisecond duration) from the ruby source. The intensity of 694.3 nm laser
light is much higher than that of 692.7 nm laser. The difference in wavelengths of the
two laser groups is very small and the intensity of higher wavelength laser is much
larger; therefore, often ruby laser is treated as a single wavelength laser source.
Ruby laser is a low power and generally a pulsed laser source of red light. It
often requires a cooling system also. As such it is bulky. Initially it was used in laser
printers, etc. but now with the availability of semiconductor diode lasers which are
very small and handy, ruby laser is not in much use.
(b) Nd.YAG laser source
Neodymium YAG consists of Yttrium Aluminium Garnet (Y3 Al5 O12 ) in which some
of the Y3 + ions are substituted by Nd3 + ions. Doping level of the YAG rod is around
410 8 Laser Technology and Its Applications
(0.72%) that corresponds about 1.4 × 1026 Nd atoms per metre3 . In a typical Nd.YAG
source the cylindrical YAG rod length is around 10 cm and diameter around 12 mm.
General layout of the ND.YAG laser source is similar to the Cr.ruby source, except
that in this source optical pumping is done by a Krypton flash lamp, Krypton light
contains the bands of 700 nm and 800 nm which cover the absorption bands of Nd
ion.
Neodymium is a rear earth element and electron configuration of the ion is as
given below,
Nd3 + = 1s 2 2s 2 2 p 6 3s 2 3 p 6 3d 10 4s 2 4 p 6 4d 10 4 f 3 5s 2 5 p 6
8.7 Classification of Laser Sources 411
Nd ion may be doped in other host materials also, like in yttrium lithium fluo-
ride (YLF) which emit laser light of 1047 and 1053 nm wavelengths; yttrium
orthovanadate (YVO4) that produces laser light of 1064 nm and glass.
Nd.YAG laser is one of the most used lasers; it is used in ophthalmology to
correct vision disorders, in ablation of malignant and benign lesion in different body
parts, and in cosmetic surgery, etc. It also has wide applications in industry, live
engraving, etching, laser cold peening, cutting and drilling holes in metal sheets,
in nonconventional rapid prototyping process called laser engineered net shaping
(LENS), etc.
(ii) Solid state semiconductor diode laser source
Semiconductor diode lasers, also called quantum well laser, are not much different
from a light-emitting diode (LED). The two important differences between the LED
and diode laser are: (i) diode lasers use some direct semiconductor, like GaAs and
have heavily doped p- and n-sides, while in LED the doping levels are not so high.
(ii) Laser diode operates at high forward current, larger than a threshold value, while
LED operates at lower forward currents. Emission of monoenergetic photons both
in LED and the laser diode is through the process of electron–hole recombination;
however, the process of recombination is spontaneous in LED while it is simulated
emission in case of laser diode.
Laser diodes are fabricated using direct semiconductor material like, GaAs and
doping the n- and p-sides heavily so that they are represented as n+ and p+ . There can
be two types of laser diodes: (a) homojunction, where the material of the n-p junction
is same and (b) heterojunction, where two different semiconductor materials are
used to fabricate the pn junction. However, the working of the two types of laser
diodes is similar. Let us first study the working of a homojunction laser diode. A
homojunction laser diode is made by heavily doping the two sides of a single crystal
of a direct semiconductor, like GaAs, with p- and n-type impurities with a very thin
(≈ 1 µm) depletion layer.
Figure 8.23a shows the conduction and valence bands of a heavily doped pn
junction when no external bias is applied to the junction. In case of a pn junction where
both sides are heavily doped, Fermi level passes through the conduction band on n-
side and through the valence band on the p-side, thus developing a potential barrier
across the junction. As a result of the potential barrier majority carrier electrons from
the n-side and holes from the p-side could not cross from one side to the other. Forward
bias voltage, when applied across the junction, opposes the inbuilt potential barrier
across the depletion layer and pushes majority charge carriers across the junction (see
Fig. 8.23b). Electrons and holes crossing the thin depletion layer recombine emitting
monochromatic light in case of ordinary LED. In case of LED recombination process
is spontaneous and light photons are emitted in different directions with random
phases. However, on increasing forward current to a higher value, larger than the
threshold current, large number of majority carriers from both, the n- and the
p-sides get accumulated in a small region around the physical junction, creating
population inversion. A photon of spontaneous recombination may work as the seed
photon to initiate laser action. Later, laser photons initiate subsequent stimulated
8.7 Classification of Laser Sources 413
photon emissions, providing a gain factor to the lasing medium, which in this case
is the depletion layer. Almost two-dimensional depletion layer also serves as optical
cavity to develop sustained laser oscillations and amplify the laser yield. However,
no external mirrors to increase the path length for sustained laser oscillations are
required in this case. Out of the six sides, four sides of the depletion layer (the
optical cavity) are exposed to the air of refractive index 1, while the refractive index
of GaAS for laser light is around 3.8. Hence the reflectivity R of the depletion layer–
air interface is R = (3.6−1)
2
(3.6+1)2
= 0.3 which is reasonable for laser light to get reflected
back and forth between the two edges of the depletion layer cavity. Out of the four
air-exposed faces of the depletion layer, two opposite faces are cleaved and their
parallelism is ensured. The other two faces are left rough. Laser oscillations build
up between the two parallel cleaved faces and are taken out from one of them as
indicated in Fig. 8.24.
When forward current is low, gain of resonance optical cavity gets compensated
by the losses and, therefore, the overall medium gain remains below the critical
gain value. Increasing forward current above the value called threshold current, the
medium gain becomes larger than the critical gain giving rise to the build-up of laser
oscillations in the cavity.
Fig. 8.23 Conduction and valence bands of a heavily doped, a unbiased pn junction, b forward
bias pn junction
414 8 Laser Technology and Its Applications
Dye lasers are typical example of liquid state lasers. An organic dye is used
as the active medium in a dye laser. Frequently used organic dyes for liquid
lasers are Rodamine 6G, (Xanthene), Anthracenes, Oxazines, Coumarin, DCM (4-
dicyanomethylene-2-methyl-6-pdimethaylaminostyryl-4H pyran), etc. These dyes
are dissolved in appropriate solvent, and the solution acts as the gain medium. The
dye solution is optically pumped and the dye molecule absorbs a band of wave-
lengths of average energy E, from the incident light of the flash lamp. The excited
dye molecules revert back to the ground state, emitting a band, called fluorescence
band, of average energy E 1 which is less than the energy E of absorption band. The
difference of energy (E − E 1 ) is generally dissipated in non-radiative collisions. Dye
molecules are generally long-chain big molecules, and only a part of the molecule
takes part in excitation and de-excitation.
Energy band structure of liquid dyes is classified in terms of singlet S and triplet T
bands. S-bands that correspond to angular momentum state = 0, and triplet bands
corresponding to angular momentum state = 1; have stacks of bands of increasing
energy, separated from each other with certain energy gaps, as shown in Fig. 8.26.
Each singlet and triplet band has a cluster of closely spaced vibrational and rotational
levels.
A general property of s-bands is that higher levels in a given s-band decay to the
lowest energy level of the band by non-radiative de-excitation in a very short time of
the order of 10−11 s. For example if levels of band s1 are excited by optical pumping
(by shining light of appropriate frequency), all levels from b to B will get excited but
within a time span of 10−11 s all higher levels up to B will revert back to the lowest
416 8 Laser Technology and Its Applications
level b. It is also observed that the lowest level of each s-band is relatively longer
lived as compared to the higher levels. In case of band s1 of the figure, the mean life
of the lowest level ‘b’ is around 10−5 s, an order of 106 longer than the mean lives of
higher levels of the bands. In a way, the lowest level of a s-band may be considered
like a metastable state (though it is not a metastable state since its life time is still
very small of the order of 10−5 s).
In normal case, when the dye solution is irradiated with a source of light of
appropriate energy, molecules in ground state of band s0 absorb energy from the
incident light and shift to various levels of band s1 . Excited molecules in levels above
‘b’ quickly lose extra energy by collisions, etc. and de-excite by non-radiative mode
to reach level ‘b’ in 10−11 s. Once most of the excited molecules get accumulated in
level ‘b’, they decay by emitting a bunch of photons to level A, at the head of band s0
in around 10−5 s. This emission of photon bunch from b to A is called fluorescence.
Fluorescence photons are incoherent, and their mean frequency is less than the mean
frequency of the photons absorbed from the incident light. That means that if the
system is irradiated, say by blue light then fluorescence light may be of red colour.
In fluorescence emission time delay between absorption and re-emission of light is
very small, generally less than a few microseconds, however, if the time delay is
measurable, say of the order of few seconds or larger, the process of light emission
is termed as phosphorescence.
Continuous optical pumping by appropriate flash lamp increases the population of
excited molecules on level ‘b’ at the cost of ground state level ‘a’ leading to population
inversion. A fluorescence photon may work as seed to initiate laser emission in
the early stages and the triggering is taken over by laser photons later on. In dye
laser, laser photons are made to circulate in a resonance cavity made by putting two
parallel mirrors, one fully reflecting and other partially reflecting, at the two ends
of the tube containing the solution of the dye. Dye solution works as gain/active
medium. If the medium gain is larger than the critical gain, laser oscillations build up
in the cavity and strong laser beam emerges from the partially reflecting mirror. Dye
8.7 Classification of Laser Sources 417
laser source consists of a quarts or glass tube (≈ 10 cm, ≈ 0.5 cm diameter) with
dye solution. The solution tube is converted into a resonance cavity by putting two
parallel mirrors at the two ends of the tube. The cavity tube is placed at one focus
of an elliptical reflecting envelop at the other focus of which a flash lamp is placed.
Elliptical envelop concentrates the light emitted by the flash lamp on the cavity tube.
Flash light is operated by the usual electronic circuits that generate flashes of light of
few milliseconds each. Laser light beam emerges from the partially polished mirror.
The frequency (or wavelength) spread of dye laser is considerably larger than solid
state lasers. This results in the production of large number of longitudinal modes
in cavity. Any of these modes may be selected by frequency selection components
associated with the cavity. Dye lasers are, therefore, tuneable to different frequencies.
This is the big advantage of a dye laser.
Dye lasers are always used in pulsed mode. The reason for this is the energy gap
(E 1 ) between the head of T1 and the bottom of T2 bands. In most dyes, this energy
difference is of the same order as the energy of photons emitted in laser beam. If there
are sizable numbers of excited molecules in state T1 , then large number of emitted
laser photons may be absorbed by molecules in state T2 and output laser flux may
die out. As may be observed in Fig. 8.26, T1 band is fed by the decay of level ‘b’
with a mean life of around 5 s, i.e. the feeding transition is very slow. It means that
if laser is produced in pulses of short durations, so that population of level ‘b’ is not
allowed to grow for long time, there will never be enough molecules in state T1 and
the chance of absorption of laser photons will be eliminated. This is the reason why
dye lasers are always used in pulsed mode.
The output wavelength of dye lasers generally varies from 390 to 1000 nm and
output power may be between milli-Watt to 1 W. Typical beam diameter may be
0.5 mm with beam divergence from 0.8 to 2 mrad.
The main advantage of dye laser is that laser beams of any wavelength ranging
from infrared, visible, and up to near ultraviolet may be produced using a dye laser,
that is why they are called tuneable lasers. Dye lasers have high efficiency ≈ 25%,
small divergence, small beam spot and high power. However, they are always pulsed
source.
Gas lasers have some gas as the active medium. Gas lasers, depending on what specie
produces lasing action, may be divided into three types: (i) atomic, (ii) molecular
and (iii) ionic.
(i) Atomic gas laser
Helium–neon laser is the best example of atomic gas laser. It was the first continuous
wave (CW) laser built by Ali Javan, Bennett and Horriot at Bell Telephone Lab in
1961. In He–Ne laser source a mixture of He and Ne gases in the ratio 10:1 is taken
418 8 Laser Technology and Its Applications
ten of cm to 1 m. Two electrodes, one at each end of the discharge tube, are used to
produce gas discharge in the tube. Since CO2 laser source may deliver large power,
it also consumes lots of electrical energy, producing considerable amount of heat.
Both N2 and CO2 gases in discharge tube take part in lasing action, while helium
simply helps in heat dissipation from the interior of the discharge tube to its walls
and to some extent in de-excitation of lower vibrational levels (020 and 010) of CO2
molecules. Closed discharge tube is wrapped with coils through which chilled water
is circulated to remove heat from the tube walls. In gas flow-type source, mixture
of N2 and CO2 gases is made to flow through the discharge tube at a constant rate,
outgoing gases takeaway heat produced in the discharge tube and hence no helium
is required in the gas flow type source.
The gas mixture in discharge tube also serves as the gain medium, and the
discharge tube is converted into a resonance cavity by putting two mirrors, one
partially polished, parallel to each other and normal to the tube axis. Since glass
absorbs laser light emitted by CO2 molecule, reflecting mirrors and other equipment
used inside the discharge tube are made from materials like Ge, GaAs, ZnS, etc.
which are transparent to laser radiations.
Working of the CO2 laser source may be easily followed by looking at Fig 8.28b.
Once discharge is produce in the tube by applying large voltage between the two elec-
trodes, there is large number of free electrons accelerated under the high electric field
of applied voltage. Energy of electrons in discharge tube is much larger than 0.3 eV,
which is the energy of the first vibrational excitation level of N2 molecule. Collisions
between energetic electrons and N2 molecules excite large number of molecules to
the first excited state. The first vibrational excited state of N2 is metastable; there-
fore, excited N2 molecules live long enough to collide with ground state molecules of
CO2 and transfer their excitation energy to the CO2 molecules. CO2 molecules also
have vibrational excited state (003) (asymmetric vibration mode at energy 0.3 eV)
very near to the in energy to the first vibrational state of N2 . Thus collisions of N2
molecules with accelerated electrons excite them to their first vibrational level and
collisions between excited N2 molecules with ground state molecules of CO2 , excite
CO2 molecules to their vibrational level 003. There are two vibrational levels 200
and 020 of CO2 at 0.2 eV energy, which are almost empty. Continuous pumping (by
N2 molecules) of CO2 molecules from ground state to 003 level establishes popu-
lation inversion between levels 003 and 200 and between levels 003 and 020. Since
molecules in level 200 lose their energy by collision with molecules in level 020, and
level 020 readily depletes through diffusion and in closed tube source in collision
with Helium molecules, the state of population inversion is maintained. CO2 laser
source produces two laser lights of wavelengths 9600 nm (9.6 µm) and 10,600 nm
(10.6 µm).
The main advantages of CO2 laser are simple in construction, both CW and pulsed
outputs, very high power output, high efficiency.
CO2 laser is extensively used in treating health problems related to gynaecology,
genitourinary, dental, orthopaedic, hepatic and cardiovascular surgery. It is consid-
ered to be the mainstay of laser neurosurgery. They are used for cutting, dissection
and coagulation of wide range of tissues.
8.7 Classification of Laser Sources 421
An argon ion laser source is made by filling natural argon gas in a tube of some ceramic
material like beryllium oxide, that is opaque to laser radiations and can withstand
high temperatures. The tube is fitted with two hollow electrodes to produce plasma
discharge. The density of Ar+ ions in the plasma is high, and to further increase
it, a solenoid magnetic coil is wrapped around the plasma tube which confines the
plasma and fast-moving electrons in the central part along the axis of the tube. A
typical source may have plasma tube up to 1 m long that may generate laser with
output power of 2–5 W consuming several tens of kilowatt input power. The efficiency
of the source is very poor, and chilled water cooling is required to dissipate the heat
developed in the source. The plasma tube is fitted with Brewster windows at the
two ends, as ordinary glass absorbs the plasma light and also high temperature is
produced around the tube. Two parallel mirrors, one partially transparent are put
after windows normal to the plasma tube axis, as shown in Fig. 8.29. A prism is
inserted between Brewster window and the totally reflecting mirror to tune laser rays
of different wavelengths.
The level scheme of excited states of Ar ion and pumping of excited-ion levels by
collision of high-energy electrons with argon ions is shown in Fig. 8.30. Continuous
pumping of excited states by electron collisions establishes population inversion
between several excited states, resulting in the emission of groups of lasers. Some
prominent laser groups are shown in Fig. 8.30. The source may produce laser beams
of as many as 35 different frequencies; however, the most prominent laser beam has
wavelength of 514.5 nm.
Fig. 8.29 Layout of an argon ion laser source. Magnetic solenoid and cooling system are not shown
in the figure
422 8 Laser Technology and Its Applications
There are some molecules, like ArF, KrF, XeCl, etc. that are stable in their first excited
states but dissociate in their ground states. Such molecules are called excimers.
Population inversion in such molecules is easy to obtain as in natural way the number
of molecules in ground state are negligible as compared to those in the first excited
state.
A typical excimer laser source may be made by filling an inert gas like argon and
some halide like F2 in a discharge tube fitted with electrodes. High voltage applied
across the two electrodes set in discharge in the tube producing Ar+ positive and
F− negative ions which combine forming ArF* molecule in its first excited state.
Excited ArF* molecules revert to the dissociative ground state initiating laser action.
The efficiency of excimer lasers is of the order of 20% and they emit laser radiations
in the wavelength range of 120–500 nm with peak power of around 200 W.
exceeds resonator losses. Mostly the above condition is fulfilled for several longi-
tudinal (or axial) modes of closely spaced wavelengths/frequencies which produce
stationary oscillations in the resonator cavity. The laser output in such cases consists
of laser radiations of closely spaced wavelengths/frequencies with decreasing ampli-
tudes (see Fig. 8.12c). Let us assume that in a typical case there are N axial modes
that produce sustained oscillations in a resonator of length L. Now all the N-modes,
in general, will have different amplitudes An , angular frequencies ωn and phases δn ,
where n varies from 0 to N. Since all the three parameters are functions of time,
therefore, in general modes will be incoherent. The total output (amplitude) of such
a laser will be a linear combination of different modes and will be given by
∑
N
A(t) = (A)n ei(ωn t+δn ) (8.14)
0
If there is nothing that fixes the three parameters, amplitudes, frequency and
relative phases of the N different modes, then the output amplitude A(t) will vary in
an uncontrolled way. However, if the different modes are forced to maintain equal
frequency spacings with a fixed phase relationship to each other, the output with
time will vary in a well-defined manner. The laser is then said to be mode-locked
or phased-locked. The form of the output will depend on which axial modes are
oscillating and what phase relationship is maintained. It is possible to obtain: an
FM modulated output, a continuous pulse train, a spacially scanning laser beam or
a ‘machine gun’ output where pulses of laser light appear periodically at different
spacial positions on the laser output mirror. Mode locking is often used to produce
short-duration, high-intensity bursts of laser radiations at a given repetition rate.
There are several ways of doing mode locking; however, details of that are beyond
the scope of the present discussion.
8.7.6 Q-Switching
It is difficult to give an exhaustive list all laser applications; however, some important
laser applications are discussed here.
0. As such when the laser beam etches a bump a zero is stored on the master disc.
The flat unburned (or unetched) plastic surface represents binary number 1. The flat
unetched area representing 1 is called land. Thus desired digital data is stored on the
master disc in the form of pits and lands. Once the master disc is made it is used to
stamp millions of plastic duplicates—the CD’s sold in the market. After stamping
each disc is coated with a reflecting aluminium layer and covered with a protective
polycarbonate layer.
Laser light is used to read the data from a CD. A thin laser beam scans the CD
surface and gets reflected from the pits and lands covered with reflecting aluminium.
The intensity of the back reflected laser light is different for reflection from a pit and
a land. A photodiode records these variations in the intensity of back reflected laser
light and the data on intensity variation is converted into 0 and 1.
Military uses of lasers include target designation and ranging, defensive counter
measures, directed energy weapons and communication, etc.
A low power laser pointer is used to indicate a target for a precision-guided
munition, generally lunched from an aircraft. The guided munition adjusts its flight-
path to home on the laser light reflected from the target, enabling a great precision
in aiming. The target designator laser beam is pulsed at a typical rate that is sensed
by the munition to avoid any confusion with other laser beams present in the area.
Powerful beams of laser light (more than 100-kW power) have been used to
destroy approaching ballistic missiles. Laser beams have also been used to destroy
land mines and unexploded explosives scattered in the war zone. Laser beams have
been used for counterdefensive measures; low-power infrared countermeasures use
lasers to confuse the heat seeker anti-aircraft missiles while high power boost-phase
interceptors use lasers to find, track and destroy ballistic missiles. Some weapons
simply use a laser to disorient enemy army personals. Lasers have also been used
as a tool to enhance the targeting of other weapon systems, for example, a machine
gun or a rifle barrel is fitted with a laser torch that emit a fine beam of visible laser
light parallel to the barrel that may travel up to few kilometres. The laser light spot
428 8 Laser Technology and Its Applications
may be aligned with the target for better aim, taking into account wind direction and
speed and trajectory of the fired bullet/shell.
(d) Industrial and commercial applications
Depending on the power of the laser source, industrial applications of laser may be
divided into two types; material processing and micromaterial processing. In material
processing laser sources of more than 1 kW power are used. Laser sources in the
range of 100–300 W are primarily used for pumping, plastic welding and soldering
applications. In applications like, metallic sheet cutting, brazing, metal welding etc.
laser sources with power larger than 300 W are employed. Multiple kilowatt lasers are
used for hardening, deep penetrating welding, cladding etc. Lasers are also used for
micromaterial processing, like fabricating screens for smart phones, table computers,
and LED TVs.
Lasers are used for optical communication over optical fiber or in free space, in
guidance systems like laser gyroscopes, barcode readers, laser engraving of printing
plates, writing subtitles on motion picture films, in consumer and industrial imaging
instruments, etc.
(e) Medical applications
Lasers are now extensively used for cosmetic surgery, like removing of tattoos, scars,
stretch marks, sunspots, wrinkles, birth marks and unwanted hairs. Ruby (694 nm),
Nd.YAG (1064 nm), alexandrite (755 nm) and pulsed diode array (810 nm) lasers are
used for dermatology applications. Soft tissue surgery is done mostly using CO2 and
Er.YAG lasers. Laser scalpels are now used for general surgery and in gynaecological,
urology and laparoscopic surgeries. Laser knifes are used for no-touch surgery of
brain and spine tumours. In dentistry, lasers are used for caries removal, in endodontic
and periodontal procedures etc.
Solved Examples
SE8.1 A resonant cavity in some laser source is made by putting two parallel
mirrors 1 m apart. Calculate the frequency separation Δν between different
axial modes. Will frequency separation be different for lasers of different
frequencies?
Solution Frequency difference Δν between successive axial modes is given as,
c/μ
Δν =
2L
where c is the velocity of light, μ the refractive index of active medium and L is the
length of the cavity. Since the value of μ is not given in the problem, we take it 1,
therefore,
3 × 108 m/s
Δν = = 1.5 × 108 cycles per second = 150 MHz
2 × 1m
8.8 Some Applications of Lasers 429
Frequency separation does not depend on the frequency of the laser; it depends
only on the length of the resonant cavity.
SE8.2 A typical laser source emits laser light of 700.0 nm wavelength with spread
of 0.50 nm. Calculate the possible number of axial modes in a cavity of 5 cm
length filled with gain medium of 1.5 refractive index.
Solution Let us first calculate the frequency uncertainty of the laser light that has
wavelength uncertainty of 0.50 nm. The frequency spread δν corresponding to
wavelength spread δλ = 0.5 nm is given by,
c 3 × 108
δν = δλ = 0.5 × 10−9 = 309 × 109 Hz.
λ2 700 × 10−9
2
c/μ 3 × 108 ms
Δν = = = 4 × 109 Hz
L 1.5 × 5 × 10−2 m
δν 309×109
Number of axial modes is = Δν
= 4×109
= 77.5 modes ≈ 77 modes.
SE8.3 A laser beam of 50 mW power having a circular spot of radius 1.0 mm is
allowed to fall on a totally reflecting mirror for 10 h. Calculate the intensity
of the laser light, the linear momentum imparted by the beam to the mirror,
average force exerted by the beam and the total electrical power consumed
by the source if the efficiency of the source is 10%.
Solution The power of the beam W = 50 mW = 50 × 10−3 J/s. This power is
2
contained in a circular beam spot of area A = πr 2 = π × 1 × 10−3 . Intensity I
of the beam is;
W 50 × 10−3
I = = 2
J s−1 m−2 = 15.91 × 103 J s−1 m−2
A π × 1 × 10−3
E T = 50 × 10−3 × 10 × 60 × 60 = 572 J
ΔL 381.3 × 10−8
Fav = = = 10.59 × 10−11 N
10 × 60 × 60 10 × 60 × 60
Line width Δλ = λ Δν = (
700×10 )
−9 2
× 1 × 109 = 18.3 × 10−13 m.
2
c 3×108
Problems
P8.1 A laser beam of wavelength 700 nm and power 1 mW was focused on an spot
of area 40 × 10−14 m2 . What is the intensity of the focused beam?
ANS 2.5 × 109 W/m2
P8.2 In case of a ruby laser the laser beam diameters at 2 m and 4 m from the source
were 2 mm and 3 mm. What is the angle of divergence of the beam?
ANS 2.5 × 10−4 rad
8.8 Some Applications of Lasers 431
P8.3 A laser beam of wavelength 800 nm and aperture 0.5 cm is sent to an object
4 × 108 m away. Calculate angular spread and area spread of the beam at the
distant object.
ANS 16 × 10−5 rad; 409.6 × 1010 m2
P8.4 The optical cavity of a laser source has the active medium of refractive index
1.75 and length 5 cm. The central wavelength of the laser resonating in the
cavity is 700 nm with spread 0.5 nm. How many axial modes will be resonating
in the cavity?
ANS 179 axial modes
cμ c/μ cμ c/μ
(a) 2L
(b) 2L
(c) L
(d) L
ANS: (b)
MC8.2 The ratio of Einstein’s coefficients A21 /B21 is given by
8π ν 3 8π hν 3 hν 3 8π hν 3
(a) hc
(b) c
(c) 8π c
(d) c3
ANS: (d)
MC8.3 A laser beam of photons of energy 1 J hits a mirror for 30 s. Assuming
that the mirror reflects back all the photons of the beam, the momentum
transferred by the beam to the mirror is
(a) 0.1 × 10−7 kg m (b) 1 × 10−7 kg m (c) 2 × 10−7 kg m (d) 20 ×
10−3 g cm
ANS: (c) and (d)
MC8.4 A laser beam of transfer mode TM00 has beam radius of 0.01 cm at
1 m from the source and of 0.2 cm at 2 m distance from the source, the
divergence of the laser beam is;
(a) 0.5 × 10−3 rad (b) 0.5 × 10−3 degree (c) 1.0 × 10−3 rad (d) 1.0 ×
10−3 degree
ANS: (c)
MC8.5 A laser beam of wavelength 600 nm has a wavelength spread of 0.5 nm.
The frequency of the laser ν and frequency spread Δυ are respectively;
(a) 5 × 1014 Hz; 41 × 1010 Hz (b) 5 × 1010 Hz; 41 × 1014 Hz (c) 41 ×
1014 Hz; 5 × 1010 Hz (d) 50 × 1014 Hz; 4.1 × 1010 Hz
ANS: (a)
MC8.6 A laser light of frequency 5 × 1014 Hz and frequency spread 42 × 1010 Hz
is confined in a resonant cavity of 5 cm length. The number of axial modes
in the cavity is;
(a) 40 (b) 140 (c) 200 (d) 240
ANS: (b)
MC8.7 An atomic system has two excited states at energies E 1 and E 2 (E 2 > E 1 ).
If RT 1 and RT 2 denote the ratio of the number of atoms in state E 1 and E 2
at temperatures T 1 K and T 2 K respectively, then the ration R = RT 1 /RT 2
for T 2 = 2T 1 is;
( E2 −E1 ) ( E2 −E1 ) 2( E 2 −E 1 ) ( E1 −E2 )
(a) e 2kT 1 (b) e kT 1 (c) e kT 1 (d) e 2kT 1
ANS: (a)
MC8.8 Tick the correct alternative(s);
The number of stimulated photons is proportional to;
8.8 Some Applications of Lasers 433
(a) Number of atoms in upper lasing level (b) number of atoms in lower
lasing level (c) number of triggering photons (d) number of atoms in the
ground state
ANS: (a) and (c)
MC8.9 In a closed tube argon ion laser the purpose of putting helium gas in the
plasma tube is to;
(a) Transfer excitation energy to argon ions (b) dissipate heat from the
central region of the plasma tube to its walls (c) facilitate plasma formation
(d) facilitate de-excitation of atoms in lower vibrational excited states of
argon ion
ANS: (a) and (d)
MC8.10 Which of the following laser source does not require any sort of pumping
(a) Nd.YAG (b) Ar ion (c) HBr (d) GaAs diode
ANS: (c)
LA8.1 What is meant by population inversion and stimulated emission? Discuss the
working of CO2 laser source and mention some of its important applications.
LA8.2 Discuss the construction and working of a semiconductor laser source. Why
it operates as a laser source only when forward current is more than the
threshold current? What is a heterojunction diode laser and why it is more
efficient?
LA8.3 What are the special characteristics of laser light? Discuss three of them in
details. Define angle of divergence of a laser source.
LA8.4 Describe Einstein’s theory of stimulation emission and calculate the ratio
of spontaneous to stimulated emission for a system in thermal equilibrium.
What are Einstein’s coefficients?
LA8.5 With the help of a neat diagram explain the working of a Nd.YAG laser. What
is the purpose of resonant cavity? Derive an expression of the threshold gain
for a parallel mirror resonator cavity.
LA8.6 Write an essay on lasers and their applications.
Chapter 9
Nanomaterials
Objective
Properties, behaviour and production of nanomaterials, along with their important
applications, will be discussed in this chapter. It is expected that a reader after going
through this chapter will be able to understand why nanomaterials are assuming added
importance and how nanomaterials with desired characteristics may be synthesised
in laboratory.
9.1 Introduction
Nanomaterials are magical materials of immense potential, though their potential has
been realised only recently. Nanomaterials belong to the world of the small and the
smallest parts, of micro- and nanotechnologies. The field of nanomaterials extends
to all branches of science; electronics, mechanics, optics, chemistry and biology, etc.
Nanomaterials are the product of nanotechnology which is the science, engineering
and technology at the nanoscale. Nanoscale ranges from 1 to 100 nanometres (1
× 10−9 –100 × 10−9 m) as shown in Fig. 9.1. One nanometre is one-billionth of
metre, a very small distance, just to have a feel of nanoscale, it may be said that our
finger nails grow about one nanometre per second, a sheet of paper is about 100,000
nanometres thick, and there are 25,400,000 nm in an inch.
The nanoworld may be considered to be intermediary between the atom and the
bulk solid. The concept of nanoworld emerged from the convergence of a mix of
scientific and technological domains which once were separate. Moreover, concept
of nano is becoming fashionable since it combines what is already known with new
concepts and gives the idea of modern technologies.
In principle nanoworld may be reached through two seemingly opposite routes,
namely the Top-down and the Bottom-up approaches. In top-down approach the
starting point is the aggregate matter (solids, liquids, gases and plasma), i.e. the
traditional large-scale world with macroscopic properties which is then analysed
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2023 435
R. Prasad, Physics and Technology for Engineers,
https://doi.org/10.1007/978-3-031-32084-2_9
436 9 Nanomaterials
Non-availability of appropriate tools and technology was the single answer to all
these questions at that time. However, at present new tools for atomic-scale char-
acterisation, new capabilities for single atom/molecule manipulation, computational
access to large systems of atoms, and convergence of almost all scientific disciplines
at nanoscale, etc. has made it possible to reach nanoworld via bottom-up route.
Various types of nanostructures which possess at least one dimension in the nanor-
ange are included in nanomaterials. Some typical examples are: Quantum dots, zero-
dimensional monostructures such as metallic, semiconductor and ceramic nanopar-
ticles of diameter 1–10 nm; one-dimensional nanostructures nano wires (diameter
of 1–100 nm), nano tubes (diameter 1–100 nm) and nano rods (diameter 1–100
nm); two-dimensional nanostructures such as thin nano films (area of several nm2 to
µm2 and thickness 1–1000 nm). Besides these individual nanostructures, ensembles
of these make high dimension arrays, assemblies and supper lattices.
SAQ: What qualifies a structure to be a nanostructure?
SAQ: What is the minimum number of atoms of average size that will make a
nanostructure?
At the nanoscale, the physical, chemical, mechanical and biological properties of
materials significantly differ in fundamental and valuable ways from the properties of
individual atoms and molecules or the bulk matte. Nanotechnology may be defined
as the research and development at the atomic, molecular and supramolecular levels,
in the length scale of approximately 1–100 nm range, through the control and manip-
ulation of matter at molecular level to design, fabricate and use materials, devices
and systems with fundamentally new properties and functions because of their small
structure.
Nanomaterials exhibit some special properties that are not found in normal macro or
even micro-materials. Most of these special features of nanomaterials arise because
of (a) large surface to volume ratio, (b) large fraction of surface atoms, (c) high
surface energy, (d) spatial confinement and (e) reduced imperfections. Some of
these properties are discussed below.
(i) Large Surface Area
Surface-to-volume ratio, i.e. the surface area for a given volume is much larger in case
of nanomaterials as compared to the similar volume of ordinary largescale or even
microscale materials. Importance of available surface area may be easily understood
by the example of chocolate. If one puts a block of chocolate of size 3.2 × 2.3 ×
438 9 Nanomaterials
1.3 cm (of surface area approximately 26 cm2 ) in his mouth; only 26 cm2 area will
come in contact with taste buds of the mouth. However if the block of chocolate is
broken into two equal pieces and the two pieces are put in mouth roughly 31 cm2
area will be available for the taste buds to enjoy chocolate. Division of chocolate
block in smaller pieces increases the available area so much so that if it is broken
into 1 nm cubes, the available surface area will be around 500,000,000 cm2 , about 10
football fields. In electric batteries current producing chemical reaction takes place
at the surface of electrodes. Therefore, electrodes made of nanomaterials expose
more surface area and thus have high current capacity. Large number of chemical
and physical processes takes place at the exposed surface, for example, adsorption of
gases, catalyst actions, etc. proceed at the exposed surface. Nanomaterials, therefore,
play important role in such processes, metallic nanomaterials are used as very active
catalysts. Chemical sensors made from nanoparticles and nano wires enhance the
sensitivity and sensor selectivity.
SAQ: Surface-to-mass ratio of nanostructures will be large or small?
Fig. 9.4 Typical absorption spectra of quantum dots and nano tubes of different dimensions
Etop Etop
ne = n e (E)dE = N (E)F(E)dE (9.2)
E bc E bc
Theoretical value for N(E), density of state at energy E and (E + dE), (assuming
free electron to be a particle in box), is given by
442 9 Nanomaterials
3/2
2m e
N (E) = 4π E 1/2 (9.3)
h2
And F(E), the probability that the state at energy E is occupied by electron is
given by quantum statistics as;
1
F(E) = ( E−Ef )
(9.4)
1+e kβ T
reduced sufficiently, then the continuous density of electronic states is broken into
discrete energy levels. The spacing ΔE between energy levels depends on the Fermi
energy E f of the metal and the number of electrons N in the metal specimen as given
by
4E f
ΔE = (9.5)
3N
Fermi energy E f for most metals is of the order of 5 eV. Discrete electronic levels
in gold nanoparticles have been observed experimentally in far-infrared absorption
measurements confirming expression (9.5).
Solids may be classified as conductors, semiconductors and insulators according
to the value of their resistivity, ρ, (resistance between the opposite faces of a unit cube
of the material). Resistivity arises essentially because of the scattering of delocalised
electrons by crystal lattice while moving under the applied electric field. According
to Drude theory, at room temperature there is some number of delocalised electrons in
conductors which are moving with thermal speeds (∼ 105 m/s) in random directions.
When a voltage V is applied across a conductor of length L, an electric field ∈ (= V /
L) is established in the specimen which forces the delocalised electrons in it to move
opposite to the direction of the field ∈. The electric field ∈ exerts a force on each
free electron which imparts an additional component of velocity called drift velocity
velocity V d to each free electron. Electrons moving under the influence of the electric
field undergo frequent collisions with the crystal lattice which randomise the motion
of drifting electrons. As such scattering by crystal lattice may be treated like a friction
force which counterbalances the force due to the electric field. This results in Ohm’s
law V = RI. Scattering events have a mean free path λ, the average distance travelled
by an electron between two successive scattering events, which is of the order of nm
in many materials at room temperature. If the size of a nanostructure is of the same
order of magnitude as the mean free path λ, then Ohm’s law may not be obeyed. In
such a situation the electron transport process is totally quantised and current–voltage
relationship may be quite different than Ohm’s law.
(v) Thermal behaviour of nanostructures
Many properties of nanostructures like, mechanical, electronic and optical behaviour
have been well studied in recent past. However, thermal behaviour of matter at
nanoscale could not be investigated in details till now. There are several prob-
lems in this regard. Firstly, the definition of temperature at nanoscale is itself quite
ambiguous. In non-metallic materials heat energy is generally transported through
photons and or phonons, both of which may have widely different frequencies and
mean-free-paths (mfp). Heat-carrying photons often have wavelengths and mfp in the
range of nanometres at room temperature. Thus nanostructure size and heat photon
parameters are of the same order of magnitudes. A temperature is defined only when
the system is in equilibrium. In bulk material it is possible to define local tempera-
tures in different small regions which may be treated as if they are in steady state/
equilibrium, and thus it becomes possible to study the process of heat transport based
444 9 Nanomaterials
shifts the metallic tip in desired direction so as to maintain the same gap and
tunnelling current. The amount of shifts in the position of the tip to keep constant
tunnelling current (produced by the feedback loop) are recorded in a computer
and are used to produce a 3-D surface image by the computer software. This
method of using (STM) is called constant current method. In case the specimen
surface is rather smooth, the feedback loop may be removed and the tunnelling
current reading itself may be converted into a 3-D surface image or surface
topology.
The biggest drawback of STM is that it can record the surface topology only
of those surfaces which are good conductor of electricity. Tunnelling current is
quite small and in STM it flows through the surface under study. In case the
surface offers high resistance it may not be possible to record small changes in
tunnelling current accurately.
(b) Atomic Force Microscope (AFM) The STM may decipher the surface topology
only of those surfaces that are good conductor of electricity. Atomic force
microscopy was developed to overcome this drawback of STM. AFM has the
advantage that it may image the topography of any type of surface, including
ceramic, polymer, composite, glass and of biological samples. AFM was
invented by Binning, Quate and Gerber in 1985.
In AFM a very sharp tip is attached at one end of a metallic strip, the other
end of which is clamped in a holder. Metallic strip, clamped at one end and
holding a fine tip at the other free end, works like a cantilever. The strip holder
may be moved in X-Y plane with the help of synchronised motors, and the X, Y
coordinates of the tip may be recorded with precision. The pointed tip is kept
very close to the surface under study (like that of STM); however, no current
is made to pass between the tip and the surface in case of AFM. In this case
atomic force between the tip and the surface either pull the tip towards surface
or push the tip away from the surface. Thus, the tip which is suspended by a
metallic cantilever, moves in the Z-direction, towards the surface or away from
it when (the tip is) scanned over the surface. Atomic force between the tip and
the surface depends on the distance of separation between them. If there is a
depression in the surface at some point, the tip will experience a smaller atomic
force and will move up while at a point of bulge the separation between tip
and surface will become less, atomic force will increase and the tip will move
downwards. Thus displacement of the sharp tip in Z-direction may be converted
into the topology of the surface.
Schematic layout of atomic force microscope is shown in Fig. 9.9. Tip
displacement in Z-direction is recorded using a laser beam which is made to
hit the top of the sharp tip. The laser beam reflected from the top of the sharp
tip is recorded in a position sensitive laser detector. In their original instrument,
Binning et al. used diamond for sharp tip and gold foil as cantilever strip.
Atomic force microscope relies on force between the tip and the sample
surface. Atomic force and distance (of the tip from the surface) curve is shown
in Fig. 9.10. Atomic force is not measured directly but is calculated by measuring
the deflection of the tip knowing the stiffness of the cantilever strip using the
448 9 Nanomaterials
C A
Attractive
Range of
atomic
force
0.10 nm 100 nm
Tip distance from the surface
Hook’s law; F = − kZ . Here k is the stiffness constant of the strip and Z the
displacement of the tip.
The range of atomic force extends from about 0.10 to 100 nm. A typical curve
showing the variation of atomic force with the tip-surface distance is shown in
Fig. 9.10. When the tip of the AFM is more than 100 nm away from the surface
(on the right side of point A in the figure) it does not experience any atomic force.
On moving the tip towards left of point A, the tip experiences an attractive atomic
force that pulls the tip down and the maximum attractive force is experienced
at the distance corresponding to point B. The exact distance corresponding
to point B is different for different surfaces, depending on the nature of the
surface, i.e. the atoms/molecules of the surfaces. On further reducing the relative
separation between the tip and the surface beyond point B, the atomic attractive
force diminishes, reducing the downward displacement of the tip. Ultimately,
at separation corresponding to point C atomic force experienced by the tip
becomes zero again and the tip assumes its undeflected position. When tip-
surface distance is reduced beyond point C, which means that the tip is hard
pressed into the surface it experiences a force of repulsion and the tip is pushed
back. In this way, sensing the deflection of the tip, the detailed structure/topology
9.3 Technology Used for the Study of Nanostructures 449
(d) Optical Tweezers (OT) Optical tweezers (OT) are instruments based on intense
laser beams that may hold or trap particle from the size of an atom up to strand of
DNA and living cell. In most cases the desired particle is trapped at a particular
point in a plane called the focal plane. Optical tweezers was invented by Arthur
Ashkin for trapping essentially biological molecules, cell, etc. for which he
received the Nobel Prize in 2018.
Using only light (laser beam) OT is able to hold or influence the motion
of objects (nano to microsize including biological cells, etc.) in a non-contact
way. This makes OT especially useful for holding and studying nanostructures
which are difficult to manipulate using conventional means such as mechanical
tweezers or micropipettes.
Most optical tweezers use a highly focused and intense laser beam, usually
of the wavelength in the range of 0.5–1.0 µm (visible to near infrared). The
laser beam is often focused using a microscope, the objective lens of which
focuses the beam at a small spot in the focal plane where the sample having
nanostructure to be studied is kept. The OT holds the nano or the other desired
microstructure (present in the sample) at the focal spot of the laser beam. The
laser beams used in OT have a Gaussian intensity profile, i.e. the intensity of
the beam is very high in the central region and decreases towards the periphery.
In order to understand the operation of the OT, let us consider a very small
(nano or microsized) particle of some dielectric medium which is lying at some
location in the laser beam. This dielectric particle is often called a bead. The
bead in the path of the laser beam may experience three different forces due
to three interactions it may have with the beam. These three interactions may
be understood in terms of the photon nature of laser beam. The laser beam
of frequency ν may be considered as a collection of photons, each of energy
E = hν. Now a photon of energy E carries a linear momentum = Ec = hν C
, where
c is the velocity of light. The laser beam photons may be scattered (reflected) by
the surface of the bead giving rise to what is called the Scattering force. The
scattering force pushes the bead towards the focal spot in the focal plane. Another
force, called the Gradient force, comes into play because of the intensity profile
of the laser beam. Since the central part of the laser beam carry larger number
of photons per unit volume as compared to the number density of photons at
periphery, a net gradient force is applied to the bead which pushes the bead
towards the centre of the beam. When scattering and gradient forces are larger
than other forces acting on the bead, like the force of gravity and the force of
Brownian motion, the bead is steered towards the focal spot and is held there. A
third type of force called Absorption force may come into play because of the
absorption of the laser radiations by the bead. Absorption force typically behaves
like the scattering force; however, too much absorption of laser radiations may
increase the temperature of the bead (or the specimen) which may be harmful
in case of biological samples.
A typical layout of an optical tweezers is shown in Fig. 9.12 where a laser
source produces an intense beam which is first expanded and then steered using
a mirror and a halfwave plate (to compensate for phase change) to a microscope
9.3 Technology Used for the Study of Nanostructures 451
which tightly focuses the beam to the focal spot. The sample containing nano/
microstructure is placed in the focal plane of the microscope objective. An
eyepiece and camera are provided with the set-up to see and record the event.
In order to understand the origin of scattering force it may be mentioned that
each incident laser beam photon scattered from the surface of the bead imparts
a linear momentum or apply a force (called the recoil force) to the bead. For an
isotropic scatter, the resulting forces cancel in all directions except the incident
direction. Thus, as a result of scattering (reflection) of incident laser photons
from all sides of the bead surface in all possible directions, a net force pushing
the bead in the direction of beam propagation is generated which is termed as
the Scattering force. Figure 9.13 shows a bead placed in a laser beam, the
surface of the bead is bombarded with laser beam photons from all directions.
Let us pick a typical photon which is scattered at point A of the surface. Since
after scattering the momentum of the scattered photon has changed, it exerts a
force say, F 1 on the bead to recoil back. Now force F 1 may be resolved into two
perpendicular components F 1 H and F 1 V . Similarly, other photons on scattering
will also apply forces F 2 , F 3 ,…F N etc., on the bead each of which may be
decomposed into two components F 2 H , F 2 V ; F 3 H , F 3 V …, F N H , F N V … etc. It
can be shown that in case∑ the scattering is isotropic, the sum of F N H components
will add up to zero, i.e. FNH = 0. The y-components of all forces add up to
give a non-zero
∑ V value of force pointing in the incident direction; i.e. Scattering
Force = FN.
Gradient force is the result of Gaussian intensity profile of the laser beam.
The intensity of cylindrical laser beam is a maximum at the axis and decreases
as one move away from the centre. Now laser beam being an electromagnetic
wave carry both electric and magnetic fields. The intensity of the electric field
produced by laser beam photons is a maximum at the beam axis and decreases
towards the periphery. Further, the electric field associated with photons induces
fluctuating electric dipole in the dielectric material of the bead. The fluctuating
dipole in the bead interacts with inhomogeneous electric field of incident beam
and experience a force towards the stronger region of the electric field. Since
the force is proportional to the electric field gradient, it is a maximum at the
452 9 Nanomaterials
outer rim of the beam and is a minimum at the central region, at the beam axis.
Thus, the gradient field works as a spring which always tries to keep dielectric
bead at the centre of the focal spot. It is interesting to note that the magnitudes
of both Scattering force and the Gradient force are of the order of 10−12 N
(pico-Newton) which are enough to hold the nano or microstructures fixed at
the focal point.
In case the bead is transparent to the laser beam, the action of the gradient
force may be explained through Fig. 9.14. Let the bead be away from the focal
spot and placed such that a part of it lies in the stronger and other part in the
weaker sections of the beam intensity. The laser rays from the stronger central
part are shown with broad red arrow (as they will be large in number per unit
volume of the beam) while corresponding rays from the weaker part of the beam
by narrower purple arrow. The refracted rays are also indicated in the figure.
Both refracted rays will exert recoil forces F 1 and F 2 on the bead as indicated
in Fig. 9.14. Forces F 1 and F 2 can be resolved into X- and Y-components. The
vertical components F 1 V and F 2 V will add up to give the Scattering force that
steers the bead in the incident direction. The horizontal components F 1 H and
F 2 H will oppose each other; since force F 1 >> F 2 (because of the intensity
profile of laser beam), the net horizontal force on the bead will be (F 1 H − F 2 H )
that is the gradient force which tries to bring the bead at the axis of laser beam.
As such under the joint action of the scattering and the gradient forces the bead
is held at the focal spot.
SAQ: What will happen to the gradient force if the laser beam intensity is least at
the axis and increases with distance from the axis?
Depending on the physical phase of the reactants, the bottom-up technique may be
further divided into two groups: (i) the gas-phase methods and (ii) the liquid-phase
methods.
(i) Gas phase Methods
(a) Chemical Vapour Deposition (CVD) Chemical vapour deposition (CVD)
is a versatile process in which gas-phase molecules are decomposed to reac-
tive species leading to the film or the particle growth. This method may be
used to deposit a wide range of conducting, semiconducting and insulating
materials. Recently the method has been used for controlled production of
nanomaterials in porous hosts. The two basic advantages of (CVD) tech-
nique are: (i) the ability to controllably create films of widely varying stoi-
chiometry that is films containing constituent materials in different ratio
and (ii) to uniformly deposit thin films of materials, even onto nonuni-
form shapes. The layout of the experimental setup generally used in (CVD)
method is shown in Fig. 9.15, where the 3D-substrate sample on which thin
454 9 Nanomaterials
laser ablation, (iii) nanolithography and etching, (iv) sputtering and (v) electric
explosion of wire.
(i) Mechanical Milling (MM) Milling is the process of reducing relatively coarse
materials to desired fineness and is a potential method of producing nanos-
tructures through top-down route. Most of the milling machines employ ball-
milling method for turning bigger lumps of precursor material into nanosize.
A ball media milling machine has a fixed double-wall stainless steel cylinder
fitted with water cooling arrangement. The cylinder is filled with big pieces
of precursor material and large number of stainless steel balls which work as
milling medium. A rotor shaft passes through the centre of the cylinder and may
be rotated by an external motor. Rotating impellers attached to the shaft impart
rotator motion to the large number of metal balls. The shaft may be rotated
by different speeds, depending on the type of the final product. Metallic balls
moving with high speed collide with each other and with precursor material
breaking down big pieces of the material into small pieces. Different type of
forces, like elastic forces, plastic forces, shear forces and chemical forces are
applied to precursor particles by the colliding balls as shown in Fig. 9.19. The
speed of rotation, size of balls temperature of the tank, etc. decide the nature
and size of nano- and microstructures found in powdered precursor.
(ii) Laser Ablation Laser ablation is a method of producing nanostructures with
very high purity. The method may be classified both: a top-down technique or as
a bottom-up. It may produce different types of nanostructures including semi-
conductor nano quantum dots, nano wires, nanotubes and core shell nanopar-
ticles. This method utilises laser as the source of energy for ablating solid
precursor material. An intense and focused laser beam, generally pulsed, is
458 9 Nanomaterials
Fig. 9.19 a Section of ball media milling machine, b generation of elastic and plastic forces by
grinding of balls, c generation of shear force by rotating balls
made to hit the solid precursor material at a point, large amount of energy
from the laser beam is absorbed by a small surface area of the target material
which evaporates. The term ‘ablation’ refers to the removal of surface atoms
and involves not only a single photon process of chemical bond breaking but
also multiple-photon process of thermal evaporation. The ablation process may
be carried out in vacuum, in an inert gas or in a liquid. The characteristics of the
nanostructures produced by this method depend on the purity of the precursor
material, the repetition rate and energy of the laser beam, and the environ-
ment (vacuum, inert gas or liquid) under which the ablation has taken place.
Figure 9.20 shows the basic layout of laser ablation technique and the various
steps of the process.
(iii) Nanolithography and Etching Fundamental idea of fabricating nanostruc-
tures using top-down technique is taken from the techniques used in making
miniature solid state electronic devices. The general method used for making
miniature electronic devices is called Lithography. Nanolithography drives
its name from the Greek words nano (small or dwarf), lithos (rock) and
grapho (to write) that literally means small writing on rocks. The technique
is essentially meant to write on some surface features with dimensions of the
order of nanometres. The writing of nanosize structures may be achieved by
depositing, etching or removing material from some sample. Lithography may
9.4 Techniques of Producing Nanostructures 459
etching, dry etching using plasma, purely physical using ion beam milling, or
reactive ion etching which is the combination of the two.
(iv) Sputtering Sputtering is essentially a technique of creating nano- and micro-
size particles of a given material and to get them deposited on some surface to
make a very fine nano- or microfilm of the material. Thin films made up of nano
(or micro)-size particles have large number of applications; they are used in
microelectronic industry, solar panels, as oxidation protection films, as antire-
flecting coating on cars, jewellery, mirrors, etc. Further, the process of sput-
tering is also used for identifying materials, for etching, for space weathering,
etc.
A typical sputtering setup is shown in Fig. 9.22. The process of sputtering
is generally carried out in a chamber which is filled by some inert gas like
argon at low pressure. The chamber is provided with two electrodes, the anode
and the cathode. Cathode which is kept at a negative potential with respect to
the anode is also called the target. Nano/microsize particles of target/cathode
9.4 Techniques of Producing Nanostructures 461
material are emitted and are deposited in the form of a uniform layer at the
desired substrate attached to the anode.
In order to start the process of sputtering a high voltage is initially applied
between the anode and the cathode which ionises the argon gas molecules
producing plasma between the electrodes. Plasma that contains positive ions of
argon and electrons glow brightly and is sometimes also called glow discharge.
Positive argon ions in plasma get accelerated towards the cathode because
of its negative potential. Accelerated argon ions hit the target (cathode) and
deposit their energy which initiates the emission of negatively charged nano-
or microsize particles (atoms/molecules of target material) from the cathode.
These negatively charged atoms/molecules of the cathode (target) material
move towards the anode which is at positive potential. On reaching anode,
negatively charged nano- or microparticles get neutralised by giving their extra
electron to the substrate and form a thin uniform film on it.
In most cases an experimental setup like the one shown in Fig. 9.23 is used,
where sudden discharge of stored electric energy through the wire is made to
make it explode. The set-up may be in open air or may be in an enclosure which
may be filled with a desired gas at a desired pressure.
SAQ: Which method of making nanostructures is most suited for (a) making ultra-
pure nanostructures (b) making powdered nanostructures?
(i) Discovery Bonding of carbon atom with itself and with other atoms has
remained a fertile field of research for decades. Three allotropic forms of carbon,
the graphene, the graphite and the diamond were well known for considerable
period of time. However, discovery of C60 , the fullerene by Harry Kroto and
Richard Smally et al. in 1985 in gas-phase carbon clusters obtained in evap-
oration of graphite by intense laser beam in helium atmosphere, gave a new
impetus to the study of other allotropic forms of carbon. Since the yield of
fullerene in laser-induced evaporation of graphite was very small, searches were
conducted to find other means of synthesising fullerene in macroscopic amount.
Later, Wolfgang Kratschmer, Donald Huffman and their co-workers detected
the dominance of fullerene structures in the soot deposited on the walls of the
helium-filled (low pressure) arc chamber when a discharge was passed through
two graphite electrodes. With this simple method it was possible to produce
fullerene in large amounts. With the discovery of fullerene a new allotropic
form of carbon got established. Amorphous and the four allotropic structures
of carbon are shown in Fig. 9.24.
Carbon nanotubes are one of the most important by-products of research on
carbon allotropy. The credit of discovering carbon nanotubes goes to Sumio
9.4 Techniques of Producing Nanostructures 463
Iijima of Japan, who was working as electron microscopist at the NEC labora-
tories of Japan. Iijima got impressed by the technique of producing fullerene in
substantial amount adopted by Kratschmer and Huffman and undertook a project
of detailed study of the soot produced in this method using transmission electron
microscope (TEM). He was motivated to carry out detailed studies as some ten
years earlier he studied soot produced in similar arc discharge between carbon
electrodes and has observed several structures of carbon architecture including
curved, closed nanoparticles and tube like structures. In the initial stages study
of soot taken from the walls of the arc chamber appeared almost completely
amorphous, indicating the absence of any long-range structure. Finding no long-
range microstructures in soot from arc chamber walls, Iijima shifted to the soot
that got collected at the cathode of the discharge chamber in the form of rather
hard rod like lump. Detailed study of this soot from the arc cathode showed
very little amorphous mass but a variety of long-range structures, most striking
of which were long hallow fibers, finer and more perfect than any seen previ-
ously. Iijima announced his discovery of carbon nanotubes in October 1991 in
a meeting at Richmond, Virginia, USA, where he showed beautiful pictures of
these tubes. Amplified photo of nanotubes is shown in Fig. 9.25. A month later
he published a paper in Journal NATURE on his observance of nanotubes in
cathode soot which again attracted scientists to carry out further detailed inves-
tigations of cathode soot which was previously discarded as waste. A coloured
photo of single- and multiple-wall carbon nanotubes is shown in Fig. 9.26.
Method of synthesising carbon nanotubes as adopted by Iijima was not very
efficient and was not able to give substantial yield of nanotubes. Lack of avail-
ability of nanotubes in sufficient amount hampered further research on them.
However, in July 1992 Thomas Ebbesen and Pulickel, two scientists working
464 9 Nanomaterials
at the same NEC lab of Japan where Iijima worked, described a method for
synthesising carbon nano tubes in large quantities, of the order of few gram.
They showed that the yield of nanotubes in cathode soot dramatically increases
when the pressure of the helium gas in the arc chamber is increased.
Discovery of nanotubes on one hand opened a totally new and fertile branch
of research that has great application potential, while on the other hand it posed a
big question: why nanotubes (and fullerene) were not discovered earlier when all
facilities and instrumentation which was used in discovering nanotubes were
available for almost last twenty years or so? The answer in case of carbon
nanotubes may be that such tubes were observed earlier also but not much impor-
tance was given to such studies. For example, an author claimed that he observed
thread like carbon structures as a product of chemical reaction between CO and
Fe2 O3 at 450 °C temperature. Methods of producing tube like carbon struc-
tures by some chemical reaction like the one above are called catalytic methods
and were known for long. However, the thread like carbon structures from
catalytic reactions were found to be rough, imperfect as compared to the carbon
9.4 Techniques of Producing Nanostructures 465
nanotubes obtained through fullerene path and therefore do not leant them-
selves to potential applications. It is also reported that Roger Bacon, National
Carbon Company, Cleveland, Ohio, in 1960 synthesised highly perfect graphite
whiskers using the technique which was very similar to the arc discharge tech-
nique of Iijima. It may however, be mentioned that whiskers are very different
from nanotubes, primarily, they are much larger, of the order of 5 µm in diameter
and few centimetres in length, as compared to nanotubes that have diameters in
the range of 2.5–30 nm and lengths from few tens of nm to few micrometer.
In his earlier experiments and investigations on thin carbon films, prepared
using arc evaporation in vacuum (not in low-pressure helium atmosphere) Iijima
(1970–1980) reported the presence of some tube like structures on carbon films,
which were treated as some sort of contamination. In summary it may be said
that carbon nanotubes were observed in some earlier than 1991 but were not
paid much attention.
(ii) Characteristics of Carbon Nanotubes Transmission electron microscope
images of the cathode soot at low magnification show a number of tubes
entangled with each other, accompanied with other material including carbon
nanoparticles, hollow fullerene based structures and some disordered carbon.
At higher resolution, one may see the inside structure of carbon tubes; generally
there are several cylindrical nanotubes of reducing diameter imbedded within
each other. The nanotube length is typically of the order of several µm and
their diameters range from 2.5 to 30 nm. The ends of some of the concentric
cylinder like multi wall nanotube structure are closed by sections of fullerene
molecule. Carbon nanotubes are often referred as molecular carbon fiber and
consist of tiny cylinders of graphene closed at each end with caps which contain
six pentagonal rings. A carbon nanotube may be formed by cutting a fullerene
molecule into two halves and placing these two halves of fullerene as caps at
the two ends of a graphene cylinder.
Depending on the orientation of end caps and graphene cylinder, theoretically
there may be three different structures of carbon nanotubes; known as arm chair,
zig-zag and chiral structures as shown in Fig. 9.27.
A substantial volume of research on nanotubes is directed towards the study
of their electronic properties. Experimental investigation of electronic proper-
ties of nanotubes preceded by theoretical calculations, several research groups
calculated the electron band structure of nanotubes using what is called the
tight-binding model. These calculations indicated that the band structure of
nanotubes is a function of their diameter and structure. Experimental investiga-
tion of electronic properties of nanotubes was very difficult initially, but by 1996
it become possible to experimentally confirm theoretical findings on electronic
properties of nanotubes.
There has been considerable interest in the conductivity of carbon nanotubes
(CNT). It has been found that electrical conductivity of CNT depends on several
factors like its structure, weather it is single walled or multi walled, types and
number of twists, diameter, etc. CNT may be perfectly conducting like metals or
they may be semiconducting depending on their structure, chirality (degree of
466 9 Nanomaterials
twist) and diameter. It has been found that armchair structure of CNT is a better
conductor as compared to other structures of same diameter. A bundle of several
single wall nanotubes makes a nanorope. The resistivity of single walled nano
tube rope is found to be of the order of 10−4 Ω-cm at room temperature, while
it was found that these ropes may sustain current densities of more than 107 A/
cm2 , may be as large as 1013 A/cm2 . Single-walled carbon nanotubes (SWNT)
contain defects which in some cases make the SWNT to behave as a transistor, a
rectifying diode, etc. It is also reported that SWNT transmit electronic signals at
high speed and therefore may be used as an interconnector of different electronic
devices.
Study of mechanical properties of nanotubes was also challenging but
studies carried out using atomic force microscopy and transmission electron
microscopy have indicated that just like their electronic properties, the mechan-
ical properties of nanotubes are very different than the mechanical properties
of graphite, graphene or diamond structures of carbon.
In a graphene sheet a carbon atom is connected to other three neighbouring
carbon atoms with very strong chemical bond, that is why the modulus of
elasticity of graphene is one of the largest of any known material. A single-wall
carbon nanotube is built up of a graphene cylinder and, therefore, SWNT is
expected to be ultimate high-strength fibers. Applying pressure at the tip of a
SWNT may cause it to bend without damaging the tip. This property has made
SWNT as an ideal probe tip for scanning microscopy.
Research has indicated that carbon nanotubes are perhaps the best conductors
of heat. On account of their unique electrical, mechanical and thermal properties,
carbon nanotubes are finding numerous applications in host of devices.
Electron gun, a system that generates an intense electron beam, is an essential
part of most of the display systems. In conventional electron guns, electrons are
produced through thermionic emission and are then focused using an electron
accelerating voltage network. In case of nanotubes, electron emission may take
9.4 Techniques of Producing Nanostructures 467
place by field emission. Since the tip of the nanotube is very sharp, a small
voltage difference between the tip and an electrode may generate large electric
field that may be sufficient for field emission of electrons as well as for their
acceleration. In this way carbon nanotube-based electron gun systems may
become a part of all display systems.
There is great demand in industry of plastic through which current may pass
(conducting plastic). Conductive plastic is made by adding some conducting
material to the plastic matrix. Carbon nanotubes (CNTs) have proven to be an
excellent additive to impart electrical conductivity to plastics. CNTs have a
very high Aspect Ratio, which means that a lower loading or concentration of
CNT is required compared to other conductive additives to achieve the same
conductivity. A lower amount of additive preserves the toughness of the polymer
resin, particularly as low temperatures.
Atom of no other element of the periodic table bonds to itself in an extended
network with the strength of carbon–carbon bond. Special properties of carbon
bonding and molecular perfection of SWNT makes them material with special
electrical, mechanical and thermal properties. The pi-electron donated by each
carbon atom is delocalised and may move around the entire structure. This makes
carbon molecule to be the first molecule that has metallic-type conductivity.
High thermal conductivity of carbon molecule may be attributed to the carbon–
carbon bond that may vibrate at high frequency, giving the molecule an intrinsic
thermal conductivity that is higher than any other material.
gram and also have very good electric conductivity; therefore, CNT is
finding great application in battery industry.
Capacitor is an element that may store electric energy. Recent
researches have indicated that CNT have very high reversible capacity
and are, therefore, extensively used in lithium-ion batteries.
Carbon nanotubes are finding applications in fuel cells. Fuel cell uses
the chemical energy of hydrogen or some other fuel to cleanly and effi-
ciently produce electricity. The only products of a fuel cell are electricity,
water and heat. They may use a wide range of fuels and feed stocks and are
capable of providing power for systems as large as a utility power station
or as small as laptop. A fuel cell works like a battery but they do not but
do not rundown or require recharging. They work and produce electricity
so long as the fuel (hydrogen in case of hydrogen fuel cell) is supplied. In
a typical hydrogen fuel cell, two electrodes, anode and cathode, are sand-
wiched around an electrolyte, the fuel hydrogen is fed to one electrode
and the air to the other. A catalyst at anode (to which hydrogen is suppled)
converts hydrogen molecules into electrons and protons. The electrons
flow through an external circuit constituting current while protons migrate
through the electrolyte to the cathode and combine with oxygen to make
H2 O. Since catalytic action at anode depends on the exposed area of the
anode, nanotube anodes which are good conductors of electricity make
fuel cells more efficient.
On account of their high mechanical strength and toughness-to-
weight property, carbon nanotubes are finding applications in composite
components in fuel cells used in transport applications.
(c) Electron emitter On account of their very sharp tips, field emission of
electrons from carbon nanotubes occurs at relatively very low applied
potential. This makes CNT as an important component of electron guns
required in all display systems. CNT is being used in flat-panel displays,
instead of a single electron gun as in traditional cathode ray tube, there is
a separate electron gun of nanotube for each pixel in such flat displays.
Another important characteristic of nanotubes, their capability of
sustaining with high stability very high current densities as large as 1010
A/cm2 or more, makes them a very suitable material for lightning arrester
material.
(d) Material properties Extraordinary thermal conductivity, high capacity of
sustaining large current densities, unparalleled mechanical strength and
extraordinary surface-to-volume (or mass) ratio, and very high aspect ratio
make carbon nanotubes a wonder material. CNT is directly used in many
electrical and electronic applications, for example as heat sinks in high-
density microelectronics where elements readily acquire high tempera-
tures. Nano tubes are also used as addends for changing electrical and
mechanical properties of composites.
9.4 Techniques of Producing Nanostructures 469
(e) Filters Many industries are engaged and have developed water and air
filters based on CNT. It is reported that filters based on nanotubes not only
stop particles as small as viruses but also kill bacteria’s and viruses.
(f) Biomedical applications In view of the excellent chemical stability, rich
polyatomic structure, and high surface area CNTs either absorb or conju-
gate with a wide variety of medically important molecules, like molecules
of drugs, proteins, antibodies, strains of DNA, enzymes, etc., and may
carry them near to the targeted cell. Drug may either be loaded or attached
at the surface of nanostructure which may deliver the drug to the desired
target cell either via the endocytosis pathway or via the insertion and
diffusion pathway.
Intense research activities are in progress to use CNTs for treatment
of cancer, in delivering chemotherapist drugs selectively to cancerous
cell without damaging the healthy cells of the body. A water-soluble
conjugate of single-wall carbon nanotubes with Paclitaxel (PTX) has been
found to be highly effective in suppressing tumour growth compared to
conventional drugs in case of breast cancer.
Another application of CNTs is their use in antimicrobial delivery.
Functionalised CNTs may be used in vaccination procedures. It is also
suggested that CNTs might themselves have antimicrobial activity through
oxidation of the intracellular antioxidants.
SWCNTs have strong optical absorption of ultraviolet to near-infrared
radiations. The absorbed radiation energy is almost entirely converted in
heat. This heat may be used to carry out photothermal therapy and imaging.
Materials used in dentistry when modified with MWCNT showed better
results on account of enhanced fatigue resistance, flexural strength and
resilience.
(g) Others As mentioned, these are only some of the important applications
of CNTs, many other applications including their use in fabrics for making
them stain resistant, dirt and water repellent, more durable and in devel-
oping scratch-resistant paints for coating on vehicle bodies, etc. are all
developing fast. Huge applications of nanotechnology are finding their
place in armament and space research.
SAQ: In your opinion which characteristic of CNTs is most important and which
application of CNTs is most beneficial for humanity?
Short Answer Questions
SA9.1 Explain the top-downand bottom-up approaches for the synthesis of nanos-
tructures and discuss in details one method of Bottom-up approach for the
synthesis of nanostructures.
SA9.2 How one may define nanomaterials and the nanotechnology? List the
important characteristics of nanomaterials and discuss one of them in some
details.
470 9 Nanomaterials
SA9.3 Give a brief account of the discovery of carbon nanotubes (CNTs). List the
types and characteristics of CNTs.
SA9.4 Which types of microscopes are generally used to study various features of
nanostructures? Explain the working of atomic force microscope and point
out where it is used.
SA9.5 What is an optical tweezers? Discuss the origin of scattering and gradient
forces in optical tweezers.
SA9.6 Explain the sol-gel method of fabricating nanostructures?
SA9.7 What is meant by self assembly? Give with suitable examples, different
types of self assemblies.
SA9.8 Give a list of important methods of synthesising nanostructures using top-
down methodology.
SA9.9 With the help of a suitable figure explain plasma arch method of fabricating
nanostructures.
SA9.10 Give a list of the important characteristics of CNTs. Why tips made up of
carbon nanotube is used in electron scanning tunnelling microscopes?
SA9.11 In few lines explain each of the following (a) Aspect ratio (b) Quantum dot
(c) laser ablation.
SA9.12 Write a note on biomedical applications of carbon nanotubes
MC9.4 Figure (MC9.4) below shows the electron state density for
Figure (MC9.4)
(a) Quantum well (b) Quantum surface (c) Quantum dot (d) normal bulk
material
ANS: (c)
MC9.5 Thermal conductivity of carbon nanotubes is
(a) Small and isotropic (b) large but highly anisotropic (c) high and
isotropic (d) small but highly anisotropic
ANS: (b)
MC9.6 If X, Y and Z respectively represent the magnetic moment per atom
of a quantum dot, quantum wire and 2D surface of some magnetic
nanomaterial, then
(a) X > Y > Z (b) X < Y < Z (c) X > Y < Z (d) X < Y > Z
ANS: (a)
MC9.7 Which of the following instruments may be used to study the surface
topology of nanostructures?
(a) Scanning tunnelling microscope (b) Atomic force microscope (c)
optical tweezers (d) Transmission electron microscope
ANS: (a), (b)
MC9.8 Which of the following instruments may be used to study the internal
structure of nanostructures?
(a) Scanning tunnelling microscope (b) Atomic force microscope (c)
optical tweezers (d) Transmission electron microscope
ANS: (d)
MC9.9 In optical tweezers which force pushes the microstructure towards the axis
of the laser beam?
(a) Scattering force (b) Surface tension force (c) Gradient force (d) Atomic
force
ANS: (c)
MC9.10 Which Bottom-up method produces carbon nanostructures in macro
amount?
472 9 Nanomaterials
(a) Chemical vapour deposition (b) Plasma arc method (c) Sputtering (d)
Wire explosion
ANS: (b)
MC9.11 Spontaneous molecular arrangement of the disordered entities of
molecules into ordered nanostructures is
(a) Self-assembly (b) sputtering (c) Laser ablation (d) Sol-gel method
ANS: (a)
MC9.12 Electrodes made from carbon nanotubes have the following properties
(a) Large surface area (b) good electric conductivity (c) low melting point
(d) mechanical strength
ANS: (a), (b), (d)
MC9.13 Carbon nanotubes may sustain current densities of the order of
(a) 1050 A/cm2 (b) 1040 A/cm2 (c) 1010 A/cm2 (d) 10−50 A/cm2
ANS: (c)
MC9.14 Single wall carbon nanotubes strongly absorb electromagnetic radiations
in the range
(a) Violet to green (b) Green to yellow (c) ultraviolet to near infrared (d)
microwave range
ANS: (c)
Long Answer Questions
LA9.1 What are nanomaterials, in what respect they are different from their bulk
materials and why these materials are gaining so much importance?
LA9.2 What are top-down and bottom-up methodologies of fabricating nanomate-
rials? Give a list of important methods adopted to synthesise nanomaterials
using bottom-up approach and discuss pone method in details.
LA9.3 Explain with necessary details the nanolithography technique of making
nanostructures. This technique of nanosynthesis falls under under top-down
or bottom-up approaches.
LA9.4 Give a detailed account of the discovery of carbon nanotubes and their
special properties.
LA9.5 Discuss in details at least three uses of carbon nanotubes.
Chapter 10
Sustainability and Sustainable Energy
Options
Objective
The concept of sustainability and its social, economical and environmental aspects
are presented in this chapter. Emission of greenhouse gases from various sectors,
threat of global warming, causes and its impact on sustainability of life, along with
sustainable energy sources, are discussed in detail in this chapter. It is hoped that after
reading this chapter the reader will become more aware of his/her responsibilities
towards developing a sustainable society and sustainable environment.
10.1 Introduction
only in a certain specific part of the world is bound to lead to a disaster of unimagin-
able magnitude. Division of world into developed, developing, underdeveloped and
non-developed has already created a situation where sustainability of any kind of life
on any part of the globe is seriously threatened; life is at razors edge in every part of
the world, including the so-called developed countries.
social equality requires that other groups, those which are in some advanta-
geous position, should help in removing or lowering the barriers so that the
disadvantage groups may have more control on their lives.
(iv) Diversity Different groups in a society add to its diversity which needs to be
protected. Instead of forcing groups to live exactly in the same way, expec-
tations and needs of all different groups must be ascertained, and infrastruc-
tures for the support and the growth of each group be developed with mutual
cooperation.
(v) Governance With diverse societies and groups it becomes necessary to frame
rules and regulations that promote quality, equality, diversity and social cohe-
sion in the lives of the people and to have an authority (government) to imple-
ment these rules. A democratically elected government which judiciously
implements rules, probes ways to generate revenue from existing resources
without over draining them, takes steps to enhance resources and spend
collected revenue in providing infrastructures for development of all groups
is essential for sustainability of societies.
World is facing large number of issues associated with social sustainability, for
example, the issue of resolving racial biases, economic disparities, difference in reli-
gious beliefs, non-uniform distribution of resources, poverty, etc. Large number of
government sponsored and an equal number of non-government organisations are
making efforts to resolve problems associated with social sustainability. Programmes
like Corporate Social Responsibility, which stress on the responsibility of the corpo-
rate sector towards societies along with profit making, are helping in developing
infrastructure for social sustainability.
SAQ: What is meant by social sustainability?
Economic sustainability is achieved with low rate of inflation, stable currency and
high level of employment with corruption-free governments. Corrupt system where
economic resources are plundered leads to anarchy the like of which was seen recently
in Sheri Lanka. Earlier, in 2009, foreclosure crisis in USA leads to financial collapse
and many fold shrinking of economies not in USA but over whole of the world. The
economic unsustainability got kicked by lenders encouraging buyers without steady
income to borrow money at unfixed rates of interest. Soon mortgage holders got
behind their payments; lenders then foreclosed, all of them simultaneously. This put
a large number of houses on the market, to be sold at lower price. The cascade effect
resulted in shrinking of larger economy; people lost their jobs, their life time saving,
their business, etc.
Economic sustainability is largely determined by government policies and plan-
ning. Neglect of infrastructure development, large-scale borrowings at unfavourable
terms, unplanned depletion of national resources and corruption all leads to the
476 10 Sustainability and Sustainable Energy Options
collapse of the economy. Pakistan and several Asian and African countries including
Bangladesh and Sri Lanka took huge loans from China on unfavourable terms,
allowed their local and traditional industry to die and mismanaged their resources.
The results are known to all.
Circular economy based on recycling and reuse is expected to contribute sustain-
ability in a big way. Circular economy is a system that seeks to obtain the maximum
value from already extracted resources. This involves keeping materials and prod-
ucts in use as long as possible instead of replacing those using new resources. After
they have been utilised, the same materials and products are recovered and regen-
erated as vital resources. They are then reintroduced into the supply chain. Circular
economy contributes to sustainability in many different ways; for example it reduces
and even eliminates waste. Waste is one of the main threats to a sustainable world;
the world generates around 2.01 billion tonnes of municipal waste every year, about
33% of which, by the most conservative estimates, is not managed in eco-friendly
manner. Unmanaged component of waste ends up in landfills, water bodies and
into atmosphere. Circular economy may reduce or even eliminate the unmanaged
component of waste. The three R (3R) strategy of waste management, reduce, reuse
and recycle, may turn waste into an asset. Circular economy encourages the use of
renewable sources of energy. Though all renewable energy sources may not be in the
category of sustainable energy sources, but these sources definitely reduce pressure
and dependence on fossil- or coal-based sources. Circular economy lays stress on
sustainable consumption of resources. Circular economy advocates both for respon-
sible production and sustainable consumption. This also helps in the preservation
and protection of the environment. Circular economy creates jobs in many different
and new fields thus providing opportunities of better life to a significant section of
the population. Overall circular economy contributes to economic growth which is
essential for sustainability of a nation and the world.
One essential point for any kind of economy which is mostly forgotten is to reduce
consumption. Advertisement industry, which according to Statista collected globally
revenue to the tune of US$ 6410.22 billion in the year 2021 and has shown a growth
of around 11%, is the biggest instrument pleading for increased consumption and
thus working against the goal of sustainable world.
SAQ: What is meant by circular economy?
or the environment, walking through forests, going to lakes, beaches are examples
of this. With the unprecedented growth of population and unrestricted exploitation
of natural resources, the environment has suffered extensive damage. As such it is
the moral responsibility of the present world population to prevent further damage
to the environment, to ensure our future generations have healthy places to live,
and minimize further damage to the earth’s bio-diverse ecosystem. According to UN
recommended environmental program, environmental sustainability involves making
life choices that ensure an equal, if not better, way of life to future generations.
For the sake of simplicity, the environment may be divided into three parts: the
atmosphere, the land mass and the water bodies.
10.4.1 Atmosphere
Planet Earth is enveloped from all sides by a gaseous blanket called atmosphere.
Atmosphere, which is retained by the gravitational force of earth, protects earth by
generating pressure that allows liquid water to exist at the surface of earth, shields
earth from ultraviolet rays of the sun and, most importantly, keeps earth warm by
holding a part of heat energy provided by the sun. A common name for the mixture
of gases in the atmosphere is ‘air’. Air contains by molar fraction (i.e. number of
molecules) about 78% nitrogen, 21% oxygen, 1.0% argon, 0.04% carbon dioxide
(CO2 ), 1% water vapours, 0.00017% methane and still smaller quantities of other
gases like nitrous oxide (N2 O), etc.
(i) Greenhouse Effect
It is a common experience that the interior of a car left in sunshine with all its
windows closed becomes very hot. It happens because sunlight (that includes ultra-
violet, visible and infrared radiations) passing through the windows of the car deposits
energy in the form of heat on every object inside the car: steering wheel, seats, dash
board, etc. As a result temperature of all objects inside the car rises. Now, there is a law
of physics which says that any object at a temperature above absolute zero emits elec-
tromagnetic radiations of very long wavelengths called thermal radiations. Thermal
radiations are invisible. Interior of the car is thus filled with thermal radiations. These
thermal radiations remain trapped within the inside of the car as glass windows do
not allow thermal radiations to go out through them. In short what happens is that
all components of sunlight, i.e. ultraviolet, visible and infrared radiations, etc., keep
going into the car through windows but eventually they all are converted into thermal
radiations which remain trapped inside, being not able to come out as window glasses
do not allow them to come out. In this way heat energy gets trapped within car’s inte-
rior. This is called greenhouse effect as the same principle of trapping solar energy
in the form of heat is used in farming where greenhouses are made using transparent
plastic sheets which allow sunlight to enter the greenhouse but does not allow the heat
energy to leave through plastic sheets. Greenhouses are used to cultivate vegetables
478 10 Sustainability and Sustainable Energy Options
and other plants that need higher temperatures for their growth, particularly in cold
places.
In case of earth, the greenhouse effect is the way in which heat is trapped close
to earth’s surface by the envelope of some gases called greenhouse gasses. These
heat trapping gases (which work like the glass windows of a car) can be thought
of as a blanket wrapped around earth keeping the earth at a higher temperature.
Greenhouse gases include carbon dioxide (CO2 ), water vapours (water in gas form
H2 O), methane (CH4 ), nitrous oxide (N2 O, NO), ozone (O3 ) and chlorofluorocarbons
(CFCs). Except CFCs all other gases are natural. Warming effect of carbon dioxide
gas is basically responsible for maintaining the average temperature on earth to a
comfortable 15 °C. Remove CO2 from the atmosphere, and the greenhouse effect
on earth will collapse, plunging earth to a temperature of the order of − 20 °C.
Natural greenhouse effect on earth allows all forms of life, including human beings, to
flourish. During the last century or so, overpopulation, overindustrialisation, overuse
of fossil fuels, excessive burning of coal, etc. have resulted in overproduction of
carbon dioxide gas in particular and other greenhouse gases in the atmosphere. The
level of CO2 in atmosphere is consistently rising for decades, trapping extra amount
of thermal energy raising the average temperature of earth.
(ii) Sources of Greenhouse Gases
Sources of greenhouse gases may be divided into two class: natural and manmade.
Natural sources of CO2 are: (i) out gassing from the ocean, (ii) decomposing vegeta-
tion and other biomass, (iii) venting volcanoes, (iv) naturally occurring wildfires and
(v) belches from ruminant animals. There are also natural sinks of carbon dioxide,
which absorbs or remove it from the atmosphere; they are (i) photosynthesis by plants
on land and in the ocean, (ii) direct absorption by the ocean and (iii) creation of soil
and peat.
Use of fossil fuel for generating energy, heating, in running vehicles, etc. is the
primary source of CO2, methane and nitrous oxide gas emission by human activi-
ties. Other human activities like deforestation and clearing of land for agriculture,
running of heavy industries, transport sector, etc. are some of the human activities
that generate carbon dioxide. Some industries like cement industry generate huge
amount of CO2 .
Figure 10.1 shows the contributions of two main manmade sources of CO2 : fossil
fuel burning plus cement industry and deforestation. The depleting strengths of two
types of natural sinks, land-based and ocean-based, are also shown in the figure. It
may be seen from this figure that there is an addition of nearly 2 × 109 tonnes of
CO2 in the atmosphere per year, adding to the greenhouse effect.
If one goes back by about 800,000 years, back to the period of ice age, the natural
emission sources and natural depletion sources (sinks) of carbon dioxide kept an
immaculate balance keeping the carbon dioxide level in atmosphere between 200
and 275 ppm (parts per million). Lower values corresponding to ice age and higher
values to warm periods called interglacial periods. However, from approximately
400,000 year back the CO2 level in atmosphere started rising very slowly, reaching
the previous highest concentration of 300 ppm around 300,000 year back. Today the
10.4 Environmental Sustainability 479
CO2 level in atmosphere is around 410 ppm, and this enormous increase in the CO2
level has taken place in a very short time in geological scale. If excessive CO2 has not
been released in the atmosphere by human activities, it might have taken 1000 years
to reach this level. Figure 10.2 shows the variation in the concentration of CO2 since
ice age till now.
The percentage of methane in atmosphere is very low, yet it is the next important
greenhouse gas after CO2 . The main source of methane is the wetland where methane
(marsh gas) is produced by the anaerobic decay of vegetation. The molar concen-
tration of methane in the atmosphere is rising continuously. The 14.7 ppb (parts per
billion) increase in CH4 concentration observed in 2020 was the largest of the past
four decades. Since 1750 its relative concentration has increased twice as fast as
that of CO2 . Like CO2 , methane also has natural and manmade sources. Wetlands,
termites, cattle, sheep, other ruminants and oceans are the natural sources of methane
Fig. 10.2 Carbon dioxide concentration since ice age till now
480 10 Sustainability and Sustainable Energy Options
emission. Hydroxyl group (OH) is a natural sink for methane. Methane reacts with
hydroxyl radical (OH) forming water and carbon dioxide. Agricultural activities,
burning of biomass and waste management are the prime causes of methane emis-
sion through human activities. Methane is also emitted during the production and
transport of coal, by the decay of organic waste in municipal solid waste landfills.
Figure 10.3 shows the relative strengths of the natural and the manmade sources
of methane emission. Strength of natural sink of methane is also shown in the figure.
Though CH4 molecule stays in the atmosphere for a short time being readily oxidised
by (OH) radicals in presence of ultraviolet light, but excessive emission of the gas
by human activities has disturbed the balance between the emission from manmade
sources and its depletion from natural sink, enhancing the greenhouse effect.
Natural source for nitrous oxide (N2 O) includes oxidation of ammonia in the
atmosphere and from nitrogen in soils. Natural sink for N2 O is photolysis to nitrogen
(N2 ) and oxygen (O). Nitrous oxide (N2 O) also reacts with oxygen to make NO which
may enter into a stratospheric ozone-depleting reaction cycle. Manmade source for
the release of N2 O in the atmosphere is the extensive use of fertilisers in agriculture.
Some nitrous oxide is also released by the burning of biomass and by the decay of
livestock manure.
Figure 10.4 shows the relative contributions of natural and manmade sources
of N2 O emission. It may be observed that both sources have nearly the same
contributions, which is nearly compensated by natural sink of the gas.
SAQ: What is greenhouse effect? What causes it?
One may ask the question: what is so special in greenhouse gases that they trap heat?
The answer is simple. According to the quantum mechanical picture, molecules
10.4 Environmental Sustainability 481
of greenhouse gases have discrete energy states that may be excited by thermal
radiations. So whenever a molecule of greenhouse gas is hit by a photon of thermal
radiations it is absorbed and the molecule goes to the excited state. These excited
states are short lived, and therefore, excited molecule reverts back to the ground
state by re-emitting the thermal photon. In this way molecules of the greenhouse
gases absorb thermal radiations and re-emit them, and this goes on and on. Thermal
radiation photons remain in interactions with molecules of greenhouse gases, being
absorbed and re-emitted without being lost. Other gases do have discrete excited
states, but the energies of these states do not match with the energy of thermal
photons; hence molecules of these other gases do not absorb thermal photons.
The efficiency of trapping heat is different for four main greenhouse gases. If the
heat trapping efficiency of CO2 is taken as 1, then the efficiency of methane is around
20, of nitrous oxide 310 and for hydrofluorocarbons around 1000.
Apart of the natural presence of greenhouse gases in the environment, human
activities add a huge amount as given below.
(a) Carbon Dioxide Gas CO2 : Fossil fuel use is the primary source of greenhouse
gas emission, and deforestation, land clearing for agriculture, degradation of
soils, etc. are other sources.
(b) Methane CH4 : Agricultural activities, biomass burning, energy use and waste
management are main sources of methane emission.
(c) Nitrous Oxide N2 O: Use of fertilisers is the primary source; fossil fuel
combustion also releases nitrous oxide.
(d) Fluorinated Gases F-Gases: Includes hydrofluorocarbons (HFCs), per-
fluorocarbons (PFCs) and sulphur hexafluoride; main source of emission is
refrigeration and allied industries.
482 10 Sustainability and Sustainable Energy Options
The net effect of the rapid increase of greenhouse gasses in atmosphere has resulted
in the rise by about 1 °C the average temperature of the earth from the pre-
industrialisation period. This rise in average temperature has, in turn, affected the
climate in different parts of the world. Climate of a place is determined by the patterns
of temperature, wind, atmospheric pressure, humidity and rain over a long period of
time. Different regions in the world, depending on the patterns of above-mentioned
parameters, have different climates like, dry, tropical, cold and moderate. Seasons
and the time when they come and go at a place are determined by the climate of the
place. The type of plants and animal lives that may survive at a place is essentially
determined by the seasons of the area. There is always a delicate balance between
484 10 Sustainability and Sustainable Energy Options
the species of plants and animals and the intricate ecosystem of region, which may
be seriously damaged by a little variation in average temperature of earth.
The rise in average temperature of earth (by only 1 °C) is called global warming
and has already affected weather, rain and temperature patterns in parts of the world.
One of the most visible signatures of global warming is ‘hotter days’. Rising sea
level and increased ocean temperatures are melting ice caps and glaciers resulting in
the rise of sea level. Several countries surrounded by the sea and parts of countries
near seashores are under threat of being submerged in the rising ocean. Most of
the extra heat and CO2 in the atmosphere due to enhanced greenhouse effect have
been absorbed by the ocean making it more acidic and warmer. Extreme weather
events like cloud bursts, wild fires, cyclones, draughts and floods have become more
frequent and intense. It is estimated that one out of six species is at risk of extinction
because of climate change due to global warming. It is because the climate change
is so rapid that some species could not adapt themselves with the rapidly changing
environment. Global warming and associated changes in rainfall and temperature
patterns have made it difficult for farmers to graze their livestock and to grow enough
food. Prolong periods of draughts: less amount of rainfall has already created scarcity
of water in some parts of the world.
The main cause of global warming is the accumulation of greenhouse gases in the
atmosphere. In order to maintain the sustainability of atmosphere it is essential (i)
firstly, to reduce the emission of greenhouse gases and (ii) then to remove the excess
of accumulated gases from the atmosphere. Since each individual human being,
animals, manufacturing of goods, transportation, etc. contribute to the emission of
greenhouse gases, it is necessary to change our way of life. The contribution of living
bodies and of different entities/events to the greenhouse gases is often measured in
terms of its carbon footprints.
SAQ: There are many gases like hydrogen, nitrogen, etc. in our atmosphere, but
only CO2 is treated as the gas responsible for global warming, why?
10.5 Global Warming 485
Earth’s average temperature has already risen by approximately 1.1 °C above pre-
industrial level and the report drafted by more than 200 scientists from over sixty
countries predicts that the world will reach or even exceed 1.5 °C of warming within
the next 20 years, in spite of the global emission reduction efforts. The report predicts
the following sever effects that may happen due to global warming:
(i) Sever Heat Waves The report predicts that almost 14% of world population
will experience sever heat waves at least once in five years of time.
(ii) Floods and Droughts Many regions worldwide will experience excessive
floods and draughts that will result in food shortages.
(iii) Rise in Sea Level Hot weather spells, and higher average temperature will
melt glaciers, resulting in the rise of sea level submerging coastal cities and
threatening small island nations. Top four countries the population of which
will be adversely affected by sea level rise are: China (5.5 million people),
Vietnam (23.4 million people), Japan (12.8 million people) and India (12.6
million people).
(iv) Arctic Ice Thaws At least once in a century, the Arctic will experience summer
without any sea ice, a phenomenon that has not happened at least in the last
two thousand years.
(v) Changes in Oceans The report suggests that almost 90% of all coral reefs will
be wiped out, oceans will become more acidic, and marine life and particularly
fisheries will be severely affected.
(vi) Loss of Species Many life forms will not be able to adapt to the rapidly
changing climate and will parish.
Over the last few decades, governments have taken collective steps to reduce the
emission of greenhouse gases to slow down the rise of average temperature of earth.
Though not intended to discuss the climate change issue, the Montreal Protocol,
1987, was a historical accord on environmental issues and laid the foundation for
further discussion on the subject. The accord, ratified by almost all countries of the
world, required them to stop producing substances that damage the ozone layer, like
chlorofluorocarbons, etc. The accord was so successful that almost 99% of ozone-
depleting substances was eliminated. Later in 2016, an amendment to the Montreal
Protocol, called Kigali Amendment, asked parties to reduce the production of the
greenhouse gases and substances like hydrofluorocarbons that add to greenhouse gas
emission.
Major UN initiative on climate change took place in 1992 as UN Framework
Convention on Climate Change (UNFCCC), ratified by 197 countries including
USA, adopted the landmark accord of establishing an annual forum, named Confer-
ence of the Parties or COP, for discussion between representatives of different
countries at international level on climate change and reduction of greenhouse gases
in the environment. These meetings of COP resulted in the development and imple-
mentation of two agreements called the Kyoto Protocol and the Paris Agreement.
Kyoto Protocol that was adopted in 1997 and came into force in 2005 was signed by
all member countries including USA (signed in 1998). However, USA never ratified
it and later even withdrew its signatures. Kyoto Protocol was the first legally binding
agreement which required that developed countries to reduce their greenhouse gas
emission by an average of 5% below 1990 levels and establish a system to monitor
countries’ progress. The important element of the protocol was that developing coun-
tries, including China and India, the two major greenhouse-producing countries, were
not asked to take any action.
Paris Agreement, signed in 2015, is the most significant agreement on restricting
the emission of greenhouse gases. The agreement is legally binding as well as it is
self binding; it requires all countries to set emission reduction pledges. Governments
were asked to set targets, called Nationally Determined Contributions (NDCs),
with the goals of preventing the global average temperature from rising 2 °C (3.6 °F)
above pre-industrial levels and pursuing efforts to keep it below 1.5 °C (2.7 °F).
It also aims to reach global net-zero emission in the second half of the century.
Global net-zero emission means that amount of greenhouse gases emitted equals
to the amount of these gases being removed from the atmosphere. Global net-zero
is also called climate neutral or carbon neutral. The agreement also provides for
assessing the progress, called stocktaking, every five years. An important component
of the agreement is the support which the developed countries should provide to the
developing countries in the form of capability building, which essentially means that
the developed countries must invest in efforts by developing countries for reduction of
greenhouse gas emission. The USA, who initially signed the agreement, withdrew
during the president ship of Donald Trump; however, President Joe Biden again
488 10 Sustainability and Sustainable Energy Options
joined and signed the agreement within one month of his election. Countries like
Libya, Yemen and Eritrea have not yet signed the treaty.
In a related move United Nations Environmental Programme (UNEP) identified
six sectors with the potential to reduce greenhouse gas emission enough to keep the
global temperature rise below 1.5 °C by 2030. Agriculture and food sector can
cut greenhouse gas emission by 6.7 Gt (Gaga tonne) per year. Buildings, cities and
construction sector, which is expected to add about 12.6 Gt of greenhouse gases by
2030, may reduce this by proper planning by almost 60%. The energy sector can
cut greenhouse gases by 12.5 Gt annually. Transport sector is responsible for about
one-quarter of all greenhouse gases. Sectors emission is expected to double by 2050.
Actions are required at every level, government, private and public. Industry sector
may cut greenhouse gas emission by 7.3 Gt yearly by using passive and renewable
energy-based systems for cooling and heating. Forest and land use, the greenhouse
gas emission may be reduced by 5.3 Gt annually if further deforestation is stopped
and restoration of degraded woodlands is taken up. These actions would also improve
air quality, increase water supplies to cities, enhance food security and boost rural
economy.
International Energy Agency (IEA) with the help of international computer and
modelling experts derived the following four scenarios for global warming.
1. (Stated Policies Scenario, SPS) Scenario based on the policies stated by different
governments in 2021. This will lead to temperature still rising when it hits 2.6 °C
above pre-industrial levels in 2100.
2. (Announced Pledges Scenario, APS) In this modelling it is assumes that
commitments made by different governments will be met in full and on time.
And if so, the average temperature will rise to the value of 2.1 °C by 2100 and
will continue to increase.
3. (Sustainable Development Scenario, SDS) This model is based on the following
assumptions: (i) all countries have successfully implemented their commitments
on time. (ii) Developed countries reach zero CO2 gas emission by 2050, China by
2060 and all other countries latest by 2070. In such case the average temperature
will peak at 1.7 °C by 2050 and could decline to 1.5 °C by 2100. It estimates that
the energy mix in 2100 will be 58% renewable, 8% nuclear and the remaining
34% by other sources.
4. (Net-Zero Emission by 2050 Scenario, NZE) If so, the temperature will peak
at 1.7 °C by 2050 and will decline to 1.4 °C by 2100. Energy mix will be 50%
renewable, particularly about 70% from wind and solar PV, about 20% from
other renewable sources and most of the remaining from nuclear sources. The
other half will be biomass, gas and oil with carbon capture and storage. Use of
10.8 Sustainability of Land Mass 489
coal falls by 90%, oil by 75% and gas by 55%. Emission from transport sector
will fall by 90%.
The term ‘land mass’ generally includes rocks, soils, minerals, vegetation and animal
habitats. The condition or land health invokes the concept of ecosystem—the inter-
actions and connections between the living and non-living components of the envi-
ronment. In degraded land, where ecosystem has changed, the altered ecosystems
continue to function but have a reduced capacity to supply the goods and services
sought by livestock. In addition to providing physical needs of increasing population,
land also has spiritual and cultural values for the local population.
Land mass is central to addressing sustainability issue, including biodiversity,
climate change, food security, poverty alleviation and energy. The United Nations
defines sustainable land management (SLM) as ‘the use of land resources
including soils, water, animals and plants, for the production of goods to meet the
changing human needs, while simultaneously ensuring the long term produc-
tivity potential of these resources and the maintenance of their environmental
functions’.
The productivity and sustainability of land mass is determined by the interaction
among three components: land resources (soil, water and biodiversity), climate and
human activities. In face of climate change it is important to select the right land
use for the given socio-economic and biophysical conditions so as to minimise the
land degradation, rehabilitating the degraded land ensuring the sustainable use of
land resources and maximising the resilience. Human activities of sustainable land
use and management in principle decide the resilience or sustainability and degra-
dation of land resources. Sustainable land management (SLM) involves established
practices of soil and water conservation, integrated landscape and natural resource
managements. The four pillars of sustainable land management are:
(i) Institutional support for targeted policies, inbuilt incentive mechanism for the
adoption of SLM and income generation at the local level.
(ii) Integrated use of natural resources at farms and at the ecosystem scale.
(iii) Involvement and participation of land owners, technical experts and policy-
makers with multistakeholders at multilevel.
(iv) Holistic land-user-driven and participatory approaches.
Food and Agriculture Organization (FAO) of the United Nations has the mandate
to support its member countries in developing norms, standards and policies; provide
technical advice; and help in capacity development to implement SLM.
Land degradation has many definitions, but all share the idea that detrimental
changes to the condition of land have occurred because of the many ways land has
been developed and used. Frequently changes in the condition of the land are linked
to a reduction in the productive capacity and its economic value. Important causes
490 10 Sustainability and Sustainable Energy Options
Water bodies consisting of oceans, rivers, lakes and pounds are very important for
the survival of life on planet Earth. Unfortunately, all these resources of water have
got degraded by their excessive exploitation and natural calamities. It is essential to
slow down their further degradation and repair as far as possible the damage they
suffered.
Oceans provide and regulate our rainwater, drinking water, weather, climate,
coastlines, substantial amount of sea food and even the air we breathe. Unfortu-
nately, there is continuous deterioration, particularly of coastal waters by pollution.
Moreover, sea water is becoming more acidic having adversary effect on biodi-
versity and ecosystems. Oceans are home to nearly a million known species and
contain vast potential for scientific research and discoveries. Over 3 billion people
depend on marine and coastal diversity for their livelihood. Oceans absorb about
23% of annual CO2 generated by human activities. It also absorbs about 90% of
excess heat produced in the climate system. Excess heat in oceans is causing heat
waves for marine life, destroying coral reefs, threatening the marine life and severely
affecting the ecosystem. According to estimates about 5–12 million metric tonnes
of plastic enters the oceans. This results in a loss of around $13 billons per year in
cleaning operations and fisheries industries. Heavy tourism hubs lie in coastal areas,
and about 80% of total tourism occurs in coastal areas. Coastal tourism industry
grows at a fantastic rate of US$130 billion per annum. Coastal areas support huge
labour force, almost one third of the total labour force. It is, therefore, very essential
that oceans, seas and coastal areas are properly managed. Unplanned/uncontrolled
coastal tourism may destroy natural resources of the area unless measures are taken
now to regulate it.
Oceans are very intimately related to human health on the planet. Diversity of
species found in ocean is a great boon to pharmaceuticals industry. Bacteria’s found
10.9 Sustainability of Water Bodies 491
deep inside the ocean are very important for developing test kits; for example, these
bacteria were used to develop rapid testing kits for COVID-19 cases.
In order to develop a sustainable ocean regime, efforts at two levels are required.
Steps at international level require international cooperation to protect habitats that
are in danger by establishing comprehensive, effective and equitable systems so
as to conserve biodiversity and ensure a sustainable future for marine life. On the
local level, single-use plastic should be strictly banned, sea food consumption be
regulated, a list of certified sea food items be made, and public be encouraged to
buy only certified items. Fishing, particularly of the seed-carrying fish, be controlled
so that the fish population does not deplete. Keeping beaches clean and healthy will
automatically improve the health of the sea/ocean.
Along with direct rainfall (green water) rivers, wetlands, lakes and aquifers (blue
water) are primary sources of water for human consumption including irrigation.
Rivers and associated wetlands also provide many ecosystem services along with
religious and cultural values. On account of increasing population and increasing
needs of river water for irrigation, rivers are under pressure, and often there is not
enough water and of adequate quality to fulfil the need of the region. This pressure
resulted in the decline of river conditions in many parts of the world.
River sustainability is concerned with resource sufficiency, resilience to water
related risks, access to water supply and other services, the productive use of
water and equitable distribution between different users and generations.
Sustainability of a river or river basin is determined by weather; the river system
can support the long-term ecological and socio-economic functions of the river basin.
Any programme on sustainability of river system must focus on the unique flow
requirements for the river and then create operating plans for dams that achieve
environmental flows, quantity and quality of water flow that must occur downstream
and upstream of dams in order to revive and sustain critical ecological functions
and habitat for species. Since the terrine, the fauna and flora and other parameters
are different for different river systems, in general it is not possible to give one
prescription for the sustainability of all rivers; each river system must be considered
separately, and adequate strategy for arresting further degradation and improving
the degraded system must be evolved. Close collaboration of government agencies,
scientists and other stakeholders is essential along with sustained and long-term
financial support. Many countries have already taken up programmes for improving
river basins, construction of dams, monitoring natural water flow, timing and quantity,
etc. in order to implement river sustainability measures.
492 10 Sustainability and Sustainable Energy Options
and energy-intensive; studies made in 2018 indicated the cost of direct capture
around $100–$240 per metric tonne. It is required to carry out further research
in order to reduce the cost.
(ii) Forests and Farms Trees are particularly excellent at storing carbon dioxide
through photosynthesis. Forests have very high potential of removing CO2
from the atmosphere. Forest cover gets reduced essentially for two reasons:
providing additional land for agriculture and for human habitat. Increasing the
yield of existing farms and using barren lands for human dwellings may reduce
pressure on forests. Planting trees around agriculture farms may enhance CO2
capture.
(iii) Bioenergy with Carbon Capture and Storage (BECCS) This is another way
of using photosynthesis for carbon capture. BECCS is based on utilising the
biomass as a source of energy in industrial, power or transport sectors and
cap the resulting emission of greenhouse gases to reach the environment. The
captured carbon/CO2 may be stored underground or in long-lived products
like concrete.
(iv) Mineralisation of Carbon Some minerals in nature are found to react with
CO2 gas turning carbon of the gas in solid. This process, called carbon miner-
alisation, happens in nature at a very slow rate, and it takes hundred and thou-
sands of years. Scientists are trying to accelerate this slow natural process
by finding suitable catalysts, exposing carbon dioxide containing atmospheric
air to mineral/rock pieces/slag, etc. littered by mining. It has been found that
atmospheric air when injected in some special types of rock structures turns
its carbon into solid carbonates. Since chemical reactions that turn gaseous
carbon into solid take place at the surface of the minerals, nanoparticles of the
minerals that have large surface area are also being investigated for achieving
faster carbon mineralisation.
(v) Capture by Ocean A large part of globe is covered with oceans where different
types of lives thrive. Some of them, like costal plants and weeds, absorb CO2
via photosynthesis. Increasing costal plant density at appropriate places may
enhance carbon capture by a large amount. Similarly, adding some chemicals
to increase storage of dissolved bicarbonates may help in carbon capture.
There are many other ocean-based options which need further research to
make oceans a carbon store without compromising the marine life.
Each carbon capture technique has its own drawbacks and advantages; is suited
to a specific region; and has its own economic viability and the extent to which it can
be used. A mix of all techniques and further research to tailor a particular method to
suit the area are required to achieve the optimum carbon capture.
SAQ: How ocean helps in removing carbon dioxide from the atmosphere?
494 10 Sustainability and Sustainable Energy Options
There are several places in the world which have zero greenhouse gas emission but
are facing extreme conditions because of the global warming and climate change.
One such place is the area of Ladakh, a high plateau in north India. Climate change
and rise in average global temperature around 1.8 °C above the pre-industrialisation
have dried the natural resource of water in this area. The average annual rainfall in
this region is only around 12 cm, and the flat land is called the desert of north. The
water lifelines of this area were the winter snow and glaciers, which have now almost
disappeared or receded. Some forty years back there use to be enough winter snow
which persists till spring and provided enough water for cultivation of crops. Water
fountains fed by glaciers provided water during summer. However, due to global
warming, winter snow does not stay till spring, and receding glaciers do not provide
sufficient water for summer (Fig. 10.9).
Some youths of the region took the challenge head on. They observed that during
nights and under shade the ice remain frozen in Ladakh, at the height of summer
even at lower altitudes. They thought that water frozen during winters and protected
from sun may be used for irrigation in spring and summer. It was not possible to erect
big shades for the large volumes of ice; however, it was realised that tall mounds of
ice may themselves protect their interior from sun. Simple science that they studied
at their high school level told them that conical shape of the ice mound is best for
shielding its own interior. The idea of conical shape for the ice mound also has some
connection with stupa, conical mounds of stones where religious relics are kept under
the stupa in Buddhist culture. The first ice stupa was built in November 2013. Channel
to bring down water of melting glaciers from higher slopes or rivers was brought
down and was then sent up vertically in a metallic pipe that had a nozzle at the top.
The nozzle was opened during nights when the air was below freezing point. The
fine spray of water coming out of the nozzle froze as it fell. Slowly a mound of ice,
conical in shape, rose around the pipe. The first ice stupa was only 20 feet high, but
since then ice stupas or artificial glaciers as high as 100 feet have been made. It is
believed that if ice stupa of optimum height is made at a favourable place, it may last
round the year and accumulation of ice for few years may convert it into a manmade
glacier.
In line with the definition of sustainability, sustainable energy is the energy that
powers the needs of the present generation maintaining the ability to meet those of the
generations to come. Sustainable energy is replenished constantly via natural means.
In other words sustainable energy will never run out. It is however important to realise
that sustainable energy does not necessarily mean that it is 100% environmentally
safe; it does not pollute environment at all. Sustainable energy definitely has a smaller
carbon footprint as compared to sources like coal or fossil fuels. Sustainable energy
source, which include many renewable energy sources, is in a way the global ticket
to a cleaner and less polluted Earth. Sustainable energy sources are also called the
alternative energy sources.
Basically, energy is the capacity to perform work, and in science it may be measured
in several units like erg, joule, electron volt (eV), kilowatt hour (kWh), etc. The SI
unit of energy is joule and 1 J = 107 erg = 2.78 × 10−7 kWh = 6.24 × 1018 eV. Some
really big units for energy, like terajoule (TJ)[1TJ = 1012 J], terawatt hour (TWh) {1
TWh = 1012 W h = 109 kWh} and million tonnes of oil equivalent per year (Mtoe),
are also used. 1Mtoe = 11.63 TWh = 11.63 × 109 kWh. Big energy units are often
used in expressing national or international energy production/consumption.
Let us take the example of crude oil, it is a form of energy, but it is processed to
make it suitable for consumption by end user. The supply chain between production
and final consumption involves many conversion activities and much trade and trans-
port among countries. In energy statistics primary energy refers to the first stage
where energy enters the supply chain before any further conversion or transformation
process.
496 10 Sustainability and Sustainable Energy Options
Before discussing sustainable energy options, let us have a look on present-day global
energy production and utilisation. According to the International Energy Agency
(IEA) data for year 2018 the total primary energy (PE) production of the world
was 14,420 million tonnes of oil equivalent per year (Mtoe). Data for some other
countries for the same year along with breakup of the contributions from different
energy sources is given in Table 10.1.
As may be observed in Table 10.1, most of the primary energy (PE) till 2018 has
been produced by fossil fuels and coal.
Figure 10.10 shows how the contribution of different sources of primary energy
increased with time and the relative contributions of different source in 2020. As is
evident from this chart almost 83% of total PE in the world is still derived from the
three most polluting fossil fuels: oil, coal and natural gas.
Average per head consumption of energy in some countries and regions is shown
in Table 10.2. As may be seen in this table, energy consumption per head in developed
countries is much higher than that of people in developing countries.
Last two frames of Fig. 10.11 show the history of global energy production: the
landmark events that helped in energy generation to meet the growing demand of
energy ignited by the industrial revolution of 1750.
SAQ: Which source of primary energy is most polluting?
Coming back to the issue of sustainable energy options, sustainable energy
sources must be such that they never exhaust; i.e. the nature must keep sustain-
able energy sources alive. It is obvious that such sustainable energy sources must
be derived from those natural objects and events that are expected to continue undi-
minished for generations to come. Heavenly bodies like sun, moon and earth (all
expected to live for another 3 billion years or more) and events like rotation of earth,
gravitational pull of earth and moon on oceans, etc. are expected to continue for many
future generations of mankind. Energy sources like sunshine, wind and ocean waves,
Fig. 10.10 Percentage contribution of different primary energy sources in total global energy
production in 2018
geothermal energy, hydroelectric energy and biomass energy, energy from hydrogen;
all come under the category of sustainable energy sources as they are inexhaustible
and are derived from natural objects or processes that are likely to continue forever.
Other renewable energy sources are wood, sun-derived biodiesel and grain-derived
ethanol.
In science it is said that ‘Energy can neither be created nor it could be destroyed, it
may, however, change from one form to the other form’. Large amount of energy is
scattered in the universe in different forms; in the form of heat in sunshine, heat in
the form of geothermal rise of temperature as one goes underground below earth’s
surface, in form of kinetic energy of wind, sea waves and of river water, in the
form of potential energy of nuclear field and so on. Coal, crude oil, natural gas and
hydrogen gas are also sources of energy which is released when they burn in presence
of oxygen. However, energies in all their natural forms are not directly useable; for
example it is possible to utilise solar energy to some extent in cooking food or for
drying clothes, but most of the other energies in their natural form cannot be used
498 10 Sustainability and Sustainable Energy Options
Almost all countries of the world are planning to replace most of their polluting
energy-generating systems based on fossil fuel by alternate renewable, cleaner energy
sources. Financial aspect of this planning requires a basis of comparison of different
renewable sources in consistent units. One such unit is LCOE. Another measure
may be the energy density on a Joule per cubic metre or Joule per kilogram basis.
Energy density when measured in Joule per cubic metre basis is called volumetric
energy density and in units of Joule per kilogram, the gravimetric energy density.
Energy density of a fuel or a source of energy is in a way a measure of its efficiency.
500 10 Sustainability and Sustainable Energy Options
Knowledge of energy density of a fuel is very important; for example, let us consider
the energy density of a battery; higher the energy density of a battery, longer the
battery can emit charge in relation to its size; high density batteries are useful when
much room is not available for batteries to keep, but lot of energy output is required.
Volumetric energy densities of some sources are given in Table 10.3.
Gravimetric energy density of some fuels is given in Table 10.4.
SAQ: Do you see some relation between the LCOE and the energy density of a
primary energy source?
Fig. 10.13 Per head CO2 emission in six most polluting countries
(iii) Fuel cellIt is a device that converts chemical energy released in a reaction
directly into electrical energy, similar to that of an electrical battery or cell.
However, the conversion efficiency of fuel cell is much higher (> 60%) than
that of a combustion engine used in automobiles. Further, unlike ordinary cell,
fuel cell continues to deliver electrical energy at a constant rate so long as
the fuel supply is available. In contrast an ordinary battery supplies electrical
energy at a rate that decreases with time. Fuel cell has no moving parts and is
therefore noiseless. Most of the fuel cells do not produce any greenhouse gas
and other polluting substance and are therefore very clean source of energy. In
principle many different types of fuels may be used to run a fuel cell. Fuel cells
have great potential as they may supply electrical power to a system as small
as a laptop or to a system as big as an automobile, lift fork, air-conditioning
systems at airports etc.
The physical size of a single hydrogen fuel cell is extremely small, of the order of
3 × 3 × 1 mm. Each hydrogen fuel cell (HFC) typically delivers between 0.5 and
0.8 V in its output, and therefore many HFCs are connected in series to get desired
output. The series and parallel combinations of HFCs make stacks which are used to
run devices. HFCs use hydrogen and oxygen (either pure or as air) gases as fuel; the
504 10 Sustainability and Sustainable Energy Options
chemical reaction in the cell produces electrical energy, some heat and pure water as
the by-product.
Fuel cell technology is fast evolving, and new designs and structures of fuel cells
appear almost on daily basis. Basic structure of a hydrogen fuel cell is shown in
Fig. 10.14. A hydrogen fuel cell has two electrodes, an anode and a cathode. Each
electrode is coated with a layer of catalyst which is followed by a layer of polymer
membrane. An electrolyte is sandwiched between the two membranes of the two
electrodes. Fuel cells are classified primarily by the kind of electrolyte used in it.
This classification determines the type of the electro-chemical reaction that takes
place in the cell, the type of catalyst required, the appropriate temperature at which
the cell operates and other factors.
H+ ions migrate towards the cathode because of the difference in their concen-
trations at anode and cathode, i.e. by diffusion. PEM-type HFCs used in auto-
mobiles have polymer membranes of thickness of the order of 20 µm. A layer of
catalyst is added on both sides of the membrane: the anode layer on anode side
and the cathode layer on cathode side of the membrane. Conventional catalyst
layer is made by dispersing nanosized particles of platinum on a high surface
area carbon support and then mixing an ion-conducting polymer. PEM fuel cells
operate at temperatures around 80 °C or 176 °F.
(b) Alkaline Fuel Cells (AFCs) These fuel cells use a solution of potassium
hydroxide (KOH) in water as the electrolyte for the diffusion of protons (H+ )
from anode to cathode. The advantage of AFC over PEM is that catalysts made
of non-precious metals that are relatively much cheap, as compared to platinum,
may be used in these cells. AFCs were used in American space missions for
providing electric power and water on-board. One big challenge to this tech-
nology is the severe poising of the cell action by the presence of a very little
amount of CO2 in the system. It is, therefore, very important to check that both
the hydrogen and the oxygen or air used in the cell are totally free from any
traces of carbon dioxide.
(c) Phosphoric Acid Fuel Cells (PAFCs) It is one of the most matured types of
the fuel cell and is often referred as the first generation of modern fuel cells.
PAFC uses liquid phosphoric acid as electrolyte. The acid is contained in a
Teflon-bonded Silicon carbide matrix. Porous carbon electrodes and platinum-
based catalysts are employed in these fuel cells. These cells are more tolerant
to impurities and may have efficiency of the order of 80%. However, their
gravimetric energy density is low as a result stack of these cells is large and
heavy. PAFCs are, therefore, used either in big automobiles like trucks/buses or
are used in powering stationary systems like buildings, etc.
(d) Molten Carbonate Fuel Cells (MCFCs) These are temperature fuel cells that
use an electrolyte made of molten carbonate salt mixture that is suspended in a
porous and chemically inert ceramic-lithium-Aluminium oxide matrix. MCFCs
506 10 Sustainability and Sustainable Energy Options
operate at temperatures of the order of 600–700 °C range and use catalysts made
of non-precious metals. They have high efficiency, if the waste heat produced
in the electro-chemical reaction is captured; the efficiency of MCFCs reaches
approximately 85%.
Another big advantage of this fuel cell is that natural gas or biogas (instead
of hydrogen) may be directly used in the cell or generation of electrical energy.
This becomes possible because of the high temperature of operation; at such
high temperature natural or other gases undergo internal reforming generating
hydrogen with in the fuel cell.
Durability of MCFCs is substantially compromised because of the use of
molten carbonate which is corrosive and damages the components of the cell.
(e) Direct Methanol Fuel Cells (DMFCs) DMFCs are powered by methanol
(instead of hydrogen) without it being converted into hydrogen. Methanol is
liquid, has higher energy density and is easy to transport using the existing
systems of transport. DMFCs are used to run small equipment like laptops, etc.
(f) Solid Oxide Fuel Cells (SOFCs) They are fuel cells that operate at very high
temperatures of the order of 1000 °C, eliminating the use of precious metal cata-
lysts and facilitating the use of any hydrogen-rich gas (natural gas, biogas, coal
gas, etc.) as fuel. SOFC uses a hard, non-porous ceramic compound as the elec-
trolyte. They are the most sulphur-resistant fuel cells and are not damaged by the
presence of carbon monoxide. High-temperature operation, however, increases
the start-up time and requires good thermal shielding to protect people and retain
heat. High operating temperature also puts stringent conditions on the materials
used in the cell and adversely affects the durability of the cell components.
Efforts are on to develop lower temperature version of solid electrolyte fuel
cells.
(g) Reversible Fuel Cells Normal hydrogen fuel cell uses hydrogen and oxygen to
produce electricity, heat and water. The reversible fuel cells also have the capa-
bility of operating in the reverse order; that is if they are given water and elec-
tricity they electrolyse water and produce hydrogen and oxygen. This reverse
operation capability makes these cells very useful, since at times when there
is excess production of electric power by some renewable energy source (like
solar cell at peak hours or by wind mill at times of high-speed winds, etc.), the
excess electricity may be supplied to the reversible fuel cell which will generate
hydrogen that may be later used for forward operation.
Considerable amount of energy is released when (i) some big or heavy atomic nucleus
undergoes fission; that is it splits into two nearly equal nuclei, and also (ii) when two
light nuclei undergo fusion; that is the two light nuclei fuse together to make a bigger
nucleus. The energy released in both the fission and the fusion is totally clean; no
greenhouse gas is released in the atmosphere. Though energy released per fusion is
10.14 Nuclear Energy 507
larger and no radioactive waste is produced in fusion, it has not been possible so
far to extract fusion energy in a controlled way on a commercial scale. In contrast,
systems called nuclear reactors have been developed to trap in a controlled manner
the energy released in fission of heavy nuclei; fission energy is with us for almost
60 years or so, with very few accidents. Fission in some heavy stable or (almost
stable) nuclei may be initiated by neutrons and other nuclear particles. Neutrons of
very low kinetic energies ≈ 0.025 eV may enter a heavy (positively charged) nucleus
with ease, as it has no electric charge, and may initiate nuclear fission of the nucleus.
On the other hand, some other heavy nuclei undergo fission when they are hit by
high-energy (≈ 1 meV or more) neutrons. Low-energy neutrons having energies of
the order of the energy of atoms/molecules of gases present in the atmosphere at room
temperature and pressure (≈ 0.25 eV) are called thermal neutrons and are present
in the atmosphere in thermal equilibrium, colliding with the atoms of atmospheric
gases. Those heavy elements the nuclei of which may undergo fission by thermal
neutrons are called fissile atoms or materials. 235 U, 233 U (bred by 232 Th by neutron
capture), 239 Pu and 241 Pu (bred from 240 Pu by direct neutron capture) are fissile
nuclei which may undergo fission when hit by thermal neutrons. Out of the above
four fissile nuclei, 235 U is found in nature in very small percentage; natural uranium
ore contains only 0.7% of 235 U isotope. Other fissile nuclei may be produced using
specific nuclear reactions.
Uranium occurs naturally in low concentration in soil, rocks and in sea water
and is commercially extracted from uranium bearing minerals like uranite. Natural
uranium contains two isotopes: 235 U 0.711% and 238 U 910.284%. Since 235 U isotope
undergoes fission by thermal neutrons, natural uranium must be enriched in 235 U
isotope to use it as the fuel for thermal neutron reactor. Most of thermal neutron
reactors use uranium that has 3–5% enrichment of 235 U isotope.
235
U nucleus when hit by thermal neutrons may undergo fission in ≈ 87% cases,
and in the remaining 13% cases, it may capture the neutron emitting a gamma ray.
In case of fission, the fission products may be different, as shown in the following
set of equations:
It may thus be observed that as a result of the fission of 235 U by thermal neutrons
two heavier nuclei like Ba and Kr or Te and Zr, etc., called fission fragments (FF),
two to three neutrons and considerable amount of energy are released. Almost 95%
of the energy that is released in fission is in the form of the kinetic energy of fission
fragments. When these highly excited fission fragments moving with high speed
collide with the container walls, they release their kinetic energy in the form of heat,
as such most of the (about 200 MeV = 3.2 × 10−11 J) kinetic energy is ultimately
converted into heat. Two to three (average 2.5) fast neutrons of average energy ≈
508 10 Sustainability and Sustainable Energy Options
2 meV are also produced in the fission of each 235 U nucleus. Fission fragments are
highly excited neutron-rich nuclei, and they de-excite by emitting neutrons, but these
neutrons are emitted after a time delay, sometime after the emission of fast neutrons
that are produced at the instant of fission. Thus, two types of neutrons, prompt
neutrons produced at the instant of fission and delayed neutrons emitted by fission
fragments, are generated as a result of fission. Prompt neutrons make it possible to
establish fission chain reaction, while delayed neutrons play a key role in controlling
the fission reaction rate in the reactor.
Nuclear fuel used in thermal neutron reactors is uranium with 3–5% enrichment of
235
U isotope which means that in about 100 atoms of the fuel there will be about 95–
97 atoms of 338 U isotopes and 5–3 atoms of 235 U isotope. In the space where the fuel is
placed in the reactor there are always some stray thermal neutrons. These background
thermal neutrons may interact with the atoms of both the 235 U and 238 U isotopes.
Interaction of thermal neutrons with 235 U atoms results in their fission producing
energy, on an average 2.5 fast neutrons, some delayed neutrons and radioactive
waste. Interaction of neutrons with 238 U atoms of the fuel produces weapon-grade
fissile material 239 Pu and long-life radioactive waste.
Figure 10.16 shows the results of neutron interactions with fuel atoms.
In case the fuel interacts with only stray thermal neutrons, those present in the
surroundings, very few fission events per unit time will take place on account of the
very low percentage of 235 U atoms and very low density of stray thermal neutrons.
However, if fast neutrons produced in initial fission events are made to lose their
Fig. 10.16 a Thermal neutron fission of 235 U nucleus, b decay of fast-moving fission fragments
into radioactive waste, c absorption of neutron by 238U resulting in the production of weapon-grade
fissile 239Pu and other long-life radioactive waste
10.14 Nuclear Energy 509
energy and are converted into thermal neutrons then the rate of fission events may
be increased such that a self-sustained chain of fission reactions may be maintained.
SAQ: Which fission product is crucial in sustaining of fission chain reaction?
Fast neutrons may be rapidly converted into thermal neutrons if they are made to
collide with atoms of those materials which have nearly same mass. It is because in a
collision with another particle of same mass, the neutron will lose maximum energy
in each collision. Materials that may rapidly and without absorption reduce the
energy of fast neutrons are called moderators. Heavy water (D2 O), normal water
(H2 O) and graphite (C) are the materials that are frequently used as moderators.
Figure 10.17 shows how a self-sustained fission chain reaction may be established
in a fuel which is distributed with moderators around it. An important consideration
for chain reaction is the economy of neutrons; the fuel-moderator assembly should
be such that no neutrons may leak it. In order to derive energy from a nuclear reactor
at a constant power level (nearly same number of fission event per unit time) it is
essential that number of fission events per unit time must remain nearly constant.
This brings in the criticality criteria in consideration. Let us assume that at some
instant ‘t’ A-number of fission events have taken place. These A fission events will,
on an average, produce (2.5A) fast neutrons. These fast neutrons will collide with
moderator and may produce say, N 1 (N 1 < 2.5A) thermal neutrons. These N 1 thermal
neutrons are the first-generation thermal neutrons at time ‘t’. N 1 thermal neutrons
will initiate fission in many more fuel nuclei again producing fast and then thermal
neutrons. Let the number of thermal neutrons produced at this stage is say, N 2 . N 2
is the second-generation thermal neutrons. The ratio η
nuclei in the fuel elements by the stray neutrons. Since fission of each 235 U nucleus
produces two to three fast neutrons which are readily thermalised by moderator water,
the number of thermal neutrons increases almost exponentially initiating fission of
many more 235 U nuclei and setting a self-sustained fission chain reaction. The power
level (fission rate) of the reactor increases with time, and once it reaches the desired
level, arrestor or control rods are inserted in the reactor core by remote operation.
Control rods absorb thermal neutrons leading to the decrease in the fission rate. When
fission rate starts declining below the desired level, control rods are withdrawn and
are re-inserted when fission rate increases above the desired level. Remote-operated
insertion and withdrawal of control rods maintains the power output of the reactor
to the desired level.
In case of any emergency, the reactor may be switched off by inserting the control
roads to the full extent and removing moderator, water in the present case, from the
system. However, both these operations take time of the order of few ten of seconds
to a few minutes because control roads are bulky and removing water also takes time.
Large switch-off time is a drawback of the reactor.
plant was poorly managed. Long switch-off time also played a negative role, core-
meltdown occurred during the switch-off, and big explosion took place. Another
reason for the leakage of radioactive gases and solids, etc., was the absence of
any concrete containing walls around the reactor. A big nuclear mishap happened
in 2011 at Fukushima nuclear plant, but it occurred because of the tsunami wave
of unprecedented height. Direct loss of human life occurred only in Chernobyl
mishap.
3. Fuel economy is a prime consideration in choosing a source of energy. Energy
density of nuclear fuel is among the highest as is indicated in Table 10.5;
4. Land required for a power plant, particularly in countries having high density
of population, also becomes an important consideration. Land area required for
nuclear plant is comparable to the area required by a fossil fuel power plant of
same capacity. A nuclear power plant of 1000 megawatt capacity may typically
require a land area of about 2 km2 , while a wind farm of same capacity requires
almost 360 times this area and solar photovoltaic plants of same capacity about
75 times the area.
5. Nuclear reactors are working in the world for almost 70 years; the nuclear
technology is time tested and reliable.
Two main drawbacks of a fission nuclear power plant are: (a) production of large
amount of highly active and long-life radioactive waste and weapon-grade fissile
material, proper and safe disposal of which is a big issue and (b) possibility of
radioactive fallout if control and or cooling systems fail. It is however, important
to note that all radioactive nuclei of long half-lives that are generated during the
nuclear fission in a reactor may be made to undergo fission and produce energy if
hit by neutrons of appropriate energy. Also, long-life radioactive nuclei on hitting
by fast neutrons may break into stable or short-life radionuclides. In thermal neutron
reactors, the criticality factor is kept around 1; therefore, there are no extra neutrons
to burn the waste.
To overcome the problems associated with conventional thermal neutron reactors
a new concept is taking shape in terms of the accelerator-based energy amplifiers.
A proton beam of about 1 mA and 800 MeV from an accelerator is made to hit a
10.14 Nuclear Energy 513
heavy target like lead. Interaction of proton beam with lead produces high flux of
spallation neutrons that have an energy distribution. The spallation neutrons energy
distribution is further modified from the scattering by lead block such that there
develop zones around the interaction point which have neutrons of different energies.
If heavy nuclei are charged in zones of appropriate neutron energy, they may undergo
fission releasing energy. The system will not only burn out its own radioactive waste,
but radioactive waste from other conventional reactors may also be charged in the
system to produce energy. Weapon-grade fissile material accumulated in the world
may also be used as a fuel in this system to generate energy. It may be noted that
the neutron economy of the system is not governed by the fast neutrons emitted
in fission, as is the case with conventional fission reactors; instead it depends on
the current and energy of incident proton beam. The system remains sub-critical in
absence of proton beam and becomes supercritical when proton beam is switched
on. As such the fission process going on when proton beam is on may be switched
off instantly if proton beam is stopped. This avoids the possibilities of accidents that
may happen in conventional reactors because of large switch-off time. It is true that
initially a high current high-energy proton accelerator will need considerable amount
of energy to run it, but once fission energy is available it will be much larger than the
energy consumed by the accelerator, and a part of the fission output energy may be
used to operate the accelerator. The basic concept of the energy amplifier is shown
in Fig. 10.19.
Figure 10.20 shows an artist’s view of an accelerator-driven energy amplifier.
Though nuclear energy is not a renewable source of energy but the present reserves
of fissile and fissionable sources in the world are enough to last for almost a century.
Moreover, many countries are developing techniques to extract uranium from water;
India for example claims to have already developed methods to extract uranium from
water with almost 90% efficiency. The safety track record of nuclear energy is quite
impressive; less than 50 people have directly lost their lives during the history of
nuclear energy, maximum of 14 people in Fukushima, Japan, in 1911 accident. In
comparison much more lives have been lost in accidents of coal-run energy industry
and hydropower industry in the world. Nuclear energy is one of the cleanest energies
and requires comparatively very small amount of fuel and space; technology is time
tested and may supply uninterrupted, clean power to industry on a long-time basis.
Expected future developments, like accelerator-based energy amplifiers, will make
fission nuclear energy a very clean, safe and sustainable option.
SAQ: How accelerator-driven amplifiers will make fission energy safer?
Many renewable energy sources may be installed only in specific areas or terrain;
for example wind energy may be trapped only in those regions where winds of some
minimum velocity below at all times. Same is true for geothermal energy, energy
from ocean and hydroelectric energy. This is partially true for solar energy also;
energy from sunshine may be trapped and converted into electric energy only where
it is going waste, like on rooftops, deserts, etc. Sunlight falling on agricultural land,
in forest areas, etc. cannot be obstructed as it is already being used in such areas by
crops and plants/trees.
10.15 Terrain Dependent Renewable Energy Sources 515
It is simply the power derived from the earth’s internal heat. Geothermal energy is
contained in the rocks and fluids beneath the crust of the earth. At some locations
it may be available at shallow grounds, while at other places it might be few kilo-
metre beneath the surface. Geothermal energy finds its way to the earth’s surface in
three ways: hot springs, geysers and fumaroles (where volcanic gases are released),
volcanoes. Most of the active geothermal reserves are found along major tectonic
plate boundaries, the most active geothermal area in the world called the ring of fire,
encircles the pacific ocean. When magma reaches near the surface of the earth it heats
groundwater trapped in porous rocks or water trapped in fractured rock surfaces and
faults making geothermal water reservoirs.
In USA most of the geothermal power plants are located in western states and
Hawaii where geothermal resources, mostly geysers, are near to the ground level.
California State of USA is producing electricity from geothermal resources since
1960. The first geothermal electric power station was built in Larderello, Italy, in
1904. Geothermal power plants may be of three types: dry steam, flash and binary.
In dry steam plants steam is taken out of the faults or fractures on the ground and is
directly used to run electric turbines. Flash plants pull out hot and high-pressure water
into cooler and low-pressure water producing steam which is used to run turbines.
In binary plants hot water pulled from geothermal reservoir is brought in contact
with some fluid that has low boiling point. The low BP fluid gets vaporised, and
these vapours are used to derive turbines. Many countries including New Zealand,
Iceland, Philippines, Indonesia, Mexico, Sweden and Turkey are producing electric
power using geothermal energy sources.
Geothermal energy is a relatively clean source, energy can be extracted without
burning fossil fuels, and a geothermal power plant emits almost one sixth amount
of CO2 compared to a clean natural gas fired plant of same capacity. As a matter
of fact binary geothermal plants do not release any carbon dioxide. In many cases
geothermal energy may be directly used for heating of homes, removing snow and
in many similar other applications. Geothermal sources supply energy continuously
are not weather or time dependents which is not the case with wind or solar energy.
Figure 10.21 shows a geothermal electric power station.
Many countries have considerable geothermal potential; however electricity
production from geothermal sources is not economically viable at present.
The main concern of geothermal energy is the hydrogen sulphide (H2 S) gas which
is invariably released from geothermal sources. At many geothermal sources a toxic
fluid is also emitted disposal of which creates problems.
516 10 Sustainability and Sustainable Energy Options
Kinetic energy of naturally blowing wind may be trapped using a turbine that converts
it into electricity. Special turbines used in wind-run power generation may be clas-
sified into two types: the horizontal-axis wind turbine (HAWT) and the vertical-
axis wind turbine (VAWT). Blade design of the two types of turbines is shown in
Fig. 10.23.
A single wind turbine may provide electric power to a single house, but a cluster
of wind turbines, called a wind farm, may provide electric power to a city. One big
problem with wind power is that it depends on the availability in a given area of
wind with a minimum speed of 12–14 km/h and up to the speed of 50–60 km/h when
electricity generation is a maximum. Further, at wind speeds of around 90 km/h the
turbines must be switched off to avoid damage. The upper and lower limits of wind
speeds essentially depend on the type and size of the turbine blades. Since the output
power level of a wind turbine depends on the speed of the turbine rotor which varies
with the speed of the wind, most of the wind turbines are attached to storing batteries
to stabilise the inconsistent energy surges to be useful. Storing batteries attached to
a wind turbine also provide power at a constant level when the wind speed at wind
farm area is either less than the minimum or is above the maximum speed limit
recommended for the system.
Wind energy is renewable, free from any greenhouse emission and is also cheap.
Wind power share of worldwide electricity usage in 2014 was around 3.1% which
rose to 4.8% by the end of 2018. Portugal and Spain both produce around 20% of
their total energy through wind. India is perhaps the only developing country where
wind energy share is around 10% of total energy consumption, followed by USA
with 10.2% and China 6.1%.
Average cost of wind energy in USA varies between 1 and 2 cent per kWH. On the
other hand in India it is around INR 2.77 (3.5 US cent), in Italy around 7.5 US cent,
and Europe average is around 6.2 US cent per kWH. The global weighted average
of electricity of new onshore wind farms in 2019 was USD 0.053/kWh.
The biggest limitation of wind power is that it requires large open area where
natural wind with speeds above the minimum requirement is available for consider-
ably long periods of time. Another fact is that generally areas where wind power may
be harnessed are far away from cities and places where energy is used. Transportation
of energy adds to the cost of wind energy. Large numbers of wind turbines in wind
farm adversely affect the birds and other wild life in the area.
SAQ: A wind field power station is to be connected to an AC domestic grid. Name
the electronic circuits/blocks that may be required for this.
Energy coming in the form of light and heat from sun is called solar energy. Solar
energy may be directly used for heating and cooking in solar heaters, or it may be
converted into electricity. Solar energy is also used to create renewable fuels like
hydrogen. By the end of 2020, around 3% of global energy demand (≈ 700 GW)
was met by solar energy. Further, solar energy sector is the fast-growing sector of
renewable energy. One reason for this rapid growth in solar electric energy use is the
sharp drop of the global levelised cost (LCOE) of solar electricity which dropped by
a factor of 85% during the period from 2010 to 2011.
Solar energy may be used through two different technologies: solar thermal and
solar photovoltaic (PV).
In solar thermal technology, radiations from the sun are converted into heat energy
which may be directly used for heating, or it may be converted to steam by concen-
trating incident solar radiations. The steam may then be used to run electric turbines
for generating electricity.
The small-scale thermal technology is used to heat a volume of space, like
rooms and houses and for providing hot water for homes and swimming pools, etc.
The concentrated solar thermal technology is used to produce electrical energy by
concentrating solar radiations falling on a large area. The average power density of
sun radiations on the surface of earth is approximately 1.4 kW per m2 and is rather
small. Solar power may be concentrated at a small area or volume by accumulating
solar powers falling at different areas using reflecting mirrors. The small area or
volume where solar energy is concentrated by reflections from mirrors is called the
receiver. Receiver is kept in thermal contact with a heat reservoir or thermal energy
storage system. Heat energy may be drawn from the heat storage system as and when
required.
520 10 Sustainability and Sustainable Energy Options
As already mentioned the energy density of solar radiations is rather small, there-
fore, it is required to collect solar radiations falling on different areas of the surface
to concentrate large amount of solar energy. This is called solar energy concentra-
tion. In older version of solar energy concentrators, called parabolic concentrator
(see Fig. 10.24), a large parabolic shaped reflecting surface was used. Solar radia-
tions falling on different parts of the parabolic surface got reflected by the surface
to the focus of the parabola and thus large amount of solar energy got deposited at
the focus of the parabola where some energy absorber like water (called receiver)
etc. may be kept. In parabolic concentrators temperatures as high as 400 °C have
been produced at the receiver of the device. Since it is difficult to make very large
parabolic reflecting surface, parabolic concentrators are limited in their applications
and have now been replaced by another kind of concentrator called Tower Concen-
trator. A tower concentrator uses large number of mirrors inclined in such a way
that reflected solar radiations from all mirrors get collected in a small volume at a
tower which works as the receiver. Working principle of a tower concentrator is also
shown in Fig. 10.24. Since there is no limit on the number of mirrors that may be
used to concentrate solar radiations, the total solar energy absorbed by the receiver
may be very large, temperatures of the order of 565 °C at the tower concentrator
receiver has been observed and solar radiations falling on areas of the order of 1–2
Km2 have been concentrated at the receiver. The receiver may be attached to a water
line that may work as the heat store converting liquid into vapours and running the
electric turbine. It is also possible to attach receiver with molten salt storage system
allowing the system to operate for periods of low or no solar energy. Solar power
tower at Crescent Dunes solar energy project concentrates solar energy using 10,000
mirrored heliostats spread on an area of about 1.21 km2 . Parabolic concentrators,
used earlier collect solar energy from a relatively small area.
Concentrated solar power (CSP) had a global capacity of around 7000 MW,
the maximum share of about 33% being produced in Spain. Other CSP generating
countries include USA, North Africa, India and China.
SAQ: What is the advantage of tower concentrator over the parabolic one?
Photovoltaic devices convert solar energy into electrical energy. A single PV device,
called a cell, is small, usually less than the thickness of four human hairs, and may
typically produce about 1 or 2 watts of electric power. PV cells are delicate, and
in order to withstand the outdoors for many years, they are sandwiched between
protective transparent materials like in a combination of glass and plastic. To boost
the power output of PV cells they are connected together in chains to form larger
units called panels or modules. Individual panels may be directly used, or they may
be connected to form arrays. Two other important components of a PV power system
are: (i) the mechanical mounting on which arrays are placed to face the sunlight. (ii)
The electrical out of PV array is in the form of DC power, and it must be converted
into AC power before it is connected to the power grid. Alternator circuit converts
DC power into AC power.
Figure 10.25 shows the structure of a PV cell. As shown in the figure a PV cell
is fabricated by developing a very thin layer of N-type material followed by a thick
P-type layer on the same semiconductor crystal. The semiconductor crystal behaves
as an n-p junction diode, with a depletion layer sandwiched between the N- and the
P-sides. It may be recalled (see Chap. 3) that the depletion layer of an np junction
behaves as a charged parallel plate capacitor with N-layer behaving as positively
charged plate and the P-layer as negatively charged plate of the capacitor. Further,
the depletion region is free of any free charge carriers: electrons and holes. It may
also be recalled that there is an internal potential difference V ib between the N-
and the P-sides, N-side being at a higher potential than the P-side. The N-layer is
made thin so that sunlight photons falling on it pass through it. Some conducting
strips are also developed on the N-layer, which work as electrode for taking electric
connection. The P-layer is sufficiently thick so that sunlight photons passing through
the thin N-layer are all stopped in the depletion region and may not leave the cell. An
electrode, for electric connections, is developed on the thick P-layer. The top side
of the N-layer which is exposed to the sun is painted by an antireflection paint so
that most of the sunlight photons falling on the cell are transmitted to the N-layer
without much reflection. When PV cell is exposed to sunlight, N-side facing the
sun, sunlight photons reach the depletion layer passing through the N-layer. Sunlight
photons ionise the atoms of depletion layer creating free charge carrier: electron and
hole. Electrons are attracted by the N-layer which is at positive potential (+V ib ) and
behave as a positively charged plate, while positively charged holes move towards
the negatively charged P-layer at lower potential. The flow of electrons and holes
(created by sunlight photons) constitutes a current through the external load in the
522 10 Sustainability and Sustainable Energy Options
circuit. In this way the solar energy which is contained in sunlight in the form of
photons gets converted in to electrical energy by the PV cell.
Most of PV cells are Silicon cells, which have different conversion efficiencies and
costs ranging from amorphous Silicon cells to polycrystalline and monocrystalline
Silicon types. The efficiency of an ordinary PV cell ranges from 10 to 20%, because of
several factors including the energy of sunlight photons, their absorption in depletion
layer, cleaning of panel surface, etc. Solar PV cell arrays may power a small laptop
to a house. Solar farms of say of 20,000 panels spread across an area of 30 acres
may generate around 3–4.5 megawatts of power, sufficient to power 1200 homes.
Governments in many countries are encouraging house owners to have solar energy
systems for their household use. Household solar systems are very reliable, have low
maintenance cost and have no negative environment impact; if in peak hours the solar
system provides excess electrical energy, and it may be given to be sold to the grid
for use by others. Solar panels are generally put on rooftops, an area that may not
be used for any other purpose. Since most solar cells are made from Silicon, which
is quite in abundance in soil, there is no constrain on the fabrication of PV devices
from the point of the availability of raw material.
A major concern is the high initial cost of fabricating PV cells, panels and arrays.
Fabrication of solar energy systems and solar energy industry adds greenhouse gases,
though use of solar energy in itself is non-polluting and free of greenhouse gas
emission. In view of low conversion efficiency of solar devices coupled with the low
energy density of solar radiations, it is required to spread solar panels or other solar
energy pick-up systems over wide areas to make a viable alternate energy system
based on solar energy. Globally, solar PV electrical generation is expected to grow
by 18% or more in the coming years.
SAQ: Why PV technology for harnessing solar energy is becoming popular?
10.18 Energy from Ocean 523
Ocean energy may be classified in two broad types: mechanical energy that is asso-
ciated with tides and waves in oceans and thermal energy that is stored by the sun in
ocean in the form of temperature difference between the water at the ocean surface and
deep inside. There is another form of ocean energy the potential energy which is asso-
ciated with ocean currents and is generally not taken into account while discussing
extraction of ocean energy as a source of alternate green energy.
does not produce food for birds of the area and they migrate. Level of salinity of
the area is also affected. Rotating blades of turbines also endanger marine fife. A
barrage-type power station built at Rance River estuary in Brittany, France, in 1966
to harness tidal energy is still working. It uses the energy contained in currents of the
river as well as the tidal energy of English Channel. The barrage has increased the
silt level and has adversely affected the plant life of the area. A fish of the area called
Plaice has totally extinct, while another variety called cuttlefish is now thriving in
the area.
Tidal lagoon is a body of ocean water that is partially enclosed by artificial or
natural boundary. Tidal lagoon works in the same way as a barrage basin. Ocean water
enters lagoon during high tide and drains out during the low tide. Electrical turbines
generate electricity during both cycles, and therefore, lagoon-based generators may
supply continuous electrical energy.
World’s five biggest power stations based on tidal power are: (i) Sihwa Lake
Tidal Power Station, South Korea, capacity 254 MW; (ii) Rance Tidal Power Station,
France, capacity 240 MW; (iii) MeyGen Power Station, UK, capacity 6.0 MW; (iv)
Jiangxia Tidal Power Station, China, capacity 3.2 MW; and (v) Eastern Scheldt
Barrier Tidal Power Station, Netherlands, capacity 1.25 MW.
Ocean energy is renewable and clean and qualifies to be a sustainable source of
energy; however, the market for tidal electrical energy is not yet grown, and investors
are not sure about its profitability.
Thermal energy in the form of temperature gradient with depth in ocean water is
deposited by the sun. Since the ocean water temperature gradient is in general small
and varies with the location of the place, the first task is to find a suitable site for
installing a system that may harness ocean thermal energy. Ocean thermal energy
conversion (OTEC) plants can only be installed at seashores where the top ocean
surface remains hot all through the year and the water at depth remains sufficiently
cold. Hawaii and some Caribbean islands are most suited for OTEC. However,
these areas are also very rich in solar and wind energies, and therefore the ocean
thermal energy has to compete with these other sustainable energy sources to become
economically viable.
Figure 10.26 shows an artist’s view of an OTEC system. Ocean surface water
at temperature 20–25 °C may be used to vaporise another low boiling point fluid,
called working fluid, and pressurised working fluid vapours may run an electric
generator. Low-pressure and low-temperature vapours discharged from the turbine
(as it converts thermal energy of vapours into electrical energy) may be condensed
using cold water (3–7 °C) pumped from the depth of the ocean and may be returned
to the boiling pot forming working fluid closed loop. Sea water itself may be used as
the working fluid and may be turned into vapours by supplying additional heat. The
advantage of using sea water as working fluid is that the water left as discharge from
10.19 Portable Sources of Sustainable Energy 525
the electric turbine is pure salt-free water and may be used for human consumption
or may be further cooled using the low-temperature ocean water pumped from the
depth, to supply as refrigerated cold water.
The global potential of OTEC is much larger than other ocean energy options,
and it may become a major source of continuous, clean, renewable energy that may
sustain base load and therefore many countries including Japan, USA, European
countries, India, China, etc. are working on projects involving OTEC.
SAQ: What are the three different technologies used to harness tidal energy?
Intermittent renewable sources, like the wind electric turbines or solar PV cells, etc.,
which generate electric power only at some particular time or generate excess power
at some peak hour, need a system where excess electrical energy generated may be
stored for use at a later time. Chargeable lithium-ion batteries and supercapacitors are
devices that may be employed to store electrical energy for use at some other place
and time. Both of them are portable and can be carried from one place to another;
not only that, these devices may be refilled with electrical energy once the energy
already stored is consumed.
526 10 Sustainability and Sustainable Energy Options
to be discharged are all very critical. For long life and safety, it is required
that battery should not be charged beyond 80% of the maximum value and,
similarly, should not be allowed to discharge below 20% value. These controls
in a Li-ion battery are achieved by the use of IC-based electronic circuits.
Figure 10.27 shows the diagram of a Li-ion cell. The cell may be divided into
five components: Aluminium current collector cathode, lithium metal oxide, porous
separator that allows the migration only of lithium ions and does not allow migration
of electrons through it, lithium-loaded graphite and a copper anode. The cell is filled
with a polymer gel/organic carbonate, like ethylene carbonate which serves as the
electrolyte for the migration of lithium ions. When a charged Li-ion cell is connected
to the load, Li-ions from the anode side migrate through the separator to the Cathode
side inside the cell, while electrons travel through the external load constituting the
load current. Reverse happens during the charging phase, and Li-ions migrate from
cathode side to the anode side via separator, and electrons from the charging current
source reach anode to complete the circuit. As already mentioned, the electrolyte
simply facilitates the migration of lithium ions from one side to the other across the
separator.
Super- or ultracapacitors are used for energy storage undergoing frequent charge and
discharge cycles at high current and short duration.
Supercapacitors are used in automobiles, buses, trains, cranes, elevators and other
airport equipment where regenerative breaking, short-term energy storage or burst-
mode power delivery is required. In contrast, Li-ion batteries are used where long-
term energy storage is required. The layout of a typical Helmholtz double-layer
supercapacitor that develops two layers of positive ions and electrons at the interface
of electrolyte with anode and cathode is shown in Fig. 10.28. As shown in the figure,
a perforated separator layer that allows the migration only of positive ions divides
the capacitor into two sections. Each Helmholtz layer may be treated as a parallel
place capacitor with large quantity of charge on plates and extremely small distance
between the plates. Therefore the capacitance of each layer is quite large in the range
of Farad. The energy storage mechanism in a supercapacitor is bulk separation and
movement of charges.
Supercapacitors have two electrodes: a separator and an electrolyte. Outer sides
of electrodes are covered with Aluminium foils that work as current collectors.
Electrodes are porous generally made of different types of carbon such as carbon
cloth, activated carbon or carbon nanotubes or of mixed metal oxides or conducting
polymers. Recently graphene because of its high chemical stability, high electrical
conductivity and large surface area has also been used for making electrodes. The
separator allows the migration of ions through it but restricts the migration of elec-
trons. The electrolyte is mostly liquid through which ions may easily flow between
electrodes. Energy density of a supercapacitor is quite high, comparable to a Li-ion
cell.
SAQ: Compare a supercapacitor with a lithium-ion cell. The working voltage of a
1F supercapacitor is 3 V; what maximum charge it may hold?
SAQ: What are some special features of Li-ion batteries? List two drawbacks of
Li-ion batteries.
Short Answer Questions
SA10.1 What is meant by sustainability? Discuss how an individual may
contribute to the sustainability of earth.
SA10.2 Discuss the concept of sustainable economy.
SA10.3 What is greenhouse effect and what is responsible for it in earth’s
atmosphere?
SA10.4 What are the five dimensions of social sustainability?
SA10.5 What are the main requirements for economical sustainability? Comment
on circular economy.
SA10.6 What is greenhouse effect? What causes it?
SA10.7 What is the big achievement of Paris Agreement? What is ‘Nationally
Determined Contributions (NDCs) in this context?
SA10.8 What may be the possible ways of removing CO2 from the atmosphere?
SA10.9 Why CO2 is a greenhouse gas and why is it most fatal from the point of
global warming?
SA10.10 What may be the possible ways of removing CO2 from the atmosphere?
SA10.11 Define the following terms and discuss their significance; Primary energy,
LCOE, Gravimetric energy density.
SA10.12 What is a fuel cell? How does it differ from a battery?
SA10.13 What are the main components of a fuel cell? Explain the working of a
(PEM) cell.
SA10.14 How does an (AFC) cell differ from a (PEM) cell? List one advantage and
one drawback of (AFC) cell over the (PEM) cell.
SA10.15 What fuel is used in a (PAFC) cell? Why this cell is often used in big
structures like trucks, buildings, etc.?
SA10.16 Compare advantages and disadvantages of hydroelectric power.
SA10.17 Compare wind, solar and geothermal electric sources of energy
SA10.18 Name different techniques used in trapping ocean energy, explaining one
of them in some details.
SA10.19 With the help of a diagram give the construction details of a lithium-ion
cell and list its advantages.
530 10 Sustainability and Sustainable Energy Options
MC10.9 Which of the following energy sources may provide electricity and cold
pure water as a by-product?
(a) Hydrogen fuel cell (b) ocean thermal (c) nuclear (d) geothermal
ANS: (b), (d)
MC10.10 Which type of economy is best suited for sustainable growth?
(a) Circular economy (b) free market economy (c) command economy
(d) mixed economy
ANS: (a)
Long Answer Questions
LA10.1 Discuss the concept of sustainability and its components. Give a detailed
account of economical sustainability.
LA10.2 What is global warming? Discuss its causes. Mention about important steps
taken by the UNO in arresting temperature rise.
LA10.3 What should be the essential properties of a sustainable energy source?
Give a list of energy sources that in your opinion are best for your country,
and justify your choice.
LA10.4 Differentiate between a battery and a fuel cell. List different types of fuel
cells, and explain their relative merits and de-merits.
LA10.5 Write a detailed note on solar energy as a viable and sustainable source.
LA10.6 Give detailed account of the construction and working of a lithium-ion cell.
What are the main advantages of this cell?
LA10.7 Write an essay on sustainable energy options.
Index
Diffusion coefficient, 84, 95 Greenhouse effect, 477, 478, 480, 482, 484,
Diffusion currents, 94 529
Direct semiconductors, 91, 412 Greenhouse gasses, 478–488, 492–494,
Distribution function, 97, 240, 242, 272, 501, 503, 516, 517, 522, 529, 530
318, 338, 339, 342, 344, 346 Group velocity, 217, 219–221, 293, 294,
Donor, 81, 85–87, 90, 103–105, 108, 109, 365, 367
134 Gyromagnetic ratio, 145, 149
Doped semiconductors, 68, 81, 90, 108, 110
Drift velocity, 93, 100, 113, 443
H
Half metal, 68, 69, 114, 115
E Hall effect, 99, 100, 131, 135
Economic sustainability, 475, 476 Hermitian operators, 267, 268, 270, 275,
Eigenvalue, 271, 276, 277, 283–287, 297, 276, 281–283, 286, 312–315
312 High dimension arrays, 437
Einstein’s specific heat equation, 251 Hole, 74, 75, 78, 80, 81, 86–89, 91, 92,
Electric dipole, 51, 53, 54, 141, 142, 451 94–96, 100, 102–105, 107–111, 114,
Electronegativity, 44–46, 52, 55, 56, 59, 60 132, 236, 372, 405, 412, 414, 441,
Electron mobility, 93 442, 521
Energy carrier, 502 Homogeneity, 272
Ensemble, 317, 437
Environment, 71, 74, 181, 373, 458, 461,
I
473, 474, 476, 477, 481, 484–487,
Impact strength, 10, 24
489, 493, 495, 501, 522, 523, 526
Indirect semiconductors, 90, 91
Environmental sustainability, 476, 477
Induced decay, 384
Exchange interactions, 165, 168, 170–172,
Insulators, 2, 28, 61, 63, 64, 66–68, 75, 78,
176–178, 186, 188, 189
85, 104, 131, 134, 138, 175, 182,
Expectation value, 286, 287, 289, 310, 313,
263, 406, 408, 440, 443
315
Intermetallides, 19
Internal potential barrier, 103–107
Intrinsic semiconductors, 68, 73–75,
F
78–81, 85–90, 101, 108–110
Fermi Sea, 72
Ferrites, 18, 19, 181–184, 187, 189
Fick’s second law, 84 L
Fluorescence, 415, 416, 438, 449 Langevin function, 160, 161
Food deserts, 474 Leveilized cost of electricity, 499
Forbidden energy gap, 64–69, 73–75, 78, Lewis symbols, 42
86, 89, 91, 109, 110, 113–115, 129, Lithography, 459
130, 440 Localised Surface Plasmon Resonance,
Forward bias, 105, 107, 412, 413 440, 470
Fuel cell, 468, 503–506, 529–531 Lustre, 2, 3, 10
Full angle divergence, 403
Functionality, 21, 60
M
Macromolecules, 19
G Macrostate, 323–328, 331, 333, 336,
Gamma rays, 210, 216, 242, 243, 264, 378 340–346
Garnets, 181, 182 Magnetic spin quantum number, 30, 31, 33
Gelation, 455 Majority carriers, 86–90, 96, 103, 106, 107,
Giant pulse formation, 423 112, 133, 412
Gradient force, 450–453, 470, 472 Malleability, 3, 5, 9
Gravimetric energy density, 499, 500, 502, Matrix, 24–28, 58, 59, 184, 268, 275, 276,
505, 526, 529, 530 467, 505
Index 535
Meissner effect, 118, 123, 134 Population inversion, 377, 386, 388–394,
Metallic bonding, 6, 7, 9, 18, 52, 113, 115 402, 406–409, 411, 412, 414, 416,
Metalloids, 69, 114 418–422, 431, 433
Metastable states, 389–392, 401, 402, 409, Primary carbon footprint, 485
416, 418, 431 Primary energy, 495, 496, 501, 529
Microstates, 323–327, 334, 336, 340–342, Pumping, 389–392, 394, 395, 401, 402,
344–346 406–411, 415, 416, 420, 421, 424,
Minority carriers, 86–89, 95, 107, 110, 112, 428, 433
133
Mode-locked, 423
Monocrystalline, 69, 71, 522 Q
Quantum confinement, 441, 471
Quantum dots, 437–440, 442, 458, 470, 471
N Quantum mechanical superposition, 272,
Nano films, 437 292
Nanofluids, 444 Quantum mechanical tunnelling, 306, 308,
Nanoscale, 435, 437 315
Nanotechnology, 435, 437, 467, 469, 470
Nano tubes, 437, 439, 464, 466, 469
R
Nano wires, 437, 438, 444, 458
Radiationless de-excitation, 383
Nanoworld, 435, 437
Rayleigh Jean’s formula, 240
Nationally Determined Contributions
Residual magnetization, 174, 175
(NDCs), 487, 529
Resistivity, 11, 14, 19, 56, 61–64, 95, 96,
Non-degenerate level, 320, 330
113, 114, 116–118, 120, 126, 131,
Non-degenerate semiconductors, 90, 108
134, 175, 443, 466
Numerical aperture, 360, 361, 373
Retarding potential, 227, 230
Reverse bias, 105, 107, 134, 372
Right hand rule, 139, 140, 142
O
Occupation number, 319, 321–324, 328,
331, 334, 338, 341–344 S
Optical resonator, 393, 402 Scattering force, 450–452, 472
Schrodinger’s time-independent equation,
274, 289, 313
P Schrodinger wave equation, 30, 268
Pair production, 246 Screening constant, 204
Partition function, 318, 339 Secondary carbon footprint, 485
Pauli’s exclusion principle, 8, 31, 33, 170, Self-assembly, 436, 455, 456, 470, 472
176, 188 Semiconductors, 2, 17, 28, 61, 63, 64, 66,
Permeability, 139, 152, 153, 175, 189 68–70, 72, 74, 75, 78, 79, 81–97,
Phase velocity, 217, 218, 220, 221, 294 99–101, 107–111, 113, 114, 124,
Phonon, 14, 116, 126–129, 135, 172, 253, 130–132, 134, 391, 405, 406, 409,
255, 443 412, 414, 433, 436, 437, 440–443,
Phosphorescence, 416 458, 521
Photo avalanch diode, 372 Shallow impurities, 90, 92
Photon, 8, 11, 14, 52, 91, 110, 130, 191, Shear strength, 10
195, 196, 211, 216, 232–235, Singlet state, 176, 177, 187, 415
242–246, 253, 258–261, 263, 264, Social sustainability, 474, 475, 529
364, 378, 380–383, 385–402, 409, Specific dielectric strength, 67
411–414, 416–418, 423, 429, 431, Stationary states, 274, 310
432, 438, 443, 444, 450, 451, 458, Superconductors, 61, 63, 64, 117–126, 129,
481, 521, 522 131, 135, 157
Planck distribution function, 242 Supermagnetism, 445
536 Index