Professional Documents
Culture Documents
Fundamentals of Semiconductor Physics and Devices RolfNorman
Fundamentals of Semiconductor Physics and Devices RolfNorman
Rolf Enderlein
ihmboldt-bhiuemity BerEin
Universi?yof Sao Pauh
NommJ.M. Horing
stewwas Institute of Technology
Hoboken, NJ
World Scientific
Singapore New Jersey London Hong Kong
Puhli.dzed hy
World Scientific Publishing Co. Re. Ltd.
For photocopying of material in this volume, please pay a copying fee through the Copyright
Clearance Center, Inc., 222 Rosewood Drive. Danvers, MA 01923, U S A . In this casc pcmission to
photocopy i s not required from the publisher.
ISBN 981-02-2387-0
Printed in Singapore.
Dedication
This book is dedicated to the memory of Adele and Werner Enderlein (par
ents of R.E.), and to t h e memory of Joseph and Esther Morgenstern (hfor
ganstein)(grandparents of N.J. M. 11.).
Vii
Preface
People come to technical books with a vast array of daerent needs and
requirements, arising from their differing educational backgrounds, professional orientations and career objectives. This is particularly evident in the
field of semiconductors, which stands at the juncture of physics, chemistry,
electronic engineering, material science and mathematics. No longer just
an academic discipline. this field is at the heart of an ongoing revolution in
communications, computation and electronic device applications, that innovate many fields and change modern life in myriad ways, large and small. Its
profound impact and further potential command interest and attention from
all corners of the earth. and from a wide variety of students and researchers.
The clear need to address a broadly diversified and variously motivated
readership has weighed heavily in the authors considerations. It poses a
pedagogical problem faced by many teachers of intermediate level courses
on semiconductor physics. Generally speaking, every student has previously
studied about half of the course materiaL The difficulty lies in the fact that
each students exposure is likely to have been to a &fleerent half, depending on which lower level courses and teachers they have had, and where the
emphasis lay. To accommodate readers with varied backgrounds we start
from first principles and provide fully detaiIed explanations and proofs, assuming only that the reader is familiar with the Schrodinger equation. This
intensively tutorial treatment of the electronic properties of semiconductors includes recent fundamental developments and is carried through to the
physical principles of device operation, to meet the needs of readers interested in engineering aspects of semiconductors, as well as those interested in
basic physics. Clarity of explanation and breadth of exposure relating to the
electronic properties of semiconductors, from first principles to modern devices, are our principal objectives in this fraddy pedagogical book. We offer
full mathematical derivations to strengthen understanding and discuss the
physical significance of results. avoiding reliance on hand waving arguments
alone.
To support the readers introduction to the physics of semiconductors, we
provide a thorough grounding in the basic principles of solid state physics,
assuming no prior knowledge of the field on the part of the reader. An ele
mentary discussion of the crystal structure, chemical nature and macroscopic
properties characterizing semiconductors is given in Chapter 1. Moreover,
we also include an extensive appendix to guide the reader through group
theory and its applications in connection with the symmetry properties of
semiconductors, which are of major importance. Beside spatially homogeneous bulk semiconductors, we undertake a full exposition of inhomogeneous
semiconductor junctions and heterostructures because of their crucial role
iu
Preface
The book has emerged from lectures which the authors presented for physics
students at the Humboldt-University of Berlin. Germany, and the State University of Sao Paulo, Brazil, and for physics and engineering physics students
at the Stevens Institute of TeFhnology in Hoboken, New Jersey, C.S.A. Part
of the book has similarities with the german book "Grundlagen der Halbleiterphysik" ("Fundamentals of Semiconductor Physics") which was written by
one of us (R. E.) together with A. Schenk. We are thankful to Dr. Schenk
(now at ETH Zurich) for allowing us to use part of his work in the present
volume. In writing this book we have had excellent suppoIt from many
of our colleagues at our own and other Universities. We are particularly
thankful to Prof. Dr. J. Auth (Humboldt-Lniversity Berlin), Prof. Dr. F.
Bechstedt (Friedrich-Schiller University Jena), Prof. Dr. W. A. Harrison
(Stanford University), Prof. Dr. M. Scheffler (Fritz-Haber Institut, Berlin),
Prof. Dr. J. R. b i t e , Prof. Dr. A. Fazzio, and Prof. Dr. J. L. Alves
(State University Sao Paulo), as well as to Prof. Dr. H. L. Cui, Prof. Dr.
G. Rothberg, Mr. G. Lichtner (Stevens Institute of Technology), and Prof.
Dr. G . Gumbs (Hunter College, CWNY, New York), who read parts of the
manuscript and contributed helpful suggestions and critical remarks. The
technical assistance of Mrs. Hannelore Enderlein is gratefully acknowledged.
RoIf Enderlein
Sao Paulo
Hoboken, N J
October 1996
Xi
Contents
1 Characterization of sernicond uct ors
Inlrnduclion . . . . . . . . . . . . . . . . . . . . . . . . . . . .
1.2 Atomic structure of ideal crystals . . . . . . . . . . . . . . . .
1.2.1 Cryst.al latlices . . . . . . . . . . . . . . . . . . . . . .
1.2.2 Point groups of equivalent directions arid crystal classes
1.2.3 Space groups and crystal structures . . . . . . . . . .
1.2.4 Cubic semiconductor structures . . . . . . . . . . . . .
1.2.5 Hexagoiial semiconductor st.ructures . . . . . . . . . .
1.3 Chemical nature of semiconductors. Material classes . . . . .
1.3.1 Group IV elemental semiconductors . . . . . . . . . .
1.3.2 111-V semiconductors . . . . . . . . . . . . . . . . . . .
1.3.3 11-VI semiconductors . . . . . . . . . . . . . . . . . . .
1.3.4 Group \'I elemental semiconductors . . . . . . . . . .
1.3.5 IV-VI semiconductors . . . . . . . . . . . . . . . . . .
1.3.6 Other compound semiconductors . . . . . . . . . . . .
1.4 hlacroscopic properties and their microscopic implications . .
1.4.1 Electrical conductivity . . . . . . . . . . . . . . . . . .
1.4.2 Depenclenre of conductivity on the semiconductor state
1.4.3 Optical absorption spectrum and the band modcl of
srmicoiiductors . . . . . . . . . . . . . . . . . . . . . .
1.4.4 Electrical conductivity in the band model . . . . . . .
1.4.5 The Hall effect and the existence of positively charged
freely mobile carriers . . . . . . . . . . . . . . . . . . .
1.4.6 Seinicondiictors far from thermodynamic equilibrium .
1.1
1
1
5
12
14
16
22
28
29
30
31
31
32
32
33
34
35
38
43
45
49
51
51
54
54
57
66
82
82
85
89
94
xii
Contents
2.4
2.5
2.6
2.7
2.8
Contents
...
Xlll
Contents
xiv
4.4.4
4.4.5
4.4.6
5
xv
Contents
7.1.2
7.1.3
7.1.4
7.1.5
7.4.3 MOSFET . . . . . . . . . . . . . . . . . . . . . . . . .
620
Appendices
A Group theory for applications in semiconductor physics
623
A.1 Definitions and concepts . . . . . . . . . . . . . . . . . . . . .
623
A . l .1 Group definition . . . . . . . . . . . . . . . . . . . . .
623
A.1.2 Concepts . . . . . . . . . . . . . . . . . . . . . . . . .
624
A.2 Rigid displacements . . . . . . . . . . . . . . . . . . . . . . .
627
A.2.1 Definition . . . . . . . . . . . . . . . . . . . . . . . . .
621
A.2.2 Translations . . . . . . . . . . . . . . . . . . . . . . . . 628
A.2.3 Orthogonal transformations . . . . . . . . . . . . . . . 629
A.2.4 Geometrical interpretation . . . . . . . . . . . . . . . . 631
-4.2.5 Screw rotations and glide re3ections . . . . . . . . . . 632
A.3 Translation. point and space groups . . . . . . . . . . . . . . 635
A.3.1 Lattice translation groups . . . . . . . . . . . . . . . . 635
-4.3.2 Point groups . . . . . . . . . . . . . . . . . . . . . . .
636
A.3.3 Space groups . . . . . . . . . . . . . . . . . . . . . . .
654
A.4 Representations of groups . . . . . . . . . . . . . . . . . . . .
655
A.4.1 Introduction . . . . . . . . . . . . . . . . . . . . . . .
655
A.4.2 Irreducible representations . . . . . . . . . . . . . . . . 661
4.4.3 Products of representations . . . . . . . . . . . . . . . 667
A . 5 Representations of the full rotation group . . . . . . . . . . . 673
4.5.1 Vector representation of the rotation group and generators of infinitesimal rotations . . . . . . . . . . . . 674
A.5.2 Representations for dimensions other than three . . . 676
A.6 Spinor representations . . . . . . . . . . . . . . . . . . . . . .
682
A.6.1 Space-dependent spinors . . . . . . . . . . . . . . . . . 682
Cootents
mi
737
'741
Bibliography
747
Index
757
FUNDAMENTALS OF
SEMICONDUCTOR PHYSICS AND DEVICES
Chapter 1
Characterization of
semiconductors
1.1
Introduction
1.1. Introduction
1.2
means that,, for a given atom, there are remote atoms possessing the same
short-range order complexes as the original atom, rand that the positions of
the remote atoms are related to the position of the origina1 atom by sirnple transformations. 'l'hp crystal is considered t o be inhitely large in this
context. Atoms having identical short-range order complexes are termed
equivalent. Equivalent atoms are necessarily of the same chemical species,
but chemically identical atoms need not necessarily be equivalent.
1.2.1
Crystal lattices
Definition
The smallest possible unit cell is called a primitive unit cell In the extreme
this cell can contain only 1atom, but as a rule, it has several atoms. If
there is only one atom per cell, then the short-range order merges into the
long-range order.
If the unit cell is taken to be a primitive one, the vectors Al, A2, &
are some minimal vectors a l , az,w. The parallelepiped spanned by these
vectors is a primitive unit cell, Each translation by a vector R of the form
case
with integer coefficients r l , r2, r 3 transforms the crystal into itself. One refers
to this property as the trunslation~lsymmetry of the crystal. The point set
defined by the vectors R forms a spatial lattice called the crystal lattice. The
vectors al,a2, are termed primitive lattice vectors, The volume 00of a
primitive unit cell may be written as the triple scalar product of al,ag,a,
Ro = al . [ag x
a].
While the lattice of a crystal and the volume Qo of its primitive unit cell
are well defined, this is not the case for its primitive lattice vectors al,a2,a3
and also it is not true for the form of its primitive unit cell. Any set of
linear combinations of the primitive vectors al,a2,a3 which yields a triple
scalar product equal to the volume Qo is again a set of primitive lattice
vectors, and the parallelepiped spanned by them forms a primitive unit cell.
The corners of the parallelepipeds do not necessarily have to lie on lattice
points. Each parallelepiped, shifted arbitrarily in space, is again a primitive
unit cell. The 'parallelepiped form is also not imperative, as there are also
other forms possible. An especially compact primitive unit cell is the socalled Wzgner3eit.z cell. The center of this cell lies on a lattice point and its
surface is formed by the perpendicular bisector planes which divide in half
the line segments joining the center to adjacent lattice points.
Translations which transform a crystal into itself, by definition, do the
same for the lattice of the crystal. Here, the translations are through lattice
vectors R, called lattice tramtations The set of all lattice translations forms
a group. This term describes a mathematical set of elements among which
a 'multiplication' is defined that results in products which are also elements
of the set. Further properties of a set forming a group are listed in A p
pendix A. In particular, there must be an identity element, and the inverse
of an element must also be an element of the set. In the case of translations
the 'multiplication' is the consecutive application of two of these transformations. Since two consecutive lattice translations constitute yet another
lattice translation, and also the requirements of Appendix A are satisfied,
the set of all lattice translations of a crystal forms, in fact, a group, called
the translation symmetrg, group or simply the tramlation group. Groups of
symmetry elements play a central role in the description of the atomic structure and other microscopic properties of crystals. Appendix A provides a
thorough discussion of groups as needed in this book.
their positions. In a t,ranslation, all points are shifted by the same vector,
with no points fixed. Rotations transform right-hand4 Cartesian coordinate systems into right-handed systems, but the application of reflections
and inversion to right-handed system results in left-handed ones.
It turns out that there are orthogonal transformations which transform
lattices into themselves. They are called point symmetry operations of lattices. The set of all point symmetry operations of a lattice forms a group,
as does the set of all lattice translations. The multiglicslion of two of these
operations is again understood to represent their consecutive application.
Groups of point symmetry operations are termed point gvoups. In Appendix
A we describe thwn in detail. Not all of the various point groups listed
in Appendix A are allowed as symmetry groups of crystal lailices, but only
parlicular ones which are called holohedral point groups. We will derive them
now by demonstrating that lhey must have three special properties.
First, all uf these point groups must contain the inversion transformation
with r e q e t t to the lattice point R = 0. This may be seen as fullows:
Inversion with respect to 0 transforms a lattice point R into -EL Considered
joiutly with R, the point -R is a lattice point having -q, -r2, -7-3 as integer
coefficients. Therefore, inversion with respect to 0 transforms an arbitrary
lattice into itself. It follows immediately that inversion with respect l o any
other lattice point will do h e same.
Second, it turns out that rotation symmetry axes Ohrough lattice points
can only be 2-, 3-, 4- and 6-fold while ri, 7- and more-fold axes are not
compatible with the translation symmetry of the lattice. One may prove
this as follows: Let C, be a rotation about such an axis t,hrough an angle
2?r/n. We consider a lattice plane perpendicular to this axis and denote a
primitive lattice vector of the corresponding planar lattice by f (see Figure
1.1). Rotating it through 2a/a, it becomes C,f, and a rotation by - 2 ? r / R
transforms it into Cqlf. If, as w e suppose, Cmbelongs to the point group of
the lattice, them s n d o a C;'. Thus both C,f and C i l f are vectors of the
plane lattice. The same holds for the sum Cnf i Cg'f ol t,he two vectors.
Moreover, C,,f
CGLf represents a vector parallel to f. This means that
C,f
CF'f must be an integer multiple of f. Since the largest possible
length of Cnf Cg'f can only he 2 / f J ,one has Cnf f C'L'f = p,F with
pn - -2. -LO, 1 or 2. This is indicative t.hat, the relation
+
+
p , = 2COS
(?)
must hold for p,. For p n = -2, equation (1.4) yields n 2. For pn -= -1
one has n = 3, for pn - 0 it follows that n = 4 and for p , = 1 l h e solulion
is n = 6 . For ppI- 2 equation (1.4) has only the trivial solution n = I . This
completes the proof concerning rotation symmetv opexations. For so-caIled
gvasi-crystals. which do not exhibit an exact translation symmetry, rotations
1
Within a given crystal system, several different types of lattices may exist.
Their common property is that they all have the same point symmetry, but
they may differ otherwise. Figure 1.2 visualizes them by means of their unit
cells. These differences give rise to diflerenf lattice types. The simplest lattices ef a given point symmetry are represented at the far left of each row
in Figure 1.2. They axe called primitive lattices. Even these simple lattices
are not unambiguously &ermined by their point symmetry. If one, for instance, multiplies all lattice vectors by the same real number, i.e. stretches
or compresses the lattices evenly on all sides, the point symmetry remains
unchanged. In less symmetrical crystal system one may even change certain length relationships or angles between primitive lattice vectors without
disturbing the point symmetry. In the tetragonal system the height of the
10
rectangular parallelepiped in relation to its basis may be changed arbitrarily. Generally speaking, the primitive lattices are determined only up to
continuous point transformations preserving their point symmetries.
Moreover, starting from a primitive lattice one can produce other sets
of regularly ordered points by adding new points to each primitive unit cell
in equivalent positions. As before, equivalent refers to positions which are
either identical or that differ by a lattice vector. If one places the new points
in symmetrical positions, i.e. those which are transformed into themselves
or equivalent points by the point symmetry operations, snch as the centers
of the primitive unit cells, one obtains a new point set having the same
point symmetry as the original lattice. The new point set, in general, no
longer forms a lattice, but may be thought of as the union of several lattices
placed within each other. In some special cases, however, the result may
still be a primitive lattice. Whether this happens or not is a question which
must be explored separately in each case. If the answer is positive, one has
to examine whether the lattice is only another realization of the original
primitive lattice, i.e. whether or not it can be brought back to the original
one by a continuous and symmetry preserving transformation. It turns out
that both cases may occur. If the lattices cannot be transformed into each
other by such a transformation, then this implies that there are two different
types of lattices with the same point symmetry. One calls them different
Bmvais types or Bravais latttices. According to this definition two Bravais
lattices are of the same type if they may be transformed into each other by
a continuous and point symmetry preserving transformation, otherwise they
are Bravais lattices of different types.
As an example, we consider the different Bravais lattices in the case of the
cubic crystal system. If one adds the body centers of the primitive unit-cellcubes as additional points to a primitive cubic lattice, the resulting point set
has the cubic point symmetry and it forms again a lattice. The same holds if
the added points are the centers of the faces of the primitive unit-cell-cubes
instead of the body centers. Neither the spacecentered nor the face-centered
cubic lattices can be transformed back to a primitive cubic lattice by means
of a continuous and symmetry preserving transformation, nor can the two
centered lattices be transformed into each other by such a transformation.
Therefore, they represent cubic lattices of two new Bravais types. If one adds
both face and body centers to the primitive cubic lattice, then the new point
set forms again a primitive cubic lattice, however, with a lattice constant
equal to half of that of the original lattice. It may be transformed back to
the original primitive cubic lattice by means of a continuous and symmetry
preserving transformation, thus it does not represent a cubic lattice of a
new Bravais type. Altogether one finds three different cubic Bravais lattices,
the primitive (p), the body-centered (bc), and the face-centered (fc) ones.
Analogous considerations have to be made for the other 6 primitive Iattices.
11
Chic F
Cuhic 1
Cubic P
Tetragonst I
pj$,
.# .\
Monocfinic
Monoclinic 1
Trigonal
Triclinic
12
In this w q ,one finds that, in totul, 14 diEerriit Bravais lattices are possible
in the 7 crystal systems. The assignment ol Iht.sr lattices to the crystal
systems is indicated in Table 1.1. The Bravais lattices themsehps are shown
in Figure 1.2.
1.2.2
The lattice of a crystal serves as a conceptional basis for the illustration of its
translation symmetry. Lattice points do not necessarily have to be occupied
by atoms, which may be bum1 at general points of the primitive unit cells.
Generally, a primitive unit cell contains several atoms which may be either
chemically identical or different. We denote them here by an atom index 1
which t,akes the values 1 - 1 , 2 , . . . , L , where L is the total number of atoms
in a primitive unit cell. For L 2 2 the set of all L atoms is called the basis
uf the crystal. In the case I. = 1 one says t.hat the crystal has no basis.
Ihe position of the 1-th at,om, relative to the corner R of a primitive unit
cell, is drscribed by a vrcbor
The position Ri of this atom relative t o the
origin is then given by
Ri=Rti.
(1.5)
Without loss of generality one may always set onr of the vectors il, e.g. i l .
equal to zero. If the primitive unit cell canlains only 1 atom, it may be
placed in o m o f the roriiers of the cell. A crystal without basis may thiis be
describd as a laltice whosp points are dl occupied by ahoms. For L 2 2,
the crystal may be generated in such a way that one multiples its crystal
lattice L-fold, then shifts the resulting sublattices relative to t,he first by,
respcctivcly, the vectors 6 ,
. . . ,?I,, and finally occupies the points of the
shift,ed lattices, respectively, with atoms of t,he species 2,. . . , L.
With only one atom per primitive unit cell! any point symmetry operation of the lattice will ncccssatily transform the whole crystal into itsdf. For
cryst,alswith basis, however, this is not true in general. For this reason it is
meaningful to consider, besides the point symmetry operations of the crystal lattice, also orthogonal transformations which map physically equivalent
direchns of Ihe crystal into each other, without necessarily bringing the
crystal back onto itself. We explain the meaning of physical equivalence by
using the example of the relation between the vectors of the electric current
density j and the electric field strength E in a crystal. Generally, j is a
non-lincar fimrtioii of E and, because of crystal snisotropy, the direct.ions
of the two vectors may be different. If E and j are transformed from their
original directions reht.ive to the crystal into new ones, without changing
their relative orientation, the relation betwccn the new j and E, in general,
will differ from the relation before rotat,ion. Analogous statements hold for
reflections and rotation-reflections. If tbcre are orthogonal transformations
13
Crystal
system
tnclinic
Holohedral
Braiais
Crystal
Number of space
group
lattk
class
groups
r
c
1
C,
monoclinic
! I
1
'2h
rhombic
trigonal
tetragonal
hexagonal
cubic
I
)
p
bc
fc
14
which leave the relation between the two vectors unchanged - which ultimately must be verified experimentally - the original and the transformed
directions are called physically equivalent. The group of orthogonal transformations which map physically equivalent directions of a crystal into each
other, is called the point group of equivalent directions. It defines the crystal
class. Each crystal class corresponds to a particular point group of equivalent
directions. In crystals without basis the point group of equivalent directions
coincides with the holohedral point group. For crystals with a basis this is
no longer true in general, and the point groups of equivalent directions are
generally subgroups of the holohedral groups. There are as many different
point groups of equivalent directions, or crystal classes, as there are different
subgroups of the 7 holohedral groups. Using Appendix A one can easily
show that their number amounts to 32 (see Table 1.1).
Not all crystal classes can be realized in all crystal systems - the point
group which defines the crystal class has to be a subgroup of the holohedral
group of the crystal lattice. Each crystal class is, however, found in at
least one crystal system, several in more than one. In assigning the crystal
classes to the different crystal systems one proceeds as follows: A class which
exists in several systems is attributed to the one with the lowest common
symmetry. In this way one obtains the assignment between crystal systems
and crystal classes shown in Table 1.1.
1.2.3
15
+.
16
(zincblende st ruct ure) , F m3m (rocksalt st ruct ure) , P63m c (wurt zite st ructure), and P3121 or P3221 (selenium structure). Each of these crystal structures is more or less closely tied to a particular material group. We consider
this connection in more detail in the next section, which deals with the
chemical nature of semiconductors.
The five crystal structures above belong to just two different crystal systems - t h e diamond, zincblende and rocksalt structures belong to the cubic
system and the wurtzite and selenium structures belong to the hexagonal
system. We start with the three cubic structures. Table 1.2 summarizes
their properties
1.2.4
Crystals having diamond, zincblende and rocksalt structures not only belong
to the same crystal system, but also have the same Bravais lattice, namely
the face centered cubic, abbreviated as fcc. The fcc lattice is commonly
described in a Cartesian coordinate system whose unit vectors e,, ey,e, are
taken in the directions of the cubic crystal axes. The lattice constant a is
the distance between the lattice points of a primitive cubic reference lattice,
which is obtained from the fcc lattices by omitting the face centers. The
primitive lattice vectors al,a2,* of the fcc lattice may be chosen in the
form
The lattice constants for a series of semiconductors with fcc lattices are listed
in Table 1.3. We stress that the cubic lattice constant is neither the distance
between two lattice points (it is u / f i for the fcc lattice), nor the distance
between two atoms in a crystal with this structure (which is &/4).
For
the volume Ro of the primitive unit cell one obtains the value u3/4 from
equation (1.3). In the case of silicon this yields Ro = 4.00 x
cm3. A
silicon crystal of volume 1 cm3, therefore, contains 2.5 x
primitive unit
cells.
The lattice does not determine the positions of the atoms. This is done
by the basis of the crystal, in which respect the three cubic semiconductor
structures differ.
17
Table 1.2: Characteristicz of the five most comnion structure types of seniiconduc
tors. 0 < C < h, 0 < n < $. More details are given in the text.
Diamond
Zincblende
Itocksalt
Seleniuni
Wiirtzite
F#m
P3121
P3221
fy
,3;
4 -
43m
Gmm
32
Bravais
cubic
cubic
cubic
hexagonal
hexagonal
lattice
fc
fc
fc
Primitive
lattice
vectors
Basis
18
1:
Diamond structure
C
Si
Ge
5.43
5.65
Zincblende structure
a-Sn
6.46
AlP
RlAs
AlSb
GaN
Gal
Cans
CaSb
Id
4.37
5.47
5.66
6.14
4.54
5.45
5.65
6.13
5.87
InAs
InSb
ZnS
ZnSe
ZnTc
CdTe
HgSe
IlgTe
6.05
6.47
5.43
5.66
6.08
6.42
6.08
6.37
Rocksalt structure
PbS
PbSe
PbTe
SnTc
CdO
MgS
MgSe
5.93
6.12
6.45
6.3
4.70
5.20
5.46
Wurtzite structure
1:
ZnS
CdS
CdSe
MgTe
AlN
GaN
InN
3.82
4.14
4.31
4.53
3.11
3.19
3.54
1.64
1.62
1.63
1.62
1.60
1.62
1.61
cla
Selenium structure
Se
Te
4.36
4.47
1.14
1.33
cl a
19
Figure 1.3; Spatial model of a crystal having diamond (a) and zincblende structure
(b). In the diamond structure all atoms arc of the same chemical clement, in the
zincblendc structure the atoms are of two different chemical elements.
0 1A
02 8
3A
0 4B
(1111
6
Cmll
a IA
r, 2 0
3A
40
* 5A
60
(J
cfm
20
Figuw 1.5: Stereograms of the paint groups Oh (left) and 7'6 (right) of, respectively,
the diamond and zinchlmde structiires.
rotation, and the planrs through opposite cube dges are mirror planes. Both
structures are theiefore tramformd into themselves by the point group T d ,
the symmetry group of a tetrahcdron. It follows that their point groups of
equivalent directions are at least T d . For the zincblende structure there are
In the raw of diamond structure, thew i s yet another symmptry operation, namely that which transforms the two chemically equivalent atoms
of the basis into cach other. It may be described as reflection in a plane
perpendicular to a cubic axis, say e,, which cuts the connecting line bel w r n two atoms at its center, followrd hy a translation in that plane by
(e/4)(ex ey). One, therefore, has a glidereflection as additional symmetry
dement. The space group of diamond structure becomw non-symmorphir
in this way The point group of directions of this structure may be obtained
from the point group T d of the zincblende structure by adding the reflections
in planes perpmdicular to the thrw cubic axes. This yields the full cubic
group Oh, as one can easily determine by means of the stereograms of the
Iwo groups rn Figure 1.5 (for a n introduction to the stereograms of point
groups, see Appendix A). The group o h can be generated from t h e tetrahedral group in yet anothcr way, namely by adding the inversion operation.
As an element of the space group, inversion must be relative to the site of an
atom and be followed by a tradation through the vector T2 or -?zr depaiding on whether m c considers a 1- or 2-atom. For future reference we bear
in mind that for the diamond and zincblende structures, inversion always
involves an exchange of the two sublattims.
21
tal structures, the so-called coordmatatm Consider atom 1 of thc Lesis of the
unit cell at R = 0. The basis atom 2 of this cell is one of its nearebt neighbors. If all rotation-reflections are executed with respect to the axis through
atom I, then the basis atom 2 of the primitive unit cell at R = 0 gives
rise t o three more basis atoms of type 2. They are all at the same distance
from atom 1 as thP 2-atom first considered, but naturally they lie in three
other primitive unit cells. This means that each 1-atom has, altogether,
four nearest neighboring 2 atoms. Since it would also have been possible
to start from a 2-atom in this consideration, the mme also holds with respect to a %atom, and, of course, independently of whether t h 2-atom is
of the same chemical nature as the 1-atom (diamond structure) or is not
(zincblende structure). The four nearest neighbor atoms lie at the corners of
a tetrahedron, whose center is occupied by the atom itself (see Figure 1.3).
Their relative positions are given by the vectors (a/4)(1, 1, l),(e/4)(T, 1,l),
(a/4) (1,7,l), (a/4)(1,1,i). Thr distanrc betwrrn nearest neighbor atoms
is &a/4. For silicon this is 2+35 A . The second nearest npighbors belong
to the same sublattice. Therefore, for both structures, they arc atoms of
thP same chemical species. They are located at thc nearest-neighbor lattice
points of the fcc lattice. relative to the central atom, thus their positions are
( u / 2 } ( * e 9 f ef), ( a / 2 ) ( f e y f e z ) ,(a/2)(*e,
f ey). This means that
thew are 12 second nearest neighbors in each slructurr.
Rocksalt structure
The basis of this crystal structure consists, like that of zincblende structure,
of two atoms of diffprent chemical nature which arc displaced relative to each
other in the direction of the space diagonal of the reference cube. In contrast
to zincblende structure, the displawmcnl i s not, however, a quarter but half
of the length o the space diagonal of the reference cube, whence
22
(1111
1A
1A
01B
4 2A
0 28
028
0
3A
48
5A
6B
Figure 1.7: Projection of a crystal having rocksalt structure in the (100) plane
(left) and the (111) plane (right). The vertical sequences of atomic layers is shown
on the right hand side of the projections. In the (LOO) case the stacking repeata
itself after 4 Iayem, and in the (111) case after 6 layers.
Accordingly, a crystal having rocksalt structure can be described as a composite of two fcc lattices, displaced with respect to ~ a c hother by the vector
?'z, and with their lattice points occupied by chemically different atoms.
This structure is illustrated in Figures 1.6 and 1.7. As one can see horn
these drawings, the crystal has the full symmetry of the primitive reference
cube. The point group of equivalent directions is therefore the full cubic
group o h (see Figure 1.5). The space group is symmorphic and is denoted
by FmSm.
In Ihe direction of a cubic axis, the atoms are separated by a distance of
a / 2 , with 1- and 2-atoms alternating. Since shorter distances do not occur,
these atoms are ncarest neighbors. Each atom, therefore, has 6 nearest
neighbors. The second nearest neighbors are the nearest neighboring atoms
of the same fcc sublatticp. Their number amounts to 12, as in the zincblende
structure. The relative positions are also the same as in this structure.
1.2.5
The primitive lattice vectors of the hexagonal lattice may be written in the
form
h
ai y- ae,,
a h
a2 - --ex
a
+ -&e,",
2
ce,.
Here ek, I$, e," are unit vectors of a cubic coordinate system whose z-axis
points in the direction of the c-axis of the hexagonal lattice and whose z-axis
is identifird arbitrarily with one of the three symmetric lattice directions in
the plane perpendicular to the c-axis. The two primitive lattice vectors a1
23
and a2 in this plane have equal lengths and form an angle of 2x13 with
each other. The volume 5-20 of a primitive unit cell is &a2c/2. The lattice
constants a and c of some semiconductors with a hexagonal lattice are listed
in Table 1.3.
Wurtzite structure
The basis of crystals with wurtzite structure consists of 4 atoms. We denote
them by M I , X I , Mz and X 2 . Each pair of these atoms, namely M I , M2,
on the one hand, and XI, X 2 ) on the other hand, are chemically equivalent,
while the two pairs are of different elements M and X. If atom M I is sited
at a lattice point, then the position vectors of the three others are
fx1= cc e," ,
(1.10)
Here, ( is a parameter which may take any value between 0 and 1/2. A
wurtzite type crystal can be understood as a composite superposition of
four interpenetrating hexagonal lattices, with the lattice points of two of
them occupied by atoms of chemical species M, and two of them by atoms
of species X. The two M-lattices are displaced relative to each other by
F M ~ ,likewise as the two X-lattices. The displacement of the XI-lattice
with respect to the MI-lattice is Fx1. In Figures 1.8 and 1.9 these relations
are illustrated in, respectively, spatial and planar displays. The M-atoms
are shown in black and the X-atoms in white. From these figures one can
easily determine the symmetry of the wurtzite structure. The c-axis through
the center of one of the empty hexagons in Figure 1.9 forms a 6-fold screw
axis 63: a 2n/6-rotation about this axis must be connected with a parallel
displacement by c/2 in order to transform the crystal into itself. This also
mean8 that this axis represents a %fold proper axis of rotation. The axis
contains six inequivalent mirror planes, among them three proper reflection
planes and three glide-reflection planes joined with translations by c/2 in
the direction of the c-axis. The corresponding point group of directions is
c 6 v (see Figure 1.10). The space group of the wurtzite structure is nonsymmorphic and is denoted by P6amc.
Consider the atom M I in the unit cell at R - 0. Its neighbor in
the same unit cell is atom X 1 at the site ?XI. Another neighbor of M I is
Since
the atom Xz at the site -ce,h I Fxz of the unit cell at R - -c$.
the c-axis through atom M i is a %fold symmetry axis (sep Figure t.9), one
obtains from Xz two additional X-atoms at the same distance but in other
unit cells. Altogether, there are four neighboring X-atoms. In order that
all four be at the same distance from M I , and, correspondingly, all four be
24
0 1 A
028
3A
0
4B
c6~of
the wurtzite
c = , 1t 9 ( ;1)
(1.11)
25
<
<
(z) fi=
=
1.633.
(1.12)
Table 1.2 shows that all wurtzite type semiconductors listed there also satisfy
this condition quite well. This indicates that the ordering of the nearest
neighbors in the wurtzite structure is almost the same as in the zincblende
structure. If a wurtzite crystal exactly obeys conditions (1.11) and (1.12),
one says that it has an ideal w u r t z i t e structure. The distance between two
nearest neighbor atoms in the zincblende structure is &all4 with a the
cubic lattice constant. The corresponding distance in the wurtzite structure
is Cc = (3/8)@
a. For the two distances to be equal, the equation
a= h a
(1.13)
must hold. This result means that in a crystal having wurtzite structure
which fuEUs conditions (1.11) and (1.12), the nearest neighbor atoms are
positioned at exactly the same sites as in a crystal having zincblende structure with lattice constant 4 a. If one multiplies the lattice constants a,
of thc wurtzitc type crystals in Table 1.2 by 4,then one in fact obtains
values which fit well witahthe cubic lat,t,iccconstants of the zincblendc structure crystals of this table. The volume &a2c/2 of a primitive unit cell oi
the ideal wurtzitc structure is d 3 / 2 , which is twice that of thc zincblende
reference structure given by 8 1 4 .
Of course, this does not at all mean that the wurtzite structurc may
be traced back to lhe zincblende slruclure, Indeed, the actual positions of
the second nearest neighbors differ in the two crystal structures, although
they are all at the same distances a = a/& This may be seen most easily
from the projections of the two crystals on the plane perpendicular to the
c-axis (see Figure 1.4) or to the space diagonals e, -t e$ - t e, (see Figure
1.9). In the wurtzite structure the vertical stacking repeats itself after two
double layers of nearest neighbor atoms, but in the zincblende structure only
after three. The wurtzik structure can be set up, therefore, by stacking one
upon another two different double layers A and B, the zincblende structiire
by stacking three double layers A, B,C (see Figure 1.11). The uppermost
double layer A of the zincbbnde structure in Figure 1.llb coincides with the
26
* A
- A
-B - B
-A - C
. B
-A
- B
-C
:
27
~3 = -2craez
2c
+3 e,h,
(1.14)
where a is a parameter ranging between 0 and 112. The atoms are ordered
on parallel spirals, in which successive atomic positions are rotated with
respect to each other by an angle of 120' (see Figure 1.12). The space
group can be determined from the projection of the crystal on the plane
perpendicular to the c-axis (see Figure 1.13). It contains one %fold screw
axis parallel to the c-axis which goes through a center of one of the empty
triangles in Figure 1.13. A rotation by 120 or 240' must be followed by
a displacement by c/3 or 2c/3 in the direction of the c-axis, to transform
the crystal into itself. Beside these screw operations, rotations are allowed
with respect to the three inequivalent 2-fold axes perpendicular to the caxis. These rotation axes pass through the %fold screw axis and through
one of the three surrounding atoms. The symmetry operations mentioned
correspond to the space group P312. Beside this, the selenium structure
may also have symmetry corresponding to the space group P322, where the
spiral differing from P312 has not a right-handed thread, but a left-handed
one. The point group of directions is, in both cases, D3 (see Figure 1.14).
Concluding this discussion of the characterization of the most common
semiconductor crystal structures, it should be emphasized that these struc-
28
(00011
tures apply to the ideal case of infinite, structurally perfect, and chemically
absolutely pure materials. Real crystals deviate from this ideal case in varying degrees. There are structural defects such as stacking faults, step- or
screw-dislocations, vacancies, or atoms on interstitial crystal sites. Such defects will be considered in detail in Chapter 3, in the context of the electronic
structure of yprturbrrl semiconductors. In the next section we will discuss
the chemical composition of idea1 semiconductor crystals.
1.3
According to their chemical nature, most semiconductors are inorganic materials. Examples of organic semiconductors indude anthracene, naphthaline
and polyacetylcne. In thiv book, .we restrict our considerations to inorganic
semiconductors, because these are much bettcr understood and haw much
greater twhnolagical iniporttznce than semiconductors of the organic type.
Nevertheless, organic semiconductors are also of interest for science and tcchnology.
The best way to get an overview of the different &sues of inorganic
semiconducting materials is to examine thP periodic table of elements. In
'l'able 1.4. a part of the periodic table is shown with which many elemental
and compound semiconductors are associated. W e start with the clernenld
semiconductors of group IV.
Mg
A1
Si
Zn
Ga
Ge
As
Se
Cd
In
Sn
Sb
're
Hg
T1
Pb
Bi
29
Table 1.4: Part of the periodic table t o which Inany element.al and compound
semiconductors are associated.
1.3.1
Semiconductors of this group are, starting at the top of column IV? carbon
(C) in the form as diamond, silicon (Si), germanium (Ge), and tin (Sn) in
its gray modification known as a-Sn. All cited semiconductors of this group
crystallize into the structure of diamond. They differ from each other in
regard to their position Let,weezl metals and insulators. Diamond behaves
much like an insulator! and tin, on the contrary, much like a metal. Retween
them one has the two typical semicondnctors silicon and germanium. Both
mat.&& play an essential role in micro- and optoelectronics. It is weUknown that silicon is the material of preference in microelectronics and,
with this, one of the most important materials in dl modern technology.
Beside the pure elemental semiconductors Si a i d Ge, alloys of bath materials also have semiconducting properties. The notation for such alloys is
(Si,Ge), or Sil-,Ge, to indicate the rpspcctive mole fractions 1 - z and z
of the two alloy components. According to their structure, the alloys may
be identified as mixed crystals. These are crystals with the same geomcbric order of atom sites as in the case of the alloy components here that
of diamond in both CBSW. The regular crystal sites are occupied, however.
randamly by Si or Ge atoms. T'he probability of finding a particular el*
ment on a given site is determined hy the mole frclctiou of this element. in
t,he alloy. In the case of Sil-,Ce,
the probability of finding Si is 1 - 2.
and of finding Ge is x. The lattice constants of inixcd cryst.als, in many
~
30
cases, follow from the lattice constants of the constituents by means of Enear interpolation. This result is known as Vegards rule. The cubic lattice
constant a(si,ce)of a Sil-,Ge, alloy, for example, is given by the relation
~ ( s i=
, ~(1)- z)asi Z U G ~ ,where asi and a& are the cubic lattice constants of Si and Ge, respectively. A mixed crystal strongly deviates from
an ideal crystal, but the semiconducting property of the two pure crystals
is often retained in the mixed crystal. This is important, and by no means
obvious. It was recognized at the beginning of the sixties in (Si, Ge) alloys.
Later it was also shown to be true in many other material combinations.
The discovery of semiconducting alloys significantly expanded the family of
semiconductor materials. It became possible to tailor materials with desired
properties, just by varying the alloy elements and mole fractions.
The common feature of the elements of the main group IV of the periodic table is that there are four electrons in the outer shell of their electron
clouds, the so-called valence shell. The primitive unit cell, which here has
two atoms, therefore contains 8 electrons. Later we will prove that semiconducting properties are related to the crystal structure and the number of
valence electrons per primitive unit cell. Using this result, one may conclude
that compounds of elements from main groups III and V, and from main
groups 11 and VI of the periodic table should also be semiconductors. provided they have zincblende structure, since the crystals of these compounds
also have 8 electrons per primitive unit cell. Actually the GI-V compounds
(with a few exceptions) do crystallize into the zincblende structure, In the
case of ILVI-compounds the wurtzite structure can also occur, besides the
zincblende structure. The two structures are, however, very similar, as has
been pointed out above, so that semiconducting behavior may be expected
also in their case.
1.3.2
111-V semiconductors
31
is a promising material for blue LEDs and laser diodes. The quaternary alloy
system (Ga, In)(As, P ) is used in making laserx and pholodiodes for optical
fiber communications at 1.55 p m wavelengths, which provides maximum
transmission for SiOa-based fibers. GaAs and, lately, also (Gtr,Al)As are
employed for transistors with extremely short time delay. BN and GaN can
be used for electronic devices to be operated at high temperatures.
1.3.4
32
1.3.6
33
34
Table 1.5: Electrical conductivity 0 and charge carrier concentrations for metals,
serniconduct.ors,and insulators.
Material class
Metal
Semiconductor
lo3..
< 10-10
1021 . . . 1o1O
n jcm-3
1.4.1
Insulator
Electrical conductivity
j=uE
(1.15)
(1.16)
The mobility p is defined as the ratio of the magnitude of the average velocit,y of an electron to the strength of the electric field driving it (for a
derivation see Chapter 5). Formula (1.16) for the Conductivity o shows that
its variations may, in principle, be caused either by different values of the
mobility p and/or by different electron concentrations n.
The mobility is actually determined by the scattering processes which
electrons undergo due to the perturbations of the crystal lattice. One ex-
35
(1.17)
1.4.2
(1.18)
Here, rrg is a factor which depends only weakly on temperature, k is the
Boltzmann constant and Eg is an energy of about 0.7 eV in the case of germanium and 1.1el in the case of silicon. Formula (1.18) also describes the
temperature dependence of the conductivity for other pure semiconductor
materials fairly well at sufficiently high temperatures, save that the values
for cro and E, must be changed. In Table 1.6 the &.-values are listed for
some import ant semiconductors.
There is a second way of varying the conductivity of a semiconductor
over a wide range, namely to intentionally pollute it with atoms of certain
chemical elements. One calls this process doping wzth impup.ity atoms. In
this context, the doped semiconductor materials are called extrinsic semiconductors. As an example we choose silicon, doped with arsenic a t o m
36
Epzianov, 1979.)
Table 1.6: E,-values for several semiconductor materials at 300 K (in eV).(Aftar
Lundoldt-Bumstein, 1982.)
C:
Si
Ge
,O-SiC
BN
AlN
AIP
AlAs
AlSb
5.5
1.11
0.66
2.2
6.2
6.28
2.45
3.14
1.63
Material
GaN
GaP
GaAs
(Mh
InN
IriP
Iiub
IrlSb
ZnS
E,
3.30
2.27
1.43
0.71
1.95
1.26
0.36
0.18
3.56
Material
ZnSe
CdS
CdTe
HgTe
PbS
PbSe
PbTe
Se
Te
E,
2.67
2.50
1.43
0.00
0.37 0.26
0.30
1.8
0.33
Material
E,
37
The origin of the difference in temperature characteristics of the conductivity of pure and d o p d semiconductors remains to be addressed. In
this regard, it will be helpful to discuss yet another rnacrosropic property of
semiconductors, namely optical absorption.
38
100 -'
'
10-2~
1 2
1.4.3
'
'
m m
102040
hw/eV
39
Valence bond
O0
X---
HL
0
1
f [El--)
Figure 1.20:
E,P.
Fermi distribution
function f ( E ) as a function of energy
at different temperat#ur&.The energy
unit is E,, and EF has been set at
40
L
X-
L
X
Here E F is the so-calld Fwmz mevyy. Like temperature, the Fermi energy is an intensive thermodynamic state variable, namely the free enthalpy
per particle or the chemical potential. A more detailed treatment of the
Fermi distribution function will he given in Chapter 4. The probability of
orcupation of a particular energy value E: depends decisively on the relative
position of E with respect to the Fermi energy. If E < E F holds, and in
addition J E- E F J>> kT,then f ( R ) essentially has the value 1. If, on the
contrary, E > E F and again IE - EFI >> k T , then f ( E ) is approximately
zero. The shape of f(E)is shown schematically in Figure 1.20. The width
of the e n e r a region where the transition horn 1 to 0 takes place is cf the
order of magnitude kT. The lower the temperature, the more abrupt this
transition becomes. At T = 0 K thc transition is steplike. Qccesionally, one
says that f(E)has the form of an iceblock at T
0 K , which melts at
higher temperatures. 111 Figure 1.20 we have also implied that Pi, lies below
E F and E, above E F , in order for the valence bend to be almost completely
occupied and the conduction band to be almost empty. This means that
41
L
X--L
the Fermi energy must be located within the energy gap between Ev and
E,, sufficiently far away from the two band edges (measured in units of kT).
In this. we have described an essential microscopic property of pure semiconductors. Their Fermi level lies in the energy gap between the valence
and conduction bands. The electrons are distributed over the two bands as
shown in Figure 1.21. This knowledge provides a complete explanation of
the absorption spectrum in Figure 1.18. The explanation is illustrated in
Figure 1.22 and needs no further comment.
The differences in the properties of semiconductors and metals may be
traced back, in essence, to the different positions of the Fermi levels in these
two different types of materials. In metals the Fermi level lies within the
conduction band. Insulators do not differ from semiconductors qualitatively
with respect to their Fermi level positions, i.e. the Fermi levels are found
in the energy gap in both cases, but the gap of insulators is typically larger
than that of semiconductors. If Eg > 3.5 e V , as a rule, one has an insulator.
42
(1.20)
with n o as a factor of dimension ~ r n -depending
~
weakly on temperature.
As we know, the electrons of intrinsic semiconductors are statistically dip
tributed over the valence and conduction bands in accordance with the Fermi
distribution function f(E). At T 0 K the valence band is fully occupied,
and the conduction band is completely empty. The electrons in the fully occupied valence band cannot contribute to carrier transport. The reason for
this is again the Pauli principlc, according t o which an electron state may be
occupied by only one electron. Since all valence states are occupied, no stste
change is possible in the valence band by redistribut,ing the electrons. This
means that the T = 0 K state will remain unchanged in 8x1 electric Eeld, and
consequently no current will flow. However, a current will arise from the relatively few clectrons which, according to thc Fermi distribution function: are
populating the conduction band at ftnite temperature. Their concentration
can be easily calculated. For a pure semiconductor Ec - EF >> 6T holds.
Thus, in the Fermi distribution function j ( E ) of ecpation (l.l!J),out3 m a y
neglect the 1 of the denominator for emrgies E in the conduction band,
i.e. energies with E > EC Iu so doing, one obtains, approximately, the
Boltzmann distribution function
7
(1.21)
The concentration of energy levels in the conduction band having energies
between E and E dE is p c ( E ) d E , where pc(E) is the so-called density
of states of the conduction band, which describes the number of states per
unit energy and unit volume. In Chapter 2 we will calculate this quantity
explicitly, here its mere existence suffices. In terms of p , ( E ) one may write
the electron concentration n of the conduction band in the form
(1.22)
43
E F = Ev
1
+ -E
2
'
(1.25)
i.e. the Ferini level lies exactly in the middle between the valence and condiistion bands. Strictly speaking, this is only true for T = 0 K , as we will
see later. The corrections at higher temperatures are, however, quite smaI1,
and they can bc ignored here. Another inaccuracy of the above consideration that. we have ignored is that, at, finite temperatures, the valence band
population also changes. so that no longer are all states of this band ocmpied. Under these circumstances t.he electrons of the valence band also make
a contribution to the current. The temperature dependence of this contribution is roughly the same as that of the conduction band. The electric
charge transport due to the electrons of a not-completely-occupied valence
band will be considered more fully later. It is connetted with a remarkable
observation of the Hall effect, which we will discuss below.
However, we will first clarify how one may understand the strong change
of conductivity with impurity concentration in extrinsic semiconductors. For
this purpose we again m e the energy band model of Figure 1.19 for an ides1
semiconductor. If arsenic impurity atoms are present in a silicon crystal,
then this model has to be altered. Wc will later prove (see Chapter 3)
that each of the arsenic impurity atoms g i v ~
rise to an energy level in the
forbidden gap, just below the conduct.ian hand edge, and that the electrons
in these levels are localized at the site of the impurity atom. For this reason,
we have marked these levels in Figure 1.23 by short line segments.
At temperature T = 0 K each of these levels is occupied by one dectron, namely the fifth valence elcctron of an arsenic atom replacing a silicon
atom having only four valence electrons (see Figure 1.23). If temperature
i s increased, these electrons are excited into the conduction band. In this
way, bound electrons which formerly could not participate in carrier transport, become freely mobile electrons (see Figure 1.24) which can contribute
to transport. If the concentration N D of arsenic atoms is not too large: and
the temperature 7' not too law, then practically all arsenic atoms are ionized
and the concentration n of free electrons is equal to the that of the arsenic
atoms, i.e. one has R = N D . This corresponds to the approximate proportionality between cr and N o observed in Figures 1.16 and 1.17. If the carrier
concentration has Ihe value NLI>which at sufficiently high temperatures T i s
actually the caw, then the conductivity becomes independent of T. The only
remaining source of a temperature dependence for u is the T-dependence of
the mobility p. It is this relatively weak temperature effect which shows up
44
Figure 1.23: Ordering of the allowed energy values of electrons in a silicon crys-
In
a,
P
W
G Eg
a
W
2
I
0
X--c
T-0
T.0
L
X-
X-
Figure 1.24; Distribution of the electrons of a silicon crystal doped with arsenic
over the allowed energy values. 011the left for T 0 K , OIL the right for T > 0 K .
7
45
*)U
1Phosphorus
doped Si
Current 4
Current
Boron
doped Si
u
Currenta)
bl
Current-
Phosphorus
doped Si
H ~ Icurrent
I
~ a i current
i
HOII current
fie'd
Hall current
Cl
d)
Figurc 1.25: nlrlstration of the Hall effect in two silicon smrples, one doped with
phosphorus, and the other with boron.
1.4.5
46
of Figure 1.25 no current can flow in this direction, and therefore there will
be a voltage UH such that the current caused by it will just compensate
the Hall current. Experimentally one finds that the Hall voltage UH has
different polarities in the two samples. This means that the corresponding
Hall currents flow in opposite directions. If we assume that the magnetic
field is directed normal to the plane of the figure and that it points into this
plane, then the Hall current flows upwards in the boron-doped sample, and
downward in the phosphorus-doped one.
The Hall effect can also be measured in metals. In this case the direction
of the Hall current is the same as in the silicon sample doped with phospho-
How can the dlfemnt behauiior of thesiicon sample dupd with born
be understood?
The explanation is quite clear if one assumes that, in contrast to metals
and the phosphorus-doped silicon sample, the current in the boron-doped
sample is not carried by negative charge carriers but by positive ones. This
is illustrated in Figure 1.25d. A charge q , which moves with velocity v in a
magnetic field B,experiences the Lorentz force
F = 4- [ v x S ] .
(1.26)
Since negative charges move to the left, and positive ones to the right, the
Lorentz force F, which depends directly on the sign of the charge, has, for
both charge signs the same direction, namely upwards. For the boron-doped
sample this means a Hall current directed upwards, but in the phosphorusdoped sample the Hall current is downwards, since in this case the charge
carriers are negative. This is just the observed behavior, which means that
the assumption of positive freely mobile charge carriers is successful in explaining the unusual sign of the Hall current in boron-doped silicon. In this
way the experimental observations of the Hall effect reveal a remarkable general property of semicondiictors: For doping with certain atoms, the current
is not carried by negative charges as one would expect considering the negative charge of electrons, but by positive ones. In other words, in addition to
the electrons as negative freely mobile charge carriers, one also has positive
ones in semiconductors under certain conditions.
This surprising observation may be understood as ~OUQWS.One may
demonstrate - as we will do later explicitly - that boron atoms whi& substitute silicon atoms in tf silicon crystal, give rise to energy levels in the
forbidden zone just above the valence band edge. This is illustrated in Figure 1.26. Electrons in such energy levels are localized at the sites of the
boron atoms. At very low temperatures these states are not occupied, and
the boron atonis are electrically neutral. At finite temperatures electrons
from the valence band are excited into the boron levels (Figure 1.27), leaving behind unoccupied states, or occupation hole3 in the valence band. We
Figure 1.26: Ordering of the allowed energy values of electrons in a silicon crystal doped with boron (schematically).
Each boron atom corresponds to 1 locali d energy level.
47
immediately recognize that these occupation holes of the valence band behave jubt like freely mobile positive charge carriers in an applied electric
field. This is illustrated in Figure 1.28. For simplicity only one hole is assumed to be in an otherwisr rompletely occupied valence band, placed at the
upper band edge. The population of the bend edge is shown in Figure 1.28
at different points of time. At the beginning, the hole is at the outermost
left position. The adjacent electrons experience a force which tends to move
them to tho left. Rut only the electron neighboring the hole on its right
hand side can follow this force, since all other electrons are blocked because
the states to their left are already occupied. The electron immediately on
the right hand side of the hole, moves into the hole and leaves behind another hole to the right of the first one. The hole has thus moved one site
further to the right in this way. 111 the next time interval this process is
repeated, and the hole again moves one site further to the right etc. Thus,
in an electric field, the hole moves like a freely mobile electron, but in the
opposite dirwtion, as if carrying a positive charge. In summary, holes in the
valence band behave like freely mobile positive charge carriers. This qualitative introduction of the concept of holes will later be elaborated by more
quantitative considerations.
As we have seen above, the sign of the Hall voltage tells one whether
the free carriers of an extrinsic semiconductor are negatively or positively
charged, Le. whether they are electrons or holes. In the first case one speaks
of a n-typr semzcondurtov (the n stands for negative), in the second of a
p-type semarondector ( p for positive). The antranszc semiconductors intro-
48
1
X-
X-
duced earlier haw neither electrons nor holes from impurity atoms. Their
mobile electrons arr generated by thermal excitation of bound electrons from
the valence band to the rondiiction band. Since each excited electron leaves
behind a hole in the valence bmd. one also has mobile holes in intrinsic semiconductors. Their number is equal to that of the mobile electrons. Holes
are also present in n-type semiconductors, but, in very small numbers compared to elwtrons. Analogously, a few electrons occur in p t y p e materials
with mob1 of the carriers being holes. One calls the many mobile carriers
of extrinsic semiconductors m a j o n l y cariwrs and the few mobile carriers
mzrrortty earners.
The Hall effect can also be used for purposes other than the determination of whethrr an extrinsic semiconductor is n- or p-type. The absolute
F=-[j
X B ] .
(1.27)
RC
This is the same force caused by the electric field E ~ I= F/(-e). The
voltage corresponding t,o EH is by dehition the Hall voltage Ufi. With b
as the width of the sample normal to j and 3,the expression below for U x
follows:
(1.28)
where
Rff = -
(1.29)
PrtC
1.4.6
All properties considered hitherto were for semiconductors in a state of thermodynamic equilibrium or in close proximity. However. semiconductors may
easily be driven into states far from equilibrium. Here, 'far' means that characteristic macroscopic propexties of the semiconductor deviate strongly from
those in equilibrium.
To be more specific about these qualitative statements, we examine the
example of photo-conduction. Consider a sample of an intrinsic semironductor, shielded against unwanted influence of light. If, in this 'dark state' the
conductivity is measured. one obtains a relatively small value. in accordance
with the relatively low carrier concentration of an intrinsic semiconductor in
thermodynamic equilibrium. However, if one irradiates the sample with light
which is absorbed by the semiconductor (see Figure 1.22). the conductivity
will rise more or less strongly, depending on the intensity of the light and
the magnitude of the absorption coefficient. This is shown schematically in
Figure 1.29. assuming realistic conditions. When the exposure of the sample
t o light ceases, its conductivity decreases to the original low dark value. Evidently, electrons in the conduction band and holes in the valence band were
created by irradiation with light to such an extent that their equilibrium values were exceeded by orders of magnitude. A simple estimate for Figure 1.29
50
1M - -~
1~-7 -
10-4
10-3
lo-7
10-1
ma
I h o t t om*
r]'
--+
shows that t.he carrier concentration was increased from about 10'' cm4
(equilibrium value) up to
c ~ n . -by
~ light of intensity 10 Wcm-'. The
electron-hole pairs created by the radiation decay, however, after a short
t,ime. 'lhis follows from the fact that continuous irradiation leads to tt stalionary conductivity value, and lienre a constant carricr concentration, and,
on the other hand, from the observat.ionthat the conducbjvity decays down
to the dark value after switching off the light source. The 1at.ter observat.ion means that, thermodynamic equilibrium i s reestablished by so-called
wxnnbinaiion, p7nuesses.
Rwidr irradiation with light, extreme nun-equilibrium states in semiconductors can also be created in other ways, for example, by putting an n-type
semiconductor in contact with a p t y p e semiconductor or with a metal, or
by applying voltage to a. semiconductor which previously had been isolated
by a thin insulating layer from one of the electrodes. The ability to c r e
at,? extreme non-equilibrium states in semiconductors i s extensively used in
electsonic devices. Almost all applicat.ionsof semiconductors in such devices
rest on this uniquc pQssibility-.
Non-equilibrium processes in semiconductors and the most important
scmiconductor devices based o n them, such as electric rect.%ers, bipolar and
unipolar transistors, tunnel diodes, photodetectmu: solar cells, as well as
luminescence and laser diodes, will be dealt with in the second part of this
book, i.e. in Chapters 5 , 6 and 7. In the first part of the book, the basic
concept.s, discussed above in a heuristic way, will be developed from first
principles. This applies to the stationary electron states of an ideal semiconductor (Chapt,er 2), their niodifications by impurity atoms and other
deviations from the ideal crystal, as well as by external fields (Chapter 3)
and t h e statistical distribution of charge carriers over aiergy levels in thermodynamic equilibrium (Chapter 4).
51
Chapter 2
2.1
52
crystsls
Table 2.1: Characterization of atomic cores and valence electrons of main group
elements from which semiconducting mat,erialsmay be formed.
Atom
Core
Nucleus
B
C
N
2922p2
2s22p3
13+
323p
132222p6
14+
323p2
15-
Ga
3s23p3
3 1+
32AS
222p
18
7f
A1
Ge
Core electrons
5+
6+
St
Valenw electrons
424p
ls22s22p633s23p63d10 3s24p2
3 3+
4Ap3
ble 2.2). Because the electrons of these deep shells are localized so close to
tlirir rrspertive nuclei. they feel potential changes produced by siirrouuding atoms as being almost uniform. Strictly speaking, this means that the
wavefunctions of these electrons are essentially unaltered. Their eneqy levels shift, however, specifically by the change of the constant potential value
arross their localization region. The term d i d d a t P shtjts is iised for t h e s ~
shifts of the inner electron levels. By measuring these shifts m e can obtain
information about the chemical nature and geometric striicturc of the environnient of an atom in B solid. In B way, the inner electrotla threby serve
as probes.
53
Table 2.2: Energy levels -E, and -4 of the valence electrons and - E , of the shallouwt core electrons for some chemical elements which may occur in semiconducting
materials. All energies are given in el/. (Afcer Herm.an. and SkiIlman, 1963.)
A1
Si
Zn
Ga
14.1
4.9
6.5
8.3
10.3
3.1
4.9
23.0 29.1
10.1
13.6
17.1
20.8
8.4
11.4
291
405
537
87.5
116
147.4
182
20.7
31.7
7.9
9.5
3.4
4.7
5.9
7.2
8.6
3.5
5.8
Atom
-tp
6.6
9.00 11.5
-t,
12.5
17.5
-ec
195
-cP
6.4
the inner shcll drrctruns jointly as subsystems of the crystal, whose intrrnal
structure i s of no further interest sirire it does nut change. The structure is,
so to speak, frozen. In this sense t.hc subsystcms composed of atomic iuiclei
and inner electrons are elementary building blocks of the crystal. One rekm
t,o them as atom.ic ~ 0 : ~ s .
Since the crystal aIso contains valence electrons as independent particles,
we arrivr at a yic:i.ure which is fundamental for the further analysis - the
picture of a cryst,al as system composed of at,omiccores and valence electrons.
Somctirnes one refers to this concept, as the frozen-core a p p ~ ~ ~ i i m In
ati~~~
Table 2.1 the division into core8 and valence electrons is indicated for some
elements from which semiconductor crystals arc made.
The frozen nature of the electron states of the cores of a crystal! and
their lack of rcsponse to external influences, generally prevails, but as always,
there arc exceptions. In heavy metals such like zinc, t.he inner &shells are,
energetically, rather close t,o the oubar valence shells (see Tables 2.1 and 2.2).
In this case the d-electrons significantly participate in chemical bonding and
can no longer be included in t.he core, which is iinchangeahle by definit,iou,
Moreowr, the inner shell eledrons of crystals can be excited by means of
electromagnetic radiation in thc far UV and X-ray region. This can also
occur by means of an electron beam In solid state nuclear reactions, ~ W R
the states of the nuclei of the crystal atoms change.
54
2.2
2.2.1
Schrodinger equation for the interacting core and valence electron system
55
/;
+
Figure 2.1: 13escription of the positiom of the atomic cores ( 0 ) and valence elm
trons {a) {left part) as well as the interactions between these particles (right part).
ciently large, they will encompass all types of motions of an infinite crystal
with desired accuracy. The concept of the periodicity region makes it possible to pass from the original infinite space problem of motion to a finite one
without thereby losing the translational symmetry of the infinite crystal.
We use Xj t o denote the center-of-mass coordinates of the j-th atomic
core, and xi for the position of the i-th electron, which is further assumed
to be point-like {see Figure 2.1)- The j-th core mass will be denoted by
Mj. Of course, there are only as many diferent values of M j as there are
chemically different types of atoms in the crystal, so most of the Mj-values
are identical. In the case of electrons we can omit the index i from their
masses since they have the common mass m. The momentum of the j-th
core is called P3,and that of the i-th electron pi, such that
We are interested in the motion of the interacting atomic cores and valence
electrons of the infinite crystal, which can only be adequately treated by
means of quantum mechanics. The state of the system is described by a
wavefunction @, which depends-on the coordinates xi of all electrons and
Xj of all atomic cores, as well as on the time t. Since we assume periodicity
of the motion with respect to a periodicity region, it suffices to consider @
as a function of the coordinates xi of the N-electrons and the coordinates
Xj of the J cores of only one periodicity region. The state of the particles
in the remaining periodicity regions can then be described by means of a
periodic continuation of this function, i.e. by means of the relation
56
@(XI I
Ga,,
x2 1 ,%:C
with CY 1 , 2 , 3 .
We will now wt up the Hamiltonian 7-l of the system of the N-electrons
and J atomic cores of a periodicity region. We use Tc and reto denote the
kinetic energies of the atomic cores and of the electrons, respectively, and
, . . . ,XN, X i , X2,. . . , XJ) to be the potential energy of
we define V ( X ~xz,
the system. The Harniltonian is the sum of the kinetic and potential energy
operators,
~
+ + V.
H =- Tc Te
(2.3)
(2) The interaction of the electrons with the atomic cores due to their mutually attractive Coulomb forces, and also due to (repulsive) forces of quantum
mechanical origin, which become effective if the valence electron wavefunctions overlap the inner electron shells of the atomic cores. The electron-core
interaction potential energy V,, depends on the locations of both the electrons and cores. With respect to the electrons, it is evidently additive, i.e.
. XJ)=
where Vc(xi, X I ,X2, . . . , XJ) is the potential energy of the i-th electron in
the field of all cores.
(3) The mutual interaction of cores, which at sufficiently large distances is
again of Coulomb type. If the distances become small, repulsive forces of
quantum mechanical origin also occur. The core-core interaction potential
energy will be denoted by V,. It depends only on the locations of the atomic
cores, i.e.
57
VCr(X1, XL
vcc
XJ).
(2.7)
Summing the three potential parts (2.5), (2.6) and (2.7), one gets the total
potential
v, + v,, + v,,
(2.8)
which determines the dynamical problem of the crystal uniquely. This problem i s described by the time-dependent Schrodinger equation
i3
iFL-e = H6,
at
whose solution may be determined in terms of the eigeiivalue problem for
the Hamilt,onittn 7f, which is given by the time-independenl Schrodinger
equation
%IJ! = EQ.
(2.10)
The normalization condition for the wavefunction with refereiicc to a pcriodicity region is
(@I*)
d3X1.
. . d 3 x J p q X 1 ,x2,. . . , XN,X I ,
xZ, . . . ,
xj,t ) l 2 = 1.
(2.11)
Attempts to solve this eigenvalue problem exactly are hopeless from the very
beginning, because it involves a macroscopic system, i.e. a system with about
10 electrons and a similar number of atomic cores, the motions of which
are mutually coupled in a rather complex way. One must therefore resort to
approximations. Such approximations must first provide the means to reduce
the gigantic number of electrons, and secondly, allow for a proper decoupling
of the electron and core motions. The second simplification is achieved by
the so-called adiabatic approximation, and the first by the one-particle
approximation. These two approximations will be elaborated below. We
begin with the adiabatic approximation, and in the course of the discussion
it will also become clear how the somewhat unexpected designation of the
latter arises.
2.2.2
The adiabatic approximation (also known as the Born-Oppenheimer approximation) is based on the fact that the mass of the atomic cores is many tens of
thousands of times larger than that of the electrons - in Si, e.g., 52 thousand
times, and in mercury 368 thousand times. In addition, it takes advantage of
the fact that in a crystal the kinetic energy of an atomic core is, on average,
58
smaller than that of a valence electron. 'l'his can be seen in the following
way: If the cores and valence electrons were fwe particles, i.e. if they did not
interact, then the average kinetic e n p r o of a core would be approximately
(3/2) h7'. That of a valencp dzctron would be about (3/6) Eli where E F
is defined as the Fwmi energy of an electron gas of the same density. The
daerence between the average kinetic energies of the two types of particles
arises from the fact that the electrons obey Fermi statistics, whereas atomic
cores obey Boltzmann statistics. For typical concentrations of valence electrons in a crystal of about 10" C W L - ~ the Frrmi e n e r a EF i~of the order
of m~gnit~ude
of e V , while kT reaches only about 0.1 eV below the melt
mg point of the crystal. The average kinetic energy of a core is therefore
generally smdlcr than that of an electron. This remains true when the interactions between the electrons and coresi which were omitted above, are
taken into account. Writing (M,/2)
< X: > and (m/2) < x: > as the
avrragP kinetic enmgies of a core and an electron in a crystal, we thus have
Mj
-2< X j >
1 7 1 . '
< -<xq>.
2
(2.12)
x = (XI,xz,. , ,
XN),
x = (XI,xz,.
I
XJ).
(2.14)
To take advantage of the slow motion of the cores we write the solution
q{x,X) of the Schrodinger equation (2.10) for the total crystal. in the form
of a product
q x ,X ) = $(x, X )
'
@(XI.
(2.15)
59
(2.17)
are assumed. An analogous statement holds for the periodicity condition
(2.2) of @(x,X). To assure overall periodicity with respect to a periodicity
and 4(X)separately. An Ansatz of the
region we assume it for +(x,X)
form (2.15) is always possible since it does not assert separability of the
variables x and X,but merely splits off a factor +(X)from the wavefunction
@(x,X)which depends only on one variable, X, while retaining the full
dependenre on both Coordinates in thr second factor +(x,X). The Ansatz
(2.15) becomes non-trivial if we proceed as follows: Firstly, we assume that
$(xiX)is the solution of a Schrodinger equation for electrons,
1%
+ v,,,c]
$(x,X)= U(X)@(X,
X),
(2.18)
and U(X)is in the nature of an electron energy eigenvaliie. Secondly, we demand that the split-off factor, 4(X), of the total wavefunction (2.15) satisfy
a Schrodinger equation in which the coordinates of electrons do not appear.
It turns out that such an equation cannot be derived rigorously, but only in
a special approximation - the adiabatic approximation which was mentioned
above. Yet without any approximation we have
[Te+ Tc + K , ,
(2.20)
The set {$4) of the eigenfunctions of the Schriidinger equation (2.20) forms
an orthonormalized basis set in the Hilbert space of the crystal. Therefore,
relation (2.20) is satisfied if it holds for the Fourier type coefficients relative
to all basis functions Qd of this set, i.e., if the identity
($4lTe
(2.21)
60
The first two terms on the right-hand side of this q u a t i o n turn out to be
small compared to thr kinetic cuergy term of elwtrons in equation c2.20).
One has the order of magnitude relations
(2.23)
(2.24)
Here .n/E is a typird value of the core masses M,. The two equations (2.23)
and (2.24) are of fundamental importance in crystal dynamics, because they
are ultimately responsible for the drroupling of rloctron dynamics from the
dynamics of the cores. Therefore we present the proof of thwc equations
in Appendix B. Here we proceed a n the assumption that these relations are
proven.
The terms of rplativeorders of magnitude (rn/M)1/2 2 lov2 and ( m / h f )w
l W 4 , will be nPglatrd henceforth. With this the operator T, for the rambined kinetic energies of all cores satisfies the approximate relation
Tc[.l$(x,X)9(X)l E=r ${x, X j W G q .
(2.25)
This means that T, effectively does not act on V(X, X).In view of this relation we reconsider the SrhriidingFr equation (2.20) for the crystal, replaring
the terms which still depend on x by means of the electron Schrodinger equation (2.18) in terms involving the electron energy eigenvalue U ( X ) . Finally,
forming the scalar product with VI, we obtain the relation
(Tc
+ VC,(X)+ U ( X ) \ d ( X )= Ern(X).
(2.26)
This rcpresents the SchrGdinger equation for the atomic cores in which the
coordinates x of thc electrons do not appear. The state of the elcctron
system, however, enters this equation, namely via its energy U(X)which
plays the role of a potential (referred to as adiabatze potential).
In summary, we h a w reached the following description of the total crystal, viewed as an interact,ing system of atomic cores and electrqns:
The subsystemof electrons is describcd by R separate wavefunction qi(x, X),
which obeys a Schrijdinger equahion in which the coordinates of the cores
enter only as parameters in the potential, but do not occur as differential
operators in t,hr kinetic energy. In this way, the motion 01 the electrons is
treat,& as if the cores were at. red. Core motion, which does, of course,
occur despite its neglect with respect to electron motion, is described by
the wavefunction &(XIand the Schrodinger equation (2.26). T h e potential
of this q u a t i o n contains, besides the core-core interaction energy, a second
contribution IJ(X). This originsles in the interaction of the electrons with
61
the cores, for without such an interaction the eigenvalue U in the electron
Schrodinger equation (2.18) would be a constant independent of X, which
could be omitted. The potential contribution U(X) caused by the electroncore interaction does not depend, however, on the electron coordinates. It
is an average value over all their positions X. The weight with which the
various positions x enter this average over the probability I + ( x ,X)1 d 3 N ~
of finding the electron system in a volume element d 3 N ~at the position x,
since equation (2.18) implies that
u(X) = (+(X)ITe
+ vee,ec(X)l+(X)).
(2.27)
One can alternatively express this as follows: The electrons move so fast
that they are no longer seen by the cores as point-like particles, but as
smeared out over all space. Equation (2.26) for 4(X) thus contains the same
assumption as equation (2.18) for + ( x ,X),namely that the cores move much
slower than the electrons. In so far as this feature is seen from the point
of view of the electrons, equation (2.18) follows, whereas from the point of
view of the cores one obtains equation (2.26). Since the relation between the
velocities of the cores and the electrons, according to (2.13), is determined
by the inverse ratio of their masses, it is clear why this ratio must be small
for the two Schrodinger equations (2.18) and (2.26) to hold jointly in an
approximate sense.
It remains yet to clarlfy what effects are neglected because of the above
approximation and why this approximation is called adiabatic. In quantum mechanics, one understands adiabatic temporal changes of potentials in
the sense that the changes proceed so slowly that no quantum mechanical
transitions will occur between the discrete quantum states of the potential,
which themselves evolve slowly from the initial onset of time variation. The
state of the system thus conforms continuously to the evolving new potential
values as a function of time, without any transitions to other states. That
exactly this situation is described by equation (2.18) and (2.26), may be
seen from (2.24). If one considers the previously neglected term in (2.24) of
relative order of magnitude ( m / M ) l I 2 ,then the total Hamiltonian 7-t of the
crystal has non-vanishing off-diagonal elements
(2.28)
and quantum mechanical transitions between the different eigenstates $14
and $+ of the crystal are recognized to occur. These transitions are caused
by the kinetic energy of the cores exclusively. If terms of the order of magnitude ( m / M ) I 1 2 are omitted, then the quantum transitions due to core
motion are also neglected. This is equivalent to the assumption that the
core motion be adiabatically slow, in the quantum mechanical understand-
62
ing of this term. The term adiabatic thus refers to the essential character
of the approximation in neglecting (rn/M)/. This approximation is useful, of course, only as long as transitions between different eigenstates $$
play no important dynamical role. This is actually the case in regard to
many crystal properties and phenomena. There are, however, also effects
for which this does not hold, notably electric current transport. The fact
that the electric conductivity of an absolutely pure crystal does not become
excessively large is due in large part to the scattering of carriers from the
oscillations of the atomic cores, i.e. to non-adiabatic quantum transitions
between different electron and core states. Also, in the recombination of
electron-hole pairs mentioned in Chapter 1, these transitions play a decisive
role, with the lattice of atomic cores absorbing the energy which is released
during recombination. Formally, one may understand non-adiabatic transitions as the result of an interaction between the electrons and the motion of
the atomic cores. Since such core motion, as we will see below, represents a
superposition of lattice oscillations, also known as phonon excitations, this
interaction is called the electTon-phonon interactaon
We have yet to explore how the two Schrodinger equations (2.18) and
(2.26) for the electrons and cores can be actually solved. The problem is
that both equations, are, at the outset, not completely determined - the one
for the cores contains the adiabatic potential U(X),which can be known
only after the equation for the electrons has been solved; and the electron
equation can be fully d e h e d , however, only if the positions X of cores in the
potential Vee,..-,-(x, X) are known. The direct way to overcome this difficulty
would be the following: One assumes a particular spatial ordering Xof the
cores and uses it to determine for them the potential Ve,ec(x,X).The latter is then used to solve the electron Schrodinger equation (2.18) (we will
not discuss here how this is accomplished, as it will be the subject of the
next subsection, 2.2.3). From the solution of the Schrodinger equation (2.18)
one obtains the value of the adiabatic potential U at the position X of the
cores. The same procedure is then applied to all other possible positions X,
whereby the adiabatic potential U(X) and the Schrodinger equation (2.26)
for the cores are completely determined. This equation can then be used to
calculate the core wavefunction t#(X). It follows that the dynamical problem
for the crystal as a whole is solved, since one would know its eigenfunctions
@(x, X)= y(x, X)q(X).
In reality, however, this procedure is unsuccessful.
One cannot solve the electron Schrodinger equation for all possible core positions. Therefore, a simplified procedure is necessary. It contains additional
approximations, but has the advantage of being feasible in practical terms.
In this approach, one ignores the motion of the atomic cores completely and
assumes that they are resting in certain equilibrium positions Xq. In reality, they execute oscillations around these positions with amplitudes that
become smaller as the temperature of the crystal decreases. However, due to
63
xo
- X"
I.ke,ec(X,Xn)
v,v,(xy
-0 ?
no
xn+1
Figure 2.2: Iterative calculation of the equilibrium positions of the atomic cores.
the quantum mechanical phenomenon of zero-point oscillations, such motion
remains finite even at absolute zero temperature. The equilibrium positions
Xq are unknown at the outset. One can determine them by demanding that
the total energy
(2.29)
--vxvo(x)~x,xeq
= 0.
(2.30)
Bearing this in mind, we may employ the iteration process below for the
solution of the two coupled adiabatic equations (see Figure 2.2): In this pro-
64
cess, one assumes ccrtain trial equilibrium positions Xo, enters them in the
This
electron Schrodinger equation, and determines the eigerivalue U(Xo).
solution is then used to determine the potential Vo(Xo) and the liellmaiiF e y m m forces. Thanks to the Fe'egrman theorem, taken jointly with appropriate analytical transformations, one can determine these forces without
numerically calculating Lhe potentid in the environment of X'. After the
first iteration cycle, t.he Hcllman-Feynman forces will, in g~neral.not yet
vanish. signifying that thc cmes arc still not at equilibrium posibians. By
nieans of the non-vanishing f o r c e one det.ermines new trial positions XI.
'The new positions are then substituted again onto the electron Schrudinger
equation (z.18)! to calculate a new eigenvalue U(X1),and the latter determines the corresponding Heban-Feynrnan forces. This procedure is to be
repeated until the forces become zero. The corresponding core positions are
then the equilibrium positions XeQ.In this way one reaches a very important result, the determination of the atomic structure of the crystal. Such
structure calculations are successfully carried out currently fw many solid
state systems, including a series of semiconductor cryst.als. With regard to
semiconductors, it can he shown, for instance, that under normal conditions,
Si haas the diamond structure, and that its Lattice constant a iR 5.49 A.
In so far
VO(X) =
vo(xeQ)i-' ( X
2
- X e ~ ) V ~ v X r ~ ' o ( X-~x)e(4x1.
(2.31)
65
where. for simplicity, Vo(Xeq) = 0 has been assumed. Equation (2.32) describes a system of coupled harmonic oscillators. The restoring forces are
determined by the second derivatives of the potential Vo(X). Using the
eigenvectors and eigenvalues of the matrix of restoring forces (actually, of the
so-called dynamical matrix which also includes the kinetic energy term), one
can easily transform to a system of uncoupled harmonic oscillators. Their
motions are called normal mode osciliataom or lattace oscillations,and their
excitation quanta are phonom.
Phonons are a good example of the introduction of a concept which is
of fundamental importance for the dynamics of many body systems, including the many-electron system of a crystal which will engage us in the next
subsection. The concept we have in mind here is identified by the terms
elementary excztatzons or quast-particles (both terms are commonly used).
This concept is based on the possibility of decomposing the motion of a
system of mutually interacting particles - in our case of the atomic cores
of a crystal - into non-interacting components of motion - phonons in ow
case. The phonons or, more generally, the elementary excitations are. so
to speak, the elements of n o t z o n of the system, while the atomic cores or,
more generally, the actual particles. form the ~ ~ T U C ~ U Telements
-Q~
of the system. ,413elementary excitation involves coordinated motion of all structural
elements of the system. Conversely, the motion of an individual structural
element is a superposition of all elementary excitations - the motion of the
atomic cores, for example, is a superposition of all normal mode oscillations
or phonons.
Besides the oneelectron and one-hole excitations, the phonons are the
most important elementary excitations, or quasi-particles, of a crystal. In
this book we will deal mainly with electronzc elementary excitations, and
will include phonons only if it is otherwise impossible to properly describe
electron dynamics. Relevant phonon information will simply be cited without detailed justification, since a thorough development of the theory of
phonons is beyond the scope of this book. O w choice of subject matter here
is conditioned by the fact that electrons and holes are much more important
for understanding the properties of semiconductors as they are used in electronic devices, than are phonons. Readers who are particularly interested in
phonons are referred to other books (see, e.g., Born and Huang, 1968; Bilz
and Kress, 1979; Bonch-Bruevich and Kalashnikov. 1982).
We return now to the Schrodinger equation (2.18) for electrons. In the
sense of Figure 2.2, we approximate the core positions X by their equilibrium
values Xeq. As far as the latter are concerned, we take the point of view
that they are known from experimental structure investigations, e.g., by
means of X-ray diffraction. For common semiconductor crystals this is in fact
true in all cases. Taking this approach, the potential b=,- in the electron
Schriidinger equation (2.18) is well-defined from the very beginning. To
66
crystals
2.2.3
(z -34)
The wavefunction ~+4(XI, xp, . . . ,XN) must be periodic and normalized with
respect to a periodicity region. The Schrodinger equation (2.34) is impossible
to solve directly since it describes an interacting system of electrons having
a tremendously large number of particles of order loz2. The goal of this
subsection is to provide an approximate description which allows one to
reduce the number of particles down to minimum number 1. This will be
done by developing a oneparticle Schrijdinger equation whose solutions are
rdated to those of the true many electron Schrodinger equation in a welldefined and sdiciently simple way.
~
Hartree approximation
j = 1,Z, 3.
(2.35)
(2.36)
67
oneparticle state pv[x), i.e. particle 1 with state pull particle 2 with state
puL,etc., up to partick N which is associated with the state. , ,oy
Alternatively, WP may say that we occupy state pw with particle 1, state p
, with
particle 2, etc. Due to the Pauli exclusion principle, each state can host only
1 particle, ignoring spin (which we do at this stage). Thus a given state
p1, p2,. . . ,pm may occur among the papdated ones lpy, 'py, . , ,pVNnot
more than once. Most of the states will not even orcur o n r ~ i.a,
,
not at all
(bear in mind that there is an i n h i t e number of them). These states remain
unoccupied. The set of quantum numbers, (q,
vz, . .. , VN),termed configzsratzon, definea the state of the N-electron system uniquely if we understand
that the h s t number in this set refers to the state of particle 1, the second
to the state of particle 2 etc.. Henceforth, we abbreviate the configuration
( V l , V a l . . . ,w )by (.I.
Thirdly. we assign to each configuration (IJ)
of the N-electron system a
wavefunction $(y)(xl,x2,. XN) which is given by the bllowing product of
oneparticle wavefunctions:
I
=~LpvJx3)-
(2.37)
Disregarding the miitual interaction of the electrons for the moment, the
product (2.37) forms an eigenstate of the N-electron system if the oneparticle wavefunctions pV,(xJ)are energy eigenstates of the individual o n e
electron subsystem Hamiltonians. This suggests t h e question whether a
similar result might be possible for interacting electrons, i.e. whether it will
be possible to choose the py,(xj) in such a way that the product statc +ivi
obeys the Schriidinger equation
H${V)(Xl*
. 1
XN)
(2.38)
for the fully interacting N-electron system - if not rigorously. then at least
in some reasonable approximation.
To address this question one may use the variational principle of quantum
mechanics. In this procedureI the oneparticle wavefunctions pV,(xj) are
determined such that the expectation value of the N-electron Hamiltonian
H becomes a minimum for N-eIecctran states of the product form (2.37).
Here we take a slightly different approach, and start from the Schrodinger
equation (2.38). This procedure has the advantage that one ran ai. once
determine whether there is a suitable approximation in which d ~ { may
~ }
be written in the product form (2.37), and also barn the nature of that
approximation.
Considering an M-electron system, we label a particular electron i, and
this index can take all values between 1 and N. The Hilbert space of the
68
+ 1,... ,N is spanned
(2.40)
Due to the orthogonality of the qv. t h e right-hand side of this equation
differs from zero only if the pvalues coincide with the v-values, i.e.. if
= q ,.., pz-l = ~ ~ - pcl,+l
1 ,
= v,+l. .... p~ = v w holds. However, the
left-hand side of equation (2.40) differs from zero if p J f v3 for one or several I # t . Thus equation (2.38) cannot hold rigorously, which means that
the eigenfunctions ${,,I of H cannot be written exactly as a product {2.37)
of oneparticle wavefunctions. This is only possible under the condition that
the non-diagonal elements of the Hamiltonian operator on the left-hand side
of (2.40) may be neglected. It is this approximation which makes possible
the reduction of the N-particle wavefunctions to products of oneparticle
wavefunctions. It is called a one-partzcle approozamatton. Strictly speaking, it is the simplest variant of a oneparticle approximation, the so-called
Hartwe appronmatzoa. .4 more accurate oneparticle approximation, called
the Hartree-Fock apprommatzoa, will be discussed below.
Within the framework of the Hartree approximation the equation system
(2.40) involves only the diagonal terms with p 3 = vj for each 1 # 2 , and
correspondingly takes the form
69
(2.42)
On the right-hand side of this equation only the first three terms depend on
the electron coordinates x1 while the last two are constants in this regard. If
one substitutes the expression (2.42) into equation (2.41),then the last two
terms can be grouped together with U{'(.}to h m the new eigenvalur
(2.45)
we rewrit,e (2.41) as
The final relation (2.46) has the form of a Schrodinger equation for the 7-th
particle where V'{(")(xp)is the potential energy of this particle and EV,is
its energy eigenvalue. Beside the potential energy ITc(&) due t o the atomic
It is caused by the
cores. V'Iv}(x,) also contains the contribution \#'}(x,).
mutual interaction of electrons, and is commonly called the Hartree p o t e n t d
In explicit form, ITH%{.I (x,)reads
(2.47)
70
Here the integration runs over a periodicity region. The Hartree putential
V~(X~)
describes the potential energy of the i-tb particle in the Coulomb
potential produced by the charge distribution --e Ck+%
i(oV,(x)l2of the remaining particles. The factor of the electron-electron interaction potential
(2.5) does not occur in expressions (2.45) and (2.47) for the Hartree potential since each electron pair contribntes only once. Obviously. the Hartree
potential and the corresponding energy cigenvalues depend OB the configuration { v } of the N-electron system, and also on the index i of the particle
which was removed.
The one-particle Schrdinger equation (2.46) derived above for the i-th
elpctron, holds for each other electron as well. only with a somewhat different
potential. This difference will now be removed, together with the dependence
of the H a r t r e potential on the configuration {v} of the N-electron system.
We argue as follows: If the number A; of electrons is macroscopically large,
as in the case of the electron system of a crystal, and if we consider only
oneparticle states which are spatially spread out more or less evenly aver
the entire crystal, there is no signifkant difference if we extend the sum over
k in equation (2.47) for the potential V;,[(x,)t o include k - i . Then the
2-dependence of the potential no longer exists. The emor thereby incurred is
of relative order of magnitude l/N. If one considers, on the othcr hand, only
states ( v } of the &--electron system which are similar to each other, one may
also neglect the {v)-dependence of the potential and replace V{}(x) by the
value for a representative configuration {vo). The question is, does such a
representative codguration exist in the case of a semiconductor, and if so.
what is it. The answer to the former question is under normal conditions,
yes. For a representative configuration in the abovementioned sense, we
have the state of the N-electron system with lowest total energy, the socalled ground state. In this state all one-particle states p with energies
Ev below a special energy value (the Fermi energy) are occupied, and the
states with energies above are empty. Under normal conditions the states
of the N-electron system which occur in semiconductors, and also in other
solids, deviate very little from the ground state, Non-normal conditions
are associated with large deviations, e.g., such as semiconductors which are
displaced to a highly excited state by intense laser irradiation. Excluding
such extreme cases, the Hartree potential Vrj}(x) for the configuration {v)
is almost the same as that for the ground state configuration {v}, and
correspondingly we have
(2.48)
71
V(X,) = V&)
t VH(Xt).
(2.50)
(2.51)
The Hamiltonian of this equation is the same for all particles and no longer
depends on the configuration of the N-particle system. Equation (2.51) is
therefore the oneparticle Schrodinger equation par excellence, devoid of any
reference to a particular particle or configuration of the N-electron system.
We may therefore omit the index i in equation (2.51). TJsing the oneparticle
Hamiltonian
ff
P'
2, I v(x),
(2.52)
H'Pdx) = Evcpv(x).
(2.53)
72
solution has a well defined and sufficiently simple connection with the solution of the N-electron problem.
The idea that the (py(x)are energy eigenstates and the E , are energies
of single electrons, underlying the above consideration, needs to be made
more precise. Because of the electron-electron interaction, the motion of a
particular electron is always tied to the motion of all others, and the energy
of an electron is also, in part, energy of interaction. The latter statement
manifests itself clearly in relation (2.43) between the one-particle energies E,
and the total energy U{.} of the N-electron system, which we will explore in
more detail. First of all, it can be further simplified. Using the one-particle
Schrodinger equation, one can re-express the terms on the right-hand side
of equation (2.43) by one-particle eigenvalues, leading to
The energy of the N-particle system is therefore not just the sum of all oneparticle energies. It is necessary to subtract the Coulomb interaction energy
of the particles. Therein is reflected the fact that the E, contain a certain
portion of interaction energy with other electrons. This is doubly counted in
E , of one-particle energies, once in summing over the particles
the sum
themselves, and once in summing over their interaction partners, which is
done in E, automatically. To correct this, one must subtract the Coulomb
interaction energy.
This shows that the (p,(x) may be interpreted as states of single electrons only in a generalized sense. In reality the (py(x)describe stationary
states of the motion of the N-electron system in which all electrons are involved. These states of motion are not mutually coupled, as in the case of
normal oscillations of a system of interacting atomic cores. Using the terminology introduced in that context one may consider the states q , ( x ) as
states of quasi-particles or elementary excitations of the N-electron system.
The E, are the corresponding quasi-particle or excitation energies. There
is, however, a qualitative difference between these elementary excitations
of the electron system and the normal oscillations of a crystal. This may
be made clear as follows. If one adds to the N-electron system (which we
will assume to be in the ground state) one more electron, i.e. if one passes
over to a ( N 1)-electron system, then the one-particle Hamiltonian (2.52)
does not change within the framework of the approximations made above.
The oneparticle wavefunctions pV of the N-electron system therefore also
approximately describe the elementary excitations of the system of ( N 1)
electrons. This means that an eigenstate of the ( N 1)-electron system
may be realized by keeping the previously available N electrons in their one-
xi
73
particle states and adding the ( N 1)-th electron in one of the oneparticle
states pu* of the N-electron system which were previously not occupied.
Thus, by adding an electron, the energy of the system rises approximately
by Ey*.This means that the eigenvalue Ev* of the one-particle Ilamiltonian
may be understood as the energy of an electron added to the system. This
statement is called Koopman h e o r e m . From it, one can learn more about the
kind of elementary excitations of the N-electron system that are described
by the p,. These are states in which, as always, all electrons of the system
are involved, but not all in the same way. Only one of the electrons is moving in such states, while the others play a passive role; they determine the
potential in which this movement occurs. One therefore refers to these states
as one-particle excitations of the N-electron system, and to their energies as
one-particle excitations energies or, in short, one-particle energies.
In addition to the one-particle excitations considered above there may yet
be others. This can be confirmed by taking a (N -1)-electron system instead
of the ( N 1)-electron system. The missing clectron corresponds to a hole
in a previously occupied oneparticle state ( p y ~ . The excitation energy of the
hole is -EvO, which did not occur among the oneparticle excitations considered above. It therefore represents an additional one-particle excitation. If
an electron is removed from state vy and simultaneously an electron is added
in state
then this corresponds to the excitation of the N-electron system
from state ($, Y:, . , . , v k ) into state (v;, v:, ..., I&). The energy difference
with respect to the ground state amounts to EV;- F 0 . It corresponds
5
to the excitation energy of a11 electron-hole-pair with the electron in state
pu; and the hole in state p 0 . If one excites a second electron from state
v1
'p o into state p,,;, the energy difference with respect to the ground state is
u!2
(Eq - Ey:) (By;- E e ) , etc. The excitation energies of the N-electron
system can thus be written as a linear superposition of oneparticle energics.
This is valid only within the one-particle approximation. In a strict seiisc
one also has many-particle excitations, which will be considered in more detail below. As far as the one-particle excitations are concerned, there are no
others than the ones considered above, at least as long as one ignores spin
and the magnetic interaction between electrons.
vT,
74
d?
n
PU
75
i),
i),
i)}
(~~(4)~
(2.55)
In general, however, the orbital and spin motions are coupled. This is mainly
due to the fact that, on the one hand, the spin motion is accompanied by a
magnetic moment of the electron, and that, on the other hand, the orbital
motion gives rise to a magnetic field which couples that magnetic moment.
In quantum theory it i s shown that this interaction, which is called spin-ovhit
intemction, can be represented by the following additional term H,, in the
oneelectron Hamiltouian:
Tz
H,, - -[VV(X)
4m c
(2.56)
x p] . (7.
i),
u y - ( i0
0')
1
3
(2.57)
Taking account of spin and the spin-orbit interaction, the one-particle Schrodingw equation (2.55) in Hartree approximation t,akes the form
76
(2.58)
(2.59)
Employing such SlaPer determinants as N-particle wavefunctions, as opp o s d to simple producls, a one-particle Schriidinger equation far pv(x) may
be derivd in the same way as before, but the potential in this equation ia
77
somewhat different than that in the Hartree equation. It contains an additional contribution, the so-called exchange potential Vay(x),and the total
potential reads
+ VH(4 + W X ) .
V(X) = K(x)
(2.60)
In the case of negligibly small spin-orbit interaction, the orbital state may
be characterized by a separate quantum number v t , and the spin state by a
separate cpiantum number m i . T h c spinor components cp~,(x,,
8%) are of thr
form (2.55) in such circumstances. The sum VH(X) Vx(x) of the Hartree
and cxchange potentials can then be written in a relatively simple form. It
can be shown thilt theii action on the coordinate dependent factor of the
oncparticle wavefunction cpl,(x)takes the form
(2.61)
The Erst term on the right-hand side of this cquation is t,heHartree potential.
The factor of 2 results from summing over the t,wo spin states associated with
each wavefunction cpvk(x).The second term corrcsponds to the exchange potential. Formally, it differs from the first term by exchanging the states at
the two positions x and x. The factor of $ reflects the fact that, firstly,
the exchange potential acts only between electrons of the same spin, and,
secondly, that for the ground state with total spin 0, half of the electrons are
in spin-down states, and half are in spin-up states. In this way the magnitude of the exchange potential is influenced by the existence of electron spin,
although its value is the same for spin-up and spin-down states. Equation
(2.61) also shows that, unlike the Hartree potential, the exchange potential
is non-local. The effect of the exchange potential on the wavefunction p,(x)
is represented by an integral operator. In actual calculations one often uses
a local approximation for Vx(x). The exchange potential proofs to be attractive, which is to be expected: the anti-symmetric form (2.59) of the total
wavefunction ${A} means that the probability of finding two electrons with
the same spin at the same position is zero so that one has an exchange hole
around each electron. This lowering of electron density in the vicinity of an
electron results in an attractive potential in addition to the repulsive Hartree
potential since the total Hartree wavefunction (2.37) does not account for
the exchange hole.
The improved oneparticle Schrodinger equation with the potential of
(2.60) and (2.61) is called the Hartree-Fock equation, and the oneparticle
approximation, which underlies it, is called Hartvee-Fock approximation
Thereby, the Hartree and exchange potentials are understood as those for the
78
79
which will be needed in Chapter 3. In doing so, we will concentrate on oneparticle excitations, i.e. individual electrons and holes moving in the force
field of all other electrons as well as in the force field of the atomic cores.
Correlation effects on one-particle excitations. Density functional theory.
where cpvi(x) denote the one-particle states which, in the ground state of
the N-electron system, are populated by electrons z = 1,2, ..., N. According
to the variational principle of quantum mechanics, the wavefunctions cpv,(x)
adjust so that, while keeping their norms (cpudlqv,)
constant, the total envrgy Eo[n(x)]is minimized. This requires the vanishing of the variational
derivative of the functional E o [ n ( z ) ]
E,(cp,Ip,)
with respect to p:*(x),
where the factors E , are variational parameters, therefore
-xi
(2.63)
80
exchange and correlation energies which are usually summed in the exchangecorrelation energy E x c [ n ( x ) ] .Thus
+ Exc[n(x)l.
(2.64)
) ] easily obtained as
The functionals E c [ n ( x ) ]and E ~ [ n ( x are
(2.65)
E ~ [ n ( x=
)]
//
R R
d3x'd3x
n(x') . n ( x )
Ix' - XI '
(2.66)
The functional E x c [ n ( x ) ]is less obvious. It is usually taken in a local approximation called the local density approximation (LDA). The LDA starts
with the homogeneous electron system without any external potential. In
this case the density n ( x ) is a constant n in space, and E x c ( n ) reduces to an
ordinary function of n. Dividing E x c ( n ) by the total number nR of electrons
yields the exchange-correlation energy E X C ( ~of) the free electron gas, per
electron. The total exchangecorrelation energy Exc(n) of a weakly inhomogeneous electron gas of density n ( x ) should then be given approximately
by the expression
(2.67)
Finally, the kinetic energy fuIictiona1 E:k.tn[n(x)]has to be specified. By
definition, we have
(2.68)
Although this expression does not look like a functional of n ( x ) il is indeed possible l o transform it into such a form because all other terms in the
total energy functional Eo[n(x)]of (2.64) are functionals of n ( x ) , and the
Hohenberg-Kohn theorem enforces this for E & t ( x ) ] . For the waluation of
the variational equation (2.63) we do not, however, need the explicit functional form of EkznIn(x)l;expression (2.68) suffices. Its variational derivative
with respect to p;,(x) iu given by
(2.69)
81
(2.71)
(2.72)
where
(2.73)
denotes the ~xcliangPcnrrelationpotential. The latter can be determined
if the exchange-correlatiou energy E x c ( n )of the homogeneous electron gas
is known as fniictio~iof n. This dependence can hr obtained by calcrilating E>yc(n)numerically for tfifkrent values of n and then lilting the data
to appropriate explicit functions. TJsing this procedure. various Pxchang+
correlation potentials have been proposed, for exaniylr
Lkc(x)- -
(:)
1/3
e2n1j3[x)[I
+ 0.7734 z In
with
v(x) = VC(X) I Vrr(X)
+ i<Tc:(x)
(2.76)
82
2.3
2.3.1
+ R.
(2.77)
states in
crystal
83
The set of all lattice translation operators forms the translation group, which
we already encountered in Chapter 1. Since the lattice translation operators
are symmetry dements of the crystal, the translation gioup is B subgroup of
the space group which, by definition: contains all symmetry elements of the
crystal. As in the case of translations, we also assign operators to rotations,
reflections and rotation-reflmtions of the point group of directions. These
operators also art on the spatial vectors x, and W P denote them by the
symbol a and call them point symmetry operataorw. For the symrnorphic
space groups considered here, each element y may be thought of as a product
0 . t or
~ t ~ r of
r a translcttion t~ and a point syrnrnetrg operation a. Since an
arbitrary position vector x may be represented in a basis spanned by the
three primitive lattice vectors al.a 2 and a3, it suffices to s p e d y the &ect
of a on these. We define (see Appendix A}
(2.78)
with ov as real coefficimts. By means of this relation each opeiator a is
uniquely associakd with a corresponding matrix aV. If the thee primitive
lattice vectors a, are orthogonal and of the same length, the uaJform orthogonal niatriw$, i.e. their inverse matrices are the same as their transposed
ones. However, this holds only for the primitive cubic lattice, and is not true
fur all other 13 Binvais btticrs. Thus, the utJ arc not iu general orthogoual
matrices. The effect of a on an arbitrary position vector x may easily be
d e t e r m i n d by ~ n e a n sof i t s decomposition
(2.79)
x -czja3
t
xj
(2.80)
j
(2.81)
from which it follows that the transformation a , which was ariginaIly defined
as a transformation of the basis vectors ai with fixed coordinates zz,may also
be understood as a counter-tra~sformation
of thc coordinates with fixed basis
vectors. Indeecl, lh coordinate counter-transformation takes place with the
84
(2.82)
If the crystal
tf$(x) = S ( t R ' X ) ,
(2.83)
a S ( x )= S ( a - l x ) .
(2.84)
It is striking that the operators t~ and a act on S(x) in such a way that
x ifireplaced by tR1x or m - l x . but not by k ~ or
x c k x . as one might hme
expectd. The rhosen deftnition strms from the rrcognition that the transformed property is that of the transfarmrcl crystal at the original position
x. This is the Sam?, howewr. as the proprrty o f t h e original crystal at
the inverse transformd position. Formally the definitions (2.83) and (2.84)
giiarantee the correct multiplication order of two no11 commuting operators
under the fiinction symbol, because ( q c q ) - ' - rtl-1 a -1
2 . Applying these
definitions to the potential V ( x ) and simultaneously requiring crystal symmetry, w r have
v(xj.
(2.85)
(2.86)
(2.89)
(2.W)
85
(2.91)
(2.92)
]u,
H ] = 0.
(2.93)
Since the elements g of a symmorphic space group may be written as products of t~ and C Y , one also has
2.3.2
Bloch theorem
(2.95)
86
crystah
(2.100)
Consider now the eigcnfunction g i p p If the symmetry operation g ranges
over the whole space g o u p , the totality of vectors g p spans
~
a subspace of
the Hilbert space w l i o s ~dimension d i s in general larger than 1. This is to
say that the eignvalue E, for symmetry reasons, is d-fold degenerate. It
can be shown that, apart from spmial cases, d equals the number of different elements m of the point group, and then the d basis functions of the
subspace may be chosen in the form
with Q ranging over the entire
point group. This result will not. howevpr, be usrd in what follows and the
basis funrtions will be written in the general form p ~ lp, ~ 2 ,. .. , IpE& They
span that subspace of the Hilbert space which contains the eigenfunctions
of the Hamiltonian with the eigenvalue E . According to i t s construction
this space is invariant under the operation of an arbitrary element g of
the space group. In terms of the concept of rmducible represenfatema of
groups (an introduction is provided in Appendix A), we may alternatively
express this observation by saying that the eigenfunctions of H for a particular eigenvalue E give rise to a &dimensional irreducible representation of
the space group. The same statement also follows for each subgroup of this
group, particularly for the subgroup of all translations. The space spanned
by r p ~ lP, E T , . . . , ( p ~ d :thus also provides B representation of the translation
group. However, this representation is, in general, no longer irreducible.
~
be transloorrued
That means that the original basis p ~ pl ,~ 2 .,. . ,( P E may
(d-1) x I
into a new basis (p~,(pk,'plh;,.,pE
in such a way that the representation matrices of all translation operators t~ written in the new basis are
constructed from lower dimensional matIices, odered along the diagonal. It
is of special importance that the translation group is a group of Abehan type:
The resiilt of two translations t~~ and t~~ does not depend on the sequence
in which they are executed. Formally this is expressed by the equation
(2,101)
As is demonstrated in Appendix A, all irreducible representations of Abelian
groups are 1 dimensional. This means that the lower dimensional matrices along the diagonals of the tH-representation matrices are 1-dimensional.
(d-1) x I
Each of the basis functions &, ,p;c . . . , ' p E
therefore forms a represrtntation space of the translation group by itself. Thus, su functionstRpE,
with t~ an arbitrary element of the translation group, are linearly dependent
on p~ and dl functions t ~ &are linearly dependent on & etc. 'I'his means
that thp t p p itself
~
may be written in the form
tRcPE(x) = c(R)rFE(x)
(2.102)
87
Bloch theorem:
The eigenfunctions p~ of the Hamiltonian H of a crystal can be chosen
such that they are simultaneously eigenfunctions of the lattice translation
operators of the crystal
The particular energy eigenfunctions whose existence is stated by this theorem are termed Bloch functions. The proof of the Bloch theorem sketched
above relies in an essential way on features of group representations. Although it forms the most appropriate proof, one may, however, also proceed
without the tools of group theory, using a mathematical theorem which eos u e s that two commuting Hermitian or unitary operators have a common
set of eigenfunctions.
The Bloch theorem is of such great importance that several remarks are
appropriate. In the first place, it has to be emphasized that this theorem is
an immediate consequence of the commutivity of translations. For the symmetry operations a of the point group, for example, which in general do not
commute, it does not hold, which means that in general the eigenfunctions
of H cannot be chosen to be simultaneous eigenfunctions of the operators u
of the point group. Secondly, the theorem does not say that every conceivable eigenfunction of W is also necessarily an eigenfunction of t R . This holds
only for specially chosen c p ~ .The Bloch theorem insures that such a choice
is always possible. Thirdly, this theorem also does not imply that only one
eigenfunction exists for particular eigenvalues E and c(R) of, respectively, H
and t R . In reality there are alwtLys several. The pair of eigenvalues El c ( R )
is therefore not sufficient to uniquely characterize the eigenstates of H and
of the group of operators t R . Quantities which provide such unique characterization have yet to be identified. It, turns out that there are three real
numbers k l , E z , k3 which allow one to distinguish the different irreducible
representations of the translation group of the crystal.
To understand this we must examine these representations in greater
detail. They are defined by the eigenvalue equation (2.102) of the translation
operators. T o start, we will show that the eigenvalues c k l k 2 k 3 ( R ) of this
equation may be written in the form
(2.103)
P k 1 k 2 k 3 ( R )=
(-2r)(hri
+ ~ Y+ zh ' 3 )
(2.104)
Here, the E l , kz, k3 are the above mentioned real numbers which determine
the representations uniquely. The factor L 2 ~is1 introduced to simplify expressions which later will arise. To prove equation (2.103) we first show
88
(2.105)
(the subscript indices k l , k2, Ic3 on c(R) and P(R) will be suppressed temporarily). The proof is based on the normalization of p~ according to equation (2.97), which leads to
(tR(PE t R P E ) = ( P E (PE).
(2.106)
(tRLPE tR(PE)
-I
c(R)
l2
(PaE I P E ) .
(2.107)
Considering (2.106) and (2.107) together, (2.103) is verified at once. Secondly, we show that
c(R1+ R2) = c(R1) . c(R2)
(2.108)
Comparison of the last two relations immediately shows that (2.108) is also
true. Employing (2.103) for c(R), equation (2.108) now yields
exp[iP(Rl+ Rdl = exp[iP(R~)I
. ~xP[WWI,
(2.112)
whence
(2.113)
(2.114)
where u ~ k ~ k ~ k denotes
~ ( x ) a lattice-periodic function, such that, for any
lattice vector R,
UEklkzk3(X
- R) = UEk1kzk3(X)'
89
(2.115)
The factor l/& with 0 = G300is introduced in (2.114) in order that the
normalization integral of UEklkzk3 (x) with respect to a primitive unit cell be
1. The proof of (2.114) may be carried out by verifying that functions of the
form (2.114) obey equation (2.102). This results in
The real numbers kl,k2, k3 characterizing the various Bloch functions may
he understood as components of a vector. However, as we will soon see, this
is not a vector in coordinate space, but one in a space which is reciprocal to
coordinate space.
2.3.3
The starting point for understanding the nature of the components k l , k2, k3
is their transformation law under point symmetry transformations a in coordinate space. This will now be explored.
Transformation properties of kl,122, k3
= e x ~ l i P k , k ~ k - , ( ~ ~ ~ ~ R ) l ~ ~ ' ~ ~ k ~ k(2.117)
~ k ~ ( ~ ) .
~ ( ~ ) an eigenfunction of
The latter relation means that a p ~ k ~ k ~ isk indeed
~R)].
the translation operator t R with eigenvalue e ~ p [ i P k ~ k ~ k ~ ( a -Evaluating
,f3klkZ~3(a-1R)
explicitly using equations (2.104) and (2.82), we have
90
(2.118)
j
(2.119)
must hold. One may therefore write
(2.121)
3
'
bj = 2 ~ 5 i j
(2.122)
These equations state that the vectors of the T & P T Q C ~ basis me normal to,
respectively, one of the three planes spanned by pairs of direct basis vectors.
91
2x
27r
00
00
x a11
2x
Qo
, b3 :
-[a1
x az] .
(2.123)
Here Ro = a1 . [a2x a31 is the volume of a primitive unit cell of the crystal
lattice.
If one s u bj et s the direct basis to a rotation or reflection, this induces
the same rotation or reflection of the rwiprocal basis, as follows immediately
from the defining equations (2.122) or (2.123). Because of these relations the
'reciprocal' basis (bl, bz, b3) is rigidy joined with the direct basis (al:a2,ag)
and will transform when the latter is tranaformd However, since the reciprocal basis is different from the direct one (only if (al,az,a3) are orthogonal
unit vectors are the two basis sets the same), the matrix LY which describes
i is different from that which transforms the
the rotation or reflection of the a
bj. Using equation (2.123) one may easily show that the bj are transformed
with the transposed inverse of the transposed matrix u p which transforms
the q ,that is, with the inverse matrix a;'. In other words, the inverse
matrix describes the same rotation OF reflection of the reciprocal basis as the
transposed matrix docs fur the direct basis.
Consider now an arbitrary vector k of reciprocal vector space, which can
be written as
(2.124)
i
cxk.
(2.125)
92
Bklkpl~3(R)
-k. R.
(2.126)
Using this expression the eigenvalue of t,hP treiistntion operator (2.103) becomes
R)
UEk(X)
(2.129)
4= " E k W
(2.130)
for arbitrary vectors r. This mcans that the Bloch factor ugk(x) is a cons t w t , and t,he eigenfunctious of the HamiItonian are of the form
(2.131)
This is just the well-known result that the stationary states of a reeelectron
may be taken as plane waves of a given wavwwtor k. If the potentid is not
completely constant but. as happens in a crystal, it remains constant only
under translations through lattice vectors, the meaning of k as wavevector
is largely prwervd. It is thrrefore called a qaasa-wavevector: Of course, the
dimension of the quasi-wsvevector is also that of a reciprocal length. The
Bloch functions y ~ k ( x )of (2.128) may be understood as travelling waves.
modulated spdiallJ by thr latticeperiodic Bloch factor U E ~ ( X ) .That the
stationary oneelectron states may be chosen in this form, is the content of
B l o c h ~thwrpm. This theorem d o e riot say, howevrr, that these states are
necessarily modulated tmvellzng plane waves, just as t h e stationary states
93
states in a crystd
of a free electron need not be travelling plane waves. One ran also h a m
standing plane waves, sphrriral waves d c .
Discretization of k-space
'The periodicity condition (2.96) for the eigenstates of the Hamiltonian has
largely been ignored so far. Fortunately, it can be satisfied very simply. The
Ebch fuucfions V E ~ ( X )obey the relation
k,,- G
with 4
BS
lj, j
= 1!2 , 3 ,
(2.133)
The k-vectors defined by (2.133) form a finely meshed net in k-space (see
Figure 2.4). The only permissible k-vectors must be points of this net since
the Bloch functions are periodic with respect to the Periodicity region. The
k-space thus has a discrete structure; its set of points is countable. The
distance betwren different k-vectors is, however, so small because of the
large size of C that k can pract,ically be treat4 as it continuously varying
quantity despite its discrete character. This approximationwill be used later
extensively.
Reciprocal lattice
Just t t h in direct spaw, one may also consider a point lattice in reciprocal
space. In this context, the so-called mciprocal lattice i s especially useful. It
is dehed by taking the reciprocal basis vectors bl,bz,b3 of (2.123) aa its
94
K =CKjbj
(2.135)
-(P
Izm
UE&)
= EUEk(X).
(2.136)
95
however, just from the space and time symmetry of V(x). We will discuss
this now, beginning with point symmetry, whose implications are relatively
easy to examine.
Point symmetry
We consider the Schrodinger equation (2.95), on the one hand, for the quasiwavevector crk, where TY i s taken to be an arbitrary element of the point
group of the crystal:
(2.137)
and, on the other hand, for the quasi-wavevector k, but with an application
of the transformation (Y as follows
HWEdX)
E(kbPEk(X).
(2.138)
The relations (2.120) and (2.124), derived in the preceding subsection, mean
that U p E k is an eigenfunction of H for quasi-wavevector ak. If there is
only orbe eigenfunction for a given wavevector, which we initially take to be
the case, a c p ~ k ( x must,
)
be identical with ( P E ~ ~ ( x
because
),
( P E ~ ~ ( is
x )by
definition the eigenfunction for a k . Thus,
(2.140)
The assumption that there is only one eigenfunction of the Hamiltonian for
the quasi-wavevector k holds for almost all k. There is an exception for
those special k values for which ak does not differ from k or from a vector
k K equivalent to k for all a. One refers to such vectors as symmetrical kvectors. They will be considered in section 2.5, but we exclude symmetrical
k-vectors at this point. For non-symmetrical k-vectors, the fact that there
is only one eigenfunction of H , follows from the onedimensionality of the
irreducible representations of the translation group (see Appendix A).
96
time reversed Bloch function of wavevector k has wavevector -k. Since both
Bloch functions have the same eigenvalue, one obtains
E ( k ) - E(-k).
(2.141)
(2.143)
K .R1
with K , and T % as integers. Therefore we obtain the full spwtrum of eigenvalues of t R if k varies within a particular primitive unit cell of k-space. The
corresponding k-vectors will br denoted by kl. The k-vectors of any other
primitive unit cell differ from these by a reciprocal lattice vector K and thus
do not lead to new eigenvalues of the translation operator. However, the energy cigenvalues g ( k ) do differ, so that E(k1) f h(k1-t- K ) holds in general.
Table 2.3 illustrates these connections.
This asymmetry with respect to the k-dependence of rk(R) and E(k) is
the reason for the introduction of two different representation schemes for
the energy eigenvalue function E(k). In the representation scheme employed
thus far, k varies over the entire reciprocal space, and to each vector k we
associate only one k-dependent energy function E(k). With respect to the
eigenvalues of the translation operator, this description i s highly redundant,
since one encounters the same eigenvaliie an infinite number of times. This
disadvantage is eliminated if the selection of the region over which k varies
is not fitted to the energy E ( k ) but rather to the ck(R) eigenvalues of the
translation operator, i.e., if k is allowed to vary only over a primitive unit cell
rather than over the whole k-space. Then we must accept, however, that an
infinite number of different energy eigenvalues are assigned to each k-vector,
namely all those which follow frotn E ( k ) by means of the prescription
97
Table 2.3: Representation of band structure in the cxtcnded and reduced zonc
schemes.
I
Wavevector
Eigenvalue of
tR
Eigenvalue of H
Represent at ion
I
General
[-ik. R]
E(k + K)
~ X P
k from
Infinite
Space
k = kl from
Unit
Cell
exp [-ikl
. R]
1
Extended
Zone Scheme
'
Reduced
Zone Scheme
98
We know, however, that lattices have fewer point groups than there are
point groups of equivalent crystal directions. The next-lower lattice point
group would not contain the needed point group of directions. The issue
in question thus revolves upon whether the function branches EK(k) are
continuous in the interior of the Wigner-Seitz cell of the reciprocal lattice at
K = 0. We will prove that this is indeed the case for the branch Eo(kl) in
the next section. Moreover, it will be shown that the other branches EK(k1)
of the infinitely multi-valued function E(k) may be redefined in terms of
new branches such that each single-valued branch is also continuous over
the Wigner-Seitz cell at K = 0. One calls these continuous branches energy
bands and the Wigner-Seitz cell of the reciprocal lattice at K = 0 is the f i r s t
Brillouin zone.
Having started from general considerations and obvious conjectures, we
thus arrive at the important conclusion that the spectrum of allowed energy
values of crystal electrons will have the form of energy bands, and that these
energy bands arise by varying the quasi-wavevector k over the first Brillouin
zone. We will discuss the concept of Brillouin zones along with the still
outstanding proof of the continuity of E(k) over these zones more fully in
the next section.
2.4
2m
(2.145)
The perturbing operator H I , which together with Ho forms the total Hamiltonian
(2.146)
H = Ho Hi,
(2.147)
Cpk
99
and eigenvalues E ( k )
+ 6E2(k)+ . . .
(2.149)
with respect to the perturbation potential, here with respect to the periodic
potential V(x). In this discussion, we suppress the energy index E of the
eigenfunctions for the sake of brevity. This leads to no ambiguity since we
work in the extended zone scheme, where the energy is a unique function of
the quasi-wavevectors k. The zero-th order terms obey the equation
Hop; = Eo(k)Cpg.
(2.150)
In order that the perturbation theory be applicable, the zero-th order solutions
must form a complete orthonormal set of functions, such that
and
-yCpp(x)Cp;(x) = 6(x - x)
(2.152)
must hold. The k-summation of (2.152) is extended over all points of the
infinite finely meshed net of Figure 2.4. The two relations (2.152) and (2.153)
are in fact valid for the solutions
(2.153)
of equation (2.150). The validity of the orthonormality relation (2.151) may
be easily verified by direct calculation, and the completeness relation (2.152)
is proved in the theory of Fourier series. The energy eigenvalue Eo(k) of
zero-th order for pg is
0
Ti2
E (k) = -k2
(2.154)
2m
We will assume at the outset that this eigenvalue is not degenerate, more
exactly, that this holds for k-values for which the perturbation operator V
has non-vanishing matrix elements (pi! I V I ~ p g )with any vector k. It is
not possible to exclude degeneracy completely because wavevectors k of the
same length lead to the same energy eigenvalue (2.154).
100
2.4.1
(2.155)
(2.156)
The energy correction 6E1(k) of the first order i s the average value (cpg I
V I pi) of the periodic potential. This value may be set equal to zcro. The
off-diagonal matrix element of the potential involved in (2.1551, (2.156) are
given by
(z.157)
Because of the lattice periodicity of V(x) we can transform the integral over
the periodicity region in (2.157) into a sum o f integrals over the primitive
unit cell Qo(0) at R = 0:
Noling that
(2.159)
(2.160)
we arrive at the result
Using this result, the correction 6 ~ :to the wavefunction takes the form
(2.162)
and the total wavvcfunclion
p k = p!
One recognizes that this has the expected Bloch form (2.128) b r energy
eigenfunctions of the crystal Hamiltonian. Moreover, (2.163) yields the Bloch
factor as
(2.16 4)
The second order energy correction E2(k) may be determined using (2.161)
as
(2.165)
The corrections to the eigenfunctions (2.162) and the enerffi eigmvtrlues
(2.165) will become large if the denommator in these expressions becomes
srnall. Actually. the denominator may even vanish when
E"(k)- Eo(k I K ) - 0.
(2 166)
1
k.K+ -IK
12= 0.
(2.16 7)
2
This equation can be solved geometrically. It is obvious from Figure 2.5
that the tIps of the corresponding k-vectors lie on a plane perpendicular
to the vectors K or -K, with this plane intersecting the vector -K at its
half-length. Such planes are familiar from the theory of X-ray diffraction by
crystal lattices. They aTe called Bmgg reflectam planes.
The non-degenerate perturbation-theoretic results discussed above may
be interpreted quite clearly if one regards the problem of the calculation of
the eigenfunctions in the periodic potential as a scattering problem. In this,
the unperturbed eigenfunctions p: are thought of as incoming waves and
:
to
the perturbed eigenfunctions 9 k as outgoing waves, 6 ~ corresponding
the scattered part of the latter. That the perturbation theory according to
formula (2.162). yields only minor corrections to the unperturbed eigenfunctions is related in this picture to the fact that the incoming and scattered
plane waves in\-olve different wavelengths, since 1 k If jk KI, and correspondingly they are not capable of significant constructive interference.
102
-&
of
Bragg
-K
For incoming waves with k-vectors close to the Bragg reflection plane of the
reciprocal vector -K, the scattered part dpk contains a p a r t i d a r l y large
plane wave component of wavevector k + K. This is due t o the fact that,
in this case, the incoming plane wave and the Ecaltered wave of wavevecK, have almost the same wavelengths. They are thus rapable of
tor k
constructive interference which strongly enhances the amplitude of this particular scattered wave. If k is located exactly on the Bragg reflection plane,
one may understand the scattered wave with wavevector k
K as the plane
wave reflected at this plane in the sense of geometrical optics. This follows
immediately from the construction of the k h g g reflection plancs in Figure
2.5. The scattered wave of wavevector k
K is strengthened by interference, w h c h makes it the refkected wave (bear in mind that in the wave
picture reflection represents an interference phenomenon). This interpretation can be universally applied to the propagtltion of plane waves of any kind
in a system of scattering centers periodically ordered on a lattice, including
X-rays. Actually, the above results for elwtron waves were discoverd iisinE
X-ray radiation.
2.4.2
1
ko.K+ - IK
12=
(2.168)
k=ko+Ak
(2.169)
is then a vector close the Bragg plane involved. The same holds for the
vector k K, as it lies close to the parallel plane in Figure 2.6.
According to quantum mechanical perturbation theory for the case of
degenerate zero-th order states, one proceeds as follows. The perturbed
eigenfunctions cpk are sought as linear combinations of the two (almost)
degenerate eigenfunctions
and (p&
in zero-th order, writing
cpi
(PIFo(x)= cocp:(x)
+C K d + K ( X )
(2.170)
Eo(k) - E
V(K)*
(2.171)
V(K)
Eo(k$ K ) - E
For the set to have a non-trivial solution, its determinant must vanish,
whence
Eo(k) - E
V(K) = 0
V(K)* Eo(k+ K ) - E
(2.172)
1
+ Eo(k + K)] f -\/[Eo(k)
- Eo(k + K)]*+ 4 I V(K) I*.
2
(2.173)
For further evaluation of this expression, we assume that Ak is directed
parallel to K (see Figure 2 . 6 ) . Expanding the square root in powers of
I Ak I and terminating the series with the second order, we find
E*(k)
Eo(ko)f
I V(K) I 4 li2
jpK2
1 f 2m
I V(K) I]
Ak2.
(2.174)
104
Figure 2.7: The degeneracy of the two energy parabolas E(k + Ak) and E(k
K + Ak) at the Bragg reflection plane Ak = 0 (left part of figure) is removed by
the periodic crystal potential. In the formerly continuous energy spectrum a gap
arises at this plane (right part of the figure).
The remarkable feature of this result is that the 2-fold degenerate energy
eigenvalue Eo(ko) = Eo(ko K ) at the Bragg reflection plane, i.e. for
Ak = 0, splits into two different eigenvalues which are shifted by IV(K)I
to higher and lower energies, respectively. Between them there is a region
of width 2 I V ( k ) I where no allowed energy values may exist (see Figure
2.7). This means that, in the formerly continuous energy spectrum of the
free electron. an energy gap appears as a result of the periodic potential.
This occurs at all Bragg reflection planes, provided that the corresponding
Fourier components I'(K) of the periodic potential do not vanish. Except
for the Bragg planes, the function E ( k ) and the energy spectrum remain
continuous. The result derived here by means of perturbation theory is
also valid well beyond the framework of validity of this approximation. The
following statement remains true also in the general case.
In regard to the shape of the function E*(k) close to the Bragg planes,
the relation (2.174) provides the following picture (see Figure 2.7). The two
branches E+(k) and E-(k) approach Ak = 0 with horizontal tangents to
the limiting values Eo(k) IV(K)I and Eo(k)- ll'(K)I, the upper branch
with positive curvature, and the lower with a negative one. The latter observation follonrs from (2.174) since the applicability of perturbation theory
is restricted in validity by 1 V(k) I << h 2 K 2 / 2 m ,which means that the sign
105
2.5
Band structure
2.5.1
Brillouin zones
Definition
Consider, in k-space, centro-symmetric bodies which contain the origin and
are entirely bounded by parts of Bragg reflection planes. These bodies are
arranged and enumerated according to their volume. The smallest will be the
Wigner-Seitz cell at K = 0. The second, next-largest body, has the volume
of two Wigner-Seitz cells, as one may easily demonstrate. It contains the
106
ZONE
9 1 0
Figme 2.8: The first, 10 Rrillouin zones or H sqimre plane 1al.tic:e. (After Brillouin,
1953).
first body jointly with the Bragg planes bonnding i t . If one removes the
first body from the second, then one obtains a hollow body of the volume of
one Wigner-Seitz cell, which is bounded by Bragg planes inside and outside,
but contains no siirh planes in i t s interior. This conhtruction may be carried
further. Removing the v-th body from the (v 1)-th one again obtains a
hollow body having the volume of the primitive unit cell in k-space. It i s
boiindd by Bragg planes, and has no such planes in its interior. These
difference bodies are called Brdloum zones (abbreviated 8 8 s ) . The v-th
body i s called the v-th R%, the Wigner-Seitz cell at K = 0 is, accordingly,
the first B Z . In Figure 2.8 the first 10 Brillouin zones are shown for a
2-dimensional square reciprocal lattice. The first B Z for two important 3dimensional lattices, i.e. the frr lattice and the hexagonal lattice, are shown
in Figure 2.9. We will derive the shape of the first of them below.
Focusing on a point k, of the v-th R Z , a v e t o r K(k,) of the reciprocal
lattice is attached k, such that k,- K(k,) represents a vector kl of the first
BZ,
kl = k,
- K(k,).
(2.175)
107
Figure 2.9: The first Brillouin zones of the two Bravais lattices which apply to many
semiconductors: (left) the face centered cubic lattice of diamond, zincblende and
rocksalt structures; (right) the hexagonal lattice of wurtzite and selenium structures.
We omit the formal proof for this important result here. For the square
lattice it follows immediately from Figures 2.8 and 2.10. Its validity in the
general 3-dimensional case is clearly plausible. If, instead of k,, another
point kh of the v-th B Z is chosen, another reciprocal lattice vector K(kh)
may be necessary in order that kh - K ( k l ) shall be a vector of the first
B Z . The set of reciprocal lattice vectors K(k,), K(kL), . . . which arises if
k, varies over the whole v-th B Z , is just the set of K-vectors which defines
the internal boundary planes of the v-th B Z . The correspondence of the
points of the v-th B Z to points of the first B Z given by equation (2.175)
is unambiguous in both directions - each k, corresponds exactly to one kl,
and each kl exactly to one k,. One terms this correspondence folding of
the v-th B Z into the first. The term displacing would probably provide a
better description. During such folding or displacing, the external or internal
boundary planes of v-th B Z may be translated to the interior of the first
B Z . There, they border on other boundary planes of the v-th B Z . This
means that original kl-vectors which lie immediately to the right and to the
left of such planes will involve different reciprocal lattice vectors.
The folding operation (2.175) provides the mechanism for changing from
the extended description of the energy function E ( k ) over all infinite k-space
to the reduced description within the first B Z . The transfer of the function
values is, defined above by relation (2.144). It states that the value E(k) at
a point k = k, of the v-th B Z is assigned to that particular point of the
first B Z which arises from k, by folding the v-th B Z into the first. By this
assignment a multi-valued function E,(kl) is defined within the first BZ.
The unique function E(k) over the whole k-space is mapped into the multivalued function E,(kl) within the first B Z . Formally, this is expressed by
108
Figure 2.10: Folding of the second through ninth Rrillouiri zones into the first R Z
for the plane square lattice of Figure 2.9. (After Brillouin, 1953)).
the equation
Ev(ki)- E ( k i
+ K(k,))
E(kv).
(2.176)
We maintain that the branches E,(kl) of the multi-valued function are continuous within their entire range of ddinition, i.e. within the fust B Z . For
points kl which correspond to points k, of the interior of the v t h B Z ,
the validity of this assertion is obvious. For those psrticiilar kl-planes srising from a bo~iirlaryplane of the v-th B Z , continuity is not immediately
evident, for in penetrating such a kl-plane the original vector k,, of the Yth R Z jumps by the negative of the rcciprocal latticc vector. which defines
this boundary plane. However, this jump does not affect the energy eigenvalue E(k,) in the extended zone scheme as may be seen by means of the
degenerate perturbation tbwry in swtion 2.4. Thus the functions E,(k) are
also continuous on the particular planes in question.
Each of the function branches F;,(kl) defined above encompasses some
finite interval of values on the energy scale. The term energy hnnd for the
set of E,(kl)-values is thus obvious; I / is the so-called band andm Between
the various function branches, or energy bands, energy regions may exist in
which no energy eigenvalues occur. These regions are called forbidden zone8
or energy gaps. l'he whole s r t of functions R,(kl) is referred to collectively
as thr band structure.
In Figure 2.11 we illustrate the band structurc using the model of a
1-dimensional lattice. The solid curves correspond l o a rornplelely free electron, and the dashed ones to an elwtzon in a weak periodic potential. One
recognizes that energy gaps orcur not only on the boundaries of the first BZ
109
Figure 2.11: Rcduction of the energy parabola of a free electron into the first B Z
of a 1-dimensional lattice. The changes caused by a weak, periodic lattice potential
are indicated by dashed lines.
but also at its center at k = 0. This is due to the fact that, during folding,
surface points of the first B Z are displaced to the B Z center, and the energy
gaps from those points also appear at the center.
The folding procedure into the first B Z must also be carried out for the
eigenfunctions of the Harniltonian. Let p v k l denote the Bloch function of
energy E,(kl) and of reduced wavevector kl. In the extended scheme the
same wavefunction reads q q k , ) k v ( x ) . Thus,
110
f f ~ v / c ( x=
)
(2.178)
&(k)pvk(x).
The first RZ is the fully symmetric primitive unit cell of the reciprocal
lattice. The relation between the reciprocal and the direct lattices was discussed in section 2.3, where it was seen that each of the 14 direct Bravais
lattices corresponds to a particular reciprocal lattice of the same symmetry,
although not necessarily of the same Rravais type. Accordingly, there are
14 different reciprocal Bravais lattices, which ah0 means 14 different first
B Z s . Two of them, the two that are most important for semiconductors,
are shown in Figure 2.9.
Empty-lattice band s t r u c t u r e
In Figure 2.11 the energy E ( k ) of a free electron moving in 1 dimension
has been represented over the first B Z of R I-dimensional lattice. Similar
representations are possible in 3 dimensions. Below we demonstrate this for
the face centered cubic lattice which applies to crystals of the diamond and
zincblende structures, According to Table 1.2, the primitive vectors of this
lattice may be chosen in the form
a
a
a
a1 = -(O, 1,I ) , a2 = - ( l , O , I), a 3 = -(I, I, 0)
(2.179)
.
2
2
2
The corresponding basis vectors of the reciprocal lattice follow from equation
(2.123), with the result
27r
27r 2n - bl = -(I, T, I), b2 = -(l, 1,T), b3 = -(I,
a
1,l).
(2.180)
(1) 8 lattice vectors having the length of the primitive lattice vectors given
above, which means the smallest possible length overall. These vectors read:
2n
-(i,i, T),
a
27r 2s
-(1, 1,1), -(I,
27r 2x - 2n
-(I,
1,T), -(I, 1,I), -(T,T,T),
a
a
a
2n
27r
(2.181)
27r
-4%
a
0, O ) ,
27r
-(0,2,0),
a
2n
-(O,
a
0,Z).
(2.182)
111
Figure 2.12: Symmetry points and irreducible parts of the first BZ of the fcc
lattice (left) and the hexagonal lattice (right).
The first B Z of the fcc lattice obtained by means of these reciprocal lattice
vectors is shown in Figure 2.9. Perpendicular to the three cubic axes it is
bounded by squares, and normal to the four space diagonals by hexagons. It
is common to denote the symmetry points of the first BZ by capital letters.
Greek letters are assigned to symmetry points in the interior of the first BZ,
and latin letters to symmetry points on its surface (see Figure 2.12). The
center of the B Z , for example, is I?, the point at which a cubic axis cuts the
B Z surface is X , and the connecting line between I? and X is A.
We will now reduce the energy parabola E(k) = (fi2/2m)k2 of a free
electron along the A-line, i.e. for points
r = 2R
-(o,
o,o)
a
2R
2?r
o , ~ ) ; x = -(o,
a
A = -(o,
a
0, 1)
(2.183)
E,(k)
-h2
(k
2m
+K ( ~ Y ) ) ~ .
(2.184)
Using the above listed 14 reciprocal lattice vectors K(k,) in this relation,
the second and third BZ's are folded into the first. For convenience we
~ an energy unit, and set
introduce Eo = ( T ~ ' / 2 m ) ( 2 7 r / a )as
E,(k)
71
E u . ~v(k).
(2.185)
With lattice constant a of 5 A, a value of 5.9 eV follows for Eo. For ty(k)
we have
112
(2.186)
where the K,, Kvy,K,, are the components of the vectors K, in units of
2nla. In Table 2.4 we show the reduced energy functions ty(<) for the 14
vectors K, together with those for 0. In some cases the same value of the
function E,(C) is obtained for 4 different K,s, meaning that the corresponding energy bands are 4-fold degenerate. In Figure 2.13 the content of Table
2.4 is illustrated graphically, along with the band structure parallel to the
A-line between 0 and L not given in Table 2.4. Again, one recognizes that
there are k-points associated with the same energy value several times.
The band structure shown in Figure 2.13 is that of an empty fcc lattice,
i.e. a fcc lattice whose lattice points are not occupied by atoms, since the
latter would create a non-zero periodic potential V(x)if they were present.
For a vanishing potential V(x),no definite lattice exists because of its arbitrary translation symmetry. Nevertheless, a definite lattice, the fcc lattice,
and a definite lattice constant, were chosen in the above procedure. This
was an arbitrary choice, in the sense that we could have also chosen any
other of the 14 Bravais lattices, and any other lattice constants. However, if
we want the empty lattice band structure to resemble the band structure of
a really existing crystal, then we cannot take any empty lattice band structure but must chose the one for the lattice applying to the crystal under
consideration. Later we will verify that the band structures of real crystals and the pertinent empty lattice band structures have in fact common
features. In Figure 2.11 such features are, for example, the appearance of
the lowest energy eigenvalue at the I?-point and the development of several
non-degenerate energy bands from this level along the A-line, one of them
crossing another band at the X-point. It is also consistent that the energetically high-lying bands display a relatively high degeneracy at the I?-point,
although for real fcc crystals the degeneracy at maximum can be only 3-fold,
and not 8-fold as for the empty fcc lattice at the energy E = 3&, and not
&fold as for the fcc lattice at E = 4Eo. A degeneracy higher than %fold
turns out be accidental, i.e. not caused by symmetry, but by the particular
values of the potential, which is identically zero in this case. The width of
the lowest energy band in Figure 2.11 is about Eo, i.e. 6 eV. Also, this
result is not too far from the actual widths in fcc crystals, as we will verify
below.
As with the reduction along A- and A-lines, one can plot the band structure along other symmetry lines in the interior and on the boundary of
the first B Z . A complete representation of the functions E,(k) over the
points of 3-dimensional k-space would require a Cdimensional space. In the
3-dimensional space available to us it is only possible to represent bands
E,(k) over planes in k-space. Such representations are, however, uncom-
113
cy(c)
(KV&V,K,)
+ (6 + I)*
4 I cz
Notation
Degree of Degeneracy
c2
F3
mon. -4s a rulc, one plots the energy against lines in k-space,as in Figure
2.13.
The question of what valws the energy band functions E,(k) take outside the first B Z does not arise or is meaningless, since the E,(k) were
"L
A
Wavevector
114
just defined by transferring the function values from other B Z s to the first
one. For this reason, the E,(k) already encompass the entire spectrum of
eigenvalues as k varies over the first B Z . If one wishes to also consider the
functions E,(k) outside of the first B Z , one has to define them there. The
most natural manner to do this is the periodic continuation.
Periodic continuation
In this context, the values of E,(k) in Wigner-Seitz cells having centers at
K # 0 are defined by the relation
(2.187)
where k is a point of the first B Z . By periodic continuation, one can of
course extend each function originally defined only within the first B Z to the
entire k-space. The question is what analytic properties does this function
have? If one considers an arbitrary function, discontinuities will occur on the
boundaries of the Wigner-Seitz cells, i.e. the periodically continued function
will not be continuous in the entire k-space. In order to have continuity,
E,(k) must satisfy certain conditions on the boundary of the first B Z . These
conditions follow from the fact that the boundary of the first B Z consists
of pairs of equivalent parallel planes (see Figure 2.6). We denote a point on
one plane of such a pair by ko, and kb denotes the equivalent point on the
other plane of the pair. If ko belongs to the first BZ,as we will assume, then
kb cannot also belong to it, because a primitive unit cell contains only nonequivalent points. The value of E,(kb) is defined by the periodic continuation
and, as such, it is equal to E,(ko). In order for the periodically continued
func%iorrt o be continuous, thix value must coincide with the limiting value
of E,(k) if one approaches the point kb coming from the interior of the first
B Z . Thus, a necessary condition for continuity of the periodically continued
function is that the original function, defined only over the first B Z , obey
the boundary condition
lim E,(k)
k- k&
E,(ko),
(2.188)
where k is in the interior of the first B8. This condition is also siifFicient,
since the continuity of the periodically continued function E,(k) implies
that equation (2.188) holds. Therefore, the boundary condition (2.188) for
the function E,(k) defined over the first R E , and the continuity of the
periodically continued function E,(k) are just different descriptions of the
same property. Tising continuity of the function &(k) defined over all kspace, this property may be expressed in a more convenient way. This is the
reason why one continues the e n e r a bend fiinctions R,(k), which actually
115
have meaning only within the first BZ,periodically over the entire k-space.
One could also forego this, and use the condition (2.188) directly.
We will now prove that the energy bands E,(k) actually obey the boundary condition (2.188), and thus, are continuous everywhere as periodically
continued functions. To this end, we use the Schrodinger equation (2.178)
for the eigenvalues E,(k). The corresponding Bloch functions p,k(x) may be
expanded in plane waves with wavevectors k
K , where K is an arbitrary
reciprocal lattice vector, as follows:
(2.189)
Using this result, the Schrodinger equation (2.178) takes the form
(2.190)
The E,(k) are the eigenvalues of the matrix of coefficients of this system of
equations and their k-dependence i s determined by that of the matrix. The
latter is manifpstly continiioiis. That it also exhibits the periodicity of the
reciprocal lattice, one may verify as follows. Consider the coefficient matrix
of (2.190) at the point k K with K an arbitrary reciprocal lattice vector.
If one replaces the column and row indices Kand K , respcctively, by K+K
and K K, using the relation (K KIVJK
K) - (KIVIK), one
obtains the matrix at the original k-vertor. This implies that the eigenvalues
Ev(k t K) and E,(k) are identical, as stated.
The above proof employs only the lattice translation symmetry of the
potential V(X). The continuity of the periodically rontinued function E,(k)
in the entire reciprocal space is therefore an immediate consequence of the
periodicity of V(x) in the direct lattice. The continuity of the periodically
repeated energy band functions E,(k) is often very useful. It ensures, on the
one hand, that the E,(k) may be npproximctted fairly closely by a Fourier
series with relatively few terms. On the other hand, mathematical theorems
dealing with the types and numbers of extrema of periodic analytic functions
may also be applied. This i s particularly important for the theory of the
optical proprrties of semiconductors.
We return now to a question which was previously explored in section
2.3, namely the implications of point symmetry for band structure. This was
discussed above in the extended zone scheme, and symmetric k-points were
excluded. NOW we USP the reduced zone scheme description and concentrate
on symmetric k-vectors.
116
2.5.2
The first B Z , as the Wiper-Seitx cell of the reciprocal lattice, also cxhibits
the point synmelry of this lattice which, for its part, is the same as the point
bymmetry of the corresponding direct lattice. The latter statement follows
directly from the definition (2.123) of the reciprocal basis bl, bp, b3. The
first B Z of the fcc lattice has, therefore, the point symmetry of a cube, and
its point group is Oh(m3m). The energy bands likewise have a particular
point symmetry, namely that of the equivalent directions of the crystal. This
is expressed by quation (2.140), which underlies the extended zone scheme.
It may, however, be transferred immediately to the reduced scheme, so that
we may also write
&(k) = E v ( a k ) ,
(2.19 1)
117
degunrrucy
Y~UT
OJ
e n e r g y bands.
crcplk(x)
(2.192)
118
is quite remarkable. It holds not only for the energy bands of crystals,
but for the eigenvalues of any Hsmiltonisn in quantum mechanics. It is a
good example how a relatively abstract mathematical theory - the theory of
groups has immediate physical consequences.
~
119
2.5.3
120
(2.193)
(2.194)
With the requirement that E,(k) be a regular function at k,one may
approximate it in the neighborhood of k, by a Taylor expansion to second
121
order
(2.196)
IJsing this tensor, the expansion (2.195) of E,(k) may be written in the form
(2.197)
Thc tensor components cnter this expression in a manner similar to the
way the reciprocal mass rn-l enters the energy dispersion relation of a free
electron. Actually one can obtain the expression (2.154) for the energy of a
free electron from (2.197) by Erst taking the energy zero at E u ( k c ) which is
unimportant, and then by substituting
(2.198)
The components of the invrrsr tensor M , of M; accordingly have the dimension of a mass. One therefore calls M u the effectzve mas8 tensor. The
term effective refers to the fact that the electrons of a crystal are not free
elcctrons but arc subject to the influencc of thc periodic potential. T h e
presence of this potential mandates that the wavevector of the free electron
be repIared by the quasi-wavevwtot k, and that the quadratic dppendenre
of the energy on k become a more complicated dependence. Only in the
vicinity of critical points does the quadratic drprndenre persist, but with
coefficients M;$,
which differ from those of the flee electton. They are
eflectztw coefficients which involvc the effrct of the periodical potmtial.
In contrast to the free electron case, where the coeficients form a multiple
rn-l of the unit tensor 6,p, i.e. a scalar quantity, for the case of an electron
in a crystal they represent u tensor ML$ with non-vanishing off-diagonal
elements and general diagonal elements. An analogous statement holds for
the inverse, as the cffective mass tensor M,. Thus, in general, the effective
mass is a tensorial quantily. Therein the crystal potential manifests itself,
generally generating different mass values in different spatial directions. In
special cases one ran approximate the effertive mass by a scalar quantity
rn;. Then, the difference between the mass of the free electron and that of
the crystal electron reduces to the different magnitudes of the two masses.
z 22
crystals
The mathematical description of effective mass can be simpl%d by exploitMv:, with rcsppct to an exchange of its
indices a and 3, i.e. by using the relation
ing the symmetry of the tensor
(2.199)
This propat,y follows directly from the defining equation (2.196) for M&.
As is well-known from linear algebra, such a real symmetric tensor cau be
brought l o diagonal form by a coordinate transformation, in our case of the
being
components of the k-vector, to the principal axis system. With
the principal diagonal elements of M l l in this system, one has
ms'
The m;;' are real. Ifn general, they can take both positive and negative
values. The same applies to thc inverse quantities, the effective masses rnk
themselves. Taking k, as componenls of k in the principal axis system, we
have
(2.201)
If all three effective masses are positive, the energy function E w ( k ) has a
minimum at the critical point kc,and if all three are negative it is a maximum. If the three effective masses do not all have the same sign, then E , ( k )
has a saddle point.
lies on a 4Consider the particular case in which the critical point
fold symmetry axis of the first 3 2 . and that this is simultaneously one of
the principal axes. Then, for symmetry reasons, the two principal tensor
elements normal to this axis must be equal. With LY = 3 for the symmetrical
principal axis we thus have
(2.202)
mC1 = m t 2
The corresponding element for the symmetrical principal axis m:3 E m:,,in
general differs from mZI. If k,, however, lies on a symmetry center of the
first B Z , then all three elements are in fact identical, i.e.
(2 203)
For the frequentIy observed case of a symmetry point at the center of the
first BZ!i.e. with k, = 0, one gets the simple dispersion law
E,(k) = E v( 0) + -k2.
2m;
F12
(2.204)
123
This differs from that of a free electron only in that the effective mass rn;
appears instead of the free mass and that the energy at k = 0 assumes a
value E,(O) which depends on the band index and, therefore, cannot be set
equal to zero for all bands by changing the zero of energy.
In the following subsection we will again deal with band structure in its
general form E,(k). We will introduce the so called density of states that
encompasses essential information on band structure in just one function of
energy. Often, a knowledge of the density of states is enough to calculate
important macroscopic properties of semiconductor crystals.
2.5.4
Density of states
Definition
We first consider an arbitrary infinite oneelectron system. Its Hamiltonian
will be denoted by H , its eigenstates (disregarding spin) by li), and its energy
eigenvalues by Ei, i = 1 , 2 , .. . , 00. Periodicity of the eigenstates is assumed
with respect to a periodicity region of volume 51. The quantity
p ( E ) = - C 6 ( E - Ei).
(2.205)
O i
is called the density of states (DOS) of the system. We will show that this
designation is justified, i.e., that p ( E ) does represent the number of o n e
particle states of energy E per unit energy and volume. To this end we
integrate equation (2.205) over a small energy interval E < E' < E A E ,
obtaining
P(E)AE
=
FL
E+AE
dE'6(E' - E i ) .
(2.206)
JEE+AE
dE'6(E' - Ei) =
(2.207)
Thus, each state i whose energy lies between E and E A E contributes '1'
to the sum on i, with no contributions from all other states. The sum in
(2.206) equals, therefore, the number A z ( E )of states having energy in this
interval. Considering spin, one has
2
~ ( E ) A=E- A z ( E ) ,
51
(2.208)
124
p ( E ) = --Im
(2.209)
Tr
7T
where trace sytxho1 Tr denotes the sum over all diagonal elements with
respect to any basis set in Hihert space. If this set is identified with thr
eigmstatcs 12) of ff, expression (2.209) transforms immediately into (2.205).
We now consider the DOS p ( E ) defined by (2.205) for an ideal crystal.
The cigenstates li) are Rloch states Ivk), in this case, and the DOS p ( F )
therefore reads
p(E)-
2
C 6 ( E-
(2.210)
uk
The k-sum is over all points of the finely meshed net of Figure 2.4 which lie
within the first B Z . Because of the fineness of this network, the result of the
summation is almost identical to that of an integration. The substitution of
the s m by an integral is done in accordance with the prescription
c...=
-1
R
8T3
d3k ...
(2.211)
(2.212)
which is independent of the volume R of the system. The density of states
restricted to the u-th energy band, p u ( E ) ,is defined by the expression
(2.213)
and it differs from zero only for energy values for which the argument of the
6-function may brrome zero, i.r. for energies in the v-th band. Summing
over d l bands one again obtains the entire density of states
(2.214)
125
(2.215)
with df as element of the constant energy surface E,(k) -= E . The inteErand
l/IVkF,,(k)l is, apart from a factor TL, the reciprocal absolute value of the
group velocity of an electron in the Bloch state Ivkq). The smaller this
velocity is at a particular k-point on the constant energy surface or, in other
words, the longer an electron stays at it, the larger is its contribution to the
density of states. If (VkE,(k)l - 0, i.e. at a critical point, the contribution
is, formally, infinitely large. From this it follows that for energy values E
whose iso-energy surfaces contain critic81 points, singularities of the density
of states as a function of energy are to be expected. They are called van
Hove singularitiee.
(2.2 17)
126
1 [S(z - ZO)
E - EA - x 2 - y2 - z 2 ) = 2zo
+6(z+
ZO)]
(2.2
with
Zo
= ,/E - E , , ~- x 2 - y2 f o r E - E&
> 0.
(2.2
It follows that
where
B(E - Euo) =
1 f o r E > Euo
0 for
E < Euo
(2.221)
(2.222)
by the so-called density-of-states-mass m b . This may be seen immediately
if one reviews the calculation for the isotropic case. The anisotropic effective mass involves a change only upon substitution of the variables for the
components of the k-vector - the factors for the 3 components are different
and lead to the modification indicated in equation (2.222).
According to expression (2.220) the density of states of a parabolic band
with m*, > 0 exhibits square root-like behavior at the lower band edge, and
continues following a square root law up to infinitely large energies. The
latter property is a consequence of our untenable assumption of parabolic
dispersion up to the upper band edge and the replacement of this edge by
the point at infinity. In reality the band edge lies at finite energies, and the
density of states again falls off to 0 there. It has, qualitatively, the shape
shown by the dashed curve of Figure 2.14. The sudden square root-like
increase of the DOS at the band edge reflects the fact that the function
p(E) has singularities at these energies - the first derivative with respect to
E is 0 if one approaches the edge from the forbidden zone, and it is +CXJ if
one approaches it from the interior of the band. This is one of the already
mentioned van Hove singularities of the DOS. These singularities occur not
only at energies where a particular band E,(k) has a minimum, as in the
127
5-
-c
=
<
4-
3-
vl
2-
1-
0
-1
The square root enerw dependence of the density of states in the case of a
parabolic dispersion law may also be demonstrated in the following, more
vivid way (also see Figure 2.15). For simplicity we set E,o = 0. and omit
factors which do not depend on k or E. The n m b r r A Z of b m d states
with energy between E and E .f A E is the number of different k-points of
the finely meshed net which yield energy values in t,his interval. These kvalues lie in a thin spherical shell in k-space [see Figure 2.15) which, because
E o(. k2, has radius k~ 6: &.and thickness Ak which, since Ah' CK k&k,
is given by
Ak
0;
-AE.
fi
(2.223)
(2.224)
Since the density of the finely meshed net of k-points of the first BZ i s the
same everywhere, the number of k-points in the spherical shell is proportional to its volume AV. Hence AZ cy v%AE and
p(E)
o(.
4%.
(22 2 5)
128
(a)
(2.226)
2.5.5
-AE.
fi
(2.227)
Spin
Thus far, spin has been omitted from our discussion of the general properties
of stationary oneelectron states of crystals. It turns out. however, that spin
and spin-orbit interaction play an important role in most semiconductors.
Therefore, the question arises as to how the results derived above for scalar
wavefunctions change when electron states are no longer scalar but spinor
functions and the spin-orbit interaction H , is taken into account.
The starting point to address this question is the one-electron Schrodinger
equation (2.58) with spin, which may be written in the following short form
(2.228)
129
The first point we will consider is that of spatial symmetry in the presence
of spin.
Spatial symmetry
Inspecting the explicit form of the spin-orbit interaction operator ( b e y formula ( 2 . 5 6 ) ) , one readily recognizes that TKso has thc full symrwtry of the
crystal. Thus it colrimutes with all elements 9 of the space group. Since the
one
same holds for the spin free part H of the total Hamiltonian !i+
has
Iff
+ H,.
g] = 0
(2.229)
gave
4).
zti)
Second, we consider point symmetry operations u , whose action in tramforming t,hc components of a spinor is discussed in Appendix -4. The results
of Appendix A are applied here in the form
130
with
13 1
of the
small point groups of symmetry points and lines of the firat B Z of the fix lattice
for crystala with the diamond structure.
+ H,,
132
i)}
i),
(2.234)
To prove this assertion, one executes the time reversal operation on the
spin-orbit interaction operator H,, of equation (2.56) explicitly. The vector
a' then transforms into -a*, and p into -p, so that the net change in Hso is
the replacement of d by a'*. Employing the operator K , on the other hand,
the effect of timc reversal on H , , may be expressed in terms of the similarity
For the two expressions to be identical,
transformed operator KH,,K-'.
K must have the form given above in equation (2.234), apart from a phase
factor which remains undetermined. The wavefunction {cpx(x,
cpx(x,$)}
must be replaced by K{cpx(x,
cpx(x, $)} for the Schrodinger equation
(2.180) to remain valid.
The question, under what conditions time reversal symmetry includes
degeneracy of eigenfunctions not accounted for by spatial symmetry alone,
is treated in Appendix A in full generality. Here we deal only with a specjal
cp~(x,
case. We consider a non-symmetric wavevector k. Let {cpxk(x,
be an eigenfunction with energy Ex(k). If the point group of directions
contains the inversion, then the eigenfunction {cpx-k(x,
cpx-k(x,
corresponds to the same eigenvalue Ex(k) -= Ex(-k). The time reverse of
the Bloch function {cpx-k(x,
cpx-k(x, $)} likewise has energy Ex(k) and
wavevector k. It is linearly independent of {(Px~(x, cpxk(x, $)}, since
i),
i),
i),
i),
i),
i)}
i)}
i),
as one may easily show by evaluating the scalar product (here, one also has to
sum over the spin coordinate s ) . Thus, two linearly independent eigenstates
of the same energy have the wavevector k. Since k was chosen arbitrarily, it
2.5.6
133
There are many methods, quasi-analytic and numerical, for calculating the
band structure of crystals. In the following we will give an overview.
In band structure calculations one is confronted with two problems,
firstly with the formulation of the one-elcctron Schrodingcr equation, i.c.
with the determination of the effective periodic oneelectron potential V ( x )
of the crystal, and secondly with finding the eigenvalues and eigenfunctions
of that equation. The various methods of band structure calculation differ
in the manner in which thcsc two problcms are resolved. Hcre we will deal
mainly with the second problem, i.e. with the solution of the Schrodinger
equation whose potential is known. The first problem, the determination
of thc cffectivc oncclcctron potential V ( x ) , has in principle brcn solved
in section 2.2, where we dealt with the one-electron approximation for the
many-electron system of a crystal. Here, we only address some further dctails
of this problem.
134
The solution procedures can be divided into three groups, firstly! the matrix
methods, secondly, the cell methods, and thirdly, the muffin-tin methods.
Matrix methods
In the application of matrix methods one represents Bloch type eigenfunctions of a given quasi-wavevector k in a particular basis set consisting of a
finite number of functions of the Bloch type. The Hamiltonian of the crystal
is thereby represented by a k-dependent complex Hermitian matrix which
135
has as many columns and rows as the basis set has functions per k-vector.
The calculation of band structure is thus traced back to the determination
of the eigenvalues and eigenvectors of a k-dependent Hamiltonian matrix.
The basis sets employed differ by the number and kinds of functions
they contain. One would like to manage with the fewest possible functions,
to minimize the numerical effort. The price for this is a loss of precision,
because with fewer basis vectors the eigenfunctions are necessarily approximated more crudely than with a larger number - in the extreme, one needs
an i n h i t e number of functions. For a fixed number of basis functions the
level of precision achieved is higher for basis functions which are better adjusted to represent the eigenstates. The following basis functions prove to
be of practical use:
-Plane waves Iki-K) with k being a vector of the first BZ and K a reciprocal
lattice vector. This constitutes a generalization of the nearly free electron
approximation.
Bloch sums of atomic orbitals, also called LCAO's (Lznear Combznatzons
of Atomzc Orbatuls), or of other localized functions. This includes the tight
binding met hod.
(2.236)
holds. The OPW's must have the form
+ K ) Ick)
(2.237)
in order to satisfy the orthogonality condition (2.236). Making the expansion functions orthogonal to the core eigenstates accounts for the fact that
136
The idea underlying the pseudopotential method is to transfer the core state
orthogonality term in equation (2.237) from the OPW's to the one-electron
Hamiltonian H of the crystal. This transfer is done as follows. Consider
the core electrons. which we have hitherto taken jointly with the atomic
nuclei to form the cores, to be independent particles, just like the valence
electrons. This means that the potential created by the core electrons is no
longer included in the core potential Vc, but is added to the eEective o n e
electron potential 1 ' ~ 15-cof the electron-electron interaction. Because of
this reinterpretation of the oneelectron Hamiltonian H , its eigenstates now
also include core states, for which we have
(2.238)
H i c k ) = E,lck).
The expansion of
~$m
with respect to the OPW's
reads
Removal of the core states from the expansion functions mandates changing
from the eigenfunctions $ato other functions v p i that have the same
expansion coefficients, but plane waves as expansion functions:
(2.241)
~$g~
137
(2.242)
H P S= H
+ E ( E - E,)lck)(ckl
(2.243)
The Hamiltonian HPS is a well-defined linear operator, although it is nonlocal and depends on the eigenvalue E itself. It is called a pseudo-Hamiltonian The
are termed pseudo-wavefunctions. Each eigenvalue of HP" is
simultaneously an eigenvalue of H , but the reverse statement does not hold.
While core levels E, are eigenvalues of H , they are not also eigenvalues of
HPS since the pseudo-wavefunctions of the core states vanish.
The pseudo-Hamiltonian HPS may be written in a form which clarifies its
meaning. To begin with one revokes the re-interpretation of the Hamiltonian
H , i.e. considers the core electrons no longer as independent particles which
enter the effective one-electron potential VH Vxc of the electron-electron
interaction, but includes them again into the atomic cores making them
contributors to the core potential V,. Then HPS takes the form
$gk
H P s = -p2+ v
2m
(2.244)
The last two terms in this expression jointly constitute the so-called pseudopotential V,p"
(2.245)
The second term on the right is significant only in the core regions. There,
it is preferentially positive, indicating that it repels valence and conduction
band electrons away from the cores. One can show that it largely compensates the variation of the core potential V, in these regions. Because of this,
the pseudopotential V,p" is relatively smooth throughout the cores. It can
he made even smoother if one exploits a property of V,p" which has not yet
heen discussed. We refer to the non-uniqueness of the repulsive part of V,p"
in equation (2.245). If there the bra-vectors (E - E,)(ckl in (2.245) are r e
placed by completely arbitrary functions (while keeping the ket-vectors Ick)
unchanged), the eigenvalues of the pseudo-Schrodinger equation (2.242) remain the same, only the pseudo-wavefunctions change. This freedom may be
used to make the pseudopotential still smoother, and also fulfill other requirements, such as, for example, norm-conservation of the pseudo-wavefunctions.
The smoothness of the pseudoptential makes it possible to restrict the planewave expansion (2.241) of the pseudo-wavefunction to terms having small
reciprocal lattice vectors K. Consequently, the representation matrix of the
pseudo-Hamiltonian HPS is small and easy to diagonalize. This is the reason
138
These methods are 3-dimensional generalizations of the method of matching conditions usually employed in solving thr Schrodinger equation for the
square well potential and other 1-dimensional potentials. In cell methods?
one first determines linearly independent solutiona of the Schrodinger equation within a primitive unit cell for arbitrary energies. Forming Bloch sums
with them, one constructs the solution for the total crystal. The require
ments that these functions, and their first derivatives, be continuous at the
boundaries of the unit cells determines the energy eigenvalues and eigenfunctiuns. This method suffers from the arbitrariness of t,lie choice of the unit
cells and the difficulty of satiseying the boundary conditions over the whole
surface, i.s. at an infinite number of points. The most natural choice of unit
cell is the Wigner-Seitz cell. A further development, of cell methods lies, in
a sense, in the mufin-tin methods.
Muf&n-tin methods
In the present method one delimits spheres around the atoms, and leaves
some empty space around them. The crystal then looks something like
139
140
2.6
2.6.1
Fundamentals
Atomic Orbitals
The basis functions of the TB approximation are the one-particle eigenfunctions of the valence electrons of the free atoms, more strictly, of the atoms
composing the crystal under consideration. These eigenfunctions are called
141
atomic oibatnls. The spinor character of the orbitals may be taken into account, but we will omit it here brevity. The mobt important property of
the orhit&, which turns out to be decisive for the TB approximation, is
their spatial symmetry. The latter i s determined by the symmetry of the
Hartree o i HartrecFock potentials of the atomic cores. These potentials are
isotropic if all core shells are fully populated by electrons. In the case of
atoms forming diamond and zincblende type semiconductors, this condition
is always satisfirul. Thus we may assume isotropic core potentials, and the
atomic orbitals of given energy eigcnvalues form basis sets of irrediicible representations of the full orthogonal symmetry group. These representations
are characterizd by an angular momentum quantum number 1 which may
assume all non-negative integral values. The irreducible representation of a
given quantum number 1 is (21 1)-fold degenerate. The 21 1 basis functions are distinguished by the magnetic quantum niimbcr 7n which takes all
integei values between - 1 and + l . The energy spectrum of the free atom
is degenerate with respect to m. Foi the hydrogen atom there is also a d c
generacy with respect to 1. This is not due to the spatial symmetry of the
potcntial but to its purr Coulomb form. Here this additional degeneracy
may not be assumed. For each value o l lhe quantum number 1 one has an
infinite set of different energy eigenvalues. These are distinguished by the
mdin quantum number n which may take all intcgcr values horn 1 1 to 00.
In this way the energy levels E,l of an atom depend on thc two yuanturn
numbers r~ and 1 , and the corresponding eigenfunctions dnlm(x)depend on
thrw quantum numbers r ~I ,, m. 'l'ht. eigcnfunctions &,jrn(x) can be written
as products of the radial wavefunctions &(I x I) and spherical harmonics
Km(xl I x I),
(2.246)
Here, it is assumed that the tttorriir core is located iit the coordinate origin
x = 0. A wavefunction with the quantum number I = 0 is called an s-orbital,
one with 1 = 1 a p-orbital, with 1 - 2 a d-orbital etc.
In order to represent the eigenstates of the valence electronb: of a crystal,
one needs, rigorously speaking, nll orbitals of the cores of its atoms since
only the totality of all orbitals forms a complete basis set in Hilbert space.
However, not all of these contribute in an essential manner. 'l'he largest
contributions are to be expccted from orbitals forming the valence shells of
the free atoms. Within the TB approximation one takes only these orbitals
into account, This corresponds to a perturbation-theoretic treatment of the
Hamiltonian matrix with respect to the atomic orbital basis; only matrix
clcmerits between valence orbitals are considered while those involving other
orbitals are neglected. For the elemental semiconductors of the fourth group
of the periodic table the valence shell orbitals are formed by the four ns-
142
The latter are also energy eigenfunctions because of the degeneracy of qbn1l
aiid 0 ~ 1 - 1 with respect to the magnetic quantum number m. For the sake
of uniformity, one sets
143
Figure 2.16: Polar diagrams of the atomic s-and p- orbitals in Cartesian representation.
Two different orbitals & f j f R ( X ) and & ~ R ( x ) with identical values of R and
R as well 8s of j and j , but different values of a and a, are also orthogonal
to each other. For R f R or j # j , i.e. for orbitals at different centers, no such orthogonality exists. Although the two orbitals are localized
in different spatial regions, and the integral over the product of the two, the
so-callcd ovcrlap integral, turns out to be relatively small, it may not be n e
glected because its influence on the energy eigenvalues is of the same order
of magnitude as the matrix elements of the Hamiltonian between orbitals
at different centers. The latter elements are essential because they are r e
sponsible for the bonding between atoms in a crystal and for the splitting of
the atomic energy levels into bands. The non-orthogonality overlap integrals
must therefore also be taken into account. This may be done directly, by
writing down and solving the eigenvalue problem for the crystal Hamiltonian in the non-orthogonal basis set of the atomic orbitals. This procedure
144
is, Iiowcvcr, quite inconvenient becaiise the matrix of overlap integrals has
to be calculated explicitly and diagonalized together with the Hamiltonian
matrix. It is more useful to employ a set of orthoganalizd orbitals by forming suitable h e a r combinations of the q a j ~ ( x )Here,
.
'suitable' means that
the new orbitals should have t,he same spatial symmetries a s thc original
atomic nrhitrals Q U j ~ ( xbecause
),
these symmetries are the essential properties that allow the matrix elements of the Ilamiltonian to be reduced to a few
const.ants. Consequently, the new orbitals must, likewise form basis sets of
irreducible representations of the full orthogonal group, each set heing chara c t w i d by a particular angular morrieriturri quantum miniber 1. That there
are indeed linear combinations of atomic orbitals possessing these properties,
forms the content of Liimdin,'<sthcowm (we, e.g., Slater, Koster, 1954). We
will iise this theorem below. The orbitals m U j ~ ( xwhich
)!
thus far have been
taken as ordinary atomic orbitals, will henceforth be understood as atomic
) now,
orbitals in the sense of Ldwdin's theorem. The orbitals q a j ~ ( xare
of course, no longer given by the expressions [2.246) to (2.250), but their
indices n E nlm will retain the meaning they previously had for the ordinary atomic orbitals. hlthough explicit expressions for the G w d i n orbitals
& j ~ ( x )can in principle be providd, they will not be given here since thcy
are nut, n m d d if one proceeds in thc manner to be discussed below. The
Lowdin orbitals arc, by definition, aIso orthogonal for different centers, such
that
(&!jrR(
I O a j R ) = bo'abj'$R'R.
(2.253)
(2.254)
(2.255)
The sum over R extends over all lattice points of the periodicity region. By
means of (2.254) one may readily verify that
(2.2 56)
145
I #ajk)
6aa6j1j6k1k.
(2.257)
In the TB approximat,ion, the eigenfunctions C p v k ( x ) of the Schrodinger equation are written as linear combinations
cPvlc(x) = x ( J j k I c P v k ) c P a j k ( x )
(2.258)
jQ
of Bloch sums
x(ajk I H
(2.259)
ja
where the matrix elements of the Hamiltonian are given by the expressions
with
(aj0 I H
I ajR) =
d 3 x ~ ! J : (-x? ~ ) H ~ , I ( xR - 5,).
(2.261)
146
I nslRt) = pns,
t = 1 , 2 , .. . 6 .
(2.262)
(2.263)
Here, we have employed the fact that, for reasons of symmetry, all six neighbor atoms give rise to the same value of the integral (2.261). The value of ,BnS
depends on the overlap of the ns-orbitals localized at adjacent atoms. With
increasing distance between the atoms, &, approaches 0. The value of E,,,
in a crude approximation, may be identified with the energy of the ns-level of
the free atom. In terms of E,, and Pns, the matrix element (nslk IH I nslk)
of (2.260) is given by
(nslk I H I n s l k ) = ens
+ &sxeikRt.
(2.264)
t=l
It follows that the energy eigenvalues Ens(k) of the Schrodinger equation are
Ens(k) = ern
(2.265)
147
ks
P3s
P 2s
PlS
-IT
-
Wavevector
l-r
-
(O,O,k,)
but several. This means that the valence electrons in the free atom do not
occupy only one 5 level, but several s-levels ens differing in the value of n. To
justify the application of the results obtained above in the present case, the
matrix elements of H between s-orbitals having different values of n must be
negligibly small. We assume this to be true. In addition, we suppose that ens
is negative for all n, and that the sign of pns alternates, such that for n = 1
it is taken to be negative, for n - 2 positive, for n - 3 negative etc. This
behavior reflects the differing numbers of nodes of thc atomic wavefiinctions
for different values of n. The absolute values of Pns are expected to increase
with growing n , corresponding to the larger values which the ns-orbitals
with larger n have at the nearest neighbor aloms. The separation between
adjacent energy levels cn8 should, however, always be larger than 4Pns.
If the above conditions are met, the energy band dispersion along the line
(0, 0, kz) of the first B Z of the simple cubic lattice under consideration, has
the form shown in Figurr 2.17. Each level ens of the free atom gives rise to
an energy band of the crystal. The width of these bands amounts to 4&.,
thus increasing with increasing n. The bands are separated by forbidden
energy regions. Their widths decrease with growing n. If the distance between nearest neighbor atoms increaees, then the parameters fins of equation
(2.263) decrease because of decreasing ovrrlap of the orbitals, as has been
148
pointed out above. The same holds for the widths of the energy bands, they
become narrower if the distance between neighboring atoms grows. They
approach the discrete levels ern of free atoms as this distance becomes infinitely large, corresponding to ,&, = 0. In the latter case. electron states
with different values of the wavevector component k, have the same energy,
ens. These levels are, therefore, highly degenerate. If the infinitely remote
atoms again approach each other, then the ,&,-values become finite. because
of the onset of overlap of neighboring orbitals. Correspondingly, the degeneracy of electron states with different kz is removed. and the discrete levels
of the free atoms spread into bands. The TB approximation quite naturally
explains, in this way, how discrete energy levels of the free atoms transform
into energy bands of the crystal. The gaps between the bands occur naturally in this approach, since the discrete atomic levels are separated by
energy gaps from the outset. This stands in contrast to the approximation
of nearly free electrons, where the occurrence of energy gaps calls for an
explanation. On the other hand, 'bandwidth', in the form of the infinitely
broad energy continuum of the free electron. is present at the outset in the
latter approach. The gaps induced into the energy continuum were seen to
arise because of the strong perturbation of plane wave states by the periodic lattice array of atomic coTes fox wavevectors on the Bragg reflection
planes. Comparison of the two approximation procedures reveals the difference between the underlying concepts - the TB approximation emphasizes
the atoms and the short-range ordered complexes of the crystal. while the
approximation of nearly free electrons focuses on the crystal as a whole and
the long-range ordering of the atoms. Such comparison also shows that the
short-range and long-range order concepts are equivalent in the sense that
they result in the same characteristic features of the electronic structure of
crystals -both concepts predict the existence of energy bands separated by
gaps. Figure 2.18 depicts the manner in which the bands and gaps arise in
the two approximations.
2.6.2
Semiconductors of the diamond and zincblende type are tetrahedrally coordinate cubic crystals with two atoms per primitive unit cell and four valence
shell orbitals per atom, among them one s-orbital([ = 0) and three porbitals
( I = 1). The application of the TB method to this specid case is of particular importance. It will be developed in the present subsection. Only nearest
neighbor interactions will be taken into account because this introduces considerable simplification and nevertheless gives results of reasonable accuracy.
For the p-orbitah. we choose the Cartesian form, so that the orbital index a
takes the values ns,nz,ny and n z . The Cartesian components I , y r z refer
149
Figure 2.18: Illustrat,ion of the origin of energy bands and gaps in the enerw
spectrum of a crystal.
to the cubic crystal axes. The main quantum number n will be suppressed
below because it may assume only one value here for each of the t,wo atoms,
although not necessarily the same. As a first step we determine the 8 x 8Hamiltonian matrix for diamond type crystals. Later: it will be generalized
to zincblende type structures.
Hamiltonian matrix
We start with matrix elements bet,ween orbitals at equivalent atoms.
Matrix elements between orbitals at equivalent atoms
150
(2.266)
(ujk I H
(2.267)
t=1
The values of j and j' are complementary to each other because the nearest
neighbor atoms lie in the other respective sublattice. For j = 1 one has
j' = 2, and for j = 2 then j' = 1. We will restrict ourselves to the first case,
i.e. j = 1,j' = 2, because the second may determined from the first with
minor changes. The four nearest neighbors of a 1-atom are located in the
primitive unit cells at
R1 = 0,
R2 = -al,
R 3 = -a2,
R 4 = -a3,
(2.268)
(see Figure 2.19). The calculation of the matrix elements (a10 I H I a'2Rt)
between the different Cartesian orbitals and the four different values of Rt is
15 1
24
Figure 2.19: Atom of sublattice 1 and its four nearat neighbors in sublattice 2.
somewhat laborious. It will be carried out in several (five) steps. In the first
step we determine the matrices (a10 I H I a2Rt) between spherical orbitals.
I Im20) = & , , , ( h l O
1 N 1 lm20)
BmmrV~ppn
12 ,
(2.269)
152
neighbor dirwtions. For the second-nearest neighbor directions, the symmetry of H is smaller. Rwauw of that, the matrix elements (alk I H I a2k)
of equation (2.260) are, in general, non-diagonal with respect to m, m if the
R-sum is extended to the swond-warest or more remote atoms. The diagonality holds, however, in an approximate sense, which may be seen as foHows.
The periodic potential V(x) in H represents a sum &,,,, vji,(x - R - i y )
of potential contributions of all the individual atomic cores of the crystal.
The matrix elements of H between orbitals at different renterrs jR and JR
thereby decompose into bums over all centers JIR. The largest contributions will arise from t a m s where the center index jR coincides either with
1R or with JR. If one considers only such twecenter terms and neglects
all threecenter contributions, then the integrand of the matrix elements
(Im10 I H I lm20) has the full axial symmetry, so that these dements
become diagonal with respect to m, m.
The hermiticily of the Hamiltonian and the particular form of the wavefunctions in (2.252) lcsd t o the relation
(lrn10 I H
(2.270)
(0010
(2.271)
(0010
(1010
are independent of each other: and the two elements
(1110 I H
(2.272)
are identicaL These elements are illustrated in Figure 2.20. This figure also
Since the atomic orbitals under consideration are those of bound states, and
since the eigenvalues of the Hamiltonian for bound orbitals are generally
negative, one may expect negative values for k<3n, Vm, and positive ones
for Ifaw, IT&,. Taking account of the strength of the overlap of the orbitals
153
12
v s su
21
vuG
-=
vssu
21
=V p p u
=Vppcr
21,
VPPT
154
in Figure 2.20, one can conclude that the absolute values of these elements
should obey the relations
V,
I.
(2.274)
The matrix elements (lm10 I H I ImflO)of H with respect to spherical orbitals calculated above will be used to derive, in the second step, the matrix
elements of this operator with respect to the Cartesian orbitals. The z-axis
of the Cartesian coordinate system is taken t o be the same as that of the
spherical coordinate system used above. This means that the z - ax is points
in the direction of the connecting line between atom 10 and atom 21. The
corresponding 2- and y-axes lie in the plane normal to the z-axis, apart from
an irrelevant rotation about this axis. The pair-related Cartesian coordinate
system thus defined differs from the formerly introduced crystal-related system which is given by the three cubic crystal axes. The coordinates in the
pair-related Bystem will be denoted by 2 , 5,2. The pair-related ;-orbital coincides with the 8- orbital with respect to the cubic-axes system. In terms
of this notation, the matrix elements of H between Cartesian orbitals in
the pair-related system read (310 I H I 3.20),(3.10 I H I 120),(3.10I H I
520), (3.10 I H I .520),(110 I H I 120), (110 I H I 520) etc. Since the C a r t e
sian orbitals are defined in terms of spherical orbitals by equations (2.249),
(2.250), the matrix elements between Cartesian orbitals can also be related
to those between spherical orbitals. The corresponding relations are given
below. Elements which are complex conjugates due to hermiticity of the
Hamiltonian, such as (310 I H I 220) and (120 1 H I ;lo), are listed only
once. The relations read
(310 I H I 3.20) = VSScr,
(2.275)
155
To this end, the Cartesian orbitals 2,y, z related to the cubic-axes system
must be expressed in terms of the pair-related Cartesian orbitals ?,c,Z.
To determine this relation, we consider a rotation which transforms the
crystal-related axes system into the pair-related one. The transformation is
characterized by Euler angles $, 6 and 'p. Since the orientation of the pairrelated system is defined only up to an arbitrary rotation about the Z-axis,
the Euler angle 'p may, without any loss of generality, be set equal to zero. As
noted in Appendix A, the rotation matrix A of equation (A.31) transforms
the coordinates before rotation into the rotated one. The basis vectors are
transformed by the transposed matrix, which, in the present case of rotation,
is also the inverse matrix. Since the 2 ,y, z and ?,y, Z are understood here
in the sense of basis vectors, we have
(:)
cos @
= (sin@
- cos 6 sin $
sin 0 sin $
c o ~ e c o ~
-sinecos$)
~
sin 0
cos e
(i).
(2.276)
Below, we will see that the direction cosines (PI,q1, 71) of the pair-related
2 -axis with respect to the crystal-related cubic z-axis play an important
role. These are the elements of the third column of the rotation matrix in
equation (2.276), i.e.
p l = sin 0 sin $,
q1 =
- sin 6 cos $,
r1
= cos 0.
(2.277)
156
These relations are valid for matrix elements involving the two nearest neighbor atoms belonging to the same unit cell at R = 0. From these elements
we may determine the elements with the nearest neighbor atoms of different
unit cells. This will be done in the next step.
iv) fourth step.
The vectors pointing from the central 1-atom to the four nearest neighbor
atoms will be denoted by dt, t = 1,2,3,4, such that dt = Rt
- TI.
For the diamond structure, the two sublattices are displaced with respect to
each other by the vector (a/4)(1,1,1). Using equation (2.268) for Rt and
the explicit form of the primitive lattice vectors of the face-centered cubic
lattice, we obtain
dl =
a a - a(l,
1, l ) , d2 = -(1,1,1),d3 = -(l, 1,I), d4 = -(l, 1,l).
4
4
4
4
U
(2.279)
The elements (a10 I H I a2Rt) determined above are used to calculate the
k-dependent matrix elements (alk I H I a2k) between Bloch sums. For
complex conjugate elements, again, only one relation will be given. With
the help of equation (2.267), we obtain
4
(slk I H
(slk 1 H
(2.280)
t=l
4
t=1
157
The seven different t-sums which enter t,he matrix elements (2.280) may be
reduced to just four because of the obvious relations ptpt = r t , p t r t = qt. and
qtrt = p t . The four independent sums are
158
9l(k) = eikdi
+ ,ik.dz +
,ik.d3
.&.dz
+ .ik.ds
,ik.d4,
(2.282)
- .ik.d4
- ,ik.dn:
+ ,ik.d4
159
1 1 1 1 1 1 1;1
type
semiconductors (in
~i
Ge
6s
EP
VsSa
V v
vma
-17.52
-8.97
-4.50
5.91
10.41
-13.55
-6.52
-1.93
2.54
4.47
-14.38
-6.36
-1.79
2.36
4.15
-2.60
-1.04
m
i
-1.38
1.92
1.68
2.20
-0.55
160
Table 2.9: Intra-atomic TB parameters (in eV) to be used in conjunction with the
universal inter-atomic TB parameters of Table 2.8. (Source same as in Table 2.8.)
qltm.
161
tors. They represent atomic s- and p-level energies which deviate somewhat
horn the energy levels given in Table 2.1.
A k-point of prtictilar interest is the 32 center k = 0. Here, the
eigenvalues and eigenfunctions of the Hamilton matrix (2.283) may br o b
tained ~ I closed
I
analytic form. The results me the following 8 eigenvalues
&(a). i = 1,2,. . . , 8 .
1 EzOj = ( I / f i ) ( O ,
(i/&j[o,o,o, i,o.o,o,ij,
(2.2~17)
E s ( 0 ) - &;7(0).
For the pmitinn of the &(O) lcvcl, there are still three pussibilitks (important malerials to whirl1 the three possible cases apply are listed alongside
the cases), namely :
i) &(0]
>
&(O)
Ge,
These relations Inem that the E'c;(O)-levelmoves down with respect to the
other two levels as the size of the atoms increases.
162
It turns out that the ordering of the eight energy bands at k = 0 remains
the same over the entire first BZ. This is important because the positions
of the energy bands relative to each other determine the likelihood of their
population by electrons. As already mentioned at the beginning of this
section, not all of the bands are expected to be populated, just as the s- and
p-levels of the free atom whose orbitals were used as basis functions were not
completely filled. Electron population of the ground state of the crystal, i.e.
at temperature T = 0 , may be obtained as follows:
For a simple band, each k-value corresponds to 2 eigenstates of opposite
spin. For a periodicity region of volume Q = G3Ro, the first B Z contains,
as does every primitive unit cell of reciprocal space, G3 allowed k -values
(see section 2.3). A simple energy band therefore has 2 x G3 states. Four
such bands are necessary to host the (2 x 4) G3 = 8G3 valence electrons
of a periodicity region. In the ground state of the crystal, therefore, the
four lowest bands are populated, and the four highest bands are empty.
This means that in the case of C , Si and Ge, E l ( k ) , E2(k), E3(k), E4(k) are
the populated valence bands, and Es(k),&(k), E7(k), Es(k) are the empty
conduction bands. For a - Sn, El(k) and &(k) form valence bands, together
with two of the three bands E2(k), E3(k), E4(k). The remaining band of the
three is a conduction band. Since it is degenerate at k = 0 with the highest
valence band, the energy gap of a-Sn vanishes.
The above band assignment allows us to determine the symmetry of the
valence and conduction band states at r.
We know that degenerate eigenst ate8 of the crystal Hamiltonian having the
same energy value form a set of basis functions for an irreducible representation of the small point group for the wavevector k. For k = 0 this
group coincide8 with the full point group of cquivalent crystal directions,
whch, here, is oh, The dimensions of the irreducible representations are
the same as the degrres of degeneracy of the corresponding energy levels. Therefore, the eigenfunctions for & ( 0 ) and &(O) each belong to 1dimensional representations, and those of E2(0) = &(o) = E4(0) and
E s ( 0 ) = E s ( 0 ) = E r ( 0 ) each belong to 3-dimensional representations. According to equation (2.286), the deepest valence bend level Ei(0) ha5 the
eigenfunction (l/fi)[Is10)+ I s20)]. In order to determine its transformation properties under the operations of the point group Oh, it is useful
to decompose oh into two parts, firstly, the tetrahedron subgroup I>containing only elements which are not involved with an exchange of the two
sublattices 1 and 2, and secondly, the remainder of oh which is composed of
all elemfxttb of T d multiplied by the inversion. Each of the elements of the
second part of 01, exchanges the two sublattices. For brevity, we will term
163
the latter elements 'exchanging', and the former ones 'non-exchanging'. The
eigenfunction (l/&)[ls 1 0 ) + 1 sZO)] for E l ( 0 ) transforms into itself under
the action of both types of elements, thus it belongs to the unit representasl0)- I s 2 0 ) ] of the Es(O)-level, the
tion rl. For the eigenfunction (l/fi){l
transformation into itself occurs only under the action of non-exchanging elements, while a factor -1 is generated in the case of exchanging elements. It
follows from the character table of the irreducible representations of Oh given
in Appendix A, that this transformation corresponds to the representation
r;.
r&,
164
r,
6,
6, r, -
r;, -
r,
conduction
bands
r;,
6 diamond
structure
valence
bands
zinblende structure
Figure 2.21: Ordering of energy bands at the center 1 of the fint B Z for semiconductors of diamond and zincblende type.
in that at the lower right. Secondly, the relation V$$
-V:$
G -V3w of
equation (2.273) cannot be used in the zincblende case because it rests on
the chemical identity of lhe two atoms of A unit cell. For the matrix element
l 2 , the s-state belongs to a 1 atom, and the p-state to a 2-atom, while for
the s-state belongs to a 2-atom, and the p-state to a 1-atom. Therefore,
thr matrix element, V& is given by an independent constant
rather
than by -V:$,
(the parameters iSwin Table 2.8 are associated with
by
meany of equation (2.284). The other excliangr relations in (2.284) remain
valid because they refer to matrix elements whose orbitals at the two different
atoms belong to the same state. With these two changes, in equation (2.283)
the TB Hamiltonian matrix for zincblende type semiconductors becomes
z;
-espc
csF
165
with ESP= (l/fi)c&,. TTsing this matrix, the band structure and the
eigenstates of zincblende type semiconductors may be calculated. YIPspatial symmetry of the eigcnstatcs a t thr R Z rentpr is similar to thtlf, which
was found for diamond type crystals above, except that the point group Ott
has to he r ~ p l a r dby the tetrahedron group T d . The wprPsPntaliun ri, of
Oh thereby becomes the representation r15 of T d , I
'i is replaced by rl, and
r1.5 remains ri5. In contrast to diamond type crystals, one has practically
only one energetic ordering of the conduction bands here - the rl-band being
the deepest.
2.6.3
Once the bald stnicture is known, the total energy of the valence electrons
ran be calculated. Formula (2.54) of section 2 . 1 indicates how this may be
done for the ground state of the crystal. One hw to sum the energy levels of
all valence electrons! and subsrquently remove the doubly counted electronelectron Coulomb interaction energy from this sum. The total energy of the
vakncr elwtrons of the crystal or, strictly speaking, its deviation from the
total energy of the valence electrons of the free atoms which were brought
together to form the crystal, defines the energy gain due t o chemical bonding.
known as coh.esion energy of the crystal. This definition is reasonable because
the valence elect,rons are the only parts of the atoms whose stat= change
when the crystal is formed.
In order to obtain the total e n e r a in closed analytic form, one needs
explicit mathematical expressions for the valence band energies st. all paints
k of the f i d BZ. However, the TB approximation in the form developed
above produces such expressions only at particular symmetry points. To
find them everywhere, one must introduce further simplifications. A starting
point for this is a formulation of the 'I'B approximation which employs certain
linear combinations of 8- and p orbitals as basis functions, rather than the
atomic orhihals of definite angular momentum quantum numbers I ! which
were used above. These linear combinations are callcd sp'--hgrhr%d orbitals.
In contrast to the atomic orbitals, they are not energy eigenfunctions of the
atoms.
sp3-hybrid orbitals
166
23
23
23
23
167
These orbitals point to an atom of type 1. The following four orbitals are
directed to the 1-atom in the unit cell at R: orbital 1 h12Rl+ R) at atom 21,
I h 2 2 R 2 f R) at atom 22, I h32R3+ R)at atom 23,and I h42&+ R)at atom
24. Hybrid orbitals at the same center are orthonormalized with respect to
each other, as the s- and porbitals from which they are constructed. If the
latter are understood in the sense of Liiwdin, orthogonality also holds for
hybrid orbitals at different centers. For each unit cell there are 8 associated
hybrid orbitals 1 &jR)with t = 1,2,3,4and j = 1,2, as there are 8 atomic
orbitals for the two free atoms of a unit cell. From the hybrid orbitals of a
given sublattice one may form Bloch sums I htjk) by means of the relation
(2.291)
in complete analogy to the Bloch sums q b ~ involving
k
atomic orbitals in equation (2.255). Due t o the definitions (2.289) and (2.290) of the hybrid orbitals
in terms of atomic orbitals, the Bloch sums I h t j k ) may be thought to arise
~ atomic orbitals by means of a unitary transforfrom the Bloch sums # a 3 of
mation. The same holds for the Hamiltonian matrix ( h t j k I H I htrg'k) with
respect to the basis I htj k); this matrix may be understood as arising horn
the above mentioned unitary transformation of the known Hamiltonian matrix ( a j k I H I a'j'k) with respect to the basis daJk. As such, it has the same
eigenvalues and eigenfunctions as the original matrix. The implementation
of the TB method by means of hybrid orbitals instead of atomic orbitals
having a definite angular momentum quantum number is therefore nothing
but the solution of the same eigenvalue problem in another representation.
This statement holds, of course, only as long as equivalent approximations
are made in the two representations. However, the hybrid orbital representation is well s u i t e d for further approximations, beyond those already made
earlier. These approximations facilitate the derivation of simple analytical
168
expressions for the eigenvalues and eigenvectors which may be used to explicitly determine the total energy of the valence electrons of a crystal in
closed analytical form.
Hamiltonian in hybrid-orbital representation
Here, we describe the most important additional approximation available in
hybrid orbital representation, which utilizes the fact that those particular
hybrid orbitals at nearest neighbor atoms which point toward one another
will also overlap one another more strongly than all others. In consequence
of this, the Hamiltonian matrix elements ( h t l R I H I ht2 R
t R) between
these orbitals will be the largest. Approximately, one need only consider
these elements, while all others may be neglected. The former elements do
not depend on t and R, as can be seen from the explicit expressions for the
hybrid orbitals (2.289), (2.290). Their common value, denoted by V2, may
be determined from (2.289), (2.290) and equations (2.271) to (2.273) as
1 ht,jR) -
- ( c S - E ~ ,)
# t'
(2.294)
(htjRI H
I h y R ) = -(~g
4
-t-
kp).
(2.295)
169
In writing down the Bamiltonian matrix (htjk I H I hrjk) in the hybridorbital representation, we arrange the eight basis functions in the sequence
I h l l k ) , I hzlk), I h s W , I h 4 W , I h l W , I h Z W , I h32k), I h 4 W . Then
the matrix ( h t j k I H I httjk) takes the form
(2.296)
Table 2.10: Hybrid matrix elements calculated from the TB parameters in Table
2.7 (in eV).
Ge
-8.27
-1.76
-2.98
-8.37
-2.01
-2.76
170
wovevector
Figure 2.23: Evolution OF the energy bards of semiconductors with the diamond
slructura within the TB approximation The right-hand part shows the band struct,ure of Si
171
turns out that this task may even be carried out analytically if a suitable
additional approximation, the so-called bond orbital approsirnation. is made.
Bond orbital approximation
To introduce this approximation, we review the already solved problem of
diagonalizing the Hamiltonian (2.296). but procwd in a sompwhat different
way, The diagonalization will now bt. carried out in two steps. In the first
step, the eigenvalues and rigenvectors of the matrix (2.296) are calculated
without taking account of the VI-terms, i.e. temporarily setting Vl to zero.
As may he scen from formula (2.297). this leads to the reduction of the
enerffi bands to the two dispersionless levels givw by
O,O),
( l / f i ) [ l ,D,O,O,e;,O,
(2.299)
(htlk 1 b4k) = ( l / J z ) ( O , 0: O , l , O , O , O , e : ) ,
E,
1,0,0,0:
-ez,
1.0,0,0,
0),
-.;I.
172
I kR)= - 11 h t l
1/2
R)+ I h2Rt
+ R)], t - 1 , 2 , 3 , 4 .
(2.301)
Thc exponential factors eF of the eigenvectors (2.299) and (2.300) were compensated by the et-factors of the Bloch sums. In the case of a-states the
hybrid orbitals are subtracted, and the corresponding linear combinations
I a , R ) are given by
The polar diagrams of these functions are shown in Figure 2.22. For reasons
which will be clarified later, one refers to the orbitals I h t R ) as bondiny and
to the orbitals 1 arR)as anhi-bonding orbitals. With the help of bonding and
anti-bonding orbitals the eigenfunctions of the Hamiltonian matrix (2.296)
with b1' -- 0 may be written in the form
1
I btk) -
eik'R I b t R ) , 1 - 1 , 2 , 3 , 4 ,
aR
(2.303)
(2.304)
In the second step of the diagonalization procedure the Vl-terms are included. The Hamilton matrix (2.296) is transformed iuto the basis set of
the previously calculated eigenvectors 1 btk) and 1 atk). This results in the
(8 x 8)-matrix
(2.305)
173
{2.307)
as block elements. The expression for If, follows from that or H & if in the
latter q, is replaced by E,. The structure factors g t p and &) in (2.306) and
(2.307) are defined as follows:
The (4 x 4)-matrix H M couples the various bonding states, and H,, couples
the various anti-bop.ding states. The non-diagonal matrix Hab describes the
interaction between the two types of states. It gives rise to corrections to
the eigenvalues of relative order of magnitude I 11 I /2 I V2 I. Considering
the actual values of V1 and V2. these corrections are rather small. This
suggests treating them as perturbations. The zero-th approximation, i.e. the
complete neglect of the interaction between bonding and anti-bonding states,
is referred to as the b o d ovbital approximation. Within this approximation
the valence and conduction bands follow from separate eigenvalue equations,
the former by diagonalizing the matrix Hbbq the latter by diagonalizing Hw.
To calculate the total energy of the crystal one needs the total energy
of all valence electrons. T h e latter may be calculated by means of formnla
(2.54) which expresses the total energy of an interacting electron system by
means of its one-particle energies. One has
Er$&f= 2
E:(k)
k
-Ecd
(2.309)
(2.310)
To pro>*ethis relation, we write the eigenvalues @(k) in (2.310) as diagonal elements of H M between eigenstates, and note that within the bond
orbital approximation the eigenstates for a given wavevector k are linear
174
combinations of the bonding states I bik) only. In this matter, the linear
combinations are generated by the unitary transformation which diagonalizes the Hermitian matrix H a of (2.306). If one sums the eigenvalues @(k)
over all i , and takes advantage of unitarity, then the diagonal elements of
Hw with respect to the eigenvectors become the diagonal elements of this
matrix with respect to the bonding states, i.e. q,, as is stated in relation
(2.310). One may also prove this relation in another way, using Vieta's theorem, which states that the sum of all zeros of a polynomial of degree n
equals the negative of the coefficient of the (n - 1)- th degree term. In
the case of the characteristic polynomial of a matrix, this coefficient repre
sents the negative of the sum of all diagonal elements, here, therefore, -4q,
confirming the validity of equation (2.310).
(2.311)
To get the total energy of the crystal, the energy of the atomic cores must be
175
added to the energy value of equation (2.311). In doing so, one may again
use the fact that the core states of the crystal do not differ from those of the
free atoms. This means that only the mutual electrostatic interaction of the
cores results in a structuredependent energy contribution, while the internal
core energies sum to a constant Eo. The core-core interaction energy has,
approximately, the same value as the electron-electron interaction energy
E d between the valence electrons on different atoms. This is true because
the valence electron charge of an atom equals its core charge for the crystals
considered here. The corecore interaction energy approximately cancels,
of valence electrons
therefore, against the negative Coulomb energy -E,1
in expression (2.311). Finally, the total energy of the crystal is given by
(2.312)
The d-dependence of this energy is due to the fact that both Eh and Vz
depend on d - the hybrid energies Eh have d-dependence as they are defined
by the diagonal matrix elements of the Hamiltonian H between Lowdin
orbitals which contain overlap integrals between s- and p - orbitals of adjacent
atoms, and V2 because this quantity is the matrix element of H between
hybrid orbitals at nearest neighbor atoms. V2 decreases with decreasing
distance d, corresponding to an attractive force between the atoms. The
hybrid energy q, increases as d decreases, corresponding to a repulsive force.
For large d, the attraction dominates over the repulsion, and for small d, the
repulsion dominates over the attraction. Overall, the total energy E z l b l ( d )
of the crystal varies with d as shown in Figure 2.24 schematically. At the
equilibrium distance do, it takes its absolute minimum value. This means
that the initially free atoms will not remain free but form a diamond type
crystal with nearest neighbor distance do. They experience what is called
covalent chemical bonding.
(2.313)
where Eo, again, accounts for the energy of the atomic cores. The negative
difference of the two energies (2.312) and (2.313) represents the cohesion
eriergy of the crystal. It is given by the expression
176
crystal
atom
Etotol -Eta.ral
distance d (schematically).
Ionic bonding
The ionic contributions to chemical bonding will now be calculated for mat,+
rials having the zincblende structure. A s is well-known, a scries of 111-V and
11-VI compound semiconductors form crystals of this type. For the Ilamiltonian matrix (2.305), the transit,ion from the diamond to the zincblendc
structure means that ch in the upper left (4 x 4)-block has to be replaced
by the hybrid energy
of the 1-atom? and in the lower (4 x 4 )-block by
the hybrid energy e i of the 2-atom. With this replacement, the bonding and
anti-bonding energy levels become
EL
!+,
~8.
where
=c i Thr energy separation betwren the two levels is larger
than that of diamond type crystals. This results in an enlargement of the
energy gap betwcm the valence and conduclion hands. The bonding and
anti-bonding oibilals arp given by the expressions
+ bp.
178
EMad,
the total
(2.320)
with cy as the so-called Madelung constant. The latter depends on crystal structure and can easily be calculated numerically. In Table 2.11, the
n-values are listed for crystal structures which are observed in materials
composed of group IV elements as well as 111-V, 11-VI and I-VII compounds.
The value for the wurtzite structiirp in Table 2.11 corresponds to the ideal
tetrahedral case with an equivalent cubic lattice constant &a (see Chapter 1). The contribution of the Madelung energy to the total cnergy of a
given compound will be larger for larger effwtive charge number Z* of the
compound. This results in a tendency of compounds with larger 8* valiirs
to crystallize in structures with Madelung constants larger than that of the
zincblende structure. Therefore, in passing from the 111-V through the IIVI to the I-VII compounds, one observes a transition from the zincblende
structure through the wurtzite to the rocksalt and cesium chloride structures. The cesium chloride structure follows from the rocksalt structure by
replacing the two facecentered cubic sublattices by two primitive cubic sublattices, shifted in the same way with respect to each other as in the rocksalt
2.7. k . p -method
179
Zincblende
Uurtzite
1.6381
Rocksalt
1.7476
1.6410
2.7
k .p -method
2.7.1
Fundamentals
Luttinger-Kohn functions
The k. p-method rests on a particular property of the BIoch type eigenfunctions pV,+(x)of the crystal Hamiltonian H . As we know these functions
(which will be denoted below by (xluk) instead of q y k ( x ) ) are the product of
an exponential factor exp(ik.x) and the latticeperiodic BIoch factor uvk(x).
If one replaces the wavevector k in uYk(x) by a constant ko, while retaining
k in the exponential factor, then the resulting functions
1x0
[2.321)
are no longer eigenfunctions of H of course, but they do form a complete
orthonarmalizPc1 basis set in Hilbert space, as wcll as the Bloch functions.
whence
~
(vkko(vkko)- 6 v l u 6 k ~ k ,
(2 322)
(2.323)
uk
2.7. k . p -method
181
k .p
- Hamiltonian
(2.326)
which yields
E:(k) = E,(ko)
Ti2
+ -(k
2m
ko)2.
(2.328)
The matriv on the right-hand side of (2.327) allows for an important rewriting. If one defines
Hk.p(k) =
Ho(k)
+ -nah(k
- ko) . p!
(2.329)
with
(PkkolfflPkko) = (PkoIHk.,(k)lPko).
(2.33 1)
The latter relation means that the actual Hamiltonian matrix W in the kdependent Luttinger-Kohn basis Ipkko) equals the representative matrix of
a fictional k-dependent Harniltonian Hk.p(k) in the k-independent partial
Bloch basis lpkoko) = jpko) for the wavevector k = ko. The k-dependence
of the Luttinger-Kohn basis on the left hand-side of equation (2.327) has
been transferred to the new Hamiltonian Hk.p(k) on the right-hand side.
The SchrGdinger equation (2.325),with this new Hamilt onian, reads
182
The components of the eigenvectors Ivk) in (2.332) refer to the LuttingerKohn basis Ipkko), although the operator Hk.p(k)is represented in the Bloch
basis Ipko).
Solution of the Schrodinger equation (2.332) involves the diagonalization
of the matrix (pkolHkE,(k)lp'ko). For k = ko this matrix is automatically
diagonal, by virtue of the fact that Bloch functions Ipko) are eigenfunctions
of the Hamiltonian Hk.p(kO) = Ho. For k # ko, the Ipko) states are no
longer eigenfunctions of Hk.&), so that the matrix (pkolHk,(k)Ip'ko) has
off-diagonal elements with respect to the band indices. Formally. one may
interpret these non-vanishing elements as arising horn an interaction between
different bands. Since this interaction results from the (k - ko) . p-term
in Hk.p(k), one calls it the k . pinteractian. In this, the bands which are
mutually coupled, are not bands in the sense of the eigenvalues of the actual
crystal Hamiltonian H - the latter are uncoupled by definition - they are
fictional bands E:(k) defined by equation (2.328).
As the point k in Hk.,,(k) approaches ko, the k . p-interaction tends to
zero. For k-vectors sufficiently close to ICO, one can treat this interaction
with the help of quantum mechanical perturbation theory. Apart fkom the
square term in (k - ko) already present in E:(k), this entails a power series
expansion of the energy bands E,(k) with respect to (k - ko) about the
point ko. The form of the perturbation theoretical expansion depends on
whet her the unperturbed bands, i.e. the eigenvalues E,(ko), are degenerate
or not. We will first consider the simpler case of non-degenerate bands.
In first, order perturbation theory the eigenvalue E$(k) arising from EE(k)
is given by the relation
(2.333)
and the Bloch function Ivk)' arising from 1.k)'
2.7. k . p method
183
mz'
This relation connects the effective masses with the matrix elements of the
momentum operator between different bands and with the energy separation
of bands at the critical point. The tendency indicated is that the absolute
values of the effective masses become larger for smaller momentum matrix
elements and larger band separations. One expects small effective masses
for large momentum matrix elements and small band separations. As far as
the band separations are concerned (only for them can one make an easy
estimate), we will later find conhmation of this tendency in all concrete
cases. For pairs of bands which are closer to each other than to all other
bands and, therefore, whose mutual interaction is stronger than that with all
other bands, relation (2.338) allows one to also draw a conclusion about the
signs of the effective masses. According to it, the energetically higher of the
two bands should have a large positive effective mass, and the energetically
lower a mass of the same large absolute value but of negative sign. This
184
CFZ.S~~~S
Band degeneracy
Critical points are often symmetry centers or lie on symmetry lines, and
at these symmetry points, degeneracy of the energy bands often occurs. If
this happens, one must carry out second order k.pperturbation theory or
degenerate bands. In quantum mechanics, perturbation theory for degenerate energy levels is c o m o n l y of first order - the matrix of the perturbing
Hamiltonian operator between the degenerate states has to be diagonalized
(we remind the reader of the nearly free electron approximation in section
2.4). This procedure does not apply here because the perturbation matrix at
critical points vanishes in first order. One must therefore choose a variant of
perturbation theory for degenerate energies which works in second order. To
this end, one constructs the matrix of the perturbation operator not between
degenerate unperturbed eigenstates, as is commonly done. but between the
(also degenerate) eigenstates of first order of perturbation theory. By diagonalizing this matrix one obtains the eigenvalues in second order perturbation
theory. These are, in general, no longer degenerate.
An important case in which the k . p-perturbation matrix between the
degenerate unperturbed states vanishes, is the valence band maximum of
semiconductors with diamond structure. This case will now be investigated.
In doing so, w e initially neglect the spin-oIbit interaction. This approximation is valid for semiconductor materials composed of light elements only,
including, for example, Si. For other materials this procedure serves as a
zero-order approximation which can be used to proceed further (as we will
do below).
2.7.2
As we know from section 2.3, the valence band maximum of diamond type
semiconductors is located at the center r' of the first B Z . Therefore, we set
& = 0. The maximum is 3-fold degenerate. We denote the three pertinent
Bloch functions by IvmO), where m can assume the values 2,g,2. According
to section 2.6, these eigenfunctions belong to the irreducible represent ation
l?b5 of the cubic group oh. As indicated in Appendix -4. a basis of this representation is formed by the products yz, zx,z y of the components 2,y, z of
position vector x. Therefore, with regard to their transformation properties
under the action of elements of o h , we may identify IvzO) with yz, IvyO) with
zz, and IvzO) with zy. The vector components 2,y, z of the position vector
itself transform in accordance with the irreducible representation I'15 of Oh.
2.7. k . p -method
185
For the subgroup T d of oh the two representations F15 and r h 5 coincide. For
semiconductors having the zincblende structure, the three degenerate states
( v I O ) , IuyO), IuzO) of the valence band maximum may therefore be associated with I,y, z insofar as their transformation behavior is concerned. In
the case of the diamond structure, 2,y, z are merely a short hand notation.
The vanishing of the matrix (vmOlplum'0) of the momentum operator
between valence band states at I?, anticipated above, may easily be demonstrated using the pertinent criterion for such vanishing given in Appendix A:
The operator p transforms according to the irreducible representation I'15
of oh. The matrix (umOlplvm'0) therefore belongs to the reducible representation
x r 1 5 x rl,, = I?;, x (rh ri2 I'15 r 2 5 ) , wherein the identity
representation does not occur. According to Appendix A this means that
the matrix (umOlplvm'0) must vanish. One can also obtain this result by
means of inversion symmetry alone. We have chosen the somewhat more
troublesome method of proof because it may also be applied in other, less
obvious cases, as we will see immediately below.
+ + +
In order to apply degenerate second order perturbation theory, the solutions of Schrodingers equation (2.332) are needed to first order in the k . pperturbation. For the orthonormalieed Rloch valence band eigenstates (vmk)l
one finds
(2.339)
where we set E,(O) =
for brevity, and the degenerate valence band energy
E J O ) is denoted by Ev. The third term in (2.339) guarantees the normalization. (:onsidering the sum on p , tlir value 11 = 71m does not need to be
specifically excluded because the matrix elements ( p I p I vm') for p = 117n
vanish anyway. Expressions of tlic form (2.339) also hold for the approximate Bloch functions Ipk)' of the remaining bands p with p f urn, but we
omit an explicit presentation of them here. The states Ivmk)' and Ipk)'
with p # t m will now be used as a basis set to represent the IIamiltonian
H . The resulting matrix is 'almost' diagonal, because the basis functions
are 'almost' eigenfunctions. In particular, the submatrix of the three velence
bands is coupled to the remainder of the matrix only by elements of second
order in the k . p-perturbation. These elements give rise to corrections of the
valence band energies which are only of third order and can be neglected.
186
In second order perturbation theory, the valence band energies Ev and Forresponding Rloch states I&,) therefore follow from an rigenvalue equation
which is decoupled from the remaining bands, namely
'(vmklH!vm'k)l l(wn'kl&:y)= E ' , '(vrnklE,).
(2.340)
rn,
The initial occurrence of interaction betwren the valence band and the r e
maining bands is incorporated in the matrix (u>mk I H I wn'k) in first order
perturbation theory.
Harniltonian m a t r i x
The (3 x 3)-Hamiltonian matrix '(vmklHlvm'k)' of equation (2.340) can be
obtained by means of expression (2.339) for the perturbed states Ivrnk)'. A
short calculation yields
where
L?zk,
+ +
+ +
187
matrix elements D z z , Ll;& and D Z . We introduce the abbreviations L D g , hl = D$$, and N = D$ f D g . The elements L, M and
can be
calculated if the Bloch factors are known. In the absence of this informalion,
however, we consider L , M and N to IIPempirical parameters (as indicated
at the outset) and use their connection with the Bloch factors only to identify
some general properti=, such as the fact that they can be chosen real. Since
Q B vanish, the Hamiltmian matrix of the
all remaining matrix elements Dmm,
valence band has the form
Method of invariants
The Hamiltonian matrix (2.344) can also be derived in a somewhat different
way, whirh leads to the goal more quickly, but is formally more dernanding. One uses the fact that the Hamiltonian matrix l(vmklHlum'k)l can
be represented as a linear combination of the 9 matrices of a basis in the
product space 1 vmO)(iirn'O 1 which transforms according to the repreaentation [I& x r:,] of the point group oh. This representation is reducible. By
decomposing it into i t s irmliicible parts, m e obtains a basis which consists
of subbases, each of which belongs to a particular irreducible representation
of Oh. Such a matrix basis can easily be constructed by means of the 3dimrnsionel angular niomenturn matrices Iz,Iv,Iz (considered in Appendix
A ) and their products, since it is known how these matrices transform,
namely according to the pseudovector representation I':s. In the product
spacc k,f-fi of the components of the vwtor k, one proceeds in a similar way.
One determimes a basis from subbases which transform according to the irreducible parts of the representation [I'15 x r15Iy.The Hamiltonian matrix
reprewrits an element in the product space of Ihe two spaces which is invariant under transformations of the point group Oh. Such invariant elements of
the product space can be produced by forming scalar pruducts of subbases
of the two spaces whi& transform according to the same irreducible repre
sentation. As seen in Appendix A, the corresponding scalar products belong
to the- identity representation, that is to say, they are invariant.
To find the most general Hamiltonian matrix compatible with the symmetry oh, one has to determine all invariants of the product space. I. one
then multiplies each by a real scalar factor and sums them all, one obtains
188
the most general invariant of the product space and thus the most general
Hamiltonian matrix compatible with Oh symmetry. This process is called
the method of invariants. It is applicable to arbitrary symmetry groups and
degrees of degeneracy, and it quickly leads to the goal if one considers spin
and spin-orbit interaction. It also allows one to determine the matrices for
perturbing Hamiltonians other than that of the k . p-interaction, such as
the interaction between the angular momentum of Bloch electrons and an
external magnetic field (see section 3.9) or the interaction with mechanical
strain. In this book, we will only use the method of invariants occasionally.
A comprehensive outline of the method with several applications i s given by
Bir and Pikus (1974).
Valence b a n d s t r u c t u r e
The eigenvalues of the matrix (2.344) form three valence bands E,l(k),
E,n(k), E,3(k). For the three symmetric k-directions [loo], [lll]and [110]
the dispersion curves are determined by simple analytical expressions as follows:
E,lp(k) = M k 2 ,
1
E,lp(k) = - [ L
3
+ 2M - N]k2,
E,3(k) = L k 2 ,
1
E,3(k) = - [ L
3
+ 2M + 2 N ] k 2 ,
+ M + N]k2,
(2.346)
(2.347)
E,l(k) = M k 2 ,
E,2(k) = - [ L
(2.345)
E,3(k) = - [ L
2
+M - N]k2.
(2.348)
Along the two directions [loo] and [lll], the valence band, being triply
degenerate at r, splits into two bands, one 2-fold degenerate and one nondegenerate (see Figure 2.25). In the [llO]-direction and also for all more
asymmetric k-vectors, no degeneracy remains. This indicates a %fold splitting of the valence band for such k. All bands are parabolic, but evidently,
in general, not isotropic. One speaks of a warping of energy bands. In the
case of Si, one has L = -5.64, M = -3.60, N = -8.68 in units of ( h 2 / 2 m ) .
Using these values, the two degenerate bands E,l/a(k) of equations (2.345)
2.7.
k . p method
189
Figure 2.25: Valence band dispersion for diamond type semiconductors in the
vicinity of the I'i5-maximum for different k-directions.
or (2.346) have smaller curvatures than the third band E,s(k) in these equations. Thus the first two bands correspond to the heavy holes and the third
band to the light holes of Si. Isotropy exists only if N = 0 and L = M .
Then, there also is no longer any distinction between light and heavy holes.
Conversely, anisotropy grows stronger as the difference between the masses
of the two types of holes becomes larger.
The results discussed above were obtained without consideration o l spinorbit interaction. However, for most of the diamond and zincblende type
semiconductors, the valence band structure is significantly influenced by this
interaction (in the case of Si it is small, hut often not negligible). We now
proceed to consider the effects of spin-orbit interaction.
2.7.3
-r
190
we have
One has to be aware that the (sxlvuk) are eigenfunctions of H , but not
of H
H,.
Correspondingly, the (xlvuko) signify the spin-dependent
Luttinger-Kohn functions of H , but not of fi + f l W .T h e matrix repre
senlation of the Schrtdinger equation with respect to the spin-dependent
Luttinger-Kahn basis reads
x(prkOtH
(2.351)
pW
with
ii = p
1
+ ---[a
4m c=
Tl
+ -k
. a,
m
(2.353)
x VV(X)].
(2.352)
+ H,,
becomes
(2.354)
C ( ~ ( T O I H+ ~HsoIpuO)(pukOIEu)
.~
= E,(pnkOlE,).
(2.355)
PU
Up to this point, we have kept the discussion general. Now we wish t o explore the particular consequences of spin-orbit interaction for the previously
2.7. k p method
191
considered
valence band states. To this end, we need the matrix elements (vrnuO~Hso~wmuO)
of H,, between the spin-dependent Bloch states
Ivm 0 . 0 )namely,
~
F,
( u ~ u O I H , , I ~ ~ U =O ~) ( v v ~ [VV(X)
O I
x p] IumO) x (aldld). (2.356)
4m c
(~.Ol[V(X)XPl*l v y 0 ) = (WYOI [ w x ~ x P l . / ~ =
~ (oV Z) O l PV(XjXp1ytvrO).
where (vxOl[VV(x) x plz I u y O ) = -(uyO I [VV(x) x p],lvzO) holds. Because
the Bloch factors are real? these elements are pure imaginary. We denote
the value of (vyOl[VV(x) x p],jvzO) by (4m2c2/h)(i/3)A, i.e. we set
h
4m c
(2.357)
,(uyO~[VV(x)x plrlvzO) = i - .
Below, we will see that the constant A is the energy splitting of the valence
band at J? due to spin-orbit interaction. Applying equation (2.357) and the
explicit form of the spin matrices given in (2.57),the matrix (avmOlH,I
aw m 0j becomes
0 - i
(vmaOp,,Ivm.uO)
A
3
=-
0 0
0 0 - 4
0 - 1 i
0 - 1
o i
0 - i - i o
0 0
(2.358)
Here the rows end columns are associated with the basis functions in the
sequence Ivz T O j = Iz t), Ivy 7 0 ) = ly tj, . . ., It)z 1 0 ) 3 1s 1). Spinorbit interaction couples orbital states and spin states to each other. At
k = 0 the expressions E:(k) and k ii are both zero. The eigenenergies and
eigenfunctions of the total Hamiltonian Hk.a
H , are therefore also those
192
k - 0 becomes
(vmaO~H,,lvm'a'O)(vm'a'O~E)
= E(wamO(E).
(2.359)
u'm'
Eigenfunctions at
r. Angular momcntum
basis.
1
p
(2.360)
IE )
v1 -
IEv3) =
1
T ( 1 ,i, o,o, O,O), IEv2)
Jz
1
i
z(l, - i , O , 0 , 0 , 2 ) , IEv4) = -(O,
Jz
O,O, 1, - i , O ) ,
(2.361)
--A
2
3
(2.3 62)
i
0, 1, 1, i, o), 1E ) - ----(--I, i, o,o, 0, 1).
w6 - d3
(2.363)
Ev5/6 -
IEv5) -
-(o,
fi
2.7.
k p -method
193
emerge from the representation 213 of the full rotation group if, D Dis~taken
as a representation of the subgroup oh with +1 for inversion. It has also been
shown that the basis functions of this representation are the simultaneous
5 or the eigenvalue j ( J
eigenfunctions of the angular-momentum-squared '
1) (in units R 2 ), and of the z-component J z of J for the eigenvalues m j =
23 , 12 ., - 1 . =L and -$
One therefore denotes the first four eigenfunctions
of (2.364) by
4.
33
I--)
22
31
+
iy t), I--)
4
22
= -1rz
3T
I--)
22
-[&
I.
- iy
T)
t) + la: + iY 1 1 1 7
= -"-2Iz
+2/2
fi
33
i
= 1.22
111%I--)
Jz
-iY
(2.365)
I),
rt
4,
3,
ing to Appendix A , as Di x
T
= Y$
+ I';.
194
by the constant A, which determines the strength of the spin-orbit interaction. Therefore A is called spin-orbit splitting energy. One may interpret A
as the difference of the spin-orbit interaction energy between the states with
3 and those with j =
j = 3
As one should expect, the states with larger
angular momentum lie energetically above the states with smalIer angular
momentum. States with different m j , i.e. with different projections Jz of
total angular momentum on the z-axis, but the same J2,have the same spinorbit interaction energies. Therefore the degeneracy of these states remains.
i.
The above statements refer to valence band states at the center r of the first
B Z , where the k p-interaction vanishes. Off
this interaction is no longer
zero and must be taken into account in addition to the spin-orbit interaction.
We have seen how this can be done approximately in the preceding section,
without consideration of spin. The method used there indicates the following
procedure in the presence of spin and spin-orbit interaction: One determines
the functions Ipok)' which diagonalhe the operator H k q of (2.352) in first
order perturbation theory. In analogy to equation (2.339), one finds for the
valence band states Ivmcrk)' the expression
r,
2.7. k p -nrt.th<>d
195
(vmakIH,,lv7nak)
(vmaOlH,qolvmaO).
Now we use the fact that H,, is diagonal in the angular momentum basis
Ijmj) of (2.365) and (2.366). It is clear that this basis follows from Ivma0) E
Imu) by a unitary transformation
(2.370)
~ j m j ) C U r r m j m j Ima).
ma
can be readily
(2.371)
(hjlff,oljm;)
o $ o o
o o + o
o o o g
0
(2.372)
196
The sum of the two matrices (2.372) and (2.373) is the new Hamiltonian
matrix. It has the same eigenvalues as the original matrix, even though its
form deviates from that of the original. The difference in form is, above all,
that the new matrix is already diagonal at k = 0. Kon-diagonal elements
occur for k # 0. Among them, the elements between basis vectors ] j m 3 )and
[ j ' r n $ ) with m3 # m i , , but
= 3' play a different role than the ones with
.f j'. While the influence of the 2-diagonal elements on the eigenvalues is
independent of the size of the spin-orbit splitting A, it does depend on it for
the j off-diagonal elements. The magnitude of the latter can be estimated
as the larger of the two terms Nlki2 or IL - Mllkl'. If one assumes that
l L f a ~ ( ~ VIL, - AIl}lk12 << A holds, then a perturbation theoretical treatment
is possible. It yields an energy correction of order of magnitude [ M a r { N ,ILM1}]21k14/A. Under the assumptions made, this is small compared to A.
That means that the 3-off-diagonal elements of the transformed Hamiltonian
matrix can be neglected if the k-vectors are sufficiently close to r. We will
assume below that this is the case, although the Luttinger-Kohn model also
covers the general case of a (6 x 6 ) -Hamiltonian matrix. Neglecting ]-offdiagonal elements the Hamltonian decomposes into two blocks, one (4 x 4)
-block conesponding to the basis vectors of the representation I'i,and one
(2 x 2)-block for the basis vectors of the representation
The rsf-matrix
reads
I't.
,)::I
The Hamiltonian matrix of thr rz-valence band can also be derivcd using
the method of iiivarinnts, which was discuss& at an earlier stagc. To do this,
one needs the angular momentum matricrs for spin J =
as well aa their
products with earh other. These matrices arp given in Appendix A. The
4,
2.7. k . p +method
197
(2.376)
2
Using the explicit expressions for Q, T, R , S, this yields
A = -(L
3
+ 2M),
1
-3( L - Ad), C 2
1
-[N2 - ( L - M)')].
3
(2.378)
198
s; II [loo]
5; II
[Ill]
G+l
Figure 2.26: Valence band structure of diamond and zincblende type semiconductors in the Luttinger-Kohn model. The dispersion for the two k-directions is
different (band warping).
h2
-----TI, B
2m
ha
Lni
-,2~,
h2
C2 = ---12(7,2
2m
- 7:).
(23 8 0 )
Values of the Luttinger parameters are listed in Table 2.12. The constants
L , M , N , which were originally used, may be expressed in terms of the Luttinger parameters by the relations:
Both energy band functions E&(k) and Er.,(k) depend on the square of
Ikl, i.e. they are parabolic. This would not have resulted if the interaction
between the rBf-states, and the spin-orbit-split r,f states, had not been
neglected as it in fact was. Concerning the dependencies on the direction
of k, the spin-orbit-split band Er7(k) is isotropic, while the heavy and light
f
hole bands Er8(k)
are not. In their case one again has a warping of energy
bands as discussed above (see Figure 2.26). The constant C measures the
strength of warping. In the case C = 0, the warping vanishes. For C # 0,
the point k = 0 is a singularity of the energy band functions Erf,(k) in that
the second derivatives with respect to the components of k depend on the
direction from which one approaches the point k = 0. The effective mass
tensor, as given by equation (2.196), is not defined in such circumstances.
Instead, one can define an anisotropic effective mass, by differentiating in
equation (2.377) not with respect to the components of k,but with respect
to Ikl. If the warping of energy bands is ignored, for instance, by averaging
199
Table 2.12: k .p-band parameters for selected diamond and zincblende type serniconductors. E i , A, and E , in eV. E p = ( 2 m / h 2 ) P 2is a measure of constant P in
Kane's band model. The values for 71, yz, 7 3 are adjusted to the Luttinger-Kohn
model. Temperature below 70 K . (After Landoldt-Bomstein, 1982.)
Material
71
yz
73
0.94
0.23
0.23
Ei
5.48
C
Si
4.26
0.38
1.56
3.4
0.044
Ep
Ge
13.35
4.25
5.69
0.90
0.29
26.3
cz - S n
19.2
13.2
16.3
-0.4
0.8
39
GaAs
6.85
2.1
2.9
1.52
0.34
25.7
GaSb
11.8
4.03
5.26
0.70
0.8
22.4
InSb
35.08
15.64
16.91
0.18
0.98
21.2
ZnSe
4.8
0.67
1.53
2.67
0.42
HgTe
12.8
10.6
8.8
-0.30
1.08
17.5
over all directions, this yields the ordinary isotropic heavy and light hole
masses but in the sense of an average.
The Luttinger-Kohn model was described above for the case of diamond
type semiconductors. Formally, for materials having the zincblende structure, the model does not apply because the matrix elements of the momentum operator p between the triply degenerate valence band states without
spin. IvrnO), are in general non-zero: These states transform according to
the vector representation r15 of the tetrahedral group T d r and the matrix
elements (vrnOlplvm'0) belong to the product representation I'15 x I'15 x r15.
The latter contains the unity representation, as distinguished from diamond
type semiconductors, where the unity representation does not occur in the
corresponding I'i, x r15 x
product. The reason for this is the absence of
inversion symmetry in the zincblende structure. The non-vanishing matrix
elements (omOlplvrn'0) give rise to terms linear with respect t o k in the
I'15 valence band Hamiltonian, besides the quadratic terms which are are already present in the diamond case (see equation (2.344)).However, as a rule,
the k-linear terms are small, and the Luttinger-Kohn model also applies to
zincblende type materials, provided the other requirements which underlie
this model are satisfied. Table 2.12 therefore also contains Luttinger-Kohn
parameters for semiconductors of the zincblende type.
The most important requirement for the Luttinger-Kohn model to be valid is
200
the validity of the assumption that the k p-interaction of the valence band
with thc deepest conduction band is weak enough to be treated by means
of perturbation theory. This is justified as long as the encrgy separation
EF from the lowest condiiclion band and the I&-valenre band (not to be
confused with the fimdamental cnergy gap) is siifficiently large. One expects
deviations from the Luttinger-Kohn model to become noticeable if E i is
sinall Table 2.12 shows thal the EF-value for InSb, for example, clearly
lies below those for C, Si and Ge. In the case of a-Sn and HgTe, Ef; even
becomes negative. Simultaneously, the spin-orbit splitting energy A becomes
relatively large in some of the zincblende type materials, so that even A > /i$
holds. Describing of l h e valence band of such semiconductors by means
o l the Luttingrr-Kohn model would entail treating the effect of the remote
spin-orbit-split band exactly, while taking iuto account only the energetically
closer conduction band by means of perturbation theory. Such a procedure is
not meaningful and one must seek a different, more appropriate description.
A model which is tailored exactly to such circumstances is the Kana
model, which we will now discuss. In this matler, we asbume that the point
group of equivalent directions is the tetrahedral group rh, and no longer the
cubic group o h as above, thercby encompessing both typeb of semiconduclois, those of zinrldende type EM well as those of diamond type. In the latter
case, inversion symmetry still has to be added. This involves a spwialization
of the results, which may be casily done, should thc need arise.
2.7.4
Kane model
2.7.
k.p -methad
201
(vmaO)Ho(k)jvm'dO)= bmm~6umi-k
ri2 2 ,
2m
h2
(csdlHo(k)lcsdO) = 6,,)-k
2m
(wmOlHo(k)IcsO)= 0.
(2.382)
(2.383)
(2.384)
(2.385)
202
~ c s o ~ p , ~=7~~r r8o~~~ p v =
~ I( cv syo~/ p
) , ~ ~=
~ ti .F,-o- )~
(2.386)
The factor 2 giiarantws that P i s real, if the Bloch factors are real as we
assume. The other factor (m/Ta)was introduced for convenience in the final
dispersion rclations. With the moinentiim matrix elements of (2.3861, thp
Hamiltonian matrix (pu01Hk ,(k)lpuO) takes the form
(2.387)
Ec
,iPk,
iYk,
--iPk,
-iPkg
-iPk,
E C
iPk,
iPk,
iPk,
-iPk,
-iPk,
-iPk,
I]
/s
(um.aOlHsIcsrr0)= 0.
(2.388)
2.7. k . p -method
203
+ H,,
,
Ec
iPk,
iPk,
iPk,
-iPk,
-ig
A
-iPkg
i$
O
0
-iPk,
--
i+
Ec
iPkz
iPk,
iPk,
-A
3
--iPk,
-2z
-iPkY
-+
i+
.A
.A
-iPk,
0
\
0
a
3
2 5
.A
-27
Here, the order of rows and columns is the same as in (2.387). For k = 0
and vanishing Ec, this matrix reduces to the spin-orbit interaction opcrator
WSw If one rearranges the rows and columns of this matrix in such a way
that those relating to the conduction band states 1s 1) and 1s J) occur in the
left upper corner, side by side, then the matrix decomposes into a (2 x 2)block for the conduction bt~ncl,and B (6 x 6)-block for the valence band.
The eigenfunctioiis of the two blocks arc simultaneously also e i g e h c t i o n s
of the total matrix. T h {2 x 2)-co,nduction band block is already diagonal,
i.c. 1s t) and 1s J] are eigenhnctions of the Hamiltonian matrix (2.390) at
k - 0. The (6 x G)-vdencr band block is identical with the matrix of the
spin-orbit interaction operator H,, of (2.350) for the Luttinger-Kohn model.
The eigenfunctions at k - 0 here are therefore also the vectors 1$ms) and
i$rn
1)
The latter basis should be particularly suitable for solution of the eigenvalue problem for the Hamiltonian matrix (2.490) at k # 0. The matrix
(2.390) is, however, so simple, indeed, that one can also obtain the secular
equation directly. We will do t h s . before we h r t h w consider the angular
momentum basis. To diminate the free elmtron part ( h 2 / 2 m ) k 2from the
eigenvalues Elk), we write them in the form
h2
2m
E(k) :
E(k) -I--k2
where E(k) satisfies the following
(2.391)
204
Secular equation
(2.392)
The fact that the two factors in round brackets appear squared, signifies an at
least 2-fold degeneracy of all eigenvalues. The reason for this is. again, time
reversal symmetry jointly with spatial inversion symmetry (we remind the
reader that the term of the Hamiltonian which can break inversion symmetry
in the case of Td-symmetry has been neglected). Accordingly, one has in
general four 2-fold degenerate bands Ei(k).Ei(k),E$(k),Ei(k). It is also
noteworthy that in the secular equation (2.392). k enters only in the form of
k2. This means that all four bands are isotropic, in contrast to the LuttingerKohn model where a warping of the valence bands occurs. In the case of
k = 0, the energy levels of (2.392) are given by
One may draw conclusions from these expressions in regard to the meaning
of the four energy bands E,(k): El(k) is the J?s-conduction band. E z ( k )
2nd Es(k) are the two upper degenerate rs-valence bands at r. and E4(k)
corresponds to the spin-orbit-split rT-valence band. The energy separation
EL of the r6-conduction band and the I'g-valence bend at. r is obtained as
(2.394)
As long as E: is positive, it represents the energy gap E , at I?. The case of
negative E,' is discussed below. For one of the two upper valence bands - the
one which arises from the vanishing of the first factor of the secular equation
(2.392) and which is denoted by i = 2 - the energy El(k) does not depend
on k. For E$(k).a k-dependence follows with finite negative curvature, as
we will soon see. Thus E;(k) corresponds to a band of (infinitely) heavy
holes, and Ei(k) to a band of light holes. If one adds the (Ti2/2m)k2-term,
then the band Ez(k) displays a positive curvature. It is relatively small
because of the large free electron mass, but the positive sign contradicts
what is to be expected for a valence band. This unexpected prediction for
E z ( k ) results from the fact that the interaction of the valence band with all
remaining bands, except with the deepest conduction band, was completely
neglected. In order to treat the heavy hole band correctly, the interaction
with remote bands must also be considered at least by perturbation theory
as in the Luttinger-Kohn model. This will be done below.
205
206
sions. To avoid these below, the 4 x 4 block matrix of the remote band
interaction will be reduced to a special case before proceeding further. We
put L = M -- A,, and N = 0, which means physically that the remote
bands affect heavy and light holes in the same way, and do not disturb
the isotropy of the bands. -Ordering rows and columns in the sequence
1s t), 1s l),I);:,
I;$), . . . , I$$), the transformed 8 x 8 Hamiltonian matrix
with simplified remote band interaction becomes
aP,
-iP-
-fiP-
-i$&P,
U'
-i&
-&
(2.3
E'(k) + -A
i
3
- A,k2
I-i
E'(k) +
This equation only differs from equation (2.392) in that the factors whose
vanishing define the conduction and valence bands contain, respectively, the
additional terms A,k2 and A,k2.
Solution of the secular equation in limiting cases
The zeros of the h s t factor in (2.396) yield, as seen previously, the I's-band
of heavy holes (i = 2). However, the dispersion relation for it now reads
A
3
En(k) = -
ii2
+ A , k 2+ -k2
2m
(2.397)
2.7.
k.p -metbod
207
- [E'i(k)
$1
P2k2 = 0.
(2.398)
Case 1: E: z E , >> A,
EF
>o
We consider energy values Ei(k)in the various bands with energy separations
IEi(k) - &(O)( from the respective band exbrema which are small compared
with EF. For such energies, the conduction band El(k) approximately obeys
the equation
[El(k)- E,]E, - P 2 k 2= 0.
(2.399)
and for the two valence bands E3(k) and E4(k) we have
bi(k) -
$1
bi(k)
21
+ TA
E,
I:
+ [&(k) + -
P2k2 = 0, i = 3 , 4 . (2.400)
208
(2.401)
and from (2.400) we obtain
1 A
E3/4(k) = -2[3
P2
-k
E,
P2
2]*ij[?f-Kk2]+m'
(2.402)
-(2 13)(P2/E,) k2
E3/4(k) =
- (2/3)A
(2.403)
- (1/3)(P2/E, ) k2.
= E, << A, Ef
>0
The energy values Ei(k) now have energy separations IEi(k) - Ei(O)I from
the respective band extrema which are small compared to A. This means
they can be comparable to E i . For the conduction band and the light hole
band one then gets from (2.398), approximately,
= 0, i = 1,3,
(2.404)
from which
follows. In general, the dispersion laws for the electrons and light holes are
again non-parabolic. Only for energy separations from the band extrema
which are small compared to E,, more exactly for P2k2 << (8/3)E,, a k2dependence emerges, namely
\i
209
2.7. k . p -method
b)
7
A
r,
3
4
Figure 2.27: Valence band structure of zincblende type semiconductorsin the Kane
model for limiting cases: (a) E, >> A, ( b ) E g<< A, E: > 0, ( c ) E , << A, E; < 0.
Since the energy region of width Eg above the band extrema is relatively
narrow for the narrow gap semiconductors considered here, one has in these
materials, even at relatively small separations from the band edges, nonparabolicities in the dispersion laws of the electrons and light holes. As far
as these particles are concerned, small energy gaps and non-parabolicities
occur together.
For the spin-orbit-split band, one obtains, without further approximat ions,
2
E4(k)= --A
3
PZ2
.
3A
- -k
(2.407)
The dispersion curves of all four bands in the limiting case considered here
are depicted in Figure 2.27b.
Case 3: IEil << A, E i < 0
According to equation (2.394), negative values of EF mean that the r6-
2 10
conduction band level lies below the rglevel. The relation lEil << A
guarantees that the spin-orbit-split r;.-level is found further below it (see
Figure 2 . 2 7 ~ ) .If. again, only energy values are considered with separations
IE,(k)- E,(O)I from the respective band extrema which are small compared
to A, one obtains dispersion relations having the same form as in the previously considered case EF << A , E , > 0 (see equation (2.406). For electrons
and light holes they yield. under the condition (P2/A)k2 << IEil, the approximate parabolic dispersion laws
Because of the negative sign of E:, band 1 now lies energetically lower and
exhibits a negative effective mass, and band 3 lies higher and has a positive
effective mass. For the spin-orbit-split band E4(k), equation (2.407) holds
unchanged, and for the band E i ( k ) of the heavy holes, relation (2.397) also
remains the same. Thus, the band E 3 ( k ) is, among all four bands, the highest
energetically with the exception of I', where it degenerates with the band of
the heavy holes. Since the 8 valence electrons of a zincblende type semiconductor are only enough to occupy 6 of the 8 bands of the I?;.--. FS-. ??&band
complex - 2 electrons per unit cell are necessary to fill the deepest I's -valence
band, omitted from consideration here - the E3[k) band remains empty. It
becomes the conduction band, which is also in accord with its positive effective mass. At r, its separation from the uppermost valence band, the Ez(k)
band of heavy holes, is zero. This means that the energy gap vanishes in this
case. Negative d u e s of the parameter E i . which signifies the energy gap
when it is positive, cause EF to lose this significance, and the real energy
gap becomes zero. Materials with vanishing energy gap are called zero-gap
semtconductors. Examples are HgTe as a zincblende type semiconductor,
and D - SR as a semiconductor of the diamond type.
In Table 2.13. the effective masses m t are listed for the three limiting
cases considered above. Generally, one has 1m.I I 5 I rn31 < IrnZl. The effective masses of the electrons and light holes are proportional t o / E g l / P 2
throughout. The rule discussed above for degenerate bands is thereby confirmed to be valid also for non-degenerate bands wherein mass decreases as
lEgl decreases and P increases. For IE,l << A the effective masses of the
electrons and light holes are almost identical, for E, >> A the electrons are
lighter than the light holes. For the spin-orbit-split band, the proportionality of the effective masses to IE,l/P2 exists only in the case E, >> A; for
<< A the mass mS becomes independent of Es and proportional to A.
Band
EF>>A,E;>O
rdl)
1.(F,/P2)
(3/2). (E,/P2)
rs(3)
-(3/2) . ( E g / P ' )
-(3/2) . ( E S / f " )
rd4)
2.8
-3
'
(E,/P2)
EL<A,h',r>O
-3
'
(A/Fs,)
211
EL
-(3/2)
<< A, EF < 0
'
(I
8; I / P 2 )
(3/2) . (I -Ei I / p 2 )
-3
'
(A/ I
q I)
In this section we discuss the band structures of some important semiconductors. In all cases, the results presented are based on both theoretical and
experiment a1 investigations.
Experimental data concerning band struct ure of semiconductors arc mainly
obtained by means of optical reflectance spectroscopy. It turns out that characteristic structures of the reflectance spectra, like peaks or shoulders, are
directly related to optical transitions at critical points of the energy difference
between the initial and final bands involved. The frequencies of these structures are experimental measures of the energy separations between initial and
final bands at critical points, To enhance the charactelistic spectral features,
changes of the reflectance spectra are measured due to external perturbations, as, for example mechanical strain, clmtiic and magnetic fields, light,
or heat. By modulating the perturbations periodically in time with frequencies in the kHz range, these spectral changes can be measured very precisely
by means of frequency and phase scnsitive techniques. Exttniple5 of this socalled modulataon spectroscopy are electrorepectance (ER), pzezorejlectanw,
thermarejlectance, and photoreflectance (PR) (for morc on elect roreflwtance
see section 3.7). Details of the band structure at critical points, like effective
masses of free carriers, may also be extracted from transport meaxmemenits.
Again, external perturbations, in particular, magnetic fields, are applied to
induce changes. Magnetotransport phenomena, like rnngnetoreuzutanc.r and
Shubnzkov-dp-Haas effect are examples. In cyclotron rmonance one measures the absorption of microwave radiation by a semiconductor sample in
the presence of a magnetic field to obtain the effective masses of electrons
and holes (section 3.8).
None of the experimental methods is capable of revealing the entire band
strurture of a given semiconductor material at all poiutv of the first BZ. To
212
obtain this. one is nhliged to carry out band structure calculations. Expaimeutal data enter thrse calculatiolis in various ways. Tlus is obvious if empirical methods are cmployed their results have to be fitted to experimental
data, as, for example, to energy separations between bands at critical points
obtained from modulation spectroscopy. Less obvious. but nevertheless existing, is the need of experimental data for ab-initio calculations. Although
these methods are free of fitting parameters, various approximations are involved which call for experimental confirmation or even corrections of the
results. as, for example, in the case of the erroneous fundamental energy gap
in the local density approximation.
Below we represent the results of of uumprical hand structure calculations using one or mother of the methods described in section 2.5. We will
not specify which particular method was applied since that is not of interest
here. Our main concern is with the qualitative features of the energy bands.
We will demonstratp that thew may be understood. at least partially, just
by means of the general results derived in the preceding sections and in Appendix A. This is particularly true for features irivolving the degree of band
degrnrracy at symmetry points of the first B Z , which follow from the i r r e
dudble representations of the space group of the crystal under consideration
(see Appendix A). The band structure models derived by the empty lattice
approach. the k . p method, and the tight binding method in sections 2.4,
2.6 and 2.7, respectively. will also be helpful. We begin our discussion with
silicon.
2.8J
Silicon
2 13
Figure 2.28: Rmid structure of Si. The energy unit i eV. (After Chelikowskp and
Cohen, 1974.j
r.
214
t lo
-w 8
m
cu
w 6
r, 62 5;
r; 5' 5 5 6;
r1
A
Wavevector
A 2-fold degenerate band along A cannot merge with another band at X , but
miist terminate in a doubly degenerate level at X. From this it follows, for
example, that the upper of the two bands arising from the I'k5-level along the
A-line must be 2-fold degenerate, and the lower band must be simple. A look
at Table 2.4 shows that the only 2-dimensional representation of the small
point group of A is A5. At X this representation is compatible either with
X 1 or X 3 (the compatibility relations between irreducible representations
are derived I n Appendix A). Figure 2.28 shows that X 1 is correct in this
case. In a similar way one may conclude that the representation of the lower
level at X ,arising from the I'i,-level, must be X z . A similar analysis for the
splitting of the rls-level along the A-line shows that the lower level is nondegenerate and belongs to the irreducible representation A l , and the upper
level is 2-fold degenerate and belongs to .As. The intersection between the
Ah-band emerging from the r!+vel,
with the As-band emerging from the
l'ls-level, is not due to symmetry, but reflects an accidental degeneracy.
Consider next the two bands arising from the l'$s-level on the A-line.
The upper is the 2-fold degenerate As-baud, and the lower is the simple
A1-band. The simple band does not merge with another simple band at
L , but remains separated from it by a finite gap, in contrast to the band
behavior at X. This is possible because at L there arc also 1-dimensional
representations, L1 and L i , which give rise to the two lowest levels at L .
The splitting of the rls-level along the A-line is quite similar to that of the
l?~5-level;the lower band is non-degenerate and belongs to A l , and the upper
2 15
2 16
axe8 of the effective mass tensors of the various valleys. Each principal axis
represents a Cfold rotation symmetry axis. To p r o r d further. we select,
arbitrarily, the valley centered at (O,O, kJ. The band structure of this valley
is given by the expressions (2.201) and (2.2@2), which are applicable here,
as their conditions of validity are satisfied. Setting the band index v equal
to c, which refers to thp coiiductioii band, the dispersion rdation EJk) of
this band becomes
(2.409)
If t,hc zero point of the energy scale is put at. t h r valence band maximum, as
we do here. then E , is t,he fundamental energy gap.
In the vicinity of the valencc band maximum, the general results of the
k . p-method of section 2.7 are applicable. Without spin-orbit interaction
(see cqiiations (2.345) to (2.348)) one has two valence bands for each of
the two symmetry directions A and A, an upper E,l,z(k) which is 2-fold
degenerate, and a lower &(k) which i s non-degenerate. Along less symmdricnl k-directions E,l/z(k) split 5 into two bands. However, spin-orbit
interaction cannot be neglected; although it has little effect on the overall
band structure of Si, it inftuences the valence band structure considerably
in an energy interval of several kI' below the rnaxiniurn at r, which is the
energy region where most of the holes B r e located. Spin-orbit interaction
makes the uppermost valence band level at r, which has a 6;-fold degeneracy in Si if spin i s considered, split into an uppm &fold degenerat.e I?$-level
The splitting rncrgy amounts to
and a lower 2-fold degenerate I';-lcvel.
44 m e V . Away from I?: the upper ra-level decomposes into the two bands
Efp(k) of heavy and light holes according t o equation (2.377). The lower
r$-levd givw rise to the spin-orbit-splitr;-band. The heavy and light hole
bands are strongly warped and each exhibits %oldspin-degeneracy, which
is due to timc reversal syrrirriet,ryin coinbination wihh spatial inversion (see
Appendix A). If warping i s neglected the two bands Era(k)can he described
by isotropic effective mass# m$. 'I'hey are negative because of the maximum
at .'I Extracting the negative sign, we define the positive effective masses
rnEh and m.G1of, respectively, heavy and light holeb;: setting m:,,= - m ; , and
= - m l . For the heavy and light hole valence bands EF8(k) G E,h(k)
and E&(k) = E,,,(k): we h a w
ntl
(2.410)
Numerical values for the effective hole masses of Si are given in Table 2.14.
A vivid view of the conduction and valence band structure in the vicinity
of the band edges is provided by the corresponding iso-enerm surfaces. These
217
are obtained by k i n g a specific energy and then drawing all k-vectors for
which the bands of equations (2.409) and (2.411) yield this energy value (see
Figure 2.30). For the conduction band, the iso-energy surfaces are ellipsoids
of revolution, pointing in the direction of the symmetry axis. Each of the six
star points is the center of such an ellipsoid. For the valence bands, within
the isotropic approximation, the iso-energy surfaces are concentric spheres
centered at I?. The inner sphere corresponds to the light hole band, and the
outer to the heavy hole band. In reality the valence bands are not isotropic
but have only cubic symmetry. Thus their iso-energy surfaces are warped, as
shown in Figure 2.30. If the conduction band is populated by electrons up to
a given energy, then the k-vectors of these electrons lie within the ellipsoid
of revolution corresponding to this energy. Accordingly, the k-values of the
holes lie within one of the two spheres or the two bodies bounded by the
warped surfaces.
218
Figure 2.31: Band structure of Ge. The energy unit is eV. (After Chelikowsky
and Cohen, 1.974.)
2.8.2
Germanium
219
Milate-
C-Blmd-
Energy
rial
Mimirnurn
Gap
hl
i
_
.
II
I
_
Si
1.1 1
0.044
0.54
0.15
GI2
0.66
0.29
0.3
0.044
GaAS
1.43
0.34
0.45
0.087
GaP
2.27
0.08
0.67
0.17
CdTe
1.43
0.7
0.4
0.1
HPn:
r
r
0.00
0.30
0.3
0.03
0.064
CdY
2.50
0.7
PbTe
0.30
0.24
0.028
0.31
0.022
0.33
0.07
0.1
0.11
0.24
Se
-
Z(?)
1.8
- -
r7f-level at
2.8.3
I.
111-V Semiconductors
For the two 111-V compound semiconductors GaAs and Gap, the band structures are shown in Figure 2.32. Their first B Z s are the same as that of Si
and Ge, since they have the same Bravais lattice. The full point group of
equivalent directions is T d for both materials. Spin-orbit interaction is important for the overall band structure of GaAs, while for GaP it may be
neglected. Thus the band structure of GaAs is described in terms of spinor
representations, and that of GaP in terms of ordinary representations. For
both materials, the valence band maximum lies at
Without spin and
r.
220
Figure 2.32:Band structure of 111-V semiconductors GaAs (top) and GaP (hot
tom). The energy unit is el. (AfLer Chelikousky rand Gohen, 1974.j
22 1
I
i
U,K
-61
rI
XU,K
rI
Figure 2.33: Band structure of IT-VI sernimnductom Cdk (left) and IIgTe (right).
The energy unit is el.. (After Chadi, Walter, Cohen, Patroff and Balkanski, 2972.)
spin-orbit interaction (the case of Gap), it belongs to the irreducible repre
sentation Il5 of the point group Ta which arises from the representation ,;?I
of Oh if the latter is taken as a representation of Td. With spin-orbit interaction (the caw of GtrAs), the r15 vulence band maximum splits into an upper
r8-level, and a lower I7-levd In the upper lg-level, the two 2-fold degenerate heavy and light hole bands mcrgr together as in the case of Ge and Si.
The lower l;.-level gives rise to the 8-fold degenerale spin-orbit-split band.
The %fold degenpracy of the valence hands at X,observed in the case of Si
and Ge, splits in CaAs and Gap. The reason or this is that the t,wn atoms
of the primitive unit cell are no longer identical, which means that the point
symmdry of equivalent directions is rduccd from Of, t o Td. Therefore, one
also has 2-dimensional spinor representations ( X s , X:) instead of only one
4-dimensional ( X , ) in the case of Si and Ge,and also 1-dimensional ordinary
representations (XI X2,X3, Xq) instead of only 2-dimensional ones ( X i ,
X z ! x3>Xq) in the case of Si and Ge (see Appendix A).
The conduction bend minirnnrn of GaAs occurs at the r-point and belongs to the spinor representation r7. Thus, CaAs has a dirwt energy gap.
One of the peculiarities of its condiiction band structure is the relative minimum at the L-point only about 0.4 eV above the absolute minimum atr r.
In the case of Gap, the conduction band minimum occurs at X and belongs
to the representation XI. Since the valence band maximum resides at I,
GaP is an indirect semiconductor.
2.8.4
11-VI semiconductors
222
8
6
L
2
a
-2
lxc,
L
223
I
4
K.U
Figure 2.35: Band structure of PbTe. The energy unit is eV. (After Martinez,
Schliiter and Cohen, 1975.)
the conduction band. Since it is degenerate with the uppermost valence band
at I', the fundamental energy gap is zero. According to the general definitions
of Chapter 1, this means that HgTe is no longer a semiconductor, because
in semiconductors the valence and conduction bands must be separated by
a finite energy gap. It is more fitting to term HgTe a semimetal.
As an example of a 11-VI semiconductor which does not exhibit the
zincblende but rather the wurtzite structure, we chose the hexagonal CdS
(see Figure 2.34). The first B Z is that of the hexagonal Bravais lattice,
which is shown along with its symmetry points in Figure 2.12. CdS has a
direct energy gap at r. Furthermore, the valence band is additionally split
off at r as compared to the cubic materials of diamond and zincblende structure, because of the hexagonal deformation of CdS. Thus, the valence band
is composed of 3 bands, namely r7-band which is separated from the other
two by spin-orbit interaction, and the two bands r6 and I?7 into which the
rg-band decomposes under the hexagonal deformation. The latter effect is
called crystal field splitting (see the right part of Figure 2.34).
224
12
8
4
0
-4
-8
1 - z
Figure 2.36: Band structure of Te (left) and Se (right). The energy unit is eV.
(After Maurhke, 1 g71; Stuff, Maachke and Laude, i973.)
2.8.5
IV-VI semiconductors
The band structures of tellurium and selenium are shown in Figure 2.36.
Both materials have hexagonal Bravais lattices. Therefore, they have the
same first B Z s as the hexagonal CdS. In the case of Te. both the valence
band maximum and the conduction band minimum occur at the H-point of
the first B Z . The material is therefore direct. Se has an indirect gap, the
two edges both lie outside of I-, that of the conduction band is probably at
2 , and that of the valence band is probably at H.
22 5
Chapter 3
Electronic structure of
semiconductor crystals with
perturbations
Real semiconductor crystals are imperfect. They differ from ideal crystals
in their atomic structure as they contain structural defects, and their chemical cornpodmn is not exactly lhal of the correspondiug ideal crystals. It is,
however, not the atomic structure which provides real semiconductor crystals
with special scientific and technological importance. but their peculiar electronic structure that is manifested in pronounced macroscopic effmts such
as. for example, the increase of electrical conductirity by orders of magnitude. With some justification one can even say that the particular scientific
and technological importance of semiconductors rests on the peculiarities of
the electronic structure of amperfeet semiconductor crystals. Nevertheless,
one must also deal with the atomic structure of imperfect semiconductor
crystals since that is, ultimately, the source of the particular nature of their
electronic structure. This will be done in section 3.1. The one-electron
Schrodinger equation of imperfect semiconductor crystals will be derived in
section 3.2. - 4 important
~
theoretical method or its solution, the so-called
effective mass theory, will be developed in section 3.3. This equation allows
one to determine the effects of perturbations whose interaction potentials
with electrons are slowly varying over the atomic length scale. Interaction
potentials of this kind apply to many perturbations of atomic structure, in
particular to the so-called shallow centers which will be treated in section
3.4. There are. however. also other perturbations. like the so-called deep
centers, to be dealt with in section 3.5, in which potentials exhibit considerable change over the atomic length scale. They cannot be treated by means
of effective mass theory, but require other methods. This is also true for
clean semiconductor surfaces considered in section 3.6.
3.1
The reference system for the description of the atomic structure of a real
crystal is the ideal crystal. As discussed in Chapter I, the latter may be
characterized as follows: All regular sites
Ri
rial
+ r2a2 + 7'3a3 + 6
(3.1)
defined by the crystal structure, are occupied by atoms of the 'correct' chemical species, and other sites are empty. In a real crystal, this characterization
holds for the vast majority of regular and irregular sites - providing the justification to speak of a crystal at all, albeit a perturbed one. However, there
is deviation from the ideal occupation at some of the regular and irregular
sites. In this sense. one then refers to the crystal as either perturbed or real.
3.1.1
Classification of p e r t u r b a t i o n s
227
all regular siles are occupied by atoms, some sites remain empty. Structural
perturbations are also called strurtural & f w h or simply dpferts. The third
case is the most general one - regular or irregular sites of the crystal are
partially occupied by rhrrnirnlly wrong atoms, end also the correct atoms
partially occupy wrong sites.
Deviations from an ideal crystal may, on the other hand, be distinguished
according to the macroscopic extension of the perturbations. A perturbation
that is limited to one or a few neighboring regular or irregular crystal sites
is called 0-dimensional or a poznt perturbatzon. If the perturbation extends
over sites located on a line or a planc, it is referred to as a 1-dimensional
or lzne perturbatton and a 2-dimensional or planP perturbatton, rpspert ively,
Combining the two classification schemes, one may refer to structural point
or line perturbations, compositional point perturbations etc. The dimensions in the second classification scheme apply to the microscopic core of
the perturbation. There may be smaller perturbations induced by that core
which extend in three dimensions. Examples are charged impurity atoms,
which, due to their long-range Coulomb forces, change the potential energy
of an electron even over distances large compared to the lattice constant.
Below we will characterize the various perturbations in more detail, starting with point perturbations.
3.1.2
Point perturbations
In this subsection, we describe the most important compositional and structural point perturbations of semiconductor crystals. An illustration is given
in Figure 3.1.
(1) We begin with an impurity atom on a regular crystal site. Since the
impurity atom substitutes an atom of the host crystal (see Figure 3.lb)
it is referred to as a substztutzonal vrnpuraty. Examples of substitutional
impurities are a phosphorus atom in Si on a Si-site, or a sulphur atom in
GaAs on a As-site. To avoid a somewhat cumbersornc description, in the first
case one uses the symhol S I : P , and in the second, the ~ynibolGaAs ; S A ~ .
As a rule, impurity atoms which are chemically similar to atoms of the host
crystal, are incorporated substitutionally. For this reason, many elements
of the mein group of the periodic table, if added l o group-IV elemental
semiconductors as well as binary 111-V and 11-VI-compound semiconductors,
form substitutional impurities. The substitutional incorporation, in most
cases, occurs on that lattice site which corresponds to the chemically most
similar of the two atonis in the binary compound semiconductor. lherefore,
the doping of GaAs with S leads to the above mentioned point perturbation
C a A s : S A (and
~ not GaAs : S G ~ )and
,
the doping of GaAs with Si leads to
G a A s : Sica (and not GaAs : S ~ A ~ ) .
(2) An unoccupied regular crystal site is called a vncuncy, as depicted in
Figure 3. Ic. Tn semiconductors made of binary chemical compoiinds, one
has to distinguish between cation and anion vacancies, as shown Figure
3 . 1 ~ .Vacancies occur in all important semiconductor crystals; the general
symbol is V. The vacancy in Si is denoted by S i : c': the cation vacancy in
C h 4 s by G a A s : T;&, and the anion vacancy by GaAs : VA*.
(3)If the impurity atom does not occupy a repular crystal site but a site
between regdar ones, one has an interstitial ampurity atom (Figure 3.16).
In order for an impurity atom to stay at an interstfitrialsite, it must have
sufficiently low energy there. It is quite clear that this will be satisfied for
interstitial sites which either haw high local symmetry or which lie on a
bond between two at'orns. In thc latter case t,he crystal has bond centered
interstitiah. The high symmetry interstitial sites in t e t r a h d a l seniiconductors may be such with tetrahcdral local symmetry in the neighborhood of
the cation or of the anion (for group IV elemental semiconductors the latter
distinction is void, of course). Moreover, there are high symmetry sites with
hcxagonal symmetry. One refers to these7 respectively, as f e t v u h c h l and
hexagonal interatitiak The incorporation of impurity atoms on interstitial
crystal sites is especially likely when the impurity atom deviates relatively
strongly from the atoms of the host crystal as! for example, in t,he case of
transition metal atoms in semiconductors composrd of elements of the main
groups. The general symbol for an interstitial is I . An interstitid Fe atom
in Si on a tetrahedral site is denoted by Si : I;,.
(4) If a chemically 'correct' atom of the crystal occupics an interstitial site
rather than a regular one, one has a self-interstitial (as shown in Figure
3.le). In order for such a structural point defect to develop in a crystal,
there must h e enough space between the host ntmns, i.e., the crystal should
not be packed too densely. This happens, for example, in the case of tatrahedral semiconductors, particularly in Si and Ge which have purely covalent
b ondinE.
( 5 ) If,a crystal consists of two different chemical elements, then an atom of
the h s t may occupy a regular site of t h e sccond, and vice vcrsa Such point
perturbations are called antisite defects, as i l h s t r a t d in Figure 3 . 1 In
~ the
case of CiaAs, for example, a Ga atom may be located ttt an h - s i t e this is
called B I ~As-antisite defect,, and tla As atom may occupy a Cia-site - this is
called a Ga-antisite defect. The symbols are As& for the As-antisit>edefect,
and G U Afor
~ the Ga-antisite defect.
~
Interstitials, vacancies and antisite defects are structural point perlurbations, or point defects. The compositional point perturbations, i.e. the
229
Ideal crystal
A -Atom
0 B-Atom
Impurity atom
SA
SB
Vacancy (V)
Interstitial ( I 1
Interst. impurlty
A,-Antisite
___
B,-Antisite
e)
Figure 3.1: Illustration of the most important point perturbations in semiconductors using the example of a crystal with two atoms per unit cell of the same chemical element (left-hand side) and different chemical elements (middle and right-hand
side).
Table 3.1: Electron corifiguratiori of main group elements. In the rightrnost column
the respective closed shells are indicated.
231.
Iron m o w
ziSc
=Ti
3d4~3~ 3d248'
23V
3d34s2
z4Cr
3d5&
wMn
3d"4s2
mF'e
3d'4s2
z7cO
Ni
3d74s2
3d64aa
3a23p6
era1 they are the not completely occupied oms. Because of the relation
R 2 I
1 between the main quantum number ?a and the angular mornenturn quantum numbe1 E. the d-shells (I - 2) are possible only for n >_ 3.
the f-shells ( 1 - 3) only for n 2 4, etc. Accordingly, one has the shells
36,&, 4f, 5d, Sf, ;1g, 6 4 Sf, 69, fih ptc. Since among the first 98 elemmts of
Table 3.3: Electron configuration of rare earths and actinides. In the right column
the respective closed shells are indicated.
Actinides
the periodic tablc, however, the 5g-shell already remains unoccupied, only
d- and f-shells are to be considered, namely the d-shells 3 d , 44 5 d , 6 d , and
the f-shells 4f and 5 f . The filling of the 3d-, 4d- and 5d-shells takes place
in the series of transztzon metals (together with the Elling of the 4s-, 5s- and
6s-shells). Among the transition metals, one distinguishrs the iron groixp in
which the 3d- and 3s-shells are being filled, the palladium group in which
the samc happens with the 4d- and 4.s-shells, and the platinum group where
the 5d- and 5s-shells are being Elled. The 4f-shells are being filled in the
rare earth elements, and the 5f-shells in the actinides. In comparison with
the s- and p-orbitals, the d - and f-orbitals have a smaller extension in spacc,
they lie mostly within the s- and p-shells of the same main quantum number
7 ~ .
When there are two vacancies, the mechanism for the formation of bound
pairs can also be easily undertitood the (internal) surfacc of thc crystal i s
reduced if two previously isolated vacancies move together to occupy neigh-
233
boring cryYtal sites. One caIls this associate a ddvacancy. Analogous atatements holds for the association of more than two vacancies, which are called
multrvarunczes.
Frenkel defects
If, in a crystal, an atom moves froin a regular site to an interstitial site, then
it h v e x behind a vacancy which attracts the interstitial. Thus a defect pair
is formed in this process which consists of a self-interstitial and a vacancy.
It is called a Freibel defprt.
There arr important puint perturbation complexes which occur only in
a specific material or matrrial group. We now consider some examples for
si snd G A S .
Point perturbation complexes in Si
Latticc relaxation
a point perturbation differ frorri those in lhr ideal crystal. They are non-xr?rol in general, at. the
ideal crystal sites. Thus the atoms are forced t.o move t.u new equilibrium
sitcs. This is known as lattice relamtion (in Figure 3.1 this effect is omitted). The new rqiiilihriiim sites are initially tinknown. In principle, they
can be determined by means uf atomic structure calculatious for t h e perturbed crystal. These have t o be performed sirnultanmusly with calculations
of the elect,ronic structure, just as is done in self-consistmi. calcidations of
the electronic and atomic st.ructwes of ideal crystals described in Chapter
2, section 2.2. HoweveT, there is an impwtant difference between the two
cases. For ideal crystals, the calculation of at,omic structure may be avoided
since, for t,he latter, complete and reliable experimental data are available.
However, in regard t,o the atomic structure of a crystal in the vicinity of a
point perturbdion, in many cases, hardly more t,hari t.he symmebry is known
from experiment. Thus the self-consktent calculation of the electronic and
at.omir:sbriictures must actually be carried out. if lattice relaxat,ion becomes
important.
Experimental information concerning the symmetry of lattice relaxation
may be derivrd from observation of t,he Jahn-TelEev efleet. This eflect results
in splitting of the degenerate energy levels of a point perturbation due to
a symmetry-lowcring latt.ice relaxation. In the case of a varaiicy: for example, the lattice rehxation can reduce the original tetrahedral symmetry T d
to tetragonal symmetv D2d. Such spontaneous symmetry breaking occurs
when it leads to lowering of the total energy of the crystal. This mag; happen
when, in the unrelaxed state, i.he point perturbation has a c1rgenerat.e level
which is only parhially occupied. First. of all, the degenerate level will split off
due to the symmetry lowering relaxation. According to perturbation theory,
235
this splitting proceeds such that the center of gravity of the levels remains
unchanged. Thus, along with levels shifted up, there are also levels which
are shifted down. If only the latter are predominantly occupied, then the
energy of the electrons localized at the point perturbation decreases. This
energy reduction can compensate the increase in total energy due to the
removal of atoms from their equilibrium sites. If this happens the relaxation
is energetically favorable and will take place spontaneously.
In the case of the Jahn-Teller effect, the displacements of atoms are of
the order of magnitude of one tenth of an Angstrom, i.e. they are relatively
small. Larger displacements, of the order of magnitude of one Angstrom,
are observed at point perturbations for which different atomic structures are
stable, depending on external conditions as, for example, the position of the
Fermi level. This phenomenon is observed at the D X centers in GaAs and
(Ga, A1)As mentioned above (see section 3.5 for more detail).
3.1.3
J!
n!(J - It)!
1,
follows from this relation. The enthalpy of formatiult HJ or B vacancy typi d l y amounts to several c l ' . Assuming H f = 2 5 el/ and T - 1400 K
(roughly the crystalhdation temperature of Si), one gels the value n =
J x 5 x 10-l' from expression (3.5). Because J - s x
cm-' (for Si),
a varancy conrmtration of about 1013 crriL3 follows. Similar concentrations
are ohtnincd for other point deferts. Thew v a h s refer. hy thcir derivation,
to the high temperatures assumed above, at which the c q s t d is grown.
Cooling down to room temperature often does not, however, substantially
change the defect concentrations The defects are frozrii in. From this one
may conclude that the presence of a considerable number of point defects in
237
crystals is essentially unavoidable. The Si singlecrystal bars used in microelectronics its a material base, in fact, contain vacancies and interstitials in
~rn-~.
the above estimated concentration of about
Incorporation of impiirity a t o m s
Chemical point perturbations are not as unavoidable as structural defects.
Of course, the laws of thermodynamics in this case also act in a direction
which leads away fro111 the absolute chemical purity of the ideal crystal.
This tendency, however, can only be effective to the extent that chemically
'wrong' atoms are available in the raw materials and the chamber in which
the crystal is grown. The state of a mure or less homogeneous distribution
of impurity atoms in the whole chamber, i.e. the state in which the growing
crystal also contains impurity atoms, has smaller Gibbs free energy than
that of the chemically absolutely pure crystal with all the impurity atoms
confined to the remainder of the chamber. However, by cleaning the raw
materials and the growth chamber in thermodynamic terms by an extraction of entropy - the number of chemically 'wrong' atoms can be reduced.
Theoretically, no limit exists for the degree of purity achievable, practically
such td limit is set by the cleaning expenses, of course. Therefore, one reduces the concentration of impurity atoms only to a level which is absolutely
necessary for the application iinder consideration.
A boundary for the achievable concent,ration of impurity atoms in a crystal exists, as a rule, in the form of an upper limit. Under thermodynamic
equilibriiim conditions it is given by the solubility of the corresponding elements in the semiconductor crystal. The solubility increases with rising
temperature acrording to a law which is similar to that for the vacancy concentration of equation (3.5), providcd H f is underst,ood as the formation
enthalpy of the impurity. The latter depends on the chemical nature of the
impurity atom and host crystal, and it also differs for incorporations at differciit crystal sites like substitutional or interstitial ones. High solubility values
are achieved, as a rule, if the element of lhe impurity atom is chemically
similar to at least one of the elements of the crystal. Incorporation at substitutional sites is preferred in this case. If the crystal consists of elements of
tho main groups, which i s the case for the majority of known semiconductor
materials, one has relatively high solubility, especially for impurity atoms of
these groups themselves. In the extreme rase, an alloy will be formed with
t,he impurity atoms. The dissolving of A1 in GaAs, for example, results in
a (Gtt, Al)As alloy. In such cases, t,he solubility equals the concentration of
the host crystal atoms, i.e.
crn-'. For P in Si or Ge in G A S , solubility
values of lo2' c n ~ -are
~ reached, which lie just a little below. For transition metal elements, which d i k r st,rongly chemically from the main group
elements, t>hesolubilities in semiconductors made of such elements are sub~
I 014
cm-3.
239
3.1.4
impurity atom
Ga
Au
Cu
24 1
3.2
VWTt(X)
+ Vg;t(X).
= VL?'t(X) f V,"'"x)
(3.7)
With this, thc one-electron Schrodinger equation for the wave function $ l V ( x )
of an electron of the perturbed crystal reads
3.2.1
V,(x).
(3.9)
-+
0 for
Ix I
00.
(3.10)
The point perturbations listed in section 3.1 differ as to how fast this decay
proceeds.
We consider, initially, the case of a substitutional impurity atom whose
core contains almost the same numbers of protons and neutrons as the core
of the host atom. With this rqiiirement, the two atomic cores differ mainly
through their different charges. One refers to them as isocoric impurity
atoms. This case occurs, for example, if a P atom with only one additional
proton and one addit,ional neutron substitutes a host atom of a Si-crystal (as
illustrated in Figure 3.3). We denote the number of (positive) elementary
charges of the core of the impurity atom by Z r , and the number of (positive)
elementary charges of the core of the host atom by Z H . Because of charge
neutrality of the individual atoms, ZI and ZH are simultaneously also the
numbers of valence electrons of these atoms. The potential energy of a
valence electron of an impurity atom of the type described differs, above all,
by the change of Coulomb potential from the potential energy of an electron
of the host atom. If one considers the cores to be point-like and neglects
spatial dispersion of dielectric screening, then one has, approximately,
243
(3.11)
where E is the static dielectric constant of the semiconductor material. Deviations from this perturbation potential are to be expected in close proximity
to the impurity atom, on the one hand, bwansc there the spatial dispersion
of dielwtric screening cannot be neglected, and. on the other hand, because
the core of thp impurity atom diffcrs not only by its charge number from
that of the host atom, but also in other respects. In fact each core has a
iinite spatial extension because of its spatially extended r o w electrons (the
nucleus of the core may be treated as a point charge). In close proximity
to the center, the core electrons give rise to additional forces, heside t h e
elerlrostatic point charge forces already counted in [3.11). These additional
forces are caused by higher moments of the core electron charge distribution,
and also by exchange and correlation effects. If the cores of the host and
impurity atoms differ, whch always happens iI the atoms arc not identical,
the additional core forces will also differ. Both effects, the spatial dispersion
of screening and the differpiirps of additioiiat rorc forces, are jointly termed
central cell correcfzonu.
The perturbation potential (3.11) is distinguished in that. with the exclusion of a close environment of the impurity atom, its variation over a
primitive unit cell is relatively small. which implies that it remains significant over a relatively large distances from the impurity atom compared to the
lattice constant. In this context, it is called a long-range potential. We may
state. therefore, that isocoric impurity atoms are approximately described
by smooth or long-range perturbation potentials.
The potentials which apply for non-isocoric impurity atoms are different.
Consider first the case in which the charge eZf of the impurity atom core
equals the charge eZH of the host atom core. This does not necessarily
mean that the nuclei of the two atoms must have the same numbers of
core electrons, because different numbers of protons of the atomic nuclei
may compensate the difference of core electron charge. The requirement of
equal numbers of core charges implies, however, that the numbers of valence
electrons of the two atoms must coincide - as described by the term asovalent
impuritr atoms. Examples for non-isocoric isovalent impurity atoms are
C atoms in Si- or Ge-crystals, or N atoms substituting P atoms in Gap.
The potential energy of an electron at a non-isocoric. isovalent impurity
atom differs from the potential energy at the host atom not just by the
screened Coulomb potential (3.10). but by the different core electron shells
in the two cases. The perturbation potential b:(x) accordingly contains only
electrostatic contributions of higher moments of the core electron charge
distribution difference, as well as exchange and correlation contributions
due to this difference. Both kinds of contributions decay more rapidly with
- _ _ _ Si_ Si_ _
Si
v;
Si
Si
Si
Si
v, pert -v,
v
Figure 3.3: Illustration of thc origin of long-range (left-hand side) and short-range
(right-haritl side) core perturbation potentials.
increasing distance fiorn the center than does tlic Coulomb potential of a
point charge, they are therefore described as short runqe. These arc also the
potential contributions, whicb in the context of the ibocoric impurity atoms
abovc, gave rise to central cell corrections.
The exact &termination of the pertrirhatiori potential is a problem which,
like the determination of the periodic core potential of an ideal crystal, can
ultimately be solved only by numerical calculations. These show, in fact,
that the pet turbation potentials of isovdlenl impurity atoms differ horn zero
substantially only over a distance of a few lattice constants. Consequently,
we have
v(;(x)
v'(x) 0
(0
for I x
I 5
a f e u lattice constants
Ix1
(3.12)
245
Finally we foiego the requirement that the core charges of the two nonisocoric atoms be the same. Examples of this most general case are Cd
atoms with their 2-fold core charge in a crystal of S i atoms with their 4fold core charge, or Sn atoms with their Cfold core charge on a G&site
in GaAs, thereby substituting a triply positively charged host atom core.
In these cases, the perturbation potential represents a superpobition of the
screened Coulomb potential (3.11), which accounts for the different core
charges, and a short-range potential which takes accoimt of all remaining
differences bctween the cores d5 well as aspwts of the spatial dispersion
of screening beyond those accounted for in equation (3.11). The effects
of the two potential contributions are not independent of each other. The
same bhort-range potential has a greater effect if the Coulomb potential, due
to a larger core charge difference between the impurity and host atoms, is
stronger. This comes about because the stronger Coulomb potential pulls the
electrons closer to the core, where the the short range potential is essential.
Just like the elwtron-core inteiaction potential, the Hartree potential
V,(x) and the exchangecorrelation potential V,,(x) also undergo changes in the prrsence of point perturbationb. Below, we describe these changes
for the Hartrre potential V I y t ( ( x )and the exchange potential V,((x) of
the liartree-Fock approximation. Similar results can be derived for the
exc~iaiig~rorrt~ation
potential vzt(x)of the LDA mettiod.
3.2.2
Hartree potential
The Hartree potential V r t ( x z ) of the r-th electron is, as before (see relation
2.49), given by the expression
(3.13)
However, the oneparticle states $,(x2) here are not those of the ideal crystal,
but those of the pertiirbed crystal and the summation runs over the o n e
particle states u i occupied in the ground state of the pcrturbwl crystal.
From the physical point of view it is quite clear that the prrtiirbed rrystal ~ H two
S
kinds of stationary one-particle states; first, those with energy
eigenvalues which were already allowed for the ideal crystal, i.e. those within
the bands and, secondly, those with F I I C I eigenvalues
~~
within the energy gap
between the valence and conduction bands. The states of the first kind represent pure Bloch states only in zero-order approximation with respect to
the perturbation potential, while, in higher approximations, superpositions
vrtd
(3.14)
with
extd
V g t d ( x i )= e2
/ d3x'
k J
and
(3.16)
Here we assume that the electron i on which the potential acts is located at
the center. This assumption will also be maintained below.
The localized part VhOC(xi)of the Hartree potential depends on the number n of electrons at the center. There are two sources of this dependence.
First, the number of electrons which enter changes with n, Vfi"(xi)becomes
more repulsive if n is large, and less so if n is small. Second, the wave
functions of localized occupied states vLOC over which the sum in (3.16) is
extended, depend on n. They become more localized if electrons are removed
from the center, and less localized if electrons are added. One says that the
wavefunctions relax.
For reasons similar to those causing the localized part VhOC(xi)of the
Hartree potential of the perturbed crystal to depend on the number n of
electrons at the center, the extended part V;&(xi) also differs from that
of the ideal crystal, firstly, because the number N of extended electrons
is decreased by the localized ones and, secondly, because the states of the
extended electrons relax. The first change causes a relative potential correction of order of magnitude 1 / N and thus can be omitted (a comparable
'
247
(3.17)
where, by definition, VA(xi) is given by the expression
Exchange potential
As we know from section 2.2, the exchange potential describes the Coulomb
interaction with the exchange hole which occurs because two electrons of the
same spin cannot reside at the same position. For the localized electrons of
the perturbed crystal, the positional uncertainty is smaller than that of the
extended ones. Accordingly, their exchange interaction will be stronger than
that of the extended electrons. It is therefore again expedient to decompose
the entire exchange potential V,Pt(x,),which an electron a localized at the
center feels, into the two parts V,..td(&) and l r ~ ( x of,
~ )respectively, the
(N - n ) extended and n localized electrons. We replace the extended part
by the exchange potential V x ( x i ) of the ideal crystal and simultaneously
substitute the localized part by an effective exchange potential Vi(xz).With
this replacement, we have
+ Vfi(X2).
v,pt(X,f
= VX(Xi)
(3.19)
(3.20)
where the summation over k inrlirdes only particles whose spin W k equals the
spin LT? of particle i.
Substituting trhe three potential parts (3.91, (3.17) and (3.19) into the
oneelcctron Schrbdinger equation of the perturbed crystal, the various terms
may bc arranged such that the effective oneelectron potential E'(x} of the
ideal crystal (2.60)occurs. The equation reads
(3.22)
The energy correction due to the Hartrcc potential for an electron z in localized stateqVt i s given in first order perturbation theory b - ~the rxpectation
valri~($V, 1 VAw I vv,).[!ring (3.181, this expression may be written as
249
(3.23)
We extract the factor (E-'(x, x')/ I x-x' 1) from the x'-integral by evaluating
it at an average position X in place of x'. The remaining x'-integral yields 1
because of the normalization of the wavefunction @,,(x'). The integral over
X, multiplied by e 2 , will be denoted by Uy,,
i.e. we set
(3.24)
With this, the expectation value of the perturbation of the Hartree potential
becomes
Next we will calculate the corresponding energy correction due to the perturbation of exchange potential V%{x)of (3.19). The latter depends on the spin
of the electron because. contrary to the ideal crystal case where the numbers of electrons with 'spin-up' and 'spin-down' are equal in ground state,
the corresponding numbers n~ and nl of electrons localized at the center can
differ. In terms of the total number n = n~+nl and the total spin projection
M S = (1/2)(nT - "1) of the localized electrons, the two partial numbers ny
and nl may be written in the form
1
1
(3.26)
2 h l s ) , nl = -(n - 211iIS) .
2
2
For the expectation value (&,t 1 1
5 I iu,)
of the perturbation of the exchange
potential, equation (3.20) yields the expression
"1 = - ( n
I ~ I'+v,)
i
( ~ v ,
-(no, - I ) J ,
f o r not 2 1.
(3.27)
(3.28)
is so-called the exchange integrcab The dependence of this integral on the
orbital state v' = Uk has been ignored in expression (3.28).
25 1
longer true if the bonding is covalent or partially covalent. Then some of the
bonding electrons remain at the impurity atom, and the true number of its
elementary charges differs from V. The n valence electrons of the impurity
atom, which remain after the departure of the V electrons into the bonds
with the crystal, and are available for occupation of the localized levels in the
energy gap, are sometimes called active electrons. In the case of an impurity
atom with Vi valence electrons one has n = Vi - V active electrons.
So far, the charge state of the impurity atom prior to its incorporation
into the crystal was taken to be neutral. However, positively or negatively
charged ions can also be introduced into the crystal, and atoms introduced
in a neutral charge state can have electrons removed or added within the
crystal. Similar statements hold for structural defects. If a particular perturbation center X is not neutral for the lack of Q electrons, in the sense
just specified, one says that the charge state of X is Q and writes X ( Q )
(where negative Q mean surplus electrons). The charge state should not be
confused with the oxidation state. In the purely ionic case, the distinction
between the two can be expressed most easily: the charge state counts the
elementary charges of the atom outside the crystal, and the oxidation state
counts its elementary charges inside. Generally, the oxidation state of a center in charge state Q will be X('+Q)+ if the oxidation state of the neutral
center is V . The reason is that, in this case, V Q valence electrons are not
available for occupation of localized states. The number n of electrons which
are available, i.e. the number of active electrons, amounts to Vi - (V Q )
if V,, as before, denotes the number of valence electrons of the neutral impurity atom. The notations for the oxidation state and the charge state are
summed up in the common symbol X('+Q)+(Q+).
A simply ionized sulphur atom in a Si-crystal, for example, is denoted by
S5+(l+).Of the 6 valence electrons of the S atom, 6 - 5 = 1 are available
for occupation of localized energy levels in the gap, instead of 2 electrons in
the case of the neutral center S4+(O+). Transition metal (TM) atoms can
be installed in Si- and other tetrahedral semiconductor crystals both substitutionally as well as interstitially. The oxidation states V and, therefore
also the numbers n of electrons at the impurity atom, are different in the
two cases. For substitutional TM atoms, V equals the number of electrons
which are left at the atom after it is bound to the crystal. Interstitial impurity atoms are only weakly bound, i.e. the number V of electrons of the
T M atom which occupy bonding valence states, is almost zero. The oxidation state is therefore T M o f . Oxidation and charge states coincide in this
case, and the number n of electrons at a TM atom equals the number of its
valence electrons. Interstitial Fe atoms in Si are found in the oxidation state
F e o f , substitutional in the oxidation state Fe4+. In the first case, n = 8
(six electrons in 3d-orbitals and two in 4s-orbitals), and in the second case
the value of n = 4.
With the background analysis set forth above, we are now sufFiciently p r e
pared to address t h e solution of the one-electron Schriidinger equation for the
crystal with a point pcrturbation. Which solution methods can be applied
with succcss depends decisively on whethe1 lhe core yrrt iulmlion potential
is long- or short-range. Of course, t h r lattice translation symmetry, which
drastically simplifirs the bolution of t h ~Schriidinger eqiiation hi the casr
of an ideal crystal is perturbed by both kinds of potentials. but only for
short-rangepotentials in an e s s e a t d mannet-. For long-range potentials the
dec iations from lattice trauslation symmetry are relatively weak. In this
case the Scfirodinger equation, in a certain sense, may be decomposed into
two quations, one for the periodic potential of the ideal crystal, which was
solved previously, and one for the perturbation potential. T he latter is called
the LPffectivemass equation, which we will derive in the following section.
3.3
ure start with the Schrodinger equation (3.8) of the perturbed crystal in
the form (3.21). The most, iiriport.ant assumpt.ion we will make relat.es to
Ihe core part V ~ ( X of
) the pert,urhatiozt potential V A T t ( x )in this equation.
This potential part is supposed to be smooth on the atomic length scale. In
conjunction wit,h t,his wxmption, point pcrtiirhatioiis with shos-t-range core
pert.iirbation pot.entials Vf(x)are ruled out from the very beginning,because
these caiinot be considered to I c smooth. Point perturbations with Coihmbic core perturbation potentials are still allowed, although the Coilonib form
of this potential is not necessary for the derivation of t.he effective mass equation. The other parts of thr t o l d perturbation potential V&.t(xj of eqiiation
(3.21)!i.~.
the pdurbations of the Hartree and exchange potentials l7L(x)
and ViCx), arc automat,ically smooth if V&(x) has this property because
these potential parts are dctcrmined by t.he solutions qVof Ychrodinger equation (3.21) which are smooth if L;(x) is also. The wwelunction dependence
of Vh(x) and Li(x) requires self-consistent solutions of the Schriidinger
equation. In this section we do not intend to solve this quabion explicitly
hut iclther to t,ransform it into another equation, the cffective mass equation? aliith can be solved more easily. Althoiigh the self-consistency demand
is transferred to the effective mass equation. it does not iulerfere with its
derivation. For the latter? Vh(x) and Vk(x) may be treated as smooth external potdialii which, together with V,(x)! add lip l o form a smooth total
perturbation potential i(x).
Far hhe derivation which follows, the POt.ential need not be t.he smooth perturbation potential VieVt(x)of a point
perturbelion, any smo0t.hpotential U ( x ) is allowed. This is important innsmuch as it becomes possible in this way to utilize the effective mass equalion
not only 01 point perturbations with Coulombir core perturbation poten-
253
tials, hilt also for macroscopic perturbations, such 8s that associatd with
an external electric Geld, which are smooth on the atomic length scale by
definition.
The goal of the following consideralions is l o simplify the oneelectron
Schriidingcr eyiiation
(3.29)
for t,he perturbed crystal in stages, so that., ultimately: a mare easily solvable
equation, namely the effective mass equation, results. In this mat,ter, we
will employ t,he fact that, in the vicinity of critical points of a certain band
v , the energy of an electron of an ideal crystal depends quadratically on
quasi-wavevector k, i.e. in the same way that the energy of a free electron
depends on its momentum. However, the free electron mass 7 n is replaced
by the effective mms rnc of the particular band and critical point chosen.
The e&ct.ivc mass includes the effects of interaction of the electron with the
periodic potential of crystal. Therefore, it is generally a tensor, and if il can
be reduced to a scalar maw, the value of the lat.ter generally differs horn the
mass of a free electron. Having in mind the effective mass description of the
band energy versus quasi-wavevector k relat,ion, one may suspect that the
influence of a perturbation potential V(X) on an &&on of the cryisla1 can
be calculated in an appr0ximat.e way as follows: One eliminates the periodic
potential of t h e oiic cltrtron Schrdinge~equation for the perturbed crystal!
while simultaneously replacing the free mass m in the kinetic energ); operator
by the effective mass m:. The resulting Schrodinger equation represents
the one-bend e#ective mass eyuatian in its simplest form It can be solved
much more easily than the original Schrodinger equation? which includes
the periodic crystal potential ill explicit form. For a siibstitutioiial P atom
in Si, for example. t.he effective mass equation is twsentinlly the same a8
the Schrodiuger equation for the hydrogen atom whose solutions are already
known.
The procedure described above needs, of COZITSC, further justificat.ion. To
provide this, quest,ions have to bc addressed which have been left, open so
far, for example, how the wavefunction of the e f l d i v e mass equation reiates
to the wavefunction of the original Schrodinger equation, and the matter of
what conditions must be placed on the perturbation potential V(x) and the
wavefunction 7;j(x)in order for bhe effectivp mass equation to be applicable.
3.3.1
To address these questions and derive the effective mass equation for a single
band, we rewrite the Schrodinger equdion (3.29) in the Bloch representation,
expanding ~ ( xwith
) respect to the complete set of Bloch functions
whence
pvk(x),
(3.30)
with the expansion coefficients given by
(vk I
d3xlpXx)~(x).
(3.31)
Since the (pYk(x) are eigenfunctions of H with eigenvalue E,(k), the Schrodinger
equation (3.29) takes the following form:
[ E y ( k ) 6 v f y f 6 k k ~ (vklUjvk)] (JklV) = E ( v k l $ ) .
(3.32)
vk
To transform this equation into the effective mass equation, the assumption
made at the outset that U ( x ) should be a smooth potential, must be specified
further. This is done by assuming that the change of U ( x ) over a primitive
unit cell is small in comparison with the change of the periodic crystal potential t ( ~over
) such a cell. To formulate the condition for smoothness in
this sense quantitatively, we decompose U(x) in a Fourier series, using the
same notation introduced previously in Chapter 2. We have
(3.33)
with Fourier coefficients
(3.34)
The sum in (3.33) is extended over the entire infinite k-space, meaning over
all Brillouin zones. Smooth functions U ( x ) in the above sense have Fourier
coefficients (klU) which, for large k-vectors, more strictly speaking, for kvectors outside of the first BZ,are small compared to the Fourier coefficients
(kll.) of the periodic crysta1 potential V ( x ) . The latter components were
calculated in section 2.4. According to formula (2.161), they are non-zero
only if k is a reciprocal lattice vector K differing from zero. This means that
the smoothness condition for U ( x ) may be expressed its
for
<< I(KlV)l, K # 0 ,
for
Ikl outside f i r s t BZ
(3.35)
255
<< E g .
(3.36)
(3.37)
with F , ( k ) as a function which has yet to be determined. It turns out
that this requirement has no contradictions if the perturbation potential is
smooth.
The thzrd requirement refers to the function F,(k). It is assumed that
Fvo(k)differs from zero only for small k-vectors in the sense used in equation
(3.35).
Fourthly, it will be assumed that for small k-vectors, which according to
(3.35) have to be considered exclusively, the Bloch factors u W k ( x ) in the
Bloch functions p,k(x) can be approximately replaced by their values at
k=0:
U v o k ( X ) ZS
.mo(x).
(3.38)
In the following subsection 3.3.2 we will see that the two last requirements
can be justified when eigenstates of the perturbed crystal exist having energy
eigenvalues just above and below the edge of band vo, and if only these
eigenstates are considered. If, additionally, the band edge lies at k :
0,
the small energy deviations also correspond to small k-values, and relation
(3.38) holds approximately. The restriction of the location of the band edge
to k = 0 can be omitted, and the derivation procedure can also be applied to
band extrema
other than the center of the first B Z , the only modification
being the replacement, of the R X center k = 0 by the non-central critical
point k,as well BY of k by k - k, in the corresponding equations.
The above four conditions will now be used to simplify the Schrodinger
equation (3.32). Applying relation (3.38) the wavefunction (3.37) may be
written as
Tihe k-sum is the Fourier transform F,(x) of the function F,(k), whence
F,(x)
1
CFvo(k)eik.x.
(3.40)
& k
$44 = ~ , O ( X ) ~ Y l ( X ) .
(3.41)
The function F,(x) is termed the envelope functzon or, in short, the envelope. The envelope function, by definition, i s a bmooth function. Equation
(3.41) means that the truc wavefunction + ( x ) is obtained by enveloping the
rapidly oscillating Bloch lactor uvgo(x) with the smooth cnvelope function
Fuo ( X ) .
make
The two ronditions conccrning the smoothness of U(x) and Fvo(x)
it possible to represent the matrix element (uk(l/(vk)in the Schrodinger
equation (3.32) in a substantially simpler form. We rewrite this element
using thc Fourier representation (3.33) of U(x) and the product form of the
Bloch functions p v k ( x ) , obtaining
C .i(k+k-k)-R
R
= G3
Sklfkl-k,K.
(3.43)
C ( k - k + KIU)Ci$(K),
1
= -
(3.44)
(3.45)
257
has been introduced (0 = G3Ro). The Bloch components of the wavefunction @(x) in the Schrodinger equation (3.32), (vkl$) and (vklg), differ
from zero only for small k and k because of the smoothness of the envelope.
Therefore we need the matrix elements (vklUlvk) only for small k and
k. With regard to the smoothness requirement for the potential U(x), the
terms in expression (3.44) for (vklUlvk) yield no significant contributions
to the Schrodinger equation (3.32) if K # 0, so that
(vklUlvk)
-(k
- klU)C$t(O).
(3.46)
c,,,
kk (0) = .6
t,
(3.47)
(3.48)
With this relation, the decisive step in simplifying the Schrodinger equation
(3.32) for the perturbed crystal has been done - the coupling between the
different energy bands caused by the perturbation potential has been eliminated. The Schrodinger equation now decomposes into separate equations
for the various individual bands, and these equations can in fact be solved
by a one-band Ansatz of the form (3.37). The coupling between different
wavevectors remains in place. This can be processed relatively easily. We
employ all results achieved up to now in the Schrodinger equation (3.32),
obtaining the following relation for F,(k):
C &,,(k)Fuo(k)
(XIk) =
(3.50)
One can easily prove that this transformation is correct by expanding E,(k)
in a power series and by using the identity
U(x)Fm(x),
(3.52)
kk
E,(O)
h2
+ -(-iVx)2,
2mg
(3.54)
and
h2
[ L . d O ) - -a2
2m&
I U ( x ) F,(x) = E F , ( x ) .
(3.55)
With the derivation of this equation, initially conjectured and now verified,
it, is quite clear that the influence of the perturbation potential can be approximakly determined from an equation in which the periodic potential
no longer appears and the effective mass replaces the free electron mass.
Equation (3.55) is therefore the desired effective muss equation The eigenfunction Fvo(x) of this equation plays the role of an envelope function for
the Uloch factor uvoo(x)in equation (3.41) for the true wavefunction $J(x).
Equations (3.53) or (3.55) are also called envelope function equatioas in t,his
context .
The essential requirements involved in the derivation of the effective mass
equation were the smoothness of the perturbation potential, the smoothness
of the wavefunction and its composition of Bloch functions from only one
band, as well as the k-independence of the Bloch factors. These assumptions
are decisive, and are often not fulfilled. Short-range perturbation potentials,
for example, are not smooth. Nevertheless, the effective mass equation (3.55)
is a suitable instrument for the solution of a series of important problems
of solid state physics. Point perturbations with smooth potentials are only
one example of this. Other problems which can be solved with the help
of the effective mass equation include artificial superstructures in a crystal
and external electric fields. These will be treated later in section 3.7 and
3.8, respectively, Also, the Coulomb attraction between electrons and holes
- which results in the formation of excitons as pointed out in Chapter 2 -
259
3.3.2
The oneband effective mass equation in its general form (3.53), derived
in the preceding subsection, completely solves the eigenvalue problem for an
electron in a crystal in the presence of a smooth perturbation potential. The
practical use of this equation depmds, however, on the dispersion law E,(k)
of the band under consideration. In the vicinity of the minima or maxima
of non-degenerate bands of cubic crystals one has a purely parabolic and
isotropic k-dependence, and the effective mass equation is, as we have seen,
no more difficult to solve than an ordinary Schrodinger equation with an
external potential. This picture changes for bands which display degeneracy
in the extreme. As demonstrated in section 2.7 on k p-theory, one then
in general has non-parabolic and non-isotropic dispersion laws. This does
not involve a major difference if the one-band effective mash equation can be
solved in k-space, i.e. in the form of equation (3.49). IIowever, for various
reasons it can be necessary to transform the effective mass equation into coordinate space and to solve it there. This applies if basis functions other than
plane waves are better adjusted to the symmetry of the perturbation potential as, for example, the Coulomb potential of an impurity atom, or if no
perturbation potential is present, but the perturbation is introduced through
boundary conditions, such as in the case of artificial superstructures like superlattices and quantum wells. Because of the non-parabolic and anisotropic
dispersion laws in the case of degenerate bands, the effective mass equation
(3.53) in coordinate space is more complicated, in particular higher order
differentia1 operators occur which make solution of the eigenvalue problem
practically impossible.
A resolution of this situation is offered by k . p-perturbation theory. As
we have seen, the parabolic and anisotropic dispersion laws of degenerate
bands occur in this theory through diagonalization of the matrix of the
Hamiltonian with respect to a basis set of Bloch functions Ivk)', which are
exact up to the first order in the k . p-perturbation (see formula (2.339)).
The elements of this matrix are relatively simple linear and quadratic functions of the components of k. In the case of degenerate bands this suggests, therefore, not to take a onecomponent effective mass equation as
starting point, but a multi-component one which is obtained by writing the
Schrodinger equation for the perturbed crystal in the approximate Bloch
basis 1vk)l.
We will undertake this program now. Regarding the perturbation potential U ( x ) and the envelope function F ( x ) , we pose the same requirements
?clW=
(.kl$)(xtvk),
(3.56)
uk
and the Schrodinger equation (3.29) in this representation takes the form
(vklH
+ Ulvk)
(vkl$) = E(vk($).
(3.57)
uk
(3.59)
26 1
Non-degenerate bands
WTeassume that p(x) can be expressed in terms of the approximate Bloch
v g ? so
that
'(vkl@)= b,,F,(k).
(3.60)
$(x)
CFw(k)(xlvok)l.
(3.61)
where hi&8 are the elements of the reciprocal effective mass tensor according to formula (2.337). Using the Ansatz (3.59) for '(vklp) and expression
(3.631 for l(vk)H]vk)', the Schrodinger equation (3.57) yields
This equation is in agreement with the effective mass equation (3.53) if the
latter has E,,,(--zV) replaced by a parabolic expression in the components of
-iV,refraining, however, from the assumption of isotropy of the band structure. If the effective mass tensor reduces to a scalar quantity ( l / m ~ & ) 6 ~ 0 ,
then (3.65)yields the effective mass equation in the form (3.55). The derivation of the effective mass equation based on k . p perturbation theory thus
produces this equation automatically in parabolic approximation. In the earlier derivation of subsection 3.3.1, an effective mass equation was obtained
that did not yet contain this approximation, which was invoked only later.
The advantage of the derivation of the effective mass equation within the
framework of k * p-perturbation theory is clearly manifested when degenerate bands are considered, which we will address next.
Degenerate bands
We direct our attention to the valence band maximum of semiconductors
with diamond type structure (see section 2.7). Accordingly, we assume that
at k = 0 a degenerate band level E,(O) exists with either symmetry rhs
or r: depending on whether spin is omitted or not. First we consider the
case without spin. The three degenerate ,b'T
Rloch states of energy E w ( 0 )
arc- distinguished, as before, by an integer index m. The Bloch functions
at k = 0 arc therefore written as ) o m O ) . If we are interested in eigcnstates
$(x) of the Schrodinger equation of the perturbed crystal whose energy
eigenvalues are expected to be near the valence band edge, we can represent
$(x) as a superposition of the Bloch functions Ivrnk)' in first order k . p
perturbation theory. Thus, we set
(3.66)
As above, in the case of non-degenerate bands, it approximately follows that
(3.67)
263
+ Ci(X)h,,!
+$
(3.70)
Fm'(X) = E F m ( x ) .
@
~
($
~
D:fm,
l
(3.71)
valence band the Hamiltonian matrix is given by relat,ion (2.344). With t,his, the envelope function equation then reads, explicitly,
Here, derivatives of the type 62/6z2 are denoted by symbols like a;, and
those of type a 2 / d r d y by symbols like a&.
Consideration of spin leads to two changes. Firstly. one has to use spindependent Bloch functions (sxlvmak0)' and, therefore, also spin-dependent
envelope functions Frrw(x,
s ) , and secondly the spin-orbit interaction operator H,, is to be added to the Hamiltonian operator H . Thereby, Hk.p
U
becomes Hk.=, which has the consequence for D that the matrix elements of
p are replaced by those of ii from equation (2.353). If interference terms between the k ' p-interaction and the spin-orbit interaction H , are neglected,
the representation of the latter in the approximate Bloch basis leads to a
*
k-independent tensor Hs,.The envelope function equation then reads
The quantities
d,R , $,
Q = --(L
1
6
s* = i-(& N
v%
1
3
- -(2L
(3.75)
+Mp:,
- ia&,
(3.76)
(3.77)
- 2iNa,B,]
[3.78)
The energy origin in (3.74) was moved to the valence band maximum, which
was previously located at A/3. The envelope equation (3.74) makes it possible to calculate acceptor states and hole bands in superlattices of the diamond and zincblende type semiconductors. One can derive a similar equation
within the Kane model of the rs-valence-band-rs-conduction band complex
of zincblende type semiconductors.
3.4
265
AZ = Z I - XI?,
(3.79)
(3.80)
In general, V,'(x) does not yet represent the entire perturbation potential in
the one-particle Schrodinger equation (3.21) for the perturbed crystal. One
still must add the Hartree potential Vh(x) and the exchange potential Vfr(x)
caused by other electrons localized at the center. For impurity atoms with
only one valence electron more or less than the host atom, these potentials,
as a rule, have the effect that only one electron or hole can be bound at the
center. For this one electron or hole, the Hartree and the exchange parts of
the perturbation potentials vanish, i.e. V,'(X) is itself the entire perturbation
potential, and the one-particle Schrodinger equation has the form (3.22). For
impurity atoms with lAZl > 1, Vb(x) and Vfr(x)do not vanish. The effect
of a non-vanishing VA(x) will be discussed later.
To solve the one-electron Schrodinger equation (3.22) with the potential
V,'(x) of (3.80), we may use the effective mass theory derived in section
3.3. Here, effective mass equations must be written down for those critical
points of energy bands where changes of the spectrum of energy eigenvalues
dae to the perturbation potential are to be expected. In Chapter 1, it was
pointed out that such changes occur at the bottom of the conduction band
and the top of the valence band - for impurity atoms with A 2 > 0 discrete
levels appear in the energy gap just underneath the conduction band edge,
(3.81)
&(k)
Ti2
= -----k2,
2m;
(3.82)
with rn; and ml, as effective masses of electrons and holes, respectively. In
reality, for many semiconductors such as Si and Ge. the conduction band
edge does not lie at k = 0. The valence band edge zs found at that point for
many semiconductors including the ones mentioned; but there is degeneracy
between heavy and light hole bands at k = 0. Nevertheless, we will initially
proceed with the above simplifying assumptions. The relevance and necessary improvements of this idealized model are yet to be discussed separately.
For reasons which will become clear below, the model is often referred to as
the hydrogen model
3.4.1
Hydrogen model
For the hydrogen model we have the two effective mass equations
(3.83)
(3.84)
In the usual form of a Schrodinger equation they read
(3.86)
267
Apart from notation, signs, and constant factors these are the Schrodinger
equations for charged particleu in the potential of a point charge, just like
in the case of the hydrogen atom. From experience in quantum mechanics,
we know that h r an dttrtlclite potential, there is a continuum of positive
eiierpy eigmvalues and, in addition, there ale discrete energy levels which
occur for negative energies. Fur repulbive potentials there are only pusitixe
energy eigenvalues, but no bound statrs at negative energies. From this it
follows that for a positive point perturbation (upper sign in equations (3.851,
(3.86)), discrete e n e r a levels are to b e expected just below the conduction
band edge, and for negative point pertiirhations such levels are exppctpd
just above the valence band edge. Thew dibcrele levels lie in the emrgy gap
between the conduction and valence bands where, as we know, no energy
eigenvalues can appear in an ideal crystal. We consider first the cwe of the
conduction band.
Conduction band
Transferring results from the quantum mechanical treatment of the hydrogen atom to our present situation, we get the discrete energy levels & of
principal quantum niimher 71, n = I , 2 , . . ., as ( E , - E ~=) - ~ * / nor~
(3.87)
with
(3.88)
where Ef - (e'm//2ha) denotes the binding or Rydberg energy of the hydrogen atom. The m~av~fin~rtioii
FTdm(x) for principal quantum number n = 1,
angular mumedurn quantum number I 0 and magnetic quantum number
m - 0 reads
~
(3.89)
with
(3.90)
where a g = ( h 2 / e 2 m )denotes the Bohr radius of the hydrogen atom. The
orders of magnitude of the energy Eg and the length ag can be estimated
easily. With E g FY 13.6 eV,:a = 0.5 A , ( m z / m )= 0.2, c = 11.4 and
=
1, it follows lhat
(3.91)
11.4
(=)
AM
29
A.
(3.92)
Compared with the width of the energy gap which is of order of magnitude
1 el/, the discrete energy level El lies closely below the conduction band
edge (see Figure 3.4). Even closer are the states with n = 2 , 3 , . . .. The designation shallow levels is used in this situation. The wavefunction for energy
level El decays exponentially with distance from the core of the impurity
atom with the characteristic decay length a g . In contrast to band states,
which are evenly spread out over the entire crystal, we have, therefore, localized electron states (see Figure 3.4). The localization radius a g is clearly
larger than the distance between two crystal atoms, which is approximately
2.5 A. This means that the localization of an electron at an attracting
impurity atom is weak if considered on the interatomic length scale of the
crystal. Exactly this is to be expected considering the smoothness requirement made at the outset, which states that only Fourier components of small
wavevectors k should contribute to the wavefunction. We can also examine
the validity of this requirement quantitatively. Omitting a k-independent
factor, the Fourier transform of Fcloo(x)is given by the expression
(4.93)
At the edges of the first B Z one has k FY (../a) and a g k M w ( a g / a ) FY 10.
Between the center of the first B Z and its edge, Fcloo(k) therefore falls off
by about 4 powers of ten. The smoothness requirement is therefore fulfilled
very well. In Figure 3.4 the localization of the wavefunction in k-space is
also indicated.
Recalling the theory of the hydrogen atom, it is known that the effect of
an attractive Coulomb potential is not restricted to the formation of bound
states with energy levels in the previously forbidden negative energy region.
Changes also occur in the continuum of positive energy eigenvalues which are
allowed even without Coulomb potential. They are manifested in the fact
that the eigenstates, which were previously spherical waves, are scattered
by the Coulomb potential. Their amplitude at the positive center becomes
larger, and further away from the center it is smaller. This leads to a change
of the density of states in the continuous part of the energy spectrum (the
definition of the density of states is given in section 2.5). According to
Levinson's theorem, which we will discuss in the next section in somewhat
greater detail, this change takes place such that the total number of all states
in the presence of the perturbating potential, including the bound states at
negative energies, remains the same as the total number of states without
the perturbating potential. For each bound state of negative energy a state
of positive energy is therefore excluded from the energy spectrum.
3.4. S h d a w lev&
269
dZ.0
AZ a 0
r\
L1
O
States in coordina1.e spat:e and in kspace far mibslitutional impurity atoms with either posit.ive or negative differences
A 2 beOween the impuril,y core charge nunthers and the h o d core charge numhers
(upper part of figure). The occupation of the shallow levels by eI&rons is presented
in the lower part of the figure. The levels act as donors of elwtctruns for A 2 > 0 and
ss acceptors of electrons for A 2 < 0.
The abovr-Inmtionfd
Valence band
For a negative point perturbation there appear discrete energy lpvels in the
energy gap closely above the valence band d g r . The energies ol thew levels
are given by the exprcssiorrs
EB
En - z, '
(3.94)
(3.95)
The pcrtinent wavefunctions are localized at the perturbation center. The
wavefunction of the ground state, n = 1 and 1 = m = 0, has the form (3.89)
with
(3.96)
as the effective Bohr radius. In Figure 3.4 (lower part) the energy levels and
localization regions are shown schematically.
Although the population of energy levels by electrons will not be treated
systematically until the next chapter, we wish to deal now with a special
case, namely with the occupation of the shallow impurity levels discussed
above at absolute zero temperaturc, 7 = 0 K . This consideration will lead
us to a better understanding of the nature of these impurity states. First of
all, wc assume the case of a positive point perturbation, i.e., the core charge
number 21 of the impurity atom is taken to be larger than the core charge
number Z H of the host atom. For simplicity, we assume 21= ZH 1. Then
the impurity atom also has one valence electron more than the host atom.
To be specific, we can imagine it as a P atom in a Si-crystal. According to
Levinsons theorem, which mandates that the total number of states must
remain the same both with and without perturbation, one can proceed on the
assumption that nothing will change due to the impurity atom regarding the
number of states in the valence band. Therefore, all the valence electrons of
the host atoms and all those of the impurity atom, except for the additional
electron, can be placed in the valence band. The additional impurity electron
has to reside in the lowest unoccupied energy level. That is the shallow n =
1-level just below the conduction band edge which, according to the above
results, arises from the impurity atom. The resulting occupation is shown
in Figure 3.4. If the temperature is increased this electron can easily be
excited from the shallow level to the conduction band. There, it is no longer
localized but spread out uniformly over the whole crystal. representing a
freely mobile negative charge carrier. The shallow energy level therefore
functions as a donor of a free carrier. One calls it a donor level and the
impurity atom itself as a donor. One can also say that a donor atom (with
AZ = 1) has one surplus valence electron which is bound relatively weakly
and can be transferred easily by thermal excitation to the conduction band.
The impurity atom remains in the single positively charged state, i.e., it is
singly ionized. For A 2 2 2 one has doubly or multiply ionizable donors. We
will discuss this later more exactly.
Next, we consider an impurity atom with Z I = Z H - I, i.e. with one
positive elementary charge in the core and, with this, also one less valence
electron than in the host atom. Due to the substitution of a host atom by
such an impurity atom, the number of states in the valence band decreases
- according to Levinsons theorem by one if only the n = 1-level is counted.
With this, the number of valence band states is still large enough to host all
271
valence electrons, but, also, there remain no unoccupied states in the valence
band. The n = 1-level in the energy gap remains empty at T = 0 X. If the
temperature is increased, an electron from the valence band can easily be
excited into the impurity level. The level accepts an electron, it functions
as an acceptor. In the valence band itself, a hole remains which, as we
know from Chapter 1, behaves like a positive freely mobile charge carrier.
Consequently, we can also say that, under thermal excitation, the acceptor
transfers a hole to the valence band, whereas at T = 0 K the hole was
bound to the acceptor. This picture corresponds to the model of a negatively
charged point perturbation which binds a positive hole and transfers it to
the valence band under excitation. The analogy to the positively charged
donor center which binds an electron and donates it to the conduction band
is obvious. Similar interpretations are available for doubly and multiply
ionizable acceptors.
3.4.2
Donors
The conduction band of Si and other indirect semiconductor differs in three
ways from the simple isotropic hand model. First, the minimum lies outside of the center of the first BZ;second, for symmetry reasons, one minimum carries the implication of several equivalent minima or valleys; and
third, the band structure in the neighborhood of each individual minimum
is anisotropic. While the off-center location of band minima has no direct effect on the donor binding enerm, the many-valley feature and the anisotropy
of the band striiccture rln have such an effect. Due to the existence of several
valleys (dist.inguished by a valley index i ) , the wavefunction 01 the donor
state $(XI
has to be expanded with respect. to the appr0ximat.c Bloch functions (xlclri [k- kill1 for all mutu& degenerate minima ki, rather than
with respect to the approximate Rloch function (xlck, [k- kc])of one
minimum k, only Because of this, the one-component envelope function
rx
h2
(k - kilMClk - ki)
2
6ij6kkl
= EFi(k - ki),
(3.98)
= EFi(X).
(3.99)
If V(x) is the screened Coulomb potential of equation (3.80), as was assumed, the absolute values of the inter-valley Fourier components (kilV1kj)
are negligibly small in comparison with the intra-valley Fourier components,
because of the large wavevectors ki - k j occurring in the former combined
with the (l/lkl)-dependence of (klV) on k. However, as we know from section 3.2, the true perturbation potential V(x) differs from the pure Coulomb
potential, because the screening of a point charge by the semiconductor is
wavevector dependent, and on the other hand, because V(x) contains a
short-range part, which cannot be traced back to the potential of a point
charge in any way. Both modifications can cause the inter-valley matrix
elements (hlVJkj) of V(x) to take larger values. Nevertheless, they still
remain so small that they can be considered by means of perturbation theory. In zero-order approximation they may be omitted completely. Thereby,
one obtains a separate effective mass equation for each valley from (3.99).
In case of k, = ( O , O , k z ) , it reads, in coordinate space
273
Analogous equations hold for the other valleys. This means that, neglecting
inter-valley coupling, all valleys give rise to the same donor level.
Anisotropic effective masties
In contrast to the hydrogen model, the effective mass of the envelope q u a tion (3.100) depends on direction in k-space. In this circumstance, analytical
solutions arc not available, and one mist resort to approximations and numerical calculations. One possible approach is the application of a variational
procedure, in which the eigenfunctions are represented as linear combinations
of niembers of a set of auxiliary functions whose general shapes are adjusted
as well as possible to the real solution, but which still contain free parameters. Using this representation one calculates the energy expectation value
of the Ilamiltonian and varies the parameters until its absolute minimum is
reached. This minimiim value then yields an approximation for the ground
state energy, which is better as the linear combination of auxiliary functions
approximates the actual eigcnfunction. The success of this approach thus
depends in an essential way on the auxiliary functions used. They should,
in particular) correctly reflect the symmetry of the actual eigenfunctions, in
the case considered here the symmetry of the ground state wavefunction.
One can also employ the variational process for the calculation of excited
states. In this case the wavefunction of the first excited state must, in addition, be orthogonal to the ground state wavefunction, that of the second
excited state must be orthogonal to both previously calculated states, etc.
Numerical results obtained by means of this procedure for the energy levels
of the P-donor in Si are reproduced in Figure 3.5. For a simple estimate of
the anisotropy effect, one can replace the reciprocal effective mass rn: of the
hydrogen model by the reciprocal effective mass (l/m:ll + 2 / m ; ) / 3 obtained
by averaging the direction-dependent reciprocal effective masses of (3.100)
over all directions in k-space. With this, one obtains energy levels for the
simply ionizable donor in Si, which are shown in Figure 3.5a. The binding
energy E B is represented as 29 meV in this approximation.
Inter-valley coupling
->T 0'0
2! -0,Ol
CI,
w
I
w
-0,02
1985.)
- 0.03
-0,04
-0.05
performed explicitly.
_ In this
- case
~ one has 6 minima, which lie on the cubic
axes k,, k,, k , and k,, k,, k , close to the respective X-points. From symmetry considerations it follows that fl(k,lV'lk,)II;k(0)12 has the general form
O b
b O
b b
c b
b c
b b
b
b
c b
b c
0 b b
b O b
b b O
c b b
b
b
c
b
b
o
(3.101)
b = fi(k~lv'lky)l~z(0)12,
r
= ~t(kzIv'lkz)l.)IFz(0)12
(3.102)
have been-used.
_ _ Lines and columns in (3.101) are written in the sequence
k,, k,, kzrk,, k,, k,. The matrix (3.101) has the three eigenvalues
A&I = 4b + C,
AEz
= -2b
AEg - -c,
1 - fold
C,
fold
3 - fold
(3.103)
275
with the degrees of degeneracy indicated. Thus, inter-valley coupling partially removes the 6-fold degeneracy among the valleys.
The sphtting of the ground state donor lrvel in Si due to inter-valley
coupling can also be predicted j u s t oti the basis of group theory. One considers the representation of the symmetry group of the envelope e q u e h n
(3.99) for si, i.e.
of _
o h ) in the space of the 6 reciprocal vector compo__
nents k,, kg,k,, k,, k,,, k,. This representation is reducible. To demonstrate
this one may employ the transformation rules of the vector components
x , y, z , a,7j, Z under the action of the clcments of o h (summarized in Table
A.7 of Apprndix A), realizing that these component? transform in the same
way as thc reciprocal vector components considered here. Then one easily
finds that the irreducible parts of this representation arc rl ( A , ) . r12 ( E )
and Iz5(Tz). The notations A , E , Tz are commonly used in the theor,- of
l o c a l i d states, and WP will also employ thcm here. With this. it is clear
that the symmetry of the AEI-state is A l , that of the two AETstatcs is E ,
and that of the three AE3 states is 12.
The size of the splitting depends on the inter-valley nitrtrix elemen1 of
the perturbation yolmtial. If one considers only thc first of thc two abovementioned effects which result in matrix elernentv ol considerable size. i.e.
the wavevector dependence of screening, and assumes that screening has
become fully ineffective at large wavevectors k, - kj,therefore replaring the
screened Coulomb potential V(x) of equation (3.80) by the bare Coulomb
potential -(e2/IxI) in (ktlVlkj), then it follows that
(3.104)
With this, one obtains b as -2.36 mel and c as -1.2 met. This yields
AEl = -10.8 meV. A E z = 3.6 met and AE3 = 1.2 meV.
A 3-fold splitting of the P-donor ground state level in Si has, in fact,
been observed experimentally (Aggrawal and Ramdas, 1965). The simple
theoretical estimate presented above agrees remarkably well with the experimental splittings of 11.6 meV between the A l - and T2-levels, and of
1.3 meV between the E - and T2-levels (not resolved in Figure 3.5). The
results of a numerical treatment of inter-valley coupling shown in Figure 3.5
are even closer to the experimental values. However, the agreement is not
as good for the absolute position of the ground state level. The addition of
A E l , i.e. of 10.8 meV to the binding energy of 29 meV without inter-valley
interaction yields a corrected binding energy of 39.8 meL which is closer to
the experimental value of 45 meV, for Si:P but still clearly below.
Chemical shifts
P
Ge
As
Sb
Bi
Ga
In
13
14
10
13
B
A1
Ga
In
153
11
11
11
12
GaAs
ionizable donor and acceptor atoms in Si, Ge and GaAs. Not only are the
absolute values of these energies striking, but also the fact that they are
not equal for donor or acceptor atoms of different chemicals stands out.
Contrary to the prediction of the above theory, there are dependencies of
binding energies on the chemical nature of the impurity atoms. One refers
to these as chemical shifts.
The absence of chemical shifts in the calculated binding energies is a
consequence of the approximation of the perturbation potential V(x) as a
purely Coulombic potential in the hydrogen model. As we know from section
3.2, the correct perturbation potential of an impurity atom also contains a
short-range part, to which all central cell corrections contribute. Among
these, there are also contributions which depend on the chemical nature of
the donor. The latter are the main reason for the experimentally measured
chemical shifts of the donor binding energies. The effects of central ceE
corrections are particularly large in the case of the 1s-ground state, as one
can recognize from Figure 3.5 for the special case of the P-donor in Si. A
weaker influence, and, accordingly, better agreement between experiment
and the predictions of a theory omitting central cell effects occurs for the
excited states with n = 2 , 3 , . . .. This is to be expected since the excited state
wavefunctions have maxima that do not lie at the perturbation center x = 0
277
(which does happen in the case of the ground state) but further outside.
Electrons in the excited states are therefore less affected by changes of the
potential in the central cell than are electrons in the ground state.
Generally, the occurrence of pronounced chemical shifts means that the
perturbation center is no longer shallow and the effective mass theory is no
longer applicable for its theoretical treatment (see section 3.5 for further
discussion).
Acceptors
Corrections to the simple hydrogen model are also necessary for acceptors.
Of course, the maximum of the valence band lies in the center r of the first
B Z for all diamond and zincblende type semiconductors, as is assumed in
the hydrogen model. However. this maximum is degenerate: depending on
whether spin-orbit interaction is important or not one has, respectively. the
%fold degeneracies of the representations l'b5 (diamond) or
(zincblende)
in the space of scalar functions. or the 4-fold degeneracies of the representations I?$ (diamond) or Ts (zincblende) in the space of twc-component spinor
functions. In the vicinity of r, this degeneracy splits off and one has ihree
or two anisotropic valence bands. In section 2.7 these were approximated by
two isotropic parabolic bands. one for heavy holes and one for light holes.
If one also uses this approximation here, then there are two hydrogen-like
series of acceptor le\-els, one for each sort of holes. Calculating acceptor binding energies by means of expression (3.95) in the case of Si yields 50 met7
for heavy holes which are expected to form the ground state. In the case
of Ge. the result is 17 meV for heavy holes. One can hardy expect these
values to be in agreement with experiment. In fact, they differ appreciably from the measured ground state binding energies shown in Table 3.5.
Evidently. non-parabolicity and anisotropy of the band structure play an
essential role and must be taken into account in realistic calculations. This
may be done by means of the multfband effective mass theory developed in
the preceding section. Below. we will explain the application of this theory
to simply ionizable acceptors in Si. Owing t o the small spin-orbit splitting
energy A of 44 meV and the relatively large acceptor binding energies E B
(68 me\' for the isocoric impurity atom Al). spin-orbit interaction initially
will be neglected. Later this approximation needs to be corrected since E g
is not much larger than A. Without spin and spin-orbit interaction the
wavefunction ~ A ( x ) is a linear combination of the three Bloch functions
(xlumO),m = x. y, z of the representation Ti,, which are enveloped by the
*
components F,(x) of the envelope function vector F (x). Thus
(3.105)
The FA^ (x). FA^ (x),. . . belong to a space which differs from the ordinary
Hilbert syaccr in that it is riot spanned hy oue-comyonent basis funrlious, but
by three-component basis functions. A representation in this space is given
by the product of two represt?nt,atinns, one of which corresponds to the 3D
rqresentation
according to which the components of each of the three= $ *
component, fiinctions FA^, FA^ transform separately, and the other is the
representation according to which the threecomponent functions transform
among each other. One can easily show that the latter representation must
+,.q2(x), . . ., forincd according to relation (3.105),b e
be r A , so that $,.ql(x),
long to the representation r A , as has been supposed. For the representation
*
*
D A of FA^ (x):FA^ (x),. . . it follows that
(3.107)
279
(3.108)
shown in Table A.27 of Appendix -4. Relation (3.108) determines the symmetries of the three-component auxiliary functions under rotations, therefore
the angular dependencies of these functions. In order to also get their raand ri5-functions with
diaI dependencies one expands the rl-, r12-,
respect to angular momentum eigenfunctions with the various quantum numbers 6, rn and multiplies the expansion coefficients of different values of I by
corresponding radial wavefunctions r1exp(-T/q). These are formed in analogy to the eigenfunctions for the Coulomb potential, but with &dependent
localization radii rl. The latter are treated as variational parameters, just
like the Coefficients of the auxiliary functions belonging to different irreducible representations. Applying this procedure to the acceptor ground
state of Si, a binding energy of 31 me&' is obtained (Schechter, 1962). More
recent calculations, taking spin-orbit interaction into account, have resulted
in a value of 44 mel/ (Baldereschi and Lipari. 1977), which is very close to
the experimental value of 45 meV for 3 in Table 3.5 but, of course, does not
account for the pronounced chemical shifts seen in this table.
Calculations of acceptor binding energies have also been performed for
materials like Ge whose spin-orbit splitting energies are large compared to
the acceptor binding energies. Spin has to be taken into account under
such circumstances, i.e. the T'& valence band has to be replaced by the two
rg- and r$-bands. Owing to the large spin-orbit-splitting energy, however,
the spin-orbit-split I'F-band can be omitted. For the expansion of the 4component envelope function of the remaining rg-band one needs auxiliary
functions of
2r15 2r25 symmetry (see Table
x I'i = ri fk
+ +
is taken into account, then the 1s-level is shifted up by just this energy
Us.By exciting one electron into the conduction band, the neutral S-donor
28 1
S(D+) bccarnes a single positively charged S(1+) donor. For the Is-lad
01 h e S ( l 1 ) ioii thr shift in energy by Us do= not occur. Thus this level
i s shifted down by I/, in comparison with the 1s-level of tbc S(0 +-) atom.
Exprrirneniully, one finds t,hat the ionk-ation energy of the neutral S(O+)
donor is 0.31 eV, and the ionization energy of the S(1t) donor j s 0.69 eV
(see Figure 3.6). From thefie values a Hubbard energy I i , of 0.28 eV may
be deduced. IIowever, both ionization energies are suhstantially larger than
the result. 4 x 0.03 el7 = 0.12 eb which follows from the hydrogen model.
Evident.ly, the central cell short-range poteniid cont,ribution is ewedial in
the case of the S-donor in Si. l l i s donor is a deep center rather than a
shallow me.
3.5
Deep levels
283
-2
-4
-6
-2
-4
-G
.3
Acceptors
- - - _ aF:
.4
.-
.5
.6 (eV)
-8
-1OleV)
Atomic Electronegativity
Figure 3.6: Ionization energy of substitutional impurity atoms in varioum tedrahedrally coordinate host semironductors as B function of the strength of the pcrturbation potential, measured in terms of valence shell s-level differences between
impurity and host atoms in the case of donors, and valence shell p-level differences
in the caae of acceptors. Donor energies are reIative to the conduction band edge,
and acceptor energies to the valence band edge. (After Vogl, 1981. Reproduced
from Boer, 1990.
- in the discussion above, the periodic potential of the crystal was omitted
285
for selected deep centers, among them the vacancy in Si. the substitutional
impurity at,oms of the main groups of the periodic table. the group of 3dtransition metals. and the group of rare earths. Also, we discuss the D X center and the EL2-center in GaAs.
3.5.2
The 'Tight 3inding' (TB) method developed in section 2.6 represents one
of the various procedures for calculating band structures of ideal crystals.
Unlike other methods it uses basis functions to represent the Hamiltonian
which are localized on the atomic length scale. Since the perturbation potentials of deep centers are localized on the same scale, the TB method should
be particulady well suited for such centers. Of course, one must chose the
Hamiltonian matrix elements empirically in order t o arrive at useful practical results, and the results also cannot be expected to be very accurate in
a quantitative sense. However, the method should be suited to the derivation of simple models that exhibit the essential physical features of real deep
centers. The simplest among these models is the so-called defect-molecule
model, which we will introduce below. In doing so, many particle effects
and lattice relaxation will be ignored. We will mainly use the model to
demonstrate the existence of deep levels and to explore the symmetry of the
pertinent eigenfunctions.
At the outset we have to clarify which of the various tight binding basis
sets should be used for the representation of the Hamiltonian in the case of
a deep center. En the ideal crystal case considered in section 2.6,the atomic
orbitals
or hybrid orbitals IhtRj) were not used directly, but rather.
we employed wavevector-dependent Bloch sums lukj) or ihtkj) formed from
them. This was advantageous because the translation symmetry of the crystal could be exploited in this way. The latter symmetry no longer exists in
a crystal with a deep center. so that localized basis functions, i.e. atomic
or hybrid orbitals. can also be used without any loss. We select hybrid
orbitals because these produce drastic simplifications of the Hamiltonian
matrix which, like the Bloch sums of hybrid orbitals in the case of an ideal
crystal, allow the eigenvahes of the Hamiltonian to be calculated in closed
analytical form.
We introduce the defect molecule model using the vacancy as an example.
Later, we will apply this model to substitutional impurities from the main
groups of the periodic table. As the host crystal we take, in all cases, an
elemental semiconductor of group IV,like Si.
Vacancy
Figure 3.7 shows part of a Si crystal containing a vacancy. Symmetry consid-
erations make clear that the origin of the vacancy is not important, whether
it originated by removal of a Si atom from sublattice 1 or from sublattice
2. Here, we consider the removal of an atom from sublattice 1, and to be
still more specific, from the primitive unit cell at R = 0. The perturbation potential V(x) is the negative of the potential produced by this atom
in the crystal. Because of the removal of the 1-atom, the hybrid orbitals
lht2Rt),t = 1 , 2 , 3 , 4 , of the four surrounding 2-atoms in the unit cells Rt,
pointing inwards, no longer have a hybrid orbital of a 1-atom to which they
can bind. They are called danglzng hybmds. The three other hybrid orbitals
at a surrounding 2-atom interact with inwardly directed hybrid orbitals at
atoms lying still further away (these atoms are not shown in Figure 3.7).
The hybrid orbitals at a pal ticular siirrounding 2-atom also interact among
themselves, including the one dangling hybrid at this atom. This means that
the dangling hybrids are coupled to the entire crystal through nearest neighbor interactions. If interactions between hybrid orbitals at the same atom
are omitted, i.e. if the matrix element V1 introduced in section 2.6 is set to
zero, which means - cp, then the dangling hybrids are decouplcd from the
remainder of the crystal. They still interact only among themselves. Since
the atoms at which these hybrids are located are second-nearest neighbors,
this interaction is not within the framework of Erst-nearest neighbor interaction which we have been exclusively considering in the treatment of an
ideal crystal in section 2.6, but, rather, it ha6 the sense of second-nearest
neighbor interaction. The latter must be taken into account here in order to
arrive at non-trivial results.
Based on the approximations made above, the crystal with a vacancy
decomposes into two partial systems which do not interact with each other,
first, the partial system of the four interacting dangling hybrid orbitals,
one at each of the four atoms surrounding the vacancy, and second, the
yarlial system of all remaining hybrid orbitals of the crystal with the vacancy,
i.e. the hybrid orbitals of all atoms which are not directly adjacent to the
vacancy, and the three hybrids at each one of the four adjacent atoms which
~
&renot alrpady included in the first ptrrtial system. With some ambiguity
287
and to the
the rest-of-crystal. This designation also encompasses h e term
defect molecule model for the tight binding approximation described above.
For each hybrid of the rest-of-crystal another hybrid exists in the restof-crystal which points to it. The Hamiltonian matrix elements between the
various pairs have the same value, namely the value given above in equation
(2.292) defining matrix element T/2. The energy spertriim of the rest-ofcrystal is therefore identical with t , h d of the infinite idea1 cryslal in bhe
simplest TB approximation? consisting of the bonding level t b = th - IVzl,
and the anti-bonding level E, = E,+ I&I. The splitting of these highly
degenerate levels into bands remains incomplete because of the neglect of
Ihe interaction between hybrids at. the same &om? i.e. the neglect. of r/l.
In calculating the energy spectrum of the first partial system, i.c., the
vacancy-molecule, we need the matrix elements of the pertarbed Hamiltanian H + V of the crystal with vacancy. The diagonal elements (ht2RtlH +
Vlht2Rt) are simply the hybrid energies ~h since the elements (h,t2Rt(V
lht2Rt) of the vacancy potential V between hybrid orbitals localized off the
%mancysit.e are small. One therefore has
second
tts
In order to obtain the non-diitlgounel (second-nettrest neighbor) matrix elements ( h t 2 R t l H f Vlht32FQ) between different dangling hybrid orbitals
t , t f t , one has to recaIl the symmetry of the perturbed Hamiltonian H +V.
Since, by creating a vacancy in sublattice 1) the two sublattices are no longer
eqiiivalent, the symmetry of the crystal is no longer given by thr full cubic
point group Oh, but rather by the tetrahedral group y d . Nevertheless, this
means that all non-diagonal elements are equal for symmetry reasons, such
that
wit,b W as second-nearest neighbor interaction energy. Because of the predominantly negative values of the operator fI V acting on hybrid orbitals
and the predominantly positive values of the hybrid orbital products. W
is expect.ed to be negative. Ihe absolute value of W must be determined
empirically.
The energy levels of the vacancy-molecule are obtained by diagonaliziug
the matrix
IEY)=
- IhaZRz)],
Jz [IIZL~RI)
(3.116)
) - -[ I ~ I ~ R
- IJh2R3)]
)
(3.117)
1
138"")= [lh12R1)- lh42R9)].
45
(3.118)
IE,
VaC
v5
One can easily demonstrate that IEI") belongs to the irreducible representation A1 of the symmetry group Td of the vacancy, and the three functions
IE$uc),
IEY'), IEiWc)to the irreducible representation T2 of this group (see
Table A.6 of Bppendix 4 ) . The eigenfunction of the A1-level resembles an
atomic s-orbital of a Si atom. and the three Tz-eigenfunctions are similar to
the three p-orbitals of such an atom. Evidently, the sp3-hybridization of the
atomic orbitals in a Si crystal is removed at a vacancy. The states are more
atom-like. in consequence of the fact that the crystal potential is no longer
fully effective in the vicinity of a vacancy.
In Figure 3.8. the e n e r e spectrum of the defect molecule model of a
vacancy is shown along with that of the rest-of-crystal. The A1-level lies
below the Tz-level because of the negative value of Vt-. Whether it lies
below or above the bonding level of the crystal depends on whether we
have 31W1 < l\Jzl or 31WI > lF'21. The question of whether it is found
in the valence band or in the energy gap, cannot be answered within the
defect molecule model because. therein, the valence band has shrunk to the
bonding energy level. It is just as difficult to decide whether the Tz-level lia
in the conduction bend or in the energy gap. Experiment and more exact
calculations, which we will discuss later in more detail. show that the -41level lies in the valence band. and the 7'2-level in the energy gap. With this,
289
Figure 3.8: Energy spectrum of the defect molecule model of ~ 1 .Si-vacancy along
with that. of the rest-of-crystal. The distribution of thc 4 electtona of the defect
molecule over the energy levels is also shown.
the T2-level is the actual deep level of the vacancy. Of the four electrons
of lhe neutral vacancy - each of the four dangling bonds yields one two
must he hosted by this lcvcl while the other two occupy the A1-level in the
~
valence band. IJsing the terminology introduced above, we may say that the
oxidation state of the neutral vacancy is V2+.
The defect moleciile model of the vacancy reflects the actual relationships remarkably well. In any case it, provides a qualitatively correct physical picture of the electronic structure of a vacancy in group-IV elemental
semicondiictors. Refinements of this picture will be discussed further b e
low. Here we treat a second example to illustrate the defect molecule model
which emerges from the v.mancy by occupying its empty lattice site with an
impurity atom.
Substitutional impurity a t o m s w i t h sp3-bonding
We consider a substitutional impurity alom in an elemental semiconductor
of ~ o u IV,
p which we can again imagine as a Si rrystal (see Figure 3.9).
Let the siibstituted Si atom be that of sublattice 1 in the primitive unit cell
R = 0. Like Si, the impurity atom should belong to one of the main groups
11,111, IV, V, or VI of the periodic table so that the valence shell is formcd by
s- and p-orbitals, which all lie energetically higher than any other occupied
orbitals (in contrast to rare earth atoms). The perturbation potentials of
these impurity atoms in an elemental semiconductor of group I V possess, as
we know, both a short-range part and also a long-range Coulomb part. The
+ 17lht~jR~~)~
(3.119)
In this, E ; and c i signify, respectively, the hybrid energies of the impurity and host atoms, V2 corresponds to the matrix element of H of equation
(2.292) between hybrid orbitals at nearest-neighbor atoms of the ideal crystal
291
pointing toward each other, and W describes, as in the vacancy case, the second nearest neighbor interaction between differing host atom hybrid orbitals
pointing toward the impurity atom. All elements ( h t j R t l H VlhtJRt,)not
listed above are neglected, among them also the Vl-like elements (htzRt(H
Vlht,i&,) between different hybrid orbitals at the impurity atom, i.e. with
t f t. With these approximations, the Hamiltonian matrix of the defect
molecule may now be written down in explicit form. In doing so the rows
and columns are ordered in the sequence IhliRl), lh2iR2), lh3iR3), lh4iR4),
lh12R1)) lh22R2), lh32R3), I@&),
and the Hamiltonian matrix is
L i 0
1720
e,O
vzo
6 i 0
v20
o a o
v2a o o
; W W W
W E k W W
v20
0 I$
W W E k W
0
(0
v2o
0
0
a o v2
w w w
(3.124)
e;.
This matrix has four distinct eigenvalues, two simple and two triply degenerate. The eigenfunctions of the two simple levels (we distinguish them by
indices b and a ) belong to the 1-dimensional irreducible representation A1 of
the tetrahedral group T d , and the two triply degenerate (again distinguished
by indices b and a ) belong to the 3-dimensional irreducible representation T2
of T d . The corresponding energy levels are, respectively, denoted as
EiT/b
In Figure 3.10 these levels are plotted as functions of the difference ( c i - E);
between the hybrid energies of impurity and host atoms. This difference represents a measure of the strength of the short-range perturbation potential
of the impurity atom. In order to better understand the physical meaning of
the energy levels derived above, it is helpful to consider two limiting cases.
First, we assume the impurity atom to coincide with the host atom, which
means t i = c.; If one also neglects the second nearest-neighbor interaction
energy W , then the energy spectrum of the perturbed crystal of equations
(3.125) and (3.126) must coincide with that of the ideal host crystal in the
and an
simplest TB approximation, i.e. a bonding energy level q = ~ h IVzI,
anti-bonding energy level E, = h + IV21 must emerge. This is in fact the case.
of the perturbed crystal
Thus it is also clear that the two energy levels
correspond to bonding and anti-bonding states of A1-symmetry, and the two
energy levels E;Y$. to bonding and anti-bonding states of Tysymmetry. The
two undisturbed levels h f IV21 in Figure 3.10 encompass the energy gap of
the host crystal.
Second, we consider the limiting case of ( E ; - tk) tending toward $00
or -00 which, for a given host crystal, also means t i -+ +m or E ; -+ -00,
respectively. In the limit
EiT,,
E i + $00
(3.127)
(3.128)
and in the limit
Ei
--oo
we have
EiT =
tk + 3W,
(3.129)
293
(3.130)
+ --O0.
EiT
EiY
EkYl
EiY
3.5.3
The solution of the Schrodinger equation for a crystal with a point perturbation begins with the determination of the effective one-electron potential
VPrt(x).In section 3.2 this task already was addressed in general terms.
Two remarkable differences were found in relation to an ideal crystal: First,
depends on the electron poputhe effective one-electron potential VPert(x)
lation of (localized) one-particle states, and second the atomic structure of
the perturbed crystal, which participates in the determination of the potential Vwrt(x),is in many cases initially unknown and it must be selfconsistently calculated jointly with the electronic structure. Below, we will
eliminate such additional difficulties of electronic structure calculations for
point perturbations by assuming that the population of the center and the
atomic structure of the perturbed crystal are known. Thus the potential
Vwrt(x)
of the center can also be considered to be known. In regard to the
Supercell method
At the boundary of a properly shaped ciuster, one can add an additional
cluster of the same kind. If one does this repeatedly and continues the
whole proress ad infiniturn, one finally arrives at an infinite crystal. The
primitive unit cell of this crystal is now, however, no longer that of the
crystal hosting the point perturbation. but it is the cluster, which in this
context is called a supeveell Thp crystal is referred to as a supercrystal. The
295
(3.131)
Since we are interested in bound states. the eigenvalues E which WP seek from
this equation lie outside of the bands, to be mare precise. in the iundamenlal
energy gap between the highest valence band and the lowest conduction
band. As tl preliminary we note that equation (3.131) can be formally solved
for @ by multiplying both sides by the inverse [E - H1-l of the operator
[ E - HI. The inverse operator [ F - H1-l stands in close relation t o the
Green's function of the unperturbed crystal. The retarded Green's finctaon
O " ( E ) is defined by the equation
GO@)
1
Eti6-H'
(3.132)
0.
(3.133)
Non-trivial solutions me possible only for energies E for which the determinant of the matrix of the operator [ C 0 ( E ) V '- 11, in an appropriate orthonormalized basis set, vanishps, i.r. if
Uet[G"(E)V' 11 = 0
(3.134)
holds. The energy eigenvalues E satisfying this equation are deep levels.
These levels ran thus be determined by calculating the Green's function
G o ( E )of the unperturbed crystal and solving equation (3.134). This p r o w
dure is rcfrrred to as Koster Slater method, and equation (3.131) as KosterSlater equation
The Green's operator @ ( E ) can be cdriilattul if the band structure
E,(k) and the Bloch functions of the ideal crystal are known. The matrix
represenlation of d ' ( E ) in the Bloch basis reads
(3.135)
With this we only need to know the matrix elements (uk(V'1u'k') of the
perturbation potential V' in Bloch representation in order to determine the
deep levels using equation (3.134). However, i t would be more expcdicnt
for the matrix representation of the perturbation potential to use localized
wavefunctions as) for example, atomic orbitals. If one does so, then the
matrix representation of the (heen's opertrtoi is not as simple as in the Bloch
basis. A particular compromise in choosing a basis set for the representation
ol the eigenvalue equation (3.134) is the use of so-called Wunnzer finctzons.
Koster-Slater method in Wannier representation
Wannirr functions are linear combinations of all Bloch functions for a given
band index v of the form
297
(3.136)
Here the summation is over the whole first BZ,G is the number of primitive
unit, rdls in a ppriodicity rcgioa. and t k are certain phase a~glru.If the latter
are chosen properly, the Wunnier h c t i o n s IvRJturn out to be well localized
in the unit cell at the lattice point FL If the eigenvalue equation (3.134) i s
written in terms of this basis, it reads
(3.140)
For the particular lattice point R = 0 it reads
{ 3.141)
Equation 13.141) forms a closed set of equations for the central cell components ( ~ O ~ I Jof) $ only. If the latter are known, all other components (vRI$)
follow at once from equation (3.140). For a non-trivial solution of the homogeneous system (3.141) to exist, its determinant has to vanish separately,
i.e.
1
(vOlG*(E)IvO)
=3
G;(E)
1
E+ir-EE,(k)'
Ic
(3.143)
The dominant bands in equation (3.142) are those which form the gap, i.e.
the uppermost valence band u and the lowest conduction band c. If we
consider only them and neglect all others equation (3.142) becomes
or more explicitly
To solve this equation, the host Green's functions G:(E) and G : ( E ) , as well
the perturbation potential matrix elements, have t o be known. Below we
calculate the host Green's functions for isotropic parabolic bands of finite
widths. In expression (3.143) for G : ( E ) we rqlace the k-sum by an integraL
This yields
G 0J E )
'
-G 3 8x3 , 1
d%i
s
1
~E
= c, v.
(3.146)
Jrm
(3.147)
whrre p v ( E ) is the density of states per unit energy and spin state
p,(E)
d3kd(E - &(k])
h3 1.232
, v
= c,w,
(3.148)
299
which differs from the DOS in equation (2.213) by a factor (0/2). In evaluating p,(E), we approximate the true valence and conduction bands by
isotropic parabolic ones, however, taking their band widths A E , to be the
same as those of the original bands. This corresponds to the use of an effective mass for each band averaged over the whole first B Z . The bandwidths
are introduced by putting the total number of states of the approximate
isotropic parabolic bands equal to G3. Then it follows that
(3.149)
3
pc(E) = ~
G3
(3.150)
where B(E) is the unit step function. We substitute these expressions into
the Greens function G ; ( E ) of equation (3.147). The E-integral is readily
done. The imaginary part ofG;(E) equals (-TI times pv(E). The real parts
ReG:(E) and ReG:(E) are given by
-I.;TIF]
(3.152)
Although complex numbers appear in these expressions, they Eare in fact real.
This would hP obvious if we rcplaced the In-function by an arctan-function.
We avoid this because it is more convenient to handle the many-valued character of the In-function rather than that of the arctarz-function. Altogether,
the host Greens functions of OUT model depend on three parameters. the
energy gap EB, and the two band widths A& and AE,. The latter are
measure3 of the awrage kinetic energies of electrons in the vakncce or conduction bands large bandwidths mean large average kinetic energies (or
small eEective masses).
Later, the G r m s function method in Wannicr representation will be
used to address the question of whether or not main group impurity atoms
in tetrahedral semiconductors form deep levels in the gay.
~
Scattering states
The perturbation potential also gives rise to changes within the energy bands
of the i d e a l rrystal. Of course, the w e r g i a of t h e w bands are still allowed
quantum mechanically in the presence of the perturbation, so that in this
rwpeet there is 110 change. However, change within the energy hands of the
ideal rrysttll IS indured by the perturbation in the form of a modified density
nf allowed energy levels pcr unit mnergy, i.e. in the form of a modifid density
ol statvs. To calculat? this change it is expedient to introduce the Green's
fiirirtion of the perturbed crystal,
1
G ( E ) - I; 4- i6
- [H+ V ' ].
(3.153)
(3.154)
which follows at once from the definition (3.1533 of G'(E). In quantum field
tbwry. this relation is known as the D p o n equatzon Using thk equation
and performing some simple calculations, the DOS expression (2.209) may
be brought into the form
1
d
dE
- I m -In D e t [ G ( E ) ] .
p(E)
(3.155)
M E ) = P(E)- P O ( W
(3.156)
(3 157)
G'(E)V'I.
dE
This relation can be used to ralculatc the total change of the DOS in the
entire enerRy range between -oo and t o o . Integrating A p ( E ) over this
interval, and considering the fact that G ' ( f i ) vanishes for li: -+ f m , yields
lm
M
~ E A ~ (=.E0.)
(3.158)
30 1
This relation signiEes that under the action of the perturbation potential,
the total number of states remains unchanged. This result already was used
in section 3.4 in lhe context of shallow levels, and there it was referred to
as Levznsons theorem. Here, this theorem means that for each new state of
the crystal created by the point perturbation, a state which existed without
perturbation must cease to exist. The deep states in the energy gap occur
therefore at the expense of band states. For each state occurring in the gap,
R stale is lost fiom a band,
dEAp(h:)- -
Lap.
dBAp(E).
(3.159)
Correlation effects
Correlation effects, as discussed in section 2.1, are important for the N electron system of a crystal with oneparticle states localized at a deep
centel. One of thcse e f k t s is based on tlie configuration dependence of
the Hartlee and exchange potentials. Another results from the fact that
Slater determinants, even thosc calrulatcd by means of configuration dependent Hartrw and exchange potentials, are not exact eigenstates of the
N-electron Hamiltonian. The exact eigenstates are linear coinbinations of
different Sl&r determinants, an rffwt which is referred to as cor~fignralzon
znteraetzon (see section 2.1). The two correlation effects, configuration dependenre and configuration interact ion, will be discussed helow for deep
centers. In regard to configuration dependence, we continue the general
discussion of section 3.2 here.
Configuration dependence
For an ideal crystal, the energy eigenvalues E , of the onepartick Schrijdinger equation have direct physical meaning. Apart from their sign, they are
the ionization energies I,, of the corresponding eigenststes Y , i.e. E , -I,.
In this regard, according to section 2.2, the ionization energy Iv is definned
as difference
(3.160)
of the total energy R t o ~ ( { v ) of) the N-electron system in the ionized stair
{v}, and the total energy E t o d ( { v } ]in theground state {v}. Theionized
state (v) differs from the ground state (v) in that a particle, which in the
ground state of the system occupies a oneparticle state of energy E,, is
transferred to a oneparticle state of energy 0 corresponding to the vacuum
level. Like in section 2.1 one says that a31 electron is removed from the
system. This expression has to he used with care, however, for if taken
literally i t misrepresents the charge neutrality of the system. The total
energy of the ground state follows, according to formula (2.541,by summing
all occupied oneparticle energies, followed by subtraction of the electrostatic
interaction energy of the electrons because the latter is doublecounted in
forming the sum of one-particle energies.
The equation E , = - I , is the content of Koopmans theorem, which
was explained in section 2.1. The essential requirement for the validity of
this theorem is the approximate population independence of the Hartree and
exchange potentials or, more generally, of the effective one-particle potential.
This requirement is not satisfied for a crystal with a point perturbation.
The potentials depend on the number n of electrons occupying oneparticle
states localized at the center, and so do the energy eigenvalues E , of the
one-particle Schrodinger equation of the crystal with a point perturbation.
We will denote these oneparticle energies by E:) henceforth to emphasize
this dependence explicitly. Of course, the definition (3.160) of the ionization
energy is also valid in this case, but it is no longer true that I,, represents the
negative oneparticle energy -E$.(
That this cannot hold is immediately
clear if one recognizes that the eigenvalues IT?) and E P - l ) of the oneparticle Schrodinger equation with. respectively, n and n - 1 electrons at
the center differ from each other - the level EP- is deeper than the level
E p ) because the removal of an electron results in the positive core being
less strongly screened, so that the remaining electrons are more strongly
attracted. In this situation, we say that the electrons at the center relax on
the removal of an electron from the center. If one further considers that the
total energies of both systems enter in the definition (3.160) of I,, that of the
relaxed system with (n- 1)electrons at the center, and that of the unrelaxed
303
system with n electrons at the center, there is no way to explain why the
)
be equal to the negative
energy difference E b t d ( { v } v )- E ~ M ( { V } 'should
eigenvalue - E Z( ) of the unrelaxed system and not equal to the negative
eigenvalue -EP-') of the relaxed system. In reality, the ionization energy
E b t a l ( { v } v ) - E ~ M ( { v } ' )is not equal to either one, but lies somewhere in
between the two. It can be shown that it is approximately given by the
negative of the one-particle energy eigenvalue of the N-electron system with
the fictitious number ( n - 1/2) of electrons at the center. This is plausible
because the electron, during its removal, feels, so to speak, the potential with
n localized electrons half of the time, and feels the potential with ( n - 1)
localized electrons there also half of the time.
To carry out an exact calculation of the ionization energy of a center
which, in its ground state, has n localized electrons, the one-particle energy
eigenvalues E P ) are not sufficient. Applying equation (2.54),these oneparticle energies provide the total energy E b ~ ( { v } ' of
) the ground state,
but they cannot be used to calculate the total energy Etotal({v}v) of the
ionized state with n - 1 localized electrons at the center. To obtain the
latter, one also needs the one-particle energies EP-') of the N-electron
system with n - 1 localized electrons. In density functional theory, the two
total energies follow more directly: one determines the eigenfunctions of the
Kohn-Sham equation for the centers with n and n - 1 electrons, then forms
the corresponding ground state densities, and evaluates the total energy
functional (2.64) using these densities.
Ionization, i.e. exciting an electron to the vacuum level, is only one of the
various possible one-particle excitation processes of the N-electron system
of a crystal having a perturbation center. Generally, one may examine an
' ~ an electron in a formerly unoccupied one-particle
excited state { v } ~with
state Y' of energy below the vacuum level, and a hole in a formerly occupied
one-particle state v. The corresponding excitation energy Iuiv is given by
the total energy difference between the excited state { Y } ~ and
' ~ the ground
state {v}',
(3.16 1)
As in the case of ionization, the excitation energy Iulv is not equal to the
one-particle energy difference E,i - E,. Here, we are interested in excitation
processes involving changes of the populations of one-particle states localized at the deep center. There are different types of such processes. Firstly,
in the final state, all electrons may still be localized at the center, but with
one electron having changed its localized one-particle state. Such excitations
are called internal transitions of the center. Secondly, an electron originally
localized at the center may undergo a transition to the bottom of the conduction band. This is referred to as a donor transition Thirdly, an electron
~2~
305
edge instead of the valence band edge (as is done in equation (3.162)), thus
setting A(&/(& - 1))= Eg - A(Q/(& - l)),then it follows from (3.162)
that
(c) if both levels X((Q - l ) / Q ) and X(Q/(Q 1)) are found in the gap, the
center is both a donor and an acceptor. One then calls it amphoterzc;
if neither of the two levels X((Q - 1)/Q) and X(Q/(Q
the center is neither a donor nor an acceptor.
Configuration interaction
If there is degeneracy among the various oneparticle states of an N-electron
system, then there is also degeneracy between the Slater determinants formed
307
+ +
If more than two electrons are localized at the center, and if oneparticle
states of different irreducible representations are involved as, for example, in
the ground state configuration {aSt;} of the neutralvacancy with 4 electrons,
then the corresponding many-electron levels can be obtained in the same
way as above. The only difference is that the symmetric and antisymmetric
products to be decomposed into irreducible parts have more than two factors
and the factors are not necessarily the same.
The splitting of the many-particle levels of deep centers of crystals has
its counterpart in the fine structure of the many-particle energy levels of free
atoms with more than one electron. For such atoms, many-particle states,
formed from one-particle states of given total spin S and orbital angular momentum L , having different total angular momenta J , give rise to slightly
different energy levels. (Recall that the irreducible representations of the full
rotation group, into which the products of the irreducible one-particle representations decompose, are distinguished by J . ) This fine structure splitting
has its largest effect for electron shells which are strongly localized, i.e. for
d- and f-shells (as opposed to s- and p-shells). One may expect, therefore,
that impurity atoms with unoccupied d- and f-shells, i.e. atoms of transition groups of the periodic table, should result in deep levels exhibiting
pronounced fine structure splittings. This is in fact the case, as we will see
below.
Below we discuss the structure of several deep centers which are important
either from the scientific or technological point of view. Knowledge about
these centers is, in every, case the product of combined experimental and
theoretical investigations. Since we have thus far treated only theoretical
methods, we first present a short overview of the experimental methods.
Experimental methods
One can divide the experimental methods for investigation of deep centers
into two groups, on the one hand, methods which measure ground state
properties of the centers, and on the other hand, methods which give experimental data on center properties in thermally or optically excited states.
Among the methods of the first kind are measurements of magnetic
properties, like Electron-Paramagnetic Resonance (EPR);Electron-NuclearDouble Resonance (ENDOR) and magnetic susceptibility. These methods
provide data concerning the total spin S of the centers and, if anisotropy
effects are measured, also spatial symmetries. The chemical identity of the
centers can be determined (in addition to other methods) by means of mass
spectroscopy or of Rutherford Backscattering (RBS). Measurements of the
Extended-State X-ray Absorption Fine Structure (EXSAFS) provide data
about the geometrical ordering of the atoms in the vicinity of a point perturbation.
To investigate the excitation properties of deep centers, optical and electrical methods are available. Ionization energies can be determined by means
of optical absorption spectroscopy and photoconductivity measurements,
mainly in the infrared spectral region. The cross-sections of deep centers
for emission of free charge carriers can be determined by means of time
resolved current or capacitance measurements at pn-junctions or at other
depletion layers. By suddenly applying a reverse bias at such a junction, the
deep levels are lifted relative to the Fermi level The new equilibrium state
of the junction corresponds to fewer electrons in the deep levels than previously. This state does not occur suddenly, however. it adjusts exponentially
through emission of electrons from the deep levels into the conduction band
(we assume deep donor levels here, the case of deep acceptor levels may be
treated analogously). In the conduction band, the electrons are freely mobile
carriers and are immediately sucked up by the positive electrode at the nregion. This results in an exponentially decaying current, which for its part
leads to an increasing positive charging and. thus. an increasing capacitance
of the junction. The decay time of the current and the rise time of the associated capacitance change are determined by the emission probability from
309
the dttep centers. Measuring the capacitance rise time yields experimental
values for the emission probability. Of particular importatice is the so-called
Deep Level Trawaerat Spectroscopy (DLTS), wherein a reverse biased pnjunction is exposed to periodically repeted voltage pulses of forward {deep
level filling) polarity. The recovery time for the capacitance change after a
filling pulse has been switched off, is measured as a fimrtion of temperature.
This fiinction exhibits maxima which, under certain conditions, can be used
to determine the ionization energies of the deep centers. Ionization energy
values from LILTS measurements and other thermal quililrrium technique
are, as a rule, smaller than optically measured values: the lattice has time to
rrlax if ionization proceeds thermally, and this lowers thc enerm of the finel
stat?. Optical ionization occurs instantaneously, hence lattice relaxation is
is not possible.
Perhaps the best understood point perturbation is the vacancy in Si, so
we initiate our discussion of particular deep centers with it.
Vacaricy in Si
Owing to the value of about 0.3 e V for U , it may be expected that three
or four ionization levels codd fit in the Si gap of about 1.1e V . A4ctualIy,the
donor levels 5(+/2+), t(O/+), and the acceptor levels V ( - / 0 ) and V ( 2 /-1 are found there (as above the acceptor levels are counted relative to the
conduction band edge). In Figure 3.12 (part b). calculated positions of these
levels are shown. These are not yet final positions because lattice relaxation
has not yet been considered. The latter leads to changes which are discussed
below (see Figure 3.13). In charge state V(Z+), no electrons are available
of
311
Figure 3.12: Deep levels of avacsncy in Si: a) Calculated ionization levels without
Hubbard corredions and lattice relaxation. b) Levels of a) with Hubbard corrections. c ) Fsxperimetital ionization levels which, in addition, include Jahn-Teller
shifts. The numbers give the level distances from the valence band edge in eV.
(After Watkins, l S S 4 . )
cl v
D2d
Dzd
d) v-o+d
-btc
Figurr 3.13: Defect molecule model of a vacancy in Si. Different charge states of the
vacancy are shown, taking into account the torresponding Jtthn-Teller distortions.
On the right-hand side, the basis functions of the irreducible representations of the
deep k v e l ~are indicated (a, b, c Bnd d bband for the dangling hybrids of the 4
surrounding atom). (After Watktns, l U U 4 . )
relatively different systems as, for inst ance, Si:Hg.
The perturbation polrntial of a main group impurity atom in a tetrahedral semiconductor contains, besides the short-range part, as a rule, also
a long-range Coulomb part. For isocoric impurity atoms, i.e. atoms whose
cores do not deviate too strongly from that of the host atom, the Coulomb
potential is in general the main contribution. As shown in section 3.4, this
potential leads to shallow donor or acceptor levels. These are the levels
principally involved if main group elements are used as doping atoms for
313
?r
Be
hk
A1
Vac.
EC
- 0.43
N
Ec - 0.14
P
-
Zn
Ga
Ge
As
E1, 0.32
+ 0.55
Hg
Ec - 0.31
S
Ec - 0.31
Se
E,
- 0.3
In
Sn
Sb
Te
E, t 0.15
Ec - 0.20
T1
Pb
Bi
Po
Cd
E,
E,
+ 0.25
ER
315
>"
QI
-1
-2
I
11
I I!
Eandedge
A
>
-1
Q)
-2
Cation Site
-3
-30
-20
-10
d)
el
fI
Figure 3.15: Energy levels and their populations for sp3-bonding main group impurity atoms in elemental semiconductors of group IV,within the defect molecule
model. The partial illustrations a) to h) correspond to impurity a t o m of groups I
to VII as follows: a - V, b - \I, c - VII, d - HI, e - 11; f - I.
and the defect molecule model. Of the 9 electrons of a Sirgroup-1 atom
molecule (remember that only N results in a deep level in this case), 8 are
hosted by the four bonding states and the remaining electron occupies the
anti-bonding al-state in the gap. In a 5t:groupVI atom molecule with 10
electrons, 2 occupy the al-level, and in the 3t:groty-VI atom molecule with
11 electrons, in addition 1 electron has to be placed into the anti-bonding
tl-level. This state of the molecule will certainly not be stable. As in the vacancy case, a Jahn-Teller distortion will occur which removes the degeneracy
of the ta-level and allows for a lower total energy by occupying levels shifted
downwards. The defect molecules of group-111. -11 and -I atoms have. respectively, 7, 6 and 5 electrons. Of them 2 electrons are hosted by the bonding
al-level in the valence band. There are, respectively. 5, 4, and 3 electrons
to be placed in the bonding tz-state which forms the deep le\Tel in this case.
One may also say that this level hosts, respectively, 1. 2, and 3 holes. Again,
a Jab-Teller distortion will occur which lowers the total energy. Qualitative
317
differences from this model occui for elements of the first main group which
have no occupied p-states but relatively shallow closed d-shells (Cu, Ag) or
d- and f-shells (Au). Foi these atoms, d-electrons participate in chemical
bonding with the host crystal. We will discuss this problem in more detail
in the context of the transition metal impuiity atoms.
The question of whether deep levels exist or not for a particular substi
tutional impurity atom, ran he treated arralytzrally, employing the Wannier
representalion of the Greens function derived above. As lhis calculation
provides further physical insight into the formation of deep levels, we will
discuss it hclow, agdin taking Si as the host crystal. The Chens function
in Wannier representation has already been determined above (see equations (3.158) and (3.159)). For the evaluation of the Koster-Slatcr equation
(3.152) we need the matrix elemeuts V,,,,V,, V ,of the perturbation potential V in this representation. To calculate them we use the results of the
TR approximation of scction 2.5. In the simplest veision of this upproximation, the ILamiltonian of the ideal crystal is diagonalized by the bonding
and anti-bonding Bloch functions Ibtk) and In&), t = 1 , 2 , 3 , 4 , of equations
(2.303) and (2.304). Thus, in an approximate sense, the bonding and antibonding orbitals IbtR) and latR) are the Wannier functions of the eigenvalue
equation (3.152). We will use thmn to grt explicit expressions for V,,,,,V,,,
and V,. Fiist, we write down the matrix representation of the unperturbed
Hamiltonian H between bonding orbitals. It is given by
(btRIHI4R)
fhGRR,
h
(3.165)
where E; is the hybrid energy of a host atom. For the perturbed Hamiltonian
EZ fV, one has t h e same matrix elements for all R and R with the exception
of R R= 0. The latter elements are
(btOlH iVlbtO) -
S(E;
+ EL),
(3.166)
where EL is the hybrid energy of the impurity atom. Siiicc none of the matrix
elements of (3.165) and (3.166) depend on t , one may identify IbtO) with the
Wannier function of the uppermost valence band. Then, taking the difference
of equations (3.166) and (3.165), an expression for (vO1V(wo) G V,, follows
SS
vlf, - v o , vu
s1( 6 th - Eh).
h
(3.167)
The factor in (3.166) arises because the Wannier function is equally spread
out over the two atomic sites of a unit cell, while the substitution of a n atom
occurs at only one site.
Analogoiisly, matrix elementu of fI and IT I V between anti-bonding
orbitals may be used to obtain an expression for (cO(Vlc0)I_ v,,,and matrix
v,
= v, = v,, = LG.
(3.168)
Solutions of equation (3.169) within the gap do not exist for all possible
values of the 4 parameters entering, i.e. the gap energy Eg, the two band
widths AE,, A&, and the perturbation potential constant Vo. This is evident if one considers the particular case AE,,v >> Eg,which can be realized.
The arguments of the logarithmic functions in G : ( E ) and G:(E) are close
to 1 in thia case, and the logarithms themselves become negligibly small. If
one assumes, in addition, that A & = AEw,then
Re[G:(E) G:fE)] -
(F)--&[,,/=.
fi]
(3.170)
With this exprwsion! the deep level condition (3.169) takes the form
(3.171)
It has solutions Et only in the energy gap 0 < Et < E,, within which Et has
to be restricted anyway because the deep level condition (3.170) is valid only
there. For negative Vo, one necessarily has Ef > Eg/2,and for positive Vo,
Et < E,/2. For Vi -t +m, El approaches the value E,/2, i.e. the deep level
is pinned at the midgap position. This corresponds to the pinning of & to
the vacancy level discussed in subsection 3.5.2. In order for a solution Et of
equation (3.171) to exist at all, lV01 must exceed a minimum value J V O ) ~ ~ ~
given by
(3.172)
This minimum value of (Vo(follows from equation (3.171) by identifying Et
with the lowest possible deep level position in the gap namely Et = 0 in the
case of positive Vo, and by identifying Et with the highest possible value of
Et namely E, in the case of negative VO.Considering equation 13.1721, the
solution of equation (3.171) may be written in the closed form
319
Table 3.7: Perturbation potential matrix elements Vo = (1/2)(~: - $) (in eV) for
substitutional impurity atoms of main groups in Si, calculated by means of HermanSkillman s- and p-orbital energies. The latter are partially reproduced in Table 2.2.
(After Herman and Skillman, 1963.)
B e B
1.57
Mg
2.18
Zn
1.82
Cd
1.91
Hg
1.87
0.08
A1
1.05
Ga
0.88
In
1.14
TI
1.17
C
-1.41
Vac
8.28
Ge
-0.04
Sn
0.35
Pb
0.47
N
0
-3.04 -4.8
P
s
-1.12 -2.31
F
-6.71
c l
-3.56
As
Se
Br
-0.99 -1.97 -2.98
Sb
Te
I
-0.42 -1.22 -2.03
Bi
Po
At
-0.24 -0.96 -1.68
of semiconductor crystals
with perturbations
the rritprion is correct in all caws. pxrept for In and TI, which are close to
the border line of our model but still on thp side where no deep level6 exist.
while experimentally surh 1wrls art- found.
'
32 1
(3.174)
where
'
11
GoAs to/2.4
The corresponding thecomponent wavefunctions It2b) and Itza) of, respectively, energy Eb and E,, are
where we set
1
a2 = - ( I -
s
a)'
a 2 = -1( l + - s) .
2
(3.177)
323
c -band
t2a
t2
4s
3d
v - band
Figure 3.17: Energy level diagram of the defect molecule model for a substitutional
3d-TM impurity atom in a tetrahedral semiconductor. Explanations are given in
the text.
Since the two bonding and anti-bonding eigenvectors Itzb) and Itza) of equation (3.175) are linear combinations of basis functions with tz-symmetry,
they also have t2-symmetry (as already indicated by the notation). It turns
out that the bonding tzb-level lies in the valence band, the non-bonding elevel in the valence band or the gay, and the anti-bonding tk-level is in the
gap above the e-level or in the conduction band. In regard to deep levels of
the 3d-TM atom i s in the energy gap, therefore, the 2-fold degenerate e-level
and the abovelying triply degenerate anti-bonding tp,-level are candidates
(see Figure 3.17). The eigenfunctions of the e-level are linear combinations
of the &orbitals of the TM atom and are therefore strongly localized at the
latter. Considering localization of the t a b and tn,-leveh, the position of the
two orbital energies F d and t h relative to each other is decisive. If Ed lies
deeper than Ch, then a! > ,8 holds, meaning that the bonding tzb-level is
mainly formed from the d-ststes of the TM atom, while the anti-bonding
tza-level, which forms the deep level, arises to a considerable extent from the
tz-components of the four sp3-hybrid orbitalb of the surrounding host crystal atoms. Exactly this picture emerges from the above-mentioned Greens
function ab initio calculations (Zunger, Lindfeldt, 1983). In Figure 3.18 the
calculated oneparticle energy levels are shown for some 3d-TM atoms in
Gap.
The most striking feature of the above description of the electronic structure of substitutional 3d-TM impurities is the existence of deep levels whose
wavefunctions are spread out over the surrounding host atoms. These levels
4a
*\1-
-jFq
-3
-2
are host-like to a certain extent rather than purely TM atom-like. This picture stands in remarkable contrast to the so-called ionic model, which was
long believed to be correct for substitutional TM atoms also. In this model,
the deep level is strongly localized at the TM atom. Today there exists clear
experimental evidence that this is not the case. The ionic model is based
on the so-called ligand field theory. This entails an approach to the deep
level problem for impurity atoms which differs fundamentally from the one
used in this book. Within ligand field theory, it is assumed that an impurity
atom X in the crystal has essentially the same electronic structure as the
free X"+-ion, where V + means the oxidation state of the impurity atom in
the crystal introduced above. The energy levels and wavefunctions of this
ion are weakly disturbed by the crystal field at the impurity site, an effect
referred to as crystal field splitting. While this model applies relatively well
in the case of ionic crystals (where ch lies deeper than c d ) , it evidently fails
in the case of the covalent or partially covalent tetrahedral semiconductor
crystals. The oxidation states of ligand field theory are, however, also r e
produced in the approach taken here. To demonstrat,e this we consider the
example of a Co atom substituting the metal atom in a 111-V compound.
There are 9 electrons in the 3d74s2 valence shell of Co, and 5 in the valence
shell of the group V atom to which the Co atom binds. 8 of these 14 electrons
are hosted by bonding states of the deep center (2 by the bonding al-state,
and 6 by the bonding t 2 state). S i x electrons remain for the population of
the deep impurity states, i.e. the non-bonding e-states and the anti-bonding
t2-states. Since 9 electrons are expected at a neutral Co atom, the oxidation
state of Co in a 111-V compound is Co3+. Substitutional Co in a 11-VI compound has the oxidation state Co2+ since one additional valence electron is
provided by the group VI atom of the host crystal.
325
3.0
I
-552
2.0
LT
W
z
w
1.o
0.0
and donor ionization levels of a 3d-TM impurity atom in a series of I11 V and
11-VT compound semiconductors. The rigid displacement along the energy
axis has an important physical meaning as we will discuss further below.
Here, we will provide a simple explanation for the similarity of the 'ionization-energy-versus-TM atom' curves in different semiconductors (Tersoff,
Harrison, 1986). In this discussion we will use the defect molecule model
of 3d-TM impurity atoms presented above, in addition, however, the dependence of the energy levels f d and Eh on their population will be taken into
account in a self-consistent way. Let n d , n h and n:, n: be the electron numbers in, respectively, the atomic d- and sp3-hybrid orbitals, the former after
charge transfer between the impurity atom and the crystal has taken place,
and the latter before. The dependence of the levels Ed and ~h on population
may, in the linear approximation of equation (3.25), be written as
Ed
= t d0
+ Ud(nd -
r i 0d ) ,
Eh = 1,0
tJh(nh - nh).
0
(3.178)
Here c,: :E denote the energy levels without charge transfer, and [Jd, U h
are, respectively, the Hubbsrd energies of the d- and the h-states of the sp3hybrids. The electron numbers in the bonding and anti-bonding ta-states
will, respectively, be denoted by rib and n,, and the electron number for the
estates by ne. Of the total number of electrons of the defect molecule,
n d = nbu! 2
+ nap2 in,
(3.179)
327
(3.180)
n h = n b P 2 +,,a2
are in h-states. The bonding T2-level Eb lies in the valence band, its three
states are therefore completely occupied and it holds n b = 6. The difference
6 = (Ed - h ) / 2 between the d- and h-states adjusts self-consistently: if Eh
increases, then 6 becomes small and, with this, also ,B and n h are small. As
a consequence of this, n d increases which, according to (3.178), results in an
increase of d and, hence, an increase of 6. Self-consistency is then achieved
if the two equations (3.178) for the level positions t d and Eh are satisfied with
values n d and n h from equations (3.179) and (3.180), respectively, and values
a 2 , P2 from (3.177). For the level difference 6 = ( d - ~ h ) / this
2
condition
yields
"
ci)
,(nu
+uh
1
6
+ 6 ) - -(nu
2
- 6)-A - n:)]
(3.18 1)
where 260 =
denotes the level distance without charge transfer.
The crucial point at this juncture is that [ I d , because of strong localization
of the &electrons and wcak scrcening of their interaction, has very large
values (on the order of magnitude 10 e V ) . Therefore, the factor multiplying
U d in equation (3.181) must practically vanish itself in order to satisfy this
equation, leading to the approximate rclation
(E!
+ 6 ) + s1( n u- 6 ) A-6
i ( ~ a
1 ne - n d w 0,
(3.182)
32 9
(Auj lie energetically relatively shallow below the s- and p-shells. According
to calculations by Fazzio, Caldas and Zunger (1985). the anti-bonding t 2 state lies in the energy gap, just as in the case of transition metals, and as in
the latter case, this state arises from the interaction of the impurity-derived
d-states (more exactly their tg-component) with the tg-components of the
dangling bonds of the surrounding host atoms. Because of the relatively
weak localization of this deep impurity state, its Hubbard energy is small
so that. again. several ionization levels can fit within the gap. For all three
K M atoms, the calculations result in a Nh/l(O/+) donor level, and a NM(-/O)
acceptor level. In the case of Ag and Au, the amphoteric character has also
been conhmed experimentally.
Rare earth atoms
The rare earth (RE) atoms are among the following elements: Ce, Pr, Nd,
Pm, Sm, Eu. Ac. Th. Pa, U. Np, P u , Am, and Cm. The valence shell configuration of the majority of these atoms may be described as 4fn6s2 with the
number n of f-electrons varying from 2 for Ce to 14 for Yb (see Table 3.3).
Exceptions are Gd and Tb which have one 5d valence electron in addition.
The investigation of RE impurities in Si as well as in the 111-V and 11-VI
semiconductors has received new stimulus very recently. The reason for this
is the luminescence of RE impurity atoms in these crystals in the technologically interesting visible and infrared spectral regions. In equilibrium the RE
atoms are installed both substitutionally as well as interstitially. The equilibrium solubility of the RE atoms is, however; rather small. For practical
applications, non-equilibrium incorporation techniques like ion implant ation
must be used, although, generally, only parts of the implanted atoms are
optically or electrically active. For Erbium (Er), impurity concentrations of
about lo1 ~ r n have
- ~ been reported in Si, and of about 10l8 C W I - ~ in Ga4s.
The technologically interesting luminescence discussed above is caused
by internal transitions within the 4f shell of the RE atoms, more strictly
speaking, between the various levels arising from the many-particle levels of
the 4f-shell under the influence of the crystal field. The 4f-orbitals remain,
in fact. almost unchanged by installation of the RE atoms in the crystal. This
may be traced back to the fact that these orbitals are strongly localized and,
moreover, strongly shielded by the 6s-electrons shutting out influences of the
surrounding crystal. Furthermore, for RE elements heavier than Nd, there
is also strong shielding by the completely occupied 5s- and 5p-shells lying
outside the 4f-shells for these elements. While internal electron transitions
within the 4f-shell have been studied already for a long time, transitions to
energy levels outside of the 4f-shell, including donor and acceptor transitions
into conduction and valence band states of the crystal, have been subjected
to more detailed investigation only recently.
'
'
"
6
4
5 0
v
P
-2
g-4
E(3+/4+)
-6
-8
-10
-12
1I
,
Ce Pr Nd Rn Sm Eu Gd Tb Dy Ho Er Tm Yb
Figure 3.21: Donor ionization levels of substitutional rare earth impurity atoms in
different semiconductors. (After Delerue and Lannoo, 1994.)
A rough but qualitatively correct picture of the electronic structure of substitutional RE impurity atoms in tetrahedral semiconductors may again be
obtained by means of a defect molecule model (Delerue and Lannoo, 1991).
In this model, the 4f-orbitals neither interact with the 5d- and 6s-orbitals
of the RE atom nor do they couple with the orbitals of the surrounding host
atoms. The RE atom is represented by its 5d- and 6s-orbitals only. The 4forbitals, nevertheless, are involved in forming deep states because electrons
are transferred to or from them. The host crystal is described by the four
sp3-hybrid orbitals at the four neighboring atoms pointing towards the RE
atom. The model is therefore largely analogous to that of 3d-TM atoms described above: The five 5d-orbitals decompose into two e-orbitals and three
tz-orbitals, and the 6s-orbital becomes a al-orbital under the influence of
the tetrahedrally symmetric crystal field. The four sp3-hybrid orbitals at
the neighboring atoms pointing toward the RE atom split into one a l - and
three t2-orbitals. The interaction between the two al-orbitals results in a
bonding and an anti-bonding al-state lying deep in the valence and conduction bands, respectively. The e-states remain without bonding to the host
crystal, while the host and RE derived t2-states couple to each other forming
bonding and anti-bonding tz-states. The differences between 3d-TM and RE
impurities are essentially of a quantitative nature. The 5d-levels of the RE
atoms are higher in energy and therefore closer to the hybrid levels at the
neighboring atoms than are the 3d-levels in the 3d-TM case. Because of this,
the e-level and the anti-bonding tz-level of the RE atom (which may lie in
the gap in the case of 3d-TM atoms) are lifted into the conduction band.
33 1
The position of the If-shell with respect to the above levels still has to be
determined. In the model by Delerue and Lannoo this is done assuming that
the distance between the 4f-shell and the average 5d-level in the crystal is
the same as that between the 4f-shell and the 5d-shell in the free RE atom.
The ionization levels of the 4J-shell calculated by means of the above
defect molecule model are shown in Figure 3.21 for various RE atoms and
host crystals. Two diffcrcnt oxidation states of the RE atom, RE3+ and
RE2+, have been assumed. One recognizes that, for InP, the oxidation state
RE3+ is stabk in most rasrs, because the RE(3-t /4 t ) ionization levels are
still below the Fermi level, while the RE(2 t/3+) ionization levels are above
it. For CdTe, the oxidation state RE2+ is found to be stable in most rases.
With a few exceptions, these predictions are confirmed by experimental investigations. Oxidation states can also be derived in a more direct way from
the above defect molecule model. In a 111-V compound host crystal, the RE
defect molecule has 2 n 5 electrons, 2 n of them from the HE atom, and
5 of them from the host crystal anion. These electrons orrupy either states
in the 4f-shell, or are in bonds between the RE atom and the crystal. The
bonding al- and t2-states host 2 f 6 = 8 electrons provided that they are
energetically lower than the f-shell, which in fact turns out to be the case.
Thus, 2 n 5 - 8 = JZ - 1 electrons remain for the f-shell. The oxidation
state of a neutral RE atom is RE3+ in such circumstances, because 3 electrons are missing at the RE atom, the one f -electron and the two s-electrons
in bonds with the host crystal. In 11-VI compounds, the oxidation state of a
neutral RE atom should be RE2+ according to this consideration, since the
host crystal anion delivers 6 electrons instead of 5.
A striking feature of the ionization-energy-versus-RE-atom-curves for
different semiconductors in Figure 3.21 is that they can be brought to overlie each other by rigid relative displacements. This is reminiscent of the
ionization energies of the 3d-TM atoms where the same feature is observed.
The reason is similar to that of the 3d-TM case: one the one hand, the
4f -levels are pinned electrostatically to the average 5d-levels by means of
the self-consistent charge exchange between the d- and the f-shell with its
large Hubbard energy, and on the other hand, the 5d-levels are pinned at
the average hybrid energy levels.
+ +
+ +
333
Direction
Figure 3.22: Geometry of the stable and metastable state of the As-antisitc defect
in GaAs. (After Dabrowski and SchrJler, 1992.)
interstitial As atom and Ohe three adjacent, A s atoms, st,rong sy2-bonds can
be formed similar to the bonds in the graphite structure of carbon and that
of g a y As. The non-bonding p-orbital of the A s atom, which lics rclatively high enmgetically, does not increase t,he total energy of thc interstif,ial
s h t e because it remains largely unoccupied, the two remaining electrons are
hosted by the dangling bond of t,hc fourth remotc lying As atom.
DX-centers
A metastability similar to that of the C:aAs:AsG,, ailtisite defect is also observed at ot,hcr point perturbations, among them at t,he so-called L)X-cent,ers
in GaAs and (Ga,Al)As mixed crystals (Lang, Logan, 1977). The microscopic nature of these centers was a puzzle for a long time. It was only
V and
clear that they were related to impurity atoms or the main groups l
VI of the periodic table which normally are incorporaled subst,itiitionally on
cation and anion sitcs, respectively, and form shallow donors. lTndcr certain conditions hydrostatic pressure or strong Ti-type doping in the case of
GaAs, and AlAs mole fractions larger than 22% in the case of (Al, Ga)As deep levels emerge from these shallow donors. Originally this transition was
attributed to the formation of a defect complex involving the donor atom
(a)and ti11 iinknown point, perturballon (X). Now, it is clear t,hat the D X center j u s t represents another state ol 1he shallow donor. In contrast to the
lat,t,er,the donor atom of a DX-ccntcr is not neutral, but singly negatively
charged, and its installation does not occur at a substitutional site but at an
interstitial site. If the special Conditions mentioned above are not fuliilled,
t,hen the shallow donor represents a stable state of the impurity atom, the
DX-state is metastbble. However, as for the EL2-center, thc rclalive total
energy minimum of the DX-state lies only slightly (about 0.2 e V ) above
thc absolute minimum of the shallow donor state. The reason for this energy balance is the sainc as in the E L 2 case, namely the energetically costly
populalion of the anti-bonding A1-level at the shallow donor. In the singly
negatively charged state which the donor takes under high doping, (and ev-
3.6
3.6.1
335
one has 5 x loz2 bulk atoms and only 4 x 1015 surface atoms. Beside the
above mentioned bulk properties, there are other properties or processes,
like crystal growth, oxidation, etching or catalysis, which are determined by
the surface atoms only. In these rases, the model of an infinite solid does not
apply, of course. Moreover, in semiconductors, bulk properties like transport
are often not controlled by all atoms but only by dopant atoms. Then the
number N s of surface atoms per cni i s to be compared with the number
of dopant atoms. For a semiconductor sample of thickness d , area 1 cm2,
and doping concentration N D , the number N o x d of dopant atoms per c7n2
can easily become as small as the number IV, of surface atoms. This means
that the transport properties of such a semiconductor sample will depend
on its surface. In the history of semiconductor physics, this was recognized
at a very early stage. Examples which demonstrate this are the electric
rectification at a semiconductor-metal contact (discovered by F. Braun in
1874), and the unsuccessful attempts to build a field effect transistor in
the late thirties of this century. The failure was caused by electron states
localized at the surface which captured all the electrons induced by the
extcrnal voltage.
lhe surfaces used in the early field effect experiments were far from
perfect. They were made by cutting or cleaving a semiconductor sample in
air. If at all, they were clean and smooth in a macroscopic sense, but not
so microscopically. They exhibited surface roughness on the 100 nm scale,
structural defwts on the 1 nm scale, and impurity atoms at and below the
surface. The surface states responsible for the difficulties of the early field
effect transistor were due to these imperfections. In the present section we
will deal with surfaces which are free of such imperfections. Perfect surfaces
of semiconductor crystals in this sense necessarily represent a particular
lattice plane occupied only by chemically correct atoms at regular sites.
No impurity atoms are allowed above or below the surface. The surface is
reduced to what it means ideally, the termination of the crystal.
Perfwt surfaces in this sense cannot, of course, be realized in practice.
One may only try to approximate them so closely that the existing imperfections do not essentially change the properties of the surface as compared
to the properties which a perfect surface would have, if it really could be
made. One calls such almost perfect surfaces clean. Although this term
refers only to chemical composition, it also implies structural perfection or
atonzc smoothness. There are essentially t h r w ways to manufactwe clean
surfaces. All thrw need ultrahigh vacuum (UHV), i.e. pressures below
Torr:
(i) Treatment of imperfect surfaces by ion bombardment and thermal annealing (generally in several cycles).
(ii) Cleavage under UHV conditions (only surfaces which are cleavagc planes
+ +
h l , Ik2, jL3,
337
n - h l b l + h2bz
+h ~ b 3 .
(3.182)
Hrre, reciprocal lattice vectors are understood without the factor 27~in definition (2.122). If the normal direction n is given, the roefkients ( h l h ~ h 3 )
are only determined up to a common integer factor. We chose this factor
such that h l , h 2 , hs have no common divisor. Then the coeffirients hi,h2, h 3
are the Miller indices of the lattice plane under consideration, however, r e
ferring to the primitive lattice vectors al,a 2 , a 3 as coordinate axes rather
than to the crystallographic axes. The latter are parallel to the piimitive
lattice vectors only in the case of primitive Bravais lattices. For centered
laltires, like the face rentrred cubic one, they have different directions, and
the coefficienls ( h l h ~ h 3differ
)
from the corrirnon Miller indices. Tf necessary,
one can rasily switrh from one representation to the other. The ( h l h 2 h 3 )
are, apart from a co~linionfactor, obtained by multiplying the ( h k l ) with the
matrix which transforms the three nun-primitive crystallographic axis vectors into thc three primitive lattice vectors. Using the above characterization
of lattice planes by their normal direction n, the lattice points
Ro = rloa1-k r20az
+ '7'30ag
(3.183)
of the lattice plane perpendicular to n which contains the zero point, may
be defined by the equation
n & E h17.10
+ h2r20 + h3r30 = 0.
(3.184)
The unique character of this equation lies in the fact that only integer solutions r10,r2O1r3o are allomwl. In mathematics it is called a Dzophantin
equation.
+w
a ~
(3.185)
of the infinite family of parallel plancs are obtained by replacing the right
hand side of quation (3.181) by arbitrary integers 1,
(3.186)
The points of the lattice plane defined by equation (3.184) form a 2D lattice.
The primitive vectors of this lattice will be denoted by fi and f2. This i s
done in such a way that the three vectors f1, f2, n form a right hand coordinate system. The f1, f2 may be expressed in terms of the primitive lattice
Table 3.8: Primitive surface lattice \rectors and stacking vectors of low index surfaces of semiconductor crystals.
Crpst'alStructures
I1
1
WurtBite, Selenium
Ro = S l f i
+ sgh.
(3.187)
The lattice planes Rr defined by equations (3.185 and (3.186) can be written
in the form
339
Figure 3.23; Construrtion of a crystal from its lattice planes. The lattice points of
a given plane are occupicd by identical atoms. Different planes may host different
atoms if the crystal has a basis. In the figure, a crystal with o~ilyone atom per
primitive unit cell is shown.
may be characterized as consisting or parallel lattice planes which are displaced with respect to each other and which are each o c c i i p i d hy atoms
of a particular species (see Figure 3.23). 'I'o be more specific, the lattice
plane Ro is occupied by atoms of species 1, the next plane, displaced by 6
with respect to the first onc, is occupied by atoms of species 2, etc., and the
plane displaced by FJ is occupied by atoms of type J . It may happen that
two OT more atoms of the basis are located at the same lattice plane. In
that case an ntomic layer consists of two or more basis atoms. The lattice
plane Ro 7j completev the constsuction of a crystal slab which, in the
vertical direction, encompasses exactly one yrimitivc bulk unit cell. This
slab is callcd a p r i m i t i v e crystal slab. A lattice plane occupied by atoms is
referred to nb: a t o m i c layer. The second primitive crystal slab begins with
a lattice plane occupied by atoms of species 1 and is displaced with respect
to the zero-th plane of the first layer by the stacking vector f3, followed by
a plane with atoms of species 2 which is displaced by f3 t ?2 etc., the last
plane of the second primitive crystal slab being occupied by atoms of species
J and displaced by f3 T'J, The second primitive crystal slab is followed
by another J planes dlvplaced with respect to the first slab by 2f.j instead
of f3. The crystal can therefore be thought of as consisting of successive
primitive crystal slabs sitiihtcd one above the other, each of which consists
of atomic layers containing difkrmt types of atoms and which are laterally
displaced with respect to onc another. The location of an individual atom
can bc specified by the number 1 of the primitive crystal slab, the numbers j
of the atomic sublattice and the integer coordinates s1, s2 witahinthc lattice
plane. The position R(j, I , s1, s2) of an atom j in crystal slab 1 can therefore
be written as
(3.189)
The complete set of locations of atoms in an infinite 3D crystal can be
obtained by assigning all possible integer values from --oo to +m for 1, s 1 , s 2 ,
and all integer values from 1 to J for j . The normal direction n, upon which
the whole construction of lattice planes is based, is arbitrary in the case of
an infinite 3D crystal. Any choice of n yields the same crystal.
We can immediately employ the above representation of an infinite crystal in describing a crystal with a surface. The normal direction n is then,
however, fixed by the direction perpendicular to the surface. We start with
the description of a crystal having an ideal surface. The exact meaning of
ideal in this context is explained immediately below.
n . R(j, 1 , s 1 , 5 2 ) = I + n . Tj 5 0.
(3.190)
The surface or first atomic layer is obtained if the left hand side of this
relation is taken to be zero. A solution of (3.190) is 1 = 0 and
= 0.
34 1
r =slf1+
s2f2
(3.191)
holds. The vectors f1, f 2 are the primitive lattice vectors of the 2D lattice
of the ideal surface. The primitive unit cell of the crystal with surface has
the form of a prism bounded above by the parallelogram spanned by f1, f 2
and extending to minus infinity in the direction of -f3. For the low-index
surfaces of the common semiconductor crystal structures, the three vectors
f1, f2, f 3 are listed in Table 3.8.
As in the 3D case, 2D lattices may be divided into crystal systems and
Bravais lattices according to their symmetry with respect to rotations and
reflections. We now consider the possible plane crystal systems and plane
Bravais lattices. Their point symmetry elements are necessarily rotations
about axes which are perpendicular to the 2D lattice plane, and reflections
at lines within the 2D lattice plane. Thus, the point groups are either C ,
( n ) or C,, (nm, nmm). Since the rotation through 180' is always a symmetry element of a plane lattice, only even n are allowed. The highest value
of n may be readily obtained from the derivation of the possible rotation
symmetry axes of 3D crystals in Chapter 1. There, n 5 6 was found. Thus,
the possible multiplicities of rotation symmetry axis of plane lattices are
n = 2,4, and 6.
A lattice which only contains a 2-fold symmetry axis is either a completely general oblique lattice or a rectangular lattice. The point groups of
these lattices are, respectively, C 2 (2) and C2, (2mm). Lattices with a 4-fold
symmetry axis also possess 4 reflection lines which are rotated through 45'
with respect to each other. The point group of such a lattice is therefore C4v
(4mm). Similarly, lattices with a 6-fold axis have 6 reflection lines which
meet at an angle of 30'. In this case the point group is c 6 v (6mm). There
are thus 4 different plane crystal systems - the oblique with holohedral point
group 2, the rectangular with holohedral point group 2mm, the quadratic
with holohedral point group 4mm and the hexagonal with holohedral point
group 6mm (see Figure 3.24).
The possible plane Bravais lattices are obtained as follows. First, one
takes the four primitive lattices with unit cells which are parallelograms
having either no particular symmetry, or that of a rectangle, a square or an
equilateral hexagon. Then one adds additional points to each of the unit cells
of these lattices in such a way that their point symmetries are not lowered.
Only one new lattice is obtained in this way, namely the 'body' centered
343
oblique
rectangular
quadratic
hexagonal
Figure 3.24: The 5 plane Bravais lattices of the 4 plane crystal systems.
rectangular lattice. This lattice cannot be transformed by continuous and
symmetry preserving transformations in the primitive rectangular lattice,
nor in any other primitive lattice. So it forms an additional Bravais lattice.
In all other cases, the addition of points either leads back to the primitive
lattice or results in no lattice at all (i.e. it creates a crystal with a basis). We
conclude that five plane Bravais lattices can be realized within the framework
of the 4 plane crystal systems: only the primitive lattice in the oblique case,
the primitive and the centered in the rectangular case, and again only the
primitive in the quadratic and hexagonal cases (see Figure 3.24).
With the last statements we have completed the symmetry classification
of the plane lattices of crystal surfaces. The lattice types of the ideal low
We wish to establish, on the one hand, point groups which transfer equivalent
directions of a crystal with surface into one another and, on the other hand,
space groups which transform the crystal with a surface into itself. In the
latter case this implies that not only does the 2D lattice transform into
itself, but so do equivalent atoms occupying the primitive unit cells. These
atoms are located at positions R(j,I, sir92) given by equation (3.189). To
express the 2D nature of the translation symmetry of a crystal with a surface
~)
explicitly, we denote the atomic positions by Rjl(s1, s2) R(j,I , ~ 1 , s and
write
Table 3.9: Structural properties of ideal low index surfaces of the five common
semiconductor crystal structures. Column 6 and 7 give, respectively, the number
of atomic layers of an irreducible crystal slab and the number of basis atoms per
layer.
Bravais Lattice
Point Group
Space Group
Irr. Slab
Basis
hexagonal
3m
p3ml
square
2mm
p2mm
p-rectangular
2mm
p2mg
hexagonal
3m
p3m1
square
2mm
p2mm
p-rectangular
plml
hexagonal
3m
p3ml
square
4mm
P h m
prectangular
2mm
p2mm
hexagonal
3m
p3ml
prectangular
plml
prectangular
P W
hexagonal
Pl
(ioio)
p-rectangular
Pl
(1120)
p-rectangular
P21
(ioio)
where
represents the basis of the crystal with surface. This basis contains an infinite number of vectors corresponding to the infinite number of atoms in a
primitive unit cell. The point and space group elements of a crystal with
surface must leave the surface invariant, i.e. only 2D symmetry groups need
to be taken into consideration. Here, as in the case of translation symmetry,
we therefore also have the situation that, although a crystal bounded by
a surface extends in three dimensions, its point and space groups are only
2-dimensional.
The point groups of directions of crystals with plane surfaces are necessarily subgroups of the holohedral point groups of the plane lattices, i.e.
subgroups of the point groups 2,2mm, 4mm and 6mm. There are exactly
345
a
2
@
..
2 mm
0c3
1
.4 m m
(Ii
@
..
a
6mm
@
3m
(I>
3
Figure 3.25: The 10 point groups of equivalent directions of crystals with plane
surface.
10 such groups. Their stereograms are shown in Figure 3.25. This implies
the existence of 10 different crystal classes which are associated with corresponding crystal systems as indicated in Figure 3.25.
The possible plane space groups can be found as follows (see Figure 3.26).
To start with, it is evident that each of the 10 point groups of equivalent
directions combined with the corresponding associated lattice gives rise to a
space group. The space groups p l , p 2 1 1 , p l m l , p 2 m m , p 4 , p 4 m m , p 3 , p 3 m l , p 6
and p 6 m m originate in this manner. Since the point groups of the rectangular crystal system are each associated with two Bravais lattices, primitive
and centered, we find two further space groups, c l m l and c 2 m m . In the case
of the point group 3 m , there are two different possibilities of positioning the
reflection lines relative to the hexagonal lattice vectors, either through the
vertices of the equilateral hexagon of the Wigner-Seitz cell, as assumed in
the case of p 3 m 1 , or such that ,hey bisect its edges. In the latter case one
crystal
svstem
oblique
2
rect angular
Bravais
lattice
ooint
soace
LLZ
p-rect a n g ul o r
2mn
I I
p'g'
Ip- I
c-rectangular
347
2mn
c2mn
p-square
p4
4mn
p4mn
I
I
,-hexagonal
p3m1
Figure 3.26: Continuation: The 17 space groups of crystals with plane surface.
primitive rectangular crystal system are p l g l (from p l r n l ) and p2rng,p2gg
(from p2mrn). The centered rectangular and the oblique crystal systems do
not give rise to additional space groups. In the case of crystals with plane
surface, there are therefore a total of 17 different space groups. Four or them
involve glide reflections. i.e. they are non-symmorphic.
The 2D point and space groups of the various surfaces of a given crystal
may be derived from the 3D point and space groups of the infinite crystal under consideration. They are, in fact, the subgroups of the 3D groups
which contain only those symmetry elements which transform lattice planes
In the preceding section the atomic structure of crystal surfaces was considered under the assumption that the atoms of the crystal bounded by a
surface occupy the same positions Rjl(s1,s2) as they did in the infinite 3D
crystal, if the former is generated from the latter by removing one half of
it. As already noted, this assumption is actually not valid. The atoms of
the surface layer experience different forces than those acting in the bulk
of the crystal, and are thus subjected to displacements from their original
sites in the crystal bulk. Since the forces acting on the atoms of the second
layer are in part determined by the positions of the atoms in the first, these
forces are also subjected to changes accompanied by displacements in the
second layer and so on for each successive layer. All one can assume is that
the displacements decrease from one layer t o the next and vanish altogether
at a depth that is relatively far from the surface. Here, we present a more
detailed description of the surfaceinduced displacements of atoms and discuss the resulting altered symmetries as compared to those of ideal crystal
surfaces.
Translation symmetry of relaxed and reconstructed surfaces
349
(1111
coiii
(1101
(111)
(100)
(110)
0 1 A 0 1 A 0 1 A
O2B 0 2 6 016
0 3A
0 3A
2A
Figure 3.27: Projections of a diamond or zincblende type crystal onto its low index
surfaces.
sz). Then
by Rj~(s1,
(3.195)
The displacements 6Rjl may be divided into two classes with regard to their
effect on translation symmetry. If the latter is not affected, the displacements
are termed r-e-eluzation. In this case equivalent atoms in different primitive
unit cells are displaced in the same way, i.e.
Figure 3.28: Surface relaxation (a) and surface reconstruction (b) extending up to
the second atomic layer. A 2 x 2 reconstruction is shown in part (b).
for all
51, s2.
(3.196)
Only the vectors 61 of the basis are altered, the lattice vectors remain unchanged in this case (see Figure 3.28a). If, on the other hand, the translation
symmetry is altered, the displacements are termed reconstruction. In this
case equivalent atoms in different unit cells are not all displaced in the same
manner, i.e. bRjl(s1,s2) depends on sl,s2. Both the basis and the lattice
are changed. We first consider the changed lattice translation symmetry of
reconstructed surfaces (see Figure 3.28b).
In describing reconstruction it is useful to divide the crystal with a surface
into two slabs parallel to the surface, an upper slab containing the atomic
layers with displaced atoms, and a lower slab encompassing all other layers,
i.e. layers with non-displaced atoms. The upper slab is sometimes called
a selvedge, here we use the terms surface slab and bulk slab (or simply
bulk) for the upper and lower slabs, respectively. By Ts and Tb we denote,
respectively, the plane translation symmetry groups of the surface and bulk
slabs. The translations which transform the crystal with surface into itself
must belong to both groups of translations, Ts as well as Tb. The translation
group T of the whole crystal with surface is thus the intersection
T = Ts fl Tb
(3.197)
of Ts and Tb. Alternatively, one can say that T is the largest common
subgroup of both groups Ts and Tb. There are two possibilities, either T
only consists of the identity translation, which means that the lattices defined
by Ts and Tb are non-commensurate and the crystal with surface does not
possess any lattice translation symmetry, or T contains more elements than
just the identity, in which case one says that the two lattices derived from
Ts and Tb are commensurate. The lattice associated with the T is called the
coincidence lattice. If, in particular, Tsis a subgroup of Tb then T is equal
35 1
to T,, i.e. the coincidence lattice is identical to the lattice T, of the surface
slab. If T, is not a subgroup of Tb then T cannot be equal to T, and is
necessarily a proper subgroup of T,, i.e. it is smaller than T,. One can thus
distinguish between the following three cases with regard to the translation
symmetry of crystals having a reconstructed surface:
(i) no translation symmetry,
(ii) translation symmetry exists but is smaller than that of the surface slab,
(iii) translation symmetry exists and is the same as that of the surface slab.
+ 912f2
f; = Q2lfl + 922f2.
f: = Qllfl
(3.198)
(3.199)
f: = nfi,
f& = mf2 ,
(3.200)
with n and m being integers. The primitive unit cell of the surface slab contains n x m primitive unit cells of the bulk slab. This is said to constitute an
n x m reconstruction An n x m reconstructed crystal surface of a particular
material C parallel to a lattice plane with Miller indices (hlcl) (or ( h k i l ) in
the case of hexagonal symmetry) is characterized by the symbolic notation
C(hlc1) n x m.
(3.201)
(2) The off-diagonal elements of Q are not equal zero, i.e. fi is not parallel
to fi and/or fi is not parallel to f2. The angles L(f!, fi) and L(f&fz), which
are in general not equal to each other, are assumed to be equal in the case
under consideration. This means that the two vectors f1, f2 (with tails joined
at the same point) can be transformed into the two corresponding vectors
= L(f4, f2) = a about
an axis which is perpendicular to the surface, with a subsequent rescaling
of f1, f2 by the factors lfiI/fll and ~ f ~ ~respectively.
/ ~ f ~ ~ A, (hlcl) surface
of a particular material C reconstructed in this way is characterized by the
symbolic notation
lfil x
C(hkl) lfil
141 - a.
-
(3.202)
If21
353
A further item is worth mentioning, concerning the surface reconstruction itself. It follows from the point symmetry of the crystal with ideal
surface itself. If the latter belongs to the square crystal system, i.e. if it
has a square lattice and one of the two point symmetry groups 4 m m or 4,
the directions of the two primitive lattice vectors are symmetrically equivalent. A particular reconstruction which increases the surface unit cell in
the direction of fl by a factor n and in the direction of f 2 by a factor m , is
equivalent to another reconstruction which does the same for, respectively,
the symmetrically equivalent vectors f2 and fi (Figure 3.30). An analogous
statement holds for an ideal surface of the hexagonal crystal system, having
a hexagonal lattice and one of the point groups 6mm, 6 , 3 m or 3. In this
case, three symmetrically equivalent direction exist (Figure 3.30). If there is
no physical reason which makes one of the different symmetrically equivalent
reconstructions more likely than another, they will take place simultaneously
in different regions of the surface. The result is the formation of domains
of otherwise identical, but differently oriented, reconstructed unit cells. Due
to the domain structure, the overall translation symmetry of the surface is
destroyed. Structural imperfections of a more local nature occur where the
boundaries of such domains meet.
The point and space symmetries of relaxed or reconstructed surfaces are generally of lower degree than those of the corresponding ideal surfaces. They
do not only depend on the point and space symmetries of the irreducible
crystal slab as in the case of an ideal surface, but also on the point and
space symmetries of the relaxed or reconstructed surface slab. If the surface
slab consists of more layers than the irreducible crystal slab, then its point
and space groups can be taken to determine the point and space groups of
the whole crystal with relaxed or reconstructed surface. If the surface slab
contains fewer layers than the irreducible crystal slab it is expedient to add
further atomic layers (which then do not contain displaced atoms) to make
up the difference. The point and space group elements of the thus extended
surface slab which are simultaneously point and space group elements of
the irreducible crystal slab form, respectively, the point and space groups of
the whole crystal with relaxed or reconstructed surface. Similarly like this
was done for the translation symmetry group above, one may argue that the
point and space groups of the relaxed or reconstructed surface slab should be
subgroups of the point and space groups of the corresponding ideal surface.
If this is the case, the point and space groups of the relaxed or reconstructed
surface slab are, respectively, the point and space groups of whole crystal
with relaxed or reconstructed surface.
3.6.3
The electrons and cores of a crystal with a surface undergo the same interactions as the electrons and cores of an infinite bulk crystal, the only difference
being that the cores and electrons of the removed semi-infinite crystal above
the surface are missing, and the positions of the cores just below the surface
355
differ from those in the infinite bulk. The two basic approximations of the
theory of the interacting electron-core system of an infinite crystal, namely
the adiabatic approximation and the one-electron approximation, are not
influenced by these modifications, so they may also be used in the presence
of a surface.
In the electronic structure calculations of infinite bulk semiconductor
crystals, the core positions commonly are taken as input data. This is possible because these positions are crystal sites of high symmetry which are
well-known from X-ray diffraction experiments. For crystals with a surface,
this can no longer be assumed. The positions of atoms in the surface layers
of relaxed or reconstructed surfaces are crystal sites of lower symmetry. In
many cases, they are not known or only incompletely known from experimental investigations, for these investigations are more difficult and less precise
than X-ray diffraction in the case of bulk crystals (a short review of these
methods will be given further below). X-ray diffraction cannot be applied
to surfaces because it lacks surface sensitivity. In such circumstances, the
positions of atomic cores at surfaces are to be treated as output, rather than
input, data of the electronic structure calculations. The way that this can be
accomplished was discussed in section 2.2 in general terms, and in section
3.5 with respect to point defects. It involves, first, the calculation of the
total energy of the electron-core system for a variety of different sets of core
positions and, second, the minimization of the total energy with respect to
these sets. The minimum set gives the core positions which really apply. The
most critical part of the total energy is the energy of the electron system.
In order to obtain it, the one-electron energies of the crystal with surface
have to be calculated for assumed core positions. Formally, this involves the
same task as in the case of bulk crystals, namely, the calculation of stationary one-electron states for given core positions. Below, we will demonstrate
how this problem can be solved in the case of crystals with surfaces. The
remaining parts of the procedure for determining the atomic and electronic
structures of surfaces, the calculation of the entire total energy including the
core-core interaction energy, and the minimization of the total energy with
respect to the core positions, will not be treated here because it is mainly a
numerical task.
The assumption of a priori known core positions is valid if ideal surfaces
are considered. In this case they are the infinite bulk positions. The electronic structures of ideal surfaces are important as reference data for the
electronic structures of relaxed and reconstructed surfaces. Thus we will
also deal with them.
(3.203)
as in the case of an infinite bulk crystal (see equation 2.53). The oneelectron
potential V(x) may be written as the sum
+ +
-W
(3.205)
where q(x) is the potential of a core of species j located at R = 0. The
core potential Vc(x)and, hence, also Ve(x)and V ( x ) have the translation
symmetry of the 2D surface lattice, so that
v(x)= V(x + r) .
(3.206)
As in the 3D bulk case, this symmetry can be used to derive the Bloch
theorem.
Bloch theorem
This theorem states that the energy eigenfunctions $E(x) of the Schrodinger
equation (3.203) may be chosen simultaneously as eigenfunctions of the surface lattice translation operators t,.. This allows one to write these functions
in the form of Bloch functions @Q(x) with a 2D quasi-wavevector
4 = 91g1 + 9282,
(3.207)
where q1,92 are arbitrary real numbers, and gl and gz are the primitive
lattice vectors of the reciprocal surface lattice, defined by the relations
35 7
fi . g k = 27T6&, i, k = 1,2,
(3.208)
[fl x
f2],
g2 = N-lfl x [f2 x
fl]
(3.209)
Q,Eq(x)= -e2qx
fi
UEq(X),
(3.210)
(3.211)
with p1,pz as integers. This means that the q-vectors must belong to a
finely-meshed lattice similar to that of Figure 2.4. For the macroscopically
large values of G which we assume, q is practically continuous, although the
number of different q-values within a given region of q-space is finite.
q . g + -1 g2 = 0,
(3.212)
Consider an infinite bulk crystal, and cleave it into two semi-infinite crystals
with a surface parallel to a particular lattice plane. The spectra of energy
359
t'
tY
Figure 3.31: Surface B Z s of the 5 plane lattices: (a) oblique, (b) p-rectangular,
(c) c-rectangular, (d) square, (e) hexagonal. Symmetry lines and points are also
shown, and their notations are introduced.
eigenvalues of the two semi-infinite crystals will contain all energy levels
which were already eigenvalues of the infinite bulk crystal before cleaving.
This implies that each crystal with surface possesses an energy eigenvalue
spectrum which partially is made up from energy eigenvalues of the infinite
bulk crystals from which it is derived. The eigenfunctions of the crystal with
surface corresponding to these eigenvalues, if examined at positions outside
of the crystal, will decay exponentially with increasing distance from the
surface, while inside they will practically be the same as those of the infinite
crystal, i.e. they will exhibit undamped oscillations throughout the whole
semi-infinite crystal, as do the eigenfunctions of the infinite bulk crystal (see
Figure 3.32). Eigenstates of a crystal with surface exhibiting such properties
are called bulk states. The corresponding energy eigenvalues form bulk state
surface energy bands Ep(q). These bands can be obtained by projecting the
bulk bands E?"(k) of the infinite crystal onto the first surface B Z . Here,
'projecting' means that one assigns to a particular point q of the first surface
B Z all bulk band energies EF"(k) corresponding to k-vectors of the first
.-0
-w
C
3
surface state
9-
x-
Figure 3.32: The three different types of electron energy eigenstates of a crystal
with surface below the vacuum level (schematically).
bulk B Z having the same projection q on the surface B Z plane, but with
various components kl perpendicular to it. Formally, this can be expressed
by the relations
The bulk state surface band index p in (3.213) replaces the bulk quantum
numbers v k l . As kl varies continuously, @ also does. This means that a
particular band of the infinite bulk crystal gives rise to a continuum of surface
bands. In this way, the infinitely large number of atoms in a primitive unit
cell of the crystal with surface manifests itself.
We illustrate the projection of bulk bands onto the surface B Z using the
(100) surface of a diamond type crystal as an example. The band structure
is taken from the empty lattice model introduced in section 2.4. As a first
step, the bulk B Z is to be projected onto the plane of the first surface B Z
(see Figure 3.32). In doing so, one notes that part of the projected bulk B Z
lies in the second surface B Z . This has to be folded back to the first surface
B Z , together with its energy values. In this way we obtain the projections
of the lowest three empty lattice bands shown in Figure 3.33. In addition
to q-vectors for which all energy values are allowed, there are also q-vectors
occurring for which certain energy values are forbidden. This peculiarity is
found in other, more realistic cases as well, it represents a general feature
of the projected bulk band structure. In the forbidden energy regions states
may occur which are localized at the surface.
361
Surface states
Such states are to be expected for the same reason that localized states are
observed in the case of point or 0-dimensional perturbations. In contrast to
the latter, surfaces constitute 2-dimensional perturbations. The localization
occurs, therefore, only in 3 - 2 = 1 dimension, namely that perpendicular to
the surface. If the energy lies in the forbidden region of the projected bulk
band structure, the decay of the eigenfunction towards the bulk proceeds
exponentially (see Figure 3.32). The states are then called bound surface
states and the corresponding energy bands E,(q) are bound surface bands.
Besides these, one has surface resonances and antiresonances. The latter
occur at energies in the allowed region of the projected bulk band structure
and give rise to resonant or antiresonant surface bands Ep(q). They manifest themselves in an increase (resonance) or decrease (antiresonance) of the
density of states. The eigenfunctions at these energies are also localized at
the surface, but decay less rapidly towards the bulk (according to a power
law) than the exponentially decaying bound surface states. Antiresonances
are necessary in order to satisfy Levinsons theorem, which holds for surfaces
as well as for point perturbations. If bound surface states exist, the antiresonances must compensate the increase of the total DOS in the previously
forbidden energy region.
-u1l
-2
5n
E?
a,
C
pig1
r, -A;-?;--Z;r, -A\-x:-z;--M,--z,-
363
~r~r+-~2---~-~;-X;--~;-M2-Z;-
x,-A,-
~--A~;-&-z;-M,-zF-
x2
x1
Figure 3.35: Symmetry and degeneracy of the surface energy bands for the 5 space
groups of the p-rectangular Bravais lattice. [After Terzibaschian and Enderlein,
1986.)
(3.214)
The number of different Bloch sums is infinitely large even if a finite number
of orbitals is used per atom, because the integer 1 counting the primitive
crystal slabs, runs from 0 to -co. Consequently the Hamiltonian is given
by an infinite matrix in the atomic orbital representation &jl,.(x). Some
specific approximation is necessary to transform this to a finite matrix. One
possibility is to consider a slab of the crystal, i.e. to cut off the semi-infinite
crystal at a particular lattice plane parallel to the surface and discard the
remainder. One may say that, in the direction perpendicular to the surface,
the true semi-infinite crystal is replaced by a cluster (see Figure 3.36). The
latter has two plane surfaces, one of them being real (that of the semi-infinite
crystal), and the other one (obtained by cutting the semi-infinite crystal) not.
This differs from the cluster method in the case of point defects discussed in
section 3.5 where the whole cluster surface is artificial.
The slab method may be combined with any band structure calculational
method for infinite bulk crystals. Its combination with the tight binding
method will be illustrated below by means of an example.
We consider the ideal (111) surface of a diamond type crystal which
according to subsection 3.6.2, has a hexagonal Bravais lattice. Instead of
the one s- and three p-orbitals per atom we use the four hybrid-orbitals, i.e.
we replace a in the Bloch sum (3.214) by ht, t = 1 , 2 , 3 , 4 . Only nearest
neighbor interactions are taken into account. An illustration of this model
is given in Figure 3.37. To get the matrix elements of the Hamiltonian
H between Bloch sums, we first need these elements between the localized
hybrid orbitals +htjlT Ihtjlr). The diagonal elements are
(3.215)
365
Figure 3.36: Illustration of the slab method for surface band structure calculations.
Figure 3 . 3 7 Defect molecule model of the ideal (111) surface of a crystal with
diamond structure.
and the non-diagonal elements between different hybrids at the same atom
j = j are
(htllrlH(htJ1lr)= VI ,
# t .
(3.216)
The two types of matrix elements in (3.215) and (3.216) are equal for atoms
at the surface and in the bulk. This is not true for elements between hybrids
located at different atoms. Let j = 1,1= 0, designate the surface atom layer
and consider the elements (ht10rJH]ht20rt)between the hybrid ht located
at the surface atom 10r and the hybrid lht20x-t) at its nearest neighbor 20rt
below the surface, pointing toward (htlOr). Since no nearest neighbor hybrid
exists for the hybrid )hllOr) pointing out of the surface, all nearest neighbor
elements involving IhllOr) vanish. The other three elements with t = 2 , 3 , 4
are equal to the parameter V2 introduced in equation (2.292),
(htlOq(Hlht,lOq)= v1 ,
(htIOqlHlht20q) = etV2,
# t
t = 2,3,4 ,
(3.219)
(3.220)
d2 = -(lTT),
2
d3 =
a - -
-(lll),
2
d4 =
a(TT1).
2
(3.221)
The hybrids lht201-i) at the nearest neighbor atoms pointing td the surface
atom lor, are coupled to the other hybrids at these atoms, and these hybrids
interact with hybrids at more remote atoms. Thus an infinite matrix would
occur if we would not restrict consideration to a slab, as we in fact do.
Here, we will go one step further and neglect all couplings between the
hybrids at the nearest neighbor atoms. Then the 7 Bloch sums IhtlOq),
t = 1,2,3,4, and (ht20q),t = 2 , 3 , 4 , are completely decoupled from the rest
of the Bloch sums (see Figure 3.67). In this way we arrive at a simplified
model of the crystal with surface which effectively reduces it to the two first
atomic layers and treats the atoms of the second layer only in an approximate
way. This model represents an analog of the defect molecule model in the
case of a point perturbation. If the basis functions are arranged in the
order IhllOq), IhzlOq), h l o q ) , h l o q ) , lh220q), lh320q), lh420q), then the
Hamiltonian matrix is
(3.222)
The eigenvalues of this matrix are plotted in Figure 3.38 for different symmetry lines of the first surface B Z of the hexagonal lattice of Figure 3.31.
367
W avevec tor
Figure 3.38: Band structure of the ideal (111) surface of a diamond crystal within
the defect molecule model. The TB parameters are V1 = 2.13 e V , V2 = 6.98 e V .
The hybrid energy Eh is used as the energy origin.
The three lowest and three highest bands are bulk state surface bands. They
arise from the three hybrids of the surface atom, which bind this atom back
to its nearest neighbors in the second layer. More strictly, they correspond
to the 3 back bonding and antibonding states. The band in the gap is due
to the dangling hybrid of the surface atom. It forms a bound surface band.
If we were to consider more than 2 atomic layers, the region where the bulk
bands in Figure 3.38 occur would be covered by more bulk state surface
bands, while the bound surface band in the gap would hardly change.
Supercell method
The slab considered in the preceding subsection may be repeated periodically in the direction perpendicular to the surface, simultaneously inserting
several layers of vacancies between two neighboring slabs (see Figure 3.39).
This structure may be considered to be an infinite repetition of the original crystal with surface, each repetition being approximated by a finite slab
of several atomic layers embedded between vacancy layers. This arrangement represents a 3 D supercrystal composed of 1D supercells (bear in mind
that in the case of point defects in section 3.5, we similarly considered a
3D supercrystal composed of 3 D supercells). The band structure of the supercrystal is approximately the same as that of the original crystal with
surface, provided the vacancy slabs are thick enough to suppress coupling
between neighboring crystal slabs, and the latter slabs are thick enough to
simulate a semi-infinite crystal. The bands Ep(q) of the crystal with surface
are obtained from the bulk bands EF(k) of the supercrystal by plotting the
latter on the surface B Z . In this, k l may be chosen arbitrarily because
any dispersion of EF(k) = E F ( q , k l ) with respect to k l would indicate a
coupling between the slabs, which has been excluded. The band structure
of the supercrystal may be obtained by means of any 3 D band structure
calculation method without modification. This makes the supercell method
particularly appealing. Combined with the pseudopotential method, as well
as the local density functional or quasi-particle approximations, it represents
the most important calculational method for electronic and atomic structure
determinations of surfaces.
Defect model and Greens function method
A crystal with surface may also be viewed as a 3 D crystal with a 2D perturbation. The significance of this characterization is the following: Consider
first an ideal infinite 3 D crystal, then remove some number of neighboring
atomic layers parallel to the surface under consideration or, equivalently,
369
Figure 3.40: Illustration of the defect method for calculating surface energy band
structure.
create the same number of vacancy layers (see Figure 3.40). What remains
are two identical semi-infinite crystals which are only displaced with respect
to each other. They do not interact provided the number of vacancy layers
is large enough, which we will assume. The electronic structures of the two
semi-infinite crystals coincide, and are identical with that of the considered
crystal with surface. As in the case of a single vacancy, the states bound at
the defect may be obtained by means of the Greens function G o ( E )of the
unperturbed 3D crystal, more strictly, by the vanishing of the determinant
of [Go(E)V - 11 (see equation (3.134)). Here, the perturbation potential
V(x) represents the difference of the potential energy of an electron in the
perturbed and unperturbed crystals. It is the negative of the sum of the
potentials of the removed atoms or the sum of all vacancy potentials. It has
the 2D lattice symmetry of the surface so that
~ ( x=) V(X
+ r).
(3.223)
If only nearest neighbor interactions are taken into account, a single layer
of vacancies is enough to decouple the two semi-infinite crystals. For the
analysis of the bound state condition (3.74), one has to use a particular basis
set, just as in the case of a point perturbation. Wannier functions are again
a possible choice, but here they involve localization only in one direction,
namely that perpendicular to the surface. We denote these functions by
IvqRl) where RI is the component of a 3D lattice vector R perpendicular
to the surface, such that
(3.224)
The matrix representation of the Greens function Go(E) with respect to
this basis set is
For the perturbation potential V1(x) we take that of a single layer of vacancies located at RI = 0. Then the matrix representation of V1(x) is
with
(3.228)
According to equation (3.134), the determinant of the matrix (3.227) must
vanish for energy eigenvalues E in the gap of the ideal crystal, i.e.
Det [G:(E, q)VL,,(q) - 6,,,]
= 0.
(3.229)
If one compares this equation with the corresponding relation (3.94) for deep
centers, it will be noted that the q-dependence which occurs here was absent
there. This dependence in the case of surfaces causes the eigenvalues in the
gap to form deep bands rather than deep levels (which was the case for
point perturbat ions).
Transfer matrix method
Finally, still another method for surface band structure calculations should
be mentioned. It is based on the transfer matrix concept of quantum mechanics. The transfer matrix M ( E ) is formed from solutions of the Schrodinger
equation upon one unit cell of the bulk crystal for particular boundary conditions in the direction perpendicular to the surface. The energy E is arbitrary
first of all. Transferring the wave function from the surface unit cell to the
371
n - th unit cell below the surface can be done by applying the n - t h power
W ( E ) of M ( E ) . Bound surface states are obtained for such energy values
E for which M n ( E ) decays exponentially with n. The practical use of this
3.6.4
In this subsection we will deal with the atomic and electronic structures of
some important semiconductor surfaces, including the various reconstruction
states of the Si (111) surface, the Si (100) surface, the (110) surface of GaAs
and other 111-V compound semiconductors as well as the (111) and (100)
surfaces of GaAs. It is advisable to treat the atomic structure together with
the electronic structure because of the close relation between the two kinds
of structures, as was pointed out earlier. We begin with a short introduction
to the experimental methods of surface structure analysis, again referring to
both the atomic and electronic aspects.
Experimental methods for surface s t r u c t u r e analysis
Experimental methods for determining the atomic structure of bulk crystals
are all based on the interaction of waves with the atomic cores and valence
electrons of the crystal. If the wavelength is of the order of the distance
between the atoms, i.e. of the order of magnitude of 1 A, the crystal constitutes a 3D diffraction lattice and diffraction maxima will occur in prescribed
directions in space. The crystal structure may be determined from the positions and intensities of these maxima. The various experimental methods
differ primarily in the nature of the waves employed. X-rays are by far the
most important for determining the structure of bulk crystals. A wavelength
in the region of 1 corresponds to a photon energy in the range of 10 k e V .
Electron and neutron waves are also diffracted by bulk crystals. Electron
energies in the region of 100 eV and neutron energies of 0.1 e V are required
for wavelengths in the d region. Since they are neutral particles, X-ray photons and neutrons interact only relatively weakly with the crystal. They can
pass through crystals of macroscopic thickness and be backscattered from
them from macroscopic depths within them. X-rays and neutrons thus yield
information on all atomic layers of a crystal, including those at the surface.
Since the number of surface layers is extremely small in comparison to the
total number of layers, the diffraction patterns are dominated by the bulk
of the crystals.
The interaction of electrons with the atomic cores and the valence electrons of a crystal is significantly stronger than that of photons and neutrons.
Electrons having energy less than 100 k e V can not pass through a crystal of
lkzl .
(3,231)
A solution of the two equations (3.230), (3.231) always exists for given vectors ki and g (this is in remarkable contrast to coherent scattering of electrons from 3D bulk crystal, which can only occur if k, lies on a Bragg reflection plane). The solution of equations (3.230) and (3.231) can be readily
carried out using the construction shown in Figure 3.41. The points at which
the vertical lines passing through the reciprocal lattice points g intersect the
Figure 3.41:
LEED maxima.
373
Construction of
sphere Jk,]= lkzl, determine the directions in which diffraction maxima occur. There is exactly one maximum for each reciprocal lattice point g. The
reciprocal surface lattice can thus be read immediately from the distribution
of the diffraction maxima on the registration screen. The direct surface lattice is the reciprocal of the reciprocal surface lattice. Some typical LEED
images are shown in Figure 3.42. The bright points correspond to the reciprocal lattice of the ideal surface, and the less bright points to the finer
reciprocal lattice of the reconstructed surface. In this way it is relatively
easy to determine the surface lattice by means of LEED. To obtain the actual positions of atoms is more diacult. One needs additional experimental
and theoretical information about the intensity of the diffraction maxima as
a function of the energy of the incident electrons (dynamical LEED).
Besides LEED, there are other methods for surface structure analysis
which, although they are not a substitute for LEED, can supplement it.
These methods include diffraction of energetic electrons in the region of some
10 k e V , known as Re5ection High Energy Electron Diffraction ( M E E D ) ,
diffraction of X-rays incident almost parallel to the surface, and diffraction of slow Helium atoms (of M I00 meV). Scattering of energetic ions
( M 1 M e V ) is used in techniques like Rutherford backscattering (RBS) and
ion channeling. Imaging procedures of significance are transmission electron microscopy (TEM), scanning tunneling microscopy (STM) and atomic
force microscopy (AFM). The latter two methods have become particularly
important.
Like for atomic structure determinations, a variety of methods exist to
study the electronic structure of surfaces, in particular the bound surface
states in the energy gap of the bulk crystal. The most powerful and universal method is photoemission spectroscopy (PES). This method relies on
the external photoeffect in which an electron is emitted from the crystal by
x6
Figure 3.42: LEED pictures of six differently prepared GaAs (100) surfaces.(After
Drathen, Ranke and Jacobi, 1978.)
11101
375
hi01
Figure 3.43: Geometry of the ideal Si (111) surface (left) and of the Si (111) 2 x 1
surface according to the buckling model (right).
pied surface states one may use inverse PES in which electrons captured
by such states emit photons. Beside photoemission, a variety of other techniques exists which can provide data on surface states. In principle, any
experimental technique which probes the electronic structure of bulk crystals can be employed for surface electronic structure investigations, provided
it can be made surface-sensitive. This applies to optical reflectivity, electrical transport, photoconductivity, and capacity measurements, as well as
electron energy loss spectroscopy (EELS). Experimental techniques like field
effect measurements fulfill this requirement from the very beginning. Controlling the energy of tunneling electrons in scanning tunneling microscopy,
surface states can be resolved spatially and energetically (scanning tunneling
spectroscopy).
Experimental techniques which primarily measure the electronic structure, can also provide data on the atomic structure. The solid state shifts
of core levels (see section 2.1) are an example. These shifts differ for atoms
in the bulk and at the surface because of the altered atomic structure at
the surface. The difference (typically some tenths of an e v ) can be measured by means of PES and UPS. On the other hand, they can be calculated
on the basis of a particular surface structure model. By comparing theory
and experiment one can evaluate the feasibility of various models of surface
structure.
The calculation of the total energy is a purely theoretical test of the
validity of a particular surface structure model, and it may be used to determine the parameters which can be varied in such a model. If the model has
optimized parameters and results in a lower total energy than other models
it may be given preference over them.
I '
face.
t i
*!:!
-5 - *
..
..
..
,-I I
!ii
::I
.....
.*....
I...
ll
iii
r
r
2D wave vector
Si surfaces
(111) surface
The geometry of the ideal (111) surface of diamond type crystals is illustrated in Figure 3.43 (left). The surface lattice is hexagonal, and the two
primitive lattice vectors are fi, f2 of Table 3.8. There is one surface atom per
primitive unit cell, and one dangling bond per surface atom. The (111) surface is the cleavage plane of diamond type crystals. By cleaving a Si crystal
in UHV at room temperature, one obtains a 2 x 1 reconstructed (111) surface. After annealing it at 500 C , a 7 x 7 reconstruction state evolves, which
remains stable at room temperature. The occurrence of a 2 x 1 reconstruction immediately after cleavage is to be expected if one examines the band
structure of the ideal (111) surface (see Figure 3.44) and considers, in particular, the electron occupancy of the bound surface band in the fundamental
gap. This band arises from dangling sp3-hybrids of surface atoms, and can
host 2 electrons per surface unit cell. Since 3 of the 4 valence electrons of
a surface atom are in bonds with second-layer atoms, only 1 electron per
surface unit cell is left for the bound surface band. Thus this band remains
only half-filled. The ideal Si (111) surface is metallic.
377
r?
'P
r-
[I101
[1?01
Side view
a)
b)
Figure 3.45: a-bonded chain model of the Si (111) 2 x 1 surface (After Pandey,
1982). Part (a) shows the unreconstructed surface in top and side views. The top
view in the second row has been rotated with respect to the top view in the first
row in order t o allow for the side view below. Part (b) shows the same views of the
surface as in part (a), but after reconstruction has taken place.
379
raised above. This implies the creation of an electric dipole which is too
costly in energy to actually take place. Using the terminology of section
2.2 we may say that correlation effects of electron electron interaction, more
strictly speaking, the configuration dependence of oneelectron states, prevents the buckled Si (111) surface to be lower in energy than the ideal one.
The reconstruction model which actually applies to the Si (111) surface
is the so-called s-bonded chain model, illustrated in Figure 3.45. In this
model, second layer atoms in rows parallel to fi f2, i.e. along the [lo3
direction in Figure 3.45a (including atom number 2) are raised into the first
layer as shown in Figure 3.45b, breaking their bonds with atoms in the third
layer (for example, the 2-5 bond). The dangling bond of the new surface
atom (say atom 2) is used to establish bonds with atoms of the first layer
(the 2-1 bond in this case). These can only be s-bonds (indicated by double
lines in Figure 3.45b) because the dangling bonds are perpendicular to the
surface. In this way s-bonded chains occur along the [ l O q direction (Pandey,
1982). The dangling bonds left at the third layer atoms (for example, atom
5) are saturated by hybrids of the first layer atoms which have been lowered
down to the second layer (for example, atom 3). The surface is in fact 2 x 1
reconstructed. This may be seen by taking the primitive lattice vectors of
the ideal surface to be f 1 + f 2 and -f2. Doubling -f2 yields the rectangular
lattice indicated in Figure 3.45b by dashed lines. Its primitive lattice vectors
are f1 + f 2 and - 2 f 2 + (fi + f 2 ) = f 1 - f 2 so that the short side of the rectangle
is parallel to the chain direction [lOT], and the long side perpendicular to it
(parallel to [121]).A peculiarity of the n-bonded chain model is that it has
a different bonding topology in comparison with the ideal (111) surface and
also with respect to the buckling model. While the latter exhibit rings with
6 mutually bonded atoms (see Figure 3.45a), the former shows alternating
rings with 5 and 7 bonded atoms (Figure 3.45b). This is due to the fact that
bonds existing at the ideal and buckled surfaces are broken and new bonds
are established in the s-bonded chain model.
The total energy of this model is clearly below that of the ideal surface
(about 0.5 eV per surface atom). Thus it represents a good candidate for
the reconstruction of the (111) Si surface. Further evidence is provided by
ARUPS and optical measurements. Figure 3.46 shows the wavevector dispersion of the two bound surface bands as obtained from ARUPS measurements together with the calculated dispersion of these bands. The agreement
is quite satisfying. The strong dispersion of the bound surface band on the
r-X-line and the weak dispersion on the X-M-line of the rectangular surface B Z is easily understandable: the long r-X-side of the rectangular unit
cell in q-space corresponds to the short side of the rectangular unit cell in
coordinate space, which is also the direction of the s-bonded chains. One
expects strong dispersion along the chains and weak for the perpendicular
direction, exactly what is seen in Figure 3.46.
t2
-2>
W
3 1
0
Q,
Yl
g o
-1
nf
20 wave vector
93
0.A
0,s
Energy (ev)
$5
38 1
The (100) surface is the surface of choice for electronic device applications
of Si. The geometry of the ideal (100) surface is shown in Figure 3.51. The
383
a1
Top view
I--
(100)
0'- -0
It-
-?
0
U
10113
Side view
a)
b)
Figure 3.51: Geometry of the ideal Si (100) surface (a), and of the asymmetric
dimer model of this surface.
asymmetric dimer model.
Although the 2 x 1 reconstruction is the most common superstructure of
the Si (100) surface, other reconstructions are also observed, for example,
2 x 2, c - 2 x 2, c - 4 x 2. Most of these structures may be traced back to
asymmetric dimers as building blocks.
The (110) surface represents the cleavage plane of zincblende type crystals.
The best understood (110) surface of all zincblende type 111-V semiconductors is the (110) surface of GaAs. Its surface geometry is shown in Figure
Top view
385
Ga o
(110)
As
I
?
I
I
I
5
0
Y
I
I
I
I
I
I
I
I
[?I0 1
Side view
0
c7
U
[?I01
a)
b)
Figure 3.52: Geometry of the ideal GaAs (110) surface (a), and of the same surface
after relaxation (b).
3.52. The surface lattice is p-rectangular. The primitive lattice vectors are
given in Table 3.8. There are two surface atoms in a primitive unit cell, one
Ga and one As atom. Two bonds of each surface atom lie within the surface,
one is directed back and one is dangling. The two dangling hybrids per unit
cell have different energies since they belong to either a Ga- or an As atom.
Thus one expects two bound surface bands, one Ga-like and one As-like.
These are in fact seen in the band structure of the ideal (110) GaAs surface
depicted in Figure 3.53. The lower As-like bound surface band is completely
occupied, and the higher Ga-like band is completely empty. Thus the ideal
surface is semiconducting. Nevertheless, it does not yet represent the stable
state, as the gap between the two bound surface states is too small. It can
be enlarged by moving the As atom above the surface, rendering its dangling
hybrid more s-like and lowering its energy, and moving the Ga atom below
Ga As
(111)
(1001
(1101
-10
r r x '
r r
2 D wave vector
Figure 3.53: Band structure of the ideal GaAs (110) surface. (After Talwar and
Ting, 1992.)
the surface, rendering its dangling hybrid plike and raising its energy (see
Figure 3.52). These displacements do not change the translation symmetry
of the surface because the two atoms belong to the same unit cell. One
therefore has a relaxation instead of a reconstruction of the (110) surface.
This is consistent with LEED measurements, which do not show spots other
than those due to the ideal prectangular lattice. The experimental value
for the rotation angle w of a Ga-As bond with respect to the [ l i O ] direction
is close to 30, in good agreement with structure calculations. Experiment
and theory also agree in regard to an essential feature of the band structure
of the relaxed (110) surface. As indicated in Figure 3.54, relaxation moves
the bound surface bands completely out of the fundamental gap, the As-like
band merges into the valence band, and the Ga-like band merges into the
conduction band. This implies that perturbations of the surface such as,
for example, coverage by an insulator, can easily create surface states in the
gap. Such states are in fact present at GaAs/insulator interfaces. They pin
the Fermi level and preclude the possibility of making GaAs-based field effect transistors in the same way as the Si-based MISFET (see Chapter 7 for
further discussion).
Relaxations similar to that of the (110) surface of GaAs are also observed
at the (110) surfaces of other 111-Vcompounds. For all materials except Gap,
relaxation moves the bound surface states out of the gap (see Figure 3.54).
387
(110)cleavage face
GaP GaAs GaSb
-3-
-0
5
In P
In As
In Sb
-4-
9 -5-
3 -6x
e
E -7-
Figure 3.54: Energy level diagrams of bound surface states and bulk states of various 111-Vcompound semiconductors below the vacuum level. Solid lines represent
experimental results, and dashed regions represent dangling bond surface bands.
F'ramed undashed regions indicate bulk bands. (After Bertoni, Bisi, Calandra, and
Manghi, 1978.)
Other surfaces
Besides the (110) cleavage surface, investigations have mainly been focused
on the low index (111)and (100) surfaces of GaAs. (100) is the preferred surface orientation of GaAs wafers used in device fabrication (partially because
the (011) plane perpendicular to this surface represents the cleavage plane
of GaAs). The geometrical structures of the ideal surfaces are the same as
those of the corresponding Si surfaces shown, respectively, in Figures 3.43
and 3.51. The (111)and (100) surfaces of GaAs differ from the (110) surface
of GaAs mainly in regard to the fact that two different surface terminations
are possible in their case, one by Ga atoms and another by As atoms. One
says that these surfaces are polar, in contrast to the (110) surface, which is
said to be non-polar. In forming a polar surface, an electric dipole is created
between a Ga-layer and an As-layer which is costly in energy. This explains
why for GaAs and other 111-V compound semiconductors, the cleavage plane
is neither ( l l l ) , like in the case of group-IV materials, nor (loo), but the
non-polar (110) plane. In the latter case the 1:l ratio of Ga- and As atoms
is strictly k e d by chemical stoichiometry. For the polar surfaces, there are,
however, no stoichiometrical reasons which cause a surface termination by
3.7
Semiconductor microstructures
3.7.1
Heterojunctions
389
Material 1
Material
P
a,
L.
Besides bulk states, there may be stationary states with energy eigenvalues which are allowed in none of the two materials. The wavefunctions of
such states are localized at the interface between the two materials, just like
the bound surface states in the case of clean surfaces. They are called bound
interjuce states. In analogy to the clean surface case, interface resonances
391
may also occur, their energy levels lie in one of the bands and their wave
functions are weakly localized at the interface. If the two materials forming
a heterojunction are composed of chemically similar atoms as in the case
of GaAs and AlAs and in many other cases of practical importance, then
no bound interface states will be possible, because the perturbation at the
interface is too weak.
(3.232)
The difference AE, zz [E,1 - E,2] between the two valence band edges
is called valence band discontinuity or valence band offset of the material
l/material 2 heterojunction. By means of equation (3.232) the valence
band discontinuity AE, is expressed as
(3.233)
Beside the pure bulk contribution [E$ - E&], the band discontinuity also
contains the dipole contribution -e[cpi-pg]. The latter generally depends on
the properties of the interface. If one were to ignore this dipole contribution,
then AE, would obey the so-called transitivity rule which states that the
AE,-values for a succession of two heterojunctions 1/2 and 213 should
add up to the AE,-value of the combination 1/3. In practice this rule is
rarely fulfilled, which points up the importance of the dipole contribution to
A E,.
The dipole contribution -e[cpl - cpz] may be estimated by means of the
tight binding method developed in section 2.6. The bonding energy level
b considered there forms a rough measure for the average valence band
energy. In formula (2.315) b is given for a zincblende type semiconductor.
The corresponding bonding orbitals IbtR) of equation (2.316) describes the
charge transfer from the cation c (there denoted by an upper index 1)to the
anion a (there denoted by an upper index 2) in terms of the hybrid energy
difference EL - 2. If the hybrid energies were equal, no charge transfer
would occur. A semiconductor material i composed of cations ci and anions
ai, forms a big molecule of the average hybrid energy 2
; = (2 &/2.
Thus the charge transfer between material 1 and material 2 is governed by
the average hybrid energy difference S i - 2;. If this difference is non-zero,
valence electron charge will be transferred into the material with the lower
value of 2;. As pointed out above, the charge transfer will result in an
electrostatic potential difference at the interface. The potentials on either
side of the interface will add to the average hybrid energies and diminish their
difference. Charge will flow until the average hybrid energy difference has
been equilibrated by the potential jump. This is described by the relation
-1
ch - ecpi = F;
ecp2,
(3.234)
393
i.e. the average hybrid energy levels on the two sides of the heterojunction
align, and the potential contribution -e[cpl- 1,721 to the valence band offset
matches IS; - T i ] .
At this point it is advisable to recall a result concerning deep levels of
transition metal (TM) atoms obtained in section 3.5. There, it was shown
that the difference between the position E+M of a deep level of a particular
TM atom in a semiconductor material 2, and the position E&M of the same
deep level in a semiconductor material 1, equals the average hybrid energy
difference between the two materials,
- E&M = 7; - Sk.
ETM
2
(3.235)
Taking the deep level positions to be measured with respect to the valence
band edges E& of the two infinite bulk materials, we denote these positions
by E ) ~ , i = 1 , 2 , so that
E
(3.236)
=
~ E k M - EVi.
0
[ETM
- 61 ~
(3.237)
This relation reduces the experimental determination of valence band discontinuities to measurements of deep level positions of TM atoms with respect to
the valence band edge. Other experimental methods rely on measurements
of photoemission, optical or transport properties of heterojunctions. Experimental values for valence band discontinuities of several heterojunctions
are shown in Table 3.10. Often, different values are obtained by different
methods or even by different authors using the same method. This points
up the experimental difficulties in determining valence band discontinuities,
and also to the dependence of the discontinuities on the preparation of the
heterojunctions. In order to obtain theoretical values for valence band discontinuities, electronic structure calculations are required with an accuracy
of less than 0.1 eV. Such an accuracy is difficult to achieve so that, in many
cases, calculated valence band discontinuities have also considerable uncertainty. To this day, a great deal of effort is devoted to the task of obtaining
more reIiable experimental and theoretical data for AE,, even in the case of
the most thoroughly explored heterojunction, that being between GaAs and
(Ga,Al)As alloys.
O t h e r band discontinuities
Knowing the lineup of the valence band edge at I? for a particular heterojunction, the lineups of all other energy levels at I' and off I' can be determined from the known bulk band structures of the two materials. Below we
Junction
Ge/GaAs
InAs/GaSb
GaAs/ZnSe
HgTe/CdTe
InP/CdS
AE,(eV)
0.54
-0.46
0.96
0.02
1.63
AE,(eV)
0.23
0.81
0.29
1.41
-0.39
AEg(eV)
0.77
0.35
1.25
1.43
1.24
+ E ~ I , Ec2 = E v 2 + Eg2,
(3.238)
where E,l,2 are the gap energies of the two materials. For the conduction
band discontinuity AE, = E,2 - E,1 of the material l/material 2 heterojunction, it follows that
AE,
Ec2 - Ecl =
-AEv
+ AEg,
(3.239)
where AE, = E,2 - E,1 is the discontinuity of the energy gap. If AE,
and AE, are known, AE, can be calculated by means of relation (3.239).
The gap discontinuity AE, follows from the gap energies Egl,Eg2 of the two
infinite bulk materials. The lineup thus obtained for the conduction band
edge of a GaAs/Gal_,Al,As heterojunction with x < 0.42 is schematically
plotted in Figure 3.58a, together with the lineup of the valence band edge.
Often, the two band offsets AE, and AEc are expressed in percent of the
gap discontinuity AE, by means of the ratios Qv = AE,/AE, and Q c =
AE,/AE,. Because of AE, A E , = AE,, one has Q , Qc = 1.
In the second case, i.e. for GaAs/AlAs or GaAs/Gal-,Al,As
heterojunctions with alloys having an indirect gap (z > 0.42), the conduction band
395
2.0
1.6
-.+ ---.-
Fe(1+/2+)
0.4
Composition x
Ec2
X
r
tc2
Ev2
Ev2
Figure 3.58: Conduction and valence band lineups for a GaAs/Gal-,Al,As heterojunction. In part (a) ( x < 0.42) the alloy gap is direct, and in part (b) (x > 0.4!2),
the alloy gap is indirect. In the latter case, the r and X levels forming the conduction band edges in, respectively, GaAs and Gal-,AI,As are also plotted in the
respective other material where they do not form the conduction band edge.
Consider two direct gap semiconductors with the valence and conduction
band edge at I?. There are several qualitatively different possibilities for the
lineup of the two band edges E,i and Eci, i = 1 , 2 (see Figure 3.59). The
conduction band edge of one material, say of materiai 1, may lie below the
conduction band edge of material 2, while simultaneously the valence band
edge of material 1 may lie above that of material 2. This case is referred
to as heterojunction of type I. In this case the gap of material 1 is located
entirely within the gap of material 2. If both the conduction and valence
band edges of a particular material, say, again 1, are, respectively, below the
two edges of material 2, one has a heterojunction of type II. The staggered
type II heterojunction occurs when we have Ec2 < E,1, and also E,1 < Ec2,
E,1 < Eva holds, and the misaligned case applies if Ec2 > E,1. In the
staggered case the heterojunction still has a gap, while in the misaligned
case the conduction band of material 1 overlaps with the valence band of
material 2 so that the gap of the heterojunction disappears. Sometimes, type
ZZI heterojunctions are defined. These are heterojunctions of type I with zero
energy gap in material 1. A look at Table 3.10 shows that the heterojunction
Ge/GaAs is of type I, CdS/InP of type 11- staggered, InAs/GaSb of type
11-misaligned, and HgTe/CdTe is of type 111.
3.7.2
Heterojunctions cannot be fabricated by simply putting together two separately made semiconductor samples with plane surfaces. If one would proceed in such a way the result would be a completely rough, polluted interface,
incapable of hosting electron and hole states which extend upon both materials, a property which has been assumed above and which turns out to be
crucial for the electronic behavior of a heterojunction. In actual practice,
the second material must be placed on top of a crystal made from the first by
continuing the crystal growth, a process referred to as epitaxial growth. In
397
Type
staaaered
+
>r
P
W
EC2
Ev2
tvi
misaligned
A
15
0
material 1
material 2
2
material 1
material 2
Epitaxial growth
LIQUID NITROGEN
COOLED SHROUDS
FLUORESCENT
SCREEN
399
RHEED GUN
MAIN SHUTTER
TO VARIABLE
SPEED MOTOR
ANDSUBSTRATE
HEATER SUPPLY
Semiconductor microstructures
40 1
a)
d)
Type II-staggered
Type
II - misaligned
P
Q,
5
c
Figure 3.62: Double heterostructure of type I acting as a quantum well both for
electrons and holes (a), and of type I1 acting as a quantum well for electrons and
a barrier for holes (b, c). In the misaligned case of type I1 (c) an energy region
occurs where electrons and holes may coexist. Parts (d), (e) and (f) of the figure
show the correponding superlattices giving rise to minibands on top of the bulk
band structure. Further discussion is given in the main text.
perlattices of the staggered type I1 shown in Figure 3.62e; solely the gap
becomes indirect in coordinate space in this case. For misaligned type I1
superlattices shown in part f of Figure 3.62, Bloch states are formed from
the electron and hole states of the respective wells.
It is not surprising that the layer thicknesses must lie in the nanometer
range for the confinement and tunneling effects discussed above to occur:
1 nm = 10 A is close to the distance between nearest neighbor atoms in a
natural crystal (in GaAs the nearest neighbor distance is about 2.5 A). Heterostructures with such thin layers possess, so to speak, an artificial atomic
superstructure which is likely to result in a modified electronic structure.
3.5
403
_ .-
MnSe
3.0
h
2
g
2.0
>r
1.5
1.0
2.5
P
w
0.5
0.0
5.4
5.6
5.8
6.0
6.2
Lattice constant (
6.4
6.6
LI
Figure 3.63: Energy gap versus lattice constant for diamond and zincblende type
semiconductors. Full lines represent alloys with direct gaps, and dashed lines represent alloys with indirect gaps.
their lattice constants. Such a diagram is shown in Figure 3.63 for elemental
and compound semiconductors of diamond and zincblende structure. Lines
connecting two different materials indicate the gap energies of alloys made of
these materials. The composition of an alloy is related to its lattice constant
by means of Vegard's rule which linearly interpolates between the lattice constants of the two alloy components. For two direct gap materials 1 and 2, the
gap difference AE, provides some hints about their band edge discontinuities. If the gap discontinuity AE, vanishes, then band edge discontinuities
may, but need not, occur. If a gap discontinuity does exist, then at least one
of the two band edges must also exhibit a discontinuity. In general, both the
valence and conduction band discontinuities AE, and AE, differ from zero
and their magnitudes reflect the gap discontinuity to a certain extent.
Lattice mismatch
As the materials shown in Figure 3.63 are all of zincblende type, their main
2 4 4
= cll(i)zz(i)
+ c12(i)[err(i) + yy(i)],
(3.240)
with cll(i) and clz(i) being the elastic stiffness constants of the cubic layer
of material i. Because m t z ( i ) = 0, it follows from (3.240) that
405
(3.241)
For the parallel strain components e z Z ( i ) and e y 3 / ( i ) one has
4 2 ) - &(1) = tyy(2)- E v y ( l ) = t,
-a2
. - ai
ai
(3.242)
The total strain E defined in equation (3.242) equals the lattice misfit f
divided by 100%. Using the stress and strain components, the total elastic
energy Eda of the two layers may be calculated, with the result
(3.243)
where
ci =
[Cll(i)
+ (1 - Ki)Cl2(i)l.
(3.244)
(3.245)
According to these relations, the thin layer is more heavily strained than
the thick one, provided the elastic constants of the two layer materials are
comparable. This means, in particular, that the substrate will be almost
unstrained and the epitaxial layer will accommodate almost the whole misfit
strain. This was anticipated in the discussion above.
Another general point to be mentioned in the context of lattice mismatched
heterostructures concerns the effect of strain on the band structure. From
theoretical considerations and experimental studies it is well known that such
effects can be quite large (Bir, Pikus, 1974). To give an example, we consider
a Si layer on top of a Ge substrate. The strain components in the Si layer
are e X x = cyy = 0.04, corresponding to a lattice misfit of 4% (see below), and
e Z z = -0.03, using the elastic stiffness constants c11 = 16.1 x 1011 dyn/crn2
and c12 = 6.4 x 1011 dynlcm of Si. The degenerate heavy-light hole valence
band maximum of Si splits by 0.31 el/ under this strain. To produce the
same splitting by means of uniaxial strain applied from outside, a pressure of
73 Kbar would be necessary. This example shows that considerable changes
of the band structure are to be expected because of the lattice mismatch
strain in heterostructures. In this context, energy levels in different materials
or at different points of the first B Z can shift in dserent ways. One may take
advantage of this to adjust the band discontinuities of a heterostructure to
specified conditions. The strain becomes, so to speak, an additional degree
of freedom for tailoring the properties of a heterostructure.
The Inl-,Ga,As/GaSbl-yAsy
material system forms heterostructures of
type 11. The two band edges are lower in Inl-,Ga,As than in GaSbl-,Asy
for all compositions x and y. For small values of x and y, especially for
InAs/GaSb with x = y = 0, one has a misaligned type I1 heterostructure.
The conduction band edge of InAs is 0.14eV below the valence band edge
of GaSb. Superlattices based on InAsfGaSb heterostructures have been
the subject of intensive basic research. Under certain conditions, these SLs
exhibit metallic behavior. For Inl-,Ga,As/GaSbl-yAsY heterostructures
with larger x and y, including the trivial GaAs/GaAs homostructure, one
has staggered type I1 heterostructures, for which the conduction band edges
are above the valence band edges in both materials. Perfect lattice matching
is achieved if y = 0 . 9 1 8 ~ 0.082.
Si and (Si,Ge)
The lattice constants of Si and Ge are, respectively, 5.43 A and 5.65 A. This
gives the lattice mismatch of 4 %, which has already been used above. The
critical thickness d, amounts to about 30 A which is rather small. Therefore,
one also considers heterostructures between Si as substrate and Sil-,Ge,
alloys with low Ge content (x < 0.5) as epitaxial layers. In this case d, can
be 100 A or larger depending on the actual value of x. (Si, Ge) alloys may
also be used as substrates. Consider, for example, a Sil-,Gey substrate on
which a Sil-,Ge, alloy layer is grown first, followed by a pure Si layer. Both
layers are strained in this case. For 0 < y < x, the strain in the alloy layer
407
HgTe and CdTe are lattice matched because the lattice misfit is smaller than
0.4 %. Being a zero gap material, HgTe forms a type I11 heterostructure with
CdTe. The valence band offset turns out to be rather small ( M 0.02 e v ) . In
Hgl-,Cd,Te alloys the gap opens at 2 = 0.15. For 0.15 < 2 < 1 the alloys
are direct gap materials with the valence and conduction band edges at.'I
In this range, Hgl-,Cd,Te/CdTe forms type I heterostructures which, for
sufficiently small 2 , can be applied in infrared detectors.
(Pb,Sn)Te/PbTe
The lattice misfit between PbTe and SnTe is about 2 %, so that strain effects
in Pbl-,Sn,Te/PbTe heterostructures are not negligible. Pbl-,Sn,Te alloys
are direct gap materials with the conduction and valence band edges on the
first B Z boundary at L. For PbTe, the L$ state forms the valence band
edge, and the Lg-state forms the conduction band edge (see Figure 2.35).
For SnTe, this level ordering is inverted. Thus, the gap of Pbl-,Sn,Te alloys
must go through zero for some composition TO. At 77 K , one has zo M 0.4.
Thus, Pbl-, Sn,Te/PbTe heterostructures are of type I for z < 20,of type
I11 for z = zo, and again of type I for 2 > 2 0 , however, with inverted band
edges of the well material in this case. Depending on strain, type I1 situations
also seem to be possible. Microstructures based on Pbl-, Sn,Te/PbTe have
a potential for applications in infrared laser diodes.
Doping microstructures
nipi-structures
Semiconductor samples with alternating n- and p-type doped layers may exhibit properties similar to those of superlattices composed of two different
materials (Esaki, Tsu, 1970; Dohler, 1972). Such doping superlattices may
be thought of as periodic arrays of pn- and np-junctions with alternating negatively and positively charged intrinsic (i) regions between the neutral n- and
p-layers. In this context they are sometimes referred to as nipi - structures.
The oscillating space charge distribution gives rise to an oscillating electrostatic potential which modulates the valence and conduction band edges as
shown in Figure 3.64a. If the modulation period approaches the nanometer
range, minibands arise just as in the compositional superlattices considered
above. Doping superlattices are principally of type I1 because the two band
edges, at a given position, are shifted by the same amount of energy, namely
the electrostatic energy of an electron. Their periods, generally, cannot be
made smaller than 10 nm because of the unavoidable diffusion of dopant
atoms. The most common nipi-structures are those based on GaAs.
409
m
+
a)
-1
b)
+
+
+
+
+
+
+
+
+
+
+
+
Figure 3.64: Valence and conduction band lineup for doping microstructures in a
direct gap material for a) a nipi-structure and b) a n-type &doping structure.
&doping structures
3.7.3
Bulk methods
We consider SLs composed of two zincblende type materials as, e.g., GaAs
and AlAs. The interface is taken to be parallel to a (001) lattice plane. A
GaAs layer of the SL contains a number 2n of (001) lattice planes alternatively occupied by Ga and As atoms, and the AlAs layer contains a number
2m of (001) lattice planes alternatively occupied by Al- and As atoms. The
notation (GaAs),/(AlAs), is used for such a SL. The primitive unit cell of
[IIO]
0 Ga
OAs
8At
27r
B1
= -(ez
a
27r
- ey), B2 = -(ez
a
+ ey),
4n
B~ =
(n
+m)a
ez.
(3.246)
41 1
1".
to
3 -1
A
ol
!i
-2
-3
-4
-F;
GaAs
413
A1As
Figure 3.68: Localization of the lowest valence and conduction band states of
the SL at the center r of the first SL B Z (left), and the symmetry character of
these states (right), for a (GaAs)7/(A1As)7SL. While the valence band states are
well localized in the GaAs layers and are dominantly composed of GaAs-r-states,
conduction band states extend over both layers, and are composed of both AlAsX-states and GaAs-I'-states. h l = -0.11 eV,h2 = -0.19 e V , el = 1.75 eV,e2 =
1.77 eV, e3 = 1.88 eV.(After Rucker, 1985.)
of critical points. There are, however, also differences between the shallow
level problem and the microstructure problem. The perturbation potential,
i.e. the difference of the periodic oneelectron potentials of the two m a t e
rials of a heterostructure, is far from being smooth on the atomic length
scale. Even if one averages out the microscopic potential fluctuations over
a unit cell, the average potential difference has still an abrupt change at an
interface. For this reason one has to determine at the outset whether or not
the effective mass theory developed in section 3.3 can in fact be applied to
artificial microstructures. We will address this question below (for a detailed
discussion see, e.g., Burt, 1992).
The two envelope functions F,l(x) and Fv2(x) obey the effective mass equations
x in m a t e r i a l 1,
(3.248)
F V ~ ( X=) E , F , ~ ( x ) , x in m a t e r i a l 2.
(3.249)
F,l(x) = E,F,i(x),
Formally, these equations may be written as one equation for the envelope
function
415
(3.250)
of the whole heterostructure by introducing a z-dependent effective mass
m * ( z ) and a z-dependent band edge E,(z) defined, respectively, by
(3.252)
The true wavefunction $,(x) of the heterostructure is given by the expression
(3.253)
(3.254)
is the Bloch factor of the heterostructure. Clearly, equation (3.252) holds
only within the two material half spaces, but not at the interface. Thus it
determines the possible envelope function solutions for z < 0 and z > 0. At
z = 0, the solutions for the two half spaces must be connected in an appropriate way to form an envelope eigenfunction for the entire heterostructure.
To do so, matching conditions are required, which we will now discuss.
Firstly, we derive a condition for the change of the envelope function
across the interface. To this end we consider the interface behavior of the
Bloch factor ud(x) in equation (3.253) for the total wavefunction &,(x). If
this factor was continuous at the interface, the envelope function F,(x) also
would have to be continuous there since the total wavefunction @,(x) must
be continuous everywhere. Thus the condition
Fv(z,Y, 0 - 6 ) = F,(z, Y, 0
+6)
(3.255)
should hold in the limit 6 --+ 0. Unfortunately, no general proof exists for the
continuity of the Bloch factor at a heterostructure interface. The only case in
which such a continuity is assured is that of a 'heterostructure' composed of
two identical materials. Having this obvious result in mind, it is commonly
417
_-
h2
2m*(z)
v2
-.+
h2
--v.-
v.
m*(z)
(3.25 7)
+ +
(3.259)
For F u ( z ) the Schrodinger equation follows as
where EL is given by
EL = E , -
h2
-k i
2m*,
(3.261)
+ m;T1)/2.
[-~g$r(z)aaaat + ~ v ( z ) 6 m m iFumt(x)
]
= ~ v ~ v r n ( x z) ,# 0,
m a d
(3.263)
with DE$(z) = DYZAl for z < 0, and Dg$l(z) = D;ELl for z > 0. The
solution of equation (3.263) may again be taken as a plane wave
(3.264)
=+
F U (z)lz=-6 = F u
(3.265)
(z)I~=+6
must hold for 6 + 0. To determine the condition for the derivative with
respect to z , one defines the Hermitian Hamiltonian matrix
(3.266)
($
which exists for any value of z including z = 0 and equals H 1 for z < 0 and
e
H z for z > 0. Integrating the effective mass equation for this generalized
Hamiltonian with respect to z over a small interval across the interface leads
to the conclusion that the expressions
419
(3.268)
k(z)=
--a
[y2(zj(k: - k i ) - 2i73(z)k,kg] .
(3.272)
XJ
Figure 3.69: Geometry of the SL whose stationary electron and holes states
are calculated in the text.
Electron states
Effective mass equation and its solutions
For the zincblende type materials we are considering, the conduction bands
are non-degenerate and isotropic in the vicinity of I. The two effective
electron masses m:l and mE2 generally differ, but we will initially assume
that they are equal, denoting their common value by m;. In the case of
SLs composed of GaAs and Gal_,Al,As this is a reasonable approximation.
Later, we discuss the modifications which occur if m;l and m:2 are different.
The effective mass equation for the electron envelope function Fc(x)of the
421
Ec(z) =
Egl
+AE,
f o r Id
f o r Id
+ d l < z < (1 + l ) d .
(3.274)
<
<
00.
The envelope function F,(x) may be taken in the form Fckl, (x)of equation
(3.260) describing a plane wave of wavevector kll parallel to the layers. The
z-dependent factor F,(z) of Fckl,(x)obeys the equation
where E; is given by
EA = E , -
(3.276)
Fc(ld
+ d l - 0 ) = F,(ld + d l + 0 ) ,
-Fc(ld
dz
d
-F,(ld
dz
+d l
d
- 0) = -F,(ld
dz
+ 0),
(3.277)
= Fck(z) = Lfe ii k r U c k ( z ) ,
(3.278)
where k is the quasi-wavevector component parallel to z , and U c k ( z )is the superlattice Bloch factor. The component k varies within the (1-dimensional)
first B Z of the superlattice between - r / d and r l d . It must have the form
( 2 a / L d )x (0, f l , f 2 , . . .) in order to guaranty the periodicity of F&) with
respect to the periodicity interval. The Bloch factor U c k ( z ) of the superlattice has to be distinguished from the Bloch factor uco(x) of the two bulk
(3.279)
The effective mass equation (3.275), the matching conditions (3.277), and
the Bloch condition (3.278) define an energy eigenvalue problem which differs
from the well-known Kronig-Penney problem only by notation. The latter
problem constitutes an exercise of elementary quantum mechanics and is
commonly used to demonstrate the existence of energy bands. In the case
of SLs this exercise takes on real physical meaning. In the following we
sketch its solution. We restrict ourselves to energy values 0 < E' < A E ,
or E,1 < E < E,1
AE, (7i2/2mr)ki, meaning energies below the lowest
allowed energy in material 2 for a given value of kll.
According to classical mechanics, an electron with such an energy cannot
penetrate within the superlattice layers made of material 2, it will be confined
to layers consisting of material 1. If it hits the interface to material 2, then
it will be reflected back to the interior of its material 1 layer. In quantum
mechanics, the reflection is not complete, there is a certain probability for the
electron to tunnel through a material 2 layer and reach the next neighboring
material 1layer. Although the probability for an electron to stay in material
1 is not unity, as in classical mechanics, but smaller, it is still much larger
than that for material 2. As has already been mentioned, one uses the
terms quantum wells for the layers of material 1, and bamiers for the layers
of material 2. In the wells and barriers the wavefunction F,]F(z)is given
by different expressions. For the wells 0 < z < d l , Schrodinger's equation
(3.276) yields
+ ble-iKz,
(3.280)
with K as a real number which determines the energy E' by means of the
relation
E ,I = - h2
K2
(3.281)
2 m ~ '
+ b2e-nz,
dl
< z < d,
(3.282)
KE.=
2mE
-(AE,
Ti2
423
(3.283)
- EL).
For this set to have a non-trivial solution, the determinant of the matrix of
exponentials must vanish. This yields the secular equation
cos(Kd1) cosh(tid2)
tc2 - K~
+sin(Kd)l sinh(~d2)= cos(kd).
2Kn:
(3.285)
Em(k) = E&(k)
h2
+ Egi + -kll
2mr
(3.286)
Figure 3.70: Mini- and subbands of a superlattice. Also shown are the two
limiting cases of extremely wide and/or high barriers and of narrow and/or
flat barriers. In the lower part of the figure the wavefunctions are illustrated.
Multiple quantum well versus superlattice behavior
Figure 3.70 also shows two enlightening limiting cases. If the thickness d2
of the barriers between the wells becomes large, and/or the barriers AE,
are high, then the hyperbolic terms in equation (3.285) dominate over the
cos(kd)-term on the right hand side which is responsible for the k-dispersion.
Neglecting the latter, the minibands degenerate into discrete energy levels,
termed sublevels. These levels are just the discrete energy eigenvalues of a
single quantum well. Obviously, the superlattice decomposes into isolated
quantum wells in the limiting case under consideration. The electron states
are almost completely confined within a well, and practically zero within the
barriers. One uses the term multiple quantum well for a superlattice in this
limit.
For infinitely high potential walls the secular equation (3.285) becomes
sinKd1 = 0. The solutions of this equation are K-values of the general form
K = (7r/dl)nwhere n is an integer. They give rise to the energy eigenvalues
2
Elcn =
2mE
, n = 1 , 2 , .. . ,
(3.287)
425
(3.288)
The confinement to the well region is complete in this limit. Assuming
d l = 100 A and mE = 0.067 m , one gets a value of 57 meV for the first energy
level ELl. This means that the ground state of the quantum well is shifted
up by this energy amount as compared to the conduction band bottom of
the infinite bulk crystal. This shift is a consequence of the localization of an
electron within the quantum well which leads to a momentum uncertainty
because of Heisenberg's uncertainty principle.
We consider next the opposite limiting case, that of extremely narrow
and/or flat barriers (see Figure 3.70). In zero-th approximation the secular
equation (3.285) takes the form k = K . In this limit the minigaps vanish
completely, and the superstructure which distinguishes the superlattice from
a natural crystal has no effect at all. In the first non-vanishing approximation
with respect to the SL potential, gaps are opened at the Bragg reflection
points k = ( n n / d ) of the 1-dimensional reciprocal lattice of the SL. For
the energy splitting between the two bands E&+l(n/d) and E & ( n / d ) , the
perturbation theory developed in section 2.4 yields
E&+1
(:)
-E L
(:)
= -AEc
7rn
sin ( y d g ) .
(3.289)
According to this expression, the minigaps are proportional to the conduction band discontinuity AE,. They decrease with band number n. Thus,
for large n, there are no substantial energy gaps even if sufficiently wide and
high barriers exist. One may say that for minigaps to occur the product of
wavevector k = ( x n / d ) and superlattice period d should not be large compared to 1. In other words, the superlattice period d should not be large
in comparison with the electron wavelength X = 2 r / k corresponding to k.
This proves and specifies the expectation stated at the outset of this section,
that substantial changes of electronic structure are to be expected if the
superstructure occurs on a sufficiently small length scale. The length scale
turns out to be that of the de Broglie wavelength X of an electron. For a
conduction band electron of silicon, having an energy equal to the average
thermal energy $kT at room temperature, one gets A =" 20 nm. Similar
values are obtained for other semiconductors. Thus the layer thicknesses
of superlattices have to be on the order of, or smaller than, 10 nm in order that superlattice effects on free carriers may become important at room
temperature.
The limit of narrow and flat barriers is the true superlattice case, as
distinguished by the coupling of quantum wells with one another. The en-
(3.291)
Each subband n gives rise to a step-like partial DOS, being zero below the
band bottom E L E g l , and non-zero but constant above. Such behavior
of the DOS of an electron which is free to move only in two dimensions
was already found in section 2.5. The summation in expression (3.291) for
p ~ over
~ all
w subbands results in a staircase-like DOS as schematically
shown in Figure 3.71. The heights of the steps have the constant value
( m * / n 2 h 2 ) ( n / d )while
,
the step widths depend on n. If the barriers of
the QW are taken to be infinite, then the energy levels EAn are given by
( h . 2 / 2 m E ) ( n ~ / d 1 and
) 2 , the step widths of the staircase scale with ( n / d ~ ) ~ .
For very wide QWs, i.e. in the limit d l -+ 00, the staircase-like DOS transforms into the smooth square-root-like DOS p 3 ~ ( Eof
) an electron free to
move in three dimensions. Equation (3.291) yields
(3.292)
427
0
1
0 2
In the above treatment of the electron states of a SL, the effective masses
mzl and mE2 were assumed to be the same in the two materials. If this is not
the case, the following modifications will occur. First, in the boundary conditions (3.277) for the derivative ( d F , / d z ) , different mass factors will appear
on the two sides of the interface and they will not cancel. For the secular
equation (3.285) this means that all K ' s except those which are factors in
arguments of trigonometric functions, have to be replaced by (m&'mal)K.
Second, in solving the secular equation, the relations (3.281) and (3.283)
between K and E i and, respectively, n and E' must be written with m;
and mE2 instead with mE. Finally, the kll-dispersion of the total energy
eigenvalue in relation (3.286) becomes different in the two material layers.
~ the
This results in an additional contribution (h2/2)[mZ-'(z) - E ~ - ' ] k to
SL potential, as may be seen from equation (3.260). The additional term
introduces a kll-dependence of the envelope function F& which otherwise
depends only on the z-component k .
Another simplification made in the above treatment of the electron states
of a SL, was the assumption that no electrons are available to occupy the SL
0.5 x102cm-2
6 xlO*cm-*
1.5 x102cm-2
Figure 3.72: Self-consistently calculated potential profiles and energy levels for
a Alo.aGao..rAs QW of 100 A width containing different numbers of electrons per
cm-2.
bands. If there are such electrons present, the potential seen by an electron
will change due to its interaction with other electrons in the SL bands. As in
the case of bulk semiconductors, this potential change may be decomposed
into a Hartree and an exchange-correlation part. The expressions derived for
these potentials in Chapter 2 also apply here if the stationary states of the
bulk crystal are replaced by the stationary states of the SL. As the latter
states are only known after the effective mass equation has been solved,
the inclusion of the Hartree and exchange-correlation potentials must be
performed in a self-consistent way. The net effect of the two potential parts
is repulsive, thus the SL potential well will flatten. For a given number of
electrons in SL states, this flattening will be enhanced by greater localization
of the electrons in the well regions. It will be particularly pronounced for
isolated QWs. In Figure 3.72 we show self-consistently calculated potential
profiles and energy levels for a Alo.3Gao.7As QW of 100 A width, containing
. well bottom bends up with
different numbers ng of electrons per ~ r n - ~The
rising n s , and the energy levels are shifted to higher energies.
Hole states
We consider a QW structure composed of zincblende type well and barrier
materials which are both described within the Luttinger-Kohn model. The
effective mass equation for the hole states of such a QW with energies close to
the well bottom follows from equation (3.268) by installing there the valence
band edge profle E,(z):
forO<z<d
AE,
f o r z < 0 and z
E,(z) =
> d.
(3.293)
429
In this way, a set of second order differential equations for the four hole
envelope function components arises. The solutions must be normalized
and must guarantee for the continuity of the expressions (3.273) at the two
interfaces. Solutions of this kind can be found in different ways. One can,
for example, numerically calculate normalized solutions for various values
of E and kll, valid in one of the three regions -m < 0, 0 < z < d and
z < +m. Matching these solutions in such a way that the continuity of
the expressions (3.273) is guaranteed for a given value of kll, will only be
possible for certain discrete energy values E,,(kll). These are the subband
energy eigenvalues which are sought. Subband structures calculated in this
way are shown in Figure 3.73 for a A10.3Ga0.7As QW of 100 A width. The
lowest subband ( h h l ) in the vicinity of kll = 0 arises mainly from hole states
of GaAs with the heavy hole mass parallel to z , and the second level ( I h l )
mainly from hole states with the light hole mass parallel to z. Perpendicular
starts stronger than that of
to z , the curvature of the hhl subband at
the Ihl subband. Thus the two bands would cross, if this were allowed. In
reality, the bands repel each other leading to the anticrossing behavior seen
in Figure 3.73. The alternating heavy- and light-hole-like curvatures parallel
and perpendicular to z are consequences of the Luttinger-Kohn Hamiltonian
(2.374).
If the subbands of the QW are partially filled by holes, then hole-hole inter-
:::
50
r-type &well
p-type &well
8.0x 10 cm-*
-50
-250
-200 -100
100
z (4
200
-200 -100
100
200
z (A)
Figure 3.74: Potential pofiles and energy levels of an isolated p-type &doping layer
(left hand side) and an isolated n-type-&dopinglayer (right hand side) of the same
sheet dopant concentration 8 x 10l2 ~ r n - ~The
. Fermi energy is shown by dotted
lines. (After Sipahi, Enderlein, Scolfaro, and Leite, 1996.)
action effects become important which, as in the case of electrons, require
self-consistent calculations.
&doping structures
Whereas in compositional microstructures free carriers may, but need not,
be present (they were omitted above), &doping microstructures do not exist
without such carriers because the doping layer would be neutral if none of
the doping atoms would be ionized. This implies that potential profiles and
energy levels of doping microstructures cannot be calculated without taking
carrier-carrier interaction into account. The main effect of this interaction
turns out to be the screening of the Coulomb potential created by the ionized
dopant atoms. In Figure 3.74 we show the resulting potential wells for an
isolated p-type &doping layer (left hand side) and an isolated n-type 6doping layer (right hand side), together with the sublevel and Fermi level
energies. The dopant atoms are taken to be completely ionized. The lower
Figure 3.75: Calculated current-voltage characteristics of various GaAs/ AlAs SLs with the
electric field parallel to the SL
axes. Curves 1to 4 correspond to
SLs of different miniband widths
A. (After Lea, Horing, and Cui,
1991.)
43 1
100
>a
80
60
40
>
20
10 20 30 40
SO 60 70
E [kV/cml
well depth in the p-type case as compared to the n-type case results mainly
from the fact that heavy holes are more strongly localized at the ionized
dopant sheet and thus are more effective in screening out the sheet potential
then the lighter electrons.
300K
>-
BIAS (VI
until it reaches the inflection point of Eil(k). Thereafter, its effective mass
becomes negative, causing deceleration instead of acceleration. Since the
electron is inevitably scattered by phonons and impurities, its velocity takes
a stationary value in a dc field. The latter increases with increasing field
strength E if the corresponding k-value is below the inflection point, and
it decreases with increasing E if the k-value of the electron is above this
point. However, a superlattice contains a gas of many electrons, populating
the miniband E:(k). For low field strengths, most of the electrons have low
k-momenta and positive effective masses. Thus, the total current density of
the gas increases if E is further increased. Simultaneously, the electron gas
is heated up by the electric field, which implies that more electrons populate
k-points with a negative effective mass and less with a positive one. This
reduces the current increase if E raises further because a larger subgroup of
electrons is decelerated instead of being accelerated by the field increase. If
the subgroup of electrons having negative effective mass is sufficiently large,
further increase of the field strength will actually decrease the current. This
explains the negative differential drift velocity seen in Figure 3.75.
An example of an optical experiment is shown in Figure 3.77. There,
the absorption spectra of GaAs/Al,Gal-,As QW structures of different wen
widths are shown. The staircase-like shape of these spectra reflects the DOSs
of the electron and hole subbands of a QW which above have been shown
to be staircase-like as well. Also other features of the QW band structure
observed above, as the widening of the fundamental energy gap and the
shift of the steps towards higher energies with decreasing well width, can be
clearly seen in the spectra of Figure 3.77. The enhancement of absorption
at the step edges, which correspond to optical transitions from the tops of
hole subbands to the bottoms of electron subbands, is due to the Coulomb
433
3.8
3.8.1
V(x)= e E . x.
(3.294)
435
{ E y ( - i V ) + e E . x } F V ~ , ( x )= EvFv~,(x).
(3.296)
(3.297)
=E ~ X , ~ ~ ( Z ) ,
(3.298)
with
E , = E
li2
+ -2m
k:l.
(3.299)
For the conduction and valence bands, in particular, equation (3.298) reads
(3.300)
(3.301)
The second equation can be interpreted as the effective mass equation for
a hole with (positive) mass Im:l and energy (-.,).
As the equation shows,
the charge of the hole is positive, i.e. fe, in contrast to the electron with
charge -e.
The spectra of energy eigenvalues of equations (3.300) and (3.301) are
continuous, and vary between -co and fco as a direct consequence of the
unrestricted motion of the electron or hole in the field direction. This allows
one to use the energies E , as continuously varying quantum numbers of the
,
to assume normalization in terms of Diracs
stationary states X V C v ( z )and
&function with respect to these energies, thus
(3.303)
(3.304)
with
(3.305)
437
0.4
C
0
.u
g3
0.3
(D
rn
0.2
-e
N
Lc
0.1
0.0
-0
P
c
2
rt
v)
-0.1
-0.2
-Eg/eE
Figure 3.78: Envelope functions of stationary electron and hole states in an electric
field, taken at energy cC = E, = 0. Oc = OU = 8.
3.8.2
(3.306)
$v(x, t ) = ~ V O ( X ) F V ( Xt ,) .
at
(3.307)
In solving this equation we assume that the electron was in a Bloch state
(p,k(x) at t = 0. Then the initial condition for F,(x,t ) may be stated as
FJX, t = 0) s F,/C(X, t = 0) =
-.ik.x
'
(3.308)
Here, the envelope function at t = 0 has been chosen such that its normalization integral with respect to the infinite interval over which the crystal
in field direction extends, is given by a &function with respect to k. The
solution F,k(x, t ) of equations (3.307) and (3.308) is given by
(3.309)
with
kt = k - ZEt.
fi
(3.3 10)
This is readily verified by a straightforward calculation. Owing to the envelope function (3.309) and the previously made approximation U&
Uuk,
the total non-stationary wavefunction & , k ( x , t ) evolving from a Bloch state
(pvk(x)at t = 0 remains a Bloch state for t > 0 also, however, with a
time-dependent quasi-wavevector kt,
(3.311)
This important result may also be established without approximating the
Bloch factor U v k by u,o as is done in effective mass theory. The only approximation needed is the neglect of interband transitions induced by the
electric field.
Equation (3.311) states that the quasi-wavevector k of a Bloch state in
the presence of an electric field changes with time in just the same way as
does the ordinary wavevector of a free electron. Differing from the latter,
the quasi-wavevector is restricted to the first B Z . However, relation (3.310)
results in kt-values which can also lie outside the first B Z . This means
that in the above derivation we inadvertently changed from the reduced
to the extended zone scheme. According to the general considerations of
Chapter 2, one may recover the description in terms of the first B Z by
subtracting a suitable reciprocal lattice vector K(t). For electric fields in
symmetry directions of the first B Z , there will be a particular point in time
t = T(E) at which, with increasing t , the reduced wavevector kT - K(T)
equals the wavevector kt at t = 0 for the first time. This means that the
time-dependent Bloch state (3.311) returns, at t = T , to the Bloch state
at t = 0 (not counting a phase factor). The same holds for time points
2T,3T etc. The time development of Bloch states in an electric field is
therefore periodic, hence the term Bloch oscillations. The period T of these
oscillations may be obtained from (3.310) as
(3.312)
The Bloch oscillations affect not only the wavefunctions but also the pertinent energies E,(kt), meaning that Bloch electrons execute periodic motion
within their energy bands.
Oscillations also occur in the velocity of Bloch electrons, defined by the
quantum mechanical expectation value < ( d x / d t ) >= (qJvhI(dX/dt)IVvk,)of
439
the velocity operator ( d x l d t ) with respect to the timedependent Bloch function Pvk,. using the identity (dxldt) = ( l / m ) p , the velocity < ( d x / d t ) >
follows by means of the previously derived relation (2.193) for the expectation value ((p,hJpJ(p,h)of the momentum operator p. The timeaverage of
the velocity < ( d x / d t ) > obtained in this way, taken over a Bloch period,
turns out to be zero. The same result holds for the timeaveraged current of
the electrons of a crystal in an external electric field, it also vanishes. That
this is actually not the case is due to perturbations of the Bloch oscillations
by collisions of electrons with phonons, impurities and other point perturbations. To prove this statement we consider a field strength of lo4 V / c m and
a primitive reciprocal lattice vector K typical of semiconductors. The Bloch
period T is of the order of magnitude lo-'' s in such circumstances. The
time rv between two collisions is substantially smaller than T , typically of
the order of magnitude
s. Thus, an electron starting a Bloch oscillation will soon be scattered and its momentum will be distributed randomly
over all k-space. In other terms, no periodic motion can develop in the presence of collisions. We conclude that collisions are not only responsible for
the fact that the current of a free electrons, which would otherwise be infinitely large, remains finite, but also for the fact that an otherwise vanishing
average current of a Bloch electron does not actually go to zero.
There is still another reason that Bloch oscillations cannot be observed
in ordinary semiconductor crystals. It is due to the fact that the time dependent Bloch states (3.311) decay in time by themselves since the exponential
factor oscillates with increasing frequency. This may easily be seen for quasiwavevectors k close to a critical point. For zero wavevector component k,
in the field direction, the phase of the exponential contains a term (1/3)e:t3
(0, is the electro-optic frequency of equation 3.305). If (1/3)0:t3 is considerably larger than 1, the exponential oscillates so fast that the time dependent
Bloch states (3.311) average out to zero almost completely. One may say
that these states have a finite lifetime 0;' due to the electric field. For nottoo-large field strengths, e;' is small compared with the period T of Bloch
oscillations, just as the time T, between two collisions above was found to be
small with respect to T . This implies that, in actual crystals and under normal conditions, Bloch oscillations are not observable even without collisions,
just due to the finite field induced lifetime.
For the artificial superlattices considered in the previous section 3.7, the
primitive lattice vectors and thus the periods T are much shorter than those
in natural crystals. In such circumstances T may become larger than T~ and
OF1,and Bloch oscillations may in fact be observed. The negative differential
electric conductivity of a superlattice parallel to its axis, reported in section
3.7, also may be understood as a manifestation of Bloch oscillations.
If one considers only k-vectors close to critical points of the band structure, then the energy bands E,(k) can be taken in parabolic approximation.
(3.313)
With the neglect of collisions, an equation similar to Newtons law of motion
m t -d 2 < x > = -eE,
dt2
(3.314)
follows from (3.313). It differs from the ordinary Newtons equation in that
the free electron mass is replaced by the effective mass of the respective band
v. For electrons from the valence band the effective mass is negative. The
pertinent particles with positive mass, the holes, behave like particles with
positive charge e.
3.8.3
Interband tunneling
In the effective mass equation (3.296), the possibility of electric field induced coupling between the valence and conduction bands is neglected. In
reality such coupling exists due to the non-vanishing interband matrix elements of the potential V(x) = e E . x. Taking account of these off-diagonal
elements yields a description of electrons undergoing quantum mechanical
transitions between the valence and conduction bands. For reasons of energy
and momentum conservation, the initial and final state energies E,, Ec and
wavevector components k,l, k,l perpendicular to the field have to be equal
in such a transition. According to equation (3.299) this means E, = cC. The
quantum mechanical probability W , for field induced interband transitions
is proportional to the overlap integral between the two envelope functions
F,,vkvl(x) and FECkci(x) with identical values for E, tC and k w l ,kc-. If,
in particular, transitions of electrons between the two band edges, i.e. with
E, = ec = 0 and k
,l = k,l = 0, are considered, the overlap integral has
to be taken with respect to the product Ai([E,+ e E ~ ] / A e , ) A i ( [ - e E z ] / A e , ) .
This integral is non-zero since valence and conduction band states having
the same energies E, = eC = 0 differ from zero simultaneously for almost all
values of z , as may be seen from Figure 3.106. The reason for the non-zero
overlap is the tunneling of the valence and conduction band electrons into
their respective barriers. This explains why field induced interband transitions are referred to as i n t e r b a n d t u n n e l i n g (the term Z e n e r t u n n e l i n g is also
used).
Performing the overlap integral of A i ( [ E , e E z ] / ~ e , ) A i ( [ - e E z ] / A e , )
with respect to z one arrives at an expression proportional to the square of
the Airy-function Ai3(E,/he,) where 8, is a frequency analogous to the
electro-optic frequencies e , , of equation (3.305), formed here, however with
the reduced effective mass mT, = mZ,m:/(mz m:) of electrons and holes
441
instead of their simple effective masses m,*,,. One thus obtains the interband
transition probability W , as
W,, oc A i 2
(z)
.
(3.315)
Therefore,
(3.316)
for the Airy-function applies. To understand what this result means physically we may argue as follows. Expression (3.315) for the transition probability W , was derived using the stationary electron states in an electric
field of subsection 3.8.2. The same expression follows if one uses the nonstationary states of subsection 3.8.3. In this, the electro-optic energy Ti@,,
may be understood as the energy uncertainty of an electron-hole pair due
to its finite lifetime in the presence of an electric field. Without such an
uncertainty interband tunneling would not be allowed by reason of energy
conservation: the final state in the conduction band would differ in energy
from the initial state in the valence band by the gap energy E,. The energy
uncertainty relieves the need to satisfy energy conservation and permits the
interband transition probability W , to be nonzero. The non-stationary
approach also provides a rough estimate of the proportionality factor in relation (3.315). The probability W,, for an electron to tunnel per second
from the valence into the conduction band at k,l = kl = 0, may be obtained from Ai2(E,/7iB,,) by multiplying this expression with the frequency
1/T of Bloch oscillations. This is plausible since this frequency indicates
how often a valence band electron, per second, reaches the upper band edge,
from which it can tunnel into the conduction band. It is just the probability for this tunneling step which the Airy-function expression A i 2 ( E g / W , )
represents. The complete expression for W , thus reads
W , = -Ai2
1
(2).
(3.317)
<< Eg
(3.318)
3.8.4
(3.319)
In an external electric field, the optical transition probability and, hence, the
optical absorption coefficient, is also non-zero for photon energies below the
gap. It decays exponentially as hw decreases. This phenomenon is called
the Franz-Keldysh effect (Franz, 1958; Keldysh, 1958; Boer, Hansch and
Kummel, 1959). Above the gap, the absorption coefficient with electric field
oscillates with photon energy about the absorption coefficient without field,
and we have Franz-Keldysh oscillations (Tharmalingan, 1963).
Field induced changes occw not only in the absorption coefficient, but
also in other optical constants including the real part of the complex refractive index. Besides the changes at the fundamental absorption edge considered above, one also has changes of optical constants at any higher van Hove
singularity of the joint density of states of the interband energy Ec(k)-E,(k)
- ( E,-ho)/eE
443
Figure 3.79: The same envelope functions as in Figure 3.78, but with the energy
of the electron exceeding that of the hole by the photon energy iiw.
(Aspnes, 1967; Enderlein and Keiper, 1967). This fact is exploited in electrorefiectance spectroscopy, in which field induced changes of the reflectivity
of a semiconductor sample are measured by means of a modulation technique. Photon energies which give rise to strong electroreflectance signals
correspond to optical transitions at critical points of the interband energy
(see Figure 3.80). By measuring the electroreflectance spectra of semiconductors one obtains experimental data relating to their valenceconduction
band separation at critical points. These can be used as input parameters
for empirical band structure calculations (Cardona, 1969) as well as for the
characterization of semiconductor heterostructures, including superlattices
and quantum wells (Pollak, 1994; Enderlein, 1996).
3.9
Like electric fields, macroscopic magnetic fields also give rise to effects in
semiconductors which provide experimental data concerning effective masses
and other microscopic properties of electrons and holes. Transport properties, particularly galvanomagnetic phenomena such as the Hall effect, magnetoresitivity and cyclotron resonance, as well as magneto-optic properties,
are of particular importance in this context. The basis for theoretical understanding of these phenomena are the stationary one-electron states of a
semiconductor crystal in the presence of a magnetic field. For most purposes
-121
1.0
1,5
20
25
30
35
400
Energy ( e V )
4s
3.9.1
Initially, spin will be omitted from our considerations (it will be taken into
account later). In these circumstances, the role of the magnetic field in the
Hamiltonian is fully subsumed in the kinetic energy operator (p2/2m) with
the canonical momentum operator p replaced by the kinetic momentum
operator p (e/c)A(x). The vector potential A(x) is determined by the
magnetic induction vector B, save for the gradient of an arbitrary gauge
445
(3.321)
As in the derivation of the effective mass equation for perturbing scalar potentials in section 3.3, we use the k . p-perturbation theory, i.e. we represent
the Schrodinger equation in the approximate Bloch basis Iuk)', whence
'(vk(H
u'k'
+ LmcA ( x . p ) +
(:)
A2(x)IvAk')l '(vhk'lg)
=E
'(vkI$).
(3.322)
with H = (p2/2m) + V ( x ) as Hamiltonian of the crystal without a magnetic
field. Initially, we consider a non-degenerate band extremum at k = 0, and
assume the form (3.60) for '(vkl$). In order to satisfy the Schrodinger
equation (3.322) with this Ansatz, the component with u = vo must obey
the relation
l(vok(H
k'
+ emAc ( x ) .
f)
A2(x)(uok')'Fw(k') = EF,,(k).
(3.323)
(vkl A2Ivk)
6
1
,(
klA21k) .
(3.325)
(3.326)
This relation finally confirms the vanishing of the v # vo-components of the
left hand side of Schrodingers equation (3.323) and, thus, of (vklq) with
v
# vo.
Since
1
A.p= -L.B
2
with L = [xx p] as angular momentum operator, we have
(vokOleA. pIvok0) = pB(vokO(Ti-LIvokO) . B.
mc
(3.327)
(3.328)
(3.329)
Here all terms depending on the magnetic field have been neglected since
they only give rise to quadratic terms with respect to B in the matrix elements (3.328). Substituting relation (3.329) in expression (3.328) for these
elements, and using the notation L , = zap^ - @pa for angular momentum
components (c.Pr signifies the ordered triplet z y z or a cyclical permutation
of it), we obtain
447
(3.330)
In this, we used the fact that the interband matrix elements of the momentum operator are diagonal with respect to k and k. The matrix elements
(vokO)L,lvokO) do not depend on k, so we may use the relation
The quantity L, is an antisymmetric tensor or, in other terms, a pseudovector L,. This antisymmetric tensor bears a close relation to the reciprocal
effective mass tensor M&La of equation (2.337). Apart from a constant factor, the two quantities may be understood as real and imaginary parts of
the same complex tensor, namely
(3.332)
In section 3.3 we found that the symmetric tensor M,Lp describes the kdependence of the band-index- and k-diagonal Bloch matrix elements of the
Hamiltonian (see equation (3.63)). Here we find, that the antisymmetric
tensor L,
determines the B-dependence of these elements. The latter
evidently describe the interaction of the magnetic moment of the orbital
motion of a Bloch electron with the magnetic induction vector B. In order
that it differ from zero, the angular momentum of the Bloch state IvoO)
must differ from zero, too. In principle, this is possible because the motion
of a Bloch electron cannot in general be reduced to a translation, since it
contains a rotational part about the atoms which form the crystal. The
tensor L, measures the average angular momentum of this rotation in the
In cubic materials it vanishes for symmetry reasons, at
Bloch state 1~00).
least as long as only non-degenerate Bloch states IvoO) at the center of the
first BZ are considered and spin and spin-orbit interaction are omitted from
consideration, as we do here. The situation is comparable with that for
s-states of free atoms.
Second step
Beside the k-diagonal term discussed above, which describes the coupling of
the magnetic field to the rotational part of the motion of a Bloch electron, the
+1
2m
[k,e(k(Aa(k)
kp(kja,jk)] . (3.334)
4
Summing the three terms (3.63), (3.334) and (3.335) obtained in rewriting
the Schrodinger equation (3.323), and simultaneously transforming from kspace to coordinate space, the following equation results for the envelope
function Fvo(x):
6kk,h-1pBB. L,
(3.336)
This equation changes if spin and spin-orbit interaction are taken into account. The assumption of a non-degenerate band vo will be adopted, initially.
Spin a n d spin-orbit interaction
Considering the role of spin and spin-orbit interaction, the Schrodinger equation (3.321) may be written as
449
= EF,(x,s).
(3.338)
In obtaining (3.338), we used the fact that the diagonal elements '(voaklH,,
Ivoa'k)' of the spin-orbit interaction operator vanish at k = 0 for symmetry
reasons, while the non-zero k-linear terms in these elements are negligible.
The matrix elements L, of the angular momentum operator emerge from
the corresponding expressions (3.330), (3.331) without spin-orbit interaction
by replacing p in the latter with the spin-dependent operator ?i of equation
(2.353), obtaining
In accordance with their definition, the L,, are matrices in the two-component spin space. In regard to their transformation properties in coordinate
space, they are components of a pseudovector. Apart from a constant factor,
there is only one such pseudovector in the case of systems with the symmetry
group oh, namely the vector of Pauli's spin matrices a. This means that
L, is a multiple g,Acr' of cr'. The constant gvo is determined by the matrix
1
elements of (3.339). With
= guo 29,
we may write
h-lLm
1
+ 2g
cr' = i g & d .
(3.340)
Using this expression, the effective mass equation (3.338) takes the form
The factor gEo is called an effective g-factor. Just like the effective mass,
it represents a characteristic of an energy band vo at a particular critical
point. In Table 3.11 experimental g:,-values for the conduction band minimum of several diamond and zincblende type semiconductors are listed.
As one may see, all values gE are less than 2, the g-factor of the free electron. The deviation from 2 is determined by the induced magnetic moment
of the orbital-motion component of a Bloch electron. As in a free atom,
this moment is always negative, in other words it results in a diamagnetic
IC
Bornstein, 1982.)
GaAs
-0.42
3.41
1.20
0.06
0.04
GaSb
InSb
0.13
0.15
CdTe
ZnSe
contribution to the total magnetic moment of the Bloch electron. For Si,
the effective electron g-value is close to 2, indicating that the diamagnetic
contribution is small in this case. A negative effective g-value and hence a
larger diamagnetic contribution occurs in Ge. This is to be expected since
the diamagnetic moment is determined by the average angular momentum
L
, which according to expression (3.339), and similar to the effective electron mass of expression (2.338), increases if the gap decreases. The same
behavior of the effective g-factor for decreasing gap is observed among the
111-V compound semiconductors: InSb, the material with the smallest gap,
has the largest negative value of g:.
To describe the top of the valence band of diamond and zincblende type
materials, we need an effective mass equation for degenerate bands, and will
address this below.
Degenerate bands
We consider the valence band of a diamond and zincblende type semiconductor in the vicinity of the BZ center at k = 0 . Spin and the spin-orbit
interaction will be omitted initially. Then the degeneracy is %fold. The
Bloch states at k = 0 are IvmO),m = 2,y, z , and the pertinent LuttingerKohn functions read (vmkO). The eigenfunction $(x) is given in terms of
*
the envelope function vector F , (x)= (Fwz(x),
F w ( x ) , F,,(x)) by equation
(3.67). The elements MGip of the effective mass tensor are 3 x 3 matrices
04
45 1
It follows that the envelope function vector of the degenerate valence band
in a magnetic field satisfies the equation
(3.345)
with K being a constant which has to be determined from band structure
calculation or by means of experimental investigation. The same result for
9
H M follows for the I'l5-valence band of zincblende type semiconductors.
The above consideration shows that, in the case of degenerate bands,
there is coupling of the orbital motion to the magnetic field which does not
occur for non-degenerate bands. The source of it is the non-vanishing of
angular momentum matrix elements between the degenerate valence band
states. Of course, the average angular momentum of all valence band states
also vanishes in the degenerate case. However, the various angular momentum states, averaging to zero in the absence of a magnetic field, are
r;,
(3.346)
where K and q are again constants which must be determined either from
band structure calculations or experimentally. That the presence of spin
involves not only K , but also a second constant q, is understandable because
the orbital and spin motions have different gyromagnetic ratios. In most
(z
cases q is negligibly small (see Table 3.11). The result (3.346) for H M which
has been derived here for diamond type semiconductors, holds unchanged
for materials with zincblende structure also.
3.9.2
453
because for this gauge the vector potential A = (--By, 0,O) also commutes
with the momentum operator p. The same holds for the more complex
effective mass equation (3.343) for degenerate bands without as well as with
spin. Using the present gauge, the effective mass equation (3.336) for a
non-degenerate band without spin takes the form
2am:,
(Pz + $Y)2
+Pp
+pq
F d X ) =
( E , - E,o)F,(x).
(3.347)
(3.348)
F vo (x)= -ez(kSzfkZa)X
vo(Y),
27T
yo = rCk,,
liC
r z = -.
eB
(3.350)
(3.351)
Equation (3.349) describes a harmonic oscillator with center at yo, mass lm&1
and eigenfrequency wcw. It is well-known that in this case the eigenstates
can be described by integer quantum numbers n = 0 , 1 , 2 . . . ,00, with the
corresponding eigenenergies E L given by
EL,
E &=
~ FAW,,,
( + a> . sgn(m&).
n
(3.352)
These energies are called Landau levels. For the eigenfunctions Xvo(y),we
have
(3.353)
with H , a Hermitian polynomial of order n. Thus the electron executes an
oscillation in y-direction with angular frequency wc, about its equilibrium
position yo. The pertinent frequency is given by
(3.354)
For mC, = 0.2 m and B = 1 Tesla, this frequency amounts to 140 G H z , i.e.
it lies in the microwave region. According to equations (3.352) and (3.353),
the spectrum of allowed energy values forms a set of 1-dimensional energy
bands, the Landau subbands,
+ i).
(I
455
v)
.-
% 4
Q-
4
6
E - Eg(hwCc)
determines the equilibrium position of the harmonic oscillations along the yaxis. Since the eigenenergies Evan(kz)of these oscillations do not depend on
the equilibrium positions, a high degree of degeneracy exists with respect to
k,, and every arbitrary linear combination of eigenstates having different k,values is again an eigenstate. If the vector potential would have been chosen
in the form (3.320) which is symmetric with respect to I and y, then such
linear combinations would have resulted in eigenstates which also execute
oscillations with frequency wcvo in the 2-direction, but phase shifted by 7rj2
with respect to the oscillations in the y-direction. This signifies circular
motion in the plane perpendicular to the magnetic field, known as cyclot.ron
m o t i o n Such circular motion is a consequence of the axial symmetry of the
magnetic field. In classical mechanics, one also has circular orbits for the
motion of an electron in a magnetic field, projected on I - y plane.
The radius of the cyclotron orbit depends on the quantum number n,
larger values of n and, with this, also larger energies ELan mean larger cyclotron radii. By absorbing an energy quantum of an electromagnetic radiation field, an electron can make a transition from the n-th to the (n l)-th
cyclotron orbit. Such transitions are involved in cyclotron resonance: At
a fixed frequency, the absorption of microwave radiation is measured as a
function of magnetic induction B. Varying the magnetic field, the cyclotron
frequency can be adjusted to match the fixed frequency of the microwave ra-
457
Chapter 4
Electron system in
t herrnodynamic equilibrium
4.1
458
where H and
denote, respectively, the Hamiltonian and the total particle
number operators of the many-electron system, in the so called occupation
number representation This representation is well suited to describe Slater
determinants in a more compact way than was done in Chapter 2. Before
proceeding further, we will introduce this representation (for more detail see
Appendix C). We number the possible one-particle states by an integer i
which can take the values 1,2,. . . ,m. If a particular one-particle state i
occurs in a Slater determinant one says that it is occupied or that its occupation number Ni is 1. If the state i does not occur, then Ni is 0. A
given one-particle state cannot occur more than once, otherwise the Slater
determinant vanishes in conformance with the Pauli exclusion principle for
electrons. The occupation numbers Ni can therefore only take the values 0
and 1. Using occupation numbers, the Slater determinants can be described
459
as vectors IN1, N 2 , . . . , N,) in the occupation number space. These vectors have, altogether, as many Ni different from zero as the total number of
electrons in the system. The IN1, N z , . . . , N,) may be understood as eigenvectors of particle number operators i?i for the eigenvalue Ni (see Appendix
C ) . Therefore,
fiiIN1, N27... N w )
(4.2)
00
i=l
(4.3)
The occupation number vectors IN1, N z , . . . , N,) form a complete basis set
in Hilbert space of the many-electron system. These vectors may be used to
represent an arbitrary many-electron operator. If only extended oneparticle
states are involved, like in the case of an ideal crystal, the occupation number
representation of k reads
W
EiNi.
H=
(4.5)
i=l
Using equations (4.2) to (4.6) one may easily show that the occupation
number space matrix representation of the equilibrium density matrix of
equation (4.1) is in fact diagonal with respect to the basis set of Slater
determinants "1, N z , . . . , N,), with the diagonal elements given by
460
The average value < fii > of the particle number operator fii, like the
average of any other operator of an observable quantity of the many-electron
system, is formed in accordance with the relation
Here, the symbol Tr[. .I stands for 'Trace' and means the summation over
all diagonal elements of the operator within the brackets with respect to
any basis in the Hilbert space of the many-particle system. One may easily
demonstrate that the value of the trace is independent of the particular basis
chosen. Of course, it is advantageous to use the set of Slater determinants
IN1, N z , . . . , N,) as basis, since the matrices of 6 and Ni are known in this
represent ation.
4.2
4.2.1
where we have set p = l / k T for brevity. The two expressions (4.9) and (4.10)
can be written as products of sums over particle numbers for a particular
state j , as indicated in equation (4.10). In the quotient, all factors cancel
except the two with j = i. These remain and give rise to the expression
46 1
Since the particle number Ni may take only the values 0 and 1, it follows
that
(4.12)
The average particle number in the one-particle state i thus depends only
on the energy of the state. If one identifies the chemical potential p with
the Fermi energy E F , and uses the notation
(4.13)
for the Fermi distribution function previously introduced in Chapter 1, we
obtain
(4.14)
We conclude that the average particle number for configuration-independent
one-particle state i is given by the Fermi distribution, just like for an ideal
gas. The distribution function of classical statistics, the B o l t z m a n n distribution
(4.15)
follows from the Fermi distribution for energies E with E - E F >> k T . This
also means f(E)<< 1. Under this condition the average particle number in
the one-particle state is small compared to 1, and thus the state is almost
unoccupied. Substantial deviations from the Boltzmann distribution occur
when E - EF is no longer large in comparison with k T and, therefore, f ( E )
no longer small compared to 1. In this case of substantial occupancy the
limited capacity of a one-particle state to host electrons (remember the Pauli
exclusion principle) becomes important, and the average particle number in
it deviates from that in classical statistics in which the hosting capacity of
states is not limited.
The deviation of the average particle number in a oneparticle state from
the value given by the Boltzmann distribution is called (statistical) degeneracy. A particle system is referred to as non-degenerate if it is described
by the Boltzmann distribution, otherwise it is called degenerate. The appearance of statistical degeneracy is a purely quantum mechanical effect - it
reflects the quantization of energy levels and the indistinguishability of electrons. Statistical degeneracy applies to other elementary particles as well.
Indistinguishability refers to the fact that, in the exchange of two identical elementary particles, the total wavefunction of the many-particle system
transforms either into its negative (this holds in the case of electrons and
leads to the Pauli exclusion principle and with it to the Fermi statistics),
462
or the total wavefunction transforms into itself (then the population of the
oneparticle states is not limited, but the quantization of energy yields a
distribution function which also deviates from the Boltzmann distribution).
While the Fermi distribution emerges in the former case, in the latter case
it is the Bose distribution function, which may be formally obtained from
the Fermi distribution function (4.13) by setting the chemical potential zero
and replacing +lby -1in the denominator.
In the particular case of band states, the quantum number i is the pair
vk of band index v and quasi-wavevector k. In order that the wavefunction
luk) really represent only one quantum state, which is the case here, u must
also describe the spin state, or, alternatively, we must augment v to include a
spin quantum number CT. We use the latter description, i.e. we will write the
band states, henceforth, in the form lvko). With this, we assume that both
the band and spin quantum numbers I/ and u are compatible and meaningful
simultaneously. This is only justified if the spin-orbit interaction plays no
role and the two states (uk $) and lvk have equal energy E,(k),
which we take to be the case here. In such circumstances, equation (4.14 )
yields
1
< fiuk6 > = f(E,(k)), ff = f-,
(4.16)
g).
+ NUk-+
and
(4.17)
The Fermi distribution (4.14) solves the problem of determining the average
equilibrium particle number < I?i > for one-particle states li) having energies
which are independent of the configuration of the many-electron system.
This distribution cannot be applied to configuration-dependent one-particle
states. In the latter case the above calculation of average particle numbers
has to repeated, taking account of the configuration dependencies of o n e
particle energies because of Coulomb repulsion effects.
4.2.2
463
I D a ) l % IDu)2 = p a ) ,
(4.18)
as was the case before, and consequently also only one type of particle number operators Nou, u = fi,Using the pertinent occupation numbers No,
of localized states, the N-particle states of system may be characterized as
IND+rNo-+, NJ, ...Nm), where N3, ..., N, denote the occupation numbers
of ex&nded band states. We use these N-particle states as a basis for the
calculation of the average particle number < N 1 > in localized state IDa)
D2
according to expression (4.8). We have
(4.19)
The factors referring to extended states are the same in the numerator and
denominator of expression (4.16), thus they cancel. In the remaining factors
referring to localized states, the following four configurations are possible:
No; = 0, N0-i = 0, N o + = 1, ND-l= 0, No+ = 0, ND-1= 1, No+ =
2
464
+ .-2P(E2d-d]
[,-P(%-P)
= f-.
2
(4.20)
The average number of particles occupying the center is the sum of < I$
D;i
>
Using this expression, we will now discuss how the average number of particles localized at a center is influenced by configuration dependencies of
one-particle energies due to Coulomb repulsion. In Figure 4.la this number is shown as a function of chemical potential p for T = 0 K . As long
as p < EA, we have < I? 1 f i D - ~> = 0. If p exceeds EL, then
<
D;i
D5
465
. ElD
E':
21
ED= ED
E:
21
ED+w
(4.23)
466
with
(4.24)
becomes the average number of electrons at the acceptor. The average hole
number at the acceptor follows from this as
(4.26)
Here, the one-electron energy E A of the acceptor state represents the binding
energy E B of the hole at this level.
467
<
C fiDnlnlmu
>
1 + C g D n e -P(ED,-P)
[
I-'
gDne-fl(EDn-p),
(4.27)
This expression may be written in the form (4.24) if the ground state energy
ED' is denoted by E D , and if the factor f multiplying the exponential term
in the denominator in (4.24) is replaced by the temperature-dependent factor
(4.28)
468
(4.29)
The validity of this rewriting may be easily demonstrated by inserting y*
of equation (4.28) into equation (4.29). Omitting the excited states in y*,
considering therefore only the term with n = 1 and g D 1 = 2, we obtain the
value for y*,and the average particle number at the donor center is given
by equation (4.24). Considering the excited states, 7*becomes smaller than
provided the temperature T is larger than 0 K . At T = 0 K , y* retains
the value even in the presence of excited states. This value decreases only
slightly with increasing temperature, as long as kT << [ED, - ED]holds for
all excited states n 2 2. This inequality provides a simple criterion which
helps decide whether or not the excited states are essential in the calculation
of the average particle number at the center. The formal evaluation of the
factor y* for hydrogen-like states with E D , = E, - E g / n 2 and g D n = 2n2
at arbitrary temperature T > 0 K leads to the result y* = 0, so that an
apparently strong influence of the excited states occurs at all temperatures
T > 0 K . In reality, however, the orbital radii U D of
~ the excited states
are limited, if not through other factors such as finite sample size, then
by the adjacent donor atoms - their distances from each other must be
large compared to 2 U D n , so that the n - t h electron shell may be formed
unhindered by the adjacent donors. For a donor concentration of 1015 ern-'
this condition yields 2 u <<~1000
~ A. In silicon, 2UDn equals about 100 A x n ,
so that n must clearly be smaller than 10 and the overall influence of the
excited states for T > 0 K is not so very great.
The factor y* significantly influences the average particle number at a
donor center only for weak occupation of the center, which occurs when
y* exp[P(ED - p ) ] >> 1 or E D- p >> kT holds, whence equation (4.29) yields
4,
4,
Since y* < the average particle number at the center is enlarged as compared to the case without excited states where y* =
The excited states
therefore lead to an increase of the average particle number at the center. This is physically understandable because more states are now available for occupancy than without excited states. If, on the other hand,
p - E D >> kT, i.e. the ground state is almost completely occupied, then
one has y*exp[P(ED - p ) ] << 1, and correspondingly
4.
(4.31)
469
so that one gets the same average particle number at the donor center as
without excited states. This is also physically clear immediately - because
the donor can host only 1 electron, and, with high probability it will occupy
the ground state, so excited states play practically no role.
As in the case without excited states, the results for donors can also be
transferred directly to acceptors but we will not write down the corresponding expressions explicitly.
Density of states
4.3
4.3.1
The total electron system is composed of the valence electrons of all atoms
of the semiconductor. We denote their number by N b t a l . This number
determines the average total particle number < fi > of the system,
<N >=
(4.32)
Nbtal.
i)
i)
i)
470
(4.35)
with f(E)as the Fermi distribution function. The localized part in equation
(4.34) can be written as a sum over the average values < k 1 k7-; > for
7F
the various centers 7,because they are independent of each other, whence,
Using the effective distribution function f*(E)for localized states from equations (4.24) and (4.25) we have
Altogether, we obtain
NtOta1=
2 C f ( E u ( k ) ) Cf*(E,).
uk
(4.38)
Ntotal
0 '
(4.39)
nthl
instead of
Ntotal.
4.3.2
471
(4.40)
Introducing a &function we rewrite the sum on k in the following way:
(4.41)
Substituting this expression in equation (4.40) for n i i l yields
(4.42)
where
pid"'(E) = - C 6 ( E - E , ( k ) ) .
(4.43)
uk
is the density of states (DOS) of the ideal semiconductor considered in Chapter 2 (see equation (2.210). The introduction of the density of states in expression (4.42) for
is significant in that it provides a physically meaningful division of the information needed for the calculation of the average
total particle number and other statistical average values into two parts:
first, information about the possible stationary quantum states of the system is contained in the density of states pidea'(E), and second, information
about the statistical occupation of the states of the system is reflected by
contains
the distribution function f(E).One may also say that p*'(E)
information relating to the dynamics of the system (classically, this would
pertain to the solutions of Newton's equation of motion), and f(E)information about the statistics of the system (classically this would reflect on the
statistical distribution of initial conditions or constants of motion). These
two types of information are largely independent of each other and both are
needed to calculate statistical average values in thermodynamic equilibrium.
nh%i
pc
2m* 3/2
B(E - E , ) d G .
( E )= 2x2 (L)
h2
(4.44)
472
ENERGY
GAP
The DOS of the conduction band thus exhibits square root-like behavior at
the band edge (see top part of Figure 4.2). Formally, it continues to follow
a square root law also for energies deep in the band. However, for such
energies, expression (4.44) does not apply. Qualitatively, the shape shown
by the dashed curve in the top part of Figure 4.2 is more realistic.
Expression (2.220) may not be used directly for the valence band, since its
derivation presupposes that the effective mass m t is positive, while for the
valence band it is in fact negative. The calculation above can, however, be
modified easily to the case m: < 0. With v = v, E,o = 0, and mz -mt >
0, we have
(4.45)
The density of states of the valence band is thus seen to have square root-like
dependence at the upper band edge and formally also continues to increase
according to a square-root law for energies deeper in the band (see bottom
part of Figure 4.2). The dashed curve again shows the more realistic shape.
The entire density of states pideaz(E)of an ideal semiconductor having one
valence and one conduction band is given by the expression
473
sequence of the fact that no stationary electron states exist there (see Figure 4.2). Finally, we write down the total electron concentration n&%: of
the ideal semiconductor having one conduction band and one valence band.
From equation (4.42) it follows that
In the case of the elemental semiconductors Si and Ge, the conduction band
edges consist of several valleys, and the iso-energy surfaces of the valleys
are anisotropic. This has consequences for p,(E). For Si, for example, the
conduction band density of states p p ( E ) takes the form
The valence band edges of diamond and zincblende type materials are degenerate. The degree of degeneracy depends on whether spin-orbit splitting
is important or not. In the case of Si, the spin-orbit splitting energy A is
small in absolute units (44 m e V ) , but still large enough to compete with k T
(25 meV for T = 300 K ) . Thus, for temperatures that are not extremely
high, most of the holes are hosted by the I?$ valence band while the spinsplit I?$ band remains practically devoid of holes. This means that there are
two hole states per k-vector (apart from the 2-fold spin degeneracy), one for
heavy holes, and one for light holes. Thus , the corresponding hole density
of states p p of Si, calculated in the isotropic parabolic approximation of
equation (4.45), is given by the expression
4 74
(4.50)
The conduction band density of states p P ( E ) can be derived from equation (4.50) as follows: First of all, the general relation
p p ( E )=
k 2 S ( E - E,(k))
27r
(4.51)
+4
Kane(E)
Pc
27r2
(-)3
2P2
3/2
(.
.-/
5 - 3)
3
(4.52)
If E lies close to E,, then p p ( E ) given by (4.52) takes the form of expression (4.44) for a parabolic band. To see this, the effective mass rn; has
to be replaced by the expression from Table 2.13. That is to be expected,
because the band structure is also almost parabolic for energies close to the
edge within the Kane model.
4.3.3
475
As in the context of the total electron concentration (4.42) of an ideal semiconductor, we introduce here the density of states pTal(E) of the real semiconductor. The contribution arising in the energy gap is denoted by p E $ ( E )
and we set
must be defined by
+ N A ~ ( E- E A ) .
(4.56)
The densities of states of the valence and conduction bands of the real semiconductor will be denoted by pLd(E) and p:"'(E), respectively. The band
states of real semiconductor are, of course, no longer pure Bloch states, but
Bloch states perturbed by scattering from the impurity atoms. Similar to
pure Bloch states, they are, however, still spread out over the entire crystal. Correlation effects therefore play no important role in their occupancy.
This means that the total electron concentration in the bands of the real
semiconductor may be expressed by p;T'(E) in the same way as the total
electron concentration of the ideal semiconductor was expressed by p Z d ( E )
in equation (4.47). Therefore,
,Teal
total =
1,
0
dE P:az(E)
f(-w
Eg
00
+I,dE
PEW
f*(E) .
(4.57)
The band densities of states pLY'(E) of the real semiconductor differ from
p ; Y ' ( E ) in accordance with the considerations of sections 3.4 and 3.5, where
it was shown that localized states occur at the expense of the band states
476
(Levinson's theorem). The number of conduction band states therefore decreases by the number of donor states, and the number of valence band states
decreases by the number of acceptor states, related to the volume unit in
each case. Thus we have
0
dE pLal(E) =
dE p i k d ( E )
- NA,
(4.58)
--M
(4.59)
The entire density of states
l,
m
dEp'"l(E) =
Jrn
dEpiM(E).
-m
(4.61)
Concerning the band state densities p,':'(E), we initially know, apart from
the integral relations (4.58) and (4.59), nothing further than that it must
reproduce p Z z ( E ) if N D and N A are zero. Below, we will see that it is
possible, under certain conditions, to replace the real state densities p T f ( E )
approximately by the ideal state densities p : p z ( E ) even when N D # 0 and
N A f 0.
In the following, we calculate the average electron concentrations in the
energy bands and the donor and acceptor levels separately. Such separate
calculations of the concentrations are necessary because the electrons in these
different parts of the energy spectrum behave quite differently. The electrons
localized at an impurity atom, for example, do not contribute to electric
charge transport. The electrons of the conduction band do, just as do the
electrons of the valence band, the latter, however, provide such a contribution
in a different sense, not as electrons, but as holes. Therefore, facilitate to
calculation of the conductivity of a semiconductor, one needs the average
electron concentrations in the conduction band, in the donor and acceptor
levels, and in the valence band, separately. The same remarks apply to the
calculation of other measurable quantities.
4.4
477
We start with general considerations which refer to both ideal as well as real
semiconductors. The object is to derive a condition which fixes the total
number of electrons of these systems.
4.4.1
Consider (4.53) for the total electron concentration nL%t of a real semiconductor. In this relation, the total concentration n&%! of the ideal semiconductor may be taken from equation (4.47), which we write down for T = 0 K .
Since the Wrmi level of an ideal semiconductor lies in the energy gap according to the considerations of Chapters 1, one has f(E)= O(EF - E ) for
T = 0 K . With this, relation (4.47) takes the form
0
nzdeal
total
(4.62)
S _ _ d E P"Vd"YE).
This equation says that the total concentration nitg: of electrons equals
the total concentration of valence band states of the ideal semiconductor.
This statement is, of course, independent of temperature. The total electron
concentration of the real semiconductor may therefore be written for any
temperature as
0
n Ttotal
fd
-
1,
d E pid"'(E)
+ N o - NA.
(4.63)
(4.64)
On the other hand, we have relation (4.57) for
p z z ( E ) using (4.56) to obtain
nTzL,
where we replace
(4.65)
From this equation we subtract equation (4.64), h d i n g
(4.66)
478
(4.67)
ND+ + N;
= 0.
(4.71)
This expresses the conservation of total electron number for a real semiconductor. The electrons may be redistributed between the four energy regions,
i.e. valence band, acceptor levels, donor levels and conduction band, but
their total number cannot change thereby. One may also understand relation (4.71) as the neutrality condition for a real semiconductor - the negative
charge of the electrons of the conduction band and of the ionized acceptors
must compensate for the positive charge of the holes of the valence band and
the ionized donors.
4.4.2
We now derive an explicit relation between the concentrations n , p of electrons and holes in the bands, as they are given by equations (4.67) and
(4.68), and the Fermi energy EF. The Fermi energy, in turn, is determined
by the neutrality condition (4.71). In the non-degenerate case of an exponential Boltzmann distribution for f(E), the position of the Fermi level
has no influence on the relative distribution of the electron occupancy over
the various available energy levels, only the total electron concentration depends on EF. However for a degenerate electron gas, i.e. in the regime of
quantum statistics, the Fermi distribution function f ( E ) mandates that the
relative distribution of electrons over the energy levels is also determined by
479
the Fermi energy. In particular, this means that the relative distribution
depends on the total particle concentration. By changing the total concentration, the ratios of the average particle numbers in the various energy levels
can be altered. This is a clear manifestation of the statistical correlation between the electrons of an ideal Fermi gas. It also shows that the chemical
potential, or Fermi energy, is much more important in quantum statistics
than in classical statistics.
In evaluating the integrals of (4.67) and (4.68), we note that the occupancy factors f (E) and [l- f ( E ) ]decrease monotonically with, respectively,
increasing or decreasing energy and effectively cut off the integration range.
The cutoff energies are separated by an interval of about kT from the respective band edges. We assume that the densities of states of a real semiconductor, which are required in equations (4.67) and (4.68) for the effectively
contributing energy intervals, may be approximately replaced by the state
densities of the ideal semiconductor. This approximation is justified as long
as the concentrations of donors and acceptors is small compared to the concentration of significantly contributing band states. Furthermore, we assume
that in the contributing energy intervals, the densities of states (4.44) and
(4.45) of the ideal semiconductor, which are predicated on parabolic band
structure, are approximately valid. Thus we obtain
(4.72)
(4.73)
Defining the Fermi integral as
(4.74)
the relations (4.72) and (4.73) may be written as
(4.75)
(4.76)
with
(4.77)
480
In the frequently occurring case that the Fermi level lies not only in the
energy gap, but is separated by at least several k T from the two band edges,
the arguments of the Fermi integrals in (4.75), (4.76) are negative and have
absolute values large compared to 1. For such arguments the Fermi integral
is approximately given by
(4.78)
so that
n = Nce(EF-Eg)/kT, ( E , - E F ) >> k T ,
p = NVevEFfkT , E F
>> k T .
(4.79)
(4.80)
The same result for n would be obtained if the Fermi distribution f(E)for
the electrons of the conduction band were approximated by the Boltzmann
distribution (4.13) from the very beginning, as is possible for ( E - E F ) >> k T .
An analogous approximation for the electrons of the valence band is not
available, since there E < EF holds. One may apply this approximation,
however, to the holes of the valence band, i.e., to
1
- f(E)= e(E~-E)/rCT+ 1 '
(4.81)
481
(4.84)
which is of the order of magnitude of the number of primitive unit cells per
cm3, i.e. M
~ m - ~
With
.
this number, the order of magnitude of the
effective band densities of states is obtained by multiplying it by the ratio
of k T to the whole band width. With a value of k T M 10 meV and a band
follows.
width of 10 e V , N,,, FZ lo1
The values of n and p determined by equations (4.75), (4.76) still depend
on the Fermi energy E F , which is, as yet, unknown. The equation which
determines E F is the neutrality condition (4.71). In this equation all quantities may be considered to be known except the Fermi energy, for which the
equation is to be solved. Even without any calculation it is clear that the
position of the Fermi level will depend on the donor and acceptor concentrations N D and N A . This dependence is transferred directly to n and p. There
is, however, a simple function of the two concentrations, their product np,
which in the case of non-degenerate carrier systems does not depend on E F ,
and hence not on N D and N A . This may be seen using equations (4.79) and
(4.80), from which it follows that
np = ni,
2
(4.85)
ni = (NcNV)3e-iEg/ICT.
(4.86)
with
482
n,
= 300
K (in ~ m - ~ ) .
GaP
GaAs
Si
Ge
Ids
InSb
2.7 x 10'
1.8 x lo6
1.5 x 10"
2.4 x 1013
8.6 x lOI4
1.6 x 10l6
This equation may be understood as the law of mass action for a 'chemical
reaction' in which bound electrons in the valence band and at the donor
atoms on the one hand, and bound holes in the conduction and at the acceptor atoms on the other hand, generate free electrons in the conduction
band and free holes in the valence band. The concentration of the bound
electrons can approximately be set equal to the effective density of states
Nu of the valence band, since in the non-degenerate case, the concentrations of holes and of occupied donors are small in comparison to N,,. A
similar statement holds for the bound holes which, here, also include the
non-occupied states of the conduction band. Their concentration roughly
equals the effective density of states Nc of the conduction band. According
to the mass action law, one then has, for an ideal gas, n p = K ( T ) N , N , , with
K ( T ) = exp[-(gc
g,,)/,kT]as mass action constant. Here, gc and g,, are,
respectively, the free enthalpies of an electron in the conduction band and
of a hole in the valence band. With gc gv = E,, equation (4.86) is indeed
the mass action law, as stated above.
4.4.3
Intrinsic semiconductors
(4.87)
This equation reflects the fact that the electrons of the conduction band
are created solely by thermal excitation of electrons from the valence band,
leaving behind an equal number of holes. If we substitute p = n in equation
(4.85), then we have
n = ni,
p = ni.
(4.88)
This relation was already stated in Chapter 1 on the basis of empirical facts
(see equation 1.20). One refers to ni as intrinsic concentration, because it
483
1
2
EF=
~ -E,+
1
2
-kTh
(4.90)
1
EF=
~ -Eg
2
3
+ -kT
4
(4.91)
For T = 0 K , the Fermi level thus lies exactly in the middle of the gap
between the valence and conduction bands. It remains there even for finite
temperature if rn; = rn:. If rn; # m:, a temperature dependence emerges
which is schematically shown in Figure 4.4. For rn: > rn; or N , > N,, EF(
moves away from the valence band edge, and for rn; > rn; or Nc > N,,
away from the conduction band edge. This may be understood as follows:
If the effective density of states in the valence band is larger than that in
the conduction band, one would have more holes in the valence band than
electrons in the conduction band if the Fermi level were to stay in the middle
of the gap. It must move away from the valence band edge to assure the
equality of n and p . An analogous interpretation can be given in the case of
Nc > N,. In short, one can say that the Fermi level is repelled by regions of
high density of states, and is attracted by regions of low density of states.
We next determine n and p for the case of real semiconductors with
donor and acceptor atoms present. In Chapter 1, the designation eztrznsic
semiconductors was already introduced to describe them.
484
Figure 4.3: Intrinsic concentrations of several semiconductors as functions of temperature (calculated values).
&T/OC
IO~/T/K-
-.
4.4.4
Extrinsic semiconductors
n-type-semiconduct ors
Initially, we assume that there are only donors, and no acceptors are present,
N D # 0 and N A = 0. The neutrality condition (4.71) is then transformed
into an equation for the electron concentration n. In this regard, we need
~ ED)] of ionized donors. The donor
the concentration Ngf = N D ( population probability f * ( E D ) is given by equation (4.24) as
485
(4.92)
where EB denotes the donor binding energy. In general, the donor levels,
unlike band energies, do not lie well above the Fermi level. They may be
close to, or even below it. This means that the degeneracy of the Fermi
distribution, which could be neglected for carriers in the bands, must in
general be taken into account in the effective donor distribution function
!*(ED). The expression (4.92) for f*(Eo)must therefore be used in its
original form. The Fermi energy can be eliminated from it by introducing
the electron concentration n given in equation (4.79), with the result
(4.93)
(4.94)
Apart from the factor 1/2, the quantity nl represents the electron concentration in the conduction band of a non-degenerate ideal semiconductor with
a (fictitious) Fermi level at the position of the donor level. If we replace p
in terms of nT/n using equation (4.85), then the neutrality condition takes
the form
(4.95)
From this equation one may easily derive the magnitude relations
(4.96)
In a semiconductor containing donors but no acceptors, the electron concentration is always larger than the intrinsic concentration ni, and the latter is
larger than the hole concentration p. It is therefore called an n-type semiconductor. The electrons are majority carriers in this case, and the holes
are minority carriers. To solve the neutrality condition (4.95) for n, we
transform it into the third-order polynomial equation
n3
2
+ nln2 - (ni2 + n l N D ) n - ninl
= 0.
(4.97)
486
1) N D <( ni :
Considering the inequalities (4.96), the term proportional to N D in equation
(4.97) may be neglected in comparison with the second term. Thus we have,
approximately,
n3
+ n1n2 - n f n - ninl
2
= 0.
(4.98)
(4.99)
n = ni.
For p , one gets p = p i . This means that in case l),the carrier concentrations
n and p do not differ from those of an intrinsic semiconductor. If the values of
N D are not extremely large, then the inequality N D <( ni may always be fulfilled, simply by making the temperature and therefore also ni large enough.
For sufficiently high temperatures and not overly large N D values, one thus
always has the intrinsic case. Only if the donor concentration becomes sufficiently large, and the temperature so low that the inequality N D << ni does
not hold, then the donors can influence the carrier concentrations n and p
considerably.
We turn now to the other extreme case
2 ) N D >> ni
Because n > nil one then also has N D n >> n:. Under this condition, the
last term in (4.97) may be neglected in comparison with the term n l N D n .
With this, one has the approximate equation
n2
+ n1n - n l N D = 0.
(4.100)
(4.101)
If N D << n l , i.e. for relatively light doping, but sufficiently high temperatures, it follows, approximately, that
n =No.
(4.102)
All donor atoms are ionized and have donated their additional electrons to
the conduction band. Understandably, for this to occur one needs sufficiently
high temperatures and, maintaining the validity of N D >> nil sufficiently
small donor concentrations. The latter restriction is necessary in order to
487
-5
E
"I
-- I
Id6
INTRINSIC RANGE
SLOPE = Eg
!I
:I
- I
$ E
;
c
lo"=
f
SATURATION RANGE
I
I
\
\nl
1 1
10'3-
1 1
insure that all donor electrons can be hosted by the conduction band. One
terms this case the exhaustion of doping centers or the saturation of carrier
concentration.
If, conversely, N o >> n1 holds, which means very high donor concentrations and low temperatures, equation (4.101) approximately yields
n=
JnTNo.
(4.103)
488
EC
ED
20
40
60
80
T/K
<< n1, i.e. for high temperatures and low doping, the
result
(4.105)
and, for N D >> nl, i.e. for low temperatures and high doping, we find that
(4.106)
The dependence of the Fermi level on temperature is shown in Figure 4.6 for
n-type germanium with typical values of N D and EB. At T = 0 K the Fermi
level lies exactly in the middle between the donor level and the conduction
band edge, the donors being only partially ionized. With increasing temperature the degree of ionization increases. In order for this to be possible, the
Fermi level must shift to lower energies. This may again be understood as a
repulsion of the Fermi level by the group of states with higher density - the
effective conduction band density of states is larger than the donor density
of states, which equals the donor concentration.
p-type semiconductors
489
(4.109)
+ p i p 2 - (ni2 + p i N A ) p - n i2p 1 = 0.
(4.111)
(4.112)
(4.113)
P = & K
(4.114)
if NA << p l , and
490
49 1
n:
n---n
n1
n+n1 N D
nf +pin
N A = 0,
(4.115)
where n and p are to be taken from equations (4.79) and (4.80). The relation
(4.115) can be transformed into a fourth-order equation for n or p. Since
this is not solvable in compact form, we consider certain limiting cases which
allow for simple solutions and provide some physical insight. Under the
assumption n << n1, p << pl we find, approximately,
n2
- ni2 - n ( N o + N A ) = 0.
(4.116)
492
Figure 4.8: Calculated temperature dependence of the electron concentration of partially compensated
n-type germanium. ( E B = 10 meV,
mt = 0.25 m ) . An electron concentration N D - N A = 10l6
is
assumed. The various curves correspond to different degrees of com3pensation N A : 1- 0, 2 loL5, 4 - loL6
(After Bonch-
t OI6
E
<
1015
C
104
6 8 1012
IOOT-/K-L
p = (NA- No),
n=
n?
(NA- N D )
if
N A > ND.
(4.119)
4.4.6
493
additional term N D j / ( n n j ) into the neutrality condition, and each additional type of such an acceptor (described by an index k ) gives rise to an
additional term NAkn/(n: n p k ) . The neutrality condition for this more
general case therefore takes the form
n2
n - 2 - z ~ D j - nj
+ ) : N A ~ zpkn
= 0.
n
j
n+nj
ni + P k n
(4.120)
(4.121)
with y* given by equation (4.28).
Formally, the occurrence of excited bound states modifies the neutrality
condition (4.97) in that nl is replaced by
61 =
y*2n1.
(4.122)
(4.23)
in the neutrality condition (4.111). Now, j1 > p l holds, which means that
the number of free holes is increased. This is immediately clear because
494
the acceptors bind fewer holes if there are excited electron states at the
acceptors.
n- p -N
D ~= 0,
(4.124)
one needs the hole concentration Ngf localized at the donors. It is given by
the relation
(4.128)
With this result (and using n!/n for p ) , the neutrality condition (4.124)
takes the form
495
(4.129)
It may be transformed in a fourth-order equation for the determination of
n. Because the difficulty of solving this in closed form, we again consider
only limiting cases. Generally, the doping concentrations ND >> ni should
<< na,
which means that the doping should not be too heavy, nor the temperature
too low, then (4.129) approximately yields
(4.130)
n =~ N D .
In this case, all donors are 2-fold ionized and one has the case of exhausted
doping atoms. To fulfill the condition n << n7, it is necessary that N D <<
7x7. The exhaustion of a doubly ionizable donor is thus characterized by
an inequality similar to that which formerly was found for simply ionizable
donors .
In the limiting case
n >> n;,n$
which means sufficiently strong doping and not-too-high temperatures, equation (4.129) approximately results in
(4.131)
Here, we have set n* = na2/n7 for brevity. The positive solution of this
equation is given by
n = n*
JqI]
(4.132)
With the assumption N D << n*, it yields n = N o , i.e. each donor atom is
simply ionized. One has exhaustion of the doping centers with respect to
1-fold ionization, and reserve with respect to 2-fold ionization. This result
can be understood in the following term: In order to fulfill the conditions
ND << n* and ND >> n7 simultaneously, n* >> ni and with this also nz >> nT
must hold. This means that n* << n$ and ND << na, so that n? << N D << na
496
follows. The condition for donor exhaustion is therefore fulfilled for the
2-fold occupied center, but not for the 1-fold occupied center.
A different situation occurs in the case ND >> n*, in which one finds
from (4.132), approximately,
n=
Jm.
(4.133)
Because nT < n4, the condition ND >> n* also insures that ND >> na. This
means that the reserve case also occurs with respect to the 2-fold occupied
center. In comparison with a one-particle center having an ionization energy
E B equal to the ionization energy EB of the simply occupied two-particle
center, the carrier concentration due to the two-particle center is enlarged by
the factor 2(n$/n;). The 2 is a consequence of the doubling of the electron
number at the non-ionized donors. The factor ( n ; / n ; ) , which exceeds 1,
reflects the fact that the ionization of the 2-fold occupied center has a larger
probability than that of the simply occupied center.
p-type semiconductors
+ 2PIP + P2
(4.134)
with
(4.135)
Here EL and Eil refer to electrons (rather than holes) and denote, respectively, the ionization energies of the acceptor with, respectively, 2 holes (no
electron) or 1 hole (1 electron) at the center. Because Eil > EL,one always
has pT > pz. The neutrality condition will again be solved in limiting cases.
In these cases the acceptor concentration NA will always be assumed to be
large compared to the intrinsic concentration p i = ni of holes, so that the
intrinsic term in (4.134) can be neglected. We first assume that
p
holds, which means not-too-strong doping and low temperatures. The neutrality condition then approximately yields the relation
P
2NA,
(4.136)
497
Figure 4.9: Average electron number q at an Au-atom in Ge as a function of the position of the Fermi level
(After Bonch-Bruevich and Kalashnikov, 1982.)
P = $%lP;,
(4.138)
(4.139)
All acceptors are simply ionized, and there is exhaustion of the acceptors
with respect to 1-fold ionization, and reserve with respect to 2-fold ionization.
Amphoteric doping centers
498
this, on the position of the Fermi level (see Figure 4.9). The latter is codetermined by the concentration of Au atoms, and it also depends, however,
on the concentrations of other donors and acceptors if there are such. In
p-type doped germanium, Au acts as donor because, in this case, the Fermi
level lies just above the valence band edge so that the charge state Au(l+)
occurs. In n-doped germanium, Au forms a double acceptor due to the Fermi
level position lying slightly below the conduction band edge, which results
in the charge states Au(1-) and Au(2-).
499
Chapter 5
Non-equilibrium processes
in semiconductors
In the preceding chapter we considered semiconductors which were spatially
homogeneous in a macroscopic sense and were not subject to external perturbations or fields. The thermodynamic equilibrium state of electrons in
such semiconductors is characterized in terms of thermodynamic variables,
in particular a temperature and a chemical potential, which do not depend
on space and time coordinates. Semiconductors differ from other materials inasmuch as they can easily be driven out of equilibrium. The electron
temperature and chemical potential may then differ relatively strongly from
their equilibrium values, provided such state variables do exist at all. If this
is the rase, they may depend on space and time coordinates. In addition,
there may be relatively large electric fields and currents. Relatively means
with respect to other solid state materials. In metals, by way of comparison,
macroscopic electric fields practically cannot exist, because they are almost
completely screened out over distances of about 1 A by the huge number
of mobile electrons available in these materials. The same may be said for
spatial variations of the chemical potential in metals, which also practically
cannot occur because of the large electron concentration.
Semiconductors have the unique property that external manipulations
can easily bring them into states which deviate strongly from thermodynamic equilibrium. It is on this unique property that essentially all applications of semiconductors in electronic and optoelectronic devices ultimately rest. The semiconductor photoresistor, for example, works because
non-equilibrium electrons and holes are generated by irradiation with light,
greatly changing the resistance of the semiconductor sample. In a field effect
transistor, the resistivity of a particular semiconductor region is changed by
means of an external voltage, and in a light emitting semiconductor diode,
non-equilibrium minority carriers, created at a forward biased pn-junction,
500
recombine with the equilibrium majority carriers accompanied by the emission of light. While the latter two examples refer to semiconductor junctions,
the following considerations will initially deal with bulk semiconductors, i.e.
semiconductors with spatially constant doping and chemical composition.
5.1
Boltzmann equation
We will write this equation down explicitly for the extended one-electron
excitations of the system, i.e. the Bloch states uk. The distribution function
f(uk, x,t ) in this case is the probability that, at a position x and at a time
t , the Bloch state having quasi-wavevector k and band-index u is occupied
by an electron. Since, in reality, the Bloch states are infinitely extended, the
specification at a position x must be understood in the sense of a region
centered at x, whose spatial extension A is large compared to the electron
wavelength X = 2 n / J k J .For the concept of a spacedependent distribution
function f(vk, x, t ) of Bloch states to make sense, f(vk, x, t ) must change
very slowly over a length of the order of magnitude A. A similar condition
holds for the variation of the distribution function with time.
In the absence of external perturbations of the type which drive a current
or energy transfer, the system of Bloch electrons is in thermodynamic equilibrium, such that it may be described by the Fermi distribution function
(4.13) with E = E,(k). To drive the system out of equilibrium, appropriate
external perturbations must be applied, such as particle density gradients or
an external electric field E(x, t ) , as we will assume here. In addition to external perturbations, internal perturbations also act on the Bloch electrons,
such as scattering by phonons and impurities. The term collisions is used
in this context which means that the scattering events take place in a very
short time, practically instantaneous. Both types of perturbations, external
ones and collisions, result in temporal changes of the distribution function,
denoted as (df(vk, x, t ) / d t ) e z t and (df(vk, x, t ) / d t ) , d l , respectively. According to a general theorem of statistical mechanics, the Liouville theorem, the
total change o f f , taken at a position x(t),iik(t) of phase space which moves
together with the particles, must vanish. Thus,
Performing the derivative, the first term of this equation may be written as
[i+
iVkE,(k) . V, - eE(x,t ) .
Cdl
The collision term is in general an integral over products of two distribution functions, one of which, f(vk, x, t ) , involves the same Bloch state vk
occurring on the left hand side, but the other, vk, involves any Bloch state
which can participate in a collision. The characteristic difficulties of the
502
Boltzmann equation are mainly due to its collision term, a statement which
applies both to the derivation as well as to the solution of this equation.
Lets suppose that the Boltzmann equation has been solved. Then, as in the
case of equilibrium, one may use the known distribution function f(vk, x,t )
to determine statistical average values < A > of any one-particle quantity
A of the many-electron system. Such quantities represent sums over corresponding individual electron quantities a. Like the distribution function
itself, the average values are functions < A > (x,t ) of space and time coordinates, such that
(5.5)
uk
In addition to the physical quantities which are of interest in both equilibrium and non-equilibrium, such as the electron concentration and the
energy density, non-equilibrium also involves other quantities to be determined, in particular, currents related to equilibrium quantities such as, for
example, particle or energy currents. The latter differ from zeroonly in
non-equilibrium, and thus are of no interest in equilibrium. Moreover, those
quantities whose average values do not vanish in equilibrium must also be reconsidered, because under non-equilibrium conditions they exhibit temporal
and spatial variations which are not present in equilibrium. As an example,
one may take the total number of free electrons in a spatially homogeneous
semiconductor sample. Under non-equilibrium conditions this number may
have larger or smaller values than in equilibrium. If one withdraws external
influences, equilibrium and its parameter values will be restored by nonequilibrium processes.
The two classes of phenomena, the appearance of currents, on the one
hand, and, of time and space changes of quantities which are non-zero but
constant in equilibrium, on the other hand, constitute the totality of nonequilibrium processes. For semiconductors, both types of processes are important. Their theoretical description can be based on the Boltzmann equation - one solves this equation and then employs the resulting distribution
function f(vk, x,t ) to calculate the average values of the various quantities
of interest. Alternatively it can be based on empirical equations for the average currents and the temporal or spatial change rates. We refer to these
as phenomenological equations. Ohms law for electrical conduction and the
first and second Ficks laws for diffusion are examples.
tions. The latter can be derived from the Boltzmann equation with additional assumptions, but the reverse, i.e. the derivation of the Boltzmann
equation from the phenomenological equations, is not possible. The transition from the Boltzmann equation to phenomenological equations represents,
essentially, a coarse graining of the length and time scales on which the nonequilibrium processes proceed. If processes must be analyzed on the space
scale of the average mean free path length and the time scale of the mean
free flight time, then Boltzmanns equation is required. If a macroscopic
average description suffices, one can use the phenomenological equations. In
semiconductor physics, the latter case often applies. Accordingly, we will r e
strict our considerations to the phenomenological equations in our treatment
of non-equilibrium processes.
These equations are valid only within prescribed limits. One can consider these limits, like the equations themselves, to be empirically given.
However, deeper insight can be gained by using Boltzmanns equation. The
question to be answered in this context is: Under what assumptions can
the phenomenological equations be derived from the Boltzmann equation?
The answer depends on the phenomenon under consideration. Equations as,
for example, that for the temporal change rate of a given physical quantity,
may be obtained by multiplying the Boltzmann equation by the value of this
quantity in a Bloch state vk and subsequently summing over all states. The
simplest equation of this type is that for the temporal change rate of the free
carrier concentration of a single band, which later will explicitly be written
down. To obtain this equation from Boltzmanns equation in the manner
indicated above, simplifying assumptions are necessary concerning the quantum mechanical transition probabilities between bands and localized states
occurring in the collision term: These probabilities must be approximated
by their values in thermodynamic equilibrium.
In the derivation of phenomenological equations for particle currents,
to consider another example, one assumes that the non-equilibrium state
deviates only slightly from a spatially and temporally local equilibrium state.
In the latter state, the distribution function depends on the Bloch states only
through their energies, and the form of the dependence is that of the Fermi
distribution function f ( E ) , just like in global equilibrium. However, the
are understood
temperature T and chemical potential p occurring in f(E),
as functions T(x,t ) and p(x, t) of space and time coordinates. Thus we have,
This particular non-equilibrium state may only be assumed if the temperature and chemical potential are still well defined quantities, albeit in a local
sense, not globally. For this assumption to be valid, adjacent parts of the
504
system with measurably different values of temperature and chemical potential are effectively decoupled from each other spatially and temporally. From
the microscopic theory of non-equilibrium processes it is known that spatial
decoupling (in the statistical sense of this term) occurs over a distance of
the order of magnitude of the mean free path length l f , and temporal decoupling occurs over a time interval of the order of magnitude of the mean free
flight time t f . The characteristic lengths of the spatial inhomogeneities of
T(x, t ) and p(x, t ) may be expressed by the ratio TIIVTI and p/IVpL(, and
the characteristic time of the temporal inhomogeneities of these quantities
is given by T/laT/atl and p/lOp/OtL(. In order for local equilibrium to be
approximately realized, the conditions
If << TIIVTI, P I P P I ?
and
tf << TllaTlatl, P l l a P l a t l
(5.7)
must be satisfied. Typical values for the mean free path length of electrons
and holes in semiconductors are of the order of magnitude 100 A. This means
that local equilibrium is a reasonable zero order description if important
spatial inhomogeneities occur on or above a length scale of 1000 A = 0.1 p m .
The mean free flight time has typical values close to
s. Temporal
changes with frequencies of 10l2 s- can therefore still be accepted within
the local equilibrium framework, although no light frequencies are allowed
The regime of validity of the local
which lie in the region of 1014 s-.
equilibrium assumption is nevertheless still relatively broad.
If the local equilibrium distribution were to describe the non-equilibrium
state exactly, no currents would flow at all since the corresponding average
values would be zero. In the presence of currents, the local equilibrium
distribution function can, therefore, only be a zero-th approximation. It
must be corrected by a perturbation term fl(vk, x, t ) , such that one has
with respect to the electric field E(x, t ) . The vector g(vk, x, t ) is understood
to be field-independent. It may be calculated from Boltzmanns equation in
linear approximation with respect to E. For this approximation to be justified, the field must be sufficiently weak. This represents a second condition
for the validity of Ohms law.
505
The above derivation of this law yields not only a phenomenological equation,
but also an expression for the electrical conductivity involved in it. It relates
this coefficient with characteristic microscopic quantities of the system such
as, for example, its band structure, as well as with external state parameters
like temperature and chemical potential. This is to say that derivation from
the Boltzmann equation yields the phenomenological equation, Ohmss law,
and, simultaneously, a microscopic theory for the conductivity, or, if other
phenomenological equations are considered, for the other respective kinetic
coefficients. If the phenomenological equations are taken as empiricaI laws,
these coefficients are material parameters whose magnitudes and temperature dependencies can only be determined from experimental investigations.
However, by means of Boltzmanns equation the kinetic coefficients can be
predicted. Without the use of such an equation, this opportunity is lost. Beside the somewhat lower precision of the phenomenological equations, due
to simplifications and approximations, the principal loss lies in the inability
to predict the kinetic coefficients, given only phenomenological equations instead of Boltzmanns equation. For the purposes of this book, where we do
not intend to develop a detailed understanding of the magnitudes and state
dependencies of material parameters, we will accept this loss and use the
phenomenological equations to gain insight into non-equilibrium dynamics
quickly.
In the next section we present, first of all, a systematic survey of the nonequilibrium processes which are essential in semiconductors. More detailed
treatment of each process is provided in following sections.
5.2
In this case, all state quantities, particularly the charge carrier concentrations n and p , depend only on time, and their values are different from those
in thermodynamic equilibrium. The latter will be denoted by no and po
henceforth, so that
(5.10)
506
with
(5.11)
Temporal changes of n and p occur because electrons in the conduction
band and holes in the valence band are generated or annihilated. If the
corresponding generation or annihilation rates are denoted by, respectively,
(an/at)gmer and (an/at)annihal as well as (aP/ at)gmer and (aP/at)annihal,
we have
(%)
(5.12)
With the right hand sides known, these equations determine the relaxation
of non-equilibrium charge carrier concentrations.
5.2.2
Spatial inhomogeneity may be introduced in various ways. Firstly, we consider the case in which a homogeneous electric field exists while the charge
carrier concentrations n and p do not depend on x:
1) n = no, p = p o , E = c o n s t . # 0.
The potential energy of an electron is eE . x in this case, i.e. the system
is in fact spatially inhomogeneous despite its homogeneous charge carrier
concentration. An electric current will flow in this case. This phenomenon
is commonly referred to as drift, and the current itself is called drift current.
The pertinent current density is spatially homogeneous and decomposes into
a current density for electrons, j,, and another one for holes, j,. With nn and
a, as conductivities for electrons and holes, respectively, Ohms law yields
j, = onE, j, = apE.
(5.13)
Secondly, we also admit spatial inhomogeneity of the charge carrier concentrations n , p . Then the electric field also becomes inhomogeneous. Accordingly, we consider the case
507
total electron and hole current densities j, and j, obey the phenomenological
equations
+ eD,Vn(x),
(5.14)
(5.15)
jn(x)= o,E(x)
[.(XI
(5.16)
s,
d3xp(x) = 0.
(5.17)
The electric field E(x) due to the charge distribution p(x) follows from the
pertinent potential p(x) by means of the relation
E(x) = -Vp(x),
(5.18)
1
V2p(x) = --p(x)
(5.19)
=0
508
5.2.3
Here, the generation and annihilation of charge carriers is also possible because of the fact that there are sources and sinks of the current densities j n
and j,. This follows from the continuity equations for j n and j , , where the
rates of change of the charge carrier concentrations due to relaxation enter
as source terms. Thus. we have
j p ( x ,t ) = u p E ( x ,t ) - ~ D , ~ P (t )x ,
(5.23)
Diffusion
5.3
509
5 10
free electron, while the transition of a conduction band electron into a deep
level only removes a free electron. A free hole, which could vanish, does not
exist in the latter case. If free electrons and holes are simultaneously generated or annihilated in equal numbers, we speak of bipolar generation and
annihilation processes. If only one type of free charge carrier is generated or
annihilated, we have unipolar processes. We first treat generation processes,
including uni- and bipolar ones.
5.3.1
Generation processes
To excite free charge carriers from deep levels or from bands, an appropriate
amount of energy must be provided. That can be done by exposing the
semiconductor to light or electron beams, and in many other ways. The
creation of free carriers by applying an external voltage to a semiconductor junction, so-called injection, is excluded here because we are confining
our attention to spatially homogeneous semiconductors. Injection is, by its
nature, a unipolar process and it will be treated later. Of the remaining
processes, excitation by means of light will be chosen as a representative
example. We refer to it as optical excitation, and other excitation processes
can be described in similar terms.
Unipolar optical excitation processes of electrons and holes are illustrated
in Figure 5.1. An electron from a deep center, which in thermodynamic equilibrium is in its ground state, is raised into the conduction band by a photon
of proper energy. As a result, a free electron appears in the conduction band.
Alternatively, an electron from the valence band makes a transition into a
deep level with the absorption of a photon. A free hole is generated in the
valence band as a result of this process. We denote the number of photons
absorbed per volume and time unit by, respectively, gn and g p . Then the
rate of unipolar-generated carriers may be expressed as
(g)
= gp.
=gnI
unip.gen.
(5.25)
unip.gen.
In the case of bipolar generation by means of light (see Figure 5.1) one has
gn = g p = g , so that
(5.26)
To estimate the magnitude of the optical generation rates, we consider the
bipolar case. With I as light intensity and a as the absorption coefficient
associated with the excitation of electrons from the valence band into the
conduction band. we have
9=
(E).
(5.27)
511
a)
.
.
.
.
.
Figure 5.1: Unipolar generation of electrons (a), holes (b), and bipolar generation
of electron-hole pairs (c).
For light intensity of 1 Wcrn-, photon energy hw = 1 e V , and absorption
coefficient lo3 ern-, which is not too large for the absorption edge of a
semiconductor, we obtain g = 6 x loz1 ~ r n - ~ s - . This indicates that optical excitation represents a very effective generation process. The size of
g does not tell, however, how large the steady value of the free charge carrier concentration will actually be. This value also depends on the speed of
annihilation processes.
5.3.2
As outlined above, free charge carriers are annihilated when they make a
transition into deep levels. One refers to this process as the capture of free
charge carriers by deep centers (see Figure 5.2). We first consider the capture
of electrons, and assume that the deep center D can bind one and only one
more electron than it has in its ground state. If the center is neutral in its
ground state, this means that the D(-1/0) donor level lies in the gap, while
the 0 ( - 2 / - 1) level does not appear there. For brevity, the ground state of
the deep center will be referred to as unoccupied, and the state with one
electron more at the deep center as occupied. The ionization energy of the
occupied center is denoted by Et.
Electron capture
After an electron of the conduction band is captured by a deep center, there is
a finite probability that it may be re-emitted back into this band. The anniis therefore the net capture rate (dn/dt),pture,
hilation rate -(an/dt),,,ihii
defined as the negative of the difference between the gross capture rate C,
and the emission rate En, i.e.
5 12
.
.
.
.
.
.
.
.
.
-Y
EP
(5.28)
The capture rate C , is, on the one hand, a function of the concentration of
free electrons available for capture, i.e. of n. On the other hand, it depends
on the concentration of deep centers available for a capture. If we denote
by N t the total concentration of deep centers, and by f the probability that
they are already occupied by an electron, then the concentration of available
centers is Nt(1 - f ) . Thus,
c, = c,n(l
- f)Nt.
(5.29)
The proportionality factor c, is called capture coefficient. It has the dimension cm3s-' and describes the capture capability of an unoccupied deep
center.
The emission rate En depends on how many deep centers are already
occupied by electrons, i.e. on Nt f . The concentration of existing free electrons has no influence on En, since the number of unoccupied states in the
conduction band which must host the emitted electron is relatively large.
Therefore,
En = en f Nt.
(5.30)
($)
= - [cnn(l- f ) N t - e n f ~ t ~ .
(5.31)
capture
Consider first the special case of thermodynamic equilibrium. In this state all
macroscopic quantities are independent of time, so that (dn/bt)capture = 0
holds. The concentrations n , p take their equilibrium values no and PO, and
513
the same holds for the occupation probability f of the deep level, which is
given by
(5.32)
where -ytn denotes the ratio of the statistical weights of the unoccupied and
occupied center. Equation (5.31), with vanishing left hand side, yields a
relation between en and cn in thermodynamic equilibrium, namely
with
(5.34)
The quantity nt has a structure similar to that of the concentration nl
previously introduced in Chapter 4. It denotes, apart from the factor Ytn,
the concentration of free electrons in the case when the Fermi energy lies
exactly at the deep level. For an n-type semiconductor, EF is actually found
to be much closer to the conduction band, i.e., nt is, in general, substantially
smaller than the equilibrium concentration no.
The relation (5.33) between emission and capture coefficients is a consequence of the equilibrium between the deep level and conduction band.
The term detailed balance is used in this context, since the two levels are in
equilibrium already themselves.
If there are deviations from equilibrium, the net capture rate (5.31) is
not fully determined inasmuch as the occupation probability f of the deep
center has not yet been further specified. In the general case, this probability
will also deviate from the equilibrium value fo. The exact value which it will
have cannot be predicted a priori. One can only write down an equation
for its temporal rate of change. With the assumption that the exchange of
electrons occurs only between the deep level and the conduction band, this
equation reads
(5.35)
If we also assume that beyond a particular point in time t o no further generation of free charge carriers will take place, but only annihilation processes
will occur through capture , we have
(g) (g)
=
for
t > to.
(5.36)
capture
The relations (5.35) and (5.36) together form a system of non-linear differential equations, which can be solved exactly in closed form. However, we will
514
not exhibit this solution here, restricting our discussion to a linear approximation. In this, we assume that the functions n and f deviate only weakly
from their equilibrium values. Formally, this means that n = no An and
f = f o A f with lAnl << no and lAf I << 1, whence
(g)
capture
= - c n N [-(nt
+ no)Af + An(1- f o ) ] .
(5.37)
Bilinear terms were neglected in this derivation, and equation (5.33) was
used. We consider that the generation of non-equilibrium electrons before
time t o arises from an initial equilibrium distribution with subsequent excitation of electrons from deep levels into the conduction band. The resulting
total concentration of electrons in deep levels and in the conduction band
jointly have the equilibrium value no Ntfo at all points of time t before
and after to. Therefore
An
+ NtAf = 0.
(5.38)
(5.39)
follows from (5.37), where for the sake of brevity we have defined
7,
1
cn[nt
+ no + N t ( 1 -
f o ) ]
(5.40)
(5.41)
where An0 = An(t = t o ) has been used. This result demonstrates that rn
represents the decay time of a non-equilibrium charge carrier distribution.
One also calls rn the lifetime of non-equilibrium electrons with respect to
capture at deep centers. In the case of completely unoccupied deep centers,
i.e. at fo = 0, and for very high concentrations Nt, more exactly for Nt >>
nt no, rn is given by
1
I
rno =
-.
CnNt
(5.43)
515
In accordance with its meaning, rn0 is the time interval in which the totality
of Nt unoccupied deep level per cm3 capture one non-equilibrium electron
from the conduction band. One therefore calls r,,o the capture time. The lifetime r,, and the capture time rno differ because of the occupation dependence
of the capture rate and the occurrence of reemission. If N t ( 1 - fo) >> nt+no,
then one has r,, > rno. This result is immediately understandable because
the lifetime r,, contains contributions only from unoccupied deep centers,
whereas all deep centers, including occupied ones, enter in rno.
Hole capture
As in the case of electrons, deep centers can also capture and reemit holes.
The capture of a hole means that a deep center, which in its ground state
binds a certain number of electrons, transfers one of these electrons to the
valence band and annihilates a hole there. The emission of a hole is to
be understood in the sense that a deep level captures an electron from the
valence band (see Figure 5.1). All considerations involving the capture and
emission of electrons, and all relations derived for these processes, can be
immediately transferred to holes. The capture and emission rates C p and
E p are given by the expressions
In this, cp and e p denote the capture and emission coefficients for holes.
They are related to each other by the equation
ep
= PtCp
(5.45)
(5.46)
The meaning of ytp is analogous to that of y h . The hole lifetime rp obeys
the relation
1
rp =
Cpbt + P O
~~0
+ Ntfol
'
(5.47)
(5.48)
516
Table 5.1: Experimental capture cross sections for particular deep centers in Si
cm2.(Data compiled from various sources.)
and Ge at T = 300 K in units
Si
Center
on
Au(l+)
35-63
Au(0)
1.7-5
Au(1-)
140)
UP
10-110
0.1
In(l-)
170
Cu(1-)
0.1-0.18
500
Cu(2-)
0.0036-0.3
1-3.6
Bi(0)
0.025
Ni(0)
1-8
Bi(1-)
0.6
Ni(1-)
3-6
Bi(2-)
0.6-1
Ni(2-)
Fe(l+)
16
Fe(0)
WO)
3-7
10
Zn(0)
Zn(1-)
0.0001-1
Ga(0)
< 0.01
1000
5 - 20 x 1000
1-10
Fe( 1-)
30
Fe(2-)
100
Zn(1-)
< 0.001
Ga(0)
< 0.01
300
(5.49)
Experimentally determined capture cross sections for some deep centers are
listed in Table 5.1.
517
Finally, we present a short discussion of the microscopic mechanisms responsible for the capture of free charge carriers at deep centers.
Capture mechanisms
We restrict our considerations to electrons, as the transfer of the discussion
to holes is evident. To understand capture processes, the first question that
has to be answered is how an electron, during its transition from the conduction band into the deep level, can transfer away its surplus energy. There
are essentially three possibilities. Firstly, the energy can be emitted as light,
secondly, it can be transferred to the remaining electrons of the semiconductor, and thirdly, it can be transferred to the lattice. The last case is the most
important and most universal. The transfer of electron energy to the lattice involves the interaction of the electrons with phonons. This interaction
was ignored in Chapter 2 by treating the interacting system of electrons and
atomic cores within the adiabatic approximation. In order to account for the
electron-phonon interaction, one must go beyond this approximation. Moreover, it must be recognized that one single phonon is insufficient to make
possible the transfer of energy during capture, and many phonons must
be emitted simultaneously. This is termed a multi-phonon process. Consequently, a perturbation theoretical treatment of the electron-phonon interaction is not adequate for this purpose, and one must employ non-perturbative
calculations of the capture coefficients due to multi-phonon processes at deep
centers (see Peuker, Enderlein, Schenk, and Gutsche, 1982).
5.3.3
5 18
one hole in the valence band. Auger recombination is important mainly for
high charge carrier concentrations, since its probability is determined by the
product of the concentrations of the three carriers involved. Thirdly, the energy released during recombination can be transferred again to the lattice.
This direct band-band recombination with emission of as many phonons
as needed to match the energy gap, proves to be extremely unlikely. The
reasons for this are, on the one hand, that the number of phonons which
would have to be emitted simultaneously is very large, and secondly that
the electron-phonon interaction for electrons and holes in band states is relatively weak. Recombination processes in which deep levels are involved
as intermediate states for an electron making the transition from the conduction band to a hole in the valence band are much more effective. This
mechanism is called Shockley-Read-Hall recombination Auger recombination and Shockley-Read-Hall recombination together are jointly referred to
as non-radiative recombination processes. Because of its dominating role in
electronic devices, we treat Shockley-Read-Hall recombination below in more
detail.
S hockley-Read-Hall recombinat ion
519
Figure 5.3: Capture and emission of an electron and a hole at the same deep center.
Figure 5.4: Effects of deep levels for differing relations between the capture and
emission rates for electrons and holes.
would be impossible. Since the capture of an electron from the valence band
is equivalent to the emission of a hole to this band, the first condition to be
satisfied may be expressed as Cn >> Ep. On the other hand, the electron
captured from the conduction band must make the transition to the valence
band more quickly, or, in other words, the deep center must capture a hole
from the valence band more quickly, than the time it takes the electron captured by the deep center can be re-emitted back to the conduction band.
Thus C p >> En must hold. If different order of magnitude relations exist
between the four rates, the deep center can no longer function as a recombination center, but has other effects (see Figure 5.4). We summarize these
in the following survey: The deep center works as
a)
recombination center
for
Cn >> E p , C p >> En
b)
for
C, >> Ep,
C p << En
c)
for
Cp>>
En,
C,<< Ep
d)
generation center
for
Cp
Verification of the three last cases, which were not explicitly discussed above,
520
is left to the reader. With the background of these considerations, the unipolar annihilation of charge carriers at deep centers appears in a new light. It
is a special case of the more general situation considered here, in which the
deep center can capture and emit both electrons and holes.
Thus far, recombination through deep centers has been considered only
qualitatively. To exhibit a quantitative treatment, we present equations
for the temporal rates of change of the electron concentration n, the hole
concentration p, and the occupation probability f of the deep center. These
equations are generalizations of the relat,ions which were already employed
above in describing the capture of carriers by deep centers. Here, they read
(5.50)
(g)Tmb(Z)
-
recomb
+Nt
(2)
= 0.
(5.52)
By means of relations (5.33) and (5.45) between, respectively, en, cn and ep,
cp, and using notation rno and rPo,introduced above, we have
(5.53)
1
= - [(l- f ) P t - fPl
(%)recomb
(5.54)
TPo
(g)
(5.55)
= 0,
=-(a)
recomb
recomb
(5.57)
52 1
n p - ntpt
+ + Tnoo) +P t )
~ p ~ ( nnt )
(5.59)
The relation
CI
(5.60)
with
p=po+Ap
(5.63)
An#O, A p = O .
It follows that
An
R=-
(5.64)
Tn
with
7,
= no+ntTpO I Po+Pt,nO
(5.65)
PO
PO
as the recombination lifetime of non-equilibrium electrons. In the case of
a n-type semiconductor, i.e. if p o << p t << no holds, then T n >> ~ p o , ~ n o ,
which means that the recombination lifetime is substantially longer than
the capture times for electrons and holes. This result is due to the fact
that the holes necessary for recombination of the non-equilibrium electrons
~
522
An=O,
(5.66)
with
rp =
no
~
+ nt rpo + PO+ pt
(5.67)
TnO
n0
n0
as the recombination lifetime of non-equilibrium holes. For an n-semiconductor one has rp = rPo,and for a p-type semiconductor rp >> rno, rpo.
The above results for the case of unipolar excitation may be summarized
by stating that the recombination lifetimes of minority carriers are substantially smaller than those of majority carriers - provided the deviation from
equilibrium is weak.
Finally, we consider the case of bipolar excitation of non-equilibrium
carriers. In this case,
An = Ap.
The recombination rate is now given by
(5.68)
with
(5.69)
and
r = rpo(no nt)
+ T,O(PO + P t )
(5.70)
no +PO
For an n-type semiconductor one has r = rpo, and for ap-type semiconductor
r = r,~. The recombination lifetime therefore equals the capture rate for
minority carriers. For an intrinsic semiconductor, we have no p o = Pni, so
that the recombination lifetime in this case is given by the expression
(5.71)
523
Material
r, (s)
ozuiv.(cm2)
ni
Si
Gap
indirect
3.9 x lo4
1.5 x i o 1 O
Ge
indirect
0.4
2.4 x 1013
GaP
indirect
3.4 x 10l2
2.7
10-l~
ioP2
5x
5 x 10-21
GaAs
direct
5.4 x lo2
1.8 x lo6
InP
direct
5.8 x 10
8.2 x lo6
10-15
InSb
direct
7.3 x 10-7
1.6 x 1016
10-17
In Table 5.2 typical values of the lifetimes q for some semiconductors are
listed. It should be noted that there is a large variation of q , between
3.4 x 10l2 s for the indirect wide gap semiconductor GaP and 7.3 x
s
for the direct small gap semiconductor InSb. For extrinsic semiconductors,
we have no + P O > 2 n i , so that T < q. This means that the recombination
lifetime through a particular deep center reaches its maximum value when
the semiconductor is in its intrinsic state. Doping reduces the recombination
lifetime of bipolarly excited non-equilibrium carriers.
5.4
Drift current
524
per second, according to equation (3.110), equals the force impressed by the
field on a Bloch electron. whence
[& <
mkkn
>
-eE.
(5.72)
field
In addition to the electric field there are frictional forces acting on the Bloch
electron which arise from the semiconductor itself. The interactions of an
electron with lattice vibrations and impurity atoms or defects result in scatterings which act to brake its acceleration along E, producing a frictional
effect similar to that in a viscous medium. Often, this frictional resistive
force grows stronger in proportion to the average electron speed, so that one
may put the corresponding rate [$ < mkxn
of momentum change per
second due to collisions with phonons and crystal imperfections, proportional
to the average momentum, whence
>Iez
(5.73)
with l / r i as proportionality factor. If there is a momentum excess at initial
time t = 0, < m;xn(0) >, the solution of this equation reads
(5.74)
whence
d
- < mkxn
dt
> = -eE-
- < mn
* kn
>.
(5.76)
r
;
For steady state dc current flow, the left hand side of this equation vanishes
and the concomitant constant value of the average velocity is
525
(5.77)
which is referred to as the drifi velocity.
For a field strength of 1 Vcm- the drift velocity has absolute value
(5.78)
This quantity is a measure of the mobility of an electron per unit applied
force, therefore pn is simply referred to as electron mobility. Using equation
(5.78), the drift velocity of (5.77) may be written in the form
(5.79)
(5.80)
j , = enp,E.
For the electron conductivity
(5.81)
D,,
on = enp,.
(5.82)
This conductivity expression will be used frequently below. It can be transferred to holes without difficulty, reading
Tp =
eppp,
(5.83)
- - y I
(5.84)
mP
j
and the total conductivity
.in
+j p ,
(5.85)
+ up.
(5.86)
526
Table 5.3: Electron and hole mobilities for pure semiconductor materials at T =
300 K . (After Landoldt-Bornstein, 1982.)
1000
Si
200
40
100
N=10"
20
Temperature ("C)
-50
50
100
150
Temperature ("C
U
200
Figure 5.5: Temperature dependence of the electron and hole mobilities of Si for
various doping concentrations.
(5.87)
In Table 5.3 the mobilities of electrons and holes are given for some important
semiconductors at room temperature. That the mobilities of electrons exceed
those of holes, is to be attributed to the smaller effective masses of the
electrons in these materials.
The mobility depends on the interaction of the carriers with phonons through
Sze, 1981.)
52 7
l
o
T- 3004
K
7
=- 103
-5
,102
Doping concentration ( ~ r n - ~ )
the momentum relaxation time, and the same may be said for temperature
dependence. In Figure 5.5 this dependence is shown for Si. The decrease
of mobility with rising temperature is to be attributed to the growth of
the amplitudes of the lattice oscillations or, in other terms, of the number of phonons. In extrinsic semiconductors the mobility also depends on
the doping concentration, because doping atoms constitute collision centers.
Generally, the momentum relaxation times, and through them also the mobilities pn and p p , decrease with rising doping concentrations. Examples of
such dependencies are shown in Figure 5.6.
5.5
528
latter cause spatially inhomogeneous diffusion currents in and ip of, respectively, electrons and holes. Considering first the case of electrons, the diffusion current is given by
in = -DnVn.
(5.88)
-+V.i
at
n - no
n-
(5.89)
7,
-d -nD
at
d2n
nax2
n-no
-
rn
(5.90)
(5.91)
The determination of a unique solution of this equation requires specification
of boundary conditions, which we choose as follows: at x = 0 a particular
non-equilibrium value
n(x = 0) = no
+ An0
(5.92)
of the electron concentration is maintained, and n ( x ) will decay to the equilibrium value no at x = 00. In the interval 0 < x < 00, the solution is
written in the form
(5.93)
529
t
0
X-
+ Ange-"ILn,
(5.94)
where
L,=
&.
(5.95)
The solution (5.94) admits the following simple physical interpretation (see
also Figure 5.7). Diffusion tends to disassemble existing concentration gradients, transferring part of the distribution near z = 0 further to the right.
However, since the lifetime of non-equilibrium charge carriers is limited, this
process cannot be fully accomplished, as electrons are captured during the
transfer process, leading to the exponentially decaying profile of the electron
concentration given in equation (5.94). The characteristic decay length L ,
can be interpreted as the distance to which an electron diffuses, on the average, before it is captured. One refers to L , as the dzfluszon length. L , is the
larger for larger lifetime 7,. One can illustrate the interpretation above by
means of a fluid current moving over porous ground. At a particular point
x = 0 the fluid current level is maintained at a fked height. In the direction
of the current flow, the fluid level drops because fluid is absorbed into the
porous ground. Sufficiently far downstream, practically no fluid will arrive
because most of the h i d was absorbed into the ground further upstream.
Nevertheless, one has a continuous fluid current at x = 0, which provides
exactly as much fluid as is absorbed along the entire pat,h between x = 0
and z = 00. The analogy to the diffusion of electrons with finite lifetime is
obvious. Furthermore, the current flow at z = 0, in this case i,(z = 0), may
be determined from (5.88) and (5.94) as
(5.96)
530
Table 5.4: Diffusion coefficients and diffusion lengths for several pure semiconductors at T = 300 K . Diffusion coefficients are calculated by means of mobility data
of Table 5.3 using Einstein's relation.
Si
Ge
100.8
GaAs
227.5
10.3
10-8
3.2
(5.97)
For boundary conditions analogous to those in equation (5.92), we have
with
(5.99)
as the diffusion length of holes.
In Table 5.4, diffusion lengths of electrons and holes are listed for several
semiconductor materials. The lifetimes r, and rp depend strongly on the
degrees of purity and crystallographic perfection of the semiconductors. In
s was assumed for T,,~. However, in very pure
Table 5.4 a value of
and perfect materials, considerably larger values are possible. The diffusion
lengths are then much larger than those given in Table 5.4, and they can
reach hundreds of microns.
5.6
Complete equilibrium of the total system can only exist then if the inhomogeneities have equalized out in both partial systems. This means also
that the doping of the semiconductor material must have become spatially
homogeneous. The process which brings this about, the diffusion of dopant
atoms, is so very slow, however, as compared to the carrier processes, that
one can completely neglect it. This means that, the subsystem of atomic
cores effectively remains in the non-equilibrium state in which it started;
this state is frozen, so to say. The subsystem of the free charge carriers,
however, will relatively quickly pass into a new stationary state. On the one
hand, this state will differ from the equilibrium state of free charge carriers
in a homogeneous semiconductor - the concentrations n and p of the electrons and holes will depend on position coordinate x,as a consequence of the
inhomogeneity of the dopant concentrations. On the other hand, all current
densities of the free charge carriers will also vanish in the inhomogeneous
case, just as in the homogeneous case. A stationary final state conforming
to this characterization may be understood as a thermodynamic equilibrium
state of the free charge carriers of the spatially inhomogeneous semiconductor. The nature of thermodynamic equilibrium in this sense will be discussed
below.
We consider only spatial inhomogeneities in the x-direction. The com, and j
, of the electric current densities of electrons and holes in
ponents j
this direction are given by the phenomenological equations
(5.100)
j
, = a,E(x)
dP
dx
eDp-.
(5.101)
Here E(x) represents the electric field that occurs in consequence of the
spatially inhomogeneous charge carrier distributions, initially omitted in the
previous section. In equilibrium the current densities of electrons and holes
must vanish. That this must hold for the electron and hole currents separately, and not just for the sum, follows from Boltzmanns equation which
states that in equilibrium the partial currents of the subsystems of electrons
having a fixed energy must vanish separately (not just after summing over
all energy values). The holes are, of course, also a subsystem of the total
electron system with particular energy values, namely the energies of the
unoccupied states of the valence band. Hence, in equilibrium we have
dn
0 = ~ , E ( x ) eDn--,
dx
(5.102)
0 = apE(x) - eDp-.dP
(5.103)
dx
532
The two electron current contributions in (5.102), the drift current v n E and
the diffusion current eD,(dn/dx), need not vanish separately. The same
holds for the hole current contributions a p E and - e D p ( d p / d x ) in equation
(5.103). Only the sum of these contributions must be zero. Thus we arrive
at the conclusion that in thermodynamic equilibrium the drift and diffusion
currents of each type of carrier must exactly compensate each other.
Our object now is to transform this observation into a general condition
for thermodynamic equilibrium in systems with spatially inhomogeneous
doping. To this end we express the field strength E of equations (5.102)
and (5.103) in terms of the electrostatic potential cp, and replace c n and up,
respectively, by the right hand sides of equations (5.78) and (5.83). Thus,
we obtain
o=-
dYJ
dx
+ Dn-ddx [Inn ( x ) ],
(5.104)
dcp
d
- Dp- [ h p ( ~ ) ]
dx
(5.105)
~ n -
O = -pp-
dx
In the case of local equilibrium, the charge carrier concentrations n(x) and
p(x) depend on the chemical potential p(x) in just the same way that n and
p in expression (4.59) depend on E F ,
(5.106)
Substituting these expressions in (5.104) and (5.105) we have
(5.107)
(5.108)
Einstein relation
Because the chemical potential for holes is the same as that for electrons,
we can restrict ourselves to the latter. We assume that at a given position
z = xo there is an infinitely high potential wall for these particles (see Figure
5.8). Electrons can reside on the right side of the wall, but not on the left, as
533
. .
aJ
I
01
I
I
xo
X
they are reflected by the wall if they move toward the left. The situation is
comparable with that of molecules of air above the surface of the earth. As
in the latter case, in which the equilibrium vertical distribution of particles
is given by the law of atmospheres, we have here
D, = -
p P .
(5.112)
9-e d- v = 0
dx
dx
(5.113)
or
p ( z ) - ecp(z) = const
(5.114)
The quantity on the left hand side of this equation is referred to as the
electrochemical potential. In the case of vanishing electric potential, the
electrochemical potential is just the same as the simple chemical potential.
This also remains true if the electric potential has a non-zero but spatially
534
(5.115)
In thermodynamic equilibrium, the electrochemical potential of the free
charge carriers is spatially constant, and the constant value is equal to the
Fermi energy. The chemical potential and the electric potential themselves
can change, however. It is appropriate to bear in mind here that the initially assumed spatial inhomogeneity of the semiconductor is based on the
inhomogeneity of its doping. The formal expression of this inhomogeneity is
the spatial variation of the chemical potential. To properly adjust the thermodynamic equilibrium state, the spatial variation of the electric potential
must just compensate the spatial variation of the chemical potential.
The equilibrium state of equation (5.114) represents an equilibrium in
both the local and in the global senses. That in the local sense was presupposed (approximately) at the outset with the use of phenomenological
equations for the current densities of the charge carriers. Moreover, the equilibrium state described by (5.114) represents a relative equilibrium, because
it was derived under the assumption that the initial spatial inhomogeneity
of doping, and thus the inhomogeneity of the chemical potential, remains
unchanged. Over very long periods of time, this is assumption ceases to be
valid. However, this does not affect the equilibrium condition because it is
only essential that the inhomogeneity of doping does not change during the
relaxation of the charge carriers towards equilibrium, a condition which is
always fulfilled.
As one may expect, the equilibrium condition (5.114) or (5.115) will
play a central role in the treatment of semiconductor junctions in the next
chapter.
535
Chapter 6
Semiconductor junctions in
thermodynamic equilibrium
The distinguished role of semiconductors in the modern world stems from
devices which can be fabricated from these materials and, like transistors
and semiconductor lasers, have opened the door to the present computer
and communication technologies. The operation of semiconductor devices
depends largely on effects which occur at semiconductor junctions, in addition to the properties of homogeneous semiconductors which have been
treated almost exclusively in this book thus far. A semiconductor junction
is understood as a structure composed of two interconnected layers, of which
at least one is a semiconductor material. Various semiconductor junctions
are shown in Figure 6.1 schematically. In case (a) the two layers are made
of the same semiconductor material, e.g. silicon, but one of the layers is
p-doped and the other n-doped. This is called a pn-junction In case (b)
two layers of different semiconductor materials are connected. The doping
type may, but need not be different. This structure is called a semiconductor
heterojunction. We encountered such semiconductor heterostructures above
in section 3.7. If a semiconductor layer is combined with a metal layer,
then one has a metal-semiconductor junction, and if it is connected with
an insulator layer it is an insulator-semiconductor junction Formally, one
can also include semiconductor surfaces here, which correspond to vacuumsemiconductor junctions.
Each of the semiconductor junctions identified above is associated with
an important physical discovery. Several of these discoveries were even honored with Nobel prizes. Moreover, each of these junctions plays a role in
electronic devices, and in most cases a semiconductor junction is crucial
for operation of the device. Electric rectification at a metal-semiconductor
junction, the so-called Schottky contact, was historically the first semiconductor junction to be used technologically. In this particular case, funda-
536
a)
n -type
b)
cn
Semic.
Metal
In general terms, the operation of electronic devices based on semiconductors may be described as follows: The thermodynamic equilibrium of a
semiconductor junction is disturbed by an external perturbation, for exam-
537
6.1. pn-junction
pn-junction
6.1
pn-junctions are fabricated such that either the surface region of a homogeneously doped n (or p)-type sample, is p (or n)-doped by means of diffusion
or ion implantation, or that a p (or n)-type layer is deposited on a n (or
p)-type substrate by means of epitaxy. In the particular case of Si, neutron
bombardment may also be used. Due to the nuclear reaction ;gSi n
:aSi y -+
p, a previously p-doped sample becomes n-doped in the
surface region exposed to the neutrons.
In the following theoretical description of the pn-junction we assume
homogeneously doped p - and n-regions left and right of the plane normal
to the z-axis of a Cartesian coordinate system (see Figure 6.2a). The p region extends to -00 and the n-region to +co. The transition from the p to the n-region takes place abruptly at z = 0. We refer to this position as
the nominal transition. In the transverse plane normal to the z-axis, the
two semi-infinite semiconductor samples are taken to extend to infinity in
all directions. Thus, there are no dependencies on y and z , provided the
semiconductor materials are homogeneous in the transverse plane, which we
take to be the case. With this, the problem is effectively one dimensional.
The carrier concentration distribution along the x-axis before equilibrium is
established is shown schematically in Figure 6.2a. Conceptually, this corresponds to the instant at which the junction is formed, so that the carrier
distribution has not yet had enough time to adjust to form a new equilibrium
state. Considering that the p-region also has electrons which are minority
carriers in it, and that the n-region also has holes as minority carriers there,
one must distinguish the region in which these concentrations are meant.
We indicate this by affixing a subscript to n and p . The concentrations in
the n-region will be denoted by n, and p,, and those in the p-region by np
and p p . Thus, n,, and p p are majority carrier concentrations, while np and
p, are minority carrier concentrations. The different values of the charge
carrier concentrations in the n- and p-regions are related to different values
E F , and E F of
~ the bulk Fermi energies in these regions, such that
---f
538
1--I
D i f f h o n tc Drift
-ecp (x)
C)
Recombination
I
I
I
I
Figure 6.2: Non-equilibrium state of a pn-junction (a), from which the establishment of thermodynamic equilibrium develops (b,c).
6.1. pn-junction
6.1.1
539
The situation shown in Figure 6.2a certainly does not represent an equilibrium state. The charge carrier concentrations are spatially varying functions
n(x) and p ( z ) , and because np << n, and p, << pp, strong gradients of n(x)
and p(x) exist in the vicinity of the nominal transition x = 0. They cause
electrons to diffuse to the left into the p-region, and holes to the right into
the n-region (Figure 6.2b). Consequently, the n-region becomes positively
charged, and the p-region negatively. In the terminology of Chapter 5 , one
can say that a space charge region is formed. It has an associated electric
field E ( z ) and an electric potential cp(z). The dependence of these quantities
on x is shown in Figure 6.2b (right column), illustrating results which will
actually be obtained later by detailed analysis. For our immediate considerations, only their qualitative behavior is important. In Figure 6.2b the
band edges are shown as position-dependent quantities. This representation is surprising initially, since the band edges are energy eigenvalues of
the Schrodinger equation, and as such they do not depend on space coordinates. Such a representation in terms of position-dependent band edges
is nevertheless meaningful in the present case, based on the fact that cp(z)
is a smoothly varying potential in the sense of section 3.3. Therefore, the
effective mass equation can be applied. For the envelope function Xc(x) of
the conduction band electrons this equation reads
and for the envelope function Xv(x) of the valence band holes it has the
form
According to equation (6.3), the electrons at the conduction band edge exhibit a potential energy E, - ep(z), and the holes at the valence band edge
exhibit a potential energy ecp(z). These energy functions are depicted in
Figure 6.2b. In this, one has to recognize that the energy e p ( z ) of the holes
counts negatively, so that on the positive electron energy scale one must
therefore plot -ep (z).
The electric field in the space charge region drives a current in the direction
opposite to that of the diffusion current (Figure 6.2b). The concentrations
540
of carriers which diffuse from the region where they are majority carriers
into the regions where they are minority carriers, are much larger than the
equilibrium minority concentrations there. Thus they will recombine with
the abundantly available majority carriers in these regions (Figure 6 . 2 ~ ) In
.
this process, the concentrations of both types of free charge carriers decrease
in the space charge region, as shown in Figure 5.2. The electric charge balance as well as the electric field and potential distributions are, however,
not influenced by such recombination since the annihilation of electrons and
holes takes place simultaneously, and in the same measure. Overall, a p n junction, in the time interval immediately following its formation, undergoes
a relatively complicated interplay between various non-equilibrium processes
- the diffusion and drift of majority carriers, the formation of a space charge
region and of related electric field and potential distributions, as well as recombination of non-equilibrium minority carriers with equilibrium majority
carriers.
The processes under discussion act in such a way that thepn-junction approaches thermodynamic equilibrium. The state to which the system relaxes
is, however, not a static but a dynamic equilibrium state. Of course, it has no
net current, but only because the drift and diffusion currents fully compensate each other. In regard to the equilibrium state which finally emerges, the
details of the processes which establish equilibrium do not matter, what is
important is only that there are such processes. Independently of the nature
of these processes, specific local values of the chemical potential p ( z ) , electric
potential p(z), and charge carrier concentrations n ( z ) ,p(z) are established
in equilibrium. The latter are given by the relations (5.106) discussed in
Chapter 5, and we display them here once again:
p , P(+w)= Pn.
(6.6)
(6.7)
P ( + m ) = EFn.
(6.8)
6.1. pn-junction
541
Before doing this and solving the Poisson equation, we will first determine a
relatively easily accessible integral quantity, namely the potential difference
U D , which exists between the left and the right end of the pn-junction in
consequence of the charging of the two regions. This potential difference is
called &;fusion voltage, because diffusion is the agent actually responsible for
its existence. The diffusion voltage provides one of the boundary conditions
needed for the solution of the Poisson equation.
6.1.2
Diffusion voltage
cP(m) - cP(-m)
can be determined by writing down the electrochemical potential p ( z ) ecp(z) once at z = -m and again at z = +m, and requiring the two values
to be equal, whence
p(-m) - ecp(-oo)
= p ( + w ) - ecp(+m).
(6.10)
1
- [ E F ~- E ~ p l .
(6.11)
The Fermi levels may be expressed in terms of the majority carrier concentrations nn and p, employing equations (6.5) to (6.8). First of all, we
have
nnPp - N ~ N ,-Eg/kTe ( E F-~E ~ p l / k T .
(6.12)
Introducing ni from formula (4.86) and taking the logarithm, it follows that
(6.13)
542
+-
+-
- +
P
I
:
-+
bl
;:
Metal
-- ++
6.1.3
The electric potential p(z) obeys the Poisson equation (5.19) with p ( z ) taken
from (5.16). A particularly important feature of this equation is that its
charge sources on the right-hand side themselves depend on the potential
p(z). We consider this dependence initially for the free charge carrier part
of the charge sources. One obtains this from equations (6.5) for n ( z ) and
p ( z ) , expressing the chemical potential p ( z ) in terms of the electric potential
p(z) using the relation
543
6.1. pn-junction
whence
(6.16)
and with 6.13) we find
2
(6.17)
n ( x ) p ( z ) = ni
The mass action law, which was written down for spatially homogeneous
semiconductors in equation (4.85), therefore also holds locally for inhomogeneous semiconductors. Such a result is not surprising since this law is
predicated on the existence of thermodynamic equilibrium, and local equilibrium was assumed in the inhomogeneous case considered here.
The concentrations of ionized donors NDf and acceptors N Z depend on
the potential (p(z),at least in the general case. In a frequently occurring
special case, namely that of complete ionization, this dependence no longer
exists. We will assume that this case applies, so that
~ o f ( x=
) N&),
N A ( X )= N A ( Z ) .
(6.18)
Since donor atoms are present only in the n-region, and acceptor atoms
only in the p-region, and since they are homogeneously distributed there in
accordance with our initial assumption, it follows that
(6.19)
N ~ ( X=)
0,
> 0,
N A , x < 0.
(6.20)
(6.21)
We seek a solution of this equation which accounts for the existence of the
diffusion voltage (6.9). In order to get a unique solution one more condition
is needed. This may be obtained from the observation that far from the
nominal transition at x = 0, everything should behave as it would in an infinite bulk sample. This means, in particular, that the electric field strength
has to vanish at x = f c o , whence
(6.22)
544
(6.23)
The explicit determination of the solution of the Poisson equation (6.21) is
a difficult task, mainly because the function p(z) to be calculated enters the
right-hand side of this equation via the charge carrier concentrations n ( z )
and p ( z ) as given by (6.15) in an extremely non-linear way, namely exponentially. It is therefore necessary to resort to approximations or numerical
calculations.
Here, we will discuss the former. First of all, one must consider which
approximations are suitable under existing conditions. The answer may be
obtained from a qualitative estimate of the potential profile p(z) illustrated
in Figure 6.4. The result of the estimate has already been used in Figure
6.2 to construct -ep(z). Far from the nominal pn-transition at z = 0 there
is local charge neutrality, as for a homogeneous semiconductor sample. In
consequence of the Poisson equation and the boundary condition (6.22), the
potential will approximately retain the values p(-m) and p(+oo) which it
has, respectively, at z = --oo and z = +m, in approaching the position
x = 0 from --oo and +m. In the vicinity of the nominal transition at z = 0
a space charge region is formed, as outlined above, and it is here that the
6.1. pn-junction
545
n=
I
N,
Depletion region
----
I+++,+
Bulk region
Figure 6.5: Space charge region, depletion region and bulk region of a pn-junction.
potential will undergo significant variation. We will denote by x , the first
position in the n-region where, in the approach from +CQ, space charge starts
to develop substantially. Correspondingly, the potential starts to undergo
major change at x,, where it is approximately given by cp(x,) = cp(00).
Since the total change of cp(x) over the space charge region amounts to U D ,
and since U D is, in general, substantially larger than k T , the point is quickly
reached where cp(x) has fallen off to the value cp(x,) - k T / e . We denote this
point by x n - A x , , so that
V ( x n - A X , ) = cp(xn) - k T / e .
(6.24)
0,
n,,
<~ n
> x,.
A-x , ,
(6.25)
An analogous conclusion can be drawn for the hole concentration p(x). Using similar notations x p and A x p for holes as were used for electrons, we
approximately have
(6.26)
If it turns out, as is actually the case, that A x , << x , and A x p <<I x p I hold,
then we can make the further approximation of taking the two intervals A x ,
and Axp to be vanishingly small. We will do this, understanding this change
to be made in equations (6.25), (6.26) without rewriting these formulas.
The simplification described in the resulting equations (6.25) and (6.26) is
called the Schottky a p p r o x i m a t i o n In the pn-junction, we can now identify
four regions in which the total charge density has constant values, with
jumps from region to region (see Figure 6.5). For x < x p and x > x , the
546
charge density is zero, because the charge of the ionized impurity atoms is
compensated by the charge of the free electrons and holes. Between x p and
I , , free charge carriers are absent, so for x p < x < 0 we have the charge
density - e N A of the ionized acceptors, and for 0 < x < x , the charge density
e N D of the ionized donors. Because of the absence of free charge carriers
between x p and 0 and between 0 and x,, these two regions are called depletion
regions. Jointly, the two regions together form the space charge region, The
adjacent regions on the left and right hand sides are called bulk regions. The
Poisson equation associated with the charge distribution described above
reads
d2P
--
dx2
{I
-No,
O < X <xn,
NA,
xp
0,
< x < 0,
< xp.
(6.27)
It states that the total positive charge of the n-side of the space charge region
equals the total negative charge of the pside of this region. A second relation
for the determination of x , and x p follows from the boundary condition
(6.22), which will be used later.
In solving the Poisson equation (6.27), attention must be given to the
fact that, notwithstanding discontinuity of the right-hand side at x = 0, the
only admissible potentials p(z) are those which are continuous everywhere
together with their first derivatives. The solution of the boundary value
problem (6.27), (6.9) and (6.22) which meets this requirement is given by
(6.29)
x
< xp.
(6.30)
547
6.1. pn-junction
NA
ND
+ N Aw ,
ND
xp=-
+N A ~ .
(6.31)
(6.32)
The potential values at x, and x p are given by
(6.34)
a)
fi<
d-m,
n - region,
f i <J-J~TT;;,
<
p-region.
(6.35)
[ n;y]
ln-
(6.36)
holds. At sufficiently high doping concentrations the Schottky approximation is therefore always applicable provided N A and N D are of comparable
size, so that the numerical factors d N A / ( N A N O ) and ~ N D I ( N -t
ANg)
548
or
The uniform Fermi level of the junction is uniquely determined by the individual Fermi levels and doping concentrations of the n- and p-regions. The
fact that p - and n-type materials exist at a semiconductor junction despite
the common, uniform Fermi energy, is due to the bending of the conduction
and valence band edges. This bending is such that, on the left-hand side
6.2. Heterojunctions
549
Figure 6.6: Lineup of the band edges and the chemical potentials for a pn-junction
with N D = 2N.4. The chemical potential p ( z ) and the Fermi level are also shown.
of Figure 6.6, the valence band edge is close to the Fermi level so that the
material is p-type there, and on the right-hand side it is the conduction band
edge that is close to the Fermi level so that material is n-type there.
e6.2
Heterojunctions
The pn-junction is a semiconductor structure based on a single semiconductor material, except the state of the semiconductor, specifically, the value of
the chemical potential, is different on the two sides of the junction. In the
case of semiconductor heterojunctions not only are the states different but
also the materials differ - left of the nominal transition at z = 0 one has
a particular material 1, and right of it there is a different material 2. The
material inhomogeneity means that the valence and conduction band edges
E,, E,, the energy gap E,, the effective band densities of states N,, N,, the
Fermi level E F and other quantities have different values on the two sides
550
Material 1
6.2.1
Equilibrium condition
As seen above, the thermodynamic equilibrium state of the free carrier system of an inhomogeneously doped semiconductor is characterized by the
condition that its electrochemical potential is uniform in space. Here, we
generalize this condition to heterostructures, i.e. to semiconductor systems
having not only inhomogeneous doping but also inhomogeneous materials
composition. Again, the characterization of thermal equilibrium is focused
on the vanishing of the net electron and hole currents. However, the phenomenological equations for these currents differ in the case of systems having heterogeneous materials composition from those of systems involving a
single material only. This can be seen as follows. Assume for the moment
that the band edges Ec and Ev of the heterostructure change gradually as
functions of position coordinate 2 at the interface, rather than abruptly.
6.2. Heterojunctions
551
Then, within the framework of effective mass theory, the continuous functions E c ( z )and E,(z) describe position-dependent potential energies of electrons and holes, and one may expect that the gradients VE, and VE, play
the role of driving forces, which result in additional contributions to the
electron and hole currents. The same may be said for the gradients V N , ( z )
and V N v ( z ) of the effective densities of states in contributing to diffusion
current. The exact forms of these current parts are not known a priori.
They can be determined either by means of Boltzmanns equation, or one
can formulate them phenomenologically on the basis of experience as we did
in the case of the electric drift and diffusion currents in equations (5.100)
and (5.101). Using Einsteins relations (5.111) and (5.112) for, respectively,
D , and D,, as well as equations (5.82) and (5.83), the electron and hole
diffusion currents in these equations can be written as u , ( k T / e n ) V n and
- u , ( k T / e p ) V p , respectively. Since the negative gradients of N c and N ,
should play a role similar to that of the gradients of n and p , we write the
generalized phenomenological equations for the electron and hole currents
as
kT
n
- ecp) + -Vn
1
j, = up;
[O(,T~
kT
- ecp) - - 0 p
n
(6.39)
+ -kT
VN,
Nw
(6.40)
Using these expressions for n ( z ) and p ( z ) , the two phenomenological equations (6.39) and (6.40) may be written in the more compact form
Accordingly, the vanishing of the two current densities j, and j, is guaranteed by the fact that the electrochemical potential p ( z ) - ecp(z) is spatially
uniform. The constant value is again referred to as Fermi energy EF. With
this, the spatial constancy of the Fermi energy E F is seen to be a principal feature of thermodynamic equilibrium for any carrier systems, those
of single semiconductor materials with inhomogeneous doping, and those of
5 52
6.2.2
heterojunc-
dio
dx
l2=--00
- 0.
(6.45)
The calculation of the electrostatic potential along the general lines described
above will now be demonstrated in the particular case of a heterojunction
with x < 0.4 (material 2),
between GaAs (material 1) and Gal-,Al,As
illustrated in Figure 6.8. Accordingly, we assume that the relations Ec2 >
Ecl, Ncl = Nc2, and 1 = 2 t are valid. Moreover, we suppose that both
6.2. Heterojunctions
553
(6.46)
We further assume that the donor concentrations in both materials are the
same, so that the Fermi energies E F and
~ E F of~ the two bulk semiconductors
have the same energy separations from the respective conduction band edges
E,1 and Ec2 of the two materials, whence
, consequently
Considering that Ec2 > Eel, we also have E F Z > E F ~and
electrons originating in material 2 diffuse into material 1. There, they accumulate close to the interface, and one thus identifies an accumulation layer.
Correspondingly, in material 2, a depletion layer is formed, and n < N o
holds so that the Schottky approximation can be applied. However, in the
accumulation layer, one has n >> N o , thus the Schottky approximation is
not valid in material 1. The character of the Poisson equation here is fundamentally different from that of the depletion layer - the source term on
the right-hand side depends on the potential p(z) itself through the carrier
concentration n ( z ) . In fact, from relations (6.41) and (6.44), it follows that
554
n(x) becomes n 0 1 , as one should expect. Using (6.48) the Poisson equation
takes the form
(6.49)
which is a non-linear equation that must be solved self-consistently. This
entails finding a solution p(z) which obeys the equation with the source
term depending on cp(x) as shown on the right-hand side.
Some insight into the solution may be gained as follows: We note that
the potential energy difference -e[cp(z) - p(--oo)] of an electron in the accumulation layer and at --oo cannot exceed the conduction band discontinuity
AEc Ec2 - Eel, and also will not fall significantly short of it. In any case,
it is large compared to RT. Thus, in the accumulation layer, the second
term on the right side of (6.49) representing the concentration of free carriers, is substantially larger than N o . In these circumstances it is possible
to make a simplification which is, in a sense, complementary to that of the
Schottky approximation, namely the neglect of the concentration of b e d
charge carriers in comparison with that of the freely mobile ones. We retain
a constant value for N D , nevertheless, in our analysis below because it does
not interfere with the determination of an exact first integral of equation
(6.49), as given by
6.2. Heterojunctions
555
Ec2 - ecp(+w),
(6.52)
5 56
in section 3.7. This means that in a single GaAs/Gal_,Al,As heterojunctions one can expect similar confinement effects as in Gal-,Al,As/GaAs/
Gal-,Al,As quantum well structures. In the well region, the conduction
band splits into subbands. Since the Fermi level lies above the well bottom,
these subbands are partially occupied, and the electrons of the accumulation layer form a 2-dimensional electron gas. As pointed out above, one can
observe the quantized Hall effect in such a 2-dimensional gas (see Figure
6.9).
The formation of subbands has yet another implication. Namely, that
equation (6.41) for the electron concentration n ( z ) ,which is based on the
assumption that the electrons are hosted by a 3-dimensional parabolic band,
is no longer valid. Instead, the 2-dimensional subband structure of the
triangular well has to be considered. The latter can only be determined,
however, after the potential is calculated, which, in turn, cannot be accomplished without knowing the subband structure. Thus, a new type of selfconsistency problem emerges, beside those already considered: the Poisson
equation must be solved self-consistently with the effective mass Schrodinger
equation (6.3).
557
is undoped. In this case the electrons in the GaAs well are not exposed
to ionized donors because there are none. Therefore the mobility of these
electrons is particularly high, it can reach several millions crn2V-ls-'. An
electronic device which exploits this effect is the High Electron Mobility
Transistor (HEMT), which was already mentioned several times in somewhat
different contexts.
6.3
Metal-semiconductor junctions
6.3.1
558
Figure 6.10: Positional dependence of the Fermi levels of a metal and a semiconductor, as well as of the band edges of the semiconductor in a metal-semiconductor
junction at the instant of its inception, before equilibrium is established. Four different cases are shown: n-type semiconductor (a, b), p-type semiconductor (c, d),
E F S > E F M (a, d), and EFS < E F M (b, c).
prior to emission. There are as many different values of Einitfal as there are
occupied energy levels of the metal. Thus the kinetic energies &in span the
whole occupied part of the energy spectrum of the metal, shifted by hw. The
work function represents the lower threshold energy of this distribution.
The energy separation between the conduction band of the semiconductor and the vacuum level is referred to as the electron affinity X . It, too,
is a quantity which can be determined experimentally. One measures the
photothreshold a P h of the semiconductor (see section 3.7), and subtracts the
energy gap Eg from it, obtaining the electron affinity X from the relation
X = QPh - Eg.
559
6.3.2
Electrostatic potential
560
\ *
very thin boundary layer at the interface. The thickness of this layer is about
one A. This value follows from relation (6.34) if one takes the electron concrnV3, and replaces
centration of a metal to be of the order of magnitude
the average thermal energy k T by the Fermi energy of the metal, switching over in this way from the Debye length to the Thomas-Fermi screening
length. Over a distance of an A the potential ~ ( x is) practically constant.
Deeper in the metal, because of screening by electrons in the boundary layer,
the potential is essentially constant anyway. It is therefore reasonable to approximate the potential in the metal as spatially uniform. At the nominal
transition from the metal to the semiconductor, the potential could be understood as in the case of semiconductor heterojunctions considered in
the previous section - to exhibit a discontinuity [cp(+O) - cp(-O)] because of
the formation of a dipole layer. At the outset, we wish to ignore this and
explore the possibility that the potential cp(;c) may be continuous at 2 = 0,
i.e. assume that
(6.56)
Cp(+O) - cp(-0) = 0.
~
With this, it is clear that expression (6.55) for the Schottky barrier Cpg
will be valid even after thermodynamic equilibrium is established at the
junction by the formation of a space charge layer and its associated potential.
Both levels, the Fermi level of the metal and the conduction band edge, are
lifted by the same energy -ecp(O), so that their difference remains unchanged
(Figure 6.12).
Between the metal (z 5 0) and the bulk region of the semiconductor at
x = 00, a potential difference
U D = Cp(m) - Cp(0)
evolves which corresponds to the diffusion voltage of a pn-junction.
(6.57)
The
56 1
E, -elp( x 1
c
X
If the Fermi energy of the semiconductor lies close to the conduction band
bottom, the diffusion voltage approximately equals the Schottky barrier QB.
The latter typically has values of several tenths e V , which are also typical
values that one should expect for the diffusion voltage U D .
The details of the spatial dependence of the potential p(z) depend on the
relative positions of the Fermi levels in the metal and semiconductor. For an
n-type semiconductor, if EFS > EFM holds, then electrons transfer from the
semiconductor into the metal and an electron depletion layer develops in the
semiconductor close to the interface. The diffusion voltage U D is positive,
562
i.e. the two bands are bent up at the interface (Figure 6.12a). One speaks of
a non-ohmic or Schottky contact in this case. If, for a n-type semiconductor,
the reverse ordering E F S < E F M holds, then electrons from the metal transfer into the semiconductor, and an electron accumulation layer develops in
the semiconductor at the interface (Figure 6.12b). The diffusion voltage U D
is negative, i.e., the bands are bent down at the interface, as in the case of
the heterojunction of Figure 6.8, expect that here there is no development
of a potential well in which electrons are confined to two dimensions. This
is called an ohmic contact.
Consider next p-type semiconductors. For EFS > EFM there is a hole
accumulation layer, because electrons transfer from the semiconductor into
the metal or, what is the same, holes transfer from the metal into the semiconductor. The diffusion voltage U D is positive and the bands are bent
up at the interface (ohmic contact, see Figure 6.12d). For EFS < E F M ,
there is a hole depletion layer, U D is negative, and the bands are bent down
(non-ohmic contact, see Figure 6.12~).Below we first consider metal-n-type
semiconductor contacts which are non-ohmic.
Non-ohmic metal-n-type semiconductor contacts.
The potential cp(x) can be calculated in the same way as for a pn-junction. In
particular, the Schottky approximation can, again, be used, provided that
the conditions (6.35) are satisfied. We denote the width of the depletion
layer in the semiconductor by XB. The arbitrary constant of the potential
will be chosen such that cp(x) vanishes deep in the semiconductor.
cp(x) Iz=w= 0.
(6.59)
The two boundary conditions for p(x) are taken to be the same as in the
case of the pn-junction. Firstly, the potential difference c p ( 0 0 ) - cp(0) must
match the diffusion voltage U D in accordance with (6.58). Secondly, the field
strength deep in the semiconductor approaches zero, i.e.
dcp 12=w= 0.
dx
(6.60)
With these conditions the potential cp(x) and the width X B of the depletion
layer can be calculated from the Poisson equation, with the result
(6.62)
563
6.3.3
Schottky barrier
The importance of the Schottky barrier @B for the current-voltage characteristics of a metal-semiconductor junction was pointed out at the beginning
of this section. Under the assumption (6.56) that excludes discontinuities
of the potential at the interface, equation (6.55) is valid for @B. We refer
to it as the Mott relation since Mott was the first to propose it. In Figure
6.13, Schottky barriers are shown for a series of n-type-Si-metal junctions
as a function of the metal work function a. The electron affinity of silicon
is 4.05 e V . The straight line in Figure 6.13 corresponds to the prediction
of Motts relation (6.55). One readily sees that the linear variation of @B
as a function of @ with slope 1, predicted by this relation, is not at all fulfilled. Rather, an approximate independence of the Schottky barrier and
the metal work function can be recognized from the figure. One may conclude from this that the assumption of continuity of the potential at the
metal-semiconductor interface is not valid, just as in the case of a semiconductor heterojunction. Admitting a potential discontinuity to consideration,
equation (6.55) is supplanted by the the generalized relation
@B = @
-X
- e[cp(+O)
- p(-O)].
(6.63)
564
Work Function @ (eV)Below, we will assume that interface states are also present at the metalsemiconductor junction. We denote their energy levels with respect to the
vacuum level by EA. The concentration of the semiconductor interface state
levels will be taken to be sufficiently large to host all electrons which, in
their absence, would otherwise occupy states at higher energies. We will
show that under these circumstances, the energy EL of the interface states
is also the electrochemical potential, i.e. the energy to which the common
Fermi level of the metal-semiconductor junction adjusts in equilibrium The
proof rests on the general relation (4.57) between electron concentration,
Fermi distribution and density of states, which will be applied to the case
under consideration. For simplicity, we consider zero temperature T , but the
conclusions to be derived also hold for T > 0. The semiconductor will be
taken as n-type, and the energy levels EL of the interface states lie below the
Fermi energy of the bulk semiconductor. In reality, the interface levels do
not concentrate exactly at E i , but are spread over a narrow interval about
EL. The interface state density is therefore not an exact &-function, but a
bell curve of finite width.
Conceptually, we regard the process of establishing thermodynamic equilibrium at the metal-semiconductor junction as taking place in two steps,
firstly, the establishment of equilibrium within the semiconductor, i.e. between the semiconductor bulk and the interface at the metal, and secondly,
the establishment of equilibrium across the interface between the semiconductor and metal. In the first step, electrons transfer from the semiconductor
bulk into the interface levels. Thereby, these levels fill starting from below
up to a certain maximum energy. By definition, this maximum energy rep-
565
resents the Fermi energy of the semiconductor. Since it lies within the bell
curve of the state density it practically equals Ei. An increase or reduction
of electron concentration in the semiconductor bulk by changing its doping
will increase or reduce the number of electrons filling the interface levels.
Because of the assumed large density of interface states, however, the position of the Fermi level needs to change only very little to account for such
an increase or reduction. Thus, the Fermi level is fixed at the interface state
energy levels. This is called a pinning of the Fermi level.
In the second step, the establishment of equilibrium across the interface
between the metal and the semiconductor, electrons are transferred across
the interface, strictly speaking between the thin charged boundary layer in
the metal and the interface states of the semiconductor. Thereby, a dipole
layer is formed at the interface. This is accompanied by a potential jump
[cp(+O) - p(-O)] between semiconductor and metal, which in equilibrium is
sufficiently large that the Fermi level of the metal is raised or, respectively,
lowered, to match that of the semiconductor, whence
E F M - ecp(-O) = EL - ecp(+O).
(6.64)
Combining relations (6.63) and (6.64), and using EFM = -@, we obtain
@ B = - E i - X . Referring the interface levels EL to the top of the valence
band rather than to the vacuum level as done so far, the redefined value of
the interface levels is E , = EL - E,. Using - X - E, = E,, these relations
yield
@B = E , - E,.
(6.65)
The latter expression for the Schottky barrier was first derived by Bardeen.
In it, the Schottky barrier, in contrast to Motts expression, is seen to be
independent of the work function of the metal. Figure 6.13 shows that the
experimental data for Si are closer to Bardeens relation than to that of Mott.
This statement also holds for other group-IV semiconductors, such as Ge.
For the 111-V semiconductors, the dependence of the Schottky barrier @ B
on @ is noticeable, although smaller than that in Motts relation, and for IIVI semiconductors it corresponds approximately to Motts linear dependence
with slope 1. These observations suggest a relation for @B which interpolates
between Motts and Bardeens formulas. Accordingly, we write
+ E F S )+ Po,
@B= S(@
(6.66)
566
c------0
I
I
zns
ZnC
I
0.6
)CdS
0.4
0.8
1.2
1
1.6 2. I
Electronegativity difference
In accordance with above remarks about Schottky barriers for the various
semiconductor material classes, the value of S should increase monotonically
starting from the covalent group-IV semiconductors Si and Ge through the
moderately polar 111-V semiconductors up to the strongly polar 11-VI semiconductors. To examine this conjecture, S is plotted in Figure 6.14 as a
function of the electronegativity difference between the two types of atoms
paired in a given semiconductor, which represents a measure of the polarity of the semiconductor material. The results verify the expected behavior
of S. The different properties of the interfaces of various semiconductors
with metals are, therefore, reflected by S in an overall way. Interface effects
are evidently more pronounced in the case of completely or preferentially
covalently bound semiconductors than in the case of more ionically bound
materials.
Experimental investigations show that Schottky barriers also depend on
the chemical and structural imperfections of a particular interface. This
means that, depending on the preparation of the interfaces, different Schottky barriers may be obtained for the same metal-semiconductor junction.
Thus, for a theoretical understanding of Schottky barriers, it is particularly important to study absolutely pure and structurally perfect metalsemiconductor junctions, fabricated under UHV-conditions. From experimental investigations of such junctions as, for example, with GaAs, conclusions have been drawn about the nature of the interface states assumed
in Bardeen's model. In the unified defect-model by Spicer for 111-V semiconductors, it is assumed that these states are due to point defects of the
semiconductor like vacancies or anti-site defects being induced by positioning
the metal on top of the semiconductor. In all, the microscopic understanding
of semiconductor-metal junctions still needs further investigation.
6.4. Insulator-semiconductorjunctions
6.4
567
Insulator-semiconductor junctions
6.4.1
Thermodynamic equilibrium
Just as insulators differ from semiconductors only by the size of the energy
gap, i.e. quantitatively, the differences between semiconductor heterojunctions and insulator-semiconductor junctions are mainly of a quantitative
nature. A certain qualitative difference exists in the occurrence of interface
states which is more likely for insulator-semiconductor junctions than for
semiconductor- heterojunctions. Initially, we will omit interface states from
consideration. The energy diagram of an insulator-semiconductor junction
at the instant of its inception, i.e. before establishing equilibrium, is shown
in Figure 6.15. The depicted situation, in which the energy gap of the semiconductor lies completely within the gap of the insulator, is not the only
possible one, but it is typical. The SiOz/Si-junction is of this type. The
relative positions of the valence bands of the insulator and semiconductor,
and the relative positions of their conduction bands, may be characterized
in the same way as was done in the case of heterojunctions. Again, the
band discontinuities cannot be determined by simply comparing the bulk
values of the band edges, since they are modified by the discontinuity of the
electrostatic potential induced by the dipole layer at the interface.
Immediately after fabrication of the junction between an insulator and
a semiconductor, the chemical potentials of the two materials have different
positions. Equilibrium adjustments involve the exchange of electrons and
holes across the interface. This leads to a space charge region and an electrostatic potential p(r) which equilibrates the inhomogeneity of the chemical
568
Figure 6.15: Lineup of the band edges and the Fermi levels at an insulatorsemiconductor junction (a) before and (b) after equilibrium is established.
potential, such that the electrochemical potential E F = p(z)- ecp(z) is spatially uniform in the final equilibrium state.
We explore the positional dependence of the electrostatic potential p(z)
below. To be specific, we assume a n-type semiconductor, and suppose that
the free carriers in the insulator are predominantly holes. Then we have a
pn-junction which differs, however, from that considered in section 5.1 because it is composed of two different materials. Another particular feature
of this pn-junction is that the acceptor concentration NAI of the insulator is smaller by several orders of magnitude than the donor concentration
Nos of the semiconductor. It is natural to assume that the Fermi level
EFS of the semiconductor lies above the Fermi level E F I of the insulator.
Then, in order to establish equilibrium, electrons have to transfer from the
semiconductor into the insulator. Consequently, the semiconductor becomes
positively charged and the insulator is negatively charged.
The Poisson equation for the concomitant potential cp(z) has the general
form (6.21). The material inhomogeneity of the junction enters this equation
through the relation between the electron concentration TI(.)
and the potential cp(z) as well as through the static dielectric constant which has different
values 1 and c in, respectively, the insulator and the semiconductor. If n ( z )
does not appear explicitly, as occurs within the Schottky approximation,
the material inhomogeneity simply results in the replacement of ( N A / E )by
( N A / E I ) and
,
( N o / ) remains ( N o / ) .
Below, we employ the Schottky approximation and later verify its justification. The normalization and boundary conditions are again chosen according to relations (6.9) and, respectively, (6.22) and (6.23). The continuity
of electric field strength - ( d p / d x ) at x = 0 is replaced by the continuity of
6.4. Insulator-semiconductorjunctions
569
(6.67)
m,
5 70
6.4.2
(6.68)
(6.69)
One can also understand this interface field as being due to the positive space
charge region in the semiconductor below the interface, which in Schottky
approximation gives rise to a field strength F ( x ) that grows linearly towards
the interface, and a potential p ( x ) which decays quadratically. The value of
the potential at the interface,
d o ) = us,
(6.70)
must have just the right magnitude so that the electrochemical potential is
uniform, i.e. that the difference [Ek - E k ] between the chemical potentials
at the interface and in the bulk of the semiconductor is equilibrated. Hence,
- e U S = [ E k - E$]
(6.71)
(6.72)
with
x,=
571
/s.
eND
(6.73)
Combining equations (6.68) and (6.73) yields a relation between the interface
charge density ns and the interface potential U s , which reads
(6.74)
For the interface field F S , it follows that
(6.75)
We use the latter two relations to estimate the orders of magnitude of n s
and F s ,with the resulting relations
ns = 1.05 x 103cm-2[(d/~/cm-3)(1U s I /V)]1/2,
1 F S I = 18.97 x
10-4(V/cm)[(~-1Ng/cm-3)(1
U s I /V)]1/2.
(6.76)
(6.77)
572
6.4.3
Semiconductor surfaces
573
Chapter 7
Semiconductor junctions
under non-equilibrium
conditions
The treatment of semiconductor junctions above was limited to conditions
of thermodynamic equilibrium. In this chapter, non-equilibrium states of
these junctions will be explored. Such states occur when the junctions are
subjected to various external perturbations, which can include the application of an electric voltage, irradiation with light, exposure to pressure, or the
introduction of a temperature gradient. Here, we limit our considerations to
the application of a voltage and irradiation with light, because they are by
far the most important perturb at ions generating non-equili br ium conditions
in semiconductor junctions. In sections 7.1 and 7.2 we treat the pn-junction
subject to an external voltage. The interaction with light is considered in
section 7.2. The influence of a voltage on a metal-semiconductor junction
forms the subject of section 7.3, and in section 7.4 we examine voltage effects
on an insulator-semiconductor junction.
In the introduction to Chapter 6, we pointed out that perturbation of the
equilibrium state of a semiconductor junction provides the basic mechanism
of operation for most semiconductor devices. Examination of the effects of
an applied voltage and interaction with light for such junctions therefore constitutes an exploration of the physical principles of a number of such devices.
In section 7.1, the devices involved are rectifying semiconductor diodes, the
bipolar transistor and tunnel diode; in section 7.2 the photodiode, the solar cell and the semiconductor injection laser. Section 7.3 deals with the
metal-semiconductor rectifier, and section 7.4 with the field effect transistor. In this preview of the content of the present chapter, it is worthwhile
to note that we have omitted perturbation of the equilibrium of semiconductor heterojunctions. There are two reasons for this omission. The first
5 74
575
5 76
7.1.1
holds and the doping concentrations N A and N D are of comparable size. For
positive U equation (7.4) is not always satisfied. It becomes critical when
U approaches the diffusion voltage, and is definitely invalid when U > UD.
For negative external voltages, condition (7.4) may in fact be valid even if
it does not hold in the absence of an external voltage. If it is valid in the
unbiased case, then it is even better satisfied in the presence of a negative
voltage. In the following we will assume that condition (7.4) is fulfilled and
that the doping concentrations N A , N D are comparable. Then the Schottky
(E =
577
-$,,
1.0
95 -
12.1).
7.1.2
That an applied voltage causes current flow through a pn-junction, is obvious. Less evident, however, is the manner in which it happens. Far from the
nominal transition at z = 0, one may assume that current transport does not
5 78
u-0
a1
u>o
Cl
0
0-0
+ l o o o 0 /. . . . ( ~
0 0 0 9
4 0
.to
0
0
.
c
+ ~ o o o o ( . . . .
-0
0-0
.to
o c
Figure 7.3: Mechanism of current flow through a pn-junction, a) pn-junction without voltage, b) Injection of minority carriers by applying a (positive) voltage, c)
Recombination of the injected minority carriers with available majority carriers,
d) Refilling of the emptied band states by majority carriers from the bulk region.
There a majority charge carrier current flows.
differ from that in an infinite semiconductor sample, i.e. the current in the
p-region will be carried mainly by holes, and that in the n-region mainly by
electrons. In this matter, one has the peculiar situation that a hole current
in the left part of the junction passes over into an electron current in the
right part of the junction. The mechanisms which can realize such a transition are the recombination and generation of electron-hole pairs, whence
we may conclude that the latter processes should play an important role in
current flow through the pn-junction.
In our further considerations we assume CJ > 0. In order for recombination to occur, non-equilibrium carriers must be available. These appear in
the p-region in consequence of the fact that electrons move over from the nregion to the p-region because their potential energy in the n-region is lifted
under the applied voltage by the amount eU. Therefore they overcome the
potential barrier between the n- and p-regions more easily. This transfer
of carriers across a potential barrier is called injection In the p-region the
electrons are minority charge carriers. Therefore we have an injection of minority charge carriers into thep-region. An analogous process takes place on
the n-side of the junction. There, holes from the p-region move over into the
n-region, so that one has an injection of minority holes. These relationships
are illustrated in Figure 7.3. The injected minority carriers (Figure 7.3b)
recombine with the already present majority carriers (Figure 7 . 3 ~ ) . The
5 79
states in the bands which become unoccupied in this way will be filled by
majority carriers from the bulk regions of the junction (Figure 7.3d), where
majority carrier currents therefore flow. The magnitudes of these currents
are determined by the speeds of the injection and recombination processes.
In the stationary state both speeds must be equal - all carriers which are
injected must recombine, and all carriers which recombine must have been
previously injected.
If one reverses the sign of the voltage, corresponding to U < 0, then one
has extraction of minority charge carriers instead of injection, and generation
instead of recombination. Since extraction and generation processes consume
energy, in contrast to injection and recombination wherein energy is released,
one must expect the current through the pn-junction to be much smaller for
U < 0 than for U > 0. This is in fact the case, as we will formally prove
below.
The above mechanism for current transport through a pn-junction will
now be formulated quantitatively. The total current density j ( x ) consists,
according to formula (5.85)) of the electron current density jn(x) and the
hole current density jp(x). For the two current constituents the continuity
equations (5.20) and (5.21) hold. In the present case, the generation term
is zero, and the annihilation term is determined by recombination. In the
stationary state one obtains
djp
- = -eR(x).
dx
+ jp(x) zz j
= const.
(7.9)
580
(7.12)
Setting
(a)
21 =
-oo,z2 =
+m, we find
(7.13)
+jp(zn)
+e
X,
d 4 z ) .
(7.14)
XP
We will later see that, under certain conditions, the two minority charge
carrier contributions jn(-oo) and jp(+oo) in (7.13) can be approximately
neglected. The same holds for the recombination contribution in (7.14).
Consequently, equation (7.13) means that the total current, in its essence,
represents a recombination current. Alternatively, in formula (7.14) the total
current is seen to be the sum of the minority carrier currents at the two space
charge boundaries. There, they are determined by the injection of minority
carriers. The total current represents, therefore, an injection current. The
two interpretations are equivalent, but they emphasize different aspects of
the total current.
With these considerations concerning the mechanism of the current transport, we are sufficiently prepared to calculate the spatial profiles of the chemical potentials of the two types of carriers.
7.1.3
581
n(x) = n,
for
> x,,
(7.15)
p(z) = p p
for
< xp.
(7.16)
The minority carrier concentrations in the bulk regions are calculated from
the combined diffusion-recombination equations (5.81) and (5.87). To solve
these equations uniquely, boundary conditions are required, in the case of
electrons at -m and xp,and in the case of holes at x n and +oo. For n(-m)
and p(00) the equilibrium values
-e(UD-u)/kT
npe
eU/kT
(7.18)
(7.19)
For U = 0, the values n ( x p ) and p(;cn) are the minority carrier concentrations np and p , in equilibrium. If U > 0, the factor multiplying n p in
(7.18), and the factor multiplying p , in (7.19) is larger than 1, i.e., the
minority carrier concentrations exceed the values they would have in the
absence of an external voltage. This is the formal expression of the injection of minority charge carriers. How effective injection is can be recognized
through the following estimate. With U = 0.25 V and T = 300 K we obtain
exp(eU/kT) M elo M 2 x lo4. This is to say that the small voltage of 0.25 V
suffices to increase the minority carrier concentrations by more than 10000.
The steady state solutions of the diffusion-annihilation equations (5.90)
for n(x) and (5.97) for p ( z ) , under the respective boundary conditions (7.15),
(7.16) and (7.17), are given by
P(X)
=P,
+ [~(z,)
- P,I~-(x-x*)lLp
> ",
(7.21)
where L , and L, are the diffusion lengths of, respectively, electrons and
holes. For positive external voltages one has n ( x p ) > n, and p(x,) > p,.
582
Figure 7.4: Lineup of the quasi Fermi levels of a pn-junction under an applied
voltage, (a) flow direction (U > 0 ) , (b) blocking direction (U < 0)). Dashed curves
correspond to the unbiased pn-junction. In the space charge region interpolated
values are used. The decay of the quasi Fermi levels in the two bulk regions is
drawn greatly exaggerated.
This means that in the two bulk regions, both minority carrier concentrations
are larger than their respective equilibrium values np and p,. The chemical
potential p n ( z ) of the electrons in the pregion, is therefore shifted to higher
energies with respect to its equilibrium value, and that of the holes in the
n-region is shifted to lower values. The same holds for the pertinent nonequilibrium electrochemical potentials
E F ( ~ =) p n ( x ) - ecp(x),
E;(X)
= p P ( z )- ecpG),
(7.22)
which are also referred to as quasi Fermi levels. In Figure 7.4a the spatial
variation of the two quasi Fermi levels is shown schematically, together with
the valence and conduction band profiles.
For a negative voltage U < 0 one has n ( z p ) < np and p ( z n ) < p,.
The chemical potential of electrons in the p-region therefore lies below the
equilibrium value, and that of the holes in the n-region lies above it. The
same is true again for the quasi Fermi levels (see Figure 7.4b). The elevation
or depression of these levels in the bulk regions is effective up to a distance
from the depletion region which roughly equals the diffusion length of the
pertinent minority carrier. In the depletion region between the two bulk
regions, we cannot make such statements, or any others, because the above
consideration excludes this region. Fortunately, the diffusion lengths are,
as a rule, an order of magnitude larger than the width of the space charge
583
region (see section 5.4), so that this lack of knowledge is not important. The
space charge regions function solely as potential barriers over which nonequilibrium carriers are injected. The spatial expansion of the barriers can
be neglected in a first approximation.
(7.23)
The recombination rate R is given by relations derived above in section 5.2.
According to formula (5.64), in the presence of a small excess concentration
A n ( x ) of electrons (here, of electrons in the p-region), we have
(7.24)
and according to formula (5.66), in the presence of a small excess concentration A p ( z ) of holes (here, such in the n-region), the corresponding relation
is
584
(7.25)
with rn and rp given by (5.65) and (5.67) as minority carrier lifetimes of
electrons and holes, respectively. For An(.) and A p ( x ) we employ relations
(7.20) and (7.21) in the form
~ p ( x=) b ( x n ) - pnle-(z-xJ'Lp,
xn
< 2.
(7.27)
Substituting these formulas in equations (7.24), (7.25) and (7.23) and using
relations (7.18) for n ( x p ) and (7.19) for p ( x n ) , we obtain
j = js(e
eUlkT
- 1)
(7.28)
with
(7.29)
The same result is obtained if one proceeds in accordance with method b,
which will be verified below.
Method b
In equation (7.14) for j we again neglect the recombination integral over the
depletion region. The minority carrier current densities j n ( z p )and j p ( x n )at
the boundaries of the depletion region follow from the general phenomenological equations (5.100) and (5.101), wherein the electric field strength has
to be set zero since the electric potential is constant in the bulk regions.
The charge carrier concentrations n ( x ) and p ( x ) are taken from expressions
(7.20) and (7.21), yielding
(7.31)
Adding these equations and applying relations (7.18) and (7.19), the expression (7.28) follows for j with
(7.32)
The two expressions (7.32) and (7.29) for j , are identical, since Ln =
(see equation 5.95) and L, =
(see equation 5.99).
fi
585
With this observation, the task of calculating the current through a pnjunction under an applied voltage is completely solved, within the framework
of the conditions and approximations set forth above. We will now discuss
the results. The current-voltage characteristic (7.28) is extremely non-linear.
It exhibits the expected asymmetry with respect to a change of the sign of
voltage U . For a positive U of a few tenths of a Volt, j is several orders of
magnitude larger than j,, while for a negative U of same absolute value, j
approaches - j , (see Figure 7.5). The current density j,, which cannot be
surpassed at even larger negative voltages, is called the saturation current
density. To estimate the size of this current density in the case of Si, we
assume typical values for L,,p of 10 pm and for 7n,pof lo-' s. For the
minority carrier concentrations, we obtain, from np = n?/NA and p, =
n!/ND with ni = lo1' cm-3 and N A = N D = 1OI6 cmV3, the values np =
p, = lo4 c ~ L - Using
~.
e = 1.6 x 10-lgA s, it follows that j,
10-l' A/cmz.
The saturation current density is therefore extremely small. Thus, for U < 0
practically no current flows, the pn-junction blocks the current flow. One
says that it is reverse biased or biased in blocking direction, The biasing
U > 0 refers to the forward bias or Bow direction because the current density
in this direction is orders of magnitude larger than j, as we have seen above.
The pn-junction operates as an electrical rectifier. It is called a rectifyingpndiode in this context. In Figure 7.5 a measured current-voltage characteristic
of a pn-diode made of Ge is shown.
7.1.5
Bipolar transistor
586
m
Emitter
Base Collector
PB
(PCI
(7.33)
and, accordingly,
(7.34)
holds. In this case the left pn-junction is biased in the flow direction, and the
right in blocking direction (see Figure 7.8). From the left n-region, electrons
are injected into the p-region in the middle, while the right n-region extracts
electrons from the p-region. One therefore calls the left n-region the emitter,
and the right the collector. The p-region in the middle is called the base.
Accordingly, the np-circuit will be referred to as the emitter-base circuit or,
in short, the base circuit and the npn-circuit as the emitter-collector circuit,
587
or in short, the collector circuit. Let the current in the base circuit be ig,
and that in the collector circuit ic. Then the current i E through the emitter
follows from Kirchhoffs current branching theorem as the sum of the two
currents,
i~ = iB
+ ic,
(7.35)
One can also say that the emitter current splits in two partial currents, one
flowing through the base, and one flowing through the collector (see Figure
7.7 on the right).
Our goal is the calculation of the three currents i ~ iB,
, ic. In this matter,
we can employ the results obtained above for the current flow through an
individual pn-junction, with the valence and conduction band edges of the
npn-transistor lined up as shown in Figure 7.8. The pn-junction on the
emitter side is subject to the flow voltage U g , and the pn-junction on the
collector side is subject to the blocking voltage U c - UB. The emitter current
i~ is the current flowing through the emitter-side pn-junction. As such it is
given by expression (7.12), multiplied by the emitter area A . Accordingly,
it consists of the injection current j n ( z p ) Aof the minority electrons into the
588
base, the injection current jp(x,)A of the minority holes into the emitter,
and the recombination current in the depletion region. Neglecting the latter
contribution as before, and using relations (7.30) and (7.31), we have
Here the signs are opposite to those of relations (7.30) and (7.31) above, due
to the fact that the emitter-base junction has thep-region on the right-hand
side and the n-region on the left, whereas it was the opposite above. If
both regions were expanded infinitely, as we always assumed above in the
treatment of the pn-junction, then the minority charge carrier concentrations n(x) and p(x) in (7.37) could be replaced by the previously derived
expressions (7.20) and (7.21). This procedure would result in expressions
for j,(xp) and j p ( z n )which are just the negatives of those in relations (7.30)
and (7.31). In the transistor, however, only the emitter can still be considered to be infinitely extended, while the width b of the base must be treated
as finite because it is not large in comparison with the diffusion length L, of
the minority charge carriers. Thus, only the injection current density jp(x,)
of holes may be taken from the previously derived expression (7.31). Adjusting this expression to the relationships at the emitter-base junction, we
find along with (7.27),
(7.38)
The injection current density jn(zp)of electrons, however, must be calculated
anew. This can be done on the basis of the following considerations. To
start, it is clear that, because of the finite width of the base, only part of the
injected minority electrons recombine in the p-region, while the remainder
diffuse through this region and reach the depletion region at the collector side
of the pn-junction. The negative electric field there pulls the electrons into
the bulk region of the collector, from which they are sucked up by the applied
positive voltage. The equilibrium value np of the electron concentration is
therefore reached not only at x = 00, but it is already realized at x = b. This
means that the boundary condition An(x = b) = 0 must now be imposed.
The solution of the diffusion-recombination equation (5.91), which accounts
for this new boundary condition, results in
(7.39)
589
j n ( z p ) = enp-
(7.40)
The current ig in the base circuit would be identical with the emitter current
iE, and both currents would equal the current through the emitter-base pn-
junction, if the base were to be infinitely expanded. For finite base width,
however, the emitter current differs from the base current because the latter
only takes contributions from the portion of the electrons injected into the
base which also recombine in the base. We denote the pertinent current
density as jnT. The first term of formula (7.23) expresses it as
(7.41)
The upper boundary of the recombination region, which in (7.23) is located
at infinity, is replaced here by the finite base width b. Furthermore, xn and
x p are interchanged. The recombination rate R(x) in (7.41) can, as before, be
calculated from the minority charge carrier concentration n ( z )= np+An(z)
in the p-region by means of relation (7.24). In this, expression (7.39) has to
be used for An(x), yielding
The base current ig, like the emitter current in (7.36), also involves the
injection current j p ( z n ) Aof holes from the base into the emitter, besides
the electron current j,A. Altogether, we therefore have
The collector current ic need not be calculated separately, because it is already determined by Kirchhoff's law (7.35). The main source of this current
are electrons which leave the emitter and do not recombine in the base but
diffuse further to the collector where they are sucked up. This current contribution is entirely due to the coupling between the collector-base pn-junction
and the emitter-base np-junction; it would not appear at all at a single,
separate pn-junction. Another current which flows through the collectorbase pn-junction is the true pn-current, which represents the current which
would exist if this junction were not coupled to a second one. However, the
collector-basepn-junction is biased in blocking direction, so that the true pncurrent is an extraction or generation current, which because of its smallness
590
Figure 7.9: Characteristics of a bipolar transistor made of Si in the commonemitter configuration. (After Volz, 1986.)
may be completely neglected. This approximation was used above tacitly,
as we identified the current through the emitter-base pn-junction with the
true pn-current of this junction, without adding the hole extraction current
flowing in from the collector-base pn-junction on the right-hand side .
With this, the three currents i E , i g , ic we sought are fully determined.
In Figure 7.9 some characteristic dependencies for i g and ic are shown
in the case of a npn-transistor made of Si. We can now proceed to the
question of what conditions are needed for the bipolar transistor to operate
as an amplifier. As we will see below, significant amplification by a bipolar
transistor results when the base current is only a small portion of the emitter
current. In order for that to be the case, on the one hand, j,,A must be
small. Considering (7.42)) this means that the base width b must be small
in comparison with the diffusion length L , of the minority charge carriers.
On the other hand, the hole injection current j p ( z , ) A is not allowed to be
too large. This can be achieved by low doping of the base in comparison
with the emitter, because jp(z,) is proportional to p, (as may be seen from
(7.38)) and j n ( z p )is proportional to np (see relation 7.39). If the doping
n, of the emitter is substantially higher than the doping p, of the base, i.e.
if nn >> p, holds, then it follows from the mass action law that p, << np.
The hole injection current j p ( z n ) Afrom the base is therefore actually small
compared to the electron emission current j n ( z p ) Afrom the emitter. For
simplicity, we neglect it completely in what follows.
Suppose that the potential p~ of the base is changed by dpg, while
the potentials p~ and p c of the emitter and collector are kept constant.
591
(2)~~ (2%)
= cosh[(b - z p ) / L n ] d u g
uc ,
(7.45)
The current amplification 1,3 is defined as the ratio of the current change in
the collector circuit to the current change in the base circuit. From (7.44)
and (7.45) it follows that
(7.46)
Here, the extent x p of the space charge region in the base has been neglected
in comparison with the entire base width b. If b << L,, a current amplification occurs according to relation (7.46) in the sense that a small current
change in the base circuit leads to a large current change in the collector
circuit.
What is important, however, is not primarily amplification of current,
but amplification of electric power. Initially, one might think that power
amplification would be determined by the ratio of the power change in the
collector circuit to the power change in the base circuit. This ratio would
have, however, no value because in the switching scheme of Figure 7.7 all
the power of the collector circuit is transferred to heat. For electric power
to be useful, a working element must be included in the collector circuit,
say a resistor RL. The voltage drop across it is ~ c R L .Taken together with
the voltage between the emitter and the collector of the transistor, the total
voltage in the collector circuit is U c icRL. The transistor resistance is approximately given by the resistance of the blocked pn-junction between base
and collector. As such, it is practically independent of the base voltage. We
consider, in particular, the case in which R L equals the collector resistance of
the transistor. That means that the voltage drop across the working resistor
R L is the same as the voltage U c between the collector and emitter. What
we have to calculate are the power changes d P L in the working resistor R L
and d P B in the base circuit, if the base voltage U B changes by d U B while
keeping the collector-emitter voltage at a constant value U c . We have
(7.47)
592
Aluminum
I
\
SiOD
..........
..........
..........
.......... n-Silicon ..........
..........
:::::::::A
............
...........
..........................................
I/.._.J
n*-Silicon
p3iG-I
(7.48)
The ratio of the first term in the square bracket of (7.48) to the second is
kT to U g . Under normal conditions this is small, and neglecting it the ratio
of the power changes dPL and dPB takes the simple form
(Z)
= P UUC
B.
(7.49)
Therefore, the power amplification turns out to be proportional to the current amplification. As far as the proportionality factor is concerned, we
note that the voltage drop across the collector-base np-junction (biased in
blocking direction), is larger than the voltage U B across the baseemitter
np-junction (biased in flow direction). Because of this, the collector-emitter
voltage U c is larger than U g , and the the power amplification is in fact
larger than the current amplification. Small changes of electric power in the
base circuit are therefore correlated with large power changes in the working
resistor. Of course, this does not mean amplification of power in the sense
that a large power is generated by a small power. Rather, it means that a
large useful power is tuned by a small control power.
The base circuit is also called the input circuit, and the collector circuit
the output circuit. Using this terminology one can also say that the bipolar
transistor allows the control of a large output power by means of a small
input power. The input resistance of the bipolar transistor, i.e. that of the
base circuit, is relatively low, since the corresponding pn-junction is biased
in the flow direction. The output resistance, i.e. that of the collector circuit,
is relatively large compared to the former, because the second pn-junction
operates in the blocking direction. This particular feature of the bipolar
transistor essentially determines its application in electronic circuits. In
the case of an unipolar transistor like the MOSFET, the relationships are
593
Figure 7.11: Lineup of the valence and conduction band edges at a pn-junction
between moderately doped regions (a) and heavily doped regions (b).
reversed, as we will see below; there the input resistance is high and the
output resistance low.
Due to the planar fabrication technology of microelectronics, the bipolar
transistor, as it is actually used in integrated circuits, is structured differently
than the one used in our theoretical analysis. A more realistic structure is
shown in Figure 7.10.
7.1.6
Tunnel diode
594
).
0
0
02
084
u/v-
595
7.2
7.2.1
An
= Ap = I-g = I-
(E)
(7.50)
where I- is the recombination lifetime of the electron-hole pairs and Tzw is the
photon energy of the radiation (the latter is taken to be monochromatic),
Using equations (7.11) and (7.12), the corresponding quasi Fermi levels p n
and pP of, respectively, electrons and holes may be calculated. Due to the
optical excitation they have different values - that of the electrons is raised
and that of the holes is lowered in comparison with the common equilibrium
value. If a voltage is applied to the sample, the current flow is stronger
when the sample is illuminated than without illumination. This phenomenon
is called the internal photoeffect. Alternatively, if one keeps the current
constant, then the photoeffect lowers the voltage drop across the sample.
The illumination leads, so to say, to a negative pre-voltage of the sample.
The latter is just the difference of the quasi Fermi levels of the electrons and
holes.
In these considerations, a spatially homogeneous distribution of light intensity within the semiconductor sample has been assumed. However, since
the light is absorbed in the excitation of the electron-hole pairs, this assumption is not justified. The intensity decays exponentially in the direction of
light propagation. This leads to a corresponding spatial inhomogeneity of
the electron and hole concentrations, which gives rise to diffusion. If the diffusion coefficients of electrons and holes are the same, then diffusion makes
no contribution to electric charge transport since the electron and hole currents exactly compensate each other. If the diffusion coefficients are different,
however, as often actually happens, either a net current or voltage will arise
in the light propagation direction, depending on whether the two ends of
the sample are electrically connected or not. This phenomenon is called the
Dember effect.
The homogeneous semiconductor photoeffects discussed above are relatively small in comparison with the photoeffects which occur at semicon-
596
(7.51)
This upper limit is reached if all optically excited holes diffuse as far as the
plane 3: = xn, which means that they do not recombine, neither at the surface
nor in the bulk of the sample. We assume this condition to be fulfilled.
The flow of positive charge through the np-junction from its n- to its pside, is just the opposite of what happened at the np-junction in establishing
equilibrium without illumination by light. In the latter case, the transfer of
charge results in the diffusion voltage. Therefore, exposure to light reduces
the diffusion voltage. The voltage change U$ooto is called the photovoltage. In
the case of the open pn-circuit considered here, in which the outer planes of
the p- and n-regions are not electrically connected, the total current density
jgbl in a stationary state must vanish. The latter is composed not only of
the photocurrent density, jphoto, but also of the current density j through the
pn-junction under the photovoltage U$ooto. This voltage takes a value which
597
jgtal.
(7.52)
If the outer planes of the p- and n-regions are electrically connected (see
Figure 7.14), and if the resistance R in this circuit is 0 (short-circuit), then
the voltage drop across the pn-junction is also zero, and the current
tota
.O at = iphoto.
(7.53)
flows. If the resistance in the pn-circuit is neither 0 nor infinity, but has
a finite value R , then the photovoltage takes a smaller value UpRhoto
than
Uphoto, and the total current density takes a smaller value jEtal than j:oM.
The two quantities jEbl and Ugot0 are determined by the equations
(7.54)
598
(7.56)
This relation shows that Urhto is more sensitive to the light intensity I for
smaller saturation current density j, of the pn-junction. In subsection 7.1.4,
j, was estimated to be 10-"A/cm2 for a pn-junction made of Si. From this,
one can conclude that for hw = 1 e V , a light power of 10-l' W / c m 2 will
create a photovoltage of the order of magnitude 10 mV. The upper limit of
photovoltage is given by the diffusion voltage U D . This cannot be surpassed
were to exceed
because the space charge region would cease to exist if Upmbto
Uo. In the latter case, no separation of optically excited electron-hole pairs
could take place, which is crucial for operation of the device.
Photodetectors involving semiconductor junctions are referred to as photodiodes. Beside the pn-photodiode discussed above one has also pin-photodiodes
with particularly wide depletion regions (symbolized by the 'i' in pin). Other
examples are met al-semiconductor or heteroj unction photodiodes.
Photovoltaic element
In the case of the photodetector, the usable electric power is very small and,
in fact, for R = 00 it vanishes. If the illuminated pn-junction is to produce
electric power, then R cannot be made infinitely large. On the other hand,
R also cannot be taken arbitrarily small, because then there would be no
voltage drop at the pn-junction at all and the power would vanish for this
reason. For finite values of R , a non-vanishing electric power is generated,
with light energy converted directly into electric energy. The useful electric
power R(Ajztd)2 can be calculated by solving equations (7.54) and (7.55)
with respect to jzhland Upbto. The calculation shows that the efficiency of
photovoltaic energy conversion can, theoretically, reach up to about 40 %.
In practice, it is not much more than 10 % in most cases currently. Photovoltaic elements have wide and important practical application as solar cells,
particularly those made of silicon.
7.2.2
599
Laser diode
(7.58)
where g is the so-called gain coefficient. The latter is defined as the negative
of the absorption coefficient, strictly speaking, that part of the absorption
coefficient which is due to the excitation of electrons from the valence into
the conduction band. The dimension of g is, therefore, that of a reciprocal
length.
In order for g to be greater than zero, the semiconductor must be put into
an excited state. The energy of this excited state (relative to that of the
ground state) is the energy emitted in stimulated emission when irradiating
the semiconductor with light. If the emitted radiation is to prevail over that
absorbed, the number of stimulated transitions from the excited state to the
ground state must be larger than the number of transitions in the opposite
direction. This case is realized when, in the excited state, the conduction
band edge is occupied by more electrons than the valence band edge. This
is called an occupation inversion In the case of equal occupation of the two
600
d ( n - ntr),
(7.59)
601
Figure 7.15: Heavily doped pn-junction without voltage (a) and with voltage in
flow direction (b). In case (b) occupation inversion occurs in the vicinity of the
pn-boundary at x = 0.
as a function of applied voltage. Here, we need the relation between j and the
photon current density S, taking the electric current density j be given. As
we shall see, the functional dependence of S ( j ) can be calculated just from
the continuity equations for the carrier currents j,, j , and the photon current
density S. For simplicity, we assume a symmetric pn-junction, for which it
602
suffices to consider only the electron current density j,. Also, the active
region is symmetric with respect to x = 0 under this assumption, so that
it extends between -(1/2)x,4 and ( 1 / 2 ) z ~with ;CA being the total length
of the region. The continuity equation for the electron current density j ,
was written down in Chapter 5 in general form. The derived relation (5.20)
will be specialized here in the following way. First of all, we recall that the
total current density j = j n ( z ) +jp(z)is spatially constant in the stationary
state, while the two components j n ( x ) and &,(I) vary with I. On the righthand side of the active region, j n ( z ) is almost identical with j , while &(z)
practically vanishes. On the left-hand side of the active region, jp(x) has
almost the same value as j , while jn(x) is close to zero. Consequently, in
stationary state, we find for the active region the approximate relation
(7.60)
Relaxation processes which change the electron concentration in time include
non-radiative recombination as well as spontaneous radiative and induced
radiative recombinations. For the assumed symmetric pn-junction, the nonradiative recombination rate Rmr is determined by a common lifetime T
for electrons and holes, as follows,
An
R-p=
-.
(7.61)
7-
Applying expressions (7.57), (7.58) and (5.60), (5.61) to the continuity equation (5.20) for the electron current, we have
an
at
j
An
An
(7.62)
gs.
exA
78pDnT
To formulate the continuity equation for the photon current density S , an
expression for the pertinent photon density is needed. According to electrodynamics, this density is (S/c) where c denotes the group velocity of light
propagation. The source density of the photon current is zero since S flows
in y-direction and the pn-junction can be considered as homogeneous in this
direction. The photon density does involve,external sources, however, due to
spontaneous and stimulated radiative annihilation emission processes. Since
each radiatively recombining electron-hole pair creates a photon, the corresponding source density is - apart from a modification which will be detailed
immediately below - given by the sum of the radiative recombination rates,
i.e. it is the negative of the last two terms on the right-hand side of equation (7.62). The modification affects the second term (An/rSmT),which
describes spontaneous emission. Only a small fraction of the spontaneously
emitted photons have the same propagation direction and phase as those
due to stimulated emission. Thus this term must be multiplied by a factor
---=
603
S. The source terms due to spontaneous and stimulated emission must still
be complemented by an additional term, which accounts for light absorption
processes which are not due to transitions of electrons from the valence to
the conduction band (see above). We denote the corresponding absorption
coefficient by a. The pertinent (negative) photon source density is -as. In
a , one can also include losses due to the penetration of photons through the
resonator planes which are unavoidable because otherwise the laser would be
ineffective for the world outside. Finally, the continuity equation for photon
current density takes the form
(7.63)
where the left-hand side is the change of the photon density per second. We
are interested in the stationary state of the laser defined by
an
dS
- = 0.
(7.64)
at
at
We first consider g to be different from a. As we will see, in stationary state,
this necessarily means that g is less than a:
- --
g<cw
The corresponding photon current density S< follows from equation (7.63)
as
(7.65)
Since S<, An and rSpT are positive quantities, only the case g < a is
meaningful, as indicated above. The photon current density S< given by
equation (7.65) is relatively small because the spontaneous radiative lifetime
T~~~~ has relatively large values and P is small compared to one. This is
true on the proviso that g does not approach the value of a very closely, or,
in other words, as long as the electron concentration n is distinctly smaller
than the threshold value nth defined by the relation
0: =
9/(nttZ - nt7-1.
(7.66)
The dependence of the photon current density S< on the carrier current
density j follows from equations (7.62) and (7.63) as
s<=
+ ( l / P ) ( l + rspa~/r)(a- g<)I
ez~[g<
(7.67)
Here, g has been replaced by g< to indicate that the value of g has to be taken
as that which follows from g / ( n - ,ntT) for n < nth. If spontaneous emission
604
(7.68)
This equation relates the electric current density j and the electron concentration n below the laser threshold. The threshold concentration nth
corresponds to a threshold carrier current density j t h as
(7.69)
Laser operation can occur only if the threshold current density j t h is exceeded. We will assume that this is the case. Then, by definition, we have g
equal to a:
The photon current density can now take non-vanishing values if rVar = 00.
This does not contradict the photon current continuity equation (7.63) under
the stationary conditions (7.64) because (g - a ) S now vanishes since (g - a)
does so. The actual value S> of S for a given current density j > j t h may be
obtained from the carrier continuity equation (7.62). In the approximation
rspar = 00, it yields
(7.70)
We will show that S> is in fact considerably larger than Sc. To this end,
we consider S> given by equation (7.70) for j = 2jth, and examine the ratio
of s>(2jth) to s<of (7.67) for j = j l h , obtaining
605
3
\
0
v)
/
Jth
j 1a.u.
Generally, the spontaneous radiative lifetime 7spaT is much larger than the
non-radiative lifetime T. One has (7spaT//37) > 1 since ,B << 1. The ratio
( g < / m ) is a positive number measurably smaller than 1. Therefore, the
right-hand side of equation (7.71) is large compared to 1, meaning that
the value of the photon current density S, above the threshold current
density substantially exceeds the value S< below threshold. This is the
manifestation of the laser effect. One can also say that the pn-junction
functions as a laser diode when j > j t h holds. The functional dependence
of the radiation intensity over the entire current density range is shown
in Figure 7.17 schematically. The characteristic feature of the curve is its
steep rise at the threshold density jth. This point marks the transition from
spontaneous to stimulated emission, thus the onset of the laser regime.
To estimate the order of magnitude of the threshold current and of the
radiation intensity of a laser diode, we choose parameters close to those of
G ~ A SWe
. take ntr =
c m ~ nth
~ ,= 2 . 1 0 ~ ' cmv3, X A = 1 p m , r =
lo-' s, and a = lo3 cm-'. With these values the threshold current density
of (7.69) is j t h = 103A ern-'. The corresponding current strength amounts
to 1 m A if a contact area of 10 x 10 ,urn2 is assumed. For current strength
2 m A the radiation intensity RwS within the resonator is about 10 mW. This
example shows that the radiation intensities of semiconductor laser diodes
are large enough to be employed for practical purposes, in particular in
information processing and communication. Currently, semiconductor laser
diodes are widely used in optical fiber communication, CD-players, scientific
instruments and in many other ways.
The spontaneously emitting diodes (LEDs) have also attained wide practical application, e.g., for signal generation and pattern display. Since the
radiation emitted by laser diodes and LEDs lies in the photon energy range
close to the band gap, one can theoretically cover the entire frequency spectrum from the infrared to the near UV by a proper choice of semiconductor
606
7.3
607
a)
0 - r
Metol
u-0
n- Semiconductor
c)
bl
___.
---I---: L-
---T'
x x x
iI -----I
I
L
XB
UCO
u70
i":
+t+
\
x
A_I
I
I
I
(7.74)
The application of a voltage U causes the space charge barrier of the semiconductor to change, for U > 0 it is lowered (Figure 7.18b), and for U < 0 it
608
)
the semiis increased (Figure 7.18~).Accordingly, the current i s ~ ( Ufrom
conductor into the metal increases (U > 0) or decreases (U < 0). However,
the current from the metal into the semiconductor remains constant since
the potential barrier to be overcome by the electrons, the Schottky barrier
Cpg, does not change with applied voltage. Thus,
iMS(U)= iMS(0).
(7.75)
The total current density i in the presence of the voltage U may then be
written as
(7.76)
i ( U ) = i ~ s ( 0-) i s ~ ( U ) .
We first calculate i ~ s ( 0 )In. this, we consider a metal electron which, after
passing into the semiconductor, has quasi-wavevector k and energy Ec(k).
Since no momentum and energy change takes place in crossing the interface,
the quasi-wavevector of this electron is k and its energy Ec(k) in the metal as
well. The probability to find an electron in the metal with energy E = Ec(k)
is given by the Fermi distribution f i ~ ( Eof) the metal,
(7.77)
The current density i ~ s ( 0is) the statistical average of the speed component
h-'(Ll/LlkX)Ec(k) in the positive 2-direction, per unit volume. According to
the general rule (5.3) for calculating statistical averages we have
The boundaries of the first B Z were put to infinity here, involving only a
slight error. In fM(Ec(k)) we replace -EFM by the work function @ of the
metal. This also indicates that, henceforth, the energy origin will no longer
be taken at the top of the semiconductor valence band, but at the common
vacuum level of the two materials. Since [Ec(k)+@] >> k T , the Fermi distribution (7.77) can be approximated by the Boltzmann distribution, whence
(7.79)
For Ec(k) we use an isotropic parabolic dispersion law with effective mass
m:. Then relation (7.78) takes the form
609
i ~ ~ (=0 )
-OB/kT,
(7.81)
with
(7.82)
as the average speed component in positive z-direction and n as the electron
concentration in the semiconductor. Somewhat surprisingly, through mE and
n quantities characteristic of the semiconductor occur in expression (7.81)
for i ~ s ( Oalthough
),
the current originates in the metal and one might have
expected the appearance of characteristic metal quantities only. The reason
is that for U = 0 (which is the case we are considering now) thermodynamic
equilibrium exists, and the current from the metal into the semiconductor
must equal the current from the semiconductor into the metal. That the
latter depends on semiconductor quantities is obvious.
0 ) the semiconductor
The interpretation of i ~ ~ (as0 a)current i ~ ~ (from
into the metal in thermodynamic equilibrium may also be used to calculate
i s ~ ( Uin) the presence of a voltage. For U = 0, first of all, this interpretation
yields the expression
,eU/kT Z S M ( 0 ) .
I
(7.84)
(7.87)
6 10
For positive voltages U , the current density j grows exponentially with increasing U , while for negative U of increasing magnitudes, j approaches
the saturation current density -js. The latter occurs when 1UI >> kT holds.
The Schottky junction therefore operates as a rectifier, with positive voltages
corresponding to the flow direction, and negative to the blocking direction.
The following estimate shows that j , can assume quite large values. For
T = 300 K , N D = 10l6 cm-3 and @B = 0.25 e V , we have j ,
0.2 A / c m 2 .
From this we may conclude that Schottky junctions are suitable for rectification of relatively strong currents. This distinguishes them from pnjunctions where the saturation current density is commonly much lower (see
section 7.1). The reason for this difference is that the saturation current of
a pn-junction is due to minority carriers, while the saturation current of a
Schottky junction is caused by majority carriers.
The conditions of validity for the current-voltage characteristic of a metalsemiconductor junction derived above will now be examined. The application of formula (7.83) to the calculation of the current density iSM(0) reflects
the assumption that thermionic emission of electrons from the semiconductor into the metal proceeds unhindered. To appreciate the significance of
this assumption one must first recognize that the emitted electrons originate
in the bulk region of the semiconductor, for in the depletion region none are
available. Unhindered emission can only occur, therefore, if the electrons,
during their flight through the depletion region suffer no collisions. That is
assured if the mean free path length Zf is larger than the space charge width
"B,
lf > X B .
(7.88)
For practical Schottky junctions, operating in blocking direction, this condition is often fulfilled, provided the blocking voltage is not too large. For
flow voltages of sufficient magnitude it is always correct. Condition (7.88)
excludes the possibility that the depletion region can even approximately
be in a local equilibrium state. In particular, chemical and electrochemical potentials cannot be meaningfully defined, not even in a local or 'quasi'
sense. This implies that an essential requirement for the applicability of
the phenomenological equations (5.14) and (5.15) for the current densities is
no longer valid. If the inequality (7.88) is satisfied, drift and diffusion lose
their meaning as transport mechanisms in the depletion region of a metalsemiconductor junction. The transport proceeds by electrons flying through
the depletion zone unimpeded, which is termed ballistic transport. If, instead
of l f > ZB, the condition
lf << X g
(7.89)
611
612
row that the electrons can tunnel through the potential barrier, then even
the thermionic emission theory is no longer valid. This occurs in heavily
doped semiconductors. In this case the Schottky contact conducts well in
both directions, and it becomes an ohmic contact. This effect is exploited
in manufacturing contacts in silicon devices. The frequently used contactmetal, aluminum, would in fact result in a non-ohmic Si-metal contact if the
doping were not sufficiently high. The heavy doping of Si makes this contact
ohmic, as is necessary for the proper operation of devices.
7.4
In Chapter 6 we saw that the properties of an insulator-semiconductor junction in thermodynamic equilibrium depend on whether interface states are
present or not. If such states do exist, the bands at the interface are already
bent without an applied voltage, and one has a depletion or accumulation
layer there. These arise in consequence of the spatial redistribution of free
charge carriers in the presence of interface states - some of the carriers are
either captured or generated by these states (see Figure 7.19). The semiconductor as a whole thereby remains electrically neutral, of course. This
changes if a voltage is applied.
7.4.1
Field effect
613
u<o
UO
Figure 7.20: Formation of depletion and accumulation layers at an insulatorsemiconductor junction subject to an applied voltage.
from t, then the charge on the metal contact has to be multiplied by the
factor ( I / ) to get the induced charge. Since, in general, E I is smaller than
t (ferroelectrics are an exception), the induced charge is generally a fraction
of the charge placed on the metal contact. Inducing charge at an insulatorsemiconductor junction by applying a voltage is also referred to as field effect.
This effect has played an important role in the development of semiconductor
physics. The thought of exploiting it to induce a change of semiconductor
resistance by means of an external voltage, hence, to develop a kind of
a transistor, is obvious and was followed by several scientists in the late
twenties and thirties of this century. Initially, the efforts were not successful,
however. Instead of the field effect transistor, the bipolar transistor was
invented in 1949 - in a sense as a byproduct of the search for the former.
The main reason for the failure of the early work on the field effect transistor
614
is the existence of interface states at the junction of the insulator and semiconductor. If such states are present in sufficiently high density, which was
in fact the case in the early work, all of the induced carriers are captured
by them. Since carrier mobility in these states is zero, the resistance of the
semiconductor does not change. Generally, the number of electrons captured depends on how many unoccupied interface states are pushed below
the Fermi level by applying the external voltage (see Figure 7.19c), or how
many previously occupied states are lifted above the Fermi level (see Figure
7.19b). The amount of captured charge represents, in this way, a measure of
the number of interface states in an energy interval at the Fermi level of the
size of the applied voltage. By measuring this charge as a function of applied
voltage U , which can be done, e.g., by measuring the capacitance change of
the insulator-semiconductor junction with U , one may obtain experimental
data about the density of interface states at different energies.
7.4.2
Inversion layers
For a field effect transistor to work well, the density of interface states
must be as small as possible. In the following, we will neglect these states
completely and will study the effect of an applied voltage on an insulatorsemiconductor junction under these circumstances. Qualitatively, it can be
ascertained immediately, that for not too large voltages U , an accumulation
or depletion layer is formed depending on the sign of U and the type of
semiconductor (see Figure 7.20). Below, we assume a p-type semiconductor.
Starting from U = 0, an increase of U bends the bands of the semiconductor
at the interface with the insulator downward. Increasing U further, sooner
or later, a point is reached at which the conduction band energy Eg - ecp(0)
at the interface dips down below the Fermi level EF (see Figure 7.21). This
happens if the voltage drop across the semiconductor Us = cp(0) - cp(00) is
larger than (Eg- E F ) / e 3 Cro, i.e. if
(7.92)
holds. Increasing the voltage U further, the conduction band edge E,-ecp(O)
at x = 0 is pushed further down, and the region of the semiconductor in
which the conduction band edge lies below the Fermi level extends to the
right to a point xi > 0 given by the relation
E g - ecp(zi) = E F .
(7.93)
(see Figure 7.21). In this region the free charge carriers are not predominantly holes, as otherwise in a p-type semiconductor, but rather, they are
electrons. The conduction carrier type is inverted in this region, which is
therefore called an inversion layer. The electrons in the inversion layer of a
615
(7.95)
If the Fermi energy E F lies below the conduction band edge, then the majority of electrons in this band have energy values at and a few kT above the
band edge E, - ecp(x). As long as [E, - ecp(z) - E F ]is substantially larger
616
than kT,f ( E ,x ) is approximately zero for the energy values mentioned. If,
on the other hand, the Fermi level lies above the conduction band edge,
i.e., if the expression [E, - ecp(x) - EF] is negative and its absolute value is
x ) is approximately 1. The
simultaneously much larger than k T , then f(E,
energy interval in which f(E,x ) is neither 1 nor 0 decreases, as compared to
the whole potential interval eU, spanned by the potential e p ( x ) in the semiconductor, as eU, increases in comparison with k T . We therefore suppose
that
k T << eU,
(7.96)
holds. In a sense, this relation represents a generalization of condition (6.35)
for the validity of the Schottky approximation. Then, considering (7.95) for
energy values of the conduction band, we can approximately write
f(E,X ) = B [ E F - E
+ ecp(x)]
E > E,.
(7.97)
(7.99)
Evaluating the energy integral for n ( x ) in (7.98) with the square-root expression (4.44) for the density of states p:&&(E) x p;"'(E), a concentration
n ( z )follows which is proportional to [ e p ( x ) EF - E g ] 3 / 2 .Substituting this
into the Poisson equation (5.24), the solution leads to inverse elliptic functions which are difficult to handle analytically. We therefore replace the true
density of states by a model state density Fc which is constant in the energy
617
interval between E, and EF ecp(x), with the constant 7, of the model determined as follows: We form the average value of the true square-root-like
density of states p?(E) in the occupied energy interval at x = 0, i.e. between Eg and EF eUs. If we were to use this average value for the density
of states throughout the entire inversion layer without any correction, then
the resulting average electron concentration in the inversion layer would be
markedly larger than it really is. Therefore, a correction factor y having a
value between 0 and 1must be introduced in consideration of the decrease of
the upper limit of the averaging interval with increasing I . Thus, we make
the substitution
(7.100)
Using this constant density of states, the electron concentration n(x) in the
inversion layer may be rewritten in the form
where
(7.102)
is the effective density of states of our model. The substitution (7.100) has
the consequence that the true density of states, and with that also the true
electron concentration, is underestimated in the left part of the inversion
layer, and overestimated in the right part, provided that a reasonable choice
of y has been made. Because of this, the potential profile p(x) of the inversion layer, whose calculation we will now discuss, has a curvature in our
model which differs somewhat from that in reality. Qualitatively, however,
no significant differences occur. For a quasi 2-dimensional electron system,
the results of our model even apply exactly, since in this case the density of
states is constant from the outset.
With the resulting concentrations of free charge carriers as derived above,
the Poisson equation (5.24) in the semiconductor takes the form
IO1
-d _<
I:0,
(7.103)
The solution of this equation must satisfy the boundary conditions (7.90) and
(7.91). The arbitrary constant of the potential is chosen such that ( ~ ( 0 0=
) 0
618
holds. From the Poisson equation it immediately follows that the potential
and the field strength also vanish at x = xp, i.e. that
V b P ) = 0,
(7.104)
(7.105)
u - (X+ d)FI,
Uo
-d 5 x
< 0,
0 5 x < xi,
xi
i x < Xp,
(7.108)
with
(7.109)
as the characteristic screening length of the electron gas of the inversion
(7.110)
as the electric field strength in the semiconductor at the interface with the
insulator, and
Ua = U
- FId
(7.111)
(7.112)
619
(7.113)
(7.115)
The number ns of electrons stored in the inversion layer per cm2 may be
obtained from (7.101) and (7.108) as
ns = N,L.
(7.116)
The sheet density ng of electrons in the inversion layer is zero if the voltage
drop Us across the semiconductor equals the threshold voltage Uo marking
the onset of inversion. Above this value, ng has a weak overlinear dependence on Us. Thereby, U , depends on the applied voltage U through the
relation
(7.118)
The ratio of insulator thickness d to the screening length L of the electron
gas of the inversion layer is decisive for the size of the variation of ns as a
function of U . For d >> L , ns is practically independent of U . In order that
the accumulated charge density ns can be tuned by means of U as effectively
as possible, we must have d << L. With U s - UO M 1 V and mz M m , the
corresponding value of L is about 300 A. The insulator layer must therefore
be extremely thin. In the limiting case d << L and Us >> Uo, we have
620
(7.119)
With U M 5 V , this yields (dn,/dU) M 1013 crn-'V-l.
A voltage increase
of 1 V results in about lOI3 electrons being induced into the inversion layer
per cm2.
Electrons in an inversion layer have been the subject of many physical
investigations. Like carriers at a semiconductor heterojunction (see section
6.2), these electrons are confined in a potential well. Just above the bottom
( r i 0.1 e V ) , the width of this well is still smaller than L. Thus, carriers
of such energy are freely mobile only parallel to the well. One has a quasi
2-dimensional electron gas. In it, confinement effects, like the formation of
subbands, can be observed. By means of an applied voltage, the density of
the electron gas can be varied. In this context, the accumulation layer at the
interface between SiOz and Si has been of particular interest. Historically, it
was the 2D gas of this layer in which carrier confinement effects were studied
first, and in which the quantized Hall effect was discovered.
7.4.3
MOSFET
u-+utuv.
(7.120)
For U = 0, the semiconductor region between source and drain, the so-called
channel, has high intrinsic resistivity. If a positive voltage U = UG is applied
between the bulk of the p-semiconductor (substrate) and the metal layer on
62 1
a1
P
Substrat
Figure 7.22: Structure of a MISFET (a). In part (b) the switching scheme of the
MISFET is shown.
E
..
.
n
v)
-0
12
16
20
24
Figure 7.23: Influence of the gate voltage UG on the I s 0 - versus - U s 0 currentvoltage characteristics for a n-channel MOSFET of enhancement-type. (After Vzilz,
1986.)
top of the insulator (gate), large enough to create an inversion layer, then
the channel becomes a good conductor. The current in the source-drain
circuit can be tuned by the voltage UG of the gate-substrate circuit since the
electron density of the inversion layer depends on UG. The voltage UG is
called gate voltage, and the minimum gate voltage UG necessary to a achieve
inversion is the threshold voltage. If one also includes a working resistor
in the source-drain circuit and compares the power changes in this resistor
with those in the gate-substrate circuit, one finds that the former exceeds the
latter appreciably. In this sense, the MISFET operates as an amplifier, just
like the bipolar transistor. Because of the insulating layer below the gate
electrode, the MISFET has a much larger input resistance than the bipolar
622
transistor. The output resistance of the MISFET, i.e. that of the source
drain circuit, is small because of the accumulation layer between source and
drain.
If the MISFET is realized using Si as the semiconductor material and
SiO2 as the insulator, one has the Metal Oxide Semiconductor Field Effect
Transistor, abbreviated MOSFET, which is by far the most important MISFET. The basic requirement that the electrons of the inversion layer shall
not be captured by interface states, are met by the MOSFET extremely
well. Just as the base of a bipolar transistor can be made either of p or
n-material, one also has two possibilities in the case of the unipolar MOSFET - the charge carriers in the conducting channel can either be electrons,
as has been assumed thus far, or holes. In the first case one speaks of a
n-channel MOSFET, and in the second of p-channel MOSFET. The prevoltage U V ,which was introduced above only formally to simulate the intrinsic
state, can really exist for various reasons, e.g., because of positively charged
centers within the oxide arising during its formation. The prevoltage can
take such large values that inversion exists even without an applied gate
voltage. Then the transistor is already in its conducting state at zero gate
voltage. To switch it into its blocking state, one must apply a negative gate
voltage in the case of the n-channel MOSFET, and a positive gate voltage
in the case of the p-channel MOSFET. One says that the transistor operates in the depletion mode. If the transistor is blocked without an applied
voltage and changes into the conducting state by applying a positive (for
n-channel) or negative (for p-channel) gate voltage, one has the enhancement mode. The current-voltage characteristics of the source-drain circuit
of a n-channel-enhancement-mode MOSFET are shown in Figure 7.23 for
different gate voltages. One recognizes how the sourcedrain current at a
h e d value of U s 0 increases with increasing gate voltage UG. Somewhat
unexpectedly, for a k e d gate voltage UG, the current saturates at higher
source-drain voltages USD. This is a consequence of the fact that the effective gate voltage in the vicinity of the drain electrode becomes smaller and
smaller as the drain potential grows larger. This creates a pinch off of the
inversion layer, making the current stay constant.
The MOSFET represents the most important electronic component of
digital circuits in microelectronics. Here it is used as an electrically controllable switch, meaning that its control function is reduced to two states only,
one with maximum output power, corresponding to a binary l,and another
having minimum output power, corresponding to a binary 0. Today, MOSFETs can be made as small that millions can be integrated in one single
Si chip, more than of any other electronic component. It is this ultra-large
scale integration (ULSI) technique which has made modern computer and
communication technologies possible.
623
Appendix A
A.l
A.l.l
Group definition
2) The multiplication obeys the associative law, i.e. for three arbitrary
elements 91, 92, 93 one has the identity
624
that
g.g-'
=;
g -1 - g - e .
(A.3)
Onc may show that it suffices to dcmand the existence of only the left-sided
or only the right-sided inverse. This one-side inverse is thcn uniquely determined and its existcnce automatically leads t o the existence and uniqueness
of the other-siticd iiiverse. T h e other-sidd inverse thusly determined is identical with the original oneside in?-erse. Analogous dstements hold or the
identity dement discussed above.
The inverse (91 . g2)-' of tl product gl . g:! is given by gz1 . gl1: as one
may easily verify by explicit. mult.iplicat.ionof t h e two products.
The existence of the inverse ensures that t.he products g1 . g and g2 . g
of two elements g1 and g~ with an arbitrary group element g are different,
if the elements g1 and 92 differ from each other. The same is true for the
re-ordered products g .g1 and 9 92.
The number N of elements of a group is callrd its order. Depending on
whether N is finite or infinite, one has finiteor iu,~%ittgrotqs. ' h e dements
of infinite groups may be either discrehe or continuous. In the latter rase
one has a coatinuow group.
The group multiplication which i s associative by definition, need not also
be commutative. In general: g1 . $2 differs from g2 . gl. If the multiplication
is also commutative, the group is call& A h e l i a ~ ~
We now introduce some goup-thcoretical concepts which are n d e d in
this book.
A.1.2
Concepts
For finite groups G, the theorem of Lagrange holds, wherein the order 1L"
G I is a divisor ol thp order N of the group. Thc proof of
this theorem will be briefly sketched because it also provides some insight
in other group properties. Let be ,g! = E,g$,. . , g h , the elements of a
subgroup G" of G We denote an arbitrary element of G not contained in
G' by gz. 'I'he products 92 gi, g 2 . gh,. . . , g2 . gh, of g:! with dl elements
of the subgroup G' form a 5et M,<, the elements of which are different from
each other as well as from the dementb of G'. For the sake of uniformity
of notation, w e set C'
M,<. If the two sets M1< and M; together do
not cover trhed i r e group C , WP choose an element 93 of G not contained in
AT,< and M,<. We proceed in the same way with this element as we &d with
of a subgroup
625
92. The thus emerging set M3< contains N' mutually different elements, just
like M,< and M 2 , and none of these elements occurs in M1< and h42. If the
entirety of the sets M:,
and M Z still do not yet cover the full group
G, we choose yet another element 94, and so on. After a finite number N f f
of steps a complete division of the group G into N" distinct subsets with N'
elements in each will be achieved. Therefore, one has N = N' ' N" which
proves the theorem.
Lagrange's theorem allows us to draw several conclusions. For groups
of prime order it means that only trivial subgroups can exist, namely the
identity element, and the group itself. This statement may also be reversed
- the order of a group which has no subgroups apart from the trivial ones,
must be a prime number.
Cyclic subgroups
We consider two groups G and G' of, respectively, orders N and N', and
626
examine the products between all elements g k of G and all elements gh, of
G'. The whole set of these N . N' products g g i , = g k . gLi forms a group G"
of order N . N'. This statement may be easily verified by showing that the
product of two of the elements of GI' is again an element of G". The group
GI' is called the direct product of the two groups G and G I , for which one
writes G" = G x G'.
f = f( 9 ) .
(A.4)
f (91) . f (92)
(A.5)
where g1 and 92 are two elements of G. If the mapping is not only unique but
is also bi-unique, i.e. if not only each element of G corresponds to exactly one
7
627
A.2
Rigid displacements
A.2.1
Definition
Formally one may define rigid displacements as transformations g of 3dimensional space which shift its points x by vectors a(x) in such a way,
x 2 not
that the distance I x1 - x2 I between two arbitrary points ~ 1 ~ does
change. Thus, we have
or
2
(A4
The multiplication dot in (A7) and (A.8) signifies the internal or scalar
product. In order to satisfy equation (A.8), the displacement field a(x)
cannot be of higher order in x than of the first, i.e. a(x) must have the
general form
a(x) = px
+a
(A.9)
628
gx = a x + a,
(A.lO)
(A.ll)
The multiplication 'dot' between the two operators symbolizes the consecutive application of the operations. Since (XI - xp) represents an arbitrary
vector, relation ( A . l l ) implies that
a + . a = 1.
(A.12)
= ,-I.
(A.13)
Linear operators which act in a real vector space (as we assume here) and
satisfy the relations (A.12) or (A.13), are called orthogonal. The transformation they describe are orthogonal transformations.
A.2.2
Translations
tax = x
+ a.
(A.14)
With ta, one can rewrite equation (A.lO) for gx in the form
(A.15)
whence
(A.16)
which is stated as theorem below.
Theorem
629
This theorem remains valid if one reverses the sequence of the orthogonal
transformation and the translation, but one must then use another translation in order to obtain the same displacement g as before, specifically
t,
(A.17)
a = a!.ta-la.
'
(A. 18)
= ei.a e k .
aik(a)
The transformation a itself may be expressed in terms of the dyadic structures e i e k composed of two vectors e i and e k as follows:
(A. 19)
a =C C a i k ( a ) e z e k .
i
Cai&)eiek
'
v.
(A.20)
It follows that
aei
=Caki(a)ek=Ca$(a)ek.
k
(A.21)
x =Cxiei,
(A.22)
where
xi = e i .
(A.23)
(A.24)
630
(A.25)
This is to say that the coordinates themselves transform with the matrix
A ( a ) , in contrast to the basis vectors whose transformation is governed by
the transposed matrix [ A ( a ) I T .
The product 0 1 . a2 of two orthogonal transformations a1 and ag, understood in the sense of sequential application, is represented by a matrix
A ( a 1 . ag) which one can easily determine with the help of relation (A.19),
obtaining
[ A ( a ) l T= [ A ( a ) ] - l .
(A.27)
Real matrices with this property are called orthogonaL Thus orthogonal
transformations are represented by orthogonal matrices with respect to orthonormalized basis sets.
Relations which follow immediately from (A.27) are A . AT = 1 and
A T . A = 1, which may be written in terms of matrix elements as
This means that orthogonal matrices have the property that different rows
are orthogonal to each other, and so are different columns.
The product of two orthogonal matrices is again an orthogonal matrix.
Combining this result with equation (A.26), we arrive at the conclusion
that the product of two orthogonal transformations is also an orthogonal
transformation. The implicit assumption behind this result is that both
transformations have a common point which remains unchanged (the point
x = 0 here).
631
t$
(A.29)
Since only the direction, but not the length, of e is determined by this
relation, equation (A.29) means that all points given by xo = <e with
as arbitrary real number which lie on a straight line in the direction of e
through the origin, are transformed into themselves. One says that they are
fixed points of the transformation a. The orthogonal transformation (Y with
Det[A]= +1 therefore corresponds to a rotation about the axis defined by
xo. A special rotation is the identity transformation a = E .
In the case D e t [ A ]= -1 one has
<
632
ae = -e.
(A.30)
A=
cos $ cosp
sin$ cosp
- cos 6 sin$sinp
+ ws6 cos $ sin9
sin 6 cos p
sin 6 sin p
- sin 6 cos $
cos 6
Using this matrix, a reflection in a plane normal to the direction (sin8 sin$,
- sin 8 cos $, cos 0) may be traced back to a reflection in a plane normal to
the z-axis, which is easily described. The matrix A of the reflection in the
plane normal to the above cited direction thus reads
cos 28 sin $
+ cos $
cos 28 cos $
- sin 28 sin $
- sin 28 sin $
sin 2$ sin 0
sin 28 cos $
+ sin $
sin 28 cos $
- cos 28
(A.32)
A.2.5
633
b)
ax0
= XO,
(A.33)
634
with
x o a = 0.
(A.34)
(A.35)
xo = (2x0 ra + sa-la
(A.36)
635
are glide reflections. These results allow one to express the above stated
theorem in the following, more specific form:
Theorem 2
A n y rigid displacement is either a screw rotation ore a glide reflection, o r a
product of the two.
A.3
A.3.1
Translations which transform crystal lattices into themselves only may have
displacement vectors a equal to lattice vectors R. We will show that the set
of all lattice translations t R forms a group. The product t R 1 . t R , of two
translations tR1 and t R z is defined as the translation which results from the
consecutive application of the two individual translations. One may express
this by writing
which yields
tR1
'
tRz = t R ~ + R ~
(A.38)
The latter relation means that the product of two lattice translations defined
in equation (A.37) is again a lattice translation. This set therefore forms a
group if points 2), 3) and 4) of the general group definition are also fulilled.
That this is so may be demonstrated easily. Using relation (A.4) one can
readily show that the product (A.38) obeys the associative law (point Z),
that an identity element exists, namely the translation through the lattice
vector 0 (point (3)), and that the inverse of each element t R belongs to the
group, namely the translation t-R through the lattice vector -R (point 4).
Each lattice translation can be expressed as a product of translations
through three primitive lattice vectors al,az, ag, so that
(A.39)
Since the sequence in which the primitive translations are executed plays
no role, all other lattice translations also commute with each other, i.e. the
636
A.3.2
Point groups
Definition
Point groups are sets P of orthogonal transformations a , namely of rotations,
reflections, rotation-reflections , rotation-inversion s and the inversion itself,
which transform geometric bodies having planar boundaries, i.e. prisms,
tetrahedrons, cubes or other polyhedrons, into themselves. One may thus
define point groups as the symmetry groups of these bodies. In this context
their elements are called point s y m m e t r y elements or point s y m m e t r y operations. That the set of all point symmetry operations of a geometric body
actually forms a group, may be seen as follows. Firstly, as has already been
demonstrated above, the product a1 . a2 of two orthogonal transformations
a1 and a2,understood as their consecutive application, is itself again an orthogonal transformation. The condition under which this statement holds,
namely the existence of common fixed-points, can be satisfied in the case at
hand. One has to keep in mind, however, that the various symmetry rotation
axes and mirror planes of the geometric body have to be chosen to have at
least one point in common. Secondly it is also clear that the transformation
a1 . a2 maps a body into itself if this is true for a1 and a2 separately. Thus
the product also belongs to the set P of all point symmetry operations as
well. The validity of the associative law (a1.a2).ag= al.(aq.ag)
is evident,
likewise the existence of a identity element and of the inverses. This means
that, with respect to the above defined product, P does in fact form a group.
Since rotations about two different axes and reflections at different planes
do not, in general, commute the point groups are in general non-Abelian.
The number n of the different rotations about a particular axis is finite in
all point groups. From this it follows that the rotation angles may only differ
by multiples of (an/.). Moreover, only a finite number of different rotation
axes and mirror planes occur. Point groups are therefore finite groups. This
follows from the restriction to bodies with planar boundaries. If we would
had also allowed for the sphere as geometrical body, then the symmetry
operations would have included rotations about arbitrary axes and through
arbitrary angles, likewise for reflections at arbitrary planes. Then one would
no longer have, however, a point group, but the full orthogonal group. If
one removes from the latter group all elements which involve a reflection or
inversion and retains only pure rotations the remainder is the full rotation
group. Both groups, the full orthogonal group and the rotation group, are
infinite. The point groups may be understood as finite subgroups of the full
orthogonal group.
Beside geometric bodies having planar boundaries, there are yet other
point sets which are transformed into themselves by point groups. In the
637
main text we have shown that such sets include infinite point lattices. In
this case, however, only 7 special point groups are admissible as symmetry
groups. As far as ideal crystals are concerned, these are neither geometric
bodies nor infinite lattices. Therefore, the symmetry groups of crystals, the
so-called space groups, are not necessarily constituted of point groups. This
is not so for the symmetry groups of equivalent crystal directions, which
were also considered in the main text. These are point groups, but again
only special point groups are possible, altogether 32. One calls them point
groups of equivalent crystal directions or, in short point groups of directions.
Description of point groups
Generating symmetry elements
638
ml
cn
.t
"2
-___---
b)
el
639
?:'
I
Sni
f)
e) A rotation by 2 ~ / about
n
a vertical axis followed by a reflection at the
horizontal plane ffh containing the axis, i.e. a rotation-reflection . The
Schonflies symbol for this element is Sn. By definition we have
sn=Uh'
cn=cn
'
Oh'
(A.40)
The two factors in this product commute because the reflection U h does not
change the directionof the rotation axis and the absolute value of the rotation
angle, if it is applied before instead of after the rotation. The direction to
which the rotation axis points is reversed, of course, however, with this, the
sign of the rotation angle is also changed, so that in both cases we have the
same rotation.
The symbol Sn is also used to denote an n-fold rotation-reflection axzs,
which means the whole set of powers of 27r/n.
640
S,i = I .
c, = c,.
I.
(A.41)
The two factors commute for the same reason as in the case of rotationreflections considered above. Rotation-inversion axes, which are understood
as the set of all powers of S,i, are used in the international system. The
notation for an n-fold rotation-inversion axes is E.
Stereograms
64 1
sphere around the common fixed-point into another point on the surface
of the sphere. The elements of a point group are thus put into one-toone correspondence with points on the surface of a sphere. At best, one
imagines the sphere as a globe. To achieve a planar representation) one
utilizes, as in cartography, the stereographic projection. This is illustrated
in Figure A.5 with the south pole as the reference point. In this case the
northern hemisphere is projected onto the region within the equator, and
the southern hemisphere onto the region outside the equator. In order that
only interior points need to be considered) one takes the north pole as the
reference point for projection of the southern hemisphere. Points of the
northern hemisphere are shown as full circles in the projection, and points
of the southern are blank circles (see Figure A.5). It can be shown that the
stereographic projection preserves circles and angles, i.e. circles (including
straight lines as circles of infinite radius) are projected into circles, and the
angles between intersecting tangents of circular segments also remain the
same in the projection. As for the elements of the point group, one can
also represent rotation axes and mirror planes by means of stereographic
projection. Figure A.6 displays the corresponding graphic symbols. A nfold rotation axis, for example, is depicted as a small regular n-angle placed
at the projection of the intersection of the axis with the northern hemisphere.
A mirror plane is illustrated by the projection of the arc on which the plane
intersects the northern hemisphere. That the equatorial plane is a mirror
plane may be recognized by the fact that for each full circle one has a blank
circle at the same place. The procedure described here allows a 2-dimensional
illustration of the point groups. The result of the projection is called a
stereogram.
The stereograms may also be used to obtain the product of two group
elements. As an exercise, the reader should try to verify, in this way, the
following helpful relations between generating elements:
u2 . U v = U v .u2 = U h ,
u2. U h = U h . u2 = U V ,
where U2 is the horizontal axis given by the intersection of the two planes
uv and Uh. Analogous relations hold with U d replaced by uv. Furthermore,
one has
Uh.c 2 = c 2 Uh = I ,
(A.43)
'
I .Uh
Oh
'
I =c 2 ,
I . uv = U V 'I = u2 ,
c2
'
I = I . c2 = U h ,
u2. I = I
. u2
uv.
6 42
1) Rotations through the same angle with respect to symmetrically equivalent rotation axes are mutually conjugate.
cn (4
The simplest point group of the first kind consists of the n powers CA, C$. . . , Cg of the rotation C, around a vertical axis. These are rotations
),
. 2,. . . , (21r/n) . n = 2 ~ The
.
rotation
through, respectively, ( 2 ~ / n (2n/n)
C c is the unit element E. Using the terminology introduced above one may
643
briefly say that the group consists of an n-fold rotation axis. Thus the notation for the group is C, in the Schonflies system, and n in the international
system. The multiplication of two rotations Ck and C i leads to
c,. c i = c y .
(A.44)
If k + j is larger than n, one may replace C;+j by C,k+j-.. The product thus
belongs to the set of elements CA, C:, . . .. Table A . l shows the complete
multiplication table of the group CJ. The product in its kth line and Zth
column is the result of the multiplication of the lcth element as left factor
and the lth as right. In the group C, the multiplication order plays no role,
because this group is cyclic and thus also Abelian. Each element forms a
class of conjugate elements by itself. The stereogram of the point group C3
is shown in Figure A.7.
Starting from the point group C , one may set up other point groups of
the first kind by complementing the n-fold rotation axis with new generating
elements, along with the products of these elements among themselves and
with the elements of the original group.
D, (n2, n22)
The group C, is first of all complemented by a horizontal rotation U2. This
may also be expressed by saying that a horizontal 2-fold rotation axis is
added, for the identity element is already contained in C,. However, in this
way no group is yet obtained. The multiplication of the powers C$-, k =
1 , 2 , .. . , n, with U2 produces n - 1 new elements, namely U2k = Ck- . U2
(here Uzl is merely another notation for U2). The set of these elements is
identical to the set of elements U2 . Ck- arising by multiplication with U2
from the left, but the numbering is different. The n elements U2k correspond
to n different 2-fold rotation axes. Each pair of adjacent axes forms an an
angle x/n. For odd n all n rotation axes U2k are symmetrically equivalent to
each other, while for even n they fall into two distinct classes of inequivalent
axes, namely U21, U23, .., U2,-1 for odd 1,and U2, U22, .., U2n-2 for even
644
a)
b)
Figure A.8: Stereograms of the point groups D2(222) (a) and D3(32) (b).
Table A.2: Multiplication table of the point group D3
One may interpret the group D , as the symmetry group of all rotations
of a straight prisms through a regular n-angle (the full symmetry group
of this body still contains reflections and is therefore of the second kind).
Figure A.8b shows the stereogram of the group D3, and Table A.2 shows
the multiplication table of D3. The latter may easily be checked using the
stereogram in Figure A.8b. The group D2 with three mutually orthogonal
2-fold axes, called also V , is shown in Figure A.8a.
645
Figure A.9: Rotation symmetry of a tetrahedron (left) and stereogram of the point
group T (23)(right).
We still have to specify the division into classes of conjugate elements for the
point group D,. Applying the first rule for class division derived above one
finds that the n 2-fold rotations form one class for odd n , and two for even
n. The second rule implies that a rotation Ck and its inverse CLk belong
to the same class. These form ( n - 1)/2 classes if n is odd, and n/2 if n
is even. In addition one has the identity element E which forms a class by
itself. Altogether, this yields ( n - 1)/2
2 classes for odd n, and n / 2
3
classes for even n. The group Dz has the 4 classes E , C2, U21 and U22, i.e.
each element forms a class by itself. The group D3 has 3 classes, namely E ,
the three 2-fold horizontal rotations (U21,U22 , U231, and the %fold vertical
rotations (C3, C:} (here and below elements of the same class are shown in
curly brackets).
T (23)
Another point group of the first kind is the group of all rotations which
transforms a tetrahedron into itself (the full symmetry group of a tetrahedron
also contains rotation-reflections , and is therefore of the second kind). The
rotation axes of a tetrahedron are the four %fold vertical axes C3 through
a corner and the center of the opposite face, and the three vertical axes C2
through the centers of opposite edges (see left-hand part of Figure A.9). We
will not name the axes differently. The international symbol of the group
reads '23', and the Schonflies symbol T . The four %fold axes yield a total
of 8 different rotations, 4 of the form C3, and 4 of the form C:. The three
2-fold axes yield a total of 3 elements of the form Cz. Along with the identity
646
element, the group T therefore consists of 12 elements. That the set of these
elements does in fact form a group is most easily verified by means of the
stereogram in Figure A.9. Each of the following sets of elements forms a
class: the 4 rotations C3, the 4 rotations Ci, the 3 rotations C2, and E.
There are, altogether, 4 classes.
0 (432)
The group of all rotations which transform a cube into itself is also a point
group of the first kind (the full symmetry group of a cube is of the second
kind). The group consists of the three 4-fold rotation axes C4 through the
centers of two opposite cube faces, the four %fold vertical axes C3 formed
by the diagonals of the cube, and the six 2-fold axes C2 through the centers
of opposite cube edges. Altogether, this comprises 24 symmetry operations.
They are illustrated in the stereogram of the group in Figure A.lO. The
international symbol of the group is 432, and the Schonflies symbol is 0.
The classes of 0 are the following: all 6 rotations C4 and C: (for the axes
C4 are two-sided), all 3 rotations Cz, all 8 rotations C3 and C i (the C3-axes
are likewise two-sided), and all 6 rotations C2 and E . One therefore has 5
classes. We abbreviate them in the form E , 6C4, 3C2, 8C3, and 6C2, the
symbol for each class being a representative element with the number of
elements in the class positioned in front.
Beside those already mentioned, there is only one other point group of
the first kind, namely the rotation symmetry group Y of a pentagondodecahedron. This refers to a body with twelve faces, each of which forms a
regular pentagon. The group Y , which is also known as the icosahedral
group, contains &fold rotation axes which cannot occur in the symmetry
groups of crystals. Therefore we will not discuss the group Y here further.
647
sn
(z,
ii), n even
The simplest point group of the second kind consists of one n-fold rotationreflection axis s, of even order n. In the Schonflies system this group is
denoted by S,. It contains rotation-reflections S," with k = 1,2, . . . , n.
Since for even n the relation S," = (Uh . C,)" = E holds, there are no other
elements. For odd n one has S," = Oh, which means that the set of the n
powers Sk, with k = 1,2,. . . ,n does not form a group. Thus, we exclude
odd n.
The group S, is evidently cyclic. Each of its elements forms it own class,
for a total of n classes. The even powers Sk describe rotations through
angles ( 2 ~ k / n )= (27rg/?). Thus the n-fold rotation-reflection axis also
automatically generates a (n/2)-fold ordinary rotation axis. This also means
that the group C,/2 is a subgroup of Sn. For even n not divisible by 4, the
group S , contains the inversion I , namely the element S,"12= (Uh.Cn)n'2 =
a h . C2 = I . The group S 2 consists only of I and the identity element E.
One also denotes S 2 as Ci. Each group S, with even n, not divisible by 4,
can be understood as the direct product C,l2 x Ci of the two groups C,/z
and Ci.
In the international notation system one employs, instead of rotationreflections s,, the rotation-inversion s S,i as generating elements. In the
case of even n, the powers Sii with k = 1, 2, . . . , n already form a
group, because then Szi = E holds. For odd n one needs the 2n powers
Ski, k = 1, 2, . . . , 2n, to get a group. In both cases the international
symbol reads 6. If n is odd, the group ii is equal to S2n For even n one
must distinguish between those n which are divisible by 4 and those which
are not. In the former case, we have ii = S,, i.e 4 = S4, 8 = SB etc. In the
latter case the group fi can be generated by a (n/2)-fold rotation axis and
a vertical mirror plane q. The group 2 consists of only this mirror plane,
which implies that it contains only the two elements E and f f h . It is denoted
by C,. Generally, the group +
can
i be written as the direct product C , / 2 x C ,
of C,p and C , if n is even and not divisible by 4.
Applying the above results to our original groups S, with even n, we
recognize that they can all be traced back to 6' groups. One has S2 =
T, S 4 = 1,s6 = 3, s8 = 8, Slo = 5, S12 = Tz etc. Some of these point groups
are illustrated by their stereograms in Figure A.ll.
The group C, gives rise to a new group if one adds the reflection O h at the
plane normal to the rotation axis. It contains besides the n elements C,"
648
@
Figure A.11: Stereograms of the point groups Sz(T) = Ci,
2 = c,.
S4
E,
a)
649
b)
Figure A.13: Stereograms of the point groups C3,(3m) (a) and C4v(4mrn) (b)
direct product C, x Ci of the groups C, and Ci. For odd n, Cnh is equal
to
as shown in the previous subsection. In any case, the group Cnh is
Abelian. Each element forms a class by itself, thus there are 2n classes. The
stereogram of the group C3h is shown in Figure A.12 as an example.
z,
650
i-_)@
0
a)
b)
+ +
Dnh
(---,
n 2 2
2nm2)
mmm
In this case the group Cnh is complemented by the n 2-fold horizontal axes U2
of D,. Multiplying the 2-fold rotations with Th, one automatically obtains
n normal mirror planes, each of which cuts the horizontal plane f f h of Cnh
along one of the n 2-fold rotation axes. This yields 4n elements altogether.
The group can be interpreted as the symmetry group of a straight prism
upon a regular wangle. The Schonflies symbol reads Dnh. For odd n the
= 5 holds. This gives rise to the international symbol z 2 m
identity
in this case. For even n , no uniform relation exists between the subgroup
n and an rotation-inversion axis. Instead, in this case each mirror plane
m
is normal to one of the 2-fold axes. This leads to the international symbol
Among the subgroups of Dnh are, in each case, the two groups D ,
and C,. Their direct product D , x C, is the full group Dnh. For even n,
D,h contains the inversion. Then the group may also be written as the
direct product Dn x Ci of the two groups D , and Ci. Figure A.14 shows the
stereograms of D2h and D3h.
The division of D,h into classes may easily be obtained from that of D,.
Each class of D , is also a class of Dnh. In addition one has the classes which
emerge from those of D , by multiplying them with f f hor, in the case of even
n , also with I . This yields n 3 classes if n is odd, and n 6 if n is even.
$26.
a)
65 1
b)
652
7
are therefore isomorphic. Knowing this, the classes of Dnd may be obtained
directly from those of D2n. According to what we found earlier, the number
of classes of D2,, amounts to 2n/2 3 = n 3, whether n is even or odd.
}, {
The corresponding n 3 classes of D d are E , the n classes {Szn,'"-S,:
sin,s::-'}>,. . . , { SF;', ST$+' 1, { ~ 2 of~powers
) of rotation-reflections
around the two-sided rotation-reflection axis Szn, the class of the n 2-fold
rotations U2, and the class of the n reflections uv.
The group Th may be obtained from the group T of all symmetry rotations
of a tetrahedron by adding the inversion I . The group is therefore the direct
product T x Ci, and contains 24 elements. Besides the inversion these are
the following: In forming the product of the three 2-fold axes C2 with I
one obtains three mirror planes uh perpendicular to these axes. Each of
the four %fold rotation axes gives rise to a 6-fold rotation-reflection axis
sf3 = 3, which means eight new group elements. In Figure A.16 we show
the stereogram of the group. Its international symbol is $3. The classes of
Th follow from those of T : each of the 4 classes of T is also a class of Th, and
in addition there are 4 further classes obtained by multiplying the former by
I . The group T h then has 8 classes altogether.
653
Si
The full symmetry group Oh of a cube may be obtained from the symmetry
group 0 of its rotations by adding the inversion I . Therefore Oh = 0 x Ci.
The 24 new elements include, beside the inversion, the reflections at the
six planes Ud normal to the six 2-fold rotation axes of 0, and the three
mirror planes Uh normal to the three 4-fold rotation axes of 0. The or&-
6 54
nary rotation axes become rotation-inversion axes, i.e. one has four %fold
rotation-inversion axes 3 in the direction of the %fold ordinary axes of 0,
and three 4-fold rotation-inversion axes in the direction of the 4-fold ordinary rotation axes of 0. The four %fold rotation-inversion axes lead to 8
new elements, and the three 4-fold ones to 6 . The 48 elements of O h are
depicted in the stereograms of the group in Figure A.18. The international
symbol of o h is $3$ (abbreviated as m3m).
The six planes Uh of O h may be interpreted as 6 planes uv which contain
the three 4-fold axes C4. If one observes that each rotation-inversion axis 4
is nothing but a 4fold rotation-reflection axis S 4 , then it becomes evident
how those elements of T d arise in o h , which are not already contained in 0
(these are the four %fold rotation axes and the even-numbered powers S t
of 5'4). From this one may conclude that, with 0, T d is also a subgroup of
o h . Since T d does not contain the inversion, one may understand O h as the
direct product T d x I of T d with I .
The classes of o h may be derived either from those of 0 or those of T d .
The 5 classes E , 6C4,3C;, 8C3 and 6C2 of 0 are complemented by the classes
I , 6 1 . C 4 , 3 I . C:, 8 1 . C3 and 6 I . (32 = Oh. If one refers t o T d , one has first of
all the 5 classes E , 6 S 4 = 6 I . C4,3C,2,8C3 and 6u,, which, by multiplication
with I , then yield the 5 missing classes I , 6 1 . S4 = 6 C 4 , 3 I . C:, 8 1 , C3 and
6 1 . uv = 6C2.
A further point group is the full symmetry group Y h of a pentagondodecahedron. It follows from the icosahedral group Y by adding the inversion.
For the description of the symmetries of crystals, Y h is of as little importance
as Y .
It may be proven that the set of the possible point groups is exhausted
by the above list.
A.3.3
Space groups
655
lattices of the same Bravais type, then it becomes evident that the lattice
translation group must be a subgroup of the space group.
Theorem 2 of subsection 2.1, wherein each rigid displacement is either a
screw rotation, a glide reflection or a product thereof, is important for the
construction of the space groups. It means that the generating elements of
these groups are screw rotations and glide reflections. Lattice translations
and orthogonal transformations belong to them as special cases. How space
groups may be set up from these elements is explained in the main text.
Introduction
9C
pW
= (P(g-lx)
(A.46)
By definition we have
(A.47)
656
(8.48)
Applying the operator
g1
657
The applications of group theory in quantum mechanics rest almost exclusively on the theory of representations. The reason for this is that in quantum
mechanical investigations, as a rule, it is not the groups by themselves which
are of interest, but the operations of their elements on wavefunctions. For
this reason the following theorem, already cited in Chapter 3 is of particular
importance.
Theorem 1
T h e subspace of eigenfunctions of a Hamiltonian having the s a m e energy
eigenvalue f o r m s a representation space of the f u l l spatial s y m m e t r y group
of the Hamiltonian
658
(cp2
(A.54)
Because 9 1 and cp2 are two arbitrary vectors of Hilbert space R , the relation
(A.51) must hold in R.
Representation matrices
It is convenient to describe the operators g of a representation as matrices
with respect to a particular orthonormalized basis set cp', p 2 , .. . cpd in the
+
gcp2,.. . gpd are likewise vecrepresentation space R d . Since the vectors, $@
tors in E d , they may be written as linear combinations of basis functions
cpl, p 2 , .. . cpd, whence
(A.55)
with Dpl(g) as expansion coefficients. The Diq(g) are elements of a quadratic
matrix D(g) with d rows and d columns. The operators 9 are thus described
by d-dimensional square matrices D(g). Since the 3 form a representation of
the group G, the same also holds for the matrices D(g) representing them.
The 'product operation' between the elements of the matrix group is ordinary
matrix multiplication. This may be seen by calculating the matrix D ( m )
of a product element 91.92.By definition, we have
(91 .92) cpZ(X)= cDZIl(g1 . g2)cpZI(x).
(A.56)
I'
(A.57)
Comparison of equations (A.56) and (A.57) yields
(A.58)
(A.59)
659
D+(g) . D(g) = 1 or
D+(g) = o-'(g).
(A.60)
Each group allows for several more or less obvious representations. One of
them is the so-called identity representation.
Identity representation
Every group element is associated with the identity operator or the identity
matrix, in this representation space. The 1-dimensional identity representation can be constructed as follows: One considers an arbitrary vector p(x) of
Hilbert space, and forms the new vector
p(g'-'x), where the summation
runs over all elements of the group G . This vector is then a 1-dimensional
representation space of G, and the representation in it is the identity repre
sentation. To prove this statement we apply an arbitrary element g of the
group to Cgl
p(g'-'x). In this, the sum reproduces itself because g'-lg-l
runs through all group elements if g varies over the entire group. Thus we
may write
cgl
(A.61)
According to this relation, all g are identity operators in the 1-dimensional
space
p(g'-lx), and the representation is indeed the 1-dimensional identity representation, as claimed. It is also now clear how one can construct
identity representations of higher dimensions. One selects linearly indepencpI(g'-'x),
dent vectors cpl(x),cp2(x) etc. in such a way that the sums Cgl
Cgl(02(g'-lx) etc. are linearly independent of each other. Then the vectors
Cglpl(g'-'x), Cgl'p2(g'-'x) etc. span a space in which all vectors transforms into themselves under the action of all elements of G , in other words
it forms a multi-dimensional identity representation.
From a known representation V ,additional representations of the same
dimension can be derived. Below, we give the most important ones.
cgl
660
g*cp*(x)= q"(g-1x).
(A.62)
D*(!?)
= [D(g)l*.
(A.63)
Conjugate representation D
(A.65)
One can easily demonstrate that the matrices b ( g ) form actually a representation if the matrices D(g)do so. In the case of a unitary representation,
the conjugate representation equals the complex conjugate representation
because
(A.66)
Equivalent representations
Transforming the representation space Rd by means of an arbitrary nonsingular operator M likewise results in new representations of the same
dimension. A representation D M is said to be equivalent to another representation V if it is associated with the group G M of operators
jM
M . j . M-1.
(A.67)
The group property of G M follows directly from that of the original operator
group G. The associated matrices DM(g)of the equivalent representation
read
DM(g)= M . D(g). M - l ,
(A.68)
where M denotes the representation matrix of the transformation operator A?. Since A? can be any non-singular operator in the representation
661
A.4.2
Irreducible representations
In the above definition of a group representation we did not exclude the possibility that the representation space R d contain several genuine subspaces
which are also representation spaces. If this is the case, the representation
space is called reducible and the representation of the group in it is called a
reducible representation. If the representation space does not contain such
genuine subspaces, which are also representation spaces, one calls it irreducible. The representation of a group in an irreducible space is called an
irreducible representation. The following important theorem can be proved:
Theorem 2
Each reducible unitary representation can be decomposed i n t o irreducible unitary representations.
K,
662
..... 1
The subspace of eigenfunctiom of a given Hamiltonian having the same energy eigenvalue is a representation space for an irreducible representation of
the f i l l spatial symmetry group of the Hamiltonian
663
We will now discuss important relations which hold for matrices of irreducible
representations.
Orthogonality relations
(A.70)
The factor 1/N with N being the number of elements of G is added for
reasons of normalization. Orthogonality in the sense of this scalar product
has the usual meaning (6 1 y ) = 0. With this definition, we have the following
theorem:
Theorem 3
The matrix elements Drk(g) and Dt",(g) of two inequivalent unitary irreducible representations are orthogonal functions over the group, i.e. they
obey the relation
(A.71)
If the two representations Dp and V" are equivalent, then the matrix elements D & ( g ) and D z k , ( g ) are orthonormal functions, such that
(DCk,
1
I DU) - 6ij~6kkf,
-d,
(A.72)
(A.73)
follows.
Important instruments for investigation of the properties of representations are their characters. These will be treated now.
664
Characters of representations
'trace'
(A.74)
has the property that it does not change under a unitary transformation of
the basis with a given matrix U . This is clear if we recall that the matrix
D in the new basis is given by U-IDU and that under the trace symbol
'Tr' the factors of a product of two matrices can be interchanged. We have,
therefore, TrIUDU-l] = T r ( U - l U D ] = T r [ D ] . This property gives rise to
the definition of the character of a representation in the following way. By
means of the relation
(A.75)
each element g is assigned a particular complex number X,(g). This number
is called the character of the element in the representation D,. The totality
of the character values X,(g) for the various elements g of a group constitute
the character X , of the representation One can thus say that the character
is a complex scalar function defined on the group. The character of the ddimensional identity representation has the constant value d over the whole
group.
Two different elements g' and g", which are conjugate to each other in
the sense of section A. 1, i.e. such that there is an element g of the group for
which gN = gg'g-' holds, have the same character X,(g') = X,(g"). This
follows, again, because of the commutativity of the operators under the trace
symbol. One can also say that the character function X,(g) has a uniform
value for each class of conjugate elements. The values change only in passing
between different classes.
In subsection A.4.1, starting from a given representation, a series of other
representations was constructed, among them the complex conjugate representation, the conjugate representation and the equivalent ones. The characters of these representations bear well defined relationships to those of the
original representation. For the character X v * ( 9 ) of the complex conjugate
representation D: of a given representation D, one obtains
(A.76)
Real representations have real characters. From the reality of the character
one cannot, however, deduce the reality of the representation, since complex
representations can also have real characters.
665
6uof
a given rep-
(A.77)
For unitary representations this relation can also be written as
Thus the character X , ( g ) of an element g must be real in such a representation if g is in the same class as the inverse element g-'.
The characters X,M (9)of all equivalent representations D,M of a given
representation D, are equal and coincides with the character of V,. Thus,
If the complex conjugate representation V * is equivalent to the representation V itself then the character of D must be real.
Relation (A.79) is, again, a consequence of the commutativity of matrices under the trace symbol. That equivalent representations do not differ
by their characters underlines the above-made statement that there are no
essential differences between such representations.
Theorems on irreducible representations
There are a number of important theorems relating to the irreducible representations of a group. Here, we will discuss the ones which are of direct
relevance for semiconductor physics and which are used in the main part of
the book.
Theorem 4
666
Ed: = N .
(A.80)
This statement is referred to as Burnside's theorem. An immediate consequence is that the dimension d, of any irreducible representation of a finite
group must also be finite. There is an even stronger statement relating to
dv :
Theorem 6
Using theorems 4, 5 and 6, and considering the fact that among the irreducible representations there is always a 1-dimensional representation (namely
the identity representation), one often can determine the dimensions of the
irreducible representations of a group without knowing the representation
matrices themselves. The group &d, for example, has 8 elements and 5
classes. There is only one way to obtain the group order 8 by adding 5
squares, namely by taking four squares of 1 (12 l2 l2 l2= 4),and one
of 2 (22 = 4). This means that the group D w has four 1-dimensional and
one 2-dimensional irreducible representations. For an Abelian group Burnside's theorem immediately implies that only 1-dimensional representations
are allowed for in such a groups each element forms its own class, so that
the number of classes equals the number N of group elements. Because of
(A.80) then all d , must then be 1.
+ + +
Theorem 7
(A.82)
667
(A.83)
where the sum is extended over all distinct irreducible representations V, of
the group. Using the orthonormality relation (A.81), one can determine the
multiplicities m,r immediately. Upon scalar multiplication of X with Xu(,
it follows that
(A.84)
This relation provides a simple procedure for determining if and how often
a particular irreducible representation V, occurs in a supposedly reducible
representation 2). The reduction of V into VU1smay be written in the
symbolic form
=
Cm,D,.
(A.85)
A.4.3
Products of representations
(A.86)
668
(A.87)
D,xV,=V,x~,.
(A.88)
(A.91)
or
X,xv(g) = &(g)
.X d g )
(A.93)
669
T h e direct product Vz x V, of two unitary representations V*, and V, contains the identity representation exactly i f and only if the representations V p
and V, are equivalent t o each other. T h e identity representation then occurs
exactly once.
The product space R u x , can always be decomposed in this case into two
representation subspaces R t x u and R;,,, where RExu is spanned by the
$(x) . (p;(x)] of the
symmetrical combinations (l/a)[Cp;(x) . $(XI)
products of basis functions, and RExu by the antisymmetric combinations
(l/&)($(x). cpi(x)- (pi(x) . &(x)]. The dimension of REx, is given by
d(d f 1)/2, and that of RExu by d(d - 1)/2. The representation [V,x V,I8
of the group G in RExu is called symmetric product and the representation
[V,x V,],in RExu is the antisymmetric product. For the matrices DL:Jc(g)
of [Vvx V,],we find from (A.89) and (A.90),
(A.95)
and for those of [V,x
V,],,
we have
(A.96)
(A.97)
(A.98)
670
D$kAkl.
(A.99)
kf
(A.100)
A.
from (A.99)
(A.lO1)
k'
(A.102)
or, considering equation (A.99),
671
This means that the matrix elements (&&l(Pr) transform according to the
direct product representation 23; x V Ax D,. Summing over all elements g ,
we obtain from (A.103) the relation
(A.104)
If the identity representation does not occur among the irreducible parts
of the direct product representation V; x DA x D,, then the sum over g
vanishes according to relation (A.73). In this case, one correspondingly has
i.e. all matrix elements ($7kl&(pfl) vanish for symmetry reasons. If the
identity representation is contained in the direct product representation 21: x
DA x D p ,
the sum over g in (A.104) does not vanish and the matrix elements
($&1&1$7r) can be non-zero. Nevertheless, they may vanish for reasons
unrelated to spatial symmetry.
If some of the elements ( & [ A k l p r ) do not vanish by reason of symmetry,
questions remain open as to which of the elements are non-zero and what
symmetry-induced relations exist between them, in other words, how many
independent non-zero matrix element exist. We now proceed to discuss this
question. To this end we start with equation (A.103) and rewrite it using
the representation matrices
(A.106)
($7hI A k I $7);
pxAxu
Dmklmkl(g)($7&
I AkIpF).
(A.107)
lmk
672
(A.109)
Because of the orthogonality relation (A.73), the g - s u m results in a non-zero
value only for i such that Vx(i)is the identity representation. We assume
that this occurs in the matrices DLlKtLiMKL ( 9 ) for i = 1 , 2 , .. . ml. For
these values of i, the DL,,,,,,,,(g)
are diagonal matrices with only one
non-vanishing element. The one non-zero element has the value 1. We denote
the position M'K'L' = M K L of this element by MiKiLi. Then equation
(A.109) may be written as
673
Theorem 10
T h e matrix elements (p$lAklp[) of the operator
vanish f o r s y m m e t r y
reasons i f and only if the direct product representation VE x V A x V, does
n o t contain the identity representation. If the identity representation i s in
VE x V Ax V,, n o t all elements (p&]Aklp[) vanish by reason of s y m m e t r y .
There are as m a n y independent elements among t h e m as the number of t i m e s
the identity representation i s contained in Vi x V A x D,.
From theorem 9, and the commutivity of the factors of a direct product representation, we conclude that the identity representation appears in
VE x V A x V v as often as the representation product V; x V, contains the
irreducible representation VA.
A.5
Although the full rotation group has no direct meaning for crystals, indirectly, however, it plays an important role in their study. Its representations
define the space of electron states in which the operators of quantum mechanics act and, consequently, in which the representations of the symmetry
groups of crystals need to be considered.
The full rotation group is an infinite continuous group. This distinguishes it from the groups considered heretofore, which were finite and discrete. Many concepts and theorems formulated for the latter can be utilized
directly for infinite groups. Among them is the group definition itself and
the concepts of conjugate elements and classes of such elements. In the
full rotation group, all rotations through the same angle, but with respect
to different axes, are conjugate to each other. The corresponding classes,
therefore, contain an infinite number of elements. The number of different
classes is also infinite because the number of the different axes is infinite.
The concept of representations may be used for the infinite rotation group
as well. The dimension of the representation matrices is, however, no longer
necessarily finite, as in case of finite groups. Matrices of any size are allowed,
among them, of course, also finite ones. The reducibility and irreducibility
of representations are defined in analogy with the corresponding definitions
for finite groups.
A number of theorems which were formulated for finite groups lose their
validity, however, or must be modified when applied to infinite groups. That
this happens is then evident, if the group order or the number of classes
enters the theorem directly. For example, one can only speculate whether
the number of inequivalent irreducible representations of the rotation group
is actually infinite, as it follows formally from theorem 4 and the fact that
the number of classes is finite, for it is not clear whether this theorem still
6 74
holds in the case of infinite groups at all. We shall see below that there
is in fact an infinite number of inequivalent irreducible representations of
the rotation group, so that the formal application of theorem 4 is justified.
Generally, one ought to use care in transferring results for finite groups to
infinite ones. Luckily, it turns out that the construction of the irreducible
representations of the rotation group (being the aim of this section) does not
require the theorems derived above.
O n e of the irreducible representations of the full rotation group we know
already, namely that in the 3-dimensional vector space, which is called the
vector representation The corresponding representation matrices follow
from the matrix A of equation (A.31), if the Eulerian angles are allowed
to take all real values. The matrix A represents rotations in a special basis, namely that of the Cartesian coordinate system before the rotation. As
we will see, there exists in each representation space, also including that of
dimension 3, exactly one (inequivalent) irreducible representation. Each 3dimensional irreducible representation may, therefore, be obtained from the
matrix A of equation (A.31) by a unitary transformation. The reason that
we will once again deal with the 3-dimensional vector representation is that
this representation is suitable for introducing a concept which is of importance for irreducible representations of any dimension, namely the concept
of generators of infinitesimal rotations.
A.5.1
(A.111)
(A. 112)
is the rotation matrix. The components Iz,Iyl
I, of I are called generators
of the infinitesimal rotation. They are given by
0 0
I, =
0 0 -i
O i
675
0 -i
0 0 0
Ip=
-i
0 0
0 0
113)
The exponential function in (A.112) is to be understood in the sense of a
power series expansion with respect to -i(I. n)p. In the following we sketch
the proof of assertions (A.lll) and (A.112).
Initially, we consider a rotation about the z-axis through the infinitesimal
angle dp. The coordinates after rotation, obtained using the matrix A of
(A.31), are given by 2 = 2,y = y - zdq,, z = z ydp,. In matrix form
these equations read
(A.114)
with
(A. 115)
D(ex,dp,) = E - iIxdvz,
and with E the 3-dimensional identity matrix. Analogous relations hold for
infinitesimal rotations with respect to the y- and z-axes. If the three infinitesimal rotations are executed one after another, then this corresponds,
apart from higher order corrections, to a rotation about an axis in the direc. With n as unit vector in the
tion of the vector d$ = e,dp,+eydpy+e,dp,
direction of d$, d p as the absolute value of d$, and I = Izex Igey Izez,
the result of the rotation can be written as
(A. 116)
with
D(n, d p ) = E - i ( I . n)dp.
(A. 117)
Now consider a rotation through the finite angle cp with respect to the same
rotation axis n. The pertinent rotation matrix is D(n,p). After this rotation we apply the infinitesimal rotation (A.117). The total rotation is
described by the matrix D(n, p dp). Because of the group property of the
representation matrices, we have
676
On the other hand, expanding D(n, cp+dcp) with respect to the infinitesimal
angle dcp, we obtain
(A.119)
Comparing with (A.117) it follows that
d
-D(n,
(A. 120)
dcp
A.5.2
Equation (A.112) relates the rotation matrix with the three generator matrices I z , Iy,Iz. Through the latter D(n, cp) is uniquely determined. If, in
turn, the rotation matrix D(n, cp) is known, the three generator matrices can
be determined from it as follows
d
I , = i--D(ez,
dcp
cp)lrP=0,
= i-D(e,,
dcp
cp)lrP=o.
(A.121)
These relations may be understood as the defining equations of the generators matrices. As such, they may also be applied to representations of
dimensions other than 3. With this in view we shall consider D(n, cp), and
I%,Iy,Iz as matrices of any dimension henceforth. With a knowledge of the
generator matrices for dimension d, one can then determine the rotation
matrices of a d-dimensional irreducible representation by means of equation (A. 112). In this way, the construction of the irreducible representations
of the rotation group is traced back to the determination of the generator
matrices. To accomplish the latter task, the commutation relations among
the matrices are of great value. In the 3-dimensional case the explicit form
(A.113) of the matrices I,, Iy,
I, may be employed to show that
[I,, Iy] = iI, ,
[Iyr
Iz] = iI, , [Iz,I,] = iIv.
(A.122)
In quantum mechanics, the same commutation relations hold for the components of the angular momentum operator. This reflects a specific relation
between the two sets of quantities, which we will discuss further below. The
definition (A.121) can be used to demonstrate that the commutation relations (A.122) apply to generator matrices of any dimension. One may thus
use them for the construction of these matrices. Before we can do this, we
must address two questions which, thus far, have been ignored the questions
677
which dimensions are allowed for the irreducible representations, and how
many inequivalent representations exist for a given dimension.
Consider an irreducible representation 2) of the rotation group GR in a
d-dimensional space Ed. Being a representation space of GR, Rd will also
simultaneously be a representation space of the subgroup G , of all rotations
about the z-axis. Due to the Abelian type of G,, its representations are
1-dimensional. Thus, D considered as a representation of G,, falls into a
series of 1-dimensional representations. Let the corresponding basis vectors
be b l , b2, . . . , bd. we will relate the rotation matrices D of the representation
D to them in the following. For a rotation through an angle cp with respect
to the z-axis. we have
= 1,2,. . . ,d,
(A. 123)
= 1,2,. . . , d.
(A.124)
+ iIy,
I- = I,
- iIy.
(A.125)
ej-1 = I-bj
ej-2 = I-ej-1
ej-3 = I-ej-2
, ...
(A.126)
, m =j,j
- 1,j- 2 , .
..,
(A.127)
where for the sake of uniformity we have set e j bj. The ej-k are mutually
orthogonal. Also they all lie in the representation space R d , so that not more
than d of them can be different from zero. For a particular value k = T , we
must therefore have
ej-T = I-.bj-T+l = 0.
(A.128)
Similarly, by using the commutation relations, one can show that the operI," satisfies the eigenvalue equation
ator I' = I," i-I:
(A.129)
678
with E as the identity matrix in Rd. To prove this relation, one first verifies
the equation
I+ej = 0
(A.130)
arguing indirectly: If the vector I+ej were to be non-zero, then it would
1. We have already assumed,
be an eigenvector of I , having eigenvalue j
however, that j is the largest eigenvalue of Iz in R d . On the basis of this
contradiction, the assumption that relation (A. 130) does not hold must be
rejected.
Using this result, one first proves equation (A.129) for m = j , by expressing Ix and I, in terms of I+ and I-. To do the proof for arbitrary values
of m , equation (A.126) and the commutativity of I," I; I,"with I- are
sufficient.
Equation (A.129) makes it possible to calculate the effect of I+ on the
basis vectors em with m # j . To this end, one replaces em in terms of
I-e,+l.
For I+I-, the relation I+I- = I 2 I, - I,"may be employed, with
the result
+ +
, m =j
- 1 , j- 2 , .
. . , j - r + l . (A.131)
Owing to above derived relations it is now clear that the matrices I+, I-, Iz
transform the space spanned by e j , e j - 1 , . . . , ej-,.+l into itself. The same
holds, of course, for the generator matrices Ix,Iy,I, and the matrices D
of the irreducible representation 2, of the rotation group defined by them.
According to the initial assumption, the dimension of this representation is
d , whence we have r = d.
There is an intimate relation between d and j , which may be seen as
follows. The diagonal matrix elements (em I 1," I; I," 1 em), may be
evaluated using equation (A.129), with the result
+ +
(em
(A.132)
d-1
(A. 133)
2 Thus j is a half integer if d is even, and an integer if d is odd.
Relations (A.126) to (A.131) define the elements of the matrices I+, I-, 1,
and, therefore, also those of the generator matrices Iz,Iy,I, with respect
to the basis e j , ej-1, ..., e - j . The disadvantage of this basis is that it is
not yet normalized, so that the generator matrices, and the corresponding
j = -
679
with
Nm =
(A.134)
= N m I-6,+1
J(j
(A.135)
+ m + l)(j- m )
(Em
I Iz 1
i m l )
(A.137)
= m6,1.
The generator matrices of a given dimension result in an irreducible representation of the rotation group of the same dimension. This is evident from
equation (A.112), which determines this representation uniquely. One can
show that this is also the only irreducible representation of this dimension.
An irreducible representation of the full rotation group is thus uniquely determined by its dimension d or, in view of equation (A.133), by the parameter
j. One usually employs j and denotes the various irreducible representations
by Dj. The corresponding representation matrices are denoted by D j . Their
elements (im I D j 1 Cm,) may be calculated by using equation (A.112), as has
already been discussed, but other, methods based directly on the differential
equation (A.120) are more practical. We will not discuss this matter here in
detail, but only give the final results. Using Eulerian angles for the rotations
a,one finds
- m ) ! ( j+ m')!
with
p = cos 0.
680
(im 1 D j l i m t ) *
(- 1)m-ml (kLm
1D j l L m t ) .
(A.140)
(A.141)
Sometimes the direct product of two irreducible representations Dj and Dji
is needed. It contains each of the representations Dl with values of 1 between
I j - j ' I and j j' exactly once. Thus the decomposition of the product
reads as
vj x
vjt =
c n.
j+jt
I= ( j - j l
(A.142)
It is also worth mentioning that the generator matrices give rise to an irreducible representation of the rotation group in yet another sense than that
discussed before: they are themselves the basis vectors of a representation.
The dimension of this representation is 3, i.e. the generator matrices transform according to the vector representation of the rotation group. If one adds
inversion to this group, the result is the pseudovector representation. The
reason for this is that the sign of the generator matrices cannot change under
an inversion because of the commutation relations (A. 121). The transformation of the generator matrices as pseudovectors is employed in the invariant
method for construction of the k.p-Hamiltonian (see section 2.7). For that
purpose, the explicit form of these matrices is needed. By means of formulae (A.136) and (A.137), one obtains, in the 2-dimensional case, the Pauli
spin-matrices multiplied by 1/2. For d = 3, it follows that
68 1
(A.143)
( 0
0 -1
( 8 , 11,
I &d)=
id3
Id3
2
$d3
-$&
id3
id3
-i
% o
0 ;
0 0
-4
0 0
(A.144)
0 - 32
Again, transformation of the pseudovector of three generator matrices reflects the close relation between these matrices and the angular momentum
operator J. This relation may be expressed as follows: The generator matrices of dimension d are the matrices which represent the angular momentum
operator in the basis of the irreducible representation of the full orthogonal
group of this dimension. The basis itself may be characterized in this context as the set of simultaneous eigenvectors of the square 5' of the angular
momentum operator J for eigenvalue 7i2j(j 1) with j = (d - 1)/2, and of
the z-component J z of J.
682
A.6
Spinor representations
(A'. 145)
The basis vectors -2 1 , - 2 - ~ which transform according to this representation,
5
Ii), i).
i,
g1 . g2 = g 1
. g2.
(A.146)
where S ( g ) denotes an operator in the 2-dimensional spin space. Since translations do not act on the spin coordinate s, for elements g = t, . a we have
S(g)
S(t,. a ) = D I ( a ) .
2
(A. 148)
For pure translation groups the only representation in spin space is the
identity representation. The group of rigid displacements can therefore be
683
s) in spin space,
(A.149)
S
A.6.2
Representation D+
The two Euler angles cp and $ both vary between 0 and 2n. The rotations a[cp 2a,8,$] and a[cp,8, $.J 2n] are identical with the rotation
a[cp,O,$]. Strangely enough this does not also hold for the pertinent matrices D g ( a [ p 2n,8, $]) and D;(a[cp, 8, $ 2n]) from relation (A.145).
Both equal -Dr(a[cp, 8, $1). As long as one considers only single rotations
2
a [ p ,8, $1 this does not cause any difficulty because one can always restrict cp
and $ to the interval between 0 and 2n, and assign + D i ( a ) to the rotation
3
a. Difficulties arise, however, if two rotations a1 and a 2 are multiplied. Then
it may happen that one of the two angles (o12 or $12 of the product a1 . a 2
takes a value larger than 2n. In this case, the representation (A.145) assigns
the matrix - D ~ ( a 1. a 2 ) to a 1 . a 2 . As it is itself an element as well, a1 . a 2
2
has, however, already been assigned the matrix + D l ( a l . 4 ,which cannot
be changed because the assignment in a representation must be unique. The
only resolution of this difficulty is to allow the multiplication rule for the
two representation matrices D l ( a l ) , D i ( a 2 ) to differ from that of the two
group elements a 1 and a 2 by a minus sign. More precisely, the sign in this
rule must be allowed to depend on the particular combination of the two
elements a1 and a2. It must be a function w(a1, a 2 ) of a 1 and a 2 , with two
possible function values +1 and - 1. The corresponding multiplication rule
reads
684
D.(I) = +1
or
D l ( 1 ) = -1
I
(A.152)
A.6.3
In the spinor Hilbert space R, the two spinor components p(s) are functions p(x,s) of the position coordinate X. It has already been demonstrated
in subsection 1 that the representations of the displacement group G in R
are given by direct products of the representation V L of G and a particular
2
representation of G in the Hilbert space R of scalar functions. While Vi
2
signifies a spinor representation, the representations of G in the ordinary
Hilbert space R mean representations in the ordinary sense, often referred
to as vector representations. The direct product of a spinor representation
with a vector representation is again a spinor representation for the corresponding multiplication rule is of the general form (A.151). From this we
may conclude that any representation of G in the spinor Hilbert space R
will not be an ordinary, but a spinor representation. On the other hand, the
direct product of two spinor representations yields a vector representation
because the multiplication rules for the product representation are of the
general form (A.59).
Of particular interest among the spinor representations of G are, of
course, the irreducible ones. We now set out to show how to determine
them explicitly.
A.6.4
685
686
It may easily be demonstrated that the assignment (A.153) forms an ordinary or vector representation of G'. It also holds that each irreducible
representation space R, of the double group G' simultaneously forms a representation space of the group G because the elements of G cannot map out
of R, if none of the elements of G' does so. Moreover, as a representation
space of G, R, cannot be reducible. Indeed, if R, were reducible, then there
would be at least two irreducible subspaces of R,, one denoted by R,l, and
another denoted by R,2 which forms the complement of R,l in R,. The
two subspaces Rvl and Rv2 would be transformed only into themselves by
all elements g of G. However, R,l and R,2 would also be mapped into
themselves by all elements Q . g , because the application of Q results solely
in multiplication by -1. Thus R, would also be reducible as a representation space of GI, which contradicts the initial assumption. Each irreducible
vector representation of G', therefore, is also an irreducible representation
of G.
The latter may be either a vector representation or a spinor representation, both cases must be taken into account. Since one can also prove the
converse, i.e. that each of the irreducible vector or spinor representations
of G gives rise to a vector representation of the double group GI, it follows
that the set of irreducible vector and spinor representations of the simple
displacement group G is identical to the set of irreducible vector representations of the associated double group GI. Here we are only interested in
the spinor representations of the original displacement group G. These can
easily be selected from the whole set of irreducible vector representations
of G' - they are all those which are not also vector representations of G.
These irreducible vector representations of the double group are called eztra
representations. Using this term one may state the following theorem:
Theorem 11
The irreducible spinor representations of a displacement group G (point or
space group) are the extra representations of its double group GI.
A. 7. Projective representations
687
order n is twesided (see section A.2), the elements Ck and CE-k = Q . Cqk
belong to the same class, and so do the elements Q . Ck and CGk, however,
the two classes are different. If an n-fold rotation axis is not two-sided, then
C;, C i k ,Q . Ck and Q . CLk form four different classes. For n = 2 the two
elements C2 and C,' of the simple group G are identical, but in the double
group G t they differ for it holds C; = Q which results in C2 = Q . C,' and
C',
= Q . C2. The two elements C2 and C,'
3 Q . C2 form one class if
and only if the 2-fold axis is two-sided. An analogous statement holds for
reflections uh. In this case uh and Q . uh belong to one class if the 2-fold
rotation axis C2, associated with ah by the relation uh = I ' C2, is two-sided.
Similarly, u,, and Q . cr, are in one class if the rotation axis U2, associated
with u, by the relation u,,= I . U2, is two-sided.
We illustrate the above results using the tetrahedral group Td as an example. In section A.3 we found 5 classes of the simple group T d : E , 8C3,6S4,3C;
and 60,. From them the following classes of the double group T d are obtained
by means of the rules mentioned above: ( E } , {Q}, {4C3,4Q . Cj}, (4Q .
C3,4C$}, {3C4,3Q.C,3},{3Q.C4,3c,3},
{3c,2,3Q.C,2}, { 6 u v 7 6 Q . ~ v }The
.
48 elements of the double group Ti belong therefore to 8 different classes.
Since the number of classes equals the number of different irreducible representations, and since the simple group T d has 5 classes, there must be 3
irreducible spinor representations of the group T d . The sum of squares of
the dimension of these representations must be 24. There is only one way to
obtain the number 24 by summing three squares, namely 2'
2' 4' = 24.
Thus two of the three spinor representations of Td have the dimension 2, and
one has the dimension 4.
The spinor representations of point and space groups were characterized
above as projective representations with a particular factor system. Later,
we will see that projective representations will also occur in another context,
namely the ordinary vector representations of space groups. These representations may be traced back to the projective representations of point groups
of equivalent directions with a particular factor system determined by the
crystal structure. This demonstrates that projective representations play
an important role in the applications of group theory in solid state physics.
In the following subsection, we discuss properties of these representations
which will be of use later.
+ +
Factor systems
Representations of groups are unique assignments of their elements to elements of groups of linear operators in a particular space. In regard to the
688
representations defined in section A.4, the multiplication rules of the operator group could be derived from that of the original group because the
assignment was given a priori. According to these rules, the product of two
operators assigned to two group elements was equal to the operator assigned
to the product of the two elements. In the projective representations to be
considered below one takes the opposite approach: Not that the operators
ij assigned to the group element g are specified a priori, but rather the multiplication rules of the assigned operator group G are defined at the outset.
The assigned operators are later derived from these rules. The multiplication rules of the operator group are defined as follows. Let gl and 32 be
the two operators assigned, respectively, to the two group elements g1,gz.
Then complex numbers w(g1,gz) depending on g1,g2 are introduced and the
product 31 . 92 is defined as
91.92= w(g1,92) 31
92
(A.154)
The set of factors w(g1, 92) is called the factor system of the representation.
The assignment of G to the operator group G defined by the multiplication
rules (A.154) is referred to as a projective representation or representation
with factor system.
The factors w(gI,g2) are restricted by the group properties of the operator group G. The associative property results in the condition
(A.15 7)
A. 7. Projective representations
689
(A.158)
The entirety of all p-equivalent factor systems is referred to as a class of
factor systems. If the factor system has the particular form
(A.159)
then it is p-equivalent to the identity system w t ( g 1 , g a ) = 1, and the corresponding class is called identity class. In this case the projective representations are p-equivalent to representations in the ordinary sense. The
number of different classes of factor systems of a finite group is also finite.
The numbers and particular forms of these classes are characteristic for each
group. The identity class occurs for all groups. For cyclic groups, it is the
only one possible. Generally, the number of different factor system classes
is a power of 2 , therefore it is either 2O, or 2 l , or 22 etc. Most of the 32
point groups of crystal structures have several classes of factor systems, for
example, D4h has 8, Td has 2, and O h has 4. If there are only 2 factor system classes, then the second class beside the unit class necessarily includes
the factor system of spinor representations because the spinor representations also always exist. In special cases (in particular, for the point groups
c1,c 2 , c 3 , c4,c6,c,,s4, C3h, D3, c3,,),
the spinor representations coincide
with the vector representations. Then only the identity class of factor systems exists and the factor system of the spinor representations is p-equivalent
to the identity system.
A.7.2
Unitary equivalence
Let us consider two projective representations 2, and V M with the same
factor system, and let R, and R,M be the corresponding representation
spaces. The representations two are said to be equivalent to each other if, as
in the case of ordinary representations, a unitary operator M exists which
transforms the two representation spaces R, and R,M into each other. To
690
avoid confusion with the term p-equivalence, one speaks of unitary equivalence rather than just of equivalence, as we did in the case of ordinary
representations.
Reducibility a n d irreducibility
The concepts of reducibility and irreducibility can be defined for projective
representations as well. This can be done in the same way as for ordinary
representations, and we need not repeat the definition here. It also holds
that any unitary projective representation with a particular factor system
may be decomposed into irreducible projective representations having the
same factor system as the original projective representation.
Characters
The operators g representing a group element g in a particular projective
representation may be written as matrices D ( g ) with respect to a particular
basis of the representation space R,. As for ordinary representations, the
character X ( g ) of an element g is defined as the sum of the diagonal elements
of its representation matrix D ( g ) . Also, the characters of unitarily equivalent
projective representations are identical. However, unlike the case of ordinary
representations, for projective representations the character of an element is
no longer only a function of its class. As a consequence of this, the number
of unitarily inequivalent irreducible projective representations of a group
G no longer equals the number of classes of conjugate elements G, but is
smaller than the latter one, unless the factor system class of the projective
representation is that of the identity system. In the case of the irreducible
spinor representations of the point group T d considered above, we already
noted that there are 5 classes of conjugate elements in T d , but only three
spinor representations.
If one knows the character X ( g ) of a projective representation, then the
character X(g) of a p-equivalent representation defined by equation (A.157)
is given by the relation
X(g) = 4 7 ) x ( g ) .
(A ,160)
If the matrices D ( 9 ) form an &dimensional irreducible projective representation with a particular factor system, then it follows from (A.160) that the
matrices D(g) = u ( g ) D ( g ) form an irreducible projective representation of
the same dimension d with the pequivalent factor system given by equation
(A. 15 7).
Regarding the dimensions of irreducible projective representations of
groups G possessing more factor system classes than just the unit class, one
A. 7. Projective representations
69 1
may show that the representations for factor system classes different from
the identity class must be 2- or higher dimensional. That this rule works
may be seen from the irreducible representations of the group T d considered
in the section on spinor representations. This group has two classes of factor
systems and, therefore, one set of non-ordinary representations. These are
the spinor representations of T d , and none of them is 1-dimensional.
Orthogonality
The orthogonality relations (A.71), (A.72) and (A.81) for the matrix elements and characters of ordinary irreducible representations also remain
valid for projective irreducible representations with the same factor system.
Matrix elements and characters of irreducible representations with different factor systems are always orthogonal to each other. Owing to these
properties of the characters, the same procedure for the decomposition of a
projective representation into irreducible parts can be applied as that used
for ordinary representations, namely, in accordance with equation (A.83),
the projection of the character of the representation onto the characters of
the irreducible representations. Being a consequence of the orthogonality
relations, Burnsides theorem holds also for projective representations:
Burnside theorem
The sum of the squares of the dimensions of all unitarily inequivalent irreducible projective representations of a group having the same factor system
equals the order of the group.
Direct product
The direct product of two projective representations is defined as it was for
ordinary representations, i.e. as the representation induced in the product of
the representation spaces. The factor system of the product representation
equals the product of the factor systems of the two projective representations
692
which are multiplied. A particular consequence of this rule is that the factor
system of the product of two spinor representations for which the factors
are either 1 or -1, equals the identity system. This confirms our former
conclusion that the product of two spinor representations forms an ordinary
or vector representation.
The consequences of spatial symmetry for the matrix elements of operators as expressed in theorem 10 are also valid if the wavefunctions transform
according to projective and, in particular, to spinor representations.
Finally, we sketch how irreducible projective representations may be const ruct ed.
A.7.3
A.8
The state of an electron generally depends on time t , i.e. its components cp(x)
in ordinary Hilbert space or cp(x,s ) in spinor Hilbert space are, respectively,
functions cp(x,t ) or cp(x,s,t ) of t. As well as transformations in coordinate
and spin spaces, one can also consider transformations on the time axis.
Examples are translations of time by a constant to, i.e. t becomes t to, and
693
time inversion i.e. the transition from t to -t. The latter transformation is
commonly referred to as t i m e reversal, and is denoted by the symbol K .
A.8.1
(A.161)
If there is a magnetic field, one also has to reverse its direction in order that
lp* satisfy the Schrodinger equation.
In order that the Pauli equation remain valid under time reversal, the
operation K must be represented by an operator K in spinor Hilbert space
defined by
being the y-component of Paulis spin matrices. One can easily show that
for spinors the relation (klp I lp) = 0 holds, i.e. Klp is orthogonal to lp. If
lp is an eigenstate of the Pauli Hamiltonian for a given energy, and if there
is no magnetic field, then Klp forms a second linearly independent eigenstate for this energy. That means that, because of time reversal symmetry,
all eigenvalues of the Pauli equation are at least doubly degenerate in the
absence of a magnetic field.
Without spin it is evident that K 2 = E , and K 2 = -E with spin.
This implies that without spin the unit element E and the time reversal
operation K form a group, while with spin the elements E , K , K 2 and K 3
do so. We denote both groups by 2 . Since K commutes with all elements
of the group G, the direct product G x 2 of G and Z is itself also a group.
We call it the space-time g r o u p and denote it by 6 . We are interested in the
694
A.8.2
We start by solving the first problem. To this end we consider an irreducible representation 2) of G in the Hilbert space of scalar functions p(x)
or of spinors p(x, s) in spinor Hilbert space - both possibilities are admitted below. Let 2) be a vector representation or a spinor representation,
as needed. We denote the representation space in both cases by R D , and
a basis therein by 91,p 2 , . . . , v d . The vectors Kpl,Kpa,. . . , K p d span a
space R p which is likewise an irreducible representation space of G for K
commutes with all elements of G. The representation of G in RD* is the
complex conjugate representation D* of D. This follows from the relation
(kpi I g I K P k ) = (cpi I g 1 p k ) * for the matrices of the two representations,
which holds both with and without spin. If the two spaces R D and RD* are
joined, a representation space RD of G emerges which is generally reducible.
The dimension of any irreducible representation subspace in RD cannot be
smaller than d , or else D and D* are reducible. The dimension 6 of RD can
be at most 2d. The possibility that 6 lies between d and 2d has to be ex-
695
696
Herring
case
Relationship
of R D and Rb
RD=
Rb
Relationship of V
and D*
Degeneracy
l? equivalent to V *
Unchanged
Xb = X D
RD# R b
2) inequivalent to
Xbf
RD#
Rb
V*
Doubled
XD
2) equivalent to
I?*
Doubled
Xb = X D
1
-
c
gEG
X(g2)=
K~ in case
a)
0 in case
b)
- K ~ in case
c),
(A.163)
where K 2 = 1 for vector representations, and K 2 = -1 for spinor representations. We note that cases a) and c) are interchanged if one switches
over from an ordinary to a spinor representation. The value +lon the
right hand side of equation (A.163) means no additional degeneracy for vector representations, but an additional degeneracy for spinor representations,
while the value -1means additional degeneracy for vector representations,
but no additional degeneracy for spinor representations. The definitions of
cases a) and c) were originally made using the values +land -1 rather
than K 2 and - K 2 as we do here (for good reasons). One has to keep this
in mind to avoid confusion.
A.8.3
697
Time reversal symmetry has particularly simple implications for matrix elements of operators A(t) in Hilbert space which are either even or odd
functions of time, such that
KA(t)K-=
nA(t)
with
n=
1 or n = -1.
(A.164)
cpr)
of tensor
As in section A.4, we consider matrix elements (p& 1 A k ( t ) 1
components A k ( t ) of operators A(t) between the basis vectors {py, cpz, . . .,
~p:~} and {cpy, p$, . . . ,
of particular irreducible representations V, and
Dp of the space group G. For the matrix elements of &(t) between the timereversed basis vectors {Kpy, Kp!j, . . . ,Kcpzp} and { K c p y , Kpt, . . ., K p z U }
it follows
(A.165)
This relation implies that the matrix elements with respect to timereversed
basis vectors are completely determined by the original elements involving
the non time-reversed basis vectors. If either or both of the two representations D, and V p belongs to case a) defined above , then { K c p y , Kp;, . . .,
KpzY} are linearly dependent on {cpy, pz, p$} or {Kpy, Kcp!,. . . ,K p S p }
are linearly dependent on py, pg, . . . , p k or there is such a linear dependence for both. Therefore equation (A.165) leads to relations between the
elements
1 Ak(t) I
themselves and the number of independent matrix
elements (qkI A k ( t ) 1 pr) is reduced by time reversal symmetry. Below, we
will outline a procedure which allows one to calculate the reduced number
o i independent elements in the special case p = v.
If neither of the representations V v and Dp belongs to case a), then
equation (A.165) does not impose an additional condition on the matrix
elements (cp& I A k ( t ) I pr), rather it connects these elements with matrix
elements between basis vectors of other representation spaces.
Consider now the mixed matrix elements (Kcp2 I A k ( t ) I pr) and (cpk I
A k ( t ) I I?&)
between one original basis vector and a time-reversed one. For
these, one can easily derive the relations
(cpc
cpr)
(A.167)
I KPK) = .K2(cpF I Am I &4J .
(& I
These equations mean that the number of independent matrix elements
( g p & ( A k ( t ) 1 pr)?as well as the number of independent matrix elements
(pg 1 A&) 1 Kply))is reduced because of time reversal symmetry. The
factors n and K 2 in (A.166) and (A.167) can take, independently of each
698
~ be either +1 or -1.
other, the values +1 or -1, so that the product K K will
The matrices (I?&
I Ak(t) I
and (pg I A k ( t ) I Kpr) are thus either
symmetric (nX2 = +1) or antisymmetric ( & K 2= -1) with respect to the
exchange of the pairs of indices p m and vl. How the number of independent
matrix elements follows from this observation in the general case is shown
in the book by Bir and Pikus (1974).
In the special case p = v , one can apply the procedure developed in
section A.4.3 for the determination of independent matrix elements without
taking account of time reversal symmetry, provided one makes a distinction
between symmetric products [D,x D,],and antisymmetric products [D,x
D,Ia, as defined in section A.4.3. The matrix elements (Rp; 1 A k ( t ) I p?)
transform themselves according to the representation [a,x v,]~
x Dd, if
nK2 = +1, and according to the representation [D,x V,Ia x V A ,if K K =~
-1. Following section A.4.3 the matrix (zpk I A ( t ) I
then has as many
independent elements as the number of times the identity representation
occurs in the product representation [V,x D,],x V A for K K =~ +l, or
in the product representation [D,x V,la x D A for r;K2 = -1. Analogous
statements hold for the matrix (p; I A&) I kp?), where V, is to be
replaced by VE.
pr)
(or)
A.9
A.9.1
Each irreducible representation of a space group G is also necessarily a representation of the translation group T contained in it. As such, it is in
general reducible. Since T is an Abelian group, the irreducible representations of T are 1-dimensional. This means that the representation space
of an irreducible representation of a space group G can be set up from 1dimensional irreducible representation spaces of the translation subgroup T
of G. We mark these spaces by a vector k whose meaning will immediately
become clear. The spaces themselves, which are 1-dimensional as we know,
699
-ik.R
(A. 168)
k = kibi
+ k2b2 + k3b3 ,
(A.169)
with bl, b2, b 3 being primitive lattice vectors of the reciprocal lattice and
k l , k2, k3 being real numbers. The periodicity condition with respect to
the periodicity region causes the components of k to be of the form k l =
(2n/G)ll,k2 = (27r/G)l2, k3 = (2n/G)13with 11,12,/3 as integers. If K is an
arbitrary vector of the reciprocal lattice, then the irreducible representation
v k of the translation group T belonging to k is identical with the irreducible
representation V k f K belonging to k+K, since e x p ( 4 K . R ) = 1. One therefore gets all different irreducible representation of the translation group if k
varies within a primitive unit cell of the reciprocal lattice. Altogether, there
are G3 such vectors and thus also G3 different irreducible representations of
the translation group. This number also follows from the Burnside theorem,
- since the irreducible representations are 1-dimensional, there are as many
inequivalent irreducible representations as there are group elements, i.e. G3.
By definition, the effect of the translation operator t R on the basis function
vk(x) of the irreducible representations is given by
(A.170)
The basis functions (pk(x) may be thought of as Bloch functions, i.e. as
plane waves modulated by a lattice periodic factor u ~ ( x ) ,
(A.171)
Bloch functions (pk(x)of the same wavevector k, but different Bloch factors
give rise to the same representation of the translation group. As long
as one considers only the translation group, no relations exist between the
Bloch factors of the basis functions p k for different values of k - any Bloch
function can be used for a given k. Below, starting from an arbitrarily chosen
(but later h e d ) Bloch function ( p k ( x ) of wavevector k,we will construct basis
functions for an irreducible representation of the space group G. It turns
out that basis functions cpk,(x) will emerge in this process which transform
themselves according to irreducible representations of the translation group
with wavevectors k' # k. The Bloch factors up(x) of the basis functions
Uk(x),
700
~pk~(x)
of these representations cannot, however, be chosen arbitrarily, but
are determined by the Bloch factor U k of ' p k . The determination of the
wavevectors k' and the pertinent Bloch factors Uk' is the essential task which
has to be solved in constructing the irreducible representations of a space
group G from those of its translation subgroup T . The primitive unit cell in
which the wavevector k can vary will be chosen below to be the Wigner-Seitz
cell of the reciprocal lattice, i.e. the first B Z . The reason for this is that the
latter cell, generally, is the only primitive unit cell of the reciprocal lattice
which exhibits the full symmetry of the point group of equivalent directions.
A.9.2
Star of wavevectors
rPk(X)
= Pk(g-lx) = ,-iak.R
(A.172)
cPlc(a-l[x - 5(a)1).
Here, a k denotes the vector to which k transforms if the orthogonal transformation a which originally acted on the lattice vector R, is transferred to
the vectors k of reciprocal space, as was explained in Chapter 2 of the main
text. Rewriting equation (A.172) in the form
it becomes clear that [tT(a) . a] 'pk(x)is a function which transforms according to an irreducible representation of the translation group with wavevector
ak. The set of vectors ak generated as a runs through all elements of the
point group of equivalent directions P , is called the star {k} of k.
Henceforth, we must distinguish between vectors k with and without
symmetry. In the case of non-symmetric k, we have ak # k K for all p
elements a of the point group P , where K is a vector of the reciprocal lattice.
For symmetric k we have a k = k+K for at least one element (Y different from
the identity element of P . Under the action of such an element, the function
[tT(a). a ] ( p k ( x ) transforms according to the same irreducible representation
of the translation group as does the function 'pk(x).
We consider, initially, non-symmetric k. The star {k} has p different
points crlk 3 k = k1,agk = k2,. . . , kp = q,k, where a j , j = 1 , 2 , . . . , p ,
denotes the elements of the point group P of equivalent directions. With
701
(Pg.
We define the
(A.174)
For the Bloch factor
Uki(x)
of
' p k j ( x ) , one
then has
One can easily show that the space R { k } spanned by the p functions ' p k j ( x )
of equation (A.174), forms a representation space for the space group G. To
prove this, we write the elements t R . t T ( a j t ). a y of the space group G in the
form
z= kT(aj,)
. aj'l
'
tajrlR.
kT(oIj) '
a j l -1
(A.176)
where j', at fixed j , takes all values between 1 and p . That (A.176) is indeed
a possible way of writing of the space group elements g can be seen as follows:
first one commutes the translation t R in t R . t T ( a . , ). ajt with [tT(a.,). ajt],
simultaneously replacing t R by tujt-1 R in accordance with equation (A.17).
Multiplying [tT(ajt). aj,]. t ai-1l R with the particular space group element
[t,(,,, . aj]-', one obtains again an element of the space group. If j' runs
through all values between 1 and p , and R through all lattice points, then all
elements will be different, in accordance with the general group properties
derived in section 1. The set of these elements coincides, therefore, with the
space group G. This proves that the space group elements can indeed be
written in the form (A.176).
The application of g in the form (A.176) to one of the basis functions
f , D k j ( x ) of R { k } of (A.174) yields, in view of (A.173), the expression
(A. 177)
The function f,Dk,,(x) and therefore also g'pk2 ( x ) lie in R { k } . This means
that R { k } forms a representation space. R { k } is also irreducible, we omit
formal proof here. For non-symmetric k, the irreducible representations of
the space group can therefore be characterized solely by the k-vector itself
or, more strictly, by the star {k} of this vector. Thus, in this context we
write Z ) { k ] for the irreducible representations. The character x { k ) ( g ) of an
element g in the representation Z ) { k } reads
P
x { k } (9) =
e- ikj-Rj=1
(A. 178)
702
A.9.3
gpkn(X)
(A.179)
3 (Pln(x)= ,-ik.[?(u)+R]
C~mn(Q)Vh(X)
m
(A.180)
where Dmn(a) denote the expansion coefficients. This relation means that
the assumed representation of the small space group assigns to each element
Q of Pk a specific matrix Dmn(a). Below, we will demonstrate that the
matrices Dmn(a) form a group with multiplication rules of the general form
(A.154), and thus, a projective representation of the small point group 9.
To this end we consider two elements g 1 = [tR1 . t7(,l) . 311 and 9 2 = [tRz
t T ( a 2 ) az] of Gk. For brevity we set T ( Q ~ =
) 7-1, and 7 - ( ~ 2=
) 7-2. Equation
(A.180) associates the elements g1 and 9 2 , respectively, with the matrices
Dmn(Qi1)and Dmn(~12).
For the effect of the product element [tR1. t , . a l l .
[ t R z . t, . a21 on pkn, one obtains from (A.179) the relation
(A.181)
mmf
703
(A.182)
- ,-wRl+fi+al(R2+ii)I
CDm,n(al
.
a2)cPkm,(y+
(A.183)
m'
Comparing equations (A.181) and (A.183), one finds that the matrices D(a1),
D ( a 2 ) and D(a1 . a 2 ) are related by the equation
where
Hence, the assertion is proven. The matrices D ( a ) do indeed form a projective representation of the small point group Pk of k. The factor system
is given by equation (A.185). The projective representation of the small
point group Pk arises from the representation of the small space group Gk
assumed at the outset. If the assumed representation Z)k of Gk is irreducible,
the same is true of the pertinent projective representation of Pk. Vice versa,
an irreducible projective representation of PI, gives rise to an irreducible
representation Z)k of Gk.
The factors w ( q , a2) of (A.185) will be referred to as the crystallographic
factor system. We will rewrite these factors in a more convenient form.
The scalar product k . al(R2 ?2) in the exponent may be expressed as
-1
a1 k.(R2+7'2). The exponent in (A.185) then becomes i(k-a11k).(R2+?2).
Here, a1 is an element of the small point group of k, i.e. q k may differ
from k by a reciprocal lattice vector K only. Because exp(-zK . Rz) = 1,
we may omit R2 from the exponent, whence
(A. 186)
For symmorphic space groups, no fractional translations 7' occur, i.e. the
vectors ?(a) are zero for all elements a of the small point group of a symmetric k-vector. Thus, the crystallographic factor system equals the identity
system for all symmetric k-vectors, and the projective representations of the
small point groups become representations in the ordinary sense. For nonsymmorphic space groups, a factor system different from the identity system
may occur, but only for vectors k on the surface of the first B Z . In fact,
if k is an internal point of the first B Z , then ak also lies in the interior of
704
the first B Z and cannot differ from k by a non-zero reciprocal lattice vector
K. For surface points, however, it is possible that ak deviates from k by a
vector K # 0. If that is the case, then factors differing from 1 may occur
in the crystallographic factor system (A.185). The basis ( p k l , ( p k 2 , . . . , (pkd of
the irreducible representation v k of the small space group G k then gives rise
to a projective irreducible representation of the small point group P k of k.
A.9.4
For any irreducible representation of the full space group G one needs a basis
set which contains basis functions for every point of the star of k. These
will be defined now. We denote the number of star points by s ( k ) . With
p ( k ) as the order of the small point group of k, we have s ( k ) = p / p ( k ) .
From the full point group P of equivalent directions we select s ( k ) elements
a1 = E , 122, . . . , a S ( k ) each generating a different star point k l , k 2 , ..., k s ( k ) .
Thus, we can write alk
k = k l , a 2 k = k 2 , . . . , a , ( k ) k = k , ( k ) . The
choice of elements a j , j = 1,2, . . . , s ( k ) is not unique because, together with
a j , each element generated by multiplication with an element ai of the
small point group P k j of k j forms another element which transforms kl into
k j . The choice of these elements made at the outset will be maintained
throughout.
The basis functions of an irreducible representation of the small space
group Gkj will be denoted by ( p k j 1 , ( p k j 2 , .. . ,( p k j d . They can be generated
from the basis functions p k n ( x ) , n = 1,2, . . . ,d, of the irreducible repre
sentation D k of the small space group G k in the same way in which, for
non-symmetric k , the basis functions C p k j ( x ) were obtained from the ( p k ( X )
using relation (A. 174) above. Accordingly, we set
705
allowed values, i.e. j = 1 , 2 , . . . , s(k), and i = 1 , 2 , . . . ,p(k). If, in the products a j . ali . a;' the left factor aj is replaced by a y , and j' and i are
varied within their respective definition regions at a fixed value of j , then
one likewise obtains all elements of P . Each element a of P may, therefore,
be written in the form a = ajl . a l i . c y j ', where j can be arbitrarily chosen
and j ' and i are determined by a and j uniquely. An analogous statement
holds for the elements of the space group. They allow for a representation
of the form
(A.188)
with a fixed value of j which can be arbitrarily chosen and values of j' and i
depending on g . The lattice translation ta;iR in the second angular bracket
of expression (A.188) guarantees that the lattice translation described by the
This means that the resulting functions are again in the space R k d spanned
by the basis functions (pk,n(x),n = 1 , 2 , .. . , d , j = 1 , 2 , .. . , s(k). Moreover,
the general definition (A.46) of the operators 9 ensures that the group multiplication relations 91.92 = g1 . 92 for any two elements g 1 , g 2 of the space
group G hold also in RM. This means that the space R k d forms a representation space. It is irreducible, if the partial basis set for the particular
star points k used at the outset, gives rise to an irreducible projective representation of the small space groups Gk, as we have supposed. Thus, our
assertion above is proved.
The question of whether one obtains all irreducible representations of a
space group for a symmetrical wavevector if one proceeds in the way described above, is answered in the affirmative: By taking all stars {k} of the
first B Z and all irreducible projective representations Dkv of the small point
groups of the respective k-points with the crystallographic factor system, all
irreducible representations D{k}vof the space group follow.
For the character X { k } y ( g ) of an element g = tR++) . a in the space group
representation D{klv, equation (A.189) yields
706
Here ali is that element of the small point group of k = kl associated with
the element a through the relation ali = a;' . a . a j . Furthermore, j is
defined by the equation ak = kj, and the operators aj are the special stargenerating elements introduced above for the point group P of equivalent
directions, and s(k) is the number of different points of the star {k}. With
(A.191)
as the character of the projective representation of the small point group of
k one obtains from (A.190)
A.9.5
707
the projective representations of the small point groups for crystals with
diamond and zincblende structure both with and without spin.
A.9.6
The implications of time reversal symmetry for the irreducible representations of space groups have already been explored in general terms in section
A.8. The procedure discussed there, i.e. the Herring criterion in conjunction
with the Frobenius-Schur theorem is, however, very cumbersome - one must
sum over all elements of the space group including the G3 translations in
order to utilize it. Below, we will apply the concrete form (A.189) of the
representations in order to express the general criterion of section A.8 in a
more transparent way to facilitate its use.
S t a r s {k} and {-k}
We consider an irreducible representation v { k } u of the space group G in a
particular space R{k},. From section A.8 we know that the time-reversed
space KR{k}u gives rise to the complex conjugate irreducible representation
VTklu. Owing to the construction of the space group representations, V*
CkIv
belongs to the star {-k} of the wavevector -k. There are two possible cases,
case 1 with {k} = {-k} and case 2 with {k} # {-k}.
In case 2 the two irreducible representation spaces R{k}, and KR{k}u
are necessarily linearly independent of each other, i.e. case b) of the general
Herring criterion of section A.8 applies, and an additional degeneracy exists
because of time reversal symmetry. The two representation spaces R{k}y and
KRtklu are joined to the same eigenspace of the Hamiltonian by the time
reversal operation.
In case 1, one cannot immediately tell whether the two representation
spaces R{k},, and KR{h}, are linearly independent or not. In order to determine this, the Frobenius-Schur theorem must be applied.
Theorem of Frobenius and Schur for space groups
(A.193)
We apply this result to expression (A.192) for the characters X { q u ( g 2 ) of
representations D{k}p It follows that
708
(A.195)
The R-sum extends over all lattice points of the periodicity region and the
a-sum over all elements of the point group P of equivalent directions. The
R-sum differs from zero only for those a and j for which
a-lk, = -k,
+ K,
(A.196)
holds with K, being a reciprocal lattice vector. The condition (A.196) may
be interpreted in a simple way. For iixed 3 and varying a , the vector a-lk,
runs through all points of the star of k, and, since k, belongs to the same
star as kl k, through all points of the star of k. The condition (A.196) can,
therefore, be fulfilled if among the various points k of the star {k} there are
such which are equivalent to -k. This means that -k belongs to the star of
k, or that the stars of k and -k are identical. This is case 1)defined above.
If the condition (A.196) cannot be fulfilled, then there is no point in the star
of k equivalent to -k, i.e. the stars of k and -k are different. The latter
is case 2) from above. In this case the two representation spaces R { k } , and
K R { k } yare linearly independent, and are joined by time reversal symmetry
to an eigenspace of the Hamiltonian with the same energy eigenvalue. The
degree of degeneracy is doubled, and the representation in KR{k}, forms
the complex conjugate D;,},of the representation D{+ in R{+. While
the star of D:,+ is {-k}, the projective representation of the small point
group of -k belonging to D:,}, is Z)kv* = D;,. If D,& is real, then the
two energy eigenvalues at k and -k, which are degenerate because of time
reversal symmetry, belong to the same projective representation of the small
point group of k, and if z)ky is complex, these eigenvalues belong to different
projective representations complex conjugate to each other.
In case 1) the evaluation of the R-sum yields the value G3 for elements
a which obey condition (A.196). For all other a this sum is zero. One thus
may restrict the a-sum for a particular value of J to those elements a which
obey relation (A.196). We denote the set of these elements a by "5. With
this, equation (A.195) becomes
709
Since the ali run through the set Mkl if the a vary over a set Mki, the expression under the j- sum in (A.197) is independent of j . Moreover x k v ( a f i )
may be replaced by x k , ( ( r 2 ) as has already been mentioned above. One
therefore gets
(r
(A.199)
cx(I,>v(s2)
= 0.
(A.200)
Using the last two relations, the left hand side of the Frobenius-Schur theorem (A.163) for representations of space groups becomes
Ip(k)]-' CaEMk
e-iK'7'(a)Xkp(Lu2) {k}
{-k}
- cx{I,}u(g2)
=
N L l
{k} # {-k)
(A.201)
1 0
Here, we have used the relation N = G3 . s(k) .p(k) for the total number
N of space group elements. For {k} # {-k} one obviously has the Herring
case b ) , as anticipated at the outset. For {k} = {-k}, all three Herring
cases are possible, depending on which of the relations
710
Table A.4: Characterization of the three Herring cases for space groups.
Herring
Characteristics
case
a)
{k} = {-k}
Ip(k)]-l
{k} f {-k}
no further condition
{k} = {-k}
[p(k)]-l
e-iK'P(at X k u ( a 2 ) = K 2
xaEMk
e-iK'P(a) X k , ( a 2 ) = - K 2
(A.203)
Using the above results we will now discuss the implications of time reversal
symmetry for the energy bands E,(k) in a more explicit form.
Additional degeneracy of energy bands
The characterization of the three Herring cases for space group representations is summarized in Table A.4. Unlike the Frobenius-Schur theorem in
its general form (A.136), the sums in Table A.4 are carried out only over the
subset Mk rather than over the whole space group. Let us first consider a
star with {k} = {-k}.
{k} = {-k}.
In cases b) and c) time reversal symmetry joins the irreducible representations R{k}, and K R { k } , = R{k}fi into one eigenspace of the Hamiltonian.
One therefore has E,(k) = E,(k), meaning that each energy band is at
least 2-fold degenerate because of time reversal symmetry. In case b), the
two bands E,(k) and E p ( k ) belong to different irreducible representations
711
of the space group, and in case c) to the same one. For case a), p formally
becomes v, and the relation E,(k) = E,(k) is a tautology, devoid of new
information. For stars with {k} # {-k},
In case ii), i.e. without inversion, one has {-k} # {k}. The Herring case
b) applies, and the two Bloch functions or spinors ' p k and Kpk = ' p I k belong
to the same energy: we have E(k) = E'(-k) where again E(k) and E'(k)
are, respectively, the energies of the states p k and pk. This means that both
with and without spin, time reversal symmetry causes an additional band
overlap.
712
A.9.7
Compatibility
From the main text we know that the Bloch functions p k u and the pertinent
energy eigenvalues E,(k) are continuous functions of wavevector k within
the first B Z . Therefore the irreducible representations z ) ( k } , , of the space
group in the eigenspace of the Hamiltonian for a given energy eigenvalue
E,(k) must likewise be continuous functions of k. From this observation
one obtains compatibility relations between the small representations v k p
for a symmetry line k and the small representations v k o u for a symmetry
center ko, which lies on this line.
Here, we will study these relations in greater detail. To this end we assume that there is an energy eigenvalue E,(ko) at the symmetry center ko.
The corresponding eigenfunctions should belong to the irreducible representation Vh,,of the small point group P k o of ko. The small point group
P k of a symmetry line on which the symmetry center ko is located forms
a subgroup of P b . A particular irreducible representation Vb,,of P b is
therefore simultaneously also a representation of P k , in general, however,
not an irreducible but a reducible one. Let Z ) b / k p , D b / k p ! , . . . be the irreducible parts into which v k o u decomposes as representation of P k . Because of continuity of the energy eigenvalues and the representations with
respect to k, E,,(ko) must split along the symmetry line k into bands
E,(k), E,!(k), . . ., and their corresponding eigenfunctions belong to the irreducible representations Vb/k,, D b / k p , . . .. One says that the representation
D b v at the symmetry center ko must be compatible with the representations V b / k , , v b / k , ! , . . . along the symmetry line k through ko. In particular cases, the set v k , , / k p , v b / k p j , . . . may reduce to the one representation
v b / k p alone. Then no band splitting occurs, and V b / k , is compatible with
PkOU.
Similar results hold for compatibility between representations at a symmetry line and a symmetry plane which is bounded by the symmetry line.
In section A.10 the compatibility relations are given for symmetry centers
and lines of crystals with diamond and zincblende type structure.
A.10
A.lO.l
Character tables
713
t o be considered are listed in Table A.5. We start with the structure of lower
symmetry, that of zincblende type crystals.
Zincblende structure
The symmetry group of crystals with zincblende structure is given by the
symmorphic space group F13m ( T j ) . The irreducible representations of the
space group are therefore associated with vector or spinor representations of
the small point groups of wavevectors k. The point k = 0 or
has the highest symmetry, namely that of the full point group T d of equivalent directions. The star of I' consists only of I' itself.
The 24 elements of T d belong to 5 classes according to section 2: E , 6S4,
6S2, 8C3, 60~. As before, each class is characterized by a representative
element, the number in front indicating the number of different elements in
the class. Since there are as many inequivalent irreducible representations as
there are classes, the number of these representations equals 6 in the present
case. To satisfy the Burnside theorem two of them must be 1-dimensional,
one 2-dimensional and two 3-dimensional. The two 1-dimensional representations are denoted by I'l(A1) and I'2(A2), the 2-dimensional one by I'lz(E),
and the two 3-dimensional ones by I'Is(T2) and r25(Tl). The notations in
parenthesis are the ones which commonly are used to describe the symmetry
of localized one-electron states of of molecules and point perturbations in
Table A.5: Small point groups P and number s of star points for symmetric points
k of the first B Z in zincblende and diamond type crystals. Properties of the time
reversed k-point are also indicated.
714
3T
I--)
22
dz
11
I--)
22
= -[Iz
---[J?;
I.
- iy
t ) + 212 .!)I,
33
I--)
22
= -Ix
fi
l),
(A.205)
t) + Iz l)].
(A.206)
- iy
- iy
For each representation, Table A.6 also gives a basis of linear combinations
of products of the three Cartesian components x,y, z of the position vector
x, or of the components Jz,Jg,Jz of the angular momentum pseudovector
J. According to their interpretation as basis vectors, these components
transform under the action of an orthogonal operator a-' according to the
715
rl
D~ = rr ,
(A.207)
As far as time reversal is concerned, the star {-k} equals the star {k} in
the case of I", and all 8 representations of the space group belong to the
716
Table A.7: Transformation of the position vector components x,y, z with respect
T d and Oh.
Herring case a). This means that time reversal symmetry does not result in
an additional degeneracy of energy bands at I?.
A
The points ( 0 , 0 , 2 ~ < /with
a ) 0 < C < 1 and all points on symmetrically
equivalent directions are denoted by A. The star of A has 6 points. The
small point group is C2,. This group is Abelian, i.e. each of the 4 elements E ,
C2, C T ~c,,2
~ , forms its own class. Thus, there are 4 different 1-dimensional
irreducible representations A,, A2, A3, A4 as shown in Table A.8. In the
717
Table A.8: Irreducible representations of the small point group Caw of A for
zincblende type crystals. In the lower right corner those group elements are given
which transform the basis functions (shown in the last column) according to the
respective irreducible representation. The components z, y, z refer to the cubic axes.
double group one has, in addition, the class Q. The double group therefore
has 5 different irreducible representations (see Table A.8). The one extra
representation As must be 2-dimensional according to Burnside's theorem.
It coincides with the spinor representation V1 of the full orthogonal group
1
taken as a representation of CzV.Moreover, we have
A1 x V l = A2 x Vr.= A3 x 2 ) =
~ A4 x D L= As.
(A.208)
The time reversed star {-k} differs from the star {k} in the case of A (see
Table A.5). Thus, all representations of the space group at A correspond
to the Herring case b). Since all 5 representations of the small point group
are real, time reversal symmetry joins a certain representation at k with the
same representations at -k. At A one therefore has
Eai(k) = EAi(-k),
i = 1,2,3,4,5.
(A.209)
X
The boundary point (0, 0,27r/a)of the first B Z and all symmetrically equivalent points are denoted by X . The star of X has 3 points, because (0, 0,27r/a)
and (0, 0, -27r/a) are equivalent (in the sense of reciprocal lattice translations). The small point group of X is D M . It contains the 8 elements
E , C2, S4, ,943,
odl, ad2, U21, U22 which form the 5 classes { E } ,{Cz}, ( 5 4 , S,"},
(U21, U22}, {adl, a d z } . Consequently, there are 5 irreducible representations
among which 4 are 1-dimensional, namely XI, X2, X3, X4, and one, X5, is 2dimensional (see Table A.9). In the double group one has the class Q in addition, and, instead of (S4, Sj}, the two classes (S4,QS;} and (QS4, S;}. The
718
&d
of X for the
The time reversed star {-k} coincides with the star {k} of X . All representations at X belong to the Herring case a), so that no additional degeneracy
occurs due to time reversal symmetry.
A
The point (./a)(C, C,6) with 0 < C < 1 and all the points on equivalent directions are denoted by A. The star of A has 4 points. The small point group
is C3v. The 6 elements E , C3, Ci, u v l ,av2, a v 3 are distributed among the 3
classes { E } , {C3,C:}, { u v l ,r V 2 , u v 3 } . There are therefore two 1-dimensional
representations, A1 and A2, and one 2-dimensional representation, A3. In
the double group, each of the three classes of the simple group again forms
a class, and each gives rise to an additional class by multiplying it with &.
Thus there are three spinor representations, two 1-dimensional ones, A4 and
As, and one 2-dimensional one, As. Table A.10 shows all six representations.
The spinor representations are p-equivalent to the vector representations. D 1
z
taken as a representation of C3v coincides with As. One has the relations
719
C3v of
A and L
In the case of A, the time reversed star of -k differs from that of k. Thus,
all representations of the space group at A correspond to the Herring case
at the point k are joined
b). The real representations A1, A2, A3 and
by time reversal symmetry with the same representations in the point -k,
and the two complex representations A4, A5 at k with the complex conjugate
representation at -k. From this it follows that at A
L
The point L is the point of A with = 1. Therefore it also has 4 star points,
and the small point group remains also the same, i.e. C3,. Everything said
about the representations at A applies, therefore, to L as well (see Table
A.lO). The only difference between L and A is that for L the star {-k}
does not differ from, but coincides, with the star {k}. The representations
151,L2, LJ and Lg correspond to the Herring case a), i.e. no additional degeneracy occurs between the corresponding bands. In the case of L4 and L5
one has the Herring case b), so that at L the additional degeneracy
720
(A .2 13)
occurs because of time reversal symmetry.
<
The point ( 3 ~ / 2 a ) ( < , < , O0) ,< < 1 and points on equivalent directions
are denoted by C. The point with C = 1 is denoted by K. The number of
star points amounts to 12 in both cases, and the small point group is C,.
Each of the two elements E and ah of C, forms its own class. Thus, there
are two 1-dimensional irreducible representations, C1 and Cz or K1 and Kz
(see Table A.ll). In the double group of C, each class of C, gives rise to yet
another class through multiplication by &. This means that there are also
two 1-dimensional spinor representations, C3 and C4 or K3 and K4 . These
are p-equivalent to the two vector representations. If taken on the point
group C,, Vi becomes reducible and decomposes into the 1-dimensional
5
spinor representations C 3 , C4 or K3, K4, respectively.
The star {-k} differs from the star {k} in the case of C and K , so that
the Herring case b) applies. Thus the following relations hold at C:
Diamond structure
The diamond structure follows from the zincblende structure if the two primitive f.c.c sublattices are occupied with identical, as opposed to different,
721
atoms. The operation which transforms the two sublattices into each other
thus forms an additional symmetry element of the diamond structure. One
can take it in the form of a glide reflection or the inversion joined with a fractional translation. We choose the second possibility. The inversion is carried
out at the center of the line connecting the two atoms of a primitive unit cell,
and is accompanied by a translation along the connecting line by the amount
a / 4 . The point group of equivalent directions for diamond structure follows,
therefore, from the corresponding point group for the zincblende structure,
i.e. T d , by forming the direct product T d x ci = oh,
As for the zincblende structure, we consider the irreducible representations of the space group at the symmetric points I?, A, X , A, L , C and K . The
corresponding small point groups are shown in Table A.5. Adding inversion,
the number of star points either doubles in comparison with the point group
of the zincblende structure, or remains the same while doubling the number
of elements of the small point group. For all symmetry points mentioned
above the stars {k} and {-k} are the same in the diamond structure (see
Table A.5).
Due to the occurrence of fractional translations in the space group of the
diamond structure, not only vector and spinor representations of the small
point groups are required, as in the zincblende case, but also projective
representations with other factor systems. Actually, this applies only to
k-points at the boundary of the first B Z , i.e. to X , L , and K , for only
these k-points can have a crystallographic factor system different from the
identity system. As we will see below, only that of X differs essentially from
the identity system, while that of L is p-equivalent to the latter, and that of
K is equal to it. Those of I',A, A and C are equal to the the identity system
anyway.
The small point groups C3wand CzWoccur in both the diamond and zincblende structures. Since the crystallographic factor systems of the respective
symmetric points A, K , and C are the identity systems in the diamond
structure as well, the same irreducible representations of these groups apply
as in the zincblende case.
On the other hand, there are small point groups in the diamond structure which can be obtained from small point groups in the zincblende structure by multiplying with Ci. In fact only one of the remaining small point
groups of the diamond structure cannot be generated in this way, namely
the point group C4w. In all other cases this construction is possible, one
has oh = 0 x Ci,D4h = D u x Ci and D u = D3 x Ci. If, in these cases,
the crystallographic factor system is the identity system also in the diamond
structure, then the irreducible representations found for zincblende case can
also be used to obtain the representations for diamond case. Let us consider
r: The small point group oh of I? in the diamond structure follows from
the corresponding small point group 0 of I' in the zincblende structure by
722
The five irreducible vector representations of T d generate 10 vector representations of oh, and the three spinor representations of T d give rise to 6
spinor representations of oh. The 16 representations are listed in Table A. 12.
Vector representations for which the inversion is associated with +1 are denoted by the same symbol as for T d . Vector representations with -1 for the
inversion are marked by a prime on the symbol for the corresponding repre
sentation of T d . The two 3-dimensional representations are an exception, in
their case the symbols of the representations are exchanged in the transition
of oh,
from T d to o h : r15 with +I for I becomes the representation
and r15 with -1 for I becomes the representation r 1 5 of oh, while r 2 5 with
$1 for I becomes ri5 and r25 with -1 for I becomes r 2 5 (see Table A.13).
These somewhat strange relations result from the fact that the notations I'15
and r25 were originally introduced for the 3-dimensional representations of
the cubic group 0 rather than the tetrahedral group T d . In the presence
of spin, the representations of oh with fl for I are denoted by an upper
index ' + I , and those with -1 for I by an upper index ' - I on the symbol for
the corresponding representations of T d . Simultaneously, the designations
r6 and r7 are exchanged in the transition from T d to oh.
The representation V1 of the full orthogonal group becomes I:' on oh.
T
In addition, we have the following relations:
rL5
r12x D~ = r$ , r:,
v 1 x r,
-1 a"
0-
+ b 3
(3
3 1 3
I 3
I 3
1
3
I 3
1
3
I 3
k3
C"(h
1
3
01;;
-I$ $
Table A.12: Irreducible representations of the small point group oh of I? for diamond type crystals.
724
Td
r15
Oh
Basis functions
rh5
for
I =1
~ y , y ~ , ~ l :
r15 for
I =i
c , ~ , z
rz5 ri5
for
I=1
J x , J y , Jz
rz5
for
I =I
X(VU - 12).
r$
for
I =1
r;
for
for
I =1
for
I =i
r,
r7
r;
=i
- ax), Z ( X O - W )
~ ( Z Z
t 1 4$ ) , I & J)>
{I T),I 1))
(A .2 15)
A
The star of A has 6 points, and its small point group is C4,,. The 8 elements
are assigned to classes as follows: { E } , {C4,C43}, {Ci}, {cl,c3}, {ff2,c4}.
Thus there are five ordinary irreducible representations, among them four
1-dimensional ones, A l , A2, A3, A4, and a 2-dimensional one, As (see Table
A.14).
In the double group we find the new class {Q}, and the class {C4, C,"}
of the simple group splits into the two classes {C4, Q . Cl} and {Q . C4, Cz}.
There are thus two spinor representations, A6 and A?, of dimension 2. The
representation Vi coincides with A6 on C4,,. We have the relations
A1
A2 x
Vr = A h
V L =a: X
Dl
=As,
Vr = A ? , A5 x V + = & + A ? .
(A.216)
(A .2 17)
The time-reversed star {-k} equals the original star {k} in the case of A.
All representations of the space group in A correspond to the Herring case
a), i.e. no additional degeneracy occurs because of time reversal symmetry.
725
CdV
of A for
X
The star of X has 3 points. Its small point group is D4h. According to
section A.3, we have D4h = D2d x ci. For elements of the subgroup D 2 d of
D4h the crystallographic factor system w ( q , a 2 ) has the constant value 1
because D2d forms a subgroup of T d and none of the elements of T d is joined
with fractional translations in the space group of the diamond structure.
For the remaining elements, the factors w ( c r l , c q ) generally differ from 1
and must be calculated separately. For a(1,I),for example, it follows that
exp[i(4~/a).(a/4)]= -1. It turns out that the crystallographic factor system
of D a is not pequivalent to the identity system in the case of X and the
diamond structure. It belongs to a class different from that of the identity
system.
In constructing the vector representations of D4hr one may take advantage of the identity D4h = D4 x Ci proved in section A.3. Then the 16
elements of D4h can be associated with the 5 classes of D4 as well as 5 further classes, which are generated by the previously mentioned ones through
multiplication by I . Altogether one has, therefore, 10 classes and thus also
10 vector representations of D4h. These are shown in the upper part of Table
A.15 and are denoted by Xp,XH, ..., Xg and Xp', X;', ...,Xg'. For the space
group representations at X without spin the vector representations of D4h
are not helpful, one needs the projective representations of this point group
with the crystallographic factor system discussed above. These are shown
in the middle part of Table A.15. The characters are given for each element
individually because the character is no longer a class function for the projective representation under consideration. Remarkably enough, none of the
726
Table A.15: Projective irreducible representations of the small point group D4h of
X with three different factor systems correspondingto diamond type crystals. The
vector representations (those with unit factor system) are given in the upper part,
the vector representations with crystallographic factor system in the middle part,
and the spinor representations with crystallograhic factor system in the lower part.
-I
2c4
xt"
xz"
x;
x;
xs"
x;'
xz"'
x;'
xi'
XgVI
x1
x2
x3
x4
x5
i
0
-I
-
C4I
c;
cir
U21I
U23'
U22I
D4h.
The time reversed star {-k} equals the original star {k} in the case of
X . Time reversal symmetry does not result in additional degeneracy at X,
u24r
727
Table A.16: Irreducible representations of the small point group D 3 d for diamond
type crystals.
Basis functions
2c3
3U"
zc31
L1
*I;zx
L2
JZ
L3
{ J z - Jz, Ja, - J r )
L;
x i l l i =
L;
.( - U)(# - 2 x 2 - x)
Lb
t=- 2 , u - 2 )
3avI
f uu
with (xu.)
(JzJYJz)
L g - h i s from Tab.A.6
with ( z v z )
0
(JzJYJz)
II t),Ill}
Lq-basis from Tab.A.6
1
1
since all representations of the space group belong to the Herring case a).
A
The star of A has 8 points. The small point group is C B ~and
, the crystallographic factor system (A.186) equals the unit system, just as in the
case of the zincblende structure. Thus the same irreducible representations
apply as those shown in Table A.9 for the zincblende structure. Unlike
the zincblende structure, however, the time reversed star {-k} of A equals
the original star {k} of k in the case of the diamond structure. The repcorrespond to the Herring case a), and the two
resentations A l , A2, h3,
representations A4, A5 to the Herring case b). The energy levels A4 and As
are therefore degenerate because of time reversal symmetry. One has
EA,(k) = EA,(k) at A.
(A.218)
L
The star of L has 4 points and its small point group is D M = D3 x Ci.
728
Therein elements occur which, like the inversion, are joined with fractional
translations in the space group of diamond type crystals. Some of these
elements, among them again the inversion, transform L into an equivalent,
but not identical point, whereby the crystallographic factor system differs
from the identity system. It can, however, be shown that this system is p equivalent to the identity system. The projective irreducible representations
of D3d to be considered here are, therefore, pequivalent to the ordinary
irreducible representations of this point group. Due t o the relation D3d =
D3 x Ci the latter can be generated from the irreducible representations of
D3 by multiplication with the two representations of Ci. The results are
shown in Table A. 16.
The 6 elements of D3 belong to the 3 classes { E } ,{C3, C,},{ U z l , U22, U23}.
In the double group, each of these classes gives rise to a further class. There
fore, two 1-dimensional representations L1, L2, and a 2-dimensional one, L3,
exist without spin, and also two 1-dimensional representations, Lq, L 5 , and
a 2-dimensional one, L6, with spin. In the group D u = D3 x Ci,one has the
vector representations L1, L2, L3, and the spinor representations Lsf, L,f, Lg
corresponding to +1 for the inversion, as well as the vector representations
L i , Lh, L3, and the spinor representations L 4 , L g , L;, corresponding to -1
for the inversion (see Table A.16). The repTesentation D1/2 of the full orthogonal group coincides with L$ on D 3 d . In addition, the following relations
hold:
L3 x V + = L z + L.$+ L.$ ,
L$ x
D zI= L 4 + L ,
+ L;.
(A.219)
The time reversed star {-k} equals the original star {k} in the case of L.
The representations L1, La, L.3, L i , Lh, L i as well as L$ and L; belong to
the Herring case a), and the spinor representations L z , L,: and L 4 , L c to
the Herring case b). Thus, time reversal symmetry joins L4f, L z as well as
L.4, L , to representations in the same energy eigenspace. We conclude that
at L,
(A.220)
The small point groups of C and K are both CzV,and the stars both have
12 points. The crystallographic factor systems (A.186) are identical and in
both cases equal the identity system. Therefore, only ordinary and spinor
representations have to be considered. We formulate the results for C. The
point group CzV has already been treated as the small point group of A
729
Table A.17: Irreducible representations of the small point groups CzVof C and K
for diamond type crystals.
av2
Basis functions
A. 10.2
Multiplication tables
Td
730
Table A. 19:
r with spin.
A. 10.2
Multiplication tables
731
732
x: + x; + xs"
x3"+ x; + xgu
x; + x; + xgv
1-1
11
Ad
A5
A6
733
1-
734
A. 10.3
Compatibility relations
Zincblende structure
Diamond structure
735
736
r; -A;
r;
h2
r; - c2
L2
r;
r;,<
A;
A;
Ah
r; r;,-
ra
A3
r;,
c3
<
c2
c3
L3-
L;
La___
Lg -
737
Appendix B
($'(X)l$(X))
# $.
(B.1)
In section 2.2 it is verified that the factors @(X)at @(x, X)d(X)in the total
wavefunctions $(x,X)$(X)obey a Schrodinger equation with a Hermitian
Hamiltonian as well. Thus they also may be assumed to be orthogonal,
(4'14) = 0,
4 # 4.
(B.2)
--+
X + t = (XI+ t,Xz+ t , .. . , X J +t ) ,
03.4)
thus a displacement of the whole crystal, through the vector t. Then the
eigenfunctions $(x, X)$(X)of the crystal Hamiltonian can simultaneously
738
xi +cj
pi
Pj.
be chosen as eigenfunctions of the total momentum operator
We will do so and also assume that the crystal as a whole is at rest, so that
The third property we will use involves strong localization of the wavefunctions +(X)at atomic cores. This follows from the large masses of cores - in
order for the kinetic energy operator Tc = &(l/Mj)P! in the Schrodinger
equation (2.25) to compete with the potential energy term, large second
derivatives of the core wavefunction +(X),
i.e., strong spatial changes, are
necessary. These can be achieved only through strong localization of these
functions, because the normalization condition requires that d(X) shall decay as X approaches infinity. In the case of a crystal, the core wavefunctions +(X)can have non-zero values only in direct proximity of the lattice
points. In classical terms this corresponds to cores executing small oscillations around their equilibrium positions.
Fourthly and finally, we use that the electron wavefunction $(x,X),
taken as a function of X,is smooth compared with +(X)which was found
to vary rapidly with X. This may be understood as follows: First, we note
that, because of the small mass of electrons, the argument for strong localization of the core wavefunctions +(X) with respect to core positions X is
not applicable to the variation of the electron wavefunctions +(x, X) with
respect to electron coordinates x. The electron wavefunctions @(x, X), taken
as functions of x,are rather spread out more or less uniformly over the entire
crystal. Second, we form the internal product of relation (B.6) with +(X).
Using the relation
(4IPjld) = 0
(B.6)
which holds because of the oscillatory character of the core motion, we arrive
at
The total number of terms on the left hand side of (B.7) is of the same order
of magnitude as the total number of terms on the right hand side, because
the number of (valence) electrons equals, apart of a factor of the order of
magnitude 1, the number of cores. Moreover, the order of magnitude of the
terms with different i is the same throughout, as is the order of magnitude
of the terms with different j . Therefore, the order of magnitude relations
739
follow for all i, j , indicating that $(x, X)is as smooth with respect to
it is with respect to x. From relation (B.8) it follows also that
X as
XI.
(B.9)
The four properties of the wavefunctions $(X)and $(x, X)discussed above
P?$(X, X) M Pj2$(X,
(B.lO)
In rewriting this expression we use the third property concerning the wave
function 4(x) and the fourth concerning the wavefunction $(x, X).Accordingly, the matrix element with respect to the electron states in B.10 depends
only weakly on X, and the factor @*(X)#(X)
differs substantially from zero
only in small environments around the equilibrium positions of cores. Thus,
the matrix element can be approximately evaluated at the equilibrium values
of the core coordinates X and be factored out of the integral over X. In this
way one gets
(@+l4Tc$)
(4 14)($ ITcI$ ) .
(B.ll)
740
(B.15)
The two order of magnitude relations (B.14) and (B.15) are the equations
(2.23) and (2.24) used in section 2.2.
741
Appendix C
Occupation number
representation
Within the oneparticle approximation, Slater determinants @vly...vN(x~,
x2,
. . . ,XN) of an N-electron system describe stationary many-particle states
composed of oneparticle states vi. In a Slater determinant, a particular
oneparticle state is not ascribed to an individual electron, but all electrons
of the system may be associated with this state with the same probability.
This reflects the principle of indistinguishability of elementary particles in
quantum mechanics. Owing to this principle, a many-electron state cannot
be characterized by statements like electron 1 is in one-particle state v1,
electron 2 is in oneparticle state v2, etc. One can only describe the occupation of a particular oneparticle state by electrons, without identifying
any individuality of the occupying particles.
Slater determinants in occupation n u m b e r representation
For electrons, a particular one-particle state vi (of definite spin) can either
be occupied with one particle, or not be occupied at all. This observation is
most naturally described by means of an occupation number Nv - if the state
v is occupied, then one has Nu = 1, and if the state is empty, then Nu = 0
holds. For simplicity, we identify the quantum numbers v of one-particle
state with integers i, where i runs from 1 to 00, as there are idnitely many
one-particle states.
The selection {v} = v1,v2,. . . ,UN of one-particle states entering a particular Slater determinant @iV},determines the entirety of occupation numbers
for all one-particle states, i.e. the sequence N1, N2,. . . ,N,, as follows: the
Ni-value of a state whose quantum number i coincides with one of the quantum numbers vl, v ~ . ,. . , V N ,is 1, and the Ni-value of a state whose quantum
number i differs from all quantum numbers v1, v2,. . . , v ~ is, 0. Conversely,
742
(C-1)
@{V}(Xl, x2r..
To facilitate use of the occupation number representation, one needs operators whose effects on the basis functions I N1, N 2 , . . . , N,) of Fock space are
known. To this end one introduces so-called annihilation operators ai and
creation operators a. They are defined by the relations
ai I N1, N 2 , . . . , N i , . . . ,N,) =
at
I N1, N 2 , . . . ,N i , . . . ,N,)
(N1,N 2 , . . . , 0, . . . , N,),
Ni
1,
0,
Ni = 0.
0,
Ni = 1,
JN1,N 2 , . . . , 1
. . . ,N , ) ,
N.j = 0.
743
$v(xl, x2,. . * 7 X N )
(C.4)
j=l
a:ai
1 N1, N 2 , . . . , N i , . . . , N,)
= Ni
1 N1, N 2 , . . . ,N i , . . . , N , ) .
(C.5)
The states 1 N1, Nq, . . . ,N i , . . . , N,) are therefore eigenvectors of a:ai, and
the particle number Ni of state i is the pertinent eigenvalue. One calls a r a i
the particle number operator of the state i. We abbreviate it by I?i, i.e. we
set
1.
It
Using the definition of the Hermitian adjoint operator (a+)h.a.of a:, we may
write
[ A ,B]+= A B
With this definition we have
+ BA.
744
[a:,
[Ui, U j ] +
= bij,
= [a:, a;]+ = 0.
(C.10)
((2.11)
(C.12)
$ ~ y ~ (xx2 ~
,.,
. ., XN), are the same as the matrix elements of Q between the
pertinent occupation number vectors 1 Ni,N;, . . . , N k ) and 1 N 1 , N 2 , . . . ,
Nm). We have to prove, therefore, that the identity
is valid. As far as the left hand side of this equation is concerned, one
easily verifies that the only non-vanishing matrix elements are those between
Slater determinants which differ solely in one column. This means that in
745
(v;,vi,. . . , v h ) only one component, say vi, can have a value different from
in ( v l ,v 2 , . . . , v ~ ) .The non-vanishing matrix element reads
v1
(@{vO
I Q I @(v})
(v: I q
1~1).
(C.15)
( N i ,N L , . . . ,N L
I a$uu I N i , N 2 , . . . , N m ) .
(C.16)
The matrix element of u$uU in (C.16) differs from zero only when v =
and v = vi hold. Thus, it follows that
v1
747
Bibliography
General literature on quantum mechanics, solid
state physics and semiconductor physics
Ashcroft, N.W., and N.D. Mermin (1976), Solid State Physics, Holt Reinhart
and Winston, New York.
Blakemore, J.S. (1985), Solid State Physics, Cambridge University Press.
Boer, K.W. (1990), Survey of Semiconductor Physics, Van Nostrand Reinhold, New York.
Bonch-Bruevich, V.L., and S.G. Kalashnikov (1982), Halbleiterphysilc, Deutscher Verlag der Wissenschaften, Berlin.
Callaway, J. (1976), Quantum Theory of the Solid State, Academic Press,
New York.
Enderlein, R. und A. Schenk (1991), Grundlagen der H a l b l e i t e p h y s i ~Akademie-Verlag, Berlin.
Harrison, W.A. (1980), Electronic Structure and the Properties of Solids:
The Physics of Chemical Bond, Freeman & Co., San Francisco.
Harrison, W.A. (1980), Solid State Theory, Dover.
Haug, A. (1972), Theoretical Solid State Physics, Pergamon Press, Oxford.
Kittel, C. (1963), Quantum Theory of Solids,John Wiley & Sons, New York.
Kittel, C. (1986), Introduction to Solid State Physics, John Wiley & Sons,
New York.
Landau L.S., and E.M. Lifshitz (1958), Quantum Mechanics, Pergamon
Press, Oxford.
Landoldt-Bornstein (1982), Numem'cal Data and Functional Relationships
an Science and Technology, New Series, Volume 17, Semiconductors, Eds.
Madelung O., Schulz, M., Weiss, H., Springer-Verlag, Berlin, Heidelberg,
New York.
Madelung, 0. (1978), Introduction to Solid State Theory, Springer-Verlag,
Heidelberg.
Messiah, A. (1968), Quantum Mechanics, North -Holland Publishing Company, Amsterdam.
748
Bibliography
Moss, T.S. (Ed.) (1982), Handbook on Semiconductors, Volume 1-4, NorthHolland Publishing Company, Amsterdam.
Chapter I
Adler, M., H. Fritzsche, and S.R. Ovshinsky (Eds.) (1985), Physics of Disordered Materials, Plenum Press, New York, London.
Bonch-Bruevich, V.L., R. Enderlein, B. Esser, R. Keiper, A.G. Mironov, LP.
Zvyagin (1984), Elektronentheorie ungeordneter Halbleiter, Deutscher Verlag
der Wissenschaften, Berlin.
Braun, C.L. (1980), Organic Semiconductors, in Handbook of Semiconductors 3,North Holland Publ. Co., Amsterdam.
Elliot, S.R. (1983), Physics of Amorphous Materials, Longman, London,
New York.
Epifanov, G.I. (1979), Solid State Physics, Mir Publishers, Moscow.
Kleber, W. (1970), A n Introduction to Crystallography, Verlag Technik,
Berlin.
Madelung, 0. (1964) Physics of 111-V-Compounds, J. Wiley & Sons, New
York, London, Sidney.
Bibljography
749
Chapter I1
Abrikosov, A.A., L.P. Gorkov, and I.E. Dzyaloshinski (1963), Methods of
Quantum Field Theory in Statistical Physics, Dover, New York.
Bechstedt, F. (1992), Adv. Sol. State Phys. 32, 161.
Bib, H., and W. Kress (1979), Phonon Dispersion Relations in Insulators,
Springer-Verlag, Berlin.
Bir, G.L., and G.E. Pikus (1974), Symmetry and Strain Induced Effects in
Semiconductors, John Wiley & Sons, New York. London, Sydney.
Bloch, F. (1928), Z. Physik, 52, 555.
Born, M., and K. Huang (1968), Dynamical Theory of Crystal Lattices,
Clarendon Press, Oxford.
Born, M., and I.R. Oppenheimer (1927), Ann. der Phys. 84, 457.
Brillouin, L. (1953), Wave propagation in periodic structures, Dover, New
York.
Callaway, J. (1964), Energy Band Theory, Academic Press, New York, London.
Chadi, D.J., J.P. Walter, M.L. Cohen, Y. Petroff, and M. Balkanski (1972))
Phys. Rev. B 5 , 3058.
Chelikowsky J.R. and M.R. Cohen (1974), Phys. Rev. B 10, 5059.
Economou, E.N. (1979) Greens Functions in Quantum Mechanics, SpringerVerlag, Berlin.
750
Bibliography
Chapter I11
Section 3.4 Shallow levels
Aggarwal, R.L., and A.K. Ramdas (1965), Phys. Rev. 140, A1246.
Bibliography
751
Alves, J.L.A., and J.R. Leite (1986), Phys. Rev. B 34, 7174.
Barraff, G.A., and M. Schluter (1980), Phys. Rev. B 19, 4965.
Beeler, F., O.K. Andersen, M. Scheffler (1985), Phys. Rev. Lett. 55, 1498.
Beeler, F., O.K. Andersen, M. Scheffler (1990), Phys. Rev. B 41, 1603.
Bourgoin, J., and M. Lannoo (1983), Point Defects in Semiconductors 11:
Experimental Aspects, Springer-Verlag, Berlin, Heidelberg, New York.
Caldas, M., A. Fazzio, A. Zunger (1984), Appl. Phys. Lett. 45, 671.
Chadi, D.J. (1992), in Materials Science Forum, 83-87,447.
Dabrowski, J., and M. Scheffler (1992), in Materials Science Forum, 83-87,
735.
Delerue, C., and M. Lannoo (1994), in Materials Science Forum, 143 - 147,
699.
Fazzio, A., M.J. Caldas, A. Zunger (1985), Phys. Rev. B 32, 934.
Fazzio, A., M.J. Caldas, A. Zunger (1984), Phys. Rev. B 30, 3430.
Fleurov, V.N., and K.A. Kikoin (1994), Transition metal impurities in semiconductors, World Scientific.
Grimmeis, H.G., and E. Janzen (1986), in Deep Centers in Semiconductors, S.T. Pantelides (Ed.), Gordon & Breach Science Publishers, New York,
London, Paris.
Haldan, F.D.M. and P. W. Anderson (1976) Phys. Rev. B 13,2553.
Hjalmarson, P.H., P. Vogl, D.J. Wolford, J.D. Dow (1980), Phys. Rev. Lett.
44, 810.
Jaros, M. (1982), Deep levels in semiconductors, Adam Hilger, Bristol.
Lang, V., and R.A. Logan (1977) Phys. Rev. Lett. 39, 635.
Langer, J.M., H. Heinrich (1985), Phys. Rev. Lett. 55, 1414.
Lannoo, M., J. Bourgoin (1981), Point Defects in Semiconductors I: Theoretical Aspects, Springer-Verlag, Berlin, Heidelberg, New York.
Lischka, K. (1986), phys. stat. sol. b 133, 17.
Pantelides, S.T. (Ed.) (1986), Deep Centers in Semiconductors, Gordon &
Breach Science Publishers, New York, London, Paris.
Scherz, U., and M. Scheffler (1993), in Semiconductors and Semimetals 38,
page 1, Eds. R K . Willardson and A.C. Beers, Academic Press, New York.
752
Bibliography
Bibliography
75 3
Lei, X.L., N.J.M. Horing, and H.L. Cui (1991), Phys. Rev. Lett. 66,3277.
Leonard, D., M. Krishnamurthy, C.M. Reaves, S.P. Denbaars, and P.M.
Petroff (1993), Appl. Phys. Lett. 63,3203.
754
Bibliography
Bibliography
755
Chapter IV
Blakemore, J.S. (1962), Semiconductor Statistics, Pergamon Press, London.
Fistul, V.I. (1969), Heavily Doped Semiconductors, Plenum Press, New York,
London.
Landsberg, P.T. (1978) , Thermodynamics and Statistical Mechanics, Oxford
Univ. Press.
Landsberg, P.T. (1982), in Band theory and transport properties, Handbook
on Semiconductors 1,Ed. T.S. Moss, North Holland PubL Co., Amsterdam.
Shklowskii, B.I., and A.L. Efros (1984), Electronic properties of doped semiconductors, Springer-Verlag, Berlin.
Chapter V
Greenaway, D.L. and G. Harbeke (1968), Optical Properties of Semiconductors, Pergamon Press, New York, London.
Haug, H., and S. W. Koch (1994), Quantum Theory of the Optical and
Electronic Properties of Semiconductors, World Scientsc.
Hummel R. E. (1985), Electronic Properties of Materials, Springer-Verlag,
Berlin, Heidelberg, New York, Tokyo.
Kao, K.C., and H. Hwang (1981), Electrical Transport in Solids, International Series in the Science of the Solid State, Volume 14, Pergamon Press,
New York, London.
Klingshirn, C.F. (1995) , Semiconductor Optics, Springer-Verlag, Berlin, Heidelberg, New York.
Peuker, K. (1982), R. Enderlein, A. Schenk, and E. Gutsche, phys. stat. sol.
b 109, 599.
Ftidley, B.K. (1982), Quantum Processes in Semiconductors, Oxford University Press.
Scholl, E. (1987) , Non-equilibrium phase transitions in semiconductors, Springer-Verlag, Berlin.
Singh, J. (1993), Physics of Semiconductors and Their Applications, McGrawHill, New York.
Zawadzki, W. (1980), (1982), in Band theory and transport properties, Handbook on Semiconductors 1,Ed. T.S. Moss, North Holland PubL Co., Amsterdam.
756
Bibliography
Appendix A
Heine, V. (1960), Group theory in Quantum mechanics, Pergamon Press,
New York.
Koster, G.F. (1957) Space groups and their representation, in Solid State
Physics 5, page 174, Academic Press, New York.
Ljubarski, G. J. (1962), Anwendungen der Gruppentheorie in der Physik,
Deutscher Verlag der Wissenschaften, Berlin.
Streitwolf, H.-W. (1967), Gruppentheorie in der Festkorperphysik, Akademische Verlagsgesellschaft Geest & Portig K.-G., Leipzig.
Wigner, E.P. (1959), Group theory and its application to quantum mechanics
of atomic spectra, Academic Press, New York.
757
Index
Abelian group 86, 624, 626, 666
ab initio methods 133
absorption 41, 433
acceptor level 474
acceptor transition 304, 321
acceptor 271, 276, 277, 279, 490,
507
accumulation layer 553, 555, 613
actinides 231
activation enthalpy 239, 240
active region 600
additional degeneracy 710
adiabatic approximation 57, 59, 61,
355, 737
adiabatic potential 60
adiabatic 61, 62, 64
Airy-function 436
all electron problem 134
alloy 27, 403
a-Sn 29, 210
amorphous semiconductor 2, 27
amphoteric center 305, 497
amplification 592
angular momentum basis 192
angular momentum matrix 187
angular momentum operator 681
Angular Resolved UPS (ARUPS)
374, 379
anisotropic effective mass 273
annealing 238
annihilation operator 742
annihilation 506, 527, 529
anthracene 28
anti-bonding energy level 291, 292,
3 14
758
base 586, 588, 589
basis of crystal 17, 23
BenDaniel-Duke boundary condition 417
binary compound 3
binding energy 273, 275, 276, 280,
282
bipolar generation 511
bipolar transistor 4, 50, 536, 573,
585, 590
Bloch electron 501, 524
Bloch factor 92,101,129,135,179,
255, 256, 415, 425, 699
Bloch function 87, 92, 109, 132,
254, 255, 357, 699
Bloch integral 256
Bloch oscillation 438, 439
Bloch state 120, 437
Bloch sum 135, 364
Bloch theorem 85, 87, 89, 92, 130,
356
blocking direction 585, 591
bluegreen laser diode 407
body-centered 10
Bohr radius 267, 270
Boltzmann distribution 42, 461, 480
Boltzmann equation 500, 501
bond orbital approximation 171,173
bonding energy level 291, 292, 314
bonding orbital 166, 172, 177
Born-Oppenheimer approximation
57
boron 45, 46, 47
Bose distribution function 462
bound interface state 390, 391
bound state 268
bound surface band 361, 374
bound surface state 361, 386, 387
Bragg reflection line 358
Bragg reflection plane 101, 102, 103,
104, 105, 358, 410
Brattain 536
Braun 4
INDEX
INDEX
759
correlation 78, 301, 475
Coulomb interaction 56, 72,165
Coulomb potential 70, 119
Coulomb repulsion 462, 463, 464,
465, 466
covalent bonding 174, 175
creation operator 742
critical point 119, 126, 473
critical thickness 404, 406
crystal class 13, 14, 345
crystal field splitting 222, 324
crystal lattice 6
crystal structure 14, 15, 21, 30
crystal system 9, 13, 14, 345
crystal without basis 12
crystal with basis 12
crystal 6
cubic crystal system 10
cubic lattice 10
current-voltage characteristic 559,
585, 594, 610
cyclic subgroup 625
cyclotron motion 455
cyclotron resonance 211, 443, 455,
456
d-electron 53, 134, 321
d-orbital 138
d-shell230, 232, 307
dangling bond 382
dangling hybrid 286, 377
Debye screening length 547, 555
deep center 281,282,284,299,307,
318, 393, 462, 509, 519
Deep Level Transient Spectroscopy
(DLTS) 309
defect molecule model 285, 287, 312,
323, 365, 367
defect molecule 321
defect structure 32
defects 227
degenerate electron gas 615
Dember effect 595
760
density functional theory 79, 136
density of states (DOS) 42, 120,
123, 124, 126, 127, 128,
268, 426, 469, 472, 472
density of states mass 126
depletion mode 622
depletion region 545, 546, 577
detailed balance 513
device 490, 499, 535, 548, 567, 573
diamagnetic 449
diamond structure 15, 17, 18, 19,
116, 118, 720
diamond type semiconductors 148,
198
diamond 3, 16, 29
diffusion coefficient 239, 506, 530
diffusion current 506, 528, 539
diffusion length 529, 530, 581
diffusion theory 611
diffusion voltage 541, 542, 561, 576
diffusion 239, 502, 508, 527, 528,
529, 537
Dimer-Adatom-St acking-Fault (DAS)
model 381, 383
dimerization 382
Diophantin equation 337
direct gap semiconductor 224, 215
direct product 625, 626, 667, 691
displacement groups 627
distribution function 472, 500
divacancy 233
DLTS 310
donor level 270, 474
donor transition 303, 321
donor-acceptor pair 232
donor 270, 271, 276, 486, 490, 507
doping microstructure 402, 408
doping 35, 36, 398, 486, 488
DOS 432,454, 455
double acceptor 464
double donor 304, 463, 465, 494
double group 130, 685, 686, 715
double heterostructure 397, 400, 401
INDEX
drain 620
drift current 506, 523
drift velocity 525
drift 506, 508
DX-center 233, 235, 333
Dyson equation 79
effective density of states 43, 480,
48 1
effective g-factor 449, 450
effective mass anisotropy 274
effective mass equation 252, 253,
258, 259, 266, 272, 418,
420, 434, 444, 449, 452,
539
effective mass tensor 121, 262
effective mass theory 412, 414
effective mass 119, 121, 126, 182,
183, 211, 216, 219, 440,
449, 472
effusion cell 398
Einstein relation 532, 533
EL2 center 231, 333, 234
electric field 45, 433, 440, 499, 501,
506
electric potential 539, 542
electrical conductivity 34, 35, 41,
49
electro-optic frequency 436
electrochemical potential 533, 552
electroluminescence display 31
electron affinity 558
electron concentration 34, 42, 469,
470, 484, 492
electron diffraction 371
Electron Energy Loss Spectroscopy
(EELS) 375
electron gas 478
electron system 70, 72, 457
electron trap 490
electron-core interaction 56, 242
electron-electron interaction 70, 72,
178, 245
INDEX
76 1
762
finite group 624
first Brillouin zone 98, 106, 107,
110, 114, 116, 700
first SL BZ 410, 411
flow direction 585
Fock space 742
folding 107, 108, 411
forbidden zone 39, 108
forward bias 585
Franz-Keldysh effect 442
Franz-Keldysh oscillations 442
free carriers 527, 533
freez-out of carriers 487
Frenkel defect 233
frozen-core approximation 53, 133
full orthogonal group 636
full rotation group 636, 673
Ga-rich surface 388
GaAs 4, 30, 31, 220, 395, 384, 387,
398, 411, 429, 431
gain coefficient 599
GaN 30, 31
gap discontinuity 394, 403
GaP 5, 220
gate voltage 621
gate 621
generating elements 637, 638, 639
generation center 519
generation of free carriers 509
generation 506
generators of infinitesimal rotations
6 74
Ge 2, 29, 36, 218, 444, 456, 488,
492, 497
Gibbs free energy 235
glide-reflection 14,15,632,633,635
grain boundaries 240
grand canonical ensemble 458
Greens function method 295, 368
Greens function 79, 295, 298, 299,
369, 370
INDEX
ground state 70, 73, 274, 279, 468,
469, 491
group IV semiconductors 28, 31,
313, 566
groups 7, 82, 87, 623
group theory 623
GW approximation 134
gyromagnetic ratio 448
Hall constant 49
Hall effect 43, 44, 46
Hartree approximation 68, 66, 76
Hartree energy 248
Hartree potential 69, 70, 77, 241,
245, 247, 265, 428
Hartree-Fock approximation 68, 76,
134
Hartree-Fock equation 77, 81
heavy hole 189, 197, 204, 210
Hellman-Feynman forces 63, 64
HEMT 407, 557, 574
hermiticity 417
Herring cases 710
Herring criterion 695, 707, 711
heterojunction 388, 390, 394, 396,
535, 536, 549, 550, 552,
574
heterojunction bipolar transistor (HBPT)
406
hexagonal lattice 22, 23
HgTe 209, 221
High Electron Mobility Transistor
(HEMT) 402
Hittorf 3
hole capture 515
hole subbands 429
hole trap 490
hole 46, 47, 48, 428, 429, 489
holohedral point group 8, 14
homomorphism 626
Hubbard energy 249,280,305,309,
310, 328, 463
hybrid orbital 168, 285, 286
INDEX
hydrogen atom 119, 141, 267
hydrogen model 266, 273
ideal surface 340, 341, 343, 354,
367
ideal crystal 5, 50, 225, 226
ideal semiconductor 470, 472, 482
ideal wurtzite structure 25
identity element 623
identity representation 659, 669
impact ionization 440
imperfect semiconductor crystal 225
improper rotation 632
impurity atom 35, 50, 226, 230,
237, 238, 250, 251, 312,
474, 501
indirect gap semiconductor 215,221
indirect gap 394
infinite group 624
infinitesimal rotation 674
infrared detector 31
injection current 580
injection 510, 571
inner shell electron 51
inorganic semiconductor 28
input power 592
input resistance 592
insulat or-semiconductor junction 535,
6 12
insulator 1, 34, 41
integrated circuit 622
inter-valley coupling 273, 274
interband transition 438
interband tunneling 440
interface charge density 571
interface field 571
interface potential 571
interface resonance 390
interface state 564, 565, 567, 570,
614
interface 388
internal photoeffect 595
internal transition 303
764
Koster-Slater equation 296, 317
Koster-Slater method 295
Kronig-Penney problem 422
Lagranges theorem 625
Landau level 453
Landausubband454
laser diode 30, 50, 599, 600
lattice constant 10, 16, 18, 25, 64,
403
lattice matched heterostructure 404
lattice misfit 404
lattice mismatched heterostructure
404
lattice mismatch 403
lattice oscillations 64, 65
lattice plane 336, 338
lattice relaxation 234
lattice translation 7, 82, 87, 635
lattice 6, 7, 9, 336
LCAO 135
lead sulfide 4
LEED (Low Energy Electron Diffraction) 372, 373, 386
Levinson theorem 268, 270, 301,
361, 476
lifetime of non-equilibrium electrons
5 14
ligand field theory 324
light emitting diode (LED) 30, 31,
604, 605, 606
light hole 189, 197, 208
line defect 240
line perturbation 227
linear approximation 504
Liouville theorem 501
liquid semiconductor 2
LMTO method 139
local density approximation (LDA)
80, 134
local equilibrium 503, 574
localized state 246, 460, 457, 462
long-range order 5
INDEX
INDEX
765
766
occupation inversion 599, 600
occupation number representation
458, 741
occupation number space 742
occupation number 458, 741
Ohms law 502, 504, 505, 525
ohmic contact 561, 562
ohmic metal-semiconductor contact
6 12
one-electron approximation 355
oneelectron potential 133, 134
one-electron Schrodinger equation
241, 355
oneelectron state 74
oneparticle approximation 57, 66,
68, 71, 73, 77, 741
one-particle energy 72, 73, 309
one-particle excitation energy 73
oneparticle excitation 73, 78, 81
oneparticle Schrodinger equation
66, 70, 71, 74, 76, 77
one-particle state 67, 70, 71, 76,
457, 460, 462
oneparticle wavefunction 66, 67,
71
optical absorption spectrum 38
optical excitation 510
optical fiber communication 31, 407,
605
optical transition 432
optoelectronics 29
OPW functions 135, 136
orbital motion 75
organic semiconductors 28
orthogonal transformation 7, 12, 130,
628, 629, 630
output power 592
output resistance 592
oxidation state 250, 251, 324, 331
p-channel MOSFET 622
p-equivalent 689, 690
p-orbital 138, 143, 150
INDEX
pshell230, 231
p-type semiconductor 47, 50, 489,
496
palladium group 231
partial compensation 490
Pauli principle 39, 42, 67, 76
PbTe 223, 224
Peierls instability 377
Peierls transition 377
periodic continuation 113
periodic potential 94, 100
periodic table 4, 232, 285
periodicity condition 93, 656
periodicity region 54, 55, 57, 66,
85, 656
persistent photoconductivity 332
perturbation theory 98, 99
phenomenological equation 502, 503
phonon assisted tunneling 594
phonons 65, 501
phosphorus 45, 274
photo-conduction 49
photocell 4
photocopying 31
photocurrent 596
photodetector 50, 407, 574, 598
photodiode 598
photoeffect 595
photoemission spectroscopy (PES)
373, 375
photoluminescene 433
photon assisted interband tunneling 442
photoreflectance 572
photothreshold 558
photovoltage 596, 597
pho t ovolt aic element 598
pinning of Fermi level 565
planar defect 240
plane crystal system 342, 343
plane Bravais lattice 342, 343, 358
plane perturbation 227
plane space groups 345, 346, 347
INDEX
plasma oscillations 78
plasmon 78
platinium group 231
pn-diode 585
pn-junction 535, 536, 537, 538, 540,
542, 546, 555, 574, 595
pnp-transistor 585
point contact rectifier 4
point defect 228
point group of equivalent directions
14, 345, 637
point group symmetry 343
point group of directions 637
point group 8, 9, 17, 82, 86, 95,
354, 362, 627, 636, 637,
6412, 647
point perturbation complex 232
point perturbation 227, 239, 241,
242, 474
point symmetry operation 8, 636
point symmetry 7, 9, 83
polyacetylene 28
Poisson equation 507, 508, 546, 554,
362, 570
polar surface 387
precipitate 234
primitive crystal slab 339
primitive lattice vector 6, 8, 17,
338, 699
primitive lattice 6, 7, 9,
primitive unit cell 6, 30
projected bulk band structure 360
projective representation 118, 684,
687, 688, 690, 702, 703
pseudo-wavefunction 137
pseudopotential method 136, 139
pseudopotential 137, 138
quantized Hall effect 536, 556
quantum device 574
quantum dot 402
quantum statistics 479
767
768
representation matrices 658
representations of groups 656
representations with factor system
684, 688
repulsive forces 56
resonance state 301
reverse bias 585
RHEED 398
rigid displacement 7,627,635,654,
682
rocksalt structure 16, 17, 18, 21,
22, 178
rotation group 679
rotation-inversion 7, 14, 640
rotation-reflection 7, 14, 632, 639
rotation 14, 631, 633
Rutherford Backscattering (RBS)
308
Rydberg energy 267
s-atom 146
s-orbital 138, 143, 150
s-shell 230, 231
saturation current density 585,610
saturation 487
scanning tunneling microscopy (STM)
373, 382
Schottky approximation 542, 544,
545, 547, 576
Schottky barrier 559, 563, 564 566,
608, 609
Schottky contact 535, 557, 562, 563,
607, 612
Schrodinger equation 57, 59, 62,
67, 74, 82, 98
Schonflies notation 637
Schonflies symbol 640
screw-dislocation 28, 240, 241
screw-rotation 14, 632, 633, 634
selection rules 670
selenium structure 27, 28
Se 2, 4, 31, 224
self-consistent 74, 133, 134, 428
INDEX
self-energy operator 79
self-interstitial 228
self-organized growth 402
semiconductor device 4, 50, 241
semiconductor heterojunctions 388
semiconductor heterostructures 443,
535
semiconductor microstructures 402
semiconductor optoelectronics 5
semiconductor surfaces 572
semiconductor 1, 34, 41
sensor 574
shallow acceptor 276
shallow center 462
shallow donor 276
shallow level 265, 268, 269, 487
Shockley-Read-Hall recombination
518
Shockley 536
short-range order 5
short-range potential 244,245, 258,
28 1
Shubnikov-de-Haas effect 2 11
S i c 31
silicon wafer 490
silver sulfide 3
simply ionizable acceptor 465, 466
simply ionizable donor 465
single donor 304
single heterostructure 388, 397
SiOz/Si-interface 571
Si02/Si-junction 567
Si 2, 4, 29, 36, 37, 38, 188, 189,
212, 213, 375, 490
slab method 364
Slater determinant 76, 78, 301, 305,
457, 458, 741
small point group 116, 118, 131,
362, 702, 703, 712
smooth potential 252, 258
solar cell 50
solid state shift 52
solid state 5
INDEX
769
770
INDEX
INDEX
2 x 1 reconstruction 376
7 x 7 reconstruction 376, 381, 382
IL x m reconstruction 352
sp2-bonding 333, 334
sp3-bonding impurity atom 314
sp3-hybrid orbital 166, 290, 330
sp3-hybrid 165, 334
&doping structures 409, 430
A-line 111, 112
r-point 112, 118
A-line 112
n-bonded chain model 381,378, 377,
379, 380, 381
r-bonded chain 377
k . p-interaction 182
k . p-perturbation 185
k . p-method 179
k . p-perturbation theory 259, 262
k-space 93
k-vector 93, 95, 105
(100) surface 379, 387, 388
(110) surface 384, 387, 388
(111) surface 376
2-dimensional electron gas 536, 552,
620
2D lattice 337
2D quasi-wavevector 356
3d-TM atom 320
4d-atom 320
4f-level 331
4f-orbital329
5d-level 331
5d-TM atom 320
771