Professional Documents
Culture Documents
Tony Thesis
Tony Thesis
Tony Thesis
This is to certify that I have examined this bound copy of a masters thesis by
Anthony Carlson
Traian Dumitrica
Name of Faculty Adviser
Date
GRADUATE SCHOOL
A THESIS
SUBMITTED TO THE FACULTY OF THE GRADUATE SCHOOL
OF THE UNIVERSITY OF MINNESOTA
BY
Anthony Carlson
October 2006
Contents
Chapter 1
Introduction
1.1
Fullerenes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
1.2
Production
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
1.3
1.4
Properties . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
1.5
CNT-based applications . . . . . . . . . . . . . . . . . . . . . . . .
1.6
1.7
Motivation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
Chapter 2
15
2.1
Bonding . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
2.2
Atomistic Forces . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
2.3
Electromagnetic Cohesion . . . . . . . . . . . . . . . . . . . . . . . 18
2.3.1
Repulsive force . . . . . . . . . . . . . . . . . . . . . . . . . 19
2.3.2
2.4
2.3.3
2.3.4
Mathematical models . . . . . . . . . . . . . . . . . . . . . . . . . . 23
2.4.1
Empirical . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
2.4.2
Semi-empirical . . . . . . . . . . . . . . . . . . . . . . . . . 24
2.4.3
First principles . . . . . . . . . . . . . . . . . . . . . . . . . 25
Chapter 3
26
3.1
Graphene . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27
3.2
3.3
3.2.1
3.2.2
Girifalco method . . . . . . . . . . . . . . . . . . . 33
3.3.1.2
Benedict method . . . . . . . . . . . . . . . . . . . 33
3.3.1.3
Zacharia method . . . . . . . . . . . . . . . . . . . 33
3.3.1.4
3.4
Z-axis compressibility . . . . . . . . . . . . . . . . . . . . . . . . . . 34
3.5
Vibrational properties . . . . . . . . . . . . . . . . . . . . . . . . . 35
3.6
3.6.2
3.7
Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39
Chapter 4
4.1
Empirical . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41
4.2
4.3
Chapter 5
48
5.1
5.2
5.3
5.4
Periodic solids . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52
5.5
5.6
Slater-Koster parametrization . . . . . . . . . . . . . . . . . . . . . 55
5.6.1
Orbital decomposition . . . . . . . . . . . . . . . . . . . . . 55
5.6.2
s-p decomposition . . . . . . . . . . . . . . . . . . . . . . . . 56
5.6.3
5.7
5.8
Fitting TB parameters . . . . . . . . . . . . . . . . . . . . . . . . . 62
5.8.1
5.9
Chapter 6
65
6.1
6.2
System description . . . . . . . . . . . . . . . . . . . . . . . 67
6.2.2
6.2.3
6.2.4
Application to Hydrogen . . . . . . . . . . . . . . . . . . . . 70
6.2.5
6.2.6
6.3
6.2.6.1
6.2.6.2
Slater-Kirkwood approximation . . . . . . . . . . . 75
6.2.6.3
Kirkwood approximation . . . . . . . . . . . . . . . 76
6.3.2
6.3.1.2
6.3.3
Point-Surface Interaction
iv
. . . . . . . . . . . . . . 81
6.3.3.2
6.4
Surface-Surface Interaction . . . . . . . . . . . . . 82
Chapter 7
Damping functions
85
7.1
7.2
Chapter 8
90
8.1
8.2
8.3
Orbital expansion . . . . . . . . . . . . . . . . . . . . . . . . 94
8.3.2
8.3.3
Compressibility calculation . . . . . . . . . . . . . . . . . . . 99
8.4
8.5
Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 101
Chapter 9
107
9.1
9.2
9.3
9.4
9.5
9.6
9.7
9.8
125
135
. . . . . . . . . . . . . . . . 137
. . . . . . . . . . . . . . . 138
146
152
156
Appendix F Optimization
163
vii
List of Figures
1.1
Graphite to CNT . . . . . . . . . . . . . . . . . . . . . . . . . . . .
1.2
Description of chirality . . . . . . . . . . . . . . . . . . . . . . . . .
1.3
1.4
1.5
Nanotube twine . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
1.6
1.7
2.1
2.2
3.1
3.2
3.3
3.4
3.5
4.1
5.1
5.2
6.1
6.2
6.3
6.4
7.1
8.1
Cutoff function . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 92
8.2
8.3
8.4
8.5
8.6
8.7
8.8
9.1
9.2
. . . . . . . . . . . . . . . . . . . . 35
. . . . . . . . . . . . . . . 96
9.3
Tube-tube orientation
. . . . . . . . . . . . . . . . . . . . . . . . . 111
9.4
9.5
9.6
9.7
9.8
9.9
. . . . . . . . . . . . . . . . 121
List of Tables
3.1
3.2
4.1
6.1
8.1
8.2
9.1
9.2
xi
1
Introduction
1.1
Fullerenes
The element carbon is the first element in group IV of the Periodic Table. It has
the ability to chemically bond with itself and other elements readily via orbital
hybridization. The whole of organic chemistry is dedicated to the study of the
millions of carbon-based molecules. There is a diverse variety of carbon solids
including a few crystalline allotropes and many amorphus and semi-crystalline
solids (such as lonsdaleite, and chaoite), but the most recognized forms are the
crystalline forms of diamond and graphite.
In the diamond crystal, each carbon atom is tetrahedrally bonded to its four
1
nearest neighbors via sp3 hybridization. The most stable allotrope of carbon is
graphite, which is nearly iso-energetic with diamond (E 0.014 eV/atom [172]).
Graphite has an interesting layered structure composed of planar sheets, called
graphene. In graphene, carbon atoms are trigonally bonded to their three nearest
neighbors by strong sp2 bonds, forming a hexagonal network (see left side of Figure
1.1). To build graphite, graphene sheets stack on top of one another. There is no
chemical bonding between layers; rather these layers are weakly bound by van der
Waals (vdW) forces [133]. This distinction between chemical and physical (vdW)
bonding will be addressed later in 2.3.
While these two allotropes of carbon have been known for quite some time, new
forms, termed fullerenes, have been identified. Fullerenes are hollow, all carbon
structures in the form of balls, tubes and other closed structures with walls that
are one-atomic layer thick. A notably stable soccer ball-like fullerene is C60 , commonly referred to as a bucky ball. It is named after Richard Buckminster Fuller,
an American author and architect whose geodesic dome had a very similar construction. Tubular fullerenes are referred to as carbon nanotubes (CNTs). These
elongated molecules with nanometer-size diameters have hemispherical capped
ends, and length-to-diameter ratios typically of about 1000, which makes them
truly one-dimensional structures. The wall of these tubes share the same hexagonal structure as that of individual graphene sheets, except that they are figuratively
rolled up as is shown in Figure 1.1. Figuratively is emphasized because it describes their structure, not their manufacture. This figurative connection is useful
also because many of the physical properties of nanotubes can be derived from
graphites properties. In addition, adjacent tubes interact via vdW forces rather
than chemical bond just as the adjacent sheets of graphene in the graphite crystal
Ignoring now the issue of the capped ends, tubes are classified based on their
curvature and chirality. In an unwrapped representation of the nanotube, the
chirality describes how the hexagonal structure of the graphene lattice is orientated
with respect to the axis of the tube. Figure 1.2 again shows a graphene lattice 1 .
The point in the top left is the origin. Picking this point up and curling the sheet
onto one of the labeled atoms that fall between the two vectors creates a unique
CNT. Due to the symmetry of the graphene lattice only those atoms between the
two vectors need to be considered for a unique tube. The vector pointing from
the origin to the atom picked to describe the CNT is known as the chiral vector
~ h ). This vector is conveniently defined in terms of the graphene lattice vectors
(C
labeled ~a and ~b, shown in Figure 1.2, as
Figure 1.2: The chirality is described by the nanotube wrapping angle of the
graphene sheet
1
The lattice vectors and unit cell shown Figure 1.2 are different but equivalent to those
described later in 3.1
~ h = n ~a + m ~b (n, m),
C
(1.1)
where n and m are integers. The angle this vector forms with respect to vector
~a is the chiral angle (). Tubes with = 0 are known as zigzag tubes and
are described by the chiral vector (n, 0). The moniker of zigzag comes from the
sawtooth like structure that is found around the circumference of the tube as can
be seen in Figure 1.3. The other extreme is = 30 , or armchair tubes. These
tubes are defined by the case when n = m, otherwise noted by (n, n). All other
tubes that fall between these limits are referred to as chiral tubes (n, m).
A, increasing with decreasing tube diameter [84]. MWCNTs with diameters larger
than 300
A in diameter, comprising up to 100 SWCNTs, are not uncommon [160].
As mentioned, the discovery of fullerenes is quite recent. Smalley et. al, in 1985
were the first to experimentally verify the existence of the C60 fullerene [99]. The
discovery of a MWCNT, via high-resolution transmission electron microscopy (HRTEM), goes to to Iijima in 1991 [74]. (This distinction is being debated as discussed in a recent editorial by Monthioux et al. [121] who points out that in 1952
a Russian journal published a TEM image of a MWCNT [141], and later in 1976
Endo et al. also published a TEM image of a MWCNT [129]. Neither of these articles made a clear distinction of the nature of their photographic results as Iijima
did.) The first evidence of a SWCNT was published two years later in two papers
[12, 76].
1.2
Production
While trace amounts of fullerenes are found in nature, i.e. in the soot of a candle
flame, hight quality CNTs are produced artificially. Production methods include
electric arc discharge between two graphite electrodes [37, 75], laser vaporization
of graphite [57], chemical vapor deposition (CVD) of hydrocarbon over metallic
catalysts seeds [28, 155], and other novel methods. All these methods have different yields, some preferentially produce SWCNTs (regarded as higher quality, as
opposed to MWCNTs). Differing lengths and diameters are present and the product usually needs a post-production purification step. Early production of CNTs
resulted in tubes that where quite short in length, < 10 m. Advanced methods
have been developed to produce longer CNTs, with reports of several centimeters
for some SWCNTs [179].
1.3
After Iijimas 1991 paper in Nature there was an explosion of interest in carbon
nanotube research that continues today. Figure 1.4 shows the nearly exponential
growth of published papers on nanotubes, with the year 2005 averaging 7 peerreviewed articles published a day.
2500
Number of papers
2000
1500
1000
500
1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004 2005
Year of publication
Figure 1.4: Graph showing the rapid increase in the number of published papers on nanotubes. Data points are the number of papers returned from a
search of nanotube, from the extensive ISI Web of Knowledge database at
http://portal.isiknowledge.com
Why is there such an intense interest in the field of carbon nanotubes? Compared
to other nano-scale objects CNTS are well defined elongated structures that are
basically one-dimensional systems. There were early theoretical predictions of
unique properties which were proposed before they were measured. These predictions included exotic mechanical and electronic properties that have since spurred
much new theoretical and experimental work on these unique objects.
1.4
Properties
An early and fascinating prediction of CNT electronic properties was that depending on tube chirality and diameter, CNTs would be metallic, semiconducting, or
insulating [60, 149]. This theoretical prediction was experimentally verified a few
years later [158]. Metallic MWCNTs have been shown to have amazing electrical transmission properties, experimentally carrying 109 A/cm2 compared to
105 A/cm2 for some metals [171]. An excellent in-depth review of other unique
CNT electronic phenomena including field emission, optical, and contact properties is summarized in [3].
The mechanical properties of CNTs have attracted as much or more interest than
their electronic properties. The carbon-carbon sp2 bond found in graphite, and
hence in CNTs, is one of the strongest in nature. The in-plane elastic constant
for graphite is very stiff C11 = C22 1.06 TPa [2, 169]. Axially, CNTs share this
same stiffness as evident by their high Youngs modulus. Computational results
include a tight binding study that reports a Youngs modulus (E) of 1.22 TPa,
and a DFT study that reports E = 1.1 TPa (tight binding and DFT methods will
be discussed further in 2.4) [69].
It is possible to perform tensile loading experiments on CNTs between two atomic
force microscopy (AFM) tips, monitored in real time with electron microscopy
[15, 173, 174]. Measurements on SWCNTs have yielded Youngs moduli around
1 TPa, ultimate tensile strength (UTS) of 66 GPa and strain at failure (f )
of 2 19%2 . The average experimental Youngs moduli match quite well with
theoretical studies mentioned above. The CNT specimens that were tested were
quite short and were possibly defect-free. A detailed theoretical study of failure
of ideal CNTs shows that brittle or ductile failure (depending mostly on chirality)
occurs at strain levels above 17%. [34]. The introduction of defects into CNTs
can greatly reduce their their ultimate tensile strength [124].
Only when one compares these quoted CNT material properties to other materials
7
does one realize their unique mechanical qualities. One can compare them to steels
material properties (e.g. high strength tempered 4340 steel alloy, E = 12 TPa,
UTS = 1.7 GPa, f = 12% [22]). It is wiser to compare CNTs material properties to other fiber-like materials like carbon fiber rather than three-dimensional
solids such as steel. Carbon fibers are polycrystalline graphitic fibers used heavily
in polymer-matrix composites for aerospace and other high-performance applications. Typical values for the Youngs modulus, tensile strength and strain at
failure are E = 0.3-0.7 TPa, UTS = 1.5-4.8 GPa, f = 0.1-2.0 % [19, 22]. For
the high end of these quoted values carbon nanotubes are 1.4, 13.6, 9.5 times
greater respectively, and considering that carbon fibers are 1.5 times denser
than carbon nanotubes (cnt = 1.3 g/cm3 [19]), the specific strength and stiffness
properties of CNTs are even more impressive.
Currently the mechanical properties of individual carbon nanotubes do not fully
transfer to macroscopic bulk applications. It is possible to take the short length
sections of CNTs and spin them into a nanotube yarn as shown in Figure 1.5
A, and this yarn can be twisted or braided into rope like structures as shown in
Figure 1.5 B and C [176]. The mechanical properties of these braided fibers have
an ultimate tensile strength an order of magnitude less than the individual CNTs.
While stiff and strong axially, CNTs are quite compliant radially. They are flexible enough to be curled into tight circles (as seen in bucky paper discussed below), kinking if bent too much, but reversibly and elastically popping back when
straightened, seemingly without any defects [43, 169]. Compared to carbon fibers,
which are extremely brittle in bending, CNTs show remarkable bending resilience.
2
The discussion of material properties such as the ultimate tensile strength or the Youngs
modulus, necessitates a definition of cross sectional area. For carbon nanotubes the cross sectional area is not well defined. Since SWCNTs are an atomic layer thick and basically circular
in cross section the area can be defined as A = 2R R where R is the radius of the tube and R
is the wall thickness. Should this value be the diameter of the carbon nuclei, or some function
of the electronic cloud spatial extent? Traditionally the value of R = 3.4
A has been employed.
Figure 1.5: Carbon nanotubes twisted and braided into ropes, from [176]
1.5
CNT-based applications
the possibility of CNT bearings [29, 94, 176]. Both rotational and linear bearings
have been modeled and tested. Relative linear translation of an inner tube being
partially pulled out of its shell have been performed with an AFM tip attached to
the inner tube while the outer tube is held rigid. Repeated in and out motion of
the inner tube did not deform or induce defects in the structure as viewed under
an SEM. During this same experiment it was noted that the inner tube quickly
retracted inside its parent SWCNT when the tube broke free of the AFM tip.
This rapid retraction induced by vdW forces led to the idea of a MWCNT system
that could sustain mechanical oscillation of an inner tube in the GHz range. This
proposed system consists of a short capped carbon nanotube nested in a lengthier
counterpart [104, 177, 178].
1.6
The unique mechanical properties of carbon nanotubes indirectly lead to an unusual role of the forces between them. Like adjacent sheets of graphite, nanotubes
only occasionally form chemical bonds between one another and instead interact
via vdW forces. These supramolecular interactions are normally weak and, for
most molecular species, are easily overcome by thermal agitation (see the case of
proteins). However, they turn out to be significant in case of nanotubes, building
up very strong attractive forces over the extensive aligned contacts.
Because of their hollow structure and large aspect ratio, forces or interaction between them can cause deformations of bending, torsion, flattening polygonization.
A purified form of raw CNTs known as bucky paper is shown in Figure 1.6 (a).
The spaghetti-like agglomeration with highly curved structures is particularly intriguing in view of the strong covalent carbon-carbon bonding, which confers to
CNTs high mechanical stiffness. This characteristic feature is attributed not to
thermodynamic fluctuations, but rather to vdW forces between tubes, which prevent these loops from unfolding. Another outstanding example of this balance
10
of the strain energy from bending and the van der Waals attraction is shown in
Figure 1.5 (b) [113]. Here we see closed loops of SWCNT bundles turned onto
themselves and stuck in a stable loop. It was reported that there is a critical
radius of 0.03 m for SWCNT loops at room temperature. We will address this
phenomenon later with some sample calculations.
(a)
(b)
Figure 1.6: (a) Bucky paper composed of numerous CNT bundles, from [77] (b)
Stable curled CNT bundles
The supramolecular vdW forces in CNT systems also cause SWCNTs to selforganize in ropes or bundles [80, 160]. The tubes in a bundle organize into a
triangular lattice (see Figure 1.7) with an approximate wall to wall distance of 3
A [160]. Interestingly these bundles seem to grow together with all tubes being
essentially the same length, and it has been noted that all the tubes constituting
a bundle have essentially the same diameter and only one or two chiralities [28].
Another interesting manifestation of vdW interactions is the experimental finding
that large diameter SWCNTs can be found collapsed, losing their circular cross
section and having opposite inner faces sticking together [125]. For two SWCNTs
larger than 20
A in diameter there is a flattening of the once circular cross section
along the edge of contact between one another as predicted by an empirical model
[145], or on a flat surface such as graphite [70]. For nanotube bundles with tube
diameters larger than 20
A the circular cross section can even become hexagonal
11
Figure 1.7: Typical CNT bundle showing cross sectional packing pattern [80]
1.7
Motivation
Nanometer scale devices, while currently not outside the realm of manipulation or
measurement, pose unique hurdles to the fuller understanding or their functionality due simply to their diminutive size. Computational models of nano-devices
can be utilized to perform digital experiments that offer further insight or motivation for experimental work. The atomistic simulation of carbon nanotubes is
quite common at many levels of theory, but almost all of these models, even some
of the most advanced, lack the vdW attraction. We have seen the importance of
vdW forces in many of the examples discussed above. In order to model and gain
insight for example into the possibility of efficiently gluing nanotubes by weak
supramolecular forces in a self-assembly process or the performance of MWCNT
bearings and oscillators it is important to have a detailed understanding of these
forces and an accurate model to describe them.
As nanomechanical systems typically involve a substantial number of atoms, it is
computationally convenient to adopt a classical description of the covalent bonding. However, there are several reasons to favor a quantum mechanical treatment:
12
binding plus dispersion model is presented in the last chapters along with a study
of intertube interactions in MWCNTs and CNT bundles, and is finished with an
implementation of tight-binding plus dispersion molecular dynamics.
14
All things
are made of
2
Nature of bonding and mathematical
models
15
2.1
Bonding
A bond in the everyday macroscopic sense is something that fasten things together. It is quite the same on the atomistic scale and can be described as an
energetically favorable state. This state is described by the stable and finite separation of at least two atoms under the mutual influence of one another. In order
to understand some fundamental aspects of a bond, consider the simple example
of two interacting rare gas atoms. The solid line plotted in Figure 2.1 shows the
generic form of the interaction energy of this two-atom system as a function of
internuclear separation. The interaction energy is defined as the energy in excess
or recess of this two-atom system with respect to the energy of the two atoms isolated from each other ( i.e. the energy of isolated atoms is defined as zero). There
2
1.5
Repulsion
1
Energy
0.5
r
0
0.5
Ecoh
Attraction
1.5
2
0.4
0.5
0.6
0.7
0.8
0.9
Internuclear Separation
Figure 2.1: Characteristic energy vs. internuclear separation for a simple rare gas
dimer
is a distinct minima in this plot. The energy at this point is lower than that
of the atoms at infinite separation and corresponds to a energetically favorable
state, which is a bond by our previous definition. The magnitude of the energy
at this minima (Ecoh ) is the cohesive energy, and can be thought of as the energy
required to break this bond. The distance r at which this minima occurs is simply
the equilibrium bond length.
The nature of a bond intrinsically contains two complementary components: an
16
2.2
Atomistic Forces
The starting point in a discussion of relevant forces at the atomic scale is a list
of the fundamental forces as given by the standard model: strong nuclear, weak
nuclear, gravitation, and electromagnetic forces. To what extent do these forces
play a role in bonding between atoms? The strong nuclear force is responsible for
the cohesion of the nucleus and has an extremely short range, as does the weak
nuclear interaction. Both of these forces act on the scale of 103
A, which is a few
orders of magnitude less than even one of the smallest interatomic bond length of
0.8
A for H2 [115]. These forces play no role in the bonding between atoms.
Gravitation on the other hand acts over longer ranges than the strong and weak
nuclear force. Understood in Newtons classical sense, the gravitational interaction
energy between bodies separated by a distance r is
Ug (r) =
Gm1 m2
.
r
17
(2.1)
The gravitational energy is quite small on the atomistic scale. Take for example
two argon atoms (6.63 1026 Kg) separated by 5
A, which is approximately the
nearest neighbor distance in solid argon [4]. At this distance the gravitational
potential energy between the two atoms is 5.8 1052 J, whereas experimentally
the potential energy at this same distance is 5.9 1022 J [89]. The gravitational potential energy is thirty orders of magnitude less than the experimental
measured energy. By inductive reasoning we are left with the electromagnetic
force as the dominant force in atomic bonding [89]. The manner in which the
numerous charged particles that constitute atoms interact electromagnetically is
diverse and will be addressed in the following section.
2.3
Electromagnetic Cohesion
Atomic cohesion due to the interplay of electronic charge can generally, but not
exactly, be classified into chemical and physical bonding. The general length scales
for these bonds are 1.5-3.0
A, and 3.0-5.0
A respectively.
The chemical, or covalent bond, is quantum mechanical in nature and is characterized by a directional (i.e. anisotropic) short range bond that is immensely strong.
In this chemical bond there is an overlap of the electronic clouds and sharing of
electrons. Details on the quantum mechanics and computational modeling of covalent bonds are described in later in Chapter 5. The rest of this present chapter
is dedicated to further explanation of the physical bond.
The physical bond, while still electromagnetic in nature, is a weaker, generally
isotropic bond that equilibrates at a greater distance than covalent bonds and
generally does not contain any charge overlap. Before discussing more about
the physical bond, the repulsive contribution to bonding for both chemical and
physical bonding is addressed.
18
2.3.1
Repulsive force
For neutral species the origin of the repulsive force that occurs at small distances
is based on the Pauli exclusion principle which states that no two fermions can
occupy precisely the same quantum state [4]. Phenomenologically explained there
is a huge energy penalty when two particles are very close to one another. As two
atoms are brought together their electron clouds tend to diminish along the internuclear axis and the electronic screening of the nucleus that was supported before
degrades and a repulsive Coulomb force between the exposed nuclei pushes away.
This is a very short-range interaction and energetically decays approximately as
1/r 12 . Of course another form of repulsion is simple Coulomb repulsion between
like charged particles that interact over long distances according the inverse square
law Fc q1 q2 /r 2 .
2.3.2
The attractive component of a physical bond can be classified into the attraction
due to Coulomb interaction of ions and that between neutral species. The physical attraction between neutral species is more nuanced than the simple Coulomb
repulsion/attraction. Current understanding of this physical attraction has its
historical roots in the study of gasses.
2.3.3
The ideal gas law, used to describe the properties of gasses, fails when the gas is
not sufficiently dilute. Gas particles in a dense gas are more concentrated, and
hence closer together. In denser gasses the gaseous constituents spend more time
in the range of physical bonds, and deviations from the ideal gas law occur due to
this interaction. In this regime the effects of physical bonding must be considered
to properly model the gas. In 1873 Johannes van der Waals proposed a new
equation of state that captures the ideal gas deviations attributed to the physical
19
a
P + (V b) = kB T.
V
(2.2)
In this equation P is the pressure, V is the molar volume, kB and T are Boltzmanns constant and temperature respectively. These four quantities are found
in the ideal gas law. The van der Waals equation has two parameters in addition
to these: a and b. Where a is a measure of the physical attraction between the
molecular constituents, and b is a measure of the molecular size. The attractive
physical force described by a has been given the general moniker van der Waals
force. The physical vdW force between neutral atoms/molecules is classifiable
into three distinct groups with equally distinct mechanistic origins.
2.3.4
The three effects that are collectively referred to as van der Waals forces can
be classified as orientational (Vo ), inductive (Vi ), and dispersive (Vd ) interactions
[116]. Their sum is referred to as the total van der Waals energy as shown in equation (2.3). In their simplest incarnation are expressed in terms of the internuclear
separation r of two atoms/molecules
(2.3)
and the total vdW energy for a system of N particles is expressed as a pairwise
summation over all constituents
UvdW =
1X
VvdW (rij ),
2 i,j
(2.4)
where rij is the distance between atoms i and j and the factor 1/2 accounts for
20
double counting.
The orientational term (Vo ) arises if both molecules have a permanent dipole
. Dipoles can favorably rotate into a head to tail state, and come together to
lower their interaction energy. Figure 2.2 (A) schematically shows two dipoles
favorably orientated. In the gas phase these molecules are rotating, and at high
temperature they are spinning so rapidly that the dipole effectively disappears
and the interaction energy will tend toward zero. In 1912 W.H. Keesom showed
that at finite temperature, since the probability of a given orientation of dipoles
can be determined by a Boltzmann factor, one can integrate over all orientations
and come to [116]
Vo (r) =
221 22
1
.
r 6 (4o )2 (3kB T )
(2.5)
1 and 2 are the magnitudes of the dipoles on atoms 1 and 2 that are separated
by a distance r, o is the permittivity of free space, and kB and T are Boltzmanns
constant and temperature respectively. We see that the dependance is 1/r 6 and in
the limit of high temperature the orientational dipole-dipole energy tends toward
zero.
A
+
C
+
Vi (r) =
1
1
(21 2 22 1 ).
6
r (4o)2
(2.6)
Finally, the motivation for a third vdW term in addition to the orientation and
inductive effects was to explain the presence of non-ideal gas behavior in non-polar
gasses. The dispersion effect, the last of the three to be elucidated, turned out to
be a completely quantum mechanical phenomenon that is more or less correctly
explained by being a collective interaction of instantaneous dipoles that are mutually induced. Further details on the quantum mechanics of this phenomenon will
be presented in Chapter 6. We will see there that the general relationship and
dominant term for this dispersive interaction energy takes the form
Vd (r)
C
,
r6
(2.7)
22
2.4
Mathematical models
We have introduced chemical and physical bonds and now address the methods
of modeling them as they pertain to a set of atoms. For the general purpose
of atomistic models Schrodingers wave formulation of non-relativistic quantum
mechanics is a sufficient level of theory to capture almost all physical properties.
In 1928 Paul Dirac said of this equation
The underlying physical laws necessary for the mathematical theory of a large part of physics and the whole of chemistry are thus
completely known, and the difficulty is only that the exact application
of these laws leads to equations much too complicated to be soluble.
2.4.1
Empirical
(r) = 4
r
r
r
r
(2.8)
where and are constants, as are C6 and C12 in the second form. These constants
have different values for different atoms.
Empirical potentials are computationally tractable. The functions can be readily
calculated and, for the same cpu time, a much larger system can be considered than
with higher order formulations of energy calculation. The number one drawback
is the lack of predictive power.
2.4.2
Semi-empirical
2.4.3
First principles
First principles, or ab initio calculations represent the pinnacle of electronic structure calculations. Starting with the fundamental constants and Schrodingers
equation as a postulate these methods proceed to describe the nature of atomistic
systems to a degree that is almost irrefutable. Computational resources and methods have come a long way since the time of Paul Diracs quote, and some of these
complicated equations referred to have in fact become readily soluble. But the
shear complexity of the many electron problem governed by Schrodingers equation remains burdensome for large atoms or several 10s of main group elements.
While these methods are rigorous in describing physical effects they are computationally expensive. The methods applied in solving Schrodingers equation break
into two main types: Hartree-Fock (HF) based methods and density functional
theory (DFT) methods. While both make approximations to make calculations
possible, they represent the best available methods for atomistic modeling. As
we will see latter the most crucial approximations in these methods entail the
electron-electron interactions. These interactions are the basis of van der Waals
attraction and the more common of these first principles methods fail to capture
this effect as further discussed in 3.6.
25
3
Graphite structure and interlayer
properties
26
3.1
Graphene
As mentioned earlier the graphene plane consists of carbon atoms trigonally bonded
to neighbors in a sp2 hybrid network of covalent bonds. The in-plane hybridization results in bonds. Normal to the sp2 plane are half filled un-hybridized pz
orbitals, that form weak bonds between neighboring sites in-plane. The contribution of these bonds to the in-plane binding energy is very small and are
usually referred to as non-bonding [83]. The bonds in graphene is one of the
strongest bonds known [148]. Figure 3.1 (a) shows a more detailed structure of
the graphene hexagonal lattice shown in the introduction. The solid light lines
between atoms in this figure represent the chemical bonds. For both graphite and
carbon nanotubes the carbon-carbon bond length is acc = 1.42
A [7, 51]. The
dark arrows denote the primitive lattice vectors and together with the other two
heavy lines form the unit cell. The primitive lattice vectors, ~a1 and ~a2 , defined in
terms of the cartesian coordinate system are given by
1
a
~a1 =
3
and
1
a
~a2 =
3
2
0
(3.1)
3 acc = 2.46
A. Each primitive unit cell contains two atoms labeled 1 and
2 shown in Figure 3.1 (a). The position of any of the atoms of type 1 or 2 can
be referenced with respect to the tip of the primitive lattice vectors by their basis
~ 1 = (1/3)~a1 + (2/3)~a2 and
vectors. The basis vectors for atoms 1 and 2 are, B
~ 2 = (2/3)~a1 + (1/3)~a2 , and the positions of an arbitrary atom of type 1 or 2 is
B
~ 1 = n1~a1 + n2~a2 + B
~ 1 , and R
~ 2 = n1~a1 + n2~a2 + B
~ 2 , where n1 , n2 Z.
R
The structure of graphene sheets is known through direct measurement via numerous experimental methods. One of the more illustrative measurements is an
atomic force microscope (AFM) scan of a graphite surface, as shown in figure 3.1
27
y
x
acc
~a1
2
1
~a2
(a)
(b)
Figure 3.1: (a) 2 dimensional graphene layer. Heavy dark line indicates primitive
unit cell composed of two atoms 1 and 2. (b) AFM scan of graphite surface with
overlay of a orientation of a single hexagonal ring, from [67]
(b). The AFM measures force between the AFM tip and the sample. High force
feedback indicates a localization of electrons, and hence the force contours can
approximately be interpreted as electron densities. With this interpretation the
figure shows a strong localization of electrons along bonds and nearly no electrons
in the open areas of the hexagons. This is in direct accordance with the nature of
covalent bonding and the sharing of electrons along the bond direction.
3.2
B
Zeq
Simple Hexagonal
Bernal
Figure 3.2: Graphite stacking patterns. AAA on the left and ABA on the right.
Dashed lines denote atoms that are aligned with the z-axis.
the other half are situated such that directly above and below them is the open
center of a hexagon. If one starts with an AAA stacking pattern and translates
every other layer by twice the bond length along one of the carbon-carbon bond
directions the result is an ABA stacking pattern. The ABA stacking pattern is
sometimes referred to as the Bernal form, the namesake of its founder [11]. A
third stacking pattern, which is not illustrated here, is the ABC or rhombohedral
stacking. In ABC stacking half of the carbon atoms are directly below a open area
in the above hexagon and directly above a carbon, and vice versa for the other
half of the carbon atoms. Of course there are graphitic structures in which there
is no apparent stacking pattern, where the various layers are randomly rotated or
offset. These disorder stacking patterns are referred to in literature as turbostatic
[8, 47].
Graphite found in nature is usually graphitic carbon. Graphitic carbon is characterized by small domains of graphite that are lumped together in random orientations. Some samples of large natural graphite crystals have been collected and
tested to find that approximately 90 percent of the graphene layers are stacked
in the ABA pattern, and mixed in with it is the less common rhombohedral or
29
ABC stacking [23]. A very pure form of graphite termed highly orientated pyrolytic graphite (HOPG) is synthetically manufactured. These graphite crystals
are composed of nearly 100 % ABA stacked planes [83].
For HOPG the interlayer spacing is well set at Zeq = 3.354
A for room temperature
[7, 8, 51], and shrinks slightly at low temperature (3.336
A at 4 K) [7]. For other
less refined natural graphite and disordered turbostatic graphite the interlayer
A [24, 47].
spacing increases slightly to Zeq = 3.43
3.2.1
The stacking patterns discussed in the previous section highlight that the ABA
stacking seems to be the most energetically favorable. The AAA stacking on the
other hand has not been experimentally identified and is the least energetically
favorable stacking (as shown by DFT-LDA calculations [24]). The reason for
this can be attributed mainly to the nature of the pz orbitals sticking out of
the graphene planes. In the AAA stacking all of these pz orbitals would be at
a maximum amount of overlap possible. Any other stacking pattern than AAA
would have less pz overlap and would be more energetically favorable. Another
AFM scan of a graphite surface, this time shown from an oblique angle, is shown
in Figure 3.3. The discrete nature of the pz orbitals is evident and will be referred
to as a corrugated surface.
Figure 3.3: AFM scan of graphite showing Pz orbital corrugation, from [67]
30
3.2.2
The graphite unit cell is similar to that of graphene but with a third primitive
lattice vector given by
~a3 = 2Zeq
0 .
1
(3.2)
The ABA graphite unit cell has four atoms. The first two share the same defini~ 3 = (1/3)~a1 +
tion as those of graphene and the other two have basis vectors B
~ 4 = ~a1 + (1/2)~a3 . The reciprocal lattice vectors of the
(2/3)~a2 + (1/2)~a3 and B
direct hexagonal lattice are
~b1 = 2 1
a
3
0
~b2 = 2
1
a
3
0
0
~b3 = 0 ,
Zeq
1
(3.3)
where the primitive and reciprocal lattice vectors must satisfy ~bi ~aj = ij [4]. The
first Brillouin zone is given by the Wigner-Seitz unit cell of the reciprocal lattice
and is shown in Figure 3.4 (b) with some high symmetry points labeled.
A graphene sheet by itself is a zero gap semiconductor as can be seen by the
touching of the bands at K [165]. When the graphene sheets are brought together
the interaction between layers is such that these bands split and graphite becomes
a semi-metal [165].
3.3
Due to the dichotomy in the magnitude of intra and interlayer bonding (i.e. very
strong and very weak) it makes sense to talk about the two separately. Linus
Pauling said of graphite, each of the layers is a giant molecule, and the superim31
(a)
(b)
Figure 3.4: (a) 2 dimensional graphene layer. Heavy dark line indicates primitive
unit cell composed of two atoms 1 and 2. (b) AFM scan of graphite surface with
overlay of a orientation of a single hexagonal ring, from [67]
posed layer molecules are held together only by weak van der Waals forces [133].
If we restrict ourselves to looking only at the interlayer properties we can gain
some insight into the dispersion energy between layers. By treating the layers as
extended molecules the interaction energy between them can be plotted versus the
interlayer spacing. This energy curves looks very similar in form to the diatomic
energy curve shown early in Figure 2.1. In these type of plots, which we will see
more of, the energy is referred to as the interlayer energy per atom and is dee
fined as E(Z)
= [E(Z) E()]/Natom , where the tilde denotes interlayer energy.
The location of the minima of this curve corresponds to the equilibrium interlayer
energy to refer to the energy to disassociate all carbon atoms in graphite rather
than just the layers.
32
3.3.1
2
A ) the
density of carbon atoms in a graphene plane ( = 4/ 3a2 = 2.46 atoms/
exfoliation energy per atom via the heat of wetting method is 42.5 meV/atom.
under ultra high vacuum [175]. The chemical structure of PAHs is very similar to
that graphite, sharing similar orbital hybridization and bond lengths. Large PAHs
are simply small flecks of graphite with hydrogen bonds on the edges. This group
utilized an Arrhenius rate equation to study the activation energies were found
for successively larger PAHs based on their experimental data. In their analysis
the effects of the hydrogen on the PAHs was subtracted and a generalization of
52 meV/atom was found to be the binding exfoliation energy.
Method
Heat of wetting
Tube collapse
Thermal desorption
Girifalco [54]
Benedict [9]
Zacharia [175]
3.4
Z-axis compressibility
The z-axis compressibility (kz ) is related to the c33 elastic constant via [161]
c33 = kz1 .
(3.4)
This property is described by the curvature of interlayer energy curve around the
34
equilibrium separation. The formal definition of this property and its calculation
are shown later in 8.3.3. Experimental results on HOPG via ultrasonic testing
have found the compressibility to be 2.74(0.0075)1012 cm2 dyne1 at standard
temperature and pressure [16, 128]. The compressibility is a strong function of
pressure and temperature [50]. For example at a pressure of 20 kbar and standard
temperature it decreases to 1.85 1012 cm2 dyne1 . At standard pressure and
low temperature (4 K) it decreases to 2.46 1012 cm2 dyne1 .
3.5
Vibrational properties
Two vibrational modes in graphite highlight the dichotomy of intra and interlayer
binding and the latter gives us another more insight into the nature of interlayer
properties. A representation of the atomic motion for two active Raman modes
in question are shown in Figure 3.5. In plane the strong covalent bonds lead to
a high frequency E2g (2) mode that has a frequency of 46.3 THz, shown in Figure
3.5 (A) [162].
(A)
(B)
Figure 3.5: Raman active modes: E2g (1), and E2g (2). The relative motion shown
here is adopted from [127]
The more interesting Raman active mode to us, is the rigid interlayer shearing
mode E2g (1). This mode is dominated by the weaker dispersion interactions and
35
overlap of Pz orbitals. Figure 3.5 (B) illustrates this motion. This mode is obviously of a much lower frequency than the latter due to the weaker physical bonding
involved and has been repeatedly measured at 42 cm1 = 1.26 THz 1 [2, 62, 119].
The E2g (1) shear mode is governed by the curvature of the energy landscape described by the relative translation of every other graphene sheet relative to the
others.
3.6
3.6.1
The defining idea of Hartree-Fock methods lies treating the electron-electron interaction in a mean field manner. A particular electron sees the average field of
the other electrons but is unaware of their instantaneous positions. In real systems
electrons are instantaneously correlated with each other, leading to a lowering of
total energy. The difference between the HF-SCF energy and the true energy
E o , is known as the correlation energy.
Ecor = EHF E o .
(3.5)
36
term and thus do not model dispersion forces [5]. Corrections to HF theory to
account for electron correlations include Moller-Plesset many-body perturbation
theory (MP2/MP4), coupled cluster (CC), and configuration interaction methods
(CI) [64]. These HF corrections are very expensive and scale unfavorably. The
application of corrected HF-SCF theory to graphite is restricted due to issues of
imposing periodic boundary conditions in HF methods [?]. The closest simulation
to graphite using corrected HF methods is the calculation of the binding energy
of large PAH molecules due to vdW interactions [126].
3.6.2
known.
Unfortunately the exact universal form of the XC function is not known. Much
research on its form, has been done since the mid-sixties, and many forms have
been proposed. Some forms are extremely accurate for certain states of condensed
matter including the local density approximation (LDA), and varieties of the gradient corrected methods (GGA). Both of these XC functions, and others used
fail to properly model long range interaction in rare gasses [135]. The reason for
this failure is in the fact that most XC functions are dependant on only the local density of electrons and are not a function of electron densities at appreciable
distances . Graphite is a system that has a electron distribution that is sparse compared to most condensed matter but much less so than gasses. A DFT treatment
of graphite shows some strengths but also some failings. Table 3.2 summarizes
results from a number of DFT calculations of graphite.
Method
Ref.
LDA
LDA
LDA
LDA
GGA
LDA-nl
Exp (4 K)
Exp (300 K)
[31]
[24]
[151]
[161]
[63]
[146]
a
(
A)
Zeq
(
A)
kz
12 cm2
(10 dyne )
Exf
(meV/atom)
Eco
(eV/atom)
0.97
...
4.11
7.69
0.01
7.69
2.46b
2.74c
110
20
25
30
3
24
...
Table 3.1
...
...
8.80
8.60
7.87
...
...
...
...
2.8
2.45
3.3
2.451 3.36
2.453 3.44
2.461
4.5
2.47
3.76
2.46a 3.336a
2.46a 3.354a
Table 3.2: Summary of DFT calculations of graphite including the lattice constant, equilibrium spacing, z-axis compressibility, exfoliation energy and in-plane
cohesive energy. a Ref. [7], b Ref. [50], c Ref. [13]
All methods seem to treat the in plane properties quite well. For example the lattice constant a is in very good accordance to experiment and the various in-plane
cohesive energies (Eco ) agree quite well even between XC functions but unfortunately doesnt have an experimental measurement to compare to. These methods
seem to have trouble with interlayer properties. The GGA XC shows almost zero
38
interlayer binding energy and does not account for even some of the dispersion
energy between layers. Its lack of interlayer binding has been noted in literature
[52]. This failing of the GGA XC function probably coincides with the paucity
of published information on its application to graphite. The LDA methods show
some dispersive binding energy that is in the neighborhood of experimental values.
This dispersive energy rapidly goes to zero beyond Zeq and is generally exponential rather than 1/r 6 [151]. The LDA methods also generally show the equilibrium
separation around the experimental value, though there are some outliers. All
examples shown do not accurately calculate the z-axis compressibility.
More recently there have been attempts to rectify the failing of XC functions for
long range interactions [91, 147]. An application of the method in [147] has been
applied to graphite and is shown in the table listed under LDA+nl, where the
nl stands for the non-local correction to the LDA [146]. While this represents
a more physical treatment by including non-local electron density information in
the XC function it still preforms poorly in the description of interlayer properties.
One interesting point that may be fortuitus, is the energy difference between
between AAA and ABA stackings at Zeq (referred to as EABAAAA ) as calculated
by different DFT methods. Charlier reported 17 meV/atom under LDA [24], and
Kolmogorov reported 15 meV/atom for both the LDA and GGA approximations
[96]. This point will be referenced a few times in later sections.
3.7
Summary
In this chapter we have seen the some aspects of interlayer graphite properties that
are known to a high degree from experiments including the equilibrium separation,
z-axis compressibility, and the E2g (1) mode. The exfoliation energy measured
in three experiments gives a clustering around 45 meV/atom but a much wider
margin experimental disagreement than the three previous properties. We have
also seen the general disagreement of first principles calculations with respect to
39
40
4
Existing models for dispersion
interactions in graphite
This chapter reviews previous work in the modeling of graphite interlayer properties and ends with the motivation for what will be done to address shortcomings
of the various models.
4.1
Empirical
The empirical models in this section only address the calculation of interlayer
properties. There are also intralayer empirical models for carbon ([17]) that are
sometimes used in conjunction with these for a fuller description of graphite.
41
The most quoted empirical model is the Lennard-Jones model proposed by Girifalco et al. [54, 53]. Recall that there are two constants in the LJ potential, given
in equation (2.8). In Girifalcos model the attractive C6 constant was calculated
with the Kirkwood method (explained in 6.2.6.2) from experimental data given
6
12
C6 = 15.4 eV
A and C12 = 22.5 103 eV
A .
We have implemented these two empirical models to study the interlayer properties
they produce as a function of the interlayer separation, and are shown in Figure
4.1, for both the ABA and AAA stacking. The details of this implementation
(unit cell, periodic boundary conditions etc.) will be discussed later.
30
giri AAA
giri ABA
ulbr AAA
ulbr ABA
32
34
36
38
40
42
44
46
48
50
3.1
3.2
3.3
3.4
3.5
3.6
3.7
Figure 4.1: Interlayer energy scans for the empirical models of Girifalco (giri) and
Ulbricht (ulbr) for ABA and AAA stacking of graphite around Zeq
42
Table 4.1 summarizes the results from this evaluation, which includes the energy
difference in the stackings EABAAAA , the exfoliation energy(Exf ), the E2g (1)
phonon frequency, and the z-axis compressibility (kz ) of graphite in the ABA
stacking (details of calculating the latter two quantities are addressed in 8.3.2
and 8.3.3 respectively)
Model
Girifalco
Ulbricht
Experiment
Exf
[meV/atom]
kz
[cm dyne1 ] 1012
43.41
46.6
30-50
3.18
3.17
2.74
0.9
1.1
17-23
Both of these models reproduce the correct interlayer equilibrium distance (they
were fitted to do so), and predict exfoliation energies (Exf ) that are in the range
of experimental values summarized in Table 3.1. For both models there is about a
16 % error with respect to the experimental z-axis compressibility. The percent
error in the calculated E2g (1) mode frequency is much larger, 72% for both
models. The large discrepancy in this mode comes from modeling the repulsive
interaction isotropically. As was discussed in 3.2.1 the Pz orbitals that stick out
of the graphene plane play a major role in the repulsive overlap between planes
and are very anisotropic in nature, as was illustrated in the AFM scan in Figure
3.3. This deficiency in modeling corrugation can also be seen in the relatively
small energy difference between the AAA and ABA stacking as compared to the
approximate DFT-LDA numbers. If this energy difference between stacking tends
to zero the E2g (1) mode will become non-existent in the model.
There is a third empirical model which tackles this shortcoming. The registry
dependant empirical model of Kolmogorov [96], rectifies this problem by intro1
There is a slight discrepancy ( 2.1% difference) between the exfoliation energy quoted in
[54] (42.5 meV/atom) and what I calculated here. They utilized an analytical lattice summation
technique whereas I am using a small unit cell with periodic boundary conditions
43
ducing an anisotropic repulsion term that mimics the nature of the Pz orbital.
The attractive dispersive contribution is the same as previous models (C6 /r 6 ).
The model contains 7 free parameters and is fitted to Zeq , Exf = 48 meV/atom,
kz , and the energy difference between ABA/AAA stacking from a DFT-LDA anal6 . As to whether this
ysis. The value of C6 in the model is given as 10.24 eV A
coefficient was fitted along with the 6 other parameters in the repulsive term or
calculated separately is not made clear. The E2g (1) mode was not in the database
for fitting and its value as calculated by the fitted model is not reported. It
should be in the neighborhood of the experimental value because the model is
fitted to reproduce the energy difference between ABA and AAA stacking. We
have discussed the limited knowledge of the exfoliation energy, and the unknowns
associated with the ab-initio based interalyer energy calculations (both of which
this model used in fitting), so this model has to be viewed as a step in the right
direction in modeling corrugation, but may be limited in its predictive powers.
4.2
(4.1)
where E f p/se is the energy computed from either a first principles (f p) or semiempirical (se) method. In these models it is assumed that the short range behavior
in the region of atomic overlap is modeled accurately, whereas the long range
dispersion is either completely missing (as in HF and TB methods) or only partially
accounted for (DFT). The form of E dispersion is generally a pairwise C6 /r 6 term
44
example of a tight binding plus dispersion model is that of Palser [131]. In a similar
fashion to Elstner, Palser added a damped dispersion energy to a non-orthogonal
tight-binding model. This model was focused specifically on modeling graphitic
structures. Palsers model has three free parameters that were fit to reproduce
Zeq , Girifalcos experimental interlayer energy, and the DFT based EABA/AAA
stacking difference of 17 meV/atom from [24]. The dispersion coefficient in this
6 . The z-axis compressibility that was not fitted to, is 9.4
fitting came to 11 meV A
% different than the experimental value. Palser did not report on a calculation of
E2g (1). Kwon and Tomanek have a similar model fitted to the same ABA graphite
properties as Palser [100].
4.3
These tight-binding plus dispersion methods unlike the empirical models discussed
implicitly model the corrugation of the Pz orbitals overlapping at close distances.
More importantly from a standpoint of modeling nanomechanical systems is the
quantum mechanical treatment of the inplane bonding that is not addressed with
the empirical models. This fact allows one to apply molecular dynamics to study
the evolution of interacting graphitic structures under the influence of repulsive
overlap forces and the attractive dispersion forces, all the while having access to
the electronic structure which is important in modeling NEMs. The computational
efficiency of tight-binding plus dispersion as compared to the DFT plus dispersion
of Hasegawa, allows a much greater freedom in the simulations of larger systems
over longer time scales. For these reasons the tight-binding plus dispersion method
is deemed a valid avenue for modeling the subjects proposed in the introduction,
and will be developed here.
The following three chapters lay the theoretical ground work for a physically motivated and accurate model of graphite based on this tight-binding plus dispersion
model. The dispersion force and covalent bonding that have been mentioned so
46
47
5
Modeling electronic and repulsive
interaction with a tight binding formalism
5.1
n
X
ci i .
(5.1)
The refers to the particular state ( = 1 is the ground state), c is the expansion
coefficient to be determined, and is the atomic orbitals for the n orbitals. These
atomic orbitals are known as the basis functions and usually are not the exact
atomic orbitals. Rather they usually only have atomic like character known from
the solution of the hydrogen atom e.g. Slater type orbitals (STOs).
5.2
Schr
odinger equation in LCAO approximation
The governing equation for the wavefunction in a molecule or solid is given by the
Schrodinger equation
= .
H
(5.2)
is the quantum mechanical Hamiltonian for the molecule, and is the energy
H
of the state . Within a LCAO framework the solving of this equation is equivalent to finding the set of expansion coefficients (ci ) for each state defined in
equation (5.1) that minimizes the energy that particular state. To do this the
variational method (Rayleigh-Ritz method) is applied in the minimization of the
energy functional given by
49
i
h |H|
.
h | i
(5.3)
Bracket notation as discussed in section B.2 is employed in this definition. Inserting the LCAO expansion for into equation (5.3) and applying the variational
principle with
= 0,
ci
(5.4)
Hc = Sc.
(5.5)
j i and S is the
H is the Hamiltonian matrix with elements given by Hij = hi|H|
overlap matrix with elements defined by Sij = hi|j i, and c is a vector containing
the expansion coefficients. If the basis is chosen such that all the basis functions
|i i are orthonormal, the overlap matrix will be equal to the identity matrix and
equation (5.5) will reduce to a regular eigenvalue problem
Hc = c.
(5.6)
5.3
When solving the Schrodinger equation via the variational method with a LCAO
the size of the basis must obviously be finite due to limited computational re50
sources. One ideally wants to include as few atomic orbitals as necessary to increase computational speed, but enough to capture the essence of the system. The
atomic orbitals included define the basis of the system, for covalent solids typically
the S, Px , Py , and Pz are sufficient to describe the ground state properties.
When solving a problem with atoms dispersed in space, each basis is centered at
the nucleus. The value of the basis wave function at an arbitrary point ~r in space
depends on the vector distance between this point and the nucleus at which it is
~ where R
~ is the
based. The vector connecting these two points is simply ~r R,
position vector of the nucleus taken from the same global origin as ~r. The notation
used is
~
(~r R).
(5.7)
Where labels the orbital type, (S, Px , Py , Pz ) in our 4 orbital basis. We can
rewrite our one-electron LCAO wave function now in the form
(~r) =
X
i,
~ i ).
ci (~r R
(5.8)
is an index describing the eigenstate, the sum on i goes over the n atoms in the
system, and the sum on over the basis.
We can also rewrite the matrix elements for the Hamiltonian and overlap matrices
that make our generalized eigenvalue problem from equation (5.5). Instead of Hij
we now adopt the notation Hij . Meaning the matrix element is an integral
between orbitals of type and on atoms i and j. Using our new notation these
are
Hij
j i =
= hi |H|
~ i) H
j (~r R
~ j ) d,
i (~r R
51
(5.9)
Sij = hi |j i =
5.4
~ i ) j (~r R
~ j ) d.
i (~r R
(5.10)
Periodic solids
~ = eikR~ (~r).
(~r + R)
(5.11)
~ i ) = N 1
2
ki (~r R
X
~
R
i
~
~ i ),
eikRi i (~r R
(5.12)
~ i represents the set of all translation vectors for atom i. i.e. all integer
where R
permutations for the ws given above. ki will be now referred to as a crystallike wavefunction. Using this as our basis for periodic solids we can write our
Hamiltonian and overlap matrices in terms of wavevector k as:
~ i )| H
|k (~r R
~ j )i ,
Hkij = hki (~r R
j
Hkij
=N
~
~ ,R
R
i
j
~ R
~ )
ik(R
i
j
j (r Rj )d.
i (r Ri )H
52
(5.13)
(5.14)
Hkij =
eikRji Hij .
(5.15)
~
R
j
Skij =
eikRji Sij .
(5.16)
~
R
j
At this point we defined the form of the integrals for the Hamiltonian and overlap
matrix elements given in real space by equations (5.9), and (5.10), and in k-space
by equations (5.15) and (5.16). Now we just need to evaluate them. These integrals
turn out to be very difficult to evaluate, but by applying some approximations to
the full form of the matrix elements the problem can be reduced to a manageable
form. The next few section discuss the main approximations of tight-binding and
introduce the parameters that make this method semi-empirical.
5.5
V (r) =
N
X
i
Vi (r Ri ).
(5.17)
The potential Vi is located on atom i. These potentials are assumed to be spherically symmetric. Plugging this potential into the Hamiltonian and then into the
53
Hij =
i (r
1 2
+ V1 (r R1 ) + . . . + VN (r RN ) j (r Rj )d.
Ri )
2m
(5.18)
In order to better understand the individual terms in this expression we can write
the above as
Hij = hi | T |j i + hi | Vi |j i + hi | Vj |j i +
k6=i,j
hi | Vk |j i . (5.19)
The first term is the kinetic energy and the rest are three classes of potential
energy terms. From this form we can classify the contribution to an individual
matrix element, Hij by making the following definitions
1. When all three locations, i,j and k are on the same atom we have an on-site
integral
2. When the potential k is on the same site as one of the wave functions i,
and the other wave function is at another site (i 6= j) we have a two-center
integral
3. When all three are on different cites (i 6= j 6= N) we have three-center
integrals
The three-center integrals are small compared to the two-center integrals. The
negation of three-center integrals (i.e. the two center approximation) greatly
simplifies the evaluation of these terms. Later when we discuss the parametrization
it will be seen how some of the error incurred in ignoring three-center integrals
are absorbed in the the fitting of the parameters.
54
2c |j i ,
Hij = hi | H
(5.20)
2c = 1 2 + Vi (r Ri ) + Vj (r Rj ).
H
2m
(5.21)
Within the two-center approximation the matrix elements essentially become a directionally dependant pair potential that are a function of internuclear separation,
|Ri Rj |, and the angular momenta of the orbitals and .
5.6
Slater-Koster parametrization
The tight binding or simplified linear combination of atomic orbitals method was
first described by Slater and Koster in 1954 [154]. They describe their method as a
manner of interpolation of finer resolved first principle calculations. This original
paper was concerned with calculating energy band properties of bulk crystalline
materials. The basis of the method is to decompose the integrals found in the
Hamiltonian and overlap matrices by means of orbital symmetry and replace the
direct evaluation of these simplified integrals by a parametrization based on the
two-center approximation.
5.6.1
Orbital decomposition
The orientation between arbitrarily aligned s-p, or p-p orbitals can be decomposed into and bonds, as is discussed in the next section. Our basis orbitals
have atomic like character and this character is described by spherical harmonics
known form the full analytical solution of the hydrogen atom. The integrals over
55
space of the products of these spherical harmonic functions (i.e. elements in the
Hamiltonian and overlap matrices) are always zero due to symmetry for certain
angular momenta and orientation. Just considering our sp basis, the orientations
that have none-zero space integrals are only 4 and are a shown in Figure 5.1.
Figure 5.1: Relative orientation of s and p orbitals that have a non-zero 3 space
integration between them. Shaded region is + and white region is -
5.6.2
s-p decomposition
2c |jp i = hS|H
2c |P i .
his |H
(5.22)
The right hand side introduces a simplified notation that will be used in this
section. It simply means an s type orbital on atom 1 and a p type orbital with
angular momenta on atom 2.
In order to simplify the evaluation of this matrix element we start by decomposing
the p orbital into components that are parallel () and perpendicular () to the
internuclear unit vector d~ that is shown in figure 5.2.
56
Figure 5.2: S and Pz orbital decomposition. Vector ~r connects the two atoms, d~
is a unit vector of ~r.
To facilitate this calculation we define the vector ~a, which is a unit vector along
the cartesian axes corresponding to the type of P orbital considered. For a Pz
The vector ~n, is a unit vector normal to d,
~ and
orbital ~a is simply 0i + 0j + 1k.
in the plane described by d~ and ~a. Both of these vectors are shown in figure 5.2.
It is now easy to describe the arbitrary P orbital as a sum of and parts as
|P i = ~a d~ |P i + ~a ~n |P i .
(5.23)
2c |P i = hS| H
2c | ~a d~ |P i + ~a ~n |P i i ,
hS|H
(5.24)
which expands to
~ hS|H
2c |P i = (~a d)
2c |P i + (~a ~n) hS|H
2c|P i .
hS|H
(5.25)
We gave the arguments in the previous section that the second term on the right
hand side of equation (5.25) is zero. With this consideration equation (5.25)
reduces to
57
~ hS|H
2c |P i = (~a d)
2c |P i .
hS|H
(5.26)
~ is
Since ~a is defined as a unit vector along one of the cartesian coordinates, (~a d)
simply the directional cosine in the direction of . Where the directional cosines
are defined by
dx =
~r x
,
|~r|
dy =
~r y
,
|~r|
dz =
~r z
,
|~r|
(5.27)
with (
x, y, z) being the global axis unit vectors. Using these definitions of directional cosines equation (5.26) can be rewritten as
2c |P i = d hS|H
2c |P i = d tsp (r).
hS|H
(5.28)
2c |P i .
tsp (r) = hS|H
(5.29)
This tsp (r) is the parameterized SK function that replaces the integral between
an S and P orbital in a configuration as a distance dependant function. In
5.8 we will talk about the form and fitting of these functions. The methodology
of decomposing this general SP integral into and components with the help
of directional cosines is applied to the other integrals of interest in the sp basis.
The resulting integrals can all be expressed via four SK functions (t(r)s) and the
directional cosines. The required t(r)s for our basis are defined as
The reduced form of the other integrals (for ss and pp orbitals) are shown in
equation (5.31) in the following section.
58
2c |Si
tss (r) = hS|H
2c |P i
tsp (r) = hS|H
2c |P i
tpp (r) = hP |H
2c |P i
tpp (r) = hP |H
5.6.3
For each atom pair i, j all matrix element terms between the orbitals on i and j are
calculated using the SK parameterized functions. Our S, Px , Py , Pz basis leads to
4 4 sub-matrices for each atom pair, labeled Hsub
ij . The matrix below shows the
2c |sj i )
what integral each element of the sub-matrix represents ( i.e. si sj = hsi | H
Hsub
ij
si sj
si pjx
si pjy
si pjz
pix sj
pix pjx
pix pjy
pix pjz
piy sj
piy pjx
piy pjy
piy pjz
piz sj
piz pjx
piz pjy
piz pjz
(5.30)
There are two types of these sub-matrices, on-site and hopping sub-matrices. The
latter occurs when the integral is defined between orbital on different sites ( i.e.
i 6= j). This 4 4 hopping sub-matrix is defined as
Hsub
ij
tss
dx tsp
dy tsp
dz tsp
=
d t
dy dx (tpp tpp )
d2y tpp + (1 d2y )tpp
dy dz (tpp tpp )
y sp
dz tsp
dz dx (tpp tpp )
dz dy (tpp tpp )
d2z tpp + (1 d2z )tpp
(5.31)
For on-site integrals, when i = j, the contribution to the sub-matrix is simply the
energies of the individual s and p orbitals along the diagonal.
59
Hsub
ij
E
0 0 0
s
0 Ep 0 0
0 0 Ep 0
0 0 0 Ep
(5.32)
These sub-matrices are then inserted into their respective positions of the global
Hamiltonain matrix Hi,j . To illustrate this assume the 4 4 sub-matrices are
indexed by x, and y. Their position in the global matrix is defined as
Hi,j 4(i 1) + x, 4(j 1) + y .
Until this point all discussion of the tight-binding matrices have been in terms
of the Hamiltonian matrix. The procedure for the overlap matrix S follows the
same route. The integrals between orbitals discussed in the previous section are
decomposed the same and the the results presented in the construction of the submatrices is the same except the overlap has different SK functions defined by t(r)s.
The on-site sub-matrix is the identity matrix instead of the orbital energies. The
global overlap matrix is constructed from its sub-matrices just as was described
above.
For periodic solids each element is multiplied by the summation pre-factor
eikRji
~
R
j
5.7
Up to now we have seen the tight binding formalism and now address calculating
the total energy of a system of atoms. This energy contains two parts: band
structure energy (EBS ), and a short range repulsive interaction (Erep ) [137]. This
division of the energy is understood in light of earlier approximations. Under
the Born-Oppenheimer approximation all energies calculated via Schrodingers
60
equation are electronic energies (EBS ). To account for the total energy the corecore ionic repulsion of the nuclei must be accounted for. The total tight-binding
energy is expressed as
E T B = EBS + Erep .
(5.33)
The repulsive contribution is between all atomic cores and is most often taken as
a two body potential (rij ) summed over all interactions
Erep =
1X
(rij ).
2 i,j
(5.34)
rij is the distance between atoms i and j. The form of the potential is not explicitly
known but is selected and fitted to reproduce points in the data base.
The second contribution to the total energy is the band structure energy. The
Hamiltonian and overlap matrices matrices are constructed for a configuration of
atoms and for a given wave vector k. These matrices are used in the generalized
eigenvalue problem given in equation (5.5) restated here in a form showing the
wave vector dependance
The solution to this generalized eigenvalue problem results in a spectrum of eigenvalues (k) corresponding to the energy of the different states, and the eigenvectors
c(k) constitute the expansion coefficients. The band structure energy is defined
as
EBS =
X Z
occ
f (k) d3 k.
F BZ
61
(5.35)
The sum is over the occupied states, and the occupation number of a state
is given by f {0, 1, 2}. The integral is over the first Brillouin zone (FBZ).
Computationally this integration is approximated by a weighted sum over discrete
k-points in the Brillouin zone expressed by
EBS =
Nk
X X
wi f (ki ).
(5.36)
occ i=1
The wi is the weight assigned to the specific k-point, and must satisfy the condition
Nk
X
wi = 1.
(5.37)
i=1
It is possible to uniformly sample the Brillouin zone at discrete points and assign
a uniform weight of w = 1/Nk in evaluating 5.36.
5.8
Fitting TB parameters
We have seen the ground work of tight binding this far but now must consider how
the various parameters are fit. Slater and Koster originally used first-principles calculations of energy eigenvalues at k-points corresponding to high symmetry points.
By using a least squares method a set of parameters can be found such that the
mean squared error of the eigenvalues at the high symmetry points is minimized.
Of course fitting only to the band structure for one allotrope is the easiest. This
procedure for carbon can be carried out by simultaneously fitting/reproducing the
band structure for diamond and graphite. Material properties such as the bulk
modulus and other elastic constants can be used in the database for the fitting of
the parameters. The development of tight-binding methodology given above carried along the overlap matrix. Many early parameterizations were designed such
that all basis functions were orthogonal and thus the overlap matrix is simply
the identity matrix. These orthogonal models have half the parameters as non62
orthogonal models to fit (i.e. no t(r)s for the overlap). The extra parametrization
and flexibility of non-orthogonal models has been noted to improve the transferability of tight binding models.
5.8.1
The tight-binding parameters used in our code are those of Porezag et. al. [137].
They developed a SK parametrization for carbon based on DFT, that has proven
versatile and predictive. They started by writing the Kohn-Sham orbital ks
as a linear combination of atom centered pseudoatomic wavefunctions , that
are sums of Slater-type orbitals and spherical harmonics. These pseudoatomic
wavefunctions contain parameters that are found through self-consistently solving
the Kohn-Sham equation given by
h
i
T + V psat (r) = Epsat ,
psat
The term
N
r
ro
VxcLDA [n(r)]
(5.38)
r
ro
N
(5.39)
band structure fitting is improved. This has consequences that will be addressed
in 8.2.1.
The obtained by solving equation (5.38) are used as the basis functions in
a LCAO treatment of the systems and are used next in calculating the matrix
elements H and S. For the Hamiltonian matrix elements the two-center approximation is utilized and the potential is a Kohn-Sham potential of a neutral pseudoatom. These elements are evaluated explicitly at discrete distances using the
method described in [40]. The tabulated values for the matrix elements as a function of internuclear separation are fit to 10th order Chebyshev polynomials. Its
63
LDA
Urep (R) = Etot
(R) EBS (R).
(5.40)
These points are similarly tabulated and fitted, in this case to a sum of polynomials.
5.9
64
6
Quantum mechanical origin of dispersion
forces
This chapter contains theories of dispersion forces that are elucidated and applied
to the model graphitic system. The 1/r 6 dependance in the dispersion energy will
be brought to light, along with calculation of C6 coefficient for graphite utilizing
different methods.
Theories of dispersion forces can be generally lumped into two camps: microscopic
and macroscopic theories. The microscopic theory is a bottom-up approach based
on the atom-atom dispersion interaction of London that has been mentioned several times thus far. In these microscopic theories the total interaction is thought of
65
as a sum of contributing part of the microscopic constituents. Under the assumption of pairwise addivity (discussed in the next section) the microscopic dispersive
interactions add up to describe the total interaction between macroscopic bodies
(i.e. molecules/solids).
In an opposite top-down fashion macroscopic theories start with the dispersive
energy between macroscopic bodies. Again under the assumption of additivity,
the effective microscopic contributions can be backed out of these theories. The
theories of Lifshitz and Hamaker are introduced in the eventual extraction of C6
for graphite via these methods.
6.1
Before discussing the physical basis of the dispersion theories in detail there are
some important points to keep in mind. There are some major assumptions that
are commonly made in dealing with dispersion forces that are rarely mentioned
in the first principles plus dispersion papers cited in 4.2. These assumptions are
addivity, isotropicity and non-retardation [110, 111].
The addivity assumption is the idea that the total dispersion energy can be expressed as a pairwise summation of interaction between the atomic constituents.
The isotropicity assumption says that the dispersion energy is isotropic in nature.
This turns out to be only true for some special cases and as a general rule does not
hold. Retardation is an effect that arises at far separation. Since dispersion interactions are electromagnetic in nature the instantaneous dipole that is realized on
one molecule takes a finite amount of time to register with the other molecule due
to the speed of light. At farther separations these systems can no longer correlate
with each other for a net lowering of energy. The 1/r 6 energy distance dependance
decays to a 1/r 7 dependance and finally disappears at further distances [110].
In later sections these assumptions will be further addressed in conjunction with
the development of the theories.
66
6.2
6.2.1
System description
Ha a = Ea a ,
(6.1)
Hb b = Eb b ,
(6.2)
where the Hamiltonians contain terms associated with the isolated atoms. This
system is said to be unperturbed and its wave function is simply the product of
the individual wavefunctions, and its hamiltonian and energy just the sum of the
two, i.e.
67
(0) = a b = a b ,
(6.3)
H (0) = Hab = Ha + Hb ,
(6.4)
E (0) = Ea b = Ea + Eb .
(6.5)
The superscripts 0 mean the zeroth order and stand for the unperturbed solution. The system of two non-interacting atoms now can be written as
6.2.2
(6.6)
When the two systems are separated by a finite distance rather than infinite,
the wavefunctions a and b are nearly the same but are slightly distorted due
to the presence of one another. If the distortion is slight enough one can apply
perturbation theory to approximate the change in the system.
Perturbation theory starts by assuming the full knowledge of the zeroth-order
unperturbed solution to the Schrodinger equation given in equation (6.6). With
this knowledge its possible to describe the solution of the same system that is
slightly perturbed. This system is described by a new Hamiltonian that has the
same base Hamiltonian H (0) plus a small perturbation V
H = H (0) + V.
(6.7)
The perturbation V can have many orders of contribution as can be see in Appendix E. Here we do not show higher order perturbations because they do not
68
add anything to the description of our system and it helps keep the notation
cleaner.
6.2.3
The correction to the energy and wavefunction in perturbation theory are added
to the zeroth order energy and wavefunction, and the actual wavefunction and
energy are expressed as
(6.8)
(6.9)
The superscripts on the energy and wavefunction denote the order of the correction. E (1) and E (2) are referred to as the first and second order corrections to the
energy respectively, and as mentioned earlier E (0) corresponds to the unperturbed
energy.
The derivation for the first and second order corrections to the energy and wavefunctions are shown in Appendix E. The first order correction to the energy, in
state k is
(1)
Ek = Vkk .
(0)
(6.10)
(0)
(2)
Ek
X
n
|Vkn |2
(0)
(0)
Ek En
69
(6.11)
(1)
(0)
(0)
Ea b = ha b | V |a b i ,
(2)
Ea b =
X
X
n1
6.2.4
n2
(0)
2
(0)
(0)
ha b | V |n1 n2 i
(0)
(0)
(6.12)
(0)
(6.13)
Application to Hydrogen
If the atoms A and B are hydrogen and the quantum numbers correspond to the
ground state we know, from the full analytical solution of the wave functions in
the ground state that the charge distributions are spherically symmetric.
~r1
~
R
+
~r2
y
B
A
x
~ with
Figure 6.1: Two hydrogen atoms A and B. The nuclei are separated by R
electrons referenced from the nuclei
Figure 6.1 shows the system of hydrogen atoms, separated by the internuclear
~ Atom As electron is denoted by ~r1 and the other electron is denoted as
vector R.
~r2 for atom B. The perturbation in our case is the columbic interactions of charges
in atom A and those in atom B expressed by
70
nA nB
e e
n e
e n
A
}| B { z A}| B { z A}| B {
z}|{ z
2
e2
e2
e2
e
+
V =
.
~ + ~r2 ~r1 | |R
~ + ~r2 | |R
~ ~r1 |
r
|R
(6.14)
~ in the first term, and the overbraces denote the interaction (e.g. nA
r = |R|
eB is the term associated with the Coulomb interaction between nucleus A and
electron in B). Assuming the nuclei are sufficiently far apart the interaction can
be expanded in a Taylor series. For the general case of an arbitrary number of
electrons this expansion is referred to as a multipole expansion and is described in
Appendix D ( see Equation D.9 and table D.1). In this derivation the internuclear
~ is taken as one of the cartesian directions ( e.g. x direction in figure 6.1).
vector R
For the case of non-ionic interaction, the first five terms of the expansion in table
D.1 are zero. The electrostatic energy between these neutral charge distribution,
begins with the dipole-dipole interaction energy and continues up into the higher
order terms. The perturbations first non-zero term is given by
~ 3
V =R
X
ij
(6.15)
and termed the dipole-dipole operator. The sum goes over the i charges in atom
A and the j charges in atom B. The cartesian components of ~ri are xi , yi and
zi with origins centered at the nuclei, and qi is the magnitude of the charge. For
hydrogen, using atomic units (i.e. e = 1) the dipole-dipole interaction can be
written as
V =
1
(x1 x2 + y1 y2 2z1 z2 ).
r3
(6.16)
With this perturbation we can calculate the corrections to the energy of the isolated hydrogens due to their mutual interaction. The change in energy between
the isolated states is simply
71
(1)
(2)
E = E0 + E0 + . . . h.o.t s.
(6.17)
(1)
The first order correction to the energy given in equation (6.12) is zero (E0 = 0).
This is because the ground state wavefunctions are spherically symmetric and the
integral described by them will always be zero.
For the second order correction to the energy we plug our perturbation from
(2)
(6.16) into the definition for E0 given in equation (6.13). With the ground state
described by a = b = 0 the second order correction to the energy is
(2)
E0 =
1
r6
X
X
n1
n2
2
(0) (0)
(0) (0)
h0 0 | x1 x2 + y1 y2 2z1 z2 |n1 n2 i
(0)
(0)
(0)
(0)
(6.18)
The 1/r 3 term from the perturbation given in equation (6.16) is squared in the
numerator of the second order correction and is brought front as 1/r 6 .
(3)
E =
C6
.
r6
(6.19)
(0)
E0 < En for all n in the sum, and the numerator is always positive so C6 will
always be negative. For excited states this condition is not necessarily true and
there can be a net repulsion.
72
6.2.5
We have truncated the perturbation at the second order and only considered the
dipole-dipole perturbation. It is wise to consider the magnitude of both the higher
order perturbations and multipole terms.
This change in energy will be referred to as the dispersion energy. If higher order
multipole terms were included at in the perturbation in (6.16) we would see that
the dispersion energy takes the form
disp
X C2n
C6 C8 C10
.
= 6 + 8 + 10 + . . . h.o.t s =
r
r
r
r 2n
n=3
(6.20)
The right hand side shows the general form of the infinite expansion.
6.2.6
The next few sections describe the approximate evaluation of C6 for graphite by
simplifying the double summation in equation 6.18.
= E .
(6.21)
The static polarizability, , has units of [C2 m2 J1 ]. Polarizability is usually reported in the form of the polarizability volume given by
73
,
4o
(6.22)
3
with units of volume [
A ]. While the polarizability is properly a tensorial quantity,
here it is assumed that the measured value used in Londons approximation is the
average static polarizability over all orientations.
Londons approximation is based upon applying the closure approximation ( see
Appendix E.3) in evaluating the sum in (6.18), the form is [108]
Lond
C6ij
3
=
2
Ei Ej
Ei + Ej
i j .
(6.23)
The E is the average energy difference between all excited states and the ground
state as explained in E.3. This term has been proposed to be approximately
either the ionization potential (I), and the resonance or first transition energy (1 )
[97]. Another interpretation will be discussed in the following section. Using the
interpretation of the average energy difference (Ei ) as the ionization potential,
the London form of C6 between atoms i and j is
Lond
C6ij
3
=
2
Ii Ij
Ii + Ij
i j .
(6.24)
While this is a rough approximation due to the assumptions made, this formula
provides a good starting point to evaluate C6 with experimental data.
In order to evaluate this expression for carbon in graphite one needs the static
polarizability, and the ionization potential in the graphitic state. The last point,
in the graphitic state is important. One can find these data points for carbon
in its isolated form, to its many forms of hybridization, and when it is hybridized
to other species.
The ionization potential for carbon in graphite measured experimentally is 11.22
74
eV [136]. The static polarizability, on the other hand is not as easy to measure.
The static polarizability for an atom is not just a function of its atomic number,
but it is also a function of its state of bonding or hybridization [82, 118]. Using
this logic one can see that since there isnt such a thing as isolated carbon with
the properties of a carbon atom in a graphitic state, this is not a directly measurable quantity. Fortunately there are empirical combinations rules for describing
molecular polarizability in terms of atomic polarizability and hybridization that
work very well [82, 118]. The most extensively tested of these is Millers [118]. His
combination rules correctly reproduce experimental polarizabilities of over 400
compounds with in a average error of 2.2%. For carbon in the graphitic state
only bonded to other carbons, Miller finds that the static atomic polarizability is
3
= 1.896
A . Using this polarizability value with the ionization value quoted
6
above the C6 for graphite using equation (6.24) is 30.25 eV
A
Ei = (Ni /i )1/2 .
(6.25)
sk
C6ij
i j
3
.
=
1/2
2 (i /Ni) + (j /Nj )1/2
(6.26)
Halgren has applied a similar method as Miller for the combination rules of dis75
kw
C6ij
i j
.
= 6me c
i /i + j /j
2
(6.27)
me is the mass of the electron, and c is the speed of light. Using this form, Kraus
6 for graphite from data provided in [6].
[98], calculated C6 = 15.2 eV A
6.3
The other way to tackle the problem of describing dispersion forces is to start
with a description of macroscopic bodies interacting through dispersion forces.
These theories are rigorously expressed with quantum electrodynamics (QED)
[102]. They have been reformulated in semi-classical terms and in this context
the dispersion energy can be defined as the change in the zero-point energy of
the electromagnetic field modes (obtained by solving Maxwells equations) when
the latter are perturbed by the molecules through coupling of the field with the
polarization currents induced on the molecules [110]. The general results and
implications of this macroscopic theory of dispersion forces will be briefly summarized and stated in the next few sections.
76
6.3.1
The dispersion interaction between two semi infinite slabs was described by Lifshitz, who developed a general theory of dispersion interactions between macroscopic bodies [106, 36]. Before the general form Lifshitz proposed is explained, a
necessary quantity, (i), which is utilized by this theory must be discussed.
D = E.
(6.28)
(6.29)
The Lifshitz theory of dispersion forces presented in the next section requires
the function (i), which is the real part of the dielectric function evaluated at
purely complex frequencies = i, and will be referred to as the Lifshitz dielectric
function from here on. The dielectric permeability for an imaginary frequency is
related to the imaginary part of the dielectric permeability via the Kramer-Kronig
relation [72]
77
2
(in ) = 1 +
Z
0
()
d.
2 + n2
(6.30)
Thus if the imaginary part of the dielectric function evaluated at real frequencies
is known the Lifshitz dielectric function can be attained. One path to (in ) can
start with the experimental data n() and k(), progress to () via equation
(6.29), and finally to (in ) via the Kramer-Kronig relation in equation (6.30).
Another route is the direct calculation of () from a first-principles or semiempirical technique and then the application of the Kramer-Kronig relation.
L
Figure 6.2: Two semi-infinite slabs 1 and 2 separated by a distance L and a
medium 3.
Z
kb T X
k ln(1 12 32 e2kL ) dk.
U123 (L) =
2 n=0
0
78
(6.31)
kj =
k (in ) j (in )
,
k (in ) + j (in )
(6.32)
n = n
2kbT
~
(6.33)
Mahanty and Ninham have developed this theory beyond Lifshitzs two slab problem to other geometries such as sphere-sphere, cylinder-cylinder etc. along with
theories for anisotropic dielectric functions [110].
6.3.2
Uss (L) =
A
,
12L2
(6.34)
one can solve for the Hamaker constant A [10, 48, 72]. The derivation of this
energy relationship and how the Hamaker constant A is related to C6 will be
discussed in the next section. Here we note two case in which the evaluation
of the Lifshitz dispersion relation has been applied to graphite and a Hamaker
constant has been identified, in addition to this a single experimental evaluation
of the Hamaker constant has been found in literature.
Dagastine et al. calculated the Hamaker constant for graphite from experimental
79
spectroscopic data [30]. The isotropic assumption of the dielectric function mentioned in 6.3.1.1, does not hold for graphite. The in plane dielectric function
(xx = yy ) is quite distinct to that normal to the plane (zz ). Spectroscopic data
from both in plane and normal to the plane measurements was used in the con
struction of () with the relation in (6.29), and used equation (6.30) to convert
this to (in ). Using a modified version of the Lifshitz equation in (6.31) that
accounts for non-isotropic dielectric functions they calculated a Hamaker constant
of 2.53 1019 J.
Li et al. have preformed a similar analysis on graphite but attained the dielectric
response via a tight binding model under linear response theory [105]. Unlike
Dagastine they assumed an isotropic dielectric response and there resulting long
wavelength Hamaker constant is reported as 2.2 1019 J.
Parfitt and Picton reported a experimental Hamaker constant for graphite from
some experiments on colloidal suspensions of graphite flakes. Due to numerous
uncertainties in the analysis a rather broad range of 2.1 - 5.9 1019 J is reported
[132].
6.3.3
In conjunction with the last section this section further develops the ideas of
Hamaker in describing the interaction between macroscopic bodies and calculations of effective C6 dispersion constants are preformed utilizing the Hamaker
constants reported in above. Hamakers ideas came out of the context of colloidal
science. The main idea of the Hamaker energy relation between macroscopic bodies is an assumption of addivity of dispersion forces and a treatment of the atoms
in a solid as a continuum [61]. With these assumptions the total interaction energy can then be expressed as volume integrals over the interacting bodies. These
calculations can be applied to any arbitrary geometric bodies (e.g. sphere-sphere,
cylinder-sphere etc). The derivation for the two slab problem is given below in
80
Upp (r) =
C6
.
r6
(6.35)
dx
dy
x=0
r
X
y
x=L
Figure 6.3: Point-surface interaction diagram. The wavy line represents a visual
truncation of the infinite solid. The infinitesimal ring is shown from the side and
the dotted line represents its position in the solid into and out of the page
81
Figure 6.3 is a diagram of the system in question. The point particle is located at
x = 0, a distance L from the surface. The differential element is a infinitesimal
ring that is equidistant from the point. The volume of this ring multiplied by the
number density () is the number of atoms at this distance r,
N(r) = (2y)dxdy.
(6.36)
r is simply r = (x2 + y 2 )1/2 . By evaluating the dispersion energy with equation (6.35) with this distance r, and multiplying it by the number of atoms at
this distance given by N(r), gives the dispersion energy contribution from this infinitesimal ring. Integrating over all rings results in the total dispersion interaction
energy between a point and a half-slab in the form
Ups (L) =
Z Z
(2y)
(x2
C6
dy dx.
+ y 2 )6/2
(6.37)
x=L y=0
The ps stands for point-half slab interaction. For the geometry shown in Figure
6.3 this reduces to
Ups (L) =
C6
.
6L3
(6.38)
(6.39)
dx
x
and the surface-surface total interaction is just these surface elements integrated
from x = L
Uss (L) =
Use (x) dx =
C6
dx.
6x3
(6.40)
This simplifies to
Uss (L) =
A
C6 2
=
.
12L2
12L2
(6.41)
The right hand side is the same form that was shown earlier in equation (6.34).
As can be seen from this equation the Hamaker constant A is defined as
A = 2 2 C6 .
(6.42)
The graphite dispersion constant C6 can be backed out of the Hamaker constant
with knowledge of the number density of carbon atoms in graphite which is =
83
3
1/(A0 Zeq )= 0.114 atoms/
A . Using this with equation (6.42) the three reported
6.4
Later when we report our fitted value of C6 we must refer back to this the results
of this chapter to see if the value found is reasonable with what was presented
here in its direct calculation. The results of the various methods, including the
experimental Hamaker value, are tabulated in table 6.1. This table shows a decent
spread with the Lifshitz based methods clustered at the lower end and the other
methods showing a larger value. It is interesting to note that the one experimental value reported (calculated from the Hamaker decomposition method) almost
perfectly spans the range of calculated values.
6
C6 [eV
A]
30.25
30.89
16.34
15.2
12.32
10.71
10.22 28.72
Method
London
Slater-Kirkwood
Slater-Kirkwood
Kirkwood
Lifshitz/Hamaker
Lifshitz/Hamaker
Hamaker (experiment)
84
7
Damping functions
In the previous chapter Londons perturbative description of the dispersion attraction between hydrogen atoms was discussed amongst other theories. This analysis
was based on two approximations that breakdown when the two hydrogen atoms
come closer together. The first is related to the main assumption in perturbation
theory, that says the perturbation must be small enough such that perturbed system is close to the unperturbed state. When the hydrogen atoms, or any other
system of atoms, are brought close enough to one another the resulting wavefunctions are very different than those of the isolated system, and thus perturbation
theory fails. The second breakdown in Londons derivation at close proximity
is from the definition of the perturbation operator via the multipole expansion.
85
This was derived through a Taylor series expansion under the assumption that the
internuclear separation was much larger than the average radius of the electron
cloud. At small internuclear separation this expression for the perturbation losses
its meaning.
In order to properly treat this region were Londons assumptions break down
the idea of a damping function is introduced. The deviation of the dispersion
energy from the asymptotic limit description, given in equation (6.20), at close
internuclear separation can be described if the true dispersion energy is known.
The damping function is defined simply as the ratio of the true dispersion energy
and the London dispersion energy
f (r) =
Eodisp
ELdisp
(7.1)
Eodisp is the true dispersion energy and ELdisp is the London dispersion energy. If the
damping function is known then the true dispersion energy is simply the damping
function multiplied by the London dispersion energy,
Eodisp = f (r)ELdisp .
(7.2)
There are some general features of this damping function which can be pointed
out. One is that it acts as a switching function and has a range between 0 and
1. As r approaches zero f (r) must necessarily go to zero at least as fast as the
asymptotic dispersion energy. This can be understood by saying the dispersion
energy must be finite at all distances r, without the damping function it would
approach as r tends to zero. In the limit of large r, the asymptotic limit
of the London derivation is valid and f (r) must be unity. The nature of how
this damping function switches from 0 to 1 is the key point to consider. As in
the development of the 1/r 6 dispersion energy dependance the hydrogen dimer
86
7.1
The only damping function which is known to a very high degree is that of hydrogen. Koide, Meath and Allnatt (KMA) have preformed pseudo-state evaluation
of the dispersion energy [93]. These calculations are cited as the most accurate
analysis of the dispersion energy between hydrogen atoms, and are the benchmark
for proposing a general form of damping functions [157, 166].
7.2
All of the first principles plus dispersion models discussed in 4.2 employ a damping function. There are a handful of damping functions that are used often. A
early form was proposed by Ahlrichs et al. [1]
(7.3)
2
fdamp (r) = 1 exp[c(r/rm )3 ] .
(7.4)
Again c and rm are fitted for the particular interaction. Fermi functions, which
also have two parameters to fit are also used as damping functions in some models.
Yet another damping function is that of Tang and Toennis [157]. They developed
87
a semi-universal damping function that has been shown to correctly describe the
functional form of the damping function for a number of interactions. This damping function has a single parameter, , which describes the range of the overlap
of the interacting species and has the form
2n
fdamp
(r) = 1
2n
X
(r)k
k=0
k!
er .
(7.5)
Each term in the dispersion series given in equation (6.20) has an individual damping function. n = 3 in equation (7.5) corresponds to f6 which is the damping
function used for the dipole-dipole dispersion term C6 /r 6 . The total damped dispersion series in Tangs form is then
disp
Edamped
2n
fdamp
(r)
n=3
C2n
.
r 2n
(7.6)
This damping function has a few positive qualities: its apparent universality, and
single parameter formulation. Tang and Toennies have shown that their damping
function does a very good job in reproducing the form of the damping function
as calculated by Koide et al. described above. Figure 7.1 shows Koides data set
for hydrogen as dots and the fitted damping function as a solid line for the first 8
terms in the dispersion series. Similar good fits are shown for He-He, Ar-Ar, NaK and Li-Hg based on the best available data for these interaction. Their paper
shows the shortcoming of the other discussed damping functions in describing these
interactions as compared to their own. This damping function will be adopted for
use in our formulation of the damped dispersion based on these arguments.
88
Figure 7.1: Plot of Tangs damping function for b = 1.67 a.u. for n=3-10 as solid
line, and Koides hydrogen dispersion energies as dots [93]
89
8
Tight-binding plus dispersion
parametrization
This chapter details the marriage of the tight-binding model with a damped dispersion energy and our parametrization of the free parameters to model both the
inter and intra-planar properties graphite. The notation for the interlayer energy
e defined in 3.3 is used in this and following sections. A rhomper atom (E)
bohedral supercell of four unit cells containing a total of sixteen atoms is used.
8.1
The total energy for the system is defined in the same manner as was discussed
in 4.2 and is repeated here:
E tot = E T B + E disp .
(8.1)
TB
Nk
X X
wi f (ki ) +
occ i=1
1X
(rij ).
2 i,j
(8.2)
E disp =
1X
C6
fdamp (rij ) fcut (rij ) 6 .
2 i,j
rij
(8.3)
In this dispersion energy expression, fdamp (rij ) is the damping function. We discussed in 7.1 that we are using the Tang and Toennis damping function due to
its simplicity and seeming semi-universality. [157] Its form for the damping of
the C6 term (n=3 in equation (7.5)) is
fdamp (r) = 1
6
X
(r)k
k=0
k!
er .
(8.4)
The term fcut (r) in equation 8.3 is a cutoff function. Cutoff functions are employed
to make the neighbor search routine more efficient, and improve the stability of
molecular dynamics simulations. Cutoff functions effectively define a radii beyond
which contributions are considered negligible and thus ignored. We have utilized
a Fermi function for the cutoff function within the defined cutoff distance Rcut ,
and 0 otherwise.
91
fcut (r) =
1
c
exp( rr
)
rw
+1
if r Rcut
(8.5)
if r > Rcut
The Fermi function is equal to unity and switches to zero with continuous derivatives, and in our case the dispersion energy also has continuous derivatives. For
molecular dynamics having continuous derivatives of the energy at the cutoff
negates the possibility of spikes of high force when atoms enter each others domains defined by the cutoff distance. This Fermi function is defined by two parameters; rc determines the distance at which the function is equal to 1/2, and
rw controls the smoothness of the cutoff. We simply choose a value of rw = 0.1
and find a corresponding rc such that the function is essentially zero at the cutoff,
i.e. f (Rcut ) = 1 105 . Figure 8.1 shows the cutoff functions for various values
of Rcut . The seeming arbitrariness in the definition of the cutoff function will be
addressed and rectified later in 8.4.
1
R
=5
= 10
= 15
cut
0.9
cut
0.8
cut
0.7
0.5
cut
(r)
0.6
0.4
0.3
0.2
0.1
0
10
r [A]
15
20
92
8.2
The tight-binding energy expression given in (8.2) requires the summation of energy over the Brillouin zone. The Brillouin zone was sampled with the special kpoints (ki ) and weights (wi ) of Monkhourst-Pack [120] as tabulated by [101]. The
density of k-points was increased to 370 to satisfy an energy convergence criteria
of 0.1 meV. A summary of this convergence is shown in Table 8.1 for graphite
at its equilibrium interlayer spacing. This tight convergence is need because the
magnitude of the interlayer energy is on the order of 50 meV. 8.3.2 discusses an
exception to the use of the Monkhourst-Pack points for certain calculations.
# k-points
1 ( )
28
133
370
610
ET B
-46641.143
-47341.229
-47344.471
-47344.801
-47344.882
Figure 8.2 shows a plot of the tight-binding interlayer energy per atom as a function
of interlayer spacing. As expected there is no minima due to the lack of dispersion
energy in the formalism of tight-binding and the resulting energy is of a purely
repulsive nature.
It should be noted that in the figure the interlayer energy effectively goes to
zero before Zeq . Though the tight-binding cut-off distance is around 6
A, there
is no interaction energy at this distance. This situation needs to be rectified
on the basis that in order for there to be a minima at Zeq , and thus a stable
graphite structure, there must be a non-zero tight-binding interlayer energy at Zeq .
This can be understood by a simple argument. Both energy terms monotonically
approach zero at large distances, but from different sides of zero. In order for a
minima to occur the first derivative of the total interlayer energy must be zero, and
consequently the sum of the first derivatives of the tight binding and dispersion
93
200
180
160
140
TB
At Zeq, E
< 1 10
meV/atom
120
100
80
60
40
20
0
20
2.5
3.5
interlayer separation [A ]
Figure 8.2: Purely repulsive tight-binding interlayer energy
interlayer energy must sum to zero. The derivative of the attractive dispersion
term will always be positive, implying that there must be a negative-valued first
derivative of the tight-binding interlayer energy in order for the minima to occur.
Due to its decreasing monotonic nature the tight-binding energy must necessarily
be non-zero at the interlayer spacing at which the minima occurs, in order for its
derivative to be non-zero.
8.2.1
Orbital expansion
One may wonder why the DFT fitted tight-binding parametrization fails to produce a repulsive overlap energy at the equilibrium interlayer spacing while it naturally must, in accordance to the previous argument. This result can be directly
traced to the artificial orbital compression described in 5.8.1, done at the time in
the name of celerity. To undo this limitation on the proper modeling of graphite
interlayer properties one can simply inflate the compressed orbitals. This orbital
inflation is done in a simple manner. To inflate a particular pair of interacting
orbitals one can simply contract the distance in which they are separated when
94
evaluating the hopping integrals in the hopping sub-matrix. This is done by introducing a scaling parameter which is multiplied by the real internuclear distance
rij to get a reduced distance rijred = rij which is used in the integral evaluation. In
order to preserve the in-plane properties of graphite (which are already properly
described with the unmodified TB code) we must introduce a numbering system
for the bodies interacting via dispersion forces. A layer of graphite, or later on,
a single tube is numbered and the distance scaling is only done between different
numbered systems so that in the case of infinite separation of the systems the
in-plane energetics are the same as with out the orbital stretching addition.
We have previously stressed the importance of the Pz orbital in the repulsive overlap interaction so it would make sense to only inflate this orbital when evaluating
hopping integrals between Pz orbitals on differing sheets. This works fine until we
bring our graphite dispersion model to use on the curved surfaces of carbon nanotubes. In this case there is no uniform z direction in which to inflate Pz orbitals
on differing nanotubes. Another formulation is to uniformly inflate all the orbitals
in the graphite formulation and fitting, thus bypassing this directional problem
when describing tube-tube interactions.
The introduction of this scaling parameter produces the desired repulsive energetic
effect, as is illustrated in Figure 8.3. With a decreasing we see the increased
repulsion and our necessary non-zero tight-binding interlayer energy at Zeq .
8.3
At this point there are 3 free parameters, the scaling parameter , the dispersion
constant C6 , and the damping function parameter . This situation is similar to
the one with which Palser dealt with ( 4.2). Two of the data points Palser used
in fitting have a relatively high uncertainty: the experimental interlayer cohesive
energy, and the DFT stacking difference between AAA and ABA. It seems that
a more logical choice in fitting these parameters would be to fit to the more well
95
300
= 1.00
= 0.95
= 0.90
= 0.85
250
200
150
Zeq
100
50
0
2.5
3.5
interlayer separation [A ]
Figure 8.3: Tight-binding interlayer energy for different s. Arrow shows trend
of decreasing , and the associated orbital inflation
known E2g (1) shear mode frequency and the z-axis compressibility, in addition to
the equilibrium interlayer separation Zeq . These three data points constitute a
much firmer database for our fitting than Palsers.
The fitting routine consist of the simultaneous functional minimization of the three
experimental points (Zeq , kz , Eex ) minus those calculated in the tight-binding plus
dispersion code. This procedure is preformed via the hybrd.f routine provided
in the MINPACK library from Argonne National Laboratory [122]. This routine
is an implementation of Powells hybrid method, based on the conjugate gradient
search routine [138].
In order to proceed in computing the free parameters via the aforementioned
method it is necessary to construct functions to calculate the equilibrium spacing,
phonon frequency and the compressibility given a set of parameters , C6 , . The
next three sections address the computational details of attaining these values.
96
8.3.1
Zeq
edisp
E
+
Z
= 0,
(8.6)
Zeq
which is simply a statement that the repulsive tight-binding interlayer force must
balance the attractive dispersion force at the equilibrium interlayer spacing. These
derivatives are evaluated through a finite central difference derivative evaluated
at a dilated interlayer spacing Zeq + Z and a compressed spacing Zeq Z. For
the tight-binding term this is
eT B
E
Z
Zeq
eT B (Zeq + Z) E
eT B (Zeq Z)
E
.
2 Z
(8.7)
8.3.2
The E2g (1) phonon frequency is evaluated via the frozen phonon method, modeling the interlayer sliding as approximately harmonic about the equilibrium ABA
stacking [114]. Under this assumption, the natural frequency of the sliding layers
can be described by
2fE2g (1)
q
exp
e cc
/mc .
= K
(8.8)
Here fE2g (1) is the frequency which corresponds to the E2g (1) mode (1.26 THz),
exp
e cc
K
is the experimental effective spring constant per atom along the carbon-
carbon bond length, and mc is the mass of a single carbon atom (12 amu =
1.9926 1026 Kg). Solving for the effective spring constant in the units of our
code
97
2
exp
e cc
K
= 7.795 102 (eV/
A )/atom.
(8.9)
This effective one atom spring constant can be calculated for graphite in our tightbinding code using the relation of a harmonic oscillator.
e cc d 2 .
e tot (dcc ) = 1 K
E
cc
2
(8.10)
e tot (dcc ) in this case represents the energy per atom in ABA graphite under
E
uniform translation (dcc ) of every other layer in one of the carbon-carbon bond directions. This motion corresponds to the relative translation of layers in the E2g (1)
mode described in 3.5. The energy is evaluated at Zeq and dcc is the magnitude
of the interlayer sliding (dcc = 0 corresponds to a perfect ABA stacking).
e tot (dcc ) is evaluated at three points (dcc = 0, dcc ) and fitted to a quadratic
E
form via a least squares method 2 .
etot (dcc ) = a d2 .
E
cc
(8.11)
The value a is the leading coefficients of the quadratic polynomial. Equating the
right hand sides of this equation and equation (8.10) and canceling leads to
e cc = 2a,
K
(8.12)
exp
e cc
2a K
= 0.
(8.13)
98
exp
e cc
Where the value of K
is that reported above in (8.9).
8.3.3
Compressibility calculation
kzT B+disp
Ao d E (Z)
=
Zeq
dZ 2
2
e tot
Zeq
(8.14)
The second derivative of the interlayer energy is with respect to the interlayer
separation evaluated at the equilibrium ABA spacing. Ao is the area per atom in
2
A /atom).
the graphene sheet (Ao = 3 a2 /4 = 2.62
This second derivative was calculated with the second order central difference
approximation [65]
etot (Z)
d2 E
dZ 2
=
Zeq
e tot (Zeq + Z) 2E
e tot (Zeq ) + E
e tot (Zeq Z)
E
.
Z 2
(8.15)
(8.16)
A subtle point was discovered in evaluating these phonon frequencies by this method. There
e tot (dcc ) about the ABA stacking. Specifseemed to be some discrepancies in the symmetry of E
ically it was not found to be harmonic as expected. A closer look revealed that the MonkhorstPack special k-points are generated under the consideration of certain group theoretical considerations that are broken as the perfect hexagonal unit cell is disturbed. This problem was
remedied by population of the Brillouin zone with a sufficient number of k-points (15 15 3)
e tot (dcc ) about the ABA
to attain the convergence mentioned previously. With this treatment E
stacking showed the expected harmonic behavior, and this brute force population is used in all
calculations of the E2g (1) mode.
99
8.4
Optimization results
Functions were created to evaluate the objective equations given in (8.6), (8.13)
and (8.16), detailed in the previous three subsections. These three objective functions are utilized by the previously discussed hybrd.f routine in the parameter
optimization. There is a fourth semi-free parameter, the cutoff distance Rcut for
the dispersion energy. This parameter was found iteratively and effectively canceled the need for a cutoff function, as will be shown. The logic behind picking this
distance is based on a few ideas. Since we are assuming pairwise addivity in this
model the long range interactions are collectively contributing to the total energy,
but the contribution to the total energy drops off quickly as 1/r 6 . My logic was
to apply the parameter optimization procedure with progressively larger cutoff
distances until the parameters converged to steady values. This was done, and at
a cutoff distance of 12.25
A the three free parameters are changing less by than
1%. Table 8.2 shows a summary of the optimized parameters. Note that for the
optimized parameter values, the dispersion energy contribution at Rcut = 12.25
A
without the Fermi function cutoff, is 7.3 106 meV, which is quite small. Even
though this energy contribution without the Fermi function is small, the Fermi
function is retained to remain consistent and completely zero out any force spikes
while preforming MD.
C6
(eV
A6 ) (
A1 )
24.8
1.45 0.905
Table 8.2: Optimized parameters for the tight-binding plus dispersion model with
a cutoff of 12.25
A
The parameters shown in Table 8.2 result in an essentially zero error fit to the
database values (all less than 0.2% difference). The fitted TB+disp interlayer
energy is plotted and compared to the Girifalco empirical model in Figure 8.4.
The energy curves look similar except for the scale and curvature around Zeq .
Figure 8.5 shows more definitively the difference between the two models. In
100
this figure the interlayer energy as a function of layer separation is shown for
both the ABA and AAA sacking of the TB+disp model and Girifalcos empirical
model. Note the small difference between these curves for the empirical model as
compared to the TB+disp model. Finally Figure 8.6 shows the energy landscape
of graphite, at Zeq , under a shear transition from an ABA stacking (the minimas)
to AAA stacking (the maximum points). This landscape shows the corrugated
nature of graphite that was discussed earlier and expressed well in Figure 3.3.
The comprable landscape for the LJ models is much flatter as noted in 4.1.
10
15
20
25
30
35
40
TBD ABA
giri ABA
45
50
55
3.5
4.5
Interlayer separation [A ]
Figure 8.4: Fitted tight binding plus dispersion interlayer energy for both ABA
and AAA stacking plotted along with Girifalcos empirical model
8.5
Discussion
It is necessary to discuss these results to make sure that the values of the parameters found are in accordance to what is known. In this fitting procedure the
database consisted of three experimental data points that are all found at the
equilibrium spacing. At this point we have both repulsion and attraction between
the layers. Shortly beyond the equilibrium distance there is practically no repulsion and only dispersive attraction. The database had no data points to constrain
the magnitude of C6 , which dictates the strength of interactions beyond the equi101
15
giri AAA
giri ABA
TBD AAA
TBD ABA
20
25
30
35
40
45
50
55
3.1
3.2
3.3
3.4
3.5
3.6
3.7
Interlayer separation [A ]
Figure 8.5: Tight-binding interlayer energy for AAA and ABA stacking, compared
to Girifalcos LJ model
AAA
ABA
25
30
35
2
40
1
45
50
2
0
1.5
0.5
X [A]
1
0
0.5
1.5
Y [A]
(a)
(b)
Figure 8.6: (a) Graphite energy landscape (b) ABA graphite showing corresponding X,Y orientation
102
in this phonon frequency. We have calculated the frequency for discrete values
of interlayer spacing and plotted them in Figure 8.7 as the circled line. This is
a semi-static frozen phonon calculation done in the same manner as described
in equation (8.13). Experimental data on this phonon frequency and interlayer
spacing for the 0 to 14 GPa range has been gathered and fitted by Hanfland et al.
[62]. The dashed line in Figure 8.7 is the representation of this experimental work
and is expressed by
(z) = o
"
o o
"
z
zeq
1 +1
(8.17)
o and zeq are the STP values of the E2g (1) phonon frequency, and equilibrium
spacing (1.26 THz, and 3.35
A respectively). The other four values are fitted to
the experimental measurement of graphite under pressure from [62] ( = 10.89,
o = 35.7 GPa, o = 0.110 GPa1 , = 0.43). The top abscissa in Figure 8.7,
shows the associated pressure (with zero referencing STP) to attain the interlayer
spacing shown on the bottom abscissa. Our curve, while not falling on top of the
experimental data, shows the expected trend. A difference is expected since our
TB method doesnt calculate the electronic charge density in a self consistent way,
an issue that becomes important as the layers are squeezed together.
We have implemented our model into a molecular dynamics environment and applied it to some simple systems. The tight-binding code had an existing molecular
dynamics subroutine. The force contribution from the vdW term was added to
the existing code. Details on the MD method and the vdW force term are given
in Appendix A. Our test case was two graphene sheets with periodic boundary
conditions in only the in-plane directions. We want to check if the predicted equilibrium spacing of the graphene layers is mirrored in the MD simulation. The 0
K static interlayer energy scans for both the AAA and ABA system are shown
as the dashed and solid black lines in Figure 8.8. Their equilibrium interlayer
A respectively. This simulation was prespacings are approximately 3.4 and 3.6
104
Figure 8.7: Hydrostatic pressure effects on the E2g (1) shear mode. STP is referenced as zero pressure. Dashed line is experimental data from [62], circled line is
our TBD prediction
formed at 600 K for 1500 femtoseconds with a time step of 0.1 fs. We ran this MD
without the vdW contribution, and as expected, saw the layers repel each other.
Turning on the vdW force we ran the simulation again starting with the layers
at 3.7
A and in a slightly offset ABA stacking. The red line in Figure 8.8 plots
the average distance between the layers as a function of time, which is shown on
the right-hand ordinate. As expected the layers oscillate around the 0 K predicted
equilibrium. Measuring the average time between peaks we estimate the frequency
of this oscillation is 2.27 THz.
With this successful fitting of our TB+disp model for graphite and testing in a
molecular dynamics simulation we are now able to examine the interaction between
carbon nanotubes.
105
Figure 8.8: Static 0 K energy scans of two graphene layers in AAA and ABA
orientations in black and the results of a molecular dynamics simulation showing
the oscillation around the equilibrium interlayer spacing in red
106
9
TBD description of carbon nanotube
interactions
This chapter outlines the application of our optimized tight-binding plus dispersion model applied to carbon nanotubes interactions. In the introduction of this
work we discussed the motivation for modeling interactions of collections of nanotubes for various nanomechanical based applications. Here we start with the
modeling of CNTs on a graphene substrate, followed by calculations of axially
parallel CNTs taken two at a time, 7 tube bundles, multiwalled tubes, and finally
the case of a C60 outside and inside of a CNT. We discuss some generalized results
of these static energy calculations and fit a universal binding curve for graphitic
107
9.1
Tube-graphene interactions
Graphite
= 60
CNT
Figure 9.1: Birds eye view of the various lock-in orientations on graphite for a
CNT
position as a function of tube graphene separation. The tubes considered are (n,n)
type tubes of n = 5, 10, 12, and 20. The diameter of these tubes are 3.39, 6.78,
8.14, 13.56
A respectively. The tubes sections are created with the code provided
in [148], and periodic boundary conditions along the axis are implemented to
model infinitely long tubes. We use 3 unit cells for each tube resulting in tubes
with 60, 120, 144, and 240 atoms apiece. The graphene substrate is created to
satisfy proper boundary conditions and orientated in a lock-in position relative
to the tube. A k-point mesh of 7x2x2 shows a convergence of 0.1 meV for the
smallest system and is used for all tube calculations here. Figure 9.2 shows the
tube graphene interaction energy per unit length of the tube for the various tubes
orientated in one of their lock-in positions vs the wall to wall separation (dww ).
The general trend of increasing equilibrium spacing and higher binding energies
with larger tubes can be seen here. These results are summarized in the first four
rows in Table 9.1.
9.2
Tube-tube interactions
100
150
200
250
300
350
400
450
(5,5)
(10,10)
(12,12)
(20,20)
500
550
600
2.8
3.2
d
ww
3.4
3.6
[A ]
dz
dww
Figure 9.3: (12,12) and (5,5) tubes showing adopted orientational notation. is
a relative twist angle, dww is the wall to wall tube separation, and dz is a vertical
offset distance
200
205
210
215
220
2
225
0
10
15
[degrees]
20
25
30
35
dz [A ]
Figure 9.4: Tube-tube , dz landscape for two (10,10) tubes at a wall to wall
distance of dww = 3.1
A
111
dispersion model.
For all the , dww landscapes calculated the dww scans corresponding to the most
favorable and disfavorable s are taken and fit via least squares to a modified
Lennard-Jones curve (see equation (9.2)). Using these fits, a bisection method is
aa
used to calculate the equilibrium spacing (dab
ww and dww ) through the analytical
derivative of the fit. The cohesive energies are reported in energy per unit length
(Eab , Eaa ) and are evaluated with the fitted curves at the minimas. Table 9.1
reports the results of all the tube combinations of the 4 different tubes studied.
The energy scans (for Eab and dab
ww ) as a function of wall separation around the
equilibrium spacing for each tube-tube combination are shown in Figure 9.5.
Eab
Tubes
meV/
A
(5,5)kgraphene
-284.44
(10,10)kgraphene -375.12
(12,12)kgraphene -393.69
(20,20)kgraphene -495.31
(5,5)k(5,5)
-168.16
(5,5)k(10,10)
-187.83
(5,5)k(12,12)
-195.49
(5,5)k(20,20)
-208.95
(10,10)k(10,10)
-220.92
(10,10)k(12,12)
-230.61
(10,10)k(20,20)
-256.30
(12,12)k(12,12)
-241.09
(12,12)k(20,20)
-270.99
(20,20)k(20,20)
-312.34
Eaa
meV/
A
...
...
...
...
-157.03
-177.17
-182.49
-197.28
-205.24
-214.13
-240.98
-224.07
-254.34
-292.96
Eaa Eab
meV/
A
...
...
...
...
11.12
10.65
13.00
11.67
15.68
16.48
15.32
17.02
16.65
19.38
dab
ww
A
3.00
3.04
3.05
3.10
3.02
3.07
3.07
3.10
3.09
3.10
3.11
3.10
3.11
3.12
daa
ww
A
...
...
...
...
3.10
3.14
3.16
3.18
3.20
3.21
3.20
3.22
3.21
3.22
160
180
55
510
512
1010
1012
1212
520
1020
1220
2020
200
220
240
260
280
300
320
340
2.95
3.05
3.1
3.15
3.2
3.25
3.3
3.35
( ) =
a
+ c.
b
(9.1)
113
160
(55)
180
(510)
(512)
200
(520)
220
(1010)
(1012)
(1212)
240
(1020)
260
(1220)
280
300
(2020)
320
340
0.1
0.2
0.3
0.4
sum of curvature [A
0.5
0.6
Figure 9.6: Eab from Table 9.1 plotted against the sum of the curvatures of the
interacting tubes
9.3
One can nicely summarize the last two sections on CNT-CNT and CNT-graphene
interactions into a universal binding energy relation (UBER) similar to the original
work of Rose et. al [143]. An UBER curve is a systematic rescaling of the energy vs.
distance curves in a manner which shows a single curve. This has been attempted
for graphitic systems by Girifalco [53]. Using our data set of tube-graphene and
tube-tube interaction energies we first normalized the well depths by dividing
the interaction energies by their cohesive energies ((d)/|(do)| where (d) is the
energy per atom at a wall to wall separation of d and do = dab
ww the equilibrium
separation). The distance scaling utilized in Girifalcos analysis is (d )/(do )
where is a system dependent scaling parameter. In Girifalcos representation
was an adjustable parameter that was tuned such that all interaction energy curves
fell on top of one another. In his formulation the value of for two (10,10) tubes
is different than that for two graphene sheets, etc. This factor partially negates
the true universality of an UBER curve. We tried a few distance scaling methods
with the most promising being a definition of as the sum of the curvatures ( )
of the interacting bodies times a constant with units of
A2 ( = ). Figure 9.7
114
(d) =
A6 B12
+ 12 + C.
d6
d
(9.2)
The fitted parameters are A6 = 0.7198, B12 = 0.3565, and C = 0.6366 with an
associated SSE of 2.24 104 , and is shown as the solid line in Figure 9.7. This
universal binding curve is a nice summary for tube-tube-graphene interactions.
Using this fitted curve one may back out estimates of cohesive energies for other
systems that were not considered explicitly here. It should be noted that the
effects of tube flattening for d > 20
A were not considered, so this will only be
valid for the d = 0-20
A regime.
0.84
0.86
0.88
(d)/ (do)
0.9
0.92
0.94
0.96
0.98
1
1.02
0.9
0.95
1.05
(d)/(d )
1.1
1.15
115
9.4
Nanotube loops
For a comparison we have computed the ab stacking energy with Girifalcos model.
We obtained a (10,10)k(10,10) intertube cohesive energy of 1.83 eV/nm. At this
point It is interesting to consider the implications of a full classical approach in
comparison with the TB + vdW one. Consider for instance the case of a (10,10)
CNT ring with the smallest bending radius of Rbend = 0.03 m mentioned in the
introduction [113]. The ring configuration is stable because the bending energy
penalty Ubend = 0.5k(1/Rbend)2 is compensated by the van der Waals attractions.
The bending stiffness k of a single-walled CNT of radius R is k = CR3 , where C
is the in-plane CNT stiffness [170]. For a (10,10) CNT, the TB model gives C =
423 J/m2 [69]. Taking R = 0.7 nm then k = 455 1018 J nm. Next, from the
balance of resulting bending energy (Ubend = 1.58 eV/nm) and our computed vdW
attraction (i.e., 2.21 eV/nm given in Table 2) we conjuncture that the observed
CNT rings of 0.03 m in radius can be formed from (10,10) CNT of 661 nm in
length, with an overlap portion of 472 nm. On the other hand, a substantially
different estimate is obtained based on the widely used combination of classical
covalent-bonding [18] (leading to C = 236 J/m2 , k = 254 1018 J nm, and Ubend
= 0.88 eV/nm) and Girifalcos vdW treatment [53]. It follows that the smallest
observed rings can be formed from (10,10) CNTs of at least 365 nm in length with
an overlap portion of 174 nm.
9.5
Nanotube bundles
We have also performed energy scans on two seven-tube CNT bundles. The two
different bundles modeled were comprised of (10,10) and (12,12) tubes. As in
the tube-tube interactions, PBCs were employed to negate end effects and model
infinitely long tubes. A section of the bundle is shown in Figure 9.8 (a). The
tubes are packed in a hexagonal shape as shown in Figure 9.8 (b). For the (12,12)
bundle the tubes are rotated and shifted such that they are in their optimal ab
116
configurations with respect to one another, as was found above. This is possible
for the (12,12) bundle because (n,n) tubes have rotational symmetry every 2/n
radians. Since 2/12 is a multiple of /3 (i.e. half the interior angle of a hexagon)
there is a possibility of an optimal (12,12) ab bundle. We have made an educated
guess at the configuration of such a bundle by using our optimized results from
the tube-tube minimas found above. Figure 9.8 (b) illustrates the ab like configuration in which we orientate the tubes in. This possible optimal configuration
is not the case for the (10,10) bundle, which will remain partially frustrated. For
this (10,10) bundle the tubes are orientated such that they are in their optimal
ab configuration only along the horizontal axis as shown by the bold ab in Figure
9.8 (b).
The energy scans are performed by dilating the tube bundle and uniformly increasing dww with the energy points plotted in Figure 9.9. The cohesive energy and
equilibrium wall-to-wall separation are summarized in Table 9.2. Included in this
table are the results of Biro et al. whom preformed similar calculations with their
tight-binding plus dispersion model [156]. The cohesive energy per unit length of a
bundle has not to our knowledge been extracted experimentally. Measurements on
bundles comprised of hundreds of SWCNTs of average radius 6.9 0.1
A (slightly
larger than (10,10)s) have shown a wall-to-wall separation of 3.15
A [160]. This
observation, while on a slightly larger and less perfect structure, is very close to
our ideal 7 tube bundle calculations.
(10,10)
Separation Cohesive energy
A
eV/
A
Model
TB+disp
3.11
-2.479
Biro [156]
3.12
-3.201
(12,12)
Separation Cohesive energy
A
eV/
A
3.12
-2.802
3.13
-3.500
117
ab
a
a
b
a
a b
a
b
a
b
dww
ab
(a)
(b)
Figure 9.8: (a) Seven tube bundle of (12,12) tubes (b) Bundle cross section showing
wall to wall distance (dww )
2.2
(10,10)
2.3
(12,12)
2.4
2.5
2.6
2.7
2.8
2.9
3
3.05
3.1
3.15
3.2
d
ww
[A]
3.25
3.3
3.35
3.4
Figure 9.9: Energy scan on CNT bundles around their equilibrium position
118
9.6
Multiwalled tube
dww = 3.39
A
y
x
z
dz
Figure 9.10: Nested (5,5)k(10,10) CNT pair. The dz direction referred to in Figures
9.11 and 9.12 is along the tube axis labeled as z
DFT LDA approach [25]. Another first principles DFT LDA study of the rotational energy barrier of this system has been reported as 7.75 meV/
A [96]. Our
model shows more corrugation and a smaller sliding barrier than these models. In
spite of the larger contact area, the nested tube corrugation is smaller than when
they are placed side-by-side (Table 9.1). Such a reduction in the corrugation due
to the difference in radii between layers is in qualitative agreement with earlier
results [96, 131].
We have preformed similar energy scans for a (10,0)k(20,0) system which is of
comparable size to the (5,5)k(10,10) system. As expected we see a similar energy
landscape but with the high and low energy barriers for and dz switched around.
Thus making (n,0) MWCNTs better candidates for rotational bearings and (n,n)
MWCNTs a superior translational bearing. MWCNTs composed of chiral tubes
are expected to have screw like energy landscapes.
For our (5,5)k(10,10) system we have also calculated a shear modulus of 0.62 GPa,
and a rigid shear mode frequency of 57 MHz which is the analogous shear mode
of E2g (1) = 1.26 THz discussed previously.
9.7
We have not discussed the C60 molecule since the introduction but here we make
some calculations of its interaction energy within and outside of a carbon nanotube. We first do a static interaction energy scan of an infinitely long (10,10)
tube with a C60 molecule. The two are aligned to be in a ab like stacking configuration as can be seen in the left hand side of Figure 9.13. The right hand side shows
the relative displacement of the two bodies. Figure 9.14 shows the interaction
energy as a function of wall-to-wall separation. The maximum cohesive energy
is -101 meV/atom at a separation of 2.99
A. For the same system with the C60
nested in the tube we calculate an interaction energy of -560 meV/atom. There
has been discussion of a small energy barrier at the entrance to an open CNT
120
1.06
1.062
1.064
1.066
1.068
1.5
20
1
d [A]
15
10
0.5
[degrees]
Figure 9.12: Nested (5,5)k(10,10) energy landscape with TB+disp model showing
the translational energy change as the circled line
121
[163], but its interesting to note that it is 5 times more favorable energetically for
a C60 molecule to be inside of a tube than on the outside. This is supported by
the direct observation of stable peapod structures. Peapods are a CNTs filled
with C60 molecules as shown in Figure 9.15 [56].
9.8
Tube-tube MD simulation
20
30
40
50
60
70
80
90
100
110
2.5
3.5
dww [A ]
123
tubes slightly rotating and sliding into their equilibrium configuration. Instead of
this we saw a slight rotation and sliding but also a general flattening. Figure 9.16
(a) shows the before shot and (b) shows the two tubes after 1 ps. There is a slight
flattening along the tubes face.
(a)
(b)
Figure 9.16: Two (5,5) tubes (a) shows configuration at t=0, (b) shows configuration after 1 ps.
124
10
Conclusion and future work
We have undertaken the study of the model system of graphite in order to gain
insight to the nature of vdW attraction and repulsion between layers of sp2 bonded
carbon. With the data garnered from experimental insights and first principles
simulations we have developed a tight-binding plus dispersion model which accurately describes both the intra and inter-layer properties of graphite. In this
pursuit we have considered the various physical models that describe dispersion
forces and used them to calculate the C6 dispersion constant for sp2 hybridized
carbon. Our model was extended to describe the interactions between carbon
nanotubes in a molecular dynamics environment. The model was compared to
literature values of equilibrium orientations and cohesive energies of nanotubes.
125
126
A
Tight-binding + dispersion molecular
dynamics overview
mi
~i
d2 R
= F~ i ,
2
dt
(A.1)
1
R(t + dt) = R(t) + V(t)dt + dt2 A(t),
2
A(t) + A(t + dt)
.
2
(A.2)
(A.3)
Where R, V and A are the position, velocity and acceleration matrices respectively (dimensions of 3 x number of atoms). The time step is denoted by dt. The
~ i = F~ i /mi . We will discuss the force calculaaccelerations are attained through A
tions below, but now start with how to start a MD simulation and the subsequent
steps in running one.
A molecular dynamics simulation needs a few things to start, namely the initial
positions of the atoms to be simulated and initial velocities. The initial velocities
are assigned as a function of the desired temperature. This is done by assigning
random velocities with a Maxwell-Boltzmann distribution, and scaling the total
kinetic energy to reflect the desired temperature. The net momentum of the
system is also zeroed. After these setup steps are complete the program enters
the MD loop where forces are calculated and the system is evolved in time. These
128
two steps are looped for a desired time period and then the simulation is stopped.
These general steps are illustrated in a flow chart in Figure A.1.
Initial nuclear
coordinates (Ro)
Assign initial
velocity fM axwell (T )
no: t = t + dt
Is t > trun?
yes
Post process:
Stop
thermodynamic averages
A.1
Calculation of Forces
We have discussed the MD technique and have seen the need to calculate the
forces on the ions in order to evolve the system. Our system has three distinct
129
force contributions and we can start by defining the force on an ion i in the
direction where is one of the cartesian directions x, y, z.
i
Ftot,
=
Etot
Ri
(A.4)
The forces defined by the tight-binding method are of two distinct types in accordance with the definition of the energy. Equation (5.33) showed us that the
total energy was the sum of the band structure energy and the repulsive energy,
and analogously the total force is simply the sum of the band structure force
and the repulsive force. We have added a van der Waals energy term which also
contributes to the total force. We can expand equation (A.4) to reflect this
i
Ftot,
Erep EvdW
EBS
+
+
=
Ri
Ri
Ri
i
i
i
= FBS,
+ Frep,
+ FvdW,
(A.5)
The repulsive ionic and van der Waals forces can be calculated analytically and
are treated differently than the band structure forces; we will cover them first.
A.1.1
Since the repulsive and vdW terms are two body potentials with a known form
the force between two ions is expressed simply as the negative derivatives of their
energy, we call these rep and vdW . Both of these forces are simply a function of
distance between two interacting atoms. The analytical negative derivative of our
vdW energy term (equation (??)) with our fitted values substituted and expanded
is shown in equation (8.3) on the last page of this Appendix. The ionic repulsive
force is the negative derivative of its its fitted chebyshev polynomial detailed in
[137].
After finding the force between two interacting bodies it is a simple manner to
decompose the force into the cartesian directions with use of the directional cosine
130
introduced in 5.6.2.
Fi = d (d)
(A.6)
where d is the distance between the atom i and the neighbor considered and d is
the directional cosine for direction .
A.1.2
While the repulsive and vdW forces were quite easy to describe the band structure forces prove a little more difficult to conceptualize. The Hellmann-Feynman
theorem can be used to calculate the forces due to the electronic portion of the energy in a quantum mechanical manner [66, 44]. The Hellmann-Feynman theorem
posits that the change in energy of a quantum system with respect to a change of
a system dependant parameter P is given by [39].
H
dE
= h|
|i
dP
P
(A.7)
Fi =
X
h | H | i
=
Ri
Ri h |i
(A.8)
With our non-orthogonal tight-binding basis we must account for the dependance
of the overlap on the force and the term that represents the change in the expansion
coefficients with respect to the ionic coordinate, the latter term is referred to as
the Pulay force [140]. Appendix C shows the derivation of this force term and is
simply given here as
131
Fi
c
p cq
p,q
Hpq
Spq
E
Ri
Ri
(A.9)
Fi
A.2
Spq
Hpq
+ Tr
= T r
Ri
Ri
(A.10)
(A.11)
132
Figure A.2: Schematic of periodic boundary conditions. Center solid box is the
unit cell surrounded by 8 copies of itself in two dimensions. The circle sees not
only the triangle in its own box (long arrow) it also sees its images (short arrow).
133
vdW
vdw (d) = dEdd
248
(1e1.45 d 1.45 de1.45 d 1.05 d2 e1.45 d 0.51 d3 e1.45 d 0.18 d4 e1.45 d 0.05d5 e1.45 d 0.013 d6 e1.45 d )e10 d110.99
2
(e10.0 d110.99 +1) d6
134
148.8 1e
1.45 d 1.45 de1.45 d 1.05 d2 e1.45 d 0.508 d3 e1.45 d 0.18 d4 e1.45 d 0.053 d5 e1.45 d 0.013 d6 e1.45 d
10
24.8 1.010
B
Quantum Mechanics Overview
The following is a very brief review of the relevant quantum mechanical postulates
that are needed as back ground to the work contained in this document.
The quantum mechanical wave function for a single particle, (x, y, z, t) or (~r, t),
describes the temporal and spatial evolution of a quantum mechanical particle.
The product of (~r, t) (~r, t) is the probability density function of a quantummechanical particle (*denotes the complex conjugate). (~r, t) (~r, t)d is the
probability of finding a particle in the differential volume d = dxdydz. Therefore
Z
135
(B.1)
if a wave function (~r, t) fulfills equation (B.1), then (~r, t) is called a normalized
wave function. Equation (B.1) is the normalization condition and implies the fact
that the particle must be located somewhere in 3 space. Another restriction on
wave functions is that they be single valued and continuous [116]. Wave function
are easily normalized by applying a normalization constant which is found by
An =
!1/2
(B.2)
n (~r, t) = An (~r, t)
B.1
(B.3)
For every observable dynamic variable there exists a Hermitian operator in quantum mechanics. Table B.1 shows a few dynamical variables and their corresponding quantum mechanical operators.
Observable
r
p
E
Symbol
r
p
E
Operator
r
i~
i~ t
Table B.1: Position, momentum, and total energy and their corresponding oper
(B.4)
B.2
Dirac Notation
The integrals in equation (B.4) show up often in quantum mechanics and are given
a short hand notation.... and is denoted by the brackets h...i. We have introduced
the Dirac bra-ket notation for representing the integration. Equation
hi =
B.3
h|op |i
h|i
(B.5)
Time-dependant Schr
odinger equation
E=
p2
+ V (r)
2m
(B.6)
where p is the momentum, m is the particles mass and V (r) is the potential energy
as a function of position. If we replace the classic variables in equation (B.6) with
the corresponding operators from table B.1 operating on the wave function we
arrive at the single-particle time-dependant Schrodinger equation.
i~
(~r, t)
sp (~r, t)
=H
t
(B.7)
2
sp = ~ 2 + V (~r)
H
2m
(B.8)
note that there is nothing in (B.7) that accounts for spin or relativistic effects.
B.4
Time-independent Schr
odinger equation
If the potential V is independent of time, the total wave function can be separated
into temporal and spatial terms.
(~r, t) = (~r)(t)
(B.9)
(~r)i~
d(t)
sp (~r)
= (t)H
dt
(B.10)
Moving time dependant terms to the left and spatial terms to the right
1
i~ d(t)
Hsp (~r)
=
(t) dt
(~r)
(B.11)
1 d(t)
iE
=
(t) dt
~
(B.12)
sp (~r) = E(~r)
H
(B.13)
(t) = eiEt/~
(B.14)
(B.15)
Equation (B.13) is the the time-independent Schrodinger equation. The timeindependent Schrodinger equation is in the form of an eigenvalue equation.
motivation for solving. Solving the Schrodinger equation for the wave function
of a general system of electrons and nuclei, leads the modeler to any property of
interest. This is of course great interest to the scientist and philosopher. But
actually solving the Schrodinger is a beast of a problem.
B.5
General Hamiltonian
So far we have been looking at a single particle problem with no reference to the
type of particle or the type of potential. In a condensed matter system we are
dealing with collections of many electrons and nuclei. In order to describe this
general system we must define the many particle hamiltonian, which in a simplified
form is
(B.16)
~ 1, R
~ 2, . . . , R
~I}
where r = {~r1 , ~r2 , . . . , ~ri } are the electron coordinates, and R = {R
are the nuclear coordinates.
P I
is the kinetic energy of the nuclei
Tn (R) = ~2
2MI
2
~2
Te (r) = 2m
e
P
i
VeN (r, R) =
VN N (R) =
Vee (r) =
1
2
1
2
P ZI e 2
1
2
Ii
I6=J
i6=j
e2
Rij
RIi
ZI ZJ e 2
RIJ
~ 1, R
~ 2, . . . , R
~I)
= (~r1 , ~r2 , . . . , ~ri , R
B.6
(B.17)
Complexity
Solving the Schrodinger equation for this system and describing the eigenstates of
this many body problem is enormously complex. The complexity of this problem
is somewhat elucidated by walking through the analytic solution for the simplest
atomic system, the hydrogen atom, see [116]. In fact this is the only atomic system
that can be treated analytically, moving up to helium, or even a hydrogen dimer
proves impossible to solve. In order to make these and other more complex systems
tractable within a quantum mechanical treatment, a series of approximations must
be applied, the two most important are the Born-Oppenheimer and one-electron
approximation.
B.7
Born-Oppenheimer approximation
The Born-Oppenheimer approximation underlies almost all atomistic models. Colloquially it is explained by considering the nuclei electron mass ratio (which is
approximately 1820 : 1). At any given time the nuclei are moving at a much
slower pace relative to the electrons. The electronic wave function is assumed to
instantaneously (adiabatically), adjust to any nuclear movement. This thinking
lead Born and Oppenheimer to posit a decoupling of electronic and ionic degrees
of freedom [14].
140
nuclei
electrons
}|
{
z }| { z
({ri}, {RI }) ({Ri }) ({ri }; {RI })
(B.18)
Here we see that the electronic wave function depends on the electronic coordinates
and parametrically on the nuclear coordinates, denoted by the semi-colon. The
electronic hamiltonian for the electronic wave function is
(B.19)
The nuclear part of the total wavefunction ({Ri }) is not typically solved via
the Schrodiger equation. Rather the nuclei are usually treated as classic particles
coulombically interacting with one another. This approximation is good if the de
Broglie wavelength () of the nuclei is much less then the average nearest neighbor
distance (a), where the de Broglie wavelength is given as
2~2
MkB T
(B.20)
B.8
One-Electron Approximation
B.9
(B.21)
Variational Method
o = o o
H
(B.22)
where o and o are the exact ground state wave function and energy. Multiplying
both sides of equation (B.22) by o then integrating over space and solving for
the ground state energy we get
o =
oi
ho |H|
ho |o i
(B.23)
trial o
(B.24)
The equality holds only if = o . The trial function may contain variational
parameters. A particularly intuitive and productive trial function consists of a
linear combination of atomic orbitals (LCAO). The trial function generally takes
142
the form
n
X
ci i
(B.25)
i=1
Where n is the number of basis functions, the cs are the n unknown expansion
coefficients and i are atomic like orbitals. The goal is to minimize trial and thus
attain the best emulation of the true wave function for the assumed basis. This is
done by by the method of variations and leads to n simultaneous linear equations
of the form
trial
=0
cn
B.10
(B.26)
Extended example
By walking through an explicit example of this problem we will see the methodology and elucidate its transformation into a generalized eigenvalue problem. To
keep things simple we will consider the simplest non-trivial trial function. This
wave function has the form
= c1 1 + c2 2
(B.27)
Taking this trial wave function and plugging it into equation (B.23).
trial
R
1 1 + c2 2 )d
(c1 1 + c2 2 )H(c
R
=
(c1 1 + c2 2 )(c1 1 + c2 2 )d
(B.28)
trial =
(B.29)
Hij and Sij are referred to as Hamiltonian and overlap matrix elements respectively, and are are defined as
Hij =
Sij =
j d = hi |H|
ji
i H
(B.30a)
i j d = hi|j i
(B.30b)
trial =
n
P
i,j=1
n
P
ci cj Hij
i,j=1
(B.31)
ci cj Sij
n
P
n
P
cj Hij
ci cj Hij
n
X
trial
j=1
j=1
= P
cj Sij = 0
2
n
n
ci
P
j=1
ci cj Sij
ci cj Sij
j=1
(B.32)
j=1
Note that this leads to n equations and the sums is now over j only. Rearranging
n
P
and multiplying both sides by
ci cj Sij
j=1
n
X
cj Hij =
j=1
n
P
j=1
n
P
j=1
ci cj Hij
n
X
cj Sij
(B.33)
ci cj Sij j=1
Notice that the fraction on the right hand side of equation (B.33) is simply the
energy expectation value from equation (B.23).
n
X
cj Hij = trial
j=1
n
X
j=1
144
cj Sij
(B.34)
At this point to put the equations in a more understandable form we can expand
them with our sample basis function. These n = 2 equations are
(B.35a)
(B.35b)
(we could show how the rearrangement of these equations leads to the secular
determinant, and then substituting in to find the as) Putting these equations in
matrix format shows the form of the generalized eigenvalue problem.
H11
H21
H12 c1
S11 S12 c1
= trial
c2
S21 S22
c2
H22
Hc = Sc
(B.36)
(B.37)
Equation B.37 is simply a generalized eigenvalue problem. H and S are the Hamiltonian and overlap matrices, and c is a column vector of the expansion coefficients.
There will be as many eignevalues and eigenvectors as there are terms in the expansion n.
It is now clear how minimizing the energy function by the method of variations is
equivalent to solving the generalized eigenvalue problem.
145
C
Non-orthogonal Hellmann-Feynman
forces
C.1
Model Review
We begin this derivation with a little restatement of the nuts and bolts of the
system we are working with. Since we are working with in the framework of the
variational principle we write our wave functions as linear combinations as
146
cp p
(C.1)
The wave-function corresponds to state , the index p in our case goes over
number of electrons size of basis. cp is just the element of the eigenvector
corresponding to the th state. Also keep in mind the non-orthogonality of our
basis and the definition of the overlap matrix elements.
hp |q i = Spq
(C.2)
After the diagonalization of the generalized eigenvalue problem we have all necessary information to proceed with calculating the forces via the Hellmann-Feynman
theorem which follows.
C.2
Force derivation
E = h |H|i
(C.3)
E =
c
p cq Hpq
(C.4)
p,q
Taking the partial derivative with respect to a cartesian direction = (x, y, z),
for a particular atom i (Ri ) we get
h |H|i
E
=
Ri
Ri
147
(C.5)
Hpq
=
Hpq cq + cp
c + cp Hpq i
Ri
Ri
Ri q
R
p,q
(C.6)
X
p,q
c
p
X cq X
X
Hpq X c
p
+
c
+
H
c
Hpq c
pq
q
p
i
i
Ri q
R
R
q
p
q
p
(C.7)
Shows that the two terms underlined are simply the left hand side of equation
(B.34) repeated here for convenience
Hpq cp = E
Spq cp
(C.8)
Switching these underlined terms with their counter parts just noted leaves us
with
X
X cq
X
E X Hpq X c
p
=
c
c
+
E
E
S
c
+
Spq c
pq
p
q
p
i q
i
i
Ri
R
R
R
p,q
p
q
q
p
(C.9)
Consider the last two terms on the right hand side. Moving the sums to the left
and isolating the expansion coefficients and their derivatives leaves
E X Hpq X
=
cp
c +
E Spq
Ri
Ri q
p,q
p,q
!
cq
c
p
c +
c
Ri q Ri p
{z
}
|
(C.10)
j
c
p cq
Ri
As noted by the underbrace the term in the brackets is simply the partial derivative
of the product of the expansion coefficient and its complex conjugate with respect
to the ionic degree of freedom. Now putting the double sum out front we have a
148
p,q
(C.11)
hi |j i = ij
(C.12)
Considering the non-trivial example (i.e. i=j), and using the definition of the
wave-function given in equation (C.1) we have
h | i =
c
p cq Spq = 1
(C.13)
p,q
Now taking the partial derivative of the overlap with respect to the ionic coordinate
Ri , and not fully expanding the term in the expansion coefficients we have
#
"
X c
S
pq
p cq
Spq + c
=0
p cq
i
R
Ri
p,q
Rearranging this we have
149
(C.14)
X
p,q
Spq
X
c
p cq
Spq
=
c
p cq
Ri
Ri
p,q
(C.15)
Taking a look back at equation (C.11) we see that we can replace the partial
over the expansion coefficients with the partial over the overlap matrix elements.
Making this substitution equation (C.11) is now written as
E X Hpq
Spq
=
cp
c E cp cq
Ri
Ri q
Ri
p,q
(C.16)
E X Hpq
Spq
=
cp cq
E
Ri
Ri
Ri
p,q
(C.17)
or more compactly
These Hamiltonian and overlap derivatives are evaluated via a finite difference
method in our code. We can now express the force on atom i in the direction
with the negative of the derivative of the energy as
Fi
X X
E
Hpq
Spq
f
cp cq
f i =
E
i
R
R
Ri
p,q
(C.18)
p,q =
f c
p cq
(C.19)
E f c
p cq
(C.20)
p,q =
150
Fi
Spq
Hpq
+ Tr
= T r
Ri
Ri
151
(C.21)
D
Multipole Expansion
We seek a power series expansion of the Columbic energy between all charges in
one molecule and those in another. This expansion is known as the multipole
expansion and we present it here in the formalism of Margenau [112].
Molecule A has i charged particles with position ~ri and charge qi with a origin at
its nucleus, and Molecule B has j charged particles with position ~rj and charge qj
~ Components of
with a origin at its nucleus. The vector between the nuclei is R.
~ are X, Y and Z and for the charge coordinates the components of
the vectors R
~rj will be referenced as xj , yj and zj .
The total charge on molecules A and B is simply the sum of their charges.
152
~ri
~r1
~rj
~
R
~r2
~r1
~r2
q=
qi
(D.1)
qj
(D.2)
q =
The prime denotes molecule 2 were as no prime is simply molecule 1. This is used
through out this derivation.
The dipole vector is defined as
p1 =
qi~ri
(D.3)
p1z =
qi zi
(D.4)
With these definitions of the system we can start by first defining the Columbic
potential seen at the origin of molecule 2 due to the charges in molecule 1. This
potential can be written as
153
X
i
qi
~ ~ri |
|R
(D.5)
!
1X
1 XX
Y X
ZX
=
qi + 2
qi xi +
qi yi +
qi zi +
r i
r
r i
r i
r i
"
X
X
X
1 3Y 2
1 3Z 2
1 1 3X 2
2
2
1
1
1
qi xi +
qi yi +
qi zi2 +
2
2
r3 2
r2
2
r
2
r
i
i
i
#
3XY X
3XZ X
3Y Z X
q
x
y
+
q
x
z
+
qi yi zi + . . . h.o.t s (D.6)
i
i
i
i
i
i
2
2
2
r
r
r
i
i
i
~
r is the magnitude of the vector R.
Now we write the energy terms associated with each charge in molecule 2 interacting with the potential developed in the previous equation. This is written
as
V =
X
j
~ + ~rj |)
qj (|R
(D.7)
V =
X
j
qj +
X
j
+
+ yj
+ zj
qj xj
X
Y
Z
2
2
1 X
qj (x2j
+
.
.
.
+
2y
z
) + . . . h.o.t s (D.8)
i i
2! j
X 2
Y Z
This expression can be simplified considerably if one picks the internuclear axis
154
along one of the cartesian axis. We will do this and state that Z = r and X =
Y = 0. Equation (D.9) is the form and the terms are given in table D.1.
V =
Pn
(D.9)
n
1
2
3
4
5
6
7
..
.
Pn
1
r 4
P
ij
r qq
r 2 (q pz qpz )
r 3 (q w3 + qw3 )
r 4 (q w4 + qw4 )
r 5 (q w5 + qw5 )
P
r 3 qi qj (xi xj + yi yj 2zi zj )
ij
wn
wn
qi rin Pn (cosi )
(D.10)
qj rjn Pn (cosj )
(D.11)
155
E
Perturbation Theory
(E.1)
The eigenstates |ni forms a complete orthonormal basis. The perturbation in this
derivation will be done for the ground state energy and wave functions n = 0, but
the result is true for any n.
If this system is slightly perturbed by for example an external field it is assumed
156
that the solution is only slightly different that the unperturbed solution.
We introduce which is....
= 0 H
(0) + 1 V (1) + 2 V (2) + . . .
H
(0)
(1)
(0)
(1)
(2)
0 = 0 0 + 1 0 + 2 0 + . . .
(2)
E0 = 0 E0 + 1 E0 + 2 E0 + . . .
(E.2)
(E.3)
(E.4)
Now we use these definitions in the Schrodinger equation H0 = E0 0 , and grouping terms by in their contribution in i we get
(0)
(0) (0)
0 H (0) 0 E0 0 +
(0) (1) + V (1) (0) E (0) (1) E (1) (0) +
1 H
0
0
0
0
0
0
(1)
(0)
(0) 0(2) E0(1) 0(1) E0(0) 0(2) E0(2) 0(0) + . . . = 0
1 V (1) 0 + V (2) 0 + H
(E.5)
Each term in this series has to be zero individually because of the arbitrariness of
the variable i . This can be written as i equations, which are just each term in i
equating to zero.
(E.6)
(0)
(1)
(1)
(0)
(0)
(1)
H E0 0 = E0 V
0
(E.7)
157
(0) E0(0) 0(2) = E0(2) V (2) 0(0) + E0(1) V (1) 0(1)
H
E.1
(E.8)
(1)
0 =
an n(0) =
an |ni
(E.9)
X
n
(0) E0(0) |ni = E0(1) V (1) |0i
an H
(E.10)
X
n
(0) |ni
an H
X
n
(0)
an E0 |ni
(E.11)
The first term is simply the left hand side of the the Schrodinger equation summed
over all states n and can be substituted with the right hand side of the Schrodinger
equation summed over all states
X
n
(0) |ni =
an H
X
n
an En(0) |ni
(E.12)
X
n
an
En(0)
(0)
E0
|ni =
(1)
E0
(1)
|0i
V
(E.13)
X
n
(1)
(0)
an En(0) E0 h0|ni = E0 h0|0i h0|V (1) |0i
(E.14)
Considering the left hand side first, we note that because of the orthonormality of
the set
ha|bi = ab
(E.15)
The only remaining non-zero term due to orthogonality in the sum is when n = 0,
(0)
but the difference of the eigenvalue E0 with its self is of course zero, leaving the
whole left hand side zero. The right hand side of (E.14) is then rearranged and
we see that the first order correction to the energy is
(1)
E0 = h0|V (1) |0i
(E.16)
(1)
(1)
E0 = H00
(E.17)
This is simply the expectation value of the perturbation acting on the unperturbed
state |0i.
E.2
We have in the previous section proposed the form for the corrected wavefunction
as a linear combination of the unperturbed state, see equation (E.9). We now
need to find a definition for the expansion coefficients an .
Starting with equation (E.13), we multiply both sides by the bra hk|.
159
X
n
(1)
(0)
an hk| En(0) E0 |ni = E0 V (1) |0i
(E.18)
Again invoking the orthonormality constraint shown in equation (E.15), the only
surviving term of the sum in the previous equation is the k th
(0)
(0)
(1)
ak Ek E0
= E0 hk|0i hk|V (1) |0i
(E.19)
now considering the case when k 6= 0, the first term on the right hand side is zero
and we can solve the previous equation for ak .
(1)
ak =
Hk0
(0)
(E.20)
(0)
E0 Ek
writing the
(1)
0 =
X
k
(1)
Hk0
(0)
E0
(0)
Ek
(0)
(E.21)
E.3
following an analogous process for the first order correction to the energy we define
the second order correction to the wavefunction again as a linear combination of
the unperturbed state.
(2)
0 =
bn n(0) =
X
n
bn |ni
160
(E.22)
(0) E0(0) 0(2) = E0(2) V (2) 0(0) + E0(1) V (1) 0(1)
H
X
n
bn
X (1)
(2)
(0)
(1)
(2)
(0)
|ni
|0i +
an E0 V
H E0 |ni = E0 V
(E.23)
X
n
(0) E (0) h0|ni =
bn H
0
(2)
X
n
(1)
an h0| E0 V (1) |ni (E.24)
The left hand side is zero via the same argument used in the first order correction
to the energy. The first term on the right is simply the second order correction to
the energy because of orthonormality. The last term on the right hand side can
be simplified significantly. First lets take it apart by isolating the n = 0 term in
the sum and taking the sum over everything but zero.
X
n
(1)
(1)
|ni =
an h0| E0 V
a0
|
(1)
E0
X
X
(1)
(1)
h0| V |0i +
an h0| V (1) |ni (E.25)
an E0 h0|ni
{z
}
n
{z
}
|n
=0
=0
With these simplifications we can rewrite equation (E.24) and solve for the second
order correction to the energy.
161
(2)
(E.26)
Using the the definition of the expansion coefficients derived earlier and stated in
equation (E.20) and the shorthand matrix notation equation (E.26) can be stated
as
(2)
E0
(2)
V00
X
n
(1)
Vn0
(0)
E0
(1)
(0)
En
V0n
(E.27)
(1) (1)
(1)
If the operator V is hermitian than Vn0 V0n = |V0n |2 and we can write our final
(2)
E0
(2)
V00
X
n
162
(1)
|V0n |2
(0)
(0)
E0 En
(E.28)
F
Optimization
A major part of this work was in the fitting of the van der Waals energy term. This
process utilized the hybrid method of Powell as implemented in the MINPACK
library from Argonne National Laboratory. Here we give the basic outline of how
this method works. First we start with the general steps to a minimization routine.
~
x
<
In the steepest decent minimization algorithm the search direction is simply the
gradient of the function evaluated at the initial guess. This is followed by a line
minimization to find k . This procedure guarantees that the gradient at the
~k . This makes this
new point ~xk+1 is conjugate to the previous search direction S
method simple in that the search direction is known and preforming step 3 can be
done analytically with an assumed quadratic form. Yet this method is inefficient
and has very slow convergence. A more robust search algorithm is the conjugate
gradient search algorithm which is basically the same but finds its search directions
in another fashion.
We can understand this method through an example. Imagine that we have preformed the first step of the steepest descent method and the new position is ~xk+1 .
~k , ~u. We want to find a new search direction
We will call the old search direction, S
~v such that the gradient remains conjugate to ~u as we move along this new
direction ~v [139]. We can find this direction by satisfying this condition
0 = ~u A ~v
(F.1)
Were A is the Hessian matrix of the function at the current solution vector ~xk .
This is defined by
2
[A]ij =
~xi ~xj
(F.2)
~
x=~
xk
ods which both avoid computing the Hessian matrix. Another conjugate gradient
scheme is Powells method [45]. Starting with a set of directions ~ui initialized to
the basis vectors
~ui = ei i = 1 . . . N
(F.3)
N is the dimension of the system. The recipe for this algorithm is the following
steps.
Powell proved that it takes N iterations of the above method and N(N + 1) total
line minimizations to exactly minimize a quadratic form.
165
Bibliography
[9] L. Benedict, N. Chopra, M.L. Cohen, A. Zettl, S. Louie, and V Crespi, Microscopic determination of the interlayer binding energy in graphite, Chemical Physics Letters 286 (1998), no. 5, 490496.
[10] L. Bergstrom, Hamaker constants of inorganic materials, Advances in Colloid and Interface Science 70 (1997), 125169.
[11] J.D. Bernal, Proceeding of the royal society of london series A 160 (1924),
749.
[12] D.S. Bethune, C.H. Klang, M.S. de Vries, G. Gorman, R. Savoy, J. Vazquez,
and R. Beyers, Cobalt-catalysed growth of carbon nanotubes with singleatomic-layer walls, Nature 363 (1993), 605607.
[13] O.L. Blakslee, D.G. Proctor, E.J. Seldin, G.B. Spence, and T. Weng, Elastic constants of compression-annealed pyrolytic graphite, Journal of Applied
Physics 41 (1970), no. 8, 33733382.
[14] M. Born and R. Oppenheimer, Zur quantentheorie der molekeln, Ann. Phys.
84 (1927), 457.
[15] D. Bozovic, M. Bockrath, J.H. Hafner, C.M. Lieber, H. Park, and M. Tinkham, Plastic deformations in mechanically strained single-walled carbon
nanotubes, Physical Review B 67 (2003), no. 3, 033407.
[16] R.O. Brennan, The interlayer binding in graphite, Journal of Chemical
Physics 20 (1952), no. 1, 4048.
[17] D.W. Brenner, O.A. Shenderova, J.A. Harrison, S.J. Stuart, B. Ni, and S.B
Sinnott, A second-generation reactive empirical bond order (rebo) potential
energy expression for hydrocarbons, Journal of Physics: condensed matter
14 (2002), no. 4, 783802.
[18] D.W. Brenner, O.A. Shenderova, J.A. Harrison, S.J. Stuart, B. Ni, and S.B.
Sinnott, A second-generation reactive empirical bond order (rebo) potential
167
Christian
Burger,
Figure
of
rolled
nanotube,
http://ms.cc.sunysb.edu/ cburger.
[22] W.D. Callister, Materials science and engineering an introduction, 6 ed.,
Wiley, 2003.
[23] J.C. Charleir, X. Gonze, and J.P. Michenauid, Graphite interplanar bonding:
electronic delocalization and van der waals interaction, Europhysics letters
28 (1994), no. 6, 403408.
[24] J.-C. Charlier, X. Gonze, and J.-P. Michenaud, First-principles study of the
stacking effect on the electronic properties of graphite(s), Carbon 32 (1994),
no. 2, 289 299.
[25] J.-C. Charlier and J.-P. Michenaud, Energetics of multilayered carbon
tubules, Physical Review Letters 70 (1993), no. 12, 18581861.
[26] L. Chen, C.L. Cheung, P.D. Ashby, and C.M. Lieber, Single-walled carbon nanotube afm probes: Optimal imaging resolution of nanoclusters and
biomolecules in ambient and fluid environments, Nano Letters 4 (2004),
no. 9, 17251731.
[27] B. Chu, Molecular forces, Interscience publishers, 1967.
168
[28] J.F. Colomer, J.M. Benoit, C. Stephan, S. Lefrant, G. Van Tendeloo, and
J. B.Nagy, Characterization of single-wall carbon nanotubes produced by ccvd
method, Chemical Physics Letters 345 (2001), 1117.
[29] J. Cumings and A. Zettl, Low-friction nanoscale linear bearing realized from
multiwall carbon nanotubes, Science 289 (2000), no. 5479, 602604.
[30] R.R. Dagastine, D.C. Prieve, and L.R. White, Calculations of van der waals
forces in 2-dimensionally anisotropic materials and its application to carbon
black, Journal of Colloid and Interface Science 249 (2002), 7883.
[31] D.P. DiVincenzo, E.J. Mele, and N.A.W, Density-functional study of interplanar binding in graphite, Physical Review B 27 (1983), no. 4, 24582469.
[32] T. Dumitrica and R. Allen, Femtosecond-scale response of gaas to ultrafast
laser pulses, Physical Review B 66 (2002), no. 081202, 14.
[33] T. Dumitrica, A. Burzo, Y. Dou, and R.E. Allen, Response of si and insb to
ultrafast laser pulses, Physica Status Solidi (B) 10 (2004), no. 2331.
[34] T. Dumitrica, M. Hua, and B.I. Yakobson, From the Cover: Symmetry-,
time-, and temperature-dependent strength of carbon nanotubes, PNAS 103
(2006), no. 16, 61056109.
[35] T. Dumitrica and R.D. James, (unpublished).
[36] I.E. Dzyaloshinskii, E.M. Lifshitz, and L.P. Pitaevskii, The general theory
of van der waals forces, Adv. Phys. 10 (1961), no. 38, 165209.
[37] T.W. Ebbesen and P.M. Ajayan, Large-scale synthesis of carbon nanotubes,
Nature 358 (1992), 220221.
[38] M. Elstner, P. Hobza, T. Frauenheim, S. Suhai, and E. Kaxiras, Hydrogen bonding and stacking interactions of nucleic acid base pairs: A densityfunctionsl-theory based treatment, Journal of Chemical Physiscs 114 (2001),
no. 12, 51495155.
169
[39] S.T. Epstein, The force concept in chemistry, Van Nostrand Reinhold Company, 1981.
[40] H. Eschrig, Readily computable expressions for lcao two-centre integrals over
slater-type orbitals with arbitrarily high quantum numbers, Physica Status
Solidi (B): Basic Research 96 (1979), no. 1, 329 342.
[41] M. R. Falvo, J. Steele, R. M. Taylor, and R. Superfine, Gearlike rolling motion mediated by commensurate contact: carbon nanotubes on hopg, Physical Review B 62 (2000), no. 16, 1066510667.
[42] M. R. Falvo, R. M. Taylor, A. Helser, V. Chi, F.P. Brooks, S. Washburn,
and R. Superfine, Nanometre-scale rolling and sliding of carbon nanotubes,
Nature 397 (1999), 236238.
[43] M.R. Falvo, G.J. Clary, R.M. Taylor, V. Chi, F.P. Brooks, S. Washburn, and
R. Superfine, Bending and buckling of carbon nanotubes under large strain,
Nature 389 (1997), 582584.
[44] R. P. Feynman, Forces in molecules, Physical Review 56 (1939), no. 340.
[45] R. Fletcher, Powell, and M.J. Davidson, A rapid convergent descent method
for minimization, Computer Journal 6 (1964), no. 2.
[46] Rene Franchy, Surface and bulk photochemistry of solids, Rep. Prog. Phys.
61 (1998), no. 41, 691753 (English).
[47] R.E. Franklin, The structure of graphitic carbons, Acta Crystallographica 4
(1951), 253261.
[48] R.H. French, Origins and applications of london dispersion forces and
hamaker constants in ceramics, Journal of the American Ceramic Society
83 (2000), 21172146.
170
[49] M.E. Garcia, T. Dumitrica, and H.O. Jeschke, Laser induced coherent
phonons in graphite and carbon nanotubes: model and simulations, Applied
Physics A 79 (2004), 855857.
[50] W. B. Gauster and I. J. Fritz, Pressure and temperature dependences of
the elastic constants of compression-annealed pyrolytic graphite, Journal of
Applied Physics 45 (1974), no. 8, 33093314.
[51] Maohui Ge and Klaus Sattler, Vapor-condensation generation and stm analysis of fullerene tubes, Science 260 (1993), no. 5107, 515 518.
[52] P. Giannozzi, R. Car, and G. Scoles, Oxygen adsorption on graphite and
nanotubes, Journal of Chemical Physics 118 (2003), no. 3, 10031006.
[53] L.A. Girifalco, M. Hodak, and R.S. Lee, Carbon nanotubes, buckyballs, ropes
and a universal graphitic potential, Physical Review B 62 (2000), no. 19,
1310413110.
[54] L.A. Girifalco and R.A. Lad, Energy of cohesion, compressibility, and the potential energy functions of the graphite system, Journal of Chemical Physics
25 (1956), 693.
[55] J.S. Graves, Electronic and structural response of semiconductors to ultraintense laser pulses, Ph.D. thesis, Texas A&M University, 1997.
[56] L. Guan, K. Suenaga, Z. Shi, Z. Gu, and S. Iijima, Direct imaging of the
alkali metal site in k-doped fullerene peapods, Physical Review Letters 94
(2005), no. 4.
[57] T. Guo, P. Nikoleav, A.G. Rinzler, D. Tomanek, D.T. Colbert, and R. Smalley, Self-assembly of tubular fullerenes, Journal of Physical Chemistry 99
(1995), 1069410697.
[58] J.H. Hafner, C.L. Cheung, and C.M. Lieber, Growth of nanotubes for probe
microscopy tips, Nature 398 (1999), 761762.
171
172
[68] J. Hepburn and G. Scoles, A simple but reliable method for the prediction of
intermolecuar potentials, Chemical Physiscs Letters 36 (1975), no. 4, 451
456.
[69] E. Hernandez, C. Goze, P. Bernier, and A. Rubio, Elastic properties of c
and bx cy nz composite nanotubes, Physical Review Letters 80 (1998), no. 20,
45024505.
[70] T. Hertel, R.E. Walkup, and P. Aouris, Deformation of carbon nanotubes by
surface van der waals forces, Physical Review B 58 (1998), 1387013873.
[71] P. Hohenberg and W. Kohn, Inhomogeneous electron gas, Physical Review
136 (1964), no. 3B, 864871.
[72] D. Hough and L.R. White, The calculation of hamaker constants from lifshitz
theory with applications to wetting phenomena, Advances in Colloid and
Interface Science 14 (1980), 341.
[73] M. Hua and T. Dumitrica, (unpublished).
[74] S. Iijima, Helical microtubules of graphitic carbon, Nature 354 (1991), 5658.
[75] S. Iijima, P. M. Ajayan, and T. Ichihashi, Growth model for carbon nanotubes, Physical Review Letters 69 (1992), no. 21, 31003103.
[76] S. Iijima and T. Ichihashi, Single-shell carbon nanotubes of 1-nm diameter,
Nature 363 (1993), 603605.
[77] Carbon
Nanotechnologies
Inc.,
Company
website
accessed
6-2-06,
www.cnanotech.com.
[78] Nantero Inc., Company website accessed 6-17-06, www.nantero.com.
[79] E.R. Johnson and A.D. Becke, A post-hartree-fock model of intermolecular
interactions, The Journal of Chemical Physics 123 (2005), no. 024101, 17.
173
174
[91] W. Kohn, Y. Meir, and D. Makarov, Van der waals energies in density
functional theory, Physical Review Letters 80 (1998), no. 19, 41534156.
[92] W. Kohn and L.J. Sham, Self-consistent equations including exchange and
correlation effects, Physical Review 140 (1965), no. 4A, 11331138.
[93] A. Koide, W.J. Meath, and A.R. Allnatt, Second order charge overlap effects and damping functions for isotropic atomic and molecular interactions,
Chemical Physics 58 (1981), 105119.
[94] A.N. Kolmogorov and V.H. Crespi, Smoothest bearings: Interlayer sliding
in multiwalled carbon nanotubes, Physical Review Letters 85 (2000), no. 22,
47274730.
, Nanotube-substrate interactions: distinguishing carbon nanotubes by
[95]
, Registry-dependant interlayer potential for graphitic systems, Physical Review B 71 (2005), no. 235415, 16.
[97] H.L. Kramer and D.R. Herschbach, Combination rules for van der waals
force constants, The Journal of Chemical Physics 53 (1970), no. 7, 2792
2800.
[98] G. Kraus, Progress report no. 2, Army air force contract af33(610) 231,
University of Cincinnati, 1953.
[99] H.W. Kroto, J.R. Heath, S.C. OBrien, R.F. Curl, and R.E. Smalley, c60 :
Buckminsterfullerene, Nature 318 (1985), 162163.
[100] Y.K. Kwon and D. Tomanek, Orientational melting in carbon nanotube
ropes, Physical Review Letters 84 (2000), no. 7, 14831486.
[101] Naval
Research
crystalline
Laboratory,
calculations:
Pre
defined
hexagonal
www.nrl.navy.mil/bind/kpts/hex/index.html.
175
k-point
lattices,
sets
for
http://cst-
[102] Dieter Langbein, Theory of van der waals attraction, Springer Tracts in
Mondern Physics, vol. 72, Springer-Verlag, 1974.
[103] C. D. Latham, M. Haugk, R. Jones, T. Frauenheim, and P. R. Briddon,
Density-functional calculations of carbon diffusion in gaas, Physical Review
B 60 (1999), no. 15117.
[104] S.B. Legoas, V.R. Coluci, S.F. Braga, P.Z. Coura, S.O. Dantas, and D.S
Galv
ao, Gigahertz nanomechanical oscillators based on carbon nanotubes,
Nanotechnology 15 (2004), 184189.
[105] J.L. Li, J. Chun, N.S. Wingreen, R. Car, I.A. Aksay, and D. Saville, Use of
dielectric functions in the theory of dispersive forces, Physical Review B 71
(2005), no. 235412, 16.
[106] E.M. Lifshitz, The theory of molecular attractive forces between solids, Sov.
Phys. JETP 2 (1956), 7383.
[107] F. London, Zur theorie und systematic der molekularkrafte, Z. Phys. 63
(1930), 245279.
[108]
[109] M.J. Lopez, A. Rubio, J.A. Alonso, L.C. Qin, and S. Iijima, Novel polygonized single-wall carbon nanotube bundles, Physical Review letters 86 (2001),
no. 14, 30563059.
[110] J. Mahanty and B.W. Ninham, Dispersion forces, Academic Press, 1976.
[111] G.C. Maitland, M. Rigby, E.B. Smith, and W.A. Wakeham, Intermolecular
forces: Their origin and determination, Clarendon Press, 1981.
[112] H. Margenau and N.R. Kestner, Theory of intermolecular forces, Pergamon
Press, 1969.
176
[113] R. Martel, H.R. Shea, and P. Avouris, Rings of single-walled carbon nanotubes, Nature 398 (1999), 299.
[114] R.M. Martin, Electronic structure: basic theroy and practical methods, Cambridge University Press, 2005.
[115] J. McMurry, Organic chemistry, 6 ed., Brooks and Cole, 2004.
[116] Donald McQuarrie and John Simon, Physical chemistry: a molecular approach, Viva Books, 2005.
[117] E.J. Meijer and M. Sprik, A density-functional study of intermolecular interactions of benzene, Journal of Chemical Physics 105 (1996), no. 19, 8684
8689.
[118] Kenneth J. Miller, Additivity methods in molecular polarizability, Journal of
the American Chemical Society 112 (1990), no. 23, 8533 8542.
[119] T. Mishina, K. Nitta, and Y. Masumoto, Coherent lattice vibration of interlayer shearing mode of graphite, Physical Review B 62 (2000), no. 4,
29082911.
[120] H.J. Monkhorst and J.D. Pack, Special points for brillouin-zone integrations,
Physical Review B 13 (1976), no. 5188-5192.
[121] M. Monthioux and V.L. Kuznetsov, Who should be given the credit for the
discovery of carbon nanotubes?, Carbon 44 (2006), 16211623.
[122] J. J. More, B. S. Garbow, and K. E. Hillstrom, User guide for MINPACK-1,
Tech. Report ANL-80-74, Argonne National Laboratory, Argonne, IL, 1980.
[123] M Morin, N. J. Levinos, AL. Harris, and M. A. Gomez, Vibrational energy
transfer of CO/Cu(100): Nonadiabatic vibration/electron coupling, Journal
of Chemical Physics 96 (1992), 39503956.
177
[124] M.B. Nardelli, B. I. Yakobson, and J. Bernholc, Brittle and ductile behavior
in carbon nanotubes, Physical Review Letters 81, no. 21, 46564659.
[125] G.C. Nasreen, L.X. Benedict, V.H. Crespi, M.L. Cohen, S.G. Louie, and
A. Zettl, Fully collapsed carbon nanotubes, Nature 377 (1995), 135138.
[126] R. Neitola, H. Ruuska, and T.A. Pakkanen, Ab-initio studies on nanoscale
friction between graphite layers: effect of model size and level of theory,
Journal of Physical Chemistry B 109 (2005), 1034810354.
[127] R.J. Nemanich, G. Lucovsky, and S.A. Solin, Infrared active optical vibrations of graphite, Solid state communications 23 (1977), 117120.
[128] R. Nicklow, N. Wakabayashi, and H.G. Smith, Lattice dynamics of pyrolytic
graphite, Physical Review B 5 (1972), no. 12, 49514962.
[129] A. Oberlin, M. Endo, and T. Koyama, Filamentous growth of carbon through
benzene decomposition, J. Cryst. Growth 32 (1976), 335349.
[130] F. Ortmann, F. Bechstedt, and W.G. Schmidt, Semiempirical van der waals
correction to the density functional description of solids and molecular structures, Physical Review B 73 (2006), no. 205101, 110.
[131] A.H. Palser, Interlayer interactions in graphite and carbon nanotubes, Phys.
Chem. Chem. Phys. 1 (1999), 44594462.
[132] G.D. Parfitt and N.H. Picton, Stability of dispersion of graphitized carbon
blacks in aqueous solutions of sodium dodecyl sulphate, Transactions of the
Faraday Society 64 (1968), 19551964.
[133] Linus Pauling, The nature of the chemical bond, 3 ed., Cornell University
Press, 1960.
[134] H. Y. Peng, N. Wang, Y. F. Zheng, Y. Lifshitz, J. Kulik, R. Q. Zhang, C. S.
Lee, and S. T. Lee, Smallest diameter carbon nanotubes, Applied Physics
Letters 77 (2000), no. 18, 28312833.
178
[145] R.S. Ruoff, J. Tersoff, D.C. Lorents, S. Subramoney, and B. Chan, Radial
deformation of carbon nanotubes by van der waals forces, Nature 364 (1993),
514516.
[146] H. Rydberg, M. Dion, N. Jacobson, E. Schrodier, P. Hyldgaard, S.I. Simak,
D.C. Langreth, and B.I. Lundqvist, Van der waals density functional for
layered structures, Physical Review Letters 91 (2003), no. 2, 14.
[147] H. Rydberg, B.I Lundqvist, D.C. Langreth, and M Dion, Tractable nonlocal
correlation density functionals for flat surfaces and slabs, Physical Review
B 62 (2000), 69977006.
[148] R. Saito, G. Dresselhaus, and M.S. Dresselhaus, Physical properties of carbon
nanotubes, Imperial College Press, 1998.
[149] R. Saito, M. Fujita, G. Dresselhaus, and M. S Dresselhaus, Electronic structure of chiral graphene tubules, Applied Physics Letters 60 (1992), no. 18,
22042206.
[150] S. Sapmaz, Ya. M. Blanter, L. Gurevich, and H. S. J. van der Zant, Carbon
nanotubes as nanoelectromechanical systems, Physical Review B (Condensed
Matter and Materials Physics) 67 (2003), no. 23, 235414.
[151] M.C. Schabel and J.L. Martins, Energetics of interplanar binding in graphite,
Physical Review B 46 (1992), no. 11, 71857188.
[152] A. Sieck, D. Porezag, T. Frauenheim, M. R. Pederson, and K. Jackson, Structure and vibrational spectra of low-energy silicon clusters, Physical Review
A 56 (1997), no. 4890.
[153] J.C. Slater and J.G. Kirkwood, The van der waals forces in gases, Physical
Review 37 (1931), 682697.
[154] J.C. Slater and G.F. Koster, Simplified lcao method for the periodic potential
problem, Physical Review 94 (1954), 1498 1524.
180
[155] M. Su, B. Zheng, and J. Liu, A scalable cvd method for the synthesis of singlewalled carbon nanotubes with high catalyst productivity, Chemical Physics
Letters 322 (2000), 321326.
[156] A. Szabados, L.P. Biro, and P.R. Surjan, Intertube interactions in carbon
nanotube bundles, Physical Review B 73 (2006), no. 195404, 19.
[157] K.T. Tang and J.P. Toennies, An improved simple model for the van der
waals potential based on universal damping functions for the dispersion coefficients, Journal of Chemical Physiscs 80 (1983), no. 8, 37263741.
[158] S.J. Tans, A. Verschueren, and C. Dekker, Room-temperature transistor
based on a single carbon nanotube, Nature 393 (1998), 49.
[159] J. Tersoff and R.S. Ruoff, Structural properties of a carbon nanotube crystal,
Physical Review Letters 73 (1994), no. 5.
[160] A. Thess, R. Lee, P. Nikolaev, H. Dai, P. Petit, J. Robert, C. Xu, Y.H Lee,
S.G. Kim, A.G. Rinzler, D.T. Colbert, G.E. Scuseria, D. Tomanek, J.E.
Fischer, and R.E. Smalley, Crystalline ropes of metallic carbon nanotubes,
Science 273 (1996), 483487.
[161] S.B. Trickey, F. M
uller-Plathe, and G.H.F. Diercksen, Interplaner binding
and lattice relaxation in a graphite dilayer, Physical Review B 45 (1992),
no. 8, 44604468.
[162] F. Tuinstra and J.L. Koenig, Raman spectrum of graphite, Journal of Chemical Physics 53 (1970), no. 3, 11261130.
[163] H. Ulbricht, G. Moos, and T. Hertel, Interaction of c60 with carbon nanotubes
and graphite, Physical Review Letters 90 (2003), no. 9.
[164] Loup Verlet, Computer experiments on classical fluids. i. thermodynamical
properties of leonard-jones molecules, Physical Review 159 (1967), no. 1.
181
[165] P.R. Wallace, The band theory of graphite, Physical Review 71 (1947), 622
534.
[166] R.J. Wheatley and W.J. Meath, Dispersion energy damping functions, and
their relative scale with interatomic separation, for (h,he,li)-(h,he,li) interactions, Molecular Physics 80 (1993), no. 1, 2554.
[167] Qin Wu and Weitao Yang, Empirical correction to density functional theory
for van der waals interactions, Journal of Chemical Physics 116 (2002),
no. 2, 515 524.
[168] X. Wu, M.C. Vargas, S. Nayak, V. Lotrich, and G. Scoles, Towards extending the applicability fo density functional theory to weakly bound systems,
Journal of Chemical Physiscs 115 (2001), no. 19, 87488757.
[169] B. I. Yakobson, C. J. Brabec, and J. Bernholc, Nanomechanics of carbon tubes: Instabilities beyond linear response, Physical Review Letters 76
(1996), no. 14, 25112514.
[170] B.I. Yakobson and L.S. Couchman, Persistence length and nanomachanics
of random bundles of nanotubes, Journal of nanoparticle research 8 (2006),
105110.
[171] Z. Yao, C.L. Kane, and C. Dekker, High-field electrical transport in singlewall carbon nanotubes, Physical Review Letters 84 (2000), no. 13, 2941
2944.
[172] M. T. Yin and Marvin L. Cohen, Structural theory of graphite and graphitic
silicon, Physical Review B 29 (1984), no. 12, 69966998.
[173] M.F. Yu, B.S. Files, S. Arepalli, and R.S. Ruoff, Tensile loading of ropes
of single wall carbon nanotubes and their mechanical properties, Physical
Review Letters 84 (2000), no. 24, 55525555.
182
[174] M.F. Yu, O. Lourie, M.J. Dyer, K. Moloni, T.F. Kelly, and R.S. Ruoff,
Strength and breaking mechanism of multiwalled carbon nanotubes under
tensile load, Science (2000), no. 287, 637640.
[175] R. Zacharia, H. Ulbricht, and T. Hertel, Interlayer cohesive energy of
graphite from thermal desorption of polyaromatic hydrocarbons, Physical
Reivew B 69 (2004), no. 155406, 17.
[176] X. Zhao, Y. Liu, S. Inoue, T. Suzuki, R. O. Jones, and Y. Ando, Smallest
carbon nanotube is 3 [angstrom] in diameter, Physical Review Letters 92
(2004), no. 12, 125502.
[177] Q. Zheng and Q. Jiang, Multiwalled carbon nanotubes as gigahertz oscillators, Physical Review Letters 88 (2002), no. 4, 045503.
[178] Q. Zheng, J.Z. Liu, and Q. Jiang, Excess van der waals interaction energy
of a multiwalled carbon nanotube with an extruded core and the induced core
oscillation, Physical Review B 65 (2002), no. 24, 245409.
[179] H.W. Zhu, C.L. Xu, D.H. Wu, B.Q. Wei, R. Vajtai, and P.M. Ajayan, Direct
synthesis of long single-walled carbon nanotube strands, Science 296 (2002),
no. 5569, 884886.
[180] U. Zimmerli, M. Parrinello, and P. Koumoutsakos, Dispersion corrections
to density functionals for water aromatic interactions, Journal of Chemical
Physiscs 120 (2004), no. 6, 26932699.
183