Professional Documents
Culture Documents
Unified Mechanics Theory
Unified Mechanics Theory
Introduction
to Unified
Mechanics Theory
with Applications
Second Edition
Introduction to Unified Mechanics Theory
with Applications
Y
0.999
=
5 years old 100 years old
with stage4
cancer
0
X
5 years old
Z
Time
Cemal Basaran
Introduction to Unified
Mechanics Theory
with Applications
Second Edition
Cemal Basaran
Department of Civil, Structural, and Environmental Engineering
University at Buffalo, SUNY
Buffalo, NY, USA
© The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Switzerland
AG 2021, 2022
This work is subject to copyright. All rights are solely and exclusively licensed by the Publisher, whether
the whole or part of the material is concerned, specifically the rights of translation, reprinting, reuse of
illustrations, recitation, broadcasting, reproduction on microfilms or in any other physical way, and
transmission or information storage and retrieval, electronic adaptation, computer software, or by
similar or dissimilar methodology now known or hereafter developed.
The use of general descriptive names, registered names, trademarks, service marks, etc. in this publication
does not imply, even in the absence of a specific statement, that such names are exempt from the relevant
protective laws and regulations and therefore free for general use.
The publisher, the authors, and the editors are safe to assume that the advice and information in this
book are believed to be true and accurate at the date of publication. Neither the publisher nor the authors or
the editors give a warranty, expressed or implied, with respect to the material contained herein or for any
errors or omissions that may have been made. The publisher remains neutral with regard to jurisdictional
claims in published maps and institutional affiliations.
This Springer imprint is published by the registered company Springer Nature Switzerland AG
The registered company address is: Gewerbestrasse 11, 6330 Cham, Switzerland
Foreword
A new scientific truth does not triumph by convincing opponents and making them see the
light, but rather because its opponents eventually die, and a new generation grows up that is
familiar with it.
v
Preface
This book is the culmination of my research over the last 28 years at the University at
Buffalo, The State University of New York. Most of the book relies heavily on the
work of my PhD students and our joint publications. This second edition is based on
the first edition. However, there are two new chapters in this second edition, Chap. 9
on fatigue and Chap. 10 on corrosion-fatigue interaction. Also, there are more
example problems added. Chapter 4 on unified mechanics theory has also been
augmented with more and better explanations.
The traditional [Newtonian] continuum mechanics portion of this book is pri-
marily based on the seminal textbook by Malvern (1969). Since I consider Prof.
Lawrence Malvern’s textbook one of the best written on continuum mechanics, I did
not find any reason to change the formulation or his presentation of the basics. Yet,
when I was a student, I found his book hard to follow because it was written at a high
level. Therefore, I have tried to simplify many of the formulae derivation steps to
make them easier for students to understand. While I did change the order of the
topics which I copied from Malvern (1969), I have tried to stay true to the message
Prof. Malvern had in his formulation.
I have tried to explain the unified mechanics theory in the simplest terms possible.
To accomplish this objective, I found it necessary to include the entire paper by
Sharp and Matschinsky (2015), which is the English translation of Boltzmann’s
(1877) original paper. While I acknowledge that including papers from a journal in
its entirety is a very unorthodox practice, it was necessary, because Boltzmann’s
original paper is misunderstood. Therefore, I had to present his entire derivation
annotated with my explanations and comments. To this day, many physics books
still refer to Boltzmann’s second law equation to be valid for gasses only, with no
reference to his derivation. Boltzmann did not make any such assumption in his
formulation that would have invalidated the applicability of his formulation to
solids. For over a quarter century, my PhD students and I have provided in the
literature the experimental proof of Boltzmann’s second law equation for solids.
Sharp and Matschinsky (2015) have also addressed this major misunderstanding in
their paper. I quote Sharp and Matschinsky (2015) directly. The fact that Boltzmann
vii
viii Preface
does not consider the interaction between the particles only increases the possible
number of complexions, as Boltzmann defines them. Interparticle interactions, like
friction or molecular bonding, would only reduce the number of possible complex-
ions. Therefore, Boltzmann’s formulation can be considered an upper bound limit of
possible permutations. However, Boltzmann’s mathematical derivation is valid for
solids, gasses, and liquids. It was not possible to include Boltzmann’s paper without
Max Planck’s (1900) paper because they complement each other.
In introducing the basics of thermodynamics, I have relied heavily on Callen
(1985). While there are many books on thermodynamics within the literature, I
found the work of Callen (1985) to be the most comprehensive and understandable
from a graduate student’s perspective. I also kept in mind that unfortunately most
mechanics students never take a basic course on thermodynamics at the undergrad-
uate level. Therefore, the chapter on thermodynamics is based primarily on the work
of Callen (1985).
The rest of the chapters are all taken from my work with my former PhD students
to whom I am eternally grateful. My former PhD students who contributed to the
development of this theory over the years are Drs. ChengYong Yan, Rumpa
Chandaroy, Hong Tang, Ying Zhao, Yujun Wen, Shihua Nie, Juan Gomez, Eray
Mustafa Gunel, Shidong Li, Wei Yao, Hua Ye, Ming Hui Lin, Michael Sellers,
Mohammed Abdul Hamid, Yong Chang Lee, Noushad Bin Jamal, and Hsiao Wei
Lee. I am also indebted to Ms. Shifan Cheng for the graphics in the book.
I am especially grateful to the US Navy, Office of Naval Research, for believing
in me and sponsoring most of the work reported in this book. Their continuous
support, [starting in 1997 with the ONR Young Investigator Award Program under
program director Dr. Roshdy Barsoum], has been an immense help, and without it,
this idea would have never come to a realization.
References
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.1 What Is the Mechanics of Continuous Medium? . . . . . . . . . . . . 1
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
2 Stress and Strain in Continuum . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
2.1 Newton’s Universal Laws of Motion . . . . . . . . . . . . . . . . . . . . 5
2.1.1 First Universal Law of Motion . . . . . . . . . . . . . . . . . . 5
2.1.2 Second Universal Law of Motion . . . . . . . . . . . . . . . . 8
2.1.3 Third Universal Law of Motion . . . . . . . . . . . . . . . . . . 9
2.1.4 Range of Validity of Newton’s Universal
Laws of Motion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
2.1.5 Relation to the Thermodynamics
and Conservation Laws . . . . . . . . . . . . . . . . . . . . . . . . 12
2.2 Stress . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
2.2.1 Definitions of Stress and Traction . . . . . . . . . . . . . . . . 12
2.2.2 Stress Vector on an Arbitrary Plane . . . . . . . . . . . . . . . 15
2.2.3 Symmetry of Stress Tensor . . . . . . . . . . . . . . . . . . . . . 19
2.2.4 Couple Stresses . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
2.2.5 Principal Stresses and Principal Axes . . . . . . . . . . . . . . 24
2.2.6 Stress Tensor Invariants . . . . . . . . . . . . . . . . . . . . . . . 27
2.2.7 Octahedral Plane and Octahedral Stresses . . . . . . . . . . 30
2.3 Deformation and Strain . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33
2.3.1 Small Strain Definition . . . . . . . . . . . . . . . . . . . . . . . . 34
2.3.2 Small Strain and Small Rotation Formulation . . . . . . . . 38
2.3.3 Small Strain and Rotation in 3-D . . . . . . . . . . . . . . . . . 47
2.4 Kinematics of Continuous Medium . . . . . . . . . . . . . . . . . . . . . 48
2.4.1 Material (Local) Description . . . . . . . . . . . . . . . . . . . . 48
2.4.2 Referential Description (Lagrangian Description) . . . . . 48
2.4.3 Spatial Description (Eulerian Description) . . . . . . . . . . 49
2.4.4 Material Time Derivative in Spatial Coordinates
(Substantial Derivative) . . . . . . . . . . . . . . . . . . . . . . . 49
ix
x Contents
Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 517
Abbreviations1
γ Shear strain
δij Kronecker delta
ε Total strain tensor
W Vorticity tensor
ε_ p Plastic strain rate tensor
ρ Mass density
σ Total stress tensor
S Deviatoric stress tensor
σO First Piola-Kirchhoff stress tensor
σ~ Second Piola-Kirchhoff tensor
φ Dissipation potential
Φ Thermodynamic state index
TSI Thermodynamic state index
Ψ Helmholtz-free energy
b Body force per unit mass
B-1 Cauchy deformation tensor
C Green deformation tensor
D Rate of deformation tensor
E Lagrange strain tensor
E Euler strain tensor
ETotal Total energy
F Deformation gradient tensor
F Plasticity yield surface
F~ Second Piola-Kirchhoff force
g Gibbs function
1
Most of the time I tried to use the same character to define a variable. However, due to the large
number of variables, some characters were used more than once to define different variables. To
clarify the variables, they are re-defined after each equation.
xvii
xviii Abbreviations
h Enthalpy
I Impulse
I 1, I 2, I 3 Total stress tensor invariants
J2D Second invariant of the deviatoric stress tensor
J3D Third invariant of the deviatoric stress tensor
k, or kB Boltzmann’s constant
K Kinetic energy
V Potential energy
L Spatial gradient of velocity
m mass
mij the distributed moment in couple stress theory
nij Direction cosines
p Momentum
q Heat flux vector
Q Plastic potential
Q Total heat input
s Entropy
t Time
u, v, w Displacements along spatial coordinates, x, y, z, respectively
i, j, k Unit vectors along spatial coordinates, x, y, z, respectively
U Strain energy
~v Velocity
A Area
V Volume
Chapter 1
Introduction
The traditional definition of the continuous medium also makes three assump-
tions (Malvern, 1969):
Continuity A material is continuous if it fills the space that it occupies, leaving no
pores or empty spaces, and if furthermore, its properties are describable by contin-
uous functions.
Homogeneity A homogenous material has identical properties at all points. The
size is larger than the Representative Volume Element (RVE) of the material. RVE is
the smallest volume over which a property measurement can be made that will yield
a value representative of the whole (Hill, 1965).
Isotropy A material is isotropic with respect to certain properties if these properties
are the same in all directions in space.
Only the first assumption is needed to define concepts of stress and strain. The last
two assumptions are needed when we introduce stress-strain relations (constitutive
relations) into continuum mechanics equations.
Of course, anisotropic, inhomogeneous, and discontinuous systems can also be
analyzed by continuum mechanics by representing them as discrete pieces, as it is
done in computational mechanics, such as the finite element method.
Chapter 2 of this book covers only the general principles of continuum mechan-
ics, and associated definitions, such as stress and strain. The fundamentals of
continuum mechanics, presented in Chap. 2, have been well established by Isaac
Newton (1687), Leonhard Euler (1736), Cauchy, Green, Truesdell, and Toupin
(1960), Truesdell and Noll (1965), Truesdell (1965, 1966), and others. Maugin
(2016) published a comprehensive account of the foundations of continuum mechan-
ics as well as a complete list of references to earlier work. Therefore, there is no
attempt in this book to cover earlier fundamental developments.
Continuum mechanics cannot be discussed without including material constitu-
tive equations. While material behavior modeling is a very important topic of major
current research efforts, it is not the focus of this book. It is covered in the
applications of the unified mechanics theory sections, Chaps. 5, 6, 7, 8, and 9.
Chapter 3 of the book covers the basics of thermodynamics. This chapter starts
with a comprehensive literature survey of the use of thermodynamics in continuum
mechanics in the last 150 years. While we do not claim to have included every paper
published on the topic, the survey covers all significant developments. Energy
dissipation and degradation of materials and structures are discussed in the context
of thermodynamics. Since not all engineering students are required to take the course
in thermodynamics, the reader has assumed a beginner, as such a piece of elementary
information is included.
Chapter 4 presents the formulation of the unified mechanics theory. Great effort
has been made to include all the details of the formulation. Chapter 4 also discusses
the motivation behind the development of the unified mechanics theory. Essentially,
the present-day mechanics equations are all based on the three laws of motion of
Isaac Newton. However, these three laws are “incomplete,” because they do not
include the laws of thermodynamics. The following examples can help us explain
References 3
References
It is essential to start the discussion about continuum mechanics, with the universal
laws of motion as given by Isaac Newton. Understanding their implications in
historical context is essential to understanding these simple and probably the most
famous physics equations. Of course, Newton’s universal laws of motion are very
well known to most readers of this book; however, their historical context and
evolution are not. Therefore, we find it necessary to quote these basics from Sir
Isaac Newton’s own Wikipedia website with their sources. The information below is
quoted directly with little or no editing.
Philosophiæ Naturalis Principia Mathematica (Latin for Mathematical Principles
of Natural Philosophy), often referred to as simply the Principia, is a work in three
books by Isaac Newton, in Latin, first published 5 July 1687. After annotating and
correcting his copy of the first edition, Newton published two further editions, in
1713 and 1726 (Harvard UP, 1972). The Principia states Newton’s universal laws of
motion, forming the foundation of classical mechanics; Newton’s law of universal
gravitation; and a derivation of Kepler’s laws of planetary motion (which Kepler first
obtained empirically) (Cohen, 2002; Galili & Tseitlin, 2003; Kleppner & Kolenkow,
1973; Plastino & Muzzio, 1992; Resnick et al., 1992) (Fig. 2.1).
The law of inertia was discovered by different philosophers and scientists inde-
pendently, including Thomas Hobbes (1588–1679) in his Leviathan (Hobbes, 1651).
The seventeenth-century philosopher and mathematician René Descartes
(1596–1650) also formulated the law, although he did not perform any experiments
to confirm it (Hellingman, 1992; Resnick & Halliday, 1977).
The first law, also known as the law of inertia, states that if the net force (the vector
sum of all forces acting on an object) is zero, then the velocity of the object is
constant. Velocity is a vector quantity, which expresses both the object’s speed and
the direction of its motion; therefore, the statement that the object’s velocity is
constant is a statement that both its speed and the direction of its motion are constant.
The first law can be stated mathematically, when the mass is a non-zero constant,
as,
d~v
F = 0, =0
dt
where F is the total net force acting on the object, d~v=dt is the time derivative of
velocity. Consequently, an object that is at rest will stay at rest unless an external
force acts upon it. An object that is in motion will not change its velocity unless an
external force acts upon it. This is known as the uniform motion. An object continues
to do whatever it happens to be doing unless a force is exerted upon it. If it is at rest, it
continues in a state of rest (demonstrated when a tablecloth is skillfully whipped
from under dishes on a tabletop and the dishes remain in their initial state of rest),
where the inertia of dishes balances the force exerted by the table cloth. If an object is
moving, it continues to move without turning or changing its speed. This is evident
in space probes that continuously move in outer space. Change in motion must be
imposed against the tendency of an object to retain its state of motion. In the absence
of a total net force, a moving object tends to move along a straight-line path
indefinitely.
Newton stated the first law of motion to establish a frame of reference for which
the other laws are applicable. The first law of motion postulates the existence of at
least one frame of reference called a Newtonian or inertial frame of reference,
relative to which the motion of a particle, not subjected to a net force, is a straight
line at a constant speed (Newton, 1999). Newton’s first law is often referred to as the
law of inertia. Thus, a condition necessary for the uniform motion of a particle
relative to an inertial frame of reference is that the total net force acting on it is zero.
In this sense, the first law can be restated as (Motte’s 1729 translation):
In every material universe, the motion of a particle in a preferential reference frame X is
determined by the action of forces whose total vanished for all times when and only when the
velocity of the particle is constant in X. That is, a particle initially at rest or in uniform
motion in the preferential frame X continues in that state unless compelled by forces to
change it.
8 2 Stress and Strain in Continuum
“Newton’s first and second laws are valid only in an inertial reference frame. Any
reference frame that is in uniform motion with respect to an inertial frame is also
inertial, i.e., Galilean invariance or the principle of Newtonian relativity” (Resnick &
Halliday, 1977). If someone kicks a ball on a moving train car, the acceleration the
ball will gain will be with respect to the train not with respect to the ground.
(Modern translation) Law II: In an inertial reference frame, the vector sum of the
forces “F” on an object is equal to the mass “m” of that object multiplied by the
acceleration “a” of the object:
According to modern ideas of how Newton was using his terminology, this is
understood, in modern terms, as an equivalent of:
The change of momentum, ½d ðm~vÞ, of a body, is proportional to the impulse (Fdt) impressed
on the body and happens along the straight line on which that impulse (Fdt) is impressed.
This may be expressed by the formula F = p′ where p′ is the time derivative of the
momentum ½p = d ðm~vÞ. This equation can be seen in the Wren Library of Trinity
College, Cambridge, UK, in a glass case in which Newton’s manuscript is open to
the relevant page.
Motte’s 1729 translation of Newton’s Latin manuscript continued with Newton’s
commentary on the second law of motion, reading:
If a force generates a motion, a double force will generate double the motion, a triple force
triple the motion, whether that force be impressed altogether and at once, or gradually and
successively. And this motion (being always directed the same way with the generating
force), if the body moved before, is added to or subtracted from the former motion, according
to as they directly conspire with or are directly contrary to each other; or obliquely joined,
when they are oblique, so as to produce a new motion compounded from the determination
of both.
The scientific intention in which Newton used his terminology, and how he
defined the second law and intended it to be understood, has been extensively
studied by historians of science.
2.1 Newton’s Universal Laws of Motion 9
The second law states that the rate of change of momentum of a body is directly
proportional to the force applied, and this change in dp
dt momentum takes place in the
direction of the applied force, F,
dp dðm~vÞ
F= =
dt dt
d~v
F=m = ma
dt
I = Δp = mΔ~v
this relation between impulse and momentum is closer to Newton’s definition of the
second law.
(Modern translation) Law III: When one body exerts a force on a second body
F12 = F21, the second body simultaneously exerts a force F21 equal in magnitude and
opposite in direction to the first body.
Newton’s explanatory comment to this law:
Whatever draws or presses another is as much drawn or pressed by that other. If you press a
stone with your finger, the finger is also pressed by the stone. If a horse draws a stone tied to
a rope, the horse (if I may so say) will be equally drawn back towards the stone: for the
distended rope, by the same endeavor to relax or unbend itself, will draw the horse as much
towards the stone, as it does the stone towards the horse, and will obstruct the progress of the
one as much as it advances that of the other. If a body impinges upon another, and by its
force changes the motion of the other, that body also (because of the equality of the mutual
pressure) will undergo an equal change, in its own motion, toward the contrary part. The
changes made by these actions are equal, not in the velocities but in the motions of the
bodies; that is to say if the bodies are not hindered by any other impediments. For, as the
motions are equally changed, the changes of the velocities made toward contrary parts are
reciprocally proportional to the bodies. This law takes place also in attractions, as will be
proved in the next scholium (Fairlie & Cayley, 1965).
F 12 = F 21
In the above, as usual, motion is Newton’s term for momentum, hence his careful
distinction between motion and velocity.
Newton used the third law to derive the law of conservation of momentum
(Cohen, 1995), from a deeper perspective; however, conservation of momentum is
the more fundamental idea (derived via Noether’s theorem from Galilean invari-
ance), and holds in cases where Newton’s third law appears to fail, for instance,
when force fields, as well as particles, carry momentum, and in quantum mechanics.
An example illustration of Newton’s third law is in which two people push
against each other. The first person on the left exerts a normal force F12 on the
second person directed towards the right, and the second person exerts a normal
force F21 on the first person directed towards the left. The magnitudes of both forces
are equal, but they have opposite directions, as dictated by Newton’s third law.
The third law states that all forces between two objects exist in equal magnitude
and opposite direction: if one object A exerts a force FA on a second object B, then B
simultaneously exerts a force FB on A, and the two forces are equal in magnitude and
opposite in direction: FA = - FB. The third law means that all forces are interactions
between different bodies, or different regions within one body, and thus that there is
no such thing as a force that is not accompanied by an equal and opposite force. In
some situations, the magnitude and direction of the forces are determined entirely by
one of the two bodies, say Body A; the force exerted by Body A on Body B is called
the “action,” and the force exerted by Body B on Body A is called the “reaction.”
This law is sometimes referred to as the action–reaction law, with FA called the
“action” and FB the “reaction.” In other situations, the magnitude and directions of
the forces are determined jointly by both bodies, and it is not necessary to identify
one force as the “action” and the other as the “reaction.” The action and the reaction
are simultaneous, and it does not matter which is called the action and which is called
the reaction; both forces are part of a single interaction, and neither force exists
without the other (Cohen, 1967).
2.1 Newton’s Universal Laws of Motion 11
The two forces in Newton’s third law are of the same type (e.g., if the road exerts
a forward frictional force on an accelerating car’s tires, then it is also a frictional
force that Newton’s third law predicts for the tires pushing backward on the road).
From a conceptual standpoint, Newton’s third law is seen when a person walks:
they push against the floor, and the floor pushes against the person. Similarly, the
tires of a car apply a shear force on the road while the road pushes back on the tires
with a re-action shear force—the tires and road simultaneously push against each
other. The reaction force is responsible for the motion of the person or the car. These
forces depend on friction; however, if a person or car is on ice, there will be no
motion because of failure to produce the needed reaction force.
It is important to point out that Robert Hooke published his law in 1676. Hooke’s
law which is more sophisticated than Newton’s third law, for deformable bodies as
F = ku where k is the stiffness of a one-dimensional spring and u is the displacement
response in the direction of the force. It is easy to see the action–reaction concept in
Hooke’s law. “Hooke stated in his 1678 work that he was aware of the law in 1660,
which is 17 years before Newton published his Principia in 1687” (Robert Hooke’s
Wikipedia page).
In their original form, Newton’s laws of motion are not adequate to characterize the motion
of rigid bodies and deformable bodies. Leonhard Euler in 1750 introduced a generalization
of Newton’s laws of motion for rigid bodies called Euler’s laws of motion, later applied as
well for deformable bodies assumed as a continuum. If a body is represented as an
assemblage of discrete particles, each governed by Newton’s laws of motion, then Euler’s
laws can be derived from Newton’s laws. Euler’s laws can, however, be taken as axioms
describing the laws of motion for extended bodies, independently of any particle structure
(Lubliner, 2008; Leonard Euler’s Wikipedia page).
Newton’s laws were verified by experiments and observations for over 200 years, and they
are excellent approximations at the scales and speeds of most of the motions we observe in
daily life. Newton’s laws of motion, together with his law of universal gravitation and the
mathematical techniques of calculus, provided for the first time a unified quantitative
explanation for a wide range of physical phenomena.
These three laws hold for macroscopic objects under everyday conditions. However,
Newton’s laws are inappropriate for use in certain circumstances, most notably, at very high
speeds (in special relativity, the Lorentz factor must be included in the expression for
momentum along with the rest mass and velocity) or in very strong gravitational fields.
Therefore, Newton’s laws cannot be used to explain phenomena such as conduction of
electricity in a semiconductor, optical properties of materials, errors in non-relativistically
corrected GPS systems, and superconductivity. Explanation of these phenomena requires
more sophisticated physical theories, including general relativity and quantum field theory.
However, in quantum mechanics, concepts such as force, momentum, and position are
defined by linear operators that operate on the quantum state; at speeds that are much lower
than the speed of light, Newton’s laws are just as exact for these operators as they are for
classical objects.
12 2 Stress and Strain in Continuum
In modern physics, the laws of conservation of momentum, energy, and angular momentum
are of more general validity than Newton’s laws, since they apply to both light and matter,
and both classical and non-classical physics. Because force is the time derivative of
momentum, the concept of force is redundant and subordinate to the conservation of
momentum, and is not used in fundamental theories (e.g., quantum mechanics, quantum
electrodynamics, general relativity, etc.). Other forces, such as gravity, also arise from
momentum conservation. Newton stated the third law within a world-view that assumed
instantaneous action at a distance between material particles. However, he was prepared for
philosophical criticism of this action at a distance, and it was in this context that he stated the
famous phrase ‘I feign no hypotheses.’ The discovery of the second law of thermodynamics
in the 19th century showed that not every physical quantity is conserved over time, thus
disproving the validity of inducing the opposite metaphysical view from Newton’s laws.
Hence, a ‘steady-state’ worldview based solely on Newton’s laws and the conservation laws
does not take entropy into account.
This brings us to the main topic of this book. In unified mechanics, theory entropy
generation is incorporated into Newton’s universal laws of motion.
2.2 Stress
The external surface force per unit area applied to the body is called the traction, t. It
is a distributed external load. The vector ΔF acting at an imaginary point in the
interior surface is called the internal force vector (Fig. 2.2).
ΔA is the area of the internal imaginary point Q; ΔF is the force vector acting on
this area as a result of external loads applied to the body. We choose to use the term
imaginary point intentionally because later we will assume that the area of this point
goes to zero to be able to define stress at a point.
Fig. 2.2 Definition of stress at an imaginary point Q in the internal surface at cross-section cut a-a
2.2 Stress 13
The average stress σ acting on ΔA is the vector ΔFΔA , ΔF is the vector sum of the
forces acting at point Q, and ΔA is the area of the chosen imaginary point. The
average stress is also a vector having the same direction as ΔF.
Assuming that ΔA is selected so that it represents a point with zero volume as the
value of ΔA approaches zero.
lim
ΔA → 0
of course, ΔA can never be equal to zero. If it were zero, stress value would be
infinite. We let ΔA approach zero to be able to define stress at an imaginary point.
ΔF
σ = lim
ΔA → 0 ΔA
Because the area corresponding to internal force is approaching zero, the numer-
ator, ΔF also approaches zero. However, the fraction, in general, approaches a finite
limit
ΔF
σ = lim
k → 1 ΔA k
where k represents the number of points in the body. This is a basic postulate of
continuum mechanics that such a limit exists and is independent of the area used
(Malvern, 1969).
It is important to explain what we mean by an “imaginary point” with zero
volume, as the area ΔA, it is natural to assume that this point represents an atom.
It cannot be smaller than an atom. However, this point does not represent an atom.
Because at the atomic level fundamental definitions of continuum do not exist, and
this definition of stress at the atomic level is not true. This “imaginary point”
therefore represents a point in the continuum, but not an atom or any other quantum
mechanics’ matter. Since quantum mechanics is outside the scope of this book, we
will refer the reader to a textbook on quantum mechanics.
In x, y, z Cartesian coordinate system stresses at point Q on the cross-section that
has x axis as its normal can be defined by
ΔF x dF x
σ xx = lim =
ΔAx → 0 ΔAx dAx
ΔF y dF y
σ xy = lim =
ΔAx → 0 ΔAx dAx
ΔF z dF z
σ xz = lim =
ΔAx → 0 ΔAx dAx
14 2 Stress and Strain in Continuum
Subscript x in dAx indicates that x axis is normal to area dAx (Fig. 2.3).
In the same fashion, we can define stress vectors acting on other surfaces, y, and
z. As a result, the state of stress at a point is defined by a second-order tensor with
nine vector components
2 3
σ xx σ xy σ xz
6 7
σ = 4 σ yx σ yy σ yz 5
σ zx σ zy σ zz
Normal stresses, σ ii, are positive when the vector is in the positive axis direction.
Shear stresses σ ij are positive when the vector component is in the positive direction
on the positive face of the block and the negative direction on the negative face of the
block. Shear stresses σ ij are negative when the vector component is in the negative
direction on the positive face of the block and the positive direction on the negative
face of the block.
In this book, tensile stress is defined to be positive, and compression is defined as
negative. However, we realize that this definition is not universal. For instance, in
geo-mechanics tension is negative and compression is positive.
While tension and compression lead to completely different results when applied
to any object, negative shear stress and positive shear stress are considered to be the
same kind of loading in opposite directions from a physical point of view. There is
no compression shear or tension shear.
2.2 Stress 15
Let’s define a plane ABC at an arbitrary slope passing through point Q (Figs. 2.4 and
2.5).
To obtain the equations governing the state of stress at an arbitrarily oriented
plane ABC, we will use the conservation of momentum principle. Forces acting on
the tetrahedron are five vectors representing the resultant force on each of the faces
(ΔAx, ΔAy, ΔAz, ΔA) and the resultant body force ρbΔV, where ρ is mass density, b is
body force per unit mass, and ΔV is the volume of the tetrahedron.
σ (n) is the average value of stress on the oblique face which has n as its normal.
σ is the average stress on the area ΔAx that has x axis as its normal, similarly σ ( y)
(x)
Fig. 2.5 (a) An arbitrary tetrahedron QABC at point Q (b) free-body diagram of
tetrahedron QABC
Z Z
~vρdV = ~vdm
ΔV Δm
where dm is the mass of the tetrahedron, dV is the volume of ρ is mass density, and ~v
is the velocity of the particle.
The time rate of change of the total momentum is given by
d~v d~v
Δm = ðρΔV Þ
dt dt
It is assumed that Δm does not change over time. The conservation of momentum
principle yields the following equilibrium equation for the free body shown in
Fig. 2.6.
d~v
σ ðnÞ ΔA þ ρbΔV - σ ðxÞ ΔAx - σ ðyÞ ΔAy - σ ðzÞ ΔAz = ðρΔV Þ
dt
We need to calculate the height and volume of the arbitrary tetrahedron at point
Q. Then we let the height h go to zero, to be able to define the imaginary point Q,
h h h
cos α = , cos β = , cos γ =
OA OB OC
nx = cosα, ny = cosβ, nz = cosγ
Three direction cosines, nx, ny, nz, are also bound by the following relation:
As a result
h = OA nx = OB ny = OC nz
1
ΔV = hΔA
3
1 1 1
ΔV = ðOA cos αÞΔA = ðOB cos βÞΔA = ðOC cos γ ÞΔA
3 3 3
Note that we can define the direction cosines also using the relationship between
the areas of the faces of the tetrahedron
ΔAx
cosα =
ΔA
because ΔAx is the projection of the inclined area of interest ΔA on y - z plane with
x axis its normal. Hence
1
ΔV = OA ΔAx
3
18 2 Stress and Strain in Continuum
1 1
ΔV = OB ΔAy and ΔV = OC ΔAz
3 3
substituting ΔV, ΔAx, ΔAy, and ΔAz in the conservation of momentum equation,
we obtain
1 1 d~v
σ ðnÞ ðΔAÞ þ ρb hΔA = σ ðxÞ ΔA nx þ σ ðyÞ ΔA ny þ σ ðzÞ ΔA nz þ ρ hΔA
3 3 dt
1 1 d~v
σ ðnÞ þ ρb h = σ ðxÞ nx þ σ ðyÞ ny þ σ ðzÞ nz þ ρ h
3 3 dt
dt = 0, then, we
Let’s assume, there is no body force b = 0, and no acceleration d~
v
can write
ðnÞ
σx = σxx nx þ σyx ny þ σzx nz
ðnÞ ð2:2Þ
σy = σxy nx þ σyy ny þ σzy nz
σðnÞ
z = σxz nx þ σyz ny þ σzz nz
where σ ðxnÞ is resultant vector component on n plane in the (x) direction, and we have
the same definition for y and z directions. We can write a second-order tensor, σ ij,
Cauchy stress tensor from these algebraic equations. In indicial notation algebraic
equations can be written as
2.2 Stress 19
ðnÞ
σ j = σ ij ni i = x,y,z j = x,y,z
in matrix notation {σ (n)} = {n}[σ]. Remember that { } defines a vector and [ ] defines
a matrix.
Cauchy stress tensor in any other Cartesian coordinate system can easily be
obtained. Assume that the new coordinate system is rotated with respect to the
original coordinate system by α, β, γ angles. The components of the new rotated
stress tensor σ ij can be defined in terms of the three stress vectors acting across the
three planes normal to the new coordinates.
Components of the new stress tensor σ ij will be
ðnÞ
σ i = σ ij nj ∙
σ ij = nki nlj σ kl
or in matrix notation
where [N] is the matrix of direction cosines nki = cos ðxi , xk Þ of the angles between
the new and old axes. [N]T is its transpose. The matrix of direction cosine is given by
2 3
cos ðx, xÞ cos ðx, yÞ cos ðx, zÞ
6 7
½N = 4 cos ðy, xÞ cos ðy, yÞ cos ðy, zÞ 5
cos ðz, xÞ cos ðz, yÞ cos ðz, zÞ
Normal stresses act orthogonal to the surface they are acting on. However, shear
stresses act parallel to the plane on each surface of the cubic control volume, which
represents a point in space with zero volume.
Shear stresses are assumed to be positive when they are on the positive face of the
cube and acting in the positive coordinate axis direction. On the negative face, shear
stress is positive if the direction of the stress is in the negative direction of the axis.
20 2 Stress and Strain in Continuum
Fig. 2.7 (a) Linear shear stresses acting at point Q. (b) x-y plane view of point Q
σ xy = σ yx , σ xz = σ zx , σ yz = σ yz
σ xz = σ zx and σ zy = σ yz
Couple stress at a point can exist due to external loads, like a magnetic field, due to
materials microstructure, or due to the small size of the structure which can cause
very large deformation gradients. If there are the couple-stresses mxz on x plane and
myz on y plane and we write the moment equilibrium with respect to z axis,
σ xy dydz dx þ mxz - σ yx dxdz dy þ myz = 0
σxy x
σyx
myz
dz
dx
z
22 2 Stress and Strain in Continuum
For the contribution of forces in x direction to the resultant force on the inclined
plane (n), we can write the following relation, where A represents the area along the
inclined surface:
σðnÞ
x = σxx nx þ σyx ny
σðnÞ
x = - 5 MPa cos 45 - 2 MPa cos 45 = - 4:95 MPa
Projection of ABC plane on x, y, z planes is the same shown below, OBC, OAB,
OAC
2.2 Stress 23
y
σ yy
σ yz σy
x
B
σ xy
σ zy
σ xx
σ zz σ xz x O A
σ zx x
C
z
z
B
B
1 1
√2 √2 √2 Area of OBC = 2
1 ∙1 = 2
1
OBC = OAB = OAC
C A O A
√2 1
Area of ABC = 1
2
√ 2 ∙ √ 2 Cos 30 = 0.866
y
( )
B
( )
ABC Plane x
A
2 ðnÞ
3 2 3
σx 0:5774
6 7 6 7
σðnÞ = nT ∙ 4 σðnÞ
y 5 = ½0:5774 0:5774 0:5774 ∙ 4 12:703 5 = 31:00 MPa
σðnÞ
z
40:418
2 2 2
σðnÞ = σðnÞ
n þ σðnÞ
s
2 2 2 2
σðnÞ = σðnÞ
x þ σðnÞ y þ σðnÞ z = 1795:5685
qffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
σs = σðnÞ2 - σ2n = 28:879 MPa
The stress tensor given in the original Cartesian coordinate system changes values if
we change the orientation of the coordinate system, i.e., simply rotating it around the
origin O. The inclined plane (Fig. 2.9) represents the orientation of a new coordinate
system. Therefore, the stress tensor becomes a function of the rotation angle with
respect to the original coordinate system. One of these orientations will have the
maximum normal stress values, also called principal stresses.
For a three-dimensional stress tensor, these principal values and principal orien-
tation can be identified by eigenvalue analysis,
Normal and shear stress components acting on the inclined face of the tetrahedron
are a function of the orientation of the inclined plane. If the stress tensor is
symmetric, we can choose an orientation of the new coordinate axes such that the
shear stress components vanish in this new coordinate system. These special axes are
called principal axes or principal directions. The three planes that are orthogonal to
three principal axes are called the principal planes. On principal planes, all shear
stresses are zero. There are only normal stresses on the principal planes. Maximum
2.2 Stress 25
normal stress is defined as the algebraically largest of the three principal stresses.
The minimum normal stress is defined as the algebraically smallest of the three
principal stresses. There is always such a set of three mutually orthogonal directions
at any point if the state of stress at a point is a symmetric second-order tensor. This is
a property common to all symmetric second-order tensors.
The principal planes and principal stresses can be found by traditional eigenvalue
analysis.
Let [I] be a matrix of unit vectors in one of the arbitrary orientations of the
coordinate system. Let λ be the principal stress components in the new orientation,
whose normals are given by ½n.
Since there is no shear stress on the principal planes, the stress vector on the
principal plane will be parallel to the normal
ðnÞ
σ i = λni
where i = x, y, z.
In matrix notation
n o
σ ðnÞ = λfng
Since
n o
σ ðnÞ = fng½σ
26 2 Stress and Strain in Continuum
f ng ½ σ = λ f n g
fng is a row matrix {n1, n2, n3} where ni defines the direction cosine. λ is the
principal stress value sought. Thus
2 3
λ 0 0
6 7
λ½I = 4 0 λ 05
0 0 λ
Equation (3.3) has a solution only if the determinant of the matrix is equal to zero
since the direction cosines vector cannot be zero, and because n21 þ n22 þ n23 = 1 must
be satisfied.
Hence,
½½σ - λ½I = 0
ðσ xx - λÞ σ xy
σ xz
σ yx σ yy - λ σ yz
=0
σ zx σ zy ðσ zz - λÞ
This is a cubic equation for the unknown λ. If the stress matrix is symmetric and
has real stress values (as opposed to imaginary values), the three roots of the cubic
equation
λ3 þ aλ2 þ bλ þ c = 0
λ1 = σ 1 , λ2 = σ 2 , and λ3 = σ 3 :
2.2 Stress 27
fng½½σ - λ½I = 0
λ 3 - I 1 λ2 þ I 2 λ - I 3 = 0 ð2:4Þ
28 2 Stress and Strain in Continuum
where I1, I2, and I3 are the first, second, and third invariants of the stress tensor.
Invariants are given by the following equations:
I 1 = σ xx þ σ yy þ σ zz = trσ ð2:5Þ
Or in indicial notation
I 1 = σ ii
I 2 = σ xx σ yy þ σ yy σ zz þ σ zz σ xx - σ 2xy - σ 2xz - σ 2yz
Or in indicial notation
1
I2 = σ ii σ jj - σ ij σ ij ð2:6Þ
2
where i = x, y, z j = x, y, z
h i
I 3 = σ xx σ yy σ zz þ 2σ xy σ yz σ xz - σ 2xz σ yy - σ 2yz σ xx - σ 2xy σ zz
or
2 3
σxx σxy σxz
6 7
I 3 = 4 σyx σyy σyz 5 = det j σ j ð2:7Þ
σzx σzy σzz
In the principal planes, these are no shear stresses; therefore the invariant terms
become simpler:
I 1 = σ1 þ σ2 þ σ3 ð2:8Þ
I 2 = σ1σ2 þ σ2σ3 þ σ3σ1 ð2:9Þ
I 3 = σ1σ2σ3 ð2:10Þ
Stress invariants are mostly used in the constitutive modeling of materials. Any
function constructed with invariants is also invariant with respect to coordinate
systems.
2.2 Stress 29
We can represent any stress tensor as a summation of the hydrostatic stress tensor
and the remainder, which is called the deviatoric stress tensor.
The hydrostatic stress part for any stress tensor is given by
1
p= σ þ σ yy þ σ zz ð2:11Þ
3 xx
Using this definition, we can split the stress tensor into two parts.
2 3 2 3
p 0 0 ðσ xx - pÞ σ xy σ xz
6 7 6 7
½σ = 4 0 p 0 5 þ 4 σ yx σ yy - p σ yz 5 ð2:12Þ
0 0 p σ zx σ zy ðσ zz - pÞ
In Eq. (2.12) the first tensor is called the hydrostatic stress tensor, and the second
tensor is called the deviatoric stress tensor. Hydrostatic tensor is also called spherical
stress tensor. We will use Sij to represent deviatoric stress tensor,
2 3
Sxx Sxy Sxz
6 7
Sij = 4 Syx Syy Syz 5
Szx Szy Szz
where
1
Sxx = σ xx - σ þ σ yy þ σ zz Sxy = σ xy , Sxz = σ xz , Syz = σ yz
3 xx
In the same fashion Syy and Szz can be written. In Cartesian form, the deviatoric
stress tensor is given by
1
Sij = σ ij - σ δ
3 kk ij
if i = j δij = 1, if i ≠ j δik = 0
Principal deviatoric stresses are in the same planes as the total stress tensor principal
planes. Following the same procedure we used for the total stress tensor, we can
obtain the characteristic [eigen] equation of the deviatoric stress tensor as follows:
2 3
ðSxx - λÞ Sxy Sxz
6 7
4 Syx Syy - λ Syz 5=0
Szx Szy ðSzz - λÞ
λ3 - J 2D λ - J 3D = 0
where J2D and J3D are the second and third invariants of the deviatoric stress tensor,
respectively. The first invariant of the deviatoric stress tensor is zero J1D = 0,
1
J 2D = Sij Sij
2
h i
1 2 2
= σ xx - σ yy þ σ yy - σ zz þ ðσ xx - σ zz Þ2 þ σ 2xy þ σ 2xz þ σ 2yz
6
h i
1
J 2D = S2xx þ S2yy þ S2zz þ S2xy þ S2xz þ S2yz
2
1
J 3D = Sij Sjk Ski
3
Sxx Sxy Sxz
J 3D = Syx Syy Syz
Szx Szy Szz
It is important to point out that the second invariant of the deviatoric stress tensor
is a representation of shear stress; however physical interpretation of the third
invariant is more complex.
The plane that makes equal angles with the three principal directions is called the
octahedral plane. The projection of the stress vector on the octahedral plane is called
the octahedral shear stress. The magnitude of the normal stress on the octahedral
plane is constant.
2.2 Stress 31
1
σ oct = σ þ σ yy þ σ zz
3 xx
The second invariant of the deviatoric stress tensor is related to octahedral shear
stress by the following relation:
3 2
J 2D = τ
2 oct
In most metals plasticity happens mainly due to shear stresses; hence deviatoric
stress tensor invariants and octahedral stresses are utilized often in defining yield
surfaces.
Example 2.3 Figure 2.3E
Stresses acting on the point shown above are given below:
Fig. 2.3E (a) Stresses acting on the point. (b) Principal directions
32 2 Stress and Strain in Continuum
(i) Calculate the normal and shear stresses acting on the octahedral plane [i.e., 1,1,1
plane with respect to the principal axes] using the stress transformation equa-
tions, and then compare those values with the formula given below:
1
σ oct = σ þ σ yy þ σ zz
3 xx
h i h i
1 2 2 2 2 2
τ2oct = σ xx - σ yy þ σ yy - σ zz þ ðσ xx - σ zz Þ2 þ σ xy þ σ yz þ ðσ zx Þ2
9 3
First, we need to calculate principal stresses and principal directions using the
Eigen equations,
ð4 - λÞ 3 2
3 ð5 - λÞ 1 =0
2 1 ð6 - λÞ
This yields a third-order polynomial in λ. The principal stresses are these roots.
Principal stresses are λ1 = σ 1 = 9.0000 MPa, λ2 = σ 2 = 4.7321 MPa, λ3 = σ 3 =
1.2679 MPa
Direction cosines for octahedral plane [111] normal with respect to the principal
directions were calculated earlier in Example 2.2.
2 3
0:5774
6 7
n = 4 0:5774 5
0:5774
2
σðnÞ = ðσ1 n1 Þ2 þ ðσ2 n2 Þ2 þ ðσ3 n3 Þ2
= 0:57742 9:00002 þ 0:57742 4:73212 þ 0:57742 1:26792
= 35:0061
2.3 Deformation and Strain 33
Also
2
σðnÞ = σ2oct þ τ2oct
Hence
qffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
τoct = σ 2 - σ 2oct = 3:1623 MPa
And now using the formula given above we can recalculate the stresses on the
octahedral plane
rffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
h i h i
1 2
τoct = ð 4 - 5 Þ 2 þ ð 5 - 6Þ 2 þ ð 4 - 6Þ 2 þ ð 3 Þ 2 þ ð 1 Þ 2 þ ð 2 Þ 2
9 3
= 3:1623 MPa
1
σ oct = ½4 þ 5 þ 6 = 5:0000 MPa
3
h i
1 2 2
J 2D = σ xx - σ yy þ σ yy - σ zz þ ðσ xx - σ zz Þ2 þ σ 2xy þ σ 2xz þ σ yz
2
6
(iii) h i
1
= ð4 - 5Þ2 þ ð5 - 6Þ2 þ ð4 - 6Þ2 þ 32 þ 22 þ 12 = 15
6
Verify
3 2
J 2D =τ
2 oct
3
15 = 3:16232 = 15
2
to the new configuration. Therefore, we must establish a strain definition. There are
many definitions of strain. We can choose any definition of strain and calculate the
strain in the deformable body. The point we are trying to emphasize is that strain is a
definition (a human construct) calculated using measured displacement quantities.
Therefore, strain cannot be measured directly.
Strain formulation without any constraints on the magnitude of the strain gives a
more accurate representation of the deformation. However, the mathematical
description of finite strain (large strain) is quite complex. Therefore, for many
engineering problems, the small-strain theory is used, which is satisfactory with a
reasonable degree of error.
First, we will present the small strain formulation and then a more complex large
strain (finite strain) formulation. Rigid body motion does not lead to any strain.
There is no magic number that defines the boundary between small-strain theory and
large strain theory. However, any strain value less than 2% is usually, but not always,
considered a small strain. The decision to use the small or large strain formulation
depends on the problem at hand and the material properties. Now, we will define
strains with respect to the initial (undeformed) configuration.
At the unit defined by ABCD, corners B and C are stretched to new locations B′ and
C′ (Fig. 2.11).
As a result, small-strain Exx is the change in length per unit of initial length in the
x-direction. In the same fashion, we can define the unit extension per unit length for
the y and z directions Eyy and Ezz
2.3 Deformation and Strain 35
Δu Δv Δw
εxx = ε = ε =
Δx yy Δy zz Δz
The shear strain is defined as the change in the initial right angle at point A
(Fig. 2.12).
π
γ xy = - Ψ = θ1 þ θ2
2
tan θ = θ
Δv Δu
tan θ1 = θ1 = and tan θ2 = θ2 =
Δx Δy
Δv Δu
γ xy = θ1 þ θ2 = þ radian
Δx Δy
36 2 Stress and Strain in Continuum
Since we define our strain at a point in the limit dimensions, zero volume of the
differential element Δx and Δy approach zero. Hence, we can define strain in
differential form as
∂u ∂v ∂u ∂v
Exx = Eyy = γ = þ
∂x ∂y xy ∂y ∂x
3 mm 1 mm
tanα = α = 0:01 rad tanβ =
ð300 mm þ 1 mmÞ ð400 mm þ 3 mmÞ
β = 0:0025 rad
h i h i
π π 400 π 300
γ xy = - π - - atan - 0:01 - π - - atan - 0:0025
2 2 300 2 400
= α þ β = 0:0125 rad
∂u ∂v 1 mm 3 mm
γ xy = þ = þ = 0:0125 rad
∂y ∂x 400 mm 300 mm
Let us assume the cantilever beam shown in Fig. 2.13 is supported at point A and
subjected to vertical and axial loads at point B. Point C in the beam will be subjected
to rigid-body displacement and rigid-body rotation, but it will not experience any
strain, because point C undergoes rigid body motion, only.
The point C undergoes displacement in x and y axes directions as well as rotation
around z axis. However, these motions will not lead to any strain or stress at point C
(Fig. 2.14).
We will start with defining spatial coordinate and material coordinate systems. Let’s
say f(u, t) is a function that defines the displacement of point A located at a particular
point A′ in space at time t.
This spatial description is called Eulerian description. Another description is
material field description, also called Lagrangian description in many textbooks. In
this case function g(r, t) defines the displacement of point A at time t regardless of
where the point A is located in space. This is called material or local description
(Figs. 2.15 and 2.16).
2.3 Deformation and Strain 39
In Fig. 2.17 we define a coordinate system that is shaped in the original shape of the
line AB. We place the origin of our local coordinate axis r at point A, where r = 0,
and at point B the value of the local coordinate is r = 1. By normalizing the
coordinate between zero and one, we can define the displacement of any point
between A and B with respect to initial coordinates easily. Of course, we can define
displacement of point A using the global Cartesian coordinate system x, y and time,
or we can use the material (local) coordinate system (which is usually called the local
coordinate system in finite element method terminology). Local coordinate r is in the
axis direction of the initial one-dimensional element shown in Fig. 2.18. The origin
of the local coordinate system is at point A, or it can be located at any point between
A and B. As a result, if point A moves with respect to the initial location due to
loading origin A also moves. Therefore coordinate r will define the relative dis-
placement of point A with respect to B at any time regardless of where the point A is.
We will use the terminology local coordinates rather than material coordinates.
For a two-dimensional case let x - y axes define our global (spatial) Cartesian
coordinate system and let r define the local coordinate system. Let dr define an
arbitrary infinitesimal line element vector (Fig. 2.18).
Where r is the material (local) coordinate axis. It is always in the natural direction
of the member, u is the displacement vector along the spatial axis x, and v is the
displacement vector along the spatial y axis.
du du dx du dy
= þ
dr dx dr dy dr
ð2:13Þ
dv dv dx dv dy
= þ
dr dx dr dy dr
It is important to point out that the left-hand side of this equation represents strain
components with respect to the initial length. Equation (2.13) can be written in a
matrix form:
2 3 2 3
du du du 2 dx 3
6 dr 7 6 dx dy 776 dr 7
6 7 6 76 7
6 7=6 6 7
6 7 6 76 7
4 5 6
4 dv
74 5
dv dv 5 dy
dr dx dy dr
Or
du
~
=J n
dr
where dudr is the column vector on the left-hand side containing relative displacement
vector components, ñ is the column vector on the right-hand side containing the
direction cosines, Cosα = dx dr , Cosβ = dr . J is called the displacement–gradient
dy
matrix or the Jacobian matrix, or the unit relative displacement matrix. It may be
considered as an operator which operates on a direction vector ~ n to yield the unit
relative displacement components for an infinitesimal line AB. This process may be
considered akin to the transformation from spatial coordinate system x, y to local
2.3 Deformation and Strain 41
For convenience, we will define shear strain Exy as half the decrease in the angle
γ xy to be able to define strain vectors as components of a second-order strain tensor ε.
1
Exy = γ
2 xy
As a result, the first matrix is the strain matrix, and the second matrix is the
rotation matrix:
1 ∂u ∂v 1 ∂v ∂u
Ωxy = - Ωyx = - = - Ωxy
2 ∂y ∂x 2 ∂x ∂y
In our stress and strain definitions, the first subscript defines the plane they belong
to and the second subscript defines the direction. Now we can show that the strain
matrix leads to deformations (shape change) and the rotation matrix leads to rigid
body motion only with no shape change (strain) (Fig. 2.19).
We can write earlier transformation equations between local (material) coordi-
nates and spatial coordinates to find the unit relative displacement of B with respect
to A.
42 2 Stress and Strain in Continuum
2 3
2 3 du 1 du dv 2 3
du þ dx
6 7
6 dr 7 6 dx 2 dy dx 76 dr 7
4 5=6 74 5
dv 4 1 dv du dv 5 dy
dr þ dr
2 dx dy dy
dx
Assume γ 1 is very small cos γ 1 = = cos 0 = 1,
dr
and
dy π
= cos = 0
dr 2
Thus for AB
du dv
Exx = , E =
dr yx dr
It is important to point out that these equations are only valid for small deforma-
tions. Similarly, for AD we can write
Assuming γ 2 is very small.
dx π dy
= cos = 0, = cos 0 = 1
dr 2 dr
2.3 Deformation and Strain 43
2 3
du
6 dr 7 Exx Exy 0 Exy
4 5= =
dv Eyx Eyy 1 Eyy
dr
du dv
= Exy = Eyy
dr dr
Displacement of corner point C to point C′ is plotted in detail Figs. 2.20, 2.21, and
2.22.
Small strain along local axis r can be written as
Exx dx þ γ xy dy cos θ þ Eyy dy sin θ
Err =
dr
1
cos 2 θ = ð1 þ cos 2θÞ
2
1
sin 2 θ = ð1 - cos 2θÞ
2
2 sin θ cos θ = sin 2θ
Fig. 2.23 Unit vectors in local (material) r-s coordinates. And spatial coordinates x, y
l = i cos θ þ j sin θ
m = - i sin θ þ j cos θ
In s direction,
CC0 m = Exx dx þ γ xy dy i þ Eyy dy j ∙ ½ - i sinθ þ jcosθ
= - Exx dx þ γ xy dy sinθ þ Eyy dycosθ
CC0 l
Err =
dr
and
CC 0 m
Ess =
ds
46 2 Stress and Strain in Continuum
We can also obtain the displacement of point C along s axis from the arc length.
Assuming small deformations and small strains
αdr = Eyy dy cos θ - Exx dx þ γ xy dy sin θ
dy dx dy
α = Eyy cos θ - Exx þ γ xy sin θ
dr dr dr
α = Eyy sin θ cos θ - Exx cosθ þ γ xy sin θ sin θ
Rotation of s axis (β) can be found by substituting θ þ π2 in the equation for α.
In Fig. 2.25, it is assumed that the shear strain is positive. Therefore α is positive
when it is counterclockwise, and β is positive when it is clockwise.
2.3 Deformation and Strain 47
γ rs = ðα - βÞ
= Eyy sin θ cos θ - Exx sin θ cos θ - γ xy sin 2 θ
- - Eyy sin θ cos θ þ Exx sin θ cos θ - γ xy cos 2 θ
γ rs = - 2 Exx - Eyy sin θ cos θ þ γ xy cos 2 θ - sin 2 θ
Using the derivation, we used for a two-dimensional case we can derive the
formulation for a three-dimensional case. In three dimensions, we will use u for
displacement along x, v for displacement along y, and w for displacement along
z axis. r axis is again our local (material) coordinate axis
2 3
8 9 ∂u ∂u ∂u 8 dx 9
>
> du >
> 6 ∂x > >
> >
>
> >
> 6 ∂y ∂z 7
7>> dr >
>
< dr >
> = 6 7>< > =
dv 6 ∂v ∂v ∂v 7 dy
=6 7
>
> dr >
> 6 ∂x ∂y ∂z 7> dr >
>
>
> >
> 6 7>> dz >
> >
: dw >
> ; 4 ∂w ∂w ∂w 5>: > ;
dr ∂x ∂y ∂z dr
J=E þ Ω
48 2 Stress and Strain in Continuum
where
2 3
∂u 1 ∂u ∂v
1 ∂u ∂w
6 þ þ
6 ∂x 2 ∂z ∂x 7
2 ∂y ∂x
6 77
6 1 ∂u ∂v ∂v 1 ∂v ∂w 7
E=6 þ þ 7
6 2 ∂y ∂x ∂y 2 ∂z ∂y 7
6 7
4 1 ∂u ∂w 1 ∂w ∂v ∂w 5
þ þ
2 ∂z ∂x 2 ∂y ∂z ∂z
2 3
1 ∂u ∂v 1 ∂u ∂w
6 0 - -
6 2 ∂y ∂x 2 ∂z ∂x 7
6 7
7
6 1 ∂u ∂v 1 ∂v ∂w 7
Ω=6 - - 0 - 7
6 2 ∂y ∂x 2 ∂z ∂y 7
6 7
4 1 ∂u ∂w
1 ∂v ∂w
5
- - - - 0
2 ∂z ∂x 2 ∂z ∂y
The variables of this coordinate system are the particle at time t, regardless of where
the point is in spatial coordinates. Note that not the position of the particle but the
particle itself is the variable.
material description to be the same. It is assumed that the origin of the coordinate
system is attached to the particle.
The spatial description is defined by the current location x, y, z of the object at time t,
which are considered the independent variables. This is the most common coordinate
description used in fluid mechanics. Because spatial description uses a fixed region
of space rather than a body of matter [a particle]. For most students, it is usually
difficult to visualize the different descriptions. Here we will quote an explanation
from Malvern (1969) “The spatial description is especially useful in fluid mechanics,
where we may observe a flow in a wind tunnel or a channel, a fixed region [window]
in space. We can imagine perfect instrumentation with which an observer records the
fluid velocity at a field point in space [a window] as a function of time. If this is done
for every point in the region, we have the spatial description of the velocity ~v as a
function of position x,y,z and time.”
A Comparison of Lagrangian and Eulerian Descriptions
“In computational solid mechanics, all particles in a body must be defined by their
location in space from the initial unstressed (undeformed) state to the final config-
uration. Therefore, Lagrangian (material) description is preferred. This approach
stands in contrast to Eulerian formulation, which is usually used in the analysis of
fluid mechanics problems, in which attention is focused on the motion of the fluid
through a stationary control volume (a fixed window). Considering the analysis of
solids and structures, a Lagrangian formulation usually represents a more natural and
effective analysis approach than an Eulerian formulation. For example, using an
Eulerian formulation of a structural problem with large displacements, new control
volumes have to be created (because the boundaries of the solid change continuously
[due to large displacements])” (Bathe, 1996). Of course, the transformation of
coordinates between these descriptions can easily be accomplished.
Of course, the transformation between these descriptions can be done with a
simple transformation formulation (Fig. 2.28).
The material time derivative is the time derivative with the material coordinates held
constant. If we know the complete motion of all particles, x = f(r, t), we can calculate
any particle’s velocity simply by taking the partial time derivative with material
coordinates r, s, t held constant.
50 2 Stress and Strain in Continuum
2
∂ ∂x ∂~v ∂
~v = Gðr, t Þ = f ðr, t Þ and a = f ðr, t Þ
∂t ∂t r ∂t r ∂t
The partial time derivative with the material coordinates r, s, t held constant is
completely different than the partial time derivative with spatial coordinates x, y,
z held constant. In spatial description velocity ~v = gðx, t Þ is given by, hence
∂~v ∂
gðx, t Þ
∂t x ∂t
~v = g½xðr, t Þ, t
where
iis particle
number. And by the chain rule
∂~vi ∂~vi ∂~vi ∂xj
∂t
= ∂t þ ∂xj ∂t (summation on repeated indice j only)
r x r
2.4 Kinematics of Continuous Medium 51
∂x
∂xj
Knowing that ~v = ∂t r
in Cartesian coordinates ~vj = ∂t r
, substituting it above
in the chain rule yields
ai = ð∂~
vi
∂t r
Þ = ð∂~
vi
∂t x
Þ þ ~vj ∂~
vi
∂xj
(summation on repeated indice j only)
This equation can be written in matrix notation as
d~v ∂~v
a= þ v~ grad~v
dt ∂t
where grad~v is the gradient with respect to the spatial coordinates, (x, y, z) which is
sometimes denoted ∇x~v to distinguish it from the gradient ∇r ~v which is with respect
to the material coordinates (r, s, t)
This material derivative can also be used for any other properties. For example, if
the spatial description of density is given by the scalar function ρ(x, t), then the rate
of change of the density in the neighborhood of the particle instantaneously at x is
given by
dρ ∂ρ
= þ v~ gradρ
dt r ∂t x
∂ρ ∂ρ
= þ ~vj
∂t x ∂xj
∂ρ
The first term ∂t x
gives the local rate of change of the density in the neigh-
∂ρ
borhood of the point (x, y, z), while the second term ~vj ∂x j
gives the convective rate of
change of the density in the neighborhood of a particle as it moves to a place with a
different density. Hence we can write the following equation as the material deriv-
ative operator:
d ∂
= þ ~v grad
dt ∂t
which can be applied to a scalar, a vector, or a tensor function of the spatial position
(x, y, z and t).
The material derivative has also been described very succinctly, [in MIT 2.20
Marine Hydrodynamics lecture notes], by quantifying the time rate of change of a
fluid property as it travels through a given flow field [flow window]. The material
derivative is the time derivative of a property x of a particle; hence it is Lagrangian.
However, we can write the material derivative in the Eulerian frame of reference.
The time rate of change of property, x defined in the Eulerian frame of reference
can be given by
52 2 Stress and Strain in Continuum
Fig. 2.26 Material (local), spatial, and referential coordinate system description
Writing Taylor’s series expansion of this variation and considering that ṽδt = δx
yields
∂f ðx,tÞ
f ðx þ ~vδt,t þ δtÞ = f ðx,tÞ þ δt þ δx ∇ fðx,tÞ þ higher order terms
∂t
From these two equations, we conclude that the material derivative of a property
f as experienced by a particle traveling with a velocity of ṽ is
df ∂f
= þ ~v ∙ ∇f
dt ∂t
d ∂
þ ~v ∇
dt ∂t
½Lagrangian ½Eulerian
2.5 Rate of Deformation Tensor and Rate of Spin Tensor 53
Where r and s are local (material) coordinate system axes that move with the
particle. O′ is the origin of the referential (Lagrangian) coordinate system. O is the
origin of the spatial description (Eulerian) coordinate system (Fig. 2.26).
The rate of deformation tensor, D, is also called the stretching tensor or velocity of
strain (strain rate) tensor. The spin tensor, W, is also called the vorticity tensor.
We realize that the rate of deformation tensor is defined differently by different
authors. We will subscribe to Malvern’s (1969) definition, and use his formulation
and derivation with few modifications (Fig. 2.27).
The relative velocity components of d~vi of point, A relative to point B can be
given by
∂~vi
d v~i = dxm
∂xm
or in matrix form
or in tensorial notation
d~v = L dx = dx LT
where L is Lim = ~vi,m LT im = ~vm,i m and i can have values of 1, 2, and 3.
The components of tensor L are spatial gradients of the velocity. Lim = ~vi,m can be
written as the sum of a symmetric tensor D, which we will call the rate-of-
54 2 Stress and Strain in Continuum
L=D þ W
1 1
D = L þ LT and W = L - LT
2 2
or in Cartesian coordinates
1 1
Dim = ð~vi,m þ ~vm,i Þ and W im = ð~vi,m - ~vm,i Þ
2 2
W spin matrix is different from the Ω rotation matrix. They should not be
confused with each other. The same thing is true, for the rate of deformation tensor
D which is not a strain-rate matrix. The rate of deformation tensor, D, allows us to
define the relative velocity of point A with respect to point B. As a result, Eq. (2.14)
can be given by
∂~vi d ∂u
= ð iÞ
∂xj dt ∂xj
2.5 Rate of Deformation Tensor and Rate of Spin Tensor 55
The time derivative of the strain tensor is with respect to local coordinates
(material coordinates-Lagrangian description), while the rate of deformation tensor
is defined with respect to spatial coordinates x, y, z (Eulerian description). For small
displacement and small strain problems, both tensors are the same. However, D, the
rate of deformation tensor is necessary for large displacement and large strain
problems.
Where i = 1, 2, 3, j = 1, 2, 3. It is important to point out that for indicial notation
axes are represented by x, or by xi. However, when convenient x, y, z spatial
coordinates and r, s, t are also used throughout the book.
L - L0
dEE =
L0
in incremental form as
Z L
dL E 1
dEE = E = dL
L0 L0 L0
However, if we use the instantaneous length (new length after deformation) in the
denominator, we obtain the true strain (natural or logarithmic strain):
dL
dεT =
L
Here the increment of true strain is defined by the change in length per unit of
instantaneous (new) length. To find the total true strain, we can integrate the
increment
ZL ZL
dL L L0 þ ΔL
dE = E =
T T
= ln = ln = ln 1 þ EE
L L0 L0
L0 L0
56 2 Stress and Strain in Continuum
where ln is the natural logarithm. The relation between true strain and engineering
strain can also be written as
eE = 1 þ εE or εE = eε - 1
T T
X1
xk x2 x3
ex = =1 þ x þ þ þ ⋯...
k=0
k! 2! 3!
As a result, we can write the following relation between true strain and engineer-
ing strain:
2 3
ET εT
εT þ þ þ ⋯:: = εE
2! 3!
In three dimensions for the small strain, this can be generalized as follows:
dεTij = Dij dt
There are many definitions of finite (large) strain. They can be categorized into two
classes:
1. Defining strain with respect to undeformed original configuration and geometry.
This approach is called Lagrangian formulation.
2. Defining strain with respect to deformed configuration and geometry. This
approach is called the Eulerian formulation.
Large (finite) strain formulation is the easiest to define in terms of a deformation–
gradient tensor, F. However, the deformation gradient tensor includes the strain
tensor and the rotation tensor. As a result, this can make it tricky to be used in
material modeling. On the other hand, strain tensor only includes strains where x, y,
z are global Cartesian, coordinates (referential description) and r, s, t is the local
coordinate (material description) system (Fig. 2.28).
In Fig. 2.28 the vector joining point A in the undeformed location and deformed
location is the displacement vector. The displacement vector can be defined in the
Lagrangian description as follows:
where c(t) is the rigid body motion. In the finite element method, the origin of the
material (local) coordinate system is usually in the center or at a corner of an
element. However, for some special-purpose elements, it is placed at a special
location, but always inside the element.
The partial derivative of the displacement vector with respect to material coordi-
nates yields the material displacement gradient tensor,
∂ui ∂x
= i - δik = F ik - δik
∂r ∂r k
∂uj ∂rj
= δjk - = δjk - F jk- 1
∂x ∂xk
The deformation gradient tensor, F, describes the stretches and rotations that the
material point has undergone from time t0 to t.
The deformation equation for a point can be defined in the local (material)
coordinate system and can be mapped onto the spatial coordinate system. For
example, we can assume the following relationship (Figs. 2.28 and
2.29), between the material and spatial coordinates,
x = xA þ ðxB - xA Þr
y = yA þ ðyB - yA Þr
we will derive the finite strain formulation using Malvern’s (1969) formulation and
notation with few changes if any. We will define deformation–gradient tensor F as a
tensor whose components are the partial derivatives with respect to material coordi-
nates in the Lagrangian formulation. F, deformation–gradient tensor is defined in
terms of the undeformed configuration. x, y, z spatial coordinates are represented by
bold x. Local (material) coordinates r, s, t are represented by bold r.
The relation between the spatial and local (material) description deformations is
given by employing the deformation–gradient tensor F, which is defined as the
tensor whose rectangular Cartesian components are the partial derivatives ∂xi=∂ri and
which operates on an arbitrary infinitesimal material vector dr located at r, to
associate it with a vector dx located at point x as follows:
dx = F dr or dx = dr FT
∂xi
fdxi g = dr j
∂r j
2.6 Finite Strain and Deformation 59
Therefore
2 3
∂x ∂x ∂x
6 ∂r ∂s ∂t 7
6 7
∂xi 6 ∂y ∂y ∂y 7
F ij = =6
6 ∂r
7
∂r j 6 ∂s ∂t 7
7
4 ∂z ∂z ∂z 5
∂r ∂s ∂t
In Fij the first index identifies the row, and the second index identifies the column.
In Fig. 2.28 we see that the deformation is the difference between the position
vector before and after deformation, where c is the rigid body motion.
x=r þ u
2 3
∂u ∂u ∂u
6 ∂r ∂s ∂t 7
6 7
6 ∂v ∂v ∂v 7
F=6
6 ∂r
7þI
6 ∂s ∂t 7
7
4 ∂w ∂w ∂w 5
∂r ∂s ∂t
dr = F - 1 dx
∂r i
fdr i g = dxj
∂xj
From the last equation, we can infer that the spatial deformation gradient, F-1 at
point x is the inverse of the material deformation–gradient tensor F,
Therefore,
F F-1 = I
can be written between the material deformation–gradient tensor and the spatial
deformation–gradient tensor, where I is the unit matrix.
60 2 Stress and Strain in Continuum
Strain is a human construct. It is not a real physical quantity that can be measured
directly, like displacement or time. It is defined to be able to formulate the response
of a continuum using mathematics. Because strain is not a direct physical quantity, it
can be defined in many ways. For large strain formulation the strain will be defined
as one-half the change in squared length of the local (material) vector AB as follows
(Fig. 2.30):
" 2 # " # " #
1 AB j - jAB 1 2drAB þ dr
0 2 2 2 2
1 jAB þ dr j - jABj dr 1 dr 2
2
= 2
= 2
= þ
2 jABj 2 jABj 2 jABj AB 2 AB
1 2
Axial strain = Er þ E
2 r
where dS = AB is the initial undeformed length and ds = AB′ is the deformed length
(Fig. 2.30). In indicial notation the last equation can be given by
in indicial notation
For the quadrilateral element shown above material (local) coordinates of the
nodes [1, 2, 3, 4] are given as follows:
1
Ni = ð1 þ r i r Þð1 þ si sÞ
4
where i is the node number and ri, and si are the local coordinates of node i. Hence,
we can define the relation between spatial and material coordinates by
Xi = 4 Xi = 4
xðr, sÞ = N x , yðr, sÞ =
i=1 i i i=1
N i yi
1 1
xðr, sÞ = ð1 þ r Þð1 þ sÞ 4 þ 0 þ 0 þ ð1 þ r Þð1 - sÞ4 = 2 þ 2r
4 4
1 1 3 3
yðr, sÞ = ð1 þ r Þð1 þ sÞ 3 þ ð1 - r Þð1 þ sÞ 3 þ 0 þ 0 = þ s
4 4 2 2
1 1
xðr, sÞ = ð 1 þ r Þ ð 1 þ sÞ 2 þ 0 þ 0 þ ð 1 þ r Þ ð 1 - sÞ 2 = 1 þ r
4 4
1 1 1 3 r
yðr, sÞ = ð1 þ r Þð1 þ sÞ 3 þ ð1 - rÞð1 þ sÞ 2 þ 0 þ ð1 þ r Þð1 - sÞ 1 = þ þ s
4 4 4 2 2
The non-zero off-diagonal terms reflect shear, and 1.0 diagonal terms reflect zero
normal strain.
2.6 Finite Strain and Deformation 63
1 1 1 3 1
xðr, sÞ = ð1 þ r Þð1 þ sÞ 3 þ ð1 - r Þð1 þ sÞ 1 þ 0 þ ð1 þ r Þð1 - sÞ2 = þ r þ s
4 4 4 2 2
1 1 1 3 r
yðr, sÞ = ð1 þ r Þð1 þ sÞ 3 þ ð1 - r Þð1 þ sÞ 2 þ 0 þ ð1 þ r Þð1 - sÞ 1 = þ þ s
4 4 4 2 2
The non-zero off-diagonal terms reflect shear, and 1.0 diagonal terms reflect zero
normal strain.
Green deformation tensor C and Cauchy deformation tensor B-1 are related to the
strain tensors. Instead of giving one-half the change in squared length per unit
squared initial length, Green deformation tensor, C, refers to the undeformed
configuration, and its tensor entries provide the new squared length (ds)2 of the
element into which the given initial vector dr is deformed, while the Cauchy
deformation tensor, B-1, gives the initial squared length (dS)2 of a vector dx defined
in the deformed configuration.
As a result, we can establish the Green deformation, C tensor, C, as follows:
ðdsÞ2 = dr C dr
64 2 Stress and Strain in Continuum
or in indicial notation as
ðdsÞ2 = dr i Cij dr j
ðdSÞ2 = dx B - 1 dx
or in indicial notation as
-1
ðdSÞ2 = dxi Bij dxj
2E = C - I or 2Eij = C ij - δij
In the same manner, comparing Eulerian strain tensor E and Cauchy deformation
tensor B-1 we observe that
-1
2E = I - B - 1 or 2E ij = δij - Bij
There is no special reason for defining the Cauchy deformation tensor as B-1. It
could be named just about any character. However, we are following the notation
used by Cauchy in 1827. In the formulation given above, both the Green deforma-
tion tensor, C, and Cauchy deformation tensor, B-1, reduce to unit tensor when
strain is zero (Malvern, 1969).
ðdsÞ2 = dx dx
where dx is the vector in the deformed configuration. On the other hand relation
between the dx and the undeformed vector dr in the local (material) coordinates is
given by
dx = F dr
2.6 Finite Strain and Deformation 65
ðdSÞ2 = dr dr
Let’s compare these later equations with our strain definition equations where
Green deformation tensor C is given by
ðdsÞ2 = dr C dr
C = FT F
ðdSÞ2 = dx B - 1 dx
∂xk ∂xk
C ij =
∂r i r j
and
∂r k ∂r k
Bij- 1 =
∂xi ∂xj
2E = C - I
Since C = FT F
1 T
E= F F-I
2
or indicial notation
1 ∂xk ∂xk
Eij = - δij
2 ∂r i ∂r j
2E = I - B - 1
h T i
Since B-1 = F-1 F-1
h T i
1
E = I - F-1 F-1
2
In indicial notation
1 ∂r ∂r k
E⋆
ij = δ - k
2 ij ∂xi ∂xj
Both Green deformation tensor C and Lagrangian strain tensor E are symmetric
tensors. Therefore, they both have three eigenvalues (principal values) in Eigen
directions (principal directions). Also, principal directions of C and E coincide,
because in principal directions off-diagonal terms are zero in Green deformation
tensor C. Then they have to be zero in strain tensor E. Of course, the same arguments
can be made between Cauchy deformation tensor B-1 and Euler strain tensor E.
However, principal (Eigen) directions of [B-1 and E] and [C and E] will not
coincide.
For the sake of simplicity, we will assume that material (local) coordinate axes and
spatial coordinate axes are parallel (Fig. 2.31).
xi = spatial coordinate, ri = local (material) coordinate.
Displacement ui is expressed in local (material) coordinates, ui = ui(r, s, t, time).
2.6 Finite Strain and Deformation 67
1 ∂xk ∂xk
Eij = - δij
2 ∂r i ∂r j
or
" 2 2 #
2
∂u 1 ∂u ∂v ∂w
E11 = þ þ þ
∂r 2 ∂r ∂r ∂r
Similarly
" 2 2 #
2
∂v 1 ∂u∂v ∂w
E 22 = þ þ þ
∂s 2 ∂s∂s ∂s
" 2 2 #
2
∂w 1 ∂u ∂v ∂w
E 33 = þ þ þ
∂t 2 ∂t ∂t ∂t
68 2 Stress and Strain in Continuum
or
1 ∂u ∂v 1 ∂u ∂u ∂v ∂v ∂w ∂w
E12 = þ þ þ þ
2 ∂s ∂r 2 ∂r ∂s ∂r ∂s ∂r ∂s
1 ∂u ∂w 1 ∂u ∂u ∂v ∂v ∂w ∂w
E13 = þ þ þ þ
2 ∂t ∂r 2 ∂r ∂t ∂r ∂t ∂r ∂t
1 ∂v ∂w 1 ∂u ∂u ∂v ∂v ∂w ∂w
E 23 = þ þ þ þ
2 ∂t ∂s 2 ∂s ∂t ∂s ∂t ∂s ∂t
1 1 1
x= ð1 þ rÞð1 þ sÞ 4 þ ð1 - r Þð1 þ sÞð1Þ þ ð1 - rÞð1 - sÞð1Þ
4 4 4
1
þ ð1 þ r Þð1 - sÞ ð3Þ
4
1 1 1
y = ð1 þ rÞð1 þ sÞ ð3:5Þ þ ð1 - r Þð1 þ sÞ ð3Þ þ ð1 - r Þð1 - sÞ ð1Þ
4 4 4
1
þ ð 1 þ r Þ ð 1 - s Þ ð 1Þ
4
1 T
E= F ∙F-I
2
X4 X4 X4 X4
x= xN
i=1 i i
y= yN
i=1 i i
u= uN
i=1 i i
v= vN
i=1 i i
∂x X4 ∂N i ∂y X4 ∂N i
= x
i = 1 i ∂r
= y
∂r ∂r i = 1 i ∂r
∂x X4 ∂N i ∂y X4 ∂N i
= x
i = 1 i ∂s
= y
∂s ∂s i = 1 i ∂s
1 ð10 þ 2sÞ ð2 þ 2r Þ
F=
8 ð 1 þ sÞ ð9 þ r Þ
1 T
E= F F-I
2
We can find strain at any point by substituting the local coordinates of the point
r = ri, s = si, for example, if we want to know the strain at node 1 where r = 1 and
s = 1, we substitute these coordinates for r and s.
d d d
ðdsÞ2 - ðdSÞ2 = ð2dr E drÞ
dt dt dt
dr and dS are constant with respect to time because they represent the initial
dimensions. Therefore, we can write
d dE
ðdsÞ2 = 2dr dr
dt dt
2.6 Finite Strain and Deformation 71
d
ðdsÞ2 = 2dx D dx
dt
Also, we have already shown that dx = F dr. Hence, we write the following
relation:
d
ðdsÞ2 = 2ðdrFT Þ D ðF drÞ
dt
= 2dr ðFT D FÞ dr
dE
2dr dr - 2dr FT D F dr = 0
dt
dE T
2dr - F D F dr = 0
dt
dE
= FT D F
dt
dE ij ∂xm ∂x
= Dmn n
dt ∂r i ∂r j
The relationship between Green deformation rate tensor C and strain rate tensor
E_ can be given by
dC dE
=2
dt dt
dE
=D
dt
in indicial notation
dE ij
= Dij
dt
72 2 Stress and Strain in Continuum
or dx = F dr
∂xi
dxi = dr j
∂r j
dv = L dx
Hence
∂~vi
Lij = ~vi,j =
∂xj
∂xi
If we write a time derivative of F = ∂r j
we obtain
_F = d ∂xi = ∂_xi = ∂_xi ∂xm
dt ∂r j ∂r j ∂xm ∂r j
_F = ∂vi ∂xm and since ∂vi = d ∂xi ∂r j
∂xm ∂r j ∂xm dt ∂r j ∂xm
F_ = L F and L = F_ F - 1
Relations between Euler strain rate tensor E, the rate of deformation tensor D,
and the spatial gradients of velocity tensor L can be given by
E_ = D - E L þ LT E
When strain is finite (large), the symmetric and skew-symmetric parts of the
displacement–gradient matrix, J, cannot be represented by additive decomposition
of a pure strain matrix and a pure rotation matrix. However, other types of
2.8 Compatibility Conditions in Continuum Mechanics 73
multiplication decompositions are possible, where one of the two tensors will
represent a rigid body rotation and the second tensor will be a symmetric positive-
definite.
When the displacements are known, the strain field can easily be calculated. For the
case of small strain, relations are given by
1 ∂ui ∂uj
Eij = þ
2 ∂r j ∂r i
When displacements and strains are small, the distinction between referential and
spatial descriptions is negligible for small strain calculations. Assuming small strain
and small deformations, we can define small strain in spatial x, y, z coordinate system
as follows:
∂u
Exx =
∂x
∂v
Eyy =
∂y
∂w
Ezz =
∂x
1 ∂u ∂v
Exy = þ
2 ∂y ∂x
1 ∂u ∂w
Exz = þ
2 ∂z ∂x
1 ∂v ∂w
Eyz = þ
2 ∂z ∂y
2
∂ E
If we assume that ∂x∂yxy exists
" #
2
∂ Exy 3
1 ∂ u
3
∂ v
= þ
∂x∂y 2 ∂2 y∂x ∂x2 ∂y
we can also write second derivatives of Exx and Eyy and sum them up
2
∂ Exx ∂ Eyy
2 3 3
∂ u ∂ v
þ = 2 þ 2
∂y 2 ∂x 2 ∂y ∂x ∂x ∂y
2.8 Compatibility Conditions in Continuum Mechanics 75
Hence
2 2
∂ Exx ∂ Eyy ∂ Exy
2
þ =2
∂y 2 ∂x 2 ∂x∂y
We can repeat this process for Exz and Eyz. Then we have
" #
2 2 2
∂ Exy ∂2 Ezx 1 ∂ ∂u ∂v ∂ ∂w ∂u
þ = þ þ þ
∂x∂z ∂y∂x 2 ∂x∂z ∂y ∂x ∂y∂x ∂x ∂z
3 3 3 3
1 ∂ u ∂ v ∂ w ∂ u
= þ þ þ
2 ∂x∂y∂z ∂x2 ∂y ∂x2 ∂y ∂x∂y∂z
2 2
1 ∂ ∂u ∂ ∂v ∂w
= 2 þ 2 þ
2 ∂y∂z ∂x ∂x ∂y ∂y
" #
2
1 ∂ Exx ∂ Eyz
2
= 2 þ :
2 ∂y∂z ∂x2
∂ ∂Exy ∂ 1 ∂Eyz
2
∂ Exx
= þ -
∂y∂z ∂x ∂z ∂y 2 ∂x
A similar process can be repeated for other shear strain pairs Eyz and Exy, Eyz and
Exz.
These equations prove the integrability of the strain field. Based on this derivation
the following six St. Venant’s compatibility equations can be given:
2 2
∂ Exx ∂ Eyy ∂ Exy
2
þ -2 =0
∂y 2 ∂x 2 ∂x∂y
2 2
∂ Eyy ∂2 Ezz ∂ Eyz
þ -2 =0
∂z2 ∂y 2 ∂y∂z
2 2 2
∂ Ezz ∂ Exx ∂ Ezx
þ -2 =0
∂x2 ∂z2 ∂z∂x
2
∂ Exx ∂ ∂Eyz ∂Ezx ∂Exy
- þ - þ þ =0
∂y∂z ∂x ∂x ∂y ∂z
2
∂ Eyy ∂ ∂Eyz ∂Ezx ∂Exy
- þ - þ =0
∂z∂x ∂y ∂x ∂y ∂z
∂ ∂Eyz ∂Ezx ∂Exy
2
∂ Ezz
- þ þ - =0
∂x∂y ∂z ∂x ∂y ∂z
76 2 Stress and Strain in Continuum
Cauchy stress tensor is defined in the spatial coordinates x, y, z (in the deformed
configuration). Euler strain is also defined in spatial position in the deformed
configuration. Therefore, using the Euler strain tensor definition with the Cauchy
stress tensor is appropriate.
On the other hand, the Lagrangian formulation is defined in material coordinates
r, s, t (in the undeformed configuration) or any other reference state configuration
that does not change. In this case, it is also more appropriate to define the Piola–
Kirchhoff stress tensor in local (material) reference coordinates. Two Piola–Kirch-
hoff stress definitions are using the undeformed (or any other stationary reference)
state as the basis for formulation.
The First Piola–Kirchhoff stress tensor is also called the Lagrangian stress tensor.
The derivatives are defined with respect to material (local) coordinates. However,
this stress tensor is not symmetric, even when there is no distributed body moment
(non-polar case) in the system.
Stress is defined as a force per unit undeformed area. Keep in mind that stress is a
human construct not a directly measurable physical quantity. Thus, it is possible to
define it as we please. The first Piola–Kirchhoff stress tensor is defined by the
following equation (Fig. 2.33):
dF
n0 σ 0 =
dA0
where σ 0 is the first Piola–Kirchhoff stress tensor and dF is the actual force acting
on the deformed configuration dA; however, the force is divided by the initial
undeformed area dA0.
2.9 Piola–Kirchhoff Stress Tensors 77
~ = F - 1 dF
dF
Remember that material (local) coordinates and spatial coordinates are also
related by
dr = F - 1 dx
~
dF
ðn0 σ~Þ =
dA0
F - 1 dF
ðn0 σ~Þ =
dA0
78 2 Stress and Strain in Continuum
At the same time, dF can be also defined in terms of Cauchy stress tensor as
T
F - 1 ðn σ ÞdA ðn σ Þ F - 1 dA
ðn0 σ~Þ = =
dA0 dA0
ρ0 - 1 T T
σ~ = F σ F - 1 and σ~ = σ 0 F - 1
ρ
Details of the long derivations for these relations are provided by Malvern (1969).
The second Piola–Kirchhoff stress tensor is usually preferred in finite-strain
elasticity problems.
Newtonian equation of motion in undeformed (reference) state using the first
Piola–Kirchhoff stress tensor is given by
Z Z Z
d2 x
n0 σ dA þ
0
ρ0 b0 dV = ρ0 dV
dt 2
A0 V0 V0
∂σ 0ij d2 x
þ ρ0 b0j = ρ0 2i
∂r i dt
where ri is the material (reference) coordinate system axis. Since the relation
between the first and second Piola–Kirchhoff stress tensors is given by
σ 0 = σ~ FT
or in indicial notation
d 2 xk
½˜σij xk,j i þ ρ0 b0k = ρ0
dt 2
Using the divergence theorem (Gauss’s theorem) the integral over a closed
surface can be converted to an integral over a volume bounded by the closed surface
by the following equation:
Z Z
n × ~v dA = — × ~vdV
A V
Equating the time rate of change in mass to the rate of inflow through area A
yields
Z Z
∂ρ
dV = - — ðρ~vÞ dV
∂t
V V
The negative on the right-hand side is because normal of the surface is defined as
positive outward. Therefore, inflow is in the negative normal direction.
Hence, we can write
Z
∂ρ
þ — ðρ~vÞ dV = 0
∂t
V
Here the integral must be zero for an arbitrary volume. Therefore the integrand
must be equal to zero.
∂ρ
þ — ðρ~vÞ = 0
∂t
When the material is incompressible, we assume that density does not change over
time,
dρ
=0
dt
2.13 Conservation of Momentum Principle 81
Therefore
must be satisfied. It is important to point out that in the theory of plasticity during
plastic deformation (flow), some materials, like metals, are considered to be
incompressible.
Momentum, p, is defined by
Z
p= ρ ~v dV
where ρ is mass density, ~v is velocity vector, and dV is the volume. The time rate of
change of the total momentum of a portion of the particles can be given by the
material derivative of the integral, which is defined as the time rate of change of any
quantity for a portion of material. Hence, the time rate of change of the total
momentum is
Z
dp d
= ρ~vdV
dt dt
External forces acting on the particles can be given by the summation of surface
tractions and body forces
Z Z
σ ðnÞ dA þ ρbdV
A V
ðnÞ
σ i = σ ij nj
Or in Cartesian coordinates
σ ðxnÞ = σ xx nx þ σ xy ny þ σ xz nz
σ ðynÞ = σ yx nx þ σ yy ny þ σ yz nz
σ ðznÞ = σ zx nx þ σ zy nz þ σ zz nz
2.13 Conservation of Momentum Principle 83
where nx,nx,nx, are direction cosines between the normal to the arbitrary plane and
spatial coordinates. In matrix notation, this can also be written as
n o
σ ðnÞ = fng½σ
2 3
n oT n oT σxx σxy σxz
ðnÞ ðnÞ ðnÞ ðnÞ 6 7
σ = σx σy σz fng = fnx ny nz g ½σ = 4 σyx σyy σyz 5
σzx σzy σzz
[σ] is the second-order Cauchy stress tensor, which is a linear vector function.
Using the divergence theorem, we can write the following conversion:
Z Z
ðnÞ ∂σ ji
σ i dA = dV
A V ∂xj
For any arbitrary volume, we can write Cauchy’s equation of motion in Newto-
nian mechanics as follows:
∂σ ji d~v
þ ρbi = ρ i
∂xj dt
Of course, for static equilibrium right-hand side of this equation is equal to zero.
Hence
∂σ ji
þ ρbi = 0
∂xj
84 2 Stress and Strain in Continuum
In any other Cartesian coordinate system rotated with respect to the original
coordinate system stress tensor can be defined using the tensor transformation
equations.
½σ = ½N T ½σ ½N
where [N] is the matrix of direction cosines nik = cos ðxr , xi Þ of the angles between
the axes of the original and rotated coordinate systems. Superscript T is used to
denote transpose. A bold character indicates a matrix or a vector. Sometimes
brackets are used to distinguish a matrix from a vector.
The time rate of change of the total moment of momentum for a collection of masses
is equal to the vector sum of the moments of the external forces acting on these
masses. Assuming there are no distributed moment couples, we can write
Z Z Z
d ðnÞ
ðr × ρ~vÞdV = r×σ dA þ ðr × ρbÞdV
dt
v A V
a × b = eijr aj br = ejri aj br
ðnÞ
σ i = σ ij nj
We can also transform the surface integral to a volume integral using the relation
given by the divergence (Gauss) theorem
Z Z
∂
eijk nj~vk dA = eijk ð~vk ÞdV
∂xj
A V
2.14 Conservation of Moment of Momentum Principle 85
where ρ is the mass density of the material per unit volume. We can transform the
moment of momentum principle into the following form:
Z Z
d ∂ xj σ kr
eijr x ~v ρdV = eijr þ xj ρbr dV
dt j r ∂xk
V V
The time derivative of displacement is velocity; hence we can substitute ~vi = dxi
dt
Z Z
d~v ∂σ kr
eijr ~vj~vr þ xj r ρdV = eijr xj þ ρbr þ σ kr δjk dV
dt ∂xk
V V
eijr ~vj~vr = 0 because ~vj~vr is symmetric for indices jr, while eijr is anti-symmetric.
Knowing that Cauchy’s equation of motion is given by
∂σ kr d~v
þ ρbr = ρ r
∂xk dt
which is equal to
Z
eijr σ jr dV = 0
V
However, the last equation is equal to zero. Because for an arbitrary volume, V, at
each point
eijr σ jr = 0
for
• i = 1: σ 23 = σ 32
• i = 2: σ 31 = σ 13
• i = 3: σ 12 = σ 21
This proves that the stress tensor is symmetric when there are no couple (moment)
stresses. This proof also establishes the symmetry of the stress tensor in general
without any assumption of equilibrium or uniformity of the stress distribution.
86 2 Stress and Strain in Continuum
L=T -V
where T is the kinetic energy of the system, and V is the potential energy of the
system. The minus sign in the equation is counterintuitive to the total energy of the
system. So Lagrangian is not the total energy. There is no unique Lagrangian for all
physical systems. Any function which yields the equation of motion can be defined
as a Lagrangian. Also, for dissipative forces, another function must be introduced
alongside L, if Newtonian mechanics is used that dissipative force is usually an
empirical function. In unified mechanics theory, we will introduce the thermody-
namic state index into the Lagrangian; hence, there is no need for an empirical
dissipation/degradation constraint functional alongside the Lagrangian.
Lagrangian mechanics can only be applied to systems whose constraints, if any, are all
holonomic. Three examples of nonholonomic constraints are, when the constraint equations
are nonintegrable when the constraints have inequalities, or with complicated
non-conservative forces like friction. In contrast, the unified mechanics theory does not
have this restriction. Unified mechanics theory applies to all systems, holonomic and
nonholonomic. Because there are no constraint functionals in the unified mechanics theory.
Nonholonomic constraints require special treatment, and one may have to revert to Newto-
nian mechanics, or use other methods (Hand & Finch, 2008).
The Lagrange equations of the first kind for a single particle are given by
X n
∂L d ∂L ∂f i
- þ λi =0
∂x dt ∂x_ i
∂x
where r is the vector of a point in the material configuration space. The spatial
position coordinates x, y, z, can be represented as functions of the material (gener-
alized) coordinates r, s, t. . . . and time. Generalized coordinates can be more than
3 that is the reason for additional dots. However, for a simple force deformation
mechanics problem, they can be the minimum r, s, t.
Example 2.7
For a mass m attached to the tip of a linear-elastic 1-dimensional spring, Fig. 2.7E,
obtain the equilibrium equation using the Lagrangian mechanics.
For this problem, we can use the Euler-Lagrange equation. Assume that gener-
alized coordinates and position coordinates have the same origin and we have small
deformations. Then we can write
∂L d ∂L
- =0
∂x dt ∂x_
1 1
L=T -V = m_x2 - kx2
2 2
- kx - m€x = 0
f = λc_xx
X
m
88 2 Stress and Strain in Continuum
- kx - m€x - c_x = 0
It is important to point out that the constraint function is not unique but empirical
based on test data.
We do not intend to make this section comprehensive, but just introductory. The
concept has been discussed in detail by Morin (2007). We intend to quote a summary
from Morin (2007).
Let’s define a quantity called the action, S, with dimensions of (energy) × (time)
and given by the following equation:
Z t2
S= Ldt
t1
where L is the Lagrangian and dt is the time interval. Let’s assume we are dealing
with a simple one-dimensional problem, and x is the only coordinate.
Let’s pose the following question: What function x(t) yields a stationary value of
S. A stationary value is a local minimum, the minimum, or a saddle point. The
answer is given by the following theorem.
Theorem If the function x0(t) yields a stationary value of S, then the following
relation must be satisfied.
d ∂L ∂L
=
dt ∂_x0 ∂x0
It is assumed that we are considering only functions that have their endpoints
fixed. Meaning
This principle is equivalent to F = ma because the above theorem shows that if and only if
we have a stationary value of S, then the Euler-Lagrange equation holds. And Euler-
Lagrange equations are equivalent to F = ma. Therefore. “Stationary action” is equivalent
to F = ma (Morin, 2007).
References 89
Example 2.8
Consider a ball being thrown from a balcony to the ground. There are many possible
paths the ball can take as shown in Fig. 2.8E. The vertical distance of the ball from
the balcony at any time is defined by x(t). Each path has a different function x(t). We
can substitute each one of the x(t) functions in Lagrangian to yield an S value.
Z t2
S= Ldt
t1
However, only one of the paths xðt Þ = 12 gt 2 will yield a stationary value of the
action, S.
Here we conclude the discussion on basic concepts of continuum mechanics.
References
[In Latin] Isaac Newton’s Philosophiae Naturalis Principia Mathematica: the Third edition (1726)
with variant readings, assembled and ed. by Alexandre Koyré and I Bernard Cohen with the
assistance of Anne Whitman (Cambridge, MA, 1972, Harvard UP). Among versions of the
Principia online: Volume 1 of the 1729 English translation is available as an online scan; limited
parts of the 1729 translation (misidentified as based on the 1687 edition) have also been
transcribed online.
Bathe, K. J. (1996). Finite element procedures. Prentice-Hall.
Beatty, M. F. (2006). Principles of engineering mechanics volume 2 of principles of engineering
mechanics: Dynamics-the analysis of motion (p. 24). Springer. ISBN 0-387-23704-6.
Cohen, I. B. (1967). Newton’s Second Law and the concept of force in the principia. In The annus
Mirabilis of Sir Isaac Newton 1666–1966. The MIT Press.
Cohen, I. B. (1995). Science and the Founding fathers: Science in the political thought of Jefferson,
Franklin, Adams, and Madison (p. 117). W.W. Norton. ISBN 978-0-393-24715-2.
Cohen, I. B. (P. M. Harman, & A. E. Shapiro, Eds.). (2002). The investigation of difficult things:
Essays on Newton and the history of the exact sciences in honor of D.T. Whiteside (p. 353).
Cambridge University Press. ISBN 0-521-89266-X.
Fairlie, G., & Cayley, E. (1965). The life of a genius (p. 163). Hodder and Stoughton.
90 2 Stress and Strain in Continuum
Galili, I., & Tseitlin, M. (2003). Newton’s first law: Text, translations, interpretations and physics
education. Science & Education., 12(1), 45–73. Bibcode:2003Sc&Ed..12...45G. https://doi.org/
10.1023/A:1022632600805
Hand, L. N., & Finch, J. D. (2008). Analytical mechanics. Cambridge University Press.
Hellingman, C. (1992). Newton’s third law revisited. Physics Education, 27(2), 112–115.
Bibcode:1992PhyEd..27..112H. https://doi.org/10.1088/0031-9120/27/2/011
Hobbes, T. (1651). Leviathan. Oxford University Press.
Kleppner, D., & Kolenkow, R. (1973). An introduction to mechanics (pp. 133–134). McGraw-Hill.
ISBN 0-07-035048-5.
Lubliner, J. (2008). Plasticity theory (Rev. ed.) (PDF). Dover Publications. ISBN 0-86-46290-0.
Archived from the original (PDF) on 31 March 2010.
Malvern, L. (1969). E introduction to the mechanics of continuous medium. Prentice-Hall.
Morin, D. (2007). Introduction to classical mechanics: With problems and solutions. Cambridge
University Press.
Newton, I. (1999). The principia. A new translation by I. B. Cohen and A. Whitman, University of
California Press, Berkeley.
Newton, I. (1687). Philosophiæ Naturalis Principia Mathematica (Newton’s personally annotated
1st edition)
Newton, I. Principia, Corollary III to the laws of motion
Plastino, A. R., & Muzzio, J. C. (1992). On the use and abuse of Newton’s second law for variable
mass problems. Celestial Mechanics and Dynamical Astronomy. Netherlands: Kluwer Aca-
demic Publishers, 53(3), 227–232. Bibcode:1992CeMDA..53..227P.
Resnick, & Halliday. (1977). Physics (3rd ed., pp. 78–79). Wiley. Any single force is only one
aspect of mutual interaction between two bodies.
Resnick, R., Halliday, D., & Krane, K. S. (1992). Physics (Vol. 1, 4th ed., p. 83). Wiley.
The Mathematical Principles of Natural Philosophy. (1687). Encyclopædia Britannica
Thornton, M. (2004). Classical dynamics of particles and systems (5th ed., p. 53). Brooks/Cole.
ISBN 0-534-40896-6.
Wikipedia Sir Isaac Newton webpage. (2001). https://en.wikipedia.org/wiki/Isaac_Newton
Chapter 3
Thermodynamics
When we treat a continuum system (solid or fluid), we will assume that is a closed
system. This means that for a given period (time increment) there is no exchange of
matter or energy with its surroundings. This can be considered as a fixed control box
in space and time. The control box is insulated, to be able to satisfy thermodynamic
conservation laws.
All macroscopic systems have “memory.” This memory is always with respect to
their stress-free state, which is the minimum energy state. All macroscopic systems
tend to evolve toward a minimum energy point, which is defined by their intrinsic
properties. Evolution towards these minimum energy points can be slow or fast,
depending on internal and external factors.
These minimum energy points of the system are called thermodynamic equilib-
rium states. These states usually are asymptotic states. Hence, they are static. Callen
(1985) provides a good description of thermodynamic equilibrium from the atomic
point of view, “the macroscopic thermodynamic equilibrium state is associated with
incessant and rapid transitions among all the atomic states consistent with the given
boundary conditions. If the transition mechanism among the atomic states is suffi-
ciently effective, the system passes rapidly through all representative atomic states in
the course of a macroscopic observation; such a system is in thermodynamic
equilibrium. In actuality, few systems are in absolute and true thermodynamic
equilibrium. In absolute thermodynamic equilibrium, from the atomic point of
view, all radioactive materials would have decayed completely and nuclear reactions
would have transmuted all nuclei to the most stable of isotopes. Such processes
would take cosmic times to complete, and generally can be ignored. A system that
has completed the relevant [significant] processes of spontaneous evolution and that
can be described by a reasonably small number of parameters can be considered to
be in metastable thermodynamic equilibrium. Such a limited thermodynamic equi-
librium is sufficient for the application of [laws of] thermodynamics.” This is the
definition of thermodynamics equilibrium we use in the rest of the book.
From a practical point of view, a system is in equilibrium if it satisfies the laws of
thermodynamics. However, this justification is circular. That is to say if a system
satisfies the laws of thermodynamics it is said to be in equilibrium.
A succinct definition of thermodynamics with an example is given by Callen
(1985) “Thermodynamics is the determination of the equilibrium state that eventu-
ally results in a closed system” (Fig. 3.1).
Let us assume we have a container separated into two sections with a moveable rigid wall.
The container wall is assumed to be impermeable to matter, and adiabatic (heat does not
enter or leave the container). This container is the definition of a closed system in thermo-
dynamics. Initially, the separation wall is fixed. If we release the separation wall, it will move
to a new location due to the gradient of pressure on each side of the wall.
3.2 First Law of Thermodynamics 93
Then if we remove the adiabatic coating from the separation wall, the heat can freely flow
between two sections of the container. Now we drill holes in the separation wall, the matter
can flow freely between two sections depending on the concentration gradient. Every time a
constraint is removed, a spontaneous process will take place that will result in a new
equilibrium state. Initial values of pressure, volume, temperature, chemical concentration,
and all other parameters in each chamber will take on new thermodynamic equilibrium
values. The primary problem in thermodynamics is the computation of these parameters at
the thermodynamic equilibrium.
It is important to point out again, what we mean by a closed system. Because our
thermodynamic equilibrium is defined for a closed system. According to thermodynamics,
a system is considered to be closed if it cannot exchange any energy, matter, or anything
whatsoever with its surrounding. This assumption does not prevent us from solving any
engineering problem with fluctuating loads or masses. Because the problem is always solved
in incremental time steps. At each step, equilibrium must be satisfied (Callen, 1985).
This law of thermodynamics states that total energy is conserved in every process.
Therefore, it is sometimes referred to as the law of conservation of energy. The law is
independent of the path taken between the initial and final states of the system.
This law is generalized from experimental observations. Therefore, it is an
empirical conclusion. While mathematically it may be difficult to prove this law
universally, we assume its validity because there is no way to disprove it mathemat-
ically or experimentally [at least until now]. Now, we can formulate the first law of
thermodynamics for application in continuum mechanics. The conservation of
energy can be written as
d
ðu þ kÞ = W input þ Qinput ð3:1Þ
dt
94 3 Thermodynamics
where u is the internal potential energy, k is macro-level kinetic energy, Winput is the
work done by the system, due to external loads, and Qinput is energy added to the
system by heat transfer. Both Winput and Qinput do not have exact differentials.
Therefore,
I
W input dt ≠ 0 ð3:2Þ
and
I
Qinput dt ≠ 0 ð3:3Þ
However
I
W input þ Qinput dt = 0: ð3:4Þ
The work done on the system by external (forces) surface tractions and body forces
can be given by
Z Z
W input = σ ðnÞ ~v dA þ ρb ~v dV ð3:6Þ
A V
where σ (n) is the external surface traction per unit area, ~v is the velocity field in the
body, and b is the per unit mass body forces, and ρ is the mass density. Concentrated
point loads are also represented by surface tractions. We can write this equation in
indicial notation as
Z Z
ðnÞ
W input = σ j ~vj dA þ ρ bj~vj dV ð3:7Þ
A V
3.2 First Law of Thermodynamics 95
ðnÞ
The relation between external surface traction σ j and internal stresses σ ij are
given by
ðnÞ
σ j = σ ij ni ð3:8Þ
d~v
vj ρ dtj can be written as
d~vj d 1
~vj ρ = ρ~v ~v ð3:13Þ
dt dt 2 j j
Hence,
Z h i Z
d 1 ∂~vj
W input = ρ~v ~v dV þ σ ij dV ð3:14Þ
dt 2 j j ∂xi
V V
∂v
where ∂xji is the spatial gradient of velocity.
96 3 Thermodynamics
∂~vi
=L=D þ W ð3:15Þ
∂xj
where D is the rate of deformation tensor and W is the rate of spin tensor.
We have shown that for non-polar cases the stress matrix σ ij is symmetric; on the
other hand, the rate of spin tensor Wij is skew-symmetric. Therefore, we can write the
following relations:
The first term represents the kinetic energy, and the second term represents the
internal (strain) energy of the system. Here we should point out that this equation
is based on Cauchy’s equilibrium equation, which is based on Newtonian
mechanics laws.
The heat input, Q, has two parts. One is heat conduction through the surface coming
from outside, and the second part is the distributed internal heat generation with a
source strength of r per unit mass.
Internal heat generation is due to internal scattering, friction, or chemical reac-
tions. These two components of the heat travel in opposite directions. One is coming
from outside through the surface, and the second part is directly generated by the
material
Z Z
Qinput = q ndA - ρrdV ð3:18Þ
A V
where q is the heat flux vector, and n is the surface normal vector. The negative sign
due to internal heat generation is in the opposite direction of the heat coming from
outside, and it is outward. More importantly, r is generated internally by the system
due to atomic level scattering and internal friction.
3.2 First Law of Thermodynamics 97
We can substitute all these terms in the first law of thermodynamics equation. For
the general case, the total energy of the system will be the summation of the kinetic
energy and internal potential energy. Kinetic energy here is associated with the
macroscopic level velocity of the matter. It does not refer to the kinetic energy of the
atoms. That is included in the internal potential energy, which refers to many forms
of energy stored in the lattice.
For arbitrary values of Winput and Qinput we can write the following relations:
where u is the internal strain energy per unit mass. We can write the first law of
thermodynamics by remembering that Qinput is negative when heat enters the system
from outside and Winput is positive when the work is done by the system.
dK
Since d U_ = Qinput þ W input - ð3:21Þ
dt
2 3
Z Z Z
d 6 7
ρudV = 4 - q ndA þ ρrdV 5
dt
V A V
2 3
Z Z Z
6d 1 7 d 1
þ4 ρ~v ~vdV þ σ : DdV 5 - ρ ~v ~vdV ð3:22Þ
dt 2 dt 2
V V V
Kinetic energy terms cancel each other. Using the divergence theorem, we can
transform surface integral to volume integral:
Z Z
q ndA = ∇ qdV ð3:23Þ
A V
We finally collect all terms on one side and set it to equal to zero:
Z
du
ρ þ ∇ q - ρr - σ : D dV = 0 ð3:24Þ
dt
V
98 3 Thermodynamics
where u is the internal potential energy per unit mass. The left side of the equation is
the rate of increase in the internal potential energy. The right side of the equation
represents the input power that is converted into internal work as mechanical work
and as heat generation, however, excluding the kinetic energy at the macro level. The
last term is negative because it represents heat inflow per unit volume through the
boundaries.
We should point out that this equation is a very simple form of the first law of
thermodynamics because it assumes that only Winput is due to surface traction
(distributed loads). If we have distributed fields such as electro-magnetic and
chemical loads, they must be included in this equation. These load cases will be
covered in Chap. 8. Moreover, the first law of thermodynamics given above does not
include energy loss terms, because mechanical work terms are defined according to
Newtonian mechanics. In the next chapter, the same equation will be derived for
unified mechanics theory where energy loss will be included.
There are several ways to define the second law; however, we believe Kelvin-
Planck’s statement is the most concise and classical description. According to this
statement, “it is impossible to construct an engine that will produce no other effect
than the extraction of heat from a single heat reservoir and the performance of an
equivalent amount of work.” In other words, we can state that it is not possible to
construct an engine that has an efficiency of 100%, meaning that the input energy
and output energy for the intended work cannot be equal. There will always be
energy loss for unintended work. This energy loss will only occur in the positive
direction, meaning that we can only dissipate energy, but cannot gain energy in a
closed system. This law of nature is expressed mathematically as an inequality
stating that the internal entropy production is always nonnegative and is positive
for an irreversible process. This inequality is called Clausius–Duhem inequality.
Over simplified definition of entropy is that entropy quantifies how much energy
is unavailable for work. Another way of stating the second law of thermodynamics is
that the entropy of all-natural processes increases. The concept of entropy will allow
us to tie probability and statistics to continuum mechanics.
In the first law of thermodynamics, we saw interconvertibility of energy between
different forms such as heat and mechanical work. In Newtonian mechanics, kinetic
energy and potential energy may be converted from one to other with no energy loss
[Fdt = d(mv)]. The transformation can proceed in either direction, like a pendulum
3.3 Second Law of Thermodynamics 99
3.3.1 Entropy
The term “entropy” was first used by Clausius (1850). The second law of thermo-
dynamics is related to a variable called entropy, s, which is a mathematical quantity.
Therefore, entropy allows us to quantify the second law of thermodynamics. While
the concept of entropy is not easily understood from a continuum mechanics point of
view, it can be explained by utilizing molecular behavior.
First, we should clarify that the second law of thermodynamics cannot be proven
mathematically, but it is a generalization from observations (inductive reasoning).
There are several simplified descriptions of entropy while they are not mathematical;
however, they help in explaining an abstract quantity (Fig. 3.3).
Assume we have a ball at point A. It has a potential energy of mgh. Once the ball
is slightly pushed, it rolls down and travels along the flat surface. At the point, B the
ball will have no potential energy but just kinetic energy. Finally, at some point C the
ball will come to a stop. It will have zero energy.
As the ball is traveling from point A to point C, the amount of energy available for
work continually decreases. Hence, the amount of energy unavailable for work
(entropy) increases. This latter one is called entropy. In other words, “production
of entropy is associated with dissipation (or consumption) of the capacity for
spontaneous change when any process occurs” (DeHoff, 1993). The entropy of the
closed system increases or remains constant in all processes. Therefore, entropy
always increases. Energy and entropy are governed by different laws. Energy can
enter or exit a body but it cannot be created inside a body. However, entropy can be
created inside the body.
Callen (1985) formulates the interpretation of entropy in a set of postulates
depending upon a posteriori justification. These postulates are based on the basic
principle that the minimization of energy function happens at equilibrium. In the
following section, we will quote these postulates as defined by Callen (1985).
Postulate I
There exist particular states, called equilibrium states, of simple systems that mac-
roscopically are characterized completely by the internal energy and mole numbers
of the chemical components.
Postulate II
There exists a function, called the entropy (s), of the extensive parameters of any
composite system, defined for all equilibrium states and having the following
property. The values assumed by the extensive parameters in the absence of an
internal constraint are those that maximize the entropy over the manifold of
constrained equilibrium states.
An extensive parameter in a homogeneous system is proportional to the total mass
of the system.
An intensive variable is defined as a variable that has the same value everywhere
in a homogeneous system. The densities of extensive variables (e.g. u,s,~v ) are
intensive variables.
This postulate assumes the existence of a function called entropy only for the
thermodynamic equilibrium states. In the absence of constraints, the system is free to
select anyone number of states; however, the state of maximum entropy is always
selected by the system.
As stated earlier, the basic problem of thermodynamics is “The single,
all-encompassing problem of thermodynamic is the determination of the equilibrium
state that eventually results after the removal of internal constraints in a closed
composite system” Callen (1985). This basic problem can be solved if the entropy
of the system is known as a function of the intrinsic parameters of the system. The
relation that gives the entropy as a function of the system parameters is known as a
thermodynamic fundamental relation. It, therefore, follows that if the fundamental
relation of a particular system is known, all conceivable thermodynamic information
about the system can be obtained from it. Essentially if the fundamental relation of a
system is known, every thermodynamic attribute is completely and precisely deter-
mined.” Callen (1985) did an outstanding job in establishing the postulator basis for
thermodynamics.
Postulate III
The entropy of a composite system is additive over the constituent subsystems. The
entropy is continuous and differentiable and is a monotonically increasing function
of the energy.
The additive property of entropy is just a summation of the entropies of the
constituent subsystems.
Xn Xn
s= s =
i=1 i
s ðU i ,
i=1 i
V i, Ni, . . . Nr Þ ð3:26Þ
the extensive parameters are multiplied by a constant, say λ, the entropy is multiplied
by the same constant.
Postulate II implies that the partial derivative of entropy with respect to internal
energy is a positive quantity.
∂s
>0 ð3:28Þ
∂U V,N i
U = U ðs, V, N 1 , . . . N r Þ ð3:29Þ
where V is the volume and N1, . . .Nr are the entire system parameters.
Postulate IV
At zero Kelvin temperature, the entropy of any system vanishes in the state for which
∂U
=0 ð3:30Þ
∂s
Essentially this postulate states that when atoms have no vibrational energy, it is
not possible to generate entropy. Therefore, at zero Kelvin temperature systems have
zero entropy generation.
Postulate IV is an extension of the Nernst postulate or also known as the third law
of thermodynamics.
These four postulates are the rational basis for the development of thermodynam-
ics. Based on these postulates, we can solve any thermodynamics problem. Since
everything organic or inorganic is a thermodynamics system, these postulates are
applicable. Using these postulates, we can solve any problem that we can derive the
thermodynamic fundamental relation, in the following procedure.
Let us assume the thermodynamic fundamental equations governing each of the
constituent systems (mechanisms) are known. These fundamental governing equa-
3.3 Second Law of Thermodynamics 103
Carnot in 1824 defined entropy for a hypothetical reversible engine, where he shows
the mechanical equivalence of heat energy. Entropy is a state variable defined by
dQ
ds = - ðfor a reversible processÞ ð3:31Þ
T
We should point out that neither heat energy Q nor temperature T is a perfect
differential of any function. However, ds = dQ T is a perfect differential of a function.
The fact that entropy is a state function can easily be proven for an ideal gas. Entropy
will return to its initial value whenever the temperature returns to its initial value, for
any reversible process. Of course, a reversible process does not exist in real life. It is
just an imaginary idealization created for expedience in the formulation.
104 3 Thermodynamics
U = U ðS, V, N 1 , . . . . . . . . . N r Þ ð3:33Þ
The formal thermodynamic definition of temperature (at the macro level) agrees
with the intuitive definition given by (∂U
∂S
= T). We should point out that this is the
macro definition of temperature since at the atomic level temperature is defined by
the amplitude of intensity of vibrational energy of atoms. In addition, the definition
of pressure given above agrees with the mechanics’ definition of pressure. [-PdV] is
called quasi-static work.
dW m = - PdV ð3:39Þ
The term quasi-static here used is in contrast to a dynamic load where a load is
applied very fast; as a result, inertia forces are not negligible.
It is necessary to discuss the negative sign in quasi-static work. The quasi-static
work is assumed positive if it increases the total energy of the system.
3.3 Second Law of Thermodynamics 105
dW M = - Pð - dV Þ = þ PdV ð3:40Þ
Heat flux can be defined quantitatively with the help of quasi-static work. For a
quasi-static process at a constant number of moles, the heat energy dQ can be defined
by
dQ = dU - dW m ð3:41aÞ
or
dQ = dU þ PdV ð3:41bÞ
In thermodynamics terms, heat and heat flux are used interchangeably. In ther-
modynamics terminology “heat” like “work” is only a form of energy. It is assumed
that once the energy is transferred to a system either in the form of heat or as
mechanical work, it is indistinguishable from any energy that may have been
transferred directly to the system. It is important to remind that Joule is the unit
for energy and work. Therefore, there cannot be a distinction between variables with
the same units.
The total energy U of a state is not just a sum of work (dW) and heat (dQ).
Because dWM and dQ are imperfect differentials, which means that the differential is
path-dependent. [Exact differential is path independent.]
The integrals of dWM and dQ is the work and heat for that particular path of the
process. Their sum is the total energy difference ΔU. However, ΔU is independent
of the path taken.
We should point out that when the fundamental equation is written as U = U(S,
V, N1, . . .Nr) the total energy U is a dependent variable, and entropy S is an
independent variable. This is the fundamental equation unified mechanics theory is
based on. Since entropy is an independent variable, it can only be defined in space on
its axis. The relation U = U(S, V, N1, . . .Nr) is also named the energetic fundamental
relation. However, S = S(U, V, N1, . . .Nr) is said to be entropic fundamental relation,
which is referred to as thermodynamic fundamental relation in this book.
The remaining terms in dU equation represent an increase of internal energy
associated with the addition of matter to a system. These terms are called quasi-static
electro-chemical work. The term matter here does not exclude electron flow as a
matter due to an electrical bias. The quasi-static electro-chemical work is given by
Xr
dW c = μN
i=1 i i
ð3:42Þ
dU = dQ þ dW M þ dW C ð3:43Þ
Following the same differentiation process and using the same definitions we
used earlier in this section for the energy equilibrium equation and differentiating
with respect to λ yields
∂U ∂ðλSÞ ∂U ∂λX i
þ þ ⋯ = U ðS, X 1 , . . . X r Þ ð3:45Þ
∂ðλSÞ ∂λ ∂ðλX i Þ ∂λ
Assuming λ = 1. Then
∂U Xt ∂U
Sþ X =U
i = 1 ∂X i i
ð3:46Þ
∂S
Xr
U = TS þ PX
i=1 i i
ð3:47Þ
U = TS - PV þ μ1 N 1 þ ⋯ þ μr N r ð3:48Þ
The last two equations are called Euler relations. A differential form of the
relation among the intensive parameters can be obtained directly from the Euler
relation and is known as the Gibbs–Duhem relation.
Taking differentiation of Euler relation given by Eq. (3.47) leads to
Xr Xr
dU = TdS þ SdT þ P dX i þ
i=1 i i=1
X i dPi ð3:49Þ
We should note that in the derivation of the last equation, we separated quasi-
static mechanical work and quasi-static electro-chemical work. Here they are both
represented in one term:
3.3 Second Law of Thermodynamics 107
dU = dQ þ dW M þ dW C ð3:51Þ
Xr
P dX i
i=1 i
ð3:52Þ
The Gibbs–Duhem relation presents the relationship among the intensive param-
eters in differential form. The number of linearly independent intensive parameters
that have independent variations is called the number of thermodynamic degrees of
freedom of any system, which are mapped onto the thermodynamic state index axis
in the unified mechanics theory, which is discussed in the next chapter.
At this point, it is necessary to restate the thermodynamic formalism in energy
representation. This fundamental equation is
U = U ðS, V, N 1 , . . . N r Þ ð3:55Þ
T = T ðS, V, N 1 , . . . N r Þ
P = PðS, V, N 1 , . . . N r Þ
ð3:56Þ
μ1 = μ1 ðS, V, N 1 , . . . N r Þ
μr = μr ðS, V, N 1 , . . . N r Þ
where u is the internal energy density (specific internal energy), s is entropy for unit
volume, P is the thermodynamic pressure, and v is the specific volume of the fluid.
The cantilever beam shown in Fig. 3.4 is subjected to linear elastic static cyclic
loading. Let us assume that the force is small enough to produce a stress level well
below the macro yield stress. A student observing this beam will see that after every
load removal, the beam returns to the original shape. To the observer, this gives the
impression that deformation under elastic loading is a perpetual reversible process.
However, after a certain number of cycles, the cantilever beam will fatigue and fail
regardless of how small the stresses are. It will probably break away from the support
where stresses are maximum. This simple experiment shows that while we observe
macro behavior to be reversible, however, our observation at the macro level is not
accurate. If the linear elastic loading-unloading cycle were reversible, the beam
would never fatigue, and deflections from one cycle to another cycle would not
change in magnitude. In earlier pages, we cited the fatigue failure of Big Ben [clock
in London] after 117 years in service.
According to the second law of thermodynamics, reversible processes are imag-
inary. They cannot happen in the real domain. What happens in the lattice is when
the beam is mechanically loaded, there is strain in the lattice. However, when the
load is removed, atoms do not go back to their original lattice site. They go to a
lattice site with a lower energy level, to minimize their energy level. In the process,
they leave behind a vacancy. As a result, the process is not reversible. Keep in mind
that the thermodynamically reversible process is where the initial and the final states
are the same. If an atom moves to a new lattice site with lower energy, state and is left
behind a vacancy that is a different thermodynamic state than the initial one. There is
a positive entropy generation, which is a quantitative measure of dissipation in the
system. DeHoff (1993) states that “changes in the real world are always accompa-
nied by friction and something [energy] that is dissipated. When the pebble is
dropped into a pond its kinetic energy at impact is dispersed by the wave motion
throughout the pond and the [initial kinetic energy] is dissipated in the absorbing
water of the pond.” In this irreversible process, the energy cannot be recovered. “The
entropy production in a system is a quantitative measure of this energy dissipation
[mechanisms].”
Eddington (1958) referred to entropy as “entropy is time’s arrow,” The total
entropy of any system plus its surroundings always increases with increasing time.
However, the entropy-time relationship is not linear, and each one is linearly
independent. Time can change without a change in entropy, as in at zero Kelvin or
in an undisturbed system. However, time cannot change unless there is a change in
entropy. If we had a process where entropy is destroyed, then the direction of the
time arrow must be reversed for that process to happen. However, this is not possible
in a real system. A reader may question the accuracy of the theory of elasticity where
all mechanisms are assumed to be reversible. Because of the second law of thermo-
dynamics, the theory of elasticity provides an approximate solution to an imaginary
system that cannot exist in reality. In the words of DeHoff (1993) “All real processes
have a finite rate in response to finite influences; such real processes are called
irreversible to emphasize the contrast with imaginary reversible processes. Real
processes are irreversible and suffer dissipations that result in the production of
entropy and thus a permanent (irreversible) change in the system.” Therefore, for an
irreversible process, we can write
Z2
dQ
ΔS = S2 - S1 = >0 ð3:58Þ
T irreversible
1
There are different approaches to deriving this inequality. We will quote the deri-
vation given by Malvern (1969). This inequality is considered a simple quantitative
expression of the second law of thermodynamics. The thermodynamic theory
110 3 Thermodynamics
explains the relationship between energy flowing into a system and the entropy
generation in the system and its surroundings (Fig. 3.5).
Heat input in a system is given by the difference between the heat conduction
coming from outside through the surface A and distributed internal heat generation
r per unit mass:
Z Z
Qinput = - q ndA þ ρrdV ð3:59Þ
A V
We are assuming that the system is closed. If the system is open in a control
surface, [fixed in space], additional entropy input due to mass flux would be
accounted for by
Z
- ρsV ndA ð3:61Þ
A
where s is entropy per unit mass. According to the second law of thermodynamics,
entropy increase rate in the system ≥ entropy input rate
Z Z Z
d q r
ρsdV ≥ - ndA þ ρdV ð3:62Þ
dt T T
V A V
This is the integral form of the Clausius–Duhem inequality. This equation means
that in an irreversible process internal entropy production is always greater than
input. In this equation, outward normal is a positive sign. An equal sign ensures that
the equation holds for the reversible [imaginary] process.
We can transform the surface integral into a volume integral. Since volume can be
an arbitrary value, we can write the local version of the Clausius–Duhem inequality
that must be satisfied at each point in any given volume as follows:
ds r 1 q
≥ - div ð3:63Þ
dt T ρ T
or
ds r 1 q
γ - þ divq - 2 gradT ≥ 0 ð3:64Þ
dt T ρT ρT
ds r 1 1
- þ divq ≥ 0 - q gradT ≥ 0 ð3:65Þ
dt T ρT ρT 2
where the first inequality represents the local entropy production by the system and
the second inequality represents entropy production by heat conduction.
The Truesdell and Noll (1985) interpretation of the second law of thermodynam-
ics is important for the unified mechanics theory because it is assumed that disorder
in the system increases only due to internal entropy production.
Entropy production by heat conduction in a free (unconstrained) system, Fig. 3.6,
cannot lead to irreversible disorder in the system because the system travels between
stress-free equilibrium states. Of course, the entropy of the entire surrounding
including the system increases. The interatomic equilibrium distance in a free
(unconstrained) system is a function of temperature; therefore due to the heat
conduction temperature of the system increase (atoms have higher vibrational
energy) the equilibrium inter-atomic distance increases. However, there will be no
strain (or stress) exerted on the atoms. Therefore, internal work will be zero; as a
result, internal entropy production will be zero. If the atoms move from one thermal
equilibrium to another thermal equilibrium, it is assumed that there will be no
internal entropy generation. Of course, this assumption is not 100% true. When
there is heat transport in a lattice, there is always scattering with phonons. We are
assuming this scattering-induced internal entropy generation is so small that we can
ignore it.
When there are no external constraints blocking atoms moving freely at each
temperature, they have a stress-free equilibrium interatomic separation distance r0.
When the interatomic distance is r0, attraction/repulsion atomic bond energy is at an
extremum (maximum), and the force acting on the atoms is zero (Fig. 3.7). There-
fore, internal work is not possible, because internal work and internal entropy
production are proportional to the dislocation of an atom from the equilibrium lattice
site and force acting on the atom. Of course, when a system is at equilibrium and the
force acting on the atom is zero, the work done must be zero. However, this
argument assumes that the system is free to move and boundary conditions do not
impose any constraint on the system that prevents it from moving freely. If the
cantilever shown in Fig. 3.6 is fixed on both sides, of course, there will be internal
work and internal entropy generation.
Clausius–Duhem inequality is just a quantitative statement of the second law of
thermodynamics; hence, it must be satisfied by every process in organic or inorganic
systems. The coordinate system assumed in the derivation of the formulation in
previous pages is arbitrary; however the positive definite property of the inequality is
not arbitrary. It must be satisfying for all and any coordinate system.
The unified mechanics theory uses entropy as a linearly independent state variable.
Furthermore, entropy (thermodynamics state index) defines a new additional 5th
linearly independent axis in addition to Newtonian space-time axes. This is needed
to be able to define the location of a system in a space-time-thermodynamic state
index coordinate system. This is needed to be able to define the coordinates of a
system in a space-time-thermodynamic state index coordinate system. While there
are similarities between the traditional interpretation of entropy in Newtonian
(continuum) mechanics and unified mechanics, there are also major differences. In
Newtonian (continuum) mechanics, the derivative of displacement with respect to
3.3 Second Law of Thermodynamics 113
entropy is taken as zero. In the unified mechanics theory, derivatives with respect to
entropy are not zero. When derivatives with respect to entropy are taken as zero,
unified mechanics theory collapses to Newtonian mechanics. We assume that the
derivative of time with respect to entropy is small enough to be considered zero for
most engineering problems.
The traditional use of entropy in Newtonian (continuum) Mechanics is considered
extensively by Malvern (1969), Coleman (1964), and Coleman and Mizel (1964,
1967). Here we summarize their work. “In Newtonian continuum mechanics, the use
of the Clausius-Duhem inequality differs from the usual second law of thermody-
namics, where the entropy is a state function determined by the instantaneous values
of the other state variables. In Newtonian continuum mechanics, entropy is not
explicitly postulated to be a state function. As a result, in Newtonian continuum
mechanics entropy maximization (and minimization of entropy generation rate) does
not affect system state variables or system properties. In simple terms, in Newtonian
continuum mechanics, dissipation and degradation of the system are not directly
included in the laws of Newton, but as empirical test data curve-fitting constraints.
Traditionally many researchers and authors of thermodynamics have postulated that
entropy is a state function of all the state variables including possibly some variables
that are not observable by macro-scale specimen testing.”
We must clarify what we mean by “state variables.” For a simple ideal gas,
pressure, temperature, and volume are thermodynamic variables. However, the
internal energy and the entropy are state variables (also called state functions).
State variables are expressed employing thermodynamics variables. For a variable
to be classified as a state variable (also called state function), it must have an exact
differential, based on a vanishing integral in an arbitrary cycle. This definition
assumes that the material is capable of going through an arbitrary cycle and being
restored to its initial state. As an example, we can plastically deform a steel beam,
and then melt it or anneal it and return it to its original state. Therefore, the total
energy of the system is a state variable because it has an exact differential:
I
Pinput þ Qinput dt = 0 ð3:66Þ
s = f ðu, F Þ ð3:67Þ
u = uðs, V i , . . . , V n Þ ð3:68Þ
s = sðu, V 1 , . . . , V n Þ ð3:69Þ
Order and disorder are relative terms, meaning that they are defined with respect to
an initial reference state. For example, in Fig. 3.8a, the initial location of each piece,
such as A7, can be considered in the ordered state. In general, the initial or strain-free
state of a system is considered as ordered. Of course, this does not mean that the
strain-free state has no disorder at the atomic scale. It is just a benchmark [reference]
state. Any change from the initial reference state is an increase in disorder. For a
quantifiable disorder to happen in a system, there must be some inhomogeneity. For
example, if we have a box (Fig 3.8b) with four identical baseballs, regardless of how
much we shake the box, the final “disordered” state will be identical to the initially
ordered state. As a result, there is no way to quantify a disorder.
There are 24 distinct configurations possible for these balls in the container.
However, the positions the balls could take are not observable at the macroscale,
because the balls are identical. Fortunately, most systems and materials at the micro
aggregate scale are not made up of identical crystals. Now assume that we mark the
balls as A, B, C, D (Fig. 3.8c), and initially we place them in the box in alphabetical
order. We assume that the initial alphabetical order configuration is a reference
“ordered” state. In this case, after we shake the box among 4 ! = 24 possible
configurations, the initially ordered state is only possible 1/24 of the time. That
means 23/24 times it will be a disordered state. If we have 26 baseballs marked A to
Z, the possibility of getting the ordered state is one in millions. All organic and
inorganic systems are made up of a large number of atoms (or molecules). When
subjected to any external load (disturbance), they move from their initial ordered
3.3 Second Law of Thermodynamics 115
ln ða bÞ = ln a þ ln b ð3:70Þ
s = k ln w ð3:71Þ
is used to define temperature on the Kelvin scale and to ensure consistency of units
on both sides of the equation. R = 8:31 molK
Joule
is the gas constant; NA = 6.022 × 1023 mol-
ecules/mol is the Avogadros number. Equation (3.71) is considered the basis for
statistical mechanics. Callen (1985) refers to Eq. (3.71) as a postulate that is dramatic
in its brevity, simplicity, and completeness.
According to Boltzmann, the equation calculating the natural logarithm of the
number of microstates available to the system and multiplying with a constant
k yields the entropy. However, the opposite is also true to calculate the number of
microstates. In Unified Mechanics Theory, this second approach is used, since
entropy is a function of internal state variables and all active mechanisms; it can
directly be calculated because entropy is the most general caloric equation [funda-
mental equation] of state. Callen (1985) refers to the Boltzmann equation as the
statistical mechanics in the microcanonical ensemble.
Entropy calculations in continuum mechanics are not readily available for all
mechanisms. Of course, very often we do not know what these mechanisms are. This
is expected to be an active research field soon. Entropy in thermodynamics and
statistical physics are the same thing. Statistical mechanics interpretation of entropy
where natural progress towards more disorder establishes a concrete and understand-
able concept of entropy. However, in statistical mechanics to be able to calculate the
3.3 Second Law of Thermodynamics 117
entropy all active mechanisms and processes taking place in the material must be
accounted for. Then entropy generation for each process must be calculated. Unfor-
tunately, this is a very primitive field in the mechanics of materials, because all
material constitutive models at continuum scale are based on empirical functions
obtained by testing of materials. There are no explicit formulas for entropy gener-
ation due to thermo-mechanical-electrical-chemical-radiation loads. This is a wide-
open research topic and a very challenging one, to say the least, because the material
modeling field has always been based on empirical observations. For example, we
crash a concrete cylinder, and model it with macro measurement variables, such as
stress, strain, and curve fitting. However, macroscale testing does not give us much
information about the actual micro-mechanisms that are responsible for degradation
and final failure. These are the micro-mechanisms that we need to know in detail to
compute entropy generation. It is always much easier to curve fit to a test data and
use it than to understand the actual mechanisms and processes leading to failure.
Theoretical physicist and mathematician Freeman Dyson in Scientific American
September (1954) suggested that heat is disordered energy. The author gave an
example of a flying rifle bullet. The bullet has kinetic energy but no disorder. After
the bullet hits a steel plate target, its kinetic energy is transferred to random motions
of the atoms in the plate and bullet. This disordered energy makes itself felt in the
form of heat. However, the author ignores other mechanisms, because heat is not the
only product of the initial kinetic energy. There are plastic deformations, melting,
phase change, thermomigration, and other entropy generation mechanisms that
could result from an impact.
Freeman Dyson (1954) also adds that the quantity of disorder is measured in
terms of the mathematical concept called entropy; of course, entropy is a function of
energy. Therefore, we can compute disorder in terms of energy using Boltzmann’s
equation. It is important to point out that disorder is the perfect way to express the
degradation of all organic and inorganic systems. Because as the entropy increases
the amount of disorder from the initial “ordered,” reference state increases. The
degradation of the material is nothing but another disordered state. Realignment of
atoms in any system under loading just takes one of the possible disordered states. Of
course, because of a very large number of possible disordered states, the system
cannot get into every possible disordered configuration before entropy becomes
maximum, because of the closed-isolated system requirement. This is easier to
explain with an example. Say we have a box with 26 baseballs marked A to Z.
Initially they are all in alphabetical order. Say the box is 10 kg. We are allowed to
spend 100 Joule energy to shake the box. Assume we have probably 10 chances to
shake the box. Of course, for our closed system when 100 joules are consumed, the
maximum entropy is reached. However, we could not discover all possible disorder
states that 26 baseballs could take. In the same fashion when a material fails in a
fatigue loading or monotonic loading, the failure is always through a different path.
Yet the maximum entropy, the value to reach the failure is always the same.
Experimental verifications have been provided by Naderi et al. (2009), Yun and
Modarres (2019), and Tu and Gusak (2019). Failure or increase in disorder in a
system can be considered travel over an energy terrain. A soccer ball sitting in a
118 3 Thermodynamics
valley after a kick will travel. However, there are many paths it can take. Each one of
them is possible. Each one represents a disorder path. However, the ball will finally
come to stop at a valley. We can write the fundamental relation. Of course, we have
to know what path the ball will take to use the properties relevant to that path. Then
we can exactly predict in which valley the ball will stop (Fig. 3.9).
Any physical phenomenon can be defined with the appropriate choice of ther-
modynamic variables. Any physical process defined by thermodynamics variables is
admissible if it satisfies all laws of thermodynamics.
It is essential to point out that when degradation is introduced into Newton’s laws
of motion, Lemaitre and Chaboche (1990) opined that there is no objective way of
choosing the internal variables best suited to study a phenomenon. This is an opinion
in which the choice is dictated by empirical experience (laboratory observations),
“physical feeling,” and very often by the type of application. This approach is the
only option when Newtonian mechanics is used for Kachanov-type empirical dam-
age mechanics models. However, in the unified mechanics theory, all thermody-
namics variables must be chosen objectively to be able to assemble the fundamental
equation based on physics and mathematics. They must represent all active micro-
mechanisms responsible for internal entropy production. In Lemaitre and
Chaboche’s (1990) approach, which is based on Newtonian Mechanics, thermody-
namics variables are up to the scientist/engineer to decide so that experimental data
can be fit into the same curve. In unified mechanics theory, all micro-mechanisms
that are responsible for entropy generation must be included, and they must be
physically and mathematically formulated in the fundamental equation without a
curve fitting a function to a test data.
Observable Thermodynamics Variables
1. Temperature
2. Space coordinates
While some authors define total strain or stress as observable variables, we do not
subscribe to this school of thought, because strain and stress are human constructs,
and definitions, not physical quantities. We can only measure displacement and then
calculate strain and stress. If something is calculated, it is not a directly observable
variable.
Independent Internal Thermodynamic Variables
1. Entropy
2. Internal energy
3. Crystal structure (phase)
Formulation of Thermodynamic Potential
Postulate: There exists a thermodynamic potential function defined by thermody-
namic variables. The thermodynamic state laws are derived from this potential.
Thermodynamic potential must be concave [f ′′(X) < 0] with respect to temperature
and convex [f ′′(X) > 0] with respect to other variables. These convex and concave
requirements ensure the thermodynamic stability requirement imposed by the
Clausius–Duhem inequality. Malvern (1969) proposes the following thermody-
namic potentials given in Table 3.1.
The Helmholtz free specific energy potential Ψ is the portion of the internal
energy that is available for doing work at a constant temperature. In continuum
120 3 Thermodynamics
Ψ = u - sT ð3:73Þ
Enthalpy, h, is the portion of the internal energy, u, that can be released as heat
when the thermodynamic tensions are held constant. Enthalpy can also be given by
h = u - τ i vi ð3:74Þ
Assuming state variables defined by the subscripts are held constant, the follow-
ing relationships can be written:
∂u
T= ð3:80Þ
∂s vi
∂u
τi = ð3:81Þ
∂vi s
∂Ψ
s= - ð3:82Þ
∂T vi
3.3 Second Law of Thermodynamics 121
∂Ψ
τi = ð3:83Þ
∂vi T
∂h
T= ð3:84Þ
∂s τi
∂h
vi = - ð3:85Þ
∂τi s
∂g
s= - ð3:86Þ
∂T τi
∂g
vi = - ð3:87Þ
∂τi T
Then we can write the following relation based on the deformation gradient tensor
which is a summation of elastic and plastic parts:
∂Ψ ∂Ψ ∂Ψ
= - ð3:89Þ
∂De ∂D ∂Dp
122 3 Thermodynamics
_ = ∂Ψ : De þ ∂Ψ T_
Ψ ð3:90Þ
∂De ∂T
ds r 1 q gradT
- þ divq - : 2 ≥ 0 ð3:91Þ
dt T ρT ρ T
Since
q divq q gradT
div = - ð3:93Þ
T T T2
Ψ = u - Ts ð3:95Þ
dΨ du ds dT
= -T -s ð3:96Þ
dt dt dt dt
We can write
dΨ dT ds du
- þs =T - ð3:97Þ
dt dt dt dt
∂Ψ dT gradT
σ : D-ρ þs -q ≥0 ð3:98Þ
dt dt T
We should point out that in the conservation of energy time derivative of internal
energy is multiplied by density (ρ), which is dropped in this derivative for consis-
tency of units. Now we can substitute dΨ
dt in the fundamental inequality. Meanwhile,
we can split the deformation–gradient tensor, D, into elastic and plastic components:
D = De þ D p ð3:99Þ
De þ DP = ε_ e þ ε_ p ð3:102Þ
∂Ψ
sþ =0 ð3:103Þ
∂T
However, Eq. (3.103) assumes that τidvi= 0, which is not true. Because entropy
generation happens due to many mechanisms, not just temperature change. More-
over, the reversible process is imaginary; it cannot happen in real life, without
violating the second law of thermodynamics. There is always entropy generation.
Therefore, thermo-elastic laws are defined as an imaginary process where τidvi=
0 ε_ p = 0,grad T = 0 and T is arbitrary. Then the following relations define thermo-
elastic laws in Newtonian mechanics.
∂Ψ ∂Ψ
σ =ρ and s = - ð3:104Þ
∂εe ∂T
The stress-Helmholtz free energy relation in Eq. (3.104) assumes that stress is
independent of change in entropy, of course, which is not true.
124 3 Thermodynamics
Thermodynamic Forces
Thermodynamic potential Ψ(D, T, Vi, . . .Vr) is a function of thermodynamic state
variables. In constitutive modeling, the concept of thermodynamic force is very
convenient for material modeling. It is defined by
∂Ψ
Ai = ρ ð3:105Þ
∂V i
Ψ = Ψðε, s, T, V k Þ ð3:106Þ
_ V_k , q=T
φ e, ð3:107Þ
The state variable Vk is used to define any internal state variable. Dissipation
potential is assumed to have a zero value at the origin of the state variables.
However, this is not a mathematical requirement. It is possible to define a dissipation
function with a non-zero value at the origin. The complementary relationships
between internal state variables and associated variables can be defined by
employing dissipation potential. The normality rule requires that the first derivative
of the dissipation potential with respect to state variables is normal to the potential
surface and directed outwards:
∂φ ∂φ → ∂φ
σ= Ak = - g=- ð3:108Þ
∂_E ∂Vk ∂ðq=T Þ
3.3 Second Law of Thermodynamics 125
Table 3.2 Dissipation flux The flux of state variables Dual variables
variables and dual variables
e_ σ
(Lemaitre and Chaboche
1990) -Vk Ak
-
→
q g = grad T
T
∂φ
e_ p = ð3:109Þ
∂σ
∂φ
- V_k = ð3:110Þ
∂Ak
q ∂φ
- = ð3:111Þ
T ∂g
The potentials φ and φ must be non-negative, convex, and zero at the origin to
satisfy Clausius–Duhem inequality. According to Lemaitre and Chaboche (1990),
the normality rule is sufficient to ensure the satisfaction of the second principle of
126 3 Thermodynamics
∂φ1 ∂φ1
σ : eP - Ak V_ k = σ : þ Ak ≥0 ð3:114aÞ
∂σ ∂Ak
q ∂φ
-g - -g 2 ≥0 ð3:114bÞ
T ∂g
3.3 Second Law of Thermodynamics 127
When we were discussing the laws of thermodynamics, it was stipulated that energy
exchange could not happen instantaneously. However, in constitutive modeling of
materials assuming that the material response is independent of the rate simplifies the
modeling and analysis significantly. Therefore, such a simplification yields the
theory of plasticity. As stated by Lemaitre and Chaboche (1990) when the dissipa-
tion potential φ e_ p , V_k is a positive, homogeneous function of degree one, its dual
function φ(σ, Ak) is non-differentiable.
Convexity of the dissipation potential φ(σ, Ak) is checked with an indicator
function:
When
φ = 0 if f < 0 E_ P = 0 ð3:115aÞ
φ = þ 1 if f = 0 E_ P ≠ 0 ð3:115bÞ
The indicator function f (also referred to as yield criteria) must be convex and
defined by the same dual variables,Ak f(σ, Ak). Based on the normality rule, it is
possible to show that
(
p ∂F _ f_ = 0
e_ = λ if ð3:116Þ
∂σ f =0
f_ = 0:
∂F _ ∂f
e_ P = λ_ =λ ð3:117Þ
∂σ ∂σ
∂F ∂f
- V_k = λ_ = λ_ ð3:118Þ
∂Ak ∂Ak
when the plastic strain increment is not normal to F function (yield surface). Then
the plastic strain is normal to a new function Q, plastic potential. However, this is
called non-associative plasticity, because the increment of the plastic strain is not
normal to the yield function F but to a plastic potential function Q.
128 3 Thermodynamics
∂Q Q=0
e_ P = λ_ if ð3:119Þ
∂σ Q_ = 0, if < 0 → E_ P = 0
Earlier in the chapter, we discussed separating internal entropy production into two
parts, namely, mechanical work dissipation and the dissipation due to heat
conduction.
Specific entropy, s, the equation was derived earlier as
ds 1 q 1 du
þ div - - σ ij Dij þ qi,i ≥ 0 ð3:120Þ
dt ρ T T dt
du
ρ = σ ij Dij þ ρr - qi,i ð3:121Þ
dt
ds 1 r 1
= σ D þ - q ð3:122Þ
dt ρT ij ij T ρT i,i
ds r 1 q
γ - þ q - i gradT ð3:123Þ
dt T ρT i,i ρT 2
1 q
γ= σ D - gradT ð3:124Þ
ρT ij ij ρT 2
Thus, internal entropy generation is separated into two parts; the first part is due to
mechanical work dissipation, and the second part is due to irreversible heat conduc-
tion. The strong form of Clausius–Duhem inequality requires
1
σ D >0 ð3:125Þ
ρT ij ij
3.3 Second Law of Thermodynamics 129
qi
- gradT > 0 ð3:126Þ
ρT 2
From here onward, we will use Malvern’s (1969) description of Onsager recip-
rocal relations, as follows:
“The terms in the internal entropy production are called generalized irreversible
forces X and fluxes J. Selection of force and flux term is arbitrary. However, it is
assumed that the dot product of the generalized irreversible force X and the gener-
alized flux vector J must give the dissipation power, per unit mass”. Applying this
concept to the formulation above, we can assign:
1
Generalized irreversible forces, ρT σ ij and - ρT1 2 gradT:
Generalized fluxes Dij and qi
Hence, we can write
γ = X i ∙ J = X im Jm ð3:127Þ
Constitutive equations give the fluxes as functions of the forces or vice versa.
Hence, we can write a phenomenological relation between generalized irreversible
fluxes and forces as follows:
J m = Lmk X ik ð3:128aÞ
X ik = akm J m ð3:128bÞ
“It is pointed out that in certain cases the Onsager reciprocal relations must be
replaced by amk(b) = akm(-b), or amk = - akm, for example in systems affected by a
magnetic field b” (de Groot 1952; Malvern 1969). The symmetry of the constitutive
matrix may also be affected by other factors such as material anisotropy or length
scale affects. Substitution of the phenomenological constitutive relations into the
entropy production yields two quadratic equations
1
γ= L X i X i ≥ 0 and γ = akm J k J m ≥ 0 ð3:130Þ
T mk m k
Table 3.3 Examples of different thermodynamic forces and corresponding fluxes in some dissi-
pative processes
Thermodynamic force, Thermodynamic
Primary mechanism X flux Examples
Heat conduction Temperature gradient Heat flux Fatigue, creep,
wear
Plastic deformation of Stress Plastic strain Fatigue, creep,
solids wear
Chemical reaction Reaction affinity Reaction rate Corrosion, wear
Mass diffusion Chemical potential Diffusion flux Wear, creep
Electrochemical reaction Electrochemical Corrosion rate Corrosion
potential density
After Imaninan and Modarres (2016)
Q = akm J k J m ð3:131Þ
Based on the above definition, the generalized irreversible force can be given by
the following relation:
1 ∂Q
X ik = ð3:132Þ
2 ∂J k
Ziegler (1963) has shown that these phenomenological relations, Eqs. (3.128a,
3.128b, 3.131) and Onsager relation if they have symmetric dissipation function,
follow from a principle of the maximum rate of entropy production or maximum
dissipation power, which he deduces for “quasi-static processes” by a statistical
mechanics approach modified to include irreversible processes” (Malvern 1969).
Table 3.3 lists some of the thermodynamic forces and fluxes for some dissipation
mechanisms.
Here we conclude our discussion of some fundamental concepts of thermody-
namics as it relates to continuum mechanics. For further understanding of thermo-
dynamics concepts, readers are encouraged to refer to thermodynamics books listed
in the References section of this chapter.
References
Truesdell, C., & Noll, W. (1985). The non-linear field theories of mechanics. Springer.
Truesdell, C. and Toupin, R.A. (1960) The Classical Field Theories. In: Flugge, S., Ed., Handbuck
der Physik, Vol. III/I, Springer-Verlag, Berlin.
Tu, K. N., & Gusak, A. M. (2019). A unified model of mean-time-to-failure for electromigration,
thermomigration, and stress-migration based on entropy production. Journal of Applied Phys-
ics, 126, 075109.
Yun, H., & Modarres, M. (2019). Measures of entropy to characterize fatigue damage in metallic
materials. Entropy, 21(8), 804.
Chapter 4
Unified Mechanics Theory
The term unified is used to describe the unification of universal laws of motion of
Newton and laws of thermodynamics ab initio level. As we discussed in earlier
chapters, Newton’s laws do not account for energy loss or degradation. They only
govern what happens to a system in the initial moment a load is applied to a brand-
new structure. However, the laws of thermodynamics control what happens after the
initial moment over time. Historically, continuum mechanics is based on the laws of
Newton only, and phenomenological test data fit empirical models that are supposed
to satisfy thermodynamics’ laws introduce energy loss, dissipation, and degradation.
In the next section, we will discuss the earlier work on using thermodynamics in
continuum mechanics.
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2022 133
C. Basaran, Introduction to Unified Mechanics Theory with Applications,
https://doi.org/10.1007/978-3-031-18621-9_4
134 4 Unified Mechanics Theory
laws. Quoting from Haddad’s (2017) seminal work on the literature review of
thermodynamics, “In an attempt to generalize classical thermodynamics to
non-equilibrium thermodynamics, Onsager (1931, 1932) developed reciprocity the-
orems for irreversible processes based on the concept of a local equilibrium that can
be described in terms of state variables that are predicated on linear approximations
of thermodynamic equilibrium variables. Onsager’s theorem pertains to the thermo-
dynamics of linear systems, wherein a symmetric reciprocal relation applies between
forces and fluxes.
In particular, the force exerted by the thermal gradient causes a flow or flux of
matter in thermo-diffusion. Conversely, a concentration gradient causes a heat flow,
an effect that has been experimentally verified for linear transport processes involv-
ing thermo-diffusion, thermo-electric, and thermo-magnetic effects. Classical irre-
versible thermodynamics as originally developed by Onsager characterizes the rate
of entropy production of irreversible processes as a sum of the product of fluxes with
their associated forces, postulating a linear relationship between the fluxes and
forces. The thermodynamic fluxes in the Onsager formulation include the effects
of heat conduction, the flow of matter (i.e., diffusion), mechanical dissipation (i.e.,
viscosity), and chemical reactions. This thermodynamic theory, however, is only
correct for near thermodynamic equilibrium processes wherein a local and linear
instantaneous relation between the fluxes and forces holds.”
“Building on Onsager’s classical irreversible thermodynamic theory, Prigogine
(1955, 1961, 1968) developed a thermodynamic theory of dissipative
non-equilibrium structures. This theory involves kinetics describing the behavior
of systems that are far away from thermodynamic equilibrium states. Prigogine’s
thermodynamics lacks the thermodynamic fundamental equation] function of the
system, and hence, his concept of entropy for a system away from [thermodynamic]
equilibrium does not have a total differential. Furthermore, Prigogine’s characteri-
zation of dissipative structures is predicated on a linear expansion of the entropy
function about a particular [thermodynamic] equilibrium, and hence, is limited to the
neighborhood of the [thermodynamic] equilibrium. This is a severe restriction on the
applicability of this theory. In addition, his entropy cannot be calculated nor deter-
mined” (Prigogine & Herman, 1971; Prigogine & Nicolis, 1977; Haddad, 2017).
Prigogine’s work (Prigogine & Defay, 1954; Prigogine, 1955, 1957, 1961;
Prigogine & Herman, 1971; Prigogine & Nicolis, 1977) on dissipative structures
and their role in thermodynamic systems far from equilibrium won him the Nobel
Prize in Chemistry in 1977. In engineering mechanics, most of our states are near the
thermodynamic equilibrium point. Therefore, Prigogine’s work is important to
understand. Quoting from Prigogine’s own Wikipedia webpage, “Prigogine proved
that dissipation of energy in chemical systems results in the emergence of new
structures due to internal self-re-organization [in the previous sentence if we replace
“chemical” with “mechanical”, it still holds]. Prigogine (1955) drew connections
between dissipative structures and the Rayleigh-Bénard instability, and the Turing
mechanism. Turing’s (1952) mechanism describes how patterns in nature such as
stripes and spots can arise naturally out of a homogeneous uniform state. Rayleigh-
Bénard instability is a type of natural convection, occurring in a plane horizontal
4.1 Literature Review of Use of Thermodynamics in Continuum Mechanics 135
layer of fluid heated from below, in which the fluid develops a regular pattern of
convection cells known as Bénard cells (Getling, 1998; Koschmieder, 1993).”
“Prigogine’s dissipative structure theory led to pioneering research in self-
organizing systems, as well as philosophical inquiries into the formation of com-
plexity on biological entities and the quest for a creative and irreversible role of time
in the natural sciences. Prigogine’s formal concept of self-organization was used also
as a “complementary bridge” between General Systems Theory and thermodynam-
ics, conciliating the cloudiness of some important systems theory concepts with
scientific rigor.” “Definitions of self-organization and general systems theory are
described as, ‘self-organization’, also called (in the social sciences) spontaneous
order, is a process where some form of overall order arises from local interactions
between parts of an initially ‘disordered’ system. The term ‘disorder’ here defines
the initial state. The process can be spontaneous when sufficient energy is available,
not need control by any external agent. It is often triggered by random fluctuations,
amplified by positive feedback. The resulting organization is wholly decentralized,
distributed over all the components of the system, [i.e., cooling molten metal yields a
solid polycrystal metal is a good example]. As such, the organization is typically
robust and able to survive or self-repair substantial perturbations. Chaos theory
discusses self-organization in terms of islands of predictability in a sea of chaotic
unpredictability. General systems theory is the interdisciplinary study of systems. A
system is a cohesive conglomeration of interrelated and interdependent parts that is
either natural or fabricated. Every system is delineated by its spatial and temporal
boundaries, surrounded, and influenced by its environment, described by its struc-
ture and purpose or nature, and expressed in its functioning. In terms of its effects, a
system can be more than the sum of its parts if it expresses synergy or emergent
behavior. Changing one part of the system usually affects other parts and the whole
system, with predictable patterns of behavior. For systems that are self-learning and
self-adapting, the growth and adaptation depend upon how well the system is
adjusted to its environment. Some systems function mainly to support other systems
by aiding in the maintenance of the other system to prevent failure. The goal of
systems theory is systematically discovering a system’s dynamics, constraints, and
conditions and elucidating principles (purpose, measure, methods, tools, etc.) that
can be discerned and applied to systems at every level of nesting, and in every field
for achieving optimized equifinality” (Beven, 2006) quoting from Wikipedia to
define, “equifinality is the principle that in open systems a given end state can be
reached by many potential means. It also means that a goal can be reached in many
ways. The same final state may be achieved via many different loading paths.
However, in closed systems, a direct cause-and-effect relationship exists between
the initial condition and the final state of the system. Biological and social systems
are open systems, however, operate quite differently.” Simply, the idea of
equifinality implies that the same results may be achieved with different initial
conditions and in many ways.
Entropy has also been used beyond positive sciences. The seminal paper on the
topic was published by Jaynes (1957) who proved that statistical mechanics, which
is based on Boltzmann’s entropy formulation, could be generalized to information
136 4 Unified Mechanics Theory
different yield surface for the same material, using different constants. Therefore, it
would be more accurate to call these thermodynamic forces pseudo-thermodynamics
forces, since they are empirical, and many are not directly related to the laws of
thermodynamics. While Rice (1971) interprets, fαδξα as the work of increments of
internal variables, in practice when we obtain a yield surface from experiments many
of our exponents, or “material constants” have no physical meaning to justify as an
internal thermodynamic variable, because two different scientists can have signifi-
cantly different number of constants to define a yield surface for the same material.
Our argument becomes clearer in the following formulation by Rice (1971).”
Free energy ϕ and its Legendre transform ψ (complementary energy) are given by
the following relations:
ϕ = ϕðE, θ, ξÞ = u - θη
∂ϕ ∂ϕ
ψ = ψ ðS, θ, ξÞ = E : -ϕ=E : - u þ θη ð4:2Þ
∂E ∂E
Based on definitions given in Eq. (4.2) and assuming that entropy is constant, the
variation of complementary energy can be written as
1
δψ = E : δS þ f α δξα þ ηδθ ð4:3Þ
V0
∂ψ ðS, θ, ξÞ ∂ϕðE, θ, ξÞ
E= and S = ð4:4Þ
∂S ∂E
∂ψ ðS, θ, ξÞ ∂ϕðE, θ, ξÞ
f α = V0 = - V0 ð4:5Þ
∂ξα ∂ξα
Using Maxwell’s relations, which are a set of equations derivable from the
symmetry of second derivatives of potentials. Rice (1971) arrives at
∂E ðS, θ, ξÞ 1 ∂f α ðS, θ, ξÞ
= 0 ð4:6Þ
∂ξα V ∂S
4.1 Literature Review of Use of Thermodynamics in Continuum Mechanics 139
Equation (4.6) serves as the central equation in the rest of Rice (1971) in the
development of the theoretical foundations of inelastic constitutive theory. A flow
potential for the inelastic strain rate is proposed by postulating that at any given
temperature and pattern of internal rearrangement within the material, the rate at
which any specific structural rearrangement occurs is fully determined by the
thermodynamic forces. The concept, used by Rice (1971), is identical to the basis
for the unified mechanics theory where pseudo-dynamic forces are replaced with the
thermodynamic fundamental equation of the material. It is assumed that any specific
microstructural configuration can be fully determined by the thermodynamic force
associated with that microstructural rearrangement, as follows:
Rice (1971) further postulates that the current temperature and pattern of internal
microstructural rearrangement may enter the kinetic equations as parameters, but the
influence of the macroscopic stress state on a given microstructural rearrangement
appears only through the fact that the associated force is dependent on stress. The
author acknowledges that this is not the most general class of kinetic equations;
however, it does represent conventional metal plasticity behavior where the associ-
ated shear stress governs slip on a crystallographic system or at the discrete dislo-
cation. Based on Newtonian mechanics, it is assumed that the force on a given
segment of the dislocation line governs its motion. Rice (1971) recast the kinematic
equations in an integral form as follows:
Z f
∂
ξ_ β = ξ_ α ðf , θ, ξÞ df α ð4:8Þ
∂f β 0
where the integral is carried out at fixed values of θ and ξ and defines a point function
since each term in the integrand is an exact differential. Rice (1971) further postu-
lates that thermodynamic forces may be viewed as functions of the macroscopic
stress, S, temperature θ,and internal variables ξ and then defines a function
Z f
1
ΩðS, θ, ξÞ = ξ_ α ðf , θ, ξÞ df α ð4:9Þ
V0 0
∂ΩðS, θ, ξÞ 1 ∂f ðS, θ, ξÞ
= 0 ξ_ α ðf , θ, ξÞ α ð4:10Þ
∂S V ∂S
Rice (1971) elegantly shows that the right-hand side of this equation is equal to
the inelastic strain rate in the following way.
Let δE be the difference (variation) in strain between the neighboring constrained
equilibrium states, differing by δS, δθ, δξ. It is assumed that the variations in the
140 4 Unified Mechanics Theory
∂EðS, θ, ξÞ 1 ∂f α ðS, θ, ξÞ
ðδE Þp = δξα = 0 δξα ð4:11Þ
∂ξα V ∂S
2 2
∂ ψ ∂ ψ
ðδE Þe = : δS þ δθ ð4:13Þ
∂S∂S ∂S∂θ
An analogous expression for the elastic portion can be written from Eq. (4.13).
Therefore, Ω is called a flow potential, (or more commonly known as a yield
surface). According to the normality rule, it can be shown that the inelastic portion of
the strain-rate vector is normal to the surface of constant flow potential in stress
space.
p ∂ΩðS, θ, ξÞ
E_ = ð4:16Þ
∂S
Z t 2 Z Z
T z_ dA þ qdA dt = V 0 u2 - u1 ð4:17Þ
t1 A A
Z t 2 Z
ðq=θÞdA dt ≤ V 0 η2 - η1 ð4:18Þ
t1 A
where T is the surface stress vector (traction), q is the heat supplied to the surface per
unit area (positive inward), z_ is the deformation rate, u is the internal energy, and η is
the entropy. Time integral from t1 to t2 follows the irreversible process. It is
important to point out that elastic deformation here is considered a reversible
process; hence, fatigue under elastic stresses is not possible. Alternatively, it is
assumed that during a molecular dynamics simulation under elastic loading atoms
return to their original lattice site. Based on earlier work by Kestin and Rice (1970),
following the classical approach to irreversible processes for a macroscopically
homogeneous system with internal variables, Rice (1971) views all actual processes
as a sequence of constrained equilibrium states. The work of surface stresses can be
written in terms of internal stresses and strains as V 0 S : E_ for the constrained
equilibrium states and the temperature is assumed uniform in the material. Then
the first and second laws of equilibrium thermodynamics can be generalized for any
actual homogenous process as follows:
S : E_ þ Q_ = u_
Z
1
Q = 0 q dA ≤ θη_ ð4:19Þ
V A
where Q is the total heat supply rate per unit reference volume. Rice (1971) further
defines entropy generation rate due to internal variables and conjugate thermody-
namic forces as
1
σ= f α ξ_ α ≥ 0 ð4:20Þ
θV 0
Then, Rice (1971) defines the total change in entropy as, the summation of Q, the
total heat supply rate per unit reference volume plus the entropy generation due to
internal variables and thermodynamic forces as
Q þ θσ = θη_ ð4:21Þ
∂ϕðE, θ, ξÞ
ξ_ α ≥0 ð4:22Þ
∂ξα
∂ϕðE, θ, ξÞ
f α = - V0 ð4:23Þ
∂ξα
Rice (1971) concludes “That is, the rates of internal arrangement occurring during
a process must be such that the free energy would decrease if strain and temperature
were held fixed at current values.”
In Newtonian mechanics, Eqs. (4.22) and (4.23) are primarily used to require that
flow potential must be convex and smooth surface but are not included in Newton’s
universal laws to degrade the energy or material. Degradation is imposed by
empirical formulation utilizing a damage potential obtained from experimental
data. Dissipation is imposed by an empirical constant, like damping, or by an
empirical function.
Later Rice (1977) derived thermodynamic restrictions on the quasi-static growth
or healing of Griffith cracks. However, as presented they were nothing beyond
global restrictions on the detailed molecular kinetics of crack growth. Therefore,
there was no attempt to integrate these restrictions into Newton’s universal laws of
motion. Other work by Rice (1977) on the use of thermodynamics in the theory of
plasticity and fracture mechanics was based on or a derivative of his work
discussed here.
Bazant (1972) used thermodynamics for modeling the mechanics of interfaces
in concrete structures, where a thermodynamically consistent formulation of
interacting continua with surfaces based on surface thermodynamics was proposed.
Creep, shrinkage, and delayed thermal dilation and their interaction with adsorbed
water layers confined between two solid adsorbent surfaces were accounted for in
the formulation.
Kijalbaev and Chudnovsky (1970) and Chudnovsky (1973, 1984) proposed a
probabilistic model of the fracture process unifying the phenomenological study of
long-term strength degradation of materials, fracture mechanics, and statistical
approach to fracture mechanics. Chudnovsky (1984) used irreversible thermody-
namics to model the deterministic side of the failure phenomenon and stochastic
calculus to account for the failure mechanisms controlled by “chance,” particularly
the random roughness of fracture surfaces. Kijalbaev and Chudnovsky (1970) and
Chudnovsky (1973, 1984) derived the entropy production for an elastic medium
with damage. Citing Swalin (1972) on the invariance of the entropy jump during a
phase transition with respect to stresses and temperature, Chudnovsky (1973)
hypothesized that the entropy jump invariance can be used to predict the local
failure. The author proposed pairing of a phenomenological critical damage param-
eter and entropy jump, ΔS, requirement as a local failure criterion. It is important to
point out that Chudnovsky proposed calculating entropy generation in a fracture
4.1 Literature Review of Use of Thermodynamics in Continuum Mechanics 143
S = SðU, G, A, M Þ ð4:24Þ
∂S ∂S ∂S ∂S
dS = dU þ dG þ dA þ dM ð4:25Þ
∂U ∂G ∂A ∂M
And
[Note: In the derivation of Eq. (4.26) Klamecki assumes U = ST.] where the
temperature, internal energy, microstructural energy, surface energy, and chemical
potential are represented by T, U, ϕ, γ, and μ, respectively. Klamecki (1980a, b,
1984) defined the conservation of energy by
dU = δQ þ δW ð4:27Þ
where Q represents heat and W represents mechanical work. δ is used to imply the
equation is for a process [variation]. Work, W, is assumed to be due to plastic
deformation only, and then taken as stress multiplied by plastic strain. The heat,
Q, has two components: one is assumed to be due to internal heat generation, R, in
the system, and the other one heat flows into the system due to temperature gradient.
As a result, internal energy can be written as
where K is the thermal conductivity and ρ is the mass density. Substituting internal
energy given by Eq. (4.28) in entropy Eq. (4.26) leads to
As Klamecki (1984) points out this definition of entropy is very valuable since it
contains, or may be further generalized to contain, all the energy dissipation mech-
anisms in sliding bodies. Of course, there is nothing in the equation that limits it to
siding bodies only. Klamecki (1984) speculated that there are three regimes of
144 4 Unified Mechanics Theory
system behavior, which are of interest when the intent is to describe changes in the
system by the use of entropy generation. At thermodynamic equilibrium, the system
entropy is at maximum, and the change in entropy with time is zero. When the
system is not in equilibrium because of a continual supply of energy to it, it can be
near equilibrium or far from the equilibrium state. Klamecki (1984) asserted that the
process occurring in the nonequilibrium states could be analyzed by studying the
entropy production in the system. The entropy created in the system is given by
X
n
S= Fi J i ð4:31Þ
i=1
n
dS X dJ dF
S_ = = Fi i þ J i i ð4:32Þ
dt i=1
dt dt
Here, Klamecki (1984) postulates that the first term in the summation represents
the entropy generation rate far from the equilibrium state where thermodynamic
force does not change over time. The second term represents the entropy generation
near equilibrium. As a result, the summation can be summarized as follows:
S_ = S_ J þ S_ F ð4:33Þ
S_ F = 0 ð4:34Þ
Klamecki (1984) attributes this principle, Eq. (4.33), to Glansdorff and Prigogine
(1971a, b). As stated by Klamecki (1984) far from the thermodynamic equilibrium,
no widely accepted entropy production evolution criterion has been formulated so
far. Laws of thermodynamics are valid around equilibrium points only. Since in
engineering mechanics most of our problems are around equilibrium points and
4.1 Literature Review of Use of Thermodynamics in Continuum Mechanics 145
course, that is not accurate, because using plastic strain alone as a metric violates the
second law since if there is no increase in plastic strain but loading under the same
stress level, there will be no entropy generation according to this model.
Naderi et al. (2010) were able to prove experimentally that total cumulative
entropy generation is constant at the time of failure and is independent of sample
geometry, load, and loading frequency. They named this critical entropy value
Fatigue Fracture Entropy (FFE). Figure 4.2 shows experimental fatigue fracture
entropy invariance with respect to geometry and loading frequency. Total accumu-
lated entropy of metals, undergoing repeated cyclic load as it reaches the point of
fracture is a constant value, independent of load amplitude, geometry, size of the
specimen, frequency, and stress state; Naderi et al. (2010) experimentally validated
the invariance of fatigue fracture entropy with respect to loading path and displace-
ment loading amplitude for Al 6061-T6 samples.
In Fig. 4.2 fracture fatigue entropy remains at about 4 MJm-3 K-1 for both
tension-compression and bending fatigue. Displacement amplitude is varied from
25 to 50 mm. A filled square represents tension-compression; a filled circle repre-
sents bending; a filled star represents torsion data.
Amiri and Khonsari (2012) and Jang and Khonsari (2018) also did extensive
experiments to show that fatigue fracture entropy remains constant independent of
loading condition, loading frequency, and sample geometry. Their samples were
AISI 1018 carbon steel and Al 7075-T6. Liakat and Khonsari (2015) measured
fatigue fracture entropy of un-notched and V-notched specimens. They observed that
fatigue fracture entropy remains constant.
Imanian and Modarres (2015, 2018) and Yun and Modarres (2019) experimen-
tally proved the concept of using entropy as a degradation metric; authors discussed
the entropic characterization of the corrosion-fatigue degradation mechanism. They
proposed an entropy-based damage prognostics and health management technique
for integrity assessment and remaining useful life prediction of aluminum 7075-
T651 specimens. Their experimental validations proved that using entropy as a
4.1 Literature Review of Use of Thermodynamics in Continuum Mechanics 147
Fig. 4.2 Fatigue fracture entropy versus the number of cycles to failure for different bending
fatigue tests of Al 6061-T6 with different specimen thickness, frequencies, and displacement
amplitudes. Fatigue fracture entropy remains at roughly 4 MJm-3K-1, regardless of thickness,
load, and frequency. Displacement amplitude varied from 25 to 50 mm. Filled circle, thickness =
6.35 mm, f = 10 Hz; filled diamond, thickness = 3.00 mm, f = 10 Hz; filled star, thickness =
4.82 mm, f = 10 Hz; unfilled circle, thickness = 6.35 mm, f = 6.5 Hz; unfilled triangle, thickness =
4.82 mm, f = 12.5 Hz; unfilled star, thickness = 6.35 mm, f = 6.5 Hz; unfilled diamond, thickness =
6.35 mm, f = 12.5 Hz. (After Naderi et al. (2010))
law of entropy increase in the general form. They also provided extensive experi-
mental validation for the theory of what they called “mechanothermodynamics.”
They stated, “It is shown that mechanothermodynamics—a generalized physical
discipline—is possible by constructing a bridge between Newtonian mechanics and
thermodynamics. The entropy is the bridge between thermodynamics to mechanics.”
The first and the second principles of mechanothermodynamics were presented.
They formulated the foundation of the general theory of degradation evolution of
mechanothermodynamic systems for:
• Energy theory of limiting states
• Energy theory of damage
• Foundations of the theory of electrochemical damage
They provided mathematical fundamentals of the theory of interaction between
damage caused by loads of different nature (mechanical, thermal, etc.). The authors
proposed a single function for the critical damage (limiting) state of metals and
polymeric materials operating in different conditions. The analysis of 136 laboratory
experiment results showed that logarithmic function is fundamental: it is valid for
low-, average-, and high-strength pure metals, alloys, and polymers over a wide
range of temperatures of medium (from helium temperature to 0.8 TS, where TS is
the material melting temperature) and mechanical stresses (up to the strength limit
for single static loading) while the fatigue life was of the order of 106–108 cycles.
Mechanothermodynamics uses the same idea as the unified mechanics theory to
unify Newtonian mechanics and thermodynamics. That is using entropy as a bridge
to connect Newtonian mechanics and thermodynamics, which was first published in
Basaran and Yan (1998). Sosnovskiy and Sherbakov (2015, 2016) used an empirical
logarithmic function to fit the experimental evolution of degradation. It is not clear
why they do not use the Boltzmann equation, since their logarithmic function can be
obtained directly from the Boltzmann equation. The term tribo-fatigue-entropy in
their work refers to entropy generation in tribology [which is the focus of their
research] and the fatigue process. They surmise that if an analogy between light and
strain energy is justified, then the strain energy absorption law may be like
Bouguer’s light absorption law. This law, which is also exponential, becomes the
basis for their empirical degradation function formulation.
Haddad (2017) published probably one of the most comprehensive reviews of the
history of thermodynamics from its classical to its postmodern forms. Haddad et al.
(2005) and Haddad (2019) also provided a general systems theory framework for
thermodynamics which attempts to harmonize thermodynamics with classical New-
tonian mechanics. The main idea used by Haddad (2019) to unify mechanics and
thermodynamics is attributed to Basaran and Yan (1998). Haddad stated, “The
dynamical system’s notion of entropy proposed by Haddad et al. (2005, Haddad
(2019), Basaran and Yan (1998), Basaran and Nie (2004)), Sosnovskiy and
Sherbakov (2016) involving an analytical description of an objective property of
matter can potentially offer a conceptual advantage over the subjective quantum
expressions for entropy proposed in the literature (e.g., Daróczy entropy, Hartley
entropy, Rényi entropy, von Neumann entropy, infinite-norm entropy) involving a
4.1 Literature Review of Use of Thermodynamics in Continuum Mechanics 149
2
voltage levels. (After 0.075
1
Srate(WK−1)
0.000
0 10 20 30 40 50
t(s)
150 4 Unified Mechanics Theory
Cuadras et al. (2017) proposed a method to assess the degradation and aging of
light-emitting diodes (LEDs) based on an irreversible entropy generation rate.
Researchers degraded several LEDs and monitored their entropy generation rate in
accelerated tests. They compared the thermo-electrical results with the optical light
emission evolution during degradation. They found a good relationship between
aging and entropy generation rate because they both are related to device parameters
and optical performance. They proposed a threshold of entropy generation rate as a
reliable damage indicator of LED end-of-life that can avoid the need to perform
optical measurements to assess optical aging. The method is far more physics-based
and beyond the typical empirical statistical models based on curve fitting to test data
for lifetime prediction provided in the literature. Cuadras et al. (2017) tested different
LED colors and electrical stresses to validate the electrical LED model and analyzed
the degradation mechanisms of the devices to validate the model.
There has been significant interest in using entropy generation rate as a damage
metric to predict degradation, fracture, and fatigue life prediction, such as Basaran
and Chandaroy (2002); Basaran and Tang (2002); Basaran et al. (2003, 2005,
2008a, b); Tang and Basaran (2003); Gomez and Basaran (2005, 2006); Lin and
Basaran (2005); Gomez et al. (2006); Li et al. (2008); Li and Basaran (2009); Gunel
and Basaran (2010, 2011a, b); Basaran and Lin (2007a, b, 2008); Basaran and Nie
(2007); Bin Jamal et al. (2020); Sherbakov and Sosnovskiy (2010); Sosnovskiy
(1987, 1999, 2004, 2005, 2007, 2009); Sosnovskiy and Sherbakov (2012, 2017,
2019); Temfack and Basaran (2015); Wang and Yao (2019); Yao and Basaran
(2012, 2013a, b, c); Yun and Modarres (2019); Wang and Yao (2017), Guo et al.
(2018), Zhang et al. (2018), Wang et al. (2019), and Osara and Bryant (2019a, b).
Young and Subbarayan (2019a, b) and Suhir (2019) used the Boltzmann-Arrhenius-
Zhurkov equation to predict the evolution of time to failure, while this approach
essentially is based on Basaran and Yan’s (1998) concept. Suhir advocates using the
Boltzmann equation independent of entropy and Newtonian mechanics. Unfortu-
nately, this approach reduces to using Boltzmann distribution just as an evolution
function for an empirical model curve-fit to test data. Hsiao and Liang (2018)
developed a sensor that monitors entropy generation in real-time and that can give
real-time information on system aging and prediction for further estimating the
failure of electrical or any mechanical system.
We do not claim to have included a comprehensive survey of the literature on the
topic; however, we tried to give a brief historical perspective of the efforts.
Laws of unified mechanics theory are not new laws of nature. They are the unifica-
tion of the existing laws of Newton and thermodynamics at the ab initio level.
The first law of thermodynamics is also known as the law of conservation of
energy. Newton’s first law is about a state where the summation of all externally
applied forces is zero. Newton’s universal first law of motion can be summarized as
4.2 Laws of Unified Mechanics Theory 151
“An object at rest stays at rest and an object in motion stays in motion with the same
speed and the same direction unless acted upon by an unbalanced force.” Newton’s
first law of motion is also known as the law of inertia, exemplified by the simple
example of pulling the tablecloth swiftly from the table while leaving all cups and
plates in place due to their inertia forces.
The first law is intended for a system with no dissipative, or unbalanced forces
acting on the system. Therefore, we will start numerating laws of unified mechanics
with the second law.
The second law of unified mechanics theory is the ab initio level unification of the
second law of Newton and the second law of thermodynamics in Boltzmann’s
formulation.
“The change of momentum of a body is proportional to the impulse impressed on
the body and happens along the straight line on which that impulse is impressed.
Degradation of the input impulse takes place according to the second law of
thermodynamics. The rate of degradation of impulse is directly proportional to the
entropy generation rate in the system along the path chosen. The entropy generation
rate of the system can be mapped onto a linearly independent non-dimensional axis
called the Thermodynamic State Index (TSI) which can have coordinate values only
between 0 and 1 (Fig. 4.4). Thermodynamic state index (1 - Φ) is the normalized
non-dimensional form of the second law of thermodynamics in Boltzmann entropy
formulation.” As a result, unification of the second laws in the Newtonian sense can
be given by
F dt ð1 - ΦÞ = dðmvÞ ð4:35Þ
Generalizing Eq. (4.35) for multiple sources of impulse, [i.e., mechanical, mag-
netic, electric, chemical, etc.] and assuming that the mass is not a function of time,
and there are n number of different impulses acting on the body, including both
conservative and non-conservative, then the second law of the unified mechanics
theory becomes
n Z
X
½F ð1 - ΦÞdt i = mdv ð4:36Þ
i
Combining the second laws of Newton and thermodynamics requires the modi-
fication of the Newtonian space-time coordinate system. A new axis must be added
to be able to define the thermodynamic state. As a result, the motion of any particle in
the universe can be defined only in a five-dimensional space that has five linearly
independent axes. None of these axes can represent the information of other axes.
152 4 Unified Mechanics Theory
The thermodynamic state index is the normalized form of the second law of
thermodynamics as given by the Boltzmann entropy formulation. Derivation of the
(1 - Φ) term is provided in the following section. The additional axes Thermody-
namics State Index (TSI) is necessary to locate the thermodynamic state of a particle.
Space-time coordinates of a point can be defined by Newton’s universal laws of
motion. However, thermodynamic state index axis coordinate cannot be defined by a
space-time coordinate system. We find it necessary to give the following example.
Let us assume there is a 5-year-old boy in Istanbul and a 100-year-old man in
New York. Using the space-time Cartesian coordinate system, we can define their
location by x, y, z coordinates and their age on the time axis. However, this does not
give any information about their thermodynamic state. Let us assume that a 5-year-
old boy has a terminal illness and is expected to die in a few days and 100-year-old is
expected to die in a few days. This information cannot be represented in a space-time
coordinate system. However, on the TSI axis, both persons will be at Φ = 0.999
coordinate.
Another example is, if a soccer ball is given an initial acceleration with a kick, F,
it will have an initial acceleration of a = F/m but eventually, it will come to a stop.
Let’s assume the ball was stationary initially in the terrain profile shown in Fig. 4.6.
Depending on the path it follows, it will come to a stop at one of the valleys. The
amplitude and direction of the initial acceleration of the ball are governed by the
second law of Newton and the slowing down process of the ball is governed by the
laws of thermodynamics, which is represented by (1 - Φ) term.
4.3 Evolution of Thermodynamic State Index (Φ) 153
“To every action, there is always opposed an equal reaction. The initial reaction of a
body in response to action will change over time as the stiffness of the reacting
system degrades over time according to the second law of thermodynamics and its
thermodynamic fundamental equation.” The reaction will be governed by the third
law of the unified mechanics theory, given by
h i
dU 21 d 1
F 12 = F 21 = = k 21 ½1 - Φu221 ð4:37Þ
du21 du21 2
dΦ
For problems where du 21
is assumed to be negligible in each increment, then we
can write the following simplified form of the third law:
where U21 is the strain energy of the reacting member, Φ is the Thermodynamic
State Index, k21 is the stiffness of the reacting member, and u21 is the displacement in
the reacting member.
It is noteworthy that the third law of the unified mechanics theory includes both
Hooke’s law (1678) and Newton’s third law (1687).
have been misinterpreted.” The following section about Boltzmann’s work is a direct
quotation from Sharp and Matschinsky (2015):
The translation provided here is for Boltzmann (1877). Previous work of Maxwell and
Boltzmann’s derivations was based on mechanical laws of motion and particle interaction of
gasses. However, this work by Boltzmann is much more general. His formulation requires
only that particles can exchange kinetic energy, but they do not specify the mechanism. As
Boltzmann predicted in the final sentences of this paper, his approach applied to not just
gasses, but liquids and solids of any composition. Indeed, the Boltzmann distribution has
also passed almost unchanged into the quantum world.
“Boltzmann (1877) also clearly demonstrates that these are two distinct contri-
butions to entropy generation, one arising from the distribution of heat (kinetic
energy of atoms) and the distribution in space of atoms or molecules.” In the initial
effort to understand the nature of entropy, Carnot, Clausius, Maxwell Kelvin, and
others focused almost entirely on the contribution from heat only. “Boltzmann
unified the entropy due to thermal aspects with spatial distribution entropy into
one statistical mechanic’s formulation.” Boltzmann also discovered the third funda-
mental contribution to entropy, namely, radiation by deriving the Stefan-Boltzmann
Law (1884).
“Careful reading of Boltzmann (1877) is enlightening about some apparent
paradoxes subsequently encountered in the development of statistical mechanics.”
Unfortunately, many terms in Boltzmann’s equations and derivation have been
misinterpreted in many physics textbooks. “First, the variable, Ω, is not the loga-
rithm of a probability.” That would be obtained by dividing the number of com-
plexions p for a given state distribution, by the total number of complexions, J.
Boltzmann gives this a different symbol, w, from the first letter of the German word
for probability [wahrscheinlichkeit], but he does not use it. Confusingly, Planck later
chose to write Boltzmann’s equation for entropy as Planck (1901).
S = k ln w þ constant ð4:39Þ
k is Boltzmann’s constant, w is the probability that the system will exist in the state it
is in relative to all the possible states it could be in (Halliday & Resnick, 1966). On
the other hand, in most thermodynamics’ textbooks, Boltzmann’s equation is given
by
S = k ln Ω ð4:40Þ
was so concerned with. He thus avoids at the outset the considerable difficulty Gibbs
had accounting for changes in entropy with time. Boltzmann gave us, for the first
time, a definition of entropy applicable to every state (distribution) at equilibrium or
not. “Then the entropy of the initial and final states is not defined, but one can still
calculate the quantity which we have called the permutability measure. By exten-
sion, every complexion can then be assigned an entropy, using the permutability
measure of the state distribution to which that complexion belongs, Lebowitz
(1993), opening the door to the statistical mechanics of non-equilibrium states and
irreversible processes” (Sharp & Matchinsky, 2015).
Boltzmann’s work has historically been misinterpreted, assumed to apply to
gasses only, and ignored in the continuum mechanics field. Therefore, we feel
compelled to include his original paper in this book. The following section is the
English translation of Boltzmann (1877) by Sharp and Matschinky (2015) in its
entirety.
The relationship between the second fundamental theorem [i.e., the second law of
thermodynamics] and calculations of probability became clear for the first time when
I demonstrated that the theorem’s analytical proof is only possible based on prob-
ability calculations. (I refer to my publication “Analytical proof of the second
fundamental theorem of the mechanical theory of heat derived from the laws of
equilibrium for kinetic energy” Wien. Ber. 63, p 8 reprinted ass Wiss. Abhand. Vol I,
reprint 20, pp 295 and my “Remarks about several problems in the mechanical
theory of heat” 3rd paragraph, Wiss. Abhand. Vol II, reprint 39.) [Boltzmann refers
to thermodynamics by the term “The mechanical theory of heat.”] [Wiss. Abhand
refers to Boltzmann’s collected works.] This relationship is also confirmed by
demonstrating that an exact proof of the fundamental theorem of the equilibrium
of heat is most easily obtained if one demonstrates that a certain quantity—which I
wish to define again as E—has to decrease as a result of the exchange of the kinetic
energy among the gaseous molecules and therefore reaching its minimum value for
the state of the equilibrium of heat. (Compare my “Additional studies about the
equilibrium of heat among gaseous molecules” Wiss. Abhand. Vol I, reprint 22, p
316.) The relationship between the second fundamental theorem and the laws of the
equilibrium of heat is made even more compelling in light of the developments in the
second paragraph of my “Remarks about several problems of the mechanical theory
of heat.” There, I mentioned for the first time the possibility of a unique way of
calculating the equilibrium of heat using the following formulation. It is clear that
every single uniform state distribution which establishes itself after a certain time
4.3 Evolution of Thermodynamic State Index (Φ) 157
given a defined initial state is equally as probable as every single non-uniform state
distribution. Comparable to the situation in-game of Lotto [a board game] where
every single quintet is as improbable as the quintet 12345. The higher probability
that the state distribution becomes uniform with time arises only because there are far
more uniform than non-uniform state distributions. Furthermore: “It is even possible
to calculate the probabilities from the relationships of the number of different state
distributions. This approach would perhaps lead to an interesting method for the
calculation of the equilibrium of heat.” It is thereby indicated that it is possible to
calculate the state of the equilibrium of heat by finding the probability of the different
possible states of the system. The initial state in most cases is bound to be highly
improbable and from it, the system will always rapidly approach a more probable
state until it finally reaches the most probable state, i.e., that of the heat equilibrium.
If we apply this to the second basic theorem [i.e., the second law of thermodynam-
ics], we will be able to identify that quantity which is usually called entropy with the
probability of the particular state. Let’s assume a system of bodies that are in a state
of isolation with no interaction with other bodies, e.g., one body with higher and one
body with lower temperature and one so-called intermediate body which accom-
plishes the heat transfer between the two bodies; or choosing another example by
assuming a vessel with absolutely even and rigid walls one half of which is filled
with the air of low temperature and pressure whereas the other half is filled with the
air of high temperature and pressure. The hypothetical system of particles is assumed
to have a certain state at time zero. Through the interaction between the particles, the
state is changed. According to the second fundamental theorem, this change must
take place in such a way that the total entropy of the particles increases. This means
according to our present interpretation that nothing changes except that the proba-
bility of the overall state for all particles will get larger and larger. The system of
particles always changes from an improbable state to a probable state. It will become
clear later what this means. After the publication of my last treatise regarding this
topic, the same idea was taken up and developed further by Mr. Oskar Emil Meyer
totally independent of me [Die Kinetische Theorie der Gase, Breslau 1877, Seite
262]. He attempts to interpret, in the described manner, the equations of my
continued studies concerning the equilibrium of heat particles. However, the line
of reasoning of Mr. Meyer remained entirely unclear to me, and I will return to my
concerns with his approach on page 172 (Wiss. Abhand. Vol. II).
We have to take a different approach because it is our main purpose not to limit
our discussion to thermal equilibrium but to explore the relationship of this proba-
bilistic formulation to the second theorem of the mechanical theory of heat. We want
first to solve the problem which I referred to above and already defined in my
“Remarks on some problems of the mechanical theory of heat” [Wiss. Abhand.
reprint 39], namely, to calculate the probability of state distributions from the
number of different distributions. We want first to treat as simple a case as possible,
namely, a gas of rigid elastic spherical molecules trapped in a container with
absolutely elastic walls. (Which interact with central forces only within a certain
small distance, but not otherwise, the latter assumption, which includes the former as
a special case, does not change the calculations in the least.) Even in this case, the
158 4 Unified Mechanics Theory
application of probability theory is not easy. The number of molecules is not infinite,
in a mathematical sense, yet the number of velocities each molecule is capable of is
effectively infinite. Given this last condition, the calculations are very difficult; to
facilitate understanding, I will, as in earlier work, consider a limiting case.
Kinetic Energy Has Discrete Values
We assume initially that each molecule is only capable of assuming a finite number
of velocities, such as
1 2 3 p
0, , , ,⋯, ð4:41Þ
q q q q
where p and q are arbitrary finite numbers. Upon colliding, two molecules may
exchange velocities, but after the collision, both molecules still have one of the
above velocities, namely, [The fact that Boltzmann allows the exchange of energy
between molecules automatically makes his formulation general, i.e., irrespective of
the state of matter gas, liquid, or solid.]
1 2 p
0, or , or , etc till ð4:42Þ
q q q
This assumption does not correspond to any realistic mechanical model, but it is
easier to handle mathematically, and the actual problem to be solved is re-established
by letting p and q go to infinity.
Even if at first sight, this seems a very abstract way of treating the problem, it
rapidly leads to the desired objective, and when you consider that in nature all
infinities are [nothing] but limiting cases one assumes each molecule can behave in
this fashion only in the limiting case where each molecule can assume more and
more values of the velocity.
To continue, however, we will consider the kinetic energy, rather than the
velocity of the molecules. Each molecule can have only a finite number of values
for its kinetic energy. As a further simplification, we assume that the kinetic energies
of each molecule form an arithmetic progression, such as the following:
We call P the largest possible value of the kinetic energy, pE. Before impact, each
of the two colliding molecules shall have a kinetic energy of
This means that after the collision, each molecule still has one of the above values
of kinetic energy. The number of molecules in the vessel is n. If we know how many
of these n molecules have a kinetic energy of zero, how many have a kinetic energy
of E, and so on, then we know the kinetic energy distribution. If at the beginning there
4.3 Evolution of Thermodynamic State Index (Φ) 159
is some state distribution among the gas molecules, this will in general be changed
by the collisions. The laws governing this change have already been the subject of
my previous investigations. But right way, I note that this is not my intention here,
instead, I want to establish the probability of state distribution, regardless of how it is
created, or more specifically, I want to find all possible combinations of the ( p + 1)
kinetic energy values allowed to each of the n molecules and then establish how
many of these combinations correspond to each state distribution. [The term “state
distribution” is better translated/interpreted as the distribution of states into English.
However, I am keeping the original translation. Because the term refers to what
Boltzmann refers to as complexion, which is the distribution of [discrete] kinetic
energies of molecules]. The latter number [( p + 1)] then determines the likelihood of
the relevant state distribution, as I have already stated in my published “Remarks
about several problems in the mechanical theory of heat” (Wiss. Abhand. Vol II,
reprint 39, p 121).
As a preliminary, we will use a simpler schematic approach to the problem,
instead of the exact case. Suppose we have n molecules. Each of them can have
kinetic energy
and suppose these energies are distributed in all possible ways among the
n molecules, such that the total energy is a constant, e.g., λE = L. Any such
distribution, in which the first molecule may have a kinetic energy of, e.g., 2E, the
second may have 6E, and so on, up to the last molecule, we call a complexion and so
that each individual complexion can be easily enumerated. We write them in
sequence (for convenience we divide through by E), specifying the kinetic energy
of each molecule. We seek the number P of complexions where w0 [number of]
molecules have kinetic energy 0, w1 molecules have kinetic energy E, w2 have kinetic
energy 2E, up to the wp which have kinetic energy pε. We said, earlier, that given
how many molecules have kinetic energy 0, how many have kinetic energy E, etc.,
this distribution among the molecules specifies the number of P of complexions for
that distribution; in other words, it determines the likelihood of that state distribution.
Dividing the number P by the number of all possible complexions, we get the
probability of the state distribution. [Boltzmann never performs this division. It is
done later by Planck (1901).]
Since the distribution of states does not determine kinetic energies exactly, the
goal is to describe the state distribution by writing as many zeros as molecules with
zero kinetic energy (w0), w1 ones for those with kinetic energy E, etc. All these zeros,
ones, etc. are the elements defining the state distribution. It is now immediately clear
that the number P for each state distribution is the same as the number of permuta-
tions of which the elements of the state distribution are capable and that is why the
number P is the desired measure of the permutability of the corresponding distribu-
tion of states. Once we have specified every possible complexion, we have also all
possible state distributions, the latter differing from the former only by immaterial
permutations of molecular labels. All those complexions which contain the same
160 4 Unified Mechanics Theory
number of zeros, the same numbers of ones, etc., differing from each other merely by
different arrangements of elements, will result in the same state distribution, the
number of complexions forming the same state distribution, and which we have
denoted by P, must be equal to the number of permutations which the elements of the
state distribution are capable of. To give a simple numerical example, take =7, λ = 7
p = 7 so L = 7E, P = 7E. With seven molecules, there are eight possible values for the
kinetic energy 0, E, 2E, 3E, 4E, 5E, 6E, 7E to distribute in any possible way such that the
total kinetic energy = 7E. There are then 15 possible state distributions. We enu-
merate each of them in the above manner, producing the numbers listed in the second
column of the following table of state distributions (Table 4.1). The numbers in the
first column label the different state distributions.
In the last column, under the heading P is the number of possible permutations of
members for each state. The first state distribution, for example, has six molecules
with zero kinetic energy, and the seventh has kinetic energy 7E. So w0 = 6, w7 = 1,
w2 = w3 = w4 = w5 = w6 = 0 [w1 = 0 is also zero. However, Boltzmann does not
include it]. It is immaterial which molecule has kinetic energy 7E.
So, there are seven possible complexions, which represent this state distribution.
Denoting the sum of all possible complexions, 1716 by J then the probability of the
first state distribution is 7/J, similarly, the probability of the second state distribution
is 42/J, and the most probable state distribution is the tenth, as its elements permit the
greatest number of permutations. Hereon, we call the number of permutations the
relative likelihood of the state distribution; this can be defined in a different way,
which we next illustrate with a specific numerical example since generalization is
straightforward. Suppose we have an urn containing an infinite number of paper
slips. On each slip is one of the numbers 0, 1, 2, 3, 4, 5, 6, 7; each number is on the
same number of slips and has the same probability of being picked. We now draw
the first septet of slips and note the numbers on them. This septet provides a sample
state distribution with a kinetic energy of E times the number written on the first slip
for molecule 1 and so forth. We return the slips to the urn and draw a second septet,
which gives us a second state distribution, etc. After we draw a very large number of
septets, we reject all those for which the total does not equal seven. This still leaves a
large number of septets. Since each number has the same probability of occurrence
and the same elements in a different order from different complexions, each possible
complexion will occur equally often. By ordering the numbers within each septet by
4.3 Evolution of Thermodynamic State Index (Φ) 161
size, we can classify each into one of the 15 cases tabulated above. So, the number of
septets that fall into the class 0000007 relative to the 0000016 class will be 7:42,
similarly for all the other septets. The most likely state distribution is the one, which
produces the most septets, namely, the 10th.
(Boltzmann’s footnote: If we divide the number of septets corresponding to a
particular state by the total number of septets, we obtain the probability of distribu-
tion. Instead of discarding all septets whose total is not 7 we could, after drawing a
slip, remove from the urn all those other slips for which a total of 7 is now
impossible, e.g., on drawing a slip with 6 on it, all other slips except those with
0 or l would be removed. If the first 6 slips all had 0 on them, only slips with 7 on
them would be left in the urn. One more thing should be noted at this point. We
construct all possible complexions. If we denote by w0 the arithmetic mean of all
values of w0 which belong to different complexions and form analogous expressions
w1 , w2 , . . . in the limit these quantities would also form the same state distribution.)
[Translators Sharp and Matschinsky’s note: Boltzmann’s comments on the results
of Mr. Oskar Meyer beginning “Ich will hier einige Worte € uber die von. Hrn. Oskar
Meyer. . .” on p 172 (Wiss. Ab.) and ending with “. . .Bearbeitung des allgemeinen
Problems zur€uckkehren,” on p 175 (Wiss. Ab.) are of historical interest only and are
omitted.]
The first task is to determine the permutation number, previously designated by P,
for any state distribution. Denoting by J the sum of the permutations P for all
possible state distributions, the quotient P/J is the state distribution’s probability,
henceforth denoted by W. [Boltzmann uses W for the initial of the German word,
Wahrscheinlichkeit.] We would first like to calculate the permutations P for the state
distribution characterized by w0 molecules with kinetic energy 0, w1 molecules with
the kinetic energy E, etc. It must be understood that
w0 þ w1 þ w2 þ ⋯ þ wp = n ð4:46Þ
w1 þ 2w2 þ 3w3 þ ⋯ þ pwp = λ ð4:47Þ
Because the total number of molecules is n, and the total kinetic energy is λE = L.
Describing the state distribution as before, a complexion has w0 molecules with zero
energy, w1 [is the number of molecules] with one unit [of energy], and so on. The
permutations, P, arise since of the n elements w0 are mutually identical, similarly,
with w1, w2 etc., elements. The total number of permutations is well known.
n!
P= ð4:48Þ
w0 !w1 ! . . .
The most likely state distribution will be for those w0, w1, . . . values for which
P is a maximum or since the numerator is a constant, for which the denominator is a
minimum. The values w0, w1 must simultaneously satisfy the two constraints (4.46)
and (4.47). Since the denominator of P is a product, it is easiest to determine the
minimum of its logarithm, that is the minimum of
162 4 Unified Mechanics Theory
which is identical to (4.49a) for integer values of w0, w1, . . . . We then get the
non-integer values which for constraints (4.46) and (4.47) maximize M1.2 The
solution to the problem will in any case be obtained if for w0, w1, etc., we select
the closest set of integer values. If here and there a deviation of a few integers is
required, the nearest complexion is easily found.
The minimum of M1 is found by adding to both sides of the equation for M1
equation (4.46) multiplied by the constant h and Eq. (4.47) multiplied by the
constant k and setting the partial derivatives with respect to each of the variables
w0, w1, w2, . . . to zero. We thus obtain the following equations:
d ln Γðw0 þ 1Þ
þ h = 0, ð4:50aÞ
dw0
d ln Γðw1 þ 1Þ
þ h þ k = 0, ð4:50bÞ
dw1
d ln Γðw2 þ 1Þ
þ h þ 2k = 0 ð4:50cÞ
dw2
⋮ ⋮ ⋮
d ln Γ wp þ 1
þ h þ pk = 0, ð4:50dÞ
dwp
which leads to
d ln Γðw1 þ 1Þ d ln Γðw0 þ 1Þ
- ð4:50eÞ
dw1 dw0
d ln Γðw2 þ 1Þ d ln Γðw1 þ 1Þ
- ð4:50fÞ
dw2 dw1
1
Translators’ footnote: The ambiguous symbol “l” [used by Boltzmann] for [natural] logarithm in
the original text has been replaced throughout by “ln .”
2
Translators’ footnote: The original text reads as “maximized but should mean minimized”
[because Boltzmann’s objective is to maximize P].
4.3 Evolution of Thermodynamic State Index (Φ) 163
d ln Γðw3 þ 1Þ d ln Γðw2 þ 1Þ
= - ⋯ ð4:50gÞ
dw3 dw2
The exact solution of the problem through evaluation of the gamma function
integral is very difficult; fortunately, the general solution for arbitrary finite values of
p and n does not interest us here, but only the solution for the limiting case of a larger
and larger number of molecules. Then the numbers w0, w1, w2 etc. become larger and
larger, so we introduce the function3
1
ϕðxÞ = ln Γðx þ 1Þ - xð ln x - 1Þ - ln 2π: ð4:51Þ
2
Similarly, for the other equations of (4.50). It is also well known that
1 1
ϕð xÞ = - ln x þ þ ... ð4:53aÞ
2 12x
pffiffiffiffiffi
This series is not valid for x = 0, but here x! and 2π ðx=eÞx should have the same
value, and ϕ(x) = 0. Therefore, the problem of finding the minimum of w0 ! w1 ! w2 ! . . .
is replaced by the easier problem of finding the minimum of
pffiffiffiffiffi w0 w0 pffiffiffiffiffi w1 w1 pffiffiffiffiffi w2 w2
2π 2π 2π
e e e
Providing w is not zero, even at moderately large values of p and n both problems
have matching solutions. From Eq. (4.53a) it follows
dϕðw0 Þ 1 1
=- - - ... ð4:53bÞ
dw0 2w0 12w20
which for larger and larger values of w0 or lnw0 vanishes, the same also applies to the
other w′s, so Eq. (4.52) can be written as follows:
ln w1 - ln w0 = ln w2 - ln w1 ð4:54aÞ
Or
3
Boltzmann approximates ln x! by x ln x - x þ 12 ln ð2π Þ rather than x þ 12 ln x - x þ 12 ln ð2π Þ
as is now usual. For x ≪ 30,the relative difference is small.
164 4 Unified Mechanics Theory
w1 w2
= ð4:54bÞ
w0 w1
w2 w3 w4
= = = ... ð4:54cÞ
w1 w2 w3
One sees immediately that by neglecting the expression (4.53b) the minimum of
the denominator of
pffiffiffiffiffi nn
2π e
pffiffiffiffiffi w w0 pffiffiffiffiffi w
2π e 0
2π we1 1 . . .
is found instead of the minimum of the denominator of (4.48). So, for problems
involving w!, the use of a well-known papproximation
ffiffiffiffiffi (see Schlömilch’s
Comp. S. 438) amounts to the substitution of 2π ðw=eÞw for w!.
If we denote the common value of the quotient (4.54c) by, x we obtain
One sees immediately that these equations differ negligibly from Equation
(42) and the preceding ones from my earlier work “Study of the thermal equilibrium
of gas molecules.” [Boltzmann refers to Eq. (42) in his earlier study, not Eq. (42) in
this derivation]
We can use the last equation to write
xpþ1 - 1
w0 =n ð4:58aÞ
x-1
d xpþ1 - 1
w0 x =λ ð4:58bÞ
dx x - 1
pxpþ1 - ðp þ 1Þ xp þ 1
w0 x =λ ð4:59Þ
ð x - 1Þ 2
pxpþ2 - ðp þ 1Þ xpþ1 þ x λ
= ð4:60aÞ
ðxpþ1 - 1Þ ðx - 1Þ n
Or
One can see immediately from Descartes’ theorem4 that this equation cannot have
more than three real positive roots, of which two are = 1. Again, it is easy to see that
both roots are not solutions to Eqs. (4.56) and (4.57) and do not solve the problem,
but that they showed up in the final equation merely because of multiplying by the
factor (x - 1)2. [This appears to be a typographic error. The ( p + 2)2 power makes
mathematical sense, Sharp and Matschinsky (2015)]. To be convinced of this, one
needs only to derive the final equation directly by dividing Eqs. (4.56) and (4.57).
Following this division and having removed the variable x from the denominator and
collecting powers of x throughout, we get the equation
which is an equation of pth degree, and whose roots supply the solution to the
problem. Thus, Eqs. (4.60a, 4.60b) cannot have more positive roots than the solution
requires. Negative or complex roots have no meaning for the solution to the problem.
We note again that the largest allowed kinetic energy P = pE is very large compared
to the mean kinetic energy of a molecule
L λE
= ð4:62Þ
n n
From this, it follows that p is very large compared to λ/n. The polynomial
Eq. (4.61), which shares the same real roots with Eqs. (4.60a, 4.60b), is negative
for x = 0, x = 1; however, it has the value
p λ
nð p þ 1 Þ - , ð4:63Þ
2 n
which is positive and very large, since p is very large compared to n. The only
positive root occurs for x between 0 and 1, and we obtain it from the more convenient
Eq. (4.60b). Since x is a proper fraction, then the pth and ( p + 1)th powers are
smaller and can be neglected, in which case we obtain
4
Cardano’s formula.
166 4 Unified Mechanics Theory
λ
x= ð4:64Þ
nþλ
This is the value to which x tends for large p, and one can see the important fact
that for reasonably large values of p the value of x depends almost exclusively on the
ratio λ/n, and varies little with either λ or n providing their ratio is constant. Once one
has found x, it follows from Eq. (4.58b) that
1-x
w0 = n ð4:65Þ
1 - xpþ1
And Eq. (4.55) gives the values of the remaining w′s. It is seen from the quotients
w0 w1 w2
, , ,etc:
n n n
that the probabilities of the various kinetic energy values for larger p are again
dependent almost exclusively on the mean energy of the molecule. For infinitely
large p we obtain the following limiting values:
n2 n2 λ n2 λ 2
w0 = w1 = w2 = etc: ð4:66Þ
nþλ ðn þ λÞ2 ðn þ λÞ3
1 1 1
w ð ln w - 1Þ - ln w - ln ð2π Þ þ þ etc:
2 2 12w
and neglecting terms, which have second or higher powers of w in the denominator,
obtain
ðδw0 Þ2 ðδw1 Þ2
δ2 M = þ þ ... ð4:67Þ
w0 w1
Therefore, we do have a minimum. I also want to remark on the size of the term
previously designated J. One easily finds that J is given by the following binomial
coefficient:
λþn - 1
J= λ
ð4:68Þ
λþn - 2 1
1 ð λ þ n - 1Þ
J = pffiffiffiffiffi ð4:69Þ
2π ðn - 1Þn - 2 λλþ2
1 1
λ μ
= ð4:70Þ
n E
2n - 1
λ 1þ 2λ
n-1
ðλ þ n - 1Þλþn - 2 = = λλþn - 2 = λλþn - 2 en - 1
1 1 1
1þ ð4:71Þ
λ
[Note: double equal is a relational operator used to compare two variable values
whether they are equal or not.] Therefore
1 λ n - 1 en - 1
J = pffiffiffiffiffi 1 , ð4:72Þ
2π ðn - 1Þn - 2
λ 1 1
ln J = n ln þ n - ln λ þ ln n - 1 - ln ð2π Þ ð4:73Þ
n 2 2
It goes without saying that these formulas are not derived here solely for finite
p and n values, because these are unlikely to be of any practical importance, but
rather to obtain formulas that provide the correct limiting values when p and
n become infinite.
Nevertheless, it may help to demonstrate, with specific examples of only moder-
ately large values of p and n that these formulas are quite accurate, and though
approximate, are of some value even here.
We first consider the earlier example, where n = λ = 7, i.e., the number of
molecules is seven, and the total kinetic energy is 7E, and so the mean kinetic energy
is E. Suppose first, that p = 7, so each molecule can only have 0, E, 2E, 3E, . . ., 7E of
kinetic energy. Then Eq. (4.60b) becomes
1 7 8
x= þ x - 3x9 : ð4:75Þ
2 2
168 4 Unified Mechanics Theory
Since x is close to 12, we can set x = 12 in the last two very small terms on the right-
hand side, and obtain
1 1 1 1
x= þ ð7 - 3Þ = þ 7 = 0:5078125 ð4:76Þ
2 29 2 2
You could easily substitute this value for x back into the right side of Eq. (4.75)
and obtain a better approximation for x; since we already have an approximate value
for x, a more rapid approach is to apply the ordinary Newton iteration method to
Eq. (4.74) which results in
x = 0:5088742 . . .
w0 = 3:4535 w4 = 0:2316
w1 = 1:7574 w5 = 0:1178
w2 = 0:8943 w6 = 0:0599
w3 = 0:4551 w7 = 0:0304
w0 þ w1 þ w2 þ w3 þ w4 þ w5 þ w6 þ w7 = 7, ð4:77aÞ
w1 þ 2w2 þ 3w3 þ 4w4 þ 5w5 þ 6w6 þ 7w7 = 7, ð4:77bÞ
which are minimum, incidentally because the first of Eq. (4.77a) coincides with the
minimum of ðw0 Þw0 ðw1 Þw1 . . .. This provides only an approximate solution to our
problem, which asks for so many (w0) zeros, so many (w1) ones, etc. with as many
permutations as the resulting complexion permits, while the w′s simultaneously
satisfies the constraints Eqs. (4.77a, 4.77b). Since p and n here are very small, one
hardly expects any great accuracy, yet you already get the solution to the permuta-
tion problem by taking the nearest integer for each w, except for w3, for which you
have to assign the value of 1 instead of 0.4551. In this manner, it is apparent
and we saw in the previous table that the complexion of 0001123 has the most
permutations. We now consider the same special case with n = λ = 7, but set p = 1;
that is, the molecules may have kinetic energies of 0, 1, 2, 3, . . ., 1. We know then
4.3 Evolution of Thermodynamic State Index (Φ) 169
that the values of the variables w will vary little from those of the former case. In fact,
we obtain
1 7 w w1
x= ; w0 = = 3:5, w1 = 0 = 1:75, w2 = = 0:875,etc:
2 2 2 2
We consider a little more complicated example. Take n = 13, λ = 19, but we only
treat the simpler case where p = 1. Then we have
x = 19=32 w4 = 0:6560
w0 = 5:28125 w5 = 0:38950
w1 = 3:13574 w6 = 0:23133
w2 = 1:86815 ⋮
w3 = 1:10493 ⋮
Already from the fact that w0 + w1 + . . . should = 13, it is seen that again one of
the w′s must be increased by one unit. From those w′s that is set to = 0, w5 differs
least from the next highest integer. We want therefore w5 = 1, and obtain the
complexion
0000011122345
whose digit sum is in fact = 19. The number of permutations this complexion is
capable of is
13! 13! 1
=
5!3!2! 4!3!2! 5
A complexion whose sum of digits is also = 19, and which one might suppose is
capable of very many permutations, would be the following:
0000111222334
13! 13! 1
=
4!3!3!2! 4!3!2! 6
170 4 Unified Mechanics Theory
This is less than the number of permutations of the first complexion we found
from the approximate formula. Likewise, we expect that the number of permutations
of the two complexions
0000111122335
And
0000111122344
13! 13! 1
=
4!4!2!2! 4!3!2! 8
Other possible complexions are capable of still fewer permutations, and it would
be quite superfluous to follow these up here. It is seen from the examples given here
that the above formula, even for very small values of p and n gives values of w within
one or two units of the true values. In the mechanical theory of heat, we are always
dealing with extremely large numbers of molecules, so such small differences
disappear, and our approximate formula provides an exact solution to the problem.
We see also that the most likely state distribution is consistent with that known from
gases in thermal equilibrium. According to Eq. (4.66), the probability of having a
kinetic energy sE is given by
s
n2 λ
ws = ð4:78Þ
nþλ nþλ
Since λE/n is equal to the average kinetic [energy] of a molecule μ, which is finite,
so n is very small compared to λ. So, the following approximations
n2 n2 nE λ n E
= 1 - = e-λ = e-μ
n
= = , ð4:79Þ
nþλ λ μ nþλ λ
nE - Esμ
ws = e , ð4:80Þ
μ
n!
w0 !w1 !w2 ! . . .
Again, the most likely state distribution, which corresponds to thermal equilib-
rium, is defined by the maximum of this expression, that is, when the denominator is
minimized. We use again the reasonable approximation of Section I, replacing w!
with the expression
pffiffiffiffiffi w w
2π
e
172 4 Unified Mechanics Theory
pffiffiffiffiffi
We can omit the term 2π since it is a constant factor in the minimization; the
key again is to replace minimization of the denominator with minimization of its
logarithm; then we obtain the condition for thermal equilibrium that
M = w0 ln w0 þ w1 ln w1 þ w2 ln w2 þ . . . - n ð4:82Þ
n = w0 þ w1 þ w2 þ . . . ð4:83Þ
L = Ew1 þ 2Ew2 þ 3Ew3 þ . . . ð4:84Þ
which are identical to Eqs. (4.46) and (4.47) of Section I. Using Eq. (4.81) here, we
replace the variables w by the function f and obtain thereby
Since n and E are constant, (because E has the same value for all possible
complexions, and is constant between different state distributions), one can
minimize
instead. As E is made still smaller, the allowed values of kinetic energy approach a
continuum. For vanishingly small E, various sums in Eqs. (4.86, 4.87, 4.89) can be
written in the form of integrals, leading to the following equations:
Z 1
0
M = f ðxÞ ln f ðxÞdx ð4:90Þ
0
Z 1
n= f ðxÞdx, ð4:91Þ
0
4.3 Evolution of Thermodynamic State Index (Φ) 173
Z 1
L= xf ðxÞdx, ð4:92Þ
0
The functional form of f(x) is sought which minimizes expression (4.90) subject
to the constraints (4.91) and (4.92), so one proceeds as follows: To the right side of
Eq. (4.90) one adds Eq. (4.91) multiplied by a constant k, and Eq. (4.92) multiplied
by a constant h. The resulting integral is
Z 1
½f ðxÞ ln f ðxÞ þ kf ðxÞ þ h xf ðxÞdx
0
where x is the independent variable, and f is the function to be varied. This result in
Z 1
½ ln f ðxÞ þ k þ hx δf ðxÞdx
0
Setting the quantity, which has been multiplied by δf(x) in square brackets =
0, and solving for the function f(x), we obtain
f ðxÞ = Ce - hx ð4:93Þ
Here the constant e-k - 1 is denoted by C for brevity. The second variation of M′
Z 1
0 ½δf ðxÞ2
δ M =
2
dx, ð4:94Þ
0 f ðxÞ
is necessarily positive, since f(x) is positive for all values of x lying between 0 and
1. By the calculus of variations M′ is a minimum. From Eq. (4.93), the probability
that the kinetic energy of a molecule lies between x and x + dx at thermal equilibrium
is
f ðxÞdx = Ce - hx dx ð4:95Þ
The probability that the velocity of a molecule lies between ω and ω + dω would
be
hmω2
Ce - 2 mωdω ð4:96Þ
where m is the mass of a molecule. Equation (4.96) gives the correct state distribu-
tion for elastic disks [bold emphasized by this author] moving in two dimensions,
for elastic cylinders with parallel axis moving in space, but not for elastic spheres,
which move in space. For the latter, the exponential function must be multiplied by
ω2dω not ωdω. To get the right state distribution for the latter case, we must set up
174 4 Unified Mechanics Theory
the initial distribution of paper slips in our urn in a different way. To this point, we
assumed that the number of paper slips labeled with kinetic energy values between
0 and E is the same as those between E and 2E. As also for slips with kinetic energies
between 2E and 3E, 3E and 4E, etc.
Now, however, let us assume that the three velocity components along the three
coordinate axes, rather than the kinetic energies, are written on the paper slips in the
urn. The idea is the same: There are the same number of slips with u between 0 and E,
v between 0 and ξ, and w between 0 and η. The number of slips with u between E and
2E, v between 0 and ξ, and w between 0 and η is the same. Similarly, the number for
which u is between E and 2E, v is between ξ and 2ξ, w is between 0 and η. Generally,
the number of slips for which u, v, w are between the limits u and u + E, v and v + ξ,
w and w + η are the same. Here u, v, w have any magnitude, while E, ξ, η are
infinitesimal constants. With this one modification of the problem, we end up with
the actual state distribution established in gas molecules.
(LB footnote: We can of course, instead of using finite quantities E, ξ, η and then
taking the limit as they go to zero, write du, dv, dw from the outset, then the
distribution of paper slips in the urn must be such that the number for which u, v,
w are between u and u + du, v and v + dv, w and w + dw are proportional to the
product dudvdw and independent of u, v, and w. The earlier distribution of slips in
the urn is characterized by the fact that although E could be replaced by dx, kinetic
energies between 0 and dx, dx and 2dx, 2dx and 3dx, etc. occurred on the same
number of slips.)
If we now define
as the number of molecules of any complexion for which the velocity components lie
between the limits aE and (a + 1)E, bζ and (b + 1)ζ, cη and (c + 1)η, the number of
permutations, or complexions of these elements for any state distribution, becomes
n!
P = Qa = þp Qb = þq Qc = þr ð4:98Þ
a= -p b= -q c= -r wabc !
where we first assume u adopts only values between -pE and +pE, v between -qζ
and +qζ, w between -rη and +rη. Where again, the most likely state distribution
occurs when this expression, or if you will, its logarithm, is maximum. We again
substitute
pffiffiffiffiffi n n pffiffiffiffiffi w w
n! by 2π and w! by2π
e e
pffiffiffiffiffi
where you can again immediately omit the factors of 2π as they simply contribute
additive constants - 12 ln 2π to ln P ; omitting also the constant n ln n the term, the
requirement for the most probable state distribution is that the sum
4.3 Evolution of Thermodynamic State Index (Φ) 175
= þp bX
aX = þq cX
= þr
- wabc ln wabc
a= -p b= -q c= -r
aX
=p bX
=q X
c=r
n= wabc ð4:99Þ
a= -p b= -q c= -r
m X X X 2 2
a=p b=q c=r
L= a E þ b2 ζ 2 þ c2 η2 wabc ð4:100Þ
2 a= -p b= -q c= -r
Substituting for wabc using Eq. (4.97), one immediately sees that the triple sums can
in the limit be expressed as definite integrals; omitting an additive constant, the
quantity to be maximized becomes
The variable Ω, which differs from the logarithm of the number of permutations only
by an additive constant, is of special importance for this work, and we call it the
permutability measure [bold emphasized by L. B.] I note, incidentally, that
suppression of the additive constants has the advantage that the total permutability
measure of two bodies is equal to the sum of the permutability measures of
each body.
Thus, it is the maximum of the quantity (4.101), subject to the constraints (4.102)
and (4.103), that is sought. No further explanation of this problem is needed here; it
is a special case of the problem I have already discussed in my treatise “On the
thermal equilibrium of gases on which external forces act”5 in the section which
5
Wien. Ber. (1875) 72:427–457 (Wiss. Abhand. Vol. II, reprint 32).
176 4 Unified Mechanics Theory
immediately precedes the appendix. There I provided evidence that this state distri-
bution corresponds to the condition of thermal equilibrium. [Boltzmann’s use of the
term thermal equilibrium is important. In mechanics, this corresponds to the equi-
librium of energy, irrespective of the state the matter is in.] Thus, one is justified in
saying that the most likely state distribution corresponds with the condition of
thermal equilibrium. If an urn is filled with slips of paper labeled in the manner
described earlier, the most likely sampling will correspond to the state distribution
for thermal equilibrium. We should not take this for granted, however, without first
defining what is meant by the most likely state distribution. For example, if the urn
were filled with slips labeled originally, then the statement would be incorrect.
The reasoning needed to arrive at the correct state distribution will not escape
those experienced in working with such problems. The same considerations apply to
the following circumstance: If we group all the molecules whose coordinates at a
particular time lie between the limits
and let these molecules collide with other molecules under specific conditions, after
a certain time their coordinates will lie between the limits
dξ dη dζ du dv dw = dΞ dH dZ dU dV dW: ð4:108Þ
This is a general result. If at time zero the coordinates and velocity components of
arbitrary molecules (material points) lie between the limits (4.104) and (4.105) and
unspecified forces act between these molecules, [the fact that Boltzmann does not
make any assumption about the type of forces between the molecules, therefore,
shear forces between molecules are not excluded]. So that at time t the coordinates
and velocity components lie between the limits (4.106) and (4.107), then Eq. (4.108)
is still satisfied. [Note that here Boltzmann uses a 5-axes coordinate system to locate
the molecules, the 5th axis being the momentum.]
If instead of the velocity components, one uses the kinetic energy x and the
velocity direction defined by the two angles α and β, to describe the action of the
forces, these variables would initially lie between the limits
4.3 Evolution of Thermodynamic State Index (Φ) 177
And then after the action of the forces lie between the limits
And so
pffiffiffi pffiffiffiffi
dξ dη dζ x dx φðα, βÞdα dβ = dΞ dH dZ X dX
φðA, BÞdA dB ð4:109Þ
where C, h, β, γ are constants. This is in fact the state distribution for gas at thermal
equilibrium at a certain temperature, not at rest, but moving with a constant net
velocity. You can treat similar problems such as the rotation of gas in the same
manner, by adding in the appropriate constraint equations, which I have discussed in
my essay “On the definition and integration of the equations of molecular motion in
gases” (Wiss. Abhand. Vol II, reprint 36).
178 4 Unified Mechanics Theory
Some comment regarding the derivation of Eq. (4.101) from Eq. (4.98) is
required here. The formula for x! is
pffiffiffiffiffiffiffiffi x x 1 þ⋯
2πx e12x :
e
1 1
ln P = n þ ln n þ þ ⋯ - ðp þ q þ r þ 1Þ ln 2π -
2 12n
= þp bX
aX = þq cX
= þr ð4:112Þ
1 1
- wabc þ ln wabc þ þ⋯
a= -p b= -q c= -r
2 12wabc
First note, that in determining the magnitude of P , in the limit of a very large
number of molecules, n (and thus also of wabc), other small quantities such as E, ζ, η
can be treated as infinitesimals. So, all terms which have n or wabc in the denominator
can be neglected, and the 12 in the term wabc þ 12. The terms containing wabc scale with
the total mass of the gas, while the related 12 terms refer only to a single molecule. So,
the latter quantities can be neglected as the number of molecules increases. We then
get
= þp bX
aX = þq cX
= þr
ln P = n ln n - ðp þ q þ r þ 1Þ ln 2π - wabc ln wabc : ð4:113Þ
a= -p b= -q c= -r
aX
= þp bX
= þq
ln P = n ln n - ðp þ q þ r þ 1Þ ln 2π - n ln ðEζηÞ-
a= -p b= -q
cX
= þr
Eζηf ðaE, bζ, cηÞ ln f ðaE, bζ, cηÞ: ð4:114Þ
c= -r
One sees that aside from the triple sum, the terms on the right-hand side are
constant, and so can be omitted. We also let E, ζ, η decrease while p, q, r increase
infinitely, so the triple sum goes over into a triple integral over limits -1 to +1 and
from ln P we arrive immediately at the expression given by Eq. (4.101) for the
permutability measure Ω. The critical condition is that the number of molecules is
4.3 Evolution of Thermodynamic State Index (Φ) 179
very large; this means that wabc is large compared to 12 ; also that the velocity
components between the limits aE and (a + 1)E, bζ and (b + 1)ζ, cη and (c + 1)η
are identical to those between the limits u and u + du, v and v + dv, w and w + dw.
This may appear strange at first sight, since the number of gas molecules is finite
albeit large, whereas du,dv, dw are mathematical differentials. But, on closer delib-
eration this assumption is self-evident. All applications of differential calculus to the
theory of gases are based on the same assumption, namely, diffusion, internal
friction, [bold emphasized by this author], heat conduction, etc. In each infinitesimal
volume element dxdydz there are still infinitely many gas molecules whose velocity
components lie between the limits u and u + du, v and v + dv, w and w + dw. The
above assumption is nothing more than that very many molecules have velocity
components lying within these limits for every u, v, w.
Consideration of Polyatomic Gas Molecules and External Forces
I will now generalize the formulas obtained so far, by first extending them to
so-called polyatomic gas molecules and then including external forces and thereby
finally beginning to extend the discussion to any solid and liquid. [Bold emphasized
by this author] In order not to consider too many examples, I will in each case deal
with the most important case, where, aside from the equation for kinetic energy,
there is no other constraint.
The first generalization can be applied to our formulas without difficulty. So far,
we assumed each molecule was an elastic sphere or a material point so that its
position in space was entirely defined by three variables (e.g., three orthogonal
coordinates). We know that this is not the case with real gas molecules. We shall
therefore assume that three coordinates are insufficient to completely specify the
position of all parts of a molecule in space; rather r variables will be necessary
p1 , p2 , p3 , . . . , pr ,
the so-called generalized coordinates. Three of them, p1, p2, p3 are the orthogonal
coordinates of the center of mass of the molecule; the others can be either the
coordinates of the individual atoms relative to the center of mass, the angular
direction, or whatever specifies the location of every part of the molecule. We will
also remove the restriction that only one type of gas molecule is present. We assume
instead, that there exists a second type whose every molecule has the generalized
coordinates
If there are v + 1 types of molecules, the generalized coordinates of the final type
are
The first three coordinates are always the orthogonal coordinates of the center of
mass. Of course, the necessary assumption is that many molecules of each type are
present. Let l be the total kinetic energy of the first type of gas; χ is its potential
energy6 (so that χ + l is constant if internal forces only are acting). Furthermore
q 1 , q2 , q3 , . . . , q r
are the momentum coordinates corresponding to p1, p2, . . ., pr. [Bold emphasized
by C.B. Note that, here Boltzmann defines momentum as a new additional axis’
coordinates.] We can think of l in terms of the coordinates p1, p2, . . ., pr and their
derivatives with respect to time
p_ 1 , p_ 2 , p_ 3 , . . . , p_ r ,
is given by
6
The quantity χ called “Kraftfunktion” or “Ergal” by Boltzmann is translated as potential energy.
4.3 Evolution of Thermodynamic State Index (Φ) 181
variables p4, p5, . . ., qr at some time zero lies between the limits (4.115), after a lapse
of sometime t, the values of the same variables lie between the limits
p4 , p5 , . . . , q r
The product of their differentials does not change during a constant time interval.
Therefore, we must now imagine v + 1 urns. In the first are slips of paper, upon
which are written all possible values of the variables p4, p5, . . ., qr; and the number of
slips that have values within the limits of Eq. (4.115) is such that when divided by
the product dp4, dp5. . .qr it is a constant.
Similarly, for the labeling of the slips with the variables p04 , p05 , . . . , q0r in the
second urn, except that for the latter the constant can have a different value. The
same applies to the other urns. We draw from the first urn for each molecule of the
first type, from the second urn for molecules of the second type, etc. We now
suppose that the values of the variables for each molecule are determined by the
relevant drawings. It is of course entirely chance that determines the state distribu-
tions for the gas molecules, and we must first discard those state distributions, which
do not have the prescribed value for total kinetic energy. It will then be most likely
that the state distribution described by Eq. (4.116) will be drawn, i.e., that one
corresponding to thermal equilibrium. The proof of this is straightforward. So, the
results found in the first two sections can be readily generalized to this case.
We want to generalize the problem further, assuming that the gas is composed of
molecules specified exactly as before. But now so-called external forces are acting,
e.g., those like gravity, which originate outside the gas. [Of course, external forces
are not limited to gravity; they can be friction forces exerted by other molecules.] For
details on the nature of these external forces, and how to treat them, see my treatise
“On the thermal equilibrium of gases on which external forces act.”7 The essence of
the solution to the problem remains the same. Only now, the state distribution will no
longer be the same at all points of the vessel containing the gas; therefore
dp1 dp2 dp3 = dP1 dP2 dP3 will no longer hold. We will now understand the
7
Wien. Ber. (1875) 72:427–457.
182 4 Unified Mechanics Theory
molecules whose variables p1, p2, . . ., qr lie between limits 0 and α, 0 and β, 0 and γ,
etc. Furthermore, exactly
Molecules have the same variables within the limits α and 2α, 0 and β, 0 and γ,
etc., and generally
These limits are so close that we can equate all the values in between, then it is as if
the variable p1 could only take the values 0, α, 2α, 3α, etc., variable p2 could take the
values 0, β, 2β, 3β, etc. Let n be the total number of molecules of the first type. We
again distinguish the variables for the other gases by the corresponding accents, so
that
n ln n þ n0 ln n0 þ . . . nðvÞ ln nðvÞ
pffiffiffiffiffi
By N, so we can also replace n! by 2π ðn=eÞn and then immediately take the
logarithm
hX X i
ln P = N - C ln 2π - wab... ln wab... þ w0a0 b0 ... ln w0a0 b0 ... þ . . . : ð4:123Þ
The sums are to be understood in the same sense as the products above. 2C is the
number of factorials in the denominator of Eq. (4.122) minus v + 1. Let us now
substitute the expression (4.121) for the variables w into Eq. (4.123) and then take
the limit of infinitesimal α, β, γ. . . .
Omitting unnecessary constants, the magnitude we obtain for the permutability
measure, denoted by Ω, is
Z Z
- . . . f ðp1 , p2 , . . ., qr Þ ln f ðp1 , p2 , . . ., qr Þdp1 dp2 , . . . ,dqr
Ω= Z Z
þ . . . f p01 , p02 , . . ., q0r0 ln f p01 , p02 , . . ., q0r0 dp01 dp02 , . . . ,dq ′ r0 þ . . .
ð4:124Þ
The integration is to extend over all possible values of the variables. I have in my
paper “On the thermal equilibrium of gases on which external forces act” demon-
strated that the expression in the square brackets is at a minimum for a gas in a state
of thermal equilibrium, including, of course, the kinetic energy constraint equation.
On the Conditions for the Maximum of the Power-Exponent Free Product
Determining the State Distribution Function
Before I go into the treatment of the second law, I want to concisely treat a problem
whose importance I believe I have shown in Section I, in the discussion of the work
of Mr. Oskar Emil Meyer on this subject, namely, the problem of finding the
maximum of the product of the probabilities of all possible states. However, I
want to deal with this problem only for mono-atomic gases, and with no other
constraint than the equation for the kinetic energy. We first consider the simplest
184 4 Unified Mechanics Theory
case where only a discrete number of kinetic energy values, 0, E, 2E, . . ., pE are
possible, and to start we use kinetic energies, not velocity components, as variables.
We again denote by w0, w1, w2, . . ., wv the number of molecules with kinetic energy
0, E, 2E, . . ., pE.
If we treat the subject in the usual way, the following relationship holds: The
quantity
B = w0 w1 w2 ⋯ wp ð4:125Þ
ln B = ln w0 þ ln w1 þ ln w2 þ ⋯ þ ln wp ð4:126Þ
n = w0 þ w1 þ w2 þ w2 þ ⋯ þ wp ð4:127Þ
And
L = w1 þ 2w2 þ 3w3 þ ⋯ þ pwp E: ð4:128Þ
If to Eq. (4.126) we add Eq. (4.127) multiplied by h and add Eq. (4.128) multiplied
by k, then set the partial derivatives of the sum with respect to w0, w1, w2. . . equal to
zero, we obtain the equations
1 1 1
þ h = 0, þ h þ k = 0, þ h þ 2k = 0 etc: ð4:129Þ
w0 w1 w2
1 1 1 1 1 1
- = - = - = ... ð4:130Þ
w1 w0 w2 w1 w3 w2
Or
1 1 1 1
= a, = a þ b, = a þ 2b,⋯ = a þ pb: ð4:131Þ
w0 w1 w2 wp
Substituting these values into Eqs. (4.127) and (4.128), the two constants a and b can
be determined:
4.3 Evolution of Thermodynamic State Index (Φ) 185
1 1 1 1
n= þ þ þ⋯þ ð4:132Þ
a a þ b a þ 2b a þ pb
E 2E 3E pE
L= þ þ þ⋯þ ð4:133Þ
a þ b a þ 2b a þ 3b a þ pb
The direct determination of the two unknowns a and b from these equations
would be extremely lengthy. The method of Regula falsi would provide a more rapid
solution for each special case; I have not troubled myself with such calculations but
will give here only a general discussion of how the expected solutions can be easily
obtained, keeping in mind that these methods can only provide an approximation
solution to the problem, since only positive integers are allowed, but fractional
values are not. The first point to note is that the problem ceases to have any meaning
as soon as the product p ( p + 1)/2 is greater than L/E. Because then it necessarily
follows that one of the w′s, and so also the product B, is zero.
Then there is no question of a maximum value for B. For the problem to make any
sense, an excessive value for the kinetic energy cannot be possible. If
pþ1 L
p = ð4:134Þ
2 E
Then all the w′s from w0 onwards must be equal to one for B to be non-zero. A
greater variation in values can occur only if smaller values of p are chosen. Then,
when n is large the above equations provide usable approximations. First, a will be
significantly smaller than b, so w0 is very large, and w1 will be much smaller; w2 will
be close to w1/2, w3 will be close to 2w2/3, etc. In general, the decrease in the variable
w with an increasing index will be fairly insignificant when the maximum of
w0 w1 w2. . . is sought, rather than the maximum of ww0 0 ww1 1 ww2 2 . . . . Given
much smaller p values, the value of a is not much less than b, so w0 is also not
that much larger than the other w′s; then w2 is greater than w1/2, w3 is greater than
(2/3)w2, etc. The decrease of w with increasing index is even less. Decreasing p still
further, a will dominate, and there will be hardly any decrease in w with increasing
index. Finally, b becomes negative, and the size of w will even increase with
increasing index. The following cases provide examples, for each of the integer
values of the w′s which maximize B are given.
186 4 Unified Mechanics Theory
Let us now turn to the case where the value of the kinetic energy is continuous;
first, consider the kinetic energy x as the independent variable [bold emphasized by
this author], so the problem, in our view is the following: The expression
ZP
Q= ln f ðxÞdx ð4:136Þ
0
ZP ZP
n= f ðxÞdx and L= xf ðxÞdx ð4:137Þ
0 0
are constant. P is also constant. I have purposefully set the upper integration limit to
P, not 1. It is then still straightforward to allow P to increase more and more.
Proceeding accordingly, we obtain:
ZP ZP
1
δ ½ ln x þ h f ðxÞ þ kx f ðxÞ dx = þ h þ kx dx δf = 0 ð4:138Þ
f
0 0
1 1
f=- = ð4:139Þ
h þ kx a þ bx
ZP
1 a þ bP
n= f ðxÞ dx = ln , ð4:140Þ
b a
0
ZP
P a a þ bP
L= xf ðxÞ dx = - ln , ð4:141Þ
b b2 a
0
P P
L þ αn = , bn = ln 1 þ ð4:142Þ
b α
And also
P
ðL þ αnÞ ln 1 þ = Pn: ð4:143Þ
α
P
ln 1 þ ð4:144Þ
α
Could be expanded in powers of P/α, and the Eq. (4.143) would yield a finite
value for P/α. There remains only the possibility that P/α is very large. Since
αn Pn
ln
Pn αn
a
= pe - L ,
nP
α= ð4:145Þ
b
188 4 Unified Mechanics Theory
P p2 nP
b= , a = e- L ð4:146Þ
L L
By the approach used in this section, using the mean kinetic energy of a molecule,
these equations show that in the limit of increasing p, W, L the probability of
dispersion in kinetic energy remains indeterminate. We now want to consider a
second, more realistic problem. We take the three velocity components u, v,
w parallel to the three coordinate axes as the independent variables [bold empha-
sized by this author], and find the maximum of the expression
In addition, its direction is given by the two angles θ and φ (length and breadth8);
we have as is well known
Hence
ZP
Q = 4π ln f ðϱÞϱ2 dϱ ð4:152Þ
0
8
Translators note: Altitude and azimuth?
4.3 Evolution of Thermodynamic State Index (Φ) 189
ZP
n = 4π ϱ2 f ðϱÞ dϱ ð4:153Þ
0
ZP
L = 4π ϱ4 f ðϱÞ dϱ ð4:154Þ
0
If there are no external forces [like friction or body forces], then clearly f(u, v, w) is
independent of the direction of the velocity. Instead of integrating to infinity, we
intentionally integrate to a finite value of P. Evaluating f(ϱ) just as we did for f(x)
earlier, we obtain
1 1
f ð ϱÞ = - = ð4:155Þ
h þ kϱ2 a2 þ b2 ϱ2
where we set -h = a2 and -k = b2. The two constants a and b are to be determined
from Eqs. (4.153) and (4.154) which become, given the value of f(ϱ),
ZP
ϱ2 dϱ P a bP
n = 4π = 4π - arctg ð4:156Þ
a 2 þ b2 ϱ2 b 2 b3 a
0
ZP
ϱ4 dϱ 4πP3 a2
L = 4π = - 2n ð4:157Þ
a2 þ b ϱ2
2
3b2 b
0
1
f ð ϱÞ = ,
a2 - b2 ϱ2
190 4 Unified Mechanics Theory
P a a þ bP
n = 4π - 2 þ 3 ln
b 2b a - Pb
4πP3 a2
L= -
þ 2n
3b2 b
L 2
an a a þ bP
n=3 2 - 2 2 1- ln ð4:159bÞ
P bP 2bP a - Pb
From Eqs. (4.159a) and (4.159b), one first has to calculate the ratio a/b; and by the
same means by which Eq. (4.143) was analyzed, we first determine whether bP/a is
infinitely small, finite, or infinitely large, the only difference being in Eq. (4.159a)
every infinitesimal variation of L/nP2 occurs. However, I will not discuss the point
further, except to note that as n and P grow larger; one also cannot get a result, which
depends only on the average kinetic energy. Also, I will not discuss in detail those
cases where there are other constraint equations besides the equation for kinetic
energy, as this would lead me too far afield. [C.B. Having other constraints would
not lead to any more permutations, but actually less number of permutations.]
To provide a demonstration of how general the concept of the most probable state
distribution of gas molecules is, here I supply another definition for it. Suppose again
that each molecule can only have a discrete number of values for the kinetic energy,
0, E, 2E, 3E, . . ., 1. The total kinetic energy is L = λE. We want to determine the
kinetic energy of each molecule in the following manner: We have in an urn just as
many identical balls (n) as molecules present. Every ball corresponds to a certain
molecule. We now make λ draws from this urn, returning the ball to the urn each
time. The kinetic energy of the first molecule is now equal to the product of E and the
number of times the ball corresponding to this molecule is drawn. The kinetic
energies of all other molecules are determined analogously. We have produced a
distribution of the kinetic energy L among the molecules (a complexion). We again
make λ draws from the urn and produce a second complexion, then a third, etc. many
times (J ), and produce J complexions. We can define the most probable state
distribution in two ways: First, we find how often in all J complexions a molecule
has kinetic energy 0, how often the kinetic energy is E, 2E, etc., and say that the ratios
of these numbers should provide the probabilities that a molecule has kinetic energy
0, E, 2E, etc. at the thermal equilibrium. Second, for each complexion, we form the
corresponding state distribution. If some state distribution is composed of P com-
plexions, we then denote the quotient P =J as the probability of the state distribution.
At first glance, this definition of a state distribution seems very plausible. However,
we shall presently see that this should not be used, because, under these conditions,
the distribution whose probability is the greatest would not correspond to thermal
equilibrium. [C. B.: Boltzmann does not justify or explain this statement. However,
later Planck does show that the highest probability is the equilibrium and uses P/J.] It
is easy to cast the hypothesis that concerns us into formulas. First of all, we want to
discuss the first method of probability determination. We consider the first molecule,
and assume that λ draws were made; the probability that the first molecule was
4.3 Evolution of Thermodynamic State Index (Φ) 191
picked in the first draw is 1/n; however, the probability that another ball was drawn is
(n - 1)/n. Thus the probability that on the 1st, 2nd, 3rd, . . ., kth draws the molecule
corresponding to the first ball has been picked, and then a different ball for each of
the following is given by
λ-k λ
1 k
n-1 n-1 1 k
= ð4:160Þ
n n n n-1
Likewise, the probability that the ball corresponding to the first molecule is
picked on the 1st, 2nd, 3rd, . . ., (k - 1)th, and then (k + 1)th draws, etc. The
probability that the ball corresponding to the first molecule is picked for any arbitrary
k draws and not for the others is
λ
λ! n-1 1 k
wk = : ð4:161Þ
ðλ - k Þ!k! n n-1
This probability that a molecule has then kinetic energy kE is exactly the same for
all the other molecules. Using again the approximation formula for the factorial, we
obtain
rffiffiffiffiffi k
λ
1 n-1 1 λ-k
wk = λ qffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
ffi ðλ - k Þ - λ , ð4:162Þ
1 - kλ k ðn - 1Þk
2π n
This shows that the probability of the larger kinetic energies is so disproportion-
ately important that the entire expression does not approach a clearly identifiable
limit with increasing k, λ, 1/E, and n. We will now proceed to the second possible
definition of the most probable state distribution. We need to consider all
J complexions that we have formed by J drawings of λ balls from our urn. One of
the various possible complexions consists of λ drawings of the ball corresponding to
the first ball. We want to express this complexion symbolically by mλ1 m02
m03 . . . m0n . A second complexion, with λ - 1 draws of the ball corresponding to
the first molecule, and one draw of the ball corresponding to the second molecule we
want to express as
We see that the different possible complexions are expressed exactly by various
components; the sum of these appears as the power series
ðm1 þ m2 þ m3 þ ⋯ þ mn Þλ ð4:163Þ
192 4 Unified Mechanics Theory
Finally omit from this product the upper indexes, which then generate a term
exactly proportional to the polynomial coefficient. Then the symbols m01 m003
000
m7 . . . we understand that the first pick corresponded to the first molecule, the
second pick corresponded to the third molecule, on the third pick the ball
corresponding to the seventh molecule was picked out, etc. All possible products
of the variables m01 ,m001 ,m02 etc. represent equi-probable complexions. We want to
know how often among all the terms of the power series (4.163) (whose total number
is nλ); there occur terms whose coefficients contain any one state distribution. For
example, consider the state distribution where one molecule has all the kinetic
energy, and all others have zero kinetic energy. This state distribution appears to
correspond to the following members of the power series (4.163)
mλ1 m02 m03 , . . . ,m01 mλ2 m03 , . . . ,m01 m02 mλ3 . . . etc:
with “undivided” λ. Similarly, for the state distribution in which w0 molecules have
kinetic energy zero, w1 molecules have kinetic energy E, w2 molecules have kinetic
energy 2E, etc., there are
λ!
w0 !w1 !w2 ! . . . wλ !
members of the power series (4.163). Each of these elements has the same polyno-
mial coefficient, and that is identical to
λ!
ð0!Þw0 ! ð1!Þw1 ! ð2!Þw2 ! . . . ðλ!Þwλ !
ðλ!Þ2 1 1
nλ w0 !w1 !w2 ! . . . wλ ! ð0!Þw0 ð1!Þw1 ð2!Þw2 . . . ðλ!Þwλ !
However, the maximization of this quantity also does not lead to the state
distribution corresponding to thermal equilibrium.
4.3 Evolution of Thermodynamic State Index (Φ) 193
Ω=
Z Z Z Z Z Z
- f ðx, y, z, u, v, wÞ ln f ðx, y, z, u, v, wÞdxdydzdudvdw,
ð4:164Þ
And
2N
pV = T: ð4:168Þ
3
p is the pressure per unit area, the entropy of the gas is then
Z
dQ 2 3
= N ln V T 2 þ C: ð4:169Þ
T 3
It follows that for each so-called reversible change of state, wherein in the infinites-
imal limit the gas stays in equilibrium throughout the change
R of state, the increase in
the Permutability measure Ω multiplied by 23 is equal to dQ/T taken over the state
change, i.e., it is equal to the entropy increment. Whereas in fact when a very small
amount of heat dQ is supplied to a gas, its temperature and volume increase by dT
and dV. Then it follows from Eqs. (4.167) and (4.168)
2N
dQ = NdT þ TdV ð4:171Þ
3V
dV 3N dT
dΩ = þ N þ : ð4:172Þ
V 2 T
It is known that if in a system of bodies many reversible changes are taking place,
then the total sum of the entropy of all these bodies remains constant. On the other
hand, if among these processes are ones that are not reversible, then the total entropy
R
of all the bodies must necessarily increase, as is well known from the fact that dQ/T
integrated over a non-reversible cyclic process is negative. According to Eq. (4.170),
the sum of the permutability measures of the bodies ΣΩ and the total permutability
measure must have the same increase. Therefore, at thermal equilibrium the magni-
tude of the permutability measure times a constant is identical to the entropy, within
an additive constant; but it also retains meaning during a non-reversible process,
continually increasing.
We can establish two principles: The first refers to a collection of bodies in which
various state changes have occurred, at least some of which are irreversible, e.g.,
where some of the bodies were not always in thermal equilibrium. If the system was
4.3 Evolution of Thermodynamic State Index (Φ) 195
in a state of thermal equilibrium before and after all these changes, then the sum of
the entropies of all the bodies can be calculated before and after those state changes
without further ado, and it is always equal to 2/3 times the permutability measure of
all the bodies. The first principle is that the total entropy after the state changes is
always greater than before it. The same is of course true of the permutability
measure. The second principle refers to a gas that changes state without requiring
that it begin and end in thermal equilibrium. Then the entropy of the initial and final
states is not defined, but one can still calculate the quantity, which we have called the
permutability measure; and that is to say that its value is necessarily larger after the
state change than before. We shall presently see that the latter proposition can be
applied without difficulty to a system of several gases, and it can be extended as well
to polyatomic gas molecules, and when external forces are acting. For a system of
several gases, the sum of the individual gas permutability measures must be defined
to be the permutability measure of the system; if one introduces on the other hand the
number of permutations itself, then the number of permutations of the system would
be the product of the number of permutations of the constituents. If we assume the
latter principle applies to anybody, then the two propositions just discussed are
special cases of a single general theorem, which reads as follows:
We consider any system of bodies that undergoes some state changes, without requiring the
initial and final states to be in thermal equilibrium. Then the total permutability measure for
the bodies continually increases during the state changes and can remain constant only so
long as all the bodies during the change of state remain infinitely close to thermal equilib-
rium (reversible state changes).
[C.B.: it is important to point out that here Boltzmann drops the term “gas,” in his theorem,
because there is nothing in his formulation that restricts it to gasses. The liquid and solid state
of the matter will introduce additional constraints but not less than gasses. As a result, his
permutability measure is an upper bound for all possibilities.]
To give an example, we consider a vessel divided into two halves by a very thin
partition. The remaining walls of the vessel should also be very thin so that the heat
they absorb can be neglected, and surrounded by a substantial mass of other gas.
One-half of the vessel should be completely filled with gas, while the other is
initially completely empty. Suddenly pulling away from the divider, which requires
no significant work, causes that gas to spread at once throughout the vessel.
Calculating the permutability measure for the gas, we find that this increases during
this process, without changes in any other body. Now the gas is compressed in a
reversible manner to its old volume by a piston. To achieve this manipulation, we
can if we want assume that the piston is created by a surrounding dense gas enclosed
in infinitely thin walls. This gas is unchanged except that it moves down in space.
Since the permutability measure does not depend on the absolute position in
space, the permutability measure of the gas driving the piston does not change. That
of the gas inside the vessel decreases to the initial value since this R gas has gone
through a cyclic process. However, since this was not reversible, dQ/T integrated
over this cycle is not equal to the difference between the initial and final values of the
entropy, but it is smaller due to the uncompensated transformation in the expansion.
196 4 Unified Mechanics Theory
In contrast, heat is transferred to the surrounding gas. So, for this process, the
permutability measure of the surrounding gas is increased by as much as that of
the enclosed gas in the vessel during the first process. Since the latter mass of gas
went through a cyclic process, its entropy decreasedR by as much during the second
process as it increased during the first, but not by dQ/T; and because the second
process was reversible, the entropy of the surrounding gas increased as much as the
enclosed gas’s decreased. The result is, as it has to be, that the sum of the
permutability measures of all bodies of gas has increased. For a gas that moves at
a constant speed in the direction of the x-axis9
4T ððu - αÞ þv þw Þ :
N - 3m 2 2 2
f ðx, y, z, u, v, wÞ = V qffiffiffiffiffiffiffiffiffiffiffiffi
e ð4:173Þ
4πT 3
3m
If we substitute this expression into Eq. (4.164), we get exactly Eq. (4.166) again.
Thus, the translational movement of a mass of gas does not increase its permutability
measure. And, the same is true for the kinetic energy arising from any other net mass
movement (molar movement), because it arises from the progression of the individ-
ual volume elements and their deformations and rotations which are of a higher
order—infinitely small—and therefore entirely negligible. Here we obviously ignore
the changes in permutability measure due to internal friction or temperature changes
connected with those molecular motions. The temperature T of the moving gas is
understood to mean half of the average value of m[(u - α)2 + v2 + w2]. So, if
frictional [bold emphasis by CB] and temperature changes are not present (e.g., if a
gas, together with its enclosing vessel falls freely) a net mass movement does not
affect the permutability measure until its kinetic energy is converted into heat, which
is why molar motion is known as the heat of infinite temperature.
Let us now move on to a mono-atomic gas on which gravity acts. The
permutability measure is represented by the same equation (4.124) but instead of
the generalized coordinates, we again introduce x, y, z, u, v, w. Equation (4.124) thus
gives us a value for Ω which is exactly the same as Eq. (4.164). In the case of thermal
equilibrium, one has
2
f ðx, y, z, u, v, wÞ = Ce - 2T
3
gzþmw2
ð4:174Þ
where ω2 = u2 + v2 + w2. The constant C is determined by the density of the gas. One
has, e.g., a prismatic-shaped vessel of height h, with a flat, horizontal bottom surface
with area =q. Further, let N be the total number of gas molecules in the vessel, and
z denote the height of a gas molecule from the bottom of the vessel, then
9
Translator note: Here V should appear in the denominator, cf the equation after (4.165).
4.3 Evolution of Thermodynamic State Index (Φ) 197
N N
C= Z h = 3 ð4:175Þ
4πT 32 3gz 4πT 2 2T 3gh
q 3g 1 - e - 2T
3m q e - 2T dz 3m
0
It is immediately apparent from the last formula that when a mass of gas falls a bit
lower, without otherwise changing, that does not change its Ω value a bit. (Of course,
gravity acts as a constant downward force, and its increase with the approach to the
earth’s center is neglected, which is always the case in heat theory problems.)
Let us go now to the most general case of an arbitrary gas on which any external
force acts, so we again apply Eq. (4.124). However, so that the formulas are not too
lengthy, let only one type of gas be present in the vessel. The permutability measure
of a gas mixture can then be found without difficulty since it is simply equal to the
sum of the permutability measures each component would have if present in the
vessel alone. For thermal equilibrium, then
- hðχþLÞ
Ne
f=Z Z do dw, ð4:177Þ
e - hðχþLÞ
where χ is the potential energy, L is the kinetic energy of a molecule, and N is the
number of molecules in the vessel, where
It therefore follows
Z Z Z Z
Ω= - f ln fdo dw = - N ln N þ N ln e - hðχþLÞ do dw þ hN χ
rN
þ ð4:179Þ
2
Z Z
Z Z χe - hðχþLÞ dodw
1
χf dodw = Z Z ð4:180Þ
N
e - hðχþLÞ dodw
(In respect of the above, see the already cited book of Watson, pp 36 and 37). The
second term on the right of Eq. (4.179) can be further transformed if one introduces
instead of q1, q2, . . ., qr the variables s1, s2, . . ., sr, which have the property that the
term L is reduced to s21 þ s22 þ ⋯ þ s2r . We then denote by Δ the Jacobian
So then
Z
Z Z r Z Δχe - hχ do
- hðχþLÞ π - hχ
do, χ = Z
2
e do dw = Δe ð4:182Þ
h
Δe - hχ do
And so also
Z
Nr rN
Ω = N ln Δe - hχ do - ln h þ hNχ þ ð1 þ ln π Þ - N ln N ð4:183Þ
2 2
To this expression, one can compare Equation (18) of my paper “Analytical proof of
the second law of thermodynamics from the approach to equilibrium in kinetic
energy” (Wiss. Abhand. Vol I, reprint 20), or Equation (95) of my “Further
Studies” (Wiss. Abhand. Vol I, reprint 22) by replacing p1, p2, . . . with x1, y2, . . .,
q1, q2, . . . with u1, v2, . . .; s1, s2, . . . with
rffiffiffiffi rffiffiffiffi
m m
u1 , v ...,
2 2 1
r
4.3 Evolution of Thermodynamic State Index (Φ) 199
Boltzmann assumes that particles are independent rather than an ensemble like in a
molecular chain or crystal. He also assumes that there is no group interdependent
interaction. However, when his formulation is used in conjunction with conservation
of energy, conservation of mass, and Newton’s laws, it automatically makes a solid
basis for statistical mechanics for all phases of matter. Because even if the molecules
he used in his virtual experiments were no single gas atoms and were interacting
chains, the number of microstates could not change for the ensemble. Essentially gas
atoms can be replaced by an ensemble of molecules in his formulation and his
statistical basis would not change because he does not make any assumptions to
preclude forces between molecules or external forces. However, internal friction and
inter-dependent group interaction between an ensemble of atoms would make his
200 4 Unified Mechanics Theory
S = k ln Ω ð4:185Þ
S = k ln w þ constant ð4:186Þ
10
O. Lummer and E. Pringsheim, Transactions of the German Physical Society 2 (1900), p. 163.
11
H. Rubens and F. Kurlbaum, Proceedings of the Imperial Academy of Science, Berlin, October
25, 1900, p. 929.
12
H. Beckmann, Inaugural dissertation, Tübingen 1898. See also H. Rubens, Weid. Ann.
69 (1899), p. 582.
13
M. Planck, Ann. d. Phys. 1 (1900), p. 719.
14
Compare with Eq. (4.205).
202 4 Unified Mechanics Theory
analysis,15 however, it appeared to me that there must be other expressions that yield
the same result and that in any case, one needs another condition to be able to
calculate S uniquely. I believed I had found such a condition in the principle, which
at the time seemed to me perfectly plausible, that in an infinitely small irreversible
change in a system, near thermal equilibrium [bold emphasis by this author], of
N identical resonators in the same stationary radiation field, the increase in the total
entropy SN = NS with which it is associated depends only on its total energy UN = NU
and the changes in this quantity, but not on the energy U of individual resonators.
This theorem leads again to Wien’s energy distribution law. But since the latter is not
confirmed by experience, one is forced to conclude that even this principle cannot be
generally valid and thus must be eliminated from the theory.16
Thus, another condition must now be introduced which will allow the calculation
of S, and to accomplish this it is necessary to look more deeply into the meaning of
the concept of entropy. Consideration of the untenability of the hypothesis made
formerly will help to orient our thoughts in the direction indicated by the above
discussion. In the following, a method will be described that yields a new, simpler
expression for entropy and thus provides also a new radiation equation that does not
seem to conflict with any facts so far determined.
Calculations of the Entropy of a Resonator as a Function of Its Energy
1. Entropy depends on the disorder and this disorder, according to the electromag-
netic theory of radiation for the monochromatic vibrations of a resonator when
situated in a permanent stationary radiation field, depends on the irregularity with
which it constantly changes its amplitude and phase, provided one considers time
intervals large compared to the time of one vibration but small compared to the
duration of a measurement. If amplitude and phase both remained absolutely
constant, which means completely homogeneous vibrations, no entropy could
exist, and the vibrational energy would have to be completely free to be converted
into work. The constant energy U of a single stationary vibrating resonator
accordingly is to be taken as time average, or what is the same thing, as a
simultaneous average of the energies of a large number N of identical resonators,
situated in the same stationary radiation field, and which are sufficiently separated
so as not to influence each other directly. It is in this sense that we shall refer to the
average energy U of a single resonator. Then the total energy
U N = NU ð4:187Þ
15
M. Planck, loc. cit., pp. 730 ff.
16
Moreover one should compare the critiques previously made of this theorem by W. Wien (Report
of the Paris Congress 2, 1900, p. 40) and by O. Lummer (loc. cit., 1900, p. 92).
4.3 Evolution of Thermodynamic State Index (Φ) 203
SN = NS ð4:188Þ
UN = P E ð4:190Þ
17
L. Boltzmann, Proceedings of the Imperial Academy of Science, Vienna, (II) 76 (1877), p. 428.
204 4 Unified Mechanics Theory
Now it is evident that any distribution of the P energy elements among the
N resonators can result only in a finite, integral, definite number. Every such form
of distribution we call, after an expression used by L. Boltzmann for a similar idea, a
“complex,” [actually Boltzmann calls this complexion]. If one denotes the resona-
tors by the numbers 1, 2, 3, . . ., N, and writes these side by side, and if one sets under
each resonator the number of energy elements assigned to it by some arbitrary
distribution, then one obtains for every complex a pattern of the following form:
ð4:192Þ
N! = N N ð4:194Þ
ðN þ PÞNþP
R= ð4:195Þ
N N PP
4. The hypothesis which we want to establish as the basis for further calculation
proceeds as follows: for the N resonators to possess collectively the vibrational
energy UN, the probability w must be proportional to the number R of all
possible complexes formed by the distribution of the energy UN among the
N resonators; or in other words, any given complex is just as probable as any
other. Whether this actually occurs in nature one can, in the last analysis, prove
only by experience. But should experience finally decide in its favor it will be
possible to draw further conclusions from the validity of this hypothesis about
the particular nature of resonator vibrations, namely, in the interpretation put
forth by J. v. Kries18 regarding the character of the “original amplitudes,
18
Joh. v. Kries, The Principles of Probability Calculation (Freiburg, 1886), p. 36.
4.3 Evolution of Thermodynamic State Index (Φ) 205
n o
U U U U
S=k 1þ log 1 þ - log ð4:198Þ
E E E E
E dλ = θ5 ψ ðλθÞ dλ ð4:199Þ
19
W. Wien, Proceedings of the Imperial Academy of Science, Berlin, February 9, 1893, p. 55.
20
M. Thiesen, Transactions of the German Physical Society 2 (1900), p. 66.
21
Perhaps one should speak more appropriately of a “white” radiation, to generalize what one
already understands by total white light.
206 4 Unified Mechanics Theory
In addition, from this, we see, among other things, that as is well known, the
radiant energy u λ3 at a given temperature and frequency is the same for all
diathermic media.
8. To go from the energy density u to the energy U of a stationary resonator
situated in the radiation field and vibrating with the same frequency, ν, we use
the relation expressed in equation (34) of my paper on irreversible radiation
processes:22
ν2
K= U ð4:203Þ
c2
8πK
u= ð4:204Þ
c
22
M. Planck, Ann. d. Phys. 1 (1900), p. 99.
4.3 Evolution of Thermodynamic State Index (Φ) 207
8πν2
u= U ð4:205Þ
c3
where now c does not appear at all. In place of this we may also write
U
θ=ν f ð4:207Þ
ν
1 dS
= ð4:208Þ
θ dU
We then obtain
dS 1 U
= f ð4:209Þ
dU ν ν
And integrated
U
S=f ð4:210Þ
ν
ε = hν ð4:211Þ
And consequently
n o
U U U U
S=k 1þ log 1 þ - log ð4:212Þ
hν hν hν hν
1 k hν
= log 1 þ ð4:213Þ
θ hν U
hν
U= ð4:214Þ
ekθ - 1
hν
and, from Eq. (4.205) there then follows the energy distribution law sought
for
8π h ν3 1
u= hν ð4:215Þ
c3 ekθ - 1
8πch 1
E= ch ð4:216Þ
λ5 ekλθ - 1
I plan to derive elsewhere the expressions for the intensity and entropy of
radiation progressing in a diathermic medium, as well as the theorem for the
increase of total entropy in nonstationary radiation processes.
Numerical Values
11. The values of both universal constants h and k may be calculated rather precisely
with the aid of available measurements. F. Kurlbaum,23 designating the total
energy radiating into the air from 1 square centimeter of a black body at
temperature t °C in 1 s by St, found that
From this one can obtain the energy density of the total radiation energy in
the air at the absolute temperature 1:
4 7:31 105
= 7:061 10 - 15 erg=cm3 deg4 ð4:218Þ
3 105 3734 - 2734
On the other hand, according to Eq. (4.215), the energy density of the total
radiant energy for θ = 1 is
23
F. Kurlbaum, Wied. Ann. 65 (1898), p. 759.
4.3 Evolution of Thermodynamic State Index (Φ) 209
Z 1 Z 1
8πh v3 dv
u = u dv = 3 ð4:219aÞ
0 c 0 ehv=k - 1
Z 1
8πh
u = ν3 e - k þ e - þ e-
hν 2hν 3hν
k k þ . . . dν ð4:219bÞ
c3 0
If we set this equal to 7.061 10-15, then, since c = 3 1010 cm/s, we obtain
k4
= 1:1682 1015 ð4:220Þ
h3
12. O. Lummer and E. Pringswim24 determined the product λmθ, where λm is the
wavelength of maximum energy in the air at temperature θ, to be 2940
microndegree. Thus, in absolute measure
On the other hand, it follows from Eq. (4.216), when one sets the derivative
of E with respect to θ equal to 0, thereby finding λ = λm
ch
1- ech=kλm θ = 1 ð4:222Þ
5kλm θ
ch
λm θ = ð4:223Þ
4:9651 k
Consequently
h 4:9561 0:294
= = 4:866 10 - 11 ð4:224Þ
k 3 1010
From this and Eq. (4.220) the values for the universal constants become
24
O. Lummer and E. Pringsheim, Transactions of the German Physical Society 2 (1900), p. 176.
210 4 Unified Mechanics Theory
S = k ln w ð4:225Þ
4.3 Evolution of Thermodynamic State Index (Φ) 211
NA
s= k ln w ð4:226Þ
ms
As a result, the unit for specific entropy s is J/kg K. Initially, let the probability of
a matter being in a completely “ordered ground state” [a reference state] be w0.
Under any external loading/disturbance, the system will move from the initial
[reference] configuration to a new microstate defined by s and w
s0 → s
ms S0 ms S
w0 = e N A k → w = e N A k ð4:227Þ
According to the second law, in the final stage, the system will reach maximum
entropy and maximum disorder (zero entropy generation rate) state.
4.3 Evolution of Thermodynamic State Index (Φ) 213
s → s max
ms s ms s max
w=e NA k
→ w max = e NA k
ð4:228Þ
During this travel over the energy terrain, we can define the thermodynamics state
of the system at any point as a dimensionless variable that defines the distance from
the origin, [or any reference state] as follows:
w - w0
0≤Φ= ≤1 ð4:229Þ
w
ms s ms s0
h s - s0 i
eN A k - e N A k
= 1 - e NA ð k Þ
- ms
Φ= ms s ð4:230Þ
eN A k
Boltzmann constant k can also be given by = NRA , where R is the gas constant.
Finally, the thermodynamics state index (TSI) can be given in (Fig. 4.4)
h ms Δs
i
Φ = 1 - e- R ð4:231Þ
Fig. 4.7 Initial acceleration of the ball slows down according to laws of thermodynamics
214 4 Unified Mechanics Theory
1 2
U= kδ ð1 - ΦÞ ð4:232Þ
2
Taking the first derivative of strain energy with respect to deformation and
ignoring dΦ
dδ for the sake of simplicity, we obtain
F
δ= ð4:233Þ
k ð1 - ΦÞ
Of course, the degradation of elastic strain energy storage capacity is due to the
degradation of stiffness of the spring material. However, we should point out that in
∂½12kδ2
Eq. (4.233) we assumed that ∂Φ ∂δ
is smaller than ∂δ which is true for high cycle
fatigue, but it is not true for low cycle fatigue.
It is important to remind readers that entropy is essentially energy unavailable for
work. Therefore, the total available energy of a closed-isolated system degrades
along the TSI axis between 0 and 1. Multiplication of stiffness k with (1 - Φ) is due
to the simplicity of the example chosen. Essentially, the TSI coordinate maps the
entropy generation rate [thermodynamic fundamental equation] of any system to a
linearly independent axis.
The question may be asked what happens to a system moving from one stable
equilibrium to another and getting an external energy boost during the travel due to
an external factor. Because entropy is an additive property, and we have to maintain
conservation of energy, we can include the new addition in the system in an
incremental form. Actually, in computational mechanics, we solve differential
equations in incremental format. Therefore, using the thermodynamic state index
works very well for incremental solution procedures.
The ramification of the TSI coordinate is the fact that entropy generation rate
becomes a nodal unknown in computational mechanics in addition to other nodal
unknowns, for example, displacements in a thermo-mechanical analysis. However,
to reduce the amount of computation, it is easier to calculate the TSI at Gauss
integration points based on the results of the previous step. The amount of error is
negligible if the increments are small.
4.4 Experimental Verification Example 215
Now that the Thermodynamic State Index (TSI) is defined, we will use two cases to
calculate TSI using the experimental data. First, a fully reversed uniaxial cyclic
loading on a steel sample is considered, and then a monotonic loading on a steel
sample. The experiments were performed in the same conditions and on identical
specimens (Fig. 4.1) for the two cases.
Now that the Thermodynamic State Index (TSI) is defined, we will use two
different experiments to calculate TSI using the measured data. First, a fully reversed
uniaxial cyclic
loading on a steel sample is considered, and then a monotonic loading case on a
steel sample. The experiments were performed in the same conditions and on
identical specimens (Fig. 4.9) for the two cases.
To evaluate the Thermodynamic State Index (TSI) evolution in both cases, only
the effects of plastic deformation are taken into account, for the sake of simplicity.
We assume that heat generated during cycling loading is insignificant, because the
frequency of loading is very small, and the temperature of the specimen remains
constant. Furthermore, because the load is uniaxial, the plasticity is reduced to one
dimension. The internal entropy generation is then reduced to
Z t
σ : ε_ p
Δs = dt ð4:234Þ
t0 ρT
X ðσ Δεp Þ
Δsi = : ð4:235Þ
ρT
occurs after 80 cycles. Since we assume a small strain in the model, plastic strain is
calculated as follows:
σ
εp = ε - εe , εp = ε - : ð4:236Þ
E
Figure 4.10 shows the entire engineering stress and strain diagram obtained from
the experiment, which is a fully reversed uniaxial cyclic loading. Figure 4.11 depicts
the TSI evolution as a function of number cycles. As expected, TSI is initially zero
and finally approaches the value of one. Because TSI is an exponential function, it
never reaches exactly one. Readers can find more fatigue examples in the papers
cited in the references section.
4.5 Dynamic Equilibrium Equations in Unified Mechanics Theory 217
For this case, the sample is identical to the one used for the cyclic loading. The only
difference is that this time it is monotonically loaded until failure [separation of the
sample into two pieces] occurs. The loading is uniaxial tension; the experimental
data is obtained similarly and treated in the same way as before. Figure 4.12 shows
the stress–strain diagram obtained from the monotonic tension test. Figure 4.13
presents the evolution of the TSI for the specimen. TSI starts from zero and
approaches one (asymptotically) at the end. We can see that the TSI remains very
small for a while before it starts to increase. This domain corresponds to the elastic
range of the material. Because we assumed that only plastic deformation could cause
degradation in the material, there is no entropy generated in the elastic range
according to our assumption. Of course, there is entropy generation due to elastic
response; however, it is smaller by orders of magnitude.
In this section, we will derive the dynamic equilibrium equations using the unified
mechanics theory. For further details, readers are referred to Bin Jamal et al. (2021).
Newtonian mechanics does not include a term for energy loss; an empirical
damping term “C” is used in the dynamic equilibrium equation. However, in unified
218 4 Unified Mechanics Theory
1
0.9
0.8
Damage Parameter
0.7
0.6
0.5
0.4
0.3
0.2
0.1
0
0 1000 2000 3000 4000 5000 6000 7000 8000 9000
steps
mechanics theory energy loss is automatically included because they are part of the
universal laws. One-dimensional free vibration analysis, with frictional dissipation,
is used to compare the unified mechanics theory results with that of the Newtonian
mechanics solution.
The action of a material point, over a time increment, Δt can be defined by the
following integral:
Z t2
I= L dt ð4:237Þ
t1
where L represents the Lagrangian of the system. After we introduce TSI, the
Lagrangian of the system can be given by
_ Φi Þ
L = L ðt, u, u, ð4:238Þ
J ð E Þ = I ðq þ E ζ Þ ð4:240Þ
Let’s assume the functional “I” has a local minimum at u = q, and the function
J has a local minimum at E = 0. Hence the following equation must hold:
Z t2
dJ dL
= dt = 0 ð4:241Þ
dE E=0 t1 dE E=0
Taking the total derivative of Lagrangian, L, yields a new additional term when
compared to Newtonian mechanics formulation,
dL ∂L du ∂L d u_ X ∂L dΦi
n
= þ þ ð4:242Þ
dE ∂u dE ∂u_ dE i = 1 ∂Φi dE
In Eq. (4.242) that the derivative of Lagrangian with respect to the new axis,
Thermodynamic State Index, is non-zero. The displacement is represented as
follows:
u=q þ E ζ ð4:243Þ
u_ = q_ þ E ζ_ ð4:244Þ
Using Eqs. (4.243) and (4.244), we can write Eq. (4.242) as follows:
∂L _ X ∂L dΦi
n
dL ∂L
= ζþ ζþ ð4:245Þ
dE ∂u ∂u_ i=1
∂Φi dE
dΦ ∂Φ du ∂Φ du_
= þ ð4:246Þ
dE ∂u dE ∂u_ dE
dL ∂L ∂L ∂Φ ∂L ∂L ∂Φ _
= þ ζþ þ ζ ð4:247Þ
dE ∂u ∂Φ ∂u ∂u_ ∂Φ ∂u_
The second term on the right-hand side can be integrated by parts; we get the
following equation:
Z t2
∂L ∂L ∂Φ d ∂L ∂L ∂Φ
þ - þ ζ dt
t1 ∂u ∂Φ ∂u dt ∂u_ ∂Φ ∂u_ E=0
t 2
∂L ∂L ∂Φ
þ þ ½ζ E = 0 =0 ð4:249Þ
∂u_ ∂Φ ∂u_ t1
By the definition of ζ, the second term in Eq. (4.249) vanishes at the boundary of
the integral domain. Hence, using Eqs. (4.243) and (4.244) in Eq. (4.249) and by
applying the principle of local state, we can write
∂L ∂L ∂Φ d ∂L ∂L ∂Φ
þ - þ =0 ð4:250Þ
∂u ∂Φ ∂u dt ∂u_ ∂Φ ∂u_
The governing equation of motion for a mass-spring system, Fig. 4.14, is derived
using the unified mechanics theory and compared with that of a Newtonian mechan-
ics equation of motion.
Lagrangian of the system is given by
L=K -V ð4:252Þ
where K and V are the total kinetic energy and total potential energy of the system,
respectively. In this example, mass is constant as a function of time. For a mass m,
excited by an external force, the total kinetic energy of a conservative system
(without damping) is given by the following equation:
1 _2
K= m~u ð4:253Þ
2
1 2 1 _2
mu_ = m~u ð1 - ΦÞ ð4:254Þ
2 2
1 _2 1 2 1 1 2 1 1 2
L= m~u - k~u = mu_ - ku ð4:255Þ
2 2 ð1 - Φ Þ 2 ð1 - ΦÞ 2
222 4 Unified Mechanics Theory
∂L
= - ð1 - ΦÞ - 1 ku ð4:256Þ
∂u
∂L 1 1 2 1 1 2
= mu_ - ku ð4:257Þ
∂Φ ð1 - ΦÞ2 2 ð1 - ΦÞ2 2
∂Φ ms ð - ΔSmRs Þ ∂ΔS ms ∂ΔS
= e = ð1 - ΦÞ ð4:258Þ
∂u R ∂u R ∂u
Note: Eq. (4.258) is the derivative of entropy with respect to displacement. This
term, ∂Φ
∂u
, is taken as zero in Newtonian mechanics, even if it is used in a thermody-
namically consistent formulation, because there is no TSI axis.
∂L
= ð1 - ΦÞ - 1 mu_ ð4:259Þ
∂u_
d ∂L d 1
= mu_ ð1 - ΦÞ - 1 þ m€u ð4:260Þ
dt ∂u_ dt ð1 - ΦÞ
dΦ
d ∂L 1
= mu_ dt
þ m€
u ð4:261Þ
dt ∂u_ ð1 - ΦÞ 2 ð1 - ΦÞ
dΦ ms dΔS
= ð1 - ΦÞ ð4:262Þ
dt R dt
d ∂L ∂Φ d ∂L ∂Φ ∂L d ∂Φ
= þ ð4:263Þ
dt ∂Φ ∂u_ dt ∂Φ ∂u_ ∂Φ dt ∂u_
! !
d ∂L d 1 1 2 d 1 1 2
= mu_ - ku ð4:264Þ
dt ∂Φ dt ð1 - ΦÞ2 2 dt ð1 - ΦÞ2 2
!
1 2d 1 1 d 1 2 1 d
= mu_ þ mu_ - ku2
2 dt ð1 - ΦÞ2 ð1 - ΦÞ2 dt 2 2 dt
!
1 1 d 1 2
- ku ð4:265Þ
ð1 - Φ Þ2 ð1 - ΦÞ2 dt 2
!
d ∂L 1 2 1 2 1 ms dΔS 1
= mu_ - ku 2 þ
dt ∂Φ 2 2 ð1 - ΦÞ 2 R dt ð1 - ΦÞ2
ðm€u - kuÞu_ ð4:266Þ
4.5 Dynamic Equilibrium Equations in Unified Mechanics Theory 223
∂Φ ms ∂ΔS
= ð1 - ΦÞ ð4:267Þ
∂u_ R ∂u_
d ∂Φ m d ∂ΔS
= s ð1 - ΦÞ ð4:268Þ
dt ∂u_ R dt ∂u_
ms ∂ΔS d ms d ∂ΔS
= ðð1 - ΦÞÞ þ ð1 - ΦÞ ð4:269Þ
R ∂u_ dt R dt ∂u_
or
d ∂Φ m d ∂ΔS m 2 ∂ΔS dΔS
= s ð1 - Φ Þ - s ð1 - ΦÞ ð4:270Þ
dt ∂u_ R dt ∂u_ R ∂u_ dt
d ∂L ∂Φ
dt ∂Φ ∂u_
( ! )
1 2 1 2 1 ms dΔS 1
= mu_ - ku 2 þ u - kuÞu_
ðm€
2 2 ð1 - ΦÞ2 R dt ð1 - ΦÞ2
( )
ms ∂ΔS 1 1 2 1 1 2
ð1 - ΦÞ þ mu_ - ku
R ∂u_ ð1 - ΦÞ2 2 ð1 - ΦÞ2 2
ms d ∂ΔS m 2 ∂ΔS dΔS
ð1 - ΦÞ - s ð1 - Φ Þ ð4:271Þ
R dt ∂u_ R ∂u_ dt
where the function, ℊ, is the summation of all the terms as a function of ∂ΔS
∂u_
.
Newtonian mechanics governing equation of motion for the 1-D sliding friction
system shown in Fig. 4.14 is given by
ΔS, and represented along the thermodynamic state index axis, Φ. The entropy
generation function during the sliding friction is derived in the following section.
W Td = ð1 þ κ d ÞW m ð4:275Þ
where W Td is the total dissipation work and κ d is the fraction of work dissipated as
heat, acoustic losses, diffusion, etc. The initial potential energy of the spring under
free vibration will be dissipated as work of friction, heat, acoustic losses, diffusion,
etc. However, this conversion cannot be perfect, and there will be energy lost to
unintended work. Hence, the specific entropy generated is given by the following
equation:
Z t2
1
ΔS = ð1 þ κd Þψ s mμg j du j ð4:276Þ
mT t1
where ψ s represents the inefficiency factor, as the ratio between the maximum
entropy at which TSI, Φ tends to unity (say, at Φ = 0.99) and the maximum entropy
that the mass-spring system could generate, for initial potential energy in the spring,
if it had zero inefficiencies. Hence, by substituting Eq. (4.276) in Eq. (4.272), we get
the following equation:
4.5 Dynamic Equilibrium Equations in Unified Mechanics Theory 225
n o
1 1 m 1 1 2 1 2
m€u þ ku þ s ku þ mu_
ð1 - ΦÞ ð1 - ΦÞ R ð1 - ΦÞ 2 2
h i
1
ð1 þ κd Þψ s mμg sgn ðu_ Þ = F ð4:277Þ
mT
where Δt represents the duration for which the momentum is calculated and λ is the
thermodynamic friction coefficient. The inefficiency factor, ψ s, represents the dissi-
pation of initial strain energy during a free vibration. Once the TSI, Φ, is close to
unity (say 0.99), the maximum entropy for a value of, assuming Φc = 1, can be
calculated from TSI as follows:
h i
Φ = Φcritical 1 - e - ΔS R
ms
ð4:279Þ
When ΔS0u = 4:605 mRs, Φcritical = 0.99. Keep in mind that Eq. (4.279) is an exponen-
tial function; hence, Φcritical = 1.0 only when ΔS = 1.
ΔS0u is the maximum entropy, for which TSI is close to unity. For any initial
potential energy, the maximum entropy generation cannot be more than the initial
potential energy; hence it is given by the following equation:
1 1 2
ΔSu = ku ð4:280Þ
mT 2 0
where u0 is the initial excitation displacement. Hence the inefficiency factor is given
by the following equation:
ΔS ′ u
ψs = ð4:281Þ
ΔSu
226 4 Unified Mechanics Theory
Free vibration analysis is done for a single degree of freedom system under frictional
damping. Equations (4.273) and (4.277) are solved by using Newmark’s method in
MATLAB. The model parameters used in the present study are listed in Table 4.2.
Simulation results are plotted in Figs. 4.15, 4.16, 4.17 and 4.18 for the input
parameters given in Table 4.2. The solution of the equation of motion in unified
mechanics theory and Newtonian mechanics for damped oscillation of the mass is
shown in Fig. 4.15 for displacement time history and Fig. 4.16 for velocity time
history, respectively. It can be readily noticed that both models capture damped
oscillation. However, the decay in amplitude is linear for the Newtonian mechanics
model, and in unified mechanics, the response is nonlinear. Experimental results on
similar models show that the amplitude decay is nonlinear. Since the response of the
mass-spring system in the unified mechanics equilibrium equation is dependent on
the entropy evolution within the system, a detailed study is required to get the
entropy generation function, also known as the thermodynamic fundamental equa-
tion. However, this simple example is limited to the derivation of the governing
equation of motion using the fundamental principles of the unified mechanic theory.
Figure 4.17 shows the response of the mass-spring oscillator subjected to friction,
along the Thermodynamic State Index axis. In the present context, the TSI axis
represents the thermodynamic state of oscillation. Hence, the interpretation of TSI in
this example is limited to the dissipation of initial strain energy stored in the spring.
As shown in Fig. 4.17, the oscillations dampen, and the strain energy and kinetic
energy of the mass-spring system decays. Even though the Newtonian equation of
motion given for the mass-spring system accounts for damping due to friction
utilizing a pseudo linear force, the equation does not directly deal with the
Table 4.2 List of parameters used to solve the proposed model and the Newtonian mechanics
equation
Parameter Value Unit
Spring stiffness, k 6.3 × 105 N/m
Mass, m 2 kg
Critical TSI, Φcri 0.99 Unitless
Friction coefficient, μ 0.2 Unitless
Thermodynamic friction coefficient, λ 0.003 Unitless
Molar mass for iron, ms 55.84 × 10-3 kg/mol
Initial spring displacement at t = 0, u0 6.25 × 10-4 m
4.5 Dynamic Equilibrium Equations in Unified Mechanics Theory 227
0.0002
0.0001
0.0000
-0.0001
-0.0002
-0.0003
-0.0004
-0.0005
-0.0006
-0.0007
0.00 0.05 0.10 0.15 0.20 0.25 0.30 0.35 0.40
Time (s)
Fig. 4.15 Displacement time history for a damped oscillator in the presence of contact surface
friction
0.4
Velocity, v (UMT model)
Velocity, v (Newton's model)
0.2
Velocity, v (m/s)
0.0
-0.2
-0.4
0.0 0.1 0.2 0.3 0.4
Time (s)
Fig. 4.16 Velocity time history time for a damped oscillator in the presence of contact surface
friction
228 4 Unified Mechanics Theory
0.0006
Displacement, u (UMT model)
Velocity, v (UMT model) 0.4
0.0004
0.2
Displacement, u (m)
Velocity, v (m)
0.0002
0.0000 0.0
-0.0002
-0.2
-0.0004
-0.4
-0.0006
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0
TSI
Fig. 4.17 The response of the mass-spring oscillator along with the Thermodynamic State Index
(TSI) axis
Fig. 4.18 Displacement amplitude versus time for various static friction coefficient values
4.18 show that the rate of amplitude decay in the unified mechanics theory-based
models is high in the initial stage of oscillation when compared with the final stage.
This could be due to the macro-level model approximations for the entropy gener-
ation equation assumed in Eq. (4.276). A detailed study is required for the identifi-
cation of all the entropy generation sources that are active in the contact surfaces
of the friction block model. Such a study can help in reproducing the nonlinear
response of friction block experiments.
The unified mechanics theory captures the time at which oscillation stops, as
predicted by using the Newtonian mechanics equation of motion. However, the
thermodynamic fundamental equation used in this simple example is very crude with
several oversimplifications
Test data published in the literature by Feeny and Liang (1996) is used to compare
the predictions using the dynamic equilibrium of the unified mechanics’ theory
(UMT) and Newton’s equilibrium equation. Feeny and Liang (1996) experimental
setup consists of a mass-spring single degree of freedom system with dry friction
(Imanian & Modarres, 2015) subjected to an initial displacement. A linear voltage
differential transformer is used to sense the motion of mass. A combined effect of
Coulomb friction and viscous damping on a single degree of freedom system is
investigated by using a similar experimental setup Liang (2005). A list of the model
parameters used in this comparative study is given in Table 4.3.
Simulations and test data for displacements and displacement-amplitude of vibra-
tion are shown in Figs. 4.19 and 4.20, respectively. Both the unified mechanics
theory and Newtonian mechanics capture the damped oscillations under Coulomb
friction. The amplitude variations, as shown in Fig. 4.20, show a different trend for
the unified mechanics theory when compared with the predictions using Newton’s
equation of motion. Newton’s equation of motion predicts results closer to the test
data. However, it is to be noted that Newton’s model parameter involves experi-
mental curve-fit data, representing the frictional force. Nevertheless, the unified
mechanics theory-based model involves TSI, which evolves with the
Table 4.3 Parameters used to compare UMT, Newtonian mechanics, and experimental data
Parameter Value Unit
Spring stiffness, k 54 N/m
Critical TSI, Φc 1 –
Thermodynamic friction coefficient, λ 0.003 –
Mass, m 1.51 kg
Sliding friction coefficient, μ 4.23 × 10-4 –
Molar mass for iron, ms 55.84 × 10-3 kg/mol
Displacement at t = 0, u0 4.25 × 10-3 m
230 4 Unified Mechanics Theory
0.005
0.001
0.000
-0.001
-0.002
-0.003
-0.004
-0.005
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15
Time (s)
Fig. 4.19 Comparison between simulations predictions and test data for the displacement time
history. (After Bin Jamal et al. (2020))
0
0 1 2 3 4 5 6 7 8 9 10 11 12
Time (s)
for displacement. While comparing the duration of free vibration, UMT predicts a
closer result to the test data. Since the equilibrium equation in UMT is a function of
the thermodynamic fundamental equation, the model requires the computation of
entropy accurately to get a solution matching the test data.
Here we conclude the formulation of the unified mechanics theory. In the
following chapter applications of the theory are discussed in greater detail.
References
Amiri, M., & Khonsari, M. M. (2012). On the role of entropy generation in processes involving
fatigue. Entropy, 14(1), 24–31.
Basaran, C., & Chandaroy, R. (2002). Thermomechanical analysis of solder joints under thermal
and vibrational loading. Transactions of the ASME: Journal of Electronic Packaging, 124(1),
60–67.
Basaran, C., & Lin, M. (2007a). Electromigration induced strain field simulations for
nanoelectronics lead-free solder joints. International Journal of Solids and Structures, 44,
4909–4924.
Basaran, C., & Lin, M. (2007b). Damage mechanics of electromigration in microelectronics copper
interconnects. International Journal of Materials and Structural Integrity, 1(1/2/3), 16–39.
Basaran, C., & Lin, M. (2008). Damage mechanics of electromigration induced failure. Mechanics
of Materials, 40, 66–79.
Basaran, C., & Nie, S. (2004). An irreversible thermodynamic theory for damage mechanics of
solids. International Journal of Damage Mechanics, 13(3), 205–224.
Basaran, C., & Nie, S. (2007). A thermodynamics based damage mechanics model for particulate
composites. International Journal of Solids and Structures, 44, 1099–1114.
Basaran, C., & Tang, H. (2002). Implementation of a thermodynamic framework for damage
mechanics of solder interconnects in microelectronic packaging. International Journal of
Damage Mechanics, 11(1), 87–108.
Basaran, C., & Yan, C. Y. (1998). A thermodynamic framework for damage mechanics of solder
joints. Transactions of the ASME: Journal of Electronic Packaging, 120, 379–384.
Basaran, C., Lin, M., & Ye, H. (2003). A thermodynamic model for electrical current induced
damage. International Journal of Solids and Structures, 40(26), 7315–7327.
Basaran, C., Zhao, Y., Tang, H., & Gomez, J. (2005). A damage mechanics based unified
constitutive model for solder alloys. Transactions of the ASME: Journal of Electronic Packag-
ing, 127(3), 208–214.
Basaran, C., Li, S., & Abdulhamid, M. (2008a). Thermomigration induced degradation in solder
alloys. Journal of Applied Physics, 103, 123520.
Basaran, C., Nie, S., & Hutchins, C. (2008b). Time-dependent behavior of a particle filled
composite PMMA/ATH at elevated temperatures. Journal of Composite Materials, 42(19),
2003–2025.
Bazant, Z. P. (1972). Thermodynamics of interacting continua with surfaces and creep analysis of
concrete structures. Nuclear Engineering Structures and Design, 20, 477–505.
Beven, K. (2006). A manifesto for the equifinality thesis. Journal of Hydrology, 320(1), 18–36.
Bin Jamal, M. N., Kumar, A., Rao, C. L., & Basaran, C. (2020). Low cycle fatigue life prediction
using unified mechanics theory in Ti-6Al-4V alloys. Entropy, 22(1), 24.
Bin Jamal, M. N., Lee, H. W., Lakshmana Rao, C., & Basaran, C. (2021). Dynamic equilibrium
equations in unified mechanics theory. Applied Mechanics, 2, 63–80.
Callen, H. B. (1985). Thermodynamics and an Introduction to Thermostatistics, Wiley.
232 4 Unified Mechanics Theory
Chalmers, M. (2016). Second law of thermodynamics “broken”. New Scientist. Retrieved 2016-
02-09.
Chudnovsky, A. (1973). Fracture of solids. In Scientific papers on Elasticity and plasticity, N9
(pp. 3–43) (Russian).
Chudnovsky, A. (1984). Statistics and thermodynamics of fracture. International Journal of
Engineering Science, 22(8–10), 989–997.
Cuadras, A., Crisóstomo, J., Ovejas, V. J., & Quilez, M. (2015). Irreversible entropy model for
damage diagnosis in resistors. Journal of Applied Physics, 118, 165103.
Cuadras, A., Romero, R., & Ovejas, V. J. (2016). Entropy characterization of overstressed capac-
itors for lifetime prediction. Journal of Power Sources, 336, 272–278.
Cuadras, A., Yao, J., & Quilez, M. (2017). Determination of LEDs degradation with entropy
generation rate. Journal of Applied Physics, 122, 145702.
David Halliday and Robert Resnick (1966) Physics, John Wiley & Sons
De Groot, S. R., & Mazur, P. (1962a). Nonequilibrium thermodynamics. North-Holland.
Dehoff, R. T., “Thermodynamics in Materials Science,” 2nd Edition, Mc Graw-Hill, New York,
1993.
Ehrenfest, P., Trkal, V. Deduction of the dissociation-equilibrium from the theory of quanta and a
calculation of the chemical constant based on this. Ann. Phys., 1921, 65, 609–628.
Evans, D. J., Cohen, E. G. D., & Morriss, G. P. (1993). Probability of second law violations in
shearing steady states. Physical Review Letters, 71(15), 2401–2404.
Feeny, B., & Liang, J. (1996). A decrement method for the simultaneous estimation of coulomb and
viscous friction. Journal of Sound and Vibration, 195, 149–154.
Gerstner, E. Second law broken. Nature (2002). https://doi.org/10.1038/news020722-2
Getling, A. V. (1998). Rayleigh–Bénard convection: Structures and dynamics. World Scientific.
ISBN 978-981-02-2657-2.
Glansdorff, P., & Prigogine, I. (1971a). Thermodynamic theory of structure, stability and fluctua-
tions. Wiley.
Glansdorff, P., & Prigogine, I. (1971b). Thermodynamics theory of structure, stability, and fluctu-
ations. Wiley-Interscience.
Gomez, J., & Basaran, C. (2005). A thermodynamics based damage mechanics constitutive model
for low cycle fatigue analysis of microelectronics solder joints incorporating size effect.
International Journal of Solids and Structures, 42(13), 3744–3772.
Gomez, J., & Basaran, C. (2006). Damage mechanics constitutive model for Pb/Sn solder joints
incorporating nonlinear kinematic hardening and rate dependent effects using a return mapping
integration algorithm. Mechanics of Materials, 38, 585–598.
Gomez, J., Lin, M., & Basaran, C. (2006). Damage mechanics modeling of concurrent thermal and
vibration loading on electronics packaging. Multidiscipline Modeling in Materials and Struc-
tures, 2(3), 309–326.
Gunel, E. M., & Basaran, C. (2010). Stress whitening quantification in thermoformed of mineral
filled acrylics. ASME Journal of Engineering Materials and Technology, 132, 031002-11.
Gunel, E. M., & Basaran, C. (2011a). Damage characterization in non-isothermal stretching of
acrylics: Part I theory. Mechanics of Materials, 43(12), 979–991.
Gunel, E. M., & Basaran, C. (2011b). Damage characterization in non-isothermal stretching of
acrylics: Part II experimental validation. Mechanics of Materials, 43(12), 992–1012.
Guo, Q., Zaõri, F., & Guo, X. (2018). An intrinsic dissipation model for high-cycle fatigue life
prediction. International Journal of Mechanical Sciences, 140, 163–171.
Haddad, W. M. (2017). Thermodynamics: The unique universal science. Entropy, 19, 621.
Haddad, W. M. (2019). A dynamical systems theory of thermodynamics. Princeton University Press.
Haddad, W. M., Chellaboina, V., & Nersesov, S. G. (2005). Thermodynamics: A dynamical systems
approach. Princeton University Press.
Halliday, D. and Resnick R., Physics, John Wiley & Sons, Inc. 1986, New York, NY.
Hsiao, C.-C., & Liang, B.-H. (2018). The generated entropy monitored by pyroelectric sensors.
Sensors, 18, 3320.
References 233
Imanian, A., & Modarres, M. (2015). A thermodynamic entropy approach to reliability assessment
with applications to corrosion fatigue. Entropy, 17(10), 6995–7020.
Imanian, A., & Modarres, M. (2018). A thermodynamic entropy-based damage assessment with
applications to prognosis and health management. Structural Health Monitoring, 17(2),
240–254.
Jang, J. Y., & Khonsari, M. M. (2018). On the evaluation of fracture fatigue entropy. Theoretical
and Applied Fracture Mechanics, 96, 351–361.
Jaynes, E. T. (1957). Information theory and statistical mechanics. Physics Review, 106, 620–615.
Jaynes, E. (1992). The Gibbs paradox. In C. Smith, G. Erickson, & P. Neudorfer (Eds.), Maximum
entropy and Bayesian methods (pp. 1–22). Kluwer Academic Publishers.
Kestin, J., & Rice, J. R. (1970). A critical review of thermodynamics (E. B. Stuart et al., Eds.,
p. 275). Mono Book Corp.
Kijalbaev, D., & Chudnovsky, A. (1970). On fracture of deformable solids. Journal of Applied
Mechanics and Technical Physics, 3, 105.
Klamecki, B. E. (1980a). A thermodynamic model of friction. Wear, 63, 113–120.
Klamecki, B. E. (1980b). Wear – An entropy production model. Wear, 58, 325–330.
Klamecki, B. E. (1984). An entropy-based model of plastic deformation energy dissipation in
sliding. Wear, 96, 319–329.
Koschmieder, E. L. (1993). Bénard cells and Taylor vortices. Cambridge University Press. ISBN
0521-40204-2.
Lebowitz, J. (1993). Boltzmann’s entropy and time’s arrow. Physics Today, 46, 32–38.
Li, S., & Basaran, C. (2009). A computational damage mechanics model for thermomigration.
Mechanics of Materials, 41(3), 271–278.
Li, S., Abdulhamid, M., & Basaran, C. (2008). Simulating damage mechanics of electromigration
and thermomigration. Simulation: Transactions of the Society for Modeling and Simulation
International, 84(8/9), 391–401.
Liakat, M., & Khonsari, M. M. (2015). Entropic characterization of metal fatigue with stress
concentration. International Journal of Fatigue, 70, 223–234.
Liang, J.-W. (2005). Identifying Coulomb and viscous damping from free-vibration acceleration
decrements. Journal of Sound and Vibrations, 282, 1208–1220.
Lin, M., & Basaran, C. (2005). Electromigration induced stress analysis using fully coupled
mechanical-diffusion equations with nonlinear material properties. Computational Materials
Science, 34(1), 82–98.
Naderi, M., Amiri, M., & Khonsari, M. M. (2010). On the thermodynamic entropy of fatigue
fracture. Proceedings of the Royal Society A, 466, 423–438.
Onsager, L. (1931). Reciprocal relations in irreversible processes, I. Physical Review, 37, 405–426.
Onsager, L. (1932). Reciprocal relations in irreversible processes, II. Physical Review, 38,
2265–2279.
Osara, J. A., & Bryant, M. D. (2019a). Thermodynamics of fatigue: Degradation-entropy genera-
tion methodology for system and process characterization and failure analysis. Entropy,
21(7), 685.
Osara, J. A., & Bryant, M. D. (2019b). A thermodynamic model for lithium-ion battery degradation:
Application of the degradation-entropy generation theorem. Inventions, 4, 0023.
Ostoja-Starzewski, M. (2016). Second law violations, continuum mechanics, and permeability.
Continuum Mechanics and Thermodynamics, 28(1–2), 489–501.
Ostoja-Starzewski, M., & Raghavan, B. V. (2016). Continuum mechanics versus violations of the
second law of thermodynamics. Journal of Thermal Stresses, 39(6), 734–749.
Planck, M. (1901). “Ueber das Gesetz der Energieverteilung im Normalspektrum”. Annalen der
Physik. 309 (3): 553–563. Bibcode:1901AnP...309..553P. doi:10.1002/andp.19013090310
Prigogine, I. (1955). Introduction to thermodynamics of irreversible processes. Charles C. Thomas
Publisher.
Prigogine, I. (1957). The molecular theory of solutions. North-Holland Publishing Company.
234 4 Unified Mechanics Theory
Tang, H., & Basaran, C. (2003). A damage mechanics based fatigue life prediction model. Trans-
actions of the ASME: Journal of Electronic Packaging, 125(1), 120–125.
Temfack, T., & Basaran, C. (2015). Experimental verification of a thermodynamic fatigue life
prediction model. Materials Science and Technology, 31(13), 1627–1632.
Turing, A. M. (1952). The chemical basis of morphogenesis. Philosophical Transactions of the
Royal Society of London B, 237(641), 37–72. Bibcode:1952RSPTB.237...37T. https://doi.org/
10.1098/rstb.1952.0012.%20JSTOR%2092463
Valanis, K. C. (1971). Irreversibility and existence of entropy. International Journal of Non-Linear
Mechanicsm, 6(3), 337–360.
Van Kampen, N. G. (1984). The Gibbs paradox. In W. E. Parry (Ed.), Essays in theoretical physics
in honour of Dirk ter Haar (pp. 303–312). Pergamon.
Valanis and Komkov (1980). Irreversible thermodynamics from the point of view of internal
variable theory /A Lagrangian formulation, Archiwum Mechaniki Stosowanej, vol. 32, no. 1,
1980, p. 33–58.
Wang, G. M., Sevick, E. M., Mittag, E., Searles, D.J. & Evans, D.J. Experimental Demonstration of
Violations of the Second Law of Thermodynamics for Small Systems and Short Time Scales.
Physical Review Letters 89, 050601, (2002)
Wang, J., & Yao, Y. (2017). An entropy based low-cycle fatigue life prediction model for solder
materials. Entropy, 19, 503.
Wang, J., & Yao, Y. (2019). An entropy-based failure prediction model for the creep and fatigue of
metallic materials. Entropy, 21(11), 1104.
Wang, T., Samal, S. K., Lim, S. K., & Shi, Y. (2019). Entropy production based full-chip fatigue
analysis: From theory to mobile applications. IEEE Transactions on Computer-Aided Design of
Integrated Circuits and Systems, 38(1), 84–95.
Whaley, P. W. (1983). A thermodynamic approach to metal fatigue. In Proceedings of ASME.
International conference on advances in life prediction methods, Albany, NY, pp. 18–21.
Yao, W., & Basaran, C. (2012). Electromigration analysis of solder joints under ac load: A mean
time to failure model. Journal of Applied Physics, 111(6), 063703.
Yao, W., & Basaran, C. (2013a). Computational damage mechanics of electromigration and
thermomigration. Journal of Applied Physics, 114, 103708.
Yao, W., & Basaran, C. (2013b). Electrical pulse induced impedance and material degradation in IC
chip packaging. Electronic Materials Letters, 9(5), 565–568.
Yao, W., & Basaran, C. (2013c). Electromigration damage mechanics of lead-free solder joints
under pulsed DC loading: A computational model. Computational Materials Science, 71,
76–88.
Young, C., & Subbarayan, G. (2019a). Maximum entropy models for fatigue damage in metals with
application to low-cycle fatigue of aluminum 2024-T351. Entropy, 21(10), 967.
Young, C., & Subbarayan, G. (2019b). Maximum entropy models for fatigue damage aluminum
2024-T351. Entropy, 21, xx.
Yun, H., & Modarres, M. (2019). Measures of entropy to characterize fatigue damage in metallic
materials. Entropy, 21(8), 804.
Zhang, M.-H., Shen, X.-H., He, L., & Zhang, K.-S. (2018). Application of differential entropy in
characterizing the deformation inhomogeneity and life prediction of low-cycle fatigue of metals.
Materials, 11, 1917.
Chapter 5
Unified Mechanics of Thermomechanical
Analysis
5.1 Introduction
This chapter describes the implementation of the unified mechanics theory (UMT)
for inelastic thermomechanical analysis.
σ_ = C : ε_ e ð5:1aÞ
where ε_ ,_εp , and ε_ θ are total strain rate, inelastic strain rate, and thermal strain rate
vectors, respectively, and C is the elastic constitutive tensor for the virgin material.
In Eq. (5.1) “:” represents the inner product between the fourth-order constitutive
tensor C and the elastic strain rate vector, ε_ e .
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2022 237
C. Basaran, Introduction to Unified Mechanics Theory with Applications,
https://doi.org/10.1007/978-3-031-18621-9_5
238 5 Unified Mechanics of Thermomechanical Analysis
1 2
2σ xx = k2
6
In a simple uniaxial tension case, yielding starts when the applied stress, σ xx, is
σ
equal to the yield stress of the material, σ xx = σ y. Hence, pyffiffi3 = k:
Yield stress of most metals during loading and unloading cycles increases, called
isotropic hardening, and the origin of the yield surface moves in the stress space,
called kinematic hardening. Considering the general hardening case, for most
metals, an elastic-plastic domain can be defined according to the following yield
function of the von Mises type:
rffiffiffi
2
F ðσ, αÞ = kS - Xk - K ðαÞ kS - Xk - RðαÞ ð5:2Þ
3
where F(σ, α) is a yield surface separating the elastic from the inelastic domain, σ is
the second-order stress tensor, α is a hardening parameter that specifies the evolution
of the radius of the yield surface, X is the deviatoric component of the back-stress
tensor describing the position of the center of the yield surface in stress space, S is
the deviatoric component of the stress tensor given by S = σ - 13 pI whereqpffiffi is the
hydrostatic pressure and I is the second-order identity tensor, and RðαÞ 2
3 K ðαÞ
is the radius of the yield surface in stress space and k k represents the norm operator.
A schematic representation of the yield surface is described in Fig. 5.1.
5.2 Unified Mechanics Theory-Based Constitutive Modeling 239
qffiffiffi
Fig. 5.1 Yield surface in the principal stress space. 2
3 σ y is the yield stress coordinate on the π
plane
Flow Rule
The evolution of the plastic strain is represented by a general associative flow rule of
the form
∂F
ε_ p = γ γb
n ð5:3Þ
∂σ
with bn = ∂F∂σ
being a vector normal to the yield surface in the stress space and
specifying the direction of plastic flow [this is also referred to as the normality
rule], ε_ p are the plastic strain rate vector, and γ is a nonnegative consistency
parameter, which is derived later in the chapter.
Isotropic Hardening
Isotropic hardening describes the increasing yield stress of the material, which is
represented by the radius of the yield surface defined in Eq. (5.2). Chaboche (1989)
proposed the following evolution empirical function for isotropic hardening in
metals:
rffiffiffi
2
K ðαÞ = σ þ R1 ð1 - e - cα Þ ð5:4aÞ
3 y0
240 5 Unified Mechanics of Thermomechanical Analysis
where α is the plastic strain trajectory evolving according to Eq. (5.4b), σ y0 is the
initial yield stress in uniaxial tension, R1 is an isotropic hardening saturation value,
and c is an isotropic hardening rate material parameter:
rffiffiffiffiffiffiffiffiffiffiffiffiffi
2 p p
α_ = ε_ ε_ ð5:4bÞ
3
Using Eqs. (5.3) and (5.4b), we can write the standard definition of equivalent
plastic strain trajectory as follows:
Z t1
rffiffiffiffiffiffiffiffiffiffiffiffiffi
2 p p
α= ε_ ε_ dt ð5:4cÞ
t0 3
X_ = c1 ε_ p - c2 Xα_ ð5:5Þ
In Eq. (5.5) the first term represents the linear kinematic hardening rule as defined
by Prager (1955). The second term is a recall term, often called a dynamic recovery
term, which introduces the nonlinearity between the back-stress X and the actual
plastic strain. When c2 = 0, Eq. (5.5) reduces to the Prager (1955) linear kinematic
hardening rule. The NLK equation describes the rapid changes in the yield stress due
to the plastic flow during cyclic loadings and plays an important role even under
stabilized conditions (after saturation of cyclic hardening). The NLK rule given by
Eq. (5.5) considers the transient hardening effects in each stress-strain hysteresis
loop during cycling loadings, and after unloading, dislocation remobilization is
implicitly described due to the back-stress effect and the larger plastic modulus at
the beginning of the reverse plastic flow.
Consistency Parameter, γ
In Eqs. (5.3) and (5.4b), γ is a nonnegative consistency parameter representing the
irreversible character of plastic flow. The consistency parameter must satisfy the
following requirements:
h∅ðF Þi
γ= ð5:8Þ
η
n p
AD0 Eb hF i b - Q=Rθ ∂F
ε_ =
vp
e ð5:9Þ
kθ E d ∂σ
Rabotnov (1969) introduced the effective stress concept, and in the following, we
introduce the formulation of Lemaitre and Chaboche (1990) and Lemaitre (1996).
To introduce the effective stress concept, it is useful to consider a representative
volume element (RVE) of a material loaded by force F. At a point M oriented by a
→ →
plane defined by its normal direction n and its abscissa x along the direction n
→
(Fig. 5.2), the nominal uniaxial stress is σ = F/δS, where F = n F; δS is the area of
the intersection of the plane with the RVE.
The effective area of all micro-cracks and micro-cavities that lie in δS is
represented by δSDx. It is assumed that no forces are carried by the micro-cracks
and micro-cavities. It is convenient to introduce an effective stress concept related to
the surface that effectively carries the load, (δS - δSDx), namely, for the n axis, the
equation can be given by Rabotnov (1969):
F
σ~x = ð5:11Þ
δS - δSDn
The effective stress σ~ is higher than the nominal stress, because stresses are
carried by the undamaged [intact] material only.
Thermodynamics enables us to define the state at any point by a set of continuous
state variables. This postulate means that the constitutive equations written for the
surface of (δS - δSDn) are not modified by the damage in the material. Hence, the
true stress is the effective stress and not the nominal stress. As a result, we can
postulate the following strain equivalence principle by Lemaitre and Chaboche
(1990): “Any strain constitutive equation for a damaged material may be derived
in the same way as for an intact material except that the usual stress is replaced by the
effective stress.”
Based on the third law of the unified mechanics theory, the strain energy density rate
can be given by
1 1
U_ = σ_ ε_e = ð1 - ΦÞC : ½ε_ e 2 ð5:12Þ
2 2
σ_ = ð1 - ΦÞ C : ε_ e ð5:13Þ
Of course, ignoring the derivative of Φ with respect to strain rate is not accurate.
However, for any incremental procedure, the ignored portion is smaller by orders of
magnitude because of the square over strain rate. Moreover, our objective here is to
introduce the simplest implementation of unified mechanics theory. For small strain
problems, the elastic strain rate can be calculated from the total strain rate vector by
subtracting the inelastic and thermal components:
σ_ = ð1 - ΦÞC : ε_ - ε_ vp - ε_ θ ð5:14Þ
rffiffiffi
2
F = S - XΦ - ð1 - ΦÞ K ðαÞ S - X Φ - ð1 - ΦÞRðαÞ ð5:15Þ
3
Φ
X_ = ð1 - ΦÞðc1 ε_ vp - c2 Xα_ Þ ð5:16Þ
It is important to point out that in unified mechanics theory, we only care about
degradation along the thermodynamics state index axis. The kinematic hardening
function given in Eq. (5.16) is empirical and hence can be defined in many other
forms.
initial approximation to the solution. The approximated solution is then used in the
implicit method to improve the prediction. The following trial (elastic predictor)
state can be written as
where G is the shear modulus and Δen+1 is the deviatoric strain increment vector.
The increment of the back-stress can then be computed using Eq. (5.16):
Φ
dX Φ vp Φ
nþ1 = ð1 - ΦÞ c1 dεnþ1 - c2 Δγ βX n þ ð1 - βÞX nþ1 ð5:18Þ
qffiffi
where c02 = 2
3 c2 and a generalized midpoint rule for the recall term with the
extreme values of β = 0 and β = 1 corresponds to the backward and forward
Euler methods, respectively. Utilizing Eqs. (5.3) and (5.14), incremental form of
the viscoplastic strain can be written as
Snþ1 - X Φ
dεvp
nþ1 = Δγ
nþ1
ð5:19Þ
Snþ1 - XΦnþ1
Substitution of Eq. (5.19) into Eq. (5.18) yields any value of the integration
parameter β between 0 and 1:
" #
Snþ1 - X Φ c02 Φ
dX Φ
nþ1 = anþ1 Δγ
nþ1
- X ð5:20Þ
Snþ1 - X Φ
nþ1
c1 n
c1 ð1 - ΦÞ
with anþ1 = 1þc02 ð1 - ΦÞð1 - βÞΔγ .
Using the flow rule expressed in Eq. (5.3) allows us to express Eq. (5.17) as
Snþ1 - X Φ
Snþ1 = Strnþ1 - Δγ ð1 - ΦÞ2G nþ1
ð5:21Þ
Snþ1 - X Φ
nþ1
Φ
And introducing the relative stress tensor in ξD
nþ1 = Snþ1 - X nþ1 yields
Snþ1 - X Φ
ξΦ Φ
nþ1 = Snþ1 - X nþ1 Snþ1 - Δγ ð1 - ΦÞ2G
tr nþ1
- XΦ Φ
n - dX nþ1 ð5:22Þ
Snþ1 - X Φ
nþ1
Snþ1 - XΦ
Snþ1 - X Φ
nþ1 þ Δγ ½ð1 - ΦÞ2G þ anþ1
nþ1
= Bn ð5:23Þ
Snþ1 - XΦ
nþ1
c0
where Bn = Strnþ1 - X Φ n þ bnþ1 ΔγX n and bnþ1 = c1 anþ1 .
2
The normal to the yield surface can be expressed in terms of the initial values of
the stress; the state variables and the strain increment at each step can be defined as
follows:
Snþ1 - X Φ Bn
n_ nþ1 nþ1
= ð5:24Þ
Snþ1 - X Φ
nþ1
k Bnk
Snþ1 - X Φ
nþ1 þ Δγ ½ð1 - ΦÞ2G þ anþ1
n
Sn - X Φ
2
þ ð1 - ΦÞ2GΔenþ1 þ bnþ1 ΔγX Φ
2
n n ð5:25Þ
=
þ 2 Sn - X Φ Φ 1=2
n : ð1 - ΦÞ2GΔenþ1 þ bnþ1 ΔγX n g
Using Eq. (5.7) for the rate-independent case or Eq. (5.8) for the rate-dependent
case results in the following nonlinear scalar equation for the consistency parameter
γ which can be solved by a local Newton method:
gðΔγ Þ
n
2 2
Sn - X Φ
n þ ð1 - ΦÞ2GΔenþ1 þ bnþ1 ΔγX Φn
Φ
Φ 1=2
= þ 2 Sn - Xrn : ð1 - ΦÞ2GΔenþ1 þ bnþ1 ΔγX n g
ffiffiffi rffiffiffi !
2 2 Δγη
- ð1 - ΦÞ K αn þ Δγ - Δγ ½ð1 - ΦÞ2G þ anþ1 - Θ
3 3 Δt
ð5:26Þ
After solving Eq. (5.26) for Δγ, the following updating schemes are needed:
pffiffiffiffiffi
αnþ1 = αn þ 2=3Δγ ð5:27Þ
Bn
εvp
nþ1 = εn þ Δγ
vp
ð5:28Þ
kBn k
" #
Snþ1 - XΦ c0
XΦ Φ
nþ1 = X n þ anþ1 Δγ
nþ1
Φ - 2 XΦ ð5:29Þ
Snþ1 - Xnþ1 c1 n
Bn
ξΦ
nþ1 = ð1 - ΦÞK ðαnþ1 Þ ð5:30Þ
kBn k
246 5 Unified Mechanics of Thermomechanical Analysis
Snþ1 = ξΦ Φ
nþ1 þ X nþ1 ð5:31Þ
Bn θ
σ nþ1 = ҡð1 - ΦÞtrðεnþ1 Þ I þ 2Gð1 - ΦÞ enþ1 - εvp - γ - e ð5:32Þ
n nþ1
kB n k nþ1
Differentiating Eq. (5.32) with respect to the total strain but not entropy at the end of
the step yields
O ∂Δγ O ∂b
nnþ1
dσ nþ1 = ð1 - ΦÞ C - 2G b
nnþ1 - 2GΔγ : dεnþ1 ð5:33Þ
∂εnþ1 ∂εnþ1
∂Δγ b
n
= nþ1 ð5:34Þ
∂εnþ1 K3
K3 = K1 þ K2
K0 anþ1
K1 = 1 þ þ
3G 2Gð1 - ΦÞ
a0nþ1 Δγ b
n bnþ1 1 ∂Θ
K2 = þ nþ1 ½b ð1 - βÞΔγ - 1 : XΦ
n þ
2Gð1 - ΦÞ 2Gð1 - ΦÞ nþ1 2Gð1 - ΦÞ ∂Δγ
∂b
nnþ1
In addition, ∂εnþ1
can be obtained from Eq. (5.24) as
∂b
nnþ1 ∂bnnþ1 ∂Bn 1 O ∂Bn
= → I -b
nnþ1 b
nnþ1 : ð5:35aÞ
∂εnþ1 ∂Bn ∂ε nþ1 k Bnþ1 k ∂ε nþ1
5.3 Return Mapping Algorithms 247
and
∂Bn O O ∂Δγ
b - 1I
= 2μð1 - ΦÞ Π I þ b0nþ1 Δγ þ bnþ1 Xn
∂εnþ1 3 ∂εnþ1
where Π b is fourth-order unit tensor mapped into the unit matrix and I second-order
identity tensors mapped into unit column vector and K′ denotes the derivative with
respect to the argument in K(α).
Defining K 4 = b0nþ1 Δγ þ bnþ1 and substituting ∂Bn in Eq. (5.35a) yields
∂εnþ1
∂b
nnþ1 ∂bnnþ1 ∂Bn 2Gð1 - ΦÞ b O 1 O
= Π-b
nnþ1 b
nnþ1 - I I
∂εnþ1 ∂Bn ∂εnþ1 kBn k 3
1
þ
kBn k
O O
K4
I-b nnþ1 b
nnþ1 : b
nnþ1 XΦ
n ð5:35bÞ
K3
Using Eqs. (5.34) and (5.35b) into Eq. (5.33) results in the following algorithmic
version of the material Jacobian coupling the effects of damage and rate dependency:
O O
b - 1I
nþ1 = ð1 - ΦÞҡI
CEVPD I þ 2Gð1 - ΦÞδnþ1 Π
3
I
O 2Gð1 - ΦÞ O
- 2Gð1 - ΦÞθnþ1bnnþ1 b
nnþ1 - Δγ Πb -b
nnþ1 b
nnþ1
kBn k
O
4
K
: b
nnþ1 XΦn ð5:35cÞ
K3
Δγ2Gð1 - ΦÞ 1 Δγ2Gð1 - ΦÞ
δnþ1 = 1 - and θnþ1 = -
kB n k K3 kBn k
αnþ1 = αn þ 3Δγ
2
ξΦ
nþ1 ð1 - ΦÞK ðα nþ1 Þ Bn
kBn k
Snþ1 = ξΦ Φ
nþ1 þ X nþ1
σ nþ1 = ҡð1 - ΦÞtrðεnþ1 ÞbI þ 2Gð1 - ΦÞ enþ1 - εvp θ
n - γ nþ1 kBn k - enþ1
Bn
Table 5.2 Local Newton iteration for the consistency parameter-classical theory
Let Δγ (0) ← 0
αn + 1(0) ← αn
Start Iterations
DO_UNTIL |g(Δγ)| < tol
k←k+1
Compute nΔγ (k + 1)
2 2
gðΔγ Þ = Sn - X Φn þ ð1 - ΦÞ2GΔenþ1 þ bnþ1 ΔγX Φ
n þ 2 Sn - X Φ
n :
1=2
ð1 - ΦÞ2GΔenþ1 þ bnþ1 ΔγXΦ
n
qffiffi qffiffi
- ð1 - ΦÞ 23K αn þ 23Δγ - Δγ ½ð1 - ΦÞ2G þ anþ1 - Θ Δγη Δt
∂gðΔγ ðkÞ Þ
dg Δγ ðkÞ = ∂Δγ ðkÞ
ðkþ1Þ ðkÞ gðΔγ ðkÞ Þ
Δγ ← Δγ - dgðΔγðkÞ Þ
END DO_UNITL
of the complete system. No net decrease in entropy of all the related systems occurs. This is
yet another way of stating the second law of thermodynamics. (DeHoff, 1993)
The concept of entropy has far-reaching implications that tie the order/disorder of our
universe to probability and statistics. Imagine a new deck of cards in order by suits, with
each unit in numerical order. As the deck is shuffled, no one would expect the original order
to return. There is a probability that the randomized order of the shuffled deck would return
to the original format, but it is exceedingly small. An ice cube melts, and the molecules in the
liquid form have less order than in the frozen form. An infinitesimally small probability
exists that all of the slower-moving molecules will aggregate in one space so that the ice cube
will reform from the pool of water. The entropy, or disorder, of the universe increases as hot
bodies cool and cold bodies warm. Eventually, the entire universe will be at the same
temperature so the energy will be no longer usable. (DeHoff, 1993)
where ρ is the density (mass per unit volume). If no mass is created or destroyed
inside V, this quantity must be equal to the rate of the matter flow into the volume
V through its surface Ω:
Z Z
∂ρ
V dV = Ω ρ ~v dΩ ð5:37Þ
∂t
where ~v is the velocity and dΩ is a vector with magnitude |dΩ| normal to the surface
and defined as positive from the inside toward the outside. The quantities ρ and v are
all functions of time and space coordinates. Applying Gauss’s theorem to the surface
integral in equation (5.37), we obtain
∂ρ
= - ρ div ~ν ð5:38Þ
∂t
Equation (5.38) is valid for any arbitrary volume V, which expresses the fact that
the total mass is conserved, i.e., the total mass in any volume element of the system
can only change if matter flows into (or out of) the volume element. This equation
252 5 Unified Mechanics of Thermomechanical Analysis
has the form of a so-called balanced equation: the local change of the density is equal
to the negative divergence of the flow of mass. The continuity equation in the vector
form of equation (5.38) is independent of any choice of coordinate system.
The conservation of mass equation can also be written in an alternative form by
introducing the substantial time derivative (Mazur & De Groot, 1962):
d ∂
= þ ~ν grad ð5:39Þ
dt ∂t
The substantial time derivative is the rate of change of a quantity (i.e., mass) as
∂
measured by an observer moving along with the flow of matter. The first term ∂t
accounts for the time rate of change according to a stationary observer, and the
second term ð~ν gradÞ accounts for the movement of the observer.
With the help of Eq. (5.39), we can rewrite Eq. (5.38) as follows:
dρ
= - ρ div ~v ð5:40Þ
dt
d~v
ρ = div ~v ð5:41Þ
dt
Finally, the following substantial time derivative relation is valid for an arbitrary
local property a that may be a scalar or a component of a vector or tensor:
da ∂aρ
ρ = þ div aρ~v ð5:42Þ
dt ∂t
which is a consequence of Eqs. (5.38) and (5.39). We can verify Eq. (5.42) directly.
According to Eq. (5.39), the left-hand side of Eq. (5.42) is
da ∂a
ρ =ρ þ ρ~v grad a ð5:43Þ
dt ∂t
∂aρ ∂ρ ∂a
þ div aρev = a þρ þ a div ρev þ ρev grad a
∂t ∂t ∂t
∂a ð5:44Þ
= að - div ρevÞ þ ρ þ a div ρev þ ρev grad a
∂t
∂a
=ρ þ ρev grad a
∂t
The momentum principle for a collection of particles states that the time rate of the
change in the total momentum for a given set of particles equals the vector sum of all
the external forces acting on the particles of the set provided Newton’s Third Law of
action and reaction governs the initial forces (Malvern, 1969). Consider a given mass
of the medium, instantaneously occupying a volume V bounded by surface A and
acted upon by external surface tractions σ (n) and body force b. Then the momentum
principle can be expressed as follows:
Z Z Z
d
A σ ðnÞ dAþ V ρ b dV = V ρ ~v dV ð5:45Þ
dt
or in rectangular coordinates:
Z Z Z
d
A σ i dAþ V ρbi dV = V ρ v~i dV ð5:46Þ
dt
Equation (5.47) must be true for any arbitrary volume V. At each point, we have
d~vi ∂σ ji
ρ = þ ρbi ð5:48Þ
dt ∂xj
σ ij = σ ji ði, j = 1, 2, 3Þ ð5:49Þ
d~v
ρ = div σ þ ρb ð5:50Þ
dt
From a microscopic point of view, the stress tensor σ results from the short-range
interactions, [strain] between the particles of the system, whereas b contains the
external forces as well as a possible contribution from long-range interactions in the
system.
254 5 Unified Mechanics of Thermomechanical Analysis
Using Eq. (5.46), the equation of motion Eq. (5.50) can also be written as
∂ρ~v
= - div ðρ~v~v - σÞ þ ρb ð5:51Þ
∂t
N
where ~v~v = ~v ~v is an ordered (dyadic) product. This equation also has the form of a
balance equation for the momentum density ρ~v . One can interpret the quantity
ðρ~v~v - σÞ as a momentum flow with a convective part ρ~v~v , and the quantity ρb as
a source of momentum, but there is no entropy generation part.
It is also possible to derive from Eq. (5.48) a balance equation for the kinetic
energy of the center of gravity motion by multiplying both members with the
components of ~v and then summing over i:
d 12 ~v2 X3 ∂ X3 ∂
ρ = i,j = 1 ∂xj
σ ji ~
v i - σ v~ þ ρbi v~i
i,j = 1 ji ∂xj i
ði = 1, 2, 3Þ ð5:52Þ
dt
or in tensorial notation:
d 12 ~v2
ρ = divðσ ~vÞ - σ : L þ ρb ~v ð5:53Þ
dt
where L = grad~v is the spatial gradient of the velocity. L can be written as the sum of
a symmetric tensor D, called the rate of deformation tensor and a skew-symmetric
tensor W called the spin tensor (or the vorticity tensor) as follows:
L=D þ W ð5:54Þ
where D = 12 L þ LT , and W = 12 L - LT .
Since W is skew-symmetric, while σ is symmetric, it follows that
We can also establish the relationship between the strain rate dε/dt and the rate of
the deformation tensor D:
dε
= FT D F ð5:56Þ
dt
dε
≈D ð5:57Þ
dt
5.5 Conservation Laws 255
∂ 12 ρ~v2 1
= - div ρ~v2 ~v - σ ~v - σ : D þ ρb ~v ð5:58Þ
∂t 2
For the conservative body forces that can be derived from a potential Ψ indepen-
dent of time, we can write (Mazur & De Groot, 1962)
∂Ψ
b = - gradΨ, =0 ð5:59Þ
∂t
We can now establish an equation for the rate of change of the potential energy
density ρΨ. It follows from Eqs. (5.38) and (5.59) that
∂ρΨ ∂ρ ∂Ψ
=Ψ þρ = Ψð - div ρevÞ
∂t ∂t ∂t ð5:60Þ
= - div ρΨev þ ρev gradΨ = - div ρΨev - ρb ev
Adding Eqs. (5.59) and (5.60) for the rate of change of the kinetic energy 12 ρ~v2
and the potential energy ρΨ:
n o
∂ρ 12 ~v2 þ Ψ 1
= - div ρ ~v2 þ Ψ ~v - σ ~v - σ : D ð5:61Þ
∂t 2
This equation shows that the sum of kinetic and potential energy is not conserved,
since an entropy source term appears on the right-hand side. However, this entropy
source term is one of many. There are far more entropy source terms in
thermomechanical loading. It is covered in more detail in the following chapter.
Moreover, assuming the potential to be time-independent significantly simplified the
formulation, albeit unrealistically.
The first law of thermodynamics relates the work done on the system and the heat
energy flowing into the system to the change in total energy of the system. Suppose
that the only energy transferred to the system is by mechanical work done on the
system by surface tractions and body forces, by heat exchange through the boundary
surface, and the heat generated within the system by external agencies (e.g., induc-
tive heating). According to the principle of conservation of energy [the first law of
thermodynamics], the total energy within an arbitrary volume V in the system can
only change if energy flows into (or out of) the volume considered through its
boundary A, which can be expressed as
256 5 Unified Mechanics of Thermomechanical Analysis
Z V Z V Z A Z V
d ∂ρe
ρ e dV = dV = - Je dA þ ρ r dV ð5:62Þ
dt ∂t
where e is the energy per unit mass, Je is the energy flux per unit surface and unit
time, and r is the distributed internal heat source of strength per unit mass. We will
refer to e as the total specific energy because it includes all forms of energy in the
system. Similarly, we will call Je the total energy flux. With the help of Gauss’s
theorem, we can obtain the differential [local form] of the law of conservation of
energy as follows:
∂ρe
= - div Je þ ρr ð5:63Þ
∂t
To relate this equation to the previously obtained Eq. (5.61) for the kinetic energy
and potential energy, we must specify what are the various contributions to the total
specific energy e and the total energy flux Je. The total specific energy e includes the
specific kinetic energy 12 v2 , the specific potential energy Ψ, and the specific internal
energy u (Mazur & De Groot, 1962):
1
e = ~v2 þ Ψ þ u ð5:64Þ
2
From a macroscopic point of view, this relation can be considered as the defini-
tion of internal energy, u. From a microscopic point of view, u represents the energy
of thermal agitation [atomic vibrations] as well as the energy due to the short-range
atomic interactions, such as the strain energy.
Similarly, the total energy flux includes a convective term ρe~v , an energy flux
ðσ ~vÞ due to the mechanical work performed on the system, and finally a heat flux
Jq (Mazur & De Groot, 1962):
Je = ρe~v - σ ~v þ Jq ð5:65Þ
This equation may be also considered as defining the heat flux Jq. Then the heat
flow rate per unit mass is given by
dq
ρ = - div Jq ð5:66Þ
dt
where q is the heat flowing into the system per unit mass. If we subtract Eq. (5.61)
from Eq. (5.63) and use Eqs. (5.64) and (5.65), we get the balance equation for the
internal energy u:
∂ρu
= - div ρu~v þ Jq þ σ : D þ ρr ð5:67Þ
∂t
5.6 Entropy Law: Second Law of Thermodynamics 257
It is apparent from equation (5.67) that the internal energy u is not conserved. A
dissipation source term appears which is equal but of opposite sign to the source term
of the balance equation (5.61) for kinetic and potential energy.
With the help of Eqs. (5.42), (5.67) may be written in an alternative form as
follows:
du
ρ = - div Jq þ σ : D þ ρr ð5:68Þ
dt
The total stress tensor σ can be split into a scalar hydrostatic pressure part p and a
deviatoric stress tensor S:
σ = S - pI ð5:69Þ
du
ρ = - div Jq - p div ~v þ S : D þ ρr ð5:70Þ
dt
X3 ∂ X3 ∂
I : D = I : grad ~v = δ
i:j = 1 ij
~vi = v~ = div ~v
i = 1 ∂xi i
ð5:71Þ
∂xj
Utilizing Eq. (5.41), the time rate of change of internal energy can be written in
the following form:
du 1 1
= σ : D þ r - div Jq ð5:72Þ
dt ρ ρ
Historically, thermodynamics in the traditional sense was concerned with the study
of reversible transformations [which are imaginary]. For an irreversible process in
which the thermodynamic state of a matter changes from some initial state to a
current state, it can be assumed that such a process can occur along an imaginary
reversible isothermal path. The processes defined in this way will be thermodynam-
ically admissible if, at any instant of evolution, the Clausius-Duhem inequality is
satisfied. According to the principles of thermodynamics, two more new variables,
temperature T and entropy S, are introduced for any macroscopic system. The
entropy [disorder/energy unavailable for work] of the universe, taken as a system
258 5 Unified Mechanics of Thermomechanical Analysis
plus whatever surroundings are involved in producing the change within the system,
can only increase. Everything that happens in the real world is always an irreversible
process, which results in the production of positive entropy and thus a permanent/
irreversible change in the universe (DeHoff, 1993).
The variation of the entropy dS may be written as the sum of two and only two
terms for a closed system (Mazur & De Groot, 1962):
where dSe is the entropy generated by the transfer of heat from external sources
across the boundary of the system and dSi is the entropy produced inside the system
and by the matter. The second law of thermodynamics states that dSi must be zero for
any reversible process and positive for the irreversible transformation of the system
from one state to any other state (Mazur & De Groot, 1962):
dSi ≥ 0 ð5:74Þ
The entropy supplied externally, dSe, on the other hand, may be positive, zero, or
negative, depending on the interaction of the system with its surroundings.
For an irreversible process in which the thermodynamic state of a matter changes
from some initial state to a different state, it is assumed that such a process can occur
along an imaginary reversible isothermal path which consists of a two-step sequence
(Krajcinovic, 1996). This is the so-called local equilibrium assumption, which
postulates that the thermodynamic state of a material at a given point and instant is
completely defined by the knowledge of the values of a certain number of state
variables at that instant. The method of local state implies that the laws which are
valid for the macroscopic system remain valid for infinitesimally small parts of it,
which agrees with the point of view currently adopted in the macroscopic description
of a continuous system. However, we need to clarify what we mean by “infinites-
imally small,” which is relative. We mean that small enough for laws of continuum
mechanics to be applicable but not smaller. This method also implies, on a micro-
scopic model, that the local macroscopic measurements performed on the system are
measurements of the properties of small parts of the system, which still contain many
of the constituting particles. In general, it is assumed that representative volume
element (RVE) is the smallest unit of a continuum. This hypothesis of “local
equilibrium” can, from a macroscopic point of view, only be justified by the validity
of the conclusions derived from it. Ultrarapid phenomena for which the time scale of
the evolutions is in the same order as the atomic relaxation time for a return to
thermodynamic equilibrium are excluded from this theory’s field of application
(Lemaitre & Chaboche, 1990). As we discussed in Chap. 3, all physical processes
can be described with precision by utilizing the proper number of thermodynamic
state variables. The processes defined in this way will be thermodynamically admis-
sible if, at any instant of evolution, the Clausius-Duhem inequality is satisfied.
5.6 Entropy Law: Second Law of Thermodynamics 259
where s is the specific entropy per unit mass, JS, tot is the total entropy flux which is a
vector that coincides with the direction of entropy flow and has a magnitude equal to
the entropy crossing unit area perpendicular to the direction of flow per unit time,
and γ is the entropy generation rate per unit volume and per unit time.
Utilizing Eqs. (5.75)–(5.77), and Gauss’s theorem, Eq. (5.73) may be written in
the following form (Mazur & De Groot, 1962):
Z V
∂ρs
þ div JS,tot - γ dV = 0 ð5:78Þ
∂t
where the divergence of JS, tot simply represents the net entropy leaving unit volume
per unit time. Equation (5.78) must be true for any arbitrary volume V; hence, we can
write
∂ρs
= - div JS,tot þ γ ð5:79Þ
∂t
γ≥0 ð5:80Þ
These last two equations are the local forms of Eqs. (5.73) and (5.74), i.e., the
local mathematical expressions for the second law of thermodynamics. Equation
(5.79) is formally a balance equation for the entropy density s with an entropy
generation rate γ which satisfies the important inequality (5.80). With the help of
equation (5.42), equation (5.79) can be rewritten in a slightly different form as
follows:
ds
ρ = - div JS þ γ ð5:81Þ
dt
260 5 Unified Mechanics of Thermomechanical Analysis
where the entropy flux JS is the difference between the total entropy flux JS, tot and a
convective term ρs~v:
φ = u - Ts ð5:83aÞ
Differentiating this and with the help of the law of conservation of energy, we
have the following relations:
where q is the total heat flowing into the system per unit mass, including the
conduction through the surface and the distributed internal heat source, w is the
total work done on the system per unit mass by external loads and body forces, wd is
the dissipated energy associated with the total work, which may be dissipated in the
form of heat, and we is the elastic energy associated with the total work. For the
quantitative treatment of entropy for irreversible processes, let us introduce the
definition of entropy for irreversible processes:
δq þ δwd
ds = ð5:84Þ
T
With the help of Eq. (5.84), we can write the following relation:
This is the Gibbs relation which combines the first and second laws of thermo-
dynamics. From the definition of entropy, we also have
The Helmholtz free energy, φ, is the isothermal recoverable elastic energy available
for work. The specific elastic energy we, namely, the work stored in the system per
unit mass during a process, is not a function of the process path [path independent]. It
depends only on the end state of a process for a given temperature. The elastic energy
is frequently also referred to as the available energy of the process. The elastic
energy is the maximum amount of work that could be produced by a system between
any given two states if the entropy generation is zero, which is not possible in any
process.
To find the explicit form of the entropy balance equation, Eq. (5.81), we insert
Eq. (5.72) for du
dt into Eq. (5.85) with the time derivatives given by Eq. (5.39):
ds div Jq 1 ρ dwe ρr
ρ =- þ σ : D- þ ð5:87aÞ
dt T T T dt T
Noting that
div Jq Jq 1
= div þ 2 Jq gradT ð5:87bÞ
T T T
it is easy to cast equation (5.87) into the form of a balance equation, Eq. (5.81):
ds Jq 1 1 ρ dwe ρr
ρ = - div - 2 Jq gradT þ σ : D - þ ð5:88Þ
dt T T T T dt T
From comparison with equation (5.81), it follows that the expressions for the
entropy flux and the entropy production rate are given by
Jq
JS = ð5:89Þ
T
1 ρ dwe 1 ρr
γ= σ : D- - 2 Jq gradT þ ð5:90Þ
T T dt T T
Jq = - k gradT ð5:92Þ
where k is the thermal conductivity with units of Btu/fthr°F and Jq is the heat flux.
This law of heat conduction was stated first by Fourier who based it on experi-
mental observation. Equation (5.92), also known as Fourier’s law, expresses a linear
relation between the heat flux vector Jq and its dual variable gradT. Since solid,
opaque bodies are of primary interest in this chapter, heat is transferred from point to
point within this body solely by conduction. The field equation of the boundary
value problem will, therefore, always be some form of the Fourier heat conduction
equation. Of course, heat may be transferred to the surface of the body by other
modes of heat transfer. Then the expression for the internal entropy generation rate
for thermomechanical problems can be simplified as
1 ρ dwe k ρr
γ= σ : D- þ 2 jgradT j2 þ ð5:93aÞ
T T dt T T
The specific entropy production rate for small deformation and small strain
thermomechanical problems in metals may be simplified as
dsi γ σ : ε_ p k r
= = þ 2 jgradT j2 þ ð5:93cÞ
dt ρ Tρ T ρ T
5.7 Fully Coupled Thermomechanical Equations 263
ε = ε e þ εp ð5:94Þ
The relations between the energy, stress tensor, and strain tensor can be obtained
using the formalism of thermodynamics. Here we choose the specific Helmholtz free
energy, φ, which depends on observable variables and internal state variables:
φ = ðε, T, εe , εp , V k Þ ð5:95Þ
For small strain formulation, the strain appears only in the form of their additive
decomposition, so that
∂φ ∂φ ∂φ
= =- p ð5:97Þ
∂εe ∂ε ∂ε
and the following expressions define the relation between Helmholtz free energy and
thermodynamic state variables:
∂φ
σ=ρ ð5:98Þ
∂εe
∂φ
s= - ð5:99Þ
∂T
∂φ
Ak = ρ ð5:100Þ
∂V k
where Ak is a thermodynamic force associated with the internal variables Vk. The
vector formed by the variables is the gradient of the function φ in the space of the
variables T, εe, and Vk. This vector is normal to the surface φ = constant.
Let us return to the equation of the conservation of energy for small strains
(utilizing Eqs. (5.57) and (5.68)):
And utilizing φ_ and s_ by their expression as a function of the state variables with
the help of equations (5.98–5.100):
∂φ e ∂φ _ ∂φ _ 1
φ_ = : ε_ þ Tþ V k = σ : ε_ e - sT_ þ Ak V_ k ð5:103Þ
∂εe ∂T ∂V k ρ
2 2 2
∂ φ ∂ φ_ ∂ φ _ 1 ∂σ e ∂s _ 1 ∂Ak _
s_ = - : ε_ e - T- Vk = - : ε_ þ T- V
∂εe ∂T ∂T 2 ∂V k ∂T ρ ∂T ∂T ρ ∂T k
ð5:104Þ
We obtain
∂s _ ∂σ e ∂Ak _
- div Jq = ρT T - σ : ðε_ - ε_ e Þ þ Ak V k - ρr - T : ε_ þ V k ð5:105Þ
∂T ∂T ∂T
∂s
C=T ð5:106Þ
∂T
This is the fully coupled thermomechanical equation, which can simulate the
evolution of temperature due to mechanical work with properly imposed boundary
conditions. Ak V_ k represents the non-recoverable energy in the matter corresponding
to other dissipation mechanisms. If we simplify the problem to thermomechanical
loading at small strain and small strain rates on pure metals, we may be able to ignore
other dissipation terms, and then we can write
Ak V_ k ≈ 0 ð5:109Þ
∂σ e
k∇2 T = ρC T_ - σ : ε_ p - ρr - T : ε_ ð5:110Þ
∂T
Equation (5.110) also allows us to calculate heat flux Jq generated due to elastic
and/or inelastic work in a solid body. This will be covered in more detail later in the
chapter on fatigue.
For the isotropic linear thermoelastic materials, the stress-strain relationship in
Newtonian mechanics is given by
vE E
λ= ; G= ð5:112Þ
ð1 þ vÞð1 - 2vÞ 2ð1 þ vÞ
If the internal heat generation is neglected, equation (5.108) defines the response
of isotropic linear thermoelastic materials:
266 5 Unified Mechanics of Thermomechanical Analysis
The last term represents the interconvertibility of thermal and mechanical energy.
With the help of Eqs. (5.87) and (5.108), the total specific entropy generation rate
for small strains is given by
ds c ∂T 1 ∂σ e ∂Ak
= - : ε_ þ : Vk ð5:114Þ
dt T ∂t ρ ∂T ∂T
dsi γ σ : ε_ p r k
= = þ - 2 jgrad T j2 ð5:115Þ
dt ρ Tρ T T ρ
The equations governing the temperature, stress, deformation, and entropy pro-
duction rate in a continuum have been derived in the previous chapters. However,
these quantities are all interrelated and must be determined simultaneously in
incremental form.
In most quasi-static problems, the effect of stresses and deformations on the
temperature distribution is quite small and may be neglected. This procedure allows
the determination of the temperature distribution resulting from prescribed thermal
boundary conditions to become the first step of a thermal stress analysis. The second
step is then the determination of the stresses, deformations, and thermodynamic
stress index field due to this temperature distribution. Of course, performing
thermomechanical analysis in one step is possible, but it is far more computationally
intensive.
Entropy change caused by the heat transfer between the system and its surround-
ings does not influence the degradation of the material if the temperature field in the
entire body is uniform because of this exchange. If this heat exchange leads to a
nonuniform temperature field, then heat coming from outside can lead to irreversible
entropy generation in the material as in bending, thermomigration, and other mech-
anisms. Only the internal entropy generation, namely, the entropy created in the
system, should be used as a basis for the systematic description of the irreversible
processes, which can be given by
5.8 Numerical Validation of the Thermomechanical Constitutive Model 267
Z Z t Z t
t
σ : ε_ p k r
Δs = Δsi = dt - 2
jgrad T j dt þ dt ð5:116Þ
t0 Tρ t0 T ρ
2
t0 T
Equation (5.116) shows that the entropy generation is a function not only of the
loading or straining process but also of the temperature. However, a uniform increase
in temperature in a stress-free field does not cause any change in TSI. While this
fundamental equation accounts for strain rate, it does not account for all mechanisms
under thermomechanical loading. If the strain rate is very high, there are far more
entropy generation mechanisms such as phase change, elastic dissipation, melting,
and other mechanisms that can be significant entropy generation sources that this
equation does not account for.
Thermodynamic state index earlier was given by the following equation:
h ms ðΔsÞ
i
Φ = 1 - e- R ð5:117Þ
Two problems are selected to validate the model described in this chapter. First, the
model results are compared with testing performed on specimens of thin layer solder
joints of Pb37/Sn63 under monotonic and fatigue shear testing at different strain
rates and temperatures.
Table 5.3 lists the strain rate and temperature ranges for testing and simulations.
Monotonic and cycling shear tests were performed on a thin layer of the solder joint
(Pb37/Sn63) in pure shearing mode. Figure 5.3 shows the specimen attached to
copper plates. Material parameters used for the numerical simulation are shown in
Table 5.4. Figure 5.4 shows a comparison between numerical simulation results and
test data under monotonic shear testing for different temperatures at a strain rate of
1.67 × 10-3/s. Figure 5.5 shows monotonic shear testing under strain rate of
1.67 × 10-3/s, at 22 °C and inelastic strain range (ISR) = 0.005.
Figures 5.6, 5.7, 5.8, and 5.9 show a comparison between the cyclic shear
simulation results and test data at 22 °C and strain rate of 1.67 × 10-3/s. Figure 5.10
shows simulation results for isothermal fatigue testing at strain rate 1.67 × 10-4/s at
22 °C with an inelastic strain range of 0.022.
268 5 Unified Mechanics of Thermomechanical Analysis
Fig. 5.4 Monotonic shear testing under strain rate 1.67 × 10-3/s at different temperature
Figure 5.11 shows the evolution of the thermodynamic state index under cycling
fatigue loading with an inelastic strain range of 0.022.
The computational simulations are in good agreement with the experimental
results. The differences are due to the imperfect microstructure (such as voids) of
the solder joint, while the numerical model assumes a perfect continuum with no
defects. Moreover, entropy generation due to internal heat generation was ignored
for the sake of simplicity in this example.
More examples of unified mechanics theory for thermomechanical problems are
provided in detail in the references listed at the end of the chapter.
Classical continuum mechanics formulation does not include a term for the size
effect. We should clarify here what is meant by “size.” It is not the size of the system,
such as 150 stories high building versus a very small bunker. “Size” refers to the
dimensions of the microstructure of the material. When the size of the microstructure
becomes a dominant factor in the response of the material, in addition to strains,
gradients of strains are also included in the continuum mechanics formulation. The
size effect is known to be a big factor, especially in the plastic zone of the material
response. The incremental theory of plasticity is called strain gradient plasticity
theory when strain gradients are also included in the yield surface formulation in
addition to strains. Strain gradient plasticity is especially needed when traditional
5.9 Thermomechanical Analysis of Cosserat Continuum: Length Scale Effects 273
where α is a geometrical factor that depends on the type and arrangement of the
interacting dislocations, G is the shear modulus, b is Burger’s vector, ρSSD is the
density of statistically stored dislocations, and ρGND is the density of geometrically
necessary dislocations.
There are different methods to account for the size effect in continuum mechan-
ics. One of these methods is the use of the Cosserat continuum. In this section, we
will discuss (Cosserat continuum) couple stress-based strain gradient theory without
274 5 Unified Mechanics of Thermomechanical Analysis
The motivation behind Cosserat’s couple stress theory also referred to as the
Cosserat continuum, Fig. 5.12, stems from the following arguments. In classical
continuum mechanics, it is assumed that a material point occupies no space (has zero
volume) and then the size of the material microstructure is not part of the formula-
tion. To include the effects of the finite size of a material’s microstructure while
retaining a mathematical continuum, classical continuum mechanics have been
extended or generalized. In the Cosserat continuum, each material point is enriched
with higher-order kinematic degrees of freedom. In particular, the specification of
the position requires also the definition of a rotation. In the more general Cosserat
theory, this new kinematic quantity, the micro rotation, is independent of the
classical theory rotation vector θk = 12 eijk uj,k where eijk is the alternating pseudo-
tensor and a coma represents a derivative with respect to rectangular Cartesian
coordinates. This type of theory has been termed as “indeterminate couple stress
theory” by Eringen (1968) and “generalized couple stress theory” by Fleck and
Hutchinson (1997). The following discussion elaborates further upon this theory.
There are different interpretations of the Cosserat continuum. In Cosserat and
Cosserat (1909) couple stress theory, a differential material element is subjected to
not only normal and shear stresses but also couple stress components as shown in
Fig. 5.12. For linear elastic behavior, the traditional stress components are functions
of the strains and the couple stresses are functions of the strain gradients. Two
distinct theories are identified in the work of Cosserat and Cosserat (1909). First,
5.9 Thermomechanical Analysis of Cosserat Continuum: Length Scale Effects 275
there is a reduced couple stress theory with the kinematic degrees of freedom being
the displacement ui and an associated material rotation θi tied to the displacements
by the kinematic constraint in Eq. (5.119) and with the definitions of strain and
curvatures expressed in Eqs. (5.120) and (5.121):
1
θi = e u ð5:119Þ
2 ijk k,j
1
εij = ui,j þ uj,i ð5:120Þ
2
xij = θi,j ð5:121Þ
Second, there is a general couple stress theory in terms of a micro rotation, ωi,
which is regarded as an independent kinematic degree of freedom. The traditional
continuum mechanics rotation and Cosserat continuum micro rotation are related by
a relative rotation tensor αij = eijkωk - eijkθk. For the choice of ωk = θk, the general
couple stress theory reduces to the reduced couple stress theory. Equilibrium of the
differential element shown in Fig. 5.12, after neglecting body forces and body
couples [in Newtonian mechanics], yields
where σ ij and τ ij are the symmetric and antisymmetric components of the Cauchy
stress tensor, respectively, and mij is the couple’s stress tensor. Surface stress
tractions and surface couple stress tractions are given by
276 5 Unified Mechanics of Thermomechanical Analysis
ðnÞ
σ i = σ ij þ τij nj ð5:124Þ
qi = mij nj ð5:125Þ
ð nÞ
where σ i is the traction vector, qi is the couple traction vector, and ni is a surface
outward normal vector. For a linear elastic solid, the strain energy density is given by
Eq. (5.126) where l is a length scale associated with elastic behavior and may be
regarded to be of the order of inter-atomic distances (Koiter, 1964).
E v
W= ε ε þ εij εij þ l2 xij xij ð5:126Þ
2ð1 þ vÞ 1 - 2v ii jj
∂W
σ ij = C ijkl εkl ð5:127Þ
∂εij
∂W
mij = Dijkl xkl ð5:128Þ
∂xij
where Cijkl and Dijkl are elastic constitutive tensors relating stress and couple stress
components to elastic strains and elastic curvatures, respectively. The principle of
virtual work for the reduced Cosserat theory can be given by [in Newtonian
mechanics]
ZZZ ZZZ ZZ ZZ
ðnÞ
σ ij δεij dV þ mij δxij dV = σ i δui dA þ qi δθi dA ð5:129Þ
V V A A
where σ ij is the symmetric Cauchy stress tensor and τijk is the higher-order stress
tensor for a generally solid body (Fig. 5.13). Surface tractions and surface couple
tractions (moments) are given in Eqs. (5.124) and (5.125), respectively. The higher-
order stresses correspond to force couples per unit area:
ðnÞ
σ i = σ ik - τijk:j ni þ ni nj τijk Dp np - Dj ni τijk ð5:131aÞ
r k = ni nj τijk ð5:131bÞ
where the operators Dj and D are defined as Dj = (δjk - njnk)∂k and D = nk∂k.
Generalized strain-displacement and strain gradient-displacement relations can be
given by
1
εij = ui:j þ uj,i ð5:132aÞ
2
ηijk = uk,ij ð5:132bÞ
1
W= λε ε þ μεij εij þ a1 ηijj ηikk þ a2 ηiik ηkjj þ a3 ηiik ηjjk þ a4 ηijk ηijk þ a5 ηijk ηkji
2 ii jj
ð5:133Þ
where λ and μ are Lamè constants and ai are material constants with units of force. A
constitutive relationship can be directly derived out of Eq. (5.133) such that
∂W
σ ij = C ijkl εkl ð5:134aÞ
∂εij
∂W
τijk = Dijklmn ηlmn ð5:134bÞ
∂ηijk
278 5 Unified Mechanics of Thermomechanical Analysis
where Cijkl and Dijklmn are elastic constitutive tensors relating stress and higher-order
stress components to elastic strains and elastic strain gradients, respectively. The
principle of virtual work can be written by equating an external work increment and
an internal increment of strain energy and can be given by [in Newtonian mechanics]
ZZZ ZZZ ZZ ZZ
ðnÞ
σ ij δεij dV þ τijk δηijk dV = σ i δui dA þ r i ðDδui ÞdA ð5:135Þ
V V A A
In this section, we derive the equilibrium equations for two different classes of the
Cosserat continuum. Traditionally, the notation used to define a solid body occupy-
ing a volume Ω and bounded by a surface ∂Ω defined by an outward normal vector
n as schematically is shown in Fig. 5.13. In our formulation, we will use dV for
volume differentials and dA for surface area differentials.
There are several different interpretations of the Cosserat continuum. In the
Cosserat and Cosserat (1909) couple stress theory, a differential material element
has not only normal and shear stresses but also couple stress (moment) components
as shown in Fig. 5.12. For linear elastic behavior, the usual stress components are
functions of the strains and the couple stresses are functions of the strain gradients.
Two distinct theories are identified in the Cosserat and Cosserat (1909) as described
by Aero and Kuvshinsky (1961), Mindlin (1964), de Borst and Muhlhaus (1992), de
Borst (1993), and Shu and Fleck (1999). First, there is a reduced Cosserat couple
stress theory with the kinematic degrees of freedom being the displacement ui and an
associated material rotation θi tied to the displacements by a kinematic constraint
given in Eq. (5.119) and with the definitions of strain and curvatures expressed in
Eqs. (5.120) and (5.121), respectively.
According to the reduced Cosserat continuum theory, the continuum is assumed
to possess bending stiffness allowing for the introduction of additional stress mea-
sures in the form of moments per unit area. The presence of the couple stresses
(moments) renders the Cauchy stress tensor asymmetric; however, only the sym-
metric component generates work upon deformation. Second, there is a general
Cosserat continuum couple stress theory in terms of a micro rotation ωi which is
regarded as an independent kinematic variable. The rotation and micro rotation are
related by a relative rotation tensor αij = eijkωk - eijkθk. For the case of ωk = θk, the
general Cosserat continuum couple stress theory reduces to the reduced Cosserat
continuum couple stress theory. This general theory assumes that at a material point,
there is also an embedded micro-volume giving rise to the micro rotation ωi.
Different interpretations have been postulated depending on the deformation prop-
erties assumed for the embedded micro-volume (see Mindlin and Tiersten (1962),
Toupin (1962), and Mindlin (1964, 1965) for a review of the different approaches).
5.9 Thermomechanical Analysis of Cosserat Continuum: Length Scale Effects 279
Fig. 5.14 Relative deformation in the couple stress theory by Mindlin (1964)
For instance, Fig. 5.14 shows the deformation state in a material point including the
micro-volume for the case of pure shear in the solid proposed by Mindlin (1964).
Figure 5.15 shows the case of reduced Cosserat continuum couple stress theory
where the micro-volume is assumed rigid.
Newtonian mechanics equilibrium of the differential element shown in Fig. 5.12
(which is valid for both theories of the Cosserat continuum), after neglecting body
forces and body couples, yields
where σ ij and τ ij are the symmetric and antisymmetric components of the Cauchy
stress tensor, respectively, and mij is the couple’s stress tensor. Surface stress
tractions and surface couple stress (moment) tractions are given by
σ ðnÞ i = σ ij þ τij nj ð5:138Þ
qi = mij nj ð5:139Þ
280 5 Unified Mechanics of Thermomechanical Analysis
where σ (n)i is the surface traction vector, qi is the surface couple tractions vector, and
ni is a surface normal vector. To establish a general framework for elastic material
behavior, Cosserat couple stress continuum can be obtained after postulating a strain
energy density function dependent on strains and curvatures (rotation gradients).
5.9 Thermomechanical Analysis of Cosserat Continuum: Length Scale Effects 281
Toupin (1962) and Mindlin (1965) extended this theory to include also stretch
gradients. In particular, they considered invariants of the strain and strain gradients
in the strain energy density function in terms of the following generalized von Mises
strain invariant which is given by Fleck and Hutchinson (1997):
2 0 0
E2 = ε ε þ c1 η0iik η0jjk þ c2 η0ijk η0ijk þ c3 η0ijk η0kij ð5:140Þ
3 ij ij
where prime superscript denotes deviatoric component, ηijk = uk, ij are the strain
gradients and the c1,c2,c3 are additional material constant with dimensions of length
squared (L2). Equation (5.140) is the basis for the most general strain gradient
plasticity theory, and it essentially reveals the phenomenological coupling between
the densities of statistically and geometrically stored dislocations. The reduced
couple stress theory is just a special case of Eq. (5.140) where only rotation gradients
are considered.
The following elastic constitutive relationships can be written for the symmetric
Cauchy stress tensor, the asymmetric stress tensor, and the couple stress tensor:
where Cijkl is a tangential constitutive tensor relating strains to Cauchy stresses, Dijkl
is a constitutive tensor relating curvatures to couple stresses, and Dijkl is a constitu-
tive tensor relating relative rotations to the antisymmetric component of the Cauchy
stress tensor, and l is characteristic length. The total potential energy functional ∏ for
the general elastic Cosserat continuum can be written by considering separately the
contributions from the symmetric and antisymmetric stress tensors and the couple
stress tensor as follows:
Z Z
Q 1 1
ðui , ωi Þ = C ijkl εkl ðui Þεij ðui ÞdV þ Dijkl xij ðωi Þxkl ðωi ÞdV
2 2
VZ V Z Z ð5:144aÞ
1 ðnÞ
þ Dijkl αij ðui , ωi Þαkl ðui , ωi ÞdV - σ i ui dA - qi ωi dA
2 A A
V
282 5 Unified Mechanics of Thermomechanical Analysis
where
1
aðu, uÞ - f ðuÞ
F ð uÞ =
2
1
GðωÞ = bðω, ωÞ - gðωÞ
2
Z
1
H ðu, ωÞ = Dijkl αij ðui , ωi Þαkl ðui , ωi Þ dV ð5:144bÞ
2
V
R R
with aðu, uÞ C ijkl εij εkl dV and bðω, ωÞ Dijkl xij xkl dV being symmetric bilinear
Ω V
forms and f(u), g(ω) corresponding to the boundary terms in Eq. (5.144a). For the
particular case of the reduced couple stress theory where the stress tensor is sym-
metric and satisfies the constraint in Eq. (5.119), αij (or equivalently H(u, ω))
vanishes and Eq. (5.144b) reduces to Eq. (5.145a):
Z Z
1 1
Π ð ui Þ = C ijkl εkl ðui Þ εij ðui Þ dV þ Dijkl xij ðui Þ xkl ðui Þ dV
2 2
V V
Z Z
ðnÞ
- σ i ui dA - qi ωi dA ð5:145aÞ
A A
where now the strain energy contribution from the curvatures becomes a function of
the translational degrees of freedom only. Note that Eq. (5.145a) is analogous to the
total potential energy functional for the particular case of the so-called Timoshenko
beam theory. Using the alternative notation, (5.145a) can be written as
continuum. This implies that Eq. (5.145a) is written in terms of additional indepen-
dent rotational degrees of freedom but with the additional requirement imposed by
the constraint between translational and rotational degrees of freedom. In this case,
the reduced theory can also be described by the functional Πb ðu, ωÞ that follows:
Z Z
b ð ui , ω i Þ = 1
Π C ijkl εkl ðui Þεij ðui ÞdV þ
1
Dijkl xij ðωi Þxkl ðωi ÞdV
2 2
V V
Z Z
ðnÞ
- σ i ui dA - qi ωi dA ð5:145cÞ
A A
Consider the case of the general couple stress continuum where there is no constraint
set but instead, there is a space of admissible functions with translational and
rotational degrees of freedom. The rotational degrees of freedom are linearly inde-
→
pendent and belong to the space Q alternatively, and we can define displacements
→ → → →
and rotations as elements of the single space V × Q . The product V × Q results in a
→ → → →
third space with elements being all the ordered pairs of the form u , ω 2 V × Q
because translational and rotational degrees of freedom are linearly independent. In
terms of the introduced notation, this is equivalent to the following variational
→ → → → → → → →
problem; find u , ω 2 V × Q such that for any v , φ 2 V × Q , Π assumes
→ → → →
its minimum value at u, ω where V × Q is now the corresponding space of
admissible functions.
The generalized form of the principle of virtual displacements for the general
couple stress theory can be given by
Z Z Z Z
ðnÞ
σ ij εij ðvi ÞdV þ mij xij ðφi ÞdV þ τij αij ðvi , φi ÞdV - σ i vi dA
A
V V V
Z
- qi φi dA = = 0 ð5:146Þ
A
The principle of virtual displacements for a reduced couple stress theory can be
given by
Z Z Z Z
ðnÞ
σ ij εij ðvi ÞdV þ mij χ ij ðvi ÞdV - σ i vi dA - qi φi ðvi ÞdA = 0 ð5:147Þ
V V A A
In contrast to the case defined in Eq. (5.146), the present principle of virtual
→
displacements shows that the only kinematic variable is displacement u which
defines the strains εij and the curvatures χ ij. This is the definition of the reduced
couple stress theory. There are no linearly independent rotational degrees of freedom
and there is no constraint to be enforced. However, the displacement approximation
(shape) function needs to be C1 continuous in the finite element implementation.
However, when there is no kinematic constraint between displacements and rota-
tions, mesh-dependent results are obtained in finite element analysis. This is a major
obstacle.
The first two terms on the right-hand side of Eq. (5.148a) correspond to virtual
work done by the stresses and couple stresses, respectively. The third term corre-
sponds to the virtual work done by the asymmetric component of the stress tensor.
Equation (5.148b) is the weak enforcement of the constraint of vanishing relative
5.9 Thermomechanical Analysis of Cosserat Continuum: Length Scale Effects 285
rotation. This formulation was proposed by Xia and Hutchinson (1996) to solve the
mesh dependency problem that happens in the displacement-based finite element
method when Eq. (5.147) is used.
Finite Element Method Implementation Details
This section discusses the discretization of the equations derived above starting from
their corresponding variational equations. In each case, we will schematically show a
typical finite element with its associated vector of nodal point degrees of freedom.
The nodal degrees of freedom may be translational only, translational and rotational
degrees of freedom, or translational and rotational degrees of freedom with addi-
tional Lagrange multipliers. To distinguish between the values of the degrees of
freedom at any point within the element and its nodal point value, we will use the
following notation. For instance, if displacement is being considered, the value at
any point within the element will be denoted by the vector u, and its corresponding
nodal point value will be denoted by b u. The value of any degree of freedom at any
point within the element is obtained via interpolation functions from the known
nodal point values. Strains and curvatures are usually calculated at Gauss integration
points from the nodal point displacements and rotations values using interpolation
functions. We will denote such a function by a subscripted symbol where the
subscript indicates the variable that is being interpolated. For instance, we will
denote the displacement-curvature transformation matrix by Bχ where the curvature
at any point within a given element is obtained out of the nodal point displacement
vector as χ = Bχ b
u where b u may have translational or translational and rotational
degrees of freedom.
In the case of the general couple stress theory, the continuum is free of constraints as
→ →
a result, and u , ω are linearly independent degrees of freedom. In the finite element
formulation, C0 continuity of shape functions is enough to satisfy the displacement
compatibility requirement. The nodal point displacement degrees of freedom for an
n-nodded two-dimensional quadric-lateral element have the following general form:
uTe = ½u1 v1 ω1 . . . un vn ωn as shown in Fig. 5.16. To describe the finite element
b
discretization, we recall that for a general couple stress theory, principle of virtual
work is given by
Z Z Z Z
ðnÞ
σ ij εij ðvi ÞdV þ mij χ ij ðφi ÞdV þ τij αij ðvi , φi ÞdV - σ i vi dA
V V V A
Z
- qi φi dA = 0 ð5:149Þ
A
286 5 Unified Mechanics of Thermomechanical Analysis
u = Nu
ðnÞ b bT
uLetting i,, ε = Bεui,, χ = Bχ ui,, α = Bαui, and σ i = N t where t =
t 1 t v1 t ω1 . . . t un t vn t ωn , where subscript n represents node number. Using the constitutive
relationships given by Eqs. (5.141)–(5.143), we can write the following discretized
form after eliminating the virtual variables:
2 3
Z Z Z Z
4 BTε CBε dV þ BTχ DBχ dV þ BTα DBα dV 5ui = N Tbt dA ð5:150Þ
V V V A
Z Z Z Z
ðnÞ
σ ij εij ðvi ÞdV þ mij χ ij ðvi ÞdV - σi vi dA - qi φi ðvi Þ dA = 0 ð5:152Þ
V V A A
5.9 Thermomechanical Analysis of Cosserat Continuum: Length Scale Effects 287
Z Z Z Z
ðnÞ
σ ij εij ðvi ÞdV þ mij χ ji ðφi ÞdV þ τij αij ðvi , φi ÞdV = σ i vi dA
V V V A
Z
þ qi φi dA ð5:155aÞ
A
288 5 Unified Mechanics of Thermomechanical Analysis
Z
ρij αij ðui , ωi ÞdA = 0 ð5:155bÞ
A
Up to this point, all our derivations have been based on Newtonian mechanics. In the
Cosserat continuum, the side effects are introduced by enhancing the definition of
equivalent plastic strain with the addition of an equivalent plastic curvature, which is
a gradient of strain. This approach leads to a straightforward extension of the classic
flow theory (also known as the theory of incremental plasticity) which allows the
treatment of cyclic loading and monotonic loading. First, we will start with the
formulation that presents the flow theory equations for the undamaged rate-
independent case. A key feature is the coupling between the Cauchy stress and a
5.10 Cosserat Continuum Implementation in Unified Mechanics Theory 289
couple of stress components. This coupling is not present in the initial elastic
material but progressively appears with the accumulation of plastic curvatures. In
the case of degradation, additional terms appear in the expression for the Helmholtz
free energy determination. On the other hand, consideration of rate-dependent
effects assumes the validity of the same rules for the plastic curvatures as for the
plastic strains. This assumption allows for using a single creep law for both compo-
nents (strain and curvature) and is justified by the kinematic constraint present in the
reduced couple stress continuum.
The relationship between the symmetric part of the Cauchy stress tensor and the
elastic strains and the couple stress tensor and the elastic curvatures can be written in
rate form as follows:
where Dijkl = Gδikδjl, where G is the shear modulus. The strains and curvatures are
decoupled into elastic and inelastic components which results in
ε_ ij = ε_ elij þ ε_ pl
ij ð5:158Þ
ℓ χ_ ij = ℓ χ_ elij þ ℓ χ_ pl
ij ð5:159Þ
where Sij is the deviatoric stress tensor and mij is the couple stress tensor, which is
deviatoric (it does not have a spherical component). Similarly, the generalized strain
tensor norm can be given by
1=2
Ε = εij εij þ ℓ χ ij ℓ χ ij ð5:161Þ
Next, we need to introduce a yield surface separating the elastic and inelastic
domains per classical plasticity theory. To account for the Bauschinger
effect observed in metals, a back-stress tensor βij and a couple of back-stress tensors
ℓ -1ηij defining the movement of the yield surface in stress space need to be defined.
The difference fij = Sij - βij between the back-stress and the deviatoric component of
290 5 Unified Mechanics of Thermomechanical Analysis
the symmetric part of the Cauchy stress tensor is the relative stress tensor fij. In an
analogous form for the couple back-stress, there follows that the relative couple
b ij which is defined by
back-stress ℓ - 1 C
b ij = ℓ - 1 mij - ℓ - 1 ηij :
ℓ - 1C
The generalized relative stress kξk can be described in terms of the relative
stresses and given by
→ h i1=2
b ij C
ξ = f ij f ij þ ℓ - 2 C b ij ð5:162Þ
We can now define a yield surface to distinguish between elastic domain and
inelastic domain. In the classical theory of incremental plasticity, the yield surface is
defined in terms of a hardening parameter that can be shown to be proportional to the
equivalent plastic strain or plastic work. In the couple stress-based strain gradient
plasticity theory introduced here, the hardening parameter also incorporates the size
effects via the equivalent plastic curvatures. The generalized yield criteria can
therefore be given by
rffiffiffi
-1
2
F σ, ℓ m, α = ξ - K ðαÞ ð5:163Þ
3
where α is the generalized isotropic hardening parameter and K(α) represents the
radius of the yield surface. To complete the constitutive model, it is necessary to
define the rate-independent plastic flow rules (i.e., evolution equations for the plastic
strains and curvatures) and hardening laws (i.e., the evolution of the hardening
parameter and back-stress components). Here it is assumed that the flow rule
obeys associative plasticity. Considering a more general stress space with normal
stresses and couple stresses, the yield surface can be considered as a hypersphere in a
stress space with normal N b to the yield surface, F, defined as
b = ∂F ½b
N n, bv ð5:164Þ
→
∂Σ
b
ℓ-1
∂F f ij ∂F C ij
where b
n= ∂σ
and bv = ∂l - 1 m
.
ξij ξij
∂F f ij
ε_ pl
ij = γ γ γb
n ð5:165aÞ
∂σ ij ξij
5.10 Cosserat Continuum Implementation in Unified Mechanics Theory 291
∂F b ij
C
ℓ χ_ pl
ij = γ -1
γℓ - 1 γb
ν ð5:165bÞ
∂ℓ mij ξij
The constitutive model is completed with the evolution equations for the back-
stresses:
2
β_ ij = H 0 γb nij ð5:167aÞ
3
2
ℓ η_ ij = H 0 γb
νij ð5:167bÞ
3
which implies Ε_ = γ
pl
Z t rffiffiffi
2 _ pl
α ðt Þ = Ε ðτÞ dτ ð5:169Þ
3
0
∂F ∂F ∂F ∂F
F_ = : σ_ þ - 1 : ℓ - 1 m
_ þ p : ε_ pl þ : χ_ pl ð5:172Þ
∂σ ∂ℓ m ∂ε ∂ℓχ pl
Using Hooke’s law together with the plastic flow rule yields
F_ = ½b
n : C : ε_ þ bv : D : ℓχ_
∂F ∂F
b b b
- γ ½n : C : n þ v : D : v - b b
:nþ : bv ð5:173Þ
∂εpl ∂ℓχ pl
And imposing the consistency condition yields the consistency parameter γ in the
following form:
n : C : ε_ þ bv : D : ℓ χ_
b
γ= ð5:174Þ
b ∂F ∂F
n:C:b
n þ bv : D : bv - ∂εpl
:b
n þ ∂ℓχ pl : b
v
yields
rffiffiffi rffiffiffi
∂F 2 0 2 εpl S ∂F 2 0 2 ℓχ pl ℓ - 1 m
:b
n= - K : and : bv = - K :
∂εpl 3 3 α Σ ∂ℓχ pl 3 3 α Σ
ð5:176Þ
∂F ∂F
:b nþ
∂εpl ∂ℓχ pl
rffiffiffi
2 0 2 1 pl 2
: bv = - K ε : S þ ℓχ pl : ℓ - 1 m = - K 0 ð5:177Þ
3 3 α Σ 3
b
n : C : ε_ þ bv : D : ℓ χ_
γ= ð5:178Þ
b
n:C:b n þ bv : D : bv þ 23 K 0
Using
S:S
b
n : C = 2Gb
n and b
n:C:b
n = 2G
k Σk 2
and
ℓ - 1m : ℓ - 1m
bv : D = 2Gbv and bv : C : bv = 2G
kΣ k2
1S : ε_ þ ℓ - 1 : ℓ χ_ 1 S : ε_ þ ℓ - 1 : ℓ χ_
γ= = ð5:179Þ
Σ
K0
1 þ 3G Σ Kb
2G O 2G O
σ_ = C : ε_ - b
n b
n : ε_ - b
n bv : ℓχ_ ð5:180aÞ
b
K Kb
2G O 2G O
ℓ - 1m
_ = D : ℓ χ_ - b
n bv : ε_ - bv bv : ℓ χ_ ð5:180bÞ
Kb Kb
Using
O a
c 1 O a
c
C = κI I þ 2G - I I and D = 2G ð5:181Þ
3
2 O N N 3
N ‘
b - 1I b b
n n b
n bv
6 κI I- - 2G
bN 7
Iþ2G
3 N b
M ep = 6
4
K K 7 ð5:182Þ
b
n bv ‘
b b
v v 5
- 2G 2G -
b
K b
K
The plastic flow theory just described can be written in the following alternative
form, which is useful in the numerical treatment of the problem. Using the general-
ized vector and matrix notation, for the simple case of isotropic hardening, we make
the following definitions:
h i h i h i
σ ε
Σ= ℓ -1
m
, Ε= ℓχ
, M= C
0
0
D
ð5:184Þ
and then the constitutive model equations can be written as Σ = M : Εel. The yield
criteria and flow rule follow:
rffiffiffi
2
F σ, ℓ - 1 m, α = Σ0 - K ðαÞ ð5:185Þ
3
E_ = γ N
b
pl
ð5:186Þ
then
∂F _ ∂F _ pl
F_ = Σþ Ε ð5:187Þ
∂Σ ∂Εpl
F_ = N b þ ∂F γ N
b : M : Ε_ - γN : M : N b ð5:188Þ
∂Εpl
N : M : Ε_
pl
∂F b 2
γ= or after using N = - K0 ð5:189Þ
b
N:M:N b - ∂Fpl N
b ∂Ε pl 3
∂Ε
we have
5.10 Cosserat Continuum Implementation in Unified Mechanics Theory 295
b : M : Ε_ pl
N
γ= ð5:190Þ
b :M:N
N b þ 2 K0
3
Using
N bT, N
b : M = 2GN b :M:N
b = 2G, b : M : Ε = 2GN
N b Ε
yields
b Ε_
N
γ= ð5:191Þ
b
K
The constitutive tensor given by Eq. (5.192) is equivalent to Eq. (5.183) where
the coupling between strains and curvatures becomes evident in the off-diagonal
terms in the generalized constitutive tensor.
hφðF Þi
γ= ð5:193Þ
η
F = ΘðγηÞ ð5:194Þ
θ
Σ_ = ð1 - ΦÞM e : Ε_ - Ε_ - Ε_
vp
ð5:197Þ
Flow rule
Ε_ = γ ð1 Pξ
vp
- ΦÞ
Hardening laws
qffiffi qffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
Φ
α_ = 23γ ð1 - ΦÞ ξΦ PξΦ b
X_ = γ 32 H 0 ðαÞð1 - ΦÞN
T
Consistency parameter
γ = hφðηFÞi
θ
where E_ is the total strain rate, Ε_ is the viscoplastic strain rate, and Ε_ is the
vp
Φ 2
β_ ij = ð1 - ΦÞ H 0 γ b
nij ð5:199Þ
3
2 0
ℓη_ Φ
ij = ð1 - ΦÞ 3 H γ b
νij ð5:200Þ
In Eq. (5.198) P is a constant matrix; hence, PξΦ gives the deviatoric component
qffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
of ξΦ and 23 ξΦ P ξΦ is the von Misses equivalent stress for the generalized relative
T
stress vector. K(α) represents the radius of the yield surface. The direction of plastic
flow and the hardening laws are given for the associative plasticity case as follows:
Pξ
Ε_ = γ
vp
ð5:201Þ
ð1 - ΦÞ
rffiffiffi qffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
2
γ ð1 - ΦÞ ξΦ PξΦ
T
α_ = ð5:202Þ
3
Φ 2 b
X_ = γ H 0 ðαÞð1 - ΦÞN ð5:203Þ
3
stress continuum back-stresses evolve in the same manner as the symmetric stress
tensor back-stress. However, this is not essential for the formulation, because they
can be different. The couple back-stress is needed to allow for the uniform move-
ment of the yield surface in the enhanced (Cosserat continuum) stress space.
Viscoplastic Creep Law
The evolution of the generalized viscoplastic strain rate is specified by a function
representative of the viscoplastic flow micro-mechanisms in the material. In the
following formulation, the same creep law is assumed to hold for both the strains and
the strain gradients. The following viscoplastic strain rate function is one of the most
generalized models for low melting point solder alloys. It accounts for multiple creep
micro-mechanisms in the polycrystalline alloys as well as accounting for coarsening
of the crystals:
n p
AD0 Eb hF i b ∂F
ε_ vp
ij = e - Q=Rθ ð5:204Þ
kθ E d ∂σ ij
n p
AD0 Eb hF i b ∂F
ℓ χ_ vp
ij = e - Q=Rθ - 1 ð5:205Þ
kθ E d ∂ℓ mij
n p
_Ε vp = AD0 Eb hF i b
e - Q=Rθ
∂F
ð5:206Þ
kθ E d ∂Σ
In the Cosserat continuum couple stress theory, the internal energy can be given by
de
ρ = σ Sij Dij þ mji χ ij þ ρr - qi,i ð5:208Þ
dt
where σ Sij is the symmetric part of the Cauchy stress tensor, Dij is the rate of
deformation tensor, mji is the couple stress tensor, and χ ij are the corresponding
5.10 Cosserat Continuum Implementation in Unified Mechanics Theory 299
curvatures. The rate of change of the Helmholtz free energy φ is written in terms of
the symmetric part of the Cauchy stress tensor; thus,
dφ
ρ = σ Sij Deij ð5:209Þ
dt
Combining Eqs. (5.208) and (5.209), the difference between the changes in the
internal energy and the Helmholtz free energy with respect to a reference state in the
presence of couple stresses can be given by
In the material nonlinear finite element method, the solution is achieved in small
increments with at every increment Newton-Raphson iteration method is used to find
the equilibrium. At every increment, the problem may be regarded as analogous to a
strain-controlled test. At the beginning of the time step, the total and viscoplastic
strain fields and the internal state variables are considered to be known. Assuming
that displacement increment is known, the basic problem is to update the field
variables to their new values at the end of the time step in a manner consistent
with the constitutive model. Below we present the integration algorithm. First, the
case of a rate-independent model without degradation using both a radial return and
return mapping scheme is presented. Then the rate-dependent model with degrada-
tion case with the return mapping algorithm is presented in Table 5.6.
Radial Return Algorithm: Rate-Independent Model with Linear Isotropic
Hardening
Using Eqs. (5.148a) and (5.148b) and the backward Euler finite difference integra-
tion scheme, we can write
b nþ1
Εpnþ1 = Εpn þ Δγ N
rffiffiffi
2 ð5:211Þ
αnþ1 = αn þ Δγ
3
Now consider the following trial state obtained after freezing the plastic flow:
Σtrnþ1 = Σn þ 2GΔΕnþ1
ð5:212Þ
Σnþ1 = Σtr - 2GΔγ Nb nþ1
nþ1
300 5 Unified Mechanics of Thermomechanical Analysis
and
b nþ1 = Σnþ1
tr
N ð5:213bÞ
Σtrnþ1
using
dΔγ 1b
= N ð5:217Þ
b nþ1
dΕnþ1 K
and
b nþ1
dN 2G
= H nþ1 ð5:218Þ
dΕnþ1 Σtrnþ1
where
2‘ O O 3
b - 1I I-b
n b
n 0
H nþ1 = 4 3 nþ1 5 ð5:219Þ
nþ1 ‘
d N
0 -b
v bv
nþ1 nþ1
thus,
5.10 Cosserat Continuum Implementation in Unified Mechanics Theory 301
2 3
N
dΣnþ1 b
6N b nþ1
N Δγ2G 7
= M - 2G4 nþ1 þ H nþ1 5 ð5:220Þ
dΕnþ1 b
K Σtrnþ1
M ep
nþ1
2 O O 3
N ‘b - 1I N 2G
6 κI Iþ2Gδnþ1 I - 2Gθnþ1bnnþ1 bnnþ1 - b
n bvnþ1 7
=4 3 O b nþ1
K 5
2G ‘
b N
- b
nnþ1 bvnþ1 2Gδnþ1 - 2Gθnþ1 v bv
b
K
nþ1 nþ1
ð5:221Þ
The standard operator split technique defines the following trial state:
Σtr
nþ1 = Σn þ ð1 - ΦÞMΔΕ nþ1 ð5:225Þ
Σnþ1 = Σtr
nþ1 - MΔγPξnþ1 ð5:226Þ
nþ1 = Σnþ1 - X n
ξtr tr
ð5:227Þ
From (5.226) and (5.227), updated relative stress can be obtained in terms of the
algorithmic consistency parameter Δγ:
1
ξnþ1 = ΞðΔγ Þ M - 1 ξtr ð5:228Þ
1 þ 23 H 0 Δγ ð1 - ΦÞ nþ1
302 5 Unified Mechanics of Thermomechanical Analysis
where
" #-1
ΔγP
ΞðΔγ Þ = M - 1 þ 2 0
ð5:229Þ
1 þ 3 H ð1 - ΦÞΔγ
where
ΞðΔγÞPξnþ1
Ν = qffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
ffi ð5:232Þ
ξTnþ1 PΞðΔγÞPξnþ1
~β = ð1 - ΦÞ 2θ1 b 2 θ2 dΘ b 1
f nþ1 ðK 0 θ1 þ H 0 θ2 Þ þ 1 f nþ1 T ð5:233Þ
3θ2 θ2 dΔγ ξnþ1 PΞðΔγ ÞPξnþ1
2
θ1 = 1 þ H0 ð1 - ΦÞΔγ
3
2 0
θ2 = 1 - K ð1 - ΦÞΔγ
3
rffiffiffi
2
F ðΔγ Þ = bf nþ1 - K ðαnþ1 Þ = 0 ð5:234Þ
3
5.10 Cosserat Continuum Implementation in Unified Mechanics Theory 303
1
ξnþ1 = ΞðΔγ Þ M - 1 ξtr ð5:235Þ
1 þ 23 H 0 Δγ ð1 - ΦÞ nþ1
Solution of Eq. (5.230) with a local iteration requires the “Jacobian” J(Δγ) to be
defined. In the Newton-Raphson iteration, the Jacobian represents the derivative of
the function; hence,
rffiffiffi
dF dΘ dbf 2 dK dΘ
J ðΔγ Þ = - nþ1 - - ð5:236Þ
dΔγ dΔγ dΔγ 3 dΔγ dΔγ
h i-1
2
ξnþ1 = I þ H 0 Δγ ð1 - ΦÞI þ ΔγPM ξtr
nþ1 ð5:237Þ
3
db
f nþ1
To find dΔγ , the symmetric matrices P and M are diagonalized. Since both
matrices share the same characteristic space, the following relationships can be
written:
P = QΛP QT ð5:238Þ
M = QΛM QT ð5:239Þ
and then
PM = QΛP ΛM QT
h i-1
2
ξnþ1 = 1 þ ΔγH0 I þ ΔγQΛP ΛM QT ξtr
nþ1 ð5:242Þ
3
where
ΓðΔγ Þ
2 1 1 1 3
, , ,
6 1þ
2 2 2
ð1 - ΦÞH 0 þ 2G Δγ 1 þ ð1 - ΦÞH 0 þ 2G Δγ 1 þ ð1 - ΦÞH 0 Δγ 7
6 7
= DIAG6 3 3 3 7
6 1 1 1 7
4 , , 5
2 2 2
1 þ ð1 - ΦÞH 0 þ 2G Δγ 1 þ ð1 - ΦÞH 0 þ 2G Δγ 1 þ ð1 - ΦÞH 0 þ 2G Δγ
3 3 3
ð5:245Þ
db
f nþ1
Using Eqs. (5.244), (5.245), and (5.246), dΔγ can be defined as
dbf nþ1 b
g
= nþ1 ð5:247Þ
dΔγ b
f nþ1
where
T
b
gnþ1 = ξtr
nþ1 QGðΔγ ÞQ PQΓðΔγ ÞQ ξnþ1
T T tr
ð5:248Þ
5.10 Cosserat Continuum Implementation in Unified Mechanics Theory 305
dΓðΔγ Þ
GðΔγ Þ = = DIAG½G11 , G22, G33 , G44 , G55 , G66 with
dΔγ
2 0
3 ð1 - ΦÞH þ 2G
G11 = G22 = G44 = G55 = G66 = - 2
1 þ 23 ð1 - ΦÞH0 þ 2G Δγ
0
3 ð1 - ΦÞH
2
G33 = - 2
1 þ 23 ð1 - ΦÞH0Δγ
Using the definition of K(α), Eq. (5.247) and θ2 from Eq. (5.231) yield the
Jacobian needed for the local Newton iteration:
dF ðΔγ Þ b
g 2
= θ2 nþ1 - K 0bf nþ1 ð5:249Þ
dΔγ bf 3
nþ1
nþ1 = Σnþ1 - X n
ξtr tr
qffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi qffiffi
Compute trial yield function
nþ1 = ξtrT
nþ1 Pξ nþ1 - 3K ðαn Þ
tr
Ftr 2
X nþ1 = X n þ Δγ 23 H0 ð1 - ΦÞξnþ1
Σn + 1 = ξ n + 1 + Xn + 1
qffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
αnþ1 = αn þ Δγ ð1 - ΦÞ 23 ξTnþ1 Pξnþ1
Pξ
Εvp
nþ1 = Ε n þ Δγ ð1 - ΦÞ
vp nþ1
FðΔγ ðkÞ Þ
Δγ ðkþ1Þ ← Δγ ðkÞ - JðΔγðkÞ Þ
pffiffiffiffiffiffiffiffi
αnþ1 ðkþ1Þ ← αnþ1 ðkÞ þ 2=3bf nþ1 ð1 - ΦÞ Δγ ðkÞ
END DO_UNITL
References
Aero, E., & Kuvshinsky, E. (1961). Fundamental equations of the theory of elastic media with
rotationally interacting particles. Soviet Physics Solid State, 2, 1272.
Armstrong, P., & Frederick, C. (1966). A mathematical representation of the multiaxial
Bauschinger effect. CEGB Report RD/N731.
Basaran, C., Zhao, Y., Tang, H., & Gomez, J. (2005). A damage mechanics based unified
constitutive model for Pb/Sn solder alloys. ASME Journal of Electronic Packaging, 127(3),
208–214.
Chaboche, J. (1989). Constitutive equations for cyclic plasticity and viscoplasticity. International
Journal of Plasticity, 3, 247–302.
Cosserat, E., & Cosserat, F. (1909). Theorie des corps deformables. A Hermann & Fils.
De Borst, R. (1993). A generalization of J2-flow theory for polar continua. Computer Methods in
Applied Mechanics and Engineering, 103, 347–362.
De Borst, R., & Muhlhaus, H. (1992). Gradient dependent plasticity: Formulation and algorithmic
aspects. International Journal for Numerical Methods in Engineering, 35, 521–539.
DeHoff, R. T. (1993). Thermodynamics in materials science. McGraw Hill.
Eringen, A. C. (1968). Theory of micropolar elasticity. In H. Leibowitz (Ed.), Fracture, and
advanced treatise (pp. 621–729). Academic Press.
Fleck, N., & Hutchinson, J. (1997). Strain gradient plasticity. Advances in Applied Mechanics, 33,
295–361.
Fung, Y. C., & Tong, P. (2001). Classical and computational solid mechanics. Advanced series in
engineering science (Vol. 1). World Scientific.
Jean-Louis Chaboche. (1990). Mechanics of Solid Materials, Cambridge University Press; 1st
edition
Kashyap, B., & Murty, G. (1981). Experimental constitutive relations for the high-temperature
deformation of a Pb-Sn eutectic alloy. Material Science and Engineering, 50, 205–213.
Koiter, W. T. (1964). Couple of stresses in the theory of elasticity. Proceedings of the Koninklijke
Nederlandse Akademie van Wetenschappen, Series B: Physical Sciences, 67, 17–44.
Krajcinovic, D. (1996). Damage mechanics. North-Holland series in applied mathematics and
mechanics. Elsevier.
Lee, Y., & Basaran, C. (2011). A creep model for solder alloys. Transactions of the ASME: Journal
of Electronic Packaging, 133(4), 044501-1.
References 307
6.1 Introduction
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2022 309
C. Basaran, Introduction to Unified Mechanics Theory with Applications,
https://doi.org/10.1007/978-3-031-18621-9_6
310 6 Thermomechanical Analysis of Particle-Filled Composites
3ð1 - v0 Þðk 1 - k 0 Þφ
k = k0 1þ ð6:1Þ
3ð1 - v0 Þk0 þ ð1 - φÞð1 þ v0 Þðk 1 - k 0 Þ
15ð1 - v0 Þðμ1 - μ0 Þφ
μ = μ0 1þ ð6:2Þ
15ð1 - v0 Þμ0 þ ð1 - φÞð8 - 10v0 Þðμ1 - μ0 Þ
where φ is the particle volume fraction and v0 is Poisson’s ratio of the matrix.
If the effects due to interparticle interactions are included for the two-phase elastic
composites with randomly located spherical particles, the solutions for the effective
bulk modulus k and shear modulus μ are provided by Ju and Chen (1994b):
30ð1 - v0 Þφð3γ 1 þ 2γ 2 Þ
k = k0 1þ ð6:3Þ
3α þ 2β - 10ð1 þ v0 Þφð3γ 1 þ 2γ 2 Þ
30ð1 - v0 Þφγ 2
μ = μ0 1 þ ð6:4Þ
β - 4ð4 - 5v0 Þφγ 2
with
k0 μ0
α = 2ð5v0 - 1Þ þ 10ð1 - v0 Þ - ð6:5Þ
k1 - k0 μ1 - μ0
μ0
β = 2ð4 - 5v0 Þ þ 15ð1 - v0 Þ ð6:6Þ
μ1 - μ0
And
5φ 8α
γ1 = ð 13 - 14v Þv - ð 1 - 2v Þ ð1 þ v Þ ð6:7Þ
8β2
0 0
3α þ 2β 0 0
1 5φ 6α
γ2 = þ 25 - 34v þ 22v 2
- ð 1 - 2v Þ ð 1 þ v Þ ð6:8Þ
2 16β2 0 0 3α þ 2β 0 0
9k μ
E = ð6:9Þ
3k þ μ
3k - 2μ
v = ð6:10Þ
6k þ 2μ
The experimental validation of the Ju and Chen model was provided by Nie and
Basaran (2005), by comparing the analytical predictions with the experimental data
on the particulate composite prepared using lightly cross-linked poly-methyl meth-
acrylate (PMMA) filled with alumina trihydrate (ATH). All filler particles are
assumed spherical and both the matrix and the filler are isotropic elastic. The
material properties involved are as follows:
6.1 Introduction 311
12
6
Noninteracting solution
4
0
0 0.1 0.2 0.3 0.4 0.5 0.6
Particles Volume Fraction
Fig. 6.1 Effective Young’s modulus as a function of particle volume fraction. (After Nie (2005))
and 75% of the tensile strength of the composite. Therefore, the thermal stress
associated with the CTE mismatch between the matrix and particles is an important
factor in the failure of particulate composites subjected to thermomechanical loads.
where V is the volume of an RVE, V0 is the volume of the matrix, Vq is the volume of
the qth-phase particles, and n denotes the number of particulate phases [fillers] of
different materials (excluding the matrix).
Similarly, the volume-averaged strain tensor is defined as
2 3
Z Z Xn Z h Xn i
1 16 71
ε= εðxÞdx = 4 εðxÞdx þ εðxÞdx
5 V 0 ε0 þ V ε
V V q=1 V q=1 q q
V V0 Vq
ð6:12Þ
where C0 is the stiffness tensor of the matrix and ε0 is the uniform elastic strain field
induced by far-field loads for a homogeneous matrix material only. εTq is its Eigen
6.2 Ensemble-Volume Averaged Micromechanical Field Equations 313
strain associated with the qth particle, and εq is the fictitious equivalent Eigen strain
by replacing the qth particles with the matrix material. ε′(x) is the perturbed strain due
to distributed Eigen strain εT and ε is associated with all the particles in the RVE.
The stress tensor for the matrix is given in Newtonian mechanics by
σ0 = C 0 : ε 0 ð6:14Þ
The strain at any point within an RVE is decomposed into two parts, the uniform
strain and the perturbed strain due to the distributed Eigen strain. It is emphasized
that the Eigen strain ε and εT are nonzero in the particle domain and zero in the
matrix domain, respectively. The perturbed strain field induced by all the distributed
Eigen strains ε and εT can be expressed by (Mura, 1987)
Z
ε0 ðxÞ = Gðx - x0 Þ : εT ðx0 Þ þ ε ðx0 Þ dx0 ð6:15Þ
V
Gijkl ðx - x0 Þ
1
= F ð - 15, 3v0 , 3, 3 - 6v0 , - 1 þ 2v0 , 1 - 2v0 Þ ð6:16Þ
8π ð1 - v0 Þr 3 ijkl
where r x - x′, r j rj, and v0 is Poisson’s ratio of the matrix. The components of
the fourth-rank tensor F are defined by
F ijkl ðBm Þ = B1 ni nj nk nl þ B2 δik nj nl þ δil nj nk þ δjk ni nl þ δjl ni nk þ B3 δij nk nl
ð6:17Þ
þ B4 δkl ni nj þ B5 δij δkl þ B6 δik δjl þ δil δjk
where
-1
Aq = Cq - C0 C0 ð6:19Þ
Z
ε ð xÞ = ε 0 þ ε 0 ð xÞ = ε0 þ Gðx - x0 Þ : εT ðx0 Þ þ ε ðx0 Þ dx0 ð6:20Þ
V
where s is a constant tensor for unidirectionally aligned and similar ellipsoidal filler
particles. If the linear elastic matrix material is isotropic and all inclusions are
spherical, then the s takes the form of the Eshelby tensor S:
1
Sijkl = ð5v0 - 1Þδij δkl þ ð4 - 5v0 Þ δik δjl þ δil δjk ð6:22Þ
15ð1 - v0 Þ
The effective elastic properties can be obtained, in principle, from Eqs. (6.18),
(6.21), and (6.23) since the variables are σ,ε,ε0 ,εq. Essentially, one needs to solve the
relationship between ε and εq, which involves the solution of the integral Eq. (6.18).
εq depends on interparticle interactions, particle-matrix interactions, and microstruc-
ture (i.e., particle sizes, orientation, shapes, volume fractions, locations, configura-
tions, and probability functions) of a composite system. For randomly dispersed
particles, one needs to obtain the ensemble-volume averaged relation between ε and
εq by averaging all possible solutions of the integral Eq. (6.18) for any particle
configurations generated according to the specified probability function.
Taking the ensemble-volume average of Eq. (6.18) overall qth-phase particles, we
obtain
6.2 Ensemble-Volume Averaged Micromechanical Field Equations 315
where
Z Z
1
ε0q = Gðx - x0 Þ : εT ðx0 Þ þ ε ðx0 Þ dx0 dx ð6:25Þ
Vq
Vq V
If all particles in the qth phase have the same ellipsoidal shape and orientation,
then εq 0 can be given by
ε0q = ε0p
q þ Sq : εq þ εq
T
ð6:26Þ
with
8 9
Z >
< Z >
=
1 XN q
ε0p
q = Gðx - x0 Þ : εT ðx0 Þ þ ε ðx0 Þ dx0 dx ð6:27Þ
Vq i=1 >
: >
;
Ωiq V - Ωiq
representing the interparticle interaction effects, where Ωiq is the domain of the qth
filler particle in the qth-phase domain Vq and Nq is the number of the [filler] phase
q particles dispersed in V. Sq is the Eshelby tensor associated with the qth particle.
From Eqs. (6.24) and (6.26), we arrive at
- Aq - Sq : εq = ε0 - εT þ Sq : εT þ ε0p
q ð6:28Þ
To solve Eqs. (6.29)–(6.31) and obtain the effective elastic properties of the
composite, it is essential to express the qth-phase average Eigen strain εq in terms
of the volume average strain ε . Namely, one must solve the integral Eq. (6.18)
exactly for each phase, which involves details of random microstructure.
316 6 Thermomechanical Analysis of Particle-Filled Composites
where K(e p) is the isotropic hardening function of the matrix-only material. Further-
more, H(σD) signifies the square of the deviatoric stress norm in the matrix only:
H σD = σ : I D : σ ð6:33Þ
D 1 1
I ijkl = - δij δkl þ δik δjl þ δil δjk ð6:34Þ
3 2
To solve the elastoplastic response exactly, the stress at any local point must be
known and used to determine the plastic response through the local yield criterion for
all possible configurations. This approach is in general infeasible due to the com-
plexity of the statistical and microstructural information. Therefore, a framework in
which an ensemble-averaged yield criterion is constructed for the entire composite.
Total stress σ(x) at any point x in the matrix can be given by the superposition of the
far-field stress σ0 and the perturbed stress σ′ induced by the inclusion particles:
σ ð xÞ = σ 0 þ σ 0 ð xÞ ð6:35Þ
According to the Eshelby theory, the perturbed stress σ ′ at any point in the matrix
due to the presence of the particles can be given by
Z
0
σ ðxÞ = C0 : Gðx - x0 Þ : εT þ ε ðx0 Þ dx0 ð6:36Þ
V
6.3 Noninteracting Solution for Two-Phase Composites 317
where ε(x′) denotes the fictitious elastic Eigen strain in the particle-induced by
replacing the inhomogeneity [filler] with the matrix material. εT is the Eigen strain of
the inhomogeneity. Assuming εT is uniform in the particles, G(x - x′) is the fourth-
rank tensor of Green’s function defined by Eq. (6.16).
According to the Eshelby theory, the Eigen strain ε(x′) due to a single ellipsoidal
inclusion is uniform for the interior points of an isolated (noninteracting) inclusion.
Therefore, the perturbed stress for any matrix point x due to a typically isolated
inhomogeneity centered at x1 takes the form
σ0 ðxjx1 Þ = C0 Gðx - x1 Þ : εT þ ε0 ð6:37Þ
where
Z
Gðx - x1 Þ = Gðx - x0 Þdx0 For x=
2 Ω1 ð6:38Þ
Ω1
ρ3 1
G ð x - x1 Þ = H þ ρ2 H2 ð6:39Þ
30ð1 - v0 Þ
where r x - x1, r j rj, ρ = a/r, and a is the radius of a spherical particle. The
components of the fourth-rank tensor F are given by Eq. (6.17). In Eq. (6.37),
ε0 denotes the solution of the Eigen strain ε for the single inclusion problem,
which is given (from Eq. (6.31) when ε0p
q is dropped):
ε0 = - ðA þ SÞ - 1 : ε0 þ ðA þ SÞ - 1 ðI - SÞ : εT ð6:42Þ
where
A = ðC1 - C0 Þ - 1 C0 ð6:43Þ
We define H(x| ℘) as the square of the current deviatoric stress norm at the local
point x, which determines the plastic strain in a particulate composite for a given
phase configuration ℘. Since there is no plastic strain in the elastic particles or voids,
H(x| ℘) can be written as (Ju & Tseng, 1997)
318 6 Thermomechanical Analysis of Particle-Filled Composites
(
σ ðxj℘Þ : ID : σðxj℘Þ x in the matrix
H ðxj℘Þ = ð6:44Þ
0 otherwise
Here a is the radius of the particle, P(x1) denotes the probability density functions
for finding a particle centered at x1, and H0 corresponds to the far-field deviatoric
stress norm in the matrix:
H 0 = σ0 : I D : σ0 ð6:46Þ
N
Pðx1 Þ = ð6:47Þ
V
with the help of Eqs. (6.35), (6.37), (6.44), and (6.46), plus using the following
identities:
Z
4πr2
ni nj dA = δ ð6:49Þ
3 ij
Aðr Þ
Z
4πr2
ni nj nk nl dA = δij δkl þ δik δjl þ δil δjk ð6:50Þ
15
Aðr Þ
6.3 Noninteracting Solution for Two-Phase Composites 319
Z Z
dr H 1 ðrÞdA = 0 ð6:51Þ
r>a Aðr Þ
Z Z
dr H 2 ðrÞdA = 0 ð6:52Þ
r>a Aðr Þ
Z Z
dr GðrÞdA = 0 ð6:53Þ
r>a Aðr Þ
After some lengthy yet straightforward derivations, one can obtain the ensemble-
averaged current stress norm at any matrix point x as:
where
σT = AC1 : εT ð6:56Þ
with
φ
3T 1 þ 2T 2 = 200ð1 - 2v0 Þ2 ð6:59Þ
ð3α þ 2βÞ2
1 φ
T2 = þ 23 - 50v0 þ 35v20 2 ð6:60Þ
2 β
φ
3T 1 þ 2T 2 = 200ð1 - 2v0 Þ2 ð6:61Þ
ð3α þ 2βÞ2
φ
T 2 = 23 - 50v0 þ 35v20 2 ð6:62Þ
β
α and β are given by Eqs. (6.5) and (6.6), and φ is the filler particle volume fraction.
320 6 Thermomechanical Analysis of Particle-Filled Composites
Using Eqs. (6.63), (6.64), and (6.42), the relation between the far-field stress
σ0 tensor and the macroscopic stress σ takes the form
σ = P : σ0 - Q : σT ð6:65Þ
where
where
With the help of Eq. (6.65), we arrive at the alternative expression for the
ensemble-averaged current stress norm at a matrix point x as:
hH im ðxÞ = σ : T : σ þ 2σ : T : σT þ σT : T : σT ð6:76Þ
6.3 Noninteracting Solution for Two-Phase Composites 321
where the positive definite fourth-rank tensors T, T , and T are defined as follows:
T
T = P-1 T P-1 ð6:77Þ
T
T = P - 1 T P - 1 Q - T P - 1 ð6:78Þ
T
T = P - 1 Q T P - 1 Q þ T - 2T P - 1 Q ð6:79Þ
where the components of T,T , and T are
T ijkl = T 1 δij δkl þ T 2 δik δjl þ δil δjk ð6:80Þ
Tijkl = T 1 δij δkl þ T 2 δik δjl þ δil δjk ð6:81Þ
T ijkl = T 1 δij δkl þ T 2 δik δjl þ δil δjk ð6:82Þ
with
3T 1 þ 2T 2
3T 1 þ 2T 2 = ð6:83Þ
ðaφ þ 1Þ2
T2
T2 = ð6:84Þ
ðbφ þ 1Þ2
3T 1 þ 2T 2
3T 1 þ 2T2 = - ð6:85Þ
ðaφ þ 1Þ2
T2 1
T2 = - þ ð6:86Þ
ðbφ þ 1Þ 2 2 ð bφ þ 1Þ
3T 1 þ 2T 2
3T 1 þ 2T 2 = ð6:87Þ
ðaφ þ 1Þ2
T2 bφ - 1
T2 = þ ð6:88Þ
ðbφ þ 1Þ2 2ðbφ þ 1Þ
If the particle interaction for the two-phase composite is ignored, Eq. (6.30) becomes
ε = ε0 þ φS : εT þ ε0 ð6:90Þ
With the noninteracting solution ε0 of the Eigen strain given by Eq. (6.42), we
arrive at
h i
ε = I - φSðA þ SÞ - 1 : ε0 þ φSðA þ SÞ - 1 ðA þ IÞ : εT ð6:91Þ
ð6:92Þ
Therefore, we have
As a result of Eq. (6.91), the average internal stresses of particles can be expressed
as
h i-1
σ1 = C1 AðA þ SÞ - 1 I - φSðA þ SÞ - 1 :ε
h i-1
- C1 AðA þ SÞ - 1 φSðA þ SÞ - 1 ðA þ IÞ I - φSðA þ SÞ - 1 þ ðI - SÞ : εT
= U : ε - V : εT
ð6:95Þ
where
6.4 Pairwise Interacting Solution for Two-Phase Composites 323
h i-1
U = C1 AðA þ SÞ - 1 I - φSðA þ SÞ - 1 ð6:96Þ
h i-1
V = C1 AðA þ SÞ - 1 φSðA þ SÞ - 1 ðA þ IÞ I - φSðA þ SÞ - 1 þ ðI - S Þ ð6:97Þ
By carrying out the lengthy algebra, the components of the positive definite
fourth-rank tensor U and V are explicitly given by
U ijkl = U 1 δij δkl þ U 2 δik δjl þ δil δjk ð6:98Þ
V ijkl = V 1 δij δkl þ V 2 δik δjl þ δil δjk ð6:99Þ
where
Within the present two-sphere context, the integral equation governing the
distributed Eigen strain ε for a given filler particle configuration and remote strain
field ε0 can be written as
Z h i
- A : εðiÞ ðxÞ = ε0 - εT ðxÞ þ Gðx - x0 Þ : εT ðx0 Þ þ εðiÞ ðx0 Þ dx0
Ωi
Z h i
þ Gðx - x0 Þ : εT ðx0 Þ þ εðjÞ ðx0 Þ dx0 ði ≠ j, i, j = 1, 2 Þ
Ωj
ð6:105Þ
Ωj
ð6:106Þ
where
ρ3 1
G 1 xi - xj = H þ ρ2 H2 ð6:109Þ
30ð1 - v0 Þ
6.4 Pairwise Interacting Solution for Two-Phase Composites 325
ρ3 1
G 2 xi - xj = H þ 2ρ2 H2 ð6:110Þ
30ð1 - v0 Þ
In addition, 0(ρ6) denotes the terms, which are higher than the order of ρ6, where
r xi - xj, r j rj, ρ = a/r, and a is the radius of a spherical filler particle. The
components of H1 and H2 are given by Eqs. (6.40) and (6.41).
Furthermore, we observe that
ðiÞ ðjÞ
d =d =d ð6:111Þ
where
T xi - xj = 30ð1 - v0 Þ A þ S þ G1 xi - xj ð6:113Þ
T - 1 = K - 1 þ ρ3 L þ ⋯ ð6:114Þ
where
where N/V is the density of filler particles in a composite. Accordingly, the ensemble
integration of Eq. (6.118) can be written as
8 2 39
>
< Z1 Z Z1 Z >
=
N 6 27
d =- K-1 4 ρ3 H1 dAdr þ 2 ρ5 H dAdrH 5 : εT þ ε0
2
V>: >
;
2a A 2a A
2 3
Z1 Z
N6 7
- 4 ρ 6
L H1 dAdr5 : εT þ ε0 þ 0 ρ8
V
2a A
ð6:120Þ
where A denotes the spherical surface of a particle with radius r. After a lengthy
but straightforward mathematical manipulation, the final expression takes the
following form:
where
5φ 8α
γ1 = ð 13 - 14v Þv - ð 1 - 2v Þ ð 1 þ v Þ ð6:124Þ
8β2
0 0
3α þ 2β 0 0
5φ 6α
γ2 = 25 - 34v0 þ 22v0 - 2
ð1 - 2v0 Þð1 þ v0 Þ ð6:125Þ
16β2 3α þ 2β
The total stress at any point x in the matrix is given by the superposition of the
far-field stress σ0 and the perturbed stress σ′ induced by the filler particles. Assuming
the elastic Eigen strain ε in the filler particle is uniform, the perturbed stress at
matrix point x takes the form (Ju & Chen, 1994a)
σ0 ðxjx1 Þ = C0 Gðx - x1 Þ : εT þ ε ð6:126Þ
where εT is the Eigen strain caused by the CTE mismatch between the matrix and the
filler particle. Using the ensemble-volume averaged Eigen strain given in
Eq. (6.121), the stress perturbation can be given by the following relation:
σ0 ðxjx1 Þ = C0 Gðx - x1 Þ Γ : εT þ ε0 ð6:127Þ
where ε0 denotes the solution of the Eigen strain for the single inclusion problem,
which is given by Eq. (6.42). Therefore,
h i
σ0 ðxjx1 Þ = - C0 G ΓðA þ SÞ - 1
h i
: ε0 þ C0 G ΓðA þ SÞ - 1 ðA þ IÞ : εT ð6:128Þ
Following the same procedure used for the noninteracting solution, we obtain the
ensemble-averaged current stress norm at any matrix point x as:
where σT is given by Eq. (6.56), and the components of the positive definite fourth-
rank tensors T and T read:
T ijkl = T 1 δij δkl þ T 2 δik δjl þ δil δjk ð6:130Þ
T ijkl = T 1 δij δkl þ T 2 δik δjl þ δil δjk ð6:131Þ
with
ð3γ 1 þ 2γ 2 Þ2
3T 1 þ 2T 2 = 200ð1 - 2v0 Þ2 φ ð6:132Þ
ð3α þ 2βÞ2
1 4γ 2
T2 = þ 23 - 50v0 þ 35v20 22 φ ð6:133Þ
2 β
328 6 Thermomechanical Analysis of Particle-Filled Composites
ð3γ 1 þ 2γ 2 Þ2
3T 1 þ 2T 2 = 200ð1 - 2v0 Þ2 φ ð6:134Þ
ð3α þ 2βÞ2
4γ 2
T 2 = 23 - 50v0 þ 35v20 22 φ ð6:135Þ
β
σ = P : σ0 - Q : σT ð6:136Þ
where
with
Using Eq. (6.136), we arrive at the alternative expression for the ensemble-
averaged current stress norm in a matrix point x:
hH im ðxÞ = σ : T : σ þ 2σ : T : σT þ σT : T : σT ð6:147Þ
6.4 Pairwise Interacting Solution for Two-Phase Composites 329
where the positive definite fourth-rank tensors T, T , and T are defined as
T
T = P-1 T P-1 ð6:148Þ
T
T = P - 1 T P - 1 Q - T P - 1 ð6:149Þ
T
T = P - 1 Q T P - 1 Q þ T - 2T P - 1 Q ð6:150Þ
where the components of T, T , and T are
T ijkl = T 1 δij δkl þ T 2 δik δjl þ δil δjk ð6:151Þ
T ijkl = T 1 δij δkl þ T 2 δik δjl þ δil δjk ð6:152Þ
T ijkl = T 1 δij δkl þ T 2 δik δjl þ δil δjk ð6:153Þ
with
3T 1 þ 2T 2
3T 1 þ 2T 2 = ð6:154Þ
ðaφ þ 1Þ2
T2
T2 = ð6:155Þ
ðbφ þ 1Þ2
3T 1 þ 2T 2
3T 1 þ 2T 2 = - ð6:156Þ
ðaφ þ 1Þ2
T2 1
T2 = - þ ð6:157Þ
ðbφ þ 1Þ2 2ðbφ þ 1Þ
3T 1 þ 2T 2
3T 1 þ 2T 2 = ð6:158Þ
ðaφ þ 1Þ2
T2 bφ - 1
T2 = þ ð6:159Þ
ðbφ þ 1Þ2 2ðbφ þ 1Þ
ε = ε0 þ φS : εT þ ε ð6:161Þ
With the noninteracting solution Eigen strain ε0 given by Eq. (6.42), we arrive at
h i h i
ε = I - φSΓðA þ SÞ - 1 : ε0 þ φ SΓðA þ SÞ - 1 ðA þ IÞ : εT ð6:163Þ
ð6:164Þ
Combining with Eq. (6.163), the average internal stress of filler particles can be
expressed as
h ih i-1
σ1 = C1 I - SΓðA þ SÞ - 1 I - φSΓðA þ SÞ - 1 :
h ih i-1
ε - C1 φSΓ I - SΓðA þ SÞ - 1 I - φSΓðA þ SÞ - 1 ðA þ SÞ - 1 ðA þ IÞ
h i
-1
- SΓðA þ SÞ ðA þ IÞ - I : εT = U : ε - V : εT
ð6:168Þ
6.5 Noninteracting Solution for Three-Phase Composites 331
where
h ih i-1
U = C1 I - SΓðA þ SÞ - 1 I - φSΓðA þ SÞ - 1 ð6:169Þ
h ih i-1
V = C1 φSΓ I - SΓðA þ SÞ - 1 I - φSΓðA þ SÞ - 1 ðA þ SÞ - 1 ðA þ IÞ
h i
- SΓðA þ SÞ - 1 ðA þ IÞ - I
ð6:170Þ
By carrying out the lengthy algebra, the components of the positive definite
fourth-rank tensor U and V can be obtained as
U ijkl = U 1 δij δkl þ U 2 δik δjl þ δil δjk ð6:171Þ
V ijkl = V 1 δij δkl þ V 2 δik δjl þ δil δjk ð6:172Þ
where
0 φq P2 1
15ð1 - v0 Þ
q = 1 βq
μ = μ 0 @1 þ P φ
A ð6:180Þ
1 - 2ð4 - 5v0 Þ 2q = 1 β q
q
with
k0 μ0
αq = 2ð5v0 - 1Þ þ 10ð1 - v0 Þ - ð6:181Þ
kq - k0 μq - μ0
μ0
βq = 2ð4 - 5v0 Þ þ 15ð1 - v0 Þ ð6:182Þ
μq - μ0
According to the Eshelby theory, the perturbed stress σ ′ at any point in the matrix
due to the presence of the particles and voids can be written as
Z Z
0 0
σ ð xÞ = C 0 : G ðx - x Þ : ε1 þ εT1 0
dx þ C0 : G ð x - x0 Þ
V V
0
: ε2 þ εT2 dx ð6:183Þ
6.5 Noninteracting Solution for Three-Phase Composites 333
The Eigen strain in a single ellipsoidal inclusion is uniform for the interior points
of an isolated (no interacting) inclusion. Therefore, the perturbed stress for any
matrix point x due to a typical isolated q-phase inhomogeneity centered at xq takes
the following form:
σ0 xjxq = C0 G x - xq : ε0
q þ εq
T
ð6:184Þ
where G x - xq is given by Eq. (6.39), and
-1 0 -1
ε0
q = - Aq þ S : ε þ Aq þ S ðI - SÞ : εTq ð6:185Þ
We denote by H(x| ℘) the square of the current stress norm at the local point x,
which determines the plastic strain for a given phase configuration ℘. Since there is
no plastic strain in the elastic particles or voids, H(x| ℘) can be written as
σðxj℘Þ : I D : σðxj℘Þ if x is in the matrix
H ðxj℘Þ = ð6:186Þ
0 otherwise
where r1 = j x - x1j, r2 = j x - x2j P(x1) and P(x2) denote the probability density
functions for finding a particle centered at x1 and a void centered at x2, respectively.
For simplicity, P(x1) and P(x2) are assumed to be statistically homogeneous,
isotro-
pic, and uniform. Using the properties of the fourth-order tensor G x - xq , we
obtain the ensemble-averaged current stress norm at any matrix point x as:
Z Z
N1
h H im ð xÞ = H 0 þ dr 1 ðσ0 ðxjx1 Þ : Id : σ0 ðxjx1 ÞÞdA
V
r1 > a Aðr1 Þ
Z Z ð6:188Þ
N2
þ dr 1 ðσ0 ðxjx2 Þ : Id : σ0 ðxjx2 ÞÞdA
V
r2 > a Aðr2 Þ
334 6 Thermomechanical Analysis of Particle-Filled Composites
where
σT = A1 C1 : εT ð6:190Þ
with
X2 φq
3T 1 þ 2T 2 = 200ð1 - 2v0 Þ2 q=1
2 ð6:193Þ
3αq þ 2βq
1 X 2 φq
T2 = þ 23 - 50v0 þ 35v20 ð6:194Þ
2 q=1 β2q
φ1
3T 1 þ 2T 2 = 200ð1 - 2v0 Þ2 ð6:195Þ
ð3α1 þ 2β1 Þ2
φ
T 2 = 23 - 50v0 þ 35v20 2 ð6:196Þ
β1
Using Eqs. (6.197), (6.198), and (6.185), the relation between the far-field stress
σ0 and the macroscopic stress σ is given by
σ = P : σ0 - Q : σT ð6:199Þ
where
6.5 Noninteracting Solution for Three-Phase Composites 335
X2 -1
P=I þ φ ðI - SÞ
q=1 q
Aq þ S ð6:200Þ
Q = φ1 ðI - SÞðA1 þ SÞ - 1 ð6:201Þ
where
X2
3P1 þ 2P2 = aφ
q=1 q q
þ1 ð6:204Þ
1 X2
P2 = b φ þ1
q=1 q q
ð6:205Þ
2
3Q1 þ 2Q2 = a1 φ1 ð6:206Þ
1
Q2 = b φ ð6:207Þ
2 1 1
20ð1 - 2v0 Þ
aq = ð6:208Þ
3αq þ 2βq
ð7 - 5v0 Þ
bq = ð6:209Þ
βq
Using Eqs. (6.199) and (6.189), we arrive at the alternative expression for the
ensemble-averaged current stress norm in a matrix point x as:
hH im ðxÞ = σ : T : σ þ 2σ : T : σT þ σT : T : σT ð6:210Þ
where the positive definite fourth-rank tensors T, T , and T are defined as
T
T = P-1 T P-1 ð6:211Þ
T
T = P - 1 T P - 1 Q - T P - 1 ð6:212Þ
T
T = P - 1 Q T P - 1 Q þ T - 2T P - 1 Q ð6:213Þ
where the components of T ,T , and T are
336 6 Thermomechanical Analysis of Particle-Filled Composites
T ijkl = T 1 δij δkl þ T 2 δik δjl þ δil δjk ð6:214Þ
T ijkl = T 1 δij δkl þ T 2 δik δjl þ δil δjk ð6:215Þ
T ijkl = T 1 δij δkl þ T 2 δik δjl þ δil δjk ð6:216Þ
with
3T þ 2T 2
3T 1 þ 2T 2 = P 1 2
ð6:217Þ
2
q = 1 a q φq þ 1
T2
T2 = P 2
ð6:218Þ
2
q = 1 bq φq þ 1
3T þ 2T 2
3T 1 þ 2T 2 = - P 1 2
ð6:219Þ
2
q = 1 a q φq þ 1
T2 1
T2 = - P 2
þ P2 ð6:220Þ
q = 1 b q φq þ1
2
q = 1 b q φq þ1 2
3T þ 2T 2
3T 1 þ 2T 2 = P 1 2
ð6:221Þ
2
q = 1 a q φq þ 1
P2
T2 q = 1 bq φ q -1
T2 = P 2
þ P2 ð6:222Þ
q = 1 b q φq þ 1
2
q = 1 bq φ q þ 1
2
X2
ε = ε0 þ φS
q=1 q
ε0
q þ εq
T
ð6:224Þ
With the noninteracting solution Eigen strain ε0 given by Eq. (6.185), we can
write
h X2 - 1i 0
ε= I- q=1
φ q S Aq þ S : ε þ φ1 SðA1 þ SÞ - 1 ðA1 þ IÞ : εT1 ð6:225Þ
With the help of Eqs. (6.227) and (6.185), Eq. (6.226) can be simplified as
= U : ε - V : εT1
ð6:229Þ
where
h X2 - 1i - 1
U = C1 A1 ðA1 þ SÞ - 1 I - q=1
φ q S Aq þ S ð6:230Þ
n
C1 A1 ðA1 þ SÞ - 1 φ1 SðA1 þ SÞ - 1 ðA1 þ IÞ½I
V= X2 -1 -1 ð6:231Þ
- φ S Aq þ S
q=1 q
þ ðI - SÞg
By carrying out the lengthy but simple algebra, the components of the positive
definite fourth-rank tensors U and V are explicitly obtained as follows:
338 6 Thermomechanical Analysis of Particle-Filled Composites
U ijkl = U 1 δij δkl þ U 2 δik δjl þ δil δjk ð6:232Þ
V ijkl = V 1 δij δkl þ V 2 δik δjl þ δil δjk ð6:233Þ
where
β1 - 2ð4 - 5v0 Þ
U2 = h P i μ1 ð6:235Þ
φ
β1 1 - 2ð4 - 5v0 Þ 2q = 1 β q
q
The composite sphere assemblage (CSA) model was first proposed by Kerner
(1956) and Van der Poel (1958) as shown in Fig. 6.3. Smith (1974, 1975),
Christensen and Lo (1979), and Hashin and his co-workers (1962, 1963, 1968,
1990, 1991a, b) improved on the CSA model. The CSA model assumes that the
filler particles are spherical and that the action on the particle is transmitted through a
spherical interphase shell. The overall macro-behavior is assumed isotropic and is
thus characterized by two effective moduli: the bulk modulus k and the shear
modulus μ. In this section, we summarize the theoretical solution for effective
thermomechanical properties of the CSA model consisting of an elastic spherical
filler particle and an elastic interphase layer with a thickness of δ. In the following
formulas, k represents the bulk modulus, μ represents the shear modulus, and α
represents the coefficient of thermal expansion. The subscripts i, f, and m refer to the
interphase, filler particle, and matrix, respectively.
The effective bulk modulus k for the CSA model as obtained by Hashin (1962) is
given below:
φ
k = ki þ ð6:238Þ
1
kf - ki þ 33kð1i þ4μ
- φÞ
i
where
340 6 Thermomechanical Analysis of Particle-Filled Composites
r 3
φ= ð6:239Þ
rþδ
The effective coefficient of thermal expansion α for the CSA model as obtained by
Levin (1967) is given as
αf - αi 1 1
α = αi þ 1 - ð6:240Þ
kf - ki
1 k ki
where
A = 8 μf =μi - 1 ð4 - 5vi Þη1 φ10=3 - 2 63 μf =μi - 1 η2 þ 2η1 η3 φ7=3
þ252½μf =μi - 1η2 φ5=3 - 50 μf =μi - 1 7 - 12vi þ 8vi η2 φ þ 4ð7 - 10vi Þη2 η3
2
ð6:242Þ
B = - 4 μf =μi - 1 ð1 - 5vi Þη1 φ10=3 þ 4 63 μf =μi - 1 η2 þ 2η1 η3 φ7=3
ð6:243Þ
- 504½μf =μi - 1η2 φ5=3 þ150 μf =μi - 1 ð3 - vi Þvi η2 φ þ 3ð15vi - 7Þη2 η3
6.6 Effective Thermomechanical Properties 341
D = 4 μf =μi - 1 ð5vi - 7Þη1 φ10=3 - 2 63 μf =μi - 1 η2 þ 2η1 η3 φ7=3
2 ð6:244Þ
þ252½μf =μi - 1η2 φ5=3 þ25 μf =μi - 1 vi - 7 η2 φ - ð7 þ 5vi Þη2 η3
with
η1 = μf =μi - 1 49 - 50vf vi þ 35 μf =μi vf - 2vi þ 35 2vf - vi ð6:245Þ
η2 = 5vf μf =μi - 8 þ 7 μf =μi þ 4 ð6:246Þ
η3 = μf =μi ð8 - 10vi Þ þ ð7 - 5vi Þ ð6:247Þ
where
h i
α = 4Pð7 - 10vi Þ - Sφ7=3 ½Q - ð8 - 10vi ÞðM - 1Þφ
2
- 126PðM - 1Þφ 1 - φ2=3 ð6:249Þ
with
M = μf =μi ð6:252Þ
P = 7 þ 5vf M þ 4 7 - 10vf ð6:253Þ
Q = ð8 - 10vi ÞM þ ð7 - 5vi Þ ð6:254Þ
S = 35 7 þ 5vf M ð1 - vi Þ - Pð7 þ 5vi Þ ð6:255Þ
Effective Young’s modulus and Poisson’s ratio for the CSA model can be calculated
from the well-known expressions:
9k μ
E = ð6:256Þ
3k þ μ
3k - 2μ
v = ð6:257Þ
6k þ 2μ
To illustrate the effect of imperfect interface and interphase thickness on the overall
effective mechanical and thermal properties, we consider a special case of CSA that
consists of spherical alumina trihydrate (ATH) particle and interphase with the
following properties:
For particles:
E f = 70 GPa
vf = 0:24
αf = 13 × 10 - 6 = ° C
vi = 0:31
αi = 70 × 10 - 6 = ° C
δ=r = 0:01 and δ=r = 0:001
15
14.5
δ/ r = 0.01
14
α*
13.5
δ/r = 0.001
13
12.5
12
100 102 104 106
Interface Parameter q
Fig. 6.4 Variation of effective CTE with the interface parameter, q, Nie (2005)
0.8
δ/r = 0.01
0.6
m*/mf
δ/ r = 0.001
0.4
0.2
0
100 102 104 106
Interface Parameter q
Fig. 6.5 Variation of effective shear modulus with the interface parameter q, Nie (2005)
344 6 Thermomechanical Analysis of Particle-Filled Composites
0.8
δ/r = 0.01
0.6
E*/Ef
0.4 δ/ r = 0.001
0.2
0
100 102 104 106
Interface Parameter q
Fig. 6.6 Variation of effective Young’s modulus with the interface parameter q, Nie (2005)
0.8
δ/r = 0.01
0.6
k*/kf
δ/ r = 0.001
0.4
0.2
0
100 102 104 106
Interface Parameter q
Fig. 6.7 Variation of effective bulk modulus with the interface parameter q, Nie (2005)
6.7 Micromechanical Constitutive Model of the Particulate Composite 345
0.4
0.35
0.3
ν* δ/r = 0.01
0.25
0.2
δ/r = 0.001
0.15
0.1
0.05
0
100 102 104 106
Interface Parameter q
Fig. 6.8 Variation of effective Poisson’s ratio with the interface parameter q, Nie (2005)
modulus, Young’s modulus, and bulk modulus. As the interface becomes thinner for
the same Ei value, the effective shear modulus increases. On the other hand, for the
same interphase thickness, as the elastic modulus value Ei of the interphase
decreases, the effective shear modulus decreases. This is also true for both the
effective bulk modulus and effective Young’s modulus. Figure 6.8 shows the effect
of the interphase bond modulus and thickness on effective Poisson’s ratio. Effective
Poisson’s ratio seems to be independent of the interphase stiffness when q ≤ 102 and
q ≥ 104.
Analytical and numerical evaluation of the effective elastic properties and the
thermal expansion coefficients of the composite sphere assemblage show that the
bulk and shear moduli are insensitive to the value of Poisson’s ratio of interphase.
Numerical results show that the nature of the interphase has a significant effect on the
stiffness and the thermal expansion coefficient of the CSA.
The nature of the bond between particles and the matrix has a significant effect on the
mechanical behavior of particulate composites. Most analytical and numerical
models assume that the bond between the filler and matrix is perfect and can be
modeled using the continuity of tractions and displacements across the interface.
346 6 Thermomechanical Analysis of Particle-Filled Composites
Based on Ju and Chen’s (1994a, b) solution for noninteracting particles, the effective
properties of a two-phase composite with imperfect bonding can be obtained by
substituting the properties of the particle with the properties of composite sphere
assemblage:
( )
3ð1 - vm Þðk - k m Þφf
k = km 1þ ð6:258Þ
3ð1 - vm Þkm þ 1 - φf ð1 þ vm Þðk - km Þ
( )
15ð1 - vm Þðμ - μm Þφf
μ = μm 1þ ð6:259Þ
15ð1 - vm Þμm þ 1 - φf ð8 - 10vm Þðμ - μm Þ
Similarly, the pairwise interacting solution for the effective properties of the
two-phase composite with the imperfect bond can be given by
6.7 Micromechanical Constitutive Model of the Particulate Composite 349
30ð1 - vm Þð3γ 1 þ 2γ 2 Þφf
k = km 1þ ð6:260Þ
3β1 þ 2β2 - 10ð1 þ vm Þð3γ 1 þ 2γ 2 Þφf
30ð1 - vm Þγ 2 φf
μ = μm 1 þ ð6:261Þ
β2 - 4ð4 - 5vm Þγ 2 φf
where
k μ
β1 = 2ð5vm - 1Þ þ 10ð1 - vm Þ m - m ð6:262Þ
k - k m μ - μm
μm
β2 = 2ð4 - 5vm Þ þ 15ð1 - vm Þ ð6:263Þ
μ - μm
and
5φf 8β1
γ1 = ð13 - 14vm Þvm - ð1 - 2vm Þð1 þ vm Þ ð6:264Þ
8β2 2 3β1 þ 2β2
1 5φf 6β1
γ2 = þ 25 - 34vm þ 22vm -
2
ð1 - 2vm Þð1 þ vm Þ ð6:265Þ
2 16β2 2 3β1 þ 2β2
where k and μ are the effective bulk modulus and shear modulus of the composite
spherical-filler assemblage and φf is the filler particle volume fraction.
Example
Nie (2005) studied lightly cross-linked poly-methyl methacrylate (PMMA) filled
with alumina trihydrate (ATH). The following phase properties were used for the
numerical analysis:
Particle: Ef = 70GPa, vf = 0.24
Matrix: Em = 3.5GPa, vm = 0.31
The particle volume fraction is φf = 0.48
Nondimensional interface parameter is defined as q = Ef/Ei.
Three kinds of composites with different interfacial properties were prepared in
the lab. Composite A has the strongest interfacial adhesion due to the addition of a
special adhesion-promoting additive, where the value of the nondimensional inter-
face parameter q = Ef/Ei can be assumed one. The interfacial adhesion strength of
composite C is the weakest due to the addition of a debonding promoting additive.
The value of the nondimensional interface parameter q for composite C can be
assumed to be a very large number. The interfacial strength of composite B is
moderate. The value of nondimensional interface parameter q for composite B can
be assumed to be some value between that for composite A and composite C.
350 6 Thermomechanical Analysis of Particle-Filled Composites
12
Composite A [Test Data]
4 Noninteracting Solution
0
100 101 102 103 104 105 106
Interface Parameter q
Fig. 6.13 Effective Young’s modulus vs. interface parameter at a volume fraction of 48% (the
thickness of interphase is 1/10 of the diameter of particles), Nie (2005)
Viscoplastic flow occurs only in the matrix because filler particles have a very high
melting temperature, and they are brittle and deform linear elastic. The magnitude of
the current equivalent stress norm of the matrix can be used to determine viscoplastic
behavior. When the ensemble-volume average stress norm in the matrix reaches a
certain level, the effective yield function for the composite can be given by
qffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
f ðσ, αÞ = σ - σ T : T : σ - σT
pffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi ð6:266Þ
- T 1 þ 2T 2 σ y ðαÞ
6.7 Micromechanical Constitutive Model of the Particulate Composite 351
12
Composite A [Test Data]
Composite B [Test Data]
Effective Young’s Modulus (Gpa) 10
Pairwise Interacting Solution
6
Noninteracting Solution
0
100 101 102 103 104 105 106
Interface Parameter q
Fig. 6.14 Effective Young’s modulus vs. interface parameter at a volume fraction of 48% (the
thickness of interphase is 1/100 of the diameter of particles), Nie (2005)
12
Composite A [Test Data]
Composite B [Test Data]
Effective Young’s Modulus (Gpa)
10
Pairwise Interacting Solution
Noninteracting Solution
6
4
Composite C [Test Data]
0
100 101 102 103 104 105 106
Interface Parameter q
Fig. 6.15 Effective Young’s modulus vs. interface parameter at a volume fraction of 48% (the
thickness of interphase is 1/1000 of the diameter of particles), Nie (2005)
352 6 Thermomechanical Analysis of Particle-Filled Composites
where σ is the average stress in the particulate composite; σT is the stress caused by
the CTE mismatch between the matrix and the particle which was derived earlier in
Eq. (6.190); T is the fourth-order tensor which is given by Eq. (6.211); T1 and T2 are
given by Eqs. (6.217) and (6.218), respectively; σ y ðαÞ is the isotropic hardening
function of the composite materials; and α is the equivalent viscoplastic strain that
defines the isotropic hardening of the composite. Its rate is given by
pffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiqffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
-1
α_ = T 1 þ 2T 2 ε_ : T : ε_
vp vp
ð6:267Þ
The factors in the effective yield and the effective plastic strain increment
equations are chosen so that the effective stress and effective plastic strain incre-
ments are equal to the uniaxial stress and uniaxial plastic strain increment in a
uniaxial monotonic tensile test. It should be noted that the effective yield function
is spherical pressure-dependent now and not of the von Mises type anymore. So the
particles and hydrostatic pressure have a significant effect on the viscoplastic
behavior of the matrix.
Unified mechanics theory provides a basic framework to introduce degradation
evolution intrinsically without an empirical evolution function. According to the
strain equivalence principle, the effective yield stress function can be given by
rffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
σ σ pffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
f ðσ, αÞ = - σT : T : - σT - T 1 þ 2T 2 σ y ðαÞ ð6:268Þ
1-Φ 1-Φ
The degradation increases the equivalent stress norm of the composite, which
tends to amplify the viscoplastic flow.
If the kinematic hardening behavior is included, let β define the center of the yield
surface
of the composite in the stress space, and the relative stress can be defined as
σ - β . Assume the filler particles have the same effect on the stress norm of the
matrix as on the kinematic behavior of the matrix; the stress norm defining the
viscoplastic behavior of the matrix can be updated as
hH im ðxÞ = σ - β - σT : T : σ - β - σT ð6:269Þ
Therefore, the effective yield function for the composite can be given as
rffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
σ σ
f ðσ, qÞ = - β - σT : T : - β - σT
1-Φ 1-Φ
pffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
- T 1 þ 2T 2 σ y þ K ðαÞ ð6:270Þ
where q = α, β is chosen as the internal viscoplastic variable.
6.7 Micromechanical Constitutive Model of the Particulate Composite 353
∂f 1
ε_ vp = γ = γn ð6:271Þ
∂σ 1 - Φ
where
T : 1 -σ Φ - β - σT
n = qffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
ð6:272Þ
σ
1-Φ - β - σ
T
: T : 1 -σ Φ - β - σT
hf i hf i
γ= = ð6:273Þ
η 2μτ
where H′ is the kinematic hardening modulus. The factor in Eq. (6.276) is chosen so
that the effective stress and effective viscoplastic strain increments are equal to the
uniaxial stress and uniaxial plastic strain increment in a uniaxial monotonic
ensile test.
354 6 Thermomechanical Analysis of Particle-Filled Composites
Φ = Φcr 1 - e - R Δs
ms
ð6:277Þ
where Φcr is the critical value of the TSI; however, this term can also be used as a
function to map elastic modulus degradation, onto the TSI axis [however, using Φcr
is not necessary, and can be taken as one, especially if all entropy generation
mechanisms are included in the fundamental equation. In the other examples in
this book, it is taken as 1], and R is gas constant, ms is molar mass, and Δs is the
change in entropy and must be calculated from the thermodynamic fundamental
equation at each Gauss integration point. For thermomechanical analysis of partic-
ulate composites, the following thermodynamic fundamental equation represents the
majority, but not all, of the entropy generation mechanisms:
Z Z t0 Z t
σ : ε_
t vp
k 2 r
Δs = dt þ jgradT j dt þ dt ð6:278Þ
t0 Tρ t T ρ
2
t0 T
1
Δεvp = - C - 1 Δσ ð6:280Þ
1-Φ
Assuming that irreversible entropy generation is only due to plastic work and heat
generation, from Eq. (6.278) for an isothermal process at each increment, we can
write
σ : ε_
vp
r
Δ_s = þ ð6:282Þ
Tρ T
If the heat generated within the system is negligible, the distributed internal heat
source of strength per unit mass is zero [which is not true, especially if the rate of
loading is high]. Using Eqs. (6.280) and (6.282), Eq. (6.281) becomes
1 Φcr ms m
ΔΦ = - exp - s Δs σC - 1 Δσ ð6:283Þ
1 - Φ TρR R
We should point out that in Eq. (6.282) we ignored many prominent irreversible
entropy generation mechanisms, such as the relative motion between the filler
particles and the matrix, the entropy generation in the filler polycrystals, and aging
in PMMA molecular chains. Moreover, if the strain rate of the loading is very high,
there will be additional entropy generation mechanisms. However, here the goal is to
demonstrate the process as simple as possible.
From Eqs. (6.271), (6.274), and (6.276), we also have
1
εvp
nþ1 = εn þ
vp
Δγ n ð6:284Þ
1-Φ
1 pffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
αnþ1 = αn þ T 1 þ 2T 2 Δγ ð6:285Þ
1-Φ
1:5
T 1 þ 2T 2
βnþ1 = βn þ 2 2
ΔHn ð6:286Þ
T 1 þ 2T 2 þ 2T 1
Let
pffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
m1 = T 1 þ 2T 2 ð6:287Þ
1:5
T 1 þ 2T 2
m2 = 2 2
ð6:288Þ
T 1 þ 2T 2 þ 2T 1
356 6 Thermomechanical Analysis of Particle-Filled Composites
Φcr ms m
m3 = - exp - s Δs ð6:289Þ
TρR R
Also, we have
∂f 1
= n ð6:290Þ
∂σ 1 - Φ
2
∂ f 1
= N ð6:291Þ
∂σ 2
ð1 - ΦÞ2
∂f
= -n ð6:292Þ
∂β
2
∂ f
2
=N ð6:293Þ
∂β
∂f n:σ
= ð6:294Þ
∂D ð1 - ΦÞ2
∂f
= - m1 K 0 ð6:295Þ
∂α
∂n
= -N ð6:296Þ
∂β
∂n 1
= N ð6:297Þ
∂σ 1 - Φ
∂n 1
= N:σ ð6:298Þ
∂D ð1 - ΦÞ2
∂σ y
where K 0 = ∂α
is the isotropic hardening modulus.
N
T-n n
N = qffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
σ ffi ð6:299Þ
σ
1-Φ - β - σ : T 1 - Φ - β - σT
T
m1
Rα = - αnþ1 þ αn þ Δγ ð6:300Þ
1-Φ
2μτΔγ
Rγ = - hf i ð6:301Þ
Δt
1
Rε = - εvp
nþ1 þ εn þ 1 - Φ Δγn
vp
ð6:302Þ
m1 m1 m3
Rα - Δαnþ1 þ Δ2 γ þ ΔγσC - 1 Δσ = 0 ð6:304Þ
1-Φ ð1 - ΦÞ 3
2 0
0 0Rα m1 K τ
n Δσ - nΔβ = Rf þ m1 K þ þ 2μ Δ2 γ ð6:305Þ
1-Φ Δt
Δ2 γ
P : Δσ - Q : Δβ þ n þ Rε = 0 ð6:306Þ
1-Φ
m1 m2 0Δ2
R : Δσ - S : Δβ þ H γn þ m2 H 0 Rα n þ Rβ = 0 ð6:307Þ
1-Φ
where
" #
1 Δγ Δγm3 Nσ O
-1 -1
P= C þ Nþ þn σC ð6:308Þ
1-Φ 1-Φ ð1 - ΦÞ2 1 - Φ
Δγ
Q= N ð6:309Þ
1-Φ
" #
m2 m3 O
0 -1
R= ΔHN þ ðΔHNσ þ m1 H ΔγnÞ σC ð6:310Þ
1-Φ ð1 - ΦÞ2
S = I þ m2 ΔHN ð6:311Þ
" #
1 m3
n0 = nþ nσ - m1 2 K 0 Δγ σC - 1 ð6:312Þ
1-Φ ð1 - ΦÞ 2
h in o n o n o
Δσ Δ2 γ
P Q
þ n m m1 H 0 þ m H 0 RRεnþR = 0 ð6:313Þ
R S - Δβ 1-Φ 1 2 2 α β
where
h i
A-1 = P
R
Q
S
ð6:315Þ
h~ ~
i
A= P
~
R
Q
~
S
ð6:316Þ
358 6 Thermomechanical Analysis of Particle-Filled Composites
~ = P - Q S-1 R -1
P ð6:317Þ
~ = - P - Q S-1 R -1 Q S-1
Q ð6:318Þ
~ = - S-1 R P - Q S-1 R -1
R ð6:319Þ
~ = S-1 þ S-1 R P - Q S-1 R -1 Q S-1
S ð6:320Þ
So we have
8 9
n o >
< Δ2 γ >
=
nþ Rε
Δσ
= -A 1 - Φ ð6:321Þ
> m1 Δ γ
2
- Δβ
: þ Rα m2 H 0 nþRβ >
;
1-Φ
1
Δεvp = - C - 1 Δσ ð6:322Þ
1-Φ
σΔεvp
Δ2 s = ð6:323Þ
Tρ
Δsnþ1 = Δsn þ Δ2 s ð6:324Þ
ΔΦnþ1 = - m3 ðσ nþ1 Δε Þ vp
ð6:325Þ
m1 m1 m3
Δαnþ1 = Rα þ Δ2 γ - ΔγσΔεvp ð6:326Þ
1-Φ ð1 - ΦÞ2
Then, the viscoplastic strain, consistency parameter, stress, TSI and entropy
production rate, and effective equivalent viscoplastic strain are updated at each
Gauss integration point and iterated until the norms of residual functions
(Eqs. 6.300–6.303) are smaller than a predefined tolerance. Normally, the conver-
gence is achieved when the norms of these residual functions are all equal to or less
than 1 × 10-5. The value of the tolerance is problem-dependent.
The procedure summarized above is simply a systematic application of Newton’s
method to the system of Eqs. (6.270)–(6.278) that results in the computation of the
closest point projection from the trial state onto the yield surface.
It should be noted that the general return mapping algorithm is unconditionally
stable, and the convergence of the algorithms toward the final value of the state
variable is obtained at a quadratic rate. Further information on the general return
mapping algorithm is given in Simo and Hughes (1998) and Ortiz and
Martin (1989).
It is necessary to point out that the formulation presented above calculates the TSI
at Gauss integration points. In the finite element formulation, it is possible to define
TSI at nodal points and solve for TSI and other nodal unknowns. However, this
would significantly increase the computational cost. It is much more cost-effective to
calculate TSI at Gauss integration points from thermodynamic variables obtained in
the previous step. Because the solution process is incremental, this simplification
introduces very little error.
6.7 Micromechanical Constitutive Model of the Particulate Composite 359
An important advantage of the algorithm lies in the fact that it can be exactly
linearized in closed form. This leads to the notion of consistent elastic-viscoplastic
tangent moduli.
Let C be the consistent elastic-viscoplastic tangent moduli, and then the incre-
mental stress-strain relationship based on the third law of the unified mechanics
theory can be given by
dΦ = - m3 σdεvp ð6:328Þ
Δγ Δγ dΔγ
dεvp = WNdσ - WNdβ þ Wn ð6:329Þ
ð1 - Φ Þ2 1-Φ 1-Φ
where
Δγm3 Nσ O
W-1 = I þ 2 1-Φ
þn σ ð6:330Þ
ð1 - ΦÞ
m1 m1 m3
dα = dΔγ - Δγσdεvp ð6:331Þ
1-Φ ð1 - ΦÞ2
m2 m3 ΔH O m ΔH
dβ = - Nσ σ dεvp þ 2 Ndσ - m2 ΔHNdβ
ð1 - ΦÞ 2 1-Φ
þ m2 nH 0 dα ð6:332Þ
1 m3
ndσ - ndβ - nσ - m1 2 K 0 Δγ σdεvp
1-Φ ð1 - ΦÞ 2
τ m 2K0
= 2μ þ 1 dΔγ ð6:333Þ
Δt 1 - Φ
Wn
p : dσ - q : dβ þ dΔγ = dε ð6:334Þ
1-Φ
where
1 Δγ
p= C-1 þ WN ð6:335Þ
1-Φ 1-Φ
Δγ
q= WN ð6:336Þ
1-Φ
m1 m2 H 0
r : dσ - s : dβ þ WndΔγ = 0 ð6:337Þ
1-Φ
where
1 Δγ
r= m ΔHN þ VWN ð6:338Þ
1-Φ 2 1-Φ
Δγ
s = I þ m2 ΔHN þ VWN ð6:339Þ
1-Φ
h O i
m mm Nσ
V = - 1 2 32 H 0 Δγ þn σ ð6:340Þ
ð1 - ΦÞ 1-Φ
n
dσ - n dβ = m4 dΔγ ð6:342Þ
1-Φ
where
6.8 Verification Examples 361
m3 Δγ
n = n - 3
nσ - m1 2 K 0 Δγ ðσWNÞ ð6:343Þ
ð1 - ΦÞ
m3 τ m 2K0
m4 = nσ - m1 2 K 0 Δγ ðσWnÞ þ 2μ þ 1 ð6:344Þ
ð1 - ΦÞ 3 Δt 1 - Φ
where
h i
a-1 = p
r
q
s
ð6:346Þ
h i
~ ~
a= p
~r
q
~s
ð6:347Þ
1
~dε -
dσ = p ~ þ ~r WndΔγ
p ð6:348Þ
1-Φ
0
-50 0 50 100 150 200
Temperature
The interphase around the particle is also regarded as an isotropic elastic material. In
addition, it is reasonable to assume that Poisson’s ratio and CTE of the interphase are
the same as that of the matrix (PMMA); however, Young’s modulus of the inter-
phase is less than that of PMMA. For simulations, the thickness of the interphase is
taken as 1% of the diameter of filler particles; Young’s modulus is half of that of
PMMA, except where specified differently.
Poisson’s ratio
0.3
0.25
0.2
0 20 40 60 80 100
Temperature
6.E-05
CTE
4.E-05
2.E-05
0.E+00
0 20 40 60 80 100 120 140
Temperature
ν = 0:008 T þ 0:334
α = 3:035 × 10 - 7 T þ 2:347 × 10 - 5
When T ≥ 90 ° C:
α = 1:0992 × 10 - 6 T - 5:012 × 10 - 5
364 6 Thermomechanical Analysis of Particle-Filled Composites
0.8
0.4
0
0 20 40 60 80 100 120
Temperature
Viscosity η is the ratio of the loss modulus to the angular frequency, which is
determined during the forced harmonic oscillation test (Nielsen & Landel, 1994).
The viscosity relaxation time τ is defined by Simo and Hughes (1998):
η
τ=
2μ
where μ is the shear modulus. It is important to realize that the controlling factor in
the relaxation process is the relative time t/τ. The absolute clock time t is regarded as
short or long only when compared with τ. The concept of relaxation time is
explained in greater detail by Simo and Hughes (1998). From dynamic mechanical
testing, the viscosity relaxation time τ is determined as shown in Fig. 6.19.
When T ≤ 90 ° C:
When T ≥ 90 ° C:
The elastic modulus of composite A can be determined from the properties of the
filler particle, the matrix, and the interphase according to the micromechanical model
presented earlier in this chapter. The elastic modulus of composite A is also
determined by uniaxial tensile tests for strain rates and temperatures as shown in
Fig. 6.20:
0
0 50 100 150
Temperature
40
20
0
0 20 40 60 80 100
Temperature
where T is the temperature in Celsius. Φcr is not an essential parameter. As long as all
entropy generation mechanisms are included in the thermodynamic fundamental
equation and no simplifications are made in derivation, if not used, nothing in the
formulation changes. In this specific example [some entropy generation mecha-
nisms, such as polymer aging and heat generation, are not included], an Φcr is
determined using the statistical methods by matching the elastic modulus degrada-
tion determined from test data with the thermodynamic state index (TSI) evolution
calculated from the thermodynamic fundamental equation during uniaxial tensile
tests. This process allows us to map elastic modulus degradation onto TSI axis. If
Φcr = 1 is used, nothing changes in the formulation or the results. In the unified
mechanics theory, we only care about the dissipation along the thermodynamic state
index axis, which provides the entropy generation. The system will always have
many entropy generation mechanisms and degrade along the TSI axis, and the
entropy generation rate eventually always goes to zero.
366 6 Thermomechanical Analysis of Particle-Filled Composites
The stress-strain response obtained from the simulations was compared with exper-
imental data as shown in Figs. 6.22 and 6.23 at 24 and 75 °C, respectively. These
simulation results are obtained based on the assumption that the thickness of
interphase is 1% of the diameter of the particle and the elastic modulus of interphase
is 50% of that of PMMA. Furthermore, the CTE mismatch effects are also included,
where the temperature when residual stresses begin to build up is assumed to be
100 °C. It is seen that the degradation effects must be accounted for at relatively large
strains.
The comparison of the TSI from the simulations with elastic modulus degradation
that was measured during experiments is given in Fig. 6.22.
Mapping the elastic modulus degradation onto the TSI axis is not necessary if the
user is not seeking the exact stress-strain data cycle to cycle from simulations.
However, if it is needed to map the elastic modulus degradation onto the TSI axis,
the following process can be used. However, according to UMT, only the degrada-
tion along the TSI axis is what matters not the degradation along the elastic modulus
or any other material property that is a human construct.
plasticity model
20
Experimental data
10
0
0.00 0.005 0.01 0.015 0.02
Strain
6.8 Verification Examples 367
0.20
T.S.I
0.16
0.12
0.08
0.04
0.00
0.000 0.002 0.004 0.006 0.008
Strain
0
0 10 20 30 40 50
Time (s)
-0.5
-1
The critical TSI Φcr is temperature-dependent and can be determined from the test
data as follows: First, determine the elastic modulus degradation in cyclic tests.
Second, calculate the dissipated plastic strain energy during cyclic testing,
(Fig. 6.24) using the stress-strain hysteresis loop, (Fig. 6.25) and then determine
the TSI. This process is also equivalent to calculating the fatigue fracture entropy of
the material. From cyclic test data, Φcr can be determined as a function of temper-
ature as follows:
368 6 Thermomechanical Analysis of Particle-Filled Composites
Stress (MPa)
simulation vs. experimental Test
40
data for cycle 5 at room Simulation
temperature with a strain
amplitude of 0.6% 20
0
-0.009 -0.006 -0.003 0.000 0.003 0.006 0.009
-20 Strain
-40
-60
-80
0
-0.009 -0.006 -0.003 0.000 0.003 0.006 0.009
-20 Strain
-40
-60
-80
Φcr (T) is not necessary, if all material properties are defined as a function of
temperature and all entropy generation mechanisms are formulated properly in the
thermodynamic fundamental equation, because the entropy generation is always
according to the second law of thermodynamics and defined by the TSI coordinate.
Other examples of thermomechanical loading applications where Φcr = 1 are given
by Bin Jamal et al. (2020, 2021a, b, c, 2022), Lee and Basaran (2021a, b), and Lee
et al. (2022, b, c, d) (Figs. 6.26, 6.27, 6.28, 6.29, 6.30, 6.31, and 6.32).
6.8 Verification Examples 369
Stress (MPa)
simulation vs. experimental 40
data for cycle 104 at room
Test
temperature with a strain Simulation
amplitude of 0.6% 20
0
-0.010 -0.005 0.000 0.005 0.010
-20
Strain
-40
-60
-80
0
-0.009 -0.006 -0.003 0.000 0.003 0.006 0.009
-10 Strain
-20
-30
-40
0
-0.009 -0.006 -0.003 0.000 0.003 0.006 0.009
-10 Strain
-20
-30
-40
370 6 Thermomechanical Analysis of Particle-Filled Composites
Stress (MPa)
simulation vs. experimental Test 20
data for cycle 495 at 75 °C
with a strain amplitude of Simulation
0.6% 10
0
-0.009 -0.006 -0.003 0.000 0.003 0.006 0.009
-10 Strain
-20
-30
-40
0.15
0.1 Simulation
0
0 30 60 90 120
Number of Cycles
0.2
Simulation
0.1
Experimental Measurement
0
0 100 200 300 400 500 600
Number of Cycles
References 371
References
Basaran, C., & Nie, S. (2004, July). An irreversible thermodynamic theory for damage mechanics of
solids. International Journal of Damage Mechanics, 13(3), 205–224. #41.
Basaran, C., & Nie, S. (2007). A thermodynamics based damage mechanics model for particulate
composites. International Journal of Solids and Structures, 44, 1099–1114. #63.
Bin Jamal, M. N., Kumar, A., Lakshmana Rao, C., & Basaran, C. (2020). Low cycle fatigue life
prediction using unified mechanics theory in Ti-6Al-4V alloys. Entropy, 22, 24.
Bin Jamal, N., Rao, L., & Basaran, C. (2021a). A semi-infinite edge dislocation model for the
proportionality limit stress of metals under high strain rate. Computational Mechanics, 68(3),
545–565. https://doi.org/10.1007/s00466-020-01959-2
Bin Jamal, N., Rao, L., & Basaran, C. (2021b). A unified mechanics theory-based model for
temperature and strain rate dependent proportionality limit stress of mild steel. Mechanics of
Materials, 155, 103762.
Bin Jamal, N., Lee, H. W., Rao, L., & Basaran, C. (2021c). Dynamic equilibrium equations in
unified mechanics theory. Applied Mechanics, 2(1), 63–80.
Bin Jamal, N. M., Rao, L., & Basaran, C. (2022). Unified mechanics theory based flow stress model
for the rate-dependent behavior of bcc metals. Materials Today Communications, 31, 103707.
Cheng, W.-M., Miller, G. A., Manson, J. A., Hertzberg, R. W., & Sperling, L. H. (1990a).
Mechanical behavior of poly(methyl methacrylate)–Part 1: Tensile strength and fracture tough-
ness. Journal of Materials Science, 25, 1917–1923.
Cheng, W.-M., Miller, G. A., Manson, J. A., Hertzberg, R. W., & Sperling, L. H. (1990b).
Mechanical behavior of poly(methyl methacrylate)–Part 2: Temperature and frequency effects
on the fatigue crack propagation behavior. Journal of Materials Science, 25, 1924–1930.
Cheng, W.-M., Miller, G. A., Manson, J. A., Hertzberg, R. W., & Sperling, L. H. (1990c).
Mechanical behavior of poly(methyl methacrylate)–Part 3: Activation processes for fracture
mechanism. Journal of Materials Science, 25, 1931–1938.
Christensen, R. M., & Lo, K. H. (1979). Solutions for effective shear properties in three-phase
sphere and cylinder models. Journal of the Mechanics and Physics of Solids, 27, 315–330.
Eshelby, J. D. (1957). The deformation of the elastic field of an ellipsoidal inclusion, and related
problems. Proceedings of the Royal Society of London, Series A, Mathematical Science,
241(1226), 376–396.
Hashin, Z. (1962). The elastic moduli of heterogeneous materials. Transaction of the ASME,
Journal of Applied Mechanics, 29, 143–150.
Hashin, Z. (1968). Assessment of the self consistent scheme approximation: Conductivity of
particulate composites. Journal of Composite Materials, 2(3), 284–301.
Hashin, Z. (1990). Thermoelastic properties and conductivity of carbon/carbon fiber composites.
Mechanics of Materials, 8, 293–308.
Hashin, Z. (1991a). The spherical inclusion with imperfect interface. Transaction of the ASME,
Journal of Applied Mechanics, 58, 444–449.
Hashin, Z. (1991b). Thermoelastic properties of particulate composites with imperfect interface.
Journal of the Mechanics and Physics of Solids, 39(6), 745–762.
Hashin, Z., & Shtrikman, S. (1963). A variational approach to the theory of the elastic behavior of
multiphase materials. Journal of the Mechanics and Physics of Solids, 11, 127–140.
Ju, J. W., & Chen, T. M. (1994a). Micromechanics and effective moduli of elastic composites
containing randomly dispersed ellipsoidal inhomogeneities. Acta Mechanica, 103, 103–121.
372 6 Thermomechanical Analysis of Particle-Filled Composites
Ju, J. W., & Chen, T. M. (1994b). Effective elastic moduli of two-phase composites containing
randomly dispersed spherical inhomogeneities. Acta Mechanica, 103, 123–144.
Ju, J. W., & Tseng, K. H. (1996). Effective elastoplastic behavior of two-phase ductile matrix
composites: A micromechanical framework. International Journal of Solids & Structures,
33(29), 4267–4291.
Ju, J. W., & Tseng, K. H. (1997). Effective elastoplastic algorithms for ductile matrix composites.
Journal of Engineering Mechanics, 123(3), 260–266.
Kerner, E. H. (1956). The elastic and thermoelastic properties of composite media. The Proceedings
of Physical Society, 69B, 808–813.
Lee, H. W., & Basaran, C. (2021a). A review of damage, void evolution and fatigue life prediction
models. Metals, 11, 609.
Lee, H. W., & Basaran, C. (2021b). Predicting high cycle fatigue life with unified mechanics theory.
Mechanics of Materials, 104, 116, ISSN 0167-6636.
Lee, H. W., Basaran, C., Egner, H., Lipski, A., Piotrowski, M., Mroziński, S., Noushad Bin Jamal,
M., & Rao, C. L. (2022a). Modeling ultrasonic vibration fatigue with unified mechanics theory.
International Journal of Solids and Structures, 236–237, 111313.
Lee, H. W., & Basaran, C. (2022). Predicting high cycle fatigue life with unified mechanics theory.
Mechanics of Materials, 164, 104116.
Lee, H. W., Fakhri, H., Ranade, R., Basaran, C., Egner, H., Lipski, A., Piotrowski, M., &
Mroziński, S. (2022c). Modeling corrosion fatigue with unified mechanics theory. Material &
Design, 224, 111383.
Lee, H. W., Fakhri, H., Ranade, R., Basaran, C., Egner, H., Lipski, A., Piotrowski, M., &
Mroziński, S. (2022d). Modeling fatigue of pre-corroded bcc metals with unified mechanics
theory. Materials & Design, 111383. https://doi.org/10.1016/j.matdes.2022.111383
Levin, V. M. (1967). Thermal expansion coefficients of heterogeneous materials. Mechanics of
Solids, 2(1), 58–94.
Mura, T. (1987). Mechanics of elastic and inelastic solids: Micromechanics of defects in solids (2nd
ed.). Martinus Nijhoff Publishers.
Nie, S., & Basaran, C. (2005). A micromechanical model for effective elastic properties of
particulate composites with imperfect interfacial bonds. International Journal of Solids &
Structures., 42(14), 4179–4191. #62.
Nie, S., Basaran, C., Hutchins, S., & Ergun, H. (2006). Failure mechanisms in PMMA/ATH acrylic
casting dispersion. Journal of Mechanical Behavior of Materials, 17(2), 79–95. #39.
Nielsen, L. E., & Landel, R. F. (1994). Mechanical properties of polymers and composites (2nd
ed.). Marcel Dekker, Inc.
Ortiz, M., & Martin, J. E. (1989). Symmetry-preserving return-mapping algorithms and incremen-
tally extremal paths: A unification of concepts. International Journal for Numerical Methods in
Engineering, 28, 1839–1853.
Prager, W. (1956) A New Method of Analyzing Stresses and Strains in Work-Hardening Plastic
Solids. Journal of Applied Mechanics, 23, 493–496.
Simo, J. C., & Hughes, T. J. R. (1998). Interdisciplinary applied mathematics, mechanics and
materials, computational inelasticity. Springer.
Simo, J. C., & Taylor, R. L. (1985). Consistent tangent operators for rate-independent
elastoplasticity. Computer Methods in Applied Mechanics and Engineering, 48, 101–119.
Smith, J. C. (1974). Correction and extension of van der Poel’s method for calculating the shear
modulus of a particulate composite. Journal of Research of the National Bureau of Standards-A.
Physics and Chemistry, 78A(3), 355–361.
References 373
Smith, J. C. (1975). Simplification of van der Poel’s formula for the shear modulus of a particulate
composite. Journal of Research of the National Bureau of Standards-A. Physics and Chemistry,
79A(2), 419–423.
Van der Poel, C. (1958). On the rheology of concentrated suspensions. Rheologica Acta, 1, 198.
Walpole, L. J. (1966). On bounds for the overall elastic moduli of inhomogeneous systems-I.
Journal of the Mechanics and Physics of Solids, 14, 151.
Ziegler, H. (1959). A Modification of Prager’s Hardening Rule. Quarterly of Applied Mathematics,
17, 55–65.
Chapter 7
Unified Micromechanics of Finite
Deformations
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2022 375
C. Basaran, Introduction to Unified Mechanics Theory with Applications,
https://doi.org/10.1007/978-3-031-18621-9_7
376 7 Unified Micromechanics of Finite Deformations
x = χðr, t Þ ð7:1Þ
∂x
F = ∇xðχ Þ = ð7:2Þ
∂r
J = detðFÞ > 0 ð7:3Þ
~vðx, t Þ = x_ ðx, t Þ ð7:4Þ
∂x_
L = gradð~vÞ = ð7:5Þ
∂r
F_ = L F ð7:6Þ
F = Fe Fp ; J = J e J p ð7:7Þ
J e = detðFe Þ > 0 ð7:8Þ
J = detðF Þ > 0
p p
ð7:9Þ
symmetric ðLÞ = D; asymmetricðLÞ = W; L = D þ W ð7:10Þ
7.1 Introduction to Finite Deformations 377
Using Eq. (7.7), the elastic and plastic part of the spatial gradient of velocity
tensor L can be defined as
-1
L = Le þ Fe Lp Fe ð7:11Þ
where
Le = F_ Fe
e -1
ð7:12Þ
Lp = F F _ p p-1 ð7:13Þ
Stretching rate tensor D and spin tensor W for elastic and plastic parts can be
derived similarly from Eq. (7.10), where superscripts “e” and “p” represent elastic
and plastic parts of corresponding quantity, respectively:
Right stretch tensor U and left stretch tensors V and rotation tensor R can be
found from the right and left polar decompositions of the deformation gradient
tensor F as follows:
F = RU ð7:16Þ
F = VR ð7:17Þ
U and V stretch tensors are positive definite symmetric tensors and R are proper
orthogonal tensors. Cauchy tensor C and Almansi tensor B can be formulated as
follows:
C = FT F = UU ð7:18Þ
B - 1 = F - TF - 1 = V - 1V - 1 ð7:19Þ
In the same fashion, the following relations can be written for elastic and plastic
parts of the deformation gradient tensor F:
Fe = Re Ue ð7:20Þ
F =V R
e e e
ð7:21Þ
T
Ce = Fe Fe = Ue Ue ð7:22Þ
-1 -T -1 -1 -1
Be = Fe Fe = Ve Ve ð7:23Þ
378 7 Unified Micromechanics of Finite Deformations
Fp = Rp Up ð7:24Þ
F =V R
p p p
ð7:25Þ
T
Cp = Fp Fp = Up Up ð7:26Þ
-1 -T -1 -1 -1
Bp = Fp Fp = Vp Vp ð7:27Þ
Wp = 0 ð7:28Þ
Jp = 1 ð7:29Þ
J = Je ð7:30Þ
F_ = Dp Fp
p
ð7:31Þ
Because of the incompressible plastic flow and irrotational plastic flow assump-
tions, we have the following relations:
Lp = Dp ð7:32Þ
trðL Þ = trðD Þ = 0
p p
ð7:33Þ
-1
L = Le þ F e D p F e ð7:34Þ
Fp ðr, 0Þ = I ð7:35Þ
F ðr, 0Þ = I
e
ð7:36Þ
7.2 Frame of Reference Indifference 379
The material constitutive model must be invariant with respect to the frame of
reference (coordinate system). Therefore, a quantity or an equation is frame indif-
ferent (or objective) if it is invariant to changes in the frame of reference.
Let the rigid body motion of any material point χ(r, t) be defined by a proper
orthogonal rotation tensor Ω(t) and a vector ro(t) for any duration t as follows:
ΩT Ω = I ð7:38Þ
Here we define a stress tensor as objective if it does not depend on the frame of
reference. In Eq. (7.37), Ω(t) represents rigid body rotation and ro(t) represents rigid
body translation. In this context, frame indifference of a vector a and a second-order
tensor A are defined in terms of transformation rules as follows:
a = Ωa ð7:39Þ
A = ΩAΩT ð7:40Þ
F = ΩF ð7:41Þ
Therefore, the deformation gradient tensor is not objective (it is not the frame of
reference indifferent) according to Eq. (7.40). Using Eq. (7.41) Cauchy tensor C in
the new transformed configuration can be given by
T
C = F F = FT ΩT ΩF = FT F = C ð7:42Þ
Since the Cauchy tensor is a Lagrangian tensor referring to the original frame of
reference, the Cauchy tensor should not be expected to change with the current
coordinate system (frame of reference) according to Eq. (7.40). Therefore, Cauchy
tensor C is not objective, but still invariant with respect to changes in the current
frame of reference in the deformed configuration. Using Eq. (7.41) and the right
polar decomposition rule in Eq. (7.16), we can write
F = RU = ΩF = ΩRU ð7:43Þ
380 7 Unified Micromechanics of Finite Deformations
R = ΩR ð7:44Þ
U=U ð7:45Þ
Therefore, right stretch tensor U and rotation tensor R are not objective. How-
ever, similar to the Cauchy tensor, the right stretch tensor refers to the original frame
of reference, and hence right stretch tensor is invariant to changes in the current
frame of reference. Similarly, using Eq. (7.19), Almansi tensor B in the transformed
configuration, we can write
T
B = FF = ΩFFT ΩT = ΩBΩT ð7:46Þ
Therefore, Almansi tensor B is objective. Using Eq. (7.41) and the left polar
decomposition rule in Eq. (7.17) yields
F = VR = ΩF = ΩVR ð7:47Þ
According to the transformation of rotation tensor in Eqs. (7.44) and (7.47) can be
rewritten as
F = VR = ΩVΩT ΩR ð7:48Þ
Since left polar decomposition is also unique, Eq. (7.48) implies that
V = ΩVΩT ð7:49Þ
Therefore, the left stretch tensor V is objective. Taking the time derivative of
Eq. (7.41), it can be shown that the material time derivative of deformation gradient
tensor F is not objective:
:
_ þ ΩF_
F = ΩF ð7:50Þ
Using Eq. (7.50) and the definition of velocity gradient tensor given in Eq. (7.6),
we can write
:
_ þ ΩF_ = ΩFF
F = LF = ΩF _ -1 T _ - 1 ΩT ΩF
Ω þ ΩFF ð7:51Þ
Equation (7.51) implies that the velocity gradient tensor is not objective as shown
below:
_ T
L = ΩLΩT þ ΩΩ ð7:52Þ
7.2 Frame of Reference Indifference 381
Using definitions of stretch rate tensor and spin tensor given in Eq. (7.10)
_ T
L = D þ W = ΩðD þ WÞΩT þ ΩΩ ð7:53Þ
D = ΩDΩT ð7:54Þ
_ T
W = ΩWΩT þ ΩΩ ð7:55Þ
Therefore, stretch rate tensor D is objective, while spin tensor W is not objective.
Using the multiplicative decomposition definition of deformation gradient given in
Eq. (7.7), we can write
F = Fe Fp = ΩFe Fp ð7:56Þ
Fe = ΩFe ð7:57Þ
Fp = Fp ð7:58Þ
Therefore, elastic and plastic deformation gradients are not objective. Since
plastic deformation gradient refers to the original and intermediate frame of refer-
ence, plastic deformation gradient is not objective but invariant to changes in the
current frame reference. Using definitions for elastic and plastic rotation tensors, left
stretch tensor, right stretch tensor, Cauchy tensor, and Almansi tensor in Eqs. (7.20),
(7.21), (7.22), (7.23), (7.24), (7.25), (7.26), and (7.27), the following relations can be
written:
Fe = Re Ue = ΩRe Ue ð7:59Þ
Re = ΩRe ð7:60Þ
Ue = Ue ð7:61Þ
Fe = Ve Re = ΩVe ΩT Ω Re ð7:62Þ
Ve = ΩVe ΩT ð7:63Þ
Ce = Ue Ue = Ue Ue = Ce ð7:64Þ
Be = Ve Ve = ΩVe ΩT Ω Ve ΩT = ΩBe ΩT ð7:65Þ
Fp = Rp Up = ΩRp Up ð7:66Þ
Rp = ΩR p
ð7:67Þ
382 7 Unified Micromechanics of Finite Deformations
Up = Up ð7:68Þ
Fp = Vp Rp = ΩVp ΩT Ω Rp ð7:69Þ
Vp = ΩVp ΩT ð7:70Þ
Cp = Up Up = Up Up = Cp ð7:71Þ
Bp = Vp Vp = ΩVp ΩT ΩVp ΩT = ΩBp ΩT ð7:72Þ
Therefore, elastic and plastic rotation tensors Re, Rp are not objective. Cauchy
tensors, elastic, and plastic right stretch tensors and Ce, Cp, Ue, Up are also not
objective, but invariant to changes in the current frame of reference. Almansi tensors,
elastic, and plastic left stretch tensors Be, Bp, Ve, Vp are all objective. Using elastic
and plastic velocity gradient definitions in Eqs. (7.12) and (7.13) with transformation
rules for elastic and plastic deformation gradient in Eqs. (7.57) and (7.58) yields
:
-1
Le = Fe Fe _ e Fe - 1 ΩT = ΩLe ΩT þ ΩΩ
= ΩFe þ ΩF _ T ð7:73Þ
:
-1
= F_ Fp = Lp
p -1
Lp = Fp Fp ð7:74Þ
Equation (7.73) shows that the elastic component of the velocity gradient is not
objective. According to (7.74) plastic component of the velocity, the gradient is not
objective either, but it is invariant to changes in the current frame of reference,
because the plastic velocity gradient refers to original and intermediate reference
frames, but does not refer to the current frame of reference. Using definitions of
elastic stretch rate tensor and spin tensor given by Eq. (7.14), we can write
_ T
Le = De þ We = ΩðDe þ We ÞΩT þ ΩΩ ð7:75Þ
De = ΩDe ΩT ð7:76Þ
_ T
We = ΩWe ΩT þ ΩΩ ð7:77Þ
Therefore, elastic stretch rate tensor De is objective, while elastic spin tensor We
is not objective. Similarly, using the plastic stretch rate definition given in Eq. (7.15)
and applying the irrotational plastic flow assumption in Eq. (7.28), we can write the
following relations:
Lp = Dp = Dp ð7:78Þ
Dp = Dp ð7:79Þ
Therefore, plastic stretch rate tensor Dp is not objective but invariant to changes
in the current reference frame. To summarize, it has been shown that
_
F,R,F,L,W,F ,R ,R ,L ,We are not objective; C,U, Fp, Ue, Ce, Up, Cp, Lp, Dp are
e e p e
not objective but invariant to changes in the current reference frame; and B,V,D, Ve,
Be, Vp, Bp, De are objective.
ψ = u - θs ð7:80Þ
The first law of thermodynamics (also known as the conservation of energy law)
states that energy can be transported, or converted from one form to another, but
cannot be destroyed or created. Accordingly, the internal energy of a system can
change by heat flow into or out of the system (δq), heat generated within the system
due to internal scattering and friction (r), mechanical work done on the system by the
external load (δw), or all other kinds of work done on the system (δw′) during any
process, including but not limited to electrical, thermal, chemical, etc. Considering
only the thermomechanical effects, rate of change in internal energy, u,_ can be given
by
u_ = Q þ l þ r ð7:81Þ
where Q is the rate of net heat flow into the system and l is the rate of net work done
on the system and r is the heat generated in the system, due to internal friction,
scattering, and chemical reactions. The second law of thermodynamics states that
there exists a thermodynamic state function, entropy, which increases in the universe
for all types of real processes due to irreversible entropy production. Unlike energy,
entropy (s) is not only transferred across boundaries of a system, but it may also be
created in the system which is called entropy production. Clausius-Duhem inequality
describes a condition imposed by the second law of thermodynamics in terms of a
384 7 Unified Micromechanics of Finite Deformations
nonnegative entropy production rate (γ) per unit volume for any kind of irreversible
process which is defined as
Js is net entropy flux into the system. Heat flows into the body can be defined in
terms of heat flux as follows:
ρQ = - div Jq ð7:83Þ
Substituting the time derivative of Eq. (7.80), internal energy definition given by
Eq. (7.81), and heat flow equation in Eq. (7.83) into Eq. (7.82), internal entropy
production rate density can be rewritten as
div Jq ρ r ρ
_ >0
γ = divðJs Þ - þ þ l - ψ_ - θs ð7:84Þ
θ θ θ
r is the internal heat generation source strength and ρ is the mass density.
The main premise of the unified mechanics theory is the derivation of the thermo-
dynamic fundamental equation, which must contain all entropy generation terms for
all active mechanisms. Of course, the amount of entropy generated in each micro-
mechanism will be different. After a comparison of all mechanisms, it is reasonable
to ignore mechanisms that generate entropy in smaller amounts by orders of mag-
nitude. For dual-mechanism rate-dependent constitutive modeling of polymers, the
material response can be resolved into two components which necessitate multi-
mechanism generalization of multiplicative decomposition in Eq. (7.7) and descrip-
tion of different Helmholtz free energy functions and associated fundamental equa-
tion assuming that linear addition is applicable for Eq. (7.80) Accordingly,
Eqs. (7.12), (7.13), (7.14), (7.15), (7.20), (7.27), and (7.31) hold for each component
of the material’s resistance mechanism. Subscripts “I” and “M” will be used to
designate the component of a quantity in the Intermolecular mechanism and Molec-
ular network mechanism, respectively. For a description of dissipation inequality,
total Helmholtz free energy density in (original) reference configuration is written as
a summation of defect energy (ΨD) and elastic energy stored in an intermolecular
structure (ΨI) and molecular network structure(ΨM):
Ψ CeI , CeM , A, θ = ΨI EeI , θ þ ΨM CeM , θ þ ΨD ðA, θÞ ð7:85Þ
Defect energy (ΨD) is assumed to depend on a stretch like tensor A and temper-
ature (θ), elastic energy stored in an intermolecular structure (ΨI) is assumed to
7.3 Unified Mechanics Theory Formulation for Finite Strain 385
depend on logarithmic elastic strain in the intermolecular structure EeI and temper-
ature (θ), and elastic energy stored in molecular network structure (ΨM) is assumed
to depend on elastic Cauchy tensor in molecular network structure CeM and temper-
ature (θ). If a similar decomposition also holds for specific entropy and specific
Helmholtz free energy, we can write the following relation:
s CeI , CeM , A, θ = sI EeI , θ þ sM CeM , θ þ sD ðA, θÞ ð7:86Þ
ψ CeI , CeM , A, θ = ψ I EeI , θ þ ψ M CeM , θ þ ψ D ðA, θÞ ð7:87Þ
_ Ce , Ce , A, θ
ρψ_ CeI , CeM , A, θ = J - 1 Ψ ð7:89Þ
I M
where ρo and ρ are mass densities in reference configuration and deformed config-
uration, respectively. Note that since eigenvalues of elastic Cauchy tensor CeI and
logarithmic elastic strain tensor EeI corresponding to the intermolecular structure are
related through Eq. (7.90) and eigenvectors of these tensors are identical, it is
possible to consider Helmholtz free energy density (ΨI) and specific entropy (sI)
associated with intermolecular structure as functions of temperature (θ) and elastic
Cauchy tensor CeI ; hence, we can write the following relations:
1
eigenvalue EeI = ln eigenvalue CeI ð7:90Þ
2
e
ΨI EI , θ ΨI CeI , θ ð7:91Þ
sI EeI , θ sI CeI , θ ð7:92Þ
According to the principle of virtual work, rate of work done per unit volume of
the deformed body (external work) is balanced with internal work, while total work
done on the system is stored as elastic strain energy (represented by the first two
386 7 Unified Micromechanics of Finite Deformations
terms in Eq. (7.95) and dissipated as plastic work (represented with the last two terms
in Eq. (7.95)):
ρl = w_ int ð7:94Þ
w_ int = ΓeI : LeI þ ΓeM : LeM þ J - 1 ΓpI : LpI þ J - 1 ΓpM : LpM ð7:95Þ
where ΓeI , ΓeM , ΓpI , and ΓpM are stress measures conjugate to the rate of deformation
measures LeI, LeM, LpI, and LpM , which were defined in Eqs. (7.12) and (7.13). Noting
-1
that J - 1 = J e , the J-1 the multiplier in front of the last two terms in Eq. (7.95)
recovers work definitions from intermediate configuration to deformed configura-
tion. The requirement of frame indifference for internal work can be given by the
following relation:
:
w_ int = w ð7:96Þ
int
Using transformation rules for elastic and plastic velocity gradients in Eqs. (7.73)
and (7.74), we can write the following relations:
ΓI :ðΩLeI ΩT þΩΩ
_ T ÞþJ - 1 Γ :Lp þ
e p
ΓeI :LeI þJ - 1 ΓpI :LpI þ
þΓeM :LeM þJ - 1 ΓpM :LpM
= þΓM :ðΩLeM ΩT þΩΩ
e I I
_ T ÞþJ - 1 Γp :Lp ð7:98Þ
M M
or
" #
ðΩT ΓI ΩÞp:LeI þ
e
ðΩT ΓMpΩÞp:LeM þ
e
ΓeI :LeI þJ - 1 ΓpI :LpI þ
þΓeM :LeM þJ - 1 ΓpM :LpM
= þJ ΓI :LI þJ - 1 ΓM :LM þ
-1 p
ð7:99Þ
e
_ T þΓe :ΩΩ
þΓI :ΩΩ _ T
M
The first two terms on the right-hand side of Eq. (7.99) indicate that stress
measures corresponding to elastic work ΓeI , ΓeM are objective as shown in
Eqs. (7.100) and (7.101). Since ΩΩ _ T is a skew-symmetric tensor (Eqs. (7.55)
and (7.77)), the last two terms on the right-hand
side
of Eq. (7.99) imply that stress
measures corresponding to elastic work ΓeI , ΓeM are symmetric (Eqs. (7.102) and
(7.103)). The third and fourth terms on the right-hand side of Eq. (7.99) show that
stress measures corresponding to plastic work ðΓpI , ΓpM Þ are not objective but
invariant to changes in the current frame of reference as shown in Eqs. (7.104) and
(7.105). Hence,
7.3 Unified Mechanics Theory Formulation for Finite Strain 387
e
ΓI = ΩΓeI ΩT ð7:100Þ
e
ΓM = ΩΓeM ΩT ð7:101Þ
e eT T
ΓI = ΓI ; ΓeI = ΓeI ð7:102Þ
T
e e T
ΓM = ΓM ; ΓeM = ΓeM ð7:103Þ
p
ΓI = ΓpI ð7:104Þ
p
ΓM = ΓpM ð7:105Þ
Using symmetry property of stress tensor in Eqs. (7.102) and (7.103), and
irrotational plastic flow definition given in Eq. (7.28), total internal work over the
whole volume of a system can be written as
Z Z
_ int =
W w_ int dV = ΓeI : DeI þ ΓeM : DeM þ J - 1 ΓpI : DpI þ J - 1 ΓpM : DpM dV
V V
ð7:106Þ
Total external work acting on the system can be described in terms of surface
tractions on boundaries of the system and body forces acting on the system as
follows:
Z Z Z
_ ext =
W ρ l dV = σ ðnÞ
χ_ dA þ b χ_ dV ð7:107Þ
V A V
where A represents the surface and V represents volume. Consider the principle of
virtual work for a special case defined in Eq. (7.34) as
The principle of virtual work can then be rewritten for this special case from
Eqs. (7.106) and (7.107) as
Z Z Z
ðnÞ
σ _
χdA þ _
b χdV = ½ΓeI : DeI þ ΓeM : DeM dV ð7:110Þ
A V V
Z Z Z
σðnÞ χdA
_ þ _
b χdV = _
½ðΓeI þ ΓeM Þ : gradðχÞdV ð7:111Þ
A V V
388 7 Unified Micromechanics of Finite Deformations
Z Z Z
σðnÞ χdA
_ þ _
b χdV = div½χ_ ðΓeI þ ΓeM ÞdV
A V V
Z
- divðΓeI þ ΓeM Þ χ_ dV ð7:112Þ
V
Since Eq. (7.112) is true for any choice of V and gradðχ_ Þ, from the first terms on
the left- and right-hand side, we can write
σ ðnÞ = ΓeI þ ΓeM n ð7:113Þ
which is essentially Cauchy’s stress theorem describing the relation between stress
tensor and surface tractions. From the second terms on the left- and right-hand side
of Eq. (7.112), we can write
div ΓeI þ ΓeM þ b = 0 ð7:114Þ
Consider the principle of virtual work for a second special case defined from
Eq. (7.34) such that
-1 -1
L = gradðχ_ Þ = LeI þ FeI DpI FeI = LeM þ FeM DpM FeM = 0 ð7:118Þ
or
-1
LeI = - FeI DpI FeI ð7:119Þ
-1
LeM = - FeM DpM FeM ð7:120Þ
Accordingly, the principle of virtual work can be rewritten for this special case
using Eqs. (7.106) and (7.197) as
Z -1
_ int = ΓeI : - FeI DpI FeI - 1 ΓpI :Dp pI þp
þJ
W þΓeM : - FeM DpM FeM
-1
þJ - 1 ΓM :DM
dV ð7:121Þ
V
or
7.3 Unified Mechanics Theory Formulation for Finite Strain 389
_ int
W
Z h i
-T -T
J - 1 ΓpI - FeI ΓeI FeI : DpI þ J - 1 ΓpM - FeM ΓeM FeM
T T
= : DpM dV
V
ð7:122Þ
Since the velocity gradient is assumed to be zero for this special case, the velocity
field
will be also equal to zero. Therefore, external work will be equal to zero
_ ext = 0 . As a result, individual terms inside the parenthesis in Eq. (7.122) should
W
be equal to zero for arbitrary selection of V, DpI , and DpM :
-T
J - 1 ΓpI - FeI ΓeI FeI
T
=0 ð7:123Þ
-T
J - 1 ΓpM - FeM ΓeM FeM = 0
T
ð7:124Þ
Using definitions of stress measures in Eqs. (7.116) and (7.117), it can be shown
that
T -T
ΓpI = JFeI TI FeI ð7:125Þ
T -T
ΓpM = JFeM TM FeM ð7:126Þ
whereas since the trace of plastic stretch rate is equal to zero due to incompressible
plastic flow, assumption stress conjugate to plastic stretch rate should be a deviatoric
tensor. Therefore, we can write
ΓpI = dev MeI ð7:129Þ
ΓpM = dev MeM ð7:130Þ
-1 -T
SeM = JFeM TM FeM ð7:132Þ
Using definitions of elastic Mandel stress given by Eqs. (7.127) and (7.128) and
symmetric second Piola-Kirchhoff stress given by Eq. (7.131) and Eq. (7.132), we
can write the following relation:
Jq
divðJs Þ - div =0 ð7:135Þ
θ
Jq
Js = ð7:136Þ
θ
1 ρr
γ ther = - div Jq ∇x ðθÞ þ > 0 ð7:137Þ
θ 2 θ
e e
∂ΨI EI , θ ∂ΨM CM , θ ∂ΨD ðA, θÞ
J -1 þ þ þ ρs θ_ = 0 ð7:138Þ
∂θ ∂θ ∂θ
e e
e e - 1 ∂Ψ EI , CM , A, θ
ρs EI , CM , A, θ = - J ð7:139Þ
∂θ
e e
∂ψ EI , CM , A, θ
s EeI , CeM , A, θ = - ð7:140Þ
∂θ
e
1 ∂ΨI EI , θ
: C_ I =
e
ΓeI : LeI - J - 1 ð7:141Þ
θ ∂CI e
e
- 1 ∂ΨI EI , θ
: C_ I
e
ΓI : LI = TI : LI = TI : DI = J
e e e e
ð7:142Þ
∂CeI
e
1 e-1 _ - 1 ∂ΨI EI , θ
e-T
: C_ I
e e
TI : DI =
e
F TI FI : CI = J ð7:143Þ
2 I ∂CeI
-1
e-T ∂ΨI EeI , θ
e
J FI TI FI = SI = 2
e
ð7:144Þ
∂CeI
7.3 Unified Mechanics Theory Formulation for Finite Strain 391
e
1 - 1 ∂ΨM CM , θ
: C_ M =
e
ΓM : LM - J
e e
ð7:145Þ
θ ∂CM e
e
- 1 ∂ΨM CM , θ
: C_ M
e
ΓM : LM = TM : LM = TM : DM = J
e e e e
ð7:146Þ
∂CeM
e
1 e
_ - 1 ∂ΨM CM , θ
: C_ M
e e
TM : DM = FM TM FM : CM = J
e e
ð7:147Þ
2 ∂CeM
-1
e-T ∂ΨM CeM , θ
J FM TM FM = SM = 2
e e
ð7:148Þ
∂CeM
1 ∂ΨD ðA, θÞ _
γ mech = J - 1 ΓpI : LpI þ J - 1 ΓpM : LpM - J - 1 : A >0 ð7:149Þ
θ ∂A
Equation (7.136) relates entropy flux to heat flux. Equation (7.137) defines
irreversible entropy production per unit volume in a deformed configuration associ-
ated with heat conduction which is always positive according to Fourier’s law
(Eq. (7.150)). Equation (7.140) provides the relation between specific entropy and
specific Helmholtz energy. Equations (7.144) and (7.148) describe stress measures
derived from Helmholtz free energy functions. Equation (7.149) defines irreversible
entropy production due to mechanical dissipation per unit volume in a deformed
configuration. The reason for the appearance of J-1 term in front of Eq. (7.149) is
that all stress measures and their conjugate rate measures refer to intermediate
configuration (relaxed configuration), while irreversible entropy production rate
density refers to deformed configuration. Heat flux is given by
e
∂ J - 1 ∂ΨI EI , θ ∂ΨM CeM , θ ∂ΨD ðA, θÞ
c= -θ þ þ ð7:154Þ
∂θ ρ ∂θ ∂θ ∂θ
F = FI = FM ð7:155Þ
T = TI þ TM ð7:156Þ
Intermolecular Resistance, I
7.3 Unified Mechanics Theory Formulation for Finite Strain 393
n 1
2 2
o
ΨI EeI , θ = G devðEIe Þj2 þ K - G tr EeI - - 3Kαðθ - θo ÞtrðEeI Þ ð7:157Þ
2 3
1
EeI =
ln CeI ð7:158Þ
2
T
1
EeI = ln FeI FeI ð7:159Þ
2
Utilizing Eq. (7.144) symmetric second Piola-Kirchhoff stress tensor SeI and Cauchy
stress tensor TI can be obtained from Helmholtz free energy density function
corresponding to intermolecular resistance as follows:
∂ΨI EeI , θ
SeI =2 ð7:160Þ
∂CeI
Relation between elastic Mandel stress MeI and elastic logarithmic strain EeI can
be obtained from Helmholtz free energy function (Eq. (7.157)) and second Piola-
Kirchhoff stress tensor SeI , and the definition is given in Eq. (7.160); hence, we can
write
394 7 Unified Micromechanics of Finite Deformations
MeI = 2Gdev EeI þ K tr EeI - 3αðθ - θo Þ I ð7:163Þ
∂ΨðA, θÞ
Mback = 2dev A = B ln ðAÞ ð7:165Þ
∂A
A_ = DpI A þ ADpI - γA ln ðAÞνpI ð7:166Þ
AðX, 0Þ = I ð7:167Þ
The equivalent plastic stretch rate is given by Eq. (7.169), effective equivalent
shear stress is given by Eq. (7.170), and mean normal pressure is defined in
Eq. (7.171), and they are all defined in terms of tensorial variables as follows:
pffiffiffi
νpI = 2 DpI ð7:169Þ
1
τI = pffiffiffi jMeff j ð7:170Þ
2
1
pI = - tr MeI ð7:171Þ
3
F_ I = DpI FpI
p
ð7:172Þ
7.3 Unified Mechanics Theory Formulation for Finite Strain 395
Only the effective equivalent shear stress drives the plastic flow, and it is the
source of plastic dissipation. Some part of plastic work is stored as energy associated
with back-stress. In polymers, once the effective shear stress level reaches a critical
level so that the energy barrier to molecular chain segment rotation is exceeded,
plastic flow takes place. This is in contrast to the dislocation slip mechanism in metal
plasticity. It is important to point out that the concept of dislocation and dislocation
motion-induced plasticity model does not apply to amorphous polymers because the
concept of dislocation cannot be justified in amorphous polymers.
According to the cooperative model, viscous flow in a solid amorphous polymer
may take place only when several polymer segments move cooperatively which also
accounts for the significance of activation volume during the yield process. The flow
rule for amorphous polymers is essentially based on the energy distribution statistics
of individual segments (Fotheringham and Cherry 1978). In simple terms, the
cooperative model flow rule is based on the average probability of simultaneous
occurrence of n thermally activated transitions across an energy barrier (activation
energy, Q) inducing a macroscopic strain increment of νo (Fotheringham et al. 1976;
Fotheringham and Cherry 1978). Plastic flow characteristics of amorphous polymers
are strongly temperature- and rate-dependent. According to the strain rate-
temperature superposition principle, an increase in temperature will have the same
effect on the yield stress as a decrease in strain rate (Francisco et al. 1996).
Equivalence of time and temperature describes that the yielding of amorphous
polymers at low temperatures is comparable to that at high strain rates. Therefore,
Eyring plots (yield stress-temperature ratio vs. plastic strain rate curves) for various
temperatures can be shifted vertically and horizontally with respect to a reference
temperature (θref) to obtain a master curve describing yield stress behavior over a
wide range of temperatures and strain rates.
Richeton et al. (2006) proposed that both horizontal shift (ΔHh) and vertical shift
(ΔHv) should follow Arrhenius-type temperature dependence. The resulting yield
stress definition relates yield behavior of polymer with β mechanical loss peak at
temperatures below the glass transition temperature θg through introducing activa-
tion energy at β-transition temperature, i.e., yield behavior in polymers is controlled
by segmental motions of polymer chains, and the reference state for yielding is
chosen as β-transition. An increase in yield stress due to an increase in strain rate is
attributed to a decrease in molecular mobility of molecular chains, while a slow
deformation rate allows polymer chains to slip past each other, resulting in lower
resistance to flow. At low temperatures near secondary transition temperature (θβ),
secondary molecular motions are restricted, and chains become stiffer which also
increases the yield stress, while the increase in temperature provides more energy to
polymer chains facilitating relative motion between polymer molecular chains. For
temperatures above the glass transition temperature θg, characteristic plastic strain
rate equation was modified by Williams-Landel-Ferry (WLF) parameters (c1, c2).
Although characteristic plastic strain rate definitions at temperatures below and
above the glass transition temperature, θg, are continuous functions of temperature
in separate domains, piece-wise definition with respect to the glass transition tem-
perature, θg, results in an unrealistic change in plastic flow behavior around glass
transition temperature, i.e., the derivative of plastic strain rate equation is
396 7 Unified Micromechanics of Finite Deformations
discontinuous at θg. Srivastava and Anand (2010) proposed a modified version of the
flow rule in intermolecular structure which incorporates different values of activation
energy for the glassy region and rubbery region, but still, the constitutive model
makes an abrupt change in the activation energies at the glass transition temperature,
θg, which creates a discontinuity problem in computational mechanics simulations.
To provide a smoother transition in flow characteristics around glass transition
temperature, θg, characteristic plastic strain rate given by Eq. (7.174) and equivalent
shear plastic stretch rate given by Eq. (7.175) can be modified in the following
forms:
" !#
Q ln ð10Þc1 θ - θg
ν = νoI exp - I 1 þ exp ð7:174Þ
kB θ c2 þ θ - θ g
nI
p τI V
νI = ν sinh ð7:175Þ
2kB θ
where νoI is the pre-exponential factor, QI is the activation energy for plastic flow in
intermolecular structure, kB is Boltzmann’s constant, c1 and c2 are WLF parameters,
nI is the number of thermally activated transitions necessary for plastic flow, V is the
activation volume, and τI is the net effective stress which is defined as
τI = τI - SI - αP pI ð7:176Þ
state plastic flow occurs and the plastic flow rate becomes equal to the applied strain
rate. The evolution equation for order parameter is defined as
Fig. 7.3 Comparison of different viscoplastic models in literature in terms of the temperature
dependence of normalized characteristic viscoplastic shear strain rate νp =νpðθ = θg Þ , Gunel (2010)
according to Anand’s model, there is no rubbery region, but the material response is
liquid-like as the viscoplastic strain rate increases by six orders of magnitude in a
narrow temperature interval (2 °C) around the glass transition temperature. On the
other hand, in Richeton’s model, there is also a remarkable change in viscoplastic
strain rate due to the piece-wise definition with respect to the glass transition
temperature, but it also causes a discontinuous derivative of viscoplastic rate func-
tion at the glass transition temperature. Accordingly, the material response predicted
by Anand or Richeton viscoplastic models has a significant (abrupt) change in stress
value at the glass transition temperature. An improved version of the dual-
mechanism constitutive model presented in this chapter provides a gradual transition
in material response with respect to the temperature around the glass transition.
Though Anand’s model has a continuous definition of activation energy in the
temperature domain, a remarkable difference between activation energies in glassy
and rubbery regions still produces an abrupt change in the polymer’s response.
According to Fig. 7.3, Anand’s viscoplastic model is invariant of temperature above
glass transition temperature, while Richeton’s model provides a relatively gentler
transition in response. As a result of this rapid change in viscoplastic response in
Anand’s model, it cannot predict material behavior accurately under non-isothermal
conditions. These models were further studied by Gunel and Basaran (2010) in terms
of predictions for creep strain rate in a creep test with a stress level of 0.6 MPa which
is conducted at different temperatures as depicted in Fig. 7.4.
Figure 7.4 supports previous observations just discussed. All models predict the
same creep strain rates at temperatures below the glass transition. For assurance of
accurate and realistic modeling of material response around the glass transition
temperature, every aspect of material property definitions that describe hardening-
softening behavior and flow function must be continuous in the temperature domain
and must have continuous derivatives with respect to temperature.
7.3 Unified Mechanics Theory Formulation for Finite Strain 399
Fig. 7.4 Comparison of different viscoplastic models in literature in terms of creep strain rates at
different temperatures in response to applied stress of 0.6 MPa
1 I -3
ΨM CeM , θ = - μM I M ln 1 - 1 ð7:182Þ
2 Im
where μM and IM are temperature-dependent rubbery shear modulus and limit the
extensibility of polymer chains, respectively. Since volume change in a polymer is
considered in the elastic deformation gradient associated with intermolecular struc-
ture, it is essential to define Gent free energy
in terms of distortional elastic
deformation gradient in network structure, FeM d , as given by Eq. (7.183) which
produces no change in volume. I1 is the first invariant of elastic distortional Cauchy
tensor in network structure, CeM d :
-1
FeM d = J =3 FeM ð7:183Þ
det FeM d = 1 ð7:184Þ
e T
CM d = FeM d FeM d ð7:185Þ
I 1 = tr CeM d ð7:186Þ
Second Piola-Kirchhoff stress tensor SeM and Cauchy stress tensor TM can be
derived from Gent free energy as follows:
∂ΨM CeM , θ
SeM =2 ð7:187Þ
∂CeM
- 1h i
I1 - 3 1 e e - 1
SeM = J - 2=3 μM 1 - I- tr CM d CM d ð7:189Þ
IM 3
-1
I1 - 3
TM = J - 1 μ M 1 - dev BeM d ð7:190Þ
IM
Elastic distortional Almansi tensor, BeM d , is defined in terms of distortional
elastic deformation gradient FeM d Þ as
T
BeM d
= FeM d FeM d ð7:191Þ
7.3 Unified Mechanics Theory Formulation for Finite Strain 401
For elastic Mandel stress tensor MeM and equivalent shear stress τM and equivalent
plastic shear strain rate νpM for a molecular relaxation in-network, the structure can be
given by the following relations:
F_ M = DpM FpM
p
ð7:196Þ
FpM ðr, 0Þ = I ð7:197Þ
where νoM is the pre-exponential factor, QM is the activation energy for molecular
relaxation in network structure, kB is Boltzmann’s constant, nM is a strain-rate
sensitivity parameter, and SM is a stress measure describing the resistance of network
structure to relaxation which increases with increasing plastic stretch rate as defined
as follows:
S_ M = hM ðλpM - 1Þ SM - SM νpM ð7:199Þ
Plastic deformation gradient evolution given by Eqs. (7.196) and (7.197) com-
pletes the constitutive modeling of material behavior in molecular network structure.
According to Eq. (7.199), network resistance will increase continuously as plastic
stretch ðλpM Þ in polymer chains increases and reaches a constant value SM which
depends on temperature and stretch rate. Plastic stretch-dependent evolution of
resistance to plastic flow also ensures the correct prediction of elastic recovery in
the unloading path. In Anand’s model, net driving stress for plastic flow in
intermolecular mechanism includes an additional resistance term accounting for
dissipative resistance to plastic flow ðSb or S2 Þ taking place at large deformations.
According to Anand’s model, this dissipative resistance evolves with plastic stretch
in the intermolecular mechanism or total stretch in the intermolecular mechanism. In
the model derived in this chapter, dissipative resistance (SM) at large deformations in
molecular network mechanism evolves with plastic stretch in molecular network
branch which controls material response at large deformations (post-yield region). In
Anand’s model, a third mechanism must be added for the true prediction of elastic
recovery during the unloading and cooling of a pre-heated sample. It was argued that
the third mechanism for molecular network structure was introduced due to a
necessity driven by the experimentally observed complex response. However, the
constitutive model presented in this chapter involves non-isothermal conditions
without introducing the additional third mechanism.
Finally, utilizing the relation given by Eq. (7.196) and specific Helmholtz free
energy definitions (Eqs. (7.157), (7.164), (7.182)), the temperature-dependent
governing constitutive equation can be given as
where the first two terms are heat conduction representing heat transfer within the
material during transient state and heat source due to passive heating (external
heating) or active heating (internally generated heat by internal friction or scattering,
etc.). The last two terms represent heat induced due to intrinsic dissipation and
thermoelastic effect representing conversion between mechanical and thermal
energy in the elastic range. In the case of coupled thermomechanical loading,
temperature increase due to mechanical work and temperature change due to heat
transfer between material and surroundings are all included in Eq. (7.203). Based on
descriptions of stress components and strain rate measures, irreversible entropy
production due to mechanical dissipation per unit volume in deformed configuration
can be rewritten from Eq. (7.149) as follows:
2 3
∂ΨD ðA, θÞ
devð Þ - 2dev
J-1 6 A :DI þ 7p
MeI
γ mech = 6 ∂A 7>0 ð7:204Þ
θ 4 ∂Ψ D ð A, θ Þ 5
þγ A : ln ðAÞνpI þdevðMeM Þ:DpM
∂A
h i
J-1 1
γ mech = τI þ γBj ln ðAÞj2 νpI þ τM νpM > 0 ð7:205Þ
θ 2
DpI Meff
NpI = p = ð7:206Þ
DI j M eff j
DpM MeM
NpM = p = ð7:207Þ
DM MeM
There are two major irreversible entropy production mechanisms, thermal dissipa-
tion (Eq. (7.137)) due to heat exchange between the system and surroundings and
mechanical work dissipation formulated by Eq. (7.205). Critical entropy (Scr) [also
referred to as fracture entropy in the literature] is a characteristic value of a material.
The link between irreversible material degradation and the amount of heat generated
due to non-conservative dissipation mechanisms (i.e., plastic work, internal friction,
scattering mechanisms, chemical reactions, etc.) or entropy production due to
mechanical work is well established. Heat conduction within metals is usually
very fast leading to negligible thermal gradients [at low-frequency loads, also
referred to as pseudo-static in some books] and leading to quick dissipation of heat
due to the high thermal diffusivity of metals. In the case of materials with low
thermal diffusivity such as polymers, heat transfer may take place over a prolonged
404 7 Unified Micromechanics of Finite Deformations
When TSI reaches a critical value Φcr, corresponding to a critical entropy level
(Scr) that can be defined as failure or Φ = 1 can be defined as failure, depending on
the application. [Because TSI is an exponential function, it never reaches 1; com-
monly 0.999 is considered as the final value, depending on the computing
resources.] The critical TSI value depends on the critical entropy level (Scr), and it
is a characteristic property of material [like toughness] and can be calculated or
measured. Hence, we can calculate Φcr from the critical entropy value:
h ½Scr - So
i
Φcr = 1 - e - ms R ð7:209Þ
where So is initial internal entropy value, which may be taken as zero. Any irrevers-
ible process induces degradation in microstructure, and internal entropy increases
according to the second law of thermodynamics. Total entropy production due to
mechanical and thermal dissipation can be calculated at any time step as follows:
Zt
Smech = Smech jt = to þ γ mech dt ð7:210Þ
to
Zt h i
J-1 1 2 p p
Smech = Smech jt = to þ τI þ γBj ln ðAÞj νI þ τ M νM dt ð7:211Þ
θ 2
to
Zt
Sther = Sther jt = to þ γ ther dt ð7:212Þ
to
Zt
1 ρr
Sther = Sther jt = to þ - div J ∇ ð θ Þ þ dt ð7:213Þ
θ2
q x
θ
to
Material properties are vital for constitutive models. Material properties must be
defined over a large temperature and rate of loading range and should be continuous
and smooth over the transition region. Functions for material properties presented
herein are mathematical tools to describe the influence of temperature and loading
rate in a continuous form. Formulations for these material properties representing
temperature dependence are in a form that provides smooth continuity in the
temperature domain and has a continuous first derivative with respect to temperature.
Some material properties can be obtained by conducting isothermal tests at
different temperatures (both above and below the glass transition temperature)
and different loading rates such as E, ν, I m , νoI , QI , nI , Bg , X B , V, αp , γ . Some
material parameters are difficult to measure directly such as
406 7 Unified Micromechanics of Finite Deformations
hI , b, g, νoM , QM , hM , nM , μM , ϕ , SM , yet these properties/parameters can be
obtained by statistical methods.
According to the free-volume theory of William et al. (1955), the plastic flow rule
(yield function) can be constructed for equivalent plastic shear strain rate at temper-
atures above the glass transition temperature θg using Williams-Landel-Ferry (WLF)
equations. Similarly, rate dependence of glass transition temperature can be consid-
ered in terms of temperature-time equivalence of glass transition as follows:
8 g
> ν
< θref þ c2 log =νref ν > νref
θg =
g
cg1 - log ðν=νref Þ ð7:217Þ
>
:
θref
g ν≤ν ref
where cg1 and cg2 are WLF parameters associated with θg, νref is the reference stretch
rate, and ν is the equivalent stretch rate which is defined by
pffiffiffi
ν= 2 j D j ð7:218Þ
where Eg and Er are for glassy and rubbery elastic modulus corresponding to
temperatures confining the glass-rubber transition region. X gE and X rE represent the
rate of change of elastic modulus with respect to temperature in glassy and rubbery
domains, respectively. sE is the rate of loading sensitivity of elastic modulus, while
θE and ΔE define origin temperature and width of glass-rubber transition window,
respectively. Experimental studies on temperature and rate dependence of storage
modulus and elastic modulus of poly-methyl methacrylate (PMMA) indicate that
PMMA is highly sensitive to the rate of loading and temperature at constant
humidity. Modulus of PMMA continuously decreases with increasing temperature
with a remarkable drop around the glass transition temperature θg over a 10 C–20 C
temperature window, depending on the frequency of loading (Gunel 2010).
Poisson’s ratio ðνÞ is assumed to be only temperature-dependent and defined as
7.5 Definition of Material Properties 407
1 1 θ - θg þ θ E
ν = νg þ νr - νg - νr tanh ð7:221Þ
2 2 ΔE
where νg and νr are for glassy region and rubbery region Poisson’s ratios, respec-
tively. Shear modulus (G) and bulk modulus (K ) are defined as follows:
E
G= ð7:222Þ
2ð1 þ νÞ
E
K= ð7:223Þ
3ð1 - 2νÞ
Temperature and rate dependence of rubbery modulus (μM) is modeled like the
elastic modulus function as follows:
2 3
1 g 1 g θ - θg þ θμ h i
ν
μM = 4 2 μM þ μM - 2 μM - μM tanh
r r
Δμ 5 1 þ sμ log
þX μ ½θ - ðθg þθμ Þ
νref
ð7:224Þ
(
X gμ θ ≤ θg þ θμ
Xμ = ð7:225Þ
X rμ θ > θg þ θμ
1 1 θ - θg þ θϕ
ϕ = ϕg þ ϕr - ϕg - ϕr tanh þ X ϕ θ - θg þ θϕ
2 2 Δϕ
ð7:226Þ
(
X gϕ
θ ≤ θg þ θϕ
Xϕ = ð7:227Þ
θ > θg þ θϕ
X rϕ
1 g 1 g θ - θg þ θM
I M = I M þ I M - I M - I M tanh
r r
þ X M θ - θg þ θM
2 2 ΔM
ð7:228Þ
(
X gM θ ≤ θg þ θμ
XM = ð7:229Þ
X rM θ > θg þ θμ
408 7 Unified Micromechanics of Finite Deformations
1 g 1 g θ - θg þ θS
SM= SM þ SM - SM - SM tanh
r r
2 2 ΔS
þ X S θ - θg þ θS ð7:230Þ
g
X S θ ≤ θg þ θS
Xs = ð7:231Þ
X rS θ > θg þ θS
θ - θg
B = Bg 1 - tanh þ X B θg - θ ð7:232Þ
ΔB
(
X gB θ ≤ θg
XB = ð7:233Þ
0 θ > θg
1 1 θ - θg þ θg
g= g þ gr - g - gr tanh
2 g 2 g Δg
þ X g θ - θg þ θg ð7:234Þ
g
Xg θ ≤ θg þ θg
Xg = ð7:235Þ
0 θ > θg þ θg
!b3
νpI
b = b1 exp ðb2 θÞ p ð7:236Þ
νref
380
θ.g(ν)
(°K)
378
376
374
10 20
10−5 ν (S-1) 25
410 7 Unified Micromechanics of Finite Deformations
1800
1600 H series
1400
Elasc Modulus (MPa)
M series
1200
1000 L series
800
600
400
200
0
0 20 40 60 80 100 120 140 160
Temperature (°C)
Fig. 7.6 Temperature- and rate-dependent elastic modulus of PMMA, Nie (2005)
5.0
4.8
Limited Chain Extensibility
4.6
4.4
4.2
M90
H series
4.0
3.8 M series
3.6
L series
3.4 H90
3.2
3.0
0 20 40 60 80 100 120 140 160
Temperature (°C)
Figure 7.6 depicts the elastic modulus as a function of temperature for different
strain rates, H series (loading rate of 0.9 mm/s), M series (0.09 mm/s), and L
(0.009 mm/s).
Figure 7.6 indicates that the temperature sensitivity of elastic modulus becomes
smaller at slower loading rates. Parameters E g , E r , θE , ΔE , X gE , X rE , sE as a func-
tion of temperature and rate can be obtained from test data by statistical methods.
Temperature dependence of limited chain extensibility (IM) can be obtained from
fracture strain values in isothermal tests. Figure 7.7 shows the limited chain exten-
sibility as a function of temperature and strain rate.
Two data points (H90 and M90
values) are outliers. Using test data, parameters I gM , I rM , X gM , X rM , θM , ΔM can be
obtained as a function of temperature and loading rate.
7.6 Applications of Finite Deformation Models 411
Fig. 7.8 The ratio of yield stress to temperature vs. plastic strain rate plot of PMMA, Nie (2005)
1 1
Δð log ðνpI ÞÞ = H h - ð7:237Þ
θ θref
σ
y 1 1
Δ = Hv - ð7:238Þ
θ θ θref
According to Richeton et al. (2005a, b), if θref is selected as θg, horizontal shift
(Hh) and vertical shift (Hv) factors can be related to material properties through the
following relation:
QI
Hh = ð7:239Þ
kB ln ð10Þ
γ
H v = Bg - X gB θg = ð7:240Þ
3
412 7 Unified Micromechanics of Finite Deformations
where nI is the number of thermally activated transitions necessary for plastic flow,
V is the activation volume, and αp is the pressure sensitivity parameter. Temperature-
dependent characteristic viscoplastic flow rate (ν) is defined in Eq. (7.175), which is
derived from the flow rule of the theory of plasticity defined in Eq. (7.174). Effective
stress ðτI Þ at the glass transition temperature, θg, can be approximated from
Eq. (7.176) for a one-dimensional case. In this example, it is assumed that back-
stress (Mback) and plastic flow resistance in intermolecular structure (SI) vanish
around glass transition temperature, whereas applied stress at yielding is equal to
yield stress and normal pressure is one-third of applied stress:
σ1 = σy ð7:242Þ
Mback ffi 0 ð7:243Þ
SI ffi 0 ð7:244Þ
τI ffi σ 1 = σ y ð7:245Þ
1 1
pI = σ = σ ð7:246Þ
3 1 3 y
αp
τI = 1 - σ ð7:247Þ
3 y
Using regression analysis for fitting a master curve to experimental data with shift
factors Hh = 4900 K and Hv = - 40MPa material parameters, (νoI, QI, nI, Bg, XB, V,
αp and γ) can be calculated. Activation volume (V ) and activation energy (QI) were
assumed to be constant. Back-stress modulus asymptotically approaches zero around
the glass transition temperature. The remaining parameter is the back-stress modulus
definition ΔB which controls transition temperature range which was selected as 5 C
to ensure a smooth change in hardening characteristics of the material in
non-isothermal simulations (Fig. 7.9).
Implementing a 1-D version of the constitutive model in software, like
MATLAB, with isothermal conditions is an expedient way to determine the
remaining parameters (hI, b, g) in intermolecular
structure and the
parameters asso-
ciated with molecular network resistance νoM , QM , hM , nM , μM . These parameters
cannot be directly observed in macroscale experiments. Hence, it is necessary to run
simple 1-D simulations to estimate values for these parameters. Influence of param-
eters (hI, b, g) which control hardening-softening characteristics of response on
7.6 Applications of Finite Deformation Models 413
200
180
160
Back Stress Modulus (MPa)
140
120
100
80
60
40
20
0
0 20 40 60 80 100 120 140 160
Temperature (°C)
40 40
30 b 30
stress
stress
20
g
20
10 10
0 0
0 0.2 0.4 0.6 0.8 1 0 0.2 0.4 0.6 0.8 1
strain (b)
(a) strain
40 50
hM
40
30
30
hI
stress
20 stress
20
10
10
0 (d) 00
(c) 0 0.2 0.4 0.6 0.8 1 0.2 0.4 0.6 0.8 1
strain strain
40 40
m no
M
30 M
30
stress
stress
20 20
10 10
0 0
(e) 0 0.2 0.4 0.6 0.8 1 (f) 0 0.2 0.4 0.6 0.8 1
strain strain
100 40
n
M
80
Q 30
M
60
stress
stress
20
40
10
20
0 0
0 0.2 0.4 0.6 0.8 1 0 0.2 0.4 0.6 0.8 1
(g) strain (h) strain
Fig. 7.10 Influence of material parameters b, g, hI , hM , μM , νoM , QM , nM on stress-strain curves
of PMMA. Gunel (2010)
7.7 Numerical Implementation of Dual-Mechanism Model 415
Table 7.1 Material parameters for PMMA constitutive model, Gunel (2010)
Parameters Parameters Parameters
θref
g ðK Þ
373 ϕg 0.001 gr 9.55
cg1 ðK Þ 32.58 ϕg 0 Δθ (K )
g
5
cg2 87.5 θϕ(K ) 0 θg (K ) 5
c1(K ) 9 Δϕ(K ) 10 X gg K - 1 0.07
c2 116 X gϕ ð1=K Þ -4.5 × 10-6 b1(MPa) 2.72 × 1010
-1 -2
ν (s )
ref
1.73 × 10 X rϕ ð1=K Þ 0 b2(1/K ) -4.58 × 10-2
Eg(MPa) 500 I gM 7.04 b3 4.04 × 10-2
Er (MPa) 1.5 I rM 4.6 νpref ðs - 1 Þ 0.001
θE(K ) -8 θM(K ) -20 νoI ðs - 1 Þ 2.43 × 1012
ΔE(K ) 15 ΔM(K ) 12 QI(J/K) 1.56 × 10-19
sE 0.06 X gM ð1=K Þ 0 V(m3) 1.39x10-27
X gE ðMPa=KÞ -15.05 X rM ð1=K Þ 0.001 αp 0.21
X rE ðMPa=KÞ 0 SgM ðMPaÞ 35 nI 2.17
νg 0.31 SrM ðMPaÞ 0.2 hI 40.42
νr 0.49 θS(K ) 5 γ(MPa) 60
μgM ðMPaÞ 9 ΔS(K ) 5 νoM ðs - 1 Þ 4.33 × 109
μrM ðMPaÞ 0.5 X gS ðMPa=KÞ -0.01 QM(J/K) 1.30 × 10-19
θμ(K ) 5 X rS ðMPa=KÞ 0 hM 14.43
Δμ(K ) 14 Bg (MPa) 10 nM 5
sμ 0.03 X gB ðMPa=KÞ 2.15 ms(g/mol) 100.13
X gμ ðMPa=KÞ -0.4 ΔθB (K ) 5
X rμ ðMPa=KÞ 0 gg 7.97
including the first two terms of series representation of the exponential function.
Rate dependency of some material properties (E, μM, θg) is implemented in a numer-
ical algorithm such that property values are calculated based on strain rates in the
previous time increment and strain rates are updated based on initial and final
deformation gradients at the end of the time increment. Therefore, equilibrium is
satisfied at discrete time increments assuring thermodynamic consistency.
Numerical integration of evolution equations of plastic deformation gradients
FpI ,FpM and stretch-like internal variable A based on exponential mapping is obtained
from Eqs. (7.172), (7.196), and (7.166) as presented below:
ðFpI Þnþ1 = exp Δt ðDpI Þnþ1 ðFpI Þn ð7:248Þ
ðFpM Þnþ1 = exp Δt ðDpM Þnþ1 ðFpM Þn ð7:249Þ
Anþ1 = An exp Δt ðDpI Þnþ1 þ exp Δt ðDpI Þnþ1 An - γAn ln ðAn Þ
exp Δt ðDpI Þnþ1
× pffiffiffi ð7:250Þ
2
12
10
True Stress (MPa)
Δt=0.0001
6
Δt=0.001
4
Δt=0.01
2 Δt=0.1
0.0445
0.0440
0.0435
Strain Rate (s-1 )
0.0420
0.0415
0.0410
0.0405
1.E-05 1.E-04 1.E-03 1.E-02 1.E-01 1.E+00
(b) me increment (Δt, sec)
Fig. 7.11 Convergence study results for time increment in the viscoplastic model in terms of (a)
true stress-strain curves and (b) strain rates
equivalent viscoplastic strain rate ðνpI , νpM Þ, and equivalent stretch rate (d) must be
monitored.
True stress-strain curves presented in Fig. 7.11a indicate a fast convergence even
for the largest time increment of Δt = 0.1 s. Convergence of equivalent plastic strain
rates ðνpI , νpM Þ requires a small time increment, which also indicates a slow conver-
gence of internal state variables (SI, SM, ϕ. . .). Another interesting point that can be
observed from Fig. 7.11b is that equivalent viscoplastic strain rate in an
intermolecular network ðνpI Þ is larger than the equivalent stretch rate (d ) which
might seem like an error. However, this is due to the numerical method that is used to
418 7 Unified Micromechanics of Finite Deformations
12 0.050
0.045
10
0.040
0.035
True Stress (MPa)
6 0.025
0.020
4
σ11 0.015
νpI
0.010
2 νpM
d 0.005
0 0.000
0 10 20 30 40 50 60
me (sec)
1.E+05
1.E+04
1.E+03
1.E+02
1.E+01
0 2 4 6 8 10 12
(a) seed size
7
True Stress (MPa)
5 seed size=10
4
seed size=5
seed size=2.5
3
seed size=1.25
2 seed size=0.625
1
seed size=3
0
0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9
(b) True Strain
0.070
0.065
0.055
Strain Rate (s-1)
0.050
0.045
0.040
0.035
0.030
0 2 4 6 8 10 12
(c) seed size
Fig. 7.13 Mesh sensitivity of the ABAQUS model in terms of (b) true stress-strain curves and (c)
strain rates (a) for different mesh densities
7.7 Numerical Implementation of Dual-Mechanism Model 421
(a) 40
(b)
50 ISO-M30 (exp.)
ISO-H30 35
45 (exp.) ISO-M60 (exp.)
40 ISO-H60 30
(exp.) ISO-M90 (exp.)
35
Stress (MPa)
Stress (MPa)
ISO-H90 25
30 (exp.) ISO-M30 (sim.)
ISO-H30 20 ISO-M60 (sim.)
25 (sim.)
20 15 ISO-M90 (sim.)
15
10
10
5 5
0
0 0.1 0.2 0.3 0.4 0.5 0
0 0.2 0.4 0.6 0.8 1 1.2
Strain Strain
35
(c)
30 ISO-L30 (exp.)
ISO-L60 (exp.)
25
Stress (MPa)
ISO-L90 (exp.)
20
ISO-L30 (sim.)
15
ISO-L60 (sim.)
10
ISO-L90 (sim.)
5
0
0 0.5 1 1.5 2
Strain
Fig. 7.14 Comparison of Abaqus fea simulation and testing results for isothermal testing of
PMMA in (a) H series, (b) M series, and (c) L series tests
Fig. 7.15 Axial force histories in non-isothermal experiments and simulations of PMMA for H
series (0.9 mm/s forming rate)
Axial force histories of H, M, and L series with respect to normalized time (tn) are
presented in Figs. 7.15, 7.16, and 7.17. Simulation and experiment results in terms of
axial force histories are reasonably in good agreement. A unified mechanics formu-
lation of plastic flow rule for intermolecular structure and appropriate material
property definitions ensures smooth transition of response around the glass transition
temperature.
Here we conclude our discussion of the large deformation analysis of polymers.
424 7 Unified Micromechanics of Finite Deformations
Fig. 7.16 Axial force histories in non-isothermal experiments and simulations of PMMA for M
series (0.09 mm/s forming rate)
References 425
Fig. 7.17 Axial force histories in non-isothermal experiments and simulations of PMMA for L
series (0.009 mm/s forming rate)
References
Ames, N. M., et al. (2009). A thermo-mechanically coupled theory for large deformations of
amorphous polymers. Part II: Applications. International Journal of Plasticity, 25(8),
1495–1539.
Anand, L. (1986). Moderate deformations in extension-torsion of incompressible isotropic elastic
materials. Journal of the Mechanics and Physics of Solids, 34, 293–304.
Anand, L., & On, H. (1979). Hencky’s approximate strain-energy function for moderate deforma-
tions. Journal of Applied Mechanics, 46, 78–82.
Anand, L., et al. (2009). A thermo-mechanically coupled theory for large deformations of amor-
phous polymers. Part I: Formulation. International Journal of Plasticity, 25(8), 1474–1494.
Arruda, E. M., & Boyce, M. C. (1993). Evolution of plastic anisotropy in amorphous polymers
during finite straining. International Journal of Plasticity, 9(6), 697–720.
Arruda, E. M., Boyce, M. C., & Jayachandran, R. (1995). Effects of strain rate, temperature and
thermomechanical coupling on the finite strain deformation of glassy polymers. Mechanics of
Materials, 19(2–3), 193–212.
Bergström, J. S., & Boyce, M. C. (1998). Constitutive modeling of the large strain time-dependent
behavior of elastomers. Journal of the Mechanics and Physics of Solids, 46(5), 931–954.
Boyce, M. C., Socrate, S., & Llana, P. G. (2000). Constitutive model for the finite deformation
stress-strain behavior of poly(ethylene terephthalate) above the glass transition. Polymer, 41(6),
2183–2201.
Fotheringham, D. G., & Cherry, B. W. (1978). The role of recovery forces in the deformation of
linear polyethylene. Journal of Materials Science, 13(5), 951–964.
426 7 Unified Micromechanics of Finite Deformations
Fotheringham, D., Cherry, B. W., & Bauwens-Crowet, C. (1976). Comment on “the compression
yield behavior of polymethyl methacrylate over a wide range of temperatures and strain rates”.
Journal of Materials Science, 11(7), 1368–1371.
Francisco, P., Gustavo, S., & Élida, B. H. (1996). Temperature and strain rate dependence of the
tensile yield stress of PVC. Journal of Applied Polymer Science, 61(1), 109–117.
Gent, A. N. (1996). A new constitutive relation for rubber. Rubber Chemistry and Technology,
69(1), 59–61.
Gunel, E. M., & Basaran, C. (2009). Micro-deformation mechanisms in thermoformed alumina
trihydrate reinforced poly(methyl methacrylate). Materials Science and Engineering: A,
523(1–2), 160–172.
Gunel, E.M., & Basaran, C. (2010). Stress whitening quantification of thermoformed mineral filled
acrylics. Journal of Engineering Materials and Technology, 132(3), 031002–031011.
Gunel, E. M., & Basaran, C. (2011a). Damage characterization in non-isothermal stretching of
acrylics: Part II experimental validation. Mechanics of Materials, 43(12), 992–1012.
Gunel, E. M., & Basaran, C. (2011b). Damage characterization in non-isothermal stretching of
acrylics: Part I theory. Mechanics of Materials, 43(12), 979–991.
Hashin, Z., & Shtrikman, S. (1963). A variational approach to the theory of the elastic behavior of
multiphase materials. Journal of the Mechanics and Physics of Solids, 11(2), 127–140.
Mustafa Eray Gunel (2010), Large Deformation Micromechanics of Particle Filled Acrylics at
Elevated Temperatures, Dissertation submitted to Department of Civil, Structural and Environ-
mental Engineering, University at Buffalo, SUNY
Mandel, J. (1972). Plasticite classique et viscoplasticite (Lecture Notes). International Center for
Mechanical Sciences.
Nie, S. (2005). A micromechanical study of the damage mechanics of acrylic particulate composites
under thermomechanical loading. (Ph.D. Dissertation). In Civil, structural, and environmental
engineering. State University of New York at Buffalo.
Palm, G., Dupaix, R. B., & Castro, J. (2006). Large strain mechanical behavior of poly(methyl
methacrylate) (PMMA) near the glass transition temperature. Journal of Engineering Materials
and Technology, 128(4), 559–563.
Richeton, J., et al. (2005a). A formulation of the cooperative model for the yield stress of
amorphous polymers for a wide range of strain rates and temperatures. Polymer, 46(16),
6035–6043.
Richeton, J., et al. (2005b). A unified model for stiffness modulus of amorphous polymers across
transition temperatures and strain rates. Polymer, 46(19), 8194–8201.
Richeton, J., et al. (2006). Influence of temperature and strain rate on the mechanical behavior of
three amorphous polymers: Characterization and modeling of the compressive yield stress.
International Journal of Solids and Structures, 43(7–8), 2318–2335.
Richeton, J., et al. (2007). Modeling and validation of the large deformation inelastic response of
amorphous polymers over a wide range of temperatures and strain rates. International Journal
of Solids and Structures, 44(24), 7938–7954.
Srivastava, V., & Anand, L. (2010). A thermo-mechanically-coupled large-deformation theory for
amorphous polymers in a temperature range which spans their glass transition. International
Journal of Plasticity, 26(8), 1138–1182.
Weber, G. & Anand, L. (1990). Finite Deformation Constitutive Equation and a Time Integration
Procedure for Isotropic, Hyperelastic-Viscoplastic Solids. Computer Methods in Applied
Mechanics and Engineering, 79, 173–202.
Williams, M. L., Landel, R. F., & Ferry, J. D. (1955). The temperature dependence of relaxation
mechanisms in amorphous polymers and other glass-forming liquids. Journal of the American
Chemical Society, 77(14), 3701–3707.
Chapter 8
Unified Mechanics of Metals Under High
Electrical Current Density:
Electromigration and Thermomigration
8.1 Introduction
When a metal is subjected to an electrical potential gradient, the current enters from
the anode side and travels to the cathode side, and the electrons travel from the
cathode to the anode side. Electromigration is a mass diffusion-controlled phenom-
enon. When a conductor is subject to a high current density, the so-called electron
wind transfer part of its momentum to the atoms (or ions) to make the atoms (or ions)
move in the direction of the current. As a result, the degradation of the metal
conductor occurs mainly in two forms: in the anode side, the atoms will accumulate
and finally form hillocks and the vacancy concentration on the cathode side will
form voids. Both hillocks and voids will cause the degradation of the material and
eventual failure. The damage evolution due to electromigration can be modeled as an
irreversible mass transport process. The purpose of this chapter is to present the
formulation of modeling the electromigration and thermomigration-induced material
degradation process using the unified mechanics theory
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2022 427
C. Basaran, Introduction to Unified Mechanics Theory with Applications,
https://doi.org/10.1007/978-3-031-18621-9_8
428 8 Unified Mechanics of Metals Under High Electrical Current Density. . .
electrons. Scattered electrons collide with metal atoms and push them in the direc-
tion of electron flow (Seith 1955). On the other hand, atoms move in the opposite
direction of the applied electric field when they are ionized, the latter mechanism, the
static force of the electric field, is considered the direct field force. The net effect of
these two forces is the so-called electrical field driving force of electromigration.
Stress Gradient
When atoms diffuse from the cathode side to the anode side, they leave behind
vacancies on the cathode side and mass accumulation on the anode side. As a result,
there is tensile stress on the cathode side and compressive stress on the anode side.
This stress differential is responsible for the stress gradient.
Of course, the mechanical stress gradient influences the electromigration process.
There is an interaction between the stress gradient and the electromigration driving
forces. Stress gradient can counteract or enhance the electromigration process,
depending upon the interaction among all diffusion driving forces. Essentially, stress
gradient is another driving force of mass transport.
However, in thermomigration in the absence of external stress gradients, the mass
moves from the hot side to the cold side, and compression on the cold side and
tension on the hot side develop, which counters the thermal gradient-induced force.
Thermal Gradient
The Joule heat generated under high current density is highly localized. Hence, there
is a thermal gradient in the medium, which leads to thermomigration. The thermal
gradient is one of the strongest driving forces of diffusion. Thermomigration cannot
be ignored especially when the thermal gradient is large, in which case the
thermomigration can be the dominant diffusion driving force.
The physical explanation behind thermomigration is not well understood. How-
ever, recent literature shows that thermomigration could play a significant role in the
electromigration-induced failure.
Atomic Vacancy Concentration Gradient
It has been estimated that the mass flux due to vacancy concentration gradient is
small compared to those induced by electrical field forces, stress gradient, and
thermal gradient. The atomic vacancy concentration gradient is usually small at
the beginning compared to the other diffusion driving forces. However, as the mass
migration progresses, vacancy concentration gradient increases significantly. At a
certain point in time, the vacancy concentration gradient becomes large enough to
slow the diffusion.
AI Silicon Die
Ni UBM
100 μm
140 μm
Underfill
Solder
Cu
PCB
Ni UBM
Underfill
Solder
Cu Plate
40µm700X
Fig. 8.1 Electronics solder joint profile after electromigration and thermomigration failure
8.2 Physics of Electromigration Process 431
J = - D∇C ð8:1Þ
where:
J is the mass flux, which is defined as a vector quantity, with units of (molm-2s-1).
Q
D is diffusivity, with units of m2/s; it is a function of temperature D = D0 exp - RT d
∂C
= D∇2 C ð8:2Þ
∂t
DF
vD = ð8:3Þ
kB T
where vD is the drift velocity, unit m/s; D is the diffusion coefficient; F is the driving
force of the diffusion, unit Newton; kB is Boltzmann’s constant; and T is absolute
temperature, Kelvin.
From Eq. (8.3) we can easily deduce that for self-diffusion, the driving force due
to concentration gradient has the following form:
F = - k T∇C ð8:4Þ
∂C D∇F
= ð8:5Þ
∂t kT
where F is the driving force of the diffusion. In the presence of electrical current
density, the driving force has four primary components:
432 8 Unified Mechanics of Metals Under High Electrical Current Density. . .
!
X
F= Fi = F em þ F tm þ F σ þ F s ð8:6Þ
i
where Fem is the electron wind force, Ftm is the driving force due to temperature
gradient, Fσ is the stress gradient driving force, and Fs is the driving force due to
chemical potential gradient. Each driving force will be introduced separately in the
following sections.
These forces also lead to grain coarsening and phase changes in the metals.
However, we are primarily interested in the mechanical implications in here.
→ → →
F em = F wind þ F direct
→
= ð - Z wind þ Z el Þe E
→
ð8:7Þ
= Ze E
= Z eρj
8.2 Physics of Electromigration Process 433
→
where e is the electron charge, E is the electric field potential (V/cm), ρ is the
resistivity of the metal (ohmmeter (Ωm)), j is the current density (Amp/cm2), and Z*
is a dimensionless quantity called the effective charge (also called an effective
nuclear charge or the affective valence), which is the combined net attractive positive
charge of nuclear protons acting on valence electrons. When a charged particle
penetrates through condensed matter, it polarizes the medium, which in turn reacts,
back slowing down the penetrating particle. To introduce a measure of the inertia of
ions, Brandt and Kitagawa (1982) introduced the concept of effective charge. The
effective charge, Z*, relates to how a diffusing atom interacts with the conduction
band electrons. The interaction with the conduction band electrons is a complex
function of the electronic structure, a quantum mechanical effect. The effective
charge is defined in terms of the stopping power of the charge Z(S) and the power
of the proton moving in the same medium at the same velocity (Sp) (Barberan and
Echenique 1986):
1
Z = S=Sp 2 ð8:8Þ
Momentum,k
The effective charge, Z*, is determined by the effective mass, m*, of electrons
available for interaction (scattering) in which the energy produced is not exceeding
the Fermi energy, Ef, which is the total energy difference between the lowest
occupied conduction band and the highest conduction band, of uncharged metal at
absolute zero temperature. The effective mass m* is proportional to the second
derivative in the energy-versus-momentum, E - k relationship, a quantum mechan-
ical effect near Fermi level energy E = Ef (Lloyd et al. 2004). Above the inflection
point, m* is negative; below it m* is positive (Fig. 8.4).
8.2 Physics of Electromigration Process 435
Thermomigration has been known since Ludwig (1856). As Chen et al. (2012)
stated, when an inhomogeneous binary alloy is annealed at an elevated isotropic
temperature field with no thermal gradient, it will become homogeneous. However,
when a homogeneous binary alloy is annealed under a temperature gradient, the
alloy will become inhomogeneous. This de-alloying phenomenon is called Soret
effect after Soret (1879). The phenomenon is caused by mass transport due to
temperature gradient.
The role of thermomigration in the presence of high current density in micro-
electronics solder joints has been ignored until recently. It has been shown by Ye
et al. (2003a, b, c, d, e, f), Abdulhamid et al. (2008), and Basaran et al. (2008) that
thermomigration plays a significant role in the presence of high current density.
Diffusion driving force due to thermal gradient can be bigger than electron wind
forces. If they are in the opposite directions, the thermal gradient can negate the
electron wind force and if they are in the same direction can hasten the failure
process significantly.
The temperature gradient-induced diffusion driving force is usually represented
by Huntington (1972):
Q
F tm = ∇T ð8:9Þ
T
where Q* is the heat of transport, which is the isothermal heat transmitted by the
moving atom in the act of jumping a lattice site less the intrinsic enthalpy, and T is
the temperature in Kelvin.
The most accepted thermomigration theory is very much based on the same
concepts as electromigration. In electromigration, the force acting on the diffusing
atom is the momentum exchange due to the collisions (scattering) of diffusing atoms
and scattered electrons in the conduction band. However, in thermomigration, in the
absence of electrical current, the driving force in the direction of the temperature
gradient results from the fact that the energy, therefore the momentum of the atoms
(phonons) at higher temperatures is greater than that of the lower temperature ones.
This gradient in the momentum produces a driving force for mass transport from the
hot side to the cold side.
Based on this argument, Q* for metal with a high Z* value should also be high.
Similar to Z*, the sign of Q* can either be positive or negative depending on the
effective mass of the charge carriers.
According to Huntington (1972), thermomigration driving force can be divided
into an intrinsic part and a part coupled to carrier electrons and phonons:
The intrinsic considerations include the energy transported by the moving atom,
the energy required to prepare a place [a lattice site] to receive it, and, for the vacancy
diffusion mechanism, the formation energy for the counter-moving vacancy, which
is given by
Q = βEm - E f ð8:11Þ
where Em and Ef are the energies of motion and formation for the moving atoms,
respectively, and β is a dimensionless fraction less than 1 which gives the fraction of
the kinetic energy carried by the moving atom. Q* can be either positive or negative
depending on the magnitude of the kinetic energy and the formation energy.
According to Campbell and Huntington et al. (1969), β was reported to be about
0.81; hence, Q* is estimated to be about 1ev, which is in accordance with the
experimental measurement of 0.23 ev.
Due to mass transport from cathode to anode, and from the high-temperature side to
low-temperature side, there are more vacancies on one side and more mass accu-
mulation on the other side. Consequently, on the mass accumulation side, atoms are
subjected to compression, and on the vacancy accumulation side, atoms are
subjected to tension. Therefore, there is a stress gradient between two sides of the
continuum. This stress gradient is also another diffusion driving force. Tensile stress
makes diffusion easier and compressive stress makes diffusion more difficult to
happen.
The role of mechanical stress gradients as a diffusion driving force in conjunction
with electromigration was first explored in a series of experiments by Blech (1976)
and Blech and Herring (1976). It was shown that a critical current density threshold
exists below which mass transport is stopped in a thin-film aluminum sample. This
threshold was found to be inversely proportional to the thin-film Al stripe length. It
was suggested that electromigration in a short segment tends to induce back-stress
when they did an electromigration test on a set of short Al thin-film strips deposited
on a substrate as depicted in Fig. 8.5.
Extrusion Extrusion
Length
Void Void
AI AI AI
e ~ TIN e~
Fig. 8.5 Schematic of Al thin-film strips of different lengths showing stress gradient influence due
to electromigration
8.2 Physics of Electromigration Process 437
Figure 8.5 shows the effect of stress gradient on Al thin film on a substrate. The
Al thin-film strips are deposited on a substrate of TiN. It can be observed that the
longer the length, the larger the depletion, and extrusion on the cathode and anode
sides, respectively, due to electromigration. When the thin film is below a “critical
length,” there is no observable depletion or extrusion. Stress gradient diffusion
driving force is neutralizing the electron wind force-driven diffusion.
The dependence of mass transport on the strip length was explained by the effect
of the stress gradient. When the electromigration process transports Al atoms from
cathode to anode, the anode will be in compression, while the cathode will be in
tension. Based on the Nabarro-Herring model of equilibrium vacancy concentration
in a stressed solid, the tensile region has more, and the compressive region has fewer,
vacancies than the unstressed region. As a result, there exists a vacancy concentra-
tion gradient between the cathode side and the anode side. The stress gradient
induces an atomic flux of mass diffusing from anode to cathode, and it opposes
the mass flux driven by electromigration from the cathode side to the anode side.
Blech’s critical length comes into effect when a thin film on a substrate is
subjected to current density with blocking boundary conditions. A stress gradient
provides a mass transport force in opposition to electromigration. If the current
density is sufficiently low, a stress gradient will be generated that can stop electron
wind force-driven diffusion.
The stress gradient depends on the length of the strip; the shorter the thin film
strip, the greater the gradient. At a certain length defined as the “critical length,” the
stress gradient is large enough to counterbalance electromigration force so no
depletion at the cathode or extrusion at the anode occurs. The critical length can
be calculated when the net force due to electromigration and stress gradient is zero,
as shown in Eq. (8.12). The force equilibrium equation in the x-direction for an atom
can be rewritten as
∂σ
F = F e þ F σ = - Z eρj þ f Ω =0 ð8:12Þ
∂x
where f is the atomic vacancy relaxation ratio, the ratio of atomic volume to the
volume of an atomic vacancy, Ω is the atomic volume, and σ is the spherical stress.
Solving Eq. (8.12),
∂σ Z eρj
=
∂x fΩ
ð8:13Þ
fΩ
j∂x = ∂σ
Z eρ
The integration of Eq. (8.13) produces one of the most important characteristics
of electromigration. Given maximum stress that can be sustained by the conductor,
the product of current density and the length of the conductor determines whether
electromigration diffusion can occur or not. This product was discovered by I. A.
Blech and is referred to as the “Blech product” (Blech 1976; Blech and Herring
1976):
438 8 Unified Mechanics of Metals Under High Electrical Current Density. . .
fΩ
jlB = ðσ - σ 0 Þ ð8:14Þ
Z eρ m
where lB is known as Blech’s critical length, σ m is the maximum spherical stress that
can be supported by the conductor, while σ 0 is the initial spherical stress in the
conductor. If the product of the current density and the length is exceeded,
electromigration mass transport can occur; if it is not, no electromigration can
happen.
If all thin-film conductors could be designed so that jlB stays below the critical
value, electromigration can be avoided since mass transport comes to a
complete halt.
However, this scenario is only true for a pure metal thin film on a substrate,
because it assumes that metal is not an alloy and there is no temperature gradient
acting on the system.
∂C v → →
= -∇ q þG ð8:15Þ
∂t
" → #
→ → Cv Z e → Cv → C v Q ∇T
q = - Dv ∇C v þ -ρ j - ð - f ΩÞ∇σ þ ð8:16Þ
kT kT kT T
where:
Initial State
After Δt
qx qx+Δ
x
Δx+ έxΔtΔx
Δy+ έyΔtΔy
Δ
Δz+ έzΔtΔz
z
V+ ΔV
Δ c+Δc
y
Δ
x
έxΔtΔ
x
Q
Dv = vacancy diffusivity, Dv = Do e - kT
Z = effective charge number
e = electron charge
ϕ = electric potential
j = current density (vector)
f = vacancy relaxation ratio, ratio of atomic volume to the volume of a vacancy
Ω = atomic volume
k = Boltzmann’s constant
T = absolute temperature in Kelvin
σ sp = spherical part of the stress tensor, σ sp = trace(σ ij)/3
Q = heat of transport, the isothermal heat transmitted by the moving atom in the
process of jumping a lattice site less the intrinsic enthalpy
G = vacancy generation/annihilation rate can be expressed by (Sarychev et al. 1999)
Cv - C ve
G= - ð8:20aÞ
τs
ð1 - f ÞΩσ sp
Cve = Cv0 e kT ð8:20bÞ
To solve Eq. (8.15) using the finite element method (method of weighted
residuals), we can write the following relationship:
Z
∂c
δc C V0 þ ∇ q - G dV = 0 ð8:21Þ
V ∂t
ð1 - f ÞΩσ sp
Z Z exp -c
∂c kT
δc CV0 dV þ C V0 δc dVþ
∂t τs
V
Z V
Ze cf Ω cQ
δc∇ DV C V0 ∇c þ ∇ϕ c þ ∇σ sp þ þ 2 ∇T dV = 0
V kT kT kT
ð8:22Þ
8.3 Laws of Conservation 441
The concentration vector can be represented in terms of nodal values, that is,
δc = N T δci ð8:23Þ
Using Eq. (8.23) into Eq. (8.22), after simple organization, we can derive the
stiffness matrices. However, first, we have to define the terms in Eq. (8.22).
If we define C = Cv/Cv0 as the normalized concentration, then the vacancy
diffusion equation could be rewritten as
∂C v
∂t
→ → →
Z eρ → fΩ → Q →
= Dv ∇2 C þ ∇ C j þ ∇ C∇σ þ 2 ∇ C∇T
kT kT kT
G
þ ð8:24Þ
C vo
1 → →
ε_ m
ij = f Ω∇ q δij ð8:25Þ
3
1
ε_ gij = ð1 - f Þ ΩGδij ð8:26Þ
3
where δij is the Kronecker delta. Thus, the total volumetric strain rate due to electrical
current is
h i
Ω → →
ε_ elec _m
ij = ε _ gij =
ij þ ε f ∇ q þ ð1 - f ÞG δij ð8:27Þ
3
The volumetric strain caused by the current stressing is superimposed onto the
total strain tensor with strains due to other loadings; thus, total strain [for small strain
case] can be given by
442 8 Unified Mechanics of Metals Under High Electrical Current Density. . .
ε_ total
ij = ε_ mech
ij þ ε_ therm
ij þ ε_ elec
ij ð8:29Þ
where ε_ total
ij is the total strain rate tensor, ε_ mech
ij is the strain rate due to mechanical
loading, ε_ ij
therm
is the strain rate due to thermal load, and ε_ elec
ij is the volumetric strain
rate due to electromigration.
Following the standard procedure to obtain the finite element method stiffness
matrices leads to
Z
1 1
K cc = NT NdV ð8:30Þ
V Δt
2 3
Z ∂ fΩ sp Q ∇T nþ1
2 ∂N T 6 ∂x þ kT nþ1 ∇σ nþ1 þ kT 2 þ 7
K cc = DV 6
4
nþ1 sp 7 NdV ð8:31Þ
V ∂x Z e
c f Ω ∂∇σ nþ1 5
þ ∇ϕnþ1 þ nþ1
kT nþ1 kT nþ1 ∂cnþ1
2ð1 - f ÞΩ sp 3
Z ð1 - f ÞΩσ nþ1 ∂σ sp
3 exp nþ1
∂cnþ1
- 1
K cc = - N T 4 5 NdV
kT nþ1 kT nþ1
ð8:32Þ
V τs
Eventually, we have
cc
K nþ1 = K 1cc þ K 2cc þ K 3cc ð8:33Þ
sp
Z ð1 - f ÞΩσ
2
nþ1
ð1 - f ÞΩ e kT nþ1 ∂σ sp
K cu = - NT nþ1
ΒdV ð8:36Þ
V kT nþ1 τs ∂εnþ1
And we can obtain the coupled stiffness matrix of concentration and mechanical
displacement by the following:
cu
K nþ1 = K 1cu þ K 2cu ð8:37Þ
The proper approach would be also to take a derivative of the potential with
respect to entropy and find the related stiffness matrices. However, that will add
entropy as a nodal unknown to our equations. For sake of computational simplicity,
we take the entropy generation rate values from the previous step and calculate them
at the Gauss integration point. Assuming the increments are small, the error should
be small. However, this simplification is not mathematically true and does contribute
to error.
The force equilibrium equation must be written according to the laws of unified
mechanics theory. However, here we try to provide a simple implementation, which
ignores the derivative of displacement with respect to entropy. The thermodynamic
state index is introduced into the formulation later on. In the absence of body forces,
the Newtonian mechanics force equilibrium equation has the following form:
σ ij,j = 0 ð8:38Þ
For small strains, the elastic strain-stress constitutive model can be established as
where:
εvp is the viscoplastic strain vector
εD is strain vector due to diffusion
εTE is strain vector due to thermal expansion
C is the tangential constitutive tensor
Using the principle of virtual work, we can write
Y Z Z
= δε σ dV = δε C ðε - εvp - εD - εTE ÞdV ð8:40Þ
u V V
After the standard derivation process, we end up with the mechanical stress-
induced deformation-related stiffness matrices as follows:
444 8 Unified Mechanics of Metals Under High Electrical Current Density. . .
Z
nþ1 =
K uu BT C B dV ð8:41Þ
V
∂T
ρC p - ∇ðkh ∇T Þ - ρQ = 0 ð8:44Þ
∂t
where:
ρ = mass density of the material
Cp = specific heat
kh = coefficient of heat transfer
Q = heat generated within the body, which can be expressed as
Q = QJ þ QP þ QV ð8:45Þ
where:
QJ = Joule heating, which can be written as
Z
1
QJ = QJ ðt Þdt ð8:46Þ
Δt Δt
1 1 1 1
QJ = E nþ1 E - E nþ1 ΔE nþ1 þ ΔE nþ1 ΔEnþ1 ð8:47Þ
R nþ1 R 3 R
∂ϕ
E= - ð8:48Þ
∂x
E 1 ∂ϕ
j= =- ð8:49Þ
R R ∂x
QP = σ nþ1 : ε_ pl
nþ1 ð8:50Þ
where ε_ pl is the plastic strain rate. The entire plastic work is not dissipated as heat.
Some part is stored in dislocations which contribute to hardening. Equation (8.50) is
an acceptable simplification because plastic work in electromigration is negligible.
QV is the heat due to vacancy flux which can be expressed by
QV = q : F k ð8:51Þ
where:
q is vacancy flux
Fk is the effective driving force which has the following form:
Q kT
F k = - Z e∇ϕ þ f Ω∇σ sp þ ∇T þ ∇c ð8:52Þ
T c
QV
Q ∇T nþ1
2
cnþ1 kT nþ1
= Dv C V ∇cnþ1 þ Z e∇ϕnþ1 þ f Ω∇σ sp
nþ1 þ
kT nþ1 cnþ1 T nþ1
ð8:53Þ
Z
∂T
δT ρC p - ∇ðkh ∇T Þ - ρQ dV = 0 ð8:54Þ
V ∂t
By taking the derivative of Eq. (8.56) with respect to temperature, T, we can find
the heat capacity matrix and heat transfer matrix which can be separately expressed
as
Z
CP
nþ1 =
CTT NT
Δt
NdV ð8:57Þ
V
Z
∂N T kh ∂N
K 1TT = dV ð8:58Þ
V ∂x ρ ∂x
Z
∂Qnþ1
J
K 2TT = NT NdV
∂T nþ1
Z V
1 ∂R ∂ϕnþ1 ∂ϕnþ1 ∂ϕnþ1 ∂Δϕnþ1
= N
T
- 2 - ð8:59Þ
R ∂T nþ1 ∂x ∂x ∂x ∂x
V
1 ∂Δϕnþ1 ∂Δϕnþ1
þ NdV
3 ∂x ∂x
Z Z
3 ∂Qnþ1
V
DV C V cnþ1
K TT = NT NdV = NT
∂T nþ1 k
" V V #
k∇cnþ1 ∂σ sp Q ∂N Q ∇T nþ1
k
F nþ1 N - 2F nþ1
k
N þ f Ω nþ1 N þ - N
cnþ1 ∂T nþ1 T nþ1 ∂x T 2nþ1
dV
T 2nþ1
ð8:60Þ
Eventually, we can get the temperature “stiffness” matrix by adding them all
together:
8.6 Electrical Conduction Equations 447
TT 1 2 3
nþ1 = Cnþ1 þ K TT þ K TT þ K TT
K TT ð8:61Þ
The term stiffness is normally used for the mechanical force-displacement rela-
tionship. Here we are using the term “stiffness” as an analogy. The coupled term
concerning concentration field-induced temperature change can be obtained by
taking the derivative of the total potential equation with respect to c which yields
Z Z
∂Qnþ1
V
DV C V k 2 ∂σ sp ∂N
Tc
K nþ1 =- NT NdV = - F nþ1 N - 2F nþ1
k
cnþ1 f Ω nþ1
∂cnþ1 kT nþ1 ∂cnþ1 ∂x
V V
kT nþ1 ∂N
þ cnþ1 - ∇c N
c2nþ1 ∂x
ð8:62Þ
h i Z Z
T ∂Qnþ1
J
1 ∂N
Tϕ
K nþ1 = - N NdV = = N T J - ΔJ dV ð8:64Þ
V ∂ϕ nþ1 V 3 ∂x
where:
A is the surface of a control volume
n is the outward normal to surface
j is the electrical current density defined in Eq. (8.49)
448 8 Unified Mechanics of Metals Under High Electrical Current Density. . .
The divergence theorem can be used to convert the surface integral into a volume
integral which yields to
Z
∂
jdV = 0 ð8:66Þ
V ∂x
Using first the chain rule and then the divergence theorem, this statement can be
rewritten as
Z
∂δϕ
jdV = 0 ð8:68Þ
V ∂x
The influence of temperature change on the current field can be taken into account
by adding the following term:
h i Z
∂N T 1 ∂R
K ϕT = E NdV ð8:76Þ
nþ1
V ∂x R2 ∂T
nþ1
2 38 → 9 8 9
½K uu
nþ1 ½Kucnþ1 ½K uT
nþ1 0 < u = < fu =
½
½K nþ1 fug = 4 ½K nþ1
K cu
½Kccnþ1 ½K cT
nþ1 0 5 cv
= qqc ð8:73Þ
nþ1 ½KTc
nþ1 ½K TT
nþ1 ½ nþ1
K Tϕ : T ; : T;
Tu
0 0 ½K ϕT
nþ1 ½Kϕϕnþ1 ϕ j
It is easy to notice that the finite element nodal point has displacement, vacancy
concentration, temperature, and electrical potential as nodal unknowns. An accurate
analysis, according to unified mechanics theory, would also include entropy gener-
ation rate as a nodal unknown. Of course, it does complicate the computation
process.
∂σ sp
nþ1
∂σ sp ∂σ sp nþ1 ∂σ nþ1
dev
∂ΔεPnþ1
Terms ∂cnþ1 , ∂Tnþ1
nþ1
, ∂ε nþ1
, ∂ε nþ1
, and ∂εnþ1
in Eq. (8.73) can be derived individually
once we introduce the viscoplastic model.
In this section, the thermodynamic fundamental equation accounting for most but
not all entropy production mechanisms is derived in a way appropriate for imple-
mentation in the finite element method. Ignored mechanisms included but are not
limited to grain coarsening and phase change and are assumed to account for a small
portion of the total entropy generation. Of course, including their contribution is
more accurate. In the derivation below, for the sake of completeness, and clarity,
some obvious thermodynamics relations are restated.
According to the second law of thermodynamics for any macroscopic system, the
entropy of the system is a state function.
The differential of the entropy, dS, may be written as the sum of two terms:
where dSe is the entropy supplied to the system and dSi is the entropy produced
inside the system. The second law of thermodynamics states that dSi must be equal to
zero for reversible and positive for irreversible processes:
dSi ≥ 0 ð8:75Þ
Z
dSe
=- Js,tot dΩ ð8:77Þ
dt
Ω
Z
dSi
= γdV ð8:78Þ
dt
V
where s is the specific entropy per unit mass, Js, tot is the total entropy flow per unit
area and unit time, and γ is the entropy source strength or internal entropy production
per unit volume per unit time.
Using the local form of entropy, we can get the local form of the variation of
entropy as follows:
∂ρs
= - divJs,tot þ γ ð8:79Þ
∂t
γ≥0 ð8:80Þ
With the help of the material time derivative of the volume integral, which is
given by
d ∂
= þ v grad ð8:81Þ
dt ∂t
ds
ρ = - divJs þ γ ð8:82Þ
dt
where the entropy flux, Js, is the difference between the total entropy flux and a
convective term:
Based on the above definitions, the degradation of the system is directly related to
the rate of change of the entropy (thermodynamic state index). This enables us to
8.7 Thermodynamic Fundamental Equation for Electromigration and Thermomigration 451
obtain more explicit expressions for the entropy flux and internal entropy
production rate.
With the assumption of a caloric equation of state, Helmholtz free energy, Ψ, in
differential form can be written as
where u is internal energy and T is the temperature (here we use upper case, Ψ, to
distinguish from the electrical potential ϕ). Rearranging terms in Eq. (8.84) leads to
To get the expression for free energy, we need to specify a function with a scalar
value concave with respect to temperature T and convex with respect to other
variables. Here we use
Ψ = Ψðε, εe , εp , T, V k Þ ð8:86Þ
where Vk can be any internal state variable determined by the specific application at
hand and physical processes that is leading to generating entropy for our specific
definition of failure.
In small strain theory, the strain can be written in the form of their additive
decomposition as follows:
εe = ε - εp ð8:87Þ
so that
Ψ = Ψðεe , T, V k Þ ð8:88Þ
dΨ ∂Ψ dεe ∂Ψ dT ∂Ψ dV k
= e: þ : þ : ð8:89Þ
dt ∂ε dt ∂T dt ∂V k dt
ds du dΨ dT
T = - -s ð8:90Þ
dt dt dt dt
du X
ρ = - divJq þ σ : gradð~vÞ þ Jk Fk ð8:91Þ
dt k
where u is the total internal energy, Jq is the heat flux, σ is the stress tensor, ~v is the
rate of deformation, Jk is the diffusional flux of component k, and Fk is the body
force acting on the mass of component k.
Using the conservation of energy equation in the form given by Eq. (8.91) and
Eqs. (8.82) and (8.83), we can write the rate of change of entropy density as follows:
ds 1 X ∂Ψ dεe ∂Ψ dT
ρ = - divJq þ σ : gradðvÞ þ Jk Fk - ρ : þ :
dt T ∂εe dt ∂T dt
k ð8:92Þ
∂Ψ dV k dT
þ : -s
∂V k dt dt
where
σ : gradð~vÞ = σ : ðD þ WÞ ð8:93Þ
and D (symmetric) and W (skew-symmetric) are the rate of deformation tensor and
spin tensor, respectively.
Due to the symmetry of σ, we obtain the following relation:
σ : ðD þ W Þ = σ : D ð8:94Þ
dε dεe dεp
σ : D=σ : =σ : þ ð8:95Þ
dt dt dt
After rearranging Eq. (8.92) and comparing it with Eq. (8.82), we can obtain
Js = T1 Jq and the following entropy production rate term:
σ = ρð∂Ψ=∂εe Þ ð8:97Þ
∂Ψ
s= - ð8:98Þ
∂T
If we assume that entropy generation due to elastic strain rate is much smaller
compared to other mechanisms, we can simplify Eq. (8.96) as follows:
1 1X 1 ρ ∂Ψ dV k
γ= - J q grad ð T Þ þ Jk Fk þ σ : e_ p - : ð8:99Þ
T 2 T k
T T ∂V k dt
1
Jq = CjGradðT Þj2 ð8:100Þ
T2
where C is the thermal conductivity tensor, the simplified entropy production rate
can be given by
1 1X 1 ρ ∂Ψ dV k
γ= - CjGradðT Þj2 þ J F þ σ : ε_ p - : ð8:101Þ
T 2 T k k k T T ∂V k dt
We identify Jk in Eq. (8.101) as the flux, and Fk as the effective driving force
term, as defined in the Onsager relation:
→ Q → kT →
Fk = Z ejρ þ ð - f ΩÞ∇σ - ∇T - ∇C ð8:102Þ
T C
In Eq. (8.101) the irreversible dissipation includes two parts: the first term is
called heat dissipation caused by conduction inside the system, while the second,
third, and fourth terms account for other irreversible processes in the system, and we
will call it intrinsic dissipation.
It is important to point out that the simplification we did in the entropy generation
rate term is not necessarily accurate. However, it simplifies computation. An accu-
rate derivation should include all entropy-generating micro-mechanisms in the
thermodynamic fundamental equation. Ignoring the entropy generation due to elastic
deformations as well, using Eqs. (8.101) and (8.102), we can write total entropy as
Z t1
Δs = γ dt ð8:103Þ
to
454 8 Unified Mechanics of Metals Under High Electrical Current Density. . .
Δs
Zt → 2
1 CD Q∇T kT →
CjgradðT Þj2 þ v effective Z l eρj - f Ω∇σ þ þ ∇C
T2 kT 2 T C
=t0
1 ρ ∂Ψ dV k
þ σ : εp - : dt
T T ∂V k dt
ð8:104Þ
Fig. 8.7 Cross-section of test module and solder joints (Ye et al. 2003a)
h i
Φ = 1 - e - Δs R
ms
8.7 Thermodynamic Fundamental Equation for Electromigration and Thermomigration 455
Fig. 8.8 TSI evolution (a) in three different solder joints at -20 °C (b) in solder #7 at different
temperatures
456 8 Unified Mechanics of Metals Under High Electrical Current Density. . .
80
Experimental
70 Simulational
60
Time to Failure (hour)
50
40
30
20
10
0
-50 -45 -40 -35 -30 -25 -20 -15 -10 -5 0
Ambient Temperature (ć)
8.8 Example
The solder joint in the electronic package shown in Fig. 8.7 was studied using the
formulation given in this chapter.
Ye et al. (2003a, b, c, d, e, f) published laboratory test data and finite element
simulation results of time to failure shown in Table 8.1 for solder joints under
different current density levels. Material properties used in the analysis are given
in Table 8.2, Figs. 8.8 and 8.9.
In these example simulations, critical TSI is set for Φcr = 0.094. Here we
conclude our discussion on electromigration and thermomigration. More examples
on this topic are provided in the publications listed in the references section.
References 457
References
Abdulhamid, M., Li, S., & Basaran, C. (2008). Thermomigration in lead-free solder joints.
International Journal of Materials and Structural Integrity, 2(1/2), 11–34.
Barberan, N., & Echenique, P. M. (1986). Effective charge of slow ions in solids. Journal of
Physics B: Atomic and Molecular Physics, 19(3), L81.
Basaran, C., Lin, M., & Ye, H. (2003). A thermodynamic model for electrical current induced
damage. International Journal of Solids and Structures, 40(26), 7315–7327.
Basaran, C., Li, S., & Abdulhamid, M. (2008). Thermomigration induced degradation in solder
alloys. Journal of Applied Physics, 103, 123520.
Black, J. R. (1969). Electromigration failure modes in aluminum metallization for semiconductor
devices. Proceedings of the IEEE, 57(9), 1587–1594.
Blech, I. A. (1976). Electromigration in thin aluminum films on titanium nitride. Journal of Applied
Physics, 47(4), 1203–1208.
Blech, I. A., & Herring, C. (1976). Stress generation by electromigration. Applied Physics Letters,
29(3), 131–133.
Boltzmann, L. (1877). Sitzungberichte der Kaiserlichen Akademie der Wissenschaften.
Mathematisch-Naturwissen Classe. Abt. II, LXXVI, 373–435.
Bosvieux, C., & Friedel, J. (1962). Electrolysis of metallic alloys. Physics Chemistry Solids, 23,
123–136.
Brandt, W., & Kitagawa, M. (1982). Effective stopping-power charges of swift ions in condensed
matter. Physical Review B, 25, 5631. Erratum Phys. Rev. B 26, 3968 (1982).
Campbell, D. R. & H.B. Huntington. (1969). Thermomigration and Electromigration in Zirconium.
Physical Review, 179(3):601–601.
Chen, C., Hsiao, H.-Y., Chang, Y.-W., Ouyang, F., & Tu, K. N. (2012). Thermomigration in solder
joints. Materials Science and Engineering: R: Reports, 73(9–10), 85–100.
Fick, A. (1855). On liquid diffusion. Philosophical Magazine and Journal of Science, 10, 31–39.
Genoni, T. C., & Huntington, H. B. (1977). Transport in nearly-free-electron metals.
IV. Electromigration in zinc. Physical Review B, 16, 1344.
Huntington, H. B., & Grone, A. R. (1961). Current-induced marker motion in gold wires. Journal of
Physics and Chemistry of Solids, 20(1–2), 76–87.
Huntington, H. B. (1972). Electro-and thermomigration in metals. Diffusion, papers presented at a
seminar of the American Society of Metals.
Kirchheim, R. (1992). Stress, and electromigration in Al-lines of integrated circuits. Acta
Metallurgica et Materialia, 40(2), 309–323.
Kuz’menko, P. F., & Osirovskii, L. F. (1962). Electric transfer of silver in copper. Izvestiya
Vysshikh Uchebnykh Zavedenii, Chernaya Metallurgiya, 11, 146–149.
Landauer, R., & Woo, J. W. F. (1974). Driving force in electromigration. Physical Review B,
10, 1266.
Lloyd, J. R., Tu, K. N., & Jasvir Jaspal, J. (2004). The physics and materials science of
electromigration and thermomigration in solders, chapter 20. In K. J. Puttlitz & K. A. Stalter
(Eds.), Handbook of lead-free solder technology for microelectronic assemblies. Taylor &
Francis.
Lodding, A. (1965). Current induced motion of lattice defects in indium metal. Journal of Physics
and Chemistry of Solids, 26(1), 143–151.
Ludwig, C. (1856). Akademie der Wissenschaften. Mathematisch-Naturwissenschaftliche Klasse,
20, 539.
Sarychev, M. E., et al. (1999). General model for mechanical stress evolution during
electromigration. Journal of Applied Physics, 86(6), 3068–3075.
Seith, W. (1955). Diffusion in Metallen, Platzwechselreaktionen. Springer.
Singh, P., & Ohring, M. (1984). Tracer study of diffusion and electromigration in thin tin films.
Journal of Applied Physics, 56(4), 899–907.
458 8 Unified Mechanics of Metals Under High Electrical Current Density. . .
Fatigue life prediction of materials has been widely studied. An extensive literature
survey of fatigue life prediction models is available in Lee and Basaran (2021);
hence, it is not necessary to repeat a literature survey herein. However, most of the
research is based on empirical curve fitting models under the framework of Newto-
nian mechanics. Fatigue test data is used for curve fitting a degradation evolution
function using stress, strain, or energy as a variable. Unified mechanics theory
(UMT), on the other hand, unifies the universal laws of motion of Newton by
incorporating the second law of thermodynamics directly into Newton’s laws at
the ab initio level. UMT introduces an additional linearly independent axis called
thermodynamic state index (TSI), which can have values between 0 and 1. Evolution
along the TSI axis follows Boltzmann’s entropy formulation and the thermodynamic
fundamental equation of the material. As a result, governing differential equations of
any system automatically include energy dissipation and degradation evolution.
When UMT is used, there is no need for an empirical degradation evolution function
obtained by curve fitting to fatigue test data. However, UMT does require deriving
the analytical thermodynamic fundamental equation of the material. In this chapter,
thermodynamic fundamental equations for several different fatigue problems are
presented.
The theory of elasticity assumes that there is no irreversible deformation during
elastic loading. However, it is well-known that under cyclic loading in a linear elastic
regime, any metal will still fatigue. Therefore, the second law of thermodynamics is
not violated. There is always irreversible entropy generation in any closed system,
which is ignored in the Newtonian mechanics-based theory of elasticity. If we apply
a cyclic load on metal within its elastic range [below the yield stress], the
corresponding strain would always be the same at every cycle, according to the
theory of elasticity, because the second law of thermodynamics is not included in the
universal laws of Newton directly. In summary, there is no TSI axis in the Newto-
nian space-time coordinate system. Moreover, in Newtonian mechanics, the laws of
thermodynamics are satisfied separately from than laws of Newton [not in the same
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2022 459
C. Basaran, Introduction to Unified Mechanics Theory with Applications,
https://doi.org/10.1007/978-3-031-18621-9_9
460 9 Fatigue of Materials
Fatigue life prediction with the unified mechanics theory eliminates the need for
curve fitting an empirical evolution function to fatigue test data. In unified mechanics
theory, the degradation evolution occurs along with the linearly independent ther-
modynamic state index (TSI) axis which is quantified by the thermodynamic fun-
damental equation of the material. The coordinate value of TSI starts at 0 and ends at
1. However, because TSI is an exponential function, computations are usually
stopped before it reaches 1.00, which is possible only if the power term is infinite.
The thermodynamic fundamental equation provides the entropy generation quantity
in the material under a given load. Based on this entropy value, a new TSI coordinate
is calculated, which is the normalized form of Boltzmann’s entropy formulation of
the second law of thermodynamics. In the formulation, the free energy of the system
is degraded by the entropy generated.
In this section, we derive the thermodynamic fundamental equation for high cycle
fatigue where the sample is subjected to a nominal stress level that is well below the
yield stress of the material and the loading frequency is quasi-static. The following
assumptions are made in the derivation:
1. Applied maximum nominal stress is below the yield stress of the metal; hence, no
macroscopic plastic deformation is expected to happen.
2. A mechanism called microplasticity is expected to happen at some micro-defect
sites at the micro-level (Lemaitre et al., 1999; Doudard et al., 2005;).
3. Point defects, including atomic vacancies, interstitials, and impurities, can be
built-in with the original crystal growth or created by energy input during the
fatigue process.
4. Input mechanical energy increases atomic vacancies and dislocation densities.
However, the increasing dislocation density only causes hardening in the micro-
level and never induces macroscopic plastic deformation as the maximum applied
nominal stress is below the metal’s yield stress value. The vacancy generation/
diffusion and dislocation motion (e.g., cross-slip) around inclusions induce irre-
versible deformations at a micro-level in elastic cyclic loads, (Marti et al., 2020; ;
Ho et al., 2017; Mughrabi, 2009).
9.1 Predicting High Cycle Fatigue Life of Metals 461
Six mechanisms generate entropy during high cycle metal fatigue at quasi-static
frequencies. These are:
I. Configurational entropy, ΔSc
II. Vibrational entropy, ΔSvib
III. The entropy generation due to vacancy concentration gradient-driven diffusion,
ΔSd
IV. The entropy from heat conduction, ΔST
V. The entropy generation due to atomic-friction-generated heat, ΔSr
VI. The entropy generation due to microplasticity ΔSμp.
Because entropy is an additive property, we can write the thermodynamic
fundamental equation for the total entropy production as follows:
N!
ð9:2Þ
ðN - nÞ!n!
where N is the number of lattice sites and n is the number of vacancies. Replacing the
disorder parameter W in Boltzmann-Planck entropy equation yields the following
form of configurational entropy (Kelly et al., 2012; Abbaschian & Reed-Hill, 2009):
N!
Sc = kB ln ð9:3Þ
ðN - nÞ!n!
Atomic defects are atoms missing from their proper lattice sites. These vacancies
are rearranged due to the thermally activated mass transport. The change in config-
urational entropy ΔSc from the temperature state of 1 to the temperature state of 2 is
given by
462 9 Fatigue of Materials
N! N!
ΔSc = kB ln - k B ln ð9:4Þ
ðN - n2 Þ!n2 ! ðN - n1 Þ!n1 !
where n1 and n2 are the number of vacancies at the temperature state of 1 and the
temperature state of 2, respectively. Equation (9.4) can also be written in the
following form:
1 - C v2 1 - C v1
ΔSc = kB N C v2 ln - Cv1 ln ð9:5Þ
C v2 C v1
where Cv1 and Cv2 are the thermodynamic equilibrium vacancy concentrations at the
temperature state of 1 and the temperature state of 2, respectively.
II. Vibrational Entropy Generation
The vibrational entropy is also a concept from statistical thermodynamics that
replaces disorder parameter W in the Boltzmann-Planck entropy equation with the
phase state of the atoms as they vibrate, which is defined by momentum and position
coordinates (Fultz, 2010; Atkins et al., 2018; Wollenberger, 1996; Laughlin &
Hono, 2014; Kelly et al., 2012; Abbaschian & Reed-Hill, 2009; Fultz, 2010).
The vibrational state of the atoms changes when vacancies are created. There are
various models proposed to precisely calculate vibrational entropies when a certain
number of vacancies are removed (Mishin et al., 2001).
We can assume that the change in vibrational entropy is the same when each
atomic vacancy is created (Abbaschian & Reed-Hill, 2009; Burton, 1972). There-
fore, the total vibrational entropy in the system can be given by
In which n is the number of atomic vacancies created and ΔSv is the change in
vibrational entropy when one atomic vacancy is created. Equation (9.6) is valid
because of the additive property of entropy.
The variation ΔSvib is given by
temperature increase in the sample is about 15 °C. The purpose of this example
where ΔT = 100 °C is used is to show the magnitude of entropy generation for a
hypothetical case.
III. Entropy Generation due to Vacancy Concentration Gradient-Driven
Diffusion
During the fatigue process concentration of vacancies in the metal are not uniform.
As a result, there is a vacancy concentration gradient. Vacancy concentration
gradient-driven diffusion and thermomigration are governed by the vacancy conser-
vation equation (or mass conservation) (Basaran & Lin, 2007; Ye et al., 2004; Ye
et al., 2006).
464 9 Fatigue of Materials
The concentration of atomic vacancies should be higher around the edge and
lower around the center for a specimen with a round cross-section (cracks usually
develop from outside to inward); hence, they cause the vacancy diffusion. The
entropy generation from vacancy gradient-driven diffusion ΔSd is extensively stud-
ied by Basaran and Lin (2008), Li et al. (2009), and Yao and Basaran (2013a, b, c). It
is given by
Z 2 !
t
C v Dv Q ∇T kB T
ΔSd = þ ∇c dt ð9:8Þ
t0 kB T 2 T c
TΔSv - Δhf
Cv = exp ð9:9Þ
kB T
C v,398 K = 2:35 × 10 - 13
ð9:10Þ
C v0 = C v,298 K = 6:1 × 10 - 18
C 0:1C v,388 K
cedge = v,398 K , ccenter =
C v0
Cv0 ð9:11Þ
∇c = cedge - ccenter =1 cm = 3:5 × 106 m - 1
and R = 8:3145 moleJ K [63]. At 398 K, Dv = D0 exp -RTQd ≈ 3 × 10 - 37 ms .
2
m2 2
2:35 × 10 - 13 × 3 × 10 - 37
1:38 × 10 - 23 J=K × 398 K
_
Sd = s 3:5 × 10 m
6 -1
1:38 × 10 - 23 J=K × ð398 KÞ2 3852
= 8 × 10 - 91 JK - 1 s - 1
ð9:13Þ
From this calculation, it is clear that the entropy generation due to mass transport
is negligible compared to other entropy generation mechanisms.
466 9 Fatigue of Materials
section to the grip section (Fig. 9.3). The thermal conductivity of steel is K h = 50 Km
w
.
After several cycles, the temperature increases from T0 = 298 K to Ti = 398 K and
then remains in a steady-state until near failure at around 1.4 × 105 cycles (operating
frequency at 30 Hz). The evolution of entropy generation due to heat conduction
ΔST for a specimen is shown in Fig. 9.4.
V. Entropy Generation due to Atomic-Friction (Scattering)-Generated Heat
The entropy generation due to atomic-friction-generated heat is given by
Z t
ρr
ΔSr = dt ð9:15Þ
t0 T
Fig. 9.4 Entropy production due to thermal conduction (JK-1 m-3) versus the number of cycles
9.1 Predicting High Cycle Fatigue Life of Metals 467
500
450
400
350
300
250
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20
time(picosecond)
Fig. 9.5 Temperature variation with time under uniaxial compression under different strain rates
300
temperature (K)
280
260
240
220
200
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20
time(picosecond)
0.001/ps 0.005/ps 0.01/ps
Fig. 9.6 Temperature variation with time under uniaxial tension for different strain rates
qffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi rffiffiffi
2 μ
f = ðSμ - αμd Þ : ðSμ - αμd Þ - σ =0 ð9:21Þ
3 y0
where Sμ is the microscopic deviatoric stress, αμd is the deviatoric part of the
microscopic back-stress of kinematic hardening, eμp is the microscopic equivalent
plastic strain, σ μy0 is the microscopic yield stress, (σ μ - αμ) is the translational
direction of the microscopic yield surface under Ziegler’s rule, and h is the kinematic
linear hardening modulus defined by the slope of the stress-strain curve for a finite
plastic strain value as follows: h = σ μ - σ μy0 =εμp .
The entropy generation due to microplasticity is then given by
Z t σ μ : ε_ μp
ΔSμp = ϕf v dt ð9:23Þ
t0 T
in which σ μ and ε_ μP are micro-stress and strain rate tensors. During the microscopic
plastic deformation, some plastic work is stored as dislocation stored energy, which
is accounted for as the hardening of the material in Eq. (9.22). fv is a constant that
establishes a relation between the macro-stresses and micro-stresses utilizing the law
of localization and homogenization. It is the ratio of the activated volume of the
micro-defects VD to the elastic matrix Vmatrix (Charkaluk & Constantinescu, 2009):
VD
fv = ð9:24Þ
V matrix
Figure 9.8 depicts the visualization of microplasticity, inside the hexagon region,
microscopic stress and microplastic strain. In Table 9.2 material constants used for
simulation are given.
The microscopic yield stress in this microplasticity model is the mean fatigue
limit of the specimen. We should clarify that a fatigue limit does not mean that there
is no fatigue failure if the stress is below the fatigue limit. Not resulting in fatigue at
any stress level would violate the second law of thermodynamics. It just indicates
that no microplasticity occurs below the macroscopic fatigue limit stress. However,
all other entropy generation mechanisms are still active, regardless of stress level and
all materials’ fatigue at stress levels below the fatigue limit.
In Eq. (9.23), TSI, (ϕ), is a metric of entropy evolution. It is assumed that the
increase in the percentage of activated dislocation slip planes follows the second law
of thermodynamics, which is represented by TSI. However, each microplasticity site
is considered to be independent and has a constant fv. Microplasticity sites are not
expected to coalesce before the onset of final failure (Fig. 9.8).
472 9 Fatigue of Materials
Fig. 9.9 Dislocation slip bands in ferrite grains on the surface of the specimen activated at 250 MPa
stress amplitude. N = 107 cycles, ΔT < 15 ° C. (After Torabian et al. (2017))
There is no fv data available in the literature obtained by TEM or SEM for DP600
steel at hand. However, Torabian et al. (2017) provided fatigue test data on DP600
steel and the results of fractography studies. The appearance of slip bands in ferrite
grains on the surface of the specimen around the gage part under 30 Hz fatigue
loading is shown in Fig. 9.9. The arrows indicate the slip bands, which are estimated
to be around 10% of the observed surface. We use fv=10% as the initial value for the
calculations. Of course, fractography studies for individual cases are necessary, since
the defect ratio in materials is the highly variable depending quality of the material
and manufacturing methods.
The microplasticity calculation in this section is based on the following
assumptions:
1. The hardening behavior of the material is approximated by a bilinear kinematic
hardening curve. Hardening modulus is defined as the slope between (lower-
yielding strength) LYS and UTS (ultimate tensile strength) from the stress-strain
curve, as a first approximation.
2. The microscopic hysteresis loops are stabilized, and they are the same at each
defect site. The evolution of TSI, ϕ, from 0 to 1 only induces microplasticity at
more defect sites, without any macroplasticity.
3. The energy dissipation due to kinematic hardening at microplasticity sites is small
enough to be ignored (Naderi et al., 2010)
9.1 Predicting High Cycle Fatigue Life of Metals 473
Fig. 9.10 Micro-stress-strain curve for each inclusion (defect) site under 300 MPa nominal stress
amplitude
Fig. 9.11 Entropy generation (MJK-1 m-3) due to microplasticity versus the number of cycles
under 300 MPa nominal stress amplitude
two orders of magnitude bigger. Hence, we conclude that the total entropy produc-
tion, the thermodynamic fundamental equation, can be simplified as
Incorporating the TSI into microplastic work accounts for the increased proba-
bility of activated micro-defect sites that emerge in the specimen during the fatigue
process.
TSI is an exponential function, hence never reaches unity. In practice, we
determine a critical value ϕcr as a threshold. In this example, the specimen is
assumed to fail when ϕ reaches ϕcr = 0.995, since the probability of reaching
maximum entropy at this state is 99.5%.
The temperature of the specimen plays an important role in high cycle fatigue life.
From the entropy point of view, the cumulative entropy production and TSI calcu-
lation are related to the temperature of the specimen. Inspecting Eq. (9.26), if the
temperature is controlled (i.e., the specimen is air-cooled, e.g., near room tempera-
ture), the resulting entropy production will be larger, hence resulting in shorter
fatigue life. However, self-heating during loads at low frequencies (i.e., 30 Hz)
does not exceed ΔT = 15 K. This amount of temperature difference will not change
the fatigue life significantly. However, as the frequency of the loading increases, ΔT
increases. In ultrasonic frequencies, ΔT reach several hundred degrees.
The temperature of the specimen is governed by the fully coupled
thermomechanical equation derived from classical continuum mechanics as follows:
∂σ e ∂Ak _
k h ∇2 T = ρC T_ - σ : ε_ p - ρr - T : ε_ þ Ak - T Vk ð9:27Þ
∂T ∂T
∂s
where ρ is the mass density; C = T ∂T is specific heat; εe and εP are elastic and plastic
strain vectors, respectively; σ is the stress tensor; r is the strength per unit mass of the
internal distributed heat source; and Ak is a thermodynamic force associated with the
internal thermodynamic variables, Vk.
Using Eq. (9.27), we can calculate the evolution of temperature in the specimen
due to mechanical work. To simplify this equation, we can ignore plastic strain ε_ p ,
internally distributed heat source r, and other thermodynamic variables, Ak V_ k. These
assumptions are justified, because of the following reasons:
1. The sample is subjected to elastic loads only, as such, there is no uniform
macroplastic strain. However, the contribution of microplasticity is not neglected.
2. Based on molecular dynamics simulations, the contribution of r is negligible at
30 Hz loading.
3. Ak - T ∂Ak V_ k represents the non-recoverable energy corresponding to the
∂T
internal coupling source (such as grain coarsening, phase transformation, etc.).
However, for metals, this non-recoverable energy only represents 5–10% of the
mechanical dissipation (plastic work) and is often negligible for high cycle
fatigue.
476 9 Fatigue of Materials
The temperature in the sample is calculated at each cycle using Eq. (9.30).
Figure 9.12 shows the temperature evolution for DP600 steel operating at 30 Hz
for various stress amplitudes obtained using Eq. (9.30). The entropy generated due to
microplasticity is strongly affected by the stress amplitude. A rapid climbing stage
followed by a steady state is observed. However, Eq. (9.29) cannot capture the
abruptly raising temperature when a specimen is near failure when the temperature
spikes.
Fig. 9.12 Temperature evolution versus the number of cycles at various stress amplitudes
9.1 Predicting High Cycle Fatigue Life of Metals 477
Figure 9.13 shows accumulated entropy at failure for various stress amplitudes.
Accumulated total entropy at failure is a constant value and is sometimes also
referred to as the fatigue fracture entropy (FFE), which is known to be independent
of stress amplitude.
Figure 9.14 shows the thermodynamic state index as a function of the number of
cycles. TSI, ϕ, reaches 1 around 8.5 × 104 cycles under 325 MPa applied stress. For
stress amplitude of 300 MPa, TSI, ϕ, reaches 1 at 1.4 × 105 cycles. For stress
amplitude of 275 MPa, TSI, ϕ, reaches 1 at around 3.6 × 105 cycles.
If we relate the stress amplitude with the number of cycles to failure, a bilinear
curve can be plotted, in Fig. 9.15.
In Fig. 9.14, it is easy to observe that the TSI increases slowly in the early stage of
cycling and then suddenly begins to evolve rapidly after a certain critical threshold.
This trend can be explained by Eq. (9.28), in which TSI is included in the
microplasticity entropy production equation. In the early stages, there are too few
microplasticity sites. Because microplasticity is the largest contributor to entropy
generation, the corresponding TSI calculated by the entropy production is small at
the early stages. After a certain number of cycles, enough micro-defects are acti-
vated, and more entropy is produced.
Fig. 9.13 Accumulated total entropy production versus the number of cycles for various stress
amplitudes [this is also referred to as the fatigue fracture entropy]
478 9 Fatigue of Materials
Fig. 9.14 TSI evolution versus the number of cycles for various stress amplitudes
Fig. 9.15 Fatigue life prediction with UMT for stress amplitude versus the number of cycles for
various stress amplitudes [when TSI reaches ϕcr = 0.995, the specimen is considered as failed]
9.1 Predicting High Cycle Fatigue Life of Metals 479
Fig. 9.16 Comparison of fatigue data and unified mechanics theory predictions for tension-
compression fatigue tests on dual-phase steel (DP600) without any pre-strain [the experimental
SN curve is obtained from Torabian et al., 2017]
In Fig. 9.16, simulation results are compared with test data. It shows that the
predicted cycles to failure obtained from unified mechanics theory and the test data
match reasonably well. Test data in Fig. 9.16 shows expected scatter, especially at
lower stress levels. It is very well-known that fatigue test data is always stochastic.
Unified mechanics theory, which is based on Boltzmann’s entropy formulation, also
yields stochastic expected life.
In this section, unified mechanics theory is used for predicting the high cycle
fatigue life of DP600 steel. The thermodynamic fundamental equation of metals
under high cycle fatigue is derived. In the fatigue life prediction, no empirical curve
fitting function is needed. The entropy generation due to atomic vacancy configura-
tion, atomic vibration, and mass transport is extremely small compared to entropy
generation due to microplasticity. It is also assumed that entropy generation due to
grain coarsening during high cycle fatigue is negligible.
The entropy generation due to microplasticity is the most dominant entropy
generation mechanism. This mechanism can be visualized as some microplastic
inclusions at localized defect sites inside an elastic matrix which has its micro-
stress and micro-strain based on laws of localization and homogenization. Energy is
dissipated through the microplastic work at the locations of these inclusions. The
number of inclusions increases as TSI ϕ increases from 0 to ϕcr. Metal high cycle
fatigue life prediction results based on the theory of unified mechanics are compared
with experimental test data. It is shown that the prediction and test data match
very well.
480 9 Fatigue of Materials
Low cycle, high cycle, very high cycle, and ultrahigh cycle fatigue occur on different
entropy generation mechanisms. In each one, the magnitude of entropy generation
for each mechanism is different. For example, during high cycle fatigue, internal
heat generation is usually less than ≈15 ° C temperature change in a test sample,
while in very high cycle, fatigue temperature change can go to ≈150 ° C or much
higher. On the other hand, while microplasticity is the dominant entropy generation
mechanism in a high cycle and very high cycle fatigue, it is assumed that it does not
happen in ultrahigh cycle fatigue, which happens when the stress value is below the
nominal fatigue limit of a metal.
In this section, predicting the very high cycle fatigue life of metals with the
unified mechanics theory is discussed. Simulation results are compared with the
fatigue life test data. It is shown that the unified mechanics theory can predict very
high cycle fatigue life very well, without relying on traditional curves fitting an
empirical function to fatigue test data.
The objective is to predict the fatigue life of metals subjected to ultrasonic
vibration where the maximum nominal stress is below the yield stress of the material.
The nomenclature used in this section is given in Table 9.4.
Table 9.4 Nomenclature used for ultrasonic vibration fatigue life model
Nomenclature
ϕ Thermodynamic state index ρ Mass density
ϕcr Critical thermodynamic state index r Internal heat generation [IH]
Δs Change in total specific entropy rdrag IH due to drag mechanism
ms The molar mass of the material rdis IH due to dislocation motion
R Universal gas constant Bdrag Drag coefficient
ST Thermal conduction entropy v Dislocation velocity
Sr Internal friction entropy μ Shear modulus
Sμp Microplasticity entropy b The magnitude of the Burger’s
vector
kh Thermal conductivity ϱ Total dislocation density
σμ Microscopic stress tensor ϱm Mobile dislocation density
εμe The microscopic elastic strain tensor αH Taylor’s constant
εμP The microscopic plastic strain tensor γ_ Shear strain rate
Σ Macroscopic stress tensor a Lattice constant
E Macroscopic strain tensor E Young’s modulus
fv The volume fraction of activated micro- υ Poisson’s ratio
defects
Sμ Microscopic deviatoric stress α Thermal expansion coefficient
αμd Deviatoric of microscopic back-stress cV Specific heat capacity
eμp Microscopic equivalent plastic strain θ Total change in temperature
h Hardening modulus τeq Equivalent conduction time
σy Microscopic yield stress (fatigue limit) t eq Fraction of heat conducted
Ψf Frequency coefficient
9.2 Predicting Ultrasonic Vibration Fatigue Life 481
During the very high cycle fatigue of metals, six prominent mechanisms generate
entropy. These are:
I. Configurational entropy Sc.
II. Vibrational entropy, Svib
III. Entropy generation due to diffusion Sd
IV. Entropy generation due to heat conduction ST
V. Entropy generation due to internal heat generation Sr
VI. Entropy generation due to microplasticity, Sμp
Among these six mechanisms, the first three are too small compared to other
mechanisms (Lee & Basaran, 2021). Therefore, we discuss the entropy generation
due to the last three mechanisms. Entropy is an additive property; hence, we can
write the following equation for the total entropy to obtain the thermodynamic
fundamental equation:
In Eq. (9.32), the amount of entropy generation due to heat conduction is strongly
dependent on the test duration. To calculate the order of magnitude of this entropy
generation mechanism for ultrasonic vibration fatigue, the following boundary
conditions are used:
According to measurements during ultrasonic vibration testing, the temperature at
the gage section of the steel [EN 1.0577] test specimen raises from 300 K to about
450 K, while the nominal normal stress amplitude is 400 MPa. The thermal
conductivity of the material is kh = 50WK-1 m-1. Assume the temperature varies
parabolically along the centerline of the sample from the gage section to the
grip section, as T = C1x2 + C2x + C3. If the boundary conditions are set to be
Tgage(t) = T(x = 0, t), Tgrip(t) = T(x = ± L/2, t) where L2 = 30 mm is the length
between gage and grip sections. The one-dimensional thermal gradient
that changes
as the number of cycles increases can be obtained as ∇T = L8x2 T grip ðt Þ - T gage ðt Þ.
II. Entropy Generation due to Microplasticity
During very high cycle fatigue, microplasticity happens at defect sites. The concept
of microplasticity was introduced in the previous section of this chapter. Hence, we
just present the final equation. The entropy generation equation due to
microplasticity is given by
Z !
t
σ μ : ε_μP
ΔSμp = Ψf ϕf v dt ð9:33Þ
t0 T
the generation of vacancies in persistent slip bands and their diffusion toward the
matrix favor their emergence. Marti et al. (2020) concluded that cross-slip activation
is the main mechanism that leads to early slip markings.
For the ultrasonic vibration of BCC metals, Torabian et al. (2017) reported that
there is a transition between material deformation mode from thermally activated
regime at stress amplitudes below the fatigue limit to athermal mode at stress
amplitudes above the fatigue limit. Since in this example the macroscopic stress
amplitude is above the fatigue limit, the athermal regime where the mobilities of
screw and edge dislocations are equivalent and screw dislocations can cross-slip is of
interest. The cross-slip probability P is given by Marti et al. (2020):
l δt τ - τcsR
P=β exp css V ð9:34Þ
l0 t 0 kB T
where β is a normalization coefficient ensuring that 0 < P < 1, l/l0 is the ratio of the
length of a screw dislocation segment to a reference length l0 = 1 mm, δt/t0 is the
ratio of the considered time to a reference time, V is the activation volume associated
with cross-slip, (τcss - τcsR) compares the resolved shear stress τcss on the cross-slip
system to a threshold critical stress τcsR required to activate cross-slip, kB is
Boltzmann’s constant, and T is the absolute temperature. The duration in which
τcss is greater than τcsR in one cycle is given by Marti et al. (2020):
0 1
τcsR
2 arcsin τcss
Δt 1cycle = @1 - A1 ð9:35Þ
π f
where f is the frequency of cyclic loading. Assuming all the parameters in Eq. (9.34)
are frequency insensitive, Eq. (9.35) shows that the time to activate cross-slip is
inversely related to the frequency. For two different loading frequencies, 30 Hz and
20 KHz at the same stress amplitude, the following equation must be satisfied to
obtain the same cumulated probability of cross-slip activation:
where TA is the duration when τcss is greater than τcsR and N is the number of cycles.
According to Eqs. (9.35) and (9.36), at a given stress amplitude, the material
subjected to f1 loading frequency reaches the same cumulative probability of acti-
vating initial micro-defects (early slip marking) as the material under f2 loading
frequency when
N1 f 1
= ð9:37Þ
N2 f 2
484 9 Fatigue of Materials
Z !
t
σ μ : ε_μP
ΔSμp = Ψf ϕf v dt ð9:38Þ
t0 T
8
< 1, if f p < 100 Hz ðtypical servo - hydraulic machineÞ
Ψf = 100 ð9:39Þ
: , if f p is in the order of KHz ðultrasonic vibrationÞ
fp
The frequency coefficient Ψf = fr/fp is used to account for the fact that at higher
frequencies, more cycles are needed to activate the same volume of dislocation slip
bands. It indicates that at a given stress amplitude, under the same number of cycles,
the specimen produces lower cumulative microplasticity-generated entropy at high
frequency than at low frequency, because under the same number of cycles, the
probability of generating microplasticity is smaller at high frequency than at low
frequency. The high and low frequency mentioned here refer to the ultrasonic
vibration frequency and traditional servo-hydraulic machine frequency, respectively.
Equation (9.39) is only applicable to up to the frequencies in the order of kHz,
because at higher frequencies (e.g., MHz or GHz [ultrahigh cycle fatigue]), different
micro-mechanisms and fatigue failure occur (Marti et al., 2020; Torabian et al.,
2017). We should also point out that Eq. (9.39) is also applicable only for the class of
materials where dislocation slip band formation dominates the crack initiation
process (Mughrabi, 2002).
In the literature, many micromechanics models for very high cycle fatigue life
prediction utilize cyclic irreversible slip, which is equivalent to fv in the formulation
presented above.
III. Entropy Generation due to Atomic-Friction (Scattering)-Generated Heat
The entropy change due to internal atomic-friction (scattering)-generated heat is
given by
Z t
ρr
ΔSr = dt ð9:40Þ
t0 T
At the atomic level, the temperature is defined as the intensity of atomic vibra-
tions. The internally generated heat per unit mass, r, is the increasing intensity of
atomic vibrations in the lattice due to events such as the breaking of atomic bonds,
phonon-phonon scattering, phonon-electron scattering, electron scattering (referred
to as internal friction here), and dislocation motions (Gao, 1997; Ragab & Basaran,
2009; Chu et al., 2015). We should emphasize that this term is different than the heat
generated through the macroscopic observable plastic work or thermoelastic sources.
9.2 Predicting Ultrasonic Vibration Fatigue Life 485
2. The dislocation motion during the plastic deformation (Huang et al., 2008; Parvin
& Kazeminezhad, 2016). The internal heat generation equation for this mecha-
nism is given by
1 2 pffiffiffi
ρrdis = μb ϱ_ - αH μb ϱγ_ ð9:42Þ
2
where ρ is the mass density and Bdrag and v represent the effective drag coefficient
and velocity of dislocation, respectively. The terms μ, b, ϱ, and αH are the shear
modulus, the magnitude of Burger’s vector, total dislocation density, and Taylor’s
hardening parameter, respectively. In Eq. (9.41), the velocity of dislocation is related
to the applied shear strain rate, γ_ , as follows:
γ_
v= ð9:43Þ
ϱm b
where ϱm is the density of mobile dislocations. Hence, the entropy change due to
internal friction can be given by
Z Z t drag pffiffiffi
t
ρr ϱB v v þ 12 μb2 ϱ_ - αH μb ϱγ_
ΔSr = dt = dt ð9:44Þ
t0 T t0 T
Using Eq. (9.44), the evolution of ΔSr for the specimen under ultrasonic vibration
with 400 MPa stress amplitude is shown in Fig. 9.20.
Thermodynamic Fundamental Equation
Finally, the thermodynamic fundamental equation can be given in the
following form:
Z t Z t !
∇T ∙ ∇T σ μ : ε_μP
Δs = - kh dt þ Ψf ϕf v dt
t0 ρT 2 t0 ρT
Z t drag pffiffiffi
ϱB v v þ 12 μb2 ϱ_ - αH μb ϱγ_
þ dt ð9:45Þ
t0 ρT
486 9 Fatigue of Materials
To calculate the TSI evolution versus the number of cycles, material constants
need to be obtained beforehand. The temperature evolution in the sample is com-
puted using a fully coupled thermal-mechanical equation. To verify the computed
temperature from the fully coupled thermal-mechanical equation, a thermographic
camera is used to record the temperature profile of the specimen focusing on the gage
center during fatigue testing.
Modal Analysis
An ultrasonic vibration fatigue test is usually carried out in the resonance frequency
of the sample in the axial direction. Therefore, modal analysis is necessary to
determine the exact natural frequencies of the tested specimen. To compute the
natural frequencies of the test specimen, usually finite element analysis is utilized.
The tension-compression mode [in the axial direction] resonance frequency of the
sample was determined to be 19,864 Hz.
Then, for the axial direction resonance frequency, the stress distribution in the
sample must be determined for selected levels of the vibration amplitude applied at
the support end of the sample. A linear relationship between the vibration amplitude
and the average stress in the center of the sample is obtained:
σ = 20,510 A ð9:46Þ
where σ is the average normal stress in the center of the sample in MPa and A is the
vibration amplitude in mm. Finite element analysis results indicate that stress is
greater on the surface of the specimen and smaller in the center. These stresses are in
the range of about ±1% of the average stress in the narrow gage section.
The average stress is σ = 380 MPa for vibration amplitude of A = 0.0185 mm
(Fig. 9.17).
It is shown that a temperature gradient between the specimen’s gage section and
grip section increases slowly in phase 2 and then rises rapidly when a macro crack
forms in phase 3.
Thermodynamic State Index (TSI) Computation
Material properties used in the example are given in Table 9.5.
The microscopic yield stress is the mean fatigue limit of the specimen; it is
assumed that no microplasticity occurs below the macroscopic fatigue limit.
It should be noted that the microplasticity model in this subsection is based on the
following assumptions:
1. The hardening behavior of the material is approximated by a bilinear kinematic
hardening. Hardening modulus is defined as the slope between (lower yield
strength) LYS and UTS (ultimate tensile strength) from the test stress-strain
curve, as a first approximation.
2. The microscopic hysteresis loops are stabilized and they are the same at each
defect site. The evolution of TSI, ϕ, from 0 to 1 only induces microplasticity at
more defect sites; however, they never coalesce until the final rupture.
9.2 Predicting Ultrasonic Vibration Fatigue Life 487
Fig. 9.17 (a) Sample dimensions along the axis, (b) normal stress distribution along with the
sample, (c) displacement distribution along the specimen axis for average stress of σ = 380 MPa.
(After Lee et al. (2022))
where kbk is the magnitude of Burger’s vector, a is the lattice constants in units of an
angstrom (Å), and h, k, l are the components of Burger’s vector.
The parameters in Eqs. (9.41)–(9.43) can be obtained by discrete dislocation
dynamics. In the literature, these values are available for the material used in this
example, as follows:
1. Burger’s vector values are provided in Dingley and Hale (1966). The terms
related to Burger’s vector are summed according to their proportion.
2. Taylor’s hardening parameter αH is a dimensionless parameter ranging from 0.05
to 2.6. It is used to characterize the relation between the plastic flow stress of a
pffiffiffi
material and its dislocation density, τ = τ0 þ αμb ρ . Taylor’s parameter from
Davoudi and Vlassak (2018) is used, since it has a similar stress-strain curve as
the material of interest in this example.
3. The initial dislocation density value for mild steel is given by Bin Jamal
et al. (2021).
4. Moving dislocations experience a drag due to their interaction with the crystal
structure, and this drag coefficient Bdrag determines the dislocation glide time
between obstacles (Blaschke, 2019; Blaschke et al., 2019). In terms of the
dominant drag mechanism, it is assumed that the phonon scattering with dislo-
cations [phonon-defect scatter] constitutes the dominant mechanism. Bdrag value
is provided by Blaschke et al. (2019).
5. The evolution of dislocation density during plastic deformation has been exten-
sively studied in the literature. It is widely accepted that when nominal macro-
stress is well below the yield stress, the microplastic deformation that results in
stabilized hysteresis micro-stress-strain loops happens only at micro-defect sites.
Therefore, a constant dislocation generation rate Δϱ at each defect site is
assumed.
The material parameters needed in Eq. (9.45) are summarized in Table 9.6.
Figure 9.19 shows entropy generation, Eq. (9.44), due to internal heat generation.
9.2 Predicting Ultrasonic Vibration Fatigue Life 489
The temperature increase due to mechanical work is governed by the fully coupled
thermomechanical equilibrium equation given below:
_ ∂σ e ∂Ak _
ρcV T - k∇ T = σ : ε_ þ T
2 p
: ε_ þ ρr þ T Ak - Vk ð9:48Þ
∂T ∂T
where the first term is the rate of heat storage, the second term is the heat conduction,
the third term is the energy dissipation due to macroplastic work, the fourth term is
the thermoelastic coupling source, and the fifth term is the heat generation due to
internal frictions, and the last term is non-recoverable energy corresponding to the
∂s
internal source. cV = T ∂T is defined as specific heat capacity; εe and εP are elastic
and plastic strain tensors, respectively; σ is the stress tensor; r is the strength per unit
mass of the internal distributed heat generation; and Ak is a thermodynamic force
associated with the internal thermodynamic variable, Vk.
Equation (9.48) yields the evolution of temperature due to mechanical work with
properly imposed boundary conditions. To simplify this equation, we can ignore
plastic strain ε_ p and other thermodynamic variables Ak V_ k . These assumptions are
valid, because
1. We are investigating mechanical response under elastic loads only, as such, there
is no uniform plastic strain at the macro-level. However, the contribution from
microplasticity is not neglected.
2. Ak - T ∂Ak V_ k represents the non-recoverable energy corresponding to the
∂T
internal sources (e.g., sound waves, grain coarsening, phase transformation,
etc.). However, for metals, this non-recoverable energy only represents 5% or
less of the total dissipation.
490 9 Fatigue of Materials
where ρ is the mass density and Bdrag and v represent the effective dislocation-drag
coefficient and velocity of dislocation, respectively. The terms kh, ϕ, and cV are
coefficient of heat conduction, the thermodynamic state index, and the specific heat
capacity, respectively. The relative surface ratio covered by activated micro slip
bands is represented by fv. The terms μ, b, ϱ, and αH are the shear modulus of the
bulk material, the magnitude of Burger’s vector, total dislocation density, and
Taylor’s hardening coefficient, respectively.
If we ignore the thermal fluctuation due to thermoelastic dissipation and assume
that the temperature field is uniform, we can simplify the conduction term
as - k h ∇2 T ffi ρcV τθeq (Torabian et al., 2017).
Where the terms, θ, and τeq are the total change in temperature (Tn - T0) and an
equivalent conduction time used to characterize the heat transfer between the
specimen and the surroundings (Charkaluk & Constantinescu, 2009; Zhang et al.,
2013; Chrysochoos et al., 2012).
Then, the following equation is obtained:
h i
ðT - T 0 Þ
= Ψf ϕf v σ μ : ε_μP þ ϱBdrag v ∙ v þ μb2 ϱ_
1
ρcV T_ þ n
τeq 2
pffiffiffi
- αH μb ϱγ_ ð9:50Þ
μ
Δt Δt Δt γ_ p nþ1 2
ΔT nþ1 = - ðT n - T 0 Þ þ Ψf ½ϕf v ðσ μ : ˙εμP Þ þ ϱBdrag
τeq ρcV ρcV ϱm b
Δt 1 2 Δϱnþ1 Δt pffiffiffi
þ μb - α μb ϱγ_ = 0:
ρcV 2 Δt ρcV H
ð9:51Þ
After rearranging the terms in Eq. (9.51), we get the following equation for
calculating the change in temperature:
h i μ
Ψf γ_ p nþ1 2
ϕf v σ μ nþ1 ε_μP Δt þ
1
ΔT nþ1 = - t eq ðT n - T 0 Þ þ ϱBdrag Δt
ρcV ρcV ϱm b
1 1 2 1 pffiffiffi
þ μb Δϱnþ1 - α μb ϱγ_ μp Δt
ρcV 2 ρcV H
ð9:52Þ
9.2 Predicting Ultrasonic Vibration Fatigue Life 491
Fig. 9.18 Measured surface temperature at the gage center of the 11 monitored samples versus the
number of cycles (in millions) during fatigue tests. (After Lee et al. (2022))
Δt
In Eq. (9.52) dimensionless time factor, t eq = τeq , represents the fraction of heat
conducted. Equivalent conduction time, τeq, the value measured and reported by
Charkaluk and Constantinescu (2009), is used in this example.
Temperature evolution in the material is calculated at each cycle using Eq. (9.52).
It should be emphasized that Eq. (9.52) is used to simulate the phase 1 and phase
2 temperature evolution. Equation (9.52) cannot capture the abrupt temperature rise
in phase 3 when a macro crack initiates, because the mechanisms of macroplasticity,
macro crack formation, and crack propagation are not included in the formulation.
Figure 9.18 shows the measured surface temperature at the gage center versus the
number of cycles, and Fig. 9.19 shows the simulations for the same based on
Eq. (9.52).
492 9 Fatigue of Materials
Fig. 9.19 Surface temperature predictions at gage center versus the number of cycles under various
stress amplitudes based on Eq. (9.52). Compared to test data (Fig. 9.18). (After Lee et al. (2022))
The entropy generation due to heat conduction, microplasticity, and internal frictions
during phase 1 and phase 2 is obtained using Eq. (9.45). The measured temperature
rise during phase 1 (shown in Fig. 9.18, up to 2.5 × 105 cycles) matches the
predictions shown in Fig. 9.19. Some researchers reported a change in temperature
in phase 1, ΔTphase1, to be a constant value (Guo et al., 2015), analogous to the elastic
modulus of a material. The temperature rise during phase 2 follows a parabolic
function. When the simulation results shown in Fig. 9.19 are compared with test data
shown in Fig. 9.18 [before the onset of phase 3], we observe a small difference at the
end of phase 2. This discrepancy may be due to the uncertainty of the formation of
cross-slip mechanisms [which leads to microplasticity], introduced by the frequency
effects. However, the difference between the prediction and measured temperature is
very small for all practical purposes.
Figure 9.20 shows that the entropy generation due to microplasticity is small at
the beginning and starts to increase after a critical threshold. This is because the
evolution of activated micro-defects follows the TSI. As more micro-defects are
activated, more entropy is produced.
9.2 Predicting Ultrasonic Vibration Fatigue Life 493
Fig. 9.20 Total entropy generation (MJK-1 m-3) versus the number of cycles under 400 MPa
nominal normal stress, for different mechanisms. (After Lee et al. (2022))
ð9:53Þ
494 9 Fatigue of Materials
The computation of TSI requires a summation of the entropy generation for all
mechanisms. The material properties necessary for calculating TSI are presented in
Tables 9.5 and 9.6. The material properties can be regarded as constant between
25 and 150 °C based on test data for EN 1.0577 steel, which shows a small change in
material properties in this temperature window. The temperature in the specimen
during the ultrasonic vibration test does not exceed 130 °C (around 400 K) before
rapid temperature rise at phase 3 when a macro crack initiates. Therefore, material
constants are used without a temperature correction.
Using Eq. (9.53), we can compute the TSI evolution for EN 1.0577 steel
operating at 20 kHz. The entropy generation and TSI evolution for different stress
amplitudes are shown in Figs. 9.21 and 9.22, respectively. The cumulative specific
entropy at failure [fatigue fracture entropy] is validated to be a material constant
independent of stress amplitude, specimen’s dimensions and geometry, and loading
frequency. It is commonly used as a damage indicator to predict the number of cycles
to failure. Figure 9.21 shows that fatigue fracture entropy (FFE) is the same
(a material constant) regardless of the stress level during testing. This is similar to
test results reported widely by Naderi et al. (2010), Yun and Modarres (2019), and
Ribeiro et al. (2020). Figure 9.22 shows the TSI evolution between 0 and 1. The
evolution of TSI is based on the cumulative specific entropy of the system; therefore,
for the same critical TSI (i.e., ϕcr = 0.999), the cumulative specific entropy will be
the same regardless of the stress amplitude, frequency of loading, or geometry of the
sample (Fig. 9.21, Fig. 9.23).
The comparison of the unified mechanics theory-based model simulation S-N
curve and the experimental S-N curve is presented in Fig. 9.24.
From Fig. 9.21, it is observed that the cumulative entropy generation for 1.0577
steel under various stress amplitudes is the same (4.1 MJK-1 m-3) at failure. This
value is a material constant regardless of the test frequency, loading path, loading
type, geometry of the specimen, or stress state. However, the time it takes to reach
the fatigue fracture entropy will be different in each case.
9.2 Predicting Ultrasonic Vibration Fatigue Life 495
Fig. 9.21 Entropy generation versus cycles for 1.0577 steel under various stress amplitudes. (After
Lee et al. (2022))
Fig. 9.22 TSI evolution versus cycles for 1.0577 steel under various stress amplitudes. (After Lee
et al. (2022))
a self-heating term due to internal frictions is derived. However, the derived thermal-
mechanical equation cannot capture the abrupt temperature rise in phase 3 when
macro cracks initiate because the mechanisms of macroplasticity, crack formation,
and crack propagation are not included in the model.
Very high cycle fatigue life prediction results based on the unified mechanics
theory are compared with experimental test data. The comparison shows that the
prediction and test data match very well.
In low cycle fatigue, where the nominal stress is above the yield limit, the plastic
work is the dominant dissipation mechanism. Entropy generation in the plastic
dissipation process is given by
Z t2
1
Δs = σ : dεp ð9:54Þ
ρT t1
9.3 Predicting Low Cycle Fatigue Life 497
Fig. 9.23 Unified mechanics theory simulated S-N data points for 1.0577 steel under various stress
amplitudes. (After Lee et al. (2022))
where ρ is the mass density of the material and σ and ε p are the stress and plastic
strain, respectively. T represents the temperature. Integral limits t1 and t2 represent
the time bounds of the mechanical loading. For one-dimensional case, the total
plastic strain, ε p(t), is calculated as follows:
σ ðt Þ
εp ðt Þ = εtotal ðt Þ - ð9:55Þ
E
where εtotal(t) is the total strain at time t and σ(t) and E are to the stress at time t and
Young’s modulus, respectively.
The true stress-strain graph published by Carrion et al. (2017) for Ti-6Al-4V
alloy, is used for the comparison between experimental data and the numerical
predictions by Bin Jamal et al. (2020). Material constants used are given in
Table 9.8. Assuming that the gage section of a dog bone AST standard sample
experiences uniform strain, 5 mm section was considered in the computational
model to reduce the computational cost. The diameter of the specimen is 6.35 mm.
In ABAQUS, linear brick elements, C3D8R are used. One end of the sample is
defined with a zero displacement (fixed) boundary condition and the other end is
498 9 Fatigue of Materials
Fig. 9.24 UMT simulated S-N data points for 1.0577 steel compared with experimental data. The
blue line and squares are simulated results from UMT. (After Lee et al. (2022))
1.2x109
A
9
1.0x10
Test data
2.0x108
Numerical simulation
0.0
0.00 0.01 0.02 0.03
Strain
Fig. 9.25 Comparison between monotonic tensile stress-strain graphs obtained from the test data
(Carrion et al., 2017) and numerical simulation (Bin Jamal et al., 2020)
1.5x109
1.2x109 B
1.12x109
True Stress (Pa)
1.10x109
True Stress (Pa)
8 1.08x109
9.0x10 1.06x109
1.04x109
1.02x109
1.00x109
0.010
0.015
0.020
0.025
8
6.0x10
Strain
Enlarged view of B
3.0x108
Test data
Numerical (Seed size - 0.64 mm)
0.0 Numerical (Seed size - 0.85 mm)
Numerical (Seed size - 1.0 mm)
Fig. 9.26 Comparison between test data (Carrion et al., 2017) and numerical simulation (Bin Jamal
et al., 2020), for monotonic compressive stress-strain
500 9 Fatigue of Materials
1.5x109 1.5x109
Hysteresis loops at 1.2% strain amplitude Hysteresis loops at 1.2% strain amplitude
1.0x109 1.0x109
Stress (Pa)
Stress (Pa)
0.0 0.0
-5.0x108 -5.0x108
-1.5x109 -1.5x109
-0.015 -0.010 -0.005 0.000 0.005 0.010 0.015 -0.015 -0.010 -0.005 0.000 0.005 0.010 0.015
Strain Strain
a b
Fig. 9.27 Predictions for engineering stress-strain hysteresis loops for 1.2% strain amplitude. (a)
Hysteresis loops at 1.2% strain amplitude for 50 cycles; (b) comparative hysteresis plot for the first
cycle and 50th cycle (Bin Jamal et al., 2020)
1.0
Damage (TSI)
0.5
1.2% Strain
0.0 1% Strain
0.8% Strain
0
0
0
00
00
00
00
00
00
00
00
00
00
00
00
00
-50
50
10
15
20
25
30
35
40
45
50
55
60
65
70
Number of Cycles
Fig. 9.28 Predicted damage for different strain amplitudes during cyclic loading (Bin Jamal et al.,
2020)
Here we conclude our discussion on fatigue life prediction with the unified
mechanics theory. Further details may be found in the references cited below.
References 501
0.024
0.018
0.016
0.014
0.012
0.010
0.008
Fig. 9.29 Comparison between the test data (Carrion et al., 2017) and predictions (Bin Jamal et al.,
2020) for low cycle fatigue life (Nf) at different strain amplitudes
References
Abbaschian, R., & Reed-Hill, R. E. (2009). Physical metallurgy principles. Van Nostrand, ISBN
9780495438519.
Atkins, P., Paula, J., et al. (2018). Physical chemistry (11th ed.). Oxford University Press, ISBN
9780198769866.
Basaran, C., & Lin, M. (2007). Electromigration induced strain field simulations for nanoelectronics
lead-free solder joints. International Journal of Solids and Structures, 44(14–15), 4909–4924.
Basaran, C., & Lin, M. (2008). Damage mechanics of electromigration induced failure. Mechanics
of Materials, 40(1–2), 66–79.
Bin Jamal, M. N., Kumar, A., Lakshmana Rao, C., & Basaran, C. (2020). Low cycle fatigue life
prediction using unified mechanics theory in Ti-6Al-4V alloys. Entropy, 2020(22), 24.
Bin Jamal, M. N., Rao, C., & Basaran, C. (2021). A unified mechanics theory-based model for
temperature and strain rate dependent proportionality limit the stress of mild steel. Mechanics of
Materials, 155, 103762.
Blaschke, D. (2019). Properties of dislocation drag from phonon wind at ambient conditions.
Materials, 12(6), 948.
Blaschke, D., Mottola, E., & Preston, D. (2019). Dislocation drag from phonon wind in an isotropic
crystal at large velocities. Philosophical Magazine, 100(5), 571–600. https://doi.org/10.1080/
14786435.2019.1696484
Blaschke, D. N., Mottola, E., & Preston, D. L. (2020). Dislocation drag from phonon wind in an
isotropic crystal at large velocities. Philosophical Magazine, 100, 571–600.
Burton, J. J. (1972). Vacancy-formation entropy in cubic metals. Physical Review B, 5, 2948.
Charkaluk, E., & Constantinescu, A. (2009). Dissipative aspects in high cycle fatigue. Mechanics of
Materials, 41(5), 483–494, ISSN 0167-6636.
Chrysochoos, A., Boulanger, T., & Morabito, A. (2012). Dissipation and thermoelastic coupling
associated with fatigue of materials. Mechanics, Models, and Methods in Civil Engineering,
147–156. https://doi.org/10.1007/978-3-642-24638-8_7
Chu, Y., Gautreau, P., Ragab, T., & Basaran, C. (2015). Temperature dependence of Joule heating
in zigzag graphene nanoribbon. Carbon, 89, 169–175.
502 9 Fatigue of Materials
Cugy, P., & Galtier, A. (2002). Microplasticity and temperature increase in low carbon steels. In
A. F. Blom (Ed.), Proceedings of the 8th international fatigue congress, 3–7 June 2002
(pp. 549–556). EMAS, Barnsley.
Davoudi, K., & Vlassak, J. (2018). Dislocation evolution during plastic deformation:
Equations vs. discrete dislocation dynamics study. Journal of Applied Physics, 123(8),
085302. https://doi.org/10.1063/1.5013213
De Hosson, T. M., Roos, A., & Metselaar, E. D. (2001). Temperature rise due to fast-moving
dislocations. Philosophical Magazine A, 81(5), 1099–1120.
Dingley, D., & Hale, K. (1966). Burgers vectors of dislocations in deformed iron and iron alloys.
Proceedings of the Royal Society of London Series A. Mathematical and Physical Sciences,
295(1440), 55–71. https://doi.org/10.1098/rspa.1966.0225
Doudard, C., Calloch, S., Cugy, P., Galtier, A., & Hild, F. (2005). A probabilistic two-scale model
for high-cycle fatigue life predictions. Fatigue & Fracture of Engineering Materials & Struc-
tures, 28(3), 2005.
Fan, J., Zhao, Y., & Guo, X. (2018). A unifying energy approach for high cycle fatigue behavior
evaluation. Mechanics of Materials, 120, 15–25, ISSN 0167-6636.
Fultz, B. (2010). Vibrational thermodynamics of materials. Progress in Materials Science, 55(4),
247–352.
Galligan, J., McKrell, T., & Robson, M. (2000). Dislocation drag processes. Materials Science And
Engineering: A, 287(2), 259–264.
Gao, S. (1997). Quantum kinetic theory of vibrational heating and bond breaking by hot electrons.
Physical Review B, 55(3), 1876. https://doi.org/10.1103/PhysRevB.55.1876
Guo, Q., Guo, X., Fan, J., Syed, R., & Wu, C. (2015). An energy method for rapid evaluation of
high-cycle fatigue parameters based on intrinsic dissipation. International Journal of Fatigue,
80, 136–144. https://doi.org/10.1016/j.ijfatigue.2015.04.016
Ho, H. S., Risbet, M., & Feaugas, X. (2017). A cyclic slip irreversibility based model for fatigue
crack initiation of nickel base alloys. International Journal of Fatigue, 102, 1–8, ISSN
0142-1123.
Huang, M., Rivera-Díaz-Del-Castillo, P. E. J., Bouaziz, O., & Van Der Zwaag, S. (2008). Irrevers-
ible thermodynamics modeling of plastic deformation of metals. Materials Science and Tech-
nology, 24, 495–500. https://doi.org/10.1179/174328408X294125
Kelly, A., & Knowles, K. (2012). Crystallography and crystal defects. Wiley, ISBN
9780470750148.
Kröner, E. (1961). Zur plastischen verformung des vielkristalls. Acta Metallurgica, 9(2), 155–161.
Laughlin, D., & Hono, K. (2014). Physical metallurgy (5th ed.). Elsevier, ISBN 9780444537713.
Lee, H. W., & Basaran, C. (2021). A review of damage, void evolution, and fatigue life prediction
models. Metals, 11(4), 609.
Lee, H. W., & Basaran, C. (2022). Predicting high cycle fatigue life with unified mechanics theory.
Mechanics of Materials, 164, 104116.
Lee, H. W., Basaran, C., Egner, H., Lipski, A., Piotrowski, M., Mroziński, S., Noushad Bin Jamal,
M., & Rao, C. L. (2022). Modeling ultrasonic vibration fatigue with unified mechanics theory.
International Journal of Solids and Structures, 236–237, 111313.
Lemaitre, J., Sermage, J., & Desmorat, R. (1999). A two scale damage concept applied to fatigue.
International Journal of Fracture, 97, 67.
Li, S., Abdulhamid, M. F., & Basaran, C. (2009). Damage mechanics of low temperature
electromigration and thermomigration. IEEE Transactions on Advanced Packaging, 32(2),
478–485.
Marti, N., Favier, V., Gregori, F., & Saintier, N. (2020). Correlation of the low and high frequency
fatigue responses of pure polycrystalline copper with mechanisms of slip band formation.
Materials Science and Engineering: A, 772, 138619, ISSN 0921-5093.
Mughrabi, H. (2002). On ‘multi-stage’ fatigue life diagrams and the relevant life-controlling
mechanisms in ultrahigh-cycle fatigue. Fatigue & Fracture of Engineering Materials & Struc-
tures, 25(8–9), 755–764.
References 503
Mishin, Y., Sørensen, M. R., & Voter, A. F. (2001). Calculation of point-defect entropy in metals.
Philosophical Magazine A, 81(11), 2591–2612.
Mughrabi, H. (2009). Cyclic slip irreversibilities and the evolution of fatigue damage. Metallurgical
and Materials Transactions A: Physical Metallurgy and Materials Science, 40(6), 1257–1279.
Naderi, M., Amiri, M., & Khonsari, M. M. (2010). On the thermodynamic entropy of fatigue
fracture. Proceedings of the Royal Society A, 466, 423.
Parvin, H., & Kazeminezhad, M. (2016). Modeling the temperature rise effect through high-
pressure torsion. Materials Science and Technology (United Kingdom), 32, 1218–1222.
Patricio E. Carrion, Nima Shamsaei, Steven R. Daniewicz, Robert D. Moser. (2017). Fatigue
behavior of Ti-6Al-4V ELI including mean stress effects, International Journal of Fatigue, 99,
87–100.
Ragab, T., & Basaran, C. (2009). Joule heating in single-walled carbon nanotubes. Journal of
Applied Physics, 106, 063705.
Ribeiro, P., Petit, J., & Gallimard, L. (2020, July). Experimental determination of entropy and
exergy in low cycle fatigue. International Journal of Fatigue, 136, 105333.
Torabian, N., Favier, V., Dirrenberger, J., Adamski, F., Ziaei-Rad, S., & Ranc, N. (2017). Corre-
lation of the high and very high cycle fatigue response of ferrite based steels with strain rate-
temperature conditions. Acta Materialia, 134, 40–52, ISSN 1359-6454.
Wollenberger, H. J. (1996). Physical metallurgy (4th ed.). Elsevier, ISBN 9780444898753.
Yao, W., & Basaran, C. (2013a). Electromigration damage mechanics of lead-free solder joints
under pulsed DC: A computational model. Computational Materials Science., 71, 76–88.
Yao, W., & Basaran, C. (2013b). Computational damage mechanics of electromigration and
thermomigration. Journal of Applied Physics, 114, 103708.
Yao, W., & Basaran, C. (2013c). Damage mechanics of electromigration and thermomigration in
lead-free solder alloys under alternating current: An experimental study. International Journal
of Damage Mechanics, 23, 203–221.
Ye, H., Basaran, C., & Hopkins, D. C. (2004). Deformation of solder joint under current stressing
and numerical simulation––I. International Journal of Solids and Structures, 41(18–19),
4939–4958.
Ye, H., Basaran, C., & Hopkins, D. C. (2006). Experimental damage mechanics of micro/power
electronics solder joints under electric current stresses. International Journal of Damage
Mechanics, 15(1), 41–67.
Yun, H., & Modarres, M. (2019). Measures of entropy to characterize fatigue damage in metallic
materials. Entropy, 21(8), 804.
Zhang, L., Liu, X., Wu, S., Ma, Z., & Fang, H. (2013). Rapid determination of fatigue life based on
temperature evolution. International Journal of Fatigue, 54, 1–6. https://doi.org/10.1016/j.
ijfatigue.2013.04.002
Chapter 10
Corrosion-Fatigue Interaction
10.1 Corrosion
In this section, first, we summarize the basics of the metal corrosion process. The
information presented below is a summary of the corrosion chapter in Callister and
Rethwisch (2014).
Metals experience chemical interaction with their environment. As a result of this
interaction, they experience material loss by dissolution and by the formation of
nonmetallic scale or film.
Corrosion is an electrochemical process, where there is a transfer of electrons
from one material to another. Metals have a metallic bond where positive ions float
in a sea of free negatively charged valence electrons. During the corrosion process,
metals give up electrons in what is called an oxidation reaction. For example, an iron
atom (Fe) has two valence electrons in its electronic configuration
[1s22s22p63s23p63d64s2]. Fe can experience oxidation by giving up these two
valence electrons; as a result, it becomes (2+) a positively charged ion. This can
be represented in chemistry notation as follows:
yields
Fe → Fe2þ þ 2e - ð10:1Þ
where e- symbolizes an electron. The site where oxidation happens is called the
anode. The term anodic reaction is also used to refer to the oxidation process. The
electrons given up by a metal must be physically transferred to other chemical
species, in what is termed a reduction reaction. Given up electrons become part of
the other chemical species. The site where reduction reaction occurs is called the
cathode.
An electrochemical reaction contains one oxidation reaction (anodic site) and a
reduction reaction (cathodic site). The total rate of oxidation equals the total rate of
reduction, or, in other words, all electrons released by the oxidation site must be
absorbed by the reduction site. As an example, we can formulate rusting of iron, Fe,
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2022 505
C. Basaran, Introduction to Unified Mechanics Theory with Applications,
https://doi.org/10.1007/978-3-031-18621-9_10
506 10 Corrosion-Fatigue Interaction
in water with dissolved oxygen, which takes place in two separate steps. Electron
configurations of hydrogen and oxygen are given by [1s1], [1s22s22p4].
Step 1—Fe is oxidized Fe2+, as Fe(OH)2, in chemistry notation:
1 yields yields
Fe þ O2 þ H2 O → Fe2þ þ 2OH - → FeðOHÞ2 ð10:2Þ
2
1 yields
2FeðOHÞ2 þ O2 þ H2 O → 2FeðOHÞ3 ð10:3Þ
2
The compound Fe(OH)3 is called rust. Readers are referred to Callister and
Rethwisch (2014) for a more in-depth study of corrosion.
1 Xm
ΔSact = J ∙ Pi
i-1 i
ð10:4Þ
T
10.2 Thermodynamic Fundamental Equation of Corrosion-Fatigue 507
1
~ þ J c - αc A
~
ΔSact = J a αa A ð10:5Þ
T
~
α A
J a = j0 exp a -1 ð10:6aÞ
RT
~
- αc A
J c = - j0 exp -1 ð10:6bÞ
RT
where Ja and Jc are the fluxes (current densities) of anodic flux (direct) and cathodic
flux (reverse) half-reactions, respectively. Ja and Jc are defined by Butler-Volmer
equation, which formulates the relationship between the electrical current passing
through an electrode and the voltage difference between the electrode and the
electrolyte, j0 is the exchange current in a dilute solution in the equilibrium state
~ = 0 and Ja + Jb = 0), and αa and αc are the charge transfer coefficients of the
(A
~ is the electrochemical
electrochemical reactions, anodic and cathodic, respectively. A
Pn
~= -
affinity of the total reaction which is expressed as A vi μi, in which vi are the
i=1
stoichiometric coefficients, μi are the chemical potentials, and n is the total number
of components (initial substances and products). For an electrochemical reaction, the
electrochemical affinity A~ is usually determined by replacing μi with electrochemical
potential μ
~i :
μ
~i = μi þ zi Fη ð10:7Þ
where zi is the effective charge number of the ith chemical specie, F is Faraday’s
constant, η is the overpotential defined by η = (E - Ecorr) in which Ecorr is the
equilibrium electrode potential (open circuit potential, corrosion potential), and E is
the working electrode potential on the metal specimen.
For the simple electrochemical half-reaction
Fe $ Fe2þ þ 2e - ð10:8Þ
~ = zFη.
the electrochemical affinity is given by A
Butler-Volmer equation is given by
h i
α zFη α zFη
J = J a þ J c = j0 exp a - exp - c ð10:9Þ
RT RT
The exchange flux, j0, is a function of the concentrations of the reduced and
oxidized species:
j 0 = F ðk c C O Þα ðk a C R Þ1 - α ð10:10Þ
508 10 Corrosion-Fatigue Interaction
where ka and kc are the anodic and cathodic reaction rate constants, respectively. CO
and CR are the concentration of oxidized and reduced species, respectively. There-
fore, Eq. (10.5) can be rewritten as
1 ~ þ J c αc A
~
ΔSact = J αA ð10:11Þ
T a a
1 ~ M þ J M,c ð1 - αM ÞA ~ O þ J O,c ð1 - αO ÞA
~ M þ J O,a αO A
~ O ð10:12Þ
ΔSact = J M,a αM A
T
where JM, a and JM, c are the irreversible anodic and cathodic activation fluxes
(thermodynamic fluxes) for oxidation reaction and JO, a and JO, c are the anodic and
cathodic activation fluxes (thermodynamic fluxes) for reduction reaction, respec-
tively. αM and αO are the charge transport coefficients for oxidation reaction and
~ M and A
reduction reaction, respectively. A ~ O are the electrochemical affinity (ther-
modynamic force) for oxidation and reduction reactions induced by the electro-
chemical potential, respectively. Figure 10.1 explains the four different fluxes
formulated in Eq. (10.12).
The flux can be related to current density by using Faraday’s second law as
follows:
where IM, a is the anodic current density for the metal dissolution, zM is the number of
electrons involved in the metal dissolution reaction, JM, a is the irreversible anodic
activation flux, and F is Faraday’s constant.
Imanian and Modarres (2015) included three additional entropy production
mechanisms in addition to the activation overpotential. These three mechanisms are:
1. The entropy generation due to chemical reaction overpotential, ΔSreact.
2. The entropy generation due to diffusional overpotential, ΔSconc
3. The entropy generation due to ohmic loss, ΔSΩ
Entropy generations due to the diffusional overpotential and the ohmic loss are
negligible when the solution is well mixed (very low chemical concentration gradi-
ent) and when the electrolyte has a strong conductivity, respectively (Onsager,
1931). Therefore, these two mechanisms are trivial
II. Entropy Generation due to Chemical Reaction Overpotential, ΔSreact
Chemical reaction overpotential entropy generation is given by
10.2 Thermodynamic Fundamental Equation of Corrosion-Fatigue 509
Fig. 10.1 Schematic showing the anodic and the cathodic parts of the iron corrosion reaction in a
neutral electrolyte. (Adapted from Popov (2015) The red dashed line in the middle constructs the
Tafel plot)
1 Xr
ΔSreact = υA
j=1 j j
T
Usually, during a corrosion test conditions are imposed to limit entropy genera-
tion due to activation overpotential and chemical reaction overpotential only. There-
fore, the thermodynamic fundamental equation can then be simplified to include
only these two mechanisms:
1 ~ M þ J M,c ð1 - αM ÞA ~ O þ J O,c ð1 - αO ÞA
~ M þ J O,a αO A ~O
ΔScorr = J M,a αM A
T
1 X
r
þ v A
T j=1 j j
ð10:14Þ
where j0M and j0R are the exchange fluxes for the oxidation reaction and reduction
reaction, respectively. These exchange fluxes can also be expressed in the
following form: j0M = j0Fe , j0R = j0O2 . αM,a, αM,c, αO,a, αO,c are the charge transfer
coefficients. A ~ M and A
~ O are chemical affinities related to the overpotential, for
oxidation and reduction, respectively.
It is usually assumed that the redox reaction during the electrochemical corrosion
only involves the metal oxidation and the reduction of dissolved oxygen, and they
are both single-step reactions that follow the symmetry property. Therefore,
10.2 Thermodynamic Fundamental Equation of Corrosion-Fatigue 511
The effective charge coefficients αM, a and αO, C can be calculated based on the
slope of the Tafel plot:
2:303RT
βa = ð10:20aÞ
αa zF
2:303RT
βc = ð10:20bÞ
αc zF
where coefficients βa and βc are the slopes of the two branches of the Tafel plot (also
known as Tafel slope, or Tafel constant).
Exchange current density is a function of the concentrations of the reduced and
oxidized species. They are usually in the following range depending on the test
conditions (Popov, 2015; Chen et al., 2018):
Entropy is an additive property. Therefore, we can add the entropy generation due to
fatigue and entropy generation due to corrosion.
The entropy generation during ultrasonic vibration fatigue has six entropy gen-
eration mechanisms during very high cycle fatigue. However, only three are dom-
inant entropy generation mechanisms:
where ΔST is the entropy generation due to thermal conduction, ΔSr is the entropy
generation due to internal heat, and ΔSμp is the entropy generation due to
microplastic work. They are given by the following equations:
Z t
∇T ∙ ∇T
ΔST = - kh dt ð10:23Þ
t0 T2
512 10 Corrosion-Fatigue Interaction
Z t Z t drag pffiffiffi
ρr drag þ ρr dis ϱB v v þ 12 μb2 ϱ_ - αH μb ϱγ_
ΔSr = dt = dt ð10:24Þ
t0 T t0 T
Z t !
σ μ : ε_μP
ΔSμp = Ψf ϕf v dt ð10:25Þ
t0 T
Fig. 10.2 Stress vs. the number of cycles to failure test data for the uncorroded and saltwater-
corroded ASTM A656 structural steel samples subjected to 20 KHz vibrations and corrosion. (After
Lee et al. (2022))
10.4 Comparing Simulation Results and Test Data 513
1 1
Δs = ðΔScorr þ ΔSmec Þ = ΔSact þ ΔSreact þ ΔST þ ΔSr þ ΔSμp
ρ ρ
1 ~ M þ J M,c ð1 - αM ÞA
~ O þ J O,c 1 - αO ÞA
~ M þ J O,a αO A ~O
Δs = J M,a αM A
ρT
Z t Z t drag
1 Xr ∇T ∙ ∇T r þ r dis ð10:26Þ
þ υ A - k dt þ dt
ρT j = 1 j j
t0
h
ρT 2
t0 T
Z t !
μ
σ : εP _
μ
þ Ψf ϕf v dt
t0 ρT
Using the formulation presented above, the cumulative entropy generation calcu-
lated is shown in Fig. 10.3. Fatigue fracture entropy (FFE) is calculated and
measured to be 4.1 MJ K-1 m-3. The TSI evolution is presented in Fig. 10.4. TSI
value during mechanical loading starts from the endpoint of the TSI value due to
corrosion, instead of 0. Because entropy is an additive property, the contributions
from corrosion and fatigue are added.
Stress versus the number of cycles to failure curves for the uncorroded samples
and corroded samples are presented in Fig. 10.5. Corrosion degradation leads to a
decrease in fatigue life. At the stress amplitude of 400 MPa, the fatigue life decreases
from 2.79∙106 cycles to 1.58∙106 cycles due to the corrosion degradation, which is
around a 40% reduction in fatigue life.
It is important to point out that in Fig. 10.5, the test data fluctuates around the
unified mechanics theory solution, because Boltzmann’s formulation of the second
law is for all possible complexions. If we did an infinite number of experiments, the
data will always fluctuate around Boltzmann’s solution. A simple explanation can be
given by flipping a coin; we will get 50% head and 50% tail if we flip the coin an
infinite number of times.
Here we conclude our discussion of corrosion-fatigue interactions.
514 10 Corrosion-Fatigue Interaction
Fig. 10.3 Cumulative entropy production in the corroded samples subjected to ultrasonic vibration
3 because samples were first
MK
fatigue. The cumulative entropy production starts from 0.25 Km
corroded and then subjected to fatigue loading. (After Lee et al. (2022))
Fig. 10.4 TSI evolution of the corroded samples subjected to ultrasonic vibration fatigue. (After
Lee et al. (2022)). TSI due to corrosion is 0.2; hence, the curves start from 0.2
References 515
Fig. 10.5 Unified mechanics theory-based simulated stress vs. number of cycles to failure curves
for the corroded and uncorroded samples, compared with experimental data. (After Lee et al.
(2022))
References
Bockris, J., & Nagy, Z. (1973). Symmetry factor and transfer coefficient. A source of confusion in
electrode kinetics. Journal of Chemical Education, 50(12), 839. https://doi.org/10.1021/
ed050p839
Chen, L., Hu, J., Zhong, X., Zhang, Q., Zheng, Y., Zhang, Z., & Zeng, D. (2018). Corrosion
behaviors of Q345R steel at the initial stage in an oxygen-containing aqueous environment:
Experiment and modeling. Materials, 11(8), 1462. https://doi.org/10.3390/ma11081462
Gutman, E. (1998). Mechanochemistry of materials. Cambridge International Science Publishing.
Imanian, A., & Modarres, M. (2015). A thermodynamic entropy approach to reliability assessment
with applications to corrosion fatigue. Entropy, 17(12), 6995–7020. https://doi.org/10.3390/
e17106995
Lee, H. W., Basaran, C., Egner, H., Lipski, A., Piotrowski, M., Mroziński, S., Bin Jamal, N., & Rao,
L. (2022). Modeling ultrasonic vibration fatigue with unified mechanics theory. International
Journal of Solids and Structures, 236–237, 111313. https://doi.org/10.1016/j.ijsolstr.2021.
111313
Onsager, L. (1931). Reciprocal relations in irreversible processes. I. Physical Review, 37(4),
405–426. https://doi.org/10.1103/physrev.37.405
Popov, B. (2015). Electrochemical kinetics of corrosion. Corrosion Engineering, 93–142. https://
doi.org/10.1016/b978-0-444-62722-3.00003-3
William D. Callister, Jr. and David G. Rethwisch (2014), Materials Science and Engineering: An
Introduction, 9E, Wiley, 2014
Index
E
C Electromigration, 3, 427–456
Composite material, 3, 211, 263, 309, 352, 428 Electronics packaging, 428
Continuum mechanics, 1–3, 5, 13, 73–76, 81, Energy, 2, 3, 12, 86, 88, 91–94, 96–115,
89, 91, 93, 98–100, 112–114, 116, 118, 117–124, 126, 127, 133–138, 140–145,
119, 123, 125, 130, 133–145, 147–150, 148, 150, 153–156, 158–161, 165–167,
251, 258, 260, 263, 272–275, 295, 309, 170–177, 179–181, 183–188, 190–193,
460, 475 196–209, 211, 213, 214, 217, 218, 221,
Corrosion, 3, 130, 147, 505–507, 509–514 224–226, 228, 230, 238, 241, 243,
Corrosion fatigue, 3, 147, 506, 513 249–251, 254–257, 259–261, 263–266,
Couple stress, 20, 21, 96, 274–276, 278–290, 276–278, 280–282, 289, 298, 299, 367,
297–299 383–385, 391, 393–403, 412, 419, 431,
434–436, 451, 452, 459, 460, 463, 465,
470, 472, 475, 479, 481, 489, 495
D Entropy, 3, 12, 91, 98, 100–105, 107–119, 121,
Damage, 119, 142, 145–150, 242, 247, 263, 123, 125, 126, 128–130, 134–138,
346, 404, 405, 419, 421, 422, 427, 428, 141–157, 194–196, 199–203, 205, 207,
494, 500 208, 210–212, 214, 215, 217, 218,
Deformation gradient tensor, 113, 121, 254, 221–226, 229, 231, 246, 249–251, 254,
375–380, 415 255, 257–263, 266, 267, 272, 296,
© The Editor(s) (if applicable) and The Author(s), under exclusive license to 517
Springer Nature Switzerland AG 2022
C. Basaran, Introduction to Unified Mechanics Theory with Applications,
https://doi.org/10.1007/978-3-031-18621-9
518 Index
298–299, 354, 355, 358, 365, 367, 368, Modeling, 2, 3, 28, 57, 117, 124, 127, 142, 309,
383–385, 391, 403–405, 419, 421, 422, 346–348, 371, 375, 384, 392, 398, 399,
443, 449–454, 459–482, 484, 485, 488, 402, 427, 428
492–496, 506, 508–511, 513, 514
Eshelby, 309, 312, 314–317, 332
Euler equation, 106, 107 N
Eulerian descriptions, 38, 49, 55–57, 59 Newtonian mechanics, 81, 83, 86, 91, 96, 98,
113, 119, 123, 124, 126, 133, 136, 138,
139, 142, 148, 150, 217, 219–223, 226,
F 229, 237, 238, 253–255, 263, 265, 275,
Failure, 11, 99, 108, 115, 117, 133, 135, 142, 276, 278, 279, 288, 313, 366, 429,
146, 147, 149, 150, 210, 211, 215, 217, 443–444, 459
312, 404, 405, 411, 419, 421, 422,
427–430, 435, 451, 454, 456, 466, 470,
471, 476, 477, 479, 482, 484, 494, 512, O
513, 515 Onsager reciprocal relations, 128, 129, 506
Fatigue, 3, 99, 108, 117, 125, 130, 133, 137,
141, 145–148, 150, 215, 216, 263,
265–272, 346, 367, 421, 422, 459–501, P
511, 513, 514 Particle filled, 3
Fatigue corrosion interaction, vii, 3, 505–515 Particle filled composite, 3, 311, 338, 371
Field theories, 11 Piola–Kirchoff stress, 76–78, 389, 390, 393,
Finite deformations, 375–425 400
Finite strain, 3, 34, 57–73, 78, 136, 383–403, Plasticity, 31, 81, 124, 126–128, 136, 137, 139,
422 142, 215, 237, 238, 240, 241, 263, 272,
Flow theory, 237–241, 288, 294 273, 281, 288–295, 297, 309, 392, 395,
Fluid mechanics, 18, 49 403, 412, 468
Fracture, 133, 142, 145, 147, 150, 263, 367, Polymer, 3, 148, 210, 321, 362, 365, 375, 384,
403, 410, 422, 494, 513 392, 393, 395, 397–400, 402, 403, 405,
407, 422, 423
Poly-methyl-meth-acrylate (PMMA), 310, 311,
G 349, 354, 355, 362, 366, 397, 406,
Gibbs–Duhem relation, 104–107 409–411, 413–425
H R
Homogenizations, 309, 468, 470, 479 Rational mechanics, 428
Reliability, 150
L
Lagrangian description, 38, 48, 54, 57 S
Lagrangian mechanics, 86, 87, 91, 133 Solder alloys, 273, 298
Large strain, 34, 55, 60, 273, 366 Solder joints, 3, 267–272, 428, 430, 435,
Laws of motion, 2, 5, 11, 12, 50, 81, 91, 119, 455, 456
133, 142, 152, 154, 459 Solid mechanics, 18, 39, 73, 251
Life prediction, 146, 150, 459, 460, 478, 479, Strain, 1–3, 33–37, 40, 41, 43, 44, 46, 52–57,
484, 496, 500 60, 62–76, 96, 97, 108, 111, 113, 117,
119, 123, 125–127, 130, 136, 137, 139,
140, 143, 145, 148, 214, 216, 226, 237,
M 239–246, 253, 254, 256, 262–267,
Manufacturing, 346, 472 272–278, 280–282, 284, 285, 288–291,
Metal corrosion, 505, 506 295–299, 301, 303, 309, 312–317,
Microplasticity, 469–475, 477, 479, 492–495 322–324, 327, 329, 330, 333, 336, 337,
Index 519
352–355, 358, 364, 367–370, 385, 393, Thermodynamics, 2, 3, 12, 86, 91–93, 97–102,
395–399, 401, 403, 406, 409–411, 413, 104, 105, 107–116, 118–128, 130,
416–420, 439, 441–443, 445, 451–453, 133–138, 140, 142, 144, 145, 148–153,
459, 467–471, 475, 480, 485, 488, 489, 155–157, 198, 201, 210, 211, 213, 226,
497, 498, 500, 501, 512 228, 229, 242, 243, 249–251, 255,
Stress, 1, 2, 12–15, 18–32, 35, 37, 41, 73, 257–264, 266, 267, 272, 296–298, 354,
76–79, 83–85, 95, 96, 99, 108, 111, 117, 358, 368, 383–392, 403–405, 416, 419,
119, 121, 123, 126, 130, 137–142, 146, 422, 430, 439, 443, 449–451, 454,
150, 217, 238–245, 253, 257, 261, 263, 459–462, 464, 471, 474, 475, 477, 480,
266, 273–282, 284, 288–290, 295–299, 485–487, 489, 490, 493–496, 506, 508,
301, 309, 311–314, 316–323, 327–338, 512, 513
350, 352, 353, 358, 366, 378, 379, Thermomechanical analysis, 272–288
386–396, 398–403, 408, 411–413, 415, Thermomigration, 3, 117, 266, 427–456,
416, 418, 419, 428, 429, 432, 436–440, 461, 463
447, 452, 455, 459, 460, 464, 468–489,
492–497, 512, 513, 515
V
Very high cycle fatigue, 480–482, 484, 493,
T 495, 496, 511
Thermal mechanical analysis, 237–306 Viscoplasticity, 126, 241, 263, 295, 350–353
Thermodynamic fundamental equation, 3, 102,
107, 113, 134, 139, 147, 153, 210, 212,
214, 220, 224–226, 228, 229, 231, Y
249–250, 260, 263, 354, 365, 368, 384, Yield surface, 31, 124, 127, 136–138, 140,
405, 421, 428, 449–456, 459–461, 474, 238–240, 245, 246, 272, 289, 290, 295,
479, 481–488, 495, 506, 509, 510, 513 297, 298, 352, 353, 358, 470