Rosas-Ortiz: 1 Linear Spaces

z
Introduction to Classical Mechanics

Linear Spaces and Vectors
Summer Propaedeutic Course
rti
2021
Prof. Oscar Rosas-Ortiz
1 Linear Spaces
s-O
Definition 1. A linear or vector space V is a tetrad {V, F, +, “·00 }, with V a nonempty
set (the underlying set), F a scalar field, “+” a mapping V × V → V called addition and,
“·” a mapping F × V → V called multiplication by a scalar1 . The latter operations satisfy
the following conditions:
i) v1 + v2 = v2 + v1 for all v1 , v2 ∈ V .
ii) v1 + (v2 + v3 ) = (v1 + v2 ) + v3 for all v1 , v2 , v3 ∈ V .

osa
iii) There exists a unique element 0 ∈ V , called the neutral additive element, such that
0 + v = v for every v ∈ V .
iv) Associated with each v ∈ V is a unique element −v ∈ V , called the inverse additive,
such that v + (−v) = 0.
v) α(βv) = (αβ)v for all α, β ∈ F and all v ∈ V .

rR
vi) 1v = v for all v ∈ V .
vii) 0v = 0 for all 0 ∈ V .
viii) α(v1 + v2 ) = αv1 + αv2 for all α ∈ F and all v1 , v2 ∈ V .
ix) (α + β)v = αv + βv for all α, β ∈ F and all v ∈ V .
The elements of V are called vectors and the elements of the field F are called scalars.
ca
• If F = R we say that V is a vector space over the field of real numbers (or real vector
space for short).
• If F = C we say that V is a vector space over the field of complex numbers (or
complex vector space for short).
• If F = Rn we say that V is a real vector space of dimension n.
Os
• If F = Cn we say that V is a complex vector space of dimension n.

1
Formally we write α · v = v · α for all α ∈ F and all v ∈ V . However, for simplicity and to avoid
confusion, hereafter for the multiplication by a scalar we write α · v ≡ αv = vα.
z
NOTE: The elements of vector spaces are denoted in diverse forms, in agreement with the
context in which they are studied. They may be written as letters crowned by an arrow
~a, ~b, ~c, by boldface letters a, b, c, or by kets |ai, |bi, |ci, among other options. Notation
rti
is not relevant if there exists no ambiguity, so avoiding confusion between vectors and
scalars you are free to use any symbol to denote the elements of a linear space. For
present purposes, dealing with either F = Rn or F = Cn , we shall use ~a (and occasionally
a) to denote the n-tuple:
~a = (a1 , a2 , . . . , an ), ak ∈ F.
| {z }
s-O
n-positions
Example: Rn and Cn constructed as the Cartesian products of the scalar fields R and C
are by themselves vector spaces.
Homework. To show the above statement.
Example: Let V = Lp [0, T ], 1 ≤ p < ∞, the set of all real (or complex) valued functions
v defined on [0, T ] such that
osa
Z T
|v(t)|p dt < ∞, (1)
0
where the integral is the Lebesgue integral. We may define vector addition and scalar
multiplication as follows. For any v1 , v2 ∈ Lp [0, T ] we write
(v1 + v2 )(t) = v1 (t) + v2 (t), for all t ∈ [0, T ]. (2)
Also, for any α ∈ F and v ∈ Lp [0, T ], then
rR
(αv)(t) = α(v(t)), for all t ∈ [0, T ]. (3)
Homework. If v1 and v2 are two elements of Lp [0, T ] that differ only on a set of measure
zero, then they are still different points in the linear space. Show that Lp [0, T ] is a vector
space with the vector addition and scalar multiplication defined above.
Definition 2. Let V be a linear space and {vk }k∈I a set of elements of V. Let {αk }k∈I ,
αk ∈ F, the expression
ca
X
v= αv vk = α1 v1 + α2 v2 + · · · ,
k∈I
is called linear combination of the vectors v1 , v2 , . . . The scalars α1 , α2 , . . . are called the
coefficients of the linear combination.
Os
Definition 3. Let V be a linear space and {vk }nk=1 a set of elements of V. We say that
this set spans V if there exists a set of scalars {αk }nk=1 , αk ∈ F, such that
n
X
v= αv vk for all v ∈ V.
k=1
z
Definition 4. Let V be a linear space and W ⊆ V. If W satisfies the properties
i) Given v, w ∈ W then v + w ∈ W.
rti
ii) For any v ∈ W and α ∈ F then αv ∈ W.
iii) the element 0 ∈ W is also in W.
Then W is by itself a vector space, we say that it is a linear subspace of V.
Proposition 1. The set of all linear combinations of {vk }k∈calI , vk ∈ V, is a linear

subspace of V.
s-O
Proof: Homework.
Definition 5. The set of vectors {vk }k∈I is linearly independent if and only if
X
αk v k = 0
k∈I
implies αk = 0 for all k ∈ I. Otherwise the set is linearly dependent.

osa
Example. Let V = Rn and consider the vectors êk ∈ V defined as the n-tuples:
ê1 = (1, 0, 0, . . . , 0), ê2 = (0, 1, 0, . . . , 0), ên = (0, 0, 0, . . . , 1).
Then n
X
~v = αk êk = (α1 , α2 , . . . , αn ) = ~0 ≡ (0, 0, . . . , 0) ⇒ αk = 0 ∀k.
k=1
Definition 6. If the vectors vk ∈ V, k = 1, 2, . . . , n, are linearly independent and span V

rR
we say that the set {vk }nk=1 is a basis of V. We also say that the vectors vk form a basis
of V.
Definition 7. Let V be a vector space and {vk }nk=1 a basis of V. Then any v ∈ V can be
written as a linear combination v = α1 v1 + α2 v2 + · · · + αn vn . We say that (α1 , α2 , . . . , αn )
are the coordinates of v with respect to the basis vk and that αk is the kth-coordinate.
Pn
Example. ~v = αk êk = (α1 , α2 , . . . , αn ).
ca
k=1
Example. Find the coordinates of ~v = (0, 1) with respect to the vectors ~a = (1, 1) and
~b = (−1, 2). Solution: ~v = α~a + β~b implies α = β = 1/3.
Theorem 1. Let V be a vector space. Assume that the set {vk }m k=1 spans V. Let
Os
w1 , w2 , . . . , wn be elements of V and n > m. Then the vectors w1 , w2 , . . . , wn are linearly

dependent.
Proof: Let us look for a set of scalars αk , k = 1, 2, . . . , n, such that
α1 w1 + α2 w2 + · · · αn wn = 0.
z
As {vk }m
k=1 spans V, for every w` , ` = 1, 2, . . . , n, we have
w` = β`,1 v1 + β`,2 v2 + · · · + β`,m vm .
rti
Therefore
α1 w1 + α2 w2 + · · · αn wn = α1 (β1,1 v1 + β1,2 v2 + · · · + β1,m vm )
+α2 (β2,1 v1 + β2,2 v2 + · · · + β2,m vm )
···
s-O
αn (βn,1 v1 + βn,2 v2 + · · · + βn,m vm ).
So that we arrive at the system
α1 β1,1 + α2 β2,1 + · · · + αn βn,1 = 0
···
α1 β1,m + α2 β2,m + · · · + αn βn,m = 0
which admits no trivial solution since n > m. Hence, the vectors w1 , w2 , . . . , wn are
linearly dependent.
osa
Theorem 2. Let V be a vector space. Let {vk }nk=1 and {wk }m
k=1 be two different bases
of V. Then n = m.
Proof. Homework.
Definition 8. Let V be a vector space. Consider the set {vk }nk=1 of elements of V and an
nonnegative integer number r such that r ≤ n. We say that {vk }rk=1 is a maximal subset
rR
of linearly independent elements if (i) the vectors v1 , v2 , . . . , vr are linearly independent

and (ii) given any v` with ` > r, the elements v` , v1 , v2 , . . . , vr are linearly dependent.
Theorem 3. Assume that the vectors {vk }nk=1 span the linear space V and let {vk }rk=1 be
a maximal subset of linearly independent elements with r ≤ n. Then {vk }rk=1 is a basis
of V.
Proof. Homework.
ca
Definition 9. Let v1 , v2 , . . . , vn be linearly independent elements of a vector space V. We

say that they form a maximal set of linearly independent elements of V if, given w ∈ V,
the elements w, v1 , v2 , . . . , vn are linearly dependent.
Theorem 4. Let V be a linear space and {vk }nk=1 be a maximal set of linearly independent
elements of V. Then {vk }nk=1 is a basis of V.
Os
Proof. Homework.
Theorem 5. Let V be a linear space. Assume that a given maximal set of linearly
independent elements of V has n elements and another one has m elements. Then n = m.
z
Proof. Homework.
Definition 10. Let V be a vector space with a basis of n elements. We say that n is the
rti
dimension of V. If V includes only the neutral additive element then V has not a basis
and we say that its dimension is equal to zero.
Theorem 6. Let V be a linear space of dimension n. Let v1 , v2 , . . . , vn be linearly

independent elements of V. Then v1 , v2 , . . . , vn form a basis of V.
Proof. Homework.
s-O
Theorem 7. Let V be a linear space of dimension n and W be a linear subspace of V
the dimension dimension of which is equal to n. Then W = V.
Proof. Homework.
Definition 11. A set S of elements of a linear space V is said to be a Hamel basis of V

if (1) S is linearly independent and (2) V = span{S}.
osa
NOTE: The Hamel basis is the natural concept of basis for spaces that have linear struc-
ture only. The above definitions and properties of vector spaces, including the notion of
a “basis”, are indeed taking into account the concept of a Hamel basis.
Example. In R2 , a set S containing any two non-collinear vectors is a Hamel basis for
the plane.
NOTE: As we have seen in Theorem 5, all Hamel bases of a linear space V contain the
rR
same number of elements. This property allows to distinguish between finite and infinite-
dimensional linear spaces. For if you recall that two sets have the same cardinal number
provided they can be put into a one-to-one correspondence with one another, then the
following results are quite natural:
Theorem 8. If V1 and V2 are Hamel bases for a linear space V, then V1 and V2 have the
same cardinal number.
ca
Definition 12. The cardinal number of any Hamel basis of a linear space V is said to be
the dimension of V. We denote the dimension of V by Dim(V).
As you can see, the above results are a refinement of Theorem 5 and Definition 10,
respectively. We are now able to talk about infinite-dimensional spaces, according to the
cardinal number of any Hamel basis of the vector space we are dealing with.
Os
We can go a step further by recalling the notion of isomorphism:
Definition 13. The linear spaces V and W over the same scalar field F are said to be
isomorphic if there exists a one-to-one linear mapping φ of V onto W. The mapping φ is
z
then said to be an isomorphism of V onto W.
It is useful to realize that there is an important constraint for two spaces to be iso-
morphic:
rti
Theorem 9. If V and W are linear spaces over the same scalar field, then V and W are
isomorphic if and only if Dim(V) = Dim(W).
Proof. Homework.
An important result is obtained when F is either Rn or Cn . It deals with the identi-
s-O
fication of ~v ∈ V with the n-tuple (v1 , v2 , . . . , vn ), and is given by:
Corollary T9. If V is a finite-dimensional linear space over the scalar field F, where
Dim(V) = n, then V is isomorphic to F n , the linear space made uo of ordered n-tuples
of scalars.
In other words, all n-dimensional real vector spaces are isomorphic to Rn , and all
n-dimensional complex vector spaces are isomorphic to Cn .
osa
2 Normed and Metric Spaces
Definition 14. A real-valued function || · ||, defined on a vector space V, is said to be a
norm on V if, for any v, w ∈ V, and any α ∈ F, the following properties are true
i) ||v|| ≥ 0 (positivity)
rR
ii) ||v + w|| ≤ ||v|| + ||w|| (triangle inequality)
iii) ||αv|| = |α| ||v|| (homogeneity)
iv) ||v|| = 0 if and only if v = 0 (positive definiteness)
The number ||v|| is referred to as the norm of, or length of v ∈ V.
Example. The Euclidean (or canonical) length of any vector ~x = (x1 , x2 ) denoting a
ca
point in the real Euclidean plane R2 is given by

q
||~x|| = x21 + x22 . (4)
It may be shown that the above length satisfies the conditions of Definition 14, so it is a
Os
norm in R2 .
Definition 15. A normed linear space is a pair (V, || · ||), where V is a linear space and
|| · || is a norm defined on V. When no confusion is possible we shall write V for simplicity.
z
Definition 16. A metric space is a pair (X, d), where X is a set called the underlying set,
and d(x, y) is a real-valued function, called the metric, defined for x, y ∈ X and satisfying
the following axioms. For all x, y, z ∈ X:
rti
i) d(x, y) ≥ 0 and d(x, x) = 0 (positive)
ii) If d(x, y) = 0 then x = y (strictly positive)
iii) d(x, y) = d(y, x) (symmetry)
iv) d(x, y) ≤ d(x, z) + d(z, y) (triangle inequality)
s-O
Example. Function (4), defined in the previous example, generates a distance function
or metric, which makes the canonical length the archetypal example of a norm for a
linear space. To be precise, the Euclidean (or canonical) distance between two points
~x = (x1 , x2 ) and ~y = (y1 , y2 ) is given by
p
||~x − ~y || = (x1 − y1 )2 + (x2 − y2 )2 . (5)
Example. The Euclidean plane R2 equipped with the canonical distance (5) is a metric
space.
osa
Notice that any normed linear space (V, || · ||) can be always equipped with at least
one distance d(x, y) –the canonical one–. In this form, a normed linear space V is also a
metric linear space (under the canonical distance).
Comment. Remark that the notion of distance is very important in physics, par-
ticularly in Newtonian mechanics. This provides the mathematical structure of the
rR
theory with a “rule” to measure relations between the positions of material bodies
(and physical systems in general).
We have seen that such relations are fundamental to study motion in the Newtonian
picture, no matter the Newtonian space is absolute! On the other hand, the notion of
distance is a very geometric concept (recall Descartes) which can be associated with
the properties of space by itself, with no reference to any material body (Newton).
Concerning vectors, at the present stage you can realize that these are immediately
ca
useful to localize different points (positions) in space. In this form, the relationships
between the positions of the material bodies under study can be expressed in terms
of relationships between the vectors that localize such positions (position vectors).
In such a picture the mathematical properties of vectors refer to the geometric
properties of space, and they must be such that the (physical) laws of motion
Os
obeyed by material bodies are correctly represented. Otherwise, any mathematical

model constructed for describing the properties of space and motion would be not
useful.
3 Inner Product Spaces
z
Definition 17. Let V be a linear space. An inner product on V is a mapping that
rti
associates to each ordered pair of vectors v, w, an element of the field, denoted (x, y), that
satisfies the following properties. For any v, w, z ∈ V and any α ∈ F:
i) Additivity: (v + w, z) = (v, z) + (w, z)
ii) Homogeneity: (v, αw) = α(v, w)
iii) Symmetry: (v, w) = (w, v)
s-O
iv) Positive definiteness: (v, v) > 0 for v 6= 0.
Hereafter z means the complex conjugate of z ∈ Z (we also use z ∗ to denote complex
conjugation). Other notations for inner product are, for instance, ~v · w,
~ v · w, and hv|wi.
Comment. The notion of inner product is very important in the mathematical

structure of any physical theory. This permits to introduce a form of comparing (in
geometrical terms) physical properties that are represented by vectors.
osa
For instance, assume that ~v ∈ R3 represents either the velocity, the linear momen-
tum, or the angular momentum of a given system. One may wonder about “how
much” this vector resembles a concrete vector quantity ~u that is used as a standard.
The answer is easily obtained by calculating the inner product between ~v and ~u.
We may write ~v · ~u, which is in this case a real number (the field is R), and say that
this number is a measure of “how much” the vector ~v resembles the standard ~u.
If ~v ·~u = 0 then there is no relationship between ~v and ~u. In this case the description
rR
of ~v can be done without any reference to ~u and we say that ~v represents a property
that is completely independent of the standard ~u. Geometrically we say that ~v
cannot be projected onto ~u.
For ~v · ~u 6= 0 the vector ~v can be projected onto ~u (and viceversa). In other words,
the property represented by ~v is connected to the properties of ~u.
NOTE that (0, 0) = (0, w) = (w, 0) = 0 for all w ∈ V.

ca
Lemma D17. If (v, w) = 0 for all w ∈ V, then v = 0.
Example. In Cn we may introduce the rule (called canonical inner product on Cn ):

 
a1
Os
 a2 
~a · ~b = (a∗1 , a∗2 , · · · , a∗n )  ..  = |a1 |2 + |a2 |2 + · · · + |an |2 . (6)
 
 . 
an
Homework: Verify that the above rule satisfies the properties of Definition 17.
z
Example. Consider the vectors ê1 , ê2 , ê3 ∈ R3 . We have the inner product products
êj · êk = δjk , with δjk the Kronecker delta. That is ê1 cannot be projected onto neither ê2
nor ê3 (cyclically), which means that the properties of ê1 are completely independent of
rti
the properties of either ê2 or ê3 (cyclically).
Example. Consider the vectors êk ∈ Cn . For any ~v ∈ Cn we have êk · ~v = vk and
~v · êk = vk∗ (Homework: Verify such a rule). That is, if vk 6= 0 then the properties of
~v are connected with the properties of êk (equivalently, ~v can be projected onto êk and
viceversa). Otherwise, the description of ~v is independent of the properties of êk .
s-O
Definition 18. An inner product space is the pair (V, (·, ·)), where V is a linear space
and (·, ·) an inner product defined on V.
Example. The linear space Cn equipped with the canonical inner product (6) is an inner
product space.
NOTE that any inner product space can be equipped with a norm
osa p
||v|| = |(v, v)|, (7)
called the canonical norm, such that V is a normed space (and thus, it is also a metric
space).
Homework: Verify that the above rule satisfies the properties of Definition 14.
Example. Using the canonical inner product (6) we have the canonical norm on Cn :
rR
p
||~v || = |a1 |2 + |a2 |2 + · · · + |an |2 (8)
Example. The canonical norm of the vectors êk ∈ Cn is given by ||êk || = 1. That is, the
vectors êk ∈ Cn have norm equal to the unity and we say that they are unitary vectors.
Lemma D18. (Schwarz Inequality) Let (v, w) be an inner product on a linear space
V. Then
ca
|(v, w)| ≤ ||v|| ||w||. (9)
Proof. Homework.
Theorem 10. (Parallelogram Law) Let X be an inner product space, then for all
x and y in X we have
Os
||x + y||2 + ||x − y||2 = 2||x||2 + 2||y||2 .
Proof. Homework.
z
Definition 19. Two vectors v and w in an inner product space are said to be orthogonal
if (v, w) = 0.
rti
Theorem 10. (Pythagorean Theorem). If v and w are orthogonal in an inner
product space X, then
||v + w||2 = ||v||2 + ||w||2 .
Proof. Homework.
s-O
Definition 20. Let X be an inner product space. The set of elements {vk }k∈I in X is
said to be orthogonal if (vk , v` ) = 0 for k 6= `.
Definition 21. Let X be an inner product space. The set of elements {vk }k∈I in X is
said to be orthonormal if (vk , v` ) = δk,` .
Theorem 11. Let X be an inner product space and {vk }k∈I in X an orthonormal set of
elements in X is linearly independent.
Proof. Homework.
osa
Definition 22. An orthonormal set {vk }k∈I in an inner product space X is maximal if
there is no unit vector w0 in X such that {vk }k∈I ∪ {w0 } is an orthonormal set.
Lemma D22. An orthonormal set {vk }k∈I in an inner product space X is maximal if
and only if (w, vk ) = 0 for all k implies that w = 0.
rR
Definition 22. A maximal orthonormal set {vk }k∈I in an inner product space X is
referred to as an orthonormal basis for X.
Homework. Consider the linear space R3 . Verify the following properties:

1) ~a · ~b = ||~a|| ||~b|| cos θ
2) ~a × ~b = ||~a|| ||~b|| sin θ
3) ~a × ~b = −~b × ~a
ca
4) ~a × (~b + ~c) = ~a × ~b + ~a × ~c
5) ~a × (~b × ~c) = (~a · c)~b − (~a · b)~c
Hint: use the relation ~a ×~b = ~x, with ck = ai bj ijk , where ijk is the Levi-Civita tensor
Os
 
 +1(even) 
ijk = +1(odd) permutation of i, j, k,
0(no)
 
with ikl ikn = 2δnl , and ikl imn = δmk δnl − δnk δml .
z
Comment. We have two different notion of basis for linear spaces: Hamel basis
and orthonormal basis.
rti
The concept of a Hamel basis is purely algebraic and is particularly useful (and
intuitive) for linear combinations of finite (or denumerable) elements. Although this
may be applied to study infinite-dimensional linear spaces, in practical situations
it is preferred to look for other (more appropriate) kinds of bases.
The orthonormal basis is a very geometric concept that can be also expressed in
algebraic (and useful) terms. As we have seen, these bases permit to identify the
s-O
(possible) independence of a given vector from a (set of) standard(s). Thus, in
contraposition to the Hamel bases, the orthonormal bases represent a powerful
and versatile hand-tool to describe (not only) the motion (but the general physical
properties) of the systems studied in (physics, particularly in) Newtonian mechanics.
The notion of independence in the studying of a given property represented by a
vector v is therefore translated to the notion of orthogonality (a very geometric
one), and then to the notion of linear independence! Thus, a fundamental physical
property (the independence among different properties of a given physical system)
osa
is directly connected to a geometrical property of space (the linear independence of
a set of vectors that are used to represent physical states)!
The latter is a very impressive connection between physics and geometry (and
algebra) that was anticipated by Galileo, delineated by Descartes, formalized by
Newton and improved by Einstein.
rR
ca
Os
4 Description of mechanical particles
z
The position of a mechanical particle (particle for short) is given by a (position) vector
rti
in R3 :
X3
~r = êk xk = ê1 x1 + ê2 x2 + ê3 x3 (≡ êk xk . Einstein notation)
k=1
(10)
≡ r = (x1 , x2 , x3 ), xk ∈ R.
In general the coordinates xk are time-dependent xk = xk (t), with t a real parameter that
s-O
represents time. Position is measured in length units, we write [~r] = [L]. In the MKS
system we have [~r] = m.
The velocity is a vector that results from the time-derivative of position:
3 3
d X d X
~v = ~r = êk xk ≡ êk ẋk = ṙ ≡ ~ṙ. (11)
dt k=1
dt k=1
Velocity is measured in units of length over units of time, we write [~v ] = [L][T ]−1 . In the
MKS system we have [~v ] = ms−1 .
osa
The acceleration is a vector that results from the time-derivative of velocity:
d d2
~a = ~v = 2 ~r. (12)
dt dt
Acceleration is measured in units of length over units of squared time, we write [~a] =
[L][T ]−2 . In the MKS system we have [~a] = ms−2 .
rR
In particular, if the position does not depend on time ~r 6= ~r(t) we have no motion:
~r 6= ~r(t), ~v = ~0, ~a = ~0.
so that the particle is at rest.

On the other hand, let us assume that the coordinates are xk are elements of C 1 (R).
Then:
d
~r = ~r(t), ~v = ~r (with ~v 6= ~v (t)), ~a = ~0.
dt
ca
Thus, the particle moves with constant velocity (recall: the latter means fixed magni-
tude and fixed direction) and describes a straight-line trajectory.
Now let us assume that the coordinates are xk are elements of C 2 (R). Then:
d d2
Os
~r = ~r(t), ~v = ~r (with ~v = ~v (t)), ~a = ~r (with ~a 6= ~a(t)).

dt dt2
In this case the particle moves with constant acceleration (recall: the latter means
fixed magnitude and fixed direction).
z
For constant acceleration the coordinates ak do not depend on the parameter t. After
a time-integration one gets
Z t Z t
rti
d
~adt = ~v dt = ~v |tt0 = ~v (t) − ~v (t0 )
t0 t0 dt (13)
= ~a|tt0 = ~at − ~at0 .
That is,
~v (t) − ~v0 = ~a(t − t0 ), (14)
s-O
where ~v0 = ~v (t0 ). In other words, for constant acceleration the variation of velocity is
lineal in time and along the direction of the acceleration. A second time-integration yields
~r(t) − ~r0 = 21 ~a(t2 − t20 ) + (~vo − ~at0 )(t − t0 ), (15)
with ~r0 = ~r(t0 ). That is, for constant acceleration the variation of position is quadratic
in time.
Summary 1: (1) To describe accelerated particles the position-vector ~r must be at least

C 2 (R). If ~r is only C 2 (R) then this can be associated at most with uniformly accelerated
osa
motion. (2) Motion in a straight-line requires a position-vector ~r ∈ C 1 (R) (3) The resting
state requires a position-vector that is independent of the parameter t.
Homework. Use the above results to describe motion in one-dimension (for instance, free
falling bodies) and parabolic shot (two and three dimensions). Recover the expressions
for average velocity and average acceleration. Discuss on the meaning of speed and
its connection with velocity. Discuss the difference between displacement and distance.
Is it possible to get average velocity equal to zero but average speed different from zero?
rR
Justify your answer.
The mass is a scalar quantity that measures the amount of matter, it refers to the prop-
erty of the particle which resists acceleration (inertial mass) as well as to the property
of the particle which determines how strongly it will be pulled by a gravitational field
(gravitational mass). The units of mass are denoted by [M ], in the MKS system we
have [M ] = k. In general, the mass is parameterized by t and may be function of any of
the physical quantities that characterize the dynamical state of the particle.
ca
The linear momentum is a vector that results from the multiplication of velocity
with mass
d
p~ = m~v = m ~r. (16)
dt
Linear momentum is measured in units of velocity times units of mass, we write [~p] =
Os
[L][M ][t]−1 . In the MKS system we have [~p] = kms−1 .

For instance, considering a time-dependent mass, the time-variations of the linear
momentum are given by
d
p~ = ṁ~v + m~a. (17)
dt
z
What about a position-dependent mass?
The angular momentum is a vector that results from the vector multiplication of
position with linear momentum
rti
~ = ~r × p~ = m~r × ~v = m (~r × ~v ) ,
L L` = kj` rk pj , k, j, ` ∈ {1, 2, 3}. (18)
Angular momentum is measured in units of position times units of linear momentum
~ = [M ][L]2 [T ]−1 . In the MKS
(usually referred to as units of action), we write [L]
−1
system we have [~p] = kms .
s-O
Homework. Show that the time-variation of the angular momentum is given by the
expression
d~
L = ṁ~r × ~v + m~r × ~a. (19)
dt
Forces are vectors parameterized by t that may be functions of any of the physical
quantities that characterize the dynamical state of the particle. They are measured in
units of mass times units of acceleration, we write [F~ ] = [M ][L][T ]−2 . In the MKS system
we have [F~ ] = N .
osa
The torque is a vector defined by a force F~ that is applied on a given point of
material bodies. The point of application is localized by a position-vector (lever arm) ~b.
Mathematically, the torque is expressed as the vector multiplication
~τ = ~b × F~ , τ` = kj` bk Fj , k, j, ` ∈ {1, 2, 3}. (20)
Torque is measured in units of length times units of force, we write [~τ ] = [M ][L]2 [T ]−2 .
In the MKS system we have [~τ ] = N m.
rR
Dealing with rigid bodies the lever arm ~b is usually taken to be a constant vector. How-
ever, in a more general situation it is nothing but a position-vector, so it is parameterized
by t. Then, in general Eq. (20) reads ~τ = ~r × F~ .
Considering the mass m as the proportionality factor between acceleration and applied
forces, the variation of angular momentum (19) may be rewritten as follows

d~ d ~ + ~τ .
L = ṁ~r × ~v + ~τ ≡ (ln m) L (21)
dt dt
ca
Clearly, for constant mass the time-variation of the angular momentum is equal to the
torque.
In turn, (considering the mass m as the proportionality factor between acceleration
and applied forces) the time-variation of velocity for time-dependent mass (17) can be
rewritten as
Os
d
p~ = ṁ~v + F~ . (22)
dt
From the above result, if m = const then
d
p~ = F~ (valid for m= const only) (23)
dt
z
What about a position-dependent mass?
REMARK. The description of the position-vector and all its derivate quantities is not
rti
restricted to conventional cartesian basis of R3 defined by the unitary vectors êk , with
~r = (x, y, z). For if we take another basis of R3 , namely êξk with ~r = (ξ1 , ξ2 , ξ3 ), we know
that all the 3-dimensional linear spaces are isomorphic to R3 , so that there is an isomor-
phism permitting the transformation of the coordinates (ξ1 , ξ2 , ξ3 ) into the cartesian ones
(x, y, z), and vice versa. As an example consider the relationship between the cartesian
coordinates (x, y, z) and the spherical ones (r, θ, φ) given by
s-O
x = r sin θ cos φ, y = r sin θ sin φ, z = r cos θ, (24)
where
r ≥ 0, 0 ≤ φ < 2π, 0 ≤ θ ≤ π. (25)
Formally we write
x = x(r, θ, φ), y = y(r, θ, φ), z = z(r, θ, φ), (26)
so that the position-vector
osa
~r = ~r(x, y, z) = xê1 + yê2 + zê3 (27)
can be rewritten as
~r = ~r(r, θ, φ) = r sin θ cos φ ê1 + r sin θ sin φ ê2 + r cos θê3 . (28)
Thus, the position-vector ~r, originally parameterized in cartesian coordinates, is now
parametrized by the real variables r, θ, and φ. The path described by ~r as a result of the
rR
time-variations of x, y and z, is now parameterized by the time-variations of the spherical

variables r, θ, φ.
To get an appropriate set of unitary vectors in the new representation we first introduce
the scale factors
∂
hα = || ∂α ~r||, α = r, θ, φ. (29)
From (28) we immediately get
hr = 1, hθ = r, hφ = r sin θ. (30)
ca
It is straightforward to verify that the unitary vectors

1 ∂
êα = ~r, α = r, θ, φ, (31)
hα ∂α
are tangent to the path defined by ~r in the direction α. Explicitly,
Os
êr = sin θ cos φ ê1 + sin θ sin φ ê2 + cos θ ê3 ,
êθ = cos θ cos φ ê1 + cos θ sin φ ê2 − sin θ ê3 , (32)
êφ = − sin φ ê1 + cos φ ê2 .

z
Notice that the position-vector ~r = rêr coincides with (28) provided that (32) is true.
Homework. (a) Show that {êr , êθ , êφ } is an orthonormal set in R3 . (b) Solve the system
rti
(32) for the cartesian unitary vectors and express the vector ~r in the basis {êr , êθ , êφ }. (c)
Determine the scale factors and unitary vectors for cylindrical coordinates. (d) Express
the position vector ~r in the basis of cylindrical unitary vectors.
Homework. Verify the following relationships:

d
s-O
êr = θ̇êθ + φ̇ sin θêφ . (33)
dt
d
êθ = −θ̇êr + φ̇ cos θêφ . (34)
dt
d
êφ = −φ̇(sin θêr + cos θêθ ). (35)
dt
osa
Notice that, in contraposition with the cartesian vector basis ê1 , ê2 , ê3 , the unitary
vectors êr , êθ and êφ , have no definite orientation. Indeed, êr , êθ and êφ change
their orientation as θ and φ change, but they are independent of the variable r.
Remark that the magnitude of the spherical basis vectors is constant and equal to
1, no matter the values of θ, and φ.
On the other hand, it is a matter of substitution to verify the following properties:

rR
a) êr × êθ = êφ , êθ × êφ = êr , êφ × êr = êθ .

∂ ∂
b) ê
∂θ r
= êθ , ê
∂φ r
= sin θêφ .
∂ ∂
c) ê
∂θ θ
= −êr , ê
∂φ θ
= cos θêφ .
∂ ∂
d) ê
∂θ φ
= 0, ê
∂φ φ
= − sin θêr − cos θêθ .
Homework: Verify the above properties.

The differential element (differential displacement) d~r in spherical coordinates is there-
ca
fore given by
∂~
r ∂~
r ∂~
r
d~r = ∂r dr + ∂θ dθ + ∂φ dφ
=α
~ 1 dr + α
~ 2 dθ + α
~ 3 dφ
(36)
= (hr êr )dr + (hθ êθ )dθ + (hφ êφ )dφ
Os
= drêr + rdθêθ + r sin θdφêφ .

Consequently, the velocity is expressed as
~v = ~ṙ = ṙêr + r(θ̇êθ + φ̇ sin θêφ ). (37)
z
Homework: Consider a particle localized by the vector ~r = rêr . Show that the velocity
associated with this position-vector is given by (37). Calculate the expression for the
acceleration.
rti
At the present stage we may introduce a new kind of velocity and acceleration. For if
we write
~vr = ṙêr , ~vθ = rθ̇êθ , ~vφ = r sin θφ̇êφ , (38)
from (37) we write velocity as a superposition of three orthogonal vectors ~v = ~vr +~vθ +~vφ ,
with ~vr , ~vθ and ~vφ the velocities along the direction of the unitary vectors êr , êθ and
s-O
êφ , respectively. Notice that these velocities change their orientation according to the
changes of θ and φ, and change their magnitude as r, θ, and φ change. We say that ~vr ,
~vθ , ~vφ , are the radial and angular components of the velocity ~v in spherical coordinates,
respectively.
Notice that θ̇ and φ̇ have units of radians times frequency, we shall write θ̇ = ωθ ,
φ̇ = ωφ , and say that they are respectively the angular speed along θ and φ (if no
confusion is possible the sub-label is unnecessary). Remark: In conventional textbooks
both, θ̇ and φ̇, are referred to as angular velocity, however it is a misconception that
produces confusion since these quantities are indeed scalar. Throughout the present notes
osa
we shall refer to these quantities as angular speed to emphasize that they are just numbers
equipped with units but lacking direction.
Homework: Identify the radial and angular components of the acceleration ~a in spherical
coordinates.
Example. For r = r0 = const the position-vector (28) localizes the points on the 2-sphere
S 2 ⊂ R3 of radius r (without loss of generality we can assume that the sphere is centered
rR
at the origin of R3 ):
~r = r0 sin θ cos φ ê1 + r0 sin θ sin φ ê2 + r0 cos θê3 ≡ r0 êr .
The velocity (37) has not radial component (cause such a component is equal to the
neutral additive element of the vector space):
~vr = ~0, ~vθ = r0 θ̇êθ , ~vφ = r0 φ̇ sin θêφ . (39)
That is, the motion of the particle is described on the 2-sphere in such a form that the
ca
velocity is always tangent to the sphere! Notice however that the acceleration has both,
components that are tangent to the sphere as well as a component which is normal to the
surface:
~ar = −r0 θ̇2 + φ̇2 sin2 θ êr ,

~aθ = r0 θ̈ − φ̇2 sin θ cos θ êθ , (40)
Os

~aφ = r0 2θ̇φ̇ cos θ + φ̈ sin θ + φ̇2 cos θ sin θ êφ .
Moreover, the component normal to the surface of the 2-sphere is directed along the radius
and towards the center of S 2 .
z
Example. Considering the above example, assume now that θ = θ0 = const. As θ is
the azimuthal angle, the position-vector (28) localizes the points on a circle of radius
R = r0 sin θ0 , centered at the z-axis, and parallel to the xy-plane. The velocity has only
rti
the angular component along the direction defined by the unitary vector êφ :
~vr = ~0, ~vθ = ~0, ~vφ = Rωφ êφ . (41)
On the other hand, for the acceleration one has

R2 ωφ2
~aθ = −Rωφ2 cos θêθ , ~aφ = Rαφ + Rωφ2 cos θ êφ .

~ar = −
s-O
êr , (42)
r0
In this case the angular speed ωφ can be understood as arising from the arc length s =
Rφ subtended as “tangential distance” by the particle in a given interval of time while
describing the circle-path of radius R. This leads by necessity to the notion of the angular
speed ωφ = φ/t. Because φ = s/R, we can substitute this value in the equation for ωφ and
obtain ωφ = R1 st . However, st is simply the speed with which a particle in linear motion is
lifted within the time-interval defined by t. Thus, writing vT = st to denote “tangential
speed”, we have
osa
~vφ = vT êφ , vT := st = Rωφ , (43)
where “tangential” refers to the motion of the particle on a circle. In similar form we
have
vT2 vT2 vT2

~ar = − êr , ~aθ = − cot θêθ , ~aφ = aT − cot θ êφ , aT = Rαφ . (44)
r0 r0 r0
Remark Without loss of generality we can take θ = π/2, so that R = r0 and the above
rR
results are reduced to the familiar form

vT2
~ar = − êr , ~aθ = ~0, ~aφ = aT êφ .
R
Homework: Show that

d
~ × êα ,
êα = ω α = r, θ, φ. (45)
dt
ca
Then,
d d
~r = ṙêr + r êr = ṙêr + r(~ω × êr ) = ṙêr + ω
~ × ~r. (46)
dt dt
Thus, we can identify three different terms in the transition from a fixed (laboratory)
frame to a rotational frame:
Os
d
~r = velocity with respect to laboratory,
dt
ṙêr = velocity with respect to the rotational frame, (47)
~ × ~r = the price we have to pay for the transition!

ω
z
Homework: Show that
d2 ~ × ~r.
~r = r̈êr + 2ṙ(~ω × êr ) + ω
~ × (~ω × r) + ω̇ (48)
dt2
rti
The above result is known as the Theorem of Coriolis, with
d2
~r = acceleration with respect to laboratory,
dt2 (49)
r̈êr = acceleration with respect to the rotational frame.
s-O
The remaining terms correspond to the price we have to pay for the transition and include
the Coriolis acceleration
2ṙ(~ω × êr ), (50)
which is present whenever there is motion in the rotating frame (ṙ 6= 0), the centripetal
acceleration
~ × (~ω × r),
ω (51)
which is addressed towards a rotation axis, and a remanent term
~ × ~r
osa ω̇ (52)
which cancels for ω 6= ω(t). Therefore, in the rotating frame,
~aRot = ~a − ω ~ × ~r.
~ × (~ω × r) − 2~ω × ~vRot − ω̇ (53)
Then, assuming that the mass is the factor of proportionality between the applied force
F~ and the acceleration ~a, one has
m~aRot = F~ − m~ω × (~ω × r) − 2m~ω × ~vRot − mω̇
~ × ~r. (54)
rR
The second and third elements at the right hand side of the previous equation correspond
to the centrifugal force and the Coriolis force respectively.
Example. For a body on the earth’s surface, assuming ω 6= ω(t),the acceleration ~a = ~g0
leads to
~ × (~ω × r) −2~ω × ~vRot .
~aRot = ~g0 − ω (55)
| {z }
~ge =effective acceleration
The gravitational force that we measure on a body of mass m at rest (spherical pendulum)
ca
is indeed m~ge . The term −~ω × (~ω × r) radically points out of the axis of rotation of the
earth. So that, in the northern hemisphere, ~ge points towards the south.Then,
m~aRot = m~ge − 2m~ω × ~vRot . (56)
| {z }
Coriolis force
Os
• Now let us comeback to Eq. (36). Consider the inner product

3
X
ds2 = d~r · d~r = gk,` duk du` , (57)
k,`=1
z
∂~
r ∂~
r
~k · α
with gk,` = α ~` = ∂uk
· ∂u`
the components of the metric tensor. That is
gk,` = (huk êuk ) · (hu` êu` ) = huk hu` (êuk · êu` ) . (58)
rti
For instance, in spherical coordinates
g11 = h2r = 1, g22 = h2θ = r2 , g33 = h2φ = r2 sin2 θ, gk,` = 0, k 6= `. (59)
The spherical coordinates are orthogonal curvilinear coordinates
s-O
 
1 0 0
[gk,` ] =  0 r2 0 .
2 2
0 0 r sin θ
As Det [gk,` ] = r4 sin2 θ we have

 
k,` 1 0 0
1
g =  0 r2
0 .
1
0 0
osa r2 sin2 θ
Therefore ds2 = dr2 + r2 dθ2 + r2 sin2 θdφ2 . Besides, ds1 = dr, ds2 = rdθ, ds3 = r sin θdφ.
rR
ca
Os

Rosas-Ortiz: 1 Linear Spaces

Uploaded by

Copyright:

Available Formats

You might also like

Rosas-Ortiz: 1 Linear Spaces

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Rosas-Ortiz: 1 Linear Spaces

Uploaded by

Copyright:

Available Formats

z

Introduction to Classical Mechanics

Prof. Oscar Rosas-Ortiz

ii) v1 + (v2 + v3 ) = (v1 + v2 ) + v3 for all v1 , v2 , v3 ∈ V .

v) α(βv) = (αβ)v for all α, β ∈ F and all v ∈ V .

vi) 1v = v for all v ∈ V .

vii) 0v = 0 for all 0 ∈ V .

viii) α(v1 + v2 ) = αv1 + αv2 for all α ∈ F and all v1 , v2 ∈ V .

ix) (α + β)v = αv + βv for all α, β ∈ F and all v ∈ V .

• If F = Cn we say that V is a complex vector space of dimension n.

Homework. To show the above statement.

(αv)(t) = α(v(t)), for all t ∈ [0, T ]. (3)

Proposition 1. The set of all linear combinations of {vk }k∈calI , vk ∈ V, is a linear

implies αk = 0 for all k ∈ I. Otherwise the set is linearly dependent.

Definition 6. If the vectors vk ∈ V, k = 1, 2, . . . , n, are linearly independent and span V

w1 , w2 , . . . , wn be elements of V and n > m. Then the vectors w1 , w2 , . . . , wn are linearly

w` = β`,1 v1 + β`,2 v2 + · · · + β`,m vm .

of linearly independent elements if (i) the vectors v1 , v2 , . . . , vr are linearly independent

Definition 9. Let v1 , v2 , . . . , vn be linearly independent elements of a vector space V. We

Theorem 6. Let V be a linear space of dimension n. Let v1 , v2 , . . . , vn be linearly

Definition 11. A set S of elements of a linear space V is said to be a Hamel basis of V

We can go a step further by recalling the notion of isomorphism:

ii) ||v + w|| ≤ ||v|| + ||w|| (triangle inequality)

iii) ||αv|| = |α| ||v|| (homogeneity)

iv) ||v|| = 0 if and only if v = 0 (positive definiteness)

The number ||v|| is referred to as the norm of, or length of v ∈ V.

point in the real Euclidean plane R2 is given by

obeyed by material bodies are correctly represented. Otherwise, any mathematical

Comment. The notion of inner product is very important in the mathematical

NOTE that (0, 0) = (0, w) = (w, 0) = 0 for all w ∈ V.

Lemma D17. If (v, w) = 0 for all w ∈ V, then v = 0.

Example. In Cn we may introduce the rule (called canonical inner product on Cn ):

|(v, w)| ≤ ||v|| ||w||. (9)

||x + y||2 + ||x − y||2 = 2||x||2 + 2||y||2 .

Homework. Consider the linear space R3 . Verify the following properties:

~r 6= ~r(t), ~v = ~0, ~a = ~0.

so that the particle is at rest.

~r = ~r(t), ~v = ~r (with ~v = ~v (t)), ~a = ~r (with ~a 6= ~a(t)).

Summary 1: (1) To describe accelerated particles the position-vector ~r must be at least

Justify your answer.

[L][M ][t]−1 . In the MKS system we have [~p] = kms−1 .

time-variations of x, y and z, is now parameterized by the time-variations of the spherical

It is straightforward to verify that the unitary vectors

êr = sin θ cos φ ê1 + sin θ sin φ ê2 + cos θ ê3 ,

êφ = − sin φ ê1 + cos φ ê2 .

Homework. Verify the following relationships:

On the other hand, it is a matter of substitution to verify the following properties:

a) êr × êθ = êφ , êθ × êφ = êr , êφ × êr = êθ .

Homework: Verify the above properties.

= drêr + rdθêθ + r sin θdφêφ .

~vr = ~0, ~vθ = ~0, ~vφ = Rωφ êφ . (41)

On the other hand, for the acceleration one has

results are reduced to the familiar form

Homework: Show that

~ × ~r = the price we have to pay for the transition!

• Now let us comeback to Eq. (36). Consider the inner product

g11 = h2r = 1, g22 = h2θ = r2 , g33 = h2φ = r2 sin2 θ, gk,` = 0, k 6= `. (59)