Note20feb Merged
Note20feb Merged
Contents
1 Index notation 4
1.1 Summation convention . . . . . . . . . . . . . . . . . . . . . . . . . 4
1.2 Kronecker delta . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
1.3 Dot product . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
1.4 Permutation symbol . . . . . . . . . . . . . . . . . . . . . . . . . . 6
1.5 Cross product . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
1.6 Differentiation notation . . . . . . . . . . . . . . . . . . . . . . . . . 9
2 Algebraic structures 10
2.1 Group . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
2.2 Field . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
1
CONTENTS
7 Tensor product 48
7.1 Matrix representation of tensor products . . . . . . . . . . . . . . . 50
7.2 Tensor spaces: a general multilinear function . . . . . . . . . . . . 50
7.3 Tensor multiplication . . . . . . . . . . . . . . . . . . . . . . . . . . 51
7.4 Contraction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52
7.4.1 Double contraction as a bilinear functional . . . . . . . . . . 55
7.4.2 Physical components of tensors . . . . . . . . . . . . . . . . 56
7.5 Transposition of a tensor . . . . . . . . . . . . . . . . . . . . . . . . 56
7.5.1 Symmetry and skew-symmetry of second order tensor . . . . 57
7.6 Invariants in 3D . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 57
7.6.1 Cofactor tensor . . . . . . . . . . . . . . . . . . . . . . . . . 58
8 Orthogonal tensors 58
9 Skew-symmetric tensors 60
12 Tensor calculus 72
12.1 Normed space . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 72
12.2 Affine space . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 72
12.3 Differentiation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 73
12.4 Partial derivative . . . . . . . . . . . . . . . . . . . . . . . . . . . . 76
12.5 Product rule for bilinear mapping . . . . . . . . . . . . . . . . . . . 77
12.6 Product rule for any mapping . . . . . . . . . . . . . . . . . . . . . 79
12.7 Chain rule . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 80
12.8 Affine space . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81
12.9 Gradient and divergence . . . . . . . . . . . . . . . . . . . . . . . . 81
12.9.1 Gradient . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81
12.9.2 Divergence . . . . . . . . . . . . . . . . . . . . . . . . . . . . 82
12.9.3 Curl . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 83
12.10Divergence theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . 83
or
or
In the index notation, they are represented by: aij , aij and aij respectively, where
each of the indices (i, j) takes values 1 and 2.
1. The convention does not apply to numerical indices. For instant a2 x2 stands
for single term.
2. The repeated index may be replaced by any other index. For instance ai xi =
at xt . For this reason, the repeated index is called a dummy index.
3. If an index is not dummy, it is called free index. Thus in the expression aik xi ,
k is the free index.
Some examples:
1. If n = 3, ak xk = a1 x1 + a2 x2 + a3 x3 .
∂ϕ ∂ϕ ∂ϕ
2. If ϕ is a function of x1 , x2 , x3 , ..., xn , then dϕ = ∂x1
dx1 + ∂x2
dx2 +...+ ∂xn
dxn =
∂ϕ
∂xi
dxi
3. Let yi = αit xt and zi = βit yt . We can rewrite yt = αtk xk and now we can
substitute back as zi = βit (αtk xk ) = [βit αtk ]xk = γik xk . Here γik = βit αtk ,
where t is the dummy index and i, k are the free index.
4. Akmn Bkn
l
= (A1mn B1n
l
) + (A2mn B2n
l
) + ... + (AN l 1 l 1 l
mn BN n )=(Am1 B11 + Am2 B12 +
..) + (A2m1 B21
l
+ A2m2 B22
l
+ ...) + ...
5. Consider the summation (a1j xj )2 + (a2j xj )2 + ... + (anj xj )2 . This expression
is written as (ais xs )(ait xt ) or ais ait xs xt .
Note that,
ei · (ej × ek ) = ϵijk
Basically we represents a = aj êj and b = bk êk . So we can also write the compo-
nent form as:
So we can write,
δi1 δi2 δi3 δp1 δp2 δp3
ϵijk ϵpqr = det δj1 δj2
δj3 det δq1
δq2 δq3
δk1 δk2 δk3 δr1 δr2 δr3
δi1 δi2 δi3 δp1 δq1 δr1
= det δj1 δj2 δj3 det δp2 δq2 δr2 ∵ det[A] det[B] = det[A] det[B]t
δk1 δk2 δk3 δp3 δq3 δr3
δi1 δi2 δi3 δp1 δq1 δr1
= det δj1 δj2 δj3 δp2 δq2 δr2 ∵ det[A] det[B] = det([A][B]t )
δk1 δk2 δk3 δp3 δq3 δr3
δim δpm δim δqm δim δrm δip δiq δir
= det δjm δpm
δjm δqm δjm δrm = det δjp δjq δjr
δkm δpm δkm δqm δkm δrm δkp δkq δkr
Consider,
a × b = ϵmij ai bj em , a × b = ϵpqr aq br ep .
Then
(a × b) · (a × b) = (ϵmij ai bj em ) · (ϵpqr aq br ep )
= ϵmij ϵpqr ai bj aq br (em · ep )
= ϵmij ϵpqr ai bj aq br (δmp )
= ϵmij ϵmqr ai bj aq br
= (δiq δjr − δir δjq ) ai bj aq br
= δiq δjr ai bj aq br − δir δjq ai bj aq br
= aq br aq br − ar bq aq br
= (aq aq )(br br ) − (ar br )(aq bq )
= (a · a)(b · b) − (a · b)2
a = a(x1 , x2 , x3 )
ai = ai (x1 , x2 , x3 )
aij = aij (x1 , x2 , x3 )
Then
∂a
a, i = = ∂i a
∂xi
∂ai
ai, j = = ∂ j ai
∂xj
∂aij
aij, k = = ∂k aij (10)
∂xk
Example 3 Show that
∇ × ϕu = ∇ϕ × u + ϕ∇ × u
2 Algebraic structures
Consider the set X , equipped with some binary operations. A binary operation,
∗, takes two elements of the set and combine them to produce one element of the
same set. One can write
∗ : X × X → X,
A set with some binary operations and some rules creates an algebraic structure.
2.1 Group
A group is a set G together with a binary operation ∗ and satisfy the following set
of rules/axioms:
1. Closer: a, b ∈ G, then a ∗ b ∈ G
2. Associativity: If a, b, c ∈ G, then a ∗ (b ∗ c) = (a ∗ b) ∗ c.
1. Closer: a + b ∈ Z
2. Associativity: a + (b + c) = (a + b) + c.
where θ is the angle of rotation. The binary operation ∗ is the matrix multiplica-
tion.
3. Identity element:
1 0
R(0) =
0 1
Note that all the matrices are orthogonal. Moreover all of them have determinant
+1. This group is commonly called as Special Orthogonal Group of 2 × 2 matrices
or SO(2). The general form is SO(n). If we remove the constraint that the
determinant is +1, then we call it Orthogonal Group O(n).
2.2 Field
Suppose F is equipped with two binary operations called addition and multiplica-
tion and denoted by + and ·, respectively. This means ∀a, b ∈ F, we have a+b ∈ F
and a · b ∈ F. This algebraic structure (F, +, ·) is called a field if the following
postulates are satisfied :
Figure 3: Essential elements of a field and the closer properties under addition and
multiplication. The boundary of the set in the diagram is just to schematically
represent a set.
since aij + bij = bij + aij ∈ F as aij , bij ∈ F. Similarly, you can verify the
rest of the following relations.
Let Fn (x) denote the set of all polynomials of x of degree n over a field F. Let us
consider the following polynomials:
Then,
since ai + bi = ci ∈ F as ai , bi ∈ F.
8. 1f (x) = f (x)
Note that the polynomial are nonlinear with respect to x but the polynomial space
is linear!
x = a1 x 1 + a2 x 2 + · · · + an x n = ai x i
a1 x1 + a2 x2 + · · · + an xn = 0, ai ∈ F, 1 ≤ i ≤ n
a1 e1 + a2 e2 + · · · + an en = 0.
So,
a1 0 0 0 a1 0
0 a2 0 0 a2 0
.. + .. , · · · .. = .. =⇒ .. = ..
. . . . . .
0 0 an 0 an 0
Figure 4: (a) Two mutually perpendicular vectors and (b) two oblique vectors.
The first figure is obvious. Let’s examine the second case. We write,
a1 b1 0 a1 b 1 α 0
α +β = , i.e, = . (12)
a2 b2 0 a2 b 2 β 0
If the determinant of the coefficient matrix is not equal to zero, then we always get
α = β = 0. Then a and b are linearly independent. However, if the determinant
is zero, then
a2 b2
a1 b2 − a2 b1 = 0, or = .
a1 b1
This means a and b have the same direction (Fig. 5). We can express any of them
with a scalar multiplication of the other vector, say b = λa, where λ belongs to
the field and represents a scaling.
Example 13 Let
1 0 −1
S= 2 , 3 , 0 ⊂ V = R3 .
0 1 1
This gives,
a−c = 0
2a + 3b = 0
b + c = 0.
Span (B) = a1 g 1 + a2 g 2 + · · · + ap g p
1. B is linearly independent
2. Span(B) = V
form a basis of V = R3 .
We further write,
1 2 1 a 0
2 1 −1 b = 0
1 0 2 c 0
Now det A = −11 ̸= 0, and the only solutions are a = b = c = 0. So the vectors
are linearly independent. Next we need to show that they also span the space.
Consider an arbitrary vector
k1
k2 ∈ V = R3 .
k3
Then,
1 2 1 a k1
2 1 −1 b = k2 .
1 0 2 c k2
As detA ̸= 0, for any arbitrary k1 , k2 , k3 , the system is solvable uniquely. So, the
considered vectors span R3 , and they are a basis.
Note that they are linearly independent. This particular basis is known as standard
basis in Rn .
Example 16 What is the dimension of the space V = {0}, i.e., space only with
the zero element?
First check that (by yourself) V = {0} is a vector space. Since {0} is the only
element, the only possible basis is B = {0}. Then,
α0 = 0
always satisfies for any α ̸= 0, i.e., the considered space can not have any basis.
So dim V = 0.
Theorem 3.1 The necessary and sufficient condition for a nonempty subset W
of a vector space V over field F to be a subspace is
a, b ∈ F, and x, y ∈ W =⇒ ax + by ∈ W. (14)
Necessary condition
Consider W is a subspace of V. Then W must be closed under scalar multiplication
and vector addition. This means
a ∈ F, x ∈ W =⇒ ax ∈ W
b ∈ F, y ∈ W =⇒ by ∈ W
Sufficient condition
Now suppose W is a non-empty subset of V satisfying the given condition ax+by ∈
W for all a, b ∈ F, and x, y ∈ W. One can show that the proposed conditions
satisfy all the requirements for the vector space. As ax + by ∈ W,
1. let a = 1, b = 1, then 1.x + 1.y = x + y ∈ W . Thus W is closed under
vector addition.
Thus,
W1 + W2 := {w1 + w2 ; w1 ∈ W1 , w2 ∈ W2 }
1. W1 + W2 = V
2. W1 ∩ W2 = {0}
We then write
tr (α1 A + α2 B) = α1 tr A + α2 tr B, A, B ∈ V, α1 , α2 ∈ F.
Then,
Z b Z b Z b
I(α1 f + α2 g) = [α1 f (x) + α2 g(x)]dx = α1 f (x)dx + α2 g(x)dx
a a a
= α1 I(f ) + α2 I(g).
The extension of this definition to functionals of more than two variables are
simple, and such functions are called multilinear functionals.
g :V ×V →R
1. g⟨u, v⟩ is bilinear,
3. g⟨u, u⟩ > 0, if u ̸= 0.
We call g⟨u, v⟩ the inner product of u and v and use the notation of ‘dot product
as
g⟨u, v⟩ = u · v
g⟨g i , g j ⟩ = g i · g j = gij
and
A vector space V equipped with the inner product is called an inner product space
and from now our vector spaces are all inner product space.
g i · g j = δji
where δji is called the Kronecker delta. The set {g 1 , g 2 , · · · , g n } becomes another
basis of the same vector space V again. Such a basis is known as reciprocal basis.
g i · g j = δji . (16)
Then {g 1 , . . . , g n } is a basis of V.
Proof: Consider the following linear combination
α1 g 1 + α2 g 2 + · · · + αn g n = 0
Now,
(α1 g 1 + α2 g 2 + · · · + αn g n ) · g j = 0 · g j = 0
αi g i · g j = 0
αi δji = 0
=⇒ αj = 0 for j = 1, · · · , n
v · g i = (v j g j ) · g i = v j (g j · g i ) = v j δji = v i .
uj = gij ui .
u = ui g i = ui g i .
We call
and
Now, we define
g⟨g i , g j ⟩ = g ij = g i · g j .
and
gi · gj = gj · gi, =⇒ g ij = g ji .
Then
uj = g ij ui .
gij : uj → ui , g ij : uj → ui
g i = gij g j , or g i = g ij g j .
Therefore, lowering or raising the index for the reciprocal basis can be made in
the same manner. Note that
g k · g i = gij g k · g j
δik = gij g kj = gij g jk .
A vector space equipped with such a norm is called Euclidean vector space.
ds2 = dx · dx
r = x1 e1 + x2 e2 + x3 e3
The correspondence between points (x1 , x2 , x3 ) and the three scalar parameters
We assume that the mappings are bijective and have continuous derivatives, so
that the correspondence between (x1 , x2 , x3 ) and (u1 , u2 , u3 ) is unique.
If we only vary one local coordinate, say u1 , while the other two local coordi-
nates, (u2 , u3 ), are fixed, then we call the locus of the points (x1 , x2 , x3 ) a coordinate
line. So
P P ′ = OP ′ − OP
= ϕ1 (u1 + δu1 , c2 , c3 )e1 + ϕ2 (u1 + δu1 , c2 , c3 )e2 + ϕ3 (u1 + δu1 , c2 , c3 )e3
∂r ∂r
Similarly, g 2 = ∂u 2 and g 3 = ∂u3 . These three local tangents vectors
n ∂r ∂r ∂r o
{g 1 , g 2 , g 3 } := 1
, , (17)
∂u ∂u2 ∂u3
∂r 1 ∂r ∂r
dr(u1 , u2 , u3 ) = 1
du + 2 du2 + 3 du3
∂u ∂u ∂u
1 2 3
= du g 1 + du g 2 + du g 3 . (19)
So,
Figure 10: Covariant or tangent basis vectors {g 1 , g 2 } for the polar coordinates.
Example 21 Find ds2 for the polar coordinate system (Fig. 10)
We identify u1 = r and u2 = θ. Then,
Therefore,
∂r
g1 = = cos θe1 + sin θe2
∂r
∂r
g2 = = −r sin θe1 + r cos θe2
∂θ
We compute
∂ψ 3 ∂ψ 3 ∂ψ 3
∇ψ 3 (x1 , x2 , x3 ) = e 1 + e 2 + e3
∂x1 ∂x2 ∂x3
or
3 ∂u3 ∂u3 ∂u3
g = e1 + 2 e2 + 3 e3 .
∂x1 ∂x ∂x
Then g 3 = ∇u3 (x1 , x2 , x3 ) is perpendicular to the tangent plane 2 at P (Fig. 11).
2
Recall that gradient of a constant surface is normal to the tangent plane. Suppose we have a
surface xy 3 = z + 2. To find the normal to the surface at (1, 1, −1), we consider f = xy 3 − z = 2.
Then ∇f = y 3 e1 + 3xy 2 e2 − e3 = e1 + 3e2 − e3 is the normal vector at (1, 1, −1).
(1,1,−1)
Now,
∂u3 ∂u3 ∂u3 ∂x1 ∂x2 ∂x3
g3 · g3 = e +
1 1
e 2 + e3 · e 1 + e2 + e3 .
∂x ∂x2 ∂x3 ∂u3 ∂u3 ∂u3
∂u3 ∂x1 ∂u3 ∂x2 ∂u3 ∂x3
= + +
∂x1 ∂u3 ∂x2 ∂u3 ∂x3 ∂u3
∂u3
=
∂u3
= 1
Similarly,
∂u3 ∂u3 ∂u3 ∂x1 ∂x2 ∂x3
g3 · g2 = e +
1 1
e 2 + e3 · e 1 + e2 + e3 .
∂x ∂x2 ∂x3 ∂u2 ∂u2 ∂u2
∂u3 ∂x1 ∂u3 ∂x2 ∂u3 ∂x3
= + +
∂x1 ∂u2 ∂x2 ∂u2 ∂x2 ∂u2
∂u3
=
∂u2
= 0
g i · g j = δji (21)
Example 22 Find the reciprocal basis {g 1 , g 2 } for the polar coordinates, as shown
in Fig. 10.
We first find
p x2
r= (x1 )2 + (x2 )2 , θ = tan−1
x1
So,
∂r ∂r
g 1 = ∇r = e1 + e2
∂x1 ∂x2
x1 x2
= p e1 + p e2
(x1 )2 + (x2 )2 (x1 )2 + (x2 )2
= cos θe1 + sin θe2
and
∂θ ∂θ
g 2 = ∇θ = 1
e1 + 2 e2
∂x ∂x
1
x x2 x2 1
= − p − 1 2 e1 + p − 1 e2
(x1 )2 + (x2 )2 (x ) (x1 )2 + (x2 )2 (x )
2 1
x x
= −p e1 + p e2
(x1 )2 + (x2 )2 (x1 )2 + (x2 )2
sin θ cos θ
= − e1 + e2
r r
1
Moreover, g 1 · g 1 = g 11 = 1, g 1 · g 2 = g 12 = g 21 = 0, g 2 · g 2 = g 22 = r2
.
v = v r g r + v θ g θ + v z g z [LT −1 ]. (22)
Now,
∂r ∂r ∂r
gr = [·], g θ = [L], g z = [·].
∂r ∂θ ∂z
and so the components have the following dimensions
v r [LT −1 ], v θ [T −1 ], v z [LT −1 ].
Note that v r and v z have same units of velocity, but v θ has units of time inverse.
Moreover, if the dimensionless basis vectors are not unit vectors, the magnitude of
the components are not consistent with the observed or measured physical quan-
tities.
The physical components of a vector are the components that have physically
meaningful units and magnitudes. It is often convenient to derive the governing
equations for a problem in terms of vector components but to solve with the
physical components.
We now introduce the following nondimensional basis
gr gθ gz
ĝ r = , ĝ θ = , ĝ z =
|g r | |g θ | |g z |
or
g g g
ĝ r = √ r , ĝ θ = √ θ , ĝ z = √ z
grr gθθ gzz
So we write,
v = vr gr + vθ gθ + vz gz
√ √ √
= (v r grr )ĝ r + (v θ gθθ )ĝ θ + (v z gzz )ĝ z
= v̂ r ĝ r + v̂ θ ĝ θ + v̂ z ĝ z
Here,
√ √ √
v̂ r = v r grr , v̂ θ = v θ gθθ , v̂ z = v z gzz
are the physical components of v. We know that for cylindrical polar coordinate
system,
then
v̂ r = v r , v̂ θ = rv θ , v̂ z = v z
Here, g(ii) means not summed over i. It simply says g(ii) = g i · g i (again, not
summed). Similarly,
gi p
v = vi g i = v̂i ĝ i , ĝ i = p , v̂i = vi g (ii)
g (ii)
Here, too, g (ii) means not summed over i. It simply says g (ii) = g i · g i .
si · sj = δij
α1 s 1 + · · · + αn s n = 0
Taking dot product with sk , we write αk |sk |2 = 0. Note that k is not summed,
rather it is just one term. As |sk |2 = 1, we can further write αk = 0. So, Sm is
linearly independent.
Moreover, If v ∈ V, then v = αk sk = (v · sk )sk (k = 1, n). So any arbitrary
vector in V could be spanned by S.
Thus W ⊥ is the set of all vectors in ·V which are orthogonal to every vector in W.
p · s = 0, q · s = 0.
Let α1 , α2 ∈ F. Then,
(α1 p + α2 q) · s = α1 p · s + α2 q · s = 0.
W ⊕ W ⊥ = V.
h ⊥ {f 1 , · · · , f m } and h ∈ W ⊥ .
Now,
m
X m
X
v =h+ (v · f i )f i , where h ∈ W ⊥ , and (v · f i )f i ∈ W
i=1 i=1
W + W ⊥ = V. (23)
W ∩ W ⊥ = {0}. (24)
W ⊕ W ⊥ = ·V. (25)
v = w + w⊥ .
T :V→U
1. T (x + y) = T x + T y, x, y ∈ V.
2. T (ax) = aT x, a ∈ F, x ∈ V.
This means, we first apply the operation on some elements in V and then map the
resulting element into U. It will be the same if we map those elements first into U
and then perform the operations in U.
A linear transformation T : V → U is called isomorphic if the mapping is
bijective3 , and we write V ∼
= U. T is known as a second order tensor that maps
vectors to vectors.
L = Iω.
The mass moment of inertia I is a second order that maps the vector ω to L.
These two vectors may not necessarily be in the same direction. A scalar multipli-
cation with a vector also gives a vector. However, like a tensor operation, scalar
multiplication can not change the direction of a vector. The above relation also
follows the linear relationship (26), i.e.,
Example 25 The traction vector t, and the unit normal n to a plane (cut) at a
point are connected as
t = σn
where σ is the Cauchy stress and a second order tensor. The above relation also
follows the linear relationship (26).
T (α1 v 1 + α2 v 2 ) = α1 T v 1 + α2 T v 2
= 0+0=0
T : V → U.
T : V → U.
5.7 Isomorphism
When T : V → U is bijective, we call it isomorphism. Moreover, T is isomorphic
if
1. T is nonsingular
2. dim V = dim U
5.8 Invertible T :
When T : V → U is an isomorphism, then T is invertible. We write,
T −1 : U → V
T : V → V.
I : V → V, such that Iv = v
5.11 Endomorphism
When T is an operator and isomorphic, i.e., T : V → V is bijective, and R(T ) = V.
T : V → U.
Similarly, if we express
T g j = T ij f i , thenf i · T g j = T ij . (29)
1. (T 1 + T 2 )v = T 1 v + T 1 v, where v 1 , v 2 ∈ F
So, T 2 ◦ T 1 ∈ Hom (V, W), and we simply write the product T 2 T 1 , i.e, (T 2 T 1 )v =
T 2 (T 1 v). Commutative diagrams are very useful to represent composition map-
pings. We can represent the above mentioned composition mapping as
T1 /
V U . (33)
T2
T 2 ◦T 1 =T 2 T 1
W
v ⊗ u : U → V.
such that
(v ⊗ u)w = (u · w)v
Linearly independent
Consider cij (g i ⊗ f j ) = 0 ⊗ 0 ∈ Hom (U, V). Then
cij (g i ⊗ f j )f m = (0 ⊗ 0)f m
cij [f j · f m ]g i = 0
cij δjm g i = 0
cim g i = 0
(v ⊗ u)w = [u · w]v = uj wj v = uj wj v i g i
We select w = wj f j for the convenience of dot product to avoid the metric (fij or
f ij ) of U. Moreover,
(g i ⊗ f j )u = [f j · u]g i = uj g i
v ⊗ u = v i uj g i ⊗ f j . (34)
4
Take an element αw1 + βw2 ∈ U and proceed
T = T ij g i ⊗ f j .
We can also write the same tensor T as
T = Tij g i ⊗ f j ∈ V ⊗ U
= Ti·j g i ⊗ f j ∈ V ⊗ U
= T·ji g i ⊗ f j ∈ V ⊗ U (35)
Note that Ti·j means i is the first index. A dot (·) is used to denote the second
index. T ij , Tij , Ti·j , and T·ji are the different components with respect to the
associated basis of a same tensor. Also note that Tj·i ̸= T·ji .
u⊗v
(u⊗v )◦(v 1 ⊗w )=(u⊗v )(v 1 ⊗w )
U
If p ∈ W, then
(u ⊗ v)(v 1 ⊗ w)p = (u ⊗ v) ◦ (v 1 ⊗ w)p = (u ⊗ v)((v 1 ⊗ w)p)
= (u ⊗ v)([w · p]v 1 )
= u[v 1 · v][w · p]
= (u[w · p])[v · v 1 ]
= ((u ⊗ w)p)[v · v 1 ]
= (u ⊗ w)[v · v 1 ] p
A tensor of type (r, 0) is called contravariant tensor and one of type (0, s) is called
covariant tensor. Suppose {g i } and {g i } are the contravariant and covariant bases
of V respectively, then
g i1 ⊗ · · · ⊗ g ir ⊗ g j1 ⊗ · · · ⊗ g js
form a basis of Tsr . Therefore an (r, s)-type tensor T can be uniquely expressed as
T = T·ij11...i j1 js
...j s g i1 ⊗ · · · ⊗ g ir ⊗ g ⊗ · · · ⊗ g ∈ Ts
r r
Note that this is one particular format that is often useful. You have to start with
covariant basis and contravariant basis will come after them. You can not have a
suffling of covariant and contravariant basis.
T ⊗ S ∈ Tsr11+s
+r2
2
This means if
i ...i
T = T· j11 ...jr1s1 g i1 ⊗ · · · ⊗ g ir1 ⊗ g j1 ⊗ · · · ⊗ g js1 ∈ Tsr11
and
m ...m
S = S· n11 ...nsr22 g m1 ⊗ · · · ⊗ g mr2 ⊗ g n1 ⊗ · · · ⊗ g ns2 ∈ Tsr22
then
i ...i m ...m
T ⊗ S = T· j11 ...jr1s1 S· n11 ...nsr22 g i1 ⊗ · · · ⊗ g ir1 ⊗ g m1 ⊗ · · · ⊗ g mr2
⊗g j1 ⊗ · · · ⊗ g js1 ⊗ g n1 ⊗ · · · ⊗ g ns2 ∈ Tsr11+s
+r2
2
.
A ⊗ (B ⊗ C) = (A ⊗ B) ⊗ C ∈ Tsr11+s
+r2 +r3
2 +s3
A ⊗ (B + C) = A ⊗ B + A ⊗ C ∈ Tsr11+s
+r2
2
(A + B) ⊗ C = A ⊗ C + B ⊗ C ∈ Tsr11+s
+r2
2
4. For each α ∈ F:
α(A ⊗ B) = αA ⊗ B = A ⊗ αB ∈ Tsr11+s
+r2
2
The vector space equipped with such an addition and multiplication is called tensor
algebra.
7.4 Contraction
Consider tensors of type Tsr or (r, s) such that r, s ≥ 1.
r−1
such that a tensor space Tsr becomes Ts−1 . For example, let us consider the tensor
T = T·kmp g m ⊗ g p ⊗ g k ∈ T12
Note that:
One needs to be specific about the contracting basis elements if multiple options
are there. Otherwise, one should start contracting from the last.
A double contraction is a mapping
r−2
such that a tensor space Tsr becomes Ts−2 . The symbol ‘:’ is used when two
contractions take place. Consider,
mp
T = T·kl g m ⊗ g p ⊗ g k ⊗ g l ∈ T22
Then
mp mp m
: T = T·kl [g m · g k ][g p · g l ] = T·kl [δk ][δlp ] = T·kl
kl
∈ T00
A n-contraction is a mapping
r−n
[•] : Tsr → Ts−n
r−n
such that a tensor space Tsr becomes Ts−n . Unfortunately, the symbol ‘•’ (or
sometimes :) is used in most of the time beyond double contraction. You need to
understand it from the context.
Single contraction between two tensors are defined in the following way. First
we will consider for simplicity a pure contravariant (r1 , 0) and a pure covariant
tensor (0, s2 ) such that
−1
[·] : T0r1 × Ts02 → Tsr21−1
A single dot ‘·’ is used for a single contraction. For example, if T = T ijk g i ⊗g j ⊗g k
and S = Sqr g q ⊗g r , then, a single contraction between r and k (start from extreme
right) is given by
T · S = (T ijk g i ⊗ g j ⊗ g k ) · (Sqr g q ⊗ g r )
= T ijk Sqr g i ⊗ g j ⊗ g q [g k · g r ]
= T ijk Sqr g i ⊗ g j ⊗ g q [δrk ]
= T ijk Sqk g i ⊗ g j ⊗ g q
T : S = (T ijk g i ⊗ g j ⊗ g k ) : (Sqr g q ⊗ g r )
= T ijk Sqr g i [g j · g q ][g k · g r ]
= T ijk Sqr g i [δjq ][δrk ]
= T ijk Sjk g i
T • S = (T ijkl g i ⊗ g j ⊗ g k ⊗ g l ) : (Sqr g q ⊗ g r )
= T ijkl Sqr g i ⊗ g j [g k · g q ][g l · g r ]
= T ijkl Sqr g i ⊗ g j [δkq ][δlr ]
= T ijkl Skl g i ⊗ g j
(u ⊗ v ⊗ w ⊗ x ⊗ y)(a ⊗ b ⊗ c) = u ⊗ v ⊗ w ⊗ x ⊗ b ⊗ c[y · a]
For a, b, c, d, x, y ∈ V we write
1. (a ⊗ b)c = (b · c)a
2. (a ⊗ b) : (c ⊗ d) = (a · c)(b · d)
3. (a ⊗ b)(c ⊗ d) = (a ⊗ d)(b · c)
4. (a ⊗ b ⊗ c)d = (a ⊗ b)(c · d)
5. (a ⊗ b ⊗ c) : x ⊗ y = a(b · x)(c · y)
We already know from (35), T ∈ V ⊗ V, can be written in four possible ways.
Among those four, we consider elements for the tensor spaces in the form T02 , T20
and T11 . Then,
T = T ij g i ⊗ g j ∈ T02
= Tij g i ⊗ g j ∈ T20
= T·ji g i ⊗ g j ∈ T11 (40)
Note that Ti·j g i ⊗ g j does not belong to any tensor space. T ij and T·ji are also
called associate tensor components of Tij .If we consider T = T ij g i ⊗ g j , then
T g k = T ij (g i ⊗ g j )g k = T ij (g j · g k )g i = T ij δjk g i = T ik g i
=⇒ g m · T g k = g m · T ik g i = T ik g m · g i = T ik δim = T mk
In summary,
Similarly,
g i · T g j = T ij , (g im g m ) · T (g jn g n ) = T ij , g im g jn (g m · T g n ) = T ij
=⇒ g im g jn Tmn = T ij .
So, knowing the components with respect to a basis, one can directly calculate
components with respect to other basis through gij and g ij . In summary,
u ⊗ v : a ⊗ b = [u · a][v · b].
g = gij g i ⊗ g j .
g = g ij g i ⊗ g j
= δ·ji g i ⊗ g j = g i ⊗ g i
You operate g on any vector v i g i or vi g i , you will get that vector back, i.e., gv = v.
So g is indeed an identity tensor!
g gi
ĝ i = √ i , ĝ i = p .
g(ii) g (ii)
Then the physical component of tensors could be identified as follows:
√ √
T = T ij g i ⊗ g j = (T ij g(ii) g(jj) )ĝ i ⊗ ĝ j = T̂ ij ĝ i ⊗ ĝ j
p p
= Tij g i ⊗ g j = (Tij g (ii) g (jj) )ĝ i ⊗ ĝ j = T̂ij ĝ i ⊗ ĝ j
√ p
= T·ji g i ⊗ g j = (T·ji g(ii) g (jj) )ĝ i ⊗ ĝ j = T̂·ji ĝ i ⊗ ĝ j
The notion of transpose is similar to what we studied in the dual space. How-
ever, we have only one vector space V. Consider,
T :V→V
such that v and T v belong to V. Let a and u be the any other two vectors such
that v · a = T v · u. Then the transpose is the mapping T t u = a, and satisfies the
relation (Fig. 17)
v · T t u = T v · u. (42)
Example 29 Show that (a⊗b)t = (b⊗a), where a, b ∈ V and a⊗b ∈ Hom (V, V).
v · (a ⊗ b)t u = (a ⊗ b)v · u
= (b · v)a · u
= (a · u)b · v
= (b ⊗ a)u · v
= v · (b ⊗ a)u (43)
T t = (T ij g i ⊗ g j )t = T ij g j ⊗ g i = T ji g i ⊗ g j = T t ij g i ⊗ g j , =⇒ T t ij = T ji
T t = (Tij g i ⊗ g j )t = Tij g j ⊗ g i = Tji g i ⊗ g j = Tijt g i ⊗ g j , =⇒ Tijt = Tji
T t = (T·ji g i ⊗ g j )t = T·ji g j ⊗ g i = T·ij g i ⊗ g j = Tit ·j g i ⊗ g j .
Note that the component for of the last expression is not admissible as g i ⊗ g j
does not belong to any admissible tensor space.
This means the symmetry is not define due to the exchange of covariant and
contravariant basis. For a mixed basis, the transpose creates elements which do
not reside in the initial tensor space!
7.6 Invariants in 3D
Consider T ∈ Hom (V, V) such that dim V = 3. We can immediately write the
three invariants as
[T u, v, w] + [u, T v, w] + [u, v, T w]
= I1 = tr T (44)
[u, v, w]
[T u, T v, T w]
= I3 = det T (46)
[u, v, w]
where [a, b, c] = a · (b × c).
Cof T (u × v) := T u × T v. (47)
8 Orthogonal tensors
Let R ∈ Hom (V, V) and dim V = 3 such that Rt = R−1 or alternatively
RRt = Rt R = I. (49)
2. It preserves the angle between two vectors after transformation (Fig. 18):
Ru · Rv = u · Rt Rv = u · Iv = u · v. So,
Ru · Rv u·v
cos θ = =
|Ru||Rv| |u||v|
Qu
v
θ
Qv
θ u
(a) (b)
Figure 18: θ and the lengths of the vectors remain the same after an orthogonal
transformation.
1. for a vector v: v ′ = Rv
9 Skew-symmetric tensors
u
Wu = w × u
1. u · W v = W t u · v = −v · W u.
If ω = (ω1 , ω2 , ω3 )t , then
0 −ω3 ω2
[W ]ij = ω3 0 −ω1 (50)
−ω2 ω1 0
T n = λn or (T − λg)n = 0. (51)
where g is the identity tensor. As we know that there are three possibilities to
express T and g, i.e.,
T = T ij g i ⊗ g j , T = Tij g i ⊗ g j , T = T·ji g i ⊗ g j .
g = g ij g i ⊗ g j , g = gij g i ⊗ g j , g = δ·ji g i ⊗ g j = g i ⊗ g i ,
Note that the matrix representations of the last two forms of I are diagonal. Now
going back to the eigenvalue problem, we have to make sure that both T and I
should be expressed with respect to the same basis for the subtraction operation.
Let us consider the form T = Tij g i ⊗ g j (also Tij = Tji ), then we should consider
I = gij g i ⊗ g j . Let n = nk g k (we can also consider n = nk g k ), then
h i
(T − λg)n = (Tij − λgij )g i ⊗ g j nk g k
= (Tij − λgij )nk δkj g i
= (Tij − λgij )nj g i
The above form is known as generalized eigenvalue problem, i.e., in the form of
An = λBv. However, solving a simple eigenvalue problem is convenient. So we
start with T = T·ji g i ⊗ g j ∈ T11 and g = δ·ji g i ⊗ g j ∈ T11 and n = nk g k such that
det(T·ji − λδ·ji ) = 0
P(λ) = λ3 − I1 λ2 + I2 λ − I3 ,
T 3 − I1 T 2 + I2 T − I3 g = 0.
Operating on n, one can write (λ3 −I1 λ2 +I2 λ−I3 )n = 0 or λ3 −I1 λ2 +I2 λ−I3 = 0
as n ̸= 0. So, T satisfies its own characteristic equation λ3 − I1 λ2 + I2 λ − I3 = 0.
Let m = mk g k be the unit left eigenvector with an eigenvalue µ. We then
write, by definition,
m · T = µm or (T t − µg)m = 0. (52)
Theorem 10.1 Let T ∈ Hom (V, V), and dim V = 3. If T has three distinct eigenvalues,
and n(i) = nk(i) g k is the right eigenvector corresponding to i-th eigenvalue, then
{n(1) , n(2) , n(3) } is a basis of V. The eigenvectors n(i) are covariant in nature and
they need not to be orthonormal.
Consider,
Now n(3) is not a null vector, and all the eigenvalues are distinct, i.e., λ1 ̸= λ2 ̸= λ3 .
This means c3 = 0. Similarly, starting with the operators (T −λ2 g), and (T −λ3 g)
give c1 = 0. And starting with the operators (T − λ1 g), and (T − λ3 g) give c2 = 0.
This means {n(1) , n(2) , n(3) } is linearly independent. As theh dimension of the
space is 3, we have {n(1) , n(2) , n(3) } is a basis.
Similarly, we can write
Theorem 10.2 Let T ∈ Hom (V, V), and dim V = 3. If T has three distinct eigenvalues,
(i)
and m(i) = mk g k is the right eigenvector corresponding to i-th eigenvalue, then
{m(1) , m(2) , m(3) } is a basis of V. The eigenvectors m(i) are contravariant in
nature and they need not to be orthonormal.
Theorem 10.3 Let T ∈ Hom (V, V), where dim V = 3. Then any two distinct
left and right eigenvectors are biorthonormal. If T has three distinct eigenvalues,
then
i.e., the left eigenvectors are the reciprocal basis of the right eigenvalues.
Let T n(i) = λi n(i) . Now,
We write,
As λi ̸= λj , we get m(j) ⊥ n(i) . For i = j, from above we can write m(i) · n(i) ̸= 0
(not summed), i.e., m(i) · n(i) can take any value. We select m(i) · n(i) = 1. This
is a choice, not autometic. So we write,
as reciprocal basis of {n(1) , n(2) , n(3) }. Note that if we already normalized the left
and right eigenvectors, to satisfy m(i) · n(i) = 1, we need to rescale the magnitude
again and after that they need not to be a unit vector again. As direction is the
only concern of the eigenvectors, such a rescaling of the magnitude does not affect
the results.
Theorem 10.4 Let T ∈ Hom (V, V), where dim V = 3 and T ∈ Sym . Then
any two distinct left and right eigenvectors are equal (covariant). Moreover, the
eigenvectors are mutually orthonormal. If T has three distinct eigenvalues, then
n(i) · n(j) = δij . That is m(i) = n(i)
We have
T n = λn and T t m = λm ⇒ T m = λm
This shows m = n, i.e, left and right eigenvectors are equal and covariant type.
Now, consider,
T n(i) = λi n(i) , and T n(j) = λj n(j) , (i ̸= j)
Now,
n(j) · T n(i) = n(j) · λi n(i) , ⇒ T t n(j) · n(i) = λi n(j) · n(i) ⇒ T n(j) · n(i) = λi n(j) · n(i) ,
⇒ λj n(j) · n(i) = λi n(j) · n(i) , ⇒ (λj − λi )n(j) · n(i) = 0.
We write,
λj n(j) · n(i) = λi n(j) · n(i) , ⇒ (λj − λi )n(j) · n(i) = 0.
As λi ̸= λj , we get n(j) ⊥ n(i) . For i = j, from above we can write n(i) · n(i) =
1 (not summed), as we already normalized the eigenvectors. Hence the result
n(i) · n(j) = δij .
can check that T n(i) = λi n(i) and n(i) · T = λi n(i) . One of the advantages, among
many, of spectral decomposition is the following:
T 2 = (λ1 n(1) ⊗ n(1) + λ2 n(2) ⊗ n(2) + λ3 n(3) ⊗ n(3) )(λ1 n(1) ⊗ n(1)
+λ2 n(2) ⊗ n(2) + λ3 n(3) ⊗ n(3) )
= λ21 [n(1) (n(1) · n(1) ) ⊗ n(1) ] + λ22 [n(2) (n(2) · n(2) ) ⊗ n(2) ] + λ23 [n(3) (n(3) · n(3) ) ⊗ n(3) ]
= λ21 n(1) ⊗ n(1) + λ22 n(2) ⊗ n(2) + λ23 n(3) ⊗ n(3)
In general,
T n = λn1 n(1) ⊗ n(1) + λn2 n(2) ⊗ n(2) + λn3 n(3) ⊗ n(3) , (n ∈ R). (56)
We also have,
This means that any linear combinations of the eigenvectors n(2) and n(3) will
be an eigenvector with the eigenvalue λ. Moreover, n(2) and n(3) need not to be
orthonormal to each other. Note that the choice of n(1) is unique and it has to
be perpendicular to the plane containing n(2) and n(3) (Fig. 20). The spectral
decomposition in this case becomes
n(3)
n(1) 2
1 n(2)
Figure 20: Any vector in the 2 − 3 plane, i.e., linear combination of n(2) and n(3)
is an eigenvector.
T = g,
P = ni·(j) g i ⊗ g j = n(j) ⊗ g j
= n(1) ⊗ g 1 + n(2) ⊗ g 2 + n(3) ⊗ g 3 .
Λ = λ1 g 1 ⊗ g 1 + λ2 g 2 ⊗ g 2 + λ3 g 3 ⊗ g 3
We now perform
and write the diagonalization of a tensor with respect to the selected basis as
Λ = P −1 T P also T = P ΛP −1 (57)
i.e., considering left eigenvectors or the reciprocal basis of {n(1) , n(2) , n(3) }
(P −1 T P )(P −1 T P )...(P −1 T P ) = ΛΛ · · · Λ}
| {z
| {z }
k times k times
P −1 T k P = Λk .
So we write
Λk = P −1 T k P also T k = P Λk P −1 . (58)
Then
or
k
λ1 0 0
[Λk ]i·j = 0 λk2 0 .
k
0 0 λ3 g ⊗g j
i
This is true for any k (negative, fraction, etc.). For fraction value of k, we will
only consider positive definite tensors, which will be discussed next. A tensor
polynomial f (T ) is defined by
n
X
f (T ) = ak T k
k=i
n
X n
X
−1
= ak P Λ P k
= P (ak Λk )P −1
k=i k=i
Xn
= P( ak Λk )P −1
k=i
= P (f (Λ))P −1
Then
n
X n
X n
X
k
f (Λ ) = (ak λk1 )g 1 ⊗g + 1
(ak λk2 )g 2 ⊗g + 2
(ak λk3 )g 3 ⊗ g 3
k=i k=i k=i
or
Pn
k=i ak λk1 P 0 0
n
[Λk ]i·j = 0 k=i ak λk2 Pn 0
.
k
0 0 k=i ak λ3 g i ⊗g j
10.2.2 Series
We can express a series as
∞
X X∞
f (T ) = k
ak T = P ( ak Λk )P −1 = P (f (Λ))P −1
k=i k=i
Then
∞
X ∞
X ∞
X
f (Λ) = (ak λk1 )g 1 ⊗g + 1
(ak λk2 )g 2 ⊗g + 2
(ak λk3 )g 3 ⊗ g 3
k=i k=i k=i
or
P∞ k
k=i a k λ1 P 0 0
∞
[Λ]i·j = 0 k
k=i ak λ2 P 0
.
∞ k
0 0 k=i ak λ3 g ⊗g j
i
Then
∞ ∞ ∞
X λk 1
X λk 2
X λk 3
eΛ = g1 ⊗ g1 + g2 ⊗ g2 + g3 ⊗ g3
k=i
k! k=i
k! k=i
k!
λ1 1 λ2 2 λ3 3
= e g1 ⊗ g + e g2 ⊗ g + e g3 ⊗ g
or
λk1
P∞ λ
k=i k! 0 0 e 1 0 0
λk2
[Λ]i·j =
P∞
= 0 e λ2 0 .
0 k=i k! 0
0 0
P∞ k
λ3 0 0 eλ3 g ⊗g j
k=i k! g i ⊗g j i
v · T v ≥ 0, v · T v = 0 =⇒ v = 0.
v · T v = v · (T s + T ss )v = v · T s v + v · T ss v = v · T s v + 0.
So, even for a general tensor, it is sufficient to study the symmetric part of it.
When a symmetric tensor T is positive definite, we say T ∈ Psym .
Theorem 10.5 Let T ∈ Hom (V, V), and also T ∈ Psym , where dim V = 3.
Then all the eigenvalues of T are strictly positive.
Let T n = λn. Then,
n · Tn
n · T n = λn · n, ⇒ λ = .
n·n
By definition, as T ∈ Psym , n · T n > 0. Also n · n > 0. These two facts prove
that λ > 0.
T = RU = V R
We first consider the factor T = RU and start with the fact that T t T ∈ Sym and
u · T t T u = T u · T u = |T u|2 > 0 for any nonzero vector u. Then t
√ t T T ∈ Psym
and so, all the eigenvalues are positive. We can define U = T T ∈ Psym by
considering the positive square roots of the eigenvalues of T t T . Now, R = T U −1 .
Then,
Rt R = (T U −1 )t T U −1 = U −t T t T U −1
= U −1 (T t T )U −1
= U −1 (U 2 )U −1 = U −1 (U U )U −1 = I
So R ∈ Orth. You can check √ that RRt = I. You can now establish the other
relation by starting with V = T T t .
Then the vector space is called normed linear space of simply normed space.
Example 1: If V = R, then ||v|| = |v|, v ∈ R.
p
Example 2: If V = En (Euclidean), then ||v|| = v12 + · · · + vn2 .
Example 3: If V = E2 (Euclidean), then we can define many norms. For example,
the length of the vector v. Note that normed space does not always mean an inner
product space, but an inner product space can always have a norm. Norms always
give rise to a metric, which is the distance between two points. Here the points
means the members of the set. The distance function between any two points are
written as
This particular norm, among many other possible norms, known as metric.
12.3 Differentiation
Let V and U are the two inner product spaces with a metric norm. Moreover, D
is an affine space of V. Let v ∈ V and a map f (v) ∈ U. Note that f needs not
to be a linear transformation. It is a mapping with the argument v. f could be a
scalar valued, vector valued or a tensor valued functions. For example,
and so on. Let f be defined at the neighborhood of 0 ∈ V. If f (v) approaches 0 ∈ U faster than v,
we write
||f (v)||
f (v) = o(v), defined as lim = 0.
v →0 ||v||
For example, consider f (t) = ta for a > 1, then f (t) = o(t). Let D be a subset of
f (x + v) − f (x)
Note that v is fixed and α is a variable. This means o(αv) approaches zero faster
than α, i.e., o(αv) = o(α) and we write
||f (x + αv) − f (x) − D x f (x)[αv]||
lim = 0
α→0 ||α||
f (x+αv )−f (x)
|| α
− D x f (x)[v]||
lim |α| = 0
α→0 |α|
f (x + αv) − f (x)
|| lim − D x f (x)[v]|| = 0.
α→0 α
The final form becomes
f (x + αv) − f (x)
D x f (x)[v] = lim , (62)
α→0 α
and we call D x f (x) directional derivative or Gateaux derivative. If we further
expand f (x + αv) in a Taylor series, we get
∂f (x) 1 ∂f (x) 2
f (x + αv) = f (x) + (αvi ) + (α vi vj ) . . . .
∂xi 2 ∂xi ∂xj
Now,
d ∂f (x)
f (x + αv) = (vi )
dα α=0 ∂xi
d2 ∂f (x)
f (x + αv) = (vi vj )
dα2 α=0 ∂xi ∂xj
..
. (63)
d
D x f (x)[v] = f (x + αv) (64)
dα α=0
ϕ(u + v) = (u + v) · (u + v) = u · u + u · v + v · u + v · v
= ϕ(u) + 2u · v + o(v).
Then,
= (u · v + 2v · u + 2αv · v)
α=0
= 2u · v
Recall that
det(A − λI) = −λ3 + I1 λ2 − I2 λ + I3
For λ = −1, we can write
det(A + I) = 1 + I1 + I2 + I3 .
Now selecting V ∈ Hom (V, V)
ϕ(A + V ) = det(A + V ) = det A(I + A−1 V ) = det A det(I + A−1 V )
= det A(1 + tr (A−1 V ) + o(V ))
= det A + det A tr (A−1 V ) + o(V ).
= det A + det A tr (A−1 V ) + o(V ).
So,
D A ϕ(A)[V ] = det A tr (A−1 V )
= (det A)A−t : V
So,
So,
If V = R, and t, α ∈ R then
D t h(t)[α] = π(D t f (t)[α], g(t)) + π(f (t), D t g(t)[α])
D t h(t)α = π(D t f (t)α, g(t)) + π(f (t), D t g(t)α)
= απ(D t f (t), g(t)) + απ(f (t), D t g(t)) as π is bilinear
and
d
D t h(t) = h(t) = ḣ(t) = π(ḟ (t), g(t)) + π(f (t), ġ(t)). (71)
dt
We have the following product rules for different π:
d
(ϕv) = ϕ̇v + ϕv̇
dt
d
(u · v) = u̇ · v + u · v̇
dt
d
(u ⊗ v) = u̇ ⊗ v + u ⊗ v̇
dt
d
(T S) = Ṫ S + T Ṡ
dt
d
(T : S) = Ṫ : S + T : Ṡ
dt
d
(T v) = Ṫ v + T v̇
dt
Note that V = R. Let f (t) = A(t), and g = det() such that g(f (t)) = det A(t).
So, following equation (66) we write
d ˙ = (det A)A−t : Ȧ
(det A(t)) = D A(t) (det A(t))[A(t)] (75)
dt
AE 6104 80 Krishnendu Haldar
12.8 Affine space
12.9.1 Gradient
1. Consider a vector valued scalar function
h(x) : D → R.
then we write
h(x) : D → V
then
3. In general, if
then
Example 35 Show that grad (ϕw) = w⊗grad ϕ+ϕ grad w, where ϕ(x) ∈ ·F =
R, and w(x) ∈ ·G.
7
The notation ⊗n V = V ⊗ V ⊗ · · · ⊗ V means n copies of V.
we write
So,
12.9.2 Divergence
Divergence will make sense when T (x) is a vector or higher order tensor. Consider
T (x) ∈ ⊗n V Then
12.9.3 Curl
Consider the map
ϕ : Rn → R3 .
Generalizing curl in any dimension and any order requires exterior calculus, which
is out of scope in this course.
T n dA = div T dV (81)
∂D D
You are already familiar with (79) and (80) from your elementary vector calculus
course. So, we will prove only (81).
Let a be a constant vector. Then
Z Z Z
a· T n dA = a · T n dA = T t a · n dA
∂D Z∂D ∂D
Z
t
= div (T a) dV = a · div (T ) dV
DZ D
= a· div (T ) dV
Z Z D
=⇒ T n dA = div T dV
∂D D
R R
Example 37 Show that ∂D
(u ⊗ n) dA = D
grad u dV
= a· grad u dV
Z Z D
=⇒ (u ⊗ n) dA = grad u dV
∂D D
∂g i
= Γijk g k
∂uj
and
∂g i
· g = Γijk (u). (84)
∂uj k
We call Γijk as Christoffel symbols of 1nd kind. Note that
∂g i ∂g i m ∂g i m
Γijk = j
· g k = j
· gmk g = g mk j
· g = gmk Γm
ij . (85)
∂u ∂u ∂u
Also note that
∂g i k ∂ ∂x k ∂ ∂x ∂g j k
Γkij = · g = · g = · g k
= · g = Γkji
∂uj ∂uj ∂ui ∂ui ∂uj ∂ui
and
∂g i ∂ ∂x ∂ ∂x ∂g j
Γijk = j
· gk = j i
· gk = i j
· gk = · g k = Γjik
∂u ∂u ∂u ∂u ∂u ∂ui
∂gi
Example 38 Show that ∂uj
= −Γimj g m
We take advantage the relation g i · g m = δm
i
by taking derivative with respect to
j
u we write
∂g i ∂g
j
· g m + g i · mj = 0
∂u ∂u
∂g i
·g = −Γimj
∂uj m
∂g i
j
= −Γimj g m (87)
∂u
∂
∂k gij = (∂k g i ) · g j + g i · (∂k g j ) (where ∂k ≡ )
∂uk
= Γikj + Γjki . (88)
1
Γijk = − ∂k gij + ∂i gjk + ∂j gki (91)
2
Similarly,
1 mi
Γm
jk
mi
= g Γijk = g − ∂k gij + ∂i gjk + ∂j gki (92)
2
This means that knowing the metric, we will be able to compute Christoffel sym-
bols.
ϕ = D → R.
and we have
for a scalar function ϕ(u). The directional derivative along g i could be written as
D u ϕ(u)[g i ] = grad ϕ(u) · g i . (93)
From (64) we further write
d
D u ϕ(u)[g i ] = ϕ(u + αg i )
dα α=0
d
= ϕ(u1 , u2 , . . . , ui + α, . . . , un )
dα α=0
∂ϕ
=
∂ui
where u = uk g k . So we write from (93)
∂ϕ
grad ϕ(u) · g i =
∂ui
which implies
∂ϕ i
grad ϕ(u) = g. (94)
∂ui
Note that grad ϕ(u) is a contravariant vector.
∂ϕ
grad ϕ(u) = ⊗ gi. (96)
∂ui
If ϕ = ϕi g i , then
∂ϕ ∂ k ∂ϕk k ∂g k
= (ϕ g k ) = g k + ϕ
∂ui ∂ui ∂ui ∂ui
k
∂ϕ
= g + ϕk Γm ki g m
∂ui k
So, we write
∂ϕ ∂ϕm
i
grad ϕ(u) = i
⊗g = i
+ ϕ Γki g m ⊗ g i = ϕm g m ⊗ g i .
k m
∂u ∂u i
∂ϕm
ϕm = + ϕk Γm
ki
i ∂ui
∂ϕ ∂ k ∂ϕk k ∂g k
= (ϕk g ) = g + ϕk
∂ui ∂ui ∂ui ∂ui
∂ϕk k
= g − ϕk Γkmi g m , from equation (87)
∂ui
So, we write
∂ϕ i
∂ϕ
m k
m i m i
grad ϕ(u) = i
⊗ g = i
− ϕ k mi g ⊗ g = ϕm g ⊗ g .
Γ
∂u ∂u i
∂T ∂T ij
k tj i it j k ij
grad T (u) = ⊗ g = + T Γkt + T Γkt g i ⊗ g j ⊗ g = T gi ⊗ gj ⊗ gk .
∂uk ∂uk k
where
∂T ij
T ij = k
+ T tj Γikt + T it Γjkt
k ∂u
If T = Tij g i ⊗ g j , we write
∂T k
∂T
ij t t
i j k ij
grad T (u) = k
⊗ g = k
− T Γ
it jk − Tjt ik g ⊗ g ⊗ g = T
Γ gi ⊗ gj ⊗ gk .
∂u ∂u k
where
∂Tij
Tij = − Tit Γtjk − Ttj Γtik (98)
k ∂uk
Example 39 Show that the covariant derivative of the metric coefficient gij is
always zero.
Replacing gij in place of Tij in equation (98)
∂gij
gij = − git Γtjk − gjt Γtik
k ∂uk
∂gij
= − Γjki − Γikj
∂uk
∂gij
= − (Γjki + Γikj )
∂uk
∂gij ∂gij
= − from equation (88)
∂uk ∂uk
= 0 (99)
∂g
Example 40 Show that Γiik = 12 g ij ∂uijk .
∂gij
= git Γtjk + gjt Γtik
∂uk
∂gij
g ij k = g ij git Γtjk + g ij gjt Γtik
∂u
= δtj Γtjk + δti Γtik
1 ∂gij
= Γjjk + Γiik = 2Γiik =⇒ Γiik = g ij k (100)
2 ∂u
√
∂ g
Example 41 Show that Γiik = √1 .
g ∂uk
Then
div ϕ(u) = grad ϕ(u) : g = tr (grad ϕ(u)) = ϕi
i
i
∂ϕ
= + ϕk Γiki
∂ui √
∂ϕi k 1 ∂ g
= +ϕ √ from (101)
∂ui g ∂uk
√
1 ∂ gϕk
= √ (102)
g ∂uk
grad T (u) = T ij g i ⊗ g j ⊗ g k .
k
Then
= T ij g i (δjm )(δm
k
)
k
ij
= T g i δjk
k
= T ik g i
k
You can consider any form among the four possible options. You will get the same
final results. However, intermediate steps will be different.
3. Show that
√
g 1 · (g 2 × g 3 ) = g
4. Show that
g2 × g3 g3 × g1 g1 × g2
g1 = √ , g2 = √ , g3 = √
g g g
5. Show that
g i × g j = eijk g k
√
where eijk = gεijk , and εijk is the permutation with respect to the standard
basis.
6. Show that
√ √ √ 1
g1 = g(g 2 × g 3 ), g2 = g(g 3 × g 1 ), g3 = g(g × g 2 )
7. Show that
g i × g j = eijk g k
√
where eijk = εijk / g, and εijk is the permutation with respect to the stan-
dard basis.
1
8. Show that the differential volume element dV of an infinitesimal paral-
lelepiped with sides g 1 du1 , g 2 du2 and g 3 du3 , where (u1 , u2 , u3 ) are the local
(curvilinear) coordinates is given by
√
dV = gdu1 du2 du3
x1 = r cos θ,
x2 = r sin θ,
x3 = x 3 .
2
(b) Spherical (Fig. 2): (r, θ, ϕ) implies (radial, polar, azimuthal)
x1 = r cos θ sin ϕ,
x2 = r sin θ sin ϕ,
x3 = r cos ϕ.
x1 = (R − r cos θ) cos ϕ,
x2 = (R − r cos θ) sin ϕ,
x3 = r sin ϕ.
3
Figure 3: Toroidal coordinate system. Read (X, Y, Z) as (x1 , x2 , x3 ).