Tensor Calculus
Tensor Calculus
September 2002
The NASA STI Program Office... in Profile
Since its founding, NASA has been dedicated to • CONFERENCE PUBLICATION. Collected
the advancement of aeronautics and space papers from scientific and technical
scie:nce. The NASA Scientific and Technical conferences, symposia, seminars, or other
Information (STI) Program Office plays a key part meetings SponSored or
cosponsored by
in helping NASA maintain this important role. NASA.
Langley Research Center, the Lead Center for technical, or historical information from
NASA's scientific and technical information. The NASA programs, projects, and rrussions,
NASA STI Program Office provides access to the often concerned with subjects having
NASA STI Database, the largest collection of substantial public interest.
aeronautical and space science STI in the world.
The Program Office is also NASA's institutional • TECHNICAL TRANSLATION. English
-
mechanism for disseminating the results of its language translations of foreign scientific
research and development activities. These results and technical material pertinent to NASA's
are published by NASA in the NASA STI Report mission.
Series, which includes the following report types:
Specialized services that complement the STI
TECHNICAL PUBLICATION. Reports of Program Office's diverse offerings include
completed research or a major significant creating custom thesauri, building customized
phase of research that present the results of databases, organizing and publishing research
NASA programs and include extensive data results. even
.
providing videos.
.
or theoretical
analysis. Includes compilations
of significant scientific and technical data and For more information about the NASA STi
information deemed to be of continuing Program Office, see the following:
reference value. NASA's counterpart of peer -
reviewed formal professional papers but • Access the NASA STI Program Home Page
has lessstringent limitations on manuscript at http://www.sti.nasa.gov
length and extent of graphic presentations.
• E -mau your question via the Internet to
• TECHNICAL MEMORANDUM. Scientific [email protected]
and technical findings that preliminary or
are
of specialized interest, e.g., quick release • Fax your question to the NASA Access
reports, working papers, and bibliographies Help Desk at 301-621-0134
that contain minimal annotation. Does not
contain extensive analysis. •
Telephone the NASA Access Help i)es at
301-621-0390
• CONTRACTOR REPORT. Scientific and
technical findings by NASA-sponsored • Write to:
contractors and grantees. N ASA Access Help Desk
NASA Center for AeroSpace information
7121 Standard Drive
.lIanover, Mi) 21076
NASA / TM-2002-2 11716
September 2002
Available from
Joseph C. Kolecki
National Aeronautics and Space Administration
Glenn Research Center
Cleveland, Ohio 44135
Tensor analysis is the type of subject that can make even the best of students shudder. My own
post -graduate instructor in the subject took away much of the fear by speaking of an implicit
rhythm in the peculiar notation traditionally used, and helped me to see how this rhythm plays its
way throughout the various formalisms.
Prior totaking that class, I had spent many years "playing" on my own with tensors. I found the
going to be tremendously difficult, but was able, over time, to back out soi hysica1 and
geometrical considerations that helped to make the subject a little more transparent. Today, it is
in terms of tensors and their associated concepts.
For those students who wish t ond this humble start, I can only recommend my
professor's wisdom: find th rhythm n the mathematics and you will fare pretty well.
Beginnings
At the heart of all mathematics are numbers.
If I were to ask how many marbles you had in a bag, you might answer, "Three." I would find
your answer perfectly satisfactory. The 'bare' number 3, a magnitude, is sufficient to provide the
information I seek.
If I were to ask, "How far is it to your house?" and you answered, "Three," however, I would
look at you quizzically and ask, "Three what?" Evidently, for this question, more information is
required. The bare number 3 is no longer sufficient; I require a 'denominate' number -
a number
with a name.
Suppose you rejoindered, "Three km." The number 3 is now named as representing a certain
number of km. Such numbers are sometimes called scalars. Temperature is represented by a
If I were next to ask "Then how do I get to your house from here?" and you said, "Just walk
three km," again I would look at you quizzically. This time, not even a denominate number is
sufficient; it is necessary to specify a distance or magnitude, yes, but in which direction?
NASA!TM-2002-2 11716
"Just walk three km due north." The denominate number 3 km now has the
required additional
directional information attached to it. Such numbers are called Velocity is a vector since
vectors.
it has a magnitude and a direction; so is momentum. Quite often, a vector is represented by
components. If you were to tell me that to go from here to your house I must walk three blocks
east, two blocks north, and go up three floors, the vector extending from "here" to "your house"
would have three spatial components:
• Three blocks east,
• Two blocks north,
• Three floors up.
Physically, vectors are used to represent locations, velocities, accelerations, flux densities, field
quantities, etc. The defining equations of the gravitational field in classical dynamics (Newton's
Law of Universal Gravitation), and of the electromagnetic field in classical electrodynamics
(Maxwell's four equations) are all given in vector form. Since vectors are higher order quantities
than scalars, the physical realities they correspond to are typically more complex than those
represented by scalars.
A Closer Look at Vectors
The action of a vector is equal to the sum of the actions of its components. Thus, in the example
given above, the vector from "here" to "your house" can be represented as
1
V 3 blocks east + 2 blocks north + 3 floors up
Each component of V contains a vector and a scalar part. The scalar and vector components of V
canbe represented as follows:
• Scalar: Let a =
3 blocks, b =
2 blocks, and c =
3 floors be the scalar components; and
unit vector pointing up. (N.B.: Unit vectors are non -denominate, have a magnitude of
unity, and are used only to specify a direction.)
Then the total vector, in terms of its scalar components and the unit vectors, can be written as
V =
ai + bj + ck.
This notation is standard in all books on physics and engineering. It is also used in books on
introductory mathematics.
Next, letus look at how vectors combine. First of all, we know that numbers
may be combined
in various ways to produce new numbers. For example, six is the sum of three and three or the
product of two and three. A similar logic holds for vectors. Vector rules of combination include
vector addition, scalar (dot or inner) multiplication, and (in three dimensions) cross
multiplication. Two vectors, U and V, can be added to produce a new vector W:
W=U+V.
1
Theappropriate symbol to use here is
'"
rather than
""
since the 'equation' is not a strict vector
identity. However, for the sake of clarity, the "=" notation has been suppressed both here and later on,
and "=" signs have been used throughout. There is no essential loss in rigor, and the meaning should be
clear to all readers.
NASA/TM-2002-2 11716
Vector addition is often pictorially represented by the so-called parallelogram rule. This rule is a
pencil and straightedge construction that is strictly applicable only for vectors in Euclidean
space, or for vectors in a curved space embedded in a Euclidean space ofhigher dimension,
where the parallelogram rule is applied in the higher dimensional Euclidean space. For example,
two tangent vectors on the surface of a sphere may be combined via the
parallelogram rule
provided that the vectors are represented in the Euclidean 3 -space which contains the sphere. In
formal tensor analysis, such devices as the parallelogram rule are generally not considered.
Two vectors, U and V can also be combined via an inner product to form a new scalar 1. Thus
UV=ii.
Example: The inner product offorce and velocity gives the scalar power being delivered into (or
being taken out of) a system:
f(nt) v(mls) =
p(W).
Example: The inner product of a vector with itself is the square of the magnitude (length) of the
vector:
U. U= u2.
Two vectors U and V in three-dimensional space can be combined via a cross product to form a
UxV=S
where S is perpendicular to the plane containing U and V and has a sense (direction) given by the
right-hand rule.
Example: Angular momentum is the cross product of linear momentum and distance:
L(kg m2/s).
Finally, a given vector V can be multiplied by a scalar number a to produce a new vector with a
different magnitude but the same direction. Let V Vu where u is a unit vector. Then
=
aV =
aVu =
(aV)u =
m(kg) a(mls2)
where the force and the acceleration share a common direction.
We have just seen be multiplied by scalars to produce new vectors with the same
that vectors can
sense or direction. In
general, we can specify a unit vector u, at any location we wish, to point in
any direction we please. In order to construct another vector from the unit vector, we multiply u
by a scalar, for example X, to obtain ?u, a new vector with magnitude ? and the sense or
direction of ti.
NASAITM-2002-2 I 1716
Notice that the effect of multiplying the unit vector by the scalar is to change the magnitude from
unity to something else, but to leave the direction unchanged. Suppose we wished to alter both
the magnitude and the direction of a iven vector.
su icient. Forming the cross product with anO er vector is also not sufficient, unless we wish to
limit the change in direction to right angles. We must find and use another kind of mathematical
'entity.'
Let's pause to introduce some terminology. We will rename the familiar quantities of the
previous paragraphs in the following way:
This terminology is suggestive. Why stop at rank 1? Why not go onto rank 2, rank 3, and so on.
Dyad: of rank 2.
Tensor (magnitude and two directions 32
- =
9 components)
Triad: Tensor of rank 3. (magnitude and three directions 33 - =
27components)
Etcetera...
We will now merely state that f weform the inner product of a vector and a tensor of rank 2,a
dyad, the result will be another vector with both a new magnitude and a new direction. (We will
consider triads and higher order objects later.)
A tensor of rank 2 is defined as a system that has a magnitude and two directions associated with
it. It has 9 components. For now, we will use an example from classical electrodynamics to
illustrate the point just made.
The magnetic flux density B in volt -sec/rn2 and the magnetization H in Amp/rn are related
through the permeability t in H/rn by the expression
B =
density and the magnetization in free space differ in magnitude but not in direction. In some
exotic materials, however, the component atoms or molecules have peculiar dipole properties
that make these terms differ in both magnitude and direction. In such materials, the scalar
permeability is then replaced by the tensor permeability j, and we write, in place of the above
equation,
B =
H.
The permeability I! is a tensor of rank 2. Remember that B and H are both vectors, but they now
differ from one another in both magnitude and direction.
The classical example of the use of tensors in physics has to do with stress in a material object.
Stress has the units offorce-per-unit-area, or nt/rn2. It seems clear, therefore, that (stress) x
(area) should equal (force); i.e., the stress -area product should be associated with the applied
forces that are producing the stress. We know that force is a vector. We also know that area can
be represented as a vector by associating it with a direction, i.e., the differential area dS is a
vector with magnitude dS and direction normal to the area element,
pointing outward from the
convex side.
NASAITM-2002-2 11716
Thus, the stress in the
equation (force) (stress) x (area) must be either a scalar or a tensor. If
=
it is, in fact, another tensor of rank 2 and that the force must be an inner product of stress and
-
area.
The force dF due to the stress T acting differential surface element dS is thus
on a
given by
dF=IdS.
The right-hand side can be
integrated any surface within the material under consideration,
over
as is actually done, for
example, in the analysis of bending moments in beams. The stress tensor
T was the first tensor to be described and used by scientists and engineers. The word tensor
-
-
detail. Dyad products were the mathematical precursors to actual tensors, and, although they are
somewhat more cumbersome to use, their relationship with the physical world is somewhat more
intuitive because they directly build from more traditional vector concepts understood by
physicists and engineers.
Ji constructing a dyad product from two vectors, we form the term -by -term
product of each of
their individual components and add. If U and V are the two vectors under consideration, their
dyad product is simply UV. The dyad product UV is neither a dot nor a cross product. It is a
distinct entity unto itself. If U =
u1i +
u2j + u3k and V =
v1i +
v2j + v3k, then
UV =
u1v1ii +
u1v2ij + u1v31k + u2v1ji +•
where i, j, and k are unit vectors in the usual sense and ii, ij, ik, etc. are unit dyads. In forming
the product UV above, we simply "did what came naturally" (a favorite phrase of another of my
professors!) from our knowledge of multiplying polynomials in elementary algebra. Notice that,
by setting u1v1 =
Ifl, UIV2
=
-t12, etc., this dyad can be rewritten as
13V =
siiii +
JI21J + 13ik + i2ji +...
and that the scalar components can be arranged in the familiar configuration of a 3x3 matrix:
NASA!TM-2002-2 11716
p32 P33
Using the known rules of matrix multiplication, we can, by extension, write the rules associated
with dyad multiplication.
The product of a matrix j and a scalar a is commutative. Let the scalar components of M be
thought of as the same array of numbers shown above). Then for any scalar a, we find
aM[ai] =[pa]=Ma.
Similarly, the product of a dyad UV and a scalar a is defined as
a(UV) =
(aU)V =
(Ua)V =
U(aV) =
U(Va) =
(UV)a.
In this case, the results of pre- and post -multiplication are equal.
The inner product of a matrix and a vector, however, is not commutative. Let V =
(Vi) be a
where the summation is over the second matrix index j. It is clear that 'U" ~ 'U.
Similarly, the inner product of the dyad UV with another vector S is defined to be
S (UV)
when we pre -multiply, and
(UV) S
when we post -multiply. As with matrices, pre- and post -multiplication do make a difference to
the resulting object. To maintain consistency with matrix -vector multiplication, the dot
"attaches" as follows:
S UV =
(S U)V =
where i =
S U. The result is a vector with magnitude and sense (direction) determined by V.
But
NASA!TM-2002-2 11716
UV• S =
U(V S) =
UX =
2U
Tensors of Rank> 2
Tensors of rank 2 result from dyad products of vectors. In an entirely analogous way, tensors of
rank 3 arise from triad products, UVW, and tensors of rank n arise from "n-ad" products of
vectors, UVW...AB. In three-dimensional space, the number of components in each of these
systems is 3. The rules governing these higher rank objects are defined in the same way as the
rules developed above.
We now extend the properties and rules of operation for familiar objects -
1. All scalars are tensors, although all tensors of rank O are scalars (see below).
not
2. All vectors are tensors, although all tensors of rank 1 are vectors (see below).
not
3. All dyads or matrices are not tensors, although all tensors of rank 2 are dyads or
matrices.
4. We have examined, in some detail, properties and operating rules for scalars, vectors,
dyads, and matrices.
5. We now extend these rules to tensors per se. We assert that:
6. Tensors can be multiplied by other tensors to form new tensors.
7. The product of a tensor and a scalar (tensor of rank 0) is commutative.
8. The pre -multiplication of a given tensor by another tensor produces a different result
from post -multiplication; i.e., tensor multiplication in general is not commutative.
9. The rank of a new tensor formed by the product of two other tensors is the sum of their
individual ranks.
10. The inner product of a tensor and a vector or of two tensors is not commutative.
11. The rank of a new tensor formed by the inner product of two other tensors is the sum of
their individual ranks minus 2.
12. A tensor of rank n in three-dimensional space has 3 components.
style of a mathematician writing a proof than a physicist or engineer solving a problem. While
the approach is formal, the conclusions are physically as well as mathematically valid.
Let's begin with the magnetic field. We use the tensor form
B =
H
and represent the tensor permeability by a dyad j,, UV without concern for the specUIc natures
=
NASA!TM-2002-21 1716
physical natures of U and V, we understand that a second rank tensor and a dyad are equivalent
provided the vectors U and V are appropriately chosen. We make this assumption without loss of
generality. From the physicist/engineer perspective, it is only significant that
• the dyadUV represents a physical quantity i.e., permeability j; and
-
• the rules developed in the previous section can be applied to the dyad representation in a
B= UV H= U(V H) =
liA. =
XU
where A. =
V H is a scalar and U a vector. It is clear that the direction of B depends only on the
direction of U, not H. Since we specified nothing about the nature of U, U cannot be restricted in
its magnitude or direction by H in any way. Therefore, we conclude that the direction of B must
be independent of the direction of H.
In tensor (or, in this case, matrix) notation, we might represent the scalar components of the
B, =
J.tst H
with summation occurring over the repeated index, t. This last representation has become the
standard in the literature.
for the specific nature of U and V), then the inner product I dS, can be represented as
IdS=(UV)dS=U(VdS)=UdÇ
where dÇ is the scalar differential resulting from the inner product V dS. The term U d is a
vector (tensor of rank 1) and is, in fact, the differential force dF acting on the area element
U d =
dF.
It is inevitable in an article of this type that we must do some mathematics. The previous section
used a more (less intuitive) approach
formal to demonstrate one role that tensors
play in physics
and engineering. In this
section, we will stay with the formal approach and define yet another,
perhaps somewhat peculiar, tensor operation, which will be left without much physical (intuitive)
consideration. The student, who so wishes, can skip over this section without loss.
We begin by summarizing the relationship between the type of vector product being used and the
rank of the resulting object. These results are already implicit in the material given above.
• A vector-scalar product results in a vector: there is no change in rank.
• A vector-vector dyad product results in a dyad: there is an increase in rank from rank 1
(vector) to rank 2.
• A vector -vector inner product usually results in a scalar: there is a decrease in rank from
rank 1 (vector) to rank O (scalar).
• And so on...
NASAITM-2002-21 1716
Except for the inner product, the rank of the resulting quantity is the sum of the ranks of the
quantities being combined. So, if we form a triad UVW, its components comprise a tensor of
rank 3. If we form a "tetrad," its components comprise a tensor of rank 4. And so on...
JVW...ABCJ
n vectors
might choose to introduce a dot between U and V, or V and W, etc. This process is called
contraction, and results in a new tensor with rank (n 2). -
And there is a reduction in rank by two. A special case of interest is the dyad UU.
Contraction of the dyad gives the squared magnitude of the vector U: U• U U2. =
• If we introduce a dot into an existing triad, the triad is contracted to a vector. Given the
triad UVW, we can introduce a dot in one of two ways, forming either U VW or UV
W, either of which is a vector, since
ThVW=(UV)W=cxWwhere(a= U.V)
or
2). If we were to
form the force -velocity dyad FV, as might be done in formulating the general equations of fluid
dynamics, we could always find the rate of energy dissipated in the fluid (the power) by
contracting the dyad to a scalar. Thus
dE/dt =
F .
V.
Essentially, given a tensor equation of rank n, it is possible to extract information from the
equation in a variety of ways. The ability of tensor equations both to store information and to
permit its simple manipulation should be coming clear by now!
Tensors are
typically defined by their coordinate transformation properties. The transformation
properties of tensors can be understood by realizing that the physical quantities they represent
must appear in certain ways to different observers with different points of view.
Suppose, for example, that I measure the temperature (°C) at a given point P at a given time. You
also the temperature (°C) at P at the same time but from a different location that is in
measure
motion relative to my location. Would it make any sense if you and I acquired different
NASA!TM-2002-2 I 1716
magnitudes; i.e., if my thermometer measured 25°C and yours measured 125°C? No. We must
both obtain the same quantity from our respective measurements.
Put another way, suppose that I call my point of view (coordinate system or reference frame) K
and yours K*. Let T be the temperature (°C) measured at P in K and T* be the temperature (°C)
measured in K*. We then require
T=T*.
This expression is an example of a coordinate transformation law between K and K* for the
scalar temperature T. Only scalars that transform like this are to be admitted into the class of
tensors of rank O. In fact, letting T stand for any scalar quantity we wish, the equation T T* can =
Now let T be the frequency of light emanating from a monochromatic source at P. Again, let two
observers, K*, measure the frequency of the light at P at the same time using the same
K and
units of inverse seconds. If I am stationary relative to the source, the light will have a certain
frequency, for example T oo. If, on the other hand, you are moving toward or away from the
=
source when you take your measurement, the light will be red or blue shifted with
frequency T*
uo ± Au. Obviously T ~ T* in this case, and although the frequency thus observed is a scalar, it
=
A similar argument holds for vectors. As was the case with scalars, not all vectors are tensors of
rank 1. Suppose that a vector quantity V exists at point Again,
a P. assume two reference frames,
K and K*. Let V be the vector observed (measured) in K, and V' be the same vector observed in
K* at the same time. As with the temperature example, we again require that
V=v*
since, after all, K and K* are both observing the self-same vector. Any other result would not
make physical sense. Any vector that transforms according to the expression V V is defined =
to be a tensor of rank 1. We usually say that the transformation law T T*, or V V, requires = =
While the vector itself is coordinate independent, its individual components are not. Thus, in the
vector transformation law V =
V*, the components of the vector vary from system to system, but
do so in such a way that the vector quantity itself remains unchanged. This truth is evident when
we realize that the components in any coordinate system arenothing more than the projections of
the vector onto the local coordinate axes.
Many representations exist for vectors in Euclidean 3 -space, the space of our school algebra and
geometry, including the familiar V a i + f3 j + y k in which V is the vector being represented; a,
=
Í, and y are its scalar components along the x, y, and z axes of a Cartesian reference system,
respectively; and i, j, and k are unit vectors along those same axes. Another representation of V
is as a triad of numbers, V (a, 13, ). =
were a vector in E or
R (Euclidean or Riemannian n -space) it would be written as
V=(vi,v2,... ,v)
orsimply,V=(v1),i=1, ...,n.
NASAJTM-2002-211716 lo
Now, let V be the position vector extending from the origin of K to a particular point P, and V
be the position vector extending from the origin of K* to that same point. Assume that the
origins of K and K* do not coincide; then V ~ V*. The position vector is very definitely
coordinate dependent and is not a tensor because it does not satisfy the condition of coordinate
independence.2
But suppose that V1 and V2 were position vectors of points P1 and P2 in K, and that V1 and V2
were position vectors to the same points P1 and P2 in K*. The vector extending from P1 to P2
must be the same vector in both systems. This vector is V2 -
V1 in K and Vf -
V2 -
Vi =
V2 -
V1,
i.e., while the position vector itself is not a tensor, the difference between any two position
vectors is a tensor of rank 1! Similarly, for any position vectors V and V*, dV =
dV*; i.e., the
differential of the position vector is a tensor of rank 1.
This result may seem a little strange, but it provides strong motivation for exercising care in
working with physical vector quantities.
A
Digression:
Coordinate Systems and Mathematical Spaces
Now, for one brief chapter,
we are going to sidestep the main theme of this article to consider a
subject that is extremely important but all too often ignored. Students who study such disciplines
as General Relativity should especially appreciate the ideas introduced here.
So far, except for
a few brief allusions, we have
tacitly assumed that we were operating in the
same Euclidean space as we encountered in our high school and college mathematics and physics
without so much as a second thought as to what we were doing or why. In fact, the choice of a
mathematical space whether Euclidean or non-Euclidean is every bit as important as the
-
-
the distinction between coordinate systems per se and space. Since we are considering
physical/tensorial quantities that exist in space and are coordinate independent, it behooves us to
take a closer look at this distinction.
A line is example of a Euclidean 1 -space. It has one dimension, extends to ± co, and has a
an
metric (e.g.: the unit interval). The coordinate system associated with the line is defined by the
unit interval, chosen for convenience then copied repeatedly, end -to -end along the entire line in
both directions from the starting point. A line thus marked, with numbers added for reference, is
called a real number line.
2
This
argument depends on the definition of the position vector as the vector extending from the origin of
a given coordinate system to a point that it is said to locate. Thus, for any point P in space, the position
vectors in two systems K and K* whose origins do not coincide will, by definition, be different. If V is the
position vector in K, then it is also a vector in K* but not a position vector, and the coordinate
transformations apply to it in the usual way. Since, however, V is a position vector only in one system, not
in both, it cannot represent the same thing in both; hence, it is fundamentally different than other vector
quantities whose character is the same in all reference frames.
NASA/TM-2002-211716 11
Aplane is an example of a Euclidean 2 -space. It has two dimensions, extends to infinity in all
directions, and has a linear metric (the unit interval) and an areal metric (the unit square). It also
has an intrinsic geometry defined by the Greek, Euclid (c. 300 BC).
In the geometry of Euclid, objects, such as triangles, squares, or circles, can be moved about in
the plane without deformation and, therefore, compared to one another using such relationships
as similarity or congruence. Also in the geometry of Euclid, parallel lines extend forever without
meeting, and so on. In the plane, the coordinate system of choice is the Cartesian system,
comprising two real number lines that meet at right angles. Other systems are also possible.
The physical analogue of such a space is a region in which material objects and/or beams of light
can be moved about without deformation. But since gravity permeates all space and time, no
such region exists in the universe at large. Thus it was that Einstein abandoned Euclidean space
as a basis for his General Relativity and
adopted a differentially metric non-Euclidean space
instead.
A sphere is an of an elliptic 2 -space. Like the plane, the sphere also has two
example
dimensions. Unlike the plane, however, the sphere does not extend to infinity; it fact, the sphere
is a closed, finite surface. The sphere has a differential linear metric and a differential areal
metric. It also has a geometry, though one quite different from that of Euclid.
would have to be curved to fit into the surface. Such metric, of course, could be defined; but
a
many theorists prefer to use differential quantities that, in the limit of 'smallness,' behave as
though they were Euclidean. One reason is that a simple algebraic metric can be written for
differential quantities.
In the plane, algebraic metric is Pythagoras' theorem: s2 x2 + y2, describing the relationship
the =
between the length of the hypotenuse, s, and the two sides, x andy, of a right triangle. Since the
plane is flat, differential quantities are not a concern.
In a sphere, the
corresponding relationship would have additional terms: +
j3y2 + yxy.
=
Such a metric is certainly approachable, but in the limit of smallness, Pythagoras theorem
reappears: ds2 dx2 + dy2, where ds, dx, and dy are differential lengths. This situation is much
=
Navigators use just this type of geometry when traveling across the face of our earth. For them,
metrics on the order of a few km are small enough to be considered 'flat,' given that the earth has
a radius of 6,400 km.
In the geometry of the sphere, the elliptic geometry, objects again can be moved without
deformation (since the surface is of uniform curvature), and, therefore, compared in the same
sorts of relationships as in the plane. There are no parallels in the sphere, however, because there
are no Euclidean straight lines, and all pairs of curves that
approximate lines (the so-called great
circles whose radii equal that of the sphere itself) always meet at two antipodal points. Neither is
there a Cartesian coordinate system in the sphere. Coordinate systems in the sphere can be
constructed using great circles, but these systems have no unique origin.
NASAITM-2002-211716 12
An egg is another example of an elliptic 2 -space. It has two dimensions, is closed and finite. It
has a differential metric like the sphere. Unlike the sphere, the egg cannot
support relationships
like similarity or congruence since objects cannot be moved without deformation
(except for
some special cases; the is
egg differentially curved in one direction but not the other). Local
coordinate systems are possible in the egg, at least over regions small
enough that variations in
curvature can be ignored. But a global system, like that of the
sphere, is not entirely tractable.
There are
higher dimensional analogs of the plane, the sphere, and the egg, and of any number of
other shapes that might happen to come to mind (including the saddle of hyperbolic geometry).
Each comprises a mathematical space in terms of being a point set with certain specially defined
characteristics. In each space, different kinds of coordinate systems are possible. In the plane, we
spoke of the Cartesian system; but there is also the polar system, the triangular system, and so
on. All of these systems can be used to
map the same plane; yet, all are different.
Physical quantities existing in the plane must be independent of the particular coordinate system
chosen. These quantities are not necessarily independent of the space that contains them,
however. The same idea applies to all other spaces and coordinate systems as well.
Any triangle in the plane has the property that the sum of its interior angles adds to 1800. Not so,
the sphere. Any triangle in the sphere has the property that the sum of its interior angles is
greater than 180°, the more -so the larger the triangle. Consider a triangle on the earth comprising
one -quarter of the equator with two more
legs extending toward and meeting at one of the poles.
This triangle has three right angles for interior angles, giving a grand total of 270°!
Tensor analysis takes account of coordinate independence and of the peculiarities of different
kinds of spaces in one grand sweep. Its formalisms are structurally the same regardless of the
space involved, the number of dimensions, and so on. For this reason, tensors are very effective
tools in the hands of theorists working in advanced studies. For this same reason, tensors are also
very effective tools for setting up systems of equations in "everyday" physics or engineering
applications. The systems themselves may not be easy to solve, but they are usually obtained
with expedience.
Coordinate Curves and Coordinate Surfaces
Let's now return to Euclidean space and consider the idea of coordinate systems a little more
closely. What we learn here can be immediately extended to other types of spaces and/or to
higher numbers of dimensions.
We begin with a 2 -dimensional Cartesian system in a Euclidean space. The system consists of an
x- and a y-axis that are
orthogonal. These two axes determine a unique point of intersection. This
point is
designated the origin of the system and is given the special label x 0, y 0. Whole = =
numbers are then placed along each of the axes by establishing a unit interval and using it
repeatedly to mark off additional intervals.
axes, in the following manner: Through P, two lines are constructed parallel to the individual
axes. (These lines are often referred to as a local coordinate
system or local axes at P.) The
numerical values, x x0 and
=
y y, on
=
the Cartesian axes where these lines intersect designate
the coordinates of the point. The shorthand notation is
NASA/TM-2002-2 11716 13
P =
(xo, yo).
In a 3 -dimensional Cartesian system, there are three orthogonal axes (x, y, and z) and three
coordinate planes (xy, xz, and yz). Any point P is uniquely specified by the number triple
P=(x,y,z).
In an n -dimensional Cartesian system, by extension, there are n orthogonal axes and (n -i)!
coordinate planes. Any point P is uniquely specified by a number n -tuple
P=(xi,x2,x3,...,x)
where the change to subscripted notation in necessitated for purposes of generality.
Suppose we were to relax these conditions. We would obtain statements to the effect that:
The coordinate axes are general curves defined to intersect at least once. A point of
intersection can be chosen origin.
as the
The coordinate axes are not necessarily mutually orthogonal.
Pairs of coordinate axes uniquely determine curvilinear coordinate surfaces as product
spaces.
Next, consider a straight line and a circle. Let the circle touch the line so that its radius is
perpendicular to the line and the line
perpendicular is plane to the of the circle. Now slide the
circle along the full extent of the line. The result is that a cylinder will be swept out. The cylinder
is the product space of the circle and the line in the configuration specified. If coordinates are
marked on the line and the circle, then a unique pair of numbers will specify every point in the
cylinder.
Now, consider two circles of equal radius. Let the circles be perpendicular to one another so that
one circle touches the other at each of the opposite ends of a diameter. Again, sweep one of the
circles around the other to
produce a sphere. The sphere
is the product space of the two circles in
theconfiguration specified. If coordinates are marked on each of the circles, then a pair of
numbers will uniquely specify every point in the sphere.
Similarly, a torus is the product space of two circles (not necessarily of equal radius) in a
NASAJTM-2002-2 11716 14
Finally, start with any two curves. Let the curves intersect at one point. Mark the curves with
coordinates, analogously to the coordinates on a Cartesian axis. Slide one curve along the other
to produce a surface. Then a pair of numbers from the curves will
specify any point on the
surface in perfect analogy with the Cartesian plane. If the curves are called u and v, we then
say
that we have a u -axis and a v -axis. These axes together produce a uv -surface (a coordinate
surface) as a product space. Any point P in this surface is specified by a pair of numbers, u uo
and v
=
P =
(us, vo).
Such a system is called
generalized (or curvilinear) coordinate system. We need not limit this
a
discussion to Euclidean spaces, for the technique described can be carried almost directly into
non-Euclidean spaces as well. We will stay concerned with Euclidean spaces for the remainder
of this article.
Often in physics or engineering, such systems are necessary to solve problems. For example, a
NASA engineer whom I know was solving fluid dynamic equations for airflow over aircraft
compressor blades. He chose to let the blade surfaces themselves represent coordinate surfaces
and specified coordinate axes to fit. He wrote tensor equations in this somewhat complicated
system and produced beautiful theoretical flow patterns, some of which are still hanging framed
in our Administration Building and in other places around the laboratory!
a, b, c ...
, .....
Additionally:
• We canspecify local coordinate wces at any point P in the system just as we can specify
local Cartesian axes at any point in a Cartesian system.
•
Similarly, we can specify local coordinate surfaces at any point P in the system.
• We can use the local coordinate curves and the local coordinate surfaces to specify
unique sets of unit vectors at P.
• We can write any vector quantity V at P as a linear combination of these local unit
vectors.
Now, imagination is required. Let's return to the 3 -dimensional Cartesian system. At any
some
point P, specify three local axes and three local planes determined by these axes. In
we can
accordance with strict definitions, the axes must be mutually perpendicular and, by extension, so
must the planes. Now, choose three unit vectors at P such that each vector is tangent to one of
the axes. Such a triple is usually designated (I, j, k). Any vector V at P can then be written
V =
cd +
J3j +
yk
NASAITM-2002-211716 15
where a, 3, and 7 are the usual x, y, and z scalar components of the vector.
Now suppose that we had chosen unit vectors perpendicular to each of the planes rather than
tangent to each of the coordinate axes. Let's do so and call the resulting triple (j*, k*). Again,
any vector V at P can be written
V =
a*i* +
I3*i* +
where a*, Í3*, and are the scalar components of the vector referred to the i, j, k* triple.
There isnothing surprising in what we have just done, and our representation is satisfactory
provided we ensure that
ai + +
yk =
a*i* + +
But, you might argue that what we have done is trivial since it is apparent from geometry that the
two unit vector triples comprise the same set; i.e., that
i =
i*
i=i*
k=k*.
Still, we used two distinct approaches to defining a unit vector triple at P. Should we expect
these approaches to produce so tidy a result in all cases? The answer is very definitely "NO "!
To understand why the answer is "NO," let's modj5i our Cartesian system so that the axes are no
longer mutually orthogonal for example, so that they meet at 600. In this case, the origin lies at
-
a vertex of a tetrahedron, and the axes lie along three of the edges. (Such coordinate
systems are
actually used in engineering and crystallography and are called triangular coordinate systems.) It
should be intuitive that (i, i, k) and (j*, k*) are now two different sets of unit vectors.
Specifically, i and i" now meet at an angle of 60°, as do j andj*, and k and k*. Thus, while they
are all unit vectors, they specify different sets of directions, and the choice of which set to use in
In tensor analysis, the same logic must be applied in generalized coordinate systems. At any
point P in a generalized system, with an associated set of local axes and coordinate surfaces, we
can specify two related but distinct sets of unit vectors: (1.) a set
tangent to the local axes, and
(2.) another set perpendicular to the local coordinate surfaces. The first set is given the name
contravariant; the second set is given the name covariant. The vector V can be referred to either
set, and is called contravariant when referred to the contravariant unit vectors or covariant when
referred to the covariant unit vectors. As before, the choice of which to use is strictly a matter of
expediency. The vector V is obviously not affected by the choice.
Reciprocal Sets of Vectors
Let's return to the 3 -dimensional Cartesian system of our previous discussion. The unit vectors
(i, J k) are a contravariant set. The unit vectors (1*, k*) are a covariant set. The vector V has
the contravariant representation
V =
ai +
f3j +
'yk.
It also has the covariant representation
V =
a*i* + +
NASAITM-2002-211716 16
cLi+ I3i +yk=a*j* +f*j*+y*k*.
Let's further explore the last relationship. First of all, know that in this
we special case,
i =
i*
j=i*
k =
k*
so that
i .
i* = =
1 j.i*j*.j0 k i =
i k =
O
i k =
k* .
i =
O j k* =
k* j =
O k k* =
k* k =
Making a change in notation will help us to summarize this relationship very succinctly. Let
UI
=
U2
=
k =
u3
=
111*
=
fl;*
Then
Uj* =
uf ii. =
l when i=jj or [O when i~jl.
The vector sets u and U.* are called, by definition, reciprocal vector sets. If we set ö =
[1 when
i =jj or LO when i ~j], then we can write
UiUj*Uj*Uijj.
6 is a component of a second rank tensor called Kronecker's delta after the mathematician
Leopold Kronecker (1923-91) who first inaugurated its use. All vector sets satisfying this
relationship are called reciprocal. The covariant and contravariant unit vector sets in all systems
will always be (or, more generally, can always be chosen to be) reciprocal vector sets.
U uf = •
Ui
=
ou
still hold with the provision that the magnitudes of the given pairs whose inner product is unity
are reczprocal quantities. Let's consider a generalized 3 -dimensional coordinate system, u -v -w,
in Euclidean 3 -space. We can refer the u-, v-, and w- Cartesian
a axes to a x -y -z system in the
same space by transformation equations of the form
u=u(x,y,z) x=x(u,v,w)
v v(x, y, z)
=
y
=
y(u, v, w)
w=w(x,y,z) z=z(u,v,w)
NASAITM-2002-211716 17
as is done in basic calculus and analytic geometry. We require that the functions u, v, and w be
linearly independent and that x, y, and z also be linearly independent. Thus, no one coordinate
axis in either system can be written as a linear combination of the other two, and the system is
truly 3 -dimensional.
We can then choose point P in the system, and specify coordinate curves and surfaces in both
a
coordinate systems. In the generalized coordinate system, we can specify a contravariant basis
set as
e1 =
ar/au, e2 =
Jr/av, and e3 =
ar/aw
where, by convention, the contravariant vectors are superscripted rather than subscripted, and the
vector r is simply the position vector
r=xi+yj +zk
in the Cartesian system. Please note: The parentheses around the superscripts indicate "which"
base vector is being referred to; they do not denote tensor notation. We can also specify a
covariant basis set as
where, again by convention, the covariant vectors are shown subscripted3. Both sets are basis
sets; neither set necessarily comprises unit vectors; and the two sets are reciprocal. To see the
reciprocity, we must form the individual inner products:
e" e(1)
=
(ar/au) (Vu) =
(axlau)(au/ax) + (ay/au)(au/ay) + (az/au)(au/az)
=
au/au
e1 e(2)
=
(ar/au). (Vv) =
(ax/au)(av/ax) + (ay/au)(av/ay) +
(az/au)(av/az)
=
av/au
=0
and so on. The partial derivatives and chain rule used above should be familiar from basic
calculus.
We can write the vector V in its contravariant and its covariant forms as follows:
V =
v'e" + v2e2 + v3e3 =
vle(l) +
v2e(2) +
v3e(3).
If we now wish to find the magnitude of V, we can form the inner product V V. If, further, we
use both the contravariant and the covariant representations of V and take advantage of the
reciprocity between the two different sets of base vectors, we obtain a particularly nice result:
V V =
(v1e" + v2e2 +
v3e3) (vle(l) + v2e(2) + v3e(3))
=
v1v1 + v2v2 + v v3
=v2
The covariant representation retains its use of subscripts while the contravariant representation switches
from subscripts to superscripts. This change in notation helps particularly in the case of mixed tensors
(resulting from certain types of dyads and higher order products) where some of the vectors comprising
the product UVW.. .ABC are covariant while the rest are contravariant.
NASAITM-2002-211716 18
The summation involves both contravariant and covariant indices. The shorthand for this
process
is
V V =
L vv (also =
j vv') V2. =
From here on, we will always take advantage of the reciprocity between the contravariant and the
covariant base vector sets when constructing sums of the type given above. Summations will
always be done over a contravariant-covariant pair of indices.
The Cartesian Fundamental Tensor
Let's again return to the 3 -dimensional Cartesian system of our previous discussion. Please recall
that the unit vectors(i, k) contravariant set, and that the unit vectors (j*,
are a
k*) are a
covariant set. Recall also that the vector V has the contravariant representation
V =
ai + +
yk
and the covariant representation
V -
ai + +
y*k*.
This time, we will
again these results to solve for the covariant components of V in
use terms of
its contravariant components (or vice -verse).
a(i j*) +
(j j*) +
y(k j*) =
a*(i* j*) +
j3'(j' j*) +
7*(k* j*)
.
We already know how to deal with the left-hand side of this equation. The right-hand side is
taken care of when we recognize that
i i =
1 i* =
1* j* =
O k* i* =
i k* =
O
j* =
k* =
U3*.
Then
ui*. ui* =
ui* ui
=
gij
=
where gij is a component of a second rank tensor called the fundamental tensor, which, in this
case,just happens to be equal to ö (and can be called the Cartesian fundamental tensor). In the
general case, the last equality does not hold.
NASAITM-2002-21 1716 19
Using these relationships in the example above, we find that
13=*.
In this case, the same type of equality holds for the other vector components as well.
Let's see what happens when we use the base vectors and e(1) defined above. Again, we have
V =
v1e" + v2e2 + v3e3 =
vle(l) +
v2e(2) +
v3e(3).
When we wanted to find the magnitude of V, we formed the inner product V V using both the
contravariant and the covariant representations of V. Suppose that we had just chosen one or the
other. Suppose we had chosen the contravariant representation. Then
=
V (v1e1 + v2e2 + v3e3) (v'e1 + v2e2 + v3e3)
V =
22 23
g2lvv +g22(v) +g23vv +
gi v3v' + v3v2 + g33 (v3)2.
The shorthand for this process is
V'V= gv'v=V2.
While this relationship is perfectly correct, it lacks the simplicity of the previous relationship in
that it involves both a double summation and extra terms (the gij). A similar argument can be
made for using just the covariant indices.
These arguments can be extended directly to the case of a vector in a non -orthogonal, non-linear
n -dimensional
coordinate system. For the moment, let's just stick to a 3 -dimensional system. The
extension to n -dimensions should be intuitively clear.
Using local coordinate axes and surfaces at the point P. let a vector V at P again be represented
in two different ways:
V =
a1e" + a2e2 + a3e3 =
ale(l) +
a2e(2) + a3e3.
Notice that the same change in notation as before has again been introduced. Since the covariant
and contravariant basis sets are reciprocal sets, we must have
e(j)
=
eu) e =
o11.
Notice that Kronecker's delta is now a component of a mixed tensor of rank 2 in the general
case; i.e., the index (superscript) j is index, while the index (subscript) i is a
a contravariant
covariant index. (Notice also that the
superscript 'i' in the inner
products becomes a covariant
index in the delta, and that the subscript 'j' in the inner products becomes a contravariant index
in the delta. Again, recall that the letters in parentheses are not tensorial indices.)
Next let's write out the fundamental tensor in its covariant and contravariant forms:
e0 eW =
gij (covariant) and e(I) eu)
=
(contravariant).
NASA!TM-2002-2 11716 20
(Notice again that the covariant and contravariant fundamental tensors arise from the
superscripted and subscripted sets of unit vectors, respectively. Now the going becomes more
direct. These last two are the only cases in which this peculiar switch takes
place.)
Now, let's solve for the contravariant components of V in terms of the covariant components.
We will introduce yet another shorthand notation, the so-called Einstein summation convection.
Notice that V can be written as
v =
c e =
j Lj eu).
Einstein noticed that summation always occurs over a repeated index so that it is not strictly
necessary to write out the summation operator ('E1'or 'j') each and every time. Using this
convention, we have the compact notation
V =
cx e =
aj eu)
where summation is understood to i in the middle and j the
now occur over on right-hand side.
Using this convention and applying the results of the previous paragraphs, we first form the inner
product
V' e(S) =
a1 (e(». e(5)) =
a (eu) e(5))
where a new index 's' is introduced because no summation is intended. Next, we notice that the
middle term reduces to
S
a (e e(S)) =
a =
as
as the reader can show by writing out the expression in full, and that the right-hand terms reduce
to
c
(eu)' e(5)) =
giS.
Thus, we conclude that
as =
a g15
where the summation on theright-hand side is over the index 'j'. Thus, the contravariant
components of V are linear combinations of the covariant components. Expanded, this same
expression becomes
a' =
cx, gflg3' + a2 g2' + a3
a2=a, g12+a2g22+a3g32
a3=a, g'3+a2g23+a3g33
Notice that the free index 's' takes on sequential values (1, 2, 3) while the repeated index 'j'
represents summation for each new value of 's.'
Similarly, the covariant components can be expressed as linear combinations of the contravariant
components
NASAITM-2002-21 1716 21
as =
Uj giS
and
a= a' g,-.
One final note on the fundamental tensor is that it is symmetric: i.e., gjj =
gtS To
understand why this is so, let's return to the definitions
e(» eW =
g, and C(i) eU)
=
gU.
We know from our basic vector analysis that the inner product is symmetric, i.e., that
eW e(=
e» and e() e(1)
=
e(j) e(s).
From this, the symmetry of the fundamental tensor follows directly.
We now have the mathematical tools that enable us to achieve our stated objective of
approaching Line 1, Page 1 of any standard text on tensor analysis. These texts typically begin
by stating that tensors obey particular transformation laws whose forms are then specified with
little
or no motivating arguments. We have already looked at tensors as representing physical
quantities and agreed that such quantities must appear the same to all observers. If we now
formally replace the word 'observer' with the word 'coordinate system' we can paraphrase our
previous statement to read that tensors are quantities that must be invariant under a coordinate
transformation; i.e., they must retain a certain character no matter how we look at them. Their
components might vary from system to system, but their overall structure must remain the same.
Let's now consider the simplest case, the scalar. Let S be any scalar quantity observed in a
5 =
Any quantity satisfying this transformation law is called a tensor of rank O. This concludes our
study of scalars.
Let's next consider a vector. Let V be a vector quantity observed in a system K and V be the
same vector quantity observed in K*. Then, for V to represent a real physical quantity, we must
require that
V=V*.
Any quantity satisfying this transformation law is called a tensor of rank 1. In order to derive
transformation laws, we can represent V in each system in either its covariant or contravariant
form. Let's use the base vector system introduced earlier and consider the covariant case in some
detail.
First, let's review: Recall that we had a generalized 3 -dimensional coordinate system u -v -w in a
Euclidean 3 -space. Let this system now become the system K above.
We referred the u-, v-, and w- axes to a Cartesian x -y -z system in the same space by
transformation equations of the form
NASA!TM-2002-211716 22
u=u(x,y,z) x=x(u,v,w)
v v(x, y, z)
=
y
=
y(u, v, w)
w w(x, y, z)
=
z =
z(u, v, w)
We now change this notation as follows:
u=u1 x=x'
v=u2 y=x2
w=u3 z=x3
so that the coordinate transformation equations reduce to
u'=u(x) xsxs(ut)
with i,j, s, t,
=
1,2,3. We then chose a point Pin the system, and specified coordinate curves
and surfaces at P. In the generalized coordinate system, we specified a contravariant basis set as
e" =
ar/eu, e2 =
ar/v, and e3 =
e(l)
=
Vu, e(2) =
Vv, and e(3) =
Vw.
Now, let's move on: We can make identical specifications for K*, first with coordinate
transformations
u* =
u* (x) xS =
xS (ut*)
and then with base vectors
e* =
ar/aui* and eU)* =
Vu*.
V =
V e =
v e* =
V*
ViVu=Vj*VuJ*
But4 Vu =
Fui*IxS1 =
(J*/uk) [duk/xsJ (J*/uk) Vuk. Substituting, we acquire
=
V Vu =
V (Ji*/auk) Vuk
or, changing summation indices on the right-hand side
V1 Vu =
(ii*/u) Vu.
Finally, we write
(V -
Vj* (uJ*/Ju')) Vu =
O
NOTE: We are using the shorthand notation [u/xs] [u'Thx1, au'Vx2, u'Thx3] to represent the
gradient of u.
NASAITM-2002-21 1716 23
vi Vj'' (uJ*/u')
=
with summation over the index j. This expression is the general transformation law for a
covariant tensor of rank 1. A similar calculation can be completed for the transformation of the
contravariant components. The result is
V =
Vi* (ui/&t*)
The two equations
V =
subject.
Many important theories physics are developed using tensors because of their succinctness
in
and their relative "ease" in utility. The resulting algebraic and/or differential equations can be
extremely difficult to solve, but the procedure for arriving at them is usually very direct. Among
the most famous of the theoretical developments using tensors are the theory of
electromagnetism and the special and general theories of relativity.
Acknowledgments
1. Thanks to Marlos Jacob de Melo, a civil engineer, who lives in Recife, Brazil. This article was
written at his request. He appreciates physics and uses his computer to visit the NASA Glenn
LTP website (www.grc.nasa.gov/WWW/K- 12/Numbers/Math/Mathematical_Thinking!). Manos
and I corresponded extensively on this article.
2. Thanks to Dr. Ken DeWitt, my tensor analysis professor, who read the text and made valuable
corrections. Dr. DeWitt is the one who pointed out the rhythm in mathematics. If a shared insight
can change a life, his certainly did!
3. Thanks especially to Ruth Petersen, my mentor in distance learning who edits all of my Learning
Technologies Project (LTP) work, and who had both the courage and the stamina to slog through
this article not once but several times as it developed.
4. Thanks to Dr. Tom Morton who reviewed an early draft and made several helpful suggestions.
Tom kept me from getting too lazy in the really sticky parts!
5. Thanks to Dr. Harold Kautz who first described the permeability tensor to me as an example of a
quantity capable of changing both the magnitude and direction of a vector during one of our
famous walks in the park many years ago.
6. Thanks to Dr. Norman Grier who introduced me to Shokolnikoff's text on tensor analysis. I still
consider it one of the finest.
7. Thanks to Dr. Aaron Snyder whose beautiful computer renderings of air-flow over aircraft turbine
blades graphically demonstrates the power of advanced mathematics.
NASAITM-2002-21 1716 24
REPORT DOCUMENTATION PAGE
Public reporting burden tar this collection at information lo estimated to
average 1 hour per response, including the time tar reviewing lootructiono, searching existing data astecas.
gathenng and maintaining the data needed, and completing arid reviewing the collection at information. Send corsments regarding this burden estimate or any Other aspect of this
collection si intormalion, including suggestions for reducing this burden, to
Washington Headquarters Services, Directorate for Information Operations and Repass, 1215 Jefferson
Davis Highway, Suite 1204, Arlington, VA 222024302, and to the Otfice ot
Management and Budget. Paperwork Reduction Project (07040158), Washington, DC 20503.
1. AGENCY USE ONLY (Leave blank) 2. REPORT DATE 3. REPORT TYPE AND DATES COVERED
WU -332 1-00-00
6. AUTHOR(S)
Joseph C. Kolecki
Unclassi(jed -Unlimited
Subject Categories: 59 and 67 Distribution: Nonstandard
30
Tensor analysis: Introduction physiscs; Engineering 16. PRICE CODE
17. SECURITY CLASSIFICATION 18. SECURITY CLASSIFICATION 19. SECURITY CLASSIFICATION 20. LIMITATION OF ABSTRACT
OF REPORT OF THIS PAGE OF ABSTRACT