Vineet PDF
Vineet PDF
A Thesis in
Aerospace Engineering
by
Vineet Ahuja
Doctor of Philosophy
December 1995
We approve the thesis of Vineet Ahuja.
Date of Signature
Lyle N. Long
Associate Professor of Aerospace Engineering
Thesis Adviser
Chair of Committee
Philip J. Morris
Boeing Professor of Aerospace Engineering
Raymond J. Luebbers
Professor of Electrical Engineering
Charles L. Merkle
Professor of Mechanical Enginnering
Dennis K. McLaughlin
Professor of Aerospace Engineering
Head of the Department of Aerospace Engineering
ABSTRACT
iii
realistic geometries like a metallic trapezoidal wing, and an aircraft engine inlet.
iv
TABLE OF CONTENTS
ACKNOWLEDGMENTS : : : : : : : : : : : : : : : : : : : : : : : : : : : xiii
VITA : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : xiv
1 INTRODUCTION : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : 1
1.1 Scattering Problems in Electromagnetics : : : : : : : : : : : : : : : : 4
1.2 Advances in Computational Electromagnetics : : : : : : : : : : : : : 4
1.2.1 Structured Grids : : : : : : : : : : : : : : : : : : : : : : : : : 8
1.2.2 Unstructured : : : : : : : : : : : : : : : : : : : : : : : : : : : 9
1.2.3 Hybrid Methods : : : : : : : : : : : : : : : : : : : : : : : : : : 10
1.3 Parallel Computing : : : : : : : : : : : : : : : : : : : : : : : : : : : : 11
1.4 Thesis scope and outline : : : : : : : : : : : : : : : : : : : : : : : : : 15
2 NUMERICAL MODEL : : : : : : : : : : : : : : : : : : : : : : : : : : 17
2.1 Governing Equations : : : : : : : : : : : : : : : : : : : : : : : : : : : 17
2.2 The Electromagnetic-Acoustic Analogy : : : : : : : : : : : : : : : : : 21
2.3 Integral Form of the Maxwell's Equations : : : : : : : : : : : : : : : : 24
2.4 Scattered Field Formulation : : : : : : : : : : : : : : : : : : : : : : : 26
2.5 Dual Grid : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : 27
2.6 Discretization and Time Integration : : : : : : : : : : : : : : : : : : : 29
v
2.7 Dispersion, Dissipation and Articial Dissipation : : : : : : : : : : : 36
2.8 Time Step Calculation : : : : : : : : : : : : : : : : : : : : : : : : : : 41
2.9 Boundary Conditions : : : : : : : : : : : : : : : : : : : : : : : : : : : 44
2.9.1 Surface Boundary Condition : : : : : : : : : : : : : : : : : : : 44
2.9.2 Outer Radiation Boundary Condition : : : : : : : : : : : : : : 47
4 PARALLELIZATION ISSUES : : : : : : : : : : : : : : : : : : : : : : 63
4.1 Data Parallel and CM Performance : : : : : : : : : : : : : : : : : : : 63
4.2 Message Passing : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : 69
4.3 Message Passing Libraries : : : : : : : : : : : : : : : : : : : : : : : : 70
4.4 Implementation in MPL : : : : : : : : : : : : : : : : : : : : : : : : : 72
4.5 Domain Decomposition Strategies : : : : : : : : : : : : : : : : : : : : 75
4.6 Message Passing Performance : : : : : : : : : : : : : : : : : : : : : : 81
4.7 Performance Results : : : : : : : : : : : : : : : : : : : : : : : : : : : 83
5 RESULTS : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : 86
5.1 Wave Propagation : : : : : : : : : : : : : : : : : : : : : : : : : : : : 87
5.2 Scattering from a sphere : : : : : : : : : : : : : : : : : : : : : : : : : 88
5.3 Scattering from an ogive : : : : : : : : : : : : : : : : : : : : : : : : : 101
5.4 Scattering from an ellipsoid : : : : : : : : : : : : : : : : : : : : : : : 107
5.5 Scattering from the NASA almond : : : : : : : : : : : : : : : : : : : 110
5.6 Scattering from a trapezoidal wing : : : : : : : : : : : : : : : : : : : 120
vi
5.7 Scattering from an engine inlet : : : : : : : : : : : : : : : : : : : : : 121
REFERENCES : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : 133
APPENDIX : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : 143
A Dierent forms of Maxwell's equations : : : : : : : : : : : : : : : : : 143
vii
LIST OF FIGURES
xi
LIST OF TABLES
xii
ACKNOWLEDGMENTS
I would like to start by thanking Dr S Venkateswaran for injecting the condence
in me to start working in a eld I had little knowledge about. Words do fail me, in
expressing my heartfelt gratitude to Prof L.N. Long for his encouragement, advise,
support, guidance and patience during the course of this work. I would also like to
thank Prof R.J. Luebbers for his many useful suggestions that have been invaluable
in charting the direction of this thesis. Many thanks are also due to Prof P.J. Morris
and Prof C.L. Merkle for their comments and suggestions.
I would also like to extend my thanks to members of the Computational Aero-
Acoustics (CAA) group at Penn State for their input in aiding this work.
I cannot help but applaud the computational support provided by Kevin Mo-
rooney and Jerey J Nucciarone of the Center for Academic Computing at Penn
State.
My sincerest appreciation to all my oce-mates, past and present, for their
help, support and patience in putting up with my unpredictable, iconclastic nature.
In particular, I would like to mention Mr Yusuf Ozyoruk and Major Je Little for the
technical assistance they provided me with.
Finally, I would like to thank Mr Joe Schuster and Mr Branko Kosovic for
reviewing my thesis and for their technical collaboration during the entire course of
this thesis.
xiii
VITA
Vineet Ahuja was born on the banks of the river Yamuna in Delhi, India on
September 19, 1966. In June 1988, he obtained a B.A.(hons) degree from St. Stephens
College, Delhi University. He graduated from the Indian Institute of Technology, in
Bombay, with a Master of Science in Applied Mathematics in July 1990.
xiv
Chapter 1
INTRODUCTION
Five decades ago, the ENIAC computer, armed with its vacuum tubes and relay
memories, did battle in the elds of xed-point arithmetic. Five relentless revolutions
later, Massively Parallel Processing(MPP) marches on to combat the grand challenge
applications. The evolution of the computer from a serial mechanical device to a
multitude of high speed processors working in unison has led to rapid advances in the
eld of scientic computing. Problems exhibiting a fair amount of complexity like
weather forecast modelling, articial intelligence, design of VLSI circuits, turbulence
modelling involving Large Eddy Simulation, and stealth technology are being tackled
with existing computer technology [1].
Like all the other grand challenge problems, stealth and vehicle signature tech-
nology requires systems speeds approaching the tera
op regime and a memory capac-
ity of hundreds of gigabytes [2], [3], [4]. Fundamental to the stealth technology is the
scattering problem in electromagnetics, which has generated a fair amount of interest,
since Lord Rayleigh [5] investigated the scattering of electromagnetic waves from a
sphere. Since Radar Cross Section (RCS) is physically described as the fraction of
the electromagnetic radiation scattered back to the radar with respect to the imping-
ing radar signals, scattering processes have become important in the identication of
objects. Radar based identication of objects based on the scattering patterns that
emanate from them has widespread uses in military and space exploration. Reduction
of RCS of ghter aircrafts, missiles and spy satellites is of paramount importance to
1
2
Computational Domain
scattered waves
CCCCCCCC
CCCCCCCC farzone surface
incident
wave
CCCCCCCC
Target
CCCCCCCC
CCCCCCCC
value problem involving Maxwell's equations in the time domain for an isotropic me-
dia. This novel approach of utilizing digital computers to solve Maxwell's equations
in the time domain deviated from the trend of solving a set of ordinary dierential
equations in the frequency domain. Yee's algorithm, as it is now known, utilised the
leapfrog method to integrate the equations on a completely staggered grid i.e. all
the electric eld and magnetic eld components were located at completely dierent
positions on the grid. The electric eld components lie in the middle of the edges
and the magnetic eld components are located at the center of the faces (g 1.4).
Yee's approach is notable in that it was the rst paper that used nite dierences in
electromagnetics, thus giving birth to the discipline that came to be known as Com-
putational Electromagnetics (CEM). The staggered nature of his scheme is especially
suited to scattering type problems as it helps in easily evaluating boundary conditions
at the surface of the scatterer. This is primarily because only those eld components
lie on the surface for which the boundary conditions can be analytically determined.
Some of the advantages that accrued by using Yee's leapfrog algorithm include
the use of a uniform size staggered mesh thereby impeding the production and propa-
gation of spurious waves, eciency for parallel computation and simplicity in specify-
ing boundary conditions on the surface of targets. Over the past three decades, Yee's
algorithm has been used to solve various types of scatterers with complicated shapes
and materials [11]. However, this scheme has some serious pitfalls [24]. It is prone to
staircasing errors and requires the use of a large number of grid points for geometries
like the ogive and the NASA almond that have a high degree of curvature or have
corners and sharp edges included in them. In fact, FDTD has diculty modelling
7
z
Hz
Ez
Hx Hy
Ex
Ey
Body-tted structured grid algorithms, are curvilinear orthogonal grids that
map the surface of the body exactly. In this case there exists a transformation between
the physical domain and the computational domain through a set of metrics. The
use of body-tted grids in Computational Electromagnetics evolved as a result of
technology transfer from Computational Fluid Dynamics. There are several nite
dierence and nite volume methods that are employed in solving CEM problems.
The algorithms used by Shankar [2], Shang [17] are characteristic based methods
that use one-sided dierencing based on the direction of wave propagation. Shang
9
[17] has used a
ux-vector splitting Alternating Direction Implicit (ADI) scheme
by Douglas and Gunn for his three-dimensional characteristic formulation. Shankar
[2] uses Osher's Riemann solver for
ux linearization with a Lax-Wendro scheme.
Both of the above approaches use collocated E and H eld points, hence failing to
exploit the duality of Maxwell's equations. Goorjian [10] proposed that a staggered
formulation could be used on curvilinear body-tted grids. However, the nature of the
staggering is dierent from Yee staggering (g 1.4) in that all the E-eld components
are collocated as are all the H-eld components. Such a staggering procedure keeps
the advantages of Yee staggering with respect to simplicity in implementation of
the boundary conditions, without the limitation of a uniform Cartesian mesh and
staircasing errors. Noack and Anderson [15] extended the Goorjian staggering to a
nite volume formulation. Body conformal structured algorithms are rather ecient
in computational time and memory as compared to unstructured or hybrid methods.
They are relatively easy to parallelize and load balancing between processors is not
a problem.
1.2.2 Unstructured
Unstructured grid methodology has become very popular in the CFD commu-
nity in the last few years mainly due to the relative ease with which unstructured grids
can be generated around complex congurations [28], [29]. Unstructured grid solvers
are replacing structured solvers for steady state aerodynamic problems [30] [31] [32],
particularly those that need to solve the Euler equations [33] [34] [30]. Since Maxwell's
equations can be readily put in the form of the Euler equations unstructured grids
seemed to be promising in CEM too. However, unstructured methodology in CFD is
10
still largely restricted to problems of steady state and there has been little develop-
ment of time dependent type wave propagation problems. This is largely attributed
to problems of
ux reconstruction and accuracy. Although there is tremendous time
savings in generating unstructured grids it seems to be oset by the fact that the
computer time required for unstructured
ow solvers is considerably more than that
required by structured solvers. Unstructured grid methods also require more memory
because of the storage of grid information regarding grid connectivity between the
faces and vertices of the grid cells. Many nite volume and nite element models [19],
[20], [35] fall in this category. However, most of the nite volume methods employed
here have problems of accuracy especially for time dependent problems. The algo-
rithms in this category are computationally very intensive. There is an additional
overhead cost of maintaining connectivity arrays and inverting large matrices in the
case of nite element methods.
rectangular mesh thereby removing the need to carry out any sort of interpolation.
However, grid generation in itself has become a tedious process, and the advantage
gained by using an unstructured mesh seems to be lost in the process. Paralleliza-
tion of these hybrid methods is quite dicult and load balancing becomes an issue of
serious concern.
Processors
P1 P2 Pi Pn
Connecting Network
Memory Blocks
Processors
P1 P2 Pi Pn
Memory Blocks
MB1 MB2 MBi MBn
Connecting Network
boundaries are equally divided between processors to achieve near perfect load bal-
ancing. Data parallel type algorithms are better suited to shared memory systems,
although not restricted to them. Data parallel algorithms do not show any specic
preference for zonal grid type algorithms. However, data parallel type algorithms
carry a large overhead cost with processes that are inherently serial in nature. Pro-
gramming using a data parallel approach is relatively easy as data partitioning is
usually transparent to the user.
In the recent past, there has been a considerable eort made in utilizing both
the data parallel and message passing paradigms to solving FDTD/FVTD problems
in computational electromagnetics. In [37] parallelization issues related to the Yee al-
gorithm are discussed for the data parallel paradigm. Parallelization issues have also
been discussed for FVTD type algorithms in [14] for the CM-200/CM-5 machines.
Certain drawbacks of the data parallel paradigm are the overhead costs incurred in
performing the near to far eld transformation and computation of the outer ra-
diation boundary condition. Shang et al [38] addresses the domain decomposition
strategies for message passing type electromagnetic algorithms. Rowell et al [16] have
also incorporated a zone based parallel strategy on the Intel Paragon for their
ux
dierence algorithm. Nguyen and Hutchinson [12] have used the bicharacteristic form
of the Maxwell's equations on parallel architectures supporting message passing. In
[39] the authors have presented a dual grid based approach where the message passing
is reduced due to the fact that only the E-eld variables need to be passed across
processors, thus making the algorithm very ecient on parallel architectures that
support message passing.
Code development for the data parallel part of the current work was done on
the Thinking Machines Corporation CM-200 computer at The Pennsylvania State
15
University and most of the results shown herein were obtained on the CM-5 at NASA
Ames Research Center. The CM-200 is a SIMD type machine consisting of Weitek
oating point processors while the CM-5 has vector unit processors and can function
in both the SIMD and MIMD modes.
Table 1.1. Penn State CM-200 and NASA Ames CM-5 characteristics
Machine No. of Processors Peak Speed Memory
CM-200 64 1.2 Giga
ops 0.256 GBytes
CM-5 128 16.4 Giga
ops 4.096 GBytes
Message passing code development was done on the IBM 9076 Scalable Power
parallel system (SP-2) which comprises of a group of RISC system/6000 processors
connected by a low latency high bandwidth switch. For the purpose of the present
work both the SP-2 at The Pennsylvania State University and that at the Maui High
Performance Computing Center (MHPCC) were utilized.
Table 1.2. Penn State and MHPCC SP-2 characteristics
Machine Nodes Memory
PSU-SP2 32 RS/6000 model 390 128 MBytes/proc
16 RS/6000 model 370 128 MBytes/proc
1 RS/6000 model 590 1 GByte/proc
MHPCC-SP2 400 RS/6000 model 590 64-1024 Mbytes/proc
avoids the production of stair-stepping errors that are typical of Yee's leapfrog
algorithm.
The governing equations, the numerical algorithm, spatial and temporal dis-
cretization are discussed in Chapter 2. This chapter also deals with issues regarding
accuracy and numerical dispersion and dissipation errors. Boundary conditions are
also treated in this chapter. The surface boundary conditions are derived and dis-
cussed in this chapter. Also the outer radiation boundary condition is dealt with
here.
Chapter 3 discusses the near-to-far-zone transformation and the post-processing
that is required to calculate the Radar Cross Section of targets.
Chapter 4 is devoted to the parallelization of the algorithm. Both the message
passing and data parallel implementations are discussed in detail.
Chapter 5 describes the scattering results for dierent bodies. Comparisons of
RCS with exact solution and other FDTD computations are shown in this part of the
thesis.
Chapter 6 summarizes the work described in this thesis and presents recommen-
dations for future work.
Chapter 2
NUMERICAL MODEL
17
18
Wire loops
I I
V = , d
dt
m
(2.1)
The negative sign in the equation forces the relationship between EMF and current
directions in accordance with the right hand rule (Fig 2.1).
However, the total magnetic
ux associated with the circuit can be written as
Z Z
m =
S
B dS (2.2)
and
I
V = E dl (2.3)
19
where the integration in 2.3 is over the path that encloses the surface S. Combining
equations 2.1, 2.2 and 2.3 we obtain
I Z Z
E dl = , @t@ S
B dS (2.4)
The line integral on the left hand side of equation 2.4 can be viewed as the electric
circulation around a path S that encloses the surface S.
Applying Stokes' theorem to the left hand side of equation 2.4 we have
Z Z Z Z
(r E) dS = , @ B dS (2.5)
S @t S
@B + r E = 0 (2.6)
@t
Maxwell's second law can be derived from Ampere's law which states that the
line integral of magnetic eld around a closed contour is equal to the sum of the
conduction and displacement currents enclosed.
I Z
H dl = S (Jcond + Jdisp) dS (2.7)
Jcond = E (2.8)
The left hand side of equation 2.10 represents the magnetic circulation around the
curve S and the curve S is a magnetic contour.
In an analogous fashion to Maxwell's rst equation, by using Stokes' theorem
on the left hand side of equation 2.10 we have
!
Jcond + @@tD dS
Z Z Z Z
(r H) dS = , (2.11)
S S
@D + J = r H (2.12)
cond
@t
The nal two equations of Maxwell stem from Gauss's law relating to the conservation
of the electric and magnetic
ux densities over a closed surface.
The electric
ux density through a closed surface is equal to the volume integral
of the charge density enclosed by that surface
Z Z Z Z Z
S
D dS = V
dV (2.13)
21
The dierential form can be obtained from the above equation 2.13 by applying the
divergence theorem to the term on the left hand side and assuming an innitesimal
volume element
rD= (2.14)
rB=0 (2.16)
is also travelling along the same direction. Mathematically, the electromagnetic wave
can be written by assuming Ex = Ez = Hx = Hy = 0 in the equations 2.6 and 2.12.
Equation 2.6 reduces to
@Hz = , 1 @Ey (2.17)
@t @x
Similarly, equation 2.12 reduces to
@ 2 E , c 2 r2 E = 0 (2.19)
@t2 y y
@ 2 p , C 2 r2 p = 0 (2.20)
@t2
where
p is the induced perturbation to the pressure,
C is the speed of sound in the medium.
It should be noted here that the above equation is for an ideal compressible
uid
[44] with negligible dissipation processes and constant ambient pressure and density.
The other important acoustic equations that helps in establishing the electromagnetic
-acoustic analogy include the equation of conservation of mass 2.21 , the adiabatic
23
@
s = , @x (2.21)
where
is the
uid particle displacement,
s is the condensation variable dened as
s = ( , 0) (2.22)
0
p = as (2.23)
@ 2 = , 1 @p (2.24)
@t2 0 @x
Comparing equation 2.19 with 2.20 and equation 2.17 with 2.24 a correspondence can
be drawn for the primitive variables in acoustics with those in electromagnetics. This
isomorphism between the acoustic and electromagnetic variables, along with some of
the other physical quantities that are constituted by a combination of these variables,
is shown in table 2.1.
In the next section Maxwell's equations are retraced and put in its integral form
which is suitable for computation.
24
Table 2.1. Isomorphism of variables between an acoustic plane wave and a linearly
polarised electromagnetic wave
Electromagnetic Acoustic
Electric Field Ey pressure p
Magnetic Field Hz velocity _
Inverse Permittivity 1 Adiabatic bulk modulus a
Magnetic permeability Ambient density 0
Magnetic Energy / vol. 2 Hz kinetic energy / mass 12 0 _2
1 2
Electric Energy / vol. 21 Ey 2 potential energy / massq 12 p2 a
Speed of light c = 1=p Speed of Sound C = a =0
EM Impedance Z = c Acoustic Impedance Z = 0 C
Poynting vector Sx = Ey Hz Intensity I = p_
@ B = ,r E (2.25)
@t
@D = r H , j (2.26)
@t
For computational purposes equations 2.25 and 2.26 can be integrated over a volume
to obtain
Z Z Z @B Z Z Z
V @t dV = , r E dV
V
(2.27)
Z Z Z @ D dV = Z Z Z
r H dV (2.28)
V @t V
Applying Gauss Divergence Theorem to the right hand side of the above set of equa-
tions we obtain
Z Z Z
@ B dV = , Z Z n E dS (2.29)
@t
V S
25
Z Z Z @ D dV = Z Z
n H dS (2.30)
V @t S
The surface integral on the right hand side of equation 2.29 can be interpreted as the
electric vorticity over the surface S and similarly, the right hand side of equation 2.30
is the magnetic vorticity over the surface S.
In the case of homogeneous, linear, isotropic, and time-invariant materials the
following constitutive relationships can be dened
D = E (2.31)
B = H (2.32)
where the electrical permittivity and the magnetic permeability can be considered
to be material dependent properties.
Assuming the present formulation to be for free space and combining equations
2.29 and 2.30 with equations 2.31 and 2.32 we have
of a staggered grid where the electric and magnetic eld components can easily be
obtained on dierent grids. This is discussed in detail in section 2.5. The next section
deals with the use of the scattered eld formalism to solve for the Maxwell's equations
and the advantages over the total eld formalism.
As is dened in [11] the incident eld here is assumed to travel in free space
throughout the domain of the problem and the total eld is supposed to propagate in
free space outside the scatterer and in the media of the scatterer when propagating
within the scatterer. The scattered eld is dened as in the equations above as
the resultant of the incident elds subtracted from the total elds. Scattered elds
emanate from the scattering object when the incident waves impinge upon them.
Separating the two elds has some inherent advantages in solving scattering
problems. The incident eld in this case can be specied analytically. Separating the
elds provides the
exibility of making dierent waveforms incident on the scatterer
with relative ease. The only precaution to be taken in this case is that the specied
27
eld should be Maxwellian, i.e. they should satisfy Maxwell's equations exactly.
Analytically specifying the incident eld makes treatment of the outer boundary
condition easier to implement since only the outgoing scattered elds have to be dealt
with at this boundary. If a total eld formulation had been used, one would have
to keep track of the incoming and outgoing waves and boundary treatment would be
more complicated. Analysing scattered elds in situations where the scattered eld
amplitude is much lower than the incident eld amplitude is easier with a separate
eld formalism. In order to compute Radar Cross Section (RCS), scattered elds
are explicitly needed as the ratio of the far-eld scattered to incident amplitudes
are required. Thus the use of the separate eld formalism is again preferred, as less
post-processing work is required.
CCCC
CCCCCCCCCCCCC
CCCC H
CCCCCCCCCCCCC
E H
CCCC
CCCCCCCCCCCCC
CCCCCCCCC
CCCC
CCCCCCCCCCCCC
CCCCCCCCC
H CCCC
H
CCCCCCCCC
CCCC
CCCCCCCCC
CCCCH
E CCCCCCCCC
CCCC
CCCCCCCCC
CCCC
CCCCCCCCC
E
CCCC H
E E
H
H
H
H
xE
H
H
H H
E
E
E
E
xH
E
E
E E
integration method is chosen because of its high order of accuracy and its successful
application in modelling time dependent phenomena in aero-acoustic type problems
[45] and [46].
The system is solved using a four-stage Runge-Kutta time integration method
given by
Qmi =1 = Qni ,
1tR0i
Qmi =2 = Qni ,
2tR1i
Qmi =3 = Qni ,
3tR2i
Qmi =4 = Qni ,
4tR3i
Qni +1 = Qmi =4
The time step is denoted by n and each stage of the Runge-Kutta method by m,
where the coecients are
m = 41 ; 31 ; 12 ; 1 respectively.
2 3 2 3
6
6
Hx 77 6
6
Ex 77
Q1 = 6666 7
Hy 777 Q2 = 6666 7
Ey 777 (2.37)
4 5 4 5
Hz Ez
CCCCCCCCCCCCC
CCCCCCC
CCCCCCCCCCCCC
a CCCCCCC
b
ξ
S
Figure 2.5. Surface area calculation.
where
ζ
E
η
E
CCCCCCC
CCCCCCCCCCCCCCCCCCC
CCCCCCC
CCCCCCCCCCCCCCCCCCC
CCCCCCC
CCCCCCCCCCCCCCCCCCC
x
CCCCCCCCCCCCC
CCCCCCC
CCCCCCCCCCCCCCCCCCC
CCCCCCCCCCCCC
CCCCCCC
CCCCCCCCCCCCC
CCCCCCC
x
CCCCCCCCCCCCC
CCCCCCC
x
CCCCCCCCCCCCC
CCCCCCC
CCCCCCCCCCCCC
CCCCCCC
ξ
E
Figure 2.6. Illustration of points for
ux calculation.
It will be noticed that the electric eld points (Figure 2.6) used in the evaluation
of the above
uxes lie in the center of the cell faces and are not computed directly
from the integration process. Instead they are extrapolated from the electric eld
points that make up the corners of the cell faces. For example from Figure 2.7
Kordulla and Vinokur [47] have proposed an ecient method of calculating cell
volumes for three-dimensional
ow predictions. The volume V is calculated by
partitioning the hexahedron cell into ve tetrahedra and computing the volume of
each tetrahedron separately. This decomposition helps avoid gaps and overlaps in
computing cell volumes and is computationally very ecient. Figure 2.8 shows the
breakup of a hexahedra into its respective tetrahedral elements.
34
CCCCCCC
CCCCCCCCCCCCCCCCCCC
CCCCCCC
CCCCCCCCCCCCCCCCCCC
CCCCCCC
CCCCCCCCCCCCCCCCCCC
E(i, j, k+1) E(i, j+1,
CCCCCCCCCCCCC
k+1)
CCCCCCC
CCCCCCCCCCCCCCCCCCC
CCCCCCCCCCCCC CCCCCCC
CCCCCCCCCCCCC CCCCCCC
CCCCCCCCCCCCC CCCCCCC
CCCCCCCCCCCCC CCCCCCC
CCCCCCCCCCCCC
E(i, j, k) CCCCCCC E(i, j+1, k)
ξ
E
(i,j+1/2,k+1/2)
The expressions Sd ; Sd ; Sd represent the projected surface areas of constant
; ; faces of the dual grid respectively. It must be pointed out the subscripts i,j,k
used in the calculation of the above
uxes are subscripts on the dual grid.
It should be worth commenting on at this point that the dual grid based nite
volume scheme just described can be easily extended to higher order spatial accu-
racy with the help of a Gaussian quadrature technique [48]. This technique involves
35
55555555 55555555
55555555 55555555
55555555
55555555 55555555
55555555 55555555
55555555 55555555
55555555 55555555
55555555 55555555
55555555 55555555
55555555
55555555
5555555555555
5555555555555
5555555555555
5555555555555
5555555555555
5555555555555
5555555555555
5555555555555
5555555555555
5555555555555 555555555
55555555
555555555
5555555555555 55555555
555555555
5555555555555 55555555
555555555
5555555555555
5555555555555 55555555
555555555
5555555555555
5555555555555 55555555
555555555
5555555555555
5555555555555 55555555
555555555
5555555555555
5555555555555 55555555
555555555
5555555555555
5555555555555 55555555
555555555
5555555555555
5555555555555 555555555
5555555555555
5555555555555 555555555
5555555555555
5555555555555 555555555
5555555555555
Figure 2.8. Decomposition of a hexahedron cell into ve tetrahedra
36
determining the elds at the Gaussian points using interpolation and obtaining the
pointwise values of the solution using a reconstruction process.
The u and v variables were staggered in space so that the system can be written
as
2 3 2 3 2 3
@ U = 66 0 0 7
7 Uj ,1 + 6
c
0 x 77 6 0 , c 7
x 7 U
@t j 4 c 5
6
4 5 Uj + 64 5 j +1 (2.41)
x 0 , cx 0 0 0
where 2 3
6 u 7
U = 64 7
5
v
1
Figure 2.9. Dissipation for a Runge-Kutta with central dierencing.
cfl=0.5
cfl=0.75
cfl=0.9
0.999 cfl=1.0
0.998
Dissipation - |G|
0.997
0.996
0.995
0.994
0.993
0 0.5 1 1.5 2 2.5 3 3.5
phase angle - (k dx)
37
38
and inserting it in the equation 2.41 a relationship is obtained for the time discretiza-
tion
U^jn+1 = G()U^jn (2.43)
where G is called the amplication matrix and for the system in equation 2.41 using
four stage Runge-Kutta comes out to be
2 3
6 1 , 22 (1 , 32 sin2( k2 x ))(sin2 ( k2 x )) (1 , 232 sin2 ( k2 x ))(eikx , 1) 77
G = 64 5
(1 , 232 sin2( k2 x ))(1 , e,ikx) 1 , 22(1 , 32 sin2( k2 x ))(sin2( k2 x ))
where is the CFL number and can be dened as ct=x. The eigenvalues of the
matrix G are
s
1 (18,362sin2 ( kx )+124sin4 ( kx ) ,1442 + 2 cos2(kx)sin2 ( kx ))
= 18 2 2 2
The dissipation characteristics obtained from the spectral radius j+j of the
amplication matrix G using the four stage Runge-Kutta time integration is plotted
against the phase angle (kx)in gure 2.11. As is evident, the low wavenumber com-
ponents propagate without being damped. However, the high frequency components
are damped due to the grid induced dissipation caused by the staggering of the spa-
tial derivatives. In most cases, this amount of grid induced dissipation is enough to
suppress any high frequency numerical oscillations that may arise. These type of dis-
sipation characteristics are quite commonly seen with implicit and explicit methods
39
that utilize central dierencing [49] and have added articial dissipation terms.
Apart from improving the dissipation characteristics, spatial staggering is seen
to have an impact on the dispersion characteristics too. In gure 2.10 a comparison is
made between the dispersion error obtained for the one-dimensional wave equation on
a non-staggered grid using second order central dierences and a system of equations
on a staggered grid for various phase angles (kx) at a CFL number of 1. The
dispersion error for the staggered system is obtained by
Four stage Runge-Kutta time integration has been used in both cases. It is seen
that there is signicant improvement of the staggered system over the non-staggered
equation and that the staggered approach can be utilised to resolve 12 cells per
wavelength at 99 percent accuracy compared to the 20 cells per wavelength needed
for a standard second order central dierencing algorithm.
It should be pointed out that the above analysis is for a one-dimensional case
on a uniformly spaced mesh with perfect staggering. In reality, however, body-tted
grids are usually clustered and may have imperfections like skewed cells or cells with
high aspect ratios. Also, the staggering discussed in section 2.5 in three dimensions is
not completely perfect. All the above reasons contribute in the production of spurious
diusion [25] for the high wavenumber components that are not completely damped
by the grid induced dissipation. Hence the need for adding articial dissipation.
When required, fourth order dissipation [50], [51], [52] is added
DQ = D Q + D Q + D Q (2.45)
1
R-K non-staggered
Figure 2.10. A comparison of the dispersion characteristics.
0.8
0.7
dispersion error
0.6
0.5
0.4
0.3
0.2
0.1
-0.1
0 0.5 1 1.5 2 2.5 3 3.5
phase angle
40
41
D Q = di;j+ 21 ;k , di;j, 21 ;k
D Q = di;j;k+ 21 , di;j;k, 21
0.95 cfl=1.0
0.9
Dissipation - |G|
0.85
0.8
0.75
0.7
0 0.5 1 1.5 2 2.5 3 3.5
phase angle - (k dx)
42
43
t = min (t)i;j;k
where
h i
(t )i;j;k (t )i;j;k (t )i;j;k
(t)i;j;k = h i (2.47)
(t )i;j;k (t )i;j;k + (t )i;j;k (t )i;j;k + (t )i;j;k (t )i;j;k
or
(t)i;j;k = 1 (2.48)
1 1 1
(t )i;j;k + (t )i;j;k + (t )i;j;k
and
(V )ijk
(t )i;j;k = CFL rh i2 h i2 h i2 (2.49)
C Sx + Sy + Sz
where
h
Sx 1i
= 2 Sx i+1=2;j;k + Sx i,1=2;j;k
h i 1
Sy = 2 Sy i+1=2;j;k + Sy i,1=2;j;k
h i 1
Sz = 2 Sz i+1=2;j;k + Sz i,1=2;j;k
n Etotal = 0
form as
Z Z Z
(r D) dV = 0 (2.50)
V
or
Z Z
(n D) dS = 0 (2.51)
S
We need to apply Gauss's electric law to each cell that lies on the boundary.
Assuming that the grid points that lie on the surface of the conductor do not con-
tribute anything to the dot product in Gauss's law for those sides of the cell that are
perpendicular to the conducting surface (Figure 2.13).
Combining this condition with the fact that the tangential components of the
total elds are equal to zero we can write explicit expressions for the scattered eld
components of the electric elds at the boundaries.
Extotal normal = 2
Sx (T ) (2.52)
[Sx ] + [Sy ]2 + [Sz ]2
Sy (T )
Eytotal normal = (2.53)
[Sx ]2 + [Sy ]2 + [Sz ]2
CCCCCCCCCCCPEC Surface
PEC Surface CCCCCCCCCCC
CCCCCCCCCCC
CCCCCCCCCCC
incident
flux for the shaded surface is = − S ηx E
CCCCCCC
o
CCCCCCC
o
CCCCC
CCCCC
o
CCCCC
CCCCC
CCCCC
o
CCCCC x
CCCCCnormal
PEC Surface
CCCCC
CCCCC
PEC Surface
CCCCC
x
@B + r E = 0 (2.55)
@t
@D + J = r H (2.56)
cond
@t
For a source free lossless media Jcond = 0, we can combine the two equations
after taking the curl of each of the two equations. This yields the second order wave
48
@ 2 H , c 2 r2 H = 0 (2.57)
@t2
@ 2 E , c2r2 E = 0 (2.58)
@t2
Expressing the above equations for the time-harmonic case we have
k2 H + r2H = 0 (2.59)
k2 E + r2E = 0 (2.60)
where v
u !
u
L = ikt1 + 1 @2 + @2 (2.62)
k2 @y2 @z2
For the absorption of waves travelling along the negative x direction at a plane surface
located at x=0 we can apply the following condition
!
@ ,L E =0 (2.63)
@x
49
In order to model the boundary condition computationally the square root term in
equation 2.62 needs to expanded in its Taylor series approximation. For example, the
rst order approximation for a time dependent problem would reduce to solving
!
@ , @ E=0 (2.64)
@x @t
Equation 2.64 is also the rst order approximation boundary condition introduced
by Enquist and Majda [55]. In the eld of computational electromagnetics this is
also known as the Mur boundary condition [54]. For this boundary condition to be
eective it has been assumed that the waves incident on the plane represented by x=0
are plane waves travelling along the x-direction. This boundary condition produces
a re
ection of waves for angles of incidence that are not normal to the boundary.
Analytically the re
ection coecent can be written as
!
Aref = Ainc cos , 1 (2.65)
cos + 1
where is the angle between the direction of the incident wave and the normal to the
boundary. For a curved boundary the boundary condition can be modied to become
!
@ , @ + 1 E=0 (2.66)
@r @t 2r
X
(x; t) = f (ct , xcosi ) (2.67)
i
Furthermore, if the solution (x; t) represented a single plane wave, then the value
of at (t + t) at a position x can be determined from the value at a point located
at (x-c t/cos ) at a time t, if the angle of the outgoing wave is known.
X
(x; t + t) = f (c(t + t) , xcosi ) (2.68)
i
X
(x; t + t) = f (
i + i) (2.69)
i
where
i = ct , (x , ct)cosi
i = ct(1 , cosi)
f ( + ) = f ( + ) , f ( )
m f (
+ ) = m,1 f (
+ ) , m,1 f (
)
51
The discrete form of a linear interpolation of the Liao boundary condition 2.71
can be rewritten by replacing the functional f by the electric eld E as
where
T11 = (2 , s)(1
2
, s) T12 = s(2 , s)
T13 = s(s 2, 1)
and
s = c44nt where 02
52
The eectiveness of the boundary condition was tested for the case of scattering
of a wideband pulse from a perfectly conducting sphere. This case is taken up in
detail in chapter 5. However, at this point, the eect of truncating the domain with
the Liao boundary condition is investigated. The grid used for this case was an
O-O type grid and consisted of 37x46x73 grid points. The sphere is of radius 0.1
meters. There are 46 grid points in the direction running from the scatterer to the
outer boundary. In order to test the eectiveness of the outer boundary condition the
computational domain was truncated in the radial direction from 46 grid points in the
original case to 23 grid points. Figure 2.15 is the comparison of the two solutions as
the scattering process evolved in a plane perpendicular to the direction of propagation
of the incident wave. Figure 2.14 is the comparison of the scattered elds in the two
domains in the plane of propagation of the incident wave. As can be clearly seen the
truncation of the domain did not lead to any change in the solution due to spurious
wave re
ections at the boundary.
53
Figure 2.14. Comparison of the electric eld in a plane parallel to direction of incidence
54
Maxwell's equations are solved in the time domain close to the body of the scat-
terer since the computational domain ends typically a few wavelengths away from the
surface of the scatterer. After which, it is normally truncated by the outer radiation
boundary condition. This region between the scatterer and the outer boundary is
normally termed the near zone or the near eld. Estimation of radar cross section
(RCS) requires the intensity of scattered wave at innity. This is normally termed the
far zone or the far eld solution. The far eld solution can be obtained as frequency
domain transformations, that are applicable at single frequencies [3]. In [11] the au-
thors have discussed the implementation of the far eld solution in nite dierence
algorithms in great detail for both time harmonic solutions and wideband excitations.
The solution at the far eld is usually calculated from the one in the near zone with
the help of a Green's function [57]. A closed surface is rst dened somewhere in
the computational domain that encloses the scatterer. The equivalent electric and
magnetic currents are then computed on the surface by the tangential electric and
magnetic elds. The Green's function transformation is then utilised to obtain the
scattered elds in the far eld that are radiated by the currents on the surface.
This method of estimating the far eld solution, based on a near to far eld
transformation, is analogous to the Kirchho method that is used to predict the
linear acoustic far eld in aerodynamic noise related problems [58] and [59]. In that
case, a control surface is dened in such a manner that it encompasses all nonlinear
55
56
ow eects and noise sources, although the surface itself must lie in the linear region.
The complete set of nonlinear equations are solved in the region enclosed by the
surface and the surface integral of the solution over the control surface is utilised
in analytically determining the acoustic far eld. A comprehensive review of this
method and its application to acoustics is found in [60].
One major drawback of the Yee algorithm is that due to the staircased grids that
are employed, close to the surface of the scatterer there appear to be high frequency
errors generated. However as the computational domain is traversed away from the
scatterer these errors seem to get smoothed out. This is quite apparent in gure
3.1 which depicts the process of scattering from a sphere as viewed along a plane
tangential to the direction of propagation of the incident wave. The positioning of
the far zone integration surface in the computational domain is constrained by the
staircasing errors in problems that employ the Yee algorithm. Body-conformal grids
on the other hand have the advantage that the far zone integration surface can be
dened close to the scatterer.
Equations 3.1 and 3.2 can be solved to obtain the values of A and .
The time harmonic electric eld can be dened as
E (!) = Aei
A similar treatment is carried out for the magnetic eld components at all grid points
that lie on the farzone surface. Since the magnetic eld components lie on the dual
grid rather than the actual grid they are obtained by averaging from one dual grid
layer above and the one below the actual surface.
The time harmonic equivalent scattered surface currents are then computed from
58
the time harmonic scattered elds at the surface. These can be written as
Next the time harmonic vector potentials are obtained by integrating over the
farzone surface. They are obtained from the surface currents by the following expres-
sions:
Z
The time harmonic far zone electric elds can be obtained by utilizing the time
harmonic vector potentials as
Time differential
Point of reference
^
−r
x −
r
x
js = n H (3.9)
ms = E n (3.10)
In [61] the authors show that the retarded potentials w and u can be computed from
the scattered surface currents using the following formulas:
js (t) ds (3.11)
@t s
1
ms (t) ds
(3.12)
@t s
where:
t = the elapsed time
Depending on the type of mesh employed i.e. O-O type, O-H type the integration
surface is appropriately constructed. Assuming that an O-O type grid is being used
61
the surface = constant then denotes the integration surface. In that case, the far
zone transform equations can be recast in discretized form as follows:
At each time step the contribution of each cell surface that makes up the in-
tegration surface = constant is calculated and placed in its corresponding w bin
whose index is computed by subtracting the time delay factor from the elapsed time
and dividing the result by the time step t. As a result, the size of the potential
arrays will not correspond to the number of iterations. The size of the potential
arrays would be slightly larger, the increase in size being dependent on the relative
distances of points on the far zone surface from the reference point. If Ra represents
the closest point on the far zone surface to the far zone eld point and Rb the furthest
point from the far eld point then
M = cRat + N + cRbt
where cRat is representative iteration dierential for startup between point A and the
reference point. Similarly, cRat is indicative of the time delay between point B and the
reference point. The surface currents on the far zone surface will aect the radiation
in the far eld for M number of iterations.
The far zone scattered elds can be computed from the above potentials as
E = ,Zw , u E = ,Zw + u
(r0) 2
scatt
4(r0)2 E
= rlim
0 !1 (3.15)
E inc
PARALLELIZATION ISSUES
63
64
are the CM-5 and the CM-200 produced by Thinking Machines Corporation. The
CM-5 has an architecture that comprises of 32 - 1024 processors with each proces-
sor supporting 4 vector units. Each vector unit has 8 megabytes of memory. The
language used in code development on these machines is CM-Fortran which is simi-
lar to the High Performance Fortran standard. It diers from Fortran 90 in mainly
two ways. Firstly, there exist compiler directives that specify how the data is dis-
tributed over processors. This is done with the help of the CMF LAYOUT command.
This command instructs the compiler as to how the arrays should be laid out in the
computers memory. For example the compiler directive
would force A to be spread across all the parallel processors. However, the compiler
directive
CMF $ LAY OUT A(: news; : news; : serial)
would distribute the elements of array A across all processors in such a way that each
processor has a 2-D array of the size of the rst two dimensions.
Secondly, there exist constructs for data parallelism like the FORALL state-
ment. These constructs are array processing features that provide the programmer a
little more latitude and control in inducing parallelism. For example, the FORALL
statement helps in controlling data motion, thereby enabling the programmer to:
exercise DO LOOP type control in CM-Fortran
dene a masking operation, much like an embedded IF in a DO LOOP
perform complicated permutations on multi-dimensional arrays.
operate in a complicated manner on irregularly shaped parts of an array.
65
A=B+C
do j=1,jmax
do i=1, imax
a(i,j)=b(i,j)+b(i+1,j)
enddo
enddo
can be performed by issuing the command
However, multiple cshifts have been found to be very expensive. Therefore additional
arrays are dened to cut down communication costs. For example
A = cshift(cshift(b; 1; 1); 2; 1)
The more dicult parts of code conversion lie in the implementation of the outer ra-
diation boundary condition. The Liao boundary conditions use 3-dimensional arrays
but one of the dimensions is less than the other two. Additional arrays are dened
that store the electromagnetic variables at previous time steps and a masking func-
tion is used to perform the boundary condition operations only at the boundaries.
This method is not very ecient since most of the processors are sitting idle while
the boundary operations are being performed.
However, the most signicant issue is that of the implementation of the far zone
67
transformation. The far zone transformation is essentially a serial process since the
time delay factor dictates the index of the potential arrays where the contribution
of the surface arrays have to be accounted for. This is being achieved by a process
called send with add and the use of indirect array referencing, where the time delays
of dierent points on the far zone surface are stored in an array that acts like a
set of pointers for the potential arrays. Since the electric and magnetic currents
on the far zone transformation surface are cross products of surface areas and the
time derivatives of the electromagnetic eld components they are computed as two-
dimensional quantities.
Once the components of the magnetic and electric eld currents are computed,
their contribution is added to the potential arrays which are essentially one-dimensional
quantities. Hence the current arrays are converted to one-dimensional quantities using
the RESHAPE function. For example :
For the currents corresponding to each grid point on the far zone transformation
surface a time delay factor is computed that determines the appropriate potential
array into which the contribution of the current is added. In other words the time
delay factor array acts as a pointer array to the currents array.
5000
4500
Serial FVTD
Parallel FVTD
4000
3500
Retarded Time Steps
2000
1500
1000
500
0 2500 3000
2
1.5
0.5
-0.5
-1
-1.5
E_theta Amplitude
Here WPOT is a potential array, and MM3 is a pointer array. The contribution
of dcurntmx is added to the appropriate potential array block by the masking con-
dition provided by the pointer array MM3. This, essentially, is the logic behind the
SEND-WITH-ADD process [62].
A run on the 32 node CM , 5 took 19.92 micro-sec/grid point/time-step.
69
This time included the overhead of reading/writing les. With the far zone pro-
cess excluded, an equivalent run on the 32 node CM , 5 took 17.38 micro-sec/grid
point/time-step.
Although a data parallel FVTD algorithm results in an eective speedup of over
70 percent in turnaround time over the serial version of the code, this approach has
some deciencies. It should be noted here that the parallel version performed at
approximately 2 percent of the peak speed of the machine. Firstly, implementation
of the outer radiation boundary condition proves to be quite costly since most of
the processors wait while a few compute the outer boundary. Secondly, the farzone
transformation, although eciently implemented still produces a large overhead due
to its inherently serial nature. These deciencies prompted the consideration of the
message passing paradigm that is discussed in the next section.
the code to MPL and issues regarding implementation of the algorithm to a message
passing type paradigm is delineated in 4.4.
The eciency of a message passing algorithm is gauged by its ability to mini-
mize the communication-to-computation ratio and to balance the workload eectively
by partitioning the data structure evenly. Many dierent program/data decomposi-
tion techniques are used to achieve this end. The three commonly used types of
decomposition techniques are domain decomposition, control decomposition and ob-
ject decomposition. Domain decomposition techniques are best suited for structured
data problems and will be discussed at length in section 4.5. Control decomposition is
best adapted to situations where the data cannot be well organised, hence parallelism
can be enforced by distribution of the
ow of control of computation rather than the
partition of the computational domain. An object oriented programming model can
be utilized as a parallelizing tool leading to object based decomposition.
MPICH has been developed at the Argonne National Laboratory in conjunction with
Mississipi State University and MPI-F is a product of the research division of IBM.
MPL is the message passing library developed by IBM to parallelize applications on
the Scalable Powerparallel Systems (SP1 and SP2). It is a robust library and has
provided the foundation for the development of the MPI class of libraries. PVM
is one of the oldest of this class of message passing libraries and was developed at
the Oak Ridge National Laboratory and the University of Tennessee. The versions
supported on most SP platforms is an optimised version called PVMe. The Numer-
ical Aerodynamic Simulation Program at NASA Ames Research Center carried out
an extensive performance analysis comparing the dierent message passing libraries.
Table 4.1 shows the latency and bandwidth comparison of the various libraries.
While MPI and MPL have given reasonably good performance, PVM has been
found to be lacking in performance on the SP2. PVM, however, has the
exibility of
dynamically adding and deleting tasks during execution of a parallel application. It
should be pointed out here that PVM is structurally dierent from the other libraries
and cannot take advantage of asynchronous message passing that is performance
enhancing if applicable to the applications. Although MPI is extremely portable and
can just as easily be run over a cluster of workstations as on the SP2, MPL was chosen
as the library for implementation of the FVTD algorithm. Since MPL was developed
72
H−field grid
was performed on the SP-2 using the Message Passing Library (MPL). MPL uses the
Single Program Multiple Data (SPMD) model wherein the same program resides on
all processors and is executed by each one of the processors. However, each processor
has its identier processor number and this permits MPL to have processor based
conditional statements. The reason for using a SPMD model is that in a distributed
memory system, memory and address space is local to each processor and the only
way data can be shared among processors is by passing messages. Each processor has
it's own processor domain to work with. Each processor domain is either bounded by
pseudo boundaries or a combination of physical and pseudo boundaries.
Figure 4.3 illustrates the SPMD mode and the manner in which the algorithm is
implemented in MPL. A pre-determined processor (with processor ID=0) acts as the
control processor and carries out the pre-processing tasks involved in setting up the
parallel environment and all parameters required for message passing. It reads in the
grid and distributes it to the various processors depending upon the type of domain
decomposition. Each processor then generates its own dual grid from the information
it has about its own grid. The grid and the constructed dual grid together constitute
the processor domain of the processor. The processor then computes the volumes and
surface areas of the cells that are located in its processor domain. It also calculates the
time step required for computation on its own processor domain. This information
about the minimum time step on each processor domain is then communicated to the
processor with ID = 0. The processor with ID = 0 in turn determines the minimum
time step for the entire computational domain and broadcasts it to all the processors.
Message passing between processors is carried out by issuing explicit MPL calls.
For example
74
will send the array \qeast" of size (jmax kmax 3 r4size) to the processor with
a processor identication number equal to (taskid+1). Similarly, the call
will receive the array \qeast" of size(jmax kmax 3 r4size) from the processor
with a processor identication number equal to (taskid-1).
Since the whole simulation is a time dependent process, all processors are forced
to be synchronized after each Runge-Kutta stage with the MPL call:
CALL MP SY NC (allgrp)
which blocks all execution on all processors until all processors have made the cor-
responding call. For the far zone transformation to be implemented each processor
is required to compute the surface currents for that part of the far zone integration
surface that lies on it. Some processors have no part of the surface lying in their pro-
cessor domain whereas others may have a signicant part of the surface. This creates
minor problems of uneven load balancing. For two contiguous processors that have
the intersecting nodes lying on them, the far zone currents are computed from that
processor that has a lower ID. Once each of the relevant processors computes the mag-
netic and electric surface currents on the far zone integration surface, it then sums
their contributions to the potential arrays depending on the retarded times. Each
75
Processors
PID=0
PID=i Pre−processing
PID=0
Calculate time step Gather time steps
Calculate min time step
PID=0
Send potentials to PID=0
Calculate farzone Sum potentials based
potentials on retarded times
No Done
?
Yes
End
1
kmax
1
k
j 1−D Parallelization
2−D Parallelization
kmax
1
k
j
3−D Parallelization
k
j
Zone 6
Zone 3
Next the performance of the code was evaluated after switching on articial dis-
sipation. Fourth order articial dissipation requires that additional message passing
be performed since the fourth order derivative stencils require that an additional eld
point be communicated from adjacent domains on either side. Table 4.3 shows the
computational and communication times utilised in this case. In comparison to table
4.2 it is seen that there is a 5.8 percent increase in the computational time required
whereas the communication requirement has increased by 10.2 percent. The relative
increase in the amount of communication over the computations has lead to a slight
deterioration in the eciency ratio.
of the far zone integration surface dened in their respective processor domains.
13
11
10
3
4 6 8 10 12 14 16
No of processors
though the FDTD algorithm is much faster than the FVTD algorithm , it may not
be that attractive in the overall scheme of things.
It is also found from the data presented in table 4.5 that the message passing
paradigm is better suited to the FVTD algorithm over the data parallel mode. A
16-node SP-2 execution of the FVTD code was 5 times faster than a 32-node run
on the CM-5. This clearly indicates that the FVTD algorithm is more ecient as a
message passing based algorithm.
Chapter 5
RESULTS
The purpose of this chapter is to apply the various tools that have been devel-
oped and discussed in the previous chapters to simulate the process of wave propa-
gation and scattering from a variety of targets. The algorithm developed in chapter
2 is rst used to model the propagation of TEM (Transverse Electromagnetic) waves
in a rectangular domain. The validity, accuracy and eectiveness of the algorithm
is tested in calculating the scattered elds from a perfectly conducting sphere. The
results are compared with those obtained from the standard FDTD algorithm that
uses Yee stencils and the leapfrog method in time. Since the main reason for the
development of the algorithm was to analyse scattering from complex three dimen-
sional aircraft-like congurations, sections 5.3, 5.4, 5.5, 5.6 and 5.7 are devoted to
examine the eects of scattering from various components of an aircraft. In sections
5.3 and 5.4 the RCS is computed for an ogive and ellipsoid. These congurations
are typical of shapes used to represent the nose sections of aircrafts. In section 5.5
the elusive case of scattering from the NASA almond is considered. The results are
compared with available experimental data at all angles of incidence. Section 5.6 is
based on qualitatively analysing scattering for a trapezoidal wing. Lastly, the process
of scattering from an engine inlet is investigated in section 5.7
86
87
@Hx
= , 1 @Ez (5.3)
@t @y
@Hy
@t
= 1 @E
@x
z
(5.4)
For the case under consideration, the domain is bounded by perfectly conducting
boundaries at x = ,5 and x = 5. A Gaussian pulse having an Hy component was
excited over 20 cells in the middle of the domain.
= 21 e,6(x,ct) (5.5)
2
Hy
The Gaussian disturbance gives rise to two waves that are transmitted in the
domain in opposite directions. The left travelling wave has an Ez component and a
Hy component that are in phase whereas the right travelling wave has a Ez component
and a Hy component that are opposite in phase. A wave of this type, with the E and
H transverse to the direction of propagation is called a Transverse Electromagnetic
(TEM) wave. Figures 5.1 - 5.6 show the left and right travelling branches of the Ez
component solution at every 40th iteration. As is seen in gure 5.3 the waves reach
88
0.5
-0.5
-1
100
0 50
50
100 0
teristics of the Hy component after the wavefront reaches the re
ecting boundaries.
The TEM wave solution obtained with the Runge-Kutta algorithm is compared with
that obtained with the Yee algorithm. Figures 5.1 - 5.5 show that there is excellent
agreement between the two schemes for the Ez component..
0.25
0.2
0.15
0.1
0.05
0
-0.05
-0.1
-0.15
-0.2
-0.25
100
0 50
50
100 0
0.25
0.2
0.15
0.1
0.05
0
-0.05
-0.1
-0.15
-0.2
-0.25
100
0 50
50
100 0
0.25
0.2
0.15
0.1
0.05
0
-0.05
-0.1
-0.15
-0.2
-0.25
100
0 50
50
100 0
0.25
0.2
0.15
0.1
0.05
0
-0.05
-0.1
-0.15
-0.2
-0.25
100
0 50
50
100 0
0.1
0.05
-0.05
-0.1
100
0 50
50
100 0
0.0015
0.001
0.0005
100
0 10 50
20 30 40 50 60 70 80 90 0
0.0007
0.0006
0.0005
0.0004
0.0003
0.0002
0.0001
0
-0.0001
100
0 10 50
20 30 40 50 60 70 80 90 0
0.0007
0.0006
0.0005
0.0004
0.0003
0.0002
0.0001
0
-0.0001
100
0 10 50
20 30 40 50 60 70 80 90 0
0.0007
0.0006
0.0005
0.0004
0.0003
0.0002
0.0001
0
-0.0001
100
0 10 50
20 30 40 50 60 70 80 90 0
0.0007
0.0006
0.0005
0.0004
0.0003
0.0002
0.0001
0
-0.0001
100
0 10 50
20 30 40 50 60 70 80 90 0
0.0015
0.001
0.0005
100
-0.0005
0 10 50
20 30 40 50 60 70 80 90 0
0.15
0.1
0.05
-0.05
-0.1
-0.15
-0.2
-0.25
-5 -4 -3 -2 -1 0 1 2 3 4 5
0.25
Yee
R-K
0.2
0.15
0.1
0.05
-0.05
-0.1
-0.15
-0.2
-0.25
-5 -4 -3 -2 -1 0 1 2 3 4 5
0.15
0.1
0.05
-0.05
-0.1
-0.15
-0.2
-0.25
-5 -4 -3 -2 -1 0 1 2 3 4 5
0.15
0.1
0.05
-0.05
-0.1
-0.15
-0.2
-0.25
-5 -4 -3 -2 -1 0 1 2 3 4 5
0.06
0.04
0.02
-0.02
-0.04
-0.06
-0.08
-0.1
-5 -4 -3 -2 -1 0 1 2 3 4 5
= t , (~r r^) =c
and
0 < < 2 t
!2
=
4
t
and is a user dened parameter that dictates the width of the pulse. The
value of used for the FVTD run in this case was 756. It should be mentioned at
this point that the FVTD results are being compared to the results obtained with
the PSU-FDTD code [11] that employs the Yee algorithm. An equivalent of 64 was
used in the FDTD run. (The FDTD run was made with a cell size of 0.01m or ten
cells per radii) The dierence in the value can be attributed to the dierence in
95
incident
x x θ x
direction
algorithm captures the creeping wave [64] behind the main component of the re
ected
wave. Figure 5.21 show the comparison of the Ez eld component at locations 0.04m
from the surface of the sphere and = 90 and 180 respectively. All four near zone
comparisons are found to be in very good agreement.
Figure 5.22 is a comparison of the backscatter cross-section against frequency
of the FVTD and FDTD algorithms and the exact solution. The exact solution for
the monostatic radar cross section can be written as
j E s j2 1 (,1)n (2n + 1) 2
2 X
3d = rlim 2
!1 4r jEi j2 = 4 n=1 H (5.7)
and
@ H^ n(2) (kR)
H^ n(2) (kR) =
0
@ (kR)
where
H^ n(2) (kR) is the spherical Hankel function of the second k ind
is the wavelength
k is the wavenumber
R is the radius of the sphere
The comparison between the exact solution and the FVTD solution is excep-
tionally good especially at low frequencies and is seen to be an improvement over the
FDTD solution.
The time evolution of the scattering process is depicted in gures 5.23 and 5.24.
Figure 5.23 shows the scattering process evolving in a plane perpendicular to the
direction of incidence whereas gure 5.24 is along a tangential plane to the direction
of incidence. Both gures depict the scatter waves emanating from the surface of the
sphere and travelling outwards towards the outer boundary.
98
1.5
1
E_z Amplitude
0.5
-0.5
-1
-1.5
-2
-2.5
0 1 2 3 4 5 6
TIME (x 1.0e-9 secs)
0.5
E_z Amplitude
-0.5
-1
-1.5
-2
0 1 2 3 4 5 6
TIME (x 1.0e-9 secs)
Figure 5.20. Scattered eld comparison for the sphere 0.02m and 0.04m in the backscat-
ter direction.
99
1.5
0.5
0
E_z
-0.5
-1
-1.5
-2
-2.5
-3
0 0.5 1 1.5 2 2.5 3 3.5 4
time (Nano secs)
0.5
0
E_z
-0.5
-1
-1.5
-2
0 0.5 1 1.5 2 2.5 3 3.5 4 4.5
time (Nano secs)
Figure 5.21. Scattered eld comparison for the sphere at = 90 and = 180 in the
backsc atter direction.
Backscatter RCS vs Frequency for PEC Sphere
10
Exact Soln
FDTD Soln
FVTD Soln
0
-10
-20
-30
RCS (Dbsm)
-40
-50
Figure 5.23. Scattering from the sphere in a plane perpendicular to the direction of
incidence.
103
Figure 5.24. Scattering from the sphere in a plane parallel to the direction of incidence.
104
= t , (~r r^) =c
and
0 < < 2 t
!2
=
4
t
where is chosen such that the frequency content of the incident pulse extends up
to 5GHz. The direction of propagation of the incident pulse is parallel to the major
axis of the ogive. In gure 5.27 the co-polarized RCS is plotted versus frequency for
the FVTD and FDTD algorithms. The FDTD results have been obtained in this
106
5
4.5
FDTD
FVTD
4
Backscatter RCS vs Frequency for PEC Ogive
3.5
Frequency (GHz)
2
1.5
1
0.5 2.5 3
-20
-25
-30
-35
-40
-45
-50
-55
-60
-65
-70
RCS (Dbsm)
X Y
X Y
-25
ellipsoid
ogive
-30
-35
-40
RCS (Dbsm)
-45
-50
-55
-60
-65
-70
0.5 1 1.5 2 2.5 3 3.5
Frequency Ghz
109
110
Up to this point in the thesis, all the geometries considered have been relatively
simple three dimensional shapes. In the next few sections geometries with more
complexity will be studied for the scattering eects produced when subject to elec-
tromagentic excitation. The rst of these is the NASA almond which will be taken
up in the following section.
,0:416667d x 0
x 2 ] 21 cos
y = 0:193333d [1:0 , ( )
0:416667
x
z = 0:064444d [1:0 , ( )2] 12 sin
0:416667
0 x 0:583338d
x 2 , 0:96] 21 cos
y = 4:83345d [1:0 , ( )
2:08335
x
z = 1:611148d [1:0 , ( )2 , 0:96] 21 cos
2:08335
111
0.53338 d 0 -0.41667 d
d = 9.936 in
Figure 5.35. Grid depicting loss of orthogonality around the NASA almond.
o PEC o
0 180
Almond phi
different directions
of incidence
-10
EXPERIMENTAL
FVTD
-15
-20
-25
RCS (Dbsm)
-30
-35
-40
-45
-50
0 20 40 60 80 100 120 140 160 180
angle (phi)
117
118
in gure 5.46 depicts the zonal distribution in this case along a planar slice. As is
evident outside the cowl this grid is rather coarse. This grid has been used in the
solution of aeroacoustics and aerodynamic type problems. However, there are certain
limitations when the grid is used to solve for electromagnetic elds. Low frequency
excitations would lead to the formations of creeping waves, that would not be cap-
tured as the grid does not completely enclose the engine. High frequency excitation,
on the other hand would be dicult to resolve as one moves away from the inlet and
the center body.
A sinusoidal excitation of 500 Mhz was used in this case. Compared to the
diameter of the inlet cowl and the grid used, this excitation is to be high frequency.
The incident wave travels along the axial direction of the engine and is linearly po-
larized along the transverse direction. Although the geometry is axisymmetric the
scattering problem is not. Figure 5.47 is a Fore Looking Aft view of the scattering
patterns on equi-angular planes along the rotational direction at three dierent axial
locations of the grid that are marked out in Figure 5.47. As is illustrated in Figure
16 the intensity of the scattered elds is greater in planes that have a component of
the incident polarisation tangential to them.
Figure 5.48 shows the scattered Ez eld pattern along a planar slice of the grid.
The scattered eld pattern is clearly seen close to the center body and around the
inlet. Beyond that the grid is very course and the disturbance appears to be damped.
There appear to be strong interference eects from within the cavity between the
center body and the shroud, and currents seem to oscillate back and forth between
the center body and the fan.
126
3 8
1 7
2 1
4 3
6 5
-2 -1 0 1 2 3
Figure 5.47. Scattered Ez elds for dierent zones of the engine inlet.
128
10
0
-5 0 5
Figure 5.48. Scattered Ez eld for a planar slice of the engine inlet.
Chapter 6
was found to be more attractive. The message passing approach provided more con-
trol and
exibility in distributing data over the computational resource and load
balancing. The far eld transformation too, was better suited to a message passing
environment. The dual nature of the grid aided in parallelization, since only one of
the elds had to be communicated between processors. There is, however, an overlap
of one layer of cells between domains that is a computational overhead due to the
elds being computed for the very same dual grid points in contiguous domains. The
savings in message passing more than osets this marginal increase in computational
work.
Unlike algorithms that use stair-stepped grids in computational electromagnet-
ics, the algorithm developed and discussed in this thesis has shown to give good RCS
predictions for bodies with varying degrees of curvature, including those bodies with
sharp and rounded edges. In certain cases, favourable results have also been obtained
for grids that show poor orthogonality at the surface of the scatterer.
Although the nite volume algorithm is found to be more expensive than con-
ventional FDTD algorithms that are leapfrogged in time and space and use Cartesian
meshes, the number of cells required to model moderately complex to complex bod-
ies is normally much less for the nite volume algorithm, thus making them more
attractive in the overall scheme of things.
Perhaps, it should be pointed out that the purpose for the development of the
FVTD algorithm was to complement the FDTD eort. Simple shapes can be very
eciently modelled using FDTD and the FVTD algorithm would be a very expensive
and impractical option in such cases. The Cartesian meshes required for FDTD mod-
elling are relatively easier to generate than the curvilinear meshes needed for FVTD
modelling. Wave propagation properties are better simulated on regular Cartesian
131
[69] [70] [71] [72] [57] [60] [73] [56] [74] [1] [36] [75] [62] [76] [77] [66] [78] [46] [79]
[19] [20] [18] [35] [80] [25] [81] [82] [54] [55] [83] [73] [48]
REFERENCES
[4] Shankar, V., Hall, W. F., Mohammadian, A., Rowell, C., and Palaniswamy, S.
Advances in time-domain cem using structured/unstructured formulations and
massively parallel architectures. AIAA Paper 95-1963. 26th AIAA Plasmady-
namics and Lasers Conference, 1995.
[5] Rayleigh, J. W. S. Theory of Sound Vol 1 and 2. Dover Publications Inc., 1945.
[6] Fuhs, A. E. The No-See-Um Book Radar Cross Section Lectures. American
Institute of Aeronautics and Astronautics, 1988.
133
134
[9] Moore, J., and Pizer, R. Moment Methods in Electromagnetics. Wiley, New
York, 1984.
[11] Kunz, K. S., and Luebbers, R. J. The Finite-dierence Time Domain Method
for Electromagnetics. CRC Press, 1993.
[12] Nguyen, B. T., and Hutchinson, S. A. The upwind leapfrog algorithm scheme for
3-d electromagnetic scattering and its implementation on two massively parallel
computers. 10th Annual Review of Progress in Computational Electromagneti
cs, 1994.
[14] Ahuja, V., and Long, L. N. A fvtd algorithm for maxwell's equations on mas-
sively parallel machines. 11th Annual Review of Progress in Computational
Electromagneti cs, 1995.
[15] Noack, R. W., and Anderson, D. A. Time domain solutions of maxwell's equa-
tions using a nite volume formulation. AIAA Paper 92-0451. 30th AIAA
Aerospace Sciences Meeting and Exhibit, 1992.
135
[16] Rowell, C., Shankar, V., Hall, W. F., and Mohammadian, A. Advances in
time-domain cem using massively parallel architectures. 11th Annual Review
of Progress in Computational Electromagneti cs, 1995.
[18] Ambrosiano, J. J., Brandon, S. T., Lohner, R., and DeVore, C. R. Electro-
magnetics via the taylor-galekin nite element method on unstructured grids.
Journal of Computational Physics, 110 , pp. 310{319, 1994.
[20] Lee, R. L., and Madsen, N. K. A mixed nite element formulation for maxwell's
equations in the time domain. Journal of Computational Physics, 88 , pp. 284{
304, 1990.
[26] Yee, K. S., and Chen, J. S. Hybrid nite-dierence time domain and nite volume
time domain in solving maxwell's equations. 11th Annual Review of Progress in
Computational Electromagneti cs, 1995.
[29] Lohner, R., and Parikh, P. Three dimensional grid generation by the advancing
front method. Int Journal of Numerical Methods in Fluids, No 8, pp. 1135{1149,
1988.
[30] Frink, N. T., Parikh, P., and Pirzadeh, S. A fast upwind solver for the euler equa-
tions on three-dimensional unstructured meshes. AIAA 91-0102. 29th Aerospace
Sciences Meeting and Exhibit, 1991.
[31] Weinberg, Z., and Long, L. N. A massively parallel solution of the three dimen-
sional navier-stokes equations on unstructured adaptive grids. AIAA 94-0760.
32nd Aerospace Sciences Meeting and Exhibit, 1994.
137
[32] Barth, T. J., and Jespersen, D. C. The design and application of upwind schemes
on unstructured meshes. AIAA 89-0366. 27th Aerospace Sciences Meeting and
Exhibit, 1989.
[33] Halt, D. W., and Agarwal, R. K. Compact higher order characteristic-based euler
solver for unstructured grids. AIAA Journal, Vol 30 No 8, pp. 1993{1999, 1992.
[36] Freeman, T. L., and Phillips, C. Parallel Numerical Algorithms. Prentice Hall,
1992.
[37] Liu, Z. M., Mohan, A. S., Aubrey, T. A., and Belcher, W. R. Parallelized fdtd
for antenna radiation pattern calculations. 11th Annual Review of Progress in
Computational Electromagneti cs, 1995.
[38] Shang, J. S., Calahan, D. A., and Vikstorm, B. Performance of a nite vol-
ume cem code on multicomputers. AIAA Paper 94-0236. 32nd AIAA Aerospace
Sciences Meeting and Exhibit, 1994.
[39] Ahuja, V., and Long, L. N. A message passing nite volume algorithm for
maxwell's equations on parallel machines. AIAA Paper 95-1967. 26th AIAA
Plasmadynamics and Lasers Conference, 1995.
138
[42] Ramo, S., Whinnery, J. R., and Duzer, T. V. Fields and Waves in Communica-
tion Electronics. Wiley, 3 edition, 1993.
[44] Pierce, A. D. Linear acoustics. Encyclopedia of Applied Physics, Vol 1, pp. 137{
182, 1991.
[45] Chyczewski, T. S., and Long, L. N. A higher order accurate parallel algorithm
for aeroacoustic applications. AIAA Paper 94-2265, 1994.
[46] Ozyoruk, Y., and Long, L. N. A navier-stokes/kirchho method for noise radi-
ation from ducted fans. AIAA Paper 94-0462. 32th AIAA Aerospace Sciences
Meeting and Exhibit, 1994.
[50] Jameson, A., Schmidt, W., and Turkel, E. Numerical solutions of the euler
equations by nite volume methods using runge-kutta time-stepping schemes.
AIAA Paper 81-1259. 14th Fluid and Plasma Dynamics Conference, 1981.
[51] Neumann, J. V., and Richtmyer, R. D. A method for the numerical calculations
of hydrodynamical shocks. Journal of Mathematical Physics, 21, pp. {, 1950.
[53] Agarwal, R. K., and Deese, J. E. Transonic wing-body calculations using euler
equations. AIAA 83-0501. 21st Aerospace Sciences Meeting and Exhibit, 1983.
[54] Mur, G. Absorbing boundary conditions for the nite-dierence approximation
of the time-domain electromagnetic-eld equations. IEEE Transactions on Elec-
tromagnetic Compatibility, EMC-23 No. 4 , pp. 377{382, 1981.
[55] Engquist, B., and Majda, A. Absorbing boundary conditions for the numerical
simulation of waves. Mathematics of Computation, 31 No. 139 , pp. 629{651,
1977.
[56] Liao, Z. P., Wong, H. L., Yang, B., and Yuan, Y. A transmitting boundary for
transient wave analyses. Scientia Sinica, 27 No. 10 , pp. 1063{1076, 1984.
[57] King, R. W. P., and Jr., C. W. H. Antennas and Waves: A Modern Approach.
The MIT Press, 1969.
[59] Williams, F., and Hawkings, D. L. Sound generated by turbulence and surfaces
in arbitrary motion. Philosophical Transactions of the Royal Society, 264A ,
pp. 321{342, 1969.
[61] Luebbers, R. J., Kunz, K. S., Schneider, M., and Hunsberger, F. A nite-
dierence time-domain near zone to far zone transformation. IEEE Transactions
on Antennas and Propagation, 39, No. 4 , pp. 429{433, 1991.
[62] CM Fortran Libraries Reference Manual - Version 2.1. Thinking Machines Cor-
poration, 1994.
[63] Wesley, R., Wu, E., and Calahan, D. A. A massively parallel navier-stokes
implementation. AIAA Paper 89-1940, 1989.
[66] Woo, A. C., Wang, H. T. G., and Schuh, M. J. Benchmark radar targets for
the validation of computational electromagnetics programs. IEEE Antennas and
Propagation Magazine, 35, No. 1 , pp. 84{89, 1993.
[67] Long, L. N., Khan, M. M. S., and Sharp, H. T. Massively parallel three-
dimensional euler/navier-stokes method. AIAA Journal, vol 29, no. 5, pp. 657{
666, 1991.
141
[69] Wang, H. T. G., Sanders, M. L., and Woo, A. Radar Cross Section Measurement
Data on Low-Cross-Section Targets Part-1. Report NWC TM 7002, 1991.
[71] Hirsch, C. Numerical Computation of Internal and External Flows Volume 1 and
2. John Wiley and Sons, 1988.
[72] Mott, H. Polarization in Antennas and Radar. John Wiley and Sons, 1986.
[73] Freund, J. B., Lele, S. K., and Moin, P. Calculation of the radiated sound eld
using an open kirchho surface. CEAS/AIAA 95-061. First Joint CEAS/AIAA
Aeroacoustics Conference, 1995.
[75] Koelbel, C. H., Loveman, D. B., Schreiber, R. S., Jr., G. L. S., and Zosel, M. E.
The High Performance Fortran Handbook. The MIT Press, 1994.
[76] IBM AIX Parallel Environment Programming Primer - Version 2.0. IBM Cor-
poration, 1993.
[77] IBM AIX Parallel Environment Parallel Programming Reference - Version 1.0.
IBM Corporation, 1994.
142
[79] Bhattacharyya, A. K., and Sengupta, D. L. Radar Cross Section Analysis and
Control. Artech House, 1991.
[80] Ta
ove, A., and Umashankar, K. R. The nite-dierence time-domain (fd-td)
method for electromagnetic scattering and interaction problems. Journal of Elec-
tromagnetic Waves and Applications, 1 , pp. 243{267, 1987.
[81] Chew, W. C. Waves and Fields in Inhomogeneous Media. Van Nostrand Rein-
hold, 1990.
[83] Ozyoruk, Y., and Long, L. N. A navier-stokes/kirchho method for noise radi-
ation from ducted fans. AIAA 94-0462. 32nd Aerospace Sciences Meeting and
Exhibit, 1994.
Appendix A
@Q + @E + @F + @G = S (A.1)
@t @x @y @z
where the solution vector Q can be written as
2 3
66 H x 77
66 77
66 H y 77
66 77
6 H 77
Q = 66 z
77 (A.2)
66 E 77
66 x
77
66 E 77
64 y
5
E z
66 7 66 7 66 77
66 , 1 Ez 777 66 0 777 66 1
E x 77
66 77 66 7 66 7
6 1
E 77 6 , 1 Ex 777 6 0 777
E = 66 y
7 F = 66 7 G = 66 77 (A.3)
66 0 777 66 , 1 Hz 777 66 1
H 77
66 77 66 7 66 y
7
66 1
H 77 66 0 777 66 , 1 Hx 777
64 z
5 64 5 64 5
,H 1
y
1
H
x 0
143
144
while the source vector on the right hand side can be written as
2 3
66 0 77
66 7
66 0 777
66 7
6 0 777
S = 66 7 (A.4)
66 , 1 Jx 777
66 7
66 , 1 Jy 777
64 5
,J 1
z
@ Q~ + @ E~ + @ F~ + @ G~ = S~ (A.5)
@t @ @ @
where
Q~ = QJ (A.6)
E~ = J1 ( E + F + G)
x y z (A.7)
F~ = J1 ( E + F + G)
x y z (A.8)
G~ = J1 ( E + F + G)
x y z (A.9)
and J = @x;y;z
@;;
is the determinant of Jacobian geometric transformation matrix from
the physical to the computational space.
For an arbitrary direction of wave propagation n the system of Maxwell's equa-
tions can be written as
145
@Q + @F = S (A.10)
@t @n
where Q is dened as earlier and F is now dened as
2 3 2 3
66 H
x 77 66 1
(Ez ny , Ey nz ) 77
66 77 66 77
66 H
y 77 66 1
(Exnz , Ez nz ) 77
66 77 66 77
6 H 77 6 1
(Ey nx , Exny ) 77
Q = 66 z
77 F = 66
77 (A.11)
66 E 77 66 1
(Hy nz , Hz ny ) 77
66 x
77 66
77
66 E 77 66 1
(Hz nx , Hxnz ) 77
64 y
5 64
5
E
z
1
(Hxny , Hy nx)
The Jacobian matrix of F with respect to the independent variables in Q can
be written as 2 3
66 0 0 0 0 ,nz ny
77
66 7
66 0 0 0 nz 0 ,nx 777
66 7
66 0 0 0 ,ny nx 0 777
A=6 77 (A.12)
66 0 nz ,ny 0 0 0 77
66
77
66 ,nz 0 nx 0 0 0 77
64
5
ny ,nx 0 0 0 0
" #
@ c @ D H + @H c @D = ,J y x x
(A.14)
@t @x c @y @z z z