1993.Optimal Nonuniform Signaling for Gaussian Channels
1993.Optimal Nonuniform Signaling for Gaussian Channels
Abstract- Variable-rate data transmission schemes in which Maxwell-Boltzmann distributions [3], [4] or as Gibbs ensem
constellation points are selected according to a nonuniform prob bles [5], is the focus of this paper.
ability distribution are studied. When the criterion is one of
Nonuniform signaling is closely related to the notion of
minimizing the average transmitted energy for a given aver
age bit rate, the best possible distribution with which to select
constellation shaping in coded modulation (as described in,
constellation points is a Maxwell-Boltzmann distribution. In e.g., [2], [6]-[9]). Constellation shaping can provide an energy
principle, when constellation points are selected according to savings called shaping gain in addition to the usual coding
a Maxwell-Boltzmann distribution, the ultimate shaping gain gain provided by lattice- or trellis-coding. Indeed, the gain G
(1fe/6 or 1.53 dB) can be achieved in any dimension_ Nonuni provided by a coded modulation system (relative to a simple
pulse amplitude modulation baseline) operating at a bit rate f3
form signaling schemes can be designed by mapping simple
variable-length prefix codes onto the constellation. Using the
HutTman procedure, prefix codes can be designed that approach (bits per two-dimensional channel-use) is well-approximated
the optimal performance. These schemes provide a fixed-rate by writing
primary channel and a variable-rate secondary channel, and are
easily incorporated into standard lattice-type coded modulation
schemes.
Index Terms-Signal constellations, maximum entropy princi where "Ie and "Is denote, respectively, the coding gain [10]
ple, shaping gain, coded modulation, Huffman codes. and shaping gain [2] of the scheme in question. As will be
shown, the discretization factor 1 .:.c 2-/3 properly adjusts the
gain for finite bit rates. The connection between shaping and
1. INTRODUCTION nonuniform signaling arises from the fact that, in schemes
that employ shaping, a nonuniform distribution is induced on
I
N THE CONVENTIONAL apprQach to data transmission,
the points of the low-dimensional constituent constellation. By
each point in a given constellation is equally likely to be
applying nonuniform signaling directly (rather than indirectly
transmitted. While tllis approach yields the maximum bit rate
via constellation shaping), nonuniform signaling can, in any
for a given constell�tion size, it does not take into account the
dimension, achieve the ultimate shaping gain-7re/6 or 1.53
energy cost of the various constellation points. In this paper,
dB-attainable with uniform signaling only in the limit of
the idea of choosing constellation points with a nonuniform
infinite dimension [2]. Indeed, one of the principal results
probability distribution is explored. Such nonuniform signaling
of this paper is a method of designing simple nonuniform
will reduce the entropy of the transmitter output, and hence
signaling schemes that approach this ultimate level.
the average bit rate. However, if points with small energy
It is important to note at the outset, however, that prac
are chosen more often than points with large energy, energy
tical implementation of direct nonuniform signaling will be
savings may (more than) compensate for this loss in bit rate.
hampered by the variable transmission rate of such schemes.
It follows immediately from the maximum entropy prin
Transmitting data obtained from a fixed-rate source requires
ciple (see, e .g., [1, ch. 11D, or by variational calculus as
data buffering at the transmitter and the receiver, which leads
in [2, Section IV-B], that the probability distribution that
to the problem of coping with buffer over- or underflow.
maximizes entropy for a fixed average energy is one in which
Furthermore, since the transmitted signals represent variable
a constellation point r, with energy IIr112, is chosen with
probability p(r) <X exp (->-llrI12), where the nonnegative
numbers of bits, channel errors may cause the insertion and
deletion of bits in the decoded data, causing potential losses
parameter >- governs the trade-off between bit rate and average
of synchronization. Although these system problems will tend
energy. Nonuniform signaling with this family of distributions,
to limit the broad applicability of nonuniform signaling, we
well known in statistical mechanics and thermodynamics as
do not attempt to provide solutions to these problems in this
Manuscript received December 28, 1990; revised August 20, 1992. This paper.
work was supported in part by the Natural Sciences and Engineering Research Instead, our aim in this paper is 1) to provide insight
Council of Canada, and by the Government of Ontario through an Ontario into nonuniform signaling schemes and how they relate to
Graduate SchOlarship. This paper was presented in part at the 1991 Tirrenia
International Workshop on Digital Communications, Tirrenia, Italy, September conventional signaling schemes, 2) to assess the potential gains
8-;12, 1 991, and at the 16th Biennial Symposium on Communications, that nonuniform signaling may provide, and '3) to provide
Kingston, ON, Canada, May 27-29, 1992. a method (via the Huffman algorithm) by which simple,
The authors are with the Department of Electrical Engineering, University
of Toronto, Toronto, ON, Canada M5S J A4.
near-optimal, nonuniform signaling schemes may be designed.
IEEE Log Number 9205742. Such nonuniform signaling schemes provide a standard, fixed-
001 8-9448/93$03.00 © 1993 IEEE
914 IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 39, NO.3, MAY 1993
is usually finite, but not necessarily so (as in the case of an where, as in [2), we have normalized to 2-D. For the special
infinite lattice). Often, 0 will be obtained as the intersection case of a uniform probability distribution over a finite N
of a lattice A (or a translate a + A) with a finite region IR, D constellation 0, the bit rate is (2/N) IOg 2 101 bits/2-D
in which case we denote the constellation by O(A, IR). The channel-use, and this is the maximum bit rate for the given
energy (squared norm) of a point rEO is denoted by Ilr112. We constellation. A signaling scheme with a normalized bit rate
shall usually assume that the transmitter produces a sequence of f3 can send roughly f3 bits/s/Hz when implemented with a
of symbols drawn independently from the constellation and QAM (quadrature amplitude modulation) modem (which sends
that the symbols are selected at some regular symbol rate. sequences of 2-D signalsV
The probability with which the transmitter selects a point Transmitter power is proportional to the average energy
r E O is denoted by per). As usual in the study of data per transmitted symbol. The normalized average energy per
transmission schemes, we are concerned with trade-offs among symbol per two dimensions is given by
three parameters: reliability, bit rate, and transmitter power. 2 2
E �
A. Reliability, Bit Rate, Transmitter Power
N LP(r)l!rI1
rEO
. (4)
(2) where
to note that N, unlike d�in' is affected by the probability with (This latter factor is easily evaluated via a convenient approxi
which constellation points are selected. mation for Q-l(x) due to Hastings [22) given in Abramowitz
In information-theoretic terms, the transmitter is a discrete and Stegun [23, section 26.2.23).) For small values of x,
memoryless source whose output alphabet is the set of points
(6)
in the constellation. The (average) bit rate is equal to the
entropy of the transmitter in bits per transmitted symbol. In and hence the asymptotic gain
formal terms, a scheme with a constellation n, in which the
symbol r is selected independently with probability p(r), has G � lim G(p) = CFMl/CFM2
p--->o
bit rate
is the ratio of constellation figures of merit, and is not degradation is even less. As mentioned, we prefer to focus on
affected by error coefficient values. At nonzero values of p, the asymptotic gain, which is unaffected by variations in the
the asymptotic gain will be affected by the IN factor in (5). error coefficient, but we caution the reader to note that the
In particular, when N 1 > N2, the asymptotic gain will be error coefficient must be accounted for in estimating the gain
reduced. When p is small, we can again use (6) and write at nonzero error probabilities.
Suppose now that the constellation n is obtained from an
infinite ND lattice A, and that a point r E n is selected with
to estimate this effect. probability p( r). If n I- A, it is convenient to extend the dis
For bit rates (3 � 2 bits/2-D channel-use, we take as a tribution p(r) to all the points of A, simply by assigning zero
baseline for comparison the CFM obtainable with a simple 1- probability to any points not in n. Let VeAl be the volume of
D PAM (pulse amplitude modulation) constellation. Assuming a fundamental region [24] of A, i.e., the volume of N-space
that constellation points are selected with equal probability, associated with each lattice point, and let 2N(3(p)/2V (A) be
this baseline has figure of merit given by CFMEll 6/(2(3 -1). the "entropy volume" of A with distribution per). We can
then write G(d�in' (3, E) (8) as
=
we wish to study bandwidth-efficient schemes, i.e., those with is small, the total gain is approximately separable into the
large (3, this restriction will pose no problem, and we assume product of a coding gain and a shaping gain. The coding gain
it to hold throughout this paper. Ic (A), a geometric property of the lattice A studied in the
While we will be interested primarily in asymptotic gain in coded modulation literature and elsewhere [10], [11], [24], is
this paper; we can use (5) to estimate the effect of the error independent of the probability distribution per) used to select
coefficient at nonzero error probability values. In Appendix constellation points and therefore not of central interest in this
A, we show that a cubic N-D constellation based on 7LN paper. The shaping gain IS(P) is largely independent of the
under uniform signaling and supporting a bit rate (3 (Le., underlying lattice A, except insofar as the lattice restricts the
an N-D baseline constellation) has error coefficient NEll =
distribution p(r); this is why we have suppressed an explicit
2N(1- 2-f3/2). Thus, in two dimensions and for large values .dependence on A in our notation. In general [2], 'Ys(p) ;:;:: 7re/6.
of (1, NA ,:::; 4. Using (7), we estimate that every factor of As we shall see, by choosing the distribution p(r) to be
two increase in the error coefficient (over the baseline of 4) the Maxwell-Boltzmann distribution, 18(P) can be made to
reduces the asymptotic gain by about 0.2 dB for p on the order approach the ultimate shaping gain of 7re/6 in any dimension.
of 10-6, in agreement with the rule of thumb given by Forney
[10, p. 1142]. (A plot of IN in dB versus log2 N suggests C. Other Constellation Parameters
that a loss of 0.22 dB per error coefficient doubling is a fairly Often, higher-dimensional constellations n are obtained as
accurate rule, up to about N = 64, at p =10-6. Similarly, subsets drawn from Cartesian products of lower-dimensional
at p = 10-8, the loss is about 0.17 dB per doubling in error "constituent" constellations. When the dimension N of n is
coefficient.) even, we may define the constituent 2-D constellation of n
It is important to note that when the inner points of a as the smallest 2-D constellation n2 such that n c n�/2,
constellation are selected morc' often than the outcr points, where n �/2 is the Nl2-fold Cartesian product of n
the error coefficient will increase, because the inner points The constituent 2-D constellation O plays an important 2 with itself.
2 role
tend to have greater Nmin than the outer points. For example, when the signaling scheme is to be implemented with a QAM
consider a constellation drawn from the 7L2 lattice, for which modem, since all transmitted signals are obtained as sequences
N :S 4. In light of the previous paragraph, we can estimate of QAM signals drawn from n .
the maximum loss in gain by 2
If n is odd-dimensional, then n2 is even-dimensional and
can be implemented with a QAM modem. This suggests
that we may define the constituent 2-D constellation of n
for error rates on the order of 10-6. For /3 � 2, when the as the constituent 2-D constellation of n2. In particular, if
baseline has N = 2, this maximum possible degradation is N 1, this implies n2
= n2. For later use, we note that the
=
quite large (0.22 dB) relative to the maximum achievable constituent 2-D constellation of IBN(R), an N-ball of radius
shaping gain (1.53 dB); however, for (3 ,:::; 6, the maximum R centered at the origin, is a 2-D disk 1B2(R) when N is even,
degradation is only about 0.04 dB. At smaller error rates, the and is a square lBi(R) of side 2R when N is odd.
KSCHISCHANG AND PASUPATHY: OPTIMAL NONUNIFORM SIGNALING FOR GAUSSIAN CHANNELS 917
As discussed in [2] and [25], an important parameter in where the partition function Z(A) is chosen to normalize the
the design of a signaling scheme is the 2-D "constellation distribution, i.e.,
expansion ratio"
Z(,X) g L exp( -AllrW), ,X ::::: O. (12)
rEn
C A
2 AE('x)
11.2 such that 11.2 C AIJ . Similarly the constituent 2-D region = log2 Z(>.) +�. (14)
of IR is the smallest 2-D region 1R2 such that R2 C IRIJ. If N
N 1, then 11.2
= 11.2 and 1R2 1R2. It is clear that n2 is
= =
J 0.3
Z(A) = II Z;(A), (15)
i=l
4 6
where Zi(A) is the partition function over the ith factor con 6 (bits/2D
2.:r,Ef!, exp (-AllrdI2). When (15) .
channel-use)
stellation, i.e., Zi(A) =
holds, the Maxwell-Boltzmann distribution with parameter A Fig. 2. Normalized gain of spherical constellations drawn from various
dense lattices with signal point selection performed accor d i ng to the
is separable into the product of Maxwell-Boltzmann distri Maxwell-Boltzmann distribution.
butions over the factor constellations, each with parameter A.
In practice, this means that optimal nonuniform signaling can
be implemented on separable constellations by indepcndently the separability property of the Maxwell-Boltzmann distribu
implementing nonuniform signaling on each of the factor tion, the results we obtain for these spherical constellations are
constellations. From (13) and (14) it follows that E and (3 also applicable to those nonspherical constellations that can be
can be obtained by a weighted average of the corresponding expressed as Cartesian products of spherical constellations. For
factor constellation quantities, i.e., E 2.::=1 RiNi/N and
=
example, N -cube shaped constellations based on the integer
lattice lLN are Cartesian products of simple 1-D "spherical"
f3 2.:{=1 (3iN;jN.
constellations based on lL.
=
signaling. An infinite lattice can support any positive bit rate corresponds to thc "classical" case of a uniform distribution.)
and any positive average energy with a Maxwell-Boltzmann Curves corresponding to constellations drawn from the same
distribution. latticc but extending further to the right, i.e., to larger bit rates,
correspond to larger constellations. Also plotted in Fig. 2 is
the function
V. SPHERICAL CONSTELLATIONS
In this section, we apply the Maxwell-Boltzmann dis U((3) � (1fe/6)(1- T(3). (16)
tribution to spherical constellations based on the densest
known lattices in various numbers N of dimensions. In one As we shall explain in Section VI, for bit rates /3 greater than
dimension, the constellations are based on the integer lattice lL, about 2.5, U(fJ) forms the "upper envelope" of the gain curves,
while in two dimensions, the constellations are based on A2, to good approximation.
thc hexagonal lattice. In higher dimensions, the constellations Fig. 2 has several noteworthy features. First, notice that the
are based on the lattices denoted D4, E6, Ell, and K12; the gain obtained by nonuniform signaling with a constellation can
subscript in this notation displays the number of dimensions significantly exceed that provided under uniform signaling, at
N. Basis vectors and extensive theta series tables for these the expense of a reduction in bit rate. The additional gain
lattices are given in [24, ch. 4J. Each constellation is obtained provided under nonuniform signaling is called the "biasing
by taking some number M of the points of smallest energy in gain" [7].
the lattice, where M is chosen so as to include some integral The curves corresponding to large constellations tend to
number of lattice shells. Equivalently, we may think of the merge with curves corresponding to smaller constellations as
constellations as being obtained by forming the intersection A is increased. This happens because the smaller constellations
of the infinite lattice with an N -sphere centered at the origin; are subconstellations of the large constellations. As A is
hence the term spherical constellations? We note that, due to increased, the outer points of the large constellations are
selected very infrequently, so that, in effect, these outer points
3 Caution: some authors reserve this term La refer to constellations in which
all points are on the surface of a sphcre; in this paper, spherical constellations can be neglected and the large constellation "shrinks" into a
do, in general, include interior points as well. smaller constellation.
KSCHISCHANG AND PASUPATHY: OPTIMAL NONUNIFORM SIGNALING FOR GAUSSIAN CHANNELS 919
Note also that each curve tends to merge with the U((3) small a, we expect the summation on the right-hand side of
curve. Comparing (16) to (9), we see that this merging implies (17) to be a good approximation to the integral on the left
that the shaping gain under nonuniform signaling approaches hand side of(17). Setting A aA (a small), we obtain from
= *
the ultimate limit of 7r e /6 as A becomes large, and that this (17) the general continuous approximation
limit is obtained independently of dimension.
Fig. 2 also illustrates the "law of diminishing returns" L
rEO(A, R)
fer) � V(A)-l �fCr)dYer). (18)
governing the biasing gain. Recall that the rightmost point
of each curve (A = 0) in the graph corresponds to uniform For example, setting fer) = 1, yields the approximation
signaling. We see th�t, for constellations having dimension
greater than unity, some "initial" gain is available under In(A, R)I =
L 1 � V(II\l)/V(A), (19)
uniform signaling. Furthermore, the initial gain increases with rEO(A, R)
increasing dimension. This initial gain is, of course, the
where VCR) denotes the volume of the region R. This is
shaping gain of the N-sphere under uniform signaling, which
"Proposition I" of Forney and Wei [2].
increases with N and ultimately approaches the value 7re/6. For nonuniform signaling with a Maxwell-Boltzmann distri
This forces the ultimate biasing gain (the difference between
bution, the average energy and bit rate are determined by the
the ultimate shaping gain and the shaping gain of the N
partition function Z(>.) (12). The continuous approximation
sphere) to decrease with dimension.
(18) yields
�
Qualitatively, this law of diminishing returns arises due to
a phenomenon known as the "sphere hardening effect" (see, Z(A) � V(A)-l exp (->'llrI12)dYer). (20)
e.g., [31]). In a many-dimensional sphere, almost all of the
volume is located near the surface of the sphere; consequently, Combining approximation (20) with (13) we approximate E,
almost all constellation points lie near or on the surface as well. the normalized average energy (4), by
Since these points all have the same energy, i.e., the same
cost, signaling with a Maxwell-Boltzmann distribution will E (R, A) � � �IIrl12 fer, >.)dYer), (21)
cause, these points to be selected equally often. Thus, uniform
signaling with spherical constellations becomes increasingly where
effective as the dimension increases, ultimately approaching exp (-AllrI12)
fCr, A) = (22)
IR exp (->'llrIl2)dyer)'
the performance of nonuniform signaling.
lim �
L.,;
f(r)V(aA'), (17) 6E( IR, A)
JR
R
=
0:-40
rEO(<>A',R)
[ d;;'in ]. [2(2/N)H(R,,\)]
As in (9), we have grouped the gain G(IT\l, A) (24) into three IT\l, and V(A2) is the fundamental volume (actually area) of
factors-the first being the coding gain 'Yc(A)�d�in/V(A )2/N A2, the constituent 2-D lattice of A. Combining this with our
of the lattice A [10], the second being a continuous approxi approximation (23) for the bit rate, we obtain
mation for the shaping gain, namely, 2 2
20 V(A ) /N/V(A2) · [V(IT\l2)/2( /N)H(R, A) .
CER ( ) ;::; [ l l
Ll. 2(2/N)H(R,A)
'Ys(IT\l, A) =
fiE(IR,A) ,
Here, as in [2], we identify two independent factors: the coding
(25)
constellation expansion ratio of A, CER2c(A) � V(A)2/NjV(A2)'
and the third being the discretization factor and the shaping constellation expansion ratio, CER2s(R A) �
V(IT\l2)/2(2/N)H(R,A). Again, as in the case of gain, one
component, the coding constellation expansion factor, is a
The discretization factor was omitted in the analysis of Forney geometric property of the lattice A and is unaffected by the
and Wei [2], who were interested in asymptotic (/3 ---> 00) prohahility distribution with which the constellation points are
limits for the shaping gain. However, for most practical values selected. The other component, the shaping constellation ex
of /3, this factor is not insignificant and should not be omitted. pansion ratio, depends both on the region IT\l and the parameter
Indeed, as will become evident, including this factor provides A.
accurate estimates for gain, even for relatively small values of When ), 0, we obtain the special case of uniform sig
=
(
gain that nonuniform signaling can provide over uniform
signaling with the same constellation. Setting A 0 reduces We may write CER2. IT\l , A) in terms of CER2s(R, 0) as
=
[
2(2/N)HCR,A) ] [ ] E(IR,O)
where p(R,),) (27) is the normalized redundancy under
'Ys(lR, A) 'Ys(IT\l, 0)·
2(2/N)H(R,0)
.
E(IR, nonuniform signaling. We see' that in addition to the
] [E(IR,O) ]
=
[
A) constellation expansion due to uniform signaling, we have
2.!:ICR,A) incurred an additional constellation expansion factor due to
,
= 'Ys(lR 0)· 2Mf/.,0) . E(IR, A) the loss in rate caused by nonuniform signaling. Thus, for
a fixed constellation, CER2s(lIi!, 0), the shaping constellation
= 'Ys(lR, 0) . TP(Il, A) . gE(IR, A), (26) expansion ratio induced by uniform signaling, is a lower bound
In (28), we have identified two separate factors that charac PAR2(1Ii!, A) PAR2(IIi!, O)gE(IIi!, A),
=
terize the biasing gain. The "energy savings factor" where PAR2(1Ii!, 0) denotes the PAR2 under uniform signaling.
gE(IR,),) � E(iR, O)/E(IT\l, ),) 2: 1, (29)
D. Applying the Continuous Approximations
accounts for the energy savings that result when constellation In applying these continuous approximations to actual con
points of low energy are selected more often that points of stellations, one is confronted with a certain flexibility in
large energy. Of course, selecting points with a nonuniform the choice of approximating region 1Ii!. For example, to ap
distribution results in a loss of entropy and hence a drop in proximate the behavior of an !vI point, symmetric PAM
the baseline average energy. The "energy loss factor" 2-p(R, >.) constellation based on lL, one would choose iii! [-R, Rj,
=
accounts for this drop. a ID "sphere" of radius R; however, it is not clear which
choice for radius R is best. Indeed, the "best" choice for R
C. CER2 and PAR2 Approximations depends both on the constellation parameter-be it average
We now provide continuous approximations for CER2 and energy, entropy, or whatever-that one is trying to estimate,
PAR2, two constellation parameters defined in Section III. and on the value of the Maxwell-Boltzmann parameter ),. The
From (19), we estimate 1021 ;::; V(IT\l2)/V(A2 ) for the size same flexibility in choice of sphere radius R occurs when one
of the constituent 2-D constellation, where V(IT\l2) is the attempts to approximate the behavior of an N-D spherical
volume (actually area) of IT\l2, the constituent 2-D region of constellation with an N -sphere 8N(R).
KSCHISCHANG AND PASUPATHY: OPTIMAL NONUNIFORM SIGNALING FOR GAUSSIAN CHANNELS 921
n:/ [6'Y® (N)], where 'Ye;(N) (39) denotes the shaping gain M
of the N -sphere under uniform signaling. Since 'Y0 (1'1) --+ I)-Ii 1. = (30)
7re/6 monotonically from below as N --+ 00, 'Yb approaches i= l
unity as the dimension increases, thus confirming the "law of
diminishing" returns discussed at the end of Section V. Although we will nOt always explicitly refer to them as such,
Curves showing the trade-offs between CER2s and shaping all prefix codes considered in this paper are complete.
gain or between PAR2 and shaping gain for ND spherical
constellations are easily obtained from the continuous approx A. Matched Codes
imations derived in Appendix B. However, as asserted by The simplest example of the idea of mapping a prefix code
Forney and Wei [2], the best possible trade-offs are achieved to a constellation is probably the following.> The output of
>
by 2-D spherical constellations, i.e., by regions shaped as a binary equiprobable memoryless source (the data source)
discs in two dimensions. Recall that Cartesian products of is parsed into a sequence of blocks drawn from the set
922 IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 39, NO. 3, MAY 1993
10 o 11 1000 1 100
• •
• • . ,.
o
1001 000 010 1101
Fig. 4. A simple nonuniform PAM scheme. • • • •
1010 010 1 1 10
• • • 1011 001 011 1111
• • • •
2
a signaling scheme with an overall normalized average bit /3 = (2/N) Li li2-1i and E = (2/N) Li IITi I1 2-1i , subject
rate /3 (2/N) L;� 1 li2-1i . The fixed primary channel rate
= to the constraint (30), Unfortunately, short of searching all
is /3p = 2h /N, while the variable secondary channel rate is complete binary prefix codes with Inl codewords, we know of
/3
.• = /3 - /3p . It is quite possible to have /3 > /3r, so the names
.. no general method for finding the optimal prefix code,
primary and secondary do not necessarily refer to relative Rather than attempting to find the optimal code, we have
bit rates. In most practical circumstances, we will select taken the approach of finding approximations to the optimal
KSCHISCHANG AND PASUPATHY: OPTIMAL NONUNIFORM SIGNALING FOR GAUSSIAN CHANNELS
TABLE I TABLE II
PERFORMANCE OF HUFFMAN-CODED SIGNAL CONSTELLATIONS PERFORMANCE OF HUFFMAN-CODED SIGNAL CONSTELLATIONS
BASED ON 7J2 BASED 0>1 (1 + 1 / 2 )2
(3s 'Ys'Yd (dB) Neff PAR2 CER2 Inl {ip S, "fs"fd (dB) Neff PAR2 CER2
It follows from standard arguments in information theory A signaling scheme based on a coset code has two com
(e.g., [1, Section 5.4]) that the redundancy of the optimal code ponents as shown in Fig. 1. A coset code C, based on the
KSCHISCHANG AND PASUPATHY: OPTIMAL NONUNIFORM SIGNALING FOR GAUSSIAN CHANNELS 925
of the cosets' of A' in A by {Cl ' C2 , ' . . , cd. Each time the
=
of sets of channel symbols, drawn from the alphabet of the
cosets of A' in A_ The actual transmitted constellation point
memoryless signal point selector § is presented with ith coset
is determined by the signal point selector §. As discussed in
C; + A', it selects some point for transmission. The set of
Section II, both the coset code C and the signal point selector
all possible points drawn from. the ith coset forms the ith
S contribute to the transmission of data. It is important to note
constellation !1i.
that, as nonuniform signaling is a shaping technique, for the
Given that S is presented with the ith coset, if a point
schemes we propose only the signal point selector § is affected_
ri E Ci + A' is selected with probability p eri ), then
The coset code C is unchanged relative to well-known schemes
we can determine a normalized average bit rate f3i =
B A B A
C. Continuous Approximations • • • •
11 0 0 11
Suppose now that each constellation l1i is obtained from
the intersection of the ith coset of A' with the same finite D C D C
region �. Further, suppose that the memoryless signal point • • • •
10 0 0 10
selector § sclects each point in 11; with a Maxwell-Boltzmann
distribution, Le., given that § is presented with the ith constel A B
lation, a point ri E l1i is selected with probability p(ri ) • •
10 10
=
exp (-..\ llri I12)/Zi(..\)' where ..\ is fixed for all constellations.
Using the same continuous approximation principles as in Fig. 9. The sequence of subconstellations {A, B, c, D} is determined by
Section VI, the average energy Ei and bit rate (3i for each an Ungerbocck code. Signal point selection within each subconstellation is
subconstellation can be estimated via (21) and (23) from a performed using a simplex prefix code.
continuous Gaussian distribution, truncated to the region R
I t follows that each subconstellation supports approximately D. Memoryless Signal Point Selection with Huffman Codes
the same bit rate, at the same cost in average energy. Thus, As in the case of uncodcd transmission, probably the
independently of the coset code C and the probability with simplest method of selecting points from the coset sequences
which each coset is selected, K(S ) � ,Bi ( JR , ..\) and E � generated by a coset code is through a complete binary
Ei ( � , ..\) . prefix code. Because we would like our constellations to be
Using these estimates for bit rate and average energy, the polydiscs, and we have restricted our attention to binary coset
shaping gain "Is(S) is estimated by codes, we focus on coset codes based on the 2-D lattice 7/.. 2
22 H(R, >.)/N
A simple example of combining a nonuniform memoryless
"Is (JR, ..\)
signal point selector with a trellis code is shown in Fig. 9.
6E ( �, ..\) ,
=
approximately the produce of a coding constellation expansion inner point at squared Euclidean distance 1/2 from the origin
ratio CER2e( C) and a shaping constellation expansion and two outer points at squared Euclidean distance 5/2 from
ratio CER2s (S). In terms of the normalized redundancy the origin. By using the prefix code {O, 10, ll}, the signal
p(C ) 2r/N of the binary encoder C, we have
ll.
point selector chooses the inner point with probability 1/2 and
each outer point with prohability 1/4. The transmitted rate (3
A 2/
CER2 .(C) = 2P (C) V ( ) N for this scheme is 2.5 bits with a primary channel rate of 2
V (A2 ) ,
c
bits and a secondary channel rate of 0.5 bits. The shaping gain
"Is 0,994 dB. The overall gain .G "Ic"ls"ld 3.16 dB.
The four-way partition 7/.. 2/27/.. 2 , translated as in Fig. 9,
= = =
while
( JR 2 )
.:; - 2p(R, >.) VV( )2/N
yields four cosets with identical weight distributions. The
CER2., (<0)
_
since these achieve a given shaping gain with minimum As before, very satisfactory shaping gains are achievable via
CER2s and minimum PAR2. Therefore, a nonuniform signal these prefix codes, with Neff numbering in the hundreds of
ing scheme based on a multidimensional lattice A is perhaps dimensions for the larger constellations.
best implemented as a coset code involving the constituent Although the prefix codes for the smaller constellations
2-D lattice A 2 , in which the ith constellation l1i is circular have smaller Neff, as discussed at the end of Section VII,
and the signal point selector § chooses from 11; according to applying the Huffman procedure to Cartesian products of these
a Maxwell-Boltzmann distribution. subconstellations can improve the gain. Note, however, that the
KSCHISCHANG AND PASUPATHY: OPTIMAL NONUNIFORM SIGNALING FOR GAUSSIAN CHANNELS 'Y27
= APPENDIX A
enumerators ERROR COEFFICIENTS FOR BASELINE CONSTELLATIONS
8A(x) = x1 /2 + 2x25/2 + 2X41/2 + x 49/ 2 + . . . In this appendix, we compute the average nearest neighbor
8B(x ) = X5/2 + X13/2 + X29/2 + x 37/ 2 + . . . multiplicity (error coefficient) for a cubic constellation of side
8c(x) = x9/ 2 + 2x 17/2 + x25/2 + 2X 65/ 2 + . . . , M drawn from the lattice 7L N. This problem is easily solved
using the notion of a nearest neighbor enumerator.
respectively. Class A consists of four cosets, class B consists Given a finite constellation n, recall that Nmin (r) denotes
of eight cosets, and class C consists of four cosets. We have the number of points of n at distance dmin from the point r.
928 IEEE TRANSACTIONS O N INFORMATION THEORY, VOL. 39, NO. 3, MAY 1993
Define A (x) � L rH! XN min (r); then A(x) is a polynomial Setting .\ = 0 yields
with integer coefficients that we call the nearest neighbor enu
merator for O. For example, a simple 1-D PAM constellation H(IBN (R), 0) = log2 [V(IBN (R))]
with M points has A (x) 2:c + (M - 2)X2, indicating that
= = log2 [(11'R2)N/ 2/r(N/2 + 1 )] .
two points of the constellation have a single nearest neighbor,
while M - 2 points have two nearest neighbors. Combining these expressions gives an estimate for the nor
Assuming uniform signal point selection, it is easy to sec malized redundancy (27), namely,
that average nearest neighbor multiplicity N is given by
'\R2
A' ( I ) /A ( I ) where A'(x) dA(x)/dx. Thus for the M-PAM
=
fold Cartesian product of simple I-D PAM constellations, we (28) gives an cstimate for the biasing gain. Multiplying the
have N 2N(1 - 1M) for such constellations. Furthermore,
= biaSIng gain by "i0 (N) � 'if(N/2 + 1 )/ [6r(N/2 + l ) z/N] ,
since M 2(3 / 2 , we obtain Nffi 2N(1 - 2-,3/ 2 ) for our
= = the shaping gain of an N -sphere under uniform signaling [2],
N -D baseline constellations. yields an estimate for the shaping gain under nonuniform
signaling. Explicitly, the shaping gain "is ( IBN (R), .\) is
ApPENDIX B
'if P(N/2 + 1, '\R2)
CONTINUOUS APPROXIMATIONS FOR
SPHERlCAL CONSTELLATIONS
"is( IBN (R), .\) = "6 exp
( P(N/2, '\R2) )
P(N/2, .\R2)( 2 /N)+1
In this appendix, we specialize the continuous approx (36)
imations derived in Section VI to the case where IIil =
P(N/2 + 1, '\R2 ) .
IBN (R), an N-baIl of radius R centered at the origin. Cubic
constellations can be considered to be Cartesian products of PAR2: The constituent 2-D constellation of IBN (R) (as
1 D "spherical constellations" and so are (by the separability defined in Section III) is a 2-D disc B 2 (R) (with peak energy
of Gaussian densities) a special case. By letting R -> 00, we R2) when N is even, and a square lBi(R) of side 2R (with
obtain continuous approximations for the case of an infinite peak energy 2R2) when N is odd; this may be expressed
constellation. compactly by writing r;,ax (IBN (R)) [3 - (-1 ) N ] R2 /2.
=
Many of the expressions derived in this Appendix may Since the normalized average energy under uniform signaling
be written in terms of the (normalized) incomplete Gamma is 2R2 / (N + 2), we have
function pen, :e ) , defined in [23] as
(3 - ( - l ) N )(N + 2)/4 .
rta)l"t'L-le-tdt, PAR2 ( BN ( R) , 0) =
P(a, x) �
Under nonuniform signaling this PAR2 is increased by a factor
where we note that limx->oo P( a, x) 1. =
of gE so that
For a finite N-D spherical constellation 0, in our estimates
we choose the spherical radius R so that V(BN (R)) = (37)
InlV(A), Le., so that approximation (1 9) holds with equality.
Energy: From (21) we obtain
.\
sion ratio of CER2s r ( N/2 + 1)2 /N (see [2]), while a
=
Since E ( IB N ( R), 0) 2 R z/(N + 2), we find that the energy large odd-dimensional spherical constellation has CE R2. =
Limiting Case: When R -> 00, the truncated Gaussian [7] A. R. Calderbank and L. H. Ozarow, "Non-equiprobable signaling on the
random variable approaches a standard (untruncated) Gaussian Gaussian channel," IEEE Trans. lIiform Theory, vol. 36, pp. 726-740,
July 1990.
random variable. For large R, the normalized average energy [8] G. D. Forney, Jr., "Trellis shaping," IEEE Trans. Inform. Theory, vol.
E (IBN(R), A) -> l/A, independently of N. Siniilarly, the en 38, pp. 281-300, Mar. 1992.
[9] F. R. Kschischang and S. Pasupathy, "Optimal shaping properties of the
ergy savings factor gE(IBN(R), A) -> 2AR2 /(N + 2), the nor
truncated polydisc," IEEE Trans. Inform. Theory, Mar. 1992, submitted
malized redundancy p(IBN (R), A) -> log ( A R2/ef(N/ 2 + for publication.
N 2 [10] G. D. Forney, Jr., "Coset codes I: Introduction and geumetrical classi-
1 )2/ ). Thus the biasing gain
fication," IEEE Trans. Inform. Theory, vol. 34, pp. 1 123-1151, Sept.
N 1988.
i'b (BN(R), A) -> 2er(N/2 + 1 ) 2/ / (N + 2) [11] __ , "Coset codes II: Binary lattices and related codes," IEEE Trans.
1l'e/[6'Y'0 (N)], Inform. Theory, vol. 34, pp. 1152-1 187, Sept. 1988.
[12] A. R. Calderbank and N. J. A. Shine, "New trellis codes based on lat
=
tices and eosets," IEEE Trans. Inform. Theory, vol. IT-33, pp. 177-195,
where Mar. 1987.
[13] G. D. Forney, Jr., R. O. Gallager, G. R. Lang, F. M. Longstaff, and
N
A 0 (N) � 1l'(N/2 + 1)/[6 r(N/2 + 1 )2/ ] , (39) S. U. Qureshi, "Efficient modulation for band-limited channels," IEEE
J.
Select. Areas Commun., vol. SAc-i, pp. 632-647, Sept. 1984.
[14] R. G. Gallager, "Source coded modulation system," U.S. Patent 4 586
is the shaping gain of the N -sphere under uniform signaling. 182, Apr. 29, 1986.
The shaping gain i's (IBN(R), A) -> 7re/6, independently of [15] J. N. Livingston, "Shaping using variable-size regions," IEEE Trans.
Inform Theory, pp. 1347-1353, July 1992.
the dimension N. Of course, the biasing gain depends on [16] A. Chouly and H. Sari, "Block-coded modulation: Novel design tech
N, and approaches unity as N -> 00. For large values of niques and rotational invariance," Philips J. Res., vol. 45, no. 2, pp.
R, CER2(IBN (R) , A) -> AR2/e for even-dimensional spher 127-155, 1990.
[17] G. R. Lang, G. D. Forney, S. Qureshi, F. M. Longstaff, and C. H. Lee,
ical constellations, and CER2(IBN (R) , A) -> 4AR2/(1l'e) for "Signal structures with data encoding/decoding for QCM modulations,"
odd-dimensional spherical constellations. Similarly, for even U.S. Patent 4 538 284, Aug. 27, 1985.
dimensional spherical constellations, PAR2(BN(R) , A) -> [18] T. Armstrong, "Secondary channel signaling in a QAM data point
constellation," U.S. Patent 4 630 287, Dec. 16, 1 986.
AR2 independently of N. Of course, this is to be expected
,
[19] R. D. Gitlin and J.-J. Werner, "Inbaod coding of secondary data," U.S.
because the peak energy value is approximately R2, and Patent 4 644 537, Feb. 17, 1987.
the average energy is approximately 1/ A. Similarly, for odd [20] H. K. Thapar, "Inband coding of secondary data," U.S. Patent 4 651
320, Mar. 17, 1987.
dimensional spherical constellations, PAR2(IEE N (R) , A) -> [21] R. D. Gitlin, H. K. Thapar, and J. J. Werner, "Au inband coding method
2 AR2 because the peak energy value (in two dimenSIons) is for the transmission of secondary data," in Conf. Rec. IEEE InL Conf.
Commun., June 1988, pp. 3.t.i-3.1.5. '
approximately 2R2 . [22] C. Hastings, Jr., Approximations for Digital Computers. Princeton, NJ:
Princeton University Press, 1955.
[23] M. Abramowitz and I. A. Stegun, Handbook ofMathematical Functions.
ACKNOWLEDGMENT
New York: Dover, 1965.-
[24] J. H. Conway and N. 1. A. Sloane, Sphere Packings, Lattices and Groups.
We are grateful to G. D. Forney, Jr., for his detailed
New York: Springer-Verlag, 1988.
comments on earlier versions of this paper, for his insightful [25] L.-F. Wei, "Trellis-coded moduiation with multidimensional constel
suggestions, and for providing us with some related reference lations," IEEE Trans. Inform. Theory, vol. IT-33, pp. 41b-501, July
1987.
material. Comments made by the anoriymous reviewers proved [26] T. Berger, "Minimum entropy quantizers and permutation codes," IEEE
most helpful as well. We would also like to thank P. R. Trans. Inform. Theory, vol. iT-28, pp. 149-157, Mar. 1982.
Stubley for pointing out that the Huffinan algorithm minimizes [27J I. Ingemarsson, "Optimized permutation modulation," IEEE Trans.
Inform. Theory, pp. 1098-1 100, Sept. 1990.
the relative entropy between a distribution and its dyadic [28] J. A. Hertz, R. G. Palmer, and A. S. Krogh, Introduction to the Theory
approximation. of Neural Computation. Reading, MA: Addison-Wesley, 1991.
[29] S. Kirkpatrick, C. D. Gelatt, Jr., and M. P. Vecehi, "Optimization by
simulated anncaling," Science, vol. 220, pp. 671-680, May 13, 1983.
REFERENCES [30] N. J. A. Sloane, "Tables of sphere packings and spherical codes," IEEE
Trans. Inform. Theory, vol. IT-27, pp. 327-338, May 1981. .
(1] T. M. Cover and J. A. Thomas, Elements of Information Theory. New [31] 1. M. Wozencraft and I. M. Jacobs, Principles of Communication Engi
York, Wiley, 1991. neering. New York� Wiley, 1965.
[2] G. D. Forney, Jr., and L.-F. Wei, "Multidimensional constellations--Part [32] W'. Rudin, Function Theory in Polydiscs. New York: W. A. Benjamin,
I: Introduction, figures of merit, and generalized cross constellations, 1969.
IEEE J. Select. Areas Commun., vol. 7, pp. 877-892, Aug. 1989. [33] R. E. Blahut, Digital Transmission of Information. Reading, MA:
[3] E. SchrMinger, Statistical Thermodynamics. Cambridge; Cambridge Addison-Wesley, 1 990.
University Press, 1962. [34] P. R. Stubley and I. F. Blake, "On a discrete probability distribution
[4] R. K. Pathria, Statistical Mechanics. Elmsford, NY: Pergamon, 1972. matching problem," J. Algorithms, June 1991, submitted' for publication.
[5] D. Ruelle, "Thermodynamic formalism," in Encyclopedia of Mathe [35] D. A. Huffman, "A method for the construction of minimum redundancy
matics and Its Applications, vol. 5. Reading, MA: Addison-Wesley, codes," Proc. IRE, vol. 40, pp. 1098-1 101, 1952.
1978. [361 R. W. Hamming, Coding and Information Theory. Englewood Cliffs,
[6] G. D. Forney, Jr., "Multidimensional constellations--Part II: Voronoi NJ: Prentice-Hall, 1986.
constellations," IEEE J. Select. Areas Commun., vol. 7, pp. 941-958, [37] G. Ungerboeck, "Channel coding with multilevel/phase signals," IEEE
Aug. 1989. Trans. Inform. Theory, vol. IT-28, pp. 55-67, Jan. 1982.