+ All Categories
Home > Documents > Joint tx-rx beamforming design for multicarrier mimo channels: a ...€¦ · M. A. Lagunas is with...

Joint tx-rx beamforming design for multicarrier mimo channels: a ...€¦ · M. A. Lagunas is with...

Date post: 03-Aug-2020
Category:
Upload: others
View: 0 times
Download: 0 times
Share this document with a friend
21
IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 51, NO. 9, SEPTEMBER 2003 2381 Joint Tx-Rx Beamforming Design for Multicarrier MIMO Channels: A Unified Framework for Convex Optimization Daniel Pérez Palomar, Student Member, IEEE, John M. Cioffi, Fellow, IEEE, and Miguel Angel Lagunas, Fellow, IEEE Abstract—This paper addresses the joint design of transmit and receive beamforming or linear processing (commonly termed linear precoding at the transmitter and equalization at the receiver) for multicarrier multiple-input multiple-output (MIMO) channels under a variety of design criteria. Instead of considering each design criterion in a separate way, we generalize the existing results by developing a unified framework based on considering two families of objective functions that embrace most reasonable criteria to design a communication system: Schur-concave and Schur-convex functions. Once the optimal structure of the transmit-receive processing is known, the design problem simplifies and can be formulated within the powerful framework of convex optimization theory, in which a great number of interesting design criteria can be easily accommodated and efficiently solved, even though closed-form expressions may not exist. From this perspective, we analyze a variety of design criteria, and in particular, we derive optimal beamvectors in the sense of having minimum average bit error rate (BER). Additional constraints on the peak-to-average ratio (PAR) or on the signal dynamic range are easily included in the design. We propose two multilevel water-filling practical solutions that perform very close to the optimal in terms of average BER with a low implementation complexity. If cooperation among the processing operating at dif- ferent carriers is allowed, the performance improves significantly. Interestingly, with carrier cooperation, it turns out that the exact optimal solution in terms of average BER can be obtained in closed form. Index Terms—Array signal processing, beamforming, joint transmit-receive equalization, linear precoding, MIMO channels, space-time filtering, water-filling. I. INTRODUCTION M ULTIPLE-INPUT multiple-output (MIMO) channels arise in many different scenarios such as when a bundle of twisted pairs in digital subscriber lines (DSLs) is treated as Manuscript received March 5, 2002; revised February 20, 2003. This work was supported in part by the European Comission under Project IST-2000-30148 I-METRA; Samsung Advanced Institute of Technology; the Spanish Government (CICYT) TIC2000-1025, TIC2001-2356, TIC2002-04594, FIT-070000-2000-649 (MEDEA A105 UniLAN); and the Catalan Government (DURSI) 1999FI 00588, 2001SGR 00268. Part of this work was presented at the 40th Annual Allerton Conference on Communication, Control, and Computing, Monticello, IL, Oct. 2002. The associate editor coordinating the review of this paper and approving it for publication was Prof. Iaon Tabus. D. Pérez Palomar is with the Department of Signal Theory and Communi- cations, Universitat Politècnica de Catalunya (UPC), 08034 Barcelona, Spain (e-mail: [email protected]). J. M. Cioffi is with the Department of Electrical Engineering, Stanford Uni- versity, Stanford, CA 94305-9515 USA (e-mail: [email protected]). M. A. Lagunas is with the Centre Tecnològic de Telecomunicacions de Catalunya (CTTC), 08034 Barcelona, Spain (e-mail: [email protected]). Digital Object Identifier 10.1109/TSP.2003.815393 a whole [1], when multiple antennas are used at both sides of a wireless link [2], or simply when a frequency-selective channel is properly modeled by using, for example, transmit and receive filterbanks [3]. In particular, MIMO channels arising from the use of multiple antennas at both the transmitter and at the receiver have recently attracted significant interest because they provide an important increase in capacity over single-input single-output (SISO) channels under some uncorrelation conditions [4], [5]. In terms of spectral efficiency, a MIMO system should be designed to approach the capacity of the channel [6], [2], [7]. In light of this observation, a frequency-selective MIMO channel can be dealt with by taking a multicarrier approach, which is a well-known capacity-lossless structure and allows us to treat each carrier as a flat MIMO channel [2], [8]. A ca- pacity-achieving design dictates that the channel matrix at each carrier must be diagonalized, and then, a water-filling power allocation must be used on the spatial subchannels (or channel eigenmodes) of all carriers [6], [2], [7]. Note that this requires channel state information (CSI) available at both ends of the link, which we assume in the rest of the paper. In theory, this solution has the implication that an ideal Gaussian code should be used on each spatial eigenmode and carrier according to its allocated power [6]. In practice, however, each Gaussian code is substituted by a simple (and suboptimal) signal constellation and a practical (and suboptimal) coding scheme (if any). The complexity of such a solution is still significative since each channel eigenmode requires a different combination of signal constellation and code, depending on the allocated power. To reduce the complexity, the system can be constrained to use the same constellation and code in all channel eigenmodes (possibly optimizing the utilized bandwidth to transmit only over those eigenmodes with a sufficiently high gain), i.e., an equal-rate transmission. Examples of this pragmatic and simple solution are found in the European standard HIPERLAN/2 [9] and in the U.S. standard IEEE 802.11 [10] for wireless local area networks (WLANs). Assuming that the specific signal constellations and coding schemes for all the substreams have been selected either after some bit distribution method or simply by taking a simple uni- form bit distribution, it is then possible to further optimize the system to improve the quality of each of the communication links. In particular, we consider the joint design of linear pro- cessing at both ends of the link (commonly referred to as linear precoder at the transmitter and equalizer at the receiver), ac- cording to a variety of criteria, as we now review. In [11]–[13] 1053-587X/03$17.00 © 2003 IEEE
Transcript
Page 1: Joint tx-rx beamforming design for multicarrier mimo channels: a ...€¦ · M. A. Lagunas is with the Centre Tecnològic de Telecomunicacions de Catalunya (CTTC), 08034 Barcelona,

IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 51, NO. 9, SEPTEMBER 2003 2381

Joint Tx-Rx Beamforming Design for MulticarrierMIMO Channels: A Unified Framework

for Convex OptimizationDaniel Pérez Palomar, Student Member, IEEE, John M. Cioffi, Fellow, IEEE, and Miguel Angel Lagunas, Fellow, IEEE

Abstract—This paper addresses the joint design of transmitand receive beamforming or linear processing (commonlytermed linear precoding at the transmitter and equalization atthe receiver) for multicarrier multiple-input multiple-output(MIMO) channels under a variety of design criteria. Instead ofconsidering each design criterion in a separate way, we generalizethe existing results by developing a unified framework basedon considering two families of objective functions that embracemost reasonable criteria to design a communication system:Schur-concave and Schur-convex functions. Once the optimalstructure of the transmit-receive processing is known, the designproblem simplifies and can be formulated within the powerfulframework of convex optimization theory, in which a greatnumber of interesting design criteria can be easily accommodatedand efficiently solved, even though closed-form expressions maynot exist. From this perspective, we analyze a variety of designcriteria, and in particular, we derive optimal beamvectors in thesense of having minimum average bit error rate (BER). Additionalconstraints on the peak-to-average ratio (PAR) or on the signaldynamic range are easily included in the design. We propose twomultilevel water-filling practical solutions that perform very closeto the optimal in terms of average BER with a low implementationcomplexity. If cooperation among the processing operating at dif-ferent carriers is allowed, the performance improves significantly.Interestingly, with carrier cooperation, it turns out that the exactoptimal solution in terms of average BER can be obtained inclosed form.

Index Terms—Array signal processing, beamforming, jointtransmit-receive equalization, linear precoding, MIMO channels,space-time filtering, water-filling.

I. INTRODUCTION

M ULTIPLE-INPUT multiple-output (MIMO) channelsarise in many different scenarios such as when a bundle

of twisted pairs in digital subscriber lines (DSLs) is treated as

Manuscript received March 5, 2002; revised February 20, 2003. This work wassupported in part by the European Comission under Project IST-2000-30148I-METRA; Samsung Advanced Institute of Technology; the SpanishGovernment (CICYT) TIC2000-1025, TIC2001-2356, TIC2002-04594,FIT-070000-2000-649 (MEDEA+ A105 UniLAN); and the CatalanGovernment (DURSI) 1999FI 00588, 2001SGR 00268. Part of this work waspresented at the 40th Annual Allerton Conference on Communication, Control,and Computing, Monticello, IL, Oct. 2002. The associate editor coordinatingthe review of this paper and approving it for publication was Prof. Iaon Tabus.

D. Pérez Palomar is with the Department of Signal Theory and Communi-cations, Universitat Politècnica de Catalunya (UPC), 08034 Barcelona, Spain(e-mail: [email protected]).

J. M. Cioffi is with the Department of Electrical Engineering, Stanford Uni-versity, Stanford, CA 94305-9515 USA (e-mail: [email protected]).

M. A. Lagunas is with the Centre Tecnològic de Telecomunicacions deCatalunya (CTTC), 08034 Barcelona, Spain (e-mail: [email protected]).

Digital Object Identifier 10.1109/TSP.2003.815393

a whole [1], when multiple antennas are used at both sides of awireless link [2], or simply when a frequency-selective channelis properly modeled by using, for example, transmit and receivefilterbanks [3]. In particular, MIMO channels arising fromthe use of multiple antennas at both the transmitter and at thereceiver have recently attracted significant interest becausethey provide an important increase in capacity over single-inputsingle-output (SISO) channels under some uncorrelationconditions [4], [5].

In terms of spectral efficiency, a MIMO system should bedesigned to approach the capacity of the channel [6], [2],[7]. In light of this observation, a frequency-selective MIMOchannel can be dealt with by taking a multicarrier approach,which is a well-known capacity-lossless structure and allowsus to treat each carrier as a flat MIMO channel [2], [8]. A ca-pacity-achieving design dictates that the channel matrix at eachcarrier must be diagonalized, and then, awater-filling powerallocation must be used on the spatial subchannels (or channeleigenmodes) of all carriers [6], [2], [7]. Note that this requireschannel state information (CSI) available at both ends of thelink, which we assume in the rest of the paper. In theory, thissolution has the implication that an ideal Gaussian code shouldbe used on each spatial eigenmode and carrier according to itsallocated power [6]. In practice, however, each Gaussian codeis substituted by a simple (and suboptimal) signal constellationand a practical (and suboptimal) coding scheme (if any). Thecomplexity of such a solution is still significative since eachchannel eigenmode requires a different combination of signalconstellation and code, depending on the allocated power. Toreduce the complexity, the system can be constrained to usethe same constellation and code in all channel eigenmodes(possibly optimizing the utilized bandwidth to transmit onlyover those eigenmodes with a sufficiently high gain), i.e., anequal-rate transmission. Examples of this pragmatic and simplesolution are found in the European standard HIPERLAN/2 [9]and in the U.S. standard IEEE 802.11 [10] for wireless localarea networks (WLANs).

Assuming that the specific signal constellations and codingschemes for all the substreams have been selected either aftersome bit distribution method or simply by taking a simple uni-form bit distribution, it is then possible to further optimize thesystem to improve the quality of each of the communicationlinks. In particular, we consider the joint design of linear pro-cessing at both ends of the link (commonly referred to as linearprecoder at the transmitter and equalizer at the receiver), ac-cording to a variety of criteria, as we now review. In [11]–[13]

1053-587X/03$17.00 © 2003 IEEE

Page 2: Joint tx-rx beamforming design for multicarrier mimo channels: a ...€¦ · M. A. Lagunas is with the Centre Tecnològic de Telecomunicacions de Catalunya (CTTC), 08034 Barcelona,

2382 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 51, NO. 9, SEPTEMBER 2003

and [3], the sum of the mean square error (MSE) of all channelsubstreams (the trace of the MSE matrix) was used as the objec-tive to minimize under an average power constraint. This crite-rion was generalized by using a weighted sum (weighted trace)in [14]. In [3], a maximum signal to interference-plus-noise ratio(SINR) criterion with a zero-forcing (ZF) constraint was alsoconsidered. For these criteria, the original complicated designproblem is greatly simplified because the channel turns out to bediagonalized by the transmit-receive processing. In [15], the de-terminant of the MSE matrix was minimized, and the diagonalstructure was found to be optimal as well. In [16], the resultswere extended to the case of a peak power constraint (maximumeigenvalue constraint) with similar results.

We remark that the channel-diagonalizing property is of para-mount importance in order to be able to solve the problem. Themain interest of the diagonalizing structure is that it allows ascalarizationof the problem (meaning that all matrix equationsare substituted with scalar ones) with the consequent great sim-plification. In light of the optimality of the channel-diagonal-izing structure in all the aforementioned examples (includingthe capacity-achieving solution), one may wonder whether thesame holds for other criteria. Examples of other reasonable cri-teria to design a communication system are the minimizationof the maximum bit error rate (BER) of the substreams, theminimization of the average BER, or the maximization of theminimum SINR of the substreams. In these cases, it is not clearwhether one can assume a diagonal structure, as was obtainedin the previous cases.

In this paper, we consider different design criteria based onoptimizing the MSEs, the SINRs, and also the BERs directly.Instead of considering each design criterion in a separate way,we develop a unifying framework and generalize the existingresults by considering two families of objective functions thatembrace most reasonable criteria to design a communicationsystem: Schur-concave and Schur-convex functions that arisein majorization theory [17]. For Schur-concave objectivefunctions, the channel-diagonalizing structure is always op-timal, whereas for Schur-convex functions, an optimal solutiondiagonalizes the channel only after a very specific rotationof the transmitted symbols. Once the optimal structure of thetransmit-receive processing is known, the design problem sim-plifies and can be formulated within the powerful frameworkof convex optimization theory, in which a great number ofinteresting design criteria can be easily accommodated andefficiently solved, even though closed-form expressions maynot exist. We analyze a variety of criteria, and in particular, wederive optimal beamvectors in the sense of having minimumaverage BER. A convex optimization approach for the simplecase of utilizing a single spatial eigenmode (in other words,using a single beamforming per carrier) was also taken in [18].Additional constraints on the peak-to-average ratio (PAR) oron the signal dynamic range are easily included in the designwithin the convex optimization framework. We propose twomultilevel water-filling practical solutions that perform veryclose to the optimal in terms of average BER with a low im-plementation complexity. If cooperation among the processingoperating at different carriers is allowed, the performanceimproves significantly. Interestingly, with carrier cooperation,

it turns out that the optimal solution in the sense of minimumaverage BER can be obtained in closed form.

The paper is structured as follows. In Section II, a brief pre-liminary description of convex optimization problems and ofmajorization theory is given. The signal model is introducedin Section III. The main result of the paper (the optimal struc-ture for Schur-concave and Schur-convex objective functions)is given in Section IV. Section V is devoted to the systematicdesign of beamforming under the framework of convex opti-mization theory. In Section VI, additional constrains to controlthe PAR are considered. Simulation results are given in Sec-tion VII. The final conclusions of the paper are summarized inSection VIII.

The following notation is used. Boldface upper-case lettersdenote matrices, boldface lower-case letters denote column vec-tors, and italics denote scalars. The superscripts, , and

denote transpose, complex conjugate, and Hermitian op-erations, respectively. (also ) and denote the( th, th) element andth column of matrix , respectively. By

, we mean that is positive semidefinite. Thetrace, determinant, and Frobenius norm of a matrix are denotedby Tr , , and , respectively. By diag , we denotea block-diagonal matrix with diagonal blocks given by the set

. The gradient of a function with respect tois written as

. We define .

II. PRELIMINARIES

In Section V, a variety of objective functions are consideredunder the powerful framework of convex optimization theory[19]–[21]. For this purpose, we first give an overview in Sec-tion II-A of the potential and advantages of this framework.Roughly speaking, one can say that once a problem has beenexpressed in convex form, it has been solved. However, beforebeing able to express the different criteria in convex form, a sim-plification of the problem is necessary. Majorization theory [17]provides us with useful tools to simplify many matrix-valuedproblems, which we review in Section II-B.

A. Convex Optimization Problems

A generalconvex optimization problem(convex program) isof the form [19], [21]:

s.t.

where is the optimization variable,are convex functions, and are linear func-tions (more exactly affine functions). The functionis theob-jective functionor cost function. The inequalities arecalled inequality constraints, and the equations arecalledequality constraints. When the functions and arelinear (affine), the problem is called alinear program(LP) andis much simpler to solve.

Many analysis and design problems arising in engineeringcan be cast (or recast) in the form of a convex optimization

Page 3: Joint tx-rx beamforming design for multicarrier mimo channels: a ...€¦ · M. A. Lagunas is with the Centre Tecnològic de Telecomunicacions de Catalunya (CTTC), 08034 Barcelona,

PÉREZ PALOMARet al.: JOINT TX-RX BEAMFORMING DESIGN 2383

problem. In general, some manipulations are required to con-vert the problem into a convex one (unfortunately, this is not al-ways possible). The interest of expressing a problem in convexform is that although an analytical solution may not exist andthe problem may be difficult to solve (it may have hundredsof variables and a nonlinear nondifferentiable objective func-tion), it can still be solved (numerically) very efficiently, bothin theory and practice [21]. Another interesting feature of ex-pressing a problem in convex form is that additional constraintscan be straightforwardly added, as long as they are convex. Forexample, in the problem addressed in this paper, it is very simpleto add constraints to control the dynamic range of the poweramplifier [22] and the PAR of the transmitted signal (c.f. Sec-tion VI).

Convex programming has been used in related areas suchas FIR filter design [23], antenna array pattern synthesis [24],power control for interference-limited wireless networks [25],and transmit downlink beamforming in a multiuser scenariowith a multiantenna base station [22].

Solving Convex Optimization Problems:In some cases,convex optimization problems can be analytically solved usingthe Karush–Kuhn–Tucker (KKT) optimality conditions, andclosed-form expressions can be obtained. In general, however,one must resort to iterative methods [19], [21]. In the last tenyears, there has been considerable progress and developmentof efficient algorithms for solving wide classes of convexoptimization problems. Recently developedinterior-pointmethodscan be used to iteratively solve convex problemsefficiently in practice by dealing with the constrained problemas a sequence of unconstrained problems in which a Newtonmethod can be efficiently used. This was an important break-through achieved by Nesterov and Nemirovsky in 1988. Theyshowed that interior-point methods (initially proposed only forlinear programming by Karmarkar in 1984) can, in principle,be generalized to all convex optimization problems. In [26], avery general framework was developed for solving convex op-timization problems using interior-point methods. In addition,the difference between the objective value at each iteration andthe optimum value can be upper bounded using duality theory[19], [21]. This allows the utilization of nonheuristic stoppingcriteria such as stopping when some prespecified resolution hasbeen reached. Another interesting family of iterative methodsarecutting-plane methods[21].

B. Majorization Theory

We introduce the basic notion of majorization and state someimportant results (see [17] for a complete reference of the sub-ject). Majorization makes precise the vague notion that the com-ponents of a vector are “less spread out” or “more nearlyequal” than the components of a vector.

Definition 1: For any , let

denote the components of in decreasing order (also termedorder statistics of ).

Definition 2 [17, 1.A.1]: Let , . Vector is ma-jorized by vector (or majorizes ) if

and represent it by .Definition 3 [17, 3.A.1]: A real-valued function defined on

a set is said to be Schur-convex on if

on

Similarly, is said to be Schur-concave onif

on

As a consequence, if is Schur-convex on , then is Schur-concave on and vice versa.

It is important to remark that the sets of Schur-concave andSchur-convex functions do not form a partition of the set of allfunctions. In fact, neither are the two sets disjoint (the intersec-tion is not empty), nor do they cover the entire set of all func-tions.

Lemma 1 [17, 9.B.1]:Let be an Hermitian matrixwith diagonal elements denoted by the vectorand eigenvaluesdenoted by the vector. Then

Lemma 2 [17, p. 7]: Let and denote theconstant vector with . Then

Lemma 3 [17, 9.B.2]:For any , there exists a realsymmetric (and therefore Hermitian) matrix with equal diagonalelements and eigenvalues given by.

III. SIGNAL MODEL

We consider a communication system with transmit andreceive dimensions. This gives rise to a MIMO channel that

can be represented by a channel matrix. Many different commu-nication channels can be expressed under the unified notationof a channel matrix such as a frequency-selective channel em-ploying transmit and receive filterbanks [3], a bundle of twistedpairs in DSL [1], or a wireless multiantenna system [27], [2]. Al-though the results in this paper are valid for any MIMO channel,we focus on a wireless multiantenna system to gain insight intobeamforming issues traditionally associated with arrays of an-tennas.

To deal easily with the frequency-selectivity of the channel,we take a multicarrier approach without loss of optimality (sinceit is known to be a capacity-lossless structure [2], [8]):

(1)

where denotes the carrier index, is the number of carriers,is the transmitted vector, is the

Page 4: Joint tx-rx beamforming design for multicarrier mimo channels: a ...€¦ · M. A. Lagunas is with the Centre Tecnològic de Telecomunicacions de Catalunya (CTTC), 08034 Barcelona,

2384 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 51, NO. 9, SEPTEMBER 2003

(a)

(b)

Fig. 1. Matrix processing and multiple beamforming interpretations of the communication system. (We assume for the clarity of the figure thatL = L 8k.) (a)Matrix processing interpretation at carrierk. (b) Multiple beamforming interpretation at carrierk.

channel matrix, is the received signal vector, andis a zero-mean circularly symmetric complex

Gaussian noise vector with arbitrary covariance matrix ,i.e., . The channel is assumed fixed duringthe transmission of a block and known at both sides of the com-munication link as well as the noise covariance matrix.

At each carrier , the matrix channel haschannel eigenmodesor spatial subchannels(i.e., nonvanishingsingular values of the channel matrix) [2]. We can use themas a means of spatial multiplexing [28] to transmit simultane-ously symbols by having established substreams. Noticethat established substreams and spatial subchannels (or channeleigenmodes) are different concepts that may or may not coin-cide, depending on whether the channel is diagonalized or not(c.f. Section IV). Although the notation in this paper allowsfor arbitrary values of , in a practical system, we will typi-cally have to have an acceptable performance. Thetransmitted vector at theth carrier after linear precoding is[see Fig. 1(a)]

(2)

where represents the transmitted symbols(we assume zero-mean unit-energy uncorrelated (white) sym-bols,1 i.e., ), is the transmit

matrix processing, , and . We canthink of each column of as a different beamvector corre-sponding to each transmitted symbol, giving rise to a multiplebeamforming architecture [see Fig. 1(b)]. Note that if only onesymbol is transmitted per carrier ( ), then (2) reducesto a classical beamforming structure with a single beamvector:

1White symbols account, for example, to having independent bit streams. Inthe case where we have colored symbols due, for example, to a coded trans-mission, a prewhitening operation can be performed prior to precoding at thetransmitter, and the corresponding inverse operation can be performed after theequalizer at the receiver.

. The transmitter is constrained in its average totaltransmit power:2

(3)

where is the power in units of energy per block-transmis-sion (or, equivalently, per OFDM symbol). The power in unitsof energy per symbol period is given by , and thepower in units of energy per second is , where is thesymbol period. In Section VI, a separate power constraint perantenna is considered. Note that a power constraint per carrier

can be readily incorporated into the problem for-mulation.

The received vector at theth carrier after the equalizer is

(4)

where is the receive matrix processing, andis the estimation of . Again, each column of

can be interpreted as a beamvector adapted to each spa-tial channel substream at carrier, i.e., [seeFig. 1(b)].

Hitherto, only an independent processing at each carrierhas been considered, and we call it thecarrier-noncooper-ative approach[see Fig. 2(a)]. This scheme, however, canbe further generalized by allowing cooperation among car-riers, which we term thecarrier-cooperative approach[seeFig. 2(b)]. The signal model is obtained [similarly to (2)–(4)]by stacking the vectors corresponding to all carriers (e.g.,

), by considering global transmit andreceive matrices (the transmit power con-straint reduces to ) and ,

2Equation (3) is a short-term power constraint (for each channel state) asopposed to a less restrictive long-term power constraint that would allow thetransmit power to exceedP for some channel states, as long as it is compen-sated by some other channel states (this constraint, however, requires knowledgeof the channel statistics or at least of some future realizations of the channel).

Page 5: Joint tx-rx beamforming design for multicarrier mimo channels: a ...€¦ · M. A. Lagunas is with the Centre Tecnològic de Telecomunicacions de Catalunya (CTTC), 08034 Barcelona,

PÉREZ PALOMARet al.: JOINT TX-RX BEAMFORMING DESIGN 2385

(a)

(b)

Fig. 2. Carrier-cooperative versus carrier-noncooperative approaches. (We assume for clarity of the figure thatL = L 8k.) (a) Carrier-noncooperative approach.(b) Carrier-cooperative approach.

where is the total number of trans-mitted symbols, and by defining the global channel as

diag . This general block pro-cessing scheme was used in [2] to obtain a capacity-achievingsystem. This model can easily cope with intermodulation terms,unlike the noncooperative model of (1), that implicitly assumesthe carriers to be completely orthogonal. The carrier-noncoop-erative processing model (2)–(4) can be obtained from the moregeneral carrier-cooperative model by setting diagand diag , i.e., by imposing a block-diagonalstructure on and . In fact, it is this block-diagonal structurethat makes the carrier-noncooperative scheme less general and,therefore, have a worse performance than the carrier-coopera-tive one. From an intuitive point of view, the reason why thisgeneralized model has a potential better performance is that itcan reallocate the symbols among the carriers in an intelligentway (e.g., if one carrier is in a deep fading, it will try to useother carriers instead), whereas the noncooperative scheme willalways transmit symbols through theth carrier, no matterwhat the fading state of the carriers is. From a mathematicalpoint of view, however, the carrier-noncooperative model ismore general since the carrier-cooperative scheme is obtainedby particularizing (a single carrier). Thus, in the sequel,the carrier-noncooperative matrix signal model is consideredwithout loss of generality (w.l.o.g.).

IV. OPTIMALITY OF THE

CHANNEL-DIAGONALIZING STRUCTURE

The joint transmit-receive matrix design is, in general, acomplicated nonconvex problem. As previously mentioned,for some specific design criteria, the original complicatedproblem is greatly simplified because the channel turns outto be diagonalized by the transmit-receive processing, whichallows ascalarizationof the problem (meaning that all matrixequations are substituted with scalar ones). Examples are theminimization of the (weighted) sum of the MSEs of all channelspatial substreams [13], [3], [14], the minimization of thedeterminant of the MSE matrix [15], and the maximizationof the mutual information [6], [2], [7]. Recall that for otherinteresting design criteria (such as the minimization of theaverage/maximum BER or the maximization of the minimumSINR), it is unknown whether the channel-diagonalizingstructure is optimal.

In the following, we generalize these results by developing aunified framework. Instead of analyzing each design criterion ina separate way, we consider that the design is based on the mini-mization of some arbitrary objective function of the MSEs of allchannel substreams MSE , where MSE is the MSEof the th spatial substream at theth carrier (objective functionsof the SINRs and of the BERs are readily incorporated as we

Page 6: Joint tx-rx beamforming design for multicarrier mimo channels: a ...€¦ · M. A. Lagunas is with the Centre Tecnològic de Telecomunicacions de Catalunya (CTTC), 08034 Barcelona,

2386 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 51, NO. 9, SEPTEMBER 2003

show next). In particular, we will obtain that for a wide familyof functions (Schur-concave and Schur-convex functions), thechannel matrix is either fully diagonalized or diagonalized upto a very specific rotation of the data symbols.

The objective function is an indicator of how well thesystem performs. As an example, if two MIMO systems areidentical except in one of the substreams for which one ofthe systems outperforms the other, any reasonable functionshould properly reflect this difference. Therefore, it sufficesto consider only thesereasonable functions.3 Mathematically,this is equivalent to saying that the objective functionmustbe increasing in each one of its arguments while having the restfixed.

A. Optimum Receive Matrix

To design the system, we first easily derive the optimum re-ceive matrices ’s, assuming the transmit ones ’s fixed, andthen deal with the difficult part, which is the derivation of the op-timal transmit matrices ’s (this two-step derivation has beenindependently used in [16]). The MSE matrix at theth carrieris defined as the covariance matrix of the error vector (given by

):

(5)

where . The MSE ofthe ( th, th) substream is theth diagonal element of , i.e.,

MSE

(6)

where (resp. ) is the th column of (resp. ). Ex-pression (6) is mathematically intractable since it is nonconvexin . However, for a given , MSE is convex in

and independent of the other columns of and of theother carriers, which means that each can be independentlyoptimized. To obtain the optimal receive matrix in a moredirect way, it suffices to find such that the diagonal elementsof are minimized. This can be done regardless of the spe-cific choice of the objective function since we know it is in-creasing in each argument. Alternatively, we can obtainso that , which in particular im-plies that the diagonal elements are minimized (in fact, both cri-teria are equivalent as shown in [29]). In other words, we wantto solve

Setting the gradient of Tr to zero

Tr

3Given anunreasonableobjective function, it is always possible to redefineit in a reasonableway so that it better reflects the system performance.

and particularizing for all the vectors of the canonical base, itfollows that

(7)

Expression (7) is the linear minimum MSE (LMMSE) receiveror Wiener filter[30]. Using the optimal receive matrix , weobtain the following concentrated error matrix:

(8)

where we have used the matrix inversion lemma,4 and wehave defined (note that the eigenvectorsand eigenvalues of are the right singular vectors and thesquared singular values, respectively, of the whitened channel

).However, many objective functions are naturally expressed as

functions of the SINR of each substream. The SINR at thethcarrier and theth spatial substream is

SINR (9)

where is theinterference-plus-noise covariance matrix seen by the (th, th)substream, the inequality comes from Cauchy–Schwarz’s in-equality [31] [with vectors and ],and the upper bound is achieved by

, i.e., the Wiener filter again. Noting that the MSEcan be expressed as

MSE

(10)

the SINR can be easily related to the MSE as5

SINRMSE

(11)

Maximizing the SINR is clearly equivalent to minimizing theMSE.

The performance of a digital communication system is ul-timately given by the fraction of bits in error or bit error rate(BER). Under the Gaussian assumption, the symbol error proba-bility can be analytically expressed as a function of the SINR[32]:

SINR SINR (12)

where and are constants that depend on the signalconstellation, and is the -function defined as

[32]. It is sometimes conve-nient to use the Chernoff upper bound of the tail of the Gaussiandistribution function [33] to approximate

4Matrix Inversion Lemma:(A+BCD) = A �A B(DA B+C ) DA .

5Note that0 < MSE � 1.

Page 7: Joint tx-rx beamforming design for multicarrier mimo channels: a ...€¦ · M. A. Lagunas is with the Centre Tecnològic de Telecomunicacions de Catalunya (CTTC), 08034 Barcelona,

PÉREZ PALOMARet al.: JOINT TX-RX BEAMFORMING DESIGN 2387

Fig. 3. Convexity of the BER as a function of the MSE for the range of BER�2 � 10 .

the symbol error probability as SINR

(which becomes a good approximation for high values of theSINR). The BER can be approximately obtained from thesymbol error probability (assuming that a Gray encoding isused to map the bits into the constellation points) as

BER (13)

where is the number of bits per symbol, and isthe constellation size.

It is important to remark that both the exact BER function,and the Chernoff upper bound are convex decreasing functionsof the SINR (see Appendix H). In addition, they are also convexincreasing functions of the MSE for sufficiently small values ofthe argument (interestingly, for BPSK and QPSK constellations,this is true for any value of the argument), as can be observedfrom Fig. 3 (see Appendix H for a formal proof). Note that min-imizing the BER is tantamount to minimizing the MSE and tomaximizing the SINR. As a rule of thumb, the exact BER func-tion and the Chernoff upper bound are indeed convex in the MSEfor a BER less than . Note that this is a mild assump-tion since practical systems have, in general, an uncoded BER6

less than . Therefore, for practical purposes, we canassume the exact BER and the Chernoff upper bound as convexfunctions of the MSE.

Summarizing, the Wiener filter has been obtained as the op-timum linear receiver in the sense that it minimizes each of theMSEs, maximizes each of the SINRs, and minimizes each of theBERs (in terms of capacity, the Wiener filter is capacity-loss-less and simplifies the signal model). In addition, noting thatthe SINR can be expressed as a function of the MSE by (11)and that the BER can be expressed as a function of the SINR by(12) and (13), it suffices to focus on objective functions of theMSEs without loss of generality.

6Given an uncoded bit error probability of at most 10and using a propercoding scheme, coded bit error probabilities with acceptable low values such as10 can be obtained.

B. Optimum Transmit Matrix

To obtain the set of transmit matrices , we now considerthe minimization of an arbitrary objective function of the diag-onal elements of (8). As we now show, for Schur-concave andSchur-convex objective functions, the problem is scalarized andsimplified (see Fig. 4). In particular, the complicated nonconvexmatrix function is simplified into a setof simple decoupled scalar expressions. We first consider thesingle-carrier case and then extend the results to the multicar-rier case.

Theorem 1: Consider the following constrained optimizationproblem:

s.t. Tr

where matrix is the optimization variable,is the vector of diagonal elements of the MSE matrix

[the diagonal elements ofare assumed in decreasing order w.l.o.g.], is apositive semidefinite Hermitian matrix, and: isan arbitrary objective function (increasing in each variable). Itthen follows that there is an optimal solutionof at most rank

rank with the following structure.

• If is Schur-concave, then

(14)

where has as columns the eigenvectorsof corresponding to the largest eigenvalues in in-creasing order, and diaghas zero elements, except along the rightmost main diag-onal (which can be assumed real w.l.o.g.).

• If is Schur-convex, then

(15)

where and are defined as before, andis a unitary matrix such that

has identical diagonal elements. This rotation can be com-puted using the algorithm given in [34, Sect. IV-A] or withany rotation matrix that satisfies ,such as the discrete Fourier transform (DFT) matrix or theHadamard matrix (when the dimensions are appropriatesuch as a power of two [33, p. 66]).

Proof: See Appendix A.For the simple case in which only one symbol per carrier is

transmitted at each transmission, i.e., a single spatial eigenmodeis utilized, Theorem 1 simplifies, and the diagonal struc-

ture simply means that the spatial subchannel (eigenmode) withhighest gain is used [35], [18].

For Schur-concave objective functions, the global commu-nication process including pre- and post-processingis fully diagonalized [see Fig. 4(b)] as well as the MSE ma-trix . Among the established substreams, onlyare asso-ciated to nonzero channel eigenvalues, whereas the remainder

Page 8: Joint tx-rx beamforming design for multicarrier mimo channels: a ...€¦ · M. A. Lagunas is with the Centre Tecnològic de Telecomunicacions de Catalunya (CTTC), 08034 Barcelona,

2388 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 51, NO. 9, SEPTEMBER 2003

(a)

(b)

(c)

Fig. 4. Scheme of a MIMO communication system with linear Tx-Rx processing. (a) Original matrix system. (b) Fully diagonalized system. (c) Diagonalized(up to a rotation) system.

are associated with zero eigenvalues. The globalcommunication process is7

or, equivalently

where diag , the ’s are the largesteigenvalues of in increasing order, the ’s represent theallocated power, and is a normalized equivalent white noise.The MSE matrix is , and thecorresponding MSEs are given by

MSE (16)

7Note that A = (HBB H + R ) HB = R HB(I +B H R HB) .

Similarly, the SINRs are given using (11) by

SINR (17)

Note that if rank (equivalently, ), then thesubstreams associated with zero eigenvalues have an MSE

equal to 1 or a zero SINR (which implies a BER equal to 0.5).Therefore, for Schur-concave objective functions, a communi-cation system should be designed such that rank inorder to have an acceptable performance.

For Schur-convex objective functions, the global communi-cation process including pre- and post-processing isdiagonalized only up to a very specific rotation of the data sym-bols [see Fig. 4(c)], and the MSE matrixis nondiagonal withequal diagonal elements (equal MSEs). In particular, assuminga pre-rotation of the data symbols at the transmitterand a post-rotation of the estimates at the receiver ,the same diagonalizing results of Schur-concave functions apply[see Fig. 4(c)]. Since the diagonal elements of the MSE matrix

are equal whenever the appropriate ro-tation is included, the MSEs are identical and given by

MSE Tr

(18)

Page 9: Joint tx-rx beamforming design for multicarrier mimo channels: a ...€¦ · M. A. Lagunas is with the Centre Tecnològic de Telecomunicacions de Catalunya (CTTC), 08034 Barcelona,

PÉREZ PALOMARet al.: JOINT TX-RX BEAMFORMING DESIGN 2389

Similarly, the SINRs are given using (11) by

SINR (19)

Note that during the design process, the rotation matrix can beinitially ignored since the minimization can be based directlyon the MSE expression in (18). The rotation can be computedat a later stage of the design, as explained in Theorem 1. Ob-serve that for Schur-convex functions (unlike for Schur-con-cave ones), it is possible to have rank (equivalently,

) and still obtain an acceptable performance. This is be-cause the symbols are transmitted over thenonzero eigen-values in a distributed way (as opposed to the parallel and in-dependent transmission of the symbols for fully diagonalizedsystems).

In both cases of Schur-concave and Schur-convex objectivefunctions, the expressions of the MSEs have been scalarizedin the sense that the original complicated matrix expressionshave been reduced to simple scalar expressions [see (16) and(18)]. For Schur-concave functions, the specific power distri-bution among the established substreams will depend on theparticular objective function . Interestingly, for Schur-convexfunctions, the power distribution is independent of the specificchoice of since both the MSE expression in (18) and the ro-tation matrix to make the diagonal elements of the MSE matrixequal are independent of .

It is worth pointing out that there is a set of functions thatare both Schur-concave and Schur-convex, such as Tr. Suchfunctions happen to be invariant with respect to post-rotationsof and vice versa (this can be easily proved using the sameideas of the proof of Theorem 1).

Theorem 1 is easily extended to the multicarrier case as fol-lows. For any carrier , consider the matrices corresponding tothe rest of the carriers fixed, and Theorem 1 can be di-rectly invoked to show the optimal structure for .

V. JOINT TX-RX BEAMFORMING DESIGN: A CONVEX

OPTIMIZATION APPROACH

In this section, using the optimal receive matrix given by (7)and the unified framework obtained in Theorem 1, we systemat-ically consider a variety of design criteria. The potential of theproposed framework is made evident by showing that a greatvariety of interesting and appealing objective functions are ei-ther Schur-concave or Schur-convex, and thus, Theorem 1 canbe applied to scalarize and simplify the design. The aim of thissection is to express each problem in convex form so that thewell-developed body of literature on convex optimization theory[19]–[21] can be used to obtain optimal solutions very effi-ciently in practice using, for example, interior-point methods(c.f. Section II-A). In fact, it is possible in many cases to ob-tain simple closed-form solutions by means of the KKT opti-mality conditions that can be efficiently implemented in prac-tice (see [36] for simple practical implementation algorithmsderived from the KKT conditions and also for more design cri-teria).

Some of the considered design criteria have also been usedin [35] and [18] for the simple case of single beamforming. For

simplicity of notation, we define and .[Note that for Schur-concave functions with rank ,the substreams associated with zero eigenvalues aresimply ignored in the optimization process.]

A. MSE-Based Criteria

In the following, we optimize the MSEs by minimizing thearithmetic, geometric, and maximum means of the MSEs. Wealso show the equivalence of the minimization of the geometricmean, the minimization of the determinant of the MSE matrix,and the maximization of the mutual information.

1) Minimization of the ARITH-MSE:The minimization ofthe (weighted) arithmetic mean of the MSEs (ARITH-MSE)was considered in [13], [3], and [14]. We deal with the weightedversion, as was extended in [14] under the unified framework ofTheorem 1. The objective function is

MSE MSE (20)

Lemma 4: The function (assuming) is minimized when the weights are in increasing

order , and it is then a Schur-concave function.Proof: See Appendix B.

By Lemma 4, the objective function (20) is Schur-concave oneach carrier . Therefore, by Theorem 1, the diagonal structureis optimal, and the MSEs are given by (16). The problem inconvex form (the objective is convex and the constraints linear)is8

s.t. (21)

This particular problem can be solved very efficiently be-cause the solution has a nice water-filling interpretation (fromthe KKT optimality conditions):

(22)

where is thewater-levelchosen to satisfy the power con-straint with equality.

2) Minimization of the GEOM-MSE:The objective functioncorresponding to the minimization of the weighted geometricmean of the MSEs (GEOM-MSE) is

MSE MSE (23)

Lemma 5: The function (assuming) is minimized when the weights are in increasing

order , and it is then a Schur-concave function.Proof: See Appendix C.

8Note that it is not necessary to explicitly include the constraints corre-sponding to MSE � MSE in the convex problem formulation since anoptimal solution always satisfies them.

Page 10: Joint tx-rx beamforming design for multicarrier mimo channels: a ...€¦ · M. A. Lagunas is with the Centre Tecnològic de Telecomunicacions de Catalunya (CTTC), 08034 Barcelona,

2390 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 51, NO. 9, SEPTEMBER 2003

By Lemma 5, the objective function (23) is Schur-concave oneach carrier . Therefore, by Theorem 1, the diagonal structureis optimal, and the MSEs are given by (16). The problem inconvex form (since the objective is log-convex, it is also convex[21]) is

s.t. (24)

This problem also has a water-filling solution (from the KKToptimality conditions):

(25)

where is the water-level chosen to satisfy the power con-straint with equality. Note that for , (25) becomes theclassical capacity-achieving water-filling solution9 [6], [2].

3) Minimization of : The minimization of the deter-minant of the MSE matrix was considered in [15]. We nowshow how this particular criterion is easily accommodated inour framework as a Schur-concave function of the diagonalelements of the MSE matrix . [For the carrier-noncooperativecase, simply consider the global MSE matrix defined as

diag .]Using the fact that , it follows that

is minimized for the choice of the receive matrix given by (7).From (8), it is clear that does not change if the transmit ma-trices ’s are post-multiplied by a unitary matrix (a rotation).Therefore, we can always choose a rotation matrix so thatisdiagonal without loss of optimality (as we already knew from[15]), and then

(26)

Therefore, the minimization of is equivalent to the mini-mization of the (unweighted) product of the MSEs as in Sec-tion V-A2.

4) Maximization of Mutual Information:The maximizationof the mutual information can be used to obtain a capacity-achieving solution [6]

(27)

where is the transmit covariance matrix. Using the fact thatand that [from (2)], the mu-

tual information can be expressed (see [29] for detailed connec-tions between the mutual information and the MSE matrix) as

(28)

and therefore, the maximization ofis equivalent to the mini-mization of treated in Section V-A3.

Hence, the minimization of the unweighted product of theMSEs, the minimization of the determinant of the MSE matrix,

9Under the constraint of using�L substreams on each carrierk.

and the maximization of the mutual information are all equiva-lent criteria with the solution given by a channel-diagonalizingstructure and the classical capacity-achieving water-filling forthe power allocation:

(29)

5) Minimization of the MAX-MSE:In general, the overallperformance (average BER) is dominated by the substream withhighest MSE. It makes sense then to minimize the maximum ofthe MSEs (MAX-MSE) [37]. The objective function is

MSE MSE (30)

Lemma 6: The function is a Schur-convex function.

Proof: See Appendix D.By Lemma 6, the objective function (30) is Schur-convex on

each carrier . Therefore, by Theorem 1, the optimal solutionhas a nondiagonal MSE matrix with equal diagonal elementsgiven by (18), which have to be minimized (scalarized problem).Recall that after minimizing the MSEs, we must still obtain theoptimal rotation matrices so that the diagonal elements of theMSE matrices ’s are identical. The scalarized problem inconvex form (the objective is linear and the constraints are allconvex) is

s.t.

(31)

This problem has a multilevel water-filling solution (from theKKT optimality conditions):

(32)

where are multiple water levels chosen to satisfy theconstraints on and the power constraint all with equality. Forthe case of single beamforming (i.e., ), the solution sim-plifies to

(33)

as was obtained in [35] and [18]. For the single-carrier case (ormulticarrier cooperative approach), problem (31) simplifies tothe minimization of the unweighted ARITH-MSE consideredin Section V-A1 with solution .

B. SINR-Based Criteria

In the following, we optimize the SINRs by maximizing thearithmetic, geometric, harmonic, and minimum means of theSINRs. We also consider the maximization of the product of the

Page 11: Joint tx-rx beamforming design for multicarrier mimo channels: a ...€¦ · M. A. Lagunas is with the Centre Tecnològic de Telecomunicacions de Catalunya (CTTC), 08034 Barcelona,

PÉREZ PALOMARet al.: JOINT TX-RX BEAMFORMING DESIGN 2391

terms SINR and its connection to the capacity-achievingsolution. We can now define the objective function to minimizeas a function of the SINRs SINR MSE .Since, in Theorem 1, we assumed MSE MSE , theSINRs are in increasing order SINR SINR .

1) Maximization of the ARITH-SINR:The objective func-tion to be minimized for the maximization of the (weighted)arithmetic mean of the SINRs (ARITH-SINR) is

SINR SINR (34)

which can be expressed as a function of the MSEs using (11) as

MSE MSE

MSE (35)

Lemma 7: The function (as-suming ) is minimized when the weights arein increasing order , and it is then a Schur-concavefunction.

Proof: See Appendix E.By Lemma 7, the objective function (35) is Schur-concave on

each carrier . Therefore, by Theorem 1, the diagonal structureis optimal and the SINRs are given by (17). The problem ex-pressed in convex form (it is actually an LP since the objectiveand the constraints are all linear) is10

s.t. (36)

The optimal solution is to allocate all the available power tothe substream with maximum weighted gain (oth-erwise, the objective value could be increased by transferringpower from other substreams to this substream). Although thissolution indeed maximizes the weighted sum of the SINRs, itis a terrible solution in practice due to the extremely poor spec-tral efficiency (only one substream would be conveying infor-mation). This criterion gives a pathological solution and shouldnot be used.

2) Maximization of the GEOM-SINR:The objective func-tion to be minimized for the maximization of the (weighted)geometric mean of the SINRs (GEOM-SINR) is

SINR SINR (37)

which can be expressed as a function of the MSEs using (11) as

MSE MSE

MSE (38)

10Note that it is not necessary to explicitly include the constraints corre-sponding to SINR � SINR in the convex problem formulation sincean optimal solution always satisfies them.

Note that the maximization of the product of the SINRs is equiv-alent to the maximization of the sum of the SINRs expressed indecibels.

Lemma 8: The function (as-suming ) is minimized when the weightsare in increasing order , and it is then a Schur-con-cave function.

Proof: See Appendix F.By Lemma 8, the objective function (38) is Schur-concave on

each carrier , provided that MSE (this is a mildassumption since a MSE greater than 0.5 is unreasonable for apractical communication system). Therefore, by Theorem 1, thediagonal structure is optimal, and the SINRs are given by (17).The problem expressed in convex form (the weighted geometricmean is a concave function11 [20], [21]) is

s.t. (39)

where , and it is assumed that(otherwise, the problem has trivial solution

). The solution is easily obtained from the KKT optimalityconditions as

(40)

Particularizing for a uniform weighting , theproblem reduces to the maximization of the geometric meansubject to the arithmetic mean:

s.t. (41)

where . From the arithmetic-geometric meaninequality (with equality if and onlyif ) [31], it follows that the optimal solution is auniform power allocation

(42)

Note that the uniform power distribution is commonly used dueto its simplicity, e.g., [38].

3) Maximization of the HARM-SINR:The maximization ofthe harmonic mean of the SINRs (HARM-SINR) was consid-ered in [35] for the case of single beamforming. Using the uni-fied framework of Theorem 1, we can extend this result to the

11The concavity of the geometric mean is easily verified by showing that theHessian matrix is positive semidefinite for positive values of the arguments.The extension to include boundary points (points with zero-valued arguments)is straightforward either by using a continuity argument to show thatf(�x +(1 � �)y) � �f(x) + (1 � �)f(y) for 0 � � � 1 or by considering theepigraph of the function and using [19, Prop. 2.7.4].

Page 12: Joint tx-rx beamforming design for multicarrier mimo channels: a ...€¦ · M. A. Lagunas is with the Centre Tecnològic de Telecomunicacions de Catalunya (CTTC), 08034 Barcelona,

2392 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 51, NO. 9, SEPTEMBER 2003

case of multiple beamforming. The objective function to be min-imized is

SINRSINR

(43)

which can be expressed as a function of the MSEs using (11) as

MSEMSE

MSE(44)

Lemma 9: The function (for) is a Schur-convex function.

Proof: See Appendix G.By Lemma 9, the objective function (44) is Schur-convex on

each carrier . Therefore, by Theorem 1, the optimal solutionhas a nondiagonal MSE matrix with equal diagonal ele-ments given by (18), which have to be minimized. The scalar-ized problem in convex form is

s.t.

(45)The problem has a multilevel water-filling solution

(46)

where are multiple water levels chosen to satisfythe lower constraints on the ’s and the power constraint,

all with equality, and also the constraint ,where is a positive parameter [36]. For the case ofsingle beamforming (i.e., ), the solution reduces to

[35]. For the single-carrier case(or multicarrier cooperative approach), the problem simplifiesto that considered in Section V-A1.

4) Maximization of the MIN-SINR:The objective functionto be minimized for the maximization of the minimum of theSINRs (MIN-SINR) is

SINR SINR (47)

This design criterion is equivalent to the minimization of themaximum MSE treated with detail in Section V-A5. In [14], thesame criterion was used, imposing a channel diagonal structure.

5) Maximization of the PROD-(1SINR): Consider for amoment the following maximization:

SINR (48)

Using the relation between the MSE and the SINR in (11), thismaximization can be equivalently expressed as the minimiza-tion of MSE as in (23) with , as the minimiza-tion of the determinant of the MSE matrix (Section V-A3), andas the maximization of the mutual information (Section V-A4)with the solution given by the capacity-achieving expression(29). This result is completely natural since maximizing the log-arithm of (48) is tantamount to maximizing the mutual informa-tion SINR .

C. BER-Based Criteria

Next, we consider that the minimization of the average BER(the minimization of the maximum of the BERs (MAX-BER)is equivalent to the maximization of the minimum of the SINRsand to the minimization of the maximum of the MSEs, providedthat the same constellations are used on all the substreams).

1) Minimization of the ARITH-BER:The minimizationof the average BER or of the arithmetic mean of the BER’s(ARITH-BER) can be considered as the best criterion (as-suming that after the linear processing at the receiver, eachsubstream is detected independently). In practice, multicarriercommunication systems use some type of coding over thecarriers and/or over different transmissions to reduce the BER(usually some orders of magnitude). The ultimate measure isthen the coded BER as opposed to the uncoded BER (obtainedwithout using any coding). However, the coded BER is stronglyrelated to the uncoded BER (in fact, for codes based on harddecisions, both quantities are strictly related). In such cases,it suffices to focus on the uncoded BER when designing theuncoded part of a communication system.

In [39], the minimization of the average BER (and also of theChernoff upper bound) is treated in detail when a diagonal struc-ture is imposed. This design criterion has been independentlyconsidered in [40] under a ZF constraint obtaining a nondiag-onal optimal MSE matrix (in agreement with our results). Theobjective function is

BER BER (49)

which can be expressed as a function of the MSEs using (11)and (12) and (13) as

MSE BER MSE (50)

Lemma 10: The function BER(assuming , for sufficiently small such thatBER , ) is a Schur-convex function.

Proof: See Appendix I.By Lemma 10, the objective function (50) is Schur-convex

on each carrier (assuming the same constellation/coding onall substreams of theth carrier), provided that BER

(interestingly, for BPSK and QPSK constellations, this istrue for any value of the BER). Therefore, by Theorem 1, theoptimal solution has a nondiagonal MSE matrix with equal

Page 13: Joint tx-rx beamforming design for multicarrier mimo channels: a ...€¦ · M. A. Lagunas is with the Centre Tecnològic de Telecomunicacions de Catalunya (CTTC), 08034 Barcelona,

PÉREZ PALOMARet al.: JOINT TX-RX BEAMFORMING DESIGN 2393

diagonal elements given by (18), which have to be minimized.The scalarized problem in convex form is12

s.t.

(51)Note that we have explicitly included the upper boundonthe MSEs to guarantee the convexity of the BER function and,therefore, of the whole problem. For a general case withand , problem (51) does not have a simple closed-formsolution, and one has to resort to general-purpose iterativemethods such as interior-point methods (see Section II-A). Forthe single-carrier case (or multicarrier cooperative approach),the problem simplifies to the ARITH-MSE criterion consideredin Section V-A1 plus the rotation matrix to make the diagonalelements of the MSE matrix equal.

D. Remarks

Some observations are in order.

• Most of the presented methods under the framework ofconvex optimization theory have nice closed-form solu-tions that can be easily implemented in practice (see [36]for simple practical algorithms derived from the KKT op-timality conditions to implement the water-filling solu-tions).

• Method ARITH-BER is clearly the best in terms of av-erage BER and is therefore considered as a benchmark.For the carrier-noncooperative approach, it does not havea closed-form solution, and an iterative approach is nec-essary such as an interior-point method (see [21] for prac-tical implementation details). Interestingly, for the single-carrier and multicarrier-cooperative approaches, the solu-tion can be obtained in closed form, as mentioned below.

• Methods ARITH-MSE, HARM-SINR, and MAX-MSEhave very simple solutions and, as will be observed in thesimulations, perform really close to the benchmark givenby ARITH-BER. These methods should therefore be con-sidered for practical purposes.

• Two novel multilevel water-filling solutions have been ob-tained for the MAX-MSE and the HARM-SINR criteria(see [36] for practical implementation algorithms).

• Cooperation among carriers improves performancewithout significant increase on the complexity (eachcarrier can be diagonalized independently, and then, thelargest eigenmodes are selected).

• A striking result (as mentioned in Section IV) is thatfor single-carrier and multicarrier-cooperative systems,all criteria with Schur-convex objective functions (e.g.,

12We have implicitly assumed for each carrier the same constellation and codeon all the spatial eigenmodes.

MAX-MSE, HARM-SINR, MIN-SINR, ARITH-BER,13

and MAX-BER ) have the same optimal solution.Hence, the best performance (given by the ARITH-BERcriterion with a carrier-cooperative approach) has aclosed-form solution that can be obtained in practice withlow complexity using the simple water-filling solutionof the ARITH-MSE criterion in (22) plus the rotationmatrix.

• It is very common in the literature of equalization to in-clude a ZF constraint in the design. Such a constraint canbe easily introduced in the unified framework (see [36] fordetails), although the performance degrades due to the ad-ditional constraint.

VI. I NTRODUCINGADDITIONAL CONSTRAINTS

As explained in Section II, one of the nice properties of ex-pressing a problem in convex form is that additional constraintscan be added, as long as they are convex without affecting thesolvability of the problem. Of course, with the additional con-straints, the closed-form solutions previously obtained are notvalid any more.

A. Dynamic Range of Power Amplifier

We can easily add constraints on the dynamic range of thepower amplifier at each transmit antenna element, as was donein [22]. Consider a Schur-concave objective function and as-sume for simplicity . From the optimal structure in(14) , the total average transmitted power(in units of energy per symbol period) by theth antenna is

(52)

which is linear in the variables . (For the carrier-coop-erative scheme, .)Therefore, the following constraints are linear:

where and are the lower and upper bounds for theth an-tenna. Similarly, it is straightforward to set limits on the relativedynamic range of a single element in comparison with the totalpower for the whole array [22]:

where and are the relative bounds, andis the total power that is also linear in .

B. Peak-to-AverageRatio (PAR)

One of the main practical problems that OFDM systems faceis the PAR. Indeed, multicarrier signals exhibit Gaussian-like

13Recall that with carrier cooperation, the ARITH-BER and MAX-BERmethods require all spatial/carrier substreams to use the same constella-tion/coding scheme in order to be Schur convex.

Page 14: Joint tx-rx beamforming design for multicarrier mimo channels: a ...€¦ · M. A. Lagunas is with the Centre Tecnològic de Telecomunicacions de Catalunya (CTTC), 08034 Barcelona,

2394 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 51, NO. 9, SEPTEMBER 2003

time-domain waveforms with relatively high PAR, i.e., they ex-hibit large amplitude spikes when several frequency compo-nents add in-phase. These spikes may have a serious impacton the design complexity and feasibility of the transceiver’sanalog front-end (i.e., high resolution of D/A-A/D convertersand power amplifiers with a linear behavior over a large dynam-ical range). In practice, the transmitted signal has to be clippedwhen it exceeds a certain threshold, and it has detrimental ef-fects on the BER. A variety of techniques have been devised todeal with the PAR [41], [42]. In this section, we show how thePAR can be taken into account into the design of the beamvec-tors using a convex optimization framework. Note that the al-ready-existing techniques to cope with the PAR and this ap-proach are not exclusive and can be simultaneously used.

The PAR is defined as

PAR (53)

where is the symbol period, is the zero-mean trans-mitted signal, and . Since the number of car-riers is usually large ( ), can be accurately modeledas a Gaussian random process (central-limit theorem) with zeromean and variance [41]. Using this assumption, the proba-bility that the PAR exceeds certain threshold or, equivalently,the probability that the instantaneous amplitude exceeds a clip-ping value is

Pr (54)

The clipping probability of an OFDM symbol is then [41]

(55)

In other words, in order to have a clipping probability lower thanwith respect to the maximum instantaneous amplitude ,

the average signal power must satisfy

(56)

When using multiple antennas for transmission, the previousequation has to be satisfied for all transmit antennas. Those con-straints can be easily incorporated in any of the convex designsderived in Section V with a Schur-concave objective function.Using (52), the constraint is

(57)

which is linear in the optimization variables . Such aconstraint has two effects in the solution: i) The power distri-bution over the carriers changes with respect to the distributionwithout the constraint, and ii) the total transmitted power dropsas necessary.

Fig. 5. (a) Power delay profile type C for HIPERLAN/2. (b) Envelopecorrelation matrices at the base station (BS) and at the mobile station (MS) intheNokiaenvironment.

VII. SIMULATION RESULTS

For the numerical results, we have chosen the European stan-dard HIPERLAN/2 for WLAN [9]. It is based on the multicar-rier modulation OFDM (64 carriers are used in the simulations).We consider multiple antennas at both the transmitter and the re-ceiver, obtaining, therefore, the multicarrier MIMO model usedthroughout the paper. Perfect CSI is assumed at both sides ofthe communication link.14

The frequency selectivity of the channel is modeled using thepower delay profile type C for HIPERLAN/2 as specified in [43][see Fig. 5(a)], which corresponds to a typical large open spaceindoor environment for non line of sight (NLOS) conditionswith 150 ns average r.m.s. delay spread and 1050 ns maximumdelay (the sampling period is 50 ns) [9]. The spatial correlationof the MIMO channel is modeled according to theNokiamodeldefined in [44] (which corresponds to a reception hall) speci-fied by the correlation matrices of the envelope of the channelfading at the transmit and receive side given in Fig. 5(b), wherethe base station is the receiver (uplink) (see [44] for details ofthe model). It provides a large open indoor environment withtwo floors, which could easily illustrate a conference hall or ashopping galleria scenario. The matrix channel generated wasnormalized so that . The SNR is definedas the transmitted power normalized with the noise variance.

For the numerical simulations, the following design cri-teria have been considered: ARITH-MSE, GEOM-MSE,MAX-MSE (equivalently, MIN-SINR or MAX-BER),GEOM-SINR, HARM-SINR, and ARITH-BER (benchmark).The utilization of the Chernoff upper bound instead of the exactBER function gives indistinguishable results and is thereforenot presented in the simulation results. Unless otherwise spec-ified, carrier-noncooperative approaches are considered. Theperformance is given in terms of outage BER (averaged over

14In practice, channel estimation errors exist, and it is therefore necessary toquantify the loss for each of the methods. Alternatively, it is possible to takechannel estimation errors into account in the design either from a worst-case orfrom a Bayesian perspective (c.f. [36]).

Page 15: Joint tx-rx beamforming design for multicarrier mimo channels: a ...€¦ · M. A. Lagunas is with the Centre Tecnològic de Telecomunicacions de Catalunya (CTTC), 08034 Barcelona,

PÉREZ PALOMARet al.: JOINT TX-RX BEAMFORMING DESIGN 2395

Fig. 6. BER (at an outage probability of 5%) versus SNR when using QPSKin a 2� 2 MIMO channel withL = 1 for the GEOM-MSE, GEOM-SINR,ARITH-MSE, HARM-SINR, MAX-MSE, and ARITH-BER criteria (withoutcarrier cooperation).

the channel substreams), i.e., the BER that can be guaranteedwith some probability or, equivalently, the BER that is notachieved with some small outage probability. In particular, weconsider the BER with an outage probability of 5%. Note thatfor typical systems with delay constraints, the outage BER isa more realistic measure than the commonly used mean BERthat only makes sense when the transmission coding block islong enough to reveal the long-term ergodic properties of thefading process (no delay constraints).

Single Beamforming:First, we show some results whenusing a single channel spatial substream ( ). InFig. 6, the BER is plotted versus the SNR for a 22 MIMOchannel using QPSK constellations. Clearly, the ARITH-BERcriterion has the lowest BER because it was designed forthat. The MAX-MSE criterion performs really close to theARITH-BER and can be considered the second best criterion.The HARM-SINR, as well as the ARITH-MSE, performreasonably well (in fact, for values of the BER higher than10 , they outperform the MAX-MSE). The GEOM-MSE andthe GEOM-SINR criteria perform really badly in terms of BERand should not be used. In Fig. 7, the same results are shownfor a 4 2 MIMO channel using 16-QAM constellations, andthe same observations hold.

Therefore, the best criteria are (in order) ARITH-BER,MAX-MSE, HARM-SINR, and ARITH-MSE.

Including PAR Constraints:We now consider the introduc-tion of PAR constraints, as described in Section VI. We pa-rameterize the clipping amplitude with respect toas

to make the results independent of the total trans-mitted power . In Fig. 8, the probability of clipping alongwith the BER (when using QPSK constellations) is shown forthe ARITH-MSE criterion in a 2 2 MIMO channel both withPAR constraints ( ) and without them. In Fig. 8(a),the results are shown as a function of. It can be observedhow the design with the PAR constraints always has a clipping

Fig. 7. BER (at an outage probability of 5%) versus SNR when using 16-QAMin a 4� 2 MIMO channel (two transmit and four receive antennas) withL = 1

for the GEOM-MSE, GEOM-SINR, ARITH-MSE, HARM-SINR, MAX-MSE,and ARITH-BER criteria (without carrier cooperation).

probability no greater than the prespecified value 10, as ex-pected. The BER, however, can be severely affected if a very lowclipping probability is imposed due to power backoffs. FromFig. 8(a), a choice of seems reasonable. In Fig. 8(b), theresults are shown as a function of the SNR for . For thedesign with PAR constraints, the BER is slightly higher due tothe additional constraint. However, the system is guaranteed tohave a clipping probability of at most 10, unlike in the uncon-strained case, where nothing can be guaranteed. Recall that in apractical system, the final BER increases due to the clipping.

Multiple Beamforming:We now consider the simultaneoustransmission of more than one symbol per carrier, i.e., multiplebeamforming (we consider ).

In Fig. 9, the BER is plotted versus the SNR for a 44MIMO channel with using QPSK constellations. In gen-eral, similar observations hold, as for the single beamformingcase. However, it is worth pointing out that in this case, theHARM-SINR method performs much closer to the benchmarkthan the ARITH-MSE method. For higher values of the ratio

, the superiority of Schur-convex criteria withrespect to the Schur-concave methods (which have a channel-di-agonalizing structure) becomes more clear (c.f. [36]).

Carrier Cooperation: We now analyze the improvement inperformance when using cooperation among carriers for thebest methods: ARITH-MSE, HARM-SINR, MAX-MSE, andARITH-BER. Recall that with carrier cooperation, the HARM-SINR, MAX-MSE, and ARITH-BER criteria provide the samesolution since they are all Schur-convex functions (see Sec-tion V-D).

In Fig. 10, the BER is plotted versus the SNR with andwithout carrier cooperation for a 2 2 MIMO channel with

using QPSK constellations. In this case, carriercooperation gives an improvement of 0.5–2 dB. For highervalues of the ratio , the improvement is evenmore significant (c.f. [36]).

Page 16: Joint tx-rx beamforming design for multicarrier mimo channels: a ...€¦ · M. A. Lagunas is with the Centre Tecnològic de Telecomunicacions de Catalunya (CTTC), 08034 Barcelona,

2396 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 51, NO. 9, SEPTEMBER 2003

Fig. 8. Probability of clipping and BER (at an outage probability of 5%) when using QPSK in a 2� 2 MIMO channel withL = 1 for the ARITH-MSE criterionwith and without PAR constraints (without carrier cooperation). (a) As a function of� (for SNR= 8 dB andP � 10 ). (b) As a function of the SNR (for� = 4 andP � 10 ).

Fig. 9. BER (at an outage probability of 5%) versus SNR when using QPSKin a 4� 4 MIMO channel withL = 2 for the GEOM-MSE, GEOM-SINR,ARITH-MSE, HARM-SINR, MAX-MSE, and ARITH-BER criteria (withoutcarrier cooperation).

VIII. C ONCLUSIONS

In this paper, we have formulated and solved the joint de-sign of transmit and receive multiple beamvectors or beam-ma-trices (also known as linear precoders and equalizers) for mul-ticarrier MIMO systems under a variety of design criteria. In-stead of considering each design criterion in a separate way, wehave developed a unifying framework that generalizes the ex-isting results by considering two families of objective functions

Fig. 10. BER (at an outage probability of 5%) versus SNR when using QPSKin a 2� 2 MIMO channel withL = 1 for the ARITH-MSE, HARM-SINR,MAX-MSE, and ARITH-BER criteria with (coop) and without (noncoop)carrier cooperation.

that embrace most reasonable criteria to design a communica-tion system: Schur-concave and Schur-convex functions. ForSchur-concave objective functions, the channel-diagonalizingstructure is always optimal, whereas for Schur-convex func-tions, an optimal solution diagonalizes the channel only aftera very specific rotation of the transmitted symbols.

Once the optimal structure of the communication process hasbeen obtained, the design problem has been formulated withinthe powerful framework of convex optimization theory, in which

Page 17: Joint tx-rx beamforming design for multicarrier mimo channels: a ...€¦ · M. A. Lagunas is with the Centre Tecnològic de Telecomunicacions de Catalunya (CTTC), 08034 Barcelona,

PÉREZ PALOMARet al.: JOINT TX-RX BEAMFORMING DESIGN 2397

a great number of interesting design criteria can be easily ac-commodated and efficiently solved, even though closed-formexpressions may not exist. From this perspective, a variety ofdesign criteria have been analyzed, and in particular, optimalbeamvectors have been derived in the sense of having minimumaverage BER. It has been shown how to include additional con-straints on the design to control the dynamic range of the trans-mitted signal and the PAR. We have also considered the moregeneral scheme in which cooperation among different carriersis allowed to improve performance. We have obtained two mul-tilevel water-filling practical solutions to optimize the spatialsubstreams at all carriers (one minimizes the maximum MSE,and the other maximizes the harmonic mean of the SINRs) thatperform very close to the optimal solution in terms of averageBER with a low implementation complexity. Interestingly, forcarrier-cooperative schemes, it turns out that exact optimal so-lution in terms of minimizing the average BER can be obtainedin closed form.

APPENDIX APROOF OFTHEOREM 1

We first present a couple of lemmas and then proceed to theproof of Theorem 1.

Lemma 11 [17, 9.H.1.h]:If and are positivesemidefinite Hermitian matrices, then

Tr

where and are the eigenvalues of and , respec-tively, in decreasing order.

Lemma 12: Given a matrix and a positivesemidefinite Hermitian matrix such that

is a diagonal matrix with diagonal elements inincreasing order (possibly with some zero diagonal elements),it is always possible to find another matrix of the form

of at most rank rank thatsatisfies with Tr Tr ,where has as columns the eigenvectors ofcorresponding to the largest eigenvalues in increasing order,and diag has zeroelements, except along the right-most main diagonal (whichcan be assumed real w.l.o.g.).

Proof: Although the basic idea follows easily from the ap-plication of Lemma 11, the formal proof for arbitrary valuesof , , and rank becomes notationally involved. Since

is diagonal with diagonal elements in in-creasing order, we can write

where is a square diagonal matrix (with real diagonalelements) of dimension rank . Using thesingular value decomposition (SVD) [31], we can then write

, where is a unitary matrixwhose columns are the left singular vectors, the right singularvectors (eigenvectors of ) are the canonical vectors, and

matrix is composed of zero elements and containsin its top-right block so that . In particular,

if , then and , and if ,then (note that the position of matrixwithin is different from the classical definition of the SVD[31] because the diagonal elements of are assumedin increasing order). Assuming that matrix is nonsingularwith eigendecomposition given by , we canwrite

(58)

In case that is singular, clearly, must be orthogonal tothe null space of ; otherwise, this component could be madezero without changing the value of and decreasingTr . Knowing that must be orthogonal to the null spaceof , (58) is still valid using the pseudo-inverse of insteadof the inverse.

The idea now is to find another matrix by changing theunitary matrix in (58) with the lowest possible value ofTr (note that any matrix obtained from (58) satisfiesby definition the desired constraint ). UsingLemma 11, Tr can be lower bounded as follows:

Tr Tr

where , is the th diagonal element of in in-creasing order, and are the largest eigenvaluesof in increasing order. If the s are different, the lowerbound is achieved by matrix being a permutation matrix.For subsets of equal ’s, the corresponding subblock in canbe any rotation matrix. Therefore, if is chosen as a permu-tation matrix that selects the largest eigenvalues ofin the same ordering as the’s, the lower bound is achieved.From (58), we obtain that the optimal [in the sense of min-imizing the value of Tr ] has at most rank and is ofthe form , where

has as columns the eigenvectors of correspondingto the largest eigenvalues in increasing order, and

diag has zero elements, except along theright-most main diagonal (which can be assumed real w.l.o.g.).

Proof of Theorem 1:The constrained optimizationproblem to be solved is

s.t. Tr

where is the vector of diagonal elements of the MSE ma-trix . It is mathematically conve-nient to assume the diagonal elements of in decreasingorder, i.e., . Interestingly, this is without lossof generality. In fact, most reasonable objective functions (inparticular, all the objective functions considered in Section V)have a fixed preferred ordering of the arguments, i.e., the valueof the function is minimized with a very specific ordering of the

Page 18: Joint tx-rx beamforming design for multicarrier mimo channels: a ...€¦ · M. A. Lagunas is with the Centre Tecnològic de Telecomunicacions de Catalunya (CTTC), 08034 Barcelona,

2398 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 51, NO. 9, SEPTEMBER 2003

arguments. In such cases, it suffices to relabel the argumentsso that the preferred ordering is decreasing. In a more generalcase, however, a function may not have a fixed preferred or-dering since it may depend on the specific value of the argu-ments. In such a case, since we are interested in minimizingthe objective function, we can define and use instead the func-tion , where is a permutation ma-trix, and is the set of the different permutation matrices.The original minimization of without the ordering constraintis equivalent to the minimization of with the ordering con-straint. Therefore, we can always assume that the function tobe minimized has been properly defined so that the orderingconstraint can be included without loss of generality (c.f. Sec-tion V).

If is Schur-concave, it follows by Lemma 1 that, where is the vector of eigen-

values of in decreasing order. The lower boundis achieved if matrix is diagonalwith diagonal elements in decreasing order or, equivalently,if is diagonal with diagonal elements in increasingorder. Furthermore, for any given, one can always find arotation matrix so that becomes diagonaland use instead the transmit matrix , improving theperformance (by this rotation, the utilized power remains thesame). This implies that for Schur-concave functions, there isan optimal with a structure such that is diagonalwith diagonal elements in increasing order.

If is Schur-convex, the opposite happens. By Lemma 2,it follows that is minimized when has equal diag-onal elements. Furthermore, for any given, one can alwaysfind a rotation matrix so that hasidentical diagonal elements by Lemma 3 (the sum of diagonalelements of remains the same regardless of) and use in-stead the transmit matrix , improving the performance(the transmit power remains the same). Therefore, for an op-timal , we have that Tr

. Interestingly, regardless of the specific function, the optimal can be found by first minimizing Tr

(without imposing the constraint that the diagonalelements be equal) and then including the rotation to make thediagonal elements identical. The rotation can be found using thealgorithm given in [34, Sect. IV-A] or with any rotation matrix

that satisfies , , such as the DFT matrixor the Hadamard matrix (when the dimensions are appropriatesuch as a power of two [33, p. 66]). Regarding the minimizationof Tr , since it is a Schur-concave function ofthe diagonal elements , the previous resultcan be applied to show that there is an optimal(excluding forthe moment the rotation) such that is diagonal withdiagonal elements in increasing order.

Given that is diagonal, it follows from Lemma12 that has at most rank rank andcan be written as , wherehas as columns the eigenvectors of correspondingto the largest eigenvalues in increasing order, and

diag has zero elements,except along the right-most main diagonal.

Hence, we can write an optimal as

for Schur-concave

for Schur-convex

where and are defined as before, and isthe rotation to make the diagonal elements ofidentical.

APPENDIX BPROOF OFLEMMA 4

Given a set of ’s in decreasing order , the func-tion is minimized with the weights in in-creasing order . To show this, assume for a momentthat for ( ), the weights are such that . Wenow show that the term can be minimized bysimply swapping the weights:

To prove that is Schur-concave, define, where . Function is Schur-convex

because whenever [17, 3.H.2]. There-fore, is Schur-concave (see Definition 3).

APPENDIX CPROOF OFLEMMA 5

Given a set of strictly positive ’s in decreasing order, the function is minimized with the

weights in increasing order . To show this, assumefor a moment that for ( ), the weights are suchthat . We now show that the term can beminimized by simply swapping the weights:

To prove that is Schur-concave, define, where . Function

is Schur-convex because whenever[17, 3.H.2]. Since and function is de-creasing in , is Schur-concave [17, 3.B.1].

APPENDIX DPROOF OFLEMMA 6

From Definition 1, it follows that .If , it must be that (from Definition 2), andtherefore, . This means that is Schur-convexby Definition 3.

APPENDIX EPROOF OFLEMMA 7

Since the ’s are strictly positive and in decreasing order, the function

Page 19: Joint tx-rx beamforming design for multicarrier mimo channels: a ...€¦ · M. A. Lagunas is with the Centre Tecnològic de Telecomunicacions de Catalunya (CTTC), 08034 Barcelona,

PÉREZ PALOMARet al.: JOINT TX-RX BEAMFORMING DESIGN 2399

is minimized with the weights in increasing order(this can be similarly proved as was done in the proof of Lemma4).

To show that is Schur-concave, define, where . Since

whenever , it follows that is Schur-convex [17, 3.H.2],and therefore, is Schur-concave (see Definition 3).

APPENDIX FPROOF OFLEMMA 8

Given a set of strictly positive ’s in decreasing order, the function is minimized

with the weights in increasing order (this can besimilarly proved, as was done in the proof of Lemma 4).

To show that is Schur-concave provided that ,define , where

. Function is Schur-convex becausewhenever 15 [17, 3.H.2]. Sinceand function is decreasing in , is Schur-con-

cave [17, 3.B.1].

APPENDIX GPROOF OFLEMMA 9

To prove that the function is Schur-convex, rewrite it as where

. Since is convex, it follows that is Schur-convex [17,3.H.2].

APPENDIX HANALYSIS OF THE CONVEXITY OF THE BER

In this Appendix, we show that the BER and the cor-responding Chernoff upper bound are convex decreasingfunctions of the SINR and convex increasing functions of theMSE (for sufficiently small values of the MSE).

BER as a Function of the SINR:To prove that the BER isconvex decreasing in the SINR, it suffices to show that the firstand second derivatives of are negative and positive,respectively (note that a positive scaling factor preserves mono-tonicity and convexity):

The same can be done for the Chernoff upper bound :

15Function(1� x)x is increasing inx for 0 � x � 0:5.

BER as a Function of the MSE:To prove that the BER isconvex increasing in the MSE, it suffices to show that the firstand second derivatives are both positive:

where the zeros are

and

(it has been tacitly assumed that ). It is remarkable thatfor , both zeros coincide, which means that the BER func-tion is convex for the whole range of MSE values. To be morespecific, BPSK and QPSK constellations satisfy this condition,and consequently, their corresponding BER function is alwaysconvex in the MSE.

The same can be done for the Chernoff upper bound:

Therefore, the Chernoff upper bound is convex increasing in theMSE for MSE .

Concluding, as a rule of thumb, the BER and the Chernoffupper bound are convex increasing in the MSE for a BER

(see [36] for more details).

APPENDIX IPROOF OFLEMMA 10 BER

To prove that the function BER isSchur-convex for (for sufficiently small such thatBER ), write , where

BER . Since function is convex within therange (see Appendix H), it follows that is Schur-convex[17, 3.H.2].

ACKNOWLEDGMENT

The authors wish to thank M. Bengtsson for reading the paperand for his helpful comments. They would also like to thank theanonymous reviewers for their comments on the paper.

Page 20: Joint tx-rx beamforming design for multicarrier mimo channels: a ...€¦ · M. A. Lagunas is with the Centre Tecnològic de Telecomunicacions de Catalunya (CTTC), 08034 Barcelona,

2400 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 51, NO. 9, SEPTEMBER 2003

REFERENCES

[1] M. L. Honig, K. Steiglitz, and B. Gopinath, “Multichannel signal pro-cessing for data communications in the presence of crosstalk,”IEEETrans. Commun., vol. 38, pp. 551–558, Apr. 1990.

[2] G. G. Raleigh and J. M. Cioffi, “Spatio-temporal coding for wirelesscommunication,”IEEE Trans. Commun., vol. 46, pp. 357–366, Mar.1998.

[3] A. Scaglione, G. B. Giannakis, and S. Barbarossa, “Redundant filterbankprecoders and equalizers Part I: Unification and optimal designs,”IEEETrans. Signal Processing, vol. 47, pp. 1988–2006, July 1999.

[4] I. E. Telatar, “Capacity of multi-antenna Gaussian channels,”Eur. Trans.Telecommun., vol. 10, no. 6, pp. 585–595, Nov.–Dec. 1999. See also aprevious version of the paper inAT&T Bell Labs Internal Tech. Memo,June 1995.

[5] G. Foschini and M. Gans, “On limits of wireless communications ina fading environment when using multiple antennas,”Wireless Pers.Commun., vol. 6, pp. 311–335, 1998.

[6] T. M. Cover and J. A. Thomas,Elements of Information Theory. NewYork: Wiley, 1991.

[7] A. Scaglione, S. Barbarossa, and G. B. Giannakis, “Filterbank trans-ceivers optimizing information rate in block transmissions over disper-sive channels,”IEEE Trans. Inform. Theory, vol. 45, pp. 1019–1032,Apr. 1999.

[8] Z. Wang and G. B. Giannakis, “Wireless multicarrier communications:Where Fourier meets Shannon,”IEEE Signal Processing Mag., vol. 17,pp. 29–48, May 2000.

[9] ETSI, “Broadband radio access networks (BRAN); HIPERLAN type 2;physical (PHY) layer,”, ETSI TS 101 475 V1.2.2, pp 1–41, Feb. 2001.

[10] Part 11: Wireless LAN medium access control (MAC) and physical layer(PHY), IEEE Std 802.11a, Dec. 1999.

[11] K. H. Lee and D. P. Petersen, “Optimal linear coding for vector chan-nels,”IEEE Trans. Commun., vol. COM-24, pp. 1283–1290, Dec. 1976.

[12] J. Salz, “Digital transmission over cross-coupled linear channels,”At&TTech. J., vol. 64, no. 6, pp. 1147–1159, July–Aug. 1985.

[13] J. Yang and S. Roy, “On joint transmitter and receiver optimization formultiple-input-multiple-output (MIMO) transmission systems,”IEEETrans. Commun., vol. 42, pp. 3221–3231, Dec. 1994.

[14] H. Sampath, P. Stoica, and A. Paulraj, “Generalized linear precoder anddecoder design for MIMO channels using the weighted MMSE crite-rion,” IEEE Trans. Commun., vol. 49, pp. 2198–2206, Dec. 2001.

[15] J. Yang and S. Roy, “Joint transmitter-receiver optimization for multi-input multi-output systems with decision feedback,”IEEE Trans. In-form. Theory, vol. 40, pp. 1334–1347, Sept. 1994.

[16] A. Scaglione, P. Stoica, S. Barbarossa, G. B. Giannakis, and H. Sampath,“Optimal designs for space-time linear precoders and decoders,”IEEETrans. Signal Processing, vol. 50, pp. 1051–1064, May 2002.

[17] A. W. Marshall and I. Olkin,Inequalities: Theory of Majorization andIts Applications. New York: Academic, 1979.

[18] D. P. Palomar, J. M. Cioffi, M. A. Lagunas, and A. P. Iserte, “Convexoptimization theory applied to joint beamforming design in multicarrierMIMO channels,” inProc. IEEE Int. Conf. Commun., Anchorage, AK,May 11–15, 2003.

[19] D. G. Luenberger,Optimization by Vector Space Methods. New York:Wiley, 1969.

[20] R. T. Rockafellar,Convex Analysis, 2nd ed. Princeton, NJ: PrincetonUniv. Press, 1970.

[21] S. Boyd and L. Vandenberghe. (2000) Introduction to convex optimiza-tion with engineering applications. Stanford Univ., Stanford, CA, CourseNotes. [Online]. Available: http://www.stanford.edu/class/ee364.

[22] M. Bengtsson and B. Ottersten, “Optimal and suboptimal transmit beam-forming,” in Handbook of Antennas in Wireless Communications, L. C.Godara, Ed. Boca Raton, FL: CRC, 2001.

[23] S.-P. Wu, S. Boyd, and L. Vandenberghe, “FIR filter design via semidef-inite programming and spectral factorization,” inProc. IEEE Conf. De-cision Contr., Dec. 1996, pp. 271–276.

[24] H. Lebret and S. Boyd, “Antenna array pattern synthesis via convex op-timization,” IEEE Trans. Signal Processing, vol. 45, pp. 526–532, Mar.1997.

[25] S. Kandukuri and S. Boyd, “Optimal power control in interference-lim-ited fading wireless channels with outage-probability specifications,”IEEE Trans. Wireless Commun., vol. 1, pp. 46–55, Jan. 2002.

[26] Y. Nesterov and A. Nemirovsky, “Interior-point polynomial methods forconvex programming,”SIAM Studies Appl. Math., vol. 13, 1994.

[27] G. J. Foschini, “Layered space-time architecture for wireless commu-nication in a fading environment when using multi-element antennas,”Bell Labs Tech. J., vol. 1, no. 2, pp. 41–59, Autumn 1996.

[28] H. Bölcskei and A. J. Paulraj, “Multiple-input multiple-output (MIMO)wireless systems,” inThe Communications Handbook, 2nd ed, J.Gibson, Ed. Boca Raton, FL: CRC, 2002, pp. 90.1–90.14.

[29] J. M. Cioffi and G. D. Forney, “Generalized decision-feedback equaliza-tion for packet transmission with ISI and Gaussian noise,” inCommu-nications, Computation, Control and Signal Processing, A. Paulraj, V.Roychowdhury, and C. D. Schaper, Eds. Boston, MA: Kluwer, 1997,ch. 4.

[30] S. M. Kay,Fundamentals of Statistical Signal Processing: EstimationTheory. Englewood Cliffs, NJ: Prentice-Hall, 1993.

[31] R. A. Horn and C. R. Johnson,Matrix Analysis. New York: CambridgeUniv. Press, 1985.

[32] J. G. Proakis,Digital Communications, 3rd ed. New York: McGraw-Hill, 1995.

[33] S. Verdú,Multiuser Detection. New York: Cambridge Univ. Press,1998.

[34] P. Viswanath and V. Anantharam, “Optimal sequences and sum capacityof synchronous CDMA systems,”IEEE Trans. Inform. Theory, vol. 45,pp. 1984–1991, Sept. 1999.

[35] A. P. Iserte, A. I. Perez-Neira, and M. A. Lagunas, “Joint beamformingstrategies in OFDM-MIMO systems,” inProc. IEEE Int. Conf. Acoust.,Speech, Signal Process., Orlando, FL, May 2002.

[36] D. P. Palomar, “A unified framework for communications throughMIMO channels,” Ph.D. dissertation, Technical Univ. Catalonia (UPC),Barcelona, Spain, May. 2003.

[37] D. P. Palomar, M. A. Lagunas, and J. M. Cioffi, “On the optimal structureof transmit-receive linear processing for MIMO systems,” inProc. 40thAnnu. Allerton Conf. Commun., Contr., Comput.. Monticello, IL, Oct.2–4, 2002.

[38] K. K. Wong, R. S. K. Cheng, K. B. Letaief, and R. D. Murch, “Adaptiveantennas at the mobile and base stations in an OFDM/TDMA system,”IEEE Trans. Commun., vol. 49, pp. 195–206, Jan. 2001.

[39] E. N. Onggosanusi, B. D. Van Veen, and A. M. Sayeed, “Efficientsignaling schemes for wideband space-time wireless channels usingchannel state information,”IEEE Trans. Veh. Technol., vol. 52, pp.1–13, Jan. 2003.

[40] Y. Ding, T. N. Davidson, J.-K. Zhang, Z.-Q. Luo, and K. M. Wong, “Min-imum BER precoders for zero-forcing equalization,” inProc. IEEE Int.Conf. Acoust., Speech, Signal Process., Orlando, FL, May 2002.

[41] D. J. G. Mestdagh and P. M. P. Spruyt, “A method to reduce the prob-ability of clipping in DMT-based transceivers,”IEEE Trans. Commun.,vol. 44, pp. 1234–1238, Oct. 1996.

[42] J. Tellado and J. M. Cioffi, “Efficient algorithms for reducing PAR inmulticarrier systems,” inProc. IEEE Int. Symp. Inform. Theory, Cam-bridge, MA, Aug. 1998.

[43] ETSI, “Channel models for HIPERLAN/2 in different indoor sce-narios,”, ETSI EP BRAN 3ERI085B, pp. 1–8, Mar. 1998.

[44] L. Schumacher, J. P. Kermoal, F. Frederiksen, K. I. Pedersen, A. Al-gans, and P. E. Mogensen. (2001, Feb.) MIMO channel characterization.[Online]. Available: http://www.ist-metra.org, Deliverable D2 V1.1 ofIST-1999-11 729 METRA project, pp. 1–57.

Daniel Pérez Palomar (S’99) was born inBarcelona, Spain, in 1975. He received the electricalengineering degree from the Technical Universityof Catalonia (UPC), Barcelona, in 1998. In 1999,he was recipient of a Ministry of Education grant tocomplete the Ph.D. thesis and joined the Departmentof Signal Theory and Communications at UPC as aResearch Assistant, where he is currently pursuingthe Ph.D. degree in electrical engineering.

During 1998, he stayed with the Department ofElectronic Engineering, King’s College London

(KCL), London, U.K. From 1999 to 2000, he participated in the Europeanprojects ACTS-SUNBEAM and IST-METRA. From April to November2001, he held a visiting research appointment at the Department of ElectricalEngineering, Stanford University, Stanford, CA. From January to December2002, he was a visiting researcher with the Telecommunications TechnologicalCenter of Catalonia (CTTC), Barcelona. His primary research interests includestatistical array signal processing, information-theoretic and communicationaspects of wireless MIMO channels, convex optimization theory, majoriza-tion theory, DMT transmission, multiuser detection, and spread spectrumtechniques.

Page 21: Joint tx-rx beamforming design for multicarrier mimo channels: a ...€¦ · M. A. Lagunas is with the Centre Tecnològic de Telecomunicacions de Catalunya (CTTC), 08034 Barcelona,

PÉREZ PALOMARet al.: JOINT TX-RX BEAMFORMING DESIGN 2401

John M. Cioffi (S’77–M’78–SM’90–F’96) receivedthe B.S.E.E. degree in 1978 from the University ofIllinois, Urbana-Champaign, and the Ph.D.E.E. de-gree in 1984 from Stanford University, Stanford, CA.

He was with Bell Laboratories, Holmdel, NJ, from1978 to 1984; IBM Research, San Jose, CA, from1984 to 1986; and has been a Professor of electricalengineering at Stanford since 1986. He foundedAmati Communications Corp. (which was purchasedby Texas Instrumnets in 1997), in 1991 and wasofficer/director from 1991 to 1997. He is currently

on the Board of Directors of Marvell, IteX, Teknovus, Clariphy, and Teranetics.He is on the advisory boards of Actelis, Ikanos, Charter Ventures, and PortviewVentures and is a member of the U.S. National Research Council’s CSTB. Hisspecific interests are in the area of high-performance digital transmission. Hehas published over 200 papers and holds over 40 patents.

Dr. Cioffi has held the Hitachi America Professorship in Electrical Engi-neering since 2002 and has been a Member of the National Academy of En-gineering since 2001. He received the IEEE Kobayashi Medal in 2001, theIEEE Millennium Medal in 2000, the IEE J. J. Tomson Medal in 2000, wasthe 1999 University of Illinois Outstanding Alumnus, received the 1991 IEEECOMMUNICATIONS MAGAZINE best paper award, received the ANSI T1 Out-standing Achievement Award in 1995, and was an NSF Presidential Investigatorfrom 1987 to 1992.

Miguel Angel Lagunas (S’73–M’78–SM’89–F’97)was born in Madrid, Spain, in 1951. He receivedthe Telecommunications Engineer degree fromthe Polytechnic University of Madrid (UPM) in1973 and the Ph.D. degree in telecommunicationsfrom the Technical University of Catalonia (UPC),Barcelona, Spain.

From 1971 to 1973, he was a Research Assistant,from 1973 to 1979, a Teacher Assistant in NetworkSynthesis and Semiconductor Electronics, from 1979to 1982, an Associate Professor of digital signal pro-

cessing, and since 1983, Full Professor, teaching courses in signal processing,array processing, and digital commincations, all at the Semiconductor Lab.,ETSIT, Madrid. He was Project Leader of high-speed SCMA from 1987 to 1989and ATM cable networks from 1994 to 1995. He was co-director of the firstprojects for the European Spatial Agency and the European Union, providingengineering demonstration models on smart antennas for satellite communica-tions using DS and FH systems (in 1986) and mobile communications GSM(Tsunami, in 1994). His research is devoted to spectral estimation, adaptive sys-tems, and array processing. His technical activities are in advanced front-endsfor digital communications combining spatial with frequency-time and codingdiversity. He was Vice President for Research at UPC from 1986 to 1989 andVice Secretary General for Research at CICYT from 1995 to 1996. Currently,he is Director of the Telecommunications Technological Center of Catalonia(CTTC), Barcelona.

Dr. Lagunas is a member-at-large of Eurasip. He is an elected member ofthe Academy of Engineers of Spain and of the Academy of Science and Art ofBarcelona. He was a Fullbright scholar at the University of Colorado, Boulder.


Recommended