Date post: | 16-Nov-2023 |
Category: |
Documents |
Upload: | independent |
View: | 0 times |
Download: | 0 times |
IEEE TRANSACTIONS ON COMMUNICATIONS, VOL. 57, NO. 11, NOVEMBER 2009 3339
On the Average Rate Performance ofHybrid-ARQ in Quasi-Static Fading Channels
Cong Shen, Student Member, IEEE, Tie Liu, Member, IEEE, and Michael P. Fitz, Senior Member, IEEE
AbstractβThe problem of efficient communication over ascalar quasi-static fading channel is considered. The single-layertransmission (SLT) and multi-layer transmission (MLT) schemesdo not require any knowledge of the channel state information(CSI) at the transmitter, but their performance is also limited.It is shown that using Hybrid-ARQ (HARQ) can significantlyimprove the average rate performance, provided that the rateassignment between different ARQ rounds is carefully chosen.The average rate performance of several HARQ schemes isoptimized and compared. In addition, optimal power allocationamong retransmissions is derived and shown to further increasethe average rate. This power allocation gain is remarkable at lowsignal-to-noise ratio (SNR), but becomes negligible at high SNR.Comparison of two different types of limited feedback, sequentialfeedback (ARQ) and one-shot feedback (quantized CSI), is madefrom several perspectives. Although the optimization problem isformed with respect to the average rate, simulation results givea comprehensive comparison under different metrics, includingaverage rate, outage probability, and the combination of both.Substantial performance improvement is observed with even oneARQ retransmission in all simulations. More importantly, thisgain appears to be robust with respect to the fading distributions.
Index TermsβHybrid-ARQ (HARQ), incremental redundancy,fading channels, throughput, channel state information (CSI).
I. INTRODUCTION
THE problem studied in this paper is how to efficientlytransmit information over a quasi-static wireless fading
channel, where the channel gain is constant during one co-herence block and changes independently from one block toanother. It is assumed that the receiver can perfectly trackthe fading process. Depending on whether the transmitterknows about the instantaneous channel realization, differentperformance measures have been studied. If the transmitterhas no knowledge of the channel realization other than thestatistical characterization, the Shannon capacity is zero as
Paper approved by L. Rasmussen, the Editor for Iterative Detection, De-coding, and ARQ of the IEEE Communications Society. Manuscript receivedFebruary 19, 2008; revised October 22, 2008 and April 27, 2009.
C. Shen is with the Department of Electrical Engineering, University ofCalifornia, Los Angeles (UCLA), Los Angeles, CA 90095, USA (e-mail:[email protected]).
T. Liu is with the Department of Electrical and Computer Engineer-ing, Texas A&M University, College Station, TX 77843, USA (e-mail:[email protected]).
M. P. Fitz was with the Department of Electrical Engineering, Universityof California, Los Angeles (UCLA), Los Angeles, CA 90095, USA. He isnow with Northrop Grumman Space Technology, Redondo Beach, CA 90278,USA (e-mail: [email protected]).
The work of Cong Shen and Michael P. Fitz is supported by NSF grantCCF-0431196, and by ST Microelectronics with a matching grant from theUniversity of California Discovery Program. Part of this work was done whileCong Shen was visiting Texas A&M University.
Digital Object Identifier 10.1109/TCOMM.2009.11.080072
there is always a nonzero probability that the channel is indeep fade. A useful and well-accepted performance metric isthe outage capacity [1]. In this formulation, a fixed-rate chan-nel code is used, and the information is reliably transmittedif the instantaneous channel gain supports the predeterminedtransmission rate. Otherwise, an outage is declared, and noinformation can be recovered at the receiver. Quasi-staticfading with channel state information (CSI) available only atthe receiver (CSIR) is a prime example of the detrimentaleffect of fading.
Further improving the performance requires an opportunis-tic view of fading: a well-designed system should be ableto adapt to the channel variations, i.e., it sends some in-formation across the channel when the channel is not-so-good and a lot of information when the channel is verygood. By exploiting the βgoodβ channel realizations, the long-term throughput can be substantially improved. However,adapting to channel fading without transmitterβs knowledgeof CSI faces some conceptual difficulties. The breakthroughwas made in [2], where the author observed the similaritybetween communication over quasi-static fading channels andbroadcasting to multiple users. This venue was later pursuedin [3], [4], and [5], leading to the development of a multi-layer transmission (MLT) strategy which utilizes broadcastsuperposition coding. By organizing information into layers,the MLT strategy allows rate adaptation to channel fading atthe expense of creating self interference during the decoding ofthe earlier-decoded layers in the stack. Furthermore, dependingon the actual channel realization, it may occur that only partof the information can be decoded at the receiver, which maylead to some extra complications in the upper layers in thenetwork hierarchy.
This work follows the same line of examining the long-termthroughput performance of communication over quasi-staticwireless fading channels [3]β[5]. Both single-layer transmis-sion (SLT) and MLT are first briefly discussed. It is then shownthat with the use of Hybrid-ARQ (HARQ), the average rateperformance can be dramatically improved, provided that therate assignment of the HARQ protocol is optimally designed.This scheme is also referred as Rate-Optimized HARQ (RO-HARQ). The basic idea is to exploit the existence of ARQin the data link layer to increase the average rate. Roughlyspeaking, the initial transmission is set to be very aggressive(high rate). If the channel does not support this high rate,an ARQ will help by indicating the transmitter to reduce therate. The average rate maximization problem of RO-HARQ isformulated, and numerical results demonstrate the remarkablegain over both SLT and MLT strategies. Moreover, this gain
0090-6778/09$25.00 cβ 2009 IEEE
3340 IEEE TRANSACTIONS ON COMMUNICATIONS, VOL. 57, NO. 11, NOVEMBER 2009
appears to be robust to the fading distributions. Further averagerate increase is possible, especially in the low signal-to-noise ratio (SNR) regime, if power allocation among differentretransmissions is performed. To comprehensively comparethese schemes, numerical optimization and simulations areperformed with respect to different performance measures,including average rate, outage probability, and the combinationof both.
The idea of using ARQ to improve communication perfor-mance is not new. In fact, HARQ techniques are widely usedin most of the contemporary digital communication systems. Agood summary of the progress of HARQ schemes is presentedin [6]. Early work regarding the HARQ system focuses on theusage of algebraic error-correction and error-detection codes[7]. Recent interests of HARQ mainly originate from the rapidprogress of wireless communications, where high-rate reliabletransmission faces the challenge of severe channel fluctuations.Throughput and scheduling optimization of downlink packetdata systems are investigated in [8]. An information-theoreticthroughput and delay analysis of several HARQ schemes inthe Gaussian collision channel is reported in [9]. Throughputanalysis of incremental redundancy HARQ in the block-fadingadditive white Gaussian noise (AWGN) channel is carriedout in [10]. A general framework of diversity-multiplexing-delay tradeoff is proposed in [11] to study multiple-inputmultiple-output (MIMO) ARQ block fading channels. Later[12] extends this framework to incorporate discrete inputdistributions. From the practical implementation point of view,research interests have shifted from traditional algebraic linearblock codes to the more powerful capacity-approaching mod-ern codes. For example, the problem of designing low-densityparity-check (LDPC) codes for the HARQ protocol has beenaddressed in [10], [13]β[17].
The rest of this paper is organized as follows. Section II de-fines the system model. Section III briefly discusses the aver-age rate performance of SLT and MLT. Section IV presents thetheoretical analysis of RO-HARQ. Average rate maximizationis discussed in Section IV-C and IV-D, followed by the optimalpower allocation in Section IV-E. Numerical comparison withseveral different performance metrics is reported in SectionV. Section VI discusses the difference between sequentialfeedback and one-shot CSI feedback. Finally, Section VIIconcludes the paper and points out possible directions forfuture work.
II. SYSTEM MODEL
We consider a scalar quasi-static fading channel where therandom channel gain β remains constant for a duration of ππsymbol times and then changes independently to another valueaccording to the fading distribution. The value ππ is generallydetermined by the channel coherence time. The signal modelcan be written as
π¦[π] = βπ₯[π] + π§[π], π = 1, β β β , πΏ (1)
where {π₯[π],π = 1, β β β , πΏ} is a length-πΏ codeword con-taining πΎ information nats1, and π§[π] is independent and
1The information unit is nat throughout the paper, except for the numericalresults.
identically distributed (i.i.d.) complex Gaussian noise withzero mean and variance π0 (denoted as ππ© (0, π0)). Weassume ππ β« πΏ such that the transmission of πΎ informationnats only experiences one fading state. This is the worst casesince no time diversity can be exploited. There is a short-termaverage power constraint of π on {π₯[π]}, which prohibitspower allocation across different fading states2. Since eachtransmission experiences an AWGN channel, our analysis inthis paper is restricted to the Gaussian input distribution.The channel gain and noise power are normalized to beπΌ[β£ββ£2] = 1 and π0 = 1 respectively, so the average
received signal-to-noise ratio is SNR.= ππΌ
[β£ββ£2] /π0 = π .The random channel power π
.= β£ββ£2 β₯ 0 is assumed to be a
continuous variable with the cumulative distribution function(CDF) πΉ (π) and the probability density function (PDF) π(π).One example is the frequently-encountered Rayleigh fadingwith β βΌ ππ© (0, 1). We also assume that the transmitterhas a very large pool of information nats such that once thetransmission of the current πΎ nats ends, the transmitter startsto send the next πΎ nats immediately.
The quasi-static fading channel is a good model for usersthat are stationary, or moving slowly relative to the rateof communication. Due to the slowly varying nature of thechannel, channel estimation at the receiver can be performedwith high accuracy. Thus, it is reasonable to assume perfectCSI at the receiver. This assumption will be made throughoutthis paper.
The focus of this work is on high-rate, delay-insensitiveapplications such as data traffic in wireless LAN. For suchapplications the use of capacity-approaching channel codeswith long block length can be justified, and information-theoretic results are good approximations of real-world per-formance. This motivates us to take an information-theoreticview and consider the capacity related measures on the systemperformance in this paper.
III. SINGLE-LAYER AND MULTI-LAYER TRANSMISSIONS
A. Figure of Merit
Our main figure of merit is the long-term throughput. Letus use π to count the number of time slots, ππ for the channeluses of the π-th slot, π (π) =
βππ=1 ππ for the total number of
channel uses at the end of the π-th slot, and π(π) for the totalnumber of information nats successfully decoded up to slot π.Then the long-term throughput (in nats per channel use, npcu)is defined as [9]
π.= limπββ
π(π)
π (π)=
πΌ{π}πΌ{π― } (2)
where π is the number of successfully decoded informationnats at the end of each transmission, and π― is the number ofchannel uses.
Both SLT and MLT belong to the general category of fixed-length coding schemes, in which the code length is constantregardless of the fading state. For such fixed-length codes, the
2When discussing power allocation for RO-HARQ in Section IV-E, unequalpower allocation between the primary transmission and ARQ retransmissionsis allowed (still within one channel state).
SHEN et al.: ON THE AVERAGE RATE PERFORMANCE OF HYBRID-ARQ IN QUASI-STATIC FADING CHANNELS 3341
denominator of (2) is constant, and the long-term throughputdegenerates to the average rate:
οΏ½ΜοΏ½.= πΌπ [π (π)] (3)
where π is the channel state, and π (π) is the successfultransmission rate when channel state is π. See [18] for a formaldefinition of average rate. It should be noted that average rateis different from the ergodic capacity; here, coding acrossdifferent fading states is prohibited, c.f. channel model (1).
B. SLT and MLT schemes
Assuming that the transmitter does not know the channelstate π, a simple and well-adopted scheme for communicatingover a slow fading channel is to send the data at a fixed rateπ . If the instantaneous channel realization supports the rateπ , the receiver gets a successful transmission; otherwise itdeclares an outage.
The average rate of SLT can be calculated as [19, Equation(7)]
οΏ½ΜοΏ½ππΏπ = π β {log (1 + ππ ) β₯ π }= π
(1β πΉ
(ππ β 1
π
)). (4)
In order to maximize the average rate, one needs to solve thefollowing optimization problem:
maximize π (1β πΉ
(ππ β1π
))subject to π β₯ 0.
(5)
This optimization problem has been studied in [19, SectionIII], where the Karush-Kuhn-Tucker (KKT) condition is de-rived. Here, we mention that with Rayleigh fading the KKTcondition simplifies to
π ππ = π ββ π βππΏπ,π ππ¦ππππβ = β(π ) (6)
where β(β ) is the Lambert π function [20] which is definedas the solution to π¦ππ¦ = π₯, and
οΏ½ΜοΏ½βππΏπ,π ππ¦ππππβ = β(π ) exp
(βπβ(π ) β 1
π
). (7)
The problem with SLT is that it evaluates the transmissionin an βon-offβ fashion: the transmission is either entirely suc-cessful or totally failed. This scheme suffers from both over-utilizing and under-utilizing the channel since it uses a non-adaptive transmission strategy. The MLT scheme is developedto overcome the disadvantages of the SLT scheme withouttransmitterβs knowledge of CSI. The MLT scheme adopts themulti-user broadcast superposition code for a single-user slowfading channel. It is proposed in [4] for infinitely-many layers,and in [5] for finite layers. As an example, let us consider thecase with π = 2 layers. In this case, the transmitter splitsthe message π€ into two sub-messages π€1 and π€2. They areseparately encoded into {π₯1[π]} and {π₯2[π]} respectively,assuming Gaussian signaling, and finally superposed into thetransmit signal π₯[π] = π₯1[π] + π₯2[π], where {π₯1[π]} has apower πΌπ and rate π 1 npcu, {π₯2[π]} has a power (1βπΌ)πand rate π 2 npcu, and the scalar πΌ β [0, 1] determines thepower allocation between these two layers. At the receiver,the decoding is carried out based on the channel realization β.
The first step is trying to decode {π₯1[π]}, treating {π₯2[π]}as adding to the noise floor. If {π₯1[π]} is not successfullydecoded, the receiver will give up and declare an error.On the other hand, if {π₯1[π]} is successfully decoded, thedecoding procedure continues to the second step: subtractingthe successfully decoded {π₯1[π]} and then decoding {π₯2[π]}.After some manipulation [5], the two-layer MLT average ratemaximization problem reduces to:
maximize π 1 (1β πΉ (π 1)) +π 2 (1β πΉ (π 2))subject to π 1 β₯ 0
π 2 β₯ 0ππ 1+π 2βππ 2
ππ 1+π 2β1β€ πΌ β€ 1
(8)
where π 1.= ππ 1β1π (1β(1βπΌ)ππ 1 )
and π 2.= ππ 2β1π (1βπΌ) .
The average rate of MLT can be further boosted by increas-ing the number of layers in the superposition code, althoughsuch gain has been shown to be insignificant in Rayleighfading [5]. In [4], Shamai and Steiner derived the maximumaverage rate of a superposition code with infinitely-manylayers for the Rayleigh fading:
οΏ½ΜοΏ½βππΏπ,π ππ¦ππππβ = 2πΈπ(π0)β 2πΈπ(1)β
(πβπ0 β πβ1
)(9)
where π0 = 2/(1 +
β1 + 4π
)and πΈπ(π₯) =
β«βπ₯
πβπ‘
π‘ dπ‘, π₯ β₯0 is the exponential integral function.
One property of the layering strategy is that it creates selfinterference to the decoding. Specifically, when the receiverdecodes the π-th layer, it treats all the not-yet-decoded layersπ+ 1, β β β , π as interference and thus decreases the effectivereceive SNR for the π-th layer to
SNRπ =πππ
1 + πβππ=π+1 ππ
.
As the number of layers increases, this self interference affectsmore layers in the decoding; in fact, only the last decodedlayer is interference-free. This self interference has beenthe main concern of using broadcast superposition code ina single-user slow fading channel [21]. Thus, although theMLT scheme improves the performance over SLT, furtherperformance gain can be expected if we are able to eliminatesuch self interference. This motivated the development ofa Rate-Optimized Hybrid-ARQ (RO-HARQ) which will bedescribed next.
IV. RO-HARQ: THEORETICAL ANALYSIS
A. HARQ Schemes
In this section, we study several HARQ schemes withoptimally designed rate and power for each transmission. Itwill be shown that optimized HARQ can eliminate all of theaforementioned problems. With the help of ARQ, the decodingstatus at the receiver will be reported back to the transmitter,which indicates the successful decoding of received signalby acknowledgement (ACK) and failed decoding by negativeacknowledgement (NACK). It is assumed that the ARQ feed-back channel is delay and error free. The maximum numberof retransmissions of HARQ is denoted by π , i.e., the totaltransmissions (including the initial one) cannot exceed π +1.In the ARQ literature, π + 1 is also called the maximum
3342 IEEE TRANSACTIONS ON COMMUNICATIONS, VOL. 57, NO. 11, NOVEMBER 2009
allowable ARQ rounds. The resulting protocol is sometimesreferred to as ARQ with a deadline. The choice of π reflectsthe worse-case delay caused by the ARQ retransmissions,and is generally determined by the system delay requirement.For example, choosing small π models certain delay criticalsituation. In order to be consistent with channel model (1), themaximum ARQ rounds should limit the overall code lengthto be within πΏ. Note that the HARQ protocol is typicallyavailable in almost all existing wireless systems, and thusexploiting HARQ will not need additional designs.
We consider the following three HARQ protocols3 in thispaper.
1) ALO. The transmitter encodes the πΎ information nats atrate π npcu, and then keeps sending the same encodedpacket in every retransmission. The receiver only decodesthe most recently received packet. This loop continuesuntil the ACK is declared by the receiver, or the maxi-mum retransmissions are used.
2) RTD. The transmitter is the same as ALO. The receiverperforms a maximum ratio combining (MRC) of all thereceived packets. In the HARQ literature, this scheme isalso referred as Chase Combining [23].
3) INR. The transmitter encodes the πΎ information nats intoa codeword of length π (π+1). Then it serially puncturesthe length-πΏ codeword into π + 1 sub-codewords withstrictly decreasing rates πΎ/π (1) > πΎ/π (2) > β β β >πΎ/π (π+1). The lengths of sub-codewords are the designparameters, which are determined by the code rate opti-mization and will be addressed in Section IV-C. At the π-th transmission (π = 1, β β β , π ), the transmitter reducesthe total rate to4 π 1 + π 2 + β β β + π π+2βπ = πΎ/π (π)
by sending additional redundancy symbols. The receivertries to decode based on all the packets it receives up untilthis moment. At the last retransmission (round π + 1),rate π 1 is tried without any ARQ feedback. If decodingis still unsuccessful, a decoding failure is declared.
This works focuses on INR due to the following tworeasons.
1) ALO scheme does not work for slow fading channels.The reason is that there is no time diversity to exploit,as the channel gain remains constant over ARQ retrans-missions. Thus to keep sending the same packet and onlydecoding the most recently received one will not increasethe chance of successful decoding.
2) RTD has an average rate performance that is inferior toINR. This will be shown both analytically and numeri-cally. Note that although RTD has inferior performance, ithas some practical advantages which make it attractive insome applications. For example, the retransmitted packetin RTD is always the same, which is easier to implementthan INR, in which additional parity symbols have to begenerated and transmitted. As another example, having
3The acronyms are borrowed from [9], [22], although the schemes andapplications may be different. For example, [9], [22] studied a slottedsystem with equal length for each retransmission. They considered a multi-user TDMA system, where it is reasonable to assume each retransmissionexperiences a different fading state. Both are different from our assumptions.
4The reason of using this seemingly redundant expression for the data rateis that it simplifies equations (11), (12), (13), and the proof of Lemma 2.
the same size for the retransmitted packets in RTD makesthe packetization simple, while in INR the length ofthe retransmitted parity symbols may vary from oneretransmission to another.
The novelty of the proposed RO-HARQ scheme is theoptimal rate assignment5 of each transmission. Traditionally,HARQ is used in a passive manner in a wireless system. Thepurpose of HARQ is to indicate the decoding status to thetransmitter, such that the transmitter can protect the βbadβpacket by retransmitting. The existence of HARQ is typicallynot exploited by the transmitter; it is treated only as a binaryindicator of the decoding status, and other modules in thesystem (e.g., channel coding, modulation, etc.) are operatingas if HARQ were not available. Due to this reason, the rate ofeach HARQ transmission and the (equal) power allocation isusually predetermined. This work proposes the different viewthat HARQ can be exploited by the transmitter to providebetter average rate and outage performance, i.e., it can beutilized in an active manner. This new view suggests thatthe rate and power associated with each HARQ transmissioncan be optimized according to the channel fading statistics toachieve better performance. The main idea is that since thetransmitter is aware of the HARQ link in the system, it cantransmit very aggressively (at very high rate) even if it doesnot know the random channel state π before transmitting. Thenif the channel is not good, this high-rate transmission willfail, and the HARQ can save it by indicating this failure tothe transmitter so that it can adjust to a lower rate. On theother hand, if it is βluckyβ that the channel is strong enough,such βgamblingβ will bring high return: the strong channelrealization is (almost) fully utilized, i.e., there is (almost) nowaste of the good channel. This is the basic idea behind RO-HARQ.
It is now clear why RO-HARQ can eliminate the problemsof SLT and MLT. By transmitting aggressively, the βgoodβchannel realizations are fully used; with the ARQ feedback,the transmission rate can be reduced for the βbadβ channelrealizations. At the same time, as there is no superpositionor layering in the channel coding, self interference does notexist.
B. Average Rate for HARQ
The general expression (2) characterizes the long-termthroughput for both fixed and variable length coding schemes.How to evaluate and optimize the throughput performanceof variable-length coding schemes depends on the specificapplications and the assumptions on how to use the channel.If we assume that the channel remains constant during thetransmission of πΎ information nats, and changes indepen-dently when the transmission of current packet6 is doneand a new packet is ready to transmit, we need to evaluateboth the average number of successfully decoded informationnats (numerator of (2)) and the average delay (denominator
5Another novelty is the optimal power allocation among transmissionsto further improve the average rate performance. This is discussed in Sec-tion IV-E.
6The transmission of one packet includes several possible retransmissions,according to the HARQ protocol. Different packets contain different infor-mation.
SHEN et al.: ON THE AVERAGE RATE PERFORMANCE OF HYBRID-ARQ IN QUASI-STATIC FADING CHANNELS 3343
of (2)) with respect to the fading distribution. Throughputanalysis based on such assumptions is given in Appendix Aand numerical performance is reported in Section V. Thisassumption can be valid for burst communication, where thereis a long idle period in between transmissions of differentpackets such that the channel gain changes independently fromone transmission to another.
A different scenario is that the transmitter has a verylarge pool of information nats such that the communicationis βcontinuousβ. For this scenario, multiple packets (witheach packet having multiple ARQ rounds) will be transmittedwithin one coherent period. When the channel is βgoodβ, morepackets can be transmitted within one coherent period; whenthe channel is βbadβ, only a small number of packets canbe transmitted within the same coherent period. As a result,each fading state has the same length of channel uses, and theempirical channel distribution will match the true one. This isthe key difference from the burst communication scenario. Thelong-term throughput (2) for this scenario can be computedas follows. We use π (π) to denote the instantaneous rate ofthe HARQ scheme for a given channel realization π. Thetotal number of information nats that can be successfullytransmitted is π(π) = πππ (π) over the channel state π. Hencewe have
π =πΌ{π}πΌ{π― } =
πΌ{πππ (π))}ππ
= πΌ{π (π)} = οΏ½ΜοΏ½ (10)
which again degenerates to the definition of average rate. Thusin the following we shall focus on optimizing the average rateperformance of different HARQ schemes.
C. Average Rate Maximization of INR and RTD
The INR scheme is described in Section IV-A. With INR,the random variable π (π) is
π (π) =
β§β¨β©
βπ+1π=1 π π, if log (1 + ππ ) β₯βπ+1
π=1 π πβππ=1 π π, if
βπ+1π=1 π π > log (1 + ππ ) β₯βππ=1π π, βπ = 1, β β β , π
0, if π 1 > log (1 + ππ ).(11)
The average rate of INR can thus be computed as
οΏ½ΜοΏ½ππΌππ =
π+1βπ=1
π πβ
{log (1 + ππ ) β₯
πβπ=1
π π
}(12)
=π+1βπ=1
π π (1β πΉ (ππ)) (13)
where ππ.= π
βππ=1 π πβ1π .
The problem is to choose the nonnegative design parameters{π 1, β β β , π π+1} to optimize the average rate (13). Clearly,the non-negativity constraints on {π π} can be dropped, andthe INR average rate maximization problem becomes
maximizeβπ+1π=1 π π (1β πΉ (ππ)) . (14)
A direct evaluation of the KKT condition gives
1βπΉ (ππ)βπ ππβπ
π=1 π π
ππ (ππ)β
π+1βπ=π+1
π ππβπ
π=1 π π
ππ (ππ) = 0
(15)
for π = 1, β β β , π, and
1β πΉ (ππ+1)β π π+1πβπ+1
π=1 π π
ππ (ππ+1) = 0. (16)
In some simple cases (e.g., small π ), equations (15) and (16)can be numerically solved to give the optimal rate assignment.
The average rate of RTD can be similarly computed. Inthe π-th transmission of RTD, π = 1, β β β , π + 1, thereceiver performs maximum ratio combining of the π receivedpackets. This processing effectively increases the receive SNRto SNRπ = ππ , while reducing the data rate to π /π. Thusthe random variable π (π) is
π (π) =
β§β¨β©
π π , if
log (1 + πππ ) β₯ π >log (1 + (πβ 1)ππ ), βπ = 1, β β β , π + 1
0, if π > log (1 + (π + 1)ππ ).(17)
The average rate can be computed as
οΏ½ΜοΏ½ππ ππ· =
π+1βπ=1
π
πβ {ππβ1 > π β₯ ππ} (18)
=
π+1βπ=1
π
π(πΉ (ππβ1)β πΉ (ππ)) (19)
where ππ.= ππ β1
ππ for π = 1, β β β , π + 1, and π0.= β.
Similarly, the RTD average rate maximization problem can beformulated as
maximizeβπ+1π=1
π π (πΉ (ππβ1)β πΉ (ππ)) . (20)
Numerical evaluation and comparison of the optimal averagerate of (14) and (20) are given in Section V, where we willsee that RTD has a worse average rate performance than INR.This can also be analytically proved as follows.
Lemma 1: Denote the optimal solution to problem (14)as ({π β
1, β β β , π βπ+1}, οΏ½ΜοΏ½π
βπΌππ ), and the optimal solution to
problem (20) as (π β, οΏ½ΜοΏ½πβ
π ππ·). Then
οΏ½ΜοΏ½πβ
π ππ· β€ οΏ½ΜοΏ½πβ
πΌππ (21)
Proof: See Appendix B.In order to fully enjoy the benefits of RO-HARQ, the
corresponding channel code design for INR has to satisfy thefollowing two requirements.
1) A good mother code that can be serially punctured intoseveral different optimized code rates.
2) A single decoder that can handle all the rates such thatthe coding performance of each sub-code is capacity-approaching.
There are several existing code designs, developed in bothacademia (e.g., [10], [14], [15], [17], [24]) and industry (e.g.,[25]) that satisfy these requirements.
D. Asymptotic Average Rate of INR
It is of interest to ask what is the asymptotic performancelimit of the INR scheme (by allowing π to go to infinity,i.e., infinitely-many retransmissions). Such asymptotic perfor-mance gives the ultimate limit of the proposed scheme. Itis also a good performance indication for delay insensitiveapplications. The following lemma states that the average rate
3344 IEEE TRANSACTIONS ON COMMUNICATIONS, VOL. 57, NO. 11, NOVEMBER 2009
of INR asymptotically converges to the ergodic capacity, eventhough coding across fading blocks is prohibited in INR.
Lemma 2: As π β β, the average rate of INR withequal power allocation among transmissions converges to theergodic capacity of the channel.
Proof: See Appendix C.There are two interesting observations of Lemma 2. First,
Lemma 2 applies to any fading distribution. This can alsobe seen from the proof. Secondly, in order to achieve thisasymptotic limit, there is no requirement of optimal rateallocation. This observation suggests that as π becomes large,the gain of optimal rate design diminishes.
In a slow fading environment, if one is not allowed toperform coding over different fading states, it is well acceptedthat the outage performance is a good measure. In INR, thecoding is still within one fading state. However, with the useof ARQ feedback, the performance measure switches fromoutage capacity to ergodic capacity, as the amount of ARQretransmissions increases to infinity. This result, however,is not surprising in the following two aspects. First, withinfinitely-many retransmissions allowed, in each fading blockthe transmission rate is gradually reduced (by NACK fromthe receiver) until it arrives at the exact rate that the currentchannel gain π can support: log (1 + ππ ). That is, eventuallythe transmission happens with a rate that perfectly matches tothe instantaneous channel realization. Thus all the informationnats can theoretically be decoded without any error, andthe average rate is the ergodic capacity. Secondly, allowingARQ feedback effectively informs the transmitter not onlythe decoding status, but also the partial CSI. As the numberof ARQ retransmissions goes to infinity, the informed CSIbecomes perfect eventually.
From the information-theoretic viewpoint, a receiver decod-ing failure is equivalent to a channel outage with capacity-achieving codes. Thus, an ACK/NACK feedback essentiallyinforms the transmitter a channel quantization, i.e., whetherthe instantaneous channel gain is greater than a given thresh-old. With multiple ARQ rounds, this process becomes a se-quential feedback scheme, where the entire channel state spaceis sequentially quantized with more and more ACK/NACKfeedback, and thus the transmitter gets a finer and finer knowl-edge of the channel. Another widely used limited feedbackscheme is to perform a global quantization of the channelstate space, and then indicate to the transmitter the intervalindex in which the channel realization falls. Such a schemehas been well studied in [19], [26]β[28]. We will call thisscheme one-shot feedback and a detailed comparison with theARQ sequential feedback is made in Section VI.
E. Power Allocation for INR
In the previous sections, the general idea of RO-HARQis discussed and the average rate performance is optimized,under the assumption that the transmit power is constantthroughout the entire ARQ process. Intuitively, allowing powerallocation among the π + 1 transmissions could furtherimprove the average rate. For example, boosting the powerof primary transmission will increase the probability that theprimary transmission succeeds. However this comes at the
PowerPower PowerTransmission 1 Transmission 2 ...... Transmission N+1
TN +1
P1 PN +1
T1 T2
P2
Fig. 1. Illustration of unequal power allocation for INR with π retransmis-sions.
price of decreasing the power of ARQ retransmissions andhence the probability for success, if the average transmit poweris kept constant. There is obviously a tradeoff in allocatingpower among these transmissions, and this idea is pursued inthe following. Figure 1 gives a graphical illustration of thisprocess. We shall first formulate the problem for general π ,and then focus on the simplest case of π = 1 both analyticallyand numerically.
Consider the INR scheme with πΎ information nats anda maximum π + 1 transmissions. We assume that the π-thtransmission takes place with power ππ, and use ππ to denotethe event of a successful decoding at the end of transmissionπ. We will approach this problem in two steps. First we derivethe average rate for a fixed power allocation, and then showhow to maintain a constant average power.
1) Average rate for a given power allocation(π1, β β β , ππ+1): Similar to (11), the rate of INR withgiven power allocation policy (π1, β β β , ππ+1) is a randomvariable:
π (π) =
β§β¨β©βπ+2βππ=1 π π, if
π1, β β β ,ππβ1,ππ;βπ = 1, β β β , π + 1
0, if π1, β β β ,ππ+1
(22)
where we define π0 as the empty set.In order to evaluate β {ππ}, we need to derive the achiev-
able rate of sending a long Gaussian code where differentportions of the code have different power. This is not obviousbut with a random coding and typical set decoding argument[29], this achievable rate can be shown to be a TDMA-typeone, and thus
β {ππ}= β
{βππ=1
ππβππ=1 ππ
log (1 + πππ) β₯βπ+2βππ=1 π π
}= β
{βππ=1
1βπ+2βππ=1
π πβππ=1
1βπ+2βππ=1
π π
log (1 + πππ) β₯βπ+2βππ=1 π π
}.
(23)Finally the average rate is given by
οΏ½ΜοΏ½ππΌππ =π+1βπ=1
(π+2βπβπ=1
π π
)β{π1, β β β ,ππβ1,ππ
}. (24)
2) Average power constraint: The power allocation(π1, β β β , ππ+1) is a design parameter. In order to make afair comparison to the scheme discussed in Section IV-C witha constant power π , it is reasonable to put a constraint on(π1, β β β , ππ+1) such that the average power does not exceedπ . The interesting observation of the HARQ scheme is thatwith unequal power allocation the actual power consumed inthe entire transmission is a random variable. The reason isthat the π-th transmission takes place only if transmissions1, β β β , πβ 1 all fail, which is a random event determined bythe random channel gain. Denote the actual consumed poweras π½(π), then this discrete random variable is given by
SHEN et al.: ON THE AVERAGE RATE PERFORMANCE OF HYBRID-ARQ IN QUASI-STATIC FADING CHANNELS 3345
π½(π) =
β§β¨β©
βππ=1 ππππβππ=1 ππ
, if π1, β β β ,ππβ1,ππ; βπ = 1, β β β , πβπ+1
π=1 ππππβπ+1π=1 ππ
, if π1, β β β ,ππ .
(25)Thus the average power constraint that (π1, β β β , ππ+1)should satisfy isβπ
π=1
βππ=1 ππππβππ=1 ππ
β{π1, β β β ,ππβ1,ππ
}+
βπ+1π=1 ππππβπ+1π=1 ππ
β{π1, β β β ,ππ
} β€ π.(26)
Notice that in the case of constant power allocation ππ =π, βπ, the randomness of π½(π) disappears: π½(π) = π withprobability 1.
3) Optimal power allocation: Finally, the average ratemaximization problem under optimal power allocation can beformulated as
maximizeβπ+1π=1
(βπ+2βππ=1 π π
)β{π1, β β β ,ππβ1,ππ
}subject to
βππ=1
βππ=1 ππππβππ=1 ππ
β{π1, β β β ,ππβ1,ππ
}+
βπ+1π=1 ππππβπ+1π=1 ππ
β{π1, β β β ,ππ
} β€ π.
(27)This optimization problem is difficult to solve for general π .For simplicity let us consider the simplest case of π = 1. Inthis case Equation (24) becomes
οΏ½ΜοΏ½1πΌππ = (π 1 +π 2)β {log (1 + ππ1) β₯ π 1 +π 2}
+π 1β
{log (1 + ππ1) < π 1 +π 2,
π 1π 1+π 2
log (1 + ππ1) +π 2
π 1+π 2log (1 + ππ2) β₯ π 1
}(28)
and the average power constraint (26) is
π β₯ π1β {log (1 + ππ1) β₯ π 1 +π 2}+(
π 1
π 1+π 2π1 +
π 2
π 1+π 2π2
)β {log (1 + ππ1) < π 1 +π 2} .
(29)Numerical results for π = 1 is reported in Section V.
Optimal average rate of problem (27) is better than thatof problem (14): choosing ππ = π, βπ makes (24) equal to(13). It is then interesting to ask how big the power allocationgain is. Numerical examples in Section V shows that forthe Rayleigh fading channel, this gain is remarkable in thelow SNR regime, while it is negligible for medium to highSNR. This is a reasonable result for most of the known powerallocation schemes, e.g., water-filling.
V. RO-HARQ: NUMERICAL RESULTS
The ARQ feedback link allows the transmitter to partiallyadapt to the channel conditions. Due to the lack of full channelstate information at the transmitter, such adaptation is notguaranteed to support reliable transmission all the time. Thusthe instantaneous successful transmission rate is a randomvariable, whose distribution is determined by the randomchannel fading and the rate assignment. As a performancemetric, the average rate characterizes the mean value of thisrandom variable. On the other hand, it is arguable that theoutage probability (πππ’π‘) serves as a worst-case performancemeasure for the RO-HARQ scheme, as it describes the proba-bility that the transmission fails after the maximum number ofARQ retransmissions are used. Thus, although the analytical
discussion of this paper is focused on the average rate maxi-mization, numerical results for different performance measuresare reported in this section. To be specific, three performancemetrics, average rate [3]β[5], [18], outage probability [1],[30], and average rate versus outage probability [4], are usedto numerically compare the proposed RO-HARQ with SLTand MLT. Discussion of the comparison to another quantizedfeedback scheme is deferred to Section VI.
A. Average Rate
Numerical optimizations are performed to maximize theaverage rate of the schemes analyzed in Section III and IV. Inthe case of slow Rayleigh fading channel with β βΌ ππ© (0, 1),Figure 2 reports the average rate comparison among SLT,MLT, INR and RTD, together with the optimal INR throughputfor bursty communications derived in Appendix A. Ergodiccapacity is also shown as the upper limit. It is clear thatallowing INR ARQ feedback increases the average rate sub-stantially. For example, even INR with π = 1 outperforms theinfinite-layer MLT by half a bit per channel use over a widerange of SNRs (15 to 35 dB), and is 1.5 bits better than theSLT scheme with optimized rate. Notice that this average rateadvantage of INR over MLT does not come with much highercomplexity. MLT requires complicated encoding and decodingprocesses to handle the multiple layers, and this complexityincreases with the number of layers. Although ARQ requiresfeedback and some overhead in the protocol design, typicallyits complexity is not as high as MLT. The average rate isboosted by another 0.5 bits if π increases to 2. As π furtherincreases, the average rate continues to increase up until theergodic capacity. At the same time, RTD type HARQ is shownto be inefficient in terms of the average rate performance. Itis better than SLT7, but in some configurations is even worsethan MLT. This also numerically confirms Lemma 1. For thissuboptimality, RTD is not considered in the remaining of thenumerical simulations. Another observation from Figure 2is a comparison of the INR throughput based on the twodifferent assumptions (burst communication vs continuouscommunication) in Section IV-B. It can be seen that theaverage rate is always better than the optimal throughput inAppendix A. This is due to the fact (see Section IV-B) thatmore packets are transmitted in βgoodβ channels in continuouscommunication than in burst communication.
Figure 3 compares the optimized average rate performanceof SLT, MLT and INR in a slow Ricean fading channel. Twodifferent πΎ factors are considered: Figure 3(a) for πΎ = 5and Figure 3(b) for πΎ = 10. Ergodic capacity is plotted asthe performance upper bound. As opposed to the Rayleighfading case, Ricean fading channel has a high-power line-of-sight (LOS) path and thus is βless randomβ. Numerical resultsshow that in the Ricean fading environment, MLT with 2 levelshas almost negligible gain over SLT, while INR still performsmuch better than both SLT and MLT. Combined with its highcomplexity, this result indicates the inefficiency of MLT inless random channel environment such as Ricean distributions.However, INR continues to perform very well even with π =
7Analytically, this can also be easily proved by looking at Equation (18).When π = 1 the component has the same form as SLT.
3346 IEEE TRANSACTIONS ON COMMUNICATIONS, VOL. 57, NO. 11, NOVEMBER 2009
5 10 15 20 25 30 350
1
2
3
4
5
6
7
8
9
10
11
SNR [dB]
Ave
rage
Rat
e [b
pcu]
SLTMLT β 2 levelsMLT β inf levelsINR β N=1INR β N=3INR β N=7INR β N=1 (Appendix A)INR β N=3 (Appendix A)RTD β N=1RTD β N=3ergodic capacity
Fig. 2. Average rate (bpcu) versus receive SNR (dB) of SLT, MLT, INR, andRTD in a quasi-static Rayleigh fading channel. Equal power allocation amongtransmissions is performed. Optimal throughput derived in Appendix A is alsoplotted.
1, so it is more robust to the channel fading distribution thanMLT.
The transmission strategies studied in Section III and IVare for single-antenna systems. These strategies can be read-ily extended to single-input multiple-output (SIMO) fadingchannels. Consider a SIMO Rayleigh fading channel with πΏπreceive antennas. It is shown in [31], [32] that the PDF of thetotal channel gain is
ππΏπ (π) =1
Ξ (πΏπ)ππΏπβ1πβπ, π β₯ 0 (30)
where Ξ (πΏπ) =β«β0 π‘πΏπβ1πβπ‘dπ‘ is the Gamma function.
Figure 4 shows the average rate performance of SLT, MLT,INR, and ergodic capacity of a SIMO Rayleigh fading channelwith πΏπ = 2 and πΏπ = 4. Several interesting observationscan be made from these plots. First, similar to the previouscase, MLT with two levels has negligible gain over SLT: theperformance difference is almost indistinguishable. This againseems to suggest that the gain of MLT with two levels is notimportant in the βless randomβ fading distributions, especiallyconsidering that MLT is much more complicated than SLT.Secondly, the INR scheme performs extraordinary well: thegap between π = 1 INR with the ergodic capacity is only0.6 bits and 0.7 bits at the medium to high SNR regime withπΏπ = 2 and πΏπ = 4, respectively. This complies with ourprevious observation that INR is robust to the channel fadingdistribution.
Numerical results for optimal power allocation of INRwith π = 1 and the comparison to equal power allocationare reported in Figure 5(a) (medium-to-high SNR regime)and Figure 5(b) (low SNR regime) for the Rayleigh fadingdistribution. The advantage of optimal power allocation ismainly reflected in the low SNR regime. This gain diminishesas SNR increases, and becomes negligible in the medium-to-high SNR regime.
B. Outage Probability
The outage probability comparison is reported in Figure 6for both SLT and INR in a Rayleigh fading channel. Itshould be noted that the outage probability does not applyto MLT, where the requirement that all transmitted data mustbe decoded is dropped, and thus the concept of βoutageβ doesnot hold anymore. Both SLT and INR are still optimized interms of the average rate. Thus, the outage event of SLTis β
{log (1 + ππ ) < π β
ππΏπ,π ππ¦ππππβ
}where π β
ππΏπ,π ππ¦ππππβ
is given in (6), and the outage event of INR with π isβ {log (1 + ππ ) < π β
1} where π β1 is the solution of π 1 in
problem (14). The advantage of INR is now more obvious:not only does it increase the average rate, it also decreases theoutage probability simultaneously. In fact, with the argumentmade in Section IV-D, the asymptotic outage probability willbe zero, as every transmission will eventually match perfectlywith the instantaneous channel condition.
C. Average Rate versus Outage Probability
When transmitting over a slow fading channel, the success-fully transmitted data rate is a random variable. Its instanta-neous value depends on both the instantaneous channel gainand the communication scheme (e.g., SLT, MLT, or HARQ).Roughly speaking, the average rate describes the βmean valueβof the random performance, while the outage probabilitycharacterizes the βvarianceβ in the sense that it gives theworst-case performance. Thus, to have a comprehensive viewand comparison of several schemes, both average rate andoutage probability should be jointly considered. Average rateversus outage probability was proposed in [4] as a meaningfulassociation between average rate and outage probability. Thismetric requires examining the average rate when the channelgain π is known to exceed some threshold ππ‘β. This can beviewed as a conditional average rate where the distributionof channel gain πΉ (π) is replaced with the conditional CDFπΉππ‘β (π’)
.= β {π β€ π’β£π β₯ ππ‘β}. Such a conditional average rate
examines the average rate with a given outage probability, andthus effectively relates these two metrics. Figure 7 reports theaverage rate performance where the threshold ππ‘β is chosensuch that the outage probability is 1% and 30%, respectively.The use of INR still provides remarkable gain with respect tothis metric.
VI. HOW TO USE THE FEEDBACK CHANNEL: SEQUENTIAL
VERSUS ONE-SHOT
From the information-theoretic point of view, the proposedRO-HARQ is one form of utilizing the feedback link in awireless communication system. It falls into the general cate-gory of quasi-static fading channel with quantized feedback.The capacity (under different definitions, e.g., outage capacity,expected capacity, etc.) of this channel is still unknown ingeneral. Thus it is difficult to quantify how well RO-HARQperforms in the absolute sense. A reasonable approach wouldbe to compare RO-HARQ with other schemes that utilizethe quantized feedback to improve performance, which is thepurpose of this section.
SHEN et al.: ON THE AVERAGE RATE PERFORMANCE OF HYBRID-ARQ IN QUASI-STATIC FADING CHANNELS 3347
5 10 15 20 25 30 350
2
4
6
8
10
12
SNR [dB]
Ave
rage
Rat
e [b
pcu]
SLTMLT β 2 levelsINR β N=1INR β N=2INR β N=4ergodic capacity
(a) K=5
5 10 15 20 25 30 350
2
4
6
8
10
12
SNR [dB]
Ave
rage
Rat
e [b
pcu]
SLTMLT β 2 levelsINR β N=1INR β N=2INR β N=4ergodic capacity
(b) K=10
Fig. 3. Average rate (bpcu) versus receive SNR (dB) of SLT, MLT, and INR in a quasi-static Ricean fading channel with different K factors and equalpower allocation among transmissions.
5 10 15 20 25 30 350
2
4
6
8
10
12
SNR [dB]
Ave
rage
Rat
e [b
pcu]
SLTMLT β 2 levelsINR β N=1INR β N=2ergodic capacity
(a) 2 Receive Antennas
5 10 15 20 25 30 352
4
6
8
10
12
14
SNR [dB]
Ave
rage
Rat
e [b
pcu]
SLTMLT β 2 levelsINR β N=1INR β N=2ergodic capacity
(b) 4 Receive Antennas
Fig. 4. Average rate (bpcu) versus receive SNR (dB) of SLT, MLT, and INR in a quasi-static SIMO Rayleigh fading channel, πΏπ = 2 and πΏπ = 4. Equalpower allocation among transmissions is performed.
One well-known approach is the quantized CSI feedbackscheme [19], [26], [27]: the receiver sends an π -bit quan-tization of the channel state information to the transmitterbefore the transmission takes place, and the transmitter adjustsits rate and power according to this imperfect CSIT. To bespecific, the set of all possible channel gain πΊ = [0,β) is
divided into 2π nonoverlapping subsets πΊ =βͺ2π
π=1πΊπ, whereπΊπ = [ππβ1, ππ), π0 = 0, π2π = β. If the instantaneouschannel gain π β πΊπ, the receiver sends the index π tothe transmitter using the π -bit feedback channel, and thetransmitter selects a codeword with rate π π and power ππ. Dueto the imperfect CSI feedback, the transmission could fail ifthe data rate π π is not supported by the instantaneous channelrealization. The average rate performance can be optimized[19], [26] by adjusting {πΊπ, π π, ππ}2
π
π=1.
Compared with this one-shot feedback scheme, RO-HARQis in fact a sequential feedback scheme. It is then natural to
ask the following question: since sequential and one-shot aredifferent forms of utilizing feedback to inform quantized CSIto the transmitter, what are the advantages/disadvantages ofthem, and which one is preferred? A few perspectives arediscussed in this section.
A. System implementation
ARQ is a technique in the data link layer, which is im-plemented in many wireless protocols. Thus exploiting ARQfeedback does not require additional system implementations,other than that the physical layer channel coding shouldbe able to provide incremental redundancy, which has beenwell studied [10], [13]β[17], [25]. One-shot CSI feedback,on the other hand, belongs to the physical layer techniques.To exploit CSIT, typically the physical layer needs someadditional designs, especially at the transmitter, to adjustthe transmission based on the one-shot CSI feedback. For
3348 IEEE TRANSACTIONS ON COMMUNICATIONS, VOL. 57, NO. 11, NOVEMBER 2009
5 6 7 8 9 10 11 12 13 141
1.2
1.4
1.6
1.8
2
2.2
2.4
2.6
2.8
3
SNR [dB]
thro
ughp
ut (
bits
per
cha
nnel
use
)Equal power allocationOptimal power allocation
(a) Medium-to-high SNR regime
β25 β20 β15 β10 β5 010
β3
10β2
10β1
100
SNR [dB]
thro
ughp
ut (
bits
per
cha
nnel
use
)
Equal power allocationOptimal power allocation
(b) Low SNR regime
Fig. 5. Average rate (bpcu) versus receive SNR (dB) of π = 1 INR with optimal power allocation in a quasi-static Rayleigh fading channel.
5 10 15 20 25 30 3510
β2
10β1
100
SNR [dB]
Pou
t
SLTINR β N=1INR β N=2INR β N=4
Fig. 6. Outage probability versus receive SNR (dB) of INR (π =1, 2,and 4) compared to SLT in a quasi-static Rayleigh fading channel. The twoschemes are optimized in terms of the average rate.
example, adaptive coding and modulation requires the data rateand transmit power to be frequently adjusted according to thechannel variation. As another example, matrix precoder designis needed to rotate the transmit signal in case of multipletransmit antennas [33]. This increases the system complexityand cost. From this perspective, utilizing ARQ seems to bemore favorable, since ARQ is already provided by the wirelessprotocol in the system and no additional closed-loop designis needed.
B. Position of feedback
In the RO-HARQ scheme, the ACK/NACK feedbacks arescattered into the entire transmission. There are several hand-shakings between the transmitter and receiver. Each suchcoordination adds additional overhead. The typical form ofthe one-shot CSI feedback, however, is to send the entireCSI bits before the data transmission takes place, such that
5 10 15 20 25 30 350
5
10
15
SNR [dB]
Ave
rage
Rat
e [b
pcu]
SLT, 1%MLT β 2 levels, 1%INR β N=1, 1%INR β N=2, 1%INR β N=4, 1%SLT, 30%MLT β 2 levels, 30%INR β N=1, 30%INR β N=2, 30%INR β N=4, 30%
Fig. 7. Average rate versus outage probability for SLT, MLT and INR ina quasi-static Rayleigh fading channel. Two thresholds corresponding to 1%and 30% outage probability, respectively, are simulated.
the transmitter adjusts the parameters to match the partially-known channel. From the practical point of view, the latteris more favorable, since all the feedback bits are sent in oneshot, which simplifies the protocol and reduces the processingdelay, system overhead, and the implementation complexity.
C. Amount of feedback and performance
To make a fair comparison of the amount of feedbackrequired for INR and one-shot CSI feedback, it is assumedthat both schemes are optimized for maximum average rates,and the maximum average rates are set to be equal. This isreasonable since the resulting comparison indicates that inorder to achieve the same optimal performance, how muchfeedback is needed for each scheme. The following lemmagives the relationship between π and π under this condition.
Lemma 3: For any given fading distribution, if the optimalaverage rates of
SHEN et al.: ON THE AVERAGE RATE PERFORMANCE OF HYBRID-ARQ IN QUASI-STATIC FADING CHANNELS 3349
(1) INR scheme with maximum π retransmissions,(2) one-shot CSI feedback scheme with an π -bit feedback
channel
are the same, i.e., οΏ½ΜοΏ½πβ
πΌππ = οΏ½ΜοΏ½πβ
πΆππΌ , then
π + 1 = 2π . (31)
Proof: See Appendix D.One direct consequence of Lemma 3 is that we can easily
make a numerical comparison of the average rate perfor-mance between the RO-HARQ scheme and the one-shot CSIfeedback scheme studied in [19], [26], [27]. For example, inFigure 2 we have reported the optimal INR performance withπ = 1, 3 and 7 in a Rayleigh fading channel. These directlycorrespond to the optimal performance of the one-shot CSIfeedback scheme with π = 1, 2 and 3 bits of feedback,respectively.
Another important benefit from Lemma 3 is that since theaverage rate optimization problems for both one-shot CSIfeedback and RO-HARQ can be directly related, existingnumerical solutions for the previous problem, e.g., [26], canbe used to help solve Problem (14) as long as the parametersare chosen according to (31). One should note that solutionsin [26] are efficient but sub-optimal. Also they cannot helpwith the dynamic power allocation problem (27).
From Lemma 3, it seems that INR requires more feedbackthan the one-shot CSI feedback scheme: with condition (31)satisfied, π = π if and only if π = π = 1, and π > πstrictly holds in all other cases. However, π > π does notnecessarily translate to the conclusion that INR requires morefeedback. First of all, the actual number of retransmissionsassociated with INR is a random variable, which is deter-mined by the instantaneous channel realization and the rateassignment. The numberπ only denotes the maximum numberof retransmissions, which is a worst-case constraint. On theother hand, π is a fixed number in the one-shot CSI feedbackscheme. It is not affected by the channel realization or the rateassignment. Secondly, π retransmissions is not necessarilyequivalent to π bits feedback, as the number of physicalbits required for π retransmissions is determined by manyconsiderations. Wicker [7, Chapter 15.2] discussed severalpractical issues that might require different number of bitsfor ACK/NACK. For example, NACK may be set as a defaultand only ACK is transmitted in the feedback channel. Thiscertainly reduces the feedback of INR. Another example isto use the recently developed rateless codes in HARQ [34],such as LT-codes [35] and Raptor codes [36]. With ratelesscodes, the receiver does not need to send any feedback tothe transmitter until it accumulates enough data for successfuldecoding [37]. In this case the actual amount of feedback isvery small.8
D. Asymptotic performance
It is seen in Lemma 2 that the ergodic capacity is achievedwith π β β. On the other hand, it is well known that if theone-shot CSI feedback is perfect (which requires π β β),
8The sequential feedback scheme fits nicely with the use of rateless code,while the one-shot feedback does not have this advantage.
ergodic capacity can also be achieved9. Thus, moving theinfinitely many bits of feedback from the beginning of trans-mission as one-shot CSI feedback to within the transmissionas ARQ feedback gives the same asymptotic performance.
VII. CONCLUSIONS
HARQ with optimized rate assignment is shown to sig-nificantly improve the performance over the single-layer andmulti-layer transmission schemes under several different per-formance metrics. The key idea behind this work is to exploitthe existence of HARQ protocol to adapt the transmission tothe instantaneous channel realization. Average rate is chosenas the performance metric, and the optimal rate assignment isstudied. Several aspects of the RO-HARQ scheme are studied,including power allocation, asymptotic performance limit, andcomparison to one-shot quantized CSI feedback. Simulationresults show that even one HARQ retransmission gives aremarkable gain over conventional schemes, and this gain isrobust in different fading distributions.
Possible future work includes the combination of HARQfeedback with the multi-layer scheme. With such combinationthe ARQ feedback will indicate which layers cannot bedecoded, and hence the transmitter can resend those layerstogether with possibly new information in the next round.Another direction is to study the combination of sequentialand one-shot feedback, in which the receiver will send backa coarse CSI before the data transmission with one-shotfeedback, and use ARQ feedback to refine the quantized CSIin the sequel.
APPENDIX ATHROUGHPUT ANALYSIS FOR BURSTY COMMUNICATIONS
A. INR
For each possible fading state π, we assume that πΎ in-formation nats are to be sent with a maximum of π re-transmissions, with the length of the π-th transmission ππ,βπ = 1, β β β , π + 1. The rate after transmission π is thenπ (π) = πΎ/π (π). We need to separately evaluate the numeratorand denominator of the throughput expression (2) with respectto the fading distribution. Notice that the overall code lengthπ (π+1) is a function of the fading state.
We begin with the average number of successfully trans-mitted information nats:
πΌ{π} = πΎβ
{log (1 + ππ ) β₯ π (π+1)
}. (32)
To evaluate the average length of transmission for πΎ infor-mation nats, we first define π (0) = 0 and thus π (0) = β.
9Note that there is a constraint of fixed transmit power; thus waterfillingpower allocation over different fading states is not permitted.
3350 IEEE TRANSACTIONS ON COMMUNICATIONS, VOL. 57, NO. 11, NOVEMBER 2009
Then
πΌ{π― } = π (1)β
{log (1 + ππ ) β₯ π (1)
}+ π (2)
β{π (1) > log (1 + ππ ) β₯ π (2)
}+ β β β + π (π+1)
β{π (π) > log (1 + ππ )
}=
πβπ=1
π (π)β
{π (πβ1) > log (1 + ππ ) β₯ π (π)
}
+π (π+1)β
{π (π) > log (1 + ππ )
}. (33)
The throughput of INR can now be written as Equation (34) atthe top of the next page, and the INR throughput optimizationproblem is formulated as
maximize 1βπΉπΊ(ππ+1)βππ=1
(1
π (π+1)β 1
π (π)
)πΉπΊ(ππ)
subject to π (π) β₯ 0; βπ = 1, β β β , π + 1π (π) β₯ π (π+1); βπ = 0, β β β , π.
(35)
Numerical results of this optimization problem for π = 1 and3 are reported in Figure 2.
B. RTD
With the fact that Chase Combining effectively increasesthe receive SNR to πππ and reduces the overall rate to π /πafter the π-th transmission, we can similarly compute thethroughput of RTD.
Numerator:
πΌ{π} = πΎβ {log (1 + (π + 1)ππ ) β₯ π }= πΎ (1β πΉπΊ(ππ+1)) (36)
Denominator:
πΌ{π― } =βππ=1 ππβ{log (1 + πππ ) β₯ π >
log(1 + (πβ 1)ππ )}+ (π + 1)πβ {π > log (1 +πππ )}=βππ=1 ππβ {ππβ1 β₯ π > ππ}+ (π + 1)πβ {ππ β₯ π}
=βππ=1 ππ [πΉπΊ(ππβ1)β πΉπΊ(ππ)] + (π + 1)ππΉπΊ(ππ )
= πβππ=0 πΉπΊ(ππ)
(37)RTD throughput:
ππ+1π ππ· =
π (1β πΉπΊ(ππ+1))βππ=0 πΉπΊ(ππ)
=π (1β πΉπΊ(ππ+1))
1 +βππ=1 πΉπΊ(ππ)
(38)
RTD throughput optimization problem:
maximizeπ (1β πΉπΊ(ππ+1))
1 +βππ=1 πΉπΊ(ππ)
. (39)
APPENDIX BPROOF OF LEMMA 1
The proof is done by showing that the optimal averagerate of RTD can be achieved by INR. For the simplicityof discussion in this appendix, we re-denote the bound-ary points of each fading region for INR and RTD to be
ππ = πβπ+2βπ
π=1π πβ1
π , ππ = ππ β1ππ , βπ = 1, β β β , π + 1, and
π0 = π0.= β, respectively. Let us rewrite the average rate
expression for INR and RTD as:
οΏ½ΜοΏ½ππΌππ (π 1, β β β , π π+1) =
π+1βπ=1
(π+2βπβ
π=1
π π
)β {π β [ππ, ππβ1)} ,
(40)
οΏ½ΜοΏ½ππ ππ· (π ) =
π+1βπ=1
π
πβ {ππβ1 > π β₯ ππ} . (41)
Consider the optimal solution for RTD: π = π β,οΏ½ΜοΏ½ππ ππ· (π ) = οΏ½ΜοΏ½π
βπ ππ·. Denote the corresponding optimal
boundary points of the fading regions as
πβπ =ππ
β β 1
ππ(42)
for π = 1, β β β , π + 1 and πβ0 = π0. Then choose(π 1, β β β , π π+1) such that
ππ = πβπ, βπ = 1, β β β , π + 1. (43)
Condition (43) forces the event {ππβ1 > π β₯ ππ} to be equiv-alent to
{πβπβ1 > π β₯ πβπ
}for π = 1, β β β , π + 1. Thus, the
proof is set if we can show thatπ+2βπβπ=1
π π β₯ π β
π, βπ = 1, β β β , π + 1. (44)
Condition (43) can be equivalently written asπ+2βπβπ=1
π π = logππ
β+ πβ 1
π. (45)
We need the following simple lemma.Lemma 4: The function
π(π) = π(π
ππ β 1
), π β₯ 1. (46)
is monotonically decreasing, where π > 0 is a constant.This can be easily proved by checking that dπ(π)
dπ > 0 forpositive and finite π. With Lemma 4 and Equation (45), thefollowing inequalities can be obtained:
Lemma 4 =β ππ β β 1 β₯ π
(π
π βπ β 1
)=β log
ππ β+ πβ 1
πβ₯ π β
π(45)=β (44)
which concludes the proof.
APPENDIX CPROOF OF LEMMA 2
The average rate of the INR scheme with a fixed π is givenin Equation (12) as
οΏ½ΜοΏ½ππΌππ =π+1βπ=1
π πβ
{log (1 + ππ ) β₯
πβπ=1
π π
}.
As π β β, this average rate converges to
οΏ½ΜοΏ½βπΌππ =
β« β
0
β {log (1 + ππ ) β₯ π }dπ = πΌ [log (1 + ππ )] (47)
where Equation (47) is due to the fact that
πΌ [π ] =
β« β
0
(1β πΉπ(π₯)) dπ₯
for a nonnegative random variable π .
SHEN et al.: ON THE AVERAGE RATE PERFORMANCE OF HYBRID-ARQ IN QUASI-STATIC FADING CHANNELS 3351
ππ+1πΌππ =
πΎβ{log (1 + ππ ) β₯ π (π+1)
}βππ=1 π
(π)β{π (πβ1) > log (1 + ππ ) β₯ π (π)
}+ π (π+1)β
{π (π) > log (1 + ππ )
}=
β{log (1 + ππ ) β₯ π (π+1)
}βππ=1
1π (π)β
{π (πβ1) > log (1 + ππ ) β₯ π (π)
}+ 1π (π+1)β
{π (π) > log (1 + ππ )
}=
1β πΉπΊ(ππ+1)βππ=1
1π (π) [πΉπΊ(ππβ1)β πΉπΊ(ππ)] +
1π (π+1)πΉπΊ(ππ)
=1β πΉπΊ(ππ+1)βπ
π=1
(1
π (π+1) β 1π (π)
)πΉπΊ(ππ)
(34)
APPENDIX DPROOF OF LEMMA 3
The average rate of the INR scheme with maximum πretransmissions is given in Equation (12). By defining πβπ =βππ=1π
βπ for π = 1, β β β , π + 1, the maximum average rate
of INR can be rewritten as
οΏ½ΜοΏ½πβ
πΌππ =
π+1βπ=1
πβπβ{πβπ+1 > log (1 + ππ ) β₯ πβπ
}. (48)
For the CSI feedback scheme defined in Section VI, theaverage rate can be computed as
οΏ½ΜοΏ½ππΆππΌ =2πβπ=1
π πβ {log (1 + πππ ) > log (1 + ππ ) β₯ π π} .(49)
It is observed in [19] that in order to maximize the averagerate of the CSI feedback scheme, π π must satisfy
π βπ = log
(1 + πβπβ1π
). (50)
Substituting (50) into (49), the optimal average rate of the CSIfeedback scheme is
οΏ½ΜοΏ½πβ
πΆππΌ =2πβπ=1
π βπβ{π βπ+1 > log (1 + ππ ) β₯ π β
π
}. (51)
Compare (51) to (48), the result π + 1 = 2π followsimmediately.
REFERENCES
[1] L. H. Ozarow, S. Shamai, and A. D. Wyner, βInformation theoreticconsiderations for cellular mobile radio," IEEE Trans. Veh. Technol.,vol. 43, no. 2, pp. 359-378, May 1994.
[2] T. M. Cover, βComments on broadcast channels," IEEE Trans. Inf.Theory, vol. 44, no. 6, pp. 2524-2530, Oct. 1998.
[3] S. Shamai, βA broadcast strategy for the Gaussian slowly fading chan-nel," in Proc. International Symp. Inform. Theory, June-July 1997, p.150.
[4] S. Shamai and A. Steiner, βA broadcast approach for a single-user slowlyfading MIMO channel," IEEE Trans. Inf. Theory, vol. 49, no. 10, pp.2617-2635, Oct. 2003.
[5] Y. Liu, V. K. N. Lau, O. Y. Takeshita, and M. P. Fitz, βOptimal rateallocation for superposition coding in quasi-static fading channels," inProc. International Symp. Inform. Theory, June-July 2002, p. 111.
[6] D. J. Costello, J. Hagenauer, H. Imai, and S. B. Wicker, βApplicationsof error-control coding," IEEE Trans. Inf. Theory, vol. 44, no. 6, pp.2531-2560, Oct. 1998.
[7] S. Wicker, Error Control Systems for Digital Communication andStorage. Englewood Cliffs, NJ: Prentice-Hall, 1995.
[8] H. Zheng and H. Visvanathan, βOptimizing ARQ performance inhigh speed downlink systems with scheduling," IEEE Trans. WirelessCommun., vol. 4, no. 2, pp. 495-506, Mar. 2005.
[9] G. Caire and D. Tuninetti, βThe throughput of Hybrid-ARQ protocolsfor the Gaussian collision channel," IEEE Trans. Inf. Theory, vol. 47,no. 5, pp. 1971-1988, July 2001.
[10] S. Sesia, G. Caire, and G. Vivier, βIncremental redundancy hybridARQ schemes based on low-density parity-check codes," IEEE Trans.Commun., vol. 52, no. 8, pp. 1311-1321, Aug. 2004.
[11] H. El Gamal, G. Caire, and M. O. Damen, βThe MIMO ARQ Channel:diversity-multiplexing-delay tradeoff," IEEE Trans. Inf. Theory, vol. 52,no. 8, pp. 3601β3621, Aug. 2006.
[12] A. Chuang, A. G. i Fabregas, L. Rasmussen, and I. Collings, βOptimalthroughput-diversity-delay tradeoff in MIMO ARQ block-fading chan-nels," IEEE Trans. Inf. Theory, vol. 54, no. 9, pp. 3968-3986, Sept.2008.
[13] J. Li and K. Narayanan, βRate-compatible low density parity check(RC-LDPC) codes for capacity-approaching ARQ schemes in packetdata communications," in Proc. International Conf. Commun., InternetInform. Technol. (CIIT), US Virgin Islands, Nov. 2002, pp. 201-206.
[14] J. Ha, J. Kim, and S. W. McLaughlin, βRate-compatible puncturingof low-density parity-check codes," IEEE Trans. Inf. Theory, vol. 50,no. 11, pp. 2824-2836, Nov. 2004.
[15] J. Kim, W. Hur, A. Ramamoorthy, and S. W. McLaughlin, βDesignof rate-compatible irregular LDPC codes for incremental redundancyhybrid ARQ systems," in Proc. International Symp. Inform. Theory,July 2006, pp. 1139-1143.
[16] J. Kim, A. Ramamoorthy, and S. W. McLaughlin, βDesign of efficiently-encodable rate-compatible irregular LDPC codes," in Proc. IEEE Inter-national Conf. Commun., June 2006, pp. 1131-1136.
[17] J. Ha, J. Kim, D. Klinc, and S. W. McLaughlin, βRate-compatiblepunctured low-density parity-check codes with short block lengths,"IEEE Trans. Inf. Theory, vol. 52, no. 2, pp. 728-738, Feb. 2006.
[18] M. Effros, A. Goldsmith, and Y. Liang, βCapacity definitions of generalchannels with receiver side information," in Proc. International Symp.Inform. Theory, June 2007, pp. 921β925.
[19] T. Kim and M. Skoglund, βOn the expected rate of slowly fadingchannels with quantized side information," IEEE Trans. Commun.,vol. 55, no. 4, pp. 820-829, Apr. 2007.
[20] R. M. Corless, G. H. Gonnet, D. E. G. Hare, D. J. Jeffrey, andD. E. Knuth, βOn the Lambert W function," Advances ComputationalMathematics, vol. 5, pp. 329-359, 1996.
[21] X. Wang and T. Orchard, βDesign of superposition coded modulation forunequal error protection," in Proc. IEEE International Conf. Commun.,June 2001, pp. 412-416.
[22] D. Tuninetti, βTransmitter channel state information and repetitionprotocols in block fading channels," in Proc. IEEE Inform. TheoryWorkshop, Sep. 2007, pp. 505-510.
[23] D. Chase, βCode combiningβa maximum-likelihood decoding approachfor combining an arbitrary number of noisy packets," IEEE Trans.Commun., vol. 33, no. 5, pp. 385-393, May 1985.
[24] N. Varnica, E. Soljanin, and P. Whiting, βLDPC code ensembles forincremental redundancy hybrid ARQ," in Proc. International Symp.Inform. Theory, Sep. 2005, pp. 995-999.
[25] TrellisWare Technologies, White papers on FlexiCodes and F-LDPC.[Online]. Available: www.trellisware.com.
[26] L. Lin, R. D. Yates, and P. Spasojevic, βAdaptive transmission withdiscrete code rates and power levels," IEEE Trans. Commun., vol. 51,no. 12, pp. 2115-2125, Dec. 2003.
[27] ββ, βAdaptive transmission with finite code rates," IEEE Trans. Inf.Theory, vol. 52, no. 5, pp. 1847-1860, May 2006.
[28] T. Kim and M. Skoglund, βDiversity-multiplexing tradeoff in MIMO
3352 IEEE TRANSACTIONS ON COMMUNICATIONS, VOL. 57, NO. 11, NOVEMBER 2009
channels with partial CSIT," IEEE Trans. Inf. Theory, vol. 53, no. 8,pp. 2743-2759, Aug. 2007.
[29] T. Cover and J. Thomas, Elements of Information Theory. New York:Wiley Interscience, 1992.
[30] E. Biglieri, J. Proakis, and S. Shamai, βFading channels: information-theoretic and communications aspects," IEEE Trans. Inf. Theory, vol. 44,no. 6, pp. 2619-2692, Oct. 1998.
[31] A. Edelman, βEigenvalues and condition numbers of random matrices,"Ph.D. dissertation, Massachusetts Institute of Technology, Cambridge,MA, 1989.
[32] E. Telatar, βCapacity of multi-antenna Gaussian channels," AT&TβBellLabs, Tech. Rep., June 1995.
[33] M. Vu, βExploiting transmit channel side information in MIMO wirelesssystems," Ph.D. dissertation, Stanford University, Stanford, CA, 2006.
[34] E. Soljanin, N. Varnica, and P. Whiting, βPunctured vs rateless codesfor hybrid ARQ," in Proc. IEEE Inform. Theory Workshop, Punta delEste, Uruguay, Mar. 2006.
[35] M. Luby, βLT codes," in Proc. 43rd Annual IEEE Symp. FoundationsComputer Science, 2002, pp. 271-282.
[36] A. Shokrollahi, βRaptor codes," IEEE Trans. Inf. Theory, vol. 52, no. 6,pp. 2551-2567, June 2006.
[37] Y. Liu, βA lossless layered coding strategy for MIMO systems," in Proc.International Symp. Inform. Theory, Sep. 2005, pp. 1947β1951.
Cong Shen (Sβ01) received the B.S. and M.S.degrees, in 2002 and 2004 respectively, from theDepartment of Electronic Engineering, TsinghuaUniversity, Beijing, China. He is currently workingtowards the Ph.D. degree in the Electrical Engi-neering Department, University of California, LosAngeles (UCLA). His research interest is on generalcommunication theory with emphasis on wirelesscommunications.
Tie Liu received his B.S. (1998) and M.S. (2000) degrees, both in ElectricalEngineering, from the Tsinghua University, Beijing, China and M.S. degree inMathematics (2004) and Ph.D. degree in Electrical and Computer Engineering(2006) from the University of Illinois at Urbana-Champaign. Since August2006, he has been with the Texas A&M University where he is currentlyan Assistant Professor in Electrical and Computer Engineering. His researchinterests are in the field of information theory, wireless communication, andstatistical signal processing. He is a recipient of the M. E. Van ValkenburgGraduate Research Award (2006) from the University of Illinois at Urbana-Champaign and the Best Paper Award (2008) from the Third InternationalConference on Cognitive Radio Oriented Wireless Networks and Communi-cations.
Michael P. Fitz (Sβ82-Mβ83-SMβ02) received theB.E.E. degree (summa cum laude) from the Univer-sity of Dayton, Dayton, OH, in 1983 and the M.S.and Ph.D. degrees in electrical engineering from theUniversity of Southern California, Los Angeles, in1984 and 1989, respectively.
From 1983 to 1989, he was a CommunicationSystems Engineer with Hughes Aircraft and TRWInc. Since 1989, he has been with the Faculty of Pur-due University, West Lafayette, IN, The Ohio StateUniversity (OSU), Columbus, and the University of
California, Los Angeles. He is currently with Northrop Grumman Corp. as aSenior Systems Engineer working on satellite communications. His researchis in the broad area of statistical communication theory and experimentation.He is the author of Fundamentals of Communications Systems (New York:McGraw Hill, 2007). His research group at UCLA currently is interested inthe theory of space-time modems and operates an experimental wireless wide-area network and a space-time coding testbed.
Prof. Fitz received the 2001 IEEE Communications Society Leonard G.Abraham Prize Paper Award in the field of communications systems.