+ All Categories
Home > Documents > On the average rate performance of hybrid-ARQ in quasi-static fading channels

On the average rate performance of hybrid-ARQ in quasi-static fading channels

Date post: 16-Nov-2023
Category:
Upload: independent
View: 0 times
Download: 0 times
Share this document with a friend
14
IEEE TRANSACTIONS ON COMMUNICATIONS, VOL. 57, NO. 11, NOVEMBER 2009 3339 On the Average Rate Performance of Hybrid-ARQ in Quasi-Static Fading Channels Cong Shen, Student Member, IEEE, Tie Liu, Member, IEEE, and Michael P. Fitz, Senior Member, IEEE Abstractβ€”The problem of efficient communication over a scalar quasi-static fading channel is considered. The single-layer transmission (SLT) and multi-layer transmission (MLT) schemes do not require any knowledge of the channel state information (CSI) at the transmitter, but their performance is also limited. It is shown that using Hybrid-ARQ (HARQ) can significantly improve the average rate performance, provided that the rate assignment between different ARQ rounds is carefully chosen. The average rate performance of several HARQ schemes is optimized and compared. In addition, optimal power allocation among retransmissions is derived and shown to further increase the average rate. This power allocation gain is remarkable at low signal-to-noise ratio (SNR), but becomes negligible at high SNR. Comparison of two different types of limited feedback, sequential feedback (ARQ) and one-shot feedback (quantized CSI), is made from several perspectives. Although the optimization problem is formed with respect to the average rate, simulation results give a comprehensive comparison under different metrics, including average rate, outage probability, and the combination of both. Substantial performance improvement is observed with even one ARQ retransmission in all simulations. More importantly, this gain appears to be robust with respect to the fading distributions. Index Termsβ€”Hybrid-ARQ (HARQ), incremental redundancy, fading channels, throughput, channel state information (CSI). I. I NTRODUCTION T HE problem studied in this paper is how to efficiently transmit information over a quasi-static wireless fading channel, where the channel gain is constant during one co- herence block and changes independently from one block to another. It is assumed that the receiver can perfectly track the fading process. Depending on whether the transmitter knows about the instantaneous channel realization, different performance measures have been studied. If the transmitter has no knowledge of the channel realization other than the statistical characterization, the Shannon capacity is zero as Paper approved by L. Rasmussen, the Editor for Iterative Detection, De- coding, and ARQ of the IEEE Communications Society. Manuscript received February 19, 2008; revised October 22, 2008 and April 27, 2009. C. Shen is with the Department of Electrical Engineering, University of California, Los Angeles (UCLA), Los Angeles, CA 90095, USA (e-mail: [email protected]). T. Liu is with the Department of Electrical and Computer Engineer- ing, Texas A&M University, College Station, TX 77843, USA (e-mail: [email protected]). M. P. Fitz was with the Department of Electrical Engineering, University of California, Los Angeles (UCLA), Los Angeles, CA 90095, USA. He is now with Northrop Grumman Space Technology, Redondo Beach, CA 90278, USA (e-mail: fi[email protected]). The work of Cong Shen and Michael P. Fitz is supported by NSF grant CCF-0431196, and by ST Microelectronics with a matching grant from the University of California Discovery Program. Part of this work was done while Cong Shen was visiting Texas A&M University. Digital Object Identifier 10.1109/TCOMM.2009.11.080072 there is always a nonzero probability that the channel is in deep fade. A useful and well-accepted performance metric is the outage capacity [1]. In this formulation, a fixed-rate chan- nel code is used, and the information is reliably transmitted if the instantaneous channel gain supports the predetermined transmission rate. Otherwise, an outage is declared, and no information can be recovered at the receiver. Quasi-static fading with channel state information (CSI) available only at the receiver (CSIR) is a prime example of the detrimental effect of fading. Further improving the performance requires an opportunis- tic view of fading: a well-designed system should be able to adapt to the channel variations, i.e., it sends some in- formation across the channel when the channel is not-so- good and a lot of information when the channel is very good. By exploiting the β€œgood” channel realizations, the long- term throughput can be substantially improved. However, adapting to channel fading without transmitter’s knowledge of CSI faces some conceptual difficulties. The breakthrough was made in [2], where the author observed the similarity between communication over quasi-static fading channels and broadcasting to multiple users. This venue was later pursued in [3], [4], and [5], leading to the development of a multi- layer transmission (MLT) strategy which utilizes broadcast superposition coding. By organizing information into layers, the MLT strategy allows rate adaptation to channel fading at the expense of creating self interference during the decoding of the earlier-decoded layers in the stack. Furthermore, depending on the actual channel realization, it may occur that only part of the information can be decoded at the receiver, which may lead to some extra complications in the upper layers in the network hierarchy. This work follows the same line of examining the long-term throughput performance of communication over quasi-static wireless fading channels [3]–[5]. Both single-layer transmis- sion (SLT) and MLT are first briefly discussed. It is then shown that with the use of Hybrid-ARQ (HARQ), the average rate performance can be dramatically improved, provided that the rate assignment of the HARQ protocol is optimally designed. This scheme is also referred as Rate-Optimized HARQ (RO- HARQ). The basic idea is to exploit the existence of ARQ in the data link layer to increase the average rate. Roughly speaking, the initial transmission is set to be very aggressive (high rate). If the channel does not support this high rate, an ARQ will help by indicating the transmitter to reduce the rate. The average rate maximization problem of RO-HARQ is formulated, and numerical results demonstrate the remarkable gain over both SLT and MLT strategies. Moreover, this gain 0090-6778/09$25.00 c ⃝ 2009 IEEE
Transcript

IEEE TRANSACTIONS ON COMMUNICATIONS, VOL. 57, NO. 11, NOVEMBER 2009 3339

On the Average Rate Performance ofHybrid-ARQ in Quasi-Static Fading Channels

Cong Shen, Student Member, IEEE, Tie Liu, Member, IEEE, and Michael P. Fitz, Senior Member, IEEE

Abstractβ€”The problem of efficient communication over ascalar quasi-static fading channel is considered. The single-layertransmission (SLT) and multi-layer transmission (MLT) schemesdo not require any knowledge of the channel state information(CSI) at the transmitter, but their performance is also limited.It is shown that using Hybrid-ARQ (HARQ) can significantlyimprove the average rate performance, provided that the rateassignment between different ARQ rounds is carefully chosen.The average rate performance of several HARQ schemes isoptimized and compared. In addition, optimal power allocationamong retransmissions is derived and shown to further increasethe average rate. This power allocation gain is remarkable at lowsignal-to-noise ratio (SNR), but becomes negligible at high SNR.Comparison of two different types of limited feedback, sequentialfeedback (ARQ) and one-shot feedback (quantized CSI), is madefrom several perspectives. Although the optimization problem isformed with respect to the average rate, simulation results givea comprehensive comparison under different metrics, includingaverage rate, outage probability, and the combination of both.Substantial performance improvement is observed with even oneARQ retransmission in all simulations. More importantly, thisgain appears to be robust with respect to the fading distributions.

Index Termsβ€”Hybrid-ARQ (HARQ), incremental redundancy,fading channels, throughput, channel state information (CSI).

I. INTRODUCTION

THE problem studied in this paper is how to efficientlytransmit information over a quasi-static wireless fading

channel, where the channel gain is constant during one co-herence block and changes independently from one block toanother. It is assumed that the receiver can perfectly trackthe fading process. Depending on whether the transmitterknows about the instantaneous channel realization, differentperformance measures have been studied. If the transmitterhas no knowledge of the channel realization other than thestatistical characterization, the Shannon capacity is zero as

Paper approved by L. Rasmussen, the Editor for Iterative Detection, De-coding, and ARQ of the IEEE Communications Society. Manuscript receivedFebruary 19, 2008; revised October 22, 2008 and April 27, 2009.

C. Shen is with the Department of Electrical Engineering, University ofCalifornia, Los Angeles (UCLA), Los Angeles, CA 90095, USA (e-mail:[email protected]).

T. Liu is with the Department of Electrical and Computer Engineer-ing, Texas A&M University, College Station, TX 77843, USA (e-mail:[email protected]).

M. P. Fitz was with the Department of Electrical Engineering, Universityof California, Los Angeles (UCLA), Los Angeles, CA 90095, USA. He isnow with Northrop Grumman Space Technology, Redondo Beach, CA 90278,USA (e-mail: [email protected]).

The work of Cong Shen and Michael P. Fitz is supported by NSF grantCCF-0431196, and by ST Microelectronics with a matching grant from theUniversity of California Discovery Program. Part of this work was done whileCong Shen was visiting Texas A&M University.

Digital Object Identifier 10.1109/TCOMM.2009.11.080072

there is always a nonzero probability that the channel is indeep fade. A useful and well-accepted performance metric isthe outage capacity [1]. In this formulation, a fixed-rate chan-nel code is used, and the information is reliably transmittedif the instantaneous channel gain supports the predeterminedtransmission rate. Otherwise, an outage is declared, and noinformation can be recovered at the receiver. Quasi-staticfading with channel state information (CSI) available only atthe receiver (CSIR) is a prime example of the detrimentaleffect of fading.

Further improving the performance requires an opportunis-tic view of fading: a well-designed system should be ableto adapt to the channel variations, i.e., it sends some in-formation across the channel when the channel is not-so-good and a lot of information when the channel is verygood. By exploiting the β€œgood” channel realizations, the long-term throughput can be substantially improved. However,adapting to channel fading without transmitter’s knowledgeof CSI faces some conceptual difficulties. The breakthroughwas made in [2], where the author observed the similaritybetween communication over quasi-static fading channels andbroadcasting to multiple users. This venue was later pursuedin [3], [4], and [5], leading to the development of a multi-layer transmission (MLT) strategy which utilizes broadcastsuperposition coding. By organizing information into layers,the MLT strategy allows rate adaptation to channel fading atthe expense of creating self interference during the decoding ofthe earlier-decoded layers in the stack. Furthermore, dependingon the actual channel realization, it may occur that only partof the information can be decoded at the receiver, which maylead to some extra complications in the upper layers in thenetwork hierarchy.

This work follows the same line of examining the long-termthroughput performance of communication over quasi-staticwireless fading channels [3]–[5]. Both single-layer transmis-sion (SLT) and MLT are first briefly discussed. It is then shownthat with the use of Hybrid-ARQ (HARQ), the average rateperformance can be dramatically improved, provided that therate assignment of the HARQ protocol is optimally designed.This scheme is also referred as Rate-Optimized HARQ (RO-HARQ). The basic idea is to exploit the existence of ARQin the data link layer to increase the average rate. Roughlyspeaking, the initial transmission is set to be very aggressive(high rate). If the channel does not support this high rate,an ARQ will help by indicating the transmitter to reduce therate. The average rate maximization problem of RO-HARQ isformulated, and numerical results demonstrate the remarkablegain over both SLT and MLT strategies. Moreover, this gain

0090-6778/09$25.00 c⃝ 2009 IEEE

3340 IEEE TRANSACTIONS ON COMMUNICATIONS, VOL. 57, NO. 11, NOVEMBER 2009

appears to be robust to the fading distributions. Further averagerate increase is possible, especially in the low signal-to-noise ratio (SNR) regime, if power allocation among differentretransmissions is performed. To comprehensively comparethese schemes, numerical optimization and simulations areperformed with respect to different performance measures,including average rate, outage probability, and the combinationof both.

The idea of using ARQ to improve communication perfor-mance is not new. In fact, HARQ techniques are widely usedin most of the contemporary digital communication systems. Agood summary of the progress of HARQ schemes is presentedin [6]. Early work regarding the HARQ system focuses on theusage of algebraic error-correction and error-detection codes[7]. Recent interests of HARQ mainly originate from the rapidprogress of wireless communications, where high-rate reliabletransmission faces the challenge of severe channel fluctuations.Throughput and scheduling optimization of downlink packetdata systems are investigated in [8]. An information-theoreticthroughput and delay analysis of several HARQ schemes inthe Gaussian collision channel is reported in [9]. Throughputanalysis of incremental redundancy HARQ in the block-fadingadditive white Gaussian noise (AWGN) channel is carriedout in [10]. A general framework of diversity-multiplexing-delay tradeoff is proposed in [11] to study multiple-inputmultiple-output (MIMO) ARQ block fading channels. Later[12] extends this framework to incorporate discrete inputdistributions. From the practical implementation point of view,research interests have shifted from traditional algebraic linearblock codes to the more powerful capacity-approaching mod-ern codes. For example, the problem of designing low-densityparity-check (LDPC) codes for the HARQ protocol has beenaddressed in [10], [13]–[17].

The rest of this paper is organized as follows. Section II de-fines the system model. Section III briefly discusses the aver-age rate performance of SLT and MLT. Section IV presents thetheoretical analysis of RO-HARQ. Average rate maximizationis discussed in Section IV-C and IV-D, followed by the optimalpower allocation in Section IV-E. Numerical comparison withseveral different performance metrics is reported in SectionV. Section VI discusses the difference between sequentialfeedback and one-shot CSI feedback. Finally, Section VIIconcludes the paper and points out possible directions forfuture work.

II. SYSTEM MODEL

We consider a scalar quasi-static fading channel where therandom channel gain β„Ž remains constant for a duration of 𝑇𝑐symbol times and then changes independently to another valueaccording to the fading distribution. The value 𝑇𝑐 is generallydetermined by the channel coherence time. The signal modelcan be written as

𝑦[π‘š] = β„Žπ‘₯[π‘š] + 𝑧[π‘š], π‘š = 1, β‹… β‹… β‹… , 𝐿 (1)

where {π‘₯[π‘š],π‘š = 1, β‹… β‹… β‹… , 𝐿} is a length-𝐿 codeword con-taining 𝐾 information nats1, and 𝑧[π‘š] is independent and

1The information unit is nat throughout the paper, except for the numericalresults.

identically distributed (i.i.d.) complex Gaussian noise withzero mean and variance 𝑁0 (denoted as π’žπ’© (0, 𝑁0)). Weassume 𝑇𝑐 ≫ 𝐿 such that the transmission of 𝐾 informationnats only experiences one fading state. This is the worst casesince no time diversity can be exploited. There is a short-termaverage power constraint of 𝑃 on {π‘₯[π‘š]}, which prohibitspower allocation across different fading states2. Since eachtransmission experiences an AWGN channel, our analysis inthis paper is restricted to the Gaussian input distribution.The channel gain and noise power are normalized to be𝔼[βˆ£β„Žβˆ£2] = 1 and 𝑁0 = 1 respectively, so the average

received signal-to-noise ratio is SNR.= 𝑃𝔼

[βˆ£β„Žβˆ£2] /𝑁0 = 𝑃 .The random channel power 𝑔

.= βˆ£β„Žβˆ£2 β‰₯ 0 is assumed to be a

continuous variable with the cumulative distribution function(CDF) 𝐹 (𝑔) and the probability density function (PDF) 𝑓(𝑔).One example is the frequently-encountered Rayleigh fadingwith β„Ž ∼ π’žπ’© (0, 1). We also assume that the transmitterhas a very large pool of information nats such that once thetransmission of the current 𝐾 nats ends, the transmitter startsto send the next 𝐾 nats immediately.

The quasi-static fading channel is a good model for usersthat are stationary, or moving slowly relative to the rateof communication. Due to the slowly varying nature of thechannel, channel estimation at the receiver can be performedwith high accuracy. Thus, it is reasonable to assume perfectCSI at the receiver. This assumption will be made throughoutthis paper.

The focus of this work is on high-rate, delay-insensitiveapplications such as data traffic in wireless LAN. For suchapplications the use of capacity-approaching channel codeswith long block length can be justified, and information-theoretic results are good approximations of real-world per-formance. This motivates us to take an information-theoreticview and consider the capacity related measures on the systemperformance in this paper.

III. SINGLE-LAYER AND MULTI-LAYER TRANSMISSIONS

A. Figure of Merit

Our main figure of merit is the long-term throughput. Letus use 𝑛 to count the number of time slots, 𝑇𝑛 for the channeluses of the 𝑛-th slot, 𝑇 (𝑛) =

βˆ‘π‘›π‘–=1 𝑇𝑖 for the total number of

channel uses at the end of the 𝑛-th slot, and π‘˜(𝑛) for the totalnumber of information nats successfully decoded up to slot 𝑛.Then the long-term throughput (in nats per channel use, npcu)is defined as [9]

πœ‚.= limπ‘›β†’βˆž

π‘˜(𝑛)

𝑇 (𝑛)=

𝔼{π‘˜}𝔼{𝒯 } (2)

where π‘˜ is the number of successfully decoded informationnats at the end of each transmission, and 𝒯 is the number ofchannel uses.

Both SLT and MLT belong to the general category of fixed-length coding schemes, in which the code length is constantregardless of the fading state. For such fixed-length codes, the

2When discussing power allocation for RO-HARQ in Section IV-E, unequalpower allocation between the primary transmission and ARQ retransmissionsis allowed (still within one channel state).

SHEN et al.: ON THE AVERAGE RATE PERFORMANCE OF HYBRID-ARQ IN QUASI-STATIC FADING CHANNELS 3341

denominator of (2) is constant, and the long-term throughputdegenerates to the average rate:

οΏ½Μ„οΏ½.= 𝔼𝑔 [𝑅(𝑔)] (3)

where 𝑔 is the channel state, and 𝑅(𝑔) is the successfultransmission rate when channel state is 𝑔. See [18] for a formaldefinition of average rate. It should be noted that average rateis different from the ergodic capacity; here, coding acrossdifferent fading states is prohibited, c.f. channel model (1).

B. SLT and MLT schemes

Assuming that the transmitter does not know the channelstate 𝑔, a simple and well-adopted scheme for communicatingover a slow fading channel is to send the data at a fixed rate𝑅. If the instantaneous channel realization supports the rate𝑅, the receiver gets a successful transmission; otherwise itdeclares an outage.

The average rate of SLT can be calculated as [19, Equation(7)]

�̄�𝑆𝐿𝑇 = 𝑅 β„™ {log (1 + 𝑔𝑃 ) β‰₯ 𝑅}= 𝑅

(1βˆ’ 𝐹

(𝑒𝑅 βˆ’ 1

𝑃

)). (4)

In order to maximize the average rate, one needs to solve thefollowing optimization problem:

maximize 𝑅(1βˆ’ 𝐹

(π‘’π‘…βˆ’1𝑃

))subject to 𝑅 β‰₯ 0.

(5)

This optimization problem has been studied in [19, SectionIII], where the Karush-Kuhn-Tucker (KKT) condition is de-rived. Here, we mention that with Rayleigh fading the KKTcondition simplifies to

𝑅𝑒𝑅 = 𝑃 ⇐⇒ π‘…βˆ—π‘†πΏπ‘‡,π‘…π‘Žπ‘¦π‘™π‘’π‘–π‘”β„Ž = β„’(𝑃 ) (6)

where β„’(β‹…) is the Lambert π‘Š function [20] which is definedas the solution to 𝑦𝑒𝑦 = π‘₯, and

οΏ½Μ„οΏ½βˆ—π‘†πΏπ‘‡,π‘…π‘Žπ‘¦π‘™π‘’π‘–π‘”β„Ž = β„’(𝑃 ) exp

(βˆ’π‘’β„’(𝑃 ) βˆ’ 1

𝑃

). (7)

The problem with SLT is that it evaluates the transmissionin an β€œon-off” fashion: the transmission is either entirely suc-cessful or totally failed. This scheme suffers from both over-utilizing and under-utilizing the channel since it uses a non-adaptive transmission strategy. The MLT scheme is developedto overcome the disadvantages of the SLT scheme withouttransmitter’s knowledge of CSI. The MLT scheme adopts themulti-user broadcast superposition code for a single-user slowfading channel. It is proposed in [4] for infinitely-many layers,and in [5] for finite layers. As an example, let us consider thecase with 𝑁 = 2 layers. In this case, the transmitter splitsthe message 𝑀 into two sub-messages 𝑀1 and 𝑀2. They areseparately encoded into {π‘₯1[π‘š]} and {π‘₯2[π‘š]} respectively,assuming Gaussian signaling, and finally superposed into thetransmit signal π‘₯[π‘š] = π‘₯1[π‘š] + π‘₯2[π‘š], where {π‘₯1[π‘š]} has apower 𝛼𝑃 and rate 𝑅1 npcu, {π‘₯2[π‘š]} has a power (1βˆ’π›Ό)𝑃and rate 𝑅2 npcu, and the scalar 𝛼 ∈ [0, 1] determines thepower allocation between these two layers. At the receiver,the decoding is carried out based on the channel realization β„Ž.

The first step is trying to decode {π‘₯1[π‘š]}, treating {π‘₯2[π‘š]}as adding to the noise floor. If {π‘₯1[π‘š]} is not successfullydecoded, the receiver will give up and declare an error.On the other hand, if {π‘₯1[π‘š]} is successfully decoded, thedecoding procedure continues to the second step: subtractingthe successfully decoded {π‘₯1[π‘š]} and then decoding {π‘₯2[π‘š]}.After some manipulation [5], the two-layer MLT average ratemaximization problem reduces to:

maximize 𝑅1 (1βˆ’ 𝐹 (𝑠1)) +𝑅2 (1βˆ’ 𝐹 (𝑠2))subject to 𝑅1 β‰₯ 0

𝑅2 β‰₯ 0𝑒𝑅1+𝑅2βˆ’π‘’π‘…2

𝑒𝑅1+𝑅2βˆ’1≀ 𝛼 ≀ 1

(8)

where 𝑠1.= 𝑒𝑅1βˆ’1𝑃 (1βˆ’(1βˆ’π›Ό)𝑒𝑅1 )

and 𝑠2.= 𝑒𝑅2βˆ’1𝑃 (1βˆ’π›Ό) .

The average rate of MLT can be further boosted by increas-ing the number of layers in the superposition code, althoughsuch gain has been shown to be insignificant in Rayleighfading [5]. In [4], Shamai and Steiner derived the maximumaverage rate of a superposition code with infinitely-manylayers for the Rayleigh fading:

οΏ½Μ„οΏ½βˆžπ‘€πΏπ‘‡,π‘…π‘Žπ‘¦π‘™π‘’π‘–π‘”β„Ž = 2𝐸𝑖(𝑆0)βˆ’ 2𝐸𝑖(1)βˆ’

(π‘’βˆ’π‘†0 βˆ’ π‘’βˆ’1

)(9)

where 𝑆0 = 2/(1 +

√1 + 4𝑃

)and 𝐸𝑖(π‘₯) =

∫∞π‘₯

π‘’βˆ’π‘‘

𝑑 d𝑑, π‘₯ β‰₯0 is the exponential integral function.

One property of the layering strategy is that it creates selfinterference to the decoding. Specifically, when the receiverdecodes the 𝑛-th layer, it treats all the not-yet-decoded layers𝑛+ 1, β‹… β‹… β‹… , 𝑁 as interference and thus decreases the effectivereceive SNR for the 𝑛-th layer to

SNR𝑛 =𝑔𝑃𝑛

1 + π‘”βˆ‘π‘π‘–=𝑛+1 𝑃𝑖

.

As the number of layers increases, this self interference affectsmore layers in the decoding; in fact, only the last decodedlayer is interference-free. This self interference has beenthe main concern of using broadcast superposition code ina single-user slow fading channel [21]. Thus, although theMLT scheme improves the performance over SLT, furtherperformance gain can be expected if we are able to eliminatesuch self interference. This motivated the development ofa Rate-Optimized Hybrid-ARQ (RO-HARQ) which will bedescribed next.

IV. RO-HARQ: THEORETICAL ANALYSIS

A. HARQ Schemes

In this section, we study several HARQ schemes withoptimally designed rate and power for each transmission. Itwill be shown that optimized HARQ can eliminate all of theaforementioned problems. With the help of ARQ, the decodingstatus at the receiver will be reported back to the transmitter,which indicates the successful decoding of received signalby acknowledgement (ACK) and failed decoding by negativeacknowledgement (NACK). It is assumed that the ARQ feed-back channel is delay and error free. The maximum numberof retransmissions of HARQ is denoted by 𝑁 , i.e., the totaltransmissions (including the initial one) cannot exceed 𝑁 +1.In the ARQ literature, 𝑁 + 1 is also called the maximum

3342 IEEE TRANSACTIONS ON COMMUNICATIONS, VOL. 57, NO. 11, NOVEMBER 2009

allowable ARQ rounds. The resulting protocol is sometimesreferred to as ARQ with a deadline. The choice of 𝑁 reflectsthe worse-case delay caused by the ARQ retransmissions,and is generally determined by the system delay requirement.For example, choosing small 𝑁 models certain delay criticalsituation. In order to be consistent with channel model (1), themaximum ARQ rounds should limit the overall code lengthto be within 𝐿. Note that the HARQ protocol is typicallyavailable in almost all existing wireless systems, and thusexploiting HARQ will not need additional designs.

We consider the following three HARQ protocols3 in thispaper.

1) ALO. The transmitter encodes the 𝐾 information nats atrate 𝑅 npcu, and then keeps sending the same encodedpacket in every retransmission. The receiver only decodesthe most recently received packet. This loop continuesuntil the ACK is declared by the receiver, or the maxi-mum retransmissions are used.

2) RTD. The transmitter is the same as ALO. The receiverperforms a maximum ratio combining (MRC) of all thereceived packets. In the HARQ literature, this scheme isalso referred as Chase Combining [23].

3) INR. The transmitter encodes the 𝐾 information nats intoa codeword of length 𝑇 (𝑁+1). Then it serially puncturesthe length-𝐿 codeword into 𝑁 + 1 sub-codewords withstrictly decreasing rates 𝐾/𝑇 (1) > 𝐾/𝑇 (2) > β‹… β‹… β‹… >𝐾/𝑇 (𝑁+1). The lengths of sub-codewords are the designparameters, which are determined by the code rate opti-mization and will be addressed in Section IV-C. At the 𝑛-th transmission (𝑛 = 1, β‹… β‹… β‹… , 𝑁 ), the transmitter reducesthe total rate to4 𝑅1 + 𝑅2 + β‹… β‹… β‹… + 𝑅𝑁+2βˆ’π‘› = 𝐾/𝑇 (𝑛)

by sending additional redundancy symbols. The receivertries to decode based on all the packets it receives up untilthis moment. At the last retransmission (round 𝑁 + 1),rate 𝑅1 is tried without any ARQ feedback. If decodingis still unsuccessful, a decoding failure is declared.

This works focuses on INR due to the following tworeasons.

1) ALO scheme does not work for slow fading channels.The reason is that there is no time diversity to exploit,as the channel gain remains constant over ARQ retrans-missions. Thus to keep sending the same packet and onlydecoding the most recently received one will not increasethe chance of successful decoding.

2) RTD has an average rate performance that is inferior toINR. This will be shown both analytically and numeri-cally. Note that although RTD has inferior performance, ithas some practical advantages which make it attractive insome applications. For example, the retransmitted packetin RTD is always the same, which is easier to implementthan INR, in which additional parity symbols have to begenerated and transmitted. As another example, having

3The acronyms are borrowed from [9], [22], although the schemes andapplications may be different. For example, [9], [22] studied a slottedsystem with equal length for each retransmission. They considered a multi-user TDMA system, where it is reasonable to assume each retransmissionexperiences a different fading state. Both are different from our assumptions.

4The reason of using this seemingly redundant expression for the data rateis that it simplifies equations (11), (12), (13), and the proof of Lemma 2.

the same size for the retransmitted packets in RTD makesthe packetization simple, while in INR the length ofthe retransmitted parity symbols may vary from oneretransmission to another.

The novelty of the proposed RO-HARQ scheme is theoptimal rate assignment5 of each transmission. Traditionally,HARQ is used in a passive manner in a wireless system. Thepurpose of HARQ is to indicate the decoding status to thetransmitter, such that the transmitter can protect the β€œbad”packet by retransmitting. The existence of HARQ is typicallynot exploited by the transmitter; it is treated only as a binaryindicator of the decoding status, and other modules in thesystem (e.g., channel coding, modulation, etc.) are operatingas if HARQ were not available. Due to this reason, the rate ofeach HARQ transmission and the (equal) power allocation isusually predetermined. This work proposes the different viewthat HARQ can be exploited by the transmitter to providebetter average rate and outage performance, i.e., it can beutilized in an active manner. This new view suggests thatthe rate and power associated with each HARQ transmissioncan be optimized according to the channel fading statistics toachieve better performance. The main idea is that since thetransmitter is aware of the HARQ link in the system, it cantransmit very aggressively (at very high rate) even if it doesnot know the random channel state 𝑔 before transmitting. Thenif the channel is not good, this high-rate transmission willfail, and the HARQ can save it by indicating this failure tothe transmitter so that it can adjust to a lower rate. On theother hand, if it is β€œlucky” that the channel is strong enough,such β€œgambling” will bring high return: the strong channelrealization is (almost) fully utilized, i.e., there is (almost) nowaste of the good channel. This is the basic idea behind RO-HARQ.

It is now clear why RO-HARQ can eliminate the problemsof SLT and MLT. By transmitting aggressively, the β€œgood”channel realizations are fully used; with the ARQ feedback,the transmission rate can be reduced for the β€œbad” channelrealizations. At the same time, as there is no superpositionor layering in the channel coding, self interference does notexist.

B. Average Rate for HARQ

The general expression (2) characterizes the long-termthroughput for both fixed and variable length coding schemes.How to evaluate and optimize the throughput performanceof variable-length coding schemes depends on the specificapplications and the assumptions on how to use the channel.If we assume that the channel remains constant during thetransmission of 𝐾 information nats, and changes indepen-dently when the transmission of current packet6 is doneand a new packet is ready to transmit, we need to evaluateboth the average number of successfully decoded informationnats (numerator of (2)) and the average delay (denominator

5Another novelty is the optimal power allocation among transmissionsto further improve the average rate performance. This is discussed in Sec-tion IV-E.

6The transmission of one packet includes several possible retransmissions,according to the HARQ protocol. Different packets contain different infor-mation.

SHEN et al.: ON THE AVERAGE RATE PERFORMANCE OF HYBRID-ARQ IN QUASI-STATIC FADING CHANNELS 3343

of (2)) with respect to the fading distribution. Throughputanalysis based on such assumptions is given in Appendix Aand numerical performance is reported in Section V. Thisassumption can be valid for burst communication, where thereis a long idle period in between transmissions of differentpackets such that the channel gain changes independently fromone transmission to another.

A different scenario is that the transmitter has a verylarge pool of information nats such that the communicationis β€œcontinuous”. For this scenario, multiple packets (witheach packet having multiple ARQ rounds) will be transmittedwithin one coherent period. When the channel is β€œgood”, morepackets can be transmitted within one coherent period; whenthe channel is β€œbad”, only a small number of packets canbe transmitted within the same coherent period. As a result,each fading state has the same length of channel uses, and theempirical channel distribution will match the true one. This isthe key difference from the burst communication scenario. Thelong-term throughput (2) for this scenario can be computedas follows. We use 𝑅(𝑔) to denote the instantaneous rate ofthe HARQ scheme for a given channel realization 𝑔. Thetotal number of information nats that can be successfullytransmitted is π‘˜(𝑔) = 𝑇𝑐𝑅(𝑔) over the channel state 𝑔. Hencewe have

πœ‚ =𝔼{π‘˜}𝔼{𝒯 } =

𝔼{𝑇𝑐𝑅(𝑔))}𝑇𝑐

= 𝔼{𝑅(𝑔)} = οΏ½Μ„οΏ½ (10)

which again degenerates to the definition of average rate. Thusin the following we shall focus on optimizing the average rateperformance of different HARQ schemes.

C. Average Rate Maximization of INR and RTD

The INR scheme is described in Section IV-A. With INR,the random variable 𝑅(𝑔) is

𝑅(𝑔) =

⎧⎨⎩

βˆ‘π‘+1𝑖=1 𝑅𝑖, if log (1 + 𝑔𝑃 ) β‰₯βˆ‘π‘+1

𝑖=1 π‘…π‘–βˆ‘π‘›π‘–=1 𝑅𝑖, if

βˆ‘π‘›+1𝑖=1 𝑅𝑖 > log (1 + 𝑔𝑃 ) β‰₯βˆ‘π‘›π‘–=1𝑅𝑖, βˆ€π‘› = 1, β‹… β‹… β‹… , 𝑁

0, if 𝑅1 > log (1 + 𝑔𝑃 ).(11)

The average rate of INR can thus be computed as

�̄�𝑁𝐼𝑁𝑅 =

𝑁+1βˆ‘π‘›=1

𝑅𝑛ℙ

{log (1 + 𝑔𝑃 ) β‰₯

π‘›βˆ‘π‘˜=1

π‘…π‘˜

}(12)

=𝑁+1βˆ‘π‘›=1

𝑅𝑛 (1βˆ’ 𝐹 (𝑔𝑛)) (13)

where 𝑔𝑛.= 𝑒

βˆ‘π‘›π‘˜=1 π‘…π‘˜βˆ’1𝑃 .

The problem is to choose the nonnegative design parameters{𝑅1, β‹… β‹… β‹… , 𝑅𝑁+1} to optimize the average rate (13). Clearly,the non-negativity constraints on {𝑅𝑛} can be dropped, andthe INR average rate maximization problem becomes

maximizeβˆ‘π‘+1𝑛=1 𝑅𝑛 (1βˆ’ 𝐹 (𝑔𝑛)) . (14)

A direct evaluation of the KKT condition gives

1βˆ’πΉ (𝑔𝑛)βˆ’π‘…π‘›π‘’βˆ‘π‘›

π‘˜=1 π‘…π‘˜

𝑃𝑓 (𝑔𝑛)βˆ’

𝑁+1βˆ‘π‘–=𝑛+1

π‘…π‘–π‘’βˆ‘π‘–

π‘˜=1 π‘…π‘˜

𝑃𝑓 (𝑔𝑖) = 0

(15)

for 𝑛 = 1, β‹… β‹… β‹… , 𝑁, and

1βˆ’ 𝐹 (𝑔𝑁+1)βˆ’ 𝑅𝑁+1π‘’βˆ‘π‘+1

π‘˜=1 π‘…π‘˜

𝑃𝑓 (𝑔𝑁+1) = 0. (16)

In some simple cases (e.g., small 𝑁 ), equations (15) and (16)can be numerically solved to give the optimal rate assignment.

The average rate of RTD can be similarly computed. Inthe 𝑛-th transmission of RTD, 𝑛 = 1, β‹… β‹… β‹… , 𝑁 + 1, thereceiver performs maximum ratio combining of the 𝑛 receivedpackets. This processing effectively increases the receive SNRto SNR𝑛 = 𝑛𝑃 , while reducing the data rate to 𝑅/𝑛. Thusthe random variable 𝑅(𝑔) is

𝑅(𝑔) =

⎧⎨⎩

𝑅𝑛 , if

log (1 + 𝑛𝑔𝑃 ) β‰₯ 𝑅 >log (1 + (π‘›βˆ’ 1)𝑔𝑃 ), βˆ€π‘› = 1, β‹… β‹… β‹… , 𝑁 + 1

0, if 𝑅 > log (1 + (𝑁 + 1)𝑔𝑃 ).(17)

The average rate can be computed as

�̄�𝑁𝑅𝑇𝐷 =

𝑁+1βˆ‘π‘›=1

𝑅

𝑛ℙ {π‘žπ‘›βˆ’1 > 𝑔 β‰₯ π‘žπ‘›} (18)

=

𝑁+1βˆ‘π‘›=1

𝑅

𝑛(𝐹 (π‘žπ‘›βˆ’1)βˆ’ 𝐹 (π‘žπ‘›)) (19)

where π‘žπ‘›.= π‘’π‘…βˆ’1

𝑛𝑃 for 𝑛 = 1, β‹… β‹… β‹… , 𝑁 + 1, and π‘ž0.= ∞.

Similarly, the RTD average rate maximization problem can beformulated as

maximizeβˆ‘π‘+1𝑛=1

𝑅𝑛 (𝐹 (π‘žπ‘›βˆ’1)βˆ’ 𝐹 (π‘žπ‘›)) . (20)

Numerical evaluation and comparison of the optimal averagerate of (14) and (20) are given in Section V, where we willsee that RTD has a worse average rate performance than INR.This can also be analytically proved as follows.

Lemma 1: Denote the optimal solution to problem (14)as ({π‘…βˆ—

1, β‹… β‹… β‹… , π‘…βˆ—π‘+1}, �̄�𝑁

βˆ—πΌπ‘π‘…), and the optimal solution to

problem (20) as (π‘…βˆ—, οΏ½Μ„οΏ½π‘βˆ—

𝑅𝑇𝐷). Then

οΏ½Μ„οΏ½π‘βˆ—

𝑅𝑇𝐷 ≀ οΏ½Μ„οΏ½π‘βˆ—

𝐼𝑁𝑅 (21)

Proof: See Appendix B.In order to fully enjoy the benefits of RO-HARQ, the

corresponding channel code design for INR has to satisfy thefollowing two requirements.

1) A good mother code that can be serially punctured intoseveral different optimized code rates.

2) A single decoder that can handle all the rates such thatthe coding performance of each sub-code is capacity-approaching.

There are several existing code designs, developed in bothacademia (e.g., [10], [14], [15], [17], [24]) and industry (e.g.,[25]) that satisfy these requirements.

D. Asymptotic Average Rate of INR

It is of interest to ask what is the asymptotic performancelimit of the INR scheme (by allowing 𝑁 to go to infinity,i.e., infinitely-many retransmissions). Such asymptotic perfor-mance gives the ultimate limit of the proposed scheme. Itis also a good performance indication for delay insensitiveapplications. The following lemma states that the average rate

3344 IEEE TRANSACTIONS ON COMMUNICATIONS, VOL. 57, NO. 11, NOVEMBER 2009

of INR asymptotically converges to the ergodic capacity, eventhough coding across fading blocks is prohibited in INR.

Lemma 2: As 𝑁 β†’ ∞, the average rate of INR withequal power allocation among transmissions converges to theergodic capacity of the channel.

Proof: See Appendix C.There are two interesting observations of Lemma 2. First,

Lemma 2 applies to any fading distribution. This can alsobe seen from the proof. Secondly, in order to achieve thisasymptotic limit, there is no requirement of optimal rateallocation. This observation suggests that as 𝑁 becomes large,the gain of optimal rate design diminishes.

In a slow fading environment, if one is not allowed toperform coding over different fading states, it is well acceptedthat the outage performance is a good measure. In INR, thecoding is still within one fading state. However, with the useof ARQ feedback, the performance measure switches fromoutage capacity to ergodic capacity, as the amount of ARQretransmissions increases to infinity. This result, however,is not surprising in the following two aspects. First, withinfinitely-many retransmissions allowed, in each fading blockthe transmission rate is gradually reduced (by NACK fromthe receiver) until it arrives at the exact rate that the currentchannel gain 𝑔 can support: log (1 + 𝑔𝑃 ). That is, eventuallythe transmission happens with a rate that perfectly matches tothe instantaneous channel realization. Thus all the informationnats can theoretically be decoded without any error, andthe average rate is the ergodic capacity. Secondly, allowingARQ feedback effectively informs the transmitter not onlythe decoding status, but also the partial CSI. As the numberof ARQ retransmissions goes to infinity, the informed CSIbecomes perfect eventually.

From the information-theoretic viewpoint, a receiver decod-ing failure is equivalent to a channel outage with capacity-achieving codes. Thus, an ACK/NACK feedback essentiallyinforms the transmitter a channel quantization, i.e., whetherthe instantaneous channel gain is greater than a given thresh-old. With multiple ARQ rounds, this process becomes a se-quential feedback scheme, where the entire channel state spaceis sequentially quantized with more and more ACK/NACKfeedback, and thus the transmitter gets a finer and finer knowl-edge of the channel. Another widely used limited feedbackscheme is to perform a global quantization of the channelstate space, and then indicate to the transmitter the intervalindex in which the channel realization falls. Such a schemehas been well studied in [19], [26]–[28]. We will call thisscheme one-shot feedback and a detailed comparison with theARQ sequential feedback is made in Section VI.

E. Power Allocation for INR

In the previous sections, the general idea of RO-HARQis discussed and the average rate performance is optimized,under the assumption that the transmit power is constantthroughout the entire ARQ process. Intuitively, allowing powerallocation among the 𝑁 + 1 transmissions could furtherimprove the average rate. For example, boosting the powerof primary transmission will increase the probability that theprimary transmission succeeds. However this comes at the

PowerPower PowerTransmission 1 Transmission 2 ...... Transmission N+1

TN +1

P1 PN +1

T1 T2

P2

Fig. 1. Illustration of unequal power allocation for INR with 𝑁 retransmis-sions.

price of decreasing the power of ARQ retransmissions andhence the probability for success, if the average transmit poweris kept constant. There is obviously a tradeoff in allocatingpower among these transmissions, and this idea is pursued inthe following. Figure 1 gives a graphical illustration of thisprocess. We shall first formulate the problem for general 𝑁 ,and then focus on the simplest case of 𝑁 = 1 both analyticallyand numerically.

Consider the INR scheme with 𝐾 information nats anda maximum 𝑁 + 1 transmissions. We assume that the 𝑛-thtransmission takes place with power 𝑃𝑛, and use π’œπ‘› to denotethe event of a successful decoding at the end of transmission𝑛. We will approach this problem in two steps. First we derivethe average rate for a fixed power allocation, and then showhow to maintain a constant average power.

1) Average rate for a given power allocation(𝑃1, β‹… β‹… β‹… , 𝑃𝑁+1): Similar to (11), the rate of INR withgiven power allocation policy (𝑃1, β‹… β‹… β‹… , 𝑃𝑁+1) is a randomvariable:

𝑅(𝑔) =

βŽ§βŽ¨βŽ©βˆ‘π‘+2βˆ’π‘›π‘–=1 𝑅𝑖, if

π’œ1, β‹… β‹… β‹… ,π’œπ‘›βˆ’1,π’œπ‘›;βˆ€π‘› = 1, β‹… β‹… β‹… , 𝑁 + 1

0, if π’œ1, β‹… β‹… β‹… ,π’œπ‘+1

(22)

where we define π’œ0 as the empty set.In order to evaluate β„™ {π’œπ‘›}, we need to derive the achiev-

able rate of sending a long Gaussian code where differentportions of the code have different power. This is not obviousbut with a random coding and typical set decoding argument[29], this achievable rate can be shown to be a TDMA-typeone, and thus

β„™ {π’œπ‘›}= β„™

{βˆ‘π‘›π‘–=1

π‘‡π‘–βˆ‘π‘›π‘—=1 𝑇𝑗

log (1 + 𝑔𝑃𝑖) β‰₯βˆ‘π‘+2βˆ’π‘›π‘–=1 𝑅𝑖

}= β„™

{βˆ‘π‘›π‘–=1

1βˆ‘π‘+2βˆ’π‘–π‘™=1

π‘…π‘™βˆ‘π‘›π‘—=1

1βˆ‘π‘+2βˆ’π‘—π‘™=1

𝑅𝑙

log (1 + 𝑔𝑃𝑖) β‰₯βˆ‘π‘+2βˆ’π‘›π‘–=1 𝑅𝑖

}.

(23)Finally the average rate is given by

�̄�𝑁𝐼𝑁𝑅 =𝑁+1βˆ‘π‘›=1

(𝑁+2βˆ’π‘›βˆ‘π‘–=1

𝑅𝑖

)β„™{π’œ1, β‹… β‹… β‹… ,π’œπ‘›βˆ’1,π’œπ‘›

}. (24)

2) Average power constraint: The power allocation(𝑃1, β‹… β‹… β‹… , 𝑃𝑁+1) is a design parameter. In order to make afair comparison to the scheme discussed in Section IV-C witha constant power 𝑃 , it is reasonable to put a constraint on(𝑃1, β‹… β‹… β‹… , 𝑃𝑁+1) such that the average power does not exceed𝑃 . The interesting observation of the HARQ scheme is thatwith unequal power allocation the actual power consumed inthe entire transmission is a random variable. The reason isthat the 𝑛-th transmission takes place only if transmissions1, β‹… β‹… β‹… , π‘›βˆ’ 1 all fail, which is a random event determined bythe random channel gain. Denote the actual consumed poweras 𝛽(𝑔), then this discrete random variable is given by

SHEN et al.: ON THE AVERAGE RATE PERFORMANCE OF HYBRID-ARQ IN QUASI-STATIC FADING CHANNELS 3345

𝛽(𝑔) =

⎧⎨⎩

βˆ‘π‘›π‘–=1 π‘ƒπ‘–π‘‡π‘–βˆ‘π‘›π‘–=1 𝑇𝑖

, if π’œ1, β‹… β‹… β‹… ,π’œπ‘›βˆ’1,π’œπ‘›; βˆ€π‘› = 1, β‹… β‹… β‹… , π‘βˆ‘π‘+1

𝑖=1 π‘ƒπ‘–π‘‡π‘–βˆ‘π‘+1𝑖=1 𝑇𝑖

, if π’œ1, β‹… β‹… β‹… ,π’œπ‘ .

(25)Thus the average power constraint that (𝑃1, β‹… β‹… β‹… , 𝑃𝑁+1)should satisfy isβˆ‘π‘

𝑛=1

βˆ‘π‘›π‘–=1 π‘ƒπ‘–π‘‡π‘–βˆ‘π‘›π‘–=1 𝑇𝑖

β„™{π’œ1, β‹… β‹… β‹… ,π’œπ‘›βˆ’1,π’œπ‘›

}+

βˆ‘π‘+1𝑖=1 π‘ƒπ‘–π‘‡π‘–βˆ‘π‘+1𝑖=1 𝑇𝑖

β„™{π’œ1, β‹… β‹… β‹… ,π’œπ‘

} ≀ 𝑃.(26)

Notice that in the case of constant power allocation 𝑃𝑛 =𝑃, βˆ€π‘›, the randomness of 𝛽(𝑔) disappears: 𝛽(𝑔) = 𝑃 withprobability 1.

3) Optimal power allocation: Finally, the average ratemaximization problem under optimal power allocation can beformulated as

maximizeβˆ‘π‘+1𝑛=1

(βˆ‘π‘+2βˆ’π‘›π‘–=1 𝑅𝑖

)β„™{π’œ1, β‹… β‹… β‹… ,π’œπ‘›βˆ’1,π’œπ‘›

}subject to

βˆ‘π‘π‘›=1

βˆ‘π‘›π‘–=1 π‘ƒπ‘–π‘‡π‘–βˆ‘π‘›π‘–=1 𝑇𝑖

β„™{π’œ1, β‹… β‹… β‹… ,π’œπ‘›βˆ’1,π’œπ‘›

}+

βˆ‘π‘+1𝑖=1 π‘ƒπ‘–π‘‡π‘–βˆ‘π‘+1𝑖=1 𝑇𝑖

β„™{π’œ1, β‹… β‹… β‹… ,π’œπ‘

} ≀ 𝑃.

(27)This optimization problem is difficult to solve for general 𝑁 .For simplicity let us consider the simplest case of 𝑁 = 1. Inthis case Equation (24) becomes

οΏ½Μ„οΏ½1𝐼𝑁𝑅 = (𝑅1 +𝑅2)β„™ {log (1 + 𝑔𝑃1) β‰₯ 𝑅1 +𝑅2}

+𝑅1β„™

{log (1 + 𝑔𝑃1) < 𝑅1 +𝑅2,

𝑅1𝑅1+𝑅2

log (1 + 𝑔𝑃1) +𝑅2

𝑅1+𝑅2log (1 + 𝑔𝑃2) β‰₯ 𝑅1

}(28)

and the average power constraint (26) is

𝑃 β‰₯ 𝑃1β„™ {log (1 + 𝑔𝑃1) β‰₯ 𝑅1 +𝑅2}+(

𝑅1

𝑅1+𝑅2𝑃1 +

𝑅2

𝑅1+𝑅2𝑃2

)β„™ {log (1 + 𝑔𝑃1) < 𝑅1 +𝑅2} .

(29)Numerical results for 𝑁 = 1 is reported in Section V.

Optimal average rate of problem (27) is better than thatof problem (14): choosing 𝑃𝑛 = 𝑃, βˆ€π‘› makes (24) equal to(13). It is then interesting to ask how big the power allocationgain is. Numerical examples in Section V shows that forthe Rayleigh fading channel, this gain is remarkable in thelow SNR regime, while it is negligible for medium to highSNR. This is a reasonable result for most of the known powerallocation schemes, e.g., water-filling.

V. RO-HARQ: NUMERICAL RESULTS

The ARQ feedback link allows the transmitter to partiallyadapt to the channel conditions. Due to the lack of full channelstate information at the transmitter, such adaptation is notguaranteed to support reliable transmission all the time. Thusthe instantaneous successful transmission rate is a randomvariable, whose distribution is determined by the randomchannel fading and the rate assignment. As a performancemetric, the average rate characterizes the mean value of thisrandom variable. On the other hand, it is arguable that theoutage probability (π‘ƒπ‘œπ‘’π‘‘) serves as a worst-case performancemeasure for the RO-HARQ scheme, as it describes the proba-bility that the transmission fails after the maximum number ofARQ retransmissions are used. Thus, although the analytical

discussion of this paper is focused on the average rate maxi-mization, numerical results for different performance measuresare reported in this section. To be specific, three performancemetrics, average rate [3]–[5], [18], outage probability [1],[30], and average rate versus outage probability [4], are usedto numerically compare the proposed RO-HARQ with SLTand MLT. Discussion of the comparison to another quantizedfeedback scheme is deferred to Section VI.

A. Average Rate

Numerical optimizations are performed to maximize theaverage rate of the schemes analyzed in Section III and IV. Inthe case of slow Rayleigh fading channel with β„Ž ∼ π’žπ’© (0, 1),Figure 2 reports the average rate comparison among SLT,MLT, INR and RTD, together with the optimal INR throughputfor bursty communications derived in Appendix A. Ergodiccapacity is also shown as the upper limit. It is clear thatallowing INR ARQ feedback increases the average rate sub-stantially. For example, even INR with 𝑁 = 1 outperforms theinfinite-layer MLT by half a bit per channel use over a widerange of SNRs (15 to 35 dB), and is 1.5 bits better than theSLT scheme with optimized rate. Notice that this average rateadvantage of INR over MLT does not come with much highercomplexity. MLT requires complicated encoding and decodingprocesses to handle the multiple layers, and this complexityincreases with the number of layers. Although ARQ requiresfeedback and some overhead in the protocol design, typicallyits complexity is not as high as MLT. The average rate isboosted by another 0.5 bits if 𝑁 increases to 2. As 𝑁 furtherincreases, the average rate continues to increase up until theergodic capacity. At the same time, RTD type HARQ is shownto be inefficient in terms of the average rate performance. Itis better than SLT7, but in some configurations is even worsethan MLT. This also numerically confirms Lemma 1. For thissuboptimality, RTD is not considered in the remaining of thenumerical simulations. Another observation from Figure 2is a comparison of the INR throughput based on the twodifferent assumptions (burst communication vs continuouscommunication) in Section IV-B. It can be seen that theaverage rate is always better than the optimal throughput inAppendix A. This is due to the fact (see Section IV-B) thatmore packets are transmitted in β€œgood” channels in continuouscommunication than in burst communication.

Figure 3 compares the optimized average rate performanceof SLT, MLT and INR in a slow Ricean fading channel. Twodifferent 𝐾 factors are considered: Figure 3(a) for 𝐾 = 5and Figure 3(b) for 𝐾 = 10. Ergodic capacity is plotted asthe performance upper bound. As opposed to the Rayleighfading case, Ricean fading channel has a high-power line-of-sight (LOS) path and thus is β€œless random”. Numerical resultsshow that in the Ricean fading environment, MLT with 2 levelshas almost negligible gain over SLT, while INR still performsmuch better than both SLT and MLT. Combined with its highcomplexity, this result indicates the inefficiency of MLT inless random channel environment such as Ricean distributions.However, INR continues to perform very well even with 𝑁 =

7Analytically, this can also be easily proved by looking at Equation (18).When 𝑛 = 1 the component has the same form as SLT.

3346 IEEE TRANSACTIONS ON COMMUNICATIONS, VOL. 57, NO. 11, NOVEMBER 2009

5 10 15 20 25 30 350

1

2

3

4

5

6

7

8

9

10

11

SNR [dB]

Ave

rage

Rat

e [b

pcu]

SLTMLT βˆ’ 2 levelsMLT βˆ’ inf levelsINR βˆ’ N=1INR βˆ’ N=3INR βˆ’ N=7INR βˆ’ N=1 (Appendix A)INR βˆ’ N=3 (Appendix A)RTD βˆ’ N=1RTD βˆ’ N=3ergodic capacity

Fig. 2. Average rate (bpcu) versus receive SNR (dB) of SLT, MLT, INR, andRTD in a quasi-static Rayleigh fading channel. Equal power allocation amongtransmissions is performed. Optimal throughput derived in Appendix A is alsoplotted.

1, so it is more robust to the channel fading distribution thanMLT.

The transmission strategies studied in Section III and IVare for single-antenna systems. These strategies can be read-ily extended to single-input multiple-output (SIMO) fadingchannels. Consider a SIMO Rayleigh fading channel with πΏπ‘Ÿreceive antennas. It is shown in [31], [32] that the PDF of thetotal channel gain is

π‘“πΏπ‘Ÿ (𝑔) =1

Ξ“ (πΏπ‘Ÿ)π‘”πΏπ‘Ÿβˆ’1π‘’βˆ’π‘”, 𝑔 β‰₯ 0 (30)

where Ξ“ (πΏπ‘Ÿ) =∫∞0 π‘‘πΏπ‘Ÿβˆ’1π‘’βˆ’π‘‘d𝑑 is the Gamma function.

Figure 4 shows the average rate performance of SLT, MLT,INR, and ergodic capacity of a SIMO Rayleigh fading channelwith πΏπ‘Ÿ = 2 and πΏπ‘Ÿ = 4. Several interesting observationscan be made from these plots. First, similar to the previouscase, MLT with two levels has negligible gain over SLT: theperformance difference is almost indistinguishable. This againseems to suggest that the gain of MLT with two levels is notimportant in the β€œless random” fading distributions, especiallyconsidering that MLT is much more complicated than SLT.Secondly, the INR scheme performs extraordinary well: thegap between 𝑁 = 1 INR with the ergodic capacity is only0.6 bits and 0.7 bits at the medium to high SNR regime withπΏπ‘Ÿ = 2 and πΏπ‘Ÿ = 4, respectively. This complies with ourprevious observation that INR is robust to the channel fadingdistribution.

Numerical results for optimal power allocation of INRwith 𝑁 = 1 and the comparison to equal power allocationare reported in Figure 5(a) (medium-to-high SNR regime)and Figure 5(b) (low SNR regime) for the Rayleigh fadingdistribution. The advantage of optimal power allocation ismainly reflected in the low SNR regime. This gain diminishesas SNR increases, and becomes negligible in the medium-to-high SNR regime.

B. Outage Probability

The outage probability comparison is reported in Figure 6for both SLT and INR in a Rayleigh fading channel. Itshould be noted that the outage probability does not applyto MLT, where the requirement that all transmitted data mustbe decoded is dropped, and thus the concept of β€œoutage” doesnot hold anymore. Both SLT and INR are still optimized interms of the average rate. Thus, the outage event of SLTis β„™

{log (1 + 𝑔𝑃 ) < π‘…βˆ—

𝑆𝐿𝑇,π‘…π‘Žπ‘¦π‘™π‘’π‘–π‘”β„Ž

}where π‘…βˆ—

𝑆𝐿𝑇,π‘…π‘Žπ‘¦π‘™π‘’π‘–π‘”β„Ž

is given in (6), and the outage event of INR with 𝑁 isβ„™ {log (1 + 𝑔𝑃 ) < π‘…βˆ—

1} where π‘…βˆ—1 is the solution of 𝑅1 in

problem (14). The advantage of INR is now more obvious:not only does it increase the average rate, it also decreases theoutage probability simultaneously. In fact, with the argumentmade in Section IV-D, the asymptotic outage probability willbe zero, as every transmission will eventually match perfectlywith the instantaneous channel condition.

C. Average Rate versus Outage Probability

When transmitting over a slow fading channel, the success-fully transmitted data rate is a random variable. Its instanta-neous value depends on both the instantaneous channel gainand the communication scheme (e.g., SLT, MLT, or HARQ).Roughly speaking, the average rate describes the β€œmean value”of the random performance, while the outage probabilitycharacterizes the β€œvariance” in the sense that it gives theworst-case performance. Thus, to have a comprehensive viewand comparison of several schemes, both average rate andoutage probability should be jointly considered. Average rateversus outage probability was proposed in [4] as a meaningfulassociation between average rate and outage probability. Thismetric requires examining the average rate when the channelgain 𝑔 is known to exceed some threshold π‘”π‘‘β„Ž. This can beviewed as a conditional average rate where the distributionof channel gain 𝐹 (𝑔) is replaced with the conditional CDFπΉπ‘”π‘‘β„Ž (𝑒)

.= β„™ {𝑔 ≀ π‘’βˆ£π‘” β‰₯ π‘”π‘‘β„Ž}. Such a conditional average rate

examines the average rate with a given outage probability, andthus effectively relates these two metrics. Figure 7 reports theaverage rate performance where the threshold π‘”π‘‘β„Ž is chosensuch that the outage probability is 1% and 30%, respectively.The use of INR still provides remarkable gain with respect tothis metric.

VI. HOW TO USE THE FEEDBACK CHANNEL: SEQUENTIAL

VERSUS ONE-SHOT

From the information-theoretic point of view, the proposedRO-HARQ is one form of utilizing the feedback link in awireless communication system. It falls into the general cate-gory of quasi-static fading channel with quantized feedback.The capacity (under different definitions, e.g., outage capacity,expected capacity, etc.) of this channel is still unknown ingeneral. Thus it is difficult to quantify how well RO-HARQperforms in the absolute sense. A reasonable approach wouldbe to compare RO-HARQ with other schemes that utilizethe quantized feedback to improve performance, which is thepurpose of this section.

SHEN et al.: ON THE AVERAGE RATE PERFORMANCE OF HYBRID-ARQ IN QUASI-STATIC FADING CHANNELS 3347

5 10 15 20 25 30 350

2

4

6

8

10

12

SNR [dB]

Ave

rage

Rat

e [b

pcu]

SLTMLT βˆ’ 2 levelsINR βˆ’ N=1INR βˆ’ N=2INR βˆ’ N=4ergodic capacity

(a) K=5

5 10 15 20 25 30 350

2

4

6

8

10

12

SNR [dB]

Ave

rage

Rat

e [b

pcu]

SLTMLT βˆ’ 2 levelsINR βˆ’ N=1INR βˆ’ N=2INR βˆ’ N=4ergodic capacity

(b) K=10

Fig. 3. Average rate (bpcu) versus receive SNR (dB) of SLT, MLT, and INR in a quasi-static Ricean fading channel with different K factors and equalpower allocation among transmissions.

5 10 15 20 25 30 350

2

4

6

8

10

12

SNR [dB]

Ave

rage

Rat

e [b

pcu]

SLTMLT βˆ’ 2 levelsINR βˆ’ N=1INR βˆ’ N=2ergodic capacity

(a) 2 Receive Antennas

5 10 15 20 25 30 352

4

6

8

10

12

14

SNR [dB]

Ave

rage

Rat

e [b

pcu]

SLTMLT βˆ’ 2 levelsINR βˆ’ N=1INR βˆ’ N=2ergodic capacity

(b) 4 Receive Antennas

Fig. 4. Average rate (bpcu) versus receive SNR (dB) of SLT, MLT, and INR in a quasi-static SIMO Rayleigh fading channel, πΏπ‘Ÿ = 2 and πΏπ‘Ÿ = 4. Equalpower allocation among transmissions is performed.

One well-known approach is the quantized CSI feedbackscheme [19], [26], [27]: the receiver sends an 𝑀 -bit quan-tization of the channel state information to the transmitterbefore the transmission takes place, and the transmitter adjustsits rate and power according to this imperfect CSIT. To bespecific, the set of all possible channel gain 𝐺 = [0,∞) is

divided into 2𝑀 nonoverlapping subsets 𝐺 =βˆͺ2𝑀

𝑖=1𝐺𝑖, where𝐺𝑖 = [π‘‘π‘–βˆ’1, 𝑑𝑖), 𝑑0 = 0, 𝑑2𝑀 = ∞. If the instantaneouschannel gain 𝑔 ∈ 𝐺𝑖, the receiver sends the index 𝑖 tothe transmitter using the 𝑀 -bit feedback channel, and thetransmitter selects a codeword with rate 𝑅𝑖 and power 𝑃𝑖. Dueto the imperfect CSI feedback, the transmission could fail ifthe data rate 𝑅𝑖 is not supported by the instantaneous channelrealization. The average rate performance can be optimized[19], [26] by adjusting {𝐺𝑖, 𝑅𝑖, 𝑃𝑖}2

𝑀

𝑖=1.

Compared with this one-shot feedback scheme, RO-HARQis in fact a sequential feedback scheme. It is then natural to

ask the following question: since sequential and one-shot aredifferent forms of utilizing feedback to inform quantized CSIto the transmitter, what are the advantages/disadvantages ofthem, and which one is preferred? A few perspectives arediscussed in this section.

A. System implementation

ARQ is a technique in the data link layer, which is im-plemented in many wireless protocols. Thus exploiting ARQfeedback does not require additional system implementations,other than that the physical layer channel coding shouldbe able to provide incremental redundancy, which has beenwell studied [10], [13]–[17], [25]. One-shot CSI feedback,on the other hand, belongs to the physical layer techniques.To exploit CSIT, typically the physical layer needs someadditional designs, especially at the transmitter, to adjustthe transmission based on the one-shot CSI feedback. For

3348 IEEE TRANSACTIONS ON COMMUNICATIONS, VOL. 57, NO. 11, NOVEMBER 2009

5 6 7 8 9 10 11 12 13 141

1.2

1.4

1.6

1.8

2

2.2

2.4

2.6

2.8

3

SNR [dB]

thro

ughp

ut (

bits

per

cha

nnel

use

)Equal power allocationOptimal power allocation

(a) Medium-to-high SNR regime

βˆ’25 βˆ’20 βˆ’15 βˆ’10 βˆ’5 010

βˆ’3

10βˆ’2

10βˆ’1

100

SNR [dB]

thro

ughp

ut (

bits

per

cha

nnel

use

)

Equal power allocationOptimal power allocation

(b) Low SNR regime

Fig. 5. Average rate (bpcu) versus receive SNR (dB) of 𝑁 = 1 INR with optimal power allocation in a quasi-static Rayleigh fading channel.

5 10 15 20 25 30 3510

βˆ’2

10βˆ’1

100

SNR [dB]

Pou

t

SLTINR βˆ’ N=1INR βˆ’ N=2INR βˆ’ N=4

Fig. 6. Outage probability versus receive SNR (dB) of INR (𝑁 =1, 2,and 4) compared to SLT in a quasi-static Rayleigh fading channel. The twoschemes are optimized in terms of the average rate.

example, adaptive coding and modulation requires the data rateand transmit power to be frequently adjusted according to thechannel variation. As another example, matrix precoder designis needed to rotate the transmit signal in case of multipletransmit antennas [33]. This increases the system complexityand cost. From this perspective, utilizing ARQ seems to bemore favorable, since ARQ is already provided by the wirelessprotocol in the system and no additional closed-loop designis needed.

B. Position of feedback

In the RO-HARQ scheme, the ACK/NACK feedbacks arescattered into the entire transmission. There are several hand-shakings between the transmitter and receiver. Each suchcoordination adds additional overhead. The typical form ofthe one-shot CSI feedback, however, is to send the entireCSI bits before the data transmission takes place, such that

5 10 15 20 25 30 350

5

10

15

SNR [dB]

Ave

rage

Rat

e [b

pcu]

SLT, 1%MLT βˆ’ 2 levels, 1%INR βˆ’ N=1, 1%INR βˆ’ N=2, 1%INR βˆ’ N=4, 1%SLT, 30%MLT βˆ’ 2 levels, 30%INR βˆ’ N=1, 30%INR βˆ’ N=2, 30%INR βˆ’ N=4, 30%

Fig. 7. Average rate versus outage probability for SLT, MLT and INR ina quasi-static Rayleigh fading channel. Two thresholds corresponding to 1%and 30% outage probability, respectively, are simulated.

the transmitter adjusts the parameters to match the partially-known channel. From the practical point of view, the latteris more favorable, since all the feedback bits are sent in oneshot, which simplifies the protocol and reduces the processingdelay, system overhead, and the implementation complexity.

C. Amount of feedback and performance

To make a fair comparison of the amount of feedbackrequired for INR and one-shot CSI feedback, it is assumedthat both schemes are optimized for maximum average rates,and the maximum average rates are set to be equal. This isreasonable since the resulting comparison indicates that inorder to achieve the same optimal performance, how muchfeedback is needed for each scheme. The following lemmagives the relationship between 𝑁 and 𝑀 under this condition.

Lemma 3: For any given fading distribution, if the optimalaverage rates of

SHEN et al.: ON THE AVERAGE RATE PERFORMANCE OF HYBRID-ARQ IN QUASI-STATIC FADING CHANNELS 3349

(1) INR scheme with maximum 𝑁 retransmissions,(2) one-shot CSI feedback scheme with an 𝑀 -bit feedback

channel

are the same, i.e., οΏ½Μ„οΏ½π‘βˆ—

𝐼𝑁𝑅 = οΏ½Μ„οΏ½π‘€βˆ—

𝐢𝑆𝐼 , then

𝑁 + 1 = 2𝑀 . (31)

Proof: See Appendix D.One direct consequence of Lemma 3 is that we can easily

make a numerical comparison of the average rate perfor-mance between the RO-HARQ scheme and the one-shot CSIfeedback scheme studied in [19], [26], [27]. For example, inFigure 2 we have reported the optimal INR performance with𝑁 = 1, 3 and 7 in a Rayleigh fading channel. These directlycorrespond to the optimal performance of the one-shot CSIfeedback scheme with 𝑀 = 1, 2 and 3 bits of feedback,respectively.

Another important benefit from Lemma 3 is that since theaverage rate optimization problems for both one-shot CSIfeedback and RO-HARQ can be directly related, existingnumerical solutions for the previous problem, e.g., [26], canbe used to help solve Problem (14) as long as the parametersare chosen according to (31). One should note that solutionsin [26] are efficient but sub-optimal. Also they cannot helpwith the dynamic power allocation problem (27).

From Lemma 3, it seems that INR requires more feedbackthan the one-shot CSI feedback scheme: with condition (31)satisfied, 𝑀 = 𝑁 if and only if 𝑀 = 𝑁 = 1, and 𝑁 > 𝑀strictly holds in all other cases. However, 𝑁 > 𝑀 does notnecessarily translate to the conclusion that INR requires morefeedback. First of all, the actual number of retransmissionsassociated with INR is a random variable, which is deter-mined by the instantaneous channel realization and the rateassignment. The number𝑁 only denotes the maximum numberof retransmissions, which is a worst-case constraint. On theother hand, 𝑀 is a fixed number in the one-shot CSI feedbackscheme. It is not affected by the channel realization or the rateassignment. Secondly, 𝑁 retransmissions is not necessarilyequivalent to 𝑁 bits feedback, as the number of physicalbits required for 𝑁 retransmissions is determined by manyconsiderations. Wicker [7, Chapter 15.2] discussed severalpractical issues that might require different number of bitsfor ACK/NACK. For example, NACK may be set as a defaultand only ACK is transmitted in the feedback channel. Thiscertainly reduces the feedback of INR. Another example isto use the recently developed rateless codes in HARQ [34],such as LT-codes [35] and Raptor codes [36]. With ratelesscodes, the receiver does not need to send any feedback tothe transmitter until it accumulates enough data for successfuldecoding [37]. In this case the actual amount of feedback isvery small.8

D. Asymptotic performance

It is seen in Lemma 2 that the ergodic capacity is achievedwith 𝑁 β†’ ∞. On the other hand, it is well known that if theone-shot CSI feedback is perfect (which requires 𝑀 β†’ ∞),

8The sequential feedback scheme fits nicely with the use of rateless code,while the one-shot feedback does not have this advantage.

ergodic capacity can also be achieved9. Thus, moving theinfinitely many bits of feedback from the beginning of trans-mission as one-shot CSI feedback to within the transmissionas ARQ feedback gives the same asymptotic performance.

VII. CONCLUSIONS

HARQ with optimized rate assignment is shown to sig-nificantly improve the performance over the single-layer andmulti-layer transmission schemes under several different per-formance metrics. The key idea behind this work is to exploitthe existence of HARQ protocol to adapt the transmission tothe instantaneous channel realization. Average rate is chosenas the performance metric, and the optimal rate assignment isstudied. Several aspects of the RO-HARQ scheme are studied,including power allocation, asymptotic performance limit, andcomparison to one-shot quantized CSI feedback. Simulationresults show that even one HARQ retransmission gives aremarkable gain over conventional schemes, and this gain isrobust in different fading distributions.

Possible future work includes the combination of HARQfeedback with the multi-layer scheme. With such combinationthe ARQ feedback will indicate which layers cannot bedecoded, and hence the transmitter can resend those layerstogether with possibly new information in the next round.Another direction is to study the combination of sequentialand one-shot feedback, in which the receiver will send backa coarse CSI before the data transmission with one-shotfeedback, and use ARQ feedback to refine the quantized CSIin the sequel.

APPENDIX ATHROUGHPUT ANALYSIS FOR BURSTY COMMUNICATIONS

A. INR

For each possible fading state 𝑔, we assume that 𝐾 in-formation nats are to be sent with a maximum of 𝑁 re-transmissions, with the length of the 𝑛-th transmission 𝑇𝑛,βˆ€π‘› = 1, β‹… β‹… β‹… , 𝑁 + 1. The rate after transmission 𝑛 is then𝑅(𝑛) = 𝐾/𝑇 (𝑛). We need to separately evaluate the numeratorand denominator of the throughput expression (2) with respectto the fading distribution. Notice that the overall code length𝑇 (𝑁+1) is a function of the fading state.

We begin with the average number of successfully trans-mitted information nats:

𝔼{π‘˜} = 𝐾ℙ

{log (1 + 𝑔𝑃 ) β‰₯ 𝑅(𝑁+1)

}. (32)

To evaluate the average length of transmission for 𝐾 infor-mation nats, we first define 𝑇 (0) = 0 and thus 𝑅(0) = ∞.

9Note that there is a constraint of fixed transmit power; thus waterfillingpower allocation over different fading states is not permitted.

3350 IEEE TRANSACTIONS ON COMMUNICATIONS, VOL. 57, NO. 11, NOVEMBER 2009

Then

𝔼{𝒯 } = 𝑇 (1)β„™

{log (1 + 𝑔𝑃 ) β‰₯ 𝑅(1)

}+ 𝑇 (2)

β„™{𝑅(1) > log (1 + 𝑔𝑃 ) β‰₯ 𝑅(2)

}+ β‹… β‹… β‹…+ 𝑇 (𝑁+1)

β„™{𝑅(𝑁) > log (1 + 𝑔𝑃 )

}=

π‘βˆ‘π‘›=1

𝑇 (𝑛)β„™

{𝑅(π‘›βˆ’1) > log (1 + 𝑔𝑃 ) β‰₯ 𝑅(𝑛)

}

+𝑇 (𝑁+1)β„™

{𝑅(𝑁) > log (1 + 𝑔𝑃 )

}. (33)

The throughput of INR can now be written as Equation (34) atthe top of the next page, and the INR throughput optimizationproblem is formulated as

maximize 1βˆ’πΉπΊ(𝑔𝑁+1)βˆ‘π‘π‘›=1

(1

𝑅(𝑛+1)βˆ’ 1

𝑅(𝑛)

)𝐹𝐺(𝑔𝑛)

subject to 𝑅(𝑛) β‰₯ 0; βˆ€π‘› = 1, β‹… β‹… β‹… , 𝑁 + 1𝑅(𝑛) β‰₯ 𝑅(𝑛+1); βˆ€π‘› = 0, β‹… β‹… β‹… , 𝑁.

(35)

Numerical results of this optimization problem for 𝑁 = 1 and3 are reported in Figure 2.

B. RTD

With the fact that Chase Combining effectively increasesthe receive SNR to 𝑛𝑔𝑃 and reduces the overall rate to 𝑅/𝑛after the 𝑛-th transmission, we can similarly compute thethroughput of RTD.

Numerator:

𝔼{π‘˜} = 𝐾ℙ {log (1 + (𝑁 + 1)𝑔𝑃 ) β‰₯ 𝑅}= 𝐾 (1βˆ’ 𝐹𝐺(π‘žπ‘+1)) (36)

Denominator:

𝔼{𝒯 } =βˆ‘π‘π‘›=1 𝑛𝑇ℙ{log (1 + 𝑛𝑔𝑃 ) β‰₯ 𝑅 >

log(1 + (π‘›βˆ’ 1)𝑔𝑃 )}+ (𝑁 + 1)𝑇ℙ {𝑅 > log (1 +𝑁𝑔𝑃 )}=βˆ‘π‘π‘›=1 𝑛𝑇ℙ {π‘žπ‘›βˆ’1 β‰₯ 𝑔 > π‘žπ‘›}+ (𝑁 + 1)𝑇ℙ {π‘žπ‘ β‰₯ 𝑔}

=βˆ‘π‘π‘›=1 𝑛𝑇 [𝐹𝐺(π‘žπ‘›βˆ’1)βˆ’ 𝐹𝐺(π‘žπ‘›)] + (𝑁 + 1)𝑇𝐹𝐺(π‘žπ‘ )

= π‘‡βˆ‘π‘π‘›=0 𝐹𝐺(π‘žπ‘›)

(37)RTD throughput:

πœ‚π‘+1𝑅𝑇𝐷 =

𝑅 (1βˆ’ 𝐹𝐺(π‘žπ‘+1))βˆ‘π‘π‘›=0 𝐹𝐺(π‘žπ‘›)

=𝑅 (1βˆ’ 𝐹𝐺(π‘žπ‘+1))

1 +βˆ‘π‘π‘›=1 𝐹𝐺(π‘žπ‘›)

(38)

RTD throughput optimization problem:

maximize𝑅 (1βˆ’ 𝐹𝐺(π‘žπ‘+1))

1 +βˆ‘π‘π‘›=1 𝐹𝐺(π‘žπ‘›)

. (39)

APPENDIX BPROOF OF LEMMA 1

The proof is done by showing that the optimal averagerate of RTD can be achieved by INR. For the simplicityof discussion in this appendix, we re-denote the bound-ary points of each fading region for INR and RTD to be

π‘Žπ‘› = π‘’βˆ‘π‘+2βˆ’π‘›

π‘˜=1π‘…π‘˜βˆ’1

𝑃 , 𝑏𝑛 = π‘’π‘…βˆ’1𝑛𝑃 , βˆ€π‘› = 1, β‹… β‹… β‹… , 𝑁 + 1, and

π‘Ž0 = 𝑏0.= ∞, respectively. Let us rewrite the average rate

expression for INR and RTD as:

�̄�𝑁𝐼𝑁𝑅 (𝑅1, β‹… β‹… β‹… , 𝑅𝑁+1) =

𝑁+1βˆ‘π‘›=1

(𝑁+2βˆ’π‘›βˆ‘

𝑖=1

𝑅𝑖

)β„™ {𝑔 ∈ [π‘Žπ‘›, π‘Žπ‘›βˆ’1)} ,

(40)

�̄�𝑁𝑅𝑇𝐷 (𝑅) =

𝑁+1βˆ‘π‘›=1

𝑅

𝑛ℙ {π‘π‘›βˆ’1 > 𝑔 β‰₯ 𝑏𝑛} . (41)

Consider the optimal solution for RTD: 𝑅 = π‘…βˆ—,�̄�𝑁𝑅𝑇𝐷 (𝑅) = �̄�𝑁

βˆ—π‘…π‘‡π·. Denote the corresponding optimal

boundary points of the fading regions as

π‘βˆ—π‘› =𝑒𝑅

βˆ— βˆ’ 1

𝑛𝑃(42)

for 𝑛 = 1, β‹… β‹… β‹… , 𝑁 + 1 and π‘βˆ—0 = 𝑏0. Then choose(𝑅1, β‹… β‹… β‹… , 𝑅𝑁+1) such that

π‘Žπ‘› = π‘βˆ—π‘›, βˆ€π‘› = 1, β‹… β‹… β‹… , 𝑁 + 1. (43)

Condition (43) forces the event {π‘Žπ‘›βˆ’1 > 𝑔 β‰₯ π‘Žπ‘›} to be equiv-alent to

{π‘βˆ—π‘›βˆ’1 > 𝑔 β‰₯ π‘βˆ—π‘›

}for 𝑛 = 1, β‹… β‹… β‹… , 𝑁 + 1. Thus, the

proof is set if we can show that𝑁+2βˆ’π‘›βˆ‘π‘–=1

𝑅𝑖 β‰₯ π‘…βˆ—

𝑛, βˆ€π‘› = 1, β‹… β‹… β‹… , 𝑁 + 1. (44)

Condition (43) can be equivalently written as𝑁+2βˆ’π‘›βˆ‘π‘–=1

𝑅𝑖 = log𝑒𝑅

βˆ—+ π‘›βˆ’ 1

𝑛. (45)

We need the following simple lemma.Lemma 4: The function

𝑓(𝑛) = 𝑛(𝑒

𝑐𝑛 βˆ’ 1

), 𝑛 β‰₯ 1. (46)

is monotonically decreasing, where 𝑐 > 0 is a constant.This can be easily proved by checking that d𝑓(𝑛)

d𝑛 > 0 forpositive and finite 𝑛. With Lemma 4 and Equation (45), thefollowing inequalities can be obtained:

Lemma 4 =β‡’ π‘’π‘…βˆ— βˆ’ 1 β‰₯ 𝑛

(𝑒

π‘…βˆ—π‘› βˆ’ 1

)=β‡’ log

π‘’π‘…βˆ—+ π‘›βˆ’ 1

𝑛β‰₯ π‘…βˆ—

𝑛(45)=β‡’ (44)

which concludes the proof.

APPENDIX CPROOF OF LEMMA 2

The average rate of the INR scheme with a fixed 𝑁 is givenin Equation (12) as

�̄�𝑁𝐼𝑁𝑅 =𝑁+1βˆ‘π‘›=1

𝑅𝑛ℙ

{log (1 + 𝑔𝑃 ) β‰₯

π‘›βˆ‘π‘˜=1

π‘…π‘˜

}.

As 𝑁 β†’ ∞, this average rate converges to

οΏ½Μ„οΏ½βˆžπΌπ‘π‘… =

∫ ∞

0

β„™ {log (1 + 𝑔𝑃 ) β‰₯ 𝑅}d𝑅= 𝔼 [log (1 + 𝑔𝑃 )] (47)

where Equation (47) is due to the fact that

𝔼 [𝑋 ] =

∫ ∞

0

(1βˆ’ 𝐹𝑋(π‘₯)) dπ‘₯

for a nonnegative random variable 𝑋 .

SHEN et al.: ON THE AVERAGE RATE PERFORMANCE OF HYBRID-ARQ IN QUASI-STATIC FADING CHANNELS 3351

πœ‚π‘+1𝐼𝑁𝑅 =

𝐾ℙ{log (1 + 𝑔𝑃 ) β‰₯ 𝑅(𝑁+1)

}βˆ‘π‘π‘›=1 𝑇

(𝑛)β„™{𝑅(π‘›βˆ’1) > log (1 + 𝑔𝑃 ) β‰₯ 𝑅(𝑛)

}+ 𝑇 (𝑁+1)β„™

{𝑅(𝑁) > log (1 + 𝑔𝑃 )

}=

β„™{log (1 + 𝑔𝑃 ) β‰₯ 𝑅(𝑁+1)

}βˆ‘π‘π‘›=1

1𝑅(𝑛)β„™

{𝑅(π‘›βˆ’1) > log (1 + 𝑔𝑃 ) β‰₯ 𝑅(𝑛)

}+ 1𝑅(𝑁+1)β„™

{𝑅(𝑁) > log (1 + 𝑔𝑃 )

}=

1βˆ’ 𝐹𝐺(𝑔𝑁+1)βˆ‘π‘π‘›=1

1𝑅(𝑛) [𝐹𝐺(π‘”π‘›βˆ’1)βˆ’ 𝐹𝐺(𝑔𝑛)] +

1𝑅(𝑁+1)𝐹𝐺(𝑔𝑁)

=1βˆ’ 𝐹𝐺(𝑔𝑁+1)βˆ‘π‘

𝑛=1

(1

𝑅(𝑛+1) βˆ’ 1𝑅(𝑛)

)𝐹𝐺(𝑔𝑛)

(34)

APPENDIX DPROOF OF LEMMA 3

The average rate of the INR scheme with maximum 𝑁retransmissions is given in Equation (12). By defining π‘Ÿβˆ—π‘› =βˆ‘π‘›π‘–=1𝑅

βˆ—π‘– for 𝑛 = 1, β‹… β‹… β‹… , 𝑁 + 1, the maximum average rate

of INR can be rewritten as

οΏ½Μ„οΏ½π‘βˆ—

𝐼𝑁𝑅 =

𝑁+1βˆ‘π‘›=1

π‘Ÿβˆ—π‘›β„™{π‘Ÿβˆ—π‘›+1 > log (1 + 𝑔𝑃 ) β‰₯ π‘Ÿβˆ—π‘›

}. (48)

For the CSI feedback scheme defined in Section VI, theaverage rate can be computed as

�̄�𝑀𝐢𝑆𝐼 =2π‘€βˆ‘π‘›=1

𝑅𝑛ℙ {log (1 + 𝑑𝑛𝑃 ) > log (1 + 𝑔𝑃 ) β‰₯ 𝑅𝑛} .(49)

It is observed in [19] that in order to maximize the averagerate of the CSI feedback scheme, 𝑅𝑛 must satisfy

π‘…βˆ—π‘› = log

(1 + π‘‘βˆ—π‘›βˆ’1𝑃

). (50)

Substituting (50) into (49), the optimal average rate of the CSIfeedback scheme is

οΏ½Μ„οΏ½π‘€βˆ—

𝐢𝑆𝐼 =2π‘€βˆ‘π‘›=1

π‘…βˆ—π‘›β„™{π‘…βˆ—π‘›+1 > log (1 + 𝑔𝑃 ) β‰₯ π‘…βˆ—

𝑛

}. (51)

Compare (51) to (48), the result 𝑁 + 1 = 2𝑀 followsimmediately.

REFERENCES

[1] L. H. Ozarow, S. Shamai, and A. D. Wyner, β€œInformation theoreticconsiderations for cellular mobile radio," IEEE Trans. Veh. Technol.,vol. 43, no. 2, pp. 359-378, May 1994.

[2] T. M. Cover, β€œComments on broadcast channels," IEEE Trans. Inf.Theory, vol. 44, no. 6, pp. 2524-2530, Oct. 1998.

[3] S. Shamai, β€œA broadcast strategy for the Gaussian slowly fading chan-nel," in Proc. International Symp. Inform. Theory, June-July 1997, p.150.

[4] S. Shamai and A. Steiner, β€œA broadcast approach for a single-user slowlyfading MIMO channel," IEEE Trans. Inf. Theory, vol. 49, no. 10, pp.2617-2635, Oct. 2003.

[5] Y. Liu, V. K. N. Lau, O. Y. Takeshita, and M. P. Fitz, β€œOptimal rateallocation for superposition coding in quasi-static fading channels," inProc. International Symp. Inform. Theory, June-July 2002, p. 111.

[6] D. J. Costello, J. Hagenauer, H. Imai, and S. B. Wicker, β€œApplicationsof error-control coding," IEEE Trans. Inf. Theory, vol. 44, no. 6, pp.2531-2560, Oct. 1998.

[7] S. Wicker, Error Control Systems for Digital Communication andStorage. Englewood Cliffs, NJ: Prentice-Hall, 1995.

[8] H. Zheng and H. Visvanathan, β€œOptimizing ARQ performance inhigh speed downlink systems with scheduling," IEEE Trans. WirelessCommun., vol. 4, no. 2, pp. 495-506, Mar. 2005.

[9] G. Caire and D. Tuninetti, β€œThe throughput of Hybrid-ARQ protocolsfor the Gaussian collision channel," IEEE Trans. Inf. Theory, vol. 47,no. 5, pp. 1971-1988, July 2001.

[10] S. Sesia, G. Caire, and G. Vivier, β€œIncremental redundancy hybridARQ schemes based on low-density parity-check codes," IEEE Trans.Commun., vol. 52, no. 8, pp. 1311-1321, Aug. 2004.

[11] H. El Gamal, G. Caire, and M. O. Damen, β€œThe MIMO ARQ Channel:diversity-multiplexing-delay tradeoff," IEEE Trans. Inf. Theory, vol. 52,no. 8, pp. 3601–3621, Aug. 2006.

[12] A. Chuang, A. G. i Fabregas, L. Rasmussen, and I. Collings, β€œOptimalthroughput-diversity-delay tradeoff in MIMO ARQ block-fading chan-nels," IEEE Trans. Inf. Theory, vol. 54, no. 9, pp. 3968-3986, Sept.2008.

[13] J. Li and K. Narayanan, β€œRate-compatible low density parity check(RC-LDPC) codes for capacity-approaching ARQ schemes in packetdata communications," in Proc. International Conf. Commun., InternetInform. Technol. (CIIT), US Virgin Islands, Nov. 2002, pp. 201-206.

[14] J. Ha, J. Kim, and S. W. McLaughlin, β€œRate-compatible puncturingof low-density parity-check codes," IEEE Trans. Inf. Theory, vol. 50,no. 11, pp. 2824-2836, Nov. 2004.

[15] J. Kim, W. Hur, A. Ramamoorthy, and S. W. McLaughlin, β€œDesignof rate-compatible irregular LDPC codes for incremental redundancyhybrid ARQ systems," in Proc. International Symp. Inform. Theory,July 2006, pp. 1139-1143.

[16] J. Kim, A. Ramamoorthy, and S. W. McLaughlin, β€œDesign of efficiently-encodable rate-compatible irregular LDPC codes," in Proc. IEEE Inter-national Conf. Commun., June 2006, pp. 1131-1136.

[17] J. Ha, J. Kim, D. Klinc, and S. W. McLaughlin, β€œRate-compatiblepunctured low-density parity-check codes with short block lengths,"IEEE Trans. Inf. Theory, vol. 52, no. 2, pp. 728-738, Feb. 2006.

[18] M. Effros, A. Goldsmith, and Y. Liang, β€œCapacity definitions of generalchannels with receiver side information," in Proc. International Symp.Inform. Theory, June 2007, pp. 921–925.

[19] T. Kim and M. Skoglund, β€œOn the expected rate of slowly fadingchannels with quantized side information," IEEE Trans. Commun.,vol. 55, no. 4, pp. 820-829, Apr. 2007.

[20] R. M. Corless, G. H. Gonnet, D. E. G. Hare, D. J. Jeffrey, andD. E. Knuth, β€œOn the Lambert W function," Advances ComputationalMathematics, vol. 5, pp. 329-359, 1996.

[21] X. Wang and T. Orchard, β€œDesign of superposition coded modulation forunequal error protection," in Proc. IEEE International Conf. Commun.,June 2001, pp. 412-416.

[22] D. Tuninetti, β€œTransmitter channel state information and repetitionprotocols in block fading channels," in Proc. IEEE Inform. TheoryWorkshop, Sep. 2007, pp. 505-510.

[23] D. Chase, β€œCode combining–a maximum-likelihood decoding approachfor combining an arbitrary number of noisy packets," IEEE Trans.Commun., vol. 33, no. 5, pp. 385-393, May 1985.

[24] N. Varnica, E. Soljanin, and P. Whiting, β€œLDPC code ensembles forincremental redundancy hybrid ARQ," in Proc. International Symp.Inform. Theory, Sep. 2005, pp. 995-999.

[25] TrellisWare Technologies, White papers on FlexiCodes and F-LDPC.[Online]. Available: www.trellisware.com.

[26] L. Lin, R. D. Yates, and P. Spasojevic, β€œAdaptive transmission withdiscrete code rates and power levels," IEEE Trans. Commun., vol. 51,no. 12, pp. 2115-2125, Dec. 2003.

[27] β€”β€”, β€œAdaptive transmission with finite code rates," IEEE Trans. Inf.Theory, vol. 52, no. 5, pp. 1847-1860, May 2006.

[28] T. Kim and M. Skoglund, β€œDiversity-multiplexing tradeoff in MIMO

3352 IEEE TRANSACTIONS ON COMMUNICATIONS, VOL. 57, NO. 11, NOVEMBER 2009

channels with partial CSIT," IEEE Trans. Inf. Theory, vol. 53, no. 8,pp. 2743-2759, Aug. 2007.

[29] T. Cover and J. Thomas, Elements of Information Theory. New York:Wiley Interscience, 1992.

[30] E. Biglieri, J. Proakis, and S. Shamai, β€œFading channels: information-theoretic and communications aspects," IEEE Trans. Inf. Theory, vol. 44,no. 6, pp. 2619-2692, Oct. 1998.

[31] A. Edelman, β€œEigenvalues and condition numbers of random matrices,"Ph.D. dissertation, Massachusetts Institute of Technology, Cambridge,MA, 1989.

[32] E. Telatar, β€œCapacity of multi-antenna Gaussian channels," AT&T–BellLabs, Tech. Rep., June 1995.

[33] M. Vu, β€œExploiting transmit channel side information in MIMO wirelesssystems," Ph.D. dissertation, Stanford University, Stanford, CA, 2006.

[34] E. Soljanin, N. Varnica, and P. Whiting, β€œPunctured vs rateless codesfor hybrid ARQ," in Proc. IEEE Inform. Theory Workshop, Punta delEste, Uruguay, Mar. 2006.

[35] M. Luby, β€œLT codes," in Proc. 43rd Annual IEEE Symp. FoundationsComputer Science, 2002, pp. 271-282.

[36] A. Shokrollahi, β€œRaptor codes," IEEE Trans. Inf. Theory, vol. 52, no. 6,pp. 2551-2567, June 2006.

[37] Y. Liu, β€œA lossless layered coding strategy for MIMO systems," in Proc.International Symp. Inform. Theory, Sep. 2005, pp. 1947–1951.

Cong Shen (S’01) received the B.S. and M.S.degrees, in 2002 and 2004 respectively, from theDepartment of Electronic Engineering, TsinghuaUniversity, Beijing, China. He is currently workingtowards the Ph.D. degree in the Electrical Engi-neering Department, University of California, LosAngeles (UCLA). His research interest is on generalcommunication theory with emphasis on wirelesscommunications.

Tie Liu received his B.S. (1998) and M.S. (2000) degrees, both in ElectricalEngineering, from the Tsinghua University, Beijing, China and M.S. degree inMathematics (2004) and Ph.D. degree in Electrical and Computer Engineering(2006) from the University of Illinois at Urbana-Champaign. Since August2006, he has been with the Texas A&M University where he is currentlyan Assistant Professor in Electrical and Computer Engineering. His researchinterests are in the field of information theory, wireless communication, andstatistical signal processing. He is a recipient of the M. E. Van ValkenburgGraduate Research Award (2006) from the University of Illinois at Urbana-Champaign and the Best Paper Award (2008) from the Third InternationalConference on Cognitive Radio Oriented Wireless Networks and Communi-cations.

Michael P. Fitz (S’82-M’83-SM’02) received theB.E.E. degree (summa cum laude) from the Univer-sity of Dayton, Dayton, OH, in 1983 and the M.S.and Ph.D. degrees in electrical engineering from theUniversity of Southern California, Los Angeles, in1984 and 1989, respectively.

From 1983 to 1989, he was a CommunicationSystems Engineer with Hughes Aircraft and TRWInc. Since 1989, he has been with the Faculty of Pur-due University, West Lafayette, IN, The Ohio StateUniversity (OSU), Columbus, and the University of

California, Los Angeles. He is currently with Northrop Grumman Corp. as aSenior Systems Engineer working on satellite communications. His researchis in the broad area of statistical communication theory and experimentation.He is the author of Fundamentals of Communications Systems (New York:McGraw Hill, 2007). His research group at UCLA currently is interested inthe theory of space-time modems and operates an experimental wireless wide-area network and a space-time coding testbed.

Prof. Fitz received the 2001 IEEE Communications Society Leonard G.Abraham Prize Paper Award in the field of communications systems.


Recommended