+ All Categories
Home > Documents > 475_46ce89a7

475_46ce89a7

Date post: 02-Jun-2018
Category:
Upload: baburaokodavati
View: 215 times
Download: 0 times
Share this document with a friend

of 5

Transcript
  • 8/11/2019 475_46ce89a7

    1/5

    I EEE

    TRANSACTIONS ON SIGNAL PROCESSING. VOL. 41. NO. 3. MARCH

    transforms for improving performance

    of

    transform domain normal-

    ized LMS algorithm, Proc. Inst. Elec. Eng., pt. F, vol. 139, pp.

    327-335, Oct. 1992.

    [ 6 ] D. L. Duttweiler, Adaptive filter performance with nonlinearities in

    the correlation multipliers, IEEE

    Trans. Acoust.

    , Speech,

    Signal

    Processing, vol. ASSP-30, pp. 578-586, Aug. 1982.

    [7] E. Eweda, Analysis and design of a signed regressor LMS algorithm

    for stationary and nonstationary adaptive filtering with correlated

    Gaussian data,

    IEEE

    Trans. Circuits S y s t . , vol. 37, pp. 1367-1374,

    Nov. 1990.

    [8]

    R. Price, A useful theorem for nonlinear devices having Gaussian

    inputs, IRE

    Trans. Infarm.

    Theory, vol. IT-4, pp. 69-72, June 1958.

    [9] S . S . Reddi, A time-domain adaptive algorithm for rapid conver-

    gence,

    Proc.

    IEEE, vol. 72, pp. 533-535, Apr. 1984.

    [ lo] B. Farhang-Boroujeny, An efficient quasi-LMSiNewton algorithm:

    Analysis and simulation results, Tech. Rep. 01-10-91, Commun.

    Division, Dept. Elec. Eng., National Univ. Singapore, Oct. 1991.

    1111 D.

    F.

    Marshall and W . K . Jenkins, A fast quasi-Newton adaptive

    filtering algorithm, in

    Proc.

    1988

    ICASSP

    (New York, NY), Apr.

    11-14, pp. 1377-1380.

    [I21 D. F. Marshall and

    W. K.

    Jenkins, A fast quasi-Newton adaptive

    filtering algorithm, IEEE Trans. Signal Processing, vol. 40, pp.

    1652-1662, July 1992.

    A Unified Square-Root-Free Approach for

    QRD-

    Based Recursive Least Squares Estimation

    S .

    F .

    Hsieh , K . J .

    R.

    Liu, and K . Y a o

    Abstract-Givens rotation is the most commonly used method in per-

    forming the

    QR

    decomposition QRD) updating. The generic formula

    for these rotations requires explicit square-root sqrt) computations

    which constitute a computational bottleneck and are quite undesirable

    from the practical VLSI circuit design point of view. So far, there has

    been more than ten known sqrt-free algorithms. In this correspon-

    dence, we provide a unified systematic approach for the sqrt-free Giv-

    ens rotation. By properly choosing two parameters,

    p

    and

    v,

    all pre-

    viously known sqrt-free,

    as

    well as new methods, are included in our

    unified approach. This unified treatment is also extended to the QRD-

    based recursive least squares RLS) problem for optimum residual ac-

    quisition without sqrt operations.

    I. INTRODUCTION

    The Giv ens ro ta t ion , which requires a square- root ( sqrt ) opera-

    tion in the generic formulation, is a versatile method in performing

    many s ignal processing algor i thms involv ing matr ix computat ions ,

    such as the

    Q R

    decompos i t ion

    (QRD),

    the s ingular value decom-

    pos i t ion , and the e igendecompos i tion [7] . W hile many researchers

    have worked on reformulat ing a lgori thms su i tab le for paralle l com -

    put ing and V LSI archi tectures , current V LSI archi tectures s t i l l d is-

    approve if not prohibit sophisticated computations. A noticeable

    example is the sqr t operat ion , which may occupy much area in a

    VLSI chip or may also require many cycles to accomplish such

    Manuscript received July 10, 1991; revised March 31, 1992. This work

    was supported in pan by the National Science Council of

    t he

    Republic of

    China under Grant NSC80-E-SP-009-01A , the NSF Engineering Center

    Grant ECD-8803012 and Minta Martin Award of the University of Mary-

    land, the NSF Grant NCR-8814407, and a UC MICRO grant.

    S . F.

    Hsieh is with the Departm ent of Communication Engineering, Na-

    tional Chiao Tung University, Hsinchu, Taiwan 30039, R epublic of China.

    K . J . R . Liu is with the Department

    of

    Electrical Engineering, Systems

    Research Center, University

    of

    Maryland, College Park, MD 20742.

    K . Yao is with the Department of Electrical Engineering, University of

    California, Los Angeles, CA 90024- 1594.

    IEEE Log Number 9206004.

    1993 1405

    computat ion . In addi t ion , a recent s imulat ion s tudy presented in

    [

    141 by Proudler

    et

    al. showed that a finite-precision implementa-

    tion of a sqrt-free lattice algorithm achieved better numerical re-

    su l ts than that us ing the convent ional Giv ens ro ta t ion method.

    Thus , much ef for t has been spent

    on

    minimizing or everl elimi-

    nat ing the sqr t operat ion f rom these a lgor i thms . One wel l-known

    example is the sqr t - free Given s ro ta t ion fi r st proposed by Gentle-

    man [5]. Hammarling generalized his results briefly [9]. Later,

    o ther vers ions of the sqr t - f ree Given s ro ta t ions were a lso proposed

    [ l ] , [ 2 ] , [ 8 ] .

    All

    of the above algor i thms only focus

    on

    the sqrt-

    free Givens rotation itself andlor its applications in solving a least

    squares (LS) problem. M cWh ir ter (131 was the f i rs t to apply the

    sqr t- f ree Givens ro ta t ion to recurs ive LS (RLS) problems in com -

    put ing the des ired opt imum res idual without so lv ing expl ic i t ly for

    the LS coefficients. Closely related to the Givens rotation method

    is the modif ied Gram-Schmidt (MGS) or thogonal izat ion , which is

    another approach in per forming the QRD. Ling et al. [ l l ] , 1121

    and Kalson and Yao

    [lo]

    independent ly developed the sqr t - f ree

    MG S methods for the R LS f i l ter ing problems . A rank-one updat ing

    of Cholesky factor izat ion without sqr t s has a lso been repor ted in

    the l i tera ture 131. Recent ly , Chen and Yao [4] summarized the

    works done on the sqrt-free RLS filtering and proposed another

    more efficient sqrt-free method. So far , there has been more than

    ten known sqrt-free algorithms 111, [2], [4], 151, [ 8 ] - [ l l ] . Ho w-

    ever , a l l of the previous ly known der ivat ions were based

    on

    heu-

    r is tic approaches . Th ere is

    no

    known sys tematic way of generat ing

    the sqr t - f ree a lgor i thms . Motivated by these works , we wish to

    understand the fundamental re la t ionships among these sqr t - f ree a l-

    gor i thms .

    One

    of the contr ibut ions of th is correspondence is that

    these fundamental re la t ionships are character ized in s imp le man-

    ners through only two parameters .

    The pro to types of general ized sqr t- f ree a lgor ithms are g iven in

    Section 11, where all of the sqrt-free algorithms are found by the

    selection of two parameters . W e proceed in Sect ion I to seek a

    sqr t-f ree opt im um res idual of the RLS f i l ter ing problem. A brief

    conclus ions is g iven in Sect ion IV .

    11. THEpv F A M I L YF SQ U A R E-R O O T-FR EELGORITHMS

    A Givens ro ta t ion matr ix as g iven by

    is used to premult ip ly a two-row matr ix

    1

    Y1

    C Y 2 a p

    P

    02 . . 0

    a;

    f f ff;

    to zero out the e lement a t the

    (2,

    I ) locat ion such that i t becomes

    where

    c = a,/-, a n d s = PI/- (1)

    1053-587X/93 03.00 993 IEEE

  • 8/11/2019 475_46ce89a7

    2/5

    I406

    IEEE rRANSACTIONS ON SIGNAL PROCESSING, VOL. 41, NO.

    3.

    MARCH 1993

    In VLSI circuit design, sqrt operation is expensive, because i t

    takes up much area

    or

    i s s low (due to many i t e ra t ions) . Therefore ,

    it i s advantageous to avoid or minimize sqrt operations.

    By taking out a sca l ing fac tor f rom each row, the two row s under

    considera t ion before and a f te r the Givens or thogonal t ransforma-

    tions is denoted by

    s expressions in (18) are not explici t ly needed

    in

    the computa t ion

    of ( l3)- (17 ) . Th e use of the rota t ion parameter c in 18) (with one

    sqrt operation) will be further considered in Section

    I11

    when the

    optimum residual e i s des i red. F urthermore , Sec t ion 111 will show

    that it is possible to obtain

    e

    without any sqrt operation and the

    explici t computation of the rota t ion parameter c can be bypassed.

    To avoid repe t i t ive computa t ions and take the advantage of previ -

    ously com puted resul t s, (14) .

    16),

    and (18) use the newly updated

    k:

    of (13) . As s ta ted ear l i e r , we a re free to choose those two pa-

    rameters and

    U.

    Different choices of

    p

    and

    U

    will affect the num-

    ber of multiplications and divisions, as well as the numerical sta-

    bil i ty and parallel ism of these computations.

    It can be easily shown that this unified view can generate al l of

    the previously known sqrt-free algori thms via a proper choice of p

    and U . In fact , there has been more than ten sqrt-free algori thms

    known so fa r . Among them are Gent leman [ 5 ] , Hammarl ing [ 9 ] ,

    Bare iss [ I ] , Kalson and Yao

    [ I O ] ,

    Ling er al. [ I l l [ 1 2 ] , Barlow

    and Ispen [2], Chen and Yao 141, Gotze and Schwiege lsohn [ 8 ] .

    For example , i f we choose p =

    and

    U = 1 , i t becomes the sqrt-

    free algori thm proposed by Gent leman in [5] and can be upda ted

    as fol lows:

    k6 = k,,a:

    +

    k,bt

    19)

    ki,

    = k,,ki,/kA 20)

    a;

    =

    (21)

    a, = (k,a,a, +khb1b,)/kl,

    b,

    =

    -bla, a,b,.

    j

    = 2 ,

    . P

    (22)

    and

    where

    6)

    ,,

    k h , k6,

    and kA are the scaling factors result ing in sqrt-free

    - .

    operations, and

    a:

    and p: are the upda ted

    a,

    and 0 when

    PI

    is

    zeroed out.

    Now, our task is to find the expressions for, k:, k i , a; , {(a,

    b,), = 2. , p } , n te rms of k, , k h , { (a , ,b l ) , j =

    1,

    . , p } ,

    such that

    no

    sqrt operation is actually needed. The sqrt expressions

    of

    a,a,4.

    nd

    4

    n

    ( 5 )

    and

    6)

    are used for representa-

    t ional purposes only and are not actually performed.

    Replacing a =

    &a,, 0

    = a b , , ai = U,, 6

    =

    b, , j

    = 1, , p , in (1)-(4) l eads to

    a ; = J(k,at + kbbt)/kL

    4 k,a: kbb:

    a;

    =

    [koala,+ kbblb,]

    j = 2 ,

    . 3

    P .

    To check that these results are correct , we find that

    k u a l a ] + khblb ,

    a; =

    4 a ;

    =

    4

    ala, + PIP

    m

    c ,a / + SIP]

    and

    To avoid sqrt computa t ion, we need to de te rmine k: and ki such

    that ai, a,, and b, will not require sqrt operation. It is clear that if

    we choose

    k, ,

    and k i as

    ku kb

    k

    -

    v2(k,a: kbb:)

    .IO, -

    Pia,

    (24)

    -sa cp,

    which are consistent with the results in (3) and (4 ) . For the sys tol ic

    array implementation described in

    [13],

    we choose a , =

    1

    and de-

    fine the generalized rotat ional parameters

    where

    p

    and U are parameters that wil l be determined later to be

    any sqrt-free function of k,, kb,

    a , ,

    and b , , hen @ - I O ) can be

    computed wi thout sqr t opera t ion. We then have the fol lowing up-

    dating formulas without sqrt operation:

    k: = (koa: kbb:) /p2

    (13)

    C = k , /k : , and = k,b,/k, , . (25)

    Then we have

    a, = Cu, Sb,

    b, = b, bla,.

    (27)

    (28)

    b,

    =

    u[-b,a, a,b,]

    (17)

    c = ( a l / p ) a ,

    and

    = ( b , / p ) J k h / k : , . 18)

    Not ice tha t the sqrt opera t ions di sappear in our formulas of (13)-

    (17) , whi le they a re needed in the Givens rota t ions . Also, the

    c

    and

    These results are consistent with the works by Gentleman [ 5 ] and

    McWhirte r

    [

    131. The details of the systolic implementation can be

    found in [13].

    In Table I, we list various sqrt-free algori thms and the corre-

    sponding choices of p and

    U.

    Hence , thi s c lass of sqr t -free a lgo-

  • 8/11/2019 475_46ce89a7

    3/5

    I E E E T R A N S A C T I O N S O N S I G N A L P R O C E S S I N G , V O L .

    41,

    N O .

    3, MARCH 1993

    I407

    TABLE I

    SOME

    KNOWNQRT-FREEL G O R I T H M S

    F THE

    pv F A M I L Y

    Ir V

    Authors (Year) Remark

    1

    a , k ,b :

    1

    Gentleman (1973)

    a , =

    Hammarling (1974)

    Bareiss (1982)

    Ling (1989), KalsoniYao (1985)

    k , ,a ,

    =

    1

    CheniYao (1988) k ,a , =

    GotzeiSchwiegelshohn (1989)

    BarlowiIspen (1 987) Scaled

    New algorithm

    TABLE

    I1

    COMPARISONS

    O F

    COMPUTATIONAL COMPLEXITY

    OF

    SOME MEMBERSN + V F A M I L Y

    ~

    Square Root

    ultiplication Division Addition

    1

    2p - 1 1

    Givens Rotation

    4P

    p = l , u = l 4p + 3 2p - 0

    2p

    +

    6

    2

    2p

    - 1 0

    2p 6

    2

    2p - 0

    4p 4 2 2p

    - 1 0

    1

    a ,

    p = l , v = -

    4 p

    5

    2p - 2

    0

    koa: khb:

    u =

    1 4p + 6 2 2p - 0

    Cc=

    ko kh

    4p

    +

    4 1 2p - 1 0

    koa:

    +

    khb:

    p = koa: + k,b:,

    v

    =

    rithms is called the

    pv

    family of sqrt-free Givens rotation algo-

    1

    v

    r i thms .

    koa:

    kbb:

    not

    Only

    can

    we

    generate

    those

    known

    sqr t- f ree a lgo- then

    we

    can readily verify that this is a new sqrt-free algorithm. In

    f l t h m s , but we are

    choos ing

    new

    pairs Of

    ( p ,

    parameters .

    to f ind new sqr t- f ree

    by

    let us

    fact , the new sqf l- f ree a lgor i thm is among the bes t in the l is t of

    Table

    I

    in terms

    of

    number of d iv is ions , e .g . , i t on ly requires one

    n

    division and no square root. In principle, there are unlimited choice s

    of

    p

    and

    v

    for sqr t - f ree a lgor i thms . Table I shows compar isons

    of

    computat ional complexi ty of some algor i thms l is ted in Table I .

    choose

    p

    =

    koa: +

    kbb:

  • 8/11/2019 475_46ce89a7

    4/5

    1408

    IEEE

    TRANSACTIONS ON SIGNAL PROCESSING,

    VOL. 41, NO.

    3,

    M ARCH 1993

    111. SQRT-FREET R I A N G U L A RR R A Y P D A T I N GND O P T I M U M

    R E S I D U A L C Q U I S I T I O N

    Solving a full rank LS problem Aw = b ( A R m X n n

    without the sqrt operation can be easily achieved [5]. Let the QRD

    of

    A

    be

    QTA

    =

    R,

    here

    R

    s an upper triangular matrix, then

    and the optimum weight vector can be obtained by solving RO =

    U. Now, start ing with a

    ful l

    dense augmented mat rix

    &[A

    b] ,

    ser ies of sqr t -free rota t ions can be appl ied to zero ou t the subvec tor

    below the main diagonal of the underlying matrix to obtain

    [ R 0 ; 1

    where = diag 4,

    ' ,

    4)nd

    R

    =

    SF, U

    =

    U.

    Since the explici t comp utation of s not required, the optimum

    weight vector can be obtained without the sqrt operation by

    solving R O =

    U .

    In the following, we will apply the developed prototypes of sqrt-

    free rota t ions deve loped before to the Q RD-based RLS estimation

    problem where we are only interested in the optimum residu al . How

    to obtain the optimum residual by using the systolic array [131 has

    been well known. To be specific, we are interested

    in

    updat ing

    from

    R u

    R U

    L T

    to [o v 1

    (30)

    It has been shown [13] that the p X p upper t r i angular mat r ix R

    can be obta ined through a sequence of p Givens rota t ions , and the

    opt imum res idua l

    e for

    the newly appended da ta

    [x y ]

    is given

    by

    with c, represent ing the cosine va lue of the th rotat ion angle.

    agona l mat r ix l eads (30) to the form of

    Fac toring out the sca l ing constants into the premul t iplying di -

    Unl ike the previously deve loped formula , where we a re only in-

    terested

    in

    updat ing k , ,

    a,,

    , o k,'

    ,

    U, , and zeroing out al l the b , s ,

    this t ime we also need to know the cosine values explici t ly as re-

    quired in the opt imum res idual given in (31) .

    After the first rotat ion, b , will be zeroed out and we have

    (33)

    (34)

    with (p l , v I ) being the parameter pair which are st i l l free to be

    chosen later. Note the close analogy of (33)-(38) to those of (13)-

    (18) . Simi la r ly, a f te r the i th rota t ion (1