+ All Categories
Home > Documents > Norm approximation problems and norm statistics · 2011. 4. 26. · 3. Norm Statistics The...

Norm approximation problems and norm statistics · 2011. 4. 26. · 3. Norm Statistics The...

Date post: 17-Nov-2020
Category:
Upload: others
View: 3 times
Download: 0 times
Share this document with a friend
4
JO U R NA L OF RESEARCH of the National Bureau of Standards Vol ume 83 , No. ] , J anuary-February 1978 Norm Approximation Problems and Norm Statistics D. R. Shier and C. J. Witzgall Institute for Basic Standards, National Bureau of Standards, Washington, DC 20234 (September 15, 1977) Thi s paper e xp lores a relation be twee n variou s a pp roximati on problems (ar is in g fr OIl1 filling lin ea r mod e ls to da ta ) a nd con 'es po ndin g statis ti ca l meas ures (norm sta ti s ti cs) . It is es ta bli sh ed that for any optimal so lution to an approximation problem defined with res pec t to a norm, the res ultin g resi dual s have ze ro as their no rlll s tatisti c. ' rhi s res ult holds whenever the un de rl y in g des ign matrix has a co lu lllll of ones . An extension to the case of a rbitrary d es ig n IIlalri ces is al so tonside red. Key word s: App rox imation; curv e- fitt ing; Lp prob lems; I t:: ast squ ares; llIinimiza ti un; lI o rm ; res idu als; sta ti s ti c. 1 . Motivation In a pap e r! di scuss ing alt ernative crite ri a to l eas t s qua res for the fittin g of line ar model s to dat a, Appa and Sm ith [If derive ce rta in prop e rti es of solutions to L! approxi mation problems (i.e., curve-filling problem s in whi c h th e s um of abso.lu te deviations is minimized ). In particular, Prope rt y 2 of [lJ c har ac te ri zes m the sign pallern of the res iduals ei = Yi - 6 0 - 2: 6jXij co rr es pondin g to an optimal solution (b o, . . . , b m) j= ! to an LJ approximation probl em with independent variab l es x], .. . , Xm and dependent variabl e y. Th e res ult of Appa a nd Smith stat es th a t IN! - N21 :S In + 1, wh e re N! and N2 denote, res pec tively, the numb er of positive res idual s and the numb er of negative res idu als c orres pondin g to any optimal L! solution. This observ a tion a dmit s of a slight gene ra lization [4]: namely, IN ! - N21 :S Z, wh e re Z indi ca tes th e number of zero-va .lued res iduals in the given optimal solution. ( Th e a ss umption employed in r1J to e limin at e d ege neracy ins ur es that Z :S m + 1, and thus the res ult of Appa a nd Sm ith fo ll ows im medi a te ly from the above ine qu alit y.) It is s tr aightforward to show that IN! - N21 :S Z is e quiv alent to the statement that the res idual s in an optima l L! s olution have a median a/ zero. Rec all thal a me dian of some se t of obse rvat ions is an y value that e xceeds at most half the ob se rved numb ers, a nd is excee d ed by at most ha lf th e ob se rv ed numb e rs. From thi s defini tion it imm e diat ely follows that a me dian of th e numb e rs U! , ... , Un (not necessa rily di stinct) is any value such that (1) and (2) wh e re NM) = ca rd{i: Ui > N2W = ca rd{i: Ui < and card{i: Ui = Hence, zero tS a me dian of the res idual s e ], ... , en if and only if N! + Z 2: N2 a nd N2 + Z 2: N J• But the lall e r two in equ aliti es ar e clearly e quival e nt to IN! - N21 :S Z. The point to be e mpha sized here is that the sign patt ern result 3 IN! - N 21 :S Z is e qually a sta temen t about zero being a median of ce rtain res idu als. Such a result brings to mind a re lat ed statemen t about the res idual s for solution s to L2 (l eas t s quar es) approximation problems: namely , the mean of the residuals, derived from an o ptimal L2 soluti on, is zero. Like wi se for Lx> approximation problems (in which the objec t is AMS Subject Classification: 62JOS, 65010, 9OC50. I Th is paper is also co mmented upon in the short communication /3J ofCc nli e el al. Z Fi gures ill brackets indi cate the lit erature references al the end of th is pape r. 3 It is also easy 10 s how that whc ll n is odd , a s li gh tl y s tronger result obtains: - IV 21 :S Z - I. Indeed, sim;e N I + Nt + Z = " = odd. the par it y (even. odd) of N. + Nt. lind thus N 1 - Nz• is the same as the parity of Z - 1. Accordingly. - Nz i $, Z is equi vale nt to - Nz i ::5, z - 1. whell/! is odd. 71
Transcript
Page 1: Norm approximation problems and norm statistics · 2011. 4. 26. · 3. Norm Statistics The discussion in section 1 ind icated that certain statistics (namely, the median, mean and

JO URNA L OF RESEARCH of the National Bureau of Standards Vo lume 83, No. ] , J anuary-February 1978

Norm Approximation Problems and Norm Statistics

D. R. Shier and C. J. Witzgall

Institute for Basic Standards, National Bureau of Standards, Washington, DC 20234

(September 15, 1977) Thi s pape r e xplores a re lati on be tween various approx imation probl ems (aris ing frOIl1 fillin g linear mod e ls

to d a ta) and co n'es po nding s tati s ti ca l measures (norm s ta ti s ti cs) . It is establi shed that for a ny optima l solution to an approximation probl em d efin ed with res pec t to a norm , the resultin g residual s have ze ro as thei r norlll s tatisti c. 'rhi s result holds whe never the unde rl yin g des ign ma tri x has a co lu lllll of o nes . An ex tens ion to the case of a rbitrary des ign IIlalri ces is a lso tonside red.

Key words: App rox imati on; c urve- fitt ing; Lp prob lems; It:: as t squ a res; llIinimi zati un; lIorm ; res idua ls; s ta ti s ti c.

1 . Motivation

In a paper! di sc uss ing alte rn a tive c rite ri a to least squares for the fittin g of linear models to da ta, Appa

and Sm ith [If derive certa in prope rti es of so luti ons to L! approximation problems (i. e . , c urve-filling

problems in whi ch th e sum of a bso.lu te de viations is minimized ). In parti c ul a r, Propert y 2 of [lJ c harac te ri zes m

the sign pallern of the residuals ei = Yi - 60 - 2: 6jXij corresponding to an optimal solution (bo, . . . , bm) j = !

to an LJ approximation proble m with inde pe nd ent variables x], .. . , Xm and depe nd e nt variable y. The

result of Appa and Smith states th a t IN! - N21 :S In + 1, whe re N ! and N2 denote, respec tive ly, the numbe r

of pos itive res idual s and the numbe r of negative res iduals corresponding to any optimal L! solut ion.

Thi s observa tion admits of a slight ge neral ization [4]: na me ly, IN ! - N21 :S Z , whe re Z indicates th e

numbe r of ze ro-va.lued res idua ls in the g ive n optimal soluti on. (The ass umpti on e mpl oyed in r1J to e liminate degeneracy in sures tha t Z :S m + 1, and thu s th e res ult of Appa a nd Sm ith fo llows im medi a te ly from the

above inequality.)

It is straightforwa rd to show that IN! - N21 :S Z is equivalent to th e sta te me nt that the res iduals in an

optima l L! solution have a median a/ zero . Recall th a l a median of some set of obse rvat ions is an y valu e tha t

exceeds at mos t half the observed numbers, a nd is exceeded by a t mos t half th e obse rved numbe rs . From

thi s defini tion it immediately follows that a median of the numbers U! , ... , Un (not necessa rily di stinc t) is

any value ~ suc h that

(1)

and (2)

where NM) = card{i: U i > ~}, N2W = card{i: U i < ~}, and Z(~) card{i: Ui = ~}. Hence, zero t S a

median of th e res idual s e ], ... , e n if and only if N ! + Z 2: N2 a nd N2 + Z 2: N J • But the lalle r two

in equaliti es are c learly equival e nt to IN! - N21 :S Z . The point to be emphas ized here is that the s ign patte rn result 3 IN! - N 21 :S Z is equally a statement

about zero be ing a median of certain res iduals. Such a result brings to mind a related statemen t about the

res iduals for solutions to L2 (least squares) approximation problems: namely , the mean of the residuals,

derived from an optimal L2 solution, is zero. Like wi se for Lx> approximation problems (in which the objec t is

AMS Subject Classification: 62JOS, 65010, 9OC50. I This paper is a lso co mmented upon in the short communication /3J ofCc nli e el al. Z Figures ill bracket s indicate the lit erature refe rences al the end of th is pape r. 3 It is also easy 10 s how that whc ll n is odd , a s li gh tl y stronger result obtains: ~,y l - IV 21 :S Z - I. Indeed , sim;e N I + Nt + Z = " = odd. the parit y (even . odd) of N . +

Nt. lind thus N 1 - Nz• is the same as the parit y of Z - 1. Accordingl y. ~1 - Nz i $, Z is equi vale nt to ~Vl - Nz i ::5, z - 1. whell/! is odd.

71

Page 2: Norm approximation problems and norm statistics · 2011. 4. 26. · 3. Norm Statistics The discussion in section 1 ind icated that certain statistics (namely, the median, mean and

to mmllTlIZe the maximum absolute dev iation), it is known that the midrange [6] of the res iduals in an optimal Loo solution is zero . One wonde rs whether th ese facts might not be separate manifes tations of a

general relationship be twee n approximati on proble ms and corresponding s tatistical measures . Suc h a general

re la ti ons hip indeed ex is ts and will be explored in th e subseq uent sec ti ons. The proof of thi s relationship is

extremely s imple, s impler than the proofs for the spec ia l L, a nd L2 cases we have found in the lite rature. The results of this paper the re fore provide both s implifica ti on and unification.

2. Norm Approximation Problems

Suppose tha t n sets of observations are available on a single dependent variable y and m 2: ° indepe nd e nt var iablesxlo ... , x m. Such observations can be arra nged in a column vector y = 0' 10 .•• , Y n)T a nd an n X m matrix X = (xu), wh ere r i , Xi ]' ... , X im re prese nt observatio ns in the ith set. The n th e Lp approximation problem [2], l ::S p ::soo, is thatoffinding values bo, b lo ... , bmthatminim ize

(3)

over all bo, blo ... , b m' For th e case p = 1, the proble m is that of minimizing the sum of the absolute

values of th e deviations by c hoi ce of parame ters bo, b I, ... , b m' When p = 2 , the above formulation

present s the famili a r problem of c urve-fitting by least squares. In the case p = 00, the objective fun ction in

(3) becomes maxi IYi - bo - I J!I bfXiil , and we have the linear Che bys hev approximation proble m. Every

suc h Lp approximation problem can in fac t be formulated [2] as a mathe matical programming problem with a

co nvex objective fun ction and I inear cons traints .

A problem more gene ral than that described by the objec tive function (3) is the weighted Lp approximation problem, wh ere 1 ::s p < 00 . Given nonnegative we ights W 10 ••• , W n , thi s proble m concerns

findin g parameter values bo, blo . . . , bm to minimize

(4)

The in clus ion of weigh ts in the above may re fl ec t, for example, ide nti cal observations as well as differing

degrees of confidence (o r meas ures of importance) to be atlached to the observed data points.

An even more ge neral a pproximation problem can be formulated in the present context with respec t to

a ny norm. A norm N(x) is de fin ed on vec tors x and is ass umed to have the following properti es [5]:

N(x) > 0 unl ess x = 0 ,

N( Ax) = NV (x), for A 2: 0 ,

N(x + y) ::s N(x) + N(y).

Let b = (b I , .. . , b"J T and form the residuals e = y - bo 1 - X b , where 1

norm approximation problem is tha t of findin g (bo, b) to minimize

N(e) = N(y - bo 1 - Xb) .

The objective fun c tion (3) is a spec ial case of (5) with N(e) = N(elo . . , e ,.) also a spec ial case with N(e) = [I V=I wi lei IV]!/p.

(1, ... , If. The n the

(5)

It can readily be s hown that N(e ) is a convex fun ction of (b o, b), and thus the a pproximation problem

described by (5) is well be haved: any local minimum to this problem is also guaranteed to be a global

mllllmum.

72

Page 3: Norm approximation problems and norm statistics · 2011. 4. 26. · 3. Norm Statistics The discussion in section 1 ind icated that certain statistics (namely, the median, mean and

3. Norm Statistics

The disc uss ion in sect io n 1 ind icated that certain statistics (n amely, the med ian, mean and midrange)

were useful in desc ribing properti e s of certa in Lp approximation problems . Name ly, the residuals of an

optimal Ll solution ha ve a medi a n of zero , the res iduals of an L2 solution have a mea n of zero, and the

residuals of an L x> soluti on have a midrange of ze ro. Moreover, it is well known tha t these three sta ti s tics

themselves sol ve a ppropri a te one-dime nsional Lp approximation problems .

For exa mple , the median of a set of values Ul. .. . , Un is a value v tha t minimi zes 2: I~ 1 ~L ; - v i over

a ll possible v . That is, a medi an solves an Ll approximation problem with one pa ra mete r. S imila rl y, the

mea n of UI , . .. , Un minimizes 2: ~1 lUi - v1 2, a nd thus also [2: ['= 1 Iu; - vI2]1 12 . Accordingly, the mea n

solves a one-para mete r L2 proble m. Finally, the midrange minimi zes max ; lUi - v i, a n L"" a pprox ima ti on

problem, aga in with one paramete r. As sugges ted by the above exampl es, we d efin e a p-statistic of

UI , ..• , unto be a value v tha t minimizes

where 1 ~ p ~ 00 . Thi s definiti on fo ll ows tha t g ive n by Ri ce a nd White [7] , who re fer to s uc h a va lue as a n

"Lp estim ate ." In s imilar fa shion, a weiglaed p-statistic of Ub . . . , Un is defin ed to be a va lue v that

mlnlml zes

where th e nonnegati ve we ig hts Wi are give n a nd 1 ~ p < 00 . Suc h a concept o-e ne rali zes, for example, the

idea of a we ighted mean o r a we ighted med ian.

Fina ll y, le t N be a norm a s d e fin ed in secti on 2. Then a norm statistic , or a n N-statistic, fo r u = (U b ... , u ,J T is defin ed to be a value v tha t minimi zes N(o - v 1). Clea rl y, the conce pt of an N -s tati sti c

incl udes as spec ial cases bo th p-s ta ti s ti cs a nd we ighted p-s ta ti sti cs .

4. Norm Approximation Problems and N-Statistics

This sec tion conta ins the ma in result re la tin g N -s ta ti sti cs a nd norm approx imati on proble ms.

TH EO R EM: Let (bo, 6) be an. optimal solution to the norm approximation problem, (5) , and let e

- X h. Then zero is an N-stat istic lor the resid uals e.

PROOF: N(e - 0·1) = N(e ) =N(y - b o l-Xb) ~ N(y - [&0 + v] 1 - X b) = N (y - b 01 - X b - v I)

= N(e - v 1)

for all v

for all v

for all v.

The third line above holds beca use (60, b) minimi zes (5). The resulting inequality N(e - 0'1 ) ~ N(e - v

1 ), for all v, shows that 0 mini mizes N(e - v 1 ), a nd so 0 is an N -sta ti s ti c for e. Thi s comple tes the proof.

Noti ce th a t in the proof a bove, we did not a t all need the norm properti es of N . As a matte r of fac t, N coul d have bee n an a rbitrary fun c ti on; in thi s case, the theore m applies to a global solution (if it exis ts) to a

ve ry ge nera l approxima tion problem.

5. Arbitrary Design Matrices

A furthe r gene ra li za ti on of the a bove th eorem is possible for weighted Lp approximation problems . The

exte ns ion of inte rest a llows an arbitrary " des ign matrix ," where a column of l 's is not necessaril y imposed.

73

Page 4: Norm approximation problems and norm statistics · 2011. 4. 26. · 3. Norm Statistics The discussion in section 1 ind icated that certain statistics (namely, the median, mean and

In such a problem, the object is to find h = (60 , 6 m) such tha t

(6)

is minimized.

EXTENSION: Let h be an optimal solution to (6), and let e = y - X h. Then zero is a weighted p-statistic (1 ::s: p < oo)for the values {ei!xiO:xiO +- 0, i = 1, ... , n} with weights WdXiOIP.

n n

PROOF: 2: wilei - o· xiOlp = 2: Wi leil p ~) ~I

n m

= 2: wilYi - 2: bjXijlp i=1 j=O n m

= 2: wi lYi - boXiO - 2: bhlp i=l j=l n m

::s: 2: wilYi - [bo + V]XiO - 2: bjXijlp i=) j=1 n

= 2: wilei - vXiOlp· i=l

Thus, if we define T = {i: XiO '1= O}, the above inequality gives

or

Upon taking the pth root (1 ::s: p < (0) of both sides, we conclude that zero IS a weighted p-statistic for

{ei/xiO: XiO '1= o} with we ights Wi~iOlp. Notice that in the proof above, the choice of the first column , corresponding to the X iO'S, t S clearly

arbitrary. Any column of the design matrix can be used with similar result.

6. References

[I] Appa, G., and Smith, c. , On LI and Chebyshev estimation , Mathemati cal Programming 5 (1973), pp. 73-87. [2] Barrodale, I. , and Roberts, F. D. K., Applications of mathematical programming to L p approximation, in Nonlinear Programming,

J. B. Rosen, O. L. Mangasarian and K. Ritte r, Eds., (Academic Press, New York, 1970), Pl'. 447-464. [3] Gentl e , 1. E., Sposito, V. A., and Kennedy, W. J. , On some properties of L, est imators, Mathematical Programming 12 (1977),

pp . 139-140. [4] Sposi to, V.A., Kenn edy, W. J., and Gentle, J.E. , Useful generalized properties of LI-estimators, to appear in Mathematical

Programming.

[5] Householder, A. S. , The approx imate solution of matri x problems, J. Assoc . Compul. Mach. 5 (1958), pp. 205-243. [6] Kendall , M. G. , and Stuart, A., The Advanced Theory of Stati stics, Vol. 1 (Charles Griffin and Co., London, 1963). [7] Ri ce, J. R. , and White, 1. S., Norms for smoothing and estimation , SIAM Review 6 (1964), pp. 243-256.

74


Recommended