+ All Categories
Home > Documents > Confidence Limitsstark/Seminars/slac12.pdfFeldman Cousins (1998) Figure 10 assume that the...

Confidence Limitsstark/Seminars/slac12.pdfFeldman Cousins (1998) Figure 10 assume that the...

Date post: 24-Sep-2020
Category:
Upload: others
View: 0 times
Download: 0 times
Share this document with a friend
33
Background Univariate location Sign Determinations MP and QC Summary Confidence Limits Philip B. Stark Department of Statistics University of California, Berkeley Progress on Statistical Issues in Searches SLAC National Accelerator Laboratory, Stanford, CA 4–6 June 2012
Transcript
Page 1: Confidence Limitsstark/Seminars/slac12.pdfFeldman Cousins (1998) Figure 10 assume that the experiment is to search for transformations between muon type neutrinos,! ", and electron

Background Univariate location Sign Determinations MP and QC Summary

Confidence Limits

Philip B. StarkDepartment of Statistics

University of California, Berkeley

Progress on Statistical Issues in SearchesSLAC National Accelerator Laboratory, Stanford, CA

4–6 June 2012

Page 2: Confidence Limitsstark/Seminars/slac12.pdfFeldman Cousins (1998) Figure 10 assume that the experiment is to search for transformations between muon type neutrinos,! ", and electron

Background Univariate location Sign Determinations MP and QC Summary

Confidence sets

• Datum X ∈ X drawn from IPµ, µ ∈ Θ.

• If the random set S(X) satisfies IPθ{S(X) 3 θ} ≥ 1− α for allθ ∈ Θ, S is a 1− α confidence set.

• Probability meaningful only before the datum is observed: IfX = x, either S(x) 3 θ or not.

• Connected confidence set for real parameter: confidenceinterval (CI).

Page 3: Confidence Limitsstark/Seminars/slac12.pdfFeldman Cousins (1998) Figure 10 assume that the experiment is to search for transformations between muon type neutrinos,! ", and electron

Background Univariate location Sign Determinations MP and QC Summary

What are Confidence Intervals Good For?

• Express uncertainty in estimates of parameters

• Also allow inferences about signs of parameters: positive,indeterminate, negative. Tukey’s “three decisions.”

• Short intervals desirable to minimize uncertainty, but notnecessarily for sign determination: don’t maximize the numberof correct decisions

• Cf. 1-sided versus 2-sided hypothesis tests

Page 4: Confidence Limitsstark/Seminars/slac12.pdfFeldman Cousins (1998) Figure 10 assume that the experiment is to search for transformations between muon type neutrinos,! ", and electron

Background Univariate location Sign Determinations MP and QC Summary

Hypothesis testing

• Decision between two theories about the world: null andalternative hypotheses.

• Null: µ ∈ Θ0. Alternative: µ ∈ Θa.

• Datum X drawn from IPµ, µ ∈ Θ.

• If set AΘ0 ⊂ X satisfies IPθ{X /∈ AΘ0} ≤ α for all θ ∈ Θ0, AΘ0

is a (significance) level α test .

• The power of AΘ0 against the alternative µ ∈ Θa isinfθ∈Θa IPθ{X /∈ AΘ0}.

• Nearly always some tradeoff between level and power.

• Aθ is unbiased if ∀η ∈ Θ, IPθ{X ∈ Aθ} ≥ IPη{X ∈ Aθ}.

Page 5: Confidence Limitsstark/Seminars/slac12.pdfFeldman Cousins (1998) Figure 10 assume that the experiment is to search for transformations between muon type neutrinos,! ", and electron

Background Univariate location Sign Determinations MP and QC Summary

Duality between Tests and Confidence Sets

• Simple hypothesis completely specifies distribution of X.

• Suppose have family {Aθ}θ∈Θ of level-α tests, one for eachsimple hypothesis µ = θ ∈ Θ.

• Then S(X) ≡ {θ ∈ Θ : X ∈ Aθ} is a 1− α confidence set for µ:

IPθ{S(X) 3 θ} ≥ 1− α, ∀θ ∈ Θ.

Standard result, extremely powerful!

Page 6: Confidence Limitsstark/Seminars/slac12.pdfFeldman Cousins (1998) Figure 10 assume that the experiment is to search for transformations between muon type neutrinos,! ", and electron

Background Univariate location Sign Determinations MP and QC Summary

Univariate Location Model, Nonnegative Parameter

• Datum X.

• X− µ has cdf F .

• F has a symmetric, continuous, unimodal density f (x), strictlydecreasing for x ≥ 0 in the support of f .

• Want to learn about µ.

• Know a priori that µ ≥ 0. I.e., Θ = [0,∞).

Page 7: Confidence Limitsstark/Seminars/slac12.pdfFeldman Cousins (1998) Figure 10 assume that the experiment is to search for transformations between muon type neutrinos,! ", and electron

Background Univariate location Sign Determinations MP and QC Summary

Nonnegative Univariate Location: Conventional intervals

• Conventional approach: make acceptance regions as small aspossible for 2-sided, or as powerful as possible for 1-sided.

• E.g., take F ∼ N(0, 1); α = 0.05 (95% CL).

• 2-sided interval for µ is [X− 1.96,X + 1.96] ∩ [0,∞).

• 1-sided upper interval is [0,X + 1.64] ∩ [0,∞)(actually a 2-sided interval, but never “separates” from 0).

• 2-sided is empty if X < 1.96; one-sided is empty if X < 1.64.

Page 8: Confidence Limitsstark/Seminars/slac12.pdfFeldman Cousins (1998) Figure 10 assume that the experiment is to search for transformations between muon type neutrinos,! ", and electron

Background Univariate location Sign Determinations MP and QC Summary

Flip-flopping

Scientific goal may change, depending on what can be said basedon the data available.

Some practitioners make upper 1-sided CI if the results are “null”(i.e., consistent with zero) but a 2-sided CI if the results are“significant” (i.e., sufficiently larger than zero).

In other fields, common to make upper 1-sided CI if results are belowzero and lower 1-sided CI if the results are above zero.

If you make the decision based on the data but use 95% CI eitherway, the composite procedure can have much less than 95%coverage for some θ.

Page 9: Confidence Limitsstark/Seminars/slac12.pdfFeldman Cousins (1998) Figure 10 assume that the experiment is to search for transformations between muon type neutrinos,! ", and electron

Background Univariate location Sign Determinations MP and QC Summary

Feldman & Cousins (1998) Complaints

• Flip-flopping overstates the true coverage.

• If X is sufficiently small, both 1-sided and 2-sided traditional CIsare empty. What then?

• CIs combine goodness of fit testing with parameter estimation;Feldman & Cousins prefer to separate those functions.(Introduces problems I won’t discuss.)

Page 10: Confidence Limitsstark/Seminars/slac12.pdfFeldman Cousins (1998) Figure 10 assume that the experiment is to search for transformations between muon type neutrinos,! ", and electron

Background Univariate location Sign Determinations MP and QC Summary

Feldman-Cousins “Unified” Method• Construct acceptance region not to make the region as small as

possible, but to consist of points with highest likelihood ratio toconstrained MLE (cMLE).

• E.g.,

f (x||µcMLE) =

{(2π)−1/2, x ≥ 0(2π)−1/2 exp(−x2/2), x < 0.

• Ratio for null θ is

Rθ(x) =

{exp((x− θ)2/2), x ≥ 0exp(−xθ − θ2/2), x < 0.

• Calibrate to have right level for each θ; correct coverage thenguaranteed.

• Does not separate from zero until later than “flip-flop.”• Does not give empty CIs, even for large negative x.

Page 11: Confidence Limitsstark/Seminars/slac12.pdfFeldman Cousins (1998) Figure 10 assume that the experiment is to search for transformations between muon type neutrinos,! ", and electron

Background Univariate location Sign Determinations MP and QC Summary

Feldman Cousins (1998) Figure 10

assume that the experiment is to search for transformationsbetween muon type neutrinos, n

m

, and electron type neutri-nos, ne , and that the influence of other types of neutrinos canbe ignored. We hypothesize that the weak eigenstates un

m

&

and une& are linear superpositions of two mass eigenstatesun1& and un2& ,

une&5un1&cos u1un2&sin u ~5.1!

and

unm

&

5un2&cos u2un1&sin u , ~5.2!

and that the mass eigenvalues for un1& and un2& are m1 andm2 , respectively. Quantum mechanics dictates that the prob-ability of such a transformation is given by the formula@2,16#

P~

n

m

!ne!5sin2~

2u!sin2S 1.27Dm2LE D , ~5.3!

where P is the probability for a n

m

to transform into a ne , Lis the distance in km between the creation of the neutrinofrom meson decay and its interaction in the detector, E is theneutrino energy in GeV, and Dm25um1

22m22u in (eV/c2)2.

The result of such an experiment is typically representedas a two-dimensional confidence region in the plane of thetwo unknown physical parameters, u, the rotation angle be-tween the weak and mass eigenstates, and Dm2, the ~posi-tive! difference between the squares of the neutrino masses.Traditionally, sin2(2u) is plotted along the horizontal axisand Dm2 is plotted along the vertical axis. An example ofsuch a plot is shown in Fig. 11, based on a toy model that wedevelop below. In this example, no evidence for oscillationsis seen and the confidence region is set as the area to the leftof the curve in this figure.

B. Proposed technique for determining confidence regions

The problem of setting the confidence region for a neu-trino oscillation search experiment often shares all of thedifficulties discussed in the previous sections. The variable

sin2(2u) is clearly bounded by zero and one. Values outsidethis region can have no possible interpretation within thetheoretical framework that defines the unknown physical pa-rameters. Yet consider an experiment searching in a regionof Dm2 in which oscillations either do not exist or are wellbelow the sensitivity of the experiment. Such an experimentis typically searching for a small signal of excess ne interac-tions in a potentially large background of ne interactionsfrom conventional sources and misidentified n

m

interactions.Thus, it is equally likely to have a best fit to a negative valueof sin2(2u) as to a positive one, provided that the fit to Eq.~5.3! is unconstrained.Typically, the experimental measurement consists of

counting the number of events in an arbitrary number of bins@17# in the observed energy of the neutrino and possiblyother measured variables, such as the location of the interac-tion in the detector. Thus, the measured data consist of a setN[

$

ni%, together with an assumed known mean expectedbackground B[

$

bi% and a calculated expected oscillationcontribution T[

$

m iusin2(2u),Dm2%.To construct the confidence region, the experimenter must

choose an ordering principle to decide which of the largenumber of possible N sets should be included in the accep-tance region for each point on the sin2(2u)-Dm2 plane. Wesuggest an ordering principle identical to the one suggestedin Sec. IV, namely the ratio of the probabilities,

R5P

~

NuT !

P~

NuTbest!, ~5.4!

where Tbest„sin2(2u)best ,Dmbest2 … gives the highest probability

for P(NuT) for the physically allowed values of sin2(2u) andDm2.In the Gaussian regime, x

2522 ln(P), and so this ap-proach is equivalent to using the difference in x

2 between Tand Tbest , i.e.,

R8[Dx

25(

iF ~

ni2bi2m i!2

s i2 2

~

ni2bi2mbesti!2

s i2 G ,

~5.5!

FIG. 10. Plot of our 90% confidence intervals for the mean of aGaussian, constrained to be non-negative, described in the text.

FIG. 11. Calculation of the confidence region for an example ofthe toy model in which sin2(2u)50. The 90% confidence region isthe area to the left of the curve.

57 3885UNIFIED APPROACH TO THE CLASSICAL . . .

Page 12: Confidence Limitsstark/Seminars/slac12.pdfFeldman Cousins (1998) Figure 10 assume that the experiment is to search for transformations between muon type neutrinos,! ", and electron

Background Univariate location Sign Determinations MP and QC Summary

Complaint about “Unified”

• Using a principled approach is nice, but why that principle?

• Heuristic “maximize likelihood ratio compared to cMLE” is notdesigned to accomplish the real goal. Why not go straight forthat?

• Desirable properties of Unified Method are accidentalconsequences.

• Empty CIs are informative: Evidence that the model is wrong.Should not happen often if model is right.

• Unified Method never tells you the model is no good, no matterhow bad it is.E.g., upper confidence bound at 90% is 0.4 when X = −2.

Page 13: Confidence Limitsstark/Seminars/slac12.pdfFeldman Cousins (1998) Figure 10 assume that the experiment is to search for transformations between muon type neutrinos,! ", and electron

Background Univariate location Sign Determinations MP and QC Summary

Goals

Want to distinguish a parameter from zero, but also find a short CI forit.

One-sided tests and CIs can discriminate the parameter from zero fora smaller value of the observation than two-sided tests and CIs, but:

• must fix the sign you are looking for before looking at the data(c.f. flip-flopping)

• one-sided CIs are infinitely long—precision limited

Page 14: Confidence Limitsstark/Seminars/slac12.pdfFeldman Cousins (1998) Figure 10 assume that the experiment is to search for transformations between muon type neutrinos,! ", and electron

Background Univariate location Sign Determinations MP and QC Summary

Neyman’s 1935 three decision rule

Observe X = µ + Z , where Z ∼ N(0, 1). Neyman’s rule is

1. Decide µ > 0 if X ≥ zα.

2. Decide µ < 0 if X ≤ −zα.

3. Make no decision if −zα < X < zα.

This rule controls the misclassification probability at level α:

IPθ{misclassifying the sign of θ} ≤ α, ∀θ ∈ R. (1)

Neyman’s goal was a sharper rule for classifying sgn(µ) thanobtained by testing H0 : µ = 0 against a two-sided alternative andmaking a directional decision when |X| > zα/2.

Page 15: Confidence Limitsstark/Seminars/slac12.pdfFeldman Cousins (1998) Figure 10 assume that the experiment is to search for transformations between muon type neutrinos,! ", and electron

Background Univariate location Sign Determinations MP and QC Summary

Tukey (1991)Statisticians classically asked the wrong question—and were willing toanswer with a lie . . .. They asked: “Are the effects of A and B different?”and they were willing to answer “no.” All we know about the world teachesus that the effects of A and B are always different in some decimal place. . .. What we should be answering first is “can we tell the direction in whichthe effects of A differ from the effects of B?” . . . The follow-up question isabout how much—about what we are confident of concerning the numericaldifference:

effect of A MINUS effect of B

which we abbreviate as A-B. If the first question was answered “directionuncertain” then the larger part of the answer to follow-up question is howbig might A-B be... If the first question was answered “A-B positive” then thelarger part of the answer to the follow-up question answers, usually: “whatis the minimum size of A - B ?.” The smaller part, usually, answers: “What isthe maximum of A-B?”

Page 16: Confidence Limitsstark/Seminars/slac12.pdfFeldman Cousins (1998) Figure 10 assume that the experiment is to search for transformations between muon type neutrinos,! ", and electron

Background Univariate location Sign Determinations MP and QC Summary

Distillation

• When can’t classify sign, should give a short two-sided CI.

• When classify sign as positive, should have a lower endpointlarger than zero (“larger part” of the follow-up question)

• Upper endpoint should be finite (“smaller part” of the follow-upquestion)

Page 17: Confidence Limitsstark/Seminars/slac12.pdfFeldman Cousins (1998) Figure 10 assume that the experiment is to search for transformations between muon type neutrinos,! ", and electron

Background Univariate location Sign Determinations MP and QC Summary

Types of sign determinations

• “Sign exclusion” or “weak sign determination”: CI containsvalues of only one sign, and possibly zero.

• (Strict) sign determination: CI contains values of only one sign,and does not contain zero.A strict implies weak, but not vice versa.

• “Separates from zero”: CI contains values of only one sign, andits closure does not contain zero. If a CI separates from zero, itgives a strict sign determination.

Page 18: Confidence Limitsstark/Seminars/slac12.pdfFeldman Cousins (1998) Figure 10 assume that the experiment is to search for transformations between muon type neutrinos,! ", and electron

Background Univariate location Sign Determinations MP and QC Summary

Strategy

• Trade off some length for some observations; get signdetermination for smaller observed values than CS.

• Sign exclusion almost as early as Neyman’s three-decision rule.

• Exploit duality between tests and confidence sets to tailor CIs tohave special properties.

• Construction analogous to Feldman-Cousins, but based ondesired property of CI instead of likelihood ratio to cMLE.

Page 19: Confidence Limitsstark/Seminars/slac12.pdfFeldman Cousins (1998) Figure 10 assume that the experiment is to search for transformations between muon type neutrinos,! ", and electron

Background Univariate location Sign Determinations MP and QC Summary

Back to math

Recall X− µ ∼ F .

Let cp ≡ F−1(1− p).

Conventional symmetric (CS) interval: [X− cα/2,X + cα/2].

Length of CS is 2cα/2.

Allow CIs to be longer than this, to determine signs more often.

Deliberately use biased tests, to get other desirable properties.

Page 20: Confidence Limitsstark/Seminars/slac12.pdfFeldman Cousins (1998) Figure 10 assume that the experiment is to search for transformations between muon type neutrinos,! ", and electron

Background Univariate location Sign Determinations MP and QC Summary

MP and QC: Benjamini, Hochberg, Stark (1998)Modified Pratt and Quasi-Conventional intervals95% confidence and 20% increase in max length vs. CS:

• MP makes weak sign determination when |x| ≥ 1.656(c.f. 1.645 for 1-sided)

• MP makes strict sign determination when |x| ≥ 1.96 (like CS)

• MP separates from zero when |x| > 3.048.

• QC makes weak sign determination when |x| > 1.675

• QC makes strict sign determination when |x| ≥ 1.96

• QC separates from zero when |x| > 2.744

• QC same length as CS when |x| > 4.419.

• CS makes weak and strict sign determinations when |x| ≥ 1.96

• separates from zero when |x| > 1.96.

MP & QC give weak sign determination for x ≈ 15% smaller thanCS, while at most 20% longer (not on average).

Page 21: Confidence Limitsstark/Seminars/slac12.pdfFeldman Cousins (1998) Figure 10 assume that the experiment is to search for transformations between muon type neutrinos,! ", and electron

Background Univariate location Sign Determinations MP and QC Summary

The Modified Pratt (MP) Acceptance Region

Under the restriction that the density f of X− θ is unimodal andsymmetric, the acceptance region of the most powerful test ofIEX = θ against the alternative IEX = 0 is

AMP(θ) ≡{

(θ − c̃, θ + c̄), θ < 0(θ − c̄, θ + c̃), θ > 0,

(2)

where c̄ is the smaller root of

F(2rcα/2 − c) = 2− α− F(c), and (3)

c̃ ≡ 2rcα/2 − c̄. (4)

Define AMP(0) ≡ (−cα/2, cα/2) for symmetry.

Page 22: Confidence Limitsstark/Seminars/slac12.pdfFeldman Cousins (1998) Figure 10 assume that the experiment is to search for transformations between muon type neutrinos,! ", and electron

Background Univariate location Sign Determinations MP and QC Summary

The Modified Pratt (MP) Interval

Inverting AMP gives

SMP(X) =

(X− c̄,X + c̄), 0 ≤ X < c̄[0,X + c̄), c̄ ≤ X < cα/2

(0,X + c̄), cα/2 ≤ X < c̃(X− c̃,X + c̄), X ≥ c̃,

(5)

with SMP(X) = −SMP(−X) for X < 0.

For r = 1, MP is CS; for r =∞, MP is unbounded.

Page 23: Confidence Limitsstark/Seminars/slac12.pdfFeldman Cousins (1998) Figure 10 assume that the experiment is to search for transformations between muon type neutrinos,! ", and electron

Background Univariate location Sign Determinations MP and QC Summary

MP weakly determines the sign of θ for the largest possible set ofvalues of X, among CIs that are never longer than 2rcα/2.

MP is longer than CS (by as much as the fraction r − 1) when|X| > 2cα/2 − c̄.

Page 24: Confidence Limitsstark/Seminars/slac12.pdfFeldman Cousins (1998) Figure 10 assume that the experiment is to search for transformations between muon type neutrinos,! ", and electron

Background Univariate location Sign Determinations MP and QC Summary

Quasi-Conventional (QC) Confidence Intervals

CI that reverts to CS when |X| is large by penalizing the size of theacceptance region.

Earlier weak sign determinations than CS by penalizing the extent towhich the acceptance region crosses the origin

Leads us to seek for each θ

arg minA{λ|A|+ sup

x∈A:sgnx 6=sgnθ|x|} s.t. IPθ(X ∈ A) ≥ 1−α. (6)

1st term controls the length of the CI.

2nd term controls the range of values of X for which the CI includesparameter values with sign opposite to that of θ.

λ is a Lagrange multiplier for the constraint |A(θ)| ≤ C.

Page 25: Confidence Limitsstark/Seminars/slac12.pdfFeldman Cousins (1998) Figure 10 assume that the experiment is to search for transformations between muon type neutrinos,! ", and electron

Background Univariate location Sign Determinations MP and QC Summary

With no penalty for the acceptance region crossing the origin,solution to the optimization problem is CS.

If choose λ so that |A| ≤ C, optimal acceptance regions are

AQC(θ) =

(−cα/2, cα/2), θ = 0(θ − c̄, θ + c̃), 0 < θ ≤ c̄(0, θ + F−1(2− α− θ)), c̄ < θ ≤ cα/2

(θ − cα/2, θ + cα/2), θ > cα/2,

(7)

with AQC(θ) = −AQC(−θ) for θ < 0,

c̃ = (2r − 1)cα/2, (8)

c̄ = F−1(2− α− F(c̃)). (9)

Page 26: Confidence Limitsstark/Seminars/slac12.pdfFeldman Cousins (1998) Figure 10 assume that the experiment is to search for transformations between muon type neutrinos,! ", and electron

Background Univariate location Sign Determinations MP and QC Summary

Inverting the QC tests

Inverting these acceptance regions and taking the convex hull yields

SQC(X) =

(−c̄, c̄), X = 0(X− c̄,X + cα/2), 0 < X ≤ c̄[0,X + cα/2), c̄ < X < cα/2

(0,X + cα/2), cα/2 ≤ X ≤ c̃(X− c̃,X + cα/2), c̃ < X ≤ c̄ + c̃(X− cα/2,X + cα/2), X > c̄ + c̃

(10)

for X ≥ 0; for X < 0, S(X) = −S(−X).

Maximum length is L(AQC) = c̃ + cα/2 = 2rcα/2.

Page 27: Confidence Limitsstark/Seminars/slac12.pdfFeldman Cousins (1998) Figure 10 assume that the experiment is to search for transformations between muon type neutrinos,! ", and electron

Background Univariate location Sign Determinations MP and QC Summary

Benjamini, Hochberg, Stark (1998) Figure 1���� -RXUQDO�RI�WKH�$PHULFDQ�6WDWLVWLFDO�$VVRFLDWLRQ��0DUFK������

���

����

���

���

���� �� ��� �� �� �� �� �� ��

��� ��

��&/����

���

��� �� DOSKD ����� U ����

��� ��� ��� ��� �� �� �� �� ��REVHUYDWLRQ�[�

)LJXUH���� &6� �� ���03�������� DQG�4&� �� ������&RQILGHQFH�,Q��WHUYDOV�IRU�D�1RUPDO�0HDQ�DV�D�)XQFWLRQ�RI�WKH�2EVHUYHG�9DOXH�;� �[��7KH�03�DQG�4&�LQWHUYDOV�DUH�FRQVWUDLQHG�WR�KDYH�D�OHQJWK�WKDW�GRHV�QRW�H[FHHG�����WLPHV�WKH�OHQJWK��]����RI�WKH�&6�LQWHUYD/�)RU�HDFK�DEVFLVVD�[��WKH�WZR�GRWWHG�RUGLQDWHV�DUH�WKH�HQGSRLQWV�RI�WKH�&6�LQWHUYDO��DQG�WKH�GDVKHG�DQG�VROLG�RUGLQDWHV�DUH�WKRVH�RI�WKH�03�DQG�4&�LQWHUYDOV��7KH�03�DQG�4&� LQWHUYDOV�DUH�RSHQ�DW���ZKHQ�O[O�!� =����EXW�UHDFK���IRU�VPDOOHU�YDOXHV�RI�[�WKDQ�WKH�&6� LQWHUYDO��FRPSDUHG�WR�WKH�&6� LQWHUYDO��WKH\�KDYH�HQKDQFHG�DELOLW\�WR�H[FOXGH�SDUDPHWHUV�RI�RQH�VLJQ��

�UF�����)RU�<� 1�2������WKH�H[SHFWHG�OHQJWK�RI�WKH�03�LQWHUYDO�LV�

(R�,603��;�� ��&����&��� ���>��&�������� ���&����� �� ��D�>� �D�� ��� �����D�� ��@�

���F���� �� ���F�� ��� �� ����� �� ���F� �� ���� �����

)LJXUH� ��SORWV�WKH�03� LQWHUYDO�DV� D�IXQFWLRQ�RI�;�� IRU�R]� �����DQG�U� � ����DQG�<� 1�2������:KHQ�WKH�REVHU��YDWLRQ�;� WDNHV�WKH�YDOXH�[�RQ�WKH�DEVFLVVD�� WKH�RUGLQDWHV�RI�WKH�WZR�GDVKHG�FXUYHV�DW�WKDW�YDOXH�RI�[�DUH�WKH�ORZHU�DQG�XSSHU�HQGSRLQWV�RI�WKH�03� FRQILGHQFH�LQWHUYDO�IRU����)RU�UHIHUHQFH��FRUUHVSRQGLQJ�FXUYHV�IRU�WKH�FRQYHQWLRQDO�LQWHUYDO�DUH�SORWWHG�ZLWK�GRWV��7KH�VROLG�OLQHV�DUH�WKH�TXDVL��FRQYHQWLRQDO�LQWHUYDOV��GLVFXVVHG�ODWHU��1RWH�WKDW�WKH�03�LQ��WHUYDO�LV�ORQJHU�WKDQ�WKH�FRQYHQWLRQDO�LQWHUYDO��E\�DV�PXFK�DV�WKH�IUDFWLRQ�U����� ZKHQ�,;,�!� �F���� F��)LJXUH���SORWV�DV�D�IXQFWLRQ�RI�����WKH�UDWLR�RI�WKH�H[SHFWHG�OHQJWK�RI�WKH�03�LQWHUYDO�WR�WKH�OHQJWK�RI�WKH�FRQYHQWLRQDO�V\PPHWULF�LQWHUYDO��GDVKHG�FXUYH���DJDLQ�IRU�<� 1�2������7DEOH� ��JLYHV�YDOXHV�RI�F�DQG�F�IRU�WKH�03�PHWKRG�ZKHQ�<� �� 1�2������IRU�D�YD��ULHW\�RI�FRQILGHQFH�OHYHOV�DQG�OHQJWK�UDWLRV�U��7KH�HQWULHV�LQ�WKH�WDEOHV�FDQ�EH�FDOFXODWHG�UHDGLO\�IURP�WKH�GHILQLQJ�HTXD��WLRQV����������� ZLWK�DQ\�VWDWLVWLFDO�VRIWZDUH�SDFNDJH� WKDW�LQFOXGHV�D�URRW�ILQGHU�DQG�WKH�FGI�DQG�LQYHUVH�FGI�RI�WKH�QRUPDO�GLVWULEXWLRQ��

��� 48$6,�&219(17,21$/�&21),'(1&(� ,17(59$/6�

:H�FDQ�REWDLQ�D�FRQILGHQFH�LQWHUYDO�WKDW�UHYHUWV�WR�WKH�FRQYHQWLRQDO�V\PPHWULF�LQWHUYDO�ZKHQ�,;� LV�ODUJH�E\�SH��QDOL]LQJ�WKH�VL]H�RI�WKH�DFFHSWDQFH�UHJLRQ��ZH�FDQ�REWDLQ�HDUOLHU�ZHDN�VLJQ�GHWHUPLQDWLRQ�WKDQ�WKH�FRQYHQWLRQDO�LQWHU��YDO�E\�SHQDOL]LQJ�WKH�H[WHQW�WR�ZKLFK�WKH�DFFHSWDQFH�UHJLRQ�

FURVVHV�WKH�RULJLQ��7KHVH�FRQVLGHUDWLRQV�OHDG�XV�WR�VHHN�IRU�HDFK���

$���� �DUJ�PLQ^$,$����,��� VXS� ,[�̀�$� [($�VJQ[�$VJQ2�

VXFK�WKDW�3U�;�(�$��!� ���� D�� �������

7KH�ILUVW�WHUP�LQGLUHFWO\�FRQWUROV�WKH�OHQJWK�RI�WKH�FRQIL��GHQFH�LQWHUYDO��DQG�WKH�VHFRQG�WHUP�LQGLUHFWO\�FRQWUROV�WKH�UDQJH�RI�YDOXHV�RI�;�IRU�ZKLFK�WKH�FRQILGHQFH�LQWHUYDO�LQ��FOXGHV�SDUDPHWHU�YDOXHV�ZLWK�VLJQ�RSSRVLWH�WR�WKDW�RI����7KH�SDUDPHWHU�$�LV�D�/DJUDQJH�PXOWLSOLHU�IRU�WKH�FRQVWUDLQW�,$���,���&��,I�WKHUH�ZHUH�QR�SHQDOW\�IRU�WKH�DFFHSWDQFH�UH��JLRQ�FURVVLQJ�WKH�RULJLQ��WKHQ�WKH�VROXWLRQ�RI�WKH�RSWLPL]D��WLRQ�SUREOHP�ZRXOG�EH�WKH�FRQYHQWLRQDO�LQWHUYDO��,I�WKHUH�ZHUH�QR�SHQDOW\�RQ�WKH�OHQJWK�RI�HDFK�DFFHSWDQFH�UHJLRQ��WKHQ�WKH�REMHFWLYH�IXQFWLRQDO�ZRXOG�EH�RQO\�ZHDNO\�FRQYH[��DQG�WKH�SUREOHP�ZRXOG�DGPLW�PRUH�WKDQ�RQH�VROXWLRQ��RQH�RI�ZKLFK�LV�WKH�RULJLQDO�3UDWW�LQWHUYDO������,I�$�LV�FKRVHQ�VR�WKDW�/�$�� �� &��WKHQ�WKH�DFFHSWDQFH�

UHJLRQV�WKDW�RSWLPL]H������DUH��DV�VKRZQ�LQ�WKH�$SSHQGL[��

��&2���� &D����� �� � R�

$4F���� � L� �2����)�����FD������ &��� ���� &5���

��� �� &D��� ���� &D����� �� !� &D����

ZLWK�$4&���� � �$4F����� IRU��������7KH�SDUDPHWHUV�F�DQG�F�DUH�JLYHQ�E\�

F ��U���FD��� �����

DQG�

F� �)����� ��D��)�F�� �����

�����

�� �������

������ ��

8� a� aaaaaaaaaaaa���

&�

�'� ��

������ ��

DW� ����

���������� �� ��

)LJXUH���� ([SHFWHG�/HQJWKV�RI�03���������DQG�4&��� ������&RQ��ILGHPQF� ,QWHUYDOV�� 5HOPDWLYH�WR� WKH� /HQJWK� RI�IKH� &Q�RQYHQLRQDO� 6\4PPHWULF�

,QHUDO� DVD)QWR�� IWH7X� DDHH� DX� �IU2VUDLQ�

:LKD� RPD� LVUEXLQ�7H� 3DQ� &� QHUDV� U� FQWULH�

WR��KDHP[PPOQW� R�[HGQ���WPVWHOQW� IWHFQ�YHWLQD� V\PWL�QHYO�

Page 28: Confidence Limitsstark/Seminars/slac12.pdfFeldman Cousins (1998) Figure 10 assume that the experiment is to search for transformations between muon type neutrinos,! ", and electron

Background Univariate location Sign Determinations MP and QC Summary

Benjamini, Hochberg, Stark (1998) Figure 2

���� -RXUQDO�RI�WKH�$PHULFDQ�6WDWLVWLFDO�$VVRFLDWLRQ��0DUFK������

���

����

���

���

���� �� ��� �� �� �� �� �� ��

��� ��

��&/����

���

��� �� DOSKD ����� U ����

��� ��� ��� ��� �� �� �� �� ��REVHUYDWLRQ�[�

)LJXUH���� &6� �� ���03�������� DQG�4&� �� ������&RQILGHQFH�,Q��WHUYDOV�IRU�D�1RUPDO�0HDQ�DV�D�)XQFWLRQ�RI�WKH�2EVHUYHG�9DOXH�;� �[��7KH�03�DQG�4&�LQWHUYDOV�DUH�FRQVWUDLQHG�WR�KDYH�D�OHQJWK�WKDW�GRHV�QRW�H[FHHG�����WLPHV�WKH�OHQJWK��]����RI�WKH�&6�LQWHUYD/�)RU�HDFK�DEVFLVVD�[��WKH�WZR�GRWWHG�RUGLQDWHV�DUH�WKH�HQGSRLQWV�RI�WKH�&6�LQWHUYDO��DQG�WKH�GDVKHG�DQG�VROLG�RUGLQDWHV�DUH�WKRVH�RI�WKH�03�DQG�4&�LQWHUYDOV��7KH�03�DQG�4&� LQWHUYDOV�DUH�RSHQ�DW���ZKHQ�O[O�!� =����EXW�UHDFK���IRU�VPDOOHU�YDOXHV�RI�[�WKDQ�WKH�&6� LQWHUYDO��FRPSDUHG�WR�WKH�&6� LQWHUYDO��WKH\�KDYH�HQKDQFHG�DELOLW\�WR�H[FOXGH�SDUDPHWHUV�RI�RQH�VLJQ��

�UF�����)RU�<� 1�2������WKH�H[SHFWHG�OHQJWK�RI�WKH�03�LQWHUYDO�LV�

(R�,603��;�� ��&����&��� ���>��&�������� ���&����� �� ��D�>� �D�� ��� �����D�� ��@�

���F���� �� ���F�� ��� �� ����� �� ���F� �� ���� �����

)LJXUH� ��SORWV�WKH�03� LQWHUYDO�DV� D�IXQFWLRQ�RI�;�� IRU�R]� �����DQG�U� � ����DQG�<� 1�2������:KHQ�WKH�REVHU��YDWLRQ�;� WDNHV�WKH�YDOXH�[�RQ�WKH�DEVFLVVD�� WKH�RUGLQDWHV�RI�WKH�WZR�GDVKHG�FXUYHV�DW�WKDW�YDOXH�RI�[�DUH�WKH�ORZHU�DQG�XSSHU�HQGSRLQWV�RI�WKH�03� FRQILGHQFH�LQWHUYDO�IRU����)RU�UHIHUHQFH��FRUUHVSRQGLQJ�FXUYHV�IRU�WKH�FRQYHQWLRQDO�LQWHUYDO�DUH�SORWWHG�ZLWK�GRWV��7KH�VROLG�OLQHV�DUH�WKH�TXDVL��FRQYHQWLRQDO�LQWHUYDOV��GLVFXVVHG�ODWHU��1RWH�WKDW�WKH�03�LQ��WHUYDO�LV�ORQJHU�WKDQ�WKH�FRQYHQWLRQDO�LQWHUYDO��E\�DV�PXFK�DV�WKH�IUDFWLRQ�U����� ZKHQ�,;,�!� �F���� F��)LJXUH���SORWV�DV�D�IXQFWLRQ�RI�����WKH�UDWLR�RI�WKH�H[SHFWHG�OHQJWK�RI�WKH�03�LQWHUYDO�WR�WKH�OHQJWK�RI�WKH�FRQYHQWLRQDO�V\PPHWULF�LQWHUYDO��GDVKHG�FXUYH���DJDLQ�IRU�<� 1�2������7DEOH� ��JLYHV�YDOXHV�RI�F�DQG�F�IRU�WKH�03�PHWKRG�ZKHQ�<� �� 1�2������IRU�D�YD��ULHW\�RI�FRQILGHQFH�OHYHOV�DQG�OHQJWK�UDWLRV�U��7KH�HQWULHV�LQ�WKH�WDEOHV�FDQ�EH�FDOFXODWHG�UHDGLO\�IURP�WKH�GHILQLQJ�HTXD��WLRQV����������� ZLWK�DQ\�VWDWLVWLFDO�VRIWZDUH�SDFNDJH� WKDW�LQFOXGHV�D�URRW�ILQGHU�DQG�WKH�FGI�DQG�LQYHUVH�FGI�RI�WKH�QRUPDO�GLVWULEXWLRQ��

��� 48$6,�&219(17,21$/�&21),'(1&(� ,17(59$/6�

:H�FDQ�REWDLQ�D�FRQILGHQFH�LQWHUYDO�WKDW�UHYHUWV�WR�WKH�FRQYHQWLRQDO�V\PPHWULF�LQWHUYDO�ZKHQ�,;� LV�ODUJH�E\�SH��QDOL]LQJ�WKH�VL]H�RI�WKH�DFFHSWDQFH�UHJLRQ��ZH�FDQ�REWDLQ�HDUOLHU�ZHDN�VLJQ�GHWHUPLQDWLRQ�WKDQ�WKH�FRQYHQWLRQDO�LQWHU��YDO�E\�SHQDOL]LQJ�WKH�H[WHQW�WR�ZKLFK�WKH�DFFHSWDQFH�UHJLRQ�

FURVVHV�WKH�RULJLQ��7KHVH�FRQVLGHUDWLRQV�OHDG�XV�WR�VHHN�IRU�HDFK���

$���� �DUJ�PLQ^$,$����,��� VXS� ,[�̀�$� [($�VJQ[�$VJQ2�

VXFK�WKDW�3U�;�(�$��!� ���� D�� �������

7KH�ILUVW�WHUP�LQGLUHFWO\�FRQWUROV�WKH�OHQJWK�RI�WKH�FRQIL��GHQFH�LQWHUYDO��DQG�WKH�VHFRQG�WHUP�LQGLUHFWO\�FRQWUROV�WKH�UDQJH�RI�YDOXHV�RI�;�IRU�ZKLFK�WKH�FRQILGHQFH�LQWHUYDO�LQ��FOXGHV�SDUDPHWHU�YDOXHV�ZLWK�VLJQ�RSSRVLWH�WR�WKDW�RI����7KH�SDUDPHWHU�$�LV�D�/DJUDQJH�PXOWLSOLHU�IRU�WKH�FRQVWUDLQW�,$���,���&��,I�WKHUH�ZHUH�QR�SHQDOW\�IRU�WKH�DFFHSWDQFH�UH��JLRQ�FURVVLQJ�WKH�RULJLQ��WKHQ�WKH�VROXWLRQ�RI�WKH�RSWLPL]D��WLRQ�SUREOHP�ZRXOG�EH�WKH�FRQYHQWLRQDO�LQWHUYDO��,I�WKHUH�ZHUH�QR�SHQDOW\�RQ�WKH�OHQJWK�RI�HDFK�DFFHSWDQFH�UHJLRQ��WKHQ�WKH�REMHFWLYH�IXQFWLRQDO�ZRXOG�EH�RQO\�ZHDNO\�FRQYH[��DQG�WKH�SUREOHP�ZRXOG�DGPLW�PRUH�WKDQ�RQH�VROXWLRQ��RQH�RI�ZKLFK�LV�WKH�RULJLQDO�3UDWW�LQWHUYDO������,I�$�LV�FKRVHQ�VR�WKDW�/�$�� �� &��WKHQ�WKH�DFFHSWDQFH�

UHJLRQV�WKDW�RSWLPL]H������DUH��DV�VKRZQ�LQ�WKH�$SSHQGL[��

��&2���� &D����� �� � R�

$4F���� � L� �2����)�����FD������ &��� ���� &5���

��� �� &D��� ���� &D����� �� !� &D����

ZLWK�$4&���� � �$4F����� IRU��������7KH�SDUDPHWHUV�F�DQG�F�DUH�JLYHQ�E\�

F ��U���FD��� �����

DQG�

F� �)����� ��D��)�F�� �����

�����

�� �������

������ ��

8� a� aaaaaaaaaaaa���

&�

�'� ��

������ ��

DW� ����

���������� �� ��

)LJXUH���� ([SHFWHG�/HQJWKV�RI�03���������DQG�4&��� ������&RQ��ILGHPQF� ,QWHUYDOV�� 5HOPDWLYH�WR� WKH� /HQJWK� RI�IKH� &Q�RQYHQLRQDO� 6\4PPHWULF�

,QHUDO� DVD)QWR�� IWH7X� DDHH� DX� �IU2VUDLQ�

:LKD� RPD� LVUEXLQ�7H� 3DQ� &� QHUDV� U� FQWULH�

WR��KDHP[PPOQW� R�[HGQ���WPVWHOQW� IWHFQ�YHWLQD� V\PWL�QHYO�

Page 29: Confidence Limitsstark/Seminars/slac12.pdfFeldman Cousins (1998) Figure 10 assume that the experiment is to search for transformations between muon type neutrinos,! ", and electron

Background Univariate location Sign Determinations MP and QC Summary

MP and QC improve on CS for this problem

MP and QC make earlier sign exclusions than CS.

MP makes earliest sign exclusions, at the cost of being longer thanthe CS and QC on a set of infinite measure.

QC sacrifices some power against zero and some length when |X| issmall, but has a big length advantage over MP elsewhere.

The values of c̄ for MP and QC can be effectively indistinguishable,but still the values of c̃ for the two methods differ noticeably.

Then, QC separate from zero much sooner than MP, and areultimately much shorter. This results particularly when X− θ has thintails.

Page 30: Confidence Limitsstark/Seminars/slac12.pdfFeldman Cousins (1998) Figure 10 assume that the experiment is to search for transformations between muon type neutrinos,! ", and electron

Background Univariate location Sign Determinations MP and QC Summary

Most of the benefit from MP and QC comes with an increase in themaximum possible length over CS of 10%–20%.

With that increase, MP and QC make weak sign determinationsalmost as early as a one-sided test: for α = 0.05 and r = 1.2, onesign is excluded for |X| beyond about 1.01zα.

For r = 1.5 and reasonable confidence levels, MP and QC excludeone sign for essentially the same values of X as one-sided testswould, but give finite-length CIs.

Page 31: Confidence Limitsstark/Seminars/slac12.pdfFeldman Cousins (1998) Figure 10 assume that the experiment is to search for transformations between muon type neutrinos,! ", and electron

Background Univariate location Sign Determinations MP and QC Summary

Compared with CS, QC gives up length exactly where it buys anearlier sign exclusion, and in the region zα/2 ≤ |X| ≤ c̄ + c̃, whereQC still yields a strict sign determination, but one endpoint of QC isopen at zero.

In contrast, MP gives up length on an infinite set.

The “cost” of MP and QC in terms of expected CI length is even lessthan the cost in maximum length.

Page 32: Confidence Limitsstark/Seminars/slac12.pdfFeldman Cousins (1998) Figure 10 assume that the experiment is to search for transformations between muon type neutrinos,! ", and electron

Background Univariate location Sign Determinations MP and QC Summary

Summary, further work

Can do the equivalent of MP or QC for parameters constrained to beof one sign; analogous to Feldman-Cousins: Simply intersect MP orQC with [0,∞).

Protects against flip-flopping but reveals when there is strongevidence that the model is wrong, and gives shorter expectedlengths (at least for many µ).

Claim: It more sense to optimize the desired criterion—make a signdetermination as soon as possible and keep length undercontrol—rather than to use an ad hoc principle such as likelihoodratio to cMLE.

Multivariate extension for simultaneous CIs: Benjamini, Madar, Stark(2012).

Page 33: Confidence Limitsstark/Seminars/slac12.pdfFeldman Cousins (1998) Figure 10 assume that the experiment is to search for transformations between muon type neutrinos,! ", and electron

Background Univariate location Sign Determinations MP and QC Summary

References

Benjamini, Y., Y. Hochberg, and P.B. Stark, 1998. ConfidenceIntervals with more Power to determine the Sign: Two Ends constrainthe Means, Journal of the American Statistical Association, 93,309–317.

Benjamini, Y., V. Madar, and P.B. Stark, 2011. SimultaneousConfidence Intervals with more Power to Determine Signs.Submitted to Biometrika. Preprint: http://statistics.berkeley.edu/~stark/Preprints/qc11.pdf

Feldman, G.J. and R.D. Cousins, 1998. Unified approach to theclassical statistical analysis of small signals, Phys. Rev. D, 57,3873–3889.


Recommended