Date post: | 01-Apr-2015 |
Category: |
Documents |
Upload: | megan-furman |
View: | 214 times |
Download: | 0 times |
From the Data at Hand to the World at LargeChapters 19, 23
Confidence Intervals
Estimation of population parameters:
•an unknown population proportion p
•an unknown population mean
Concepts of Estimation• The objective of estimation is to estimate
the unknown value of a population parameter, like the mean , on the basis of a sample statistic calculated from sample data.
e.g., NCSU housing office may want to estimate the mean distance from campus to hometown of all students
• There are two types of estimates– Point Estimate– Interval estimate
What do we frequently need to estimate?
• An unknown population proportion p
• An unknown population mean
?
p?
Point Estimates
• The sample mean is the best point estimate of the population mean
• p = , the sample proportion of x successes in a sample of size n, is the best point estimate of the population proportion p
x
^x
n
Example: Estimating an unknown population proportion p • Is Herb Sendek's departure good or bad for
State's men's basketball team? (Technician opinion poll; not scientifically valid!!)
• In a sample of 1000 students, 590 say that Sendek’s departure is good for the bb team.
• p = 590/1000 = .59 is the point estimate of the unknown population proportion p that think Sendek’s departure is good.
^
Example: Estimating an unknown mean
• In an effort to improve drive-through service, a Burger King records the drive-through service times of 52 randomly selected vehicles.
• The sample mean service time =181.3 seconds is the point estimate of the unknown mean service time
x
Shortcoming of Point Estimates
• = 181.3 seconds, best estimate of mean service time
• p = 590/1000 = .59, best estimate of population proportion p
BUT
How good are these best estimates?
No measure of reliability
^
x
Another type of estimate
A confidence interval is a range (or an interval) of values used to estimate the unknown value of a population parameter .
http://abcnews.go.com/US/PollVault/
Interval Estimator
95% Confidence Interval for p
n)p(1p
1.96p
written
)n
)p(1p1.96p,
n)p(1p
1.96p(
:p for
interval confidence 95% aconstruct tonx
p Use
ˆˆˆ
ˆˆˆ
ˆˆˆ
ˆ
Standard Normal
P(-1.96 z 1.96) =. 95
p1.96
pqp
n 1.96
pqp
n
.95
Confidence levelSampling distribution of
ˆ95% of the time p will be in this interval
Therefore, the interval
ˆ ˆ1.96 , 1.96
will "capture" 95% of the time
pq pqp p
n n
p
p̂
Standard Normal
P(-1.96 z 1.96) =. 95
Example (Gallup Polls)
)544.,496(.024.52.1600
)48)(.52(.96.152.
ˆˆ96.1ˆ
calculate wefor
interval confidence 95% a desire weifThen
.52.ˆ suppose voters;1600ely approximat
sample typicallypolls preferenceVoter
n
qpp
p
p
http://abcnews.go.com/US/PollVault/story?id=145373&page=1
Medication side effects (confidence interval for p)Arthritis is a painful, chronic inflammation of the joints.
An experiment on the side effects of pain relievers
examined arthritis patients to find the proportion of
patients who suffer side effects.
What are some side effects of ibuprofen?Serious side effects (seek medical attention immediately):
Allergic reaction (difficulty breathing, swelling, or hives),Muscle cramps, numbness, or tingling,Ulcers (open sores) in the mouth,Rapid weight gain (fluid retention),Seizures,Black, bloody, or tarry stools,Blood in your urine or vomit,Decreased hearing or ringing in the ears,Jaundice (yellowing of the skin or eyes), orAbdominal cramping, indigestion, or heartburn,
Less serious side effects (discuss with your doctor):Dizziness or headache,Nausea, gaseousness, diarrhea, or constipation,Depression,Fatigue or weakness,Dry mouth, orIrregular menstrual periods
Calculate a 90% confidence interval for the population proportion p of arthritis patients who suffer some “adverse symptoms.”
* ˆ ˆˆ
.052(1 .052).052 1.645
440.052 1.645(0.011)
.052 .018
pqp z
n
052.0440
23ˆ p
For a 90% confidence level, z* = 1.645.
We are 90% confident that the interval (.034, .070) contains the true
proportion of arthritis patients that experience some adverse symptoms when
taking ibuprofen.
90%CIfor :
0.052 0.018 (.034,.070)
p
p̂
440 subjects with chronic arthritis were given ibuprofen for pain relief; 23 subjects suffered from adverse side effects.
* ˆ ˆˆ
pqp z
n
What is the sample proportion ?
Tool for Constructing Confidence Intervals for : The Central Limit
Theorem• If a random sample of n observations is
selected from a population (any population), then when n is sufficiently large, the sampling distribution of x will be approximately normal.
(The larger the sample size, the better will be the normal approximation to the sampling distribution of x; we’ll use n 30)
Estimating the Population Mean when the Population
Standard Deviation is Known• How is an interval estimator produced from a
sampling distribution?– To estimate , a sample of size n is drawn from the
population, and its mean is calculated.
– Under certain conditions, is normally distributed (or approximately normally distributed by the CLT).
x
x
Confidence Interval for a population mean
A 95% confidence interval for
a population mean :
1.96 , 1.96
usually written
1.96
x xn n
xn
Standard Normal
P(-1.96 z 1.96) =. 95
EXAMPLE60, 30.4, 1.6
95% confidence interval for
1.630.4 1.96
60
30.4 .405
(29.995,30.805)
We are 95% confident that the interval
from 29.995 to 30.805 contains
the true but unknown value of
n x
n
96.1n
96.1
.95
Confidence levelSampling distribution of x
interval in this be willx time theof %95
time theof 95% capture"" will
96.1,96.1
interval theTherefore,
nn
xx
Standard Normal
98% Confidence Intervals
2.33 , 2.33
written
2.33
For
x xn n
xn
ˆ ˆ ˆ ˆ(1 ) (1 )ˆ ˆ2.33 , 2.33
written
ˆ ˆ(1 )ˆ 2.33
For
p p p pp p
n n
p pp
n
p
Four Commonly Used Confidence Levels
Confidence Level Multiplier
.90 1.645
.95 1.96
.98 2.33
.99 2.58
Example (cont.) 60, 30.4, 1.6;
95% : (29.995, 30.805)
90% : 1.645
1.630.4 1.645 30.4 .34 (30.06,30.74)
60
98% : 2.33
1.630.4 2.33 30.4 .481 (29.919,30.881)
60
n x
CI
CI multiplier
CI multiplier
Example (cont.)
99% CI: multiplier 2.58
1.630.4 2.58 30.4 .533 (29.867,30.933)
60
Example Summary• 90% (30.06, 30.74)• 95% (29.995, 30.805)• 98% (29.919, 30.881)• 99% (29.867, 30.933)• The higher the confidence level, the wider
the interval• Increasing the sample size n will make a
confidence interval with the same confidence level narrower (i.e., more precise)
Example (cont.)
60, 30.4, 1.6
95% : (29.995, 30.805)
100, 30.4, 1.6
1.695% : 30.4 1.96( ) 30.4 .314
100(30.086, 30.714) ( , )
n x
CI
n x
CI
narrower more precise
Example
• Find a 95% confidence interval for p, the proportion of small businesses in favor of a tax increase to decrease the national debt, if a random sample of 1000 found the number of businesses in favor of increased taxes was 50.
Example (solution)
50ˆ ˆ.05, .95and the confidence1000interval is
(.05)(.95).05 1.96 = .05 .014
1000(.036, .064)
p soq
Interpreting Confidence Intervals• Previous example: .05±.014(.036, .064)
• Correct: We are 95% confident that the interval from .036 to .064 actually does contain the true value of p. This means that if we were to select many different samples of size 1000 and construct a 95% CI from each sample, 95% of the resulting intervals would contain the value of the population proportion p. (.036, .064) is one such interval. (Note that 95% refers to the procedure we used to construct the interval; it does not refer to the population proportion p)
• Wrong: There is a 95% chance that the population proportion p falls between .036 and .064. (Note that p is not random, it is a fixed but unknown number)