+ All Categories
Home > Documents > Section 8.1

Section 8.1

Date post: 22-Feb-2016
Category:
Upload: enrico
View: 52 times
Download: 0 times
Share this document with a friend
Description:
Section 8.1. Estimating  When  is Known In this section, we develop techniques for estimating the population mean μ using sample data. We assume that the population standard deviation σ is known. Estimating  When  is Known. Assumptions: - PowerPoint PPT Presentation
24
Section 8.1 Estimating When is Known In this section, we develop techniques for estimating the population mean μ using sample data. We assume that the population standard deviation σ is known. 8.1 / 1
Transcript
Page 1: Section 8.1

Section 8.1

Estimating When is Known

In this section, we develop techniques for estimating the population mean μ using sample data.

We assume that the population standard deviation σ is known.

8.1 / 1

Page 2: Section 8.1

Estimating When is Known

Assumptions:1.We have a simple random sample of size n

from a population of x values.2.The value of σ, the population standard

deviation of x, is known. 3.If the x distribution is normal, then our

methods work for any sample size n.4.If the distribution is unknown, a sample size of

at least 30 (sometimes even more) is required.

8.1 / 2

Page 3: Section 8.1

Point Estimate

• A point estimate is an estimate of a population parameter given by a single number.

• We use (the sample mean) as a point estimate for μ (the population mean)

• is the point estimate for μ.

x

x

8.1 / 3

Page 4: Section 8.1

Examples of Point Estimates

• x is used as a point estimate for .• s is used as a point estimate for .• Margin of Error• Is the magnitude of the difference between the

point estimate and the true parameter value.• The margin of error using x

as a point estimate for is

or x x 8.1 / 4

Page 5: Section 8.1

Confidence LevelA confidence level, c, is a measure of the degree of

assurance we have in our results.The value of c may be any number between zero

and one.Typical values for c include 0.90, 0.95, and 0.99.

Critical Value for a Confidence Level, cthe value zc such that the area under the standard normal curve falling between – zc and zc is equal to c.

8.1 / 5

Page 6: Section 8.1

Confidence Level

The area under the normal curve from is the probability that the standardized normal variable z lies in that interval. This means that

c cz to z

8.1 / 6( )c cP z z z c

Page 7: Section 8.1

Find a number z0.90 such that 90% of the area under the standard normal curve lies between z0.90 and z0.90

That is, we will find z0.90 such that

P(– z0.90 < z < z0.90 ) = 0.90

ExampleFind the critical value

SolutionWe know that to find the z value when we were given the area between -z and z. The first thing we do is to find the corresponding area to the left of –z. If A is the area between –z and z, then (1-A)/2 is the area to the left of z. In our case the area between –z and z is 0.90.The corresponding area in the left tail is

(1-0.90)/2=0.058.1 / 7

Page 8: Section 8.1

According to Appendix Table 3, 0.0500 lies exactly halfway between two values in the table ( .0505 and .0495).

• Averaging the z values associated with areas gives• z0.90 = 1.645 and z0.90 = 1.645.• z0.90 = 1.645 is the critical value for a confidence level of

c = 0.90.• We have • P(-1.645 < z < 1.645) = 0.90.

Example cont.

8.1 / 8

Page 9: Section 8.1

Some Common Levels of Confidence and Their Corresponding Critical Values

8.1 / 9

Page 10: Section 8.1

An estimate is not very valuable unless we have some kind of measure of how “good” it is. The probability can give us an idea of the size of the margin of error caused by using the sample mean as an estimate for the population mean.

Remember that is a random variable. Each time we draw a sample of size n from a population, we can get a different value for . According to the central limit theorem, if the sample size is large, then has a distribution that is approximately normal with mean

the population mean we are trying to estimate.The standard deviation isIf x has a normal distribution, these results are true for any

sample size.

8.1 / 10

x

x

x

xx

x

/ .x n

Page 11: Section 8.1

This information, together with our work on confidence levels, leads us to the probability statement

(1)Comment:To derive Equation (1), we start with the probability

statement Since we can use the central limit theorem and replace z

by Finally, we multiply all parts of the inequality by to obtain Equation (1).

8.1 / 11

c cP z x z cn n

( )c cP z z z c 30,n

( ) / ( / ).x n / n

Page 12: Section 8.1

Equation (1) uses the language of probability to give us an idea of the size of the margin of error for the corresponding confidence level c. In other words,

Equation (1) states that the probability is c that our point estimate is within a distance of the population mean μ.

The margin of error (or absolute error) using is a point estimate for μ is | - μ|. In most practical problems, μ is unknown, so the margin of error is also unknown. However, Equation (1) allows us to compute an error tolerance E, which serves as a bound on the margin of error.

8.1 / 12

x ( / )cz n

xx

cE zn

Page 13: Section 8.1

Using a c% level of confidence, we can say that the point estimate differs from the population mean μ by a maximal margin of error

(2)

Note: Formula (2) for E is based on the fact that the sampling distribution for is exactly normal, with mean μ and standard deviation

This occurs whenever the x distribution is normal with mean μ and standard deviation σ. If the x distribution is not normal, then according to the central limit theorem,

large samples produce an distribution that is approximately normal with mean μ and standard deviation

8.1 / 13

x

cE zn

x/ .n

( 30)n x/ .n

Page 14: Section 8.1

Using Equations (1) and (2), we conclude thatP(-E < - μ < E) = c (3) andP( - E < μ < + E) = c (4)Equation (4) states that there is a chance of c that the

interval from - E to + E contains the population mean μ. We call this interval a c confidence interval for μ.

8.1 / 14

x

x x

xx x

A c confidence interval for is an interval computed from sample data in such a way that c is the probability of generating an interval containing the actual value of .

Page 15: Section 8.1

A c confidence interval for is an interval computed from sample data in such a way that c is the probability of generating an interval containing the actual value of .

We may get a different confidence interval for each different sample that is taken.

Some intervals will contain the population mean μ and others will not. However, in the long run, the proportion of confidence intervals that contain μ is c.

8.1 / 15

Page 16: Section 8.1

How Find a Confidence Interval for When is Known:

Let x be a random variable appropriate to your application. Obtain a simple random sample (of size n) of x values for which you compute the sample mean, The value of σ is already known (perhaps from a previous study).

If you cannot assume x has a normal distribution, use a sample size of 30 or more.

Confidence Interval for When is Known

.x

ExEx

ondistributi normal standard the on based level confidencefor value criticalz

level confidencecn

zE

MeanSamplexwhere

c

c

Page 17: Section 8.1

Create a 95% confidence interval for the mean driving time between Philadelphia and Boston.

Assume that the mean driving time of 64 trips was =6.4 hours and that the standard deviation is σ = 0.9 hours.

SolutionThe interval from - E to + E will be a 95% confidence

interval for μ. In this case,

8.1 / 17

ExampleConfidence interval for μ with σ known

x

x

x x

= 6.4 hours = 0.9 hoursc = 95%, so zc = 1.96 and n = 64and or

0.91.96 .220564cE z

n

x

x

x E x E 6.4 – .2205 < μ < 6.4 + .22056.1795 < μ < 6.6205

We are 95% sure that the true time is between 6.18 and 6.62 hours.

Page 18: Section 8.1

Commentsof the term confidence interval

It is important to realize that the endpoints are really statistical variables the equation

States that we have a chance c of obtaining a sample such that the interval, once it is computed, will obtain the parameter μ. Of course, after the confidence interval, is numerically fixed, it either does or does not contain μ.

So the probability is 1 or 0 that the interval, when it is fixed, will contain μ. A nontrivial probability statement can be made only about variables, not constants.

Therefore, the above equation really states that if we repeat the experiment many times and get lots of confidence intervals (for the same sample size), then the proportion of all intervals that will turn out to contain the mean μ is c. 8.1 / 18

x E( )P x E x E c

Page 19: Section 8.1

We may get different confidence intervals for different samples.

8.1 / 19

Page 20: Section 8.1

Guided Exercise Walter jogs 3 miles. He knows that the standard

deviation is σ = 2.40 minutes for his jogging times. For a random sample of 90 jogging times, his mean time was = 22.50 minutes. Let μ be the mean jogging time for the entire distribution of Walter’s 3-mile jogging times over the past several years. Find the 0.99 confidence interval for μ.

(a) What is the value of (Tables)(b) Is the distribution approximately normal?

Yes, we know this from the central limit theorem.

8.1 / 20

x0.99 2.58z

0.99 ?z 0.99 2.58z

x

x

Page 21: Section 8.1

Guided Exercise cont.(c) What is the value of E?

(d) What are the end points for a 0.99 confidence interval for μ?

(e) How can we interpret the confidence interval?We are 99% certain that the interval from 21.85 to 23.15is an interval that contains the population mean time μ.

8.1 / 21

3.402.58( ) 0.6590cE z

n

22.50 0.65 21.8522.50 0.65 23.15

x Ex E

Page 22: Section 8.1

When estimating the mean, how large a sample must be used in order to assure a given level of confidence?Use the formula:

2

Ezn c

If n is not a whole number, increase n to the next higher whole number.

Note that n is the minimal sample size for a specified confidence level and maximal error of estimate E.

Note: Use the standard deviation, s, of a preliminary sample of size 30 or larger to estimate .

PROCEDUREHow to find the sample size n for

estimating μ when σ is known

Page 23: Section 8.1

Determine the sample size necessary to determine (with 99% confidence) the mean time it takes to drive from Philadelphia to Boston. We wish to be within 15 minutes of the true time. Assume that a preliminary sample of 45 trips had a standard deviation of 0.8 hours.Solution:z0.99 = 2.58 E = 15 minutes = 0.25 hours

Since the preliminary sample is large enough, we can assume that the population standard deviation is approximately equal to 0.8 hours.

8.1 / 23

Example

Minimum Sample Size is: 2 22.58(2.15) 769.20.20

cznE

n 68.16Round to the next higher whole number.To be 99% confident in our results, the minimum sample size = 69.

Page 24: Section 8.1

ExampleA study is designed to find the mean weight of salmon caught

by an Alaskan fishing company. A recent study of a random sample of 50 salmon showed s = 2.15 lb.

How large a sample should be taken to be 99% confident that the sample mean is within 0.20 lb of the true mean weight μ?

Solution:

WE conclude that a sample size of 770 or larger is enough to satisfy the specifications.

8.1 / 24

xx

0.99 2.58 0.20 2.15z E s 2 22.58(2.15) 769.2

0.20cznE

Assignment 19


Recommended