+ All Categories
Home > Documents > Probability Distributions

Probability Distributions

Date post: 30-Oct-2014
Category:
Upload: darlingjunior
View: 774 times
Download: 2 times
Share this document with a friend
Description:
 
33
QUANTITATIVE METHODS
Transcript
Page 1: Probability Distributions

QUANTITATIVE METHODS

Page 2: Probability Distributions

PROBABILITY DISTRIBUTIONS

The emphasis on the sampling process is on the way a sample is selected to represent the population to reflect it’s characteristics.

Researchers find it difficult to ascertain if the sample is an accurate representative of the population.

If the sample is drawn according to the laws of probability then the degree to which the sample mirrors the population can be calculated in probabilistic terms.

Data collected using probabilistic measures can be approximated using probability distributions and results of probability distributions can be further used for analyzing statistical data.

The Rationale for Using Probability Theory

Page 3: Probability Distributions

PROBABILITY DISTRIBUTIONS

Binomial Distribution

Normal Distribution, Standard Normal Distribution

Central Limit Theorem

Estimation

Confidence Intervals

Estimating Sample Size

In this session ….

Page 4: Probability Distributions

PROBABILITY

DISTRIBUTIONSDISCRETE CONTINUOUS

e.g Binomial Distribution tosses of a coin. success or failure of students in an aptitude test.

e.g Normal Distribution Frequency distribution of light bulbs measured on a continuous scale of hours.

Page 5: Probability Distributions

- the B-school

Binomial Distribution

Characteristics of Bernoulli Process

1.Each trial has two possible outcomes: success or failure.

2.The probability of outcome is fixed over time.

3.Trials are statistically independent: outcome of one trial does not affect or depend on another.

4.Examples: In the random experiment of the throws of a dice ‘getting 6’ is a success and ‘not getting 6’ is a failure.

Page 6: Probability Distributions

- the B-school

Binomial Formula

p = probability of success,q = 1- p = probability of failure,r = number of successes,n = total number of trials,µ = np = mean. = = standard deviation.

!( )

!( )!n r n r r n r

r

nP X r C p q p q

r n r

npq

Page 7: Probability Distributions

Binomial Formula – Exercise

The incharge of the electronics section of a large departmental store has observed that the probability that a customer who is just browsing will buy something is 0.3. Suppose that 15 customers browse in the electronics section each hour, what is the probability that

(a) exactly 4 browsing customers will buy something in the specified hour?

(b) at least one browsing customer will buy something in the specified hour?

Page 8: Probability Distributions

Binomial Formula – Exercise (solution)

15 4 114( ) ( 4) (0.3) (0.7)a P X C

15 0 150

( ) ( 1) 1 ( 0)

1 (0.3) (0.7)

b P X P X

C

Page 9: Probability Distributions

- the B-school

Binomial Distribution – A Graphical Exploration

Draw probability histograms of the binomial distribution for(a) n = 10 and p = 0.1, 0.3, 0.5, 0.7, 0.9(b) p = 0.4 and n = 5, 10, 30

Page 10: Probability Distributions

- the B-school

Binomial Distribution – A Graphical Exploration

0

0.05

0.1

0.15

0.2

0.25

0.3

0.35

0.4

0.45

1 2 3 4 5 6 7 8 9 1 11

Series1

0

0.05

0.1

0.15

0.2

0.25

0.3

1 2 3 4 5 6 7 8 9 1 11

Series1

0

0.05

0.1

0.15

0.2

0.25

0.3

1 2 3 4 5 6 7 8 9 1 11

Series1

0

0.05

0.1

0.15

0.2

0.25

0.3

1 2 3 4 5 6 7 8 9 10 11

Series1

0

0.05

0.1

0.15

0.2

0.25

0.3

0.35

0.4

0.45

1 2 3 4 5 6 7 8 9 1 11

Series1

The binomial probability histogram for n = 10 and p = 0.1,0.3,0.5,0.7,0.9

Page 11: Probability Distributions

- the B-school

Binomial Distribution – A Graphical Exploration

The binomial probability histogram for p = 0.4 and n = 5,10,30

0

0.05

0.1

0.15

0.2

0.25

0.3

0.35

0.4

1 2 3 4 5 6

Series1

0

0.05

0.1

0.15

0.2

0.25

0.3

1 2 3 4 5 6 7 8 9 10 11

Series1

0

0.02

0.04

0.06

0.08

0.1

0.12

0.14

0.16

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 2 21 2 2 2 2 2 2 2 2 3 31

Series1

Page 12: Probability Distributions

- the B-school

Normal Distribution

Characteristics1. Curve is bell shaped and unimodal (single peak).2. Mean = Median = Mode (at the centre)3. To define the normal prob. dist. we need the mean

(µ) and standard deviation ().4. Area under the normal probability curve = 1.5. Approx 68% of all values in a normally distributed

population lie within µ ± .6. Approx 95.5% of all values in a normally distributed

population lie within µ ± 2. 7. Approx 99.7% of all values in a normally distributed

population lie within µ ± 3.

Page 13: Probability Distributions

- the B-school

Standard Normal Probability Distribution

Characteristics1. µ = 0, = 1.

2. (Standardizing a normal variable)

3. The standard normal prob. dist. Table shows the area under the normal curve between the mean and positive values of z.

4. Normally distributed random variables take different units of measure: dollars, inches etc. z denotes standard units (i.e standard deviations)

XZ

Page 14: Probability Distributions

Standard Normal Probability Distribution Example 1

The life of electronic tubes of a certain type are assumed to be normally distributed with mean 155 hours and standard deviation of 19 hours. What is the probability that

(i) The life of a randomly chosen tube is less than 117 hours.

(ii)The life of a randomly chosen tube is between 136 and 174 hours.

Page 15: Probability Distributions

Standard Normal Probability Distribution Example1 (Solution)

= 155 hours, = 19 hours

(i) P(X < 117) = P(Z < -2) = P(Z > 2) = 0.5 – P(0 < Z < 2)

= 0.5 – 0.4772 = 0.0228

(ii) P(136 < X < 174) = P(-1 < Z < 1) = 2P(0 < Z < 1)

= 2 0.3413 = 0.6826

Page 16: Probability Distributions

- the B-school

Standard Normal Probability Distribution - Example

Jarrid Medical, is developing a compact kidney dialysis machine but is having trouble controlling the variability of the rate at which fluid moves through the device. Medical standards require that the hourly flow be 4 litres, plus or minus, 0.1 litre, 80% of the time. Testing the prototype has revealed that 68% of the time the hourly flow is within 0.08 litres of 4.02 litres. Does the prototype satisfy medical standards?

Page 17: Probability Distributions

- the B-school

Standard Normal Probability Distribution – Example (Solution)

Solution: = 4.02, = 0.08

(3.9 4.1) 0.80P X

3.9 4.02 4.10 4.02( )

0.08 0.08( 1.5 1)

(0 1.5) (0 1)

0.4332 .3413 0.7745

XP

P Z

P Z P Z

Thus the prototype does not satisfy the medical requirements.

(required rate of flow)

Page 18: Probability Distributions

- the B-school

Central Limit Theorem

Let be the means of N samples of size n taken from a population. The mean of these (referred to as the mean of the sampling distribution of the mean) is denoted by

, the population mean even if the population is not normal.

is the standard error of the mean. The sampling distribution of the mean approaches

normality as n the sample size increases. The histogram of the means of many samples should approach a bell shaped curve.

Significance: CLT enables us to use sample statistics to make inferences about population parameters without knowing anything about the shape of the frequency distribution of the population.

1 2, ,...... Nx x x

X

X

x n

Page 19: Probability Distributions

- the B-school

Central Limit Theorem

If the random variables x1, x2,…..xN are independent and identically distributed with mean µ and standard deviation , then

is normally distributed with mean and

standard deviation .

1 2 ...... Nx x xx

N

n

Page 20: Probability Distributions

- the B-school

Central Limit Theorem – Exercise

If the weights of individual packets of 2 minute noodles varies according to a normal distribution with a mean of 85.8 grams and a standard deviation of 1.9 gramsa. Describe the distribution that will describe the mean weight of a simple random sample of 5 such packets of noodles.Such a sample of 5 packets of noodles in a multi- pack is labeled “average weight of contents: 85 grams”b. Determine the proportion of such multi-packs with an average weight within 1 gram of the claimed average weight.c. Determine the proportion of such multi-packs with an average weight less than what is claimed.

Page 21: Probability Distributions

- the B-school

Central Limit Theorem – Exercise (solution)

a. The mean weight is normally distributed with a mean of

85.8 grams and a standard deviation of

b.

c.

1.90.85

5x approx

n

5X

5(84 86) ( 2.11 0.235)

(0 2.11) (0 0.235)

0.4826 0.091 0.5736

P X P Z

P Z P Z

84 85.8 1.8( 2.11)

0.85 0.85X

XZ

5( 85) ( 0.94) 0.5 (0 0.94)

0.5 0.3264 0.1736

P X P Z P Z

(use NORMSDIST)

Page 22: Probability Distributions

ESTIMATION

POINT ESTIMATE

INTERVAL ESTIMATE

A single number is used to estimate an unknown population parameter.

e.g our current data of MBA enrolments indicates that 20% of the students who will enrol next session will be women.

Range of values used to estimate a population parameter.

e.g our current data of MBA enrolments indicates that 15% to 24 % of the students who will enrol next session will be women.

Page 23: Probability Distributions

- the B-school

Point Estimates

The sample mean is used to estimate the population mean µ.

The sample proportion is used to estimate the population proportion p.

The sample standard deviation s is used to estimate the population standard deviation .

xp

Page 24: Probability Distributions

- the B-school

Point Estimates - Example

The National Bank of Lincon is trying to determine the number of tellers available during the lunch rush on Fridays. The bank has collected data on the number of people who entered the bank during the last 3 months on Friday from 11 A.M to 1 P.M. Using the data below, find the point estimates of the mean and standard deviation of the population from which the sample was drawn.

242 275 289 306 342 385 279 245 269 305 294 328

Page 25: Probability Distributions

- the B-school

Point Estimates

Solution:

355940.75

12ixX

n

Page 26: Probability Distributions

- the B-school

Interval Estimates and Confidence Intervals

The confidence level is the probability that we associate with an interval estimate (1 – α).

Higher probability means more confidence. Commonly used confidence levels are 90%, 95%, 99%

(i.e α = 0.1,0.05,0.01). The statement regarding a sample of car batteries

“we are 90% confident that the mean battery life of the population lies within 32 to 42 months” means if we select many random samples of the same size and calculate the confidence interval for each of these samples, then in 90% of the cases, the population mean will lie within the estimated interval.

Page 27: Probability Distributions

- the B-school

t Distribution (Student’s t distribution)

Characteristics

1. Curve is symmetrical like normal distribution but flatter at the mean and higher at the tails.

2. Used to estimate population mean for sample size ≤ 30 and when population standard deviation is not known.

3. For n ≥ 30, t distribution can be approximated by the normal distribution.

4. Different t distribution for each sample size n ≤ 30 ( or each degree of freedom).

5. A t distribution with sample size n has n -1 degrees of freedom. E.g if we are using a sample of size 15 for estimating the population mean we will use 14 degrees of freedom to select the appropriate t distribution (from t table). Degree of freedom refers to the number of values we can choose freely.

Page 28: Probability Distributions

- the B-school

Interval Estimates for Population Mean

Seven homemakers were randomly sampled and it was determined that the distances they walked in their housework had an average of 39.2 miles per week and a sample standard deviation of 3.2 miles per week. Construct a 95% confidence interval for the population mean.

Page 29: Probability Distributions

- the B-school

Interval Estimates for Population Mean

Solution:

Sample size = n = 7 (we use t distribution since n ≤ 30)Degrees of freedom = 6.Sample mean = = 39.2 miles.Sample sd = = 3.2 miles (estimate of population sd)Standard error = x = = 3.2/2.645 = 1.209t value (from t table under column 0.05, 6 df,use TINV) = 2.447

= 39.2 ± 2.447 * 1.209 = 39.2 ± 2.9596 = (36.240,42.160) miles

x

n

x t

n

Page 30: Probability Distributions

- the B-school

Interval Estimates for Population Proportion

A quality control inspector collected a random sample of 500 tubes of toothpaste from the production line and found that 41 of them had leaks from the tail end. Construct a 90% confidence interval for the percentage of all toothpaste tubes that had leakage.

Page 31: Probability Distributions

- the B-school

Interval Estimates for Population Proportion

Solution:

Sample size = 500Point estimate of population proportion = = 41/500 = 0.082Confidence level of p = 0.90 = 1 - α.α = 0.10Standard error =critical z value = Z/2 (use NORMSINV(0.05)) = 1.645

= 0.082 ± 1.645* 0.0122 = 0.082 ±0.0201

= (0.062,0.1022) tubesApprox 6.2% – 10.2% of the tubes will have leakages.

ˆ ˆ 0.082 0.918 / 500 0.0122pq n

ˆ ˆ ˆp z pq n

Page 32: Probability Distributions

- the B-school

Determining Sample Size

The university is considering raising tution to improve school facilities and they want to determine what percentage of students favour the increase. The university needs to be 90% confident that the percentage has been estimated to within 2% of the true value. How large a sample is needed to guarantee this accuracy regardless of the true percentage?

Page 33: Probability Distributions

- the B-school

Determining Sample Size

Solution:

z value for 90% confidence level is = 1.645 (NORMSINV(0.05))Standard error =

pq/n = 0.00014884n = pq/0.00014884 The largest value of n will be obtained when pq is largest i.e when p = q = 0.5n = 0.5*0.5/ 0.00014884 = 1680

0.02z pq n 0.02 /1.645 0.0122pq n


Recommended