+ All Categories
Home > Education > Lesson 002

Lesson 002

Date post: 27-Jan-2015
Category:
Upload: ning-ding
View: 157 times
Download: 1 times
Share this document with a friend
Description:
Statistical Techniques for business and Economics
Popular Tags:
61
IBS Statistics Year 1 Dr. Ning DING [email protected] I007, Friday & Monday
Transcript
Page 1: Lesson 002

IBS Statistics Year 1Dr. Ning DING

[email protected], Friday & Monday

Page 2: Lesson 002

Table of content• Review

• Learning Goals

• Chapter 3-A

• Summary

Page 3: Lesson 002

ReviewChapter 1: What is Statistics?

What is the level of measurement for these items related to the newspaper business?

a.The number of papers sold each Sunday during 2006.

b.The departments, such as editorial, advertising , sports, etc.

c.A summary of the number of papers sold by county.

d.The number of years with the paper for each employee.

RatioRatio

RatioRatio

NominalNominal

RatioRatio

P14. N.2 Ch.1

RatioRatioNominalNominal OrdinalOrdinal IntervalInterval

Page 4: Lesson 002

ReviewChapter 1: What is Statistics?

For the follow questions, would you collect information using a sample or a population?

a.Statistics 201 is a course taught at a university. Professor A has taught nearly 1,500 students in the course over the past 5 years. You would like to know the average grade for the course

b.You are looking forward to graduation project and your first job as a salesperson for one of five large corporations. Planning for your interviews, you will need to know about each company’s mission, profitability, products, and markets.

SampleSample

PopulationPopulationP16. N.8 Ch.1

SampleSample PopulationPopulation

Page 5: Lesson 002

ReviewChapter 2: Describing Data

Qualitative DataQualitative Data

Bar Chart

Bar Chart

Pie Chart

Pie Chart

Page 6: Lesson 002

ReviewChapter 2: Describing Data

Quantitative DataQuantitative Data

Histogram

Histogram Polygon

Polygon

Cumulative Frequency

Distribution

Cumulative Frequency

Distribution

Page 7: Lesson 002

ReviewChapter 2: Describing Data

Polygon

Polygon

Cumulative Frequency

Distribution

Cumulative Frequency

Distribution

Page 8: Lesson 002

ReviewChapter 2: Describing Data

P34. N.10 Ch.2

A set of data contains 53 observations. The lowest value is 43 and the largest is 129. The data are to be organized into a frequency distribution.

a. How many classes would you suggest?

25 = 32, 26 = 64, suggests 6 classes

i > ≈ 15130 - 42

6

Use interval of 15And start first class at 40

b. What would you suggest as class interval & the lower limit of the first class?

Page 9: Lesson 002

Learning Goals• Chapter 3:

– Calculate the arithmetic mean, weighted mean, median, mode, and geometric mean

– Explain the characteristics, uses, advantages, and disadvantages of each measure of location

– Identify the position of the mean, median, and mode for both symmetric and skewed distributions

– Compute and interpret the range, mean deviation, variance, and standard deviation

– Understand the characteristics, uses, advantages, and disadvantages of each measure of dispersion

– Understand Chebyshev’s theorem and the Empirical Rule as they relate to a set of observations

Page 10: Lesson 002

1. Population MeanChapter 3: Describing Data

Parameter: a numerical characteristic of a population.

Example: The fraction of U. S. voters who support Sen. McCain for President is a parameter.

Statistic: A statistic is a numerical characteristic of a sample.Example: If we select a simple random sample of n = 1067 voters from the population of all U. S. voters, the fraction of people in the sample who support Sen. McCain is a statistic.

Page 11: Lesson 002

Summary: Parameter & StatisticsChapter 3: Describing Data

Page 12: Lesson 002

1. Population MeanChapter 3: Describing Data

N

X=μ

Population mean = Sum of all the values in the population

Number of values in the population

Page 13: Lesson 002

1. Population MeanChapter 3: Describing Data

N

X=μ

Example:

Page 14: Lesson 002

2. Sample MeanChapter 3: Describing Data

Sample mean = Sum of all the values in the sample

Number of values in the sample

nΣX

=X

Page 15: Lesson 002

2. Sample MeanChapter 3: Describing Data

nΣX

=X

Example:

Page 16: Lesson 002

2. Sample MeanChapter 3: Describing Data

nΣX

=X

Example: A sample of five executives received the following bonus last year ($000):14.0, 15.0, 17.0, 16.0, 15.0

15.4=577

=5

15.0+...+14.0=

nΣX

=X

Page 17: Lesson 002

3. Properties of the Arithmetic MeanChapter 3: Describing Data

1. Every set of interval- or ratio-level data has a mean2. All the values are included in computing the mean3. The mean is unique.4. The sum of the deviations of each value from the

mean is zero.Example: Consider the set of values: 3, 8, and 4. The mean is 5. Illustrating the fifth property:

0=5)(4+5)(8+5)(3=)X -Σ(X ---

Page 18: Lesson 002

4. The Weighted MeanChapter 3: Describing Data

Weighted Mean: a set of numbers X1, X2, ..., Xn, with corresponding weights w1, w2, ...,wn, is computed from the following formula:

)n21

nn2211w ...w+w+(w

)Xw+...+Xw+X(w=X

Page 19: Lesson 002

4. The Weighted MeanChapter 3: Describing Data

Example: During a one hour period on a hot Saturday afternoon, Julie served fifty lemon drinks. She sold five drinks for $0.50, fifteen for $0.75, fifteen for $0.90, and fifteen for $1.10. Compute the weighted mean of the price of the drinks.

$0.89=50

$44.50=

15+15+15+515($1.15)+15($0.90)+15($0.75)+5($0.50)

=Xw

Frequency counts

Page 20: Lesson 002

Exercise

P62. N.14 Ch.3

The Bookstall sold books via internet. Paperbacks are $1.00 each, and hardcover books are $3.50. Of the 50 books sold on last Tuesday, 40 were paperback and the rest were hardcover. What was the weighted mean price of a book?

Chapter 3: Describing Data

)n21

nn2211w ...w+w+(w

)Xw+...+Xw+X(w=X

50.1$50

$3.50)*10+$1.00*(40=Xw

40 paperback$1.00

40 paperback$1.00

10 hardcover$3.50

10 hardcover$3.50

Page 21: Lesson 002

6. The ModeChapter 3: Describing Data

Mode:There is one situation in which the mode is the only measure of central tendency that can be used – when we have categorical, or non-numeric data. In this situation, we cannot calculate a mean or a median. The mode is the most typical value of the categorical data.

Example: Suppose I have collected data on religious affiliation of citizens of the U.S. The modal, or most Typical value, is Roman Catholic, since The RomanCatholic Church is the largest religious organization in the U.S.

Page 22: Lesson 002

6. The ModeChapter 3: Describing Data

Mode:

The value of the observation that appears most frequently.

Page 23: Lesson 002

6. The ModeChapter 3: Describing Data

Mode:

The value of the observation that appears most frequently.

Example: The exam scores for ten students are: 81, 93, 84, 75, 68, 87, 81, 75, 81, 87.

Because the score of 81 occurs the most often, it is the mode.

Page 24: Lesson 002

7. The MedianChapter 3: Describing Data

Median: the midpoint of the values after they have been ordered from the smallest to the largest.

Example: The ages for a sample of five college students are:

21, 25, 19, 20, 22

Arranging the data in ascending order gives: 19, 20, 21, 22, 25. Thus the median is 21.

Page 25: Lesson 002

7. The MedianChapter 3: Describing Data

For an even set of values, the median will be the arithmetic average of the two middle numbers.

Example: The heights of four basketball players, in inches, are: 76, 73, 80, 75

Arranging the data in ascending order gives: 73, 75, 76, 80. Thus the median is 75.5

Page 26: Lesson 002

7. The MedianChapter 3: Describing Data

72 68 65 70 75 79 73

Example:

Finding the median

65 68 70 72 73 75 79

65 68 70 72 73 75 79 79

72.5

65 68 70 72 73 75 79 79,000

72.5

Page 27: Lesson 002

Exercise

P65. N.18 & 22 Ch.3

Determine the mean, median, mode41 15 39 54 31 15 33

Chapter 3: Describing Data

Mean= 32.57; Median=33; Mode=15

List below are the total automobile sales (in millions of dollars) for the last 14 years. What was the median number of automobiles sold? What is the mode?

41 15 39 54 31 15 33

Median = 9.2 (millions of dollars)Modes are 8.2, 8.5 and 10.3 (millions of dollars)

Page 28: Lesson 002

Exercise

P69. N.26 Ch.3

Chapter 3: Describing Data

Mean Mode MedianCity - - -Wind direction

- Southwest -

Temperature

91 o F 92 o F 92 o F

Pavement - Wet & Dry Trace

Page 29: Lesson 002

5-B The Arithmetic Mean of Grouped Data

Chapter 3: Describing Data

Example:

32

Recall in Chapter 2, we constructed a frequency distribution for the vehicle selling prices. The information is repeated below. Determine the arithmetic mean vehicle selling price.

The Arithmetic Mean of Grouped Data -Example

32

Recall in Chapter 2, we constructed a frequency distribution for the vehicle selling prices. The information is repeated below. Determine the arithmetic mean vehicle selling price.

The Arithmetic Mean of Grouped Data -Example

33

The Arithmetic Mean of Grouped Data -Example

33

The Arithmetic Mean of Grouped Data -Example

Page 30: Lesson 002

5-B The Arithmetic Mean of Grouped Data

Chapter 3: Describing Data

33

The Arithmetic Mean of Grouped Data -Example

31

The Arithmetic Mean of Grouped Data

Page 31: Lesson 002

Exercise 5-B

P87. N.58 Ch.3

Determine the mean of the following frequency distribution.

Chapter 3: Describing Data

31

The Arithmetic Mean of Grouped Data

31

The Arithmetic Mean of Grouped Data

X=380/30=12.67

Page 32: Lesson 002

6-B. The Mode for Grouped DataChapter 3: Describing Data

Example:

Finding the mode for grouped data

Step 1:

Modal class with the highest frequency

32

Recall in Chapter 2, we constructed a frequency distribution for the vehicle selling prices. The information is repeated below. Determine the arithmetic mean vehicle selling price.

The Arithmetic Mean of Grouped Data -Example

Step 2:

Midpoint of the modal class is the mode

19.519.5

Page 33: Lesson 002

7-B. The Median for Grouped DataChapter 3: Describing Data

Example:

Finding the mode for grouped data

Step 1:

Modal class with the highest frequency

32

Recall in Chapter 2, we constructed a frequency distribution for the vehicle selling prices. The information is repeated below. Determine the arithmetic mean vehicle selling price.

The Arithmetic Mean of Grouped Data -Example

Step 2:

Midpoint of the modal class is the mode

19.519.5

Page 34: Lesson 002

6-B. The Mode for Grouped DataChapter 3: Describing Data

Example:

Finding the mode for grouped data

Step 1:

Cumulative Frequency Distribution

Page 35: Lesson 002

6-B. The Mode for Grouped DataChapter 3: Describing Data

Step 1:

Cumulative Frequency Distribution

Step 2:

Determine the position of the median and the median class

Page 36: Lesson 002

6-B. The Mode for Grouped DataChapter 3: Describing Data

Step 1:

Cumulative Frequency Distribution

Step 2:

Determine the position of the median and the median class

Step 3:

Draw two lines (value & position)

Median – 100 150 - 100

300.5 – 201388 - 201=

AA

BB

Value: 100 Median 150

Position: 201 300.5 388

Median = 300.5 – 201388 - 201 * 50 + 100 = 126.60 (dollars)

Page 37: Lesson 002

SummaryChapter 3: Describing Data

• Chapter 3: – Calculate the arithmetic mean, weighted mean, median,

mode, and geometric mean– Explain the characteristics, uses, advantages, and

disadvantages of each measure of location– Identify the position of the mean, median, and mode for

both symmetric and skewed distributions– Compute and interpret the range, mean deviation,

variance, and standard deviation– Understand the characteristics, uses, advantages, and

disadvantages of each measure of dispersion– Understand Chebyshev’s theorem and the Empirical Rule

as they relate to a set of observations

Page 38: Lesson 002

8. The Relative Positions of the Mean, Median, and Mode

Chapter 3: Describing Data

skewed

Page 39: Lesson 002

8. The Relative Positions of the Mean, Median, and Mode

Chapter 3: Describing Data

Zero skewness

mode=median=mean

Zero skewness

mode=median=mean

Page 40: Lesson 002

7. The Relative Positions of the Mean, Median, and Mode

Chapter 3: Describing Data

positive skewness

Mode median mean

positive skewness

Mode median mean< <

Page 41: Lesson 002

8. The Relative Positions of the Mean, Median, and Mode

Chapter 3: Describing Data

negative skewness

Mode median mean

negative skewness

Mode median mean> >

Page 42: Lesson 002

8. The Relative Positions of the Mean, Median, and Mode

Chapter 3: Describing Data

Page 43: Lesson 002

9. The Geometric MeanChapter 3: Describing Data

Geometric mean (GM) :a set of n numbers is defined as the nth root of the product of the n

numbers.

The formula is:

The geometric mean is used to average percents, indexes, and relatives.

The geometric mean is not applicable when some numbers are negative.

n n321 ))...(X)(X)(X(X=GM

Page 44: Lesson 002

9. The Geometric MeanChapter 3: Describing Data

n n321 ))...(X)(X)(X(X=GM

Example: Suppose you receive a 5 percent increase in salary this year and a 15 percent increase next year. The average annual percent increase is 9.886, not 10.0. Why is this so? We begin by calculating the geometric mean.

098861151051 . ).)(.(GM

Not understand percentage?Click here

Page 45: Lesson 002

9. The Geometric MeanChapter 3: Describing Data

n n321 ))...(X)(X)(X(X=GM

Example: The return on investment earned by Atkins construction Company for four successive years was: 30 percent, 20 percent, -40 percent, and 200 percent. What is the geometric mean rate of return on investment?

..).)(.)(.)(.(GM 2941808203602131 44

Page 46: Lesson 002

9. The Geometric MeanChapter 3: Describing Data

1period) of beginningat (Value

period) of endat (Value=GM n -

Geometric mean (GM) :Another use of the geometric mean is to determine the percent increase in

sales, production or other business or economic series from one time period

to another.

Page 47: Lesson 002

9. The Geometric MeanChapter 3: Describing Data

1period) of beginningat (Value

period) of endat (Value=GM n -

Example: The total number of females enrolled in American colleges increased from

755,000 in 1992 to 835,000 in 2000.

.0127=1 - 755,000

835,000=GM 8

That is, the geometric mean rate of increase is 1.27%.

Page 48: Lesson 002

9. The Geometric MeanChapter 3: Describing Data

Example: A banker wants to get an annual return of 100% on its loan in credit card

business. What monthly interest rate should he charge?

.059=1 - 100

200=GM 12

A monthly interest rate of 5.9%.

Page 49: Lesson 002

9. The Geometric MeanChapter 3: Describing Data

Example: The Chinese government claimed in 1990 that their GDP will double in 20

years.

What must the annual GDP growth rate be for this dream to come true?

A annual GDP growth of 3.5%.

.035=1 - 100

200=GM 20

Page 50: Lesson 002

9. The Geometric MeanChapter 3: Describing Data

Example: The 2006 population size of Duval County was 837,964. The population grew by7.6% between 2000 and 2006. We want to project the size of the population in2030, assuming that the growth rate remains the same; i.e., 7.6% every 6 years.

The Projected population size in 2030 is (1.0764 X 837,964) = 1123245. Theaverage growth rate over the 24 years is found by calculating the geometric mean:

The average growth rate is just what we expect.

076.1076.1076.1076.1076.14 GM

Page 51: Lesson 002

Exercise

P71. N.32 Ch.3

In 1976 the nationwide average price of a gallon of unleaded gasoline at a self-serve pump was $0.605. By 2005 the average price had increased to $2.57. What was the geometric mean annual increase for the period?

Chapter 3: Describing Data

5.11% found by -1292.570.605

Page 52: Lesson 002

ReviewChapter 2: Describing Data

P27. N.4 Ch.2

Qualitative DataQualitative Data

Two thousand frequent mIdwestern business travelers are asked which Midwest city they prefer: Indianapolis, Saint Louis, Chicago, or Milwaukee.

The results were 100 liked Indianapolis best, 450 liked Saint Louis, 1,300 liked Chicago, and the remainder preferred Milwaukee. Develop a frequency table and a relative frequency table to summarize this information.

Page 53: Lesson 002

ReviewChapter 2: Describing Data

P34. N.12 Ch.2

The daily number of oil changes at the Oak Streek outlet in the past 20 days are:

The data are to be organzied into a frequency distribution. a. How many classes would you recommend?

24 = 16, 25 = 32, suggests 5 classes

i > ≈ 1099 - 51

5

Use interval of 10

b. What class interval would you suggest?

Page 54: Lesson 002

ReviewChapter 2: Describing Data

P34. N.12 Ch.2

The daily number of oil changes at the Oak Streek outlet in the past 20 days are:

The data are to be organzied into a frequency distribution.

c. What lower limit would you recommend for the first class?

start first class at 50

Page 55: Lesson 002

ReviewChapter 2: Describing Data

P34. N.12 Ch.2

The daily number of oil changes at the Oak Streek outlet in the past 20 days are:

d. Organize the number of oil changes into a frequency distribution.

Page 56: Lesson 002

ReviewChapter 2: Describing Data

P34. N.12 Ch.2

The daily number of oil changes at the Oak Streek outlet in the past 20 days are:

e. Comment on the shape of the frequency distribution. Also determine the relative frequency distribution.

The fewest number is about 50, the highest about 100.

The greatest concentration is in classes 60 up to 70 and 70 up to 80.

Page 57: Lesson 002

Exercise

P65. N.20 Ch.3

Chapter 3: Describing Data

Determine the mean, median, mode12 8 17 6 11 14 8 17 10 8

Mean=11.10; Median=10.50; Mode=8

Page 58: Lesson 002

Exercise

P60. N.2 Ch.3

a. Compute the mean of the following population values: 7, 5, 7, 3, 7, 4

Chapter 3: Describing Data

μ = 5.5 found by (7+5+7+3+7+4)/6

Page 59: Lesson 002

Exercise

P60. N.4 Ch.3

Compute the mean of the following sample values: 1.3 7.0 3.6 4.1 5.0

b. Show that Σ(X - X)=0

Chapter 3: Describing Data

(1.3-4.2)+(7.0-4.2)+(3.6-4.2)+(4.1-4.2)+(5.0-4.2)=0

X = 4.2 found by 21/5

Page 60: Lesson 002

More Information

Source: Keller, Statistics for Management and Economics, 2005

Page 61: Lesson 002

More InformationPercentage

We added memory to our computer system. We had 96 MB of main memory and now with our new addition, we have 256 MB of main memory. I would like to figure out what percent increase this represents. If you go from 100 MB of memory to 200 MB then you've increased it by 100 percent, because the amount of the increase (100 MB) is 100% of the original amount (100 MB).

That is... if you double your memory then you've increased it by 100 percent. If you add another 100 MB, you're adding another 100% of the original amount, so you have a 200% increase, from 100 MB to 300 MB.

In this case, you have gone from about 100 to about 250. Since 250 is halfway between 200 MB and 300 MB, you could guess that the answer is about 150 percent. Does this make sense? Now let's find the actual value.

I'm going to do a simple example first so you see how percentages work. If I go from 100 MB to 105 MB, what is the percent increase? In this case, the numbers are straightforward: the increase (5 MB) is 5 percent of the original amount (100 MB). But we can use a method that will work even when the numbers aren't this tidy: I ask: 100 times what number will give me 105? 100 * x = 105 x = 105 / 100 x = 1.05 Then I ask: What increase is that over 100%? x - 1 = 1.05 - 1 = 0.05 = 5/100 = 5% So I have an increase of 5%. Now let's do the same thing with your numbers: 1) 96 * x = 256 x = 256 / 96 x = 2.67 2) x - 1 = 2.67 - 1 = 1.67 = 167/100 = 167% which is pretty close to the original estimate of 150%. That gives us some confidence that we have the right answer.


Recommended