+ All Categories
Home > Documents > Introduction to Summary Statistics

Introduction to Summary Statistics

Date post: 31-Dec-2015
Category:
Upload: marsden-emerson
View: 73 times
Download: 3 times
Share this document with a friend
Description:
Introduction to Summary Statistics. Statistics. The collection, evaluation, and interpretation of data. Statistical analysis of measurements can help verify the quality of a design or process. Summary Statistics. Central Tendency “Center” of a distribution Mean, median, mode Variation - PowerPoint PPT Presentation
32
Introduction to Summary Statistics
Transcript
Page 1: Introduction to Summary Statistics

Introduction to Summary Statistics

Page 2: Introduction to Summary Statistics

Statistics• The collection, evaluation, and interpretation of

data

• Statistical analysis of measurements can help verify the quality of a design or process

Page 3: Introduction to Summary Statistics

Summary Statistics

Central Tendency• “Center” of a distribution

– Mean, median, mode

Variation• Spread of values around the center

– Range, standard deviation, interquartile range

Distribution• Summary of the frequency of values

– Frequency tables, histograms, normal distribution

Page 4: Introduction to Summary Statistics

• The mean is the sum of the values of a set of data divided by the number of values in that data set.

Mean Central Tendency

μ  = ∑ x iN

Page 5: Introduction to Summary Statistics

Mean Central Tendency

= mean value

xi = individual data value

= summation of all data values

N = # of data values in the data set

μ  = ∑ x iN

Page 6: Introduction to Summary Statistics

• Data Set

3 7 12 17 21 21 23 27 32 36 44• Sum of the values = 243• Number of values = 11

Mean = 24311

= 22.09=μ  = ∑ x iN

Mean Central Tendency

Page 7: Introduction to Summary Statistics

• General Rule: Don’t round until the final answer– If you are writing intermediate results you may

round values, but keep unrounded number in memory

• Mean – round to one more decimal place than the original data

• Standard Deviation: Round to one more decimal place than the original data

A Note about Rounding in Statistics

Page 8: Introduction to Summary Statistics

• Data Set

3 7 12 17 21 21 23 27 32 36 44• Sum of the values = 243• Number of values = 11

• Reported: Mean =

Mean =24311

==μ  = ∑ x iN

Mean – Rounding

22.1

22.09

Page 9: Introduction to Summary Statistics

Mode Central Tendency

• Measure of central tendency• The most frequently occurring value in a

set of data is the mode• Symbol is M

27 17 12 7 21 44 23 3 36 32 21

Data Set:

Page 10: Introduction to Summary Statistics

• The most frequently occurring value in a set of data is the mode

3 7 12 17 21 21 23 27 32 36 44

Data Set:

Mode = M = 21

Mode Central Tendency

Page 11: Introduction to Summary Statistics

• The most frequently occurring value in a set of data is the mode

• Bimodal Data Set: Two numbers of equal frequency stand out

• Multimodal Data Set: More than two numbers of equal frequency stand out

Mode Central Tendency

Page 12: Introduction to Summary Statistics

Determine the mode of

48, 63, 62, 49, 58, 2, 63, 5, 60, 59, 55Mode = 63

Determine the mode of

48, 63, 62, 59, 58, 2, 63, 5, 60, 59, 55Mode = 63 & 59 Bimodal

Determine the mode of

48, 63, 62, 59, 48, 2, 63, 5, 60, 59, 55Mode = 63, 59, & 48 Multimodal

Mode Central Tendency

Page 13: Introduction to Summary Statistics

• Measure of central tendency• The median is the value that occurs in the

middle of a set of data that has been arranged in numerical order

• Symbol is x, pronounced “x-tilde”~

Median Central Tendency

Page 14: Introduction to Summary Statistics

• The median is the value that occurs in the middle of a set of data that has been arranged in numerical order

Data Set:27 17 12 7 21 44 23 3 36 32 21

Median Central Tendency

3 7 12 17 21 21 23 27 32 36 44

Page 15: Introduction to Summary Statistics

• A data set that contains an odd number of values always has a Median

3 7 12 17 21 21 23 27 32 36 44

Data Set:

Median Central Tendency

Page 16: Introduction to Summary Statistics

• For a data set that contains an even number of values, the two middle values are averaged with the result being the Median

3 7 12 17 21 21 23 27 31 32 36 44

Data Set:

Median Central Tendency

Middle of data set

Page 17: Introduction to Summary Statistics

• Measure of data variation• The range is the difference between the

largest and smallest values that occur in a set of data

• Symbol is R

Range = R = maximum value – minimum value

3 7 12 17 21 21 23 27 32 36 44Data Set:

Range Variation

R = 44 – 3 = 41

Page 18: Introduction to Summary Statistics

• Measure of data variation• The standard deviation is a measure of

the spread of data values– A larger standard deviation indicates a wider

spread in data values

Standard Deviation Variation

Page 19: Introduction to Summary Statistics

Standard Deviation Variation

σ=√∑ (x i−   μ )2

N

σ = standard deviation

xi = individual data value ( x1, x2, x3, …)

μ = mean

N = size of population

Page 20: Introduction to Summary Statistics

Standard Deviation Variation

Procedure

1. Calculate the mean, μ

2. Subtract the mean from each value and then square each difference

3. Sum all squared differences

4. Divide the summation by the size of the population (number of data values), N

5. Calculate the square root of the result

σ=√∑ (x i−   μ )2

N

Page 21: Introduction to Summary Statistics

• General Rule: Don’t round until the final answer– If you are writing intermediate results you may

round values, but keep unrounded number in memory

• Standard Deviation: Round to one more decimal place than the original data

A Note about Rounding in Statistics, Again

Page 22: Introduction to Summary Statistics

Standard Deviation

2, 5, 48, 49, 55, 58, 59, 60, 62, 63, 63

Calculate the standard deviation for the data array

524

111. Calculate the mean

2. Subtract the mean from each data value and square each difference

(2 - )2 = 2082.6777 (5 - )2 = 1817.8595(48 - )2 = 0.1322(49 - )2 = 1.8595(55 - )2 = 54.2231(58 - )2 = 107.4050

(59 - )2 = 129.1322(60 - )2 = 152.8595(62 - )2 = 206.3140(63 - )2 = 236.0413(63 - )2 = 236.0413

(x i−   μ )2μ  = 

∑ x iN

σ=√∑ (x i−   μ )2

N

¿ 47.63

Page 23: Introduction to Summary Statistics

Standard Deviation Variation

3. Sum all squared differences 2082.6777 + 1817.8595 + 0.1322 + 1.8595 + 54.2231 + 107.4050 + 129.1322 + 152.8595 + 206.3140 + 236.0413 + 236.0413

= 5,024.5455

4. Divide the summation by the number of data values

5. Calculate the square root of the result

=

= = 456.7769

√∑ (x i   −   μ )2

N=√456.7769  = 21.4

Note that this is the sum of the unrounded squared differences.

Page 24: Introduction to Summary Statistics

• A histogram is a common data distribution chart that is used to show the frequency with which specific values, or values within ranges, occur in a set of data.

• An engineer might use a histogram to show the variation of a dimension that exists among a group of parts that are intended to be identical.

Histogram Distribution

0.74

50.

747

0.74

90.

751

0.75

30.

755

0.75

70.

759

0

2

4

Length (in.)

Fre

qu

en

cy

Page 25: Introduction to Summary Statistics

• Large sets of data are often divided into a limited number of groups. These groups are called class intervals.

-5 to 5

Class Intervals6 to 16-16 to -6

Histogram Distribution

Page 26: Introduction to Summary Statistics

• The number of data elements in each class interval is shown by the frequency, which is indicated along the Y-axis of the graph.

Fre

qu

ency

1

3

5

7

-5 to 5 6 to 16-16 to -6

Histogram Distribution

Page 27: Introduction to Summary Statistics

3

ExampleF

req

uen

cy

1

2

4

6 to 10 11 to 151 to 5

1, 7, 15, 4, 8, 8, 5, 12, 10

12,15 1, 4, 5, 7, 8, 8, 10,

Histogram Distribution

0.5 5.5 10.5 15.5

0.5 < x ≤ 5.5 5.5 < x ≤ 10.5 10.5 < x ≤ 15.5

Page 28: Introduction to Summary Statistics

• The height of each bar in the chart indicates the number of data elements, or frequency of occurrence, within each range.

Histogram Distribution

3

Fre

qu

ency

1

2

4

6 to 10 11 to 151 to 5

12,15 1, 4, 5, 7, 8, 8, 10,

Page 29: Introduction to Summary Statistics

0.745

0.746

0.747

0.748

0.749

0.750

0.751

0.752

0.753

0.754

0.755

0.756

0.757

0.758

0.759

0.760

0

1

2

3

4

5

Length (in.)

Fre

qu

en

cy

MINIMUM = 0.745 in.

MAXIMUM = 0.760 in.

Histogram Distribution

0.7495 < x ≤ 0.7505

Page 30: Introduction to Summary Statistics

0 1 2 3 4 5 6-1-2-3-4-5-6

0

3

-1

3

2

-1

-1

1

2

-3

0

1

0

1

-2

1

2

-4

-1

1

0

-2

0

0

Dot Plot Distribution

Page 31: Introduction to Summary Statistics

0 1 2 3 4 5 6-1-2-3-4-5-6

0

3

-1

3

2

-1

-1

1

2

-3

0

1

0

1

-2

1

2

-4

-1

1

0

-2

0

0

Fre

qu

ency

1

3

5

Dot Plot Distribution

Page 32: Introduction to Summary Statistics

Fre

qu

ency

Data Elements

0 1 2 3 4 5 6-1-2-3-4-5-6

Bell shaped curve

Normal Distribution Distribution


Recommended