Date post: | 26-Dec-2015 |
Category: |
Documents |
Upload: | andrew-mcdowell |
View: | 214 times |
Download: | 0 times |
2- 1
Chapter
Two
McGraw-Hill/Irwin
© 2006 The McGraw-Hill Companies, Inc., All Rights Reserved.
Describing Data:Frequency Distribution & Graphic Presentation
2- 2
‘Organizing’ data – Remember COPAID?
A Frequency DistributionFrequency Distribution is a grouping of data into mutually exclusive
categories showing the number of observations in each class.
Each class has an interval of $3K
2- 3
Decide on the number of classes
5 Steps to constructing a frequency distribution
Determine the class interval
Set the individual class limits
Tally the number of items in each class
Count the number of items in each class
2- 4
Example
RAW DATA - 80 data points
Let us ‘organize’ this raw data into different price ranges.
2- 5
Example 1 continued
Step OneStep One:: Decide on the number of classes using the formula
22kk > n > n (2 to the k rule)(2 to the k rule)
where k=number of classes n=number of observations
There are 80 observations, so n=80. 266 = 64 & 277 = 128. 277 > 80 > 80Therefore, we should have at least 7 classes, i.e., k=7.
2- 6
where H=highest value, L=lowest value
35925 – 15546 7
= $2911
Step TwoStep Two: Determine the class interval or width using the formula
H – LH – L kk
i > =
Round up for an interval of $3000.Set the lower limit of the first class at $15000.Guideline: Make the lower limit of the first class a multiple of the class interval.
Example 1 continued
2- 7
EXAMPLE 1 continued
Step ThreeStep Three: Set the individual class limits
2-14
2- 8
Steps FourSteps Four: Tally the number of items in each class.
2- 9
Step FiveStep Five: (last step!)Count the number of items in each class.
The above table is called a Frequency Distribution(one way of ‘organizing’ initial raw data – again remember COPAID) .
You can also present in a graph – we will see that later.
2- 10
A Relative Frequency DistributionRelative Frequency Distribution shows the fraction of observations in each class. (easier to compare classes)
You can also express Relative Frequency in %ages. For the above data, it will be 10%, 28.75%, 21.25%, …
2- 11
Try out
problem #6
in page 31
2- 12
Visuals used in StatisticsVisuals used in Statistics
•HistogramsHistograms•Frequency PolygonsFrequency Polygons•Line & Bar graphsLine & Bar graphs•Pie chartsPie charts•Scatter diagramsScatter diagrams•Contingency tablesContingency tables•Pareto chartsPareto charts
2- 13
HistogramA graph with
X- axis: ClassesY- axis: Frequency
Histogram gives the frequency distribution of data
2- 14
Graphic Presentation of a Frequency Distribution
A Frequency PolygonFrequency Polygon
- a line graph connecting the points formed by the class midpoint and the class frequency.
2- 15
2- 16
Frequency polygons allow comparison of 2 more frequency distributions
Use %age frequencies if actual frequencies vary widely
2- 17
A Cumulative Frequency DistributionCumulative Frequency Distribution is used to determine how many or what proportion of the data values are below or above a certain value.
Cumulative Frequency DistributionCumulative Frequency Distribution
Find the price below which 25 vehicles were sold.Find the price below which half the cars were sold.Find %age of vehicles sold priced below $28500.
2- 18
Practice time!
(Problem #14 Page 41)
2- 19
Line graphs are typically used to show the change or trend of a variable over time.
U.S. median age by gender
25
30
35
40
Med
ian
Age
Males
Females
2- 20
Example 3 continued
Another example of Line Graph
2- 21
A Bar ChartBar Chart - useful to show data of any level of measurement.
Bar Chart
2- 22
Pie Chart
A Pie ChartPie Chart is useful for displaying a relative frequency distribution.
- a circle is divided proportionally to the relative frequency
Along with the % of each slice, you can also show the actual values
2- 23
Scatter diagram
Example The twelve days of stock prices and the overall market index on each day are given as follows:
Variables must be at least interval scaled.
Relationship can be positive (direct) or negative (inverse).
A graph showing relationship between two variables
2- 24
969291888685848382797869
PriceIndex(000s)
8.07.57.57.37.27.27.17.17.06.26.25.1
Relationship between Market Index and Stock Price
50
60
70
80
90
100
5 6 7 8 9 10
Index
Pri
ce
Scatter Diagram
2- 25
A contingency table is a cross tabulation of two variables.
Contingency tables are used when one or both variables are nominal or ordinal in scale.
Contingency table
2- 26
Male Female
Smokers 100 60
Non-
smokers
850 1075
Example Contingency Table
Good Bad Ugly
Dumbo 1 25 74
As Good as It Gets
63 22 15
Nominal
Ordinal
2- 27
•a type of histogram; arrange the bars from tallest to shortest
•used in a process improvement project
•Eg. If you record the reasons for a machine breaking down, you might find some problems to be more common than others.
•If you record the frequency of each of these reasons, you will notice that a small number of reasons will account for most of the breakdowns.
•Pareto Principle, or the 80-20 rule. In general about 80% of the problems will result from about 20% of the causes.
Pareto Chart
2- 28 Number of Defects
paper particle buildup 74
excessive temperature 38
worn roller 5
defective paper 10
guides misaligned 26
Pareto Chart
Problems of a
photocopier machine
2- 29
Number of Defects
Repair Cost ($) Total Cost
paper particle buildup
74 30 2220
excessive temperature
38 75 2850
worn roller 5 190 950
defective paper 10 20 200
guides misaligned 26 30 780
Pareto Chart