+ All Categories
Home > Documents > Frequency

Frequency

Date post: 16-Nov-2014
Category:
Upload: api-3850829
View: 879 times
Download: 2 times
Share this document with a friend
Popular Tags:
36
Frequency Distribution
Transcript
Page 1: Frequency

Frequency Distribution

Page 2: Frequency

FREQUENCY DISTRIBUTION

4 6 5 4 4 3 5 4 5 3 4 4 5 6 4 5 5 3 3 5 7 6 5 5 4

Table 1:- shows the daily number of car accident in a certain city recorded over the period of 30 days.

Simple series or ungrouped data: It is a series of observations recorded

without any systematic arrangement e.g. Table 1.The ungrouped data will be so numerous that their significance will not be readily comprehended. In such cases it will become necessary to summaries the data to an easily manageable form. We may summaries the raw data with the help of Frequency Distribution.

Page 3: Frequency

FREQUENCY DISTRIBUTION

Frequency Distribution is a statistical table which shows the values of the variable arranged in order of magnitude and also the corresponding frequencies side by side.

 Q) What is Frequency?

Number of times the particular value is repeated is known as frequency. In order to facilitate counting we may prepare a column of Tally. In another column, places all possible values of the variable from the lowest to highest and count the number of tally bar corresponding to each value of the variable and place it in the column of frequency, as below

Page 4: Frequency

Daily no of car accidents

Tally Bar Frequency(No of days)

34567

IIIIIIIIIIIIIIIIIIIIIIIIIIIIII

591141

Total 30

Table 2:Simple frequency distribution of car accidents.

Page 5: Frequency

Types of frequency distribution

There are two types of frequency distribution

1) Simple frequency Distribution-shows the value of the variable individually, e.g. Table 2

2) Grouped Frequency distribution-shows the value of the variable in groups. Generally when the number of variable is too large using grouped frequency distribution would be more advantageous than that of simple frequency distribution. For example: consider the following data showing the age (in years) of 200 unemployed persons who registered their names in an Employment Exchange on a certain day.

Page 6: Frequency

33 26 33 30 28 25 28 22 17 30

26 17 29 22 18 40 20 33 21 25

25 20 21 29 27 26 19 37 32 21

20 40 20 20 27 19 20 20 29 20

21 49 34 26 29 31 25 33 23 16

21 20 23 29 23 17 23 27 43 22

24 26 29 29 20 17 32 23 24 25

36 19 28 26 31 52 25 57 16 26

27 19 24 19 39 23 24 25 17 19

31 18 26 23 19 30 20 19 50 40

22 27 17 22 43 33 22 17 21 25

24 46 27 20 26 20 21 24 22 21

25 22 24 32 24 20 22 15 34 22

22 20 37 19 24 21 28 27 19 31

22 24 17 29 24 17 23 21 21 31

31 18 22 19 33 25 21 25 23 21

27 18 15 19 24 19 22 22 16 21

17 18 25 31 26 29 24 31 23 18

31 24 23 21 24 18 21 19 30 24

24 22 30 19 25 21 21 25 45 24

Page 7: Frequency

The first step towards summarization of data is to arrange the figure in the form of a simple frequency distribution as

follows:Ages(year) Frequency Ages(year) Frequency Ages(year) Frequency1516171819202122232425

23108 14151916112014

2627282930313233343637

107395836212

394043454649505257

132111111

Total 200

Page 8: Frequency

Although the table gives a somewhat clear picture under the simple frequency distribution but since the number of observations are too large it will be more representable if we group the observations(age) into number of intervals i.e. 15-19, 19-24 etc. and shows the frequency of occurrence (here the number of persons falling in the particular age group)separately for each interval. Such frequency table is known as Group Frequency Distribution.

Age in year Frequency(No of persons)

15-1920-2425-2930-3435-4445-49

3781432496

Total 200

Page 9: Frequency

Different component of Grouped frequency distribution

Page 10: Frequency

a) Class Interval

When the numbers of observations are too large i.e. varying in wide range, these are usually classified in several groups according to their size of values. Each of these groups defined as class interval or simply class.

In our example we have classified the ages into 6 different interval i.e. 15-19,…….., 45-49 since the ages are varying in wide range from 17 to 57.

Page 11: Frequency

Although there is no hard and fast rule regarding the number of class intervals, It is generally agreed that the number classes neither be too large then the basic purpose of classification (i.e. summarization of data to an easily manageable form.) will not be served nor be too small then the true nature of the distribution will be obscured.

As a working rule this number should lie between 5 and 15 depending upon the number of observations available.

Page 12: Frequency

One should have class intervals of either five or multiple of five like 10, 20, 25 etc.The reason is that the human mind is accustomed more to think in terms of certain multiples of 5,10 and the like.

Classes must be mutually exclusive means that each of the given values is included in one of the classes .

For Example:- 15-19 , 20-24 etc. (not like 15-20, 20-25 etc )

Page 13: Frequency

b) Class Frequency

The number of observations falling within a class is called class frequency. Sum of all class frequency is total frequency. In our example the class frequencies are 37,81 etc.

Page 14: Frequency

c) Class Limit

The two numbers used to specify the limits of a class interval are called Class Limits. The smaller of the pair is known as lower class limit and the larger as upper class limit. In our example, column 3 is the lower class limits and column 4 is the upper class limits as shown in the Table-3.

Page 15: Frequency

Class Interval

(1)

Class frequency

(2)

Class LimitLower

(3)Upper

(4)

15-1920-2425-2930-3435-4445-49

3781432496

152025303545

192429344459

Total 200Table-3

Page 16: Frequency

d) Class Boundaries

In our present example the variable is discrete since the age is expressed only in years.

But if we consider age as continuous variable, then it may be 14.5 years of 19.5 years, then the concept of class limit will not suffice.(In which class the observation 14.5 or19.5 will be included?). So in case of continuous variable, certain adjustment in the class interval is needed to obtain the continuity.

Page 17: Frequency

The extreme values which would ever be included in the class interval in order to obtain the correct class interval in case of continuous variable is known as Class Boundaries.

The lower extreme point is known as the lower class boundary and the upper extreme point is known as the upper class boundary with reference to the particular class. In our example, column (5) shows the lower class Boundaries and column (6) shows the upper class Boundaries (as shown in the Table-4)

Page 18: Frequency

= ----------------------------------------------------------------- 2

Lower Class Boundary= Lower Class Limit - dUpper Class Boundary= Upper Class Limit + d

(Lower limit of the second class - Upper limit of the first class)

Correction Factor (d)

Page 19: Frequency

Class Interval

(1)

Class frequency

(2)

Class Boundaries

Lower(5)

Upper(6)

15-1920-2425-2930-3435-4445-49

3781432496

14.519.524.529.534.544.5

19.524.529.534.544.559.5

Total 200Table-4

Page 20: Frequency

e) Class Mark

The value exactly at the middle of a class interval is called Class Mark. Class mark is the representative value of the class interval, for the calculation of mean, SD etc.In our example column (7) of table -5 shows the class marks.

Class Mark= (Lower Class Limit+ Upper Class Limit) ÷ 2

Or

(Lower Class Boundary+ Upper Class Boundary) ÷ 2

Page 21: Frequency

Class Interval

(1)

Class frequency

(2)

Class

Mark

(7)

15-19

20-24

25-29

30-34

35-44

45-59

37

81

43

24

9

6

15+19 = 34 ÷2= 17

20+24 = 44 ÷2= 22

25+29 =54 ÷2 = 27

30+34 = 64 ÷2 = 32

35+44 =79 ÷2 = 39.5

45+59 =104 ÷2= 52

Total 200

Page 22: Frequency

e) Width of class or Size of the class

1) Width of class (in case of continuous variable)

= Upper Class Boundary - Lower Class Boundary.

or

Upper Class limit - Lower Class limit.

For example:-

Class limits Width

15 – 20

20 - 25

5

5

Class boundaries Width

14.5 – 19.5

19.5 – 24.5

5

5

Page 23: Frequency

2) Width of class (in case of discrete variable)

= Upper Class limit of the second class - Lower Class limit of the first class.

For example:-

Class limits Width

15-19

20-24

25-29

30-34

35-44

45-59

20 -15 = 5

25 – 20 = 5

30 -25 = 5

35 – 30 = 5

45 – 35 = 10

60 – 45 = 15

Page 24: Frequency

How to choose the number of classes and class limits

Step I :- Remember the following points The number of classes should be

preferably between 5 and 15.The width of the classes should be

preferable be 5 or its multiple e.g., 15-19,20-24 etc.

One of the class limit being preferable a multiple of 5.

Page 25: Frequency

Step II :-Apply the Sturges formula for determining the approximate number of classes,i.e.

k = 1 + 3.322 log NWhere k = The approximate number of classes

N = Total number of observation

Page 26: Frequency

Step III :-The third step is to find the maximum and minimum of the observations and find the “range” i.e.

Range = Maximum - Minimum value

If we like to have classes of equal width, then the approximate width of the class may be obtained on dividing the range by the approximate number of classes i.e.

Page 27: Frequency

Range

Approximate width = -------------------------------

1 + 3.322 log N

Page 28: Frequency

For example:-

Problem 1:-

The profits (in lakh rupees) of 30 companies for the year 2008 is given below.

20,22,35,42,37,42,48,53,49,65,39,48,67,

18,16,23,37,35,49,63,65,55,45,58,57,69,25,29,58,65.

Classify the above data taking suitable class-interval

Page 29: Frequency

Solution:-Step I :- Find the approximate number of classes.

k = 1 + 3.322log N

Where N = 30

K = 1 + 3.322 log 30 = 1 + 3.322 x 1.4771 = 7 + 4.91 = 5.91 or 6

Page 30: Frequency

Step II :-Width of the classes Maximum - MinimumWidth of the classes = ----------------------------------------------------- 1 + 3.322log N

( 69 – 16) = ------------------------------- 5.91 53 = --------------- 5.91

= 8.97 or 9

But since width of the classes preferable be 5 or multiple of 5 i.e. 10,15We consider width as 10 instead of 9.

Page 31: Frequency

Profit (Rs. Lakh) No of companies

15 – 24 5

25 – 34 2

35 – 44 7

45 - 54 6

55 – 64 5

65 – 74 5

Total 30

Frequency distribution of the profits

Page 32: Frequency

The lower limit of the first class may not coincide with the minimum observation,since the width of the classes and the one of the class limit being preferable multiple of 5.

For example:- Here lowest value of the data is 16 and we have to take the classes of width 10 and one of the class limit as multiple of 5, then the lower limit of the first class may not coincide with the minimum observation i.e. it should be 15 – 24 not 16 – 25.

Page 33: Frequency

Problem 2:-

The following set of numbers presents the mutual fund prices at the end of the week for the selected 40 nationally sold funds

11,17,15,22,11,16,19,24,29,18

25,26,32,14,17,20,23,27,30,12

15,18,24,36,18,15,21,28,33,38

34,13,11,16,20,22,29,29,23,31

Classify the above data taking suitable class-interval

Page 34: Frequency

Solution

Ans) k = 1 + 3.322 log 40 = 1+ 3.322 x 1.602 = 1 + 5.32 = 6.32 or 6

(38 – 11) Width = ------------------ = 4.27 or 5 6.32

Page 35: Frequency

Class Interval Frequency

10 – 14 6

15 - 19 11

20 – 24 9

25 - 29 7

30 – 34 5

35 - 39 2

Total 40

Page 36: Frequency

Recommended