The gender gap in educational attainment in India: How much...

The gender gap in educational attainment in India: How much can be explained?

by

Geeta Gandhi Kingdon

Department of Economics

University of Oxford

August, 2001

Abstract

Differential treatment of sons and daughters by parents is a potential explanation of the gender gap in education in developing countries. This study empirically tests this explanation for India using household survey data collected in urban Uttar Pradesh in 1995. We estimate educational enrolment functions and selectivity-corrected educational attainment functions, conditional on enrolment. The gender difference in educational attainment is decomposed into the part that is explained by men and women’s differential characteristics and the part that is not so explained (the conventional ‘discrimination’ component). The analysis suggests that girls face significantly different treatment in the intra-household allocation of education – there is a large unexplained component in the gender gap in schooling attainment. A detailed decomposition exercise attempts to discover the individual factors most responsible for the differential treatment.

Keywords: Gender, educational attainment functions, Oaxaca decomposition, India JEL Classification: I21, J16, J23, J31, J71 Correspondence: Department of Economics, University of Oxford, Oxford OX1 3UQ email: [email protected] Acknowledgements: The research was supported by a Wellcome Trust grant No. 053660 and the data collection was funded by a McNamara fellowship of the World Bank awarded to the author in 1995. The paper has benefited from the comments of Jean Drèze and Haris Gazdar. The usual disclaimers apply.

1

1. Introduction

Recent research suggests that female schooling is more important than male schooling

for social outcomes such as fertili ty, child health, and infant mortality1. The literature also

suggests that the economic gains from women’s education are generally at least as high as

those from men’s education [Schultz 1993]. Thus, women’s educational backwardness is of

concern not only because it is inequitable but also because it is socially and economically

inefficient.

In the economics of education literature, there are two frequently cited explanations of

the gender gap in education. Firstly, that the gap is due to labour market discrimination

against women: if the labour market rewards women’s education less well than men’s (i.e. the

rate of return to women’s schooling is lower than to men’s), then girls will face poorer

economic incentives to invest in schooling than boys. A second major explanation for the gap

is that parents treat sons and daughters differentially. This differential treatment may arise

either because of son-preference, which causes parents to give a greater weight to the welfare

of sons, or it may arise because parents value only that part of the return to a child’s schooling

which accrues to them personally - and the returns to a daughter’s education are reaped largely

by her in-laws’ family. This is compounded by the fact that societal norms in some countries

require parents to accumulate a dowry for daughters but not for sons. Thus, girls may lose

out in the intra-household allocation of education because of a potentially strong asymmetry in

parental incentives to educate sons and daughters.

This paper examines the extent to which differential treatment of sons and daughters in

education can be explained in India, a country that suffers from a well-documented high level

of gender-inequality in education, as well as in a number of other welfare measures, such as girl

and boy survival chances, longevity, and anthropometric status [Shariff, 1999]. India, with a

Gender Development Index (GDI) of 0.410 ranks 103 among the 137 countries for which the

GDI has been constructed2 [UNDP 1996]. While gender inequality in education in rural India

2

has received the most attention, even in urban India, this gap is significant. For example, in the

urban data for the present study, women had significantly fewer years of education than men at

the 1 per cent level (see Table 1).

The differential treatment explanation is tested by examining the intra-household

allocation of educational enrolment and of years of schooling. While many studies in India

document the gender gap in school enrolment and educational attainment, only one recent

study [Pal, 2001] examines whether the gender gap persists when other household factors are

held constant in a multivariate framework3. Whether there is gender differentiated treatment

in the intra-household allocation of education in India is not a trivial question: although the

observed gender disparity in educational outcomes (as well as in other welfare measures such

as mortality) in India would suggest that there is likely to be strong parental discrimination

against girls, recent econometric studies of the intra-household allocation of educational

expenditure (as well as of consumption expenditure generally) have failed to unequivocally

confirm such discrimination [Subramanian and Deaton 1990; Subramanian 1995].

Section 2 describes the data and the methodology. Enrolment choice is modelled in

section 3 while section 4 examines the determinants of educational attainment for men and

women. Section 5 applies the Blinder-Oaxaca method to measure the extent of sex

discrimination in the intra-family allocation of education. The final section concludes.

2. Data and method

The data for this study came from a purpose-designed stratified sample survey of 1000

households in 1995 in the Urban Agglomeration of Lucknow district, Uttar Pradesh. The

sampling procedure and details of survey instruments and implementation are given in Kingdon

[1998b]. The household questionnaire based on the pattern of the World Bank’s LSMS

studies obtained information not only on personal characteristics and parental background but

3

also on detailed aspects of household members’ education, time allocation, and labour market

activities, if any. The survey yielded data on 4560 individuals aged 6 years old and over.

Educational enrolment

While modelli ng school enrolment choice is an interesting exercise in its own right, it is

also needed for the consistent estimation of educational attainment functions for reasons

detailed below. Individuals base their decision to participate in schooling upon their evaluation

of the net benefits of schooling, say N= (B - C) where B is total benefit and C is total cost.

Individuals will only enter school if the benefit (B) exceeds the cost (C). Thus, enrolled

individuals are those for whom B>C. For non-enrolled persons, B<=C.

Let I* be the net benefit of enrolli ng in school. That is,

I* = B - C (1)

I* is a function of a set of variables W which affect either the benefit of schooling or the cost of

schooling or both. This can be expressed as

I Wi i i* = +γ ε (2)

where γ is a vector of coefficients and ε a stochastic disturbance term. As I* is unobserved, we

define an indicator variable I such that I=1 when an individual is observed to be enrolled, and

I=0 when an individual is not enrolled. Thus, individuals are faced with a dichotomous choice:

Ii =1 if I Wi i i* > ⇒ + >0 0γ ε

Ii = 0 if I Wi i i* ≤ ⇒ + ≤0 0γ ε (3)

Thus, the sample selection rule (SSR) for school enrolment is that

I*>0

⇒ γ εWi i+ > 0

⇒ ε γi iW> − (4)

4

If it is assumed that ε is normally distributed with zero mean and unit variance, then the choice

between enrolment or not can be written as a probit model4 where the probability of enrolment

is given by

pr I pr I pr Wi i i i( ) ( ) ( )*= = > = + >1 0 0γ ε

= > −pr Wi i( )ε γ

= −Φ( )γWi (5)

where Φ(.) is the normal distribution function. This probability can be estimated using

maximum likelihood methods [Greene 1993]. Since the choice under consideration is

dichotomous - enrolment or not - a binary formulation of the probit is used.

Educational attainment

It is desired to fit educational attainment (years of education, or EDYRS) functions

separately for men and women in an unbiased fashion. This raises the issue of the censoring of

the dependent variable, EDYRS, at zero. For those who never enrolled, the value of EDYRS is

equal to zero. When the dependent variable is censored at zero for a signi ficant fraction of the

observations, conventional regression methods fail to account for the qualitative difference

between limit (zero) observations - that is, never-enrolled persons for whom EDYRS=0 - and

non-limit observations, that is, persons who enrolled and for whom EDYRS>0. The method

that uses only non-limit observations suffers from sample selection bias, since persons for

whom EDYRS takes positive values may not be a random draw from the population, but a self-

selected (or hierarchially selected) group. This is plausible if more highly ambitious or

motivated children are more likely to be enrolled in school than children with lower levels of

these unobserved qualities. With self-selected samples, the mean value of the error term in the

educational attainment equation may not equal zero, violating a basic assumption of the

classical OLS model. More seriously, the error term may be correlated with the included

variables, leading to biased estimates.

5

The tobit procedure has been used in the literature to model censored dependent

variables but it is a restrictive solution, modelli ng simultaneously the decision to receive any

education at all (choice of enrolment in school) and the decision to acquire extra years of

education. A more satisfactory approach is to treat the decision to enrol as essentially separate

from the decision to attain further years of schooling. The Heckman procedure, which unifies

least squares regression and discrete choice models, has now come to dominate the applied

literature where censored or truncated variables are to be modelled. We adopt the Heckman

procedure to model the determinants of the censored variable EDYRS and will estimate the

selectivity term lambda from the enrolment choice model of the next section.

For the purposes of analysing the determinants of educational attainment, one must

have a sample of persons who have completed education. We limit the sample to adults aged

23 and over who have completed education. Only 3.2 per cent of men and 2.0 per cent of

women aged >=23 years old were excluded as they were still enrolled in education. In other

words, most persons had completed their desired (or possible) spells of education by age 22

years.

The survey asked respondents information about the time when they were children of

age 14. It is clear that retrospective variables such as individual’s health as a child (ACHEAL),

assets-owned/ parental wealth (PAWEAL), and whether either parent read daily newspaper

(PANEWS) are likely to be measured with a progressively greater degree of error as age

increases because memory recall deteriorates over time. Given this data limitation, we use an

upper age-limit of 45 years old, that is, use the sub-sample of persons aged 23-45 years for our

analysis and discussion.

The data suggest that an important cause of females' educational disadvantage vis-a-vis

males in the sample district is their much inferior enrolment rate than males. Indeed, as table 1

shows, the disadvantage of females in school enrolment rate is much greater than their

disadvantage in years of education attained (conditional on enrolment), although the latter is

6

still statistically significant. While the average EDYRS of ever-enrolled women is 5.5 per cent

lower than that of males, enrolment of females is 16.6 per cent lower than that of males.

Consequently, we focus not only on the determinants of educational attainment but also

importantly on the determinants of enrolment. Note also in table 1 that the raw gender

differences in both enrolment and EDYRS have narrowed over time.

3. Enrolment choice

Since the enrolment outcome is a binary one, I use the discrete choice probit

framework. A pooled model with both male and female observations and a dummy for gender

shows that sex is a highly important variable in explaining enrolment, with a very large

coefficient and a t-value of 14.5 (Appendix 1). This model shows that even after controlling

for parental background, religion, and caste, girls lose out in the intra-household allocation of

schooling. Given that enrolment varies much by gender, and our interest in gender-based

comparisons, it was appropriate to estimate the discrete choice model of enrolment separately

for males and females rather than imposing the restriction that all coefficients were the same

for males and females in the enrolment function. The definitions of the variables are given in

table 2 and their means and standard deviations in tables 3a and 3b. Table 4 sets out the

estimated enrolment probit.

7

Discussion of results

The estimated parameters of the AGE variables suggest that there has been an

improvement in the enrolment rates of females over time but not in the enrolment rates of

males. The enrolment rate of women aged 31-38 is significantly higher (at the 10 per cent

level) than the enrolment rate of women aged 39-45, which is the base category for age. Age

23-30 is nearly significant at the 10 per cent level too, its lower coefficient perhaps reflecting

the fact that some (enrolled) women in this age range have not yet completed education and so

are excluded from our sub-sample. The negative sign on the age variables in the male equation

is surprising, though the effect is not statistically different from zero at conventional levels of

significance.

Being MUSLIM exerts a powerful negative influence on the probabili ty of enrolment in

both the male and the female models, as does belonging to the low and backward castes

(LOWCASTE). This may imply that even after controlli ng for parental education and wealth,

religious and caste factors are important taste shifters in the demand for education. However, a

more plausible explanation is that the well documented lower returns to education of lowcastes

due to wage and job discrimination lowers their motivation to acquire any schooling5.

Family's economic status when respondent was a child (PAWEAL) is an important

determinant of enrolment for both boys and girls and its effect is concave: higher parental

wealth increases the probabili ty of enrolment, but at a decreasing rate. Thus, even with the

availabili ty of very low-fee primary schooling in India6, enrolment is importantly related with

wealth. Persons from poor backgrounds may have a lower probabili ty of enrolli ng in school

because of the high opportunity cost of enrolli ng - if children are working to supplement family

income – and/or because of their inabili ty to afford other non-fee schooling expenses.

Household survey data on educational spending show that even so-called ‘ fee-free’ schooling

has substantial costs in India. For instance, the PROBE report [Probe Team, 1999, p16] found

that in rural north India in 1995-96, parents spent about Rupees 318 per year on each child

8

who attends government (i.e. tuition-free) school, so that a typical agricultural labourer with 3

such children would have to work for about 40 days in the year just to send them to primary

school.

The parental education variables (MAEDYRS, PAEDYRS, PAEDYRSQ) are all strongly

significant in the female regression but not so in the male enrolment equation, where only

MAEDYRS is significant at the 10 per cent level. The results suggest that while for males to be

enrolled in school, it is not so important that their parents be well-educated, for females,

parental schooling is highly important to their access to education. At the mean of PAEDYRS,

the effect of PAEDYRS is lower than the effect of MAEDYRS in both the male and female

regressions, suggesting that mother’s education matters more to child schooling than does

father’s education.

Whereas bad health as child (ACHEAL) is a significant deterrent to enrolment for boys,

it is apparently no so for girls. There is no mechanism to explain this gender asymmetry except

inferring that parents are more responsive to sons' ailments than to daughters' , something

which is not implausible in the context of India, given the relative neglect of female children' s

ill nesses in India which contributes to its well -known very male sex ratio.

Finally, ‘mother ever worked’ in the labour market (MAWORKED) exerts a negative

influence on enrolment probabili ties in both male and female sub-samples, though its effect is

statistically different from zero at the 10 per cent level only in the female equation. There are

two possible interpretations of this: One suggests that the educational ' cost' of a working

mother falls disproportionately on daughters. For example, if mother working implies that

household tasks must be delegated to children, they are given to girls more often than to boys

if parents care less about girls' schooling. Although we do not have data on time spent in

domestic activities by the respondents when they were children, we do have this data available

for those who are children presently in our dataset (aged 6 to 14 years old). Table 5 shows that

in this sub-sample, in households where mother works, girls' time on domestic chores increases

9

by 50 per cent but boys' only by 17 per cent. This suggests that the inferior enrolment rate of

girls may partly be explained by the extra household work that falls disproportionately on them

when mother is working.

If true, this means that women’s labour force participation is bad for girls’ schooling

and it has the unfortunate implication that policies to enhance women's labour force

participation will adversely affect girls' education. However, the effect may not be so

deleterious as that suggested by the coefficient on MAEDYRS: we observe that in the past 25

years or so, both female labour force participation and female education have been rising in

India, indicating that the negative effect of a working mother on girls' education is more than

outweighed by other factors, and suggesting grounds for longer-term optimism.

There is also a second possible interpretation of the effect of MAWORKED. The

variable MAWORKED does not refer specifically to mother's working when the individual was

a child of age 5 or 6 (the age at which school enrolment typically takes place). Thus, it is

possible that it reflects the respondent’s economic background, with those whose mothers

worked being the ones from poor families. This view appears to be supported by Murthi,

Guio, and Drèze [1997] who state that “female labour force participation in India is often a

reflection of economic hardship” . If this is the case then it is not so much the educational cost

of a working mother but, rather, the educational cost of poverty that falls disproportionately on

daughters.

10

4. Educational attainment

We turn next to the educational attainment equations. Years of education (EDYRS) is

modelled separately for men and women as pooling was strongly rejected by a chow test

(F=38.75 and F-critical=1.67). The pooled model of EDYRS (Appendix 2) shows that the

coefficient on the gender dummy is significant at the 1 per cent level, suggesting that girls have

significantly lower allocations than boys even after standardising for household characteristics.

Consistent with the discussion in the methodology section, it is desired to estimate a

model of educational attainment correcting for possible bias due to endogenous selection of

individuals into the sample of persons with positive years of schooling. However, as is often

the case in much of the applied literature, we have no plausible variables that belong in the first

stage ‘ever enrolled’ selection equation and that do not belong in the second stage ‘schooling

attainment’ equation.

Given the lack of credible exclusion restrictions, we followed two alternative

approaches to achieve identification of the selectivity term, lambda, though neither is ideal.

Firstly, identification through functional form and, secondly, using variables that are actually

insignificant in an OLS attainment equation (for each gender separately) but significant in the

enrolment equation. In the latter case, there is no a priori theoretical justification but rather an

empirical justification for the chosen exclusion restrictions. We estimate these two models

and compare them with the simple OLS equation of schooling attainment. The selectivity

corrected equations of years of education, conditional on enrolment, are presented in Table 6,

using both methods of identification of lambda. Both show that selectivity into schooling is

unimportant for women but is important for men, with a significant negative coefficient. The

negative sign is surprising as we had expected that those who are not likely to enrol but did

enrol (i.e. non-participant types) must be those with higher ambition and/or motivation -

qualities which will be positively related to EDYRS. There are two ways to explain this rather

counter-intuitive result. One is that ambition and motivation are not sufficient to sustain

11

disadvantaged people in education, i.e. well motivated males who live in poverty are forced to

relinquish education while the well-off males with both low and high motivation continue in

schooling. A second, more plausible explanation is that in the absence of a plausible

identification strategy, one cannot trust the sign and size of the coefficient on lambda.

Table 7 presents the OLS equation of educational attainment. It is clear from a

comparison of the coefficients in Tables 6 and 7 that the exclusion of lambda makes little

difference to the estimated parameters of the attainment function. Even a casual inspection

confirms little change in the estimated coefficients. In virtually no case has the coefficient

moved by more than 1 standard error between the selectivity-corrected and OLS

specifications. A series of wald tests confirm that not a single coefficient difference between

Tables 6 and 7 is statistically significant at even the 10 per cent significance level7. Given the

imperfect selectivity correction strategy and, more importantly, given that correction for

selectivity (such as was possible) does not significantly alter the estimated parameters of the

EDYRS model, we use the simple OLS equation of EDYRS in Table 7 as the preferred

specification in the rest of the paper, though we will compare the effect of using OLS versus

selectivity corrected specifications on the decomposition exercise in the next section.

The explanatory power of the female EDYRS model in Table 7 is substantially higher

than that of the male model. This suggests that for women, the combined role of personal

endowments, measured household characteristics, and variables representing parental 'taste for

education' is relatively more important in determining achievement than for males.

The base category for the AGE variables is Age 39-45 years. The negative coefficients

on the variable Age 23-30 in both male and female regressions appears to be the result of

sample selection since 8 per cent of males and 5 per cent of females in that age-group have not

finished their education and so are excluded from our analysis, which takes into account only

those 23-45 year olds who have completed their education. Thus, the age cohort of 23-30

12

year olds excludes persons with high values of EDYRS but includes those who stopped

studying early and have low values of EDYRS.

The coefficient on Age 31-38 shows a weak improvement in EDYRS for females over

time but, surprisingly, there is a weak deterioration in EDYRS for men and this effect is

significant at the 10 per cent level. We have not been able to find any plausible explanation for

this finding except the possibili ty that widespread graduate unemployment in India [Blaug,

Layard, and Woodhall 1969] and the increase in vocational courses has indeed lessened the

average years of general education acquired by males. Another possible explanation may be if

universities have become more selective in their student intake over time and reversed their

former 'open-door' admissions policy, thereby adversely affecting the access to higher

education among younger cohorts8.

Being MUSLIM has a strong negative effect on years of education acquired by men.

This phenomenon may reflect both lower taste for education among, and employer

discrimination against, Muslims in India. Muzammil [1994, p6] states that the perpetuation of

ancestral manual occupations in most Muslim families implies that little effort is made by

Muslims for the better education of their children (lower taste argument). He also points out

(p8) that Muslims are the subject of employment discrimination in both the government sector

and particularly severely, the private sector (discrimination argument). Such discrimination is

likely to lower the expected rate of return to education for Muslims and cause them to desire

fewer years of schooling.

The fact that Muslim females are not significantly behind non-Muslim females in

educational attainment (conditional on enrolment) suggests that most of the educational

discrimination against girls by Muslim parents occurs at the stage of enrolment (see enrolment

function in table 4); those Muslim girls whose parents are liberal enough to enrol them in

schooling have educational aspirations that are not significantly below those of their non-

Muslim counterparts.

13

Conditional on enrolment, low and backward caste (LOWCASTE) men attain the same

years of education as high caste men, ceteris paribus. However, low caste women lag behind

high caste women. This appears to suggest that while lowcaste men take advantage of the

'reservation' measures the government has put in place for improving the representation of low

castes at different levels of education (particularly higher education), low caste women do not.

Another explanation is that mandatory 'reservation' for low caste persons in highly-paid public

sector jobs since independence has given lowcaste men an economic incentive to enhance their

education but that for lowcaste women, this incentive has not been sufficient to induce them to

discard their traditional educational conservatism.

The effect of LOWCASTE on educational attainment for the two genders is consistent

with the finding in Kingdon [1998a] that there is greater wage discrimination against low

castes in the female labour market than in the male.

Parental wealth is a highly important determinant of educational attainment for both

sexes though the effect is weaker for males, both quantitatively and qualitatively. Different

measures of the home educational environment (BKHOME and PANEWS) are significant for

grade attainment of men and women.

From the point of view of analysing the intra-household allocation of education,

perhaps the most interesting result of the analysis is the effect of the parental education

variables. Mother's education is very important to girls' schooling but not to boys'; Father's

education is important to both boys' and girls’ schooling. This result has policy implications

for reducing gender inequality in education and we discuss this issue later.

Another parental variable which has a large impact on girls' education but not on boys'

is parents' opinion about the importance of girls' education (EDEQUAL)9. Girls whose parents

believed in gender equality in education attained very significantly more education than other

girls. This is similar to the findings in Drèze and Kingdon [2001] where a similar attitudinal

variable was employed. Interestingly, this variable has no effect on educational attainment for

14

boys in Table 6, unlike in Drèze and Kingdon’s paper where even sons’ education benefited

from having parents who believed in gender equali ty in education.

Father worked in a white collar occupation (PAWHITE) is of consequence to children' s

educational attainment in both male and female samples. However, surprisingly, a working

mother (MAWORKED) exerts a strongly negative influence only on boys' EDYRS. This is in

contrast to the effect of MAWORKED in the enrolment function where a "working mother"

significantly lowered the probabili ty of enrolment for girls (t=1.70) but not for boys. In other

words, most of the negative influence of "mother worked" on girls' education occurs at the

stage of enrolment and on boys' education at the post-enrolment stage.

The age at marriage variables are highly important in explaining years of education

acquired for both sexes, though the effect is particularly powerful for females, in both

quantitative and qualitative terms. The base category is ‘married at age 21 or above’. Those

who married very young (at <=17 years old) had, on average, about 2.3 years less education

than those who married late. Persons who married between ages 18 and 20 years old likewise

had significantly less education than those who married after 2010.

Observe that measures of abili ty (SRAVEN categorised into low, medium, and high

abili ty) are very significant determinants of EDYRS for males as well as females, although their

quantitative effect is bigger in the male equation. The base category for the abili ty variables is

the low abili ty category (ABILLOW, or SRAVEN score of <=10). Medium abili ty and high

abili ty individuals have decidedly higher education than individuals with low measured abili ty,

as might be expected a priori. After ' age at marriage' , abili ty is the most important determinant

of EDYRS in the female equation; in the male equation, it is the most important, with high

abili ty men gaining, on average, nearly 3 years' more schooling than low abili ty men, ceteris

paribus11.

The variable REPEAT also represents aspects of abili ty: the less able are more likely to

fail and repeat classes. REPEAT exerts a potent negative effect on EDYRS in the male

15

equation and a more modest, though still weakly significant, influence on EDYRS in the female

equation.

The quality of primary school attended (SCQUAL) affects educational attainment

positively and significantly for both the sexes12. This suggests that after controlli ng for

parental education, occupation and wealth, and for social, religious, and health-related

influences, persons who drop-out of education prematurely are the ones who faced poor

quality primary schooling (i.e. attended poorly resourced and equipped schools). This implies

that the phenomenon of early dropping out, which represents large-scale wastage in Indian

education, is not only a demand-side problem: it is also importantly a supply side issue, with

low-quality schools faili ng to retain students in education. It suggests that the level of

dropping-out can be mitigated by upgrading the quality of primary schools.

5. How much of the gender gap in educational attainment can be explained?

Are males and females with comparable characteristics equally likely to enrol and to

attain similar years of schooling? What factors contribute most to the male-female gap in

schooling? To answer these questions, we decompose the gross gender difference in

enrolment probabili ty and in mean years of education into the component 'explained' by

differences in characteristics between the two groups, and the 'unexplained' component. The

unexplained component is conventionally regarded in the literature as the extent of sex

discrimination [Oaxaca, 1973; Blinder, 1973]. However, to the extent that lower allocations

of education to daughters reflect poorer economic returns to girls’ education, it may be argued

that such lower allocations are not discriminatory but, rather, a rational economic response by

parents. Thus, we prefer to use the term ‘differential treatment’ rather than discrimination. We

use the Blinder-Oaxaca technique for measuring differential treatment when two groups differ

in their characteristics and differ in the structure relating these characteristics to educational

attainment.

16

Assume that the mean years of schooling of females (f) is S f and that of males (m) is

Sm . Mean years of schooling is determined by

S b Xi i i=�

i = f,m

where X is the vector of the mean values of characteristics and �

b is the vector of estimated

coefficients of the educational attainment function.

The mean years of schooling of men, if they were educated according to women's

educational attainment function, would be the dot product �

b Xf m . The total gender difference

(T) in mean years of schooling can be divided into the part explained (E) by the different

personal characteristics of men and women and the part unexplained (D), reflecting differences

in the structure of the educational attainment function, that is, differences in �

b for the two

sexes.

T S Sm f= −

T b X b Xm m f f= −� �

{ } { }T X b b b X Xm m f f m f= − + −(� �

)�

( )

T = D + E

This can be referred to as standardising by male means. Similarly, the estimation of the years

of schooling of women if they face men's educational attainment function permits the

decomposition into D + E as follows:

T S Sm f= −

T b X b Xm m f f= −� �

{ } { }T X b b b X Xf m f m m f= − + −(� �

)�

( )

T = D + E

This can be referred to as standardising by female means. Since the decomposition may

be sensitive to the choice of index (standardising according to male means or female means),

ideally both decompositions should be carried out.

17

Following the procedure just outlined, Table 8 presents the decomposition of the gross

average gender difference in EDYRS into 2 components, i.e. the component that is due to

gender differences in inputs (different means of characteristics for males and females) and the

component that is due to the difference in coefficients in the male and female EDYRS

production functions. We do this for each variable separately to examine the contribution of

each individual variable to the gender difference in schooling. The figures in Table 8 are those

obtained by standardising with female means. The standardisation with male means gives quite

a similar decomposition.

Columns 2 and 3 of Table 8 show that there are significant differences in the mean

characteristics of men and women. Most of the parental and household traits generally favour

females: for example, ever-enrolled females have significantly higher values of parental

education, occupation and wealth and lower values of lowcaste than ever-enrolled males.

These differences in mean parental attributes probably reflect the sample selection effects

mentioned in the discussion of the enrolment function: While boys from uneducated and poor

backgrounds may enrol in school, girls only from relatively better-off and more educated

backgrounds will enrol. However, personal variables such as age-at-marriage and abili ty

favour males: a significantly smaller proportion of men than women are in the early-age-

marriage category and in the low-abili ty category, though a significantly higher proportion of

males than females repeat classes. Thus, while parental and household variables generally

favour females, personal characteristics generally favour males.

Collectively, their higher values of household background, parental, and school quality

variables imply a 0.508 years of EDYRS advantage for females, while the remaining (personal)

variables imply a 0.680 years of EDYRS disadvantage for females. Thus, taken as a whole,

girls’ inferior characteristics account for 0.172 years of their total EDYRS disadvantage vis-a-

vis boys. The remaining female disadvantage in educational attainment is explained by the

different production functions of EDYRS for men and women. Thus, for example, females’

18

school attainment responds less to ‘ father in white collar occupation’ and to ‘having high

abili ty’ (ABILHIGH), than does males’ schooling attainment. The coefficients on the age-at-

marriage variables also favour males, reinforcing their advantage in mean values of age-at-

marriage13.

In summary, the decomposition analysis suggests that, of the total gender difference in

EDYRS (0.684 years), only 0.172 years or about 25 per cent is explained by girls’ inferior

attributes. The remaining difference in EDYRS of 0.512 years (or about 75 per cent of the

difference) is due to the different production function of EDYRS faced by girls and boys. A

decomposition exercise using coefficients of the sample selectivity corrected attainment

equations yields quite similar results14.

Clearly, the large unexplained portion of the gender gap in schooling is due to

unobserved factors which may include parental attitudes (such as son preference) and/or lower

expected economic returns to girls schooling than boys.

In principle, it should be possible to examine the effect of labour market returns of men

and women on the amount of education boys and girls are allocated within a family. One way

would be to examine the effect of male and female employment rates (proxies for labour

market returns) on the educational attainment of males and females. This would be similar to

the strategy in Rosenzweig and Schultz [1982] where the authors investigate the effect of male

and female labour market participation rates on gender differences in resource allocation and

child survival. In practice, there are difficulties in doing so convincingly with the data in

hand15. However, I examine this issue in another paper using an alternative strategy, namely

to estimate gender-specific rates of return to education [Kingdon, 1998a] using the same

dataset as the one used here. This shows that women have significantly lower returns to

education than men. If the labour market changes only slowly, as appears to be the case in

India16, then today’s gender difference in labour market returns is likely to be similar to that in

the previous generation, and it would have shaped the expectations of returns to education for

19

our sample when their parents were making schooling decisions for them. This provides

support for the explanation that parental investment motives underlie the lower allocation of

schooling attainment to girls, though it does not rule out the existence of a son-preference

motive as well .

6. Conclusions

Both the gender difference in enrolment rate and the gender difference in average years

of education attained are statistically significant in the urban Indian sample used here, even

after controlli ng for a range of pertinent characteristics. The most important factors influencing

educational attainment of women are parental background, wealth, and opinions, individual

abili ty, age-at-marriage, and the quality of primary school attended. The analysis suggests that

75 per cent of the gender disparity in EDYRS is unexplained, only 25 per cent being accounted

for by girls’ inferior education-enhancing characteristics.

Both low and backward caste men and women and Muslim men and women have

lower enrolment than others, ceteris paribus. Moreover, low and backward caste women and

Muslim men have significantly lower educational attainment (conditional on enrolment) than

their high caste and non-Muslim counterparts, ceteris paribus. While different preferences may

explain part of this, labour market discrimination against these social groups also appears to be

responsible, at least to some extent.

The results show that the educational ‘cost’ of a working mother, or of poverty - where

a working mother signifies poverty - falls disproportionately on daughters. This probably

reflects parental beliefs about the gender division of labour: if a daughter is envisaged to be a

housewife for most of her adult life, her enrolment in school can be sacrificed (if necessitated

by mother’s working or by poverty) without the same compunction as sacrificing a son’s

schooling. Poverty bears particularly heavily against girls’ education: PAWEAL is highly

20

important in the female educational attainment equation but not so in the male educational

attainment function.

Parents’ differential treatment of sons and daughters in education could be the result of

lower economic returns to girls’ education than to boys. Kingdon [1998a] and the studies

cited therein find this to be true for India. This finding is also supported by Behrman, Foster,

Rosenzweig and Vashishtha [2000] who report that in rural India in the early green revolution

period there were no direct economic returns to women’s schooling because women were not

involved in occupations where education was rewarded (though women’s schooling did have

benefits in terms of the home teaching of children). However, other possible explanations of

parents’ differential treatment of sons and daughters in education are that (i) differential

treatment partly reflects entrenched beliefs about the gender division of labour [Drèze and Sen

1995]; (ii) it reflects an asymmetry in parental incentives to educate girls and boys due to son

preference; (iii ) even if the economic returns to education were the same for boys and girls,

parents may value only that part of the return to a child’s education that accrues to them

personally (and the returns to a daughter’s education are often reaped by her in-laws); (iv) due

to the higher costs (opportunity costs and/or direct costs) of educating girls. These contrasts

in parental incentives have strong implications for public policy: parental motivation for male

education is high. For female education, however, it is important to address the conservatism

of social attitudes and parental inertia. It is also important to reduce wage and job sex-

discrimination in the labour market and boost women’s economic returns to education in order

to improve girls’ incentives to acquire schooling.

1 See, for example, evidence cited in King and Hill [1993], Subbarao and Raney [1995], and Drèze and Murthi [2001]. 2 The GDI attempts to capture achievement through the same set of basic capabiliti es included in the Human Development Index - li fe expectancy, educational attainment, and income -but adjusts the HDI for gender inequalit y. 3 I have come across only three studies that examine statisticall y the determinants of enrolment and schooling attainment for girls and boys in India [Duraisamy, 1992; Sipahimalani, 2001; Pal, 2001]. However, both Duraisamy and Sipahimalani papers fit enrolment and educational attainment functions separately for girls and boys (without decomposing the gender gaps), not a pooled model with a gender dummy and gender

21

interactions. Pal examines the gender gap in current enrolment (but not in educational at tainment) among 5-15 year olds in rural west Bengal. 4 Under alternative assumptions about the distribution of the error term in equation (2), the logit model can also be employed to predict probabiliti es of work force participation; however, we intend to use the probit model which is the discrete choice model most used in applications of the Heckman correction described in the next section. 5 Banerjee and Knight [1985], Chinnappan [1992], and Kingdon [1998a] all find that there is significant caste discrimination in the Indian labour market. However, using the present data, Kingdon [1998a] did not find statisticall y significant wage discrimination against Muslims, though the coeff icients on the MUSLIM dummies in her earnings functions were invariably negative. 6 Muzammil [1989, p 154-60] finds that in terms of 1970-71 constant prices, annual tuition fee in elementary education was meagre - Rs. 0.88 in 1950-51 and Rs. 1.48 in 1979-80. Tuition fees in primary schools was abolished for girls in 1967 and for boys in 1989. 7 For example, the 2χ value of the difference between the coeff icient on MUSLIM in tables 6 and 7 for men is

0.705. The criti cal value of 2χ (1) at the 5% (10%) level is 3.84 (2.71). Thus the null hypothesis that

selecols bb ˆˆ = cannot be rejected. This is not surprising since the change in the size of the coeff icient { 1.465 –

0.994=0.471) is only just over 1 standard error apart in the selectivity-corrected and OLS male equations. 8 This may not be a plausible explanation because the effect does not appear to apply to females. It may also not be plausible because those who fail to be selected for admission into a college or university do have the option of taking degree exams 'privately', that is, without being regular students in a college or university, though it is arguable that private exams are available only in certain subjects and not across all subjects. 9 The inclusion of attitudinal variables such as EDEQUAL is not standard in applied economic research, which typicall y includes only directly observable arguments. The main reason for being sceptical of attitudinal variables is because they may be jointly determined with the dependent variable (for example, what parents think about the importance of their daughters’ education compared with the education of their sons). However, the potential endogeniety of the parental attitude variable EDEQUAL was limited by asking a very general question, not directly related with the respondents’ circumstances: We asked what parents thought about the relative importance of education for males and females in general. Such attitudes may be determined earlier and by much wider considerations than peoples’ attitudes to their own sons’ and daughters’ education. 10 While 'age at marriage' variables may be endogenous to EDYRS, e.g. for some individuals, 'age at marriage' may be determined partly by EDYRS, evidence from the data collection suggests otherwise, at least for girls, many of whom reported 'I got married' as the reason for discontinuing education. However, this was rarely a reason given by boys for dropping out of education prematurely. Drèze and Sen [1995, p15] state that in India considerations involved in educational decisions are radicall y different for girls and boys: In the case of boys, economic reasons are strong. In the case of girls, they are more guided by (exogenous) marriage practices and by the gender division of labour. We were unable to deal with any potential endogeniety of the ‘age at marriage’ variables due to the lack of suitable instruments. Consequently, the results should be interpreted as correlational rather than causal, particularly in the male equation. Results without ‘age-at-marriage’ variables are given in Appendix 3 and they show that there is littl e change in the probit results. For example, between the ‘age-at-marriage’ inclusive and exclusive specifications, not a single coeff icient has changed by more than one standard error. So the endogeneity of these variables, if it exists, does not bias the other coeff icients in Table 7. 11 Scores on the Raven's test have been used as a measure of innate abilit y in a number of well -received empirical analyses of education [for example, Glewwe 1996; Boissiere, Knight, and Sabot 1985, inter alia]. Glewwe [1996] explains that while the Raven’s measure of abstract thinking ability may reflect other characteristics of the individual [see Raven et. al., 1984], it should also be well correlated with innate abilit y. 12 Kingdon [1996a,b] finds that schooling quality signif icantly affects another measure of educational success, namely student achievement. 13 Caution must be exercised in decomposing of the effect of ‘age-at-marriage’ on educational attainment. While we describe the lower mean ‘age-at-marriage’ of women then men as an inferior characteristic of women, in fact it is, in a sense, a manifestation of parental discrimination against girls. Thus, our calculation of the fraction of the gender gap in EDYRS that is discriminatory may be an underestimate. 14 Of the total gender difference in EDYRS (0.684), only 0.206 (or about 30%) is explained by girls’ inferior attributes. The remaining difference in EDYRS (0.478 or about 70%) is due to the different production function of EDYRS faced by girls and boys. Thus, compared to the decomposition using the OLS specification, the decomposition using the selectivity corrected specification yields a somewhat lower estimate of the unexplained component (70% instead of 75% of the total gender difference). Many studies note that sample

22

selection correction lowers the estimated amount of ‘discrimination’ by increasing the weight attached to the depreciation effect of female non-participation in the wage work force [for example, see Dolton and Makepeace 1986, p336; and Choudhury 1993, p337]. 15 Men and women’s expected labour market returns (as captured by male and female employment rates) are endogenous and must therefore be instrumented in the schooli ng attainment equation. Given that our d ata are from a single city and there was no community questionnaire, we have no good instruments for employment rates such as exogenous vill age or district labour market characteristics, unlike Rosenzweig and Schultz [ 1982]. 16 There is some support for thi s idea. Rosenzweig and Schultz assume that “ in a stable, slowly developing society such as in rural India, parents can reasonably expect that conditions which they face as adults will also condition in a similar way the behavior of their offspring” . While this may be thought to be true only for rural India, it seems true for urban India as well . For example, Kundu [ 1997, p. 442] reports that between 1977 and 1993, the labour force participation rate of rural and urban women remained reasonably stable at about 52-54% and 23-25% respectively.

1

References Banerjee, B and J. B. Knight, 1985, ‘Caste discrimination in the Indian labour market’ , Journal of

Development Economics, Vol. 17, pp.277-307. Behrman, J., A. Foster, M. Rosenzweig, and P. Vashishtha, 1999, ‘Women’s Schooling, Home Teaching, and

Economic Growth’ , Journal of Politi cal Economy, Vol. 7, No. 4, pp. 682-714. Blaug, M., R. Layard and M. Woodhall , 1969, The Causes of Graduate Unemployment in India, London:

Penguin Press. Blinder, Alan, 1973, ‘Wage Discrimination: Reduced Form and Structural Estimates’ , Journal of Human

Resources, Vol. 8, No. 4, Fall , pp. 436-55. Boissiere, Maurice, J. B. Knight, and R. H. Sabot, 1985, ‘Earnings, schooling, abili ty and cognitive skill s’ ,

American Economic Review, Vol. 75, pp.1016-1030. Chinnappan, G., 1992, ‘Significance of human capital approach to caste inequali ty’ , in Kothari, V. (ed.),

Issues in Human Capital Theory and Human Resource Development Policy , New Delhi: Himalaya Publishing House.

Choudhury, Sharmila, 1993, ‘Reassessing the Male-Female Wage Differential: A Fixed Effects Approach’ ,

Southern Economic Journal, Vol. 60, No. 2, pp.327-40. Dolton, P. and G. Makepeace, 1986, ‘Sample Selection and Male-Female Earnings Differentials in the

Graduate Labour Market’ , Oxford Economic Papers, Vol. 38, No.2, pp.317-41. Drèze, Jean and Geeta Kingdon, 2001, ‘Schooling Participation in Rural India’ , Review of Development

Economics, Vol. 5, No. 2, pp.1-26. Drèze, Jean and M. Murthi, 2001, ‘Fertili ty, Education, and Development: Evidence from India’ , Population

and Development Review, Vol. 27, No.1, pp.33-63. Drèze, Jean and A. Sen, 1995, ‘Basic education as a poli tical issue’ , chapter in India: Economic Development

and Social Opportunity, Delhi: Oxford University Press. Duraisamy, P., 1992, ‘Gender, Intrafamily allocation of resources and child schooling in India’ , Discussion

Paper No. 667, Economic Growth Center, Yale University, January. Glewwe, Paul, 1996, ‘The Relevance of Standard Estimates of Rates of Return to Schooling for Education

Policy: A Critical Assessment’ , Journal of Development Economics, Vol. 51, No. 2, pp. 267-90. Greene, W.H., 1993, Econometric Analysis, 2nd Edition, New York: Macmill an. Heckman, James, 1979, ‘Sample selection bias as a specification error’ , Econometrica, Vol. 47, pp.153-161. King, E. and M. Hill , 1993, Women’s education in developing countries, Washington D.C.: John Hopkins

Press for the World Bank. Kingdon, Geeta Gandhi, 1996a, ‘The quali ty and eff iciency of private and public education: A case-study of

urban India’ , Oxford Bulletin of Economics and Statistics, Vol. 58, No. 1, pp.57-82. Kingdon, Geeta Gandhi, 1996b, ‘Student Achievement and Teacher Pay’ , Development Economics Discussion

Paper No. 74, STICERD, London School of Economics.

2

Kingdon, Geeta Gandhi, 1998a, ‘Does the Labour Market Explain Lower Female Schooling in India?’ ,

Journal of Development Studies, Vol. 35, No. 1, pp. 39-65. Kingdon, Geeta Gandhi, 1998b, Education of Females in India: Determinants and Economic Consequences,

McNamara Fellowships Off ice of the World Bank, Washington D.C. Kundu, Amitabh, 1997, ‘Trends and Pattern of Female Employment in India: A Case of Organised

Informalisation’ , Indian Journal of Labour Economics; Vol. 40, No. 3, pp. 439-51. Maddala, G. S., 1989, Introduction to Econometrics, New York: Macmill an. Murthi, M., A. Guio, and J. Drèze, 1997, ‘Mortali ty, fertili ty, and gender bias in India: A district level

analysis’ , Population and Development Review, Vol. 21, pp.745-782. Muzammil, Mohd., 1989, Financing of Education, New Delhi: Ashish Publishing House. Muzammil, Mohd., 1994, ‘Education and employment: A study of Muslims in UP (India)’ , paper presented at

the South Asia Visiting Scholar’s Programme, Queen Elizabeth House, University of Oxford. Oaxaca, R., 1973, ‘Male-female differentials in urban labour markets’ , International Economic Review, Vol.

3, pp.603-709. Pal, Sarmistha, 2001, ‘How Much of the Gender Differences in Child School Enrolment Can be Explained:

Further Evidence from India’ , mimeo, Cardiff Business School. Probe Team, 1999, Public Report on Basic Education in India, New Delhi: Oxford University Press. Raven, J.C., J.H. Court, and J. Raven, 1984, Manual for Raven’s Progressive Matrices and Vocabulary

Scales, London: H.K Lewis and Co. Rosenzweig, M. and T. Paul Schultz, 1982, ‘Market Opportunities, Genetic Endowments, and Intrafamily

Resource Distribution: Child Survival in Rural India’ , American Economic Review, Vol. 72, No. 4, pp. 803-15.

Schultz, T. P., 1993, ‘Returns to women' s education’ chapter 2 in King, E and M Hill (eds.) Women’s

education in developing countries, Washington D.C.: Johns Hopkins Press for the World Bank. Shariff , A., 1999, India: Human Development Report, New Delhi: Oxford University Press. Sipahimalani, Vandana, 2001, ‘Education in the Rural Indian Household: The Impact of Household and

School Characteristics on Gender Differences’ , mimeo, Yale University. Subbarao, K. and L. Raney, 1995, ‘Social Gains from Female Education: A Cross-National Study’ , Economic

Development and Cultural Change, Vol. 44, No.1, pp.105-128. Subramaniam, Shankar, 1995, ‘Gender Discrimination in Intra-Household Allocation in India’ , mimeo.,

Cornell University and Indira Gandhi Institute of Development Research, Bombay. Subramaniam, S. and A. Deaton, 1991, ‘Gender effects in Indian consumption patterns’ , Sarvekshana, Vol.

14, No. 4, pp.1-12. UNDP, 1996, Human Development Report, United Nations Development Programme, New York.

3

Table 1

Raw gender difference in enrolment and average years of education attained

Persons aged 23-45 Persons aged >=23*

Enrolment Years of education Enrolment Years of education

(%) Including non-

enrolees

Conditional on

enrolment

(%) Including non-

enrolees

Conditional on

enrolment

Males (A) 90.6 11.24 12.29 88.7 10.86 12.18 Females (B) 75.6 8.86 11.61 69.4 7.79 11.22 Raw Gender Difference (C =A-B) 15.0 2.38 0.68 19.3 3.07 0.96 % Female disadvantage (C/A)*100

16.6

21.2

5.5

21.8

28.3

7.9

t-value for the difference 8.66 8.68 3.16 12.74 13.35 5.32

Note: * No upper age limit . This implies a sample of older persons. The sub-sample of persons aged 23-45 years old is the ‘younger’ sample. Non-enrolees are assigned zero years of education.

4

Table 2 Definitions of variables used in the enrolment and educational attainment functions

Variable Description

AGE 23-30 Dummy for persons between ages 23 and 30 years old



MUSLIM Religion Muslim? yes=1, no=0

LOWCASTE Belongs to the low or backward caste? yes=1, no=0

ACNSIB Number of siblings when a child

PAWEAL Index of parental wealth, based on assets owned by the respondent's family, when a child

PAWEALSQ Square of PAWEAL

BKHOME Index of number of books in parents' household, when a child. Takes values from 1 to 5,

with 1 representing less than 25 books and 5 representing more than 100 books.

PANEWS Whether either parent read a dail y newspaper when respondent was a child

MAEDYRS Mother's education in years

PAEDYRS Father's education in years

PAEDYRSQ Square of PAEDYRS

ACHEAL Index of respondent's health when a child. Takes values from 1 to 4, with 1 representing

very good health and 4 representing very bad health.

EDEQUAL Do parents think that girls' education is equally important as that of boys? yes=1, no=0

PAWHITE Father in a white collar occupation? yes=1, no=0

MAWORKED Did mother ever work in an income-generating activity? yes=1, no=0

NEVMARR Never married? yes=1, no=0

MARRAGE <=17 Age at marriage less than or equal to 17 years old

MARRAGE 18-20 Age at marriage between 18 and 20 years old, inclusive

MARRAGE >20 Age at marriage greater than 20 years old

SRAVEN* Score on the Raven's Progressive Matrices test of abilit y

ABILMISS Score on the Raven's test missing

ABILLOW Low abilit y, i.e. Raven's score less than or equal to 10

ABILMED Medium abilit y, i.e. Raven's score between 11 and 18, inclusive

ABILHIGH High abilit y, i.e. Raven's score greater than 18

REPEAT Ever repeated a class at school because of faili ng? yes=1, no=0

SCQUAL Index of qualit y of the primary school attended. Takes values between 0 and 10

depending on the number of resources that the respondent's primary school had.

Note: Variables ACNSIB, PAWEAL, BKHOME, and PANEWS are retrospective variables. That is, they represent information about the respondent when she/he was a child of 14 years old or younger. * Only sets A and C of the Raven's abilit y test were administered so that the minimum and maximum scores on the Raven's test in our data are 0 and 24 respectively.

5

Table 3a

Descriptive statistics for FEMALES aged 23-45 years old

Variable Non-Enrolled Enrolled All

Mean SD Mean SD Mean SD

ENROL* 0.000 0.00 1.000 0.00 0.754 0.43

EDYRS 0.000 0.00 11.611 4.27 8.759 6.23 AGE 35.581 7.10 33.247 6.49 33.820 6.71 AGE 23-30* 0.324 0.47 0.409 0.49 0.388 0.49

AGE 31-38* 0.262 0.44 0.340 0.47 0.321 0.47

AGE 39-45* 0.414 0.49 0.251 0.43 0.291 0.45

MUSLIM* 0.319 0.47 0.116 0.32 0.166 0.37

LOWCASTE* 0.533 0.50 0.198 0.40 0.281 0.45

ACNSIB 4.867 2.01 5.074 2.05 5.023 2.04 PAWEAL 3.359 2.48 10.067 6.82 8.425 6.70 PAWEALSQ 17.388 26.81 147.770 222.80 115.860 201.99 BKHOME 1.019 0.17 1.270 0.82 1.208 0.73 PANEWS* 0.062 0.24 0.595 0.49 0.464 0.50

MAEDYRS 0.086 0.73 4.317 4.86 3.276 4.61 PAEDYRS 1.361 3.06 9.413 5.07 7.442 5.81 PAEDYRSQ 11.188 30.60 114.310 92.74 89.070 93.22 ACHEAL 1.052 0.28 1.040 0.24 1.043 0.25 EDEQUAL* 0.376 0.48 0.670 0.47 0.598 0.49

PAWHITE* 0.038 0.19 0.473 0.50 0.366 0.48

MAWORKED* 0.152 0.36 0.076 0.27 0.095 0.29

NEVMARR* 0.014 0.12 0.082 0.27 0.065 0.25

AGEMARR 16.720 2.95 19.824 3.86 19.020 3.89 MARRAGE <=17* 0.700 0.46 0.254 0.44 0.364 0.48

MARRAGE 18-20* 0.195 0.40 0.318 0.47 0.288 0.45

MARRAGE >20* 0.090 0.29 0.346 0.48 0.283 0.45

SRAVEN 9.316 3.02 11.718 4.24 11.081 4.09 ABILMISS* 0.157 0.36 0.240 0.43 0.220 0.41

ABILLOW* 0.552 0.50 0.343 0.47 0.394 0.49

ABILMED* 0.286 0.45 0.360 0.48 0.342 0.47

ABILHIGH* 0.005 0.07 0.057 0.23 0.044 0.21

REPEAT* - - 0.122 0.33 - -

SCQUAL - - 6.113 1.83 - - N 210 645 855 Note: The variables with superscript * are 0/1 variables, so that their means represent the proportion of ones in the sample. Thus, the mean of the variable 'AGE 23-30' of 0.324 in the non-enrolled sub-sample signifies that 32.4 per cent of all non-enrolled women in the 23-45 year age group fall within ages 23 and 30 years.

6

Table 3b

Descriptive statistics for MALES aged 23-45 years old

Variable Non-Enrolled Enrolled All

Mean SD Mean SD Mean SD

ENROL* 0.000 0.00 1.000 0.00 0.905 0.29

EDYRS 0.000 0.00 12.290 4.04 11.125 5.27 AGE 34.341 7.15 33.536 6.69 33.613 6.74 AGE 23-30* 0.354 0.48 0.413 0.49 0.407 0.49

AGE 31-38* 0.317 0.47 0.313 0.46 0.313 0.46

AGE 39-45* 0.329 0.47 0.274 0.45 0.280 0.45

MUSLIM* 0.329 0.47 0.121 0.33 0.141 0.35

LOWCASTE* 0.695 0.46 0.266 0.44 0.306 0.46

ACNSIB 5.000 2.13 4.990 2.11 4.991 2.11 PAWEAL 3.061 3.05 9.454 7.41 8.847 7.36 PAWEALSQ 18.573 56.27 144.260 257.05 132.330 247.89 BKHOME 1.000 0.00 1.255 0.80 1.231 0.77 PANEWS* 0.073 0.26 0.514 0.50 0.472 0.50

MAEDYRS 0.122 0.78 3.729 4.76 3.385 4.66 PAEDYRS 0.841 2.30 7.958 5.61 7.279 5.77 PAEDYRSQ 5.939 18.87 94.708 93.35 86.244 92.72 ACHEAL 1.098 0.43 1.033 0.24 1.039 0.26 EDEQUAL* 0.537 0.50 0.686 0.46 0.672 0.47

PAWHITE* 0.037 0.19 0.388 0.49 0.355 0.48

MAWORKED* 0.268 0.45 0.112 0.32 0.127 0.33

NEVMARR* 0.122 0.33 0.190 0.39 0.184 0.39

AGEMARR 21.528 3.74 23.637 3.92 23.422 3.95 MARRAGE <=17* 0.085 0.28 0.033 0.18 0.038 0.19

MARRAGE 18-20* 0.341 0.48 0.162 0.37 0.179 0.38

MARRAGE >20* 0.451 0.50 0.614 0.49 0.599 0.49

SRAVEN 8.854 3.41 13.541 4.53 13.042 4.65 ABILMISS* 0.415 0.50 0.485 0.50 0.479 0.50

ABILLOW* 0.390 0.49 0.134 0.34 0.158 0.37

ABILMED* 0.195 0.40 0.305 0.46 0.295 0.46

ABILHIGH* 0.000 0.00 0.075 0.26 0.068 0.25

REPEAT* - - 0.262 0.44 - -

SCQUAL - - 5.898 1.80 - - LAMBDA - - 0.108 0.21 N 82 783 865 Note: The variables with superscript * are 0/1 variables, so that their means represent the proportion of ones in the sample.

7

Table 4 Binary probit model of enrolment

Variables Females Males

coefficient t-value coefficient t-value

Intercept - 1.465 - 4.21 *** 1.102 2.57 ***

AGE 23-30 0.271 1.59 - 0.296 - 1.38

AGE 31-38 0.308 1.81 * - 0.278 - 1.30

MUSLIM - 0.786 - 4.40 *** - 1.469 - 5.84 ***

LOWCASTE - 0.329 - 2.10 ** - 0.758 - 3.56 ***

ACNSIB 0.042 1.21 0.036 0.86

PAWEAL 0.256 5.81 *** 0.282 5.47 ***

PAWEALSQ - 0.005 - 2.42 *** - 0.007 - 5.06 ***

PANEWS 0.089 0.41 - 0.207 - 0.68

MAEDYRS 0.126 2.36 *** 0.146 1.66 *

PAEDYRS 0.237 5.29 *** 0.128 1.63

PAEDYRSQ - 0.011 - 2.89 *** - 0.001 - 0.15

ACHEAL - 0.029 - 0.14 - 0.524 - 2.01 **

EDEQUAL 0.216 1.49 - 0.030 - 0.17

PAWHITE 0.377 1.48 0.065 0.16

MAWORKED - 0.393 - 1.70 * - 0.120 - 0.51

N 848 850

Log li kelihood - 213.72 - 149.39

Restricted log li kelihood - 471.29 - 265.17

McFadden's Pseudo R2 0.547 0.437

% 0s correctly predicted 77.3% 30.0%

% 1s correctly predicted 92.5% 98.3%

Note: McFadden's psuedo R2 is calculated as 1 - (Ln L/Ln L0 ), where Ln L is the maximum of the log li kelihood function and Ln L0 is the restricted log li kelihood, that is when the model is estimated with just the constant term [Maddala 1989]. The base category for age is the age-group 39-45 year olds. *** , ** , and * represent signif icance at the 1, 5, and 10 per cent levels respect ively, in all Tables.

Table 5 Time spent in domestic chores, hours per day

(children aged 6-14 years old)

Mother works N Hours per day spent on domestic chores By daughters By sons Yes 195 2.08 0.54 No 852 1.38 0.46 % difference for children with working mothers

50%

17%

1

Tab

le 6

Se

lect

ivit

y C

orre

cted

Edu

cati

onal

att

ainm

ent

func

tion

s, b

y se

x

Id

enti

fica

tion

of

lam

bda

by f

unct

iona

l for

m

Id

enti

fica

tion

of

lam

bda

base

d on

em

piri

cally

jus

tifi

able

ex

clus

ion

rest

rict

ions

Fem

ales

M

ales

F

emal

es

Mal

es

C

oeff

T

Coe

ff

T

C

oeff

T

Coe

ff

T

In

terc

ept

6.07

2 6.

73

***

9.

550

10.5

9 **

*

6.23

6 7.

12

***

9.

027

12.4

9 **

*

AG

E 2

3-30

-0

.403

-1

.49

-1

.604

-4

.89

***

-1.5

89

-4.8

9 **

*

AG

E 3

1-38

0.

330

1.24

-0.5

26

-1.7

7 *

-0

.522

-1

.77

* M

USL

IM

-0.3

27

-0.9

7

-1.0

07

-2.4

8 **

*

-0

.994

-2

.53

***

LO

WC

AST

E

-0.4

80

-1.7

7 *

0.00

4 0.

02

-0

.481

-1

.79

*

AC

NSI

B

-0.0

44

-0.8

8

-0.0

22

-0.3

9

-0.0

47

-0.9

3

-0.0

20

-0.3

5

PA

WE

AL

0.18

3 3.

23

***

0.

079

1.46

0.15

4 2.

76

***

0.

078

1.46

PA

WE

ALS

Q

-0.0

04

-2.8

4 **

*

-0.0

01

-0.8

6

-0.0

04

-2.5

2 **

*

-0.0

01

-0.8

6

BK

HO

ME

0.

085

0.64

0.30

3 1.

99

**

0.14

4 1.

08

0.

309

2.04

**

P

AN

EW

S 0.

714

2.72

**

*

-0.0

88

-0.2

8

0.68

0 2.

59

***

-0

.109

-0

.35

M

AE

DYR

S 0.

103

3.69

**

*

0.05

1 1.

45

0.

100

3.64

**

*

0.05

2 1.

52

P

AE

DYR

S 0.

139

4.22

**

*

0.12

9 3.

55

***

0.

138

4.18

**

*

0.13

0 3.

59

***

A

CH

EA

L -0

.727

-1

.73

* -0

.456

-0

.96

-0

.714

-1

.69

*

ED

EQ

UA

L 1.

029

4.23

**

*

-0.1

30

-0.4

6

1.04

1 4.

25

***

-0

.148

-0

.53

P

AW

HIT

E

0.57

1 2.

27

**

0.82

1 2.

48

***

0.

551

2.18

**

0.

794

2.44

**

*

MA

WO

RK

ED

-0

.156

-0

.40

-1

.155

-3

.06

***

-1.1

49

-3.1

1 **

*

NE

VM

AR

R

2.43

6 5.

41

***

1.

783

2.63

**

*

2.20

6 4.

95

***

1.

763

2.55

**

*

MA

RR

AG

E <

=17

-2

.404

-8

.09

***

-2

.311

-3

.74

***

-2

.431

-8

.16

***

-2

.316

-3

.64

***

M

AR

RA

GE

18-

20

-1.4

69

-5.6

9 **

*

-0.6

93

-2.1

7 **

-1

.523

-5

.86

***

-0

.696

-2

.10

**

AB

ILM

ISS

0.78

2 2.

73

***

0.

776

2.04

**

0.

781

2.70

**

*

0.81

9 2.

12

**

AB

ILM

ED

1.

139

4.53

**

*

1.42

3 3.

63

***

1.

142

4.51

**

*

1.44

0 3.

60

***

A

BIL

HIG

H

1.85

1 4.

61

***

2.

982

5.98

**

*

1.89

0 4.

68

***

3.

014

6.00

**

*

RE

PE

AT

-0.6

12

-1.9

8 *

-1.2

76

-4.9

7 **

*

-0.6

22

-2.0

2 **

-1

.278

-4

.89

***

SC

QU

AL

0.43

0 6.

24

***

0.

299

3.81

**

*

0.43

0 6.

22

***

0.

306

3.82

**

*

LAM

BD

A

-0.2

04

-0.4

2

-2.1

74

-2.6

3 **

*

-0.4

52

-0.9

8

-2.2

29

-2.8

7 **

*

R2

0.

6482

0.43

69

0.

6446

0.43

77

N

641

769

641

769

Mea

n o

f de

p va

r

11.6

1 12

.29

11.6

1 12

.29

Not

e:

The

rep

orte

d t-

valu

es a

re b

ased

on

sta

ndar

d er

rors

that

are

cor

rect

ed f

or t

he p

arti

cula

r fo

rm o

f he

tero

sked

asti

city

int

rodu

ced

whe

n t

he la

mbd

a te

rm (

inve

rse

of t

he M

ill’s

ra

tio)

is in

clud

ed in

the

estim

atio

n. T

his

is a

chie

ved

by u

sin

g th

e SE

LE

CT

com

man

d in

LIM

DE

P.

For

the

seco

nd s

et o

f es

timat

es, t

he e

xclu

sion

res

tric

tion

s ar

e as

foll

ows.

For

men

: LO

WC

AST

E a

nd A

CH

EA

L;

For w

omen

: A

GE

, AG

ES

Q, M

USL

IM, a

nd M

AW

OR

KE

D.

1

Table 7 OLS Educational attainment functions, by sex

Variables Females Males

Coefficient t-value Coefficient t-value

Intercept 5.889 7.33 *** 9.015 10.17 ***

AGE 23-30 -0.388 -1.42 -1.694 -5.21 ***

AGE 31-38 0.348 1.30 -0.570 -1.93 *

MUSLIM -0.369 -1.13 -1.465 -4.02 ***

LOWCASTE -0.506 -1.88 * -0.249 -0.90

ACNSIB -0.041 -0.80 -0.007 -0.12

PAWEAL 0.192 3.58 *** 0.126 2.46 ***

PAWEALSQ -0.005 -3.12 *** -0.002 -1.78 *

BKHOME 0.076 0.56 0.269 1.77 *

PANEWS 0.724 2.72 *** -0.114 -0.37

MAEDYRS 0.103 3.62 *** 0.049 1.42

PAEDYRS 0.144 4.77 *** 0.164 4.85 ***

ACHEAL -0.728 -1.71 * -0.581 -1.24

EDEQUAL 1.037 4.20 *** -0.148 -0.53

PAWHITE 0.576 2.25 ** 0.773 2.36 ***

MAWORKED -0.176 -0.44 -1.263 -3.39 ***

NEVMARR 2.431 5.30 *** 1.783 2.56 ***

MARRAGE <=17 -2.399 -7.93 *** -2.258 -3.53 ***

MARRAGE 18-20 -1.465 -5.57 *** -0.748 -2.25 **

ABILMISS 0.790 2.71 *** 0.759 1.94 *

ABILMED 1.147 4.49 *** 1.454 3.61 ***

ABILHIGH 1.854 4.54 *** 2.922 5.78 ***

REPEAT -0.617 -1.96 ** -1.271 -4.83 ***

SCQUAL 0.431 6.15 *** 0.312 3.87 ***

R2

0.6487

0.4324

N 641 769

Mean of dependent variable

11.61 12.29

1

Tab

le 8

D

ecom

posi

tion

of

gend

er d

iffe

renc

es in

edu

cati

onal

att

ainm

ent,

us

ing

the

OL

S sp

ecif

icat

ion

of th

e at

tain

men

t fun

ctio

n

Mea

n ch

arac

teri

stic

s

t-te

st o

f th

e ge

nder

Coe

ffic

ient

s W

ald

(chi

-sq)

te

st o

f th

e ge

nder

D

iffe

renc

e in

ED

YR

S d

ue to

Var

iabl

es

Mal

es

Fem

ales

di

ffer

ence

in

char

acte

rist

ics

Mal

es

Fem

ales

di

ffer

ence

in

coe

ffic

ient

s C

oeff

icie

nts

(a)

Cha

ract

eris

tic

s (b)

Com

bine

d E

ffec

t (a

+b)

Inte

rcep

t

9.01

5 5.

889

***

3.12

5 0.

000

3.12

5 A

GE

23-

30

0.41

3 0.

409

-1

.694

-0

.388

**

* -0

.539

-

0.00

1 -

0.54

0 A

GE

31-

38

0.31

3 0.

340

-0

.570

0.

348

**

-0.2

87

- 0.

009

- 0.

296

MU

SLIM

0.

121

0.11

6

-1.4

65

-0.3

69

**

-0.1

33

- 0.

002

- 0.

135

LOW

CA

STE

0.

266

0.19

8 **

* -0

.249

-0

.506

0.06

8 -

0.03

4 0.

034

AC

NSI

B

4.99

0 5.

074

-0

.007

-0

.041

0.17

0 0.

003

0.17

4 P

AW

EA

L 9.

454

10.0

67

* 0.

126

0.19

2

-0.6

23

-0.1

18

-0.7

41

PA

WE

ALS

Q

144.

260

147.

770

-0

.002

-0

.005

0.32

7 0.

016

0.34

3 B

KH

OM

E

1.25

5 1.

270

0.

269

0.07

6

0.24

2 -

0.00

1 0.

241

PA

NE

WS

0.

514

0.59

5 **

* -0

.114

0.

724

**

-0.4

31

- 0.

058

- 0.

489

MA

ED

YRS

3.72

9 4.

317

**

0.04

9 0.

103

-0

.202

-

0.06

1 -

0.26

3 P

AE

DYR

S 7.

958

9.41

3 **

* 0.

164

0.14

4

0.15

6 -

0.21

0 -

0.05

4 A

CH

EA

L 1.

033

1.04

0

-0.5

81

-0.7

28

0.

152

0.00

5 0.

157

ED

EQ

UA

L 0.

686

0.67

0

-0.1

48

1.03

7 **

* -0

.812

0

.017

-

0.79

6 P

AW

HIT

E

0.38

8 0.

473

***

0.77

3 0.

576

0.

076

- 0.

049

0.02

8 M

AW

OR

KE

D

0.11

2 0.

076

**

-1.2

63

-0.1

76

**

-0.1

22

- 0.

006

- 0.

129

NE

VM

AR

R

0.19

0 0.

082

***

1.78

3 2.

431

-0

.123

0.

263

0.14

0 A

GE

MA

RR

<=

17

0.03

3 0.

254

***

-2.2

58

-2.3

99

0.

031

0.27

1 0.

302

AG

EM

AR

R18

-20

0.

162

0.31

8 **

* -0

.748

-1

.465

*

0.11

6 0.

228

0.34

4

AB

ILM

ISS

0.

485

0.24

0 **

* 0.

759

0.79

0

-0.0

15

0.19

4 0.

179

AB

ILM

ED

0.

305

0.36

0 **

1.

454

1.14

7

0.09

4 -

0.12

2 -

0.02

7 A

BIL

HIG

H

0.07

5 0.

057

2.

922

1.85

4

0.11

2 0.

025

0.13

6 R

EP

EA

T 0.

262

0.12

2 **

* -1

.271

-0

.617

-0.1

71

- 0.

086

- 0.

258

SCQ

UA

L 5.

898

6.11

3 **

0.

312

0.43

1

-0.6

99

- 0.

093

- 0.

792

Tota

l

0.51

2

0.17

2

0.68

4

2

1

Appendix 1

Pooled binary probit model of enrolment with gender dummy

Variable Coefficient t-ratio

Marginal effect

Intercept 0.2063 0.78 0.007

AGE 23-30 0.0582 0.45 0.002

AGE 31-38 0.0936 0.72 0.003

MUSLIM -1.0022 -7.10 *** -0.035

LOWCASTE -0.4577 -3.79 *** -0.016

ACNSIB 0.0327 1.25 0.001

PAWEAL 0.2631 9.04 *** 0.009

PAWEALSQ -0.0063 -6.80 *** -0.000

PANEWS -0.0105 -0.06 -0.000

MAEDYRS 0.1306 2.92 *** 0.005

PAEDYRS 0.1947 5.15 *** 0.007

PAEDYRSQ -0.0076 -2.23 ** -0.000

ACHEAL -0.2219 -1.35 -0.008

EDEQUAL 0.1165 1.07 0.004

PAWHITE 0.2881 1.35 0.010

MAWORKED -0.2943 -1.83 * -0.010

FEMALE -1.0797 -9.65 *** - 0.038

Log L

-373.61

Restricted Log L -771.46

Psuedo R2 0.5157

N 1698

2

Appendix 2

Pooled model of educational attainment with gender dummy

Variables Coefficient t-value

Intercept 8.3100 13.79 ***

AGE 23-30 -1.0783 -4.95 ***

AGE 31-38 -0.2033 -0.99

MUSLIM -0.9990 -3.96 ***

LOWCASTE -0.4009 -2.01 **

ACNSIB -0.0296 -0.76

PAWEAL 0.1390 3.70 ***

PAWEALSQ -0.0029 -2.97 ***

BKHOME 0.1749 1.68 *

PANEWS 0.2659 1.26

MAEDYRS 0.0752 3.29 ***

PAEDYRS 0.1419 6.11 ***

ACHEAL -0.6078 -1.86 *

EDEQUAL 0.3898 2.03 **

PAWHITE 0.6572 3.12 ***

MAWORKED -0.9020 -3.26 ***

NEVMARR -0.6225 -2.38 **

MARRAGE <=17 -2.8243 -10.26 ***

MARRAGE 18-20 -1.2445 -5.88 ***

ABILMISS 0.6676 2.79 ***

ABILMED 1.2971 5.62 ***

ABILHIGH 2.5067 7.68 ***

REPEAT -1.0707 -5.26 ***

SCQUAL 0.3588 6.53 ***

FEMALE -0.5572 -2.97 ***

R2

0.5157

N 1410


11.98

3

Appendix 3

Educational attainment function, without ‘age at marri age’ variables

Var iables Females Males

Coeff icient t-value coeff icient t-value

Intercept 3.367 4.31

*** 8.731 9.91

***

AGE 23-30 -0.305 -1.08

-1.952 -6.41

***

AGE 31-38 0.385 1.37

-0.620 -2.08

**

MUSLIM -0.543 -1.58

-1.473 -4.03

***

LOWCASTE -0.576 -2.04

** -0.297 -1.06

ACNSIB -0.039 -0.73

-0.016 -0.28

PAWEAL 0.246 4.41

*** 0.139 2.73

***

PAWEALSQ -0.006 -3.64

*** -0.003 -2.00

BKHOME 0.100 0.71

0.259 1.70

*

PANEWS 0.878 3.15

*** -0.009 -0.03

MAEDYRS 0.121 4.04

*** 0.050 1.45

PAEDYRS 0.135 4.23

*** 0.166 4.89

***

ACHEAL -0.609 -1.36

-0.636 -1.35

EDEQUAL 1.414 5.56

*** -0.110 -0.39

PAWHITE 0.916 3.45

*** 0.724 2.19

***

MAWORKED 0.089 0.21

-1.272 -3.39

***

ABILMISS 1.001 3.27

*** 0.774 1.96

**

ABILMED 1.258 4.69

*** 1.468 3.62

***

ABILHIGH 2.045 4.77

*** 2.980 5.85

***

REPEAT -0.565 -1.71

* -1.184 -4.48

***

SCQUAL 0.472 6.43

*** 0.320 3.94

***

R2

0.6110

0.4223

N 641 769


11.61 12.29

Date post:	20-Apr-2020
Category:	Documents
Upload:	others
View:	3 times
Download:	0 times

The gender gap in educational attainment in India: How much...

Documents