WPS6055 Policy Research Working Paper 6055 Gender, Geography and Generations Intergenerational Educational Mobility in Post-reform India M. Shahe Emran Forhad Shilpi The World Bank Development Research Group Agriculture and Rural Development Team May 2012 Policy Research Working Paper 6055 Abstract India experienced sustained economic growth for more geography. The evidence shows that family background than two decades following the economic liberalization plays a strong role; the estimated sibling correlation in in 1991. While economic growth reduced poverty India in 2006 is higher than the available estimates for significantly, it was associated with an increase in Latin American countries. There is a persistent gender inequality. Does this increase in inequality reflect deep- gap in rural and less-developed areas. The only group seated inequality of opportunity or efficient incentive that experienced substantial improvements is women structure in a market oriented economy? This paper in urban and developed areas, with the lower caste provides evidence on economic mobility in post-reform women benefiting the most. Almost 70 percent of the India by focusing on the educational attainment of variance in children’s education can be accounted for by children. It uses two related measures of immobility: parental education and geographic location. The authors sibling and intergenerational correlations. provide possible explanations for the apparently puzzling The paper analyzes the trends in and patterns of improvements for urban women in a country with strong educational mobility from 1992/93 to 2006, with a son preference. special emphasis on the roles played by gender and This paper is a product of the Agriculture and Rural Development Team, Development Research Group. It is part of a larger effort by the World Bank to provide open access to its research and make a contribution to development policy discussions around the world. Policy Research Working Papers are also posted on the Web at http://econ.worldbank.org. The author may be contacted at fshilpi@worldbank.org. The Policy Research Working Paper Series disseminates the findings of work in progress to encourage the exchange of ideas about development issues. An objective of the series is to get the findings out quickly, even if the presentations are less than fully polished. The papers carry the names of the authors and should be cited accordingly. The findings, interpretations, and conclusions expressed in this paper are entirely those of the authors. They do not necessarily represent the views of the International Bank for Reconstruction and Development/World Bank and its affiliated organizations, or those of the Executive Directors of the World Bank or the governments they represent. Produced by the Research Support Team Gender, Geography and Generations: Intergenerational Educational Mobility in Post-reform India M. Shahe Emran1 IPD, Columbia University Forhad Shilpi DECRG, World Bank JEL Codes: O12, J62 Key Words: Intergenerational Mobility, Education, Equality of Opportunity, Sibling Correlation, Intergenerational Correlation, Economic Liberalization, Rural-Urban Inequality, Gender Gap, India 1 We would like to thank Will Martin, Giovanna Prennushi and Debdulal Mallick for comments on an earlier draft. Emails for correspondence: shahe.emran@gmail.com (M. Shahe Emran) and fshilpi@worldbank.org (Forhad Shilpi). (1) Introduction The increasing inequality in income distribution at a time of considerable economic growth during the last couple of decades has rekindled interests in intergenerational mobility in both developed and developing countries.2 Following wide ranging economic liberalization in the early 1990s, India experienced sustained high economic growth; per capita GDP grew at a 4 percent rate over the two decades after liberalization. The evidence indicates that while growth led to a significant poverty reduction, it was also associated with a rise in inequality (World Bank (2011)). 3 There is increasing concern that the benefits of economic growth were not shared broadly, and remained especially concentrated in urban areas, thus widening the rural-urban gap (Bardhan (2007, 2010), Dreze and Sen (2011), Basu (2008), Prasad (2012)).4 The estimates of top incomes by Banerjee and Piketty (2005) show that the share of top 0.01, 0.1, and 1 percent in total income has increased substantially from a trough in the mid1980s, and this increase coincided with the move away from ‗Socialist‘ to more market oriented economic policies. According to their estimates, in 1999-2000, per capita income gap between the 99th and 99.5th percentiles was four times as large as the gap between the median and the 95th percentile. Another recent study finds that between 1996 and 2008, the wealth holding of the Indian billionaires increased from 0.8 percent of GDP to 23 percent, before declining to 14 percent in 2010 (Walton (2010)).5 2 Among the developing countries, China and India are two prominent examples where impressive economic growth has been accompanied by an increase in inequality. For a discussion on rising inequality in Asia, see Jushong and Kanbur (2012). The recent decline in intergenerational mobility in USA and UK has also attracted a lot of attention; see, for example, Deparle in New York Times (January 4, 2012) and Mazumder (2012) on USA, and Dearden et al. (1997) and Blanden et al. (2005) on UK. 3 For evidence on rising inequality in India after 1991, see Ravallion (2000), Deaton and Dreze (2002), Sen and Himanshu (2004). A recent survey of the available evidence shows that consumption inequality has increased slightly, but the income inequality in India is much higher than what is usually thought of (close to Brazil) (World Bank (2011)). It is now widely appreciated that the available estimates of consumption and income inequality may be significantly biased downward, because the household surveys fail to cover the top income households. 4 Dreze and Sen (2011) argue that Indian economic reform has been an ―unprecedented success‖ in terms of economic growth, but an ―extraordinary failure‖ in terms of improvements in the living standards of general people and social indicators. 5 The volatility in the wealth of billionaires reflects the volatility in the stock market. The common perception about a significant increase in inequality is reinforced by spectacular conspicuous consumption by the super-rich: Mukesh Ambani, the chairman of Reliance Industries in India owns and lives in the first billion dollar house in the world (Woolsey, M, Forbes.com, April 30, 2008), and in the mega wedding of two sons of Subrata Roy, the ‗chief 2 However, the relevant question is whether the observed increase in cross-sectional inequality is a natural outcome of efficient incentive structure in a liberalized and market oriented economy that rewards hard work and entrepreneurial risk taking, or it is primarily due to inequality of opportunity due to differential access, for example, to education and markets. The rise in cross-sectional inequality becomes a serious concern when it is primarily a result of inequality of opportunity, i.e., the inability of children born in poorer families and disadvantaged social groups to move beyond their parents‘ position in economic ladder by their own effort and choices.6 An immobile society may require policies, public investments and reforms to ensure both efficiency and equality of opportunity.7 Understanding the trends in and the levels and patterns of intergenerational mobility during the post liberalization period has thus become important for academics and policy makers (Bardhan (2010), Banerjee and Piketty (2005)).8 This paper provides evidence on intergenerational economic mobility in India during the post liberalization period by focusing on the educational attainment of children. Education is used as an indicator of economic status in the absence of suitable data on permanent income. 9 There is a broad consensus in the literature that education is among the most important avenues for poor to escape from poverty traps and climb up the economic ladder (for recent surveys, see Orazem and King (2008), Strauss and Thomas (1995)). The role of education may be especially important in post- reform India where growth has been concentrated in skill intensive sectors: the software industry and call centers being iconic examples (Kochhar et al. (2006), Bardhan (2010), Kotwal et al. (2011)).10 The goal of this paper is to analyze the trends in and levels and patterns of educational mobility over guardian‘ of Sahara Group, $ 250,000 was spent on candles alone (Srivastava, S, BBC online, February 11, 2004)! The popular perception that rural areas in India have been largely left out of the recent economic growth is, at least partly, shaped by the reports of farmers‘ suicides, among other things. 6 Higher inequality of opportunity is likely to lead to a higher cross-sectional inequality (Atkinson (1981)). Many observers believe that inequality in India reflects inequality in opportunity. For example, Basu (2008) comments that ―A certain amount of inequality may be essential to mitigate poverty….But the extent of Inequality in India seems to be well above that‖. 7 In an immobile society, many high ability children from poor families may not be able to go to school and thus fail to realize their productive potential. High inequality can also lead to political instability. 8 For a broader discussion on the importance of equity in economic opportunity for development, see Equity and Development (World Development Report (2006)). 9 Reliable data on children‘s and parents‘ income over the life cycle are not available in a developing country such as India. As emphasized in the recent literature, one needs good quality income data over a number of years at appropriate phase of the lifecycle to tackle the attenuation bias in the estimated intergenerational correlation in income (Solon (1999), Mazumder (2003)). The analysis of intergenerational persistence in income in India is also complicated by the fact that a majority of population especially during parent‘s generation were engaged in family farming as self-employed workers making it difficult to attribute income to individual members. 10 This is in contrast to the Chinese experience where growth has been dominated by agriculture and labor intensive manufacturing. Bardhan (2010) and Datt and Ravallion (2010) emphasize low and unequal human capital as an important constraint on poverty reduction in India. 3 a period of almost a decade and a half after the liberalization in 1991 (1993-2006), with a special focus on possible gender and spatial differences (rural vs. urban and developed vs. less-developed states). We use two related measures of educational immobility: (1) sibling correlation in educational attainment and (2) persistence in educational attainment across parents and children. The standard approach to the study of intergenerational educational mobility is to estimate the parent- offspring association in educational attainment.11 It has, however, been well appreciated in the literature that the influence of family background on children extends much beyond what is implied by parent‘s education (Corcoran et al. (1990), Mazumder (2008, 2011, 2012), Bjorklund, Lindahl and Lindquist (2010), Bjorklund and Salvanes (2010)). Sibling correlation is a much broader concept that provides a summary measure of all common family and community background factors that affect child outcomes but are not chosen by children themselves. 12 A significantly higher sibling correlation implies greater influence of family and community backgrounds on economic outcomes, which in turn indicates that the role one‘s own effort and choices can play is limited. To the best of our knowledge, there is no study in the literature that exploits estimates of both sibling correlations and intergenerational correlation to trace out the levels, trends in and patterns of intergenerational mobility in a developing country. There is now a large and mature literature on intergenerational economic mobility in developed countries, most of which focuses on intergenerational correlation between parents‘ and children‘s incomes (for reviews, see Solon (1999, 2002), Black et al. (2010)).13 However, economic analysis of intergenerational mobility in the context of developing and transitional countries remains a largely unexplored area of research (among the available contributions, see Jalan and Murgai (2008), Hnatkovska et al. (2011), Dahan and Gaviria (2001), Emran and Shilpi (2011), and Emran and Sun (2011)). Also, the existing economic literature on sibling correlation in education focuses primarily on a set of developed countries that include USA, UK, Norway and Sweden. The only exception known to us is Dahan and Gaviria (2001) who provide estimates of sibling correlations for 16 Latin American countries. They find that El Salvador, Mexico, Colombia and Ecuador are the least mobile countries, with sibling correlation explaining almost 60 percent of the variation in 11 For a survey of this literature, see Black and Devereux (2010), Bjorklund and Salvanes (2010) for developed countries, and Hertz et al. (2009) for developing countries. 12 It is, however, important to recognize that parent-children correlation is not a proper subset of the sibling correlation in representing the effects of family background on economic outcomes. The intergenerational link between the parents and a child captures genetic similarities that may not be shared by the siblings, except for the identical twins. 13 See, among others, Arrow et al. (2000), Dearden et al. (1997), Mulligan (1999), Solon (1999, 2002), Birdsall and Graham (1999), Fields et al. (2005), Bowles et al. (2005), Blanden et al. (2005), World Development Report (2006), Mazumder (2003), Hertz (2005), Bjorklund et al. (2006), and Lee and Solon (2009). 4 educational outcomes. The available evidence on developed countries shows that factors common to siblings explain from 40 to 65 percent of variation in educational outcomes (Bjorklund and Salvanes (2010)). In contrast, intergenerational correlation between parents and children– the traditional measure of intergenerational persistence -- explains from 9 to 21 percent of variations in children‘s educational outcome. An interesting finding from these studies is that gender or geographic location (as measured by neighborhood effect) does not exert any significant influence on the intergenerational persistence in children‘s educational outcomes. Are gender and geography also largely irrelevant for educational mobility faced by children in developing countries? One can argue that the role of gender and geography might be much more prominent in a developing country such as India, because gender bias against women is more common and stronger, geographic mobility is lower, and many areas (especially rural) are not integrated with the urban growth centers because of underdeveloped transport infrastructure.14 On the other hand, the high tide of rapid economic growth in Indian economy for more than two decades may have lifted all the boats, improving economic mobility across the income distribution, irrespective of gender and geographic location. The data used in this paper come from the1992/93 and 2006 rounds of the National Family Health Survey (NFHS) in India. The first period of our sample nearly overlaps with the period of economic liberalization (1991-1992), and the second period is about 15 years after liberalization. For both survey rounds, our analysis focuses on the same age cohort (16 to 27 year olds) who constitute the bulk of new entrants into the labor force.15 To examine the spatial aspects in detail, the empirical analysis is done separately for families residing in different areas such as rural vs. urban areas and relatively developed vs. less developed states. To discern any possible gender bias, we implement the empirical analysis separately for male and female samples. We use the mixed effects model to estimate the sibling correlation. An advantage of this approach is that both the family and community level covariates can be included in the analysis to examine their relative influence on sibling correlation (Mazumder (2008), Bjorklund et al. (2011)). We examine the influence of two sets of covariates on sibling and intergenerational correlations: the first set relates to caste and religion of the 14 There is evidence that geographic location may be important for economic opportunities faced by households in developing countries. For example, Jalan and Ravallion (1999) show that there are geographic poverty traps in China. Emran and Hou (forthcoming) find that better access to markets increases household consumption in rural China in a significant way. They also find that the effects of domestic market centers are much larger than that of international market access. 15 Our conclusions, however, do not depend on this particular age cohorts. In the robustness checks section, we provide evidence using an alternative age cohort sample. 5 household which are identified as important determinants of educational attainment in India, and the second relates to the geographic location as measured by neighborhood fixed effect.16 Our estimates of sibling and intergenerational correlations suggest no significant change in the intergenerational persistence in educational attainment for a large proportion of the population in India from 1992/93 to 2006. Sibling (and intergenerational) correlations in our full sample have declined only marginally from 0.64 (0.57) in 1992/93 to 0.62 (0.54) in 2006 respectively. 17 However, this aggregate picture hides important gender and spatial differences. While the evidence indicates that the sibling correlation among men (brothers) has remained effectively unchanged (it increased slightly from 0.614 in 1993 to 0.624 in 2006), it experienced a moderate decline for women, (sisters) from 0.780 to 0.696. Geographic location is important, both in 1992/93 and 2006; the neighborhood effect accounts for about 40 percent of the sibling correlation among women and a third among men. In terms of geographic pattern, we find that sibling correlation remained essentially unchanged in rural areas and declined marginally in urban areas. The sibling correlation also declined slightly in the developed states, but increased in the less-developed states. Perhaps the most interesting trends and patterns emerge when we partition the data using both gender and geography. The sibling correlations among men (brothers) in rural areas and less-developed states have increased a bit, but the correlation has in fact declined marginally in urban areas and remained virtually constant in developed states. In contrast, the sibling correlations among women (sisters) registered a decline irrespective of geographic partitioning of the data. However, geography matters for women also, only the women in urban areas and developed states experienced substantial decline in sibling correlations. As a result, the gender gap in sibling correlation has disappeared in urban areas though it remains virtually unchanged in rural areas. We also find that among the urban women, it is the lower caste women who experienced the largest decline in the sibling correlation. The evidence on improvements in educational mobility of women is similar to the available evidence on China and Malaysia (see Emran and Sun (2011) on China and Lillard and Willis (1994) on 16 The recent evidence using NSS data shows that the influence of caste and religious identity on the strength of intergenerational link between parents and children has become much weaker (Hnatkovska et al. (2011)). Our results also show a similar pattern. It is however, important to note that the direct effect of lower caste and Muslim dummy remains significantly negative on children‘s educational attainment. 17 Note that a formal test of equality of the estimates in 1993 and 2006 rejects the null because of very small standard errors due to the large sample sizes (number of observations is 34000 in 1993 and 38000 in 2006). However, statistical precision is largely irrelevant here, because the difference in the numerical magnitude of the estimates between 1992/93 and 2006 is very small in most of the cases, suggesting the lack of any substantial change in intergenerational mobility over a period of almost a decade and a half of impressive economic growth. 6 Malaysia).18 The broad trends in and patterns of educational persistence discussed above are also observed in the estimates of intergenerational correlations in education between parents and children. In contrast to the evidence from developed countries, majority of the variations in sibling correlations in India can be explained by two factors: parental education and geographic location as measured by neighborhood effect. The estimates indicate that a decade and a half after the economic liberalization in 1991, the absolute magnitudes of sibling and intergenerational correlations in India in 2006 are still very large, larger than the available estimates for the Latin American countries (for sibling correlations) and Asian countries (for intergenerational correlations). The influence of family and community backgrounds is especially dominant for rural women: about 70 percent of variations in sisters‘ schooling levels can be explained by common family and community factors shared, but not chosen, by them. After more than two decades of impressive economic growth, a large proportion of Indian population experienced no significant change in their educational opportunity; place of residence and gender still play a large role in a child‘s educational attainment and thus his/her economic fortunes. The absence of a positive effect of economic growth on educational mobility, especially for men, is, however, not peculiar to the Indian experience following liberalization. Recent evidence shows that educational mobility of men in rural China has in fact worsened during the high growth post reform period (1988-2002) (Emran and Sun (2011)). The rest of the paper is organized as follows. The conceptual framework underpinning empirical work is described in section 2. Data and empirical strategy are elaborated in section 3. Section 4 organized in different subsections presents the main empirical results, and section 5 reports as set of robustness checks. Some preliminary conjectures for explaining the observed trends in and patterns of educational mobility in post-reform India are offered in section 6. The paper concludes with a summary of the findings. (2) Conceptual Framework Sibling Correlation (SC) For the estimation and interpretation of sibling correlations, we adopt a conceptual framework that has been utilized widely in the empirical literature on sibling correlations (see, Solon et al (1991), Bjorklund et al (2002), Bjorklund and Lindquist (2010), Bjorklund and Salvanes (2010), 18 The positive evidence on women may seem puzzling given the fact that son preference is prevalent in all three countries. We provide a set of explanations for the observed trend later in the paper. 7 Mazumder (2008) and (2011)). )). Let be the years of schooling of sibling j in family i. It can be expressed as: (1) where is a family component which is common to all siblings in family i and is the individual specific component for sibling j which captures j‘s deviation from the family component. Assuming these two components are independent, the variance of can be expressed as the sum of variances of the family and individual components as: (2) The sibling correlation in education then can be expressed as: (3) The sibling correlation depicts the share of variance of years of education that can be attributed to common family background effects. Thus sibling correlation can be thought of as a summary statistic measuring the importance of common family and community effects which includes anything and everything shared by the siblings. It is useful to distinguish among different types of family and community factors that are commonly experienced by siblings. The family level variables include observable factors such as parental education and occupation as well as unobserved factors such as common genetic traits, parental aspirations, child rearing ability and style, cultural inheritances and interaction among siblings. The community effects include factors such as school availability and quality as well as peer effects within the neighborhood. Though sibling correlation captures most of the family background influences, it does not capture all of them. For instance, genetic traits not shared by siblings, differential treatment of siblings and time dependent changes in family and neighborhood factors will show up in the individual component of outcome variance, though they might be part of family background. As a result, the estimate of sibling correlations can be taken as a lower bound estimate of the total influence of the common family background on children‘s education outcome (for a discussion on this point, see Bjorklund and Salvanes (2010)). Intergenerational Correlation (IGC) It is instructive to look at the difference between sibling correlations and intergenerational correlations as measures of intergenerational persistence in economic outcomes. The standard 8 regression model to estimate intergenerational correlation between parents and children can be written as: (4) where is the parental year of schooling in family i, and is the intergenerational regression coefficient. Because individual component in equation (1) is orthogonal to the family component, one can express the family component as: (5) where denotes family factors that are orthogonal to parental education. It follows from equation (5) that: where is the intergenerational correlation in education. The above equation shows clearly that sibling correlation is a broader measure of the impact of family background than the squared intergenerational correlation. Also, the intergenerational correlation parameter ( ) is different from intergenerational regression coefficient ( ). Estimating Equations To estimate the sibling correlations, we extend the regression model in equation (1) and specify the following mixed effects model: (6) where is a vector of control variables. To estimate the intergenerational correlation in education, we augment equation (4) to estimate the following regression specification where the education variables of both generations are standardized: (7) Equations (6) and (7) can be estimated as soon as vector is specified. Following Bjorklund et al. (2010) and Mazumder (2008, 2011), we take a sequential approach in introducing variables to vector. All regressions in this paper include age and/or gender dummy, the latter is added whenever relevant. In addition, we introduce two sets of explanatory variables sequentially.19 The first set 19 The NFHS 2006 dataset has more detailed information about some of the family background variables such as mother‘s age at first marriage, mother‘s health, domestic violence faced by mothers as well as birth order of 9 includes dummies for caste and religion. Evidence from India suggests that educational outcomes vary systematically across different caste and religion groups. Next, we add a village/neighborhood level fixed effect as a part of to capture any common community level factors faced by the children growing up in the same locality. A comparison of sibling correlations estimated using alternative specifications can shed light on the importance of caste and religion as well as geographic location as captured by the neighborhood effect.20 As noted in earlier studies (summarized in Bjorklund and Salvanes (2010)), if households are sorted across neighborhoods according to their attributes (well-off families living in better neighborhoods), then the estimate of neighborhood effect is biased upward. So the comparison will provide an upper bound estimate of neighborhood effect. In contrast, the estimate of intergenerational correlation can be biased upward (due to correlation in genetic traits) or downward (due to measurement error). We compare the estimated sibling correlations with the estimates of intergenerational correlations and neighborhood effects. This allows us to deduce the extent of sibling correlations that can be accounted for by the parent-child link and the neighborhood effect. The part of sibling correlations that remains unaccounted for by these two factors is mainly due to common family environment such as family structure (e.g. divorced/separated parents) and parental skills and patience in child rearing etc. Note that if the strong sibling correlation observed in the data is due mainly to intergenerational correlations in education and common neighborhood effects, then it indicates higher inequality in opportunities than if it were due to parents‘ child rearing skills. 21 Estimation Approaches The intergenerational correlation can be estimated by first using OLS regression for equation (7) to estimate the intergenerational regression coefficient β and then using the formula for intergenerational correlation that adjusts for the change in the variance in education: . For children. Our analysis indicates that many of these variables are influenced significantly by mother and her spouse‘s education level, and when we add parent‘s education in the regression, these variables lose much of their explanatory power. In this paper, we thus focus on caste and religion related variables which are mostly exogenous to parent‘s education. 20 This approach follows Mazmuder (2008, 2011) and Bjorklund et al. (2010). The basic idea is that if the estimated sibling correlation is primarily driven by factors such as neighborhood effects, caste and religion, then the estimate would decline significantly once these factors are included in the regression. 21 Bjorklund, Lindahl and Lindquist (2010) find a sibling correlation of around 0.21 for Sweden. Almost 70 percent of sibling correlation can be explained by parental involvement in school work and mother‘s patience (willingness to postpone benefits into the future and propensity to plan ahead). Intergenerational correlations in education as well as neighborhood effects are found to have small influence on sibling correlations. Sweden however is characterized by nearly universal access to quality education, generous child care assistance and low income inequality. 10 the estimation of sibling correlation in equation (6), the family and individual components need to be estimated. The available literature on sibling correlations relies on two alternative estimation methods. Mazumder (2006, 2011) uses the Restricted Maximum Likelihood (REML) method which has better small sample properties under the normality assumption. Bjorklund et al. (2010) instead utilize a mixed effects model to estimate the family and individual components. The procedure in Bjorklund et al. (2010) can be implemented as a two-step procedure similar to the method employed also by Solon et al (1991) and Bjorklund et al (2002). The weakness of this procedure is that its small sample properties are not well understood. We implemented both procedures. Given the large size of the samples used in this paper (more than 34,000 in 1993 and 38,000 in 2006), both procedures produce nearly identical parameter estimates, and for the sake of brevity, we report estimates from the procedure suggested by Bjorklund et al. (2010). The estimates using Restricted Maximum Likelihood are available from the authors. The estimates of sibling correlations presented in this paper are from the mixed effects model using Stata GLLAMM procedure. As noted before, all standard errors are corrected for clustering at the family level. 22 (3) Data and Empirical Issues The data for our analysis come from the National Family Health Survey (NFHS), 1992/93 and 2006. The NFHS is a large-scale and nationally representative survey of nearly all of Indian states.23 The main target group for this survey is women in their reproductive years. While both surveys followed similar sampling methodology, the surveys differ somewhat in terms of sample size and questionnaires. The NFHS 2006 used three separate questionnaires to interview 109,041 households, 124,385 unmarried and ever married women between 15 to 49 years of age and 74,369 unmarried and ever married men in the age group 15-54 years. The NFHS 1992/93 on the other hand collected information from 88,562 households and 89,777 ever-married women in the age group 13- 49 years. Data for our analysis are drawn from the household and women‘s questionnaires which are common to both surveys. While sample sizes of the NFHS are comparable to that of National Sample Surveys (NSS) in India, the data from NFHS offer two distinct advantages for our analysis. First, all children up to 17 years of age in the NFHS are matched to their co-resident parents (not just to the household head). This is particularly important in the Indian context where joint families are still common. This 22 For details of the estimation method, please see Rabe-Hesketh et al (2002). 23 The NFHS 1992/93 covered 24 states and Delhi (the Capital city) whereas 2006 survey covered all of the 29 states. The sample sizes in the NFHS are comparable to more widely used National Sample Surveys. 11 matching of children to parents allows us to estimate both sibling and intergenerational correlations for the same sample of children. The second advantage of NFHS is that education data are more detailed. Instead of reporting education level in discrete intervals as is done in the case of NSS, NFHS collected information on years of schooling for both parents and children, which facilitates more precise estimates of sibling and intergenerational correlations. To define the estimation sample, we follow the literature and restrict our sample to closely spaced young adult siblings between the age of 16 and 27 years. The argument for estimating sibling correlations from closely spaced siblings rests on the fact that there may be important changes in the family structure as well as shocks to family life over a longer time horizon diluting the already conservative estimate of family background on children‘s outcome. To check the sensitivity of our results, we report the estimates of sibling and intergenerational correlations for other age groups also. While the sample for estimation of the sibling correlations should ideally focus on co- resident children, it may bias the estimation of intergenerational link in education between parents and children. If, for example, among older children, the best educated ones tend to leave household earlier than less educated children, it may bias intergenerational regression coefficient β downward, but may not necessarily bias the estimate of intergenerational correlation. Because such exit of better educated children from the household would also reduce the variance in children‘s education, thus offsetting the decline in the intergenerational regression coefficients. The problem of not observing all of the children as co-residents in the household is more prevalent in the case of older age cohorts, particularly for women who usually leave their natal household upon marriage. In the case of women, if educated women delay marriage and we have better probability of observing them as co-resident children, then estimate of intergenerational correlations from our sample may be biased upward. On the other hand, if marriage timing follows birth order and there is a substantial birth order effect as reported in Black, Deverux and Salvanes (2005) and Booth and Kee (2009), then estimates from our sample will be more on the conservative side. We address the issue of non-coresident children in two ways. We keep all singleton households in the sample. This is likely to reduce the bias in the estimate of intergenerational correlations by allowing the two opposing factors discussed above to offset each other. This also improves the precision of the estimate of individual component in the case of sibling correlation. Second, we check robustness of our results by estimating both sibling and intergenerational correlations from a sample of younger age cohort (16-20 year) where possibility of having non-coresident children is lower. Note also that for older age cohorts, the co-residency pattern changes, as it is the parents who co- reside with children at old age. If parents tend to co-reside with better educated and well off children 12 as is the usual custom in developing countries, then intergenerational regression coefficients for older cohorts will be biased upward. Tracking the same younger age cohort [16-27] between years has the added advantage that our estimates are comparable and are not un-duly influenced by changes in co- residency pattern over the life cycle. While we are not aware of any paper that provides direct estimates of sibling and intergenerational correlations in education for India, some indirect evidence on intergenerational persistence in education can be found in two recent studies. Our empirical approach, however, differs in some important ways from that of the existing studies. Jalan and Murgai (2008) use the NFHS 1998/99 data to estimate intergenerational regression coefficients for different age cohorts.24 Hnatkovska, Lahiri and Paul (2011) examine the probability of children having a different level of education compared with their parents among the socially dis-advantaged Scheduled Caste/Tribes relative to rest of the population using different rounds of NSS data. 25 In contrast to Jalan and Murgai (2008), we track influences of family background and parental education directly for the same age cohort between 1992/93 (year immediately following economic liberalization) and 2006 (15 years after liberalization). For the reasons mentioned above, we restrict our sample to younger age cohort (16-27 years), whereas the sample used in Hnatkovska et al. (2011) consists of 13 to 65 year olds. As noted before, our main sample consists of all children in the age group 16-27 who are co-resident with the mother.26 Estimation was carried out for all children and separately for brothers and sisters. Since an important objective of our study is to uncover spatial differences in intergenerational mobility, we also estimate the sibling and intergenerational correlations for sub- samples defined on the basis of geographical location such as rural and urban areas, and developed and less developed regions/states. The number of observations for different sub-samples is reported 24 The estimated regression coefficients are in general different from the intergenerational correlations that take into account the changes in the variance of the children‘s education. Trying to uncover trends in intergenerational correlations on the basis of estimates from different age cohorts is problematic when co-residency pattern of children and parents changes over life cycle. As noted above, the coefficients tend to be underestimated for younger age-cohorts in the presence of birth effect in education and tend to be over-estimated when parents co-reside with better educated children. Thus intergenerational regression coefficients may suggest a spurious decrease in intergenerational persistence across age cohorts simply due to changes in co-residency pattern over the life cycle. 25 Hnatkovska et al. (2011) do not estimate intergenerational correlations directly. Instead they regress the probability of education switching (defined as children having different education level than parents) on scheduled caste and scheduled tribe (SC/ST) dummies for various rounds of NSS data between 1983 and 2005. The magnitudes of coefficients of SC/ST dummy are then compared to find the trend in intergenerational persistence among SC/ST compared with non-SC/ST population. 26 For NFHS 2006, there is also a sub-sample of children who are co-resident with father, but information on their mother is not available. We repeated our estimation procedure for this extended sample. Main conclusions are similar to those from our main sample. NFHS 1992/93 did not administer the male questionnaire, and so does not have this additional sub-sample. For comparability of results, we restrict both 2006 and 1992/92 samples to those who are co-resident with mother. 13 in Table 1. The samples for all children consist of 34,585 observations in 1992/93 and 39,562 observations in 2006. The average numbers of children per family are 2.35 in 1992/93 and 1.98 in 2006. The shares of singleton families in our sample of 16 to 27 years olds are 25 percent in 1992/93 and 36 percent in 2006. More than a third of the families have two children in both survey years. About 63 percent and 59 percent of our total samples are brothers in 1992/93 and 2006 respectively. As reported in Table 1, sample sizes for different sub-samples are considerable, the smallest sample size being 2208 for sisters in less-developed states in 1992/93. The large sample sizes ensure precision of our estimates of sibling and intergenerational correlations for both survey years. Summary statistics from our main samples are presented in Table 2. The education levels of both boys and girls improved between 1992/93 and 2006. Average education of boys increased from 7.63 years in 1992/93 and 8.76 years in 2006. The gains in girls‘ education were more dramatic: it increased from 6.9 years in 1992/92 to 8.67 in 2006. As a result, the gap between boys and girls has narrowed considerably between these two survey years.27 A similar trend can be detected in mother and father‘s education as well though the gender gap in the parent‘s generation remained substantial. Average education of father increased from 5.33 years to 6.43 years between the two survey years, while that of mother increased from 2.63 years to 3.75 years. The improvements in years of education were associated with a decline in the standard deviation of education levels between the survey years. Consistent with international evidence in Hertz et al. (2009), the variances of education levels are higher in parent‘s generation compared with the kids in both the survey years. This decline in variance implies that relying on intergenerational regression coefficient to understand intergenerational mobility may be misleading. The summary statistics for the rural sample are also reported in Table 2. As expected, average education levels are lower in rural areas compared with our full sample. Consistent with national trends, average years of schooling have increased for both boys and girls in rural areas. The gender gap in education has also narrowed though the gap is still larger in rural areas compared with our full sample. Summary statistics for other sub-samples also confirm improvements in education attainment of children during this period. The trends in education levels reported here are consistent with those reported in other studies (ASER reports, World Bank (2011)). In addition to education levels, Table 2 provides summary statistics for age and caste and religion composition of our sample. Overall, the samples from two years appear to be comparable to each other in terms of age and caste-religion composition. 27 Similar convergence in educational attainment between boys and girls over the reform period is observed in China (see, for example, Behrman et al. (2008)). 14 (4) Empirical Results Equations (6) and (7) form the basis of empirical estimation of sibling and intergenerational correlations respectively. To estimate the individual and family components of equation (6), we followed the two-step procedure suggested by Bjorklund et al. (2011).28 Unless otherwise noted, all standard errors are clustered at the family level. All sibling pairs are given equal weights in all estimation results presented in this paper. (4.1) Results from the Full Sample Table 3 reports the results for the full sample. The sibling and intergenerational correlations estimated from our simplest specification of equations (6) and (7) are reported in panel A. In this simplest specification, age dummies are introduced to control for children‘s age, and in the ‗all children sample‘, a female dummy to account for gender difference in education level. The sibling correlation is estimated to be 0.642 in 1993 which declines slightly to 0.616 in 2006. Both of these parameters are estimated with great precision (t-statistics greater than 95). The estimates imply that the influence of the factors common to siblings on their educational attainment is very high (more than 60 percent) and has remained remarkably stable over more than a decade. Interpreting it from a different angle, the estimates of sibling correlations suggest that individual effort and other idiosyncratic factors account for less than 40 percent of variations in schooling years, both in 1992/93 and 2006. The absolute magnitude of the sibling correlation in 2006 is quite high, higher than the available estimates for Latin American countries including Brazil and El Salvador.29 The third row in Table 3 reports the estimates of the intergenerational correlations between children and parents in education. We define the parent‘s education variable as the maximum of 28 Equation (6) can be estimated directly (without the two step procedure) using Stata GLLAMM procedure when the set of control variables is small. However, it becomes unmanageable in the case where we introduce neighborhood fixed effects. For the sake of comparability, we report results from the two step procedure in this paper. The results from single step estimation do not differ from that of two step procedure when applied to specifications that does not include neighborhood level fixed effects. 29 The highest estimate is 0.60 among 16 Latin American countries, for El Salvador (Dahan and Gaviria (2001). Among developed countries, sibling correlations are found to be highest in USA. The estimates range between 0.6 (Mazumder (2008) for biological siblings in the same household for age cohort born during 1957-1969 and 0.63 (Conley and Glauber(2008) for siblings with same biological mother for age cohort 1958-76). The average estimate for Nordic and European countries is around 0.4 (see Bjorklund and Salvanes (2010)). 15 father‘s and mother‘s years of schooling. We, however, note that the results and conclusions in this paper are not sensitive to alternative definitions of parental education such as average of mother‘s and father‘s years of schooling. The intergenerational correlations reported in panel A are estimated from a simple specification that controls only for age and gender. The estimates for all children show a slight decline in intergenerational correlations between two survey years: it declined from 0.574 in 1992/93 to 0.540 in 2006. The absolute magnitude of intergenerational correlation for India is, however, much larger than the average for other Asian countries reported by Hertz et al (2009) (average=0.39).30 Among 10 Asian countries covered by Hertz et al. (2009), only Indonesia has intergenerational correlation in education (0.55) which is comparable to that for India. 31 (4.1.1) Gender and Intergenerational Mobility in Education To understand any possible gender bias in the intergenerational educational mobility, we report estimates of sibling correlations for brothers and sisters separately in columns 3 to 6 of Table 3. The estimates show that while the sibling correlation among men (brothers) did not change perceptibly between 1992/93 and 2006, it experienced a moderate decline in the case of women (sisters). The estimated sibling correlations are: 0.614 (1992/93) and 0.624 (2006) for men and 0.780 (1992/93) and 0.696 (2006) for women. Compared with men, the magnitude of sibling correlation among women is thus significantly higher in both survey years. This is in contrast with evidence from developed countries where there is no significant gender differences in sibling correlations (Bjorklund and Salvanes (2010)). Despite the moderate decline from 1992/93, the estimate for women in 2006 (0.696) is well above the upper bound estimates for sibling correlations among women found in developed and Latin American countries.32 We also analyze the trend in intergenerational correlations between parents and children across gender (columns 3-6, Table 3). The intergenerational correlations for men remained stable (0.541 in 1992/93 and 0.523 in 2006), but for women, it declined moderately from 0.622 to 0.559 between the two survey years, 1993 and 2006. Consistent with our findings regarding sibling 30 While intergenerational correlations for India are estimated for 16-27 age cohorts, the estimates in Hertz et al are for adults in age range 20-69 years. As noted by Hertz et al (2009), with increase in the level of education for younger cohorts, the intergenerational correlations for younger cohorts have either become smaller or not change at all. In that sense, our estimates for the intergenerational correlations for India are likely to be on the conservative side. 31 The intergenerational correlations in Latin American countries are higher than that of India. The average for 7 Latin American countries in Hertz et al (2009) is 0.60. 32 The estimates of sibling correlation among sisters for developed countries fall within the range [0.46-0.6]. Only one study reported a significant difference in sibling correlations between brothers and sisters for USA (Conley and Glauber (2008)). 16 correlations, intergenerational correlations are stronger for women than men in both the years. The higher intergenerational persistence observed for women compared with men in India is consistent with the recent findings on intergenerational economic mobility from other developing countries. For instance, Emran and Shilpi (2011) find occupational immobility (as measured by correlations between parents and children) to be much higher for women in Nepal and Vietnam. As discussed in the conceptual framework above, the square of intergenerational correlation provides an estimate of the share of total variance in schooling that can be explained by parent‘s education alone. The estimates (5th row in Table 3) show that parent‘s education alone can explain between 27 to 29 percent of variations in years of education for men (brothers) and 31 to 39 percent variations for women (sisters). In contrast, for developed countries, parental education explains only 10 to 20 percent of total variations in schooling years (see Bjorklund and Salvanes (2010)). (4.1.2) Role of Caste and Religion A potentially important determinant of educational attainment in India is the caste and religious identity of a household. Studies on education in India show that the average level of education is much lower among children from socially disadvantaged scheduled caste (SC) and scheduled tribes (ST) (Jalan and Murgai (2008), Kajima and Lanjouw (2006), Aslam et al. (2011)). In the next specification of our regressions, we include dummies for SC, ST and other backward castes. We also include a dummy for households whose head is a Muslim, as Muslim are among the most economically lagging groups in India (Sachar Committee (2006), World Bank (2011)). The effects of the caste and religion dummies on the estimated sibling correlation is minimal; the estimates in panel B of Table 3 are only slightly smaller compared with those reported in panel A. The inclusion of caste and religion dummies also does not affect the magnitudes of intergenerational correlations in any significant way. The results thus suggest that sibling and intergenerational correlations do not vary across caste groups in any significant way in both of the survey years, 1993 and 2006. This is consistent with the findings in Hantskovska et al. (2011) which reported a convergence of intergenerational persistence in education across castes during last three decades in India. The conclusion that the sibling and intergenerational correlations do not depend in any significant way on caste or religious identity is also supported by the estimates from the sub-samples based on caste and religion. The only exception is the urban women, where lower caste women experienced significantly higher mobility compared to the upper caste women (see section 4.2.3 below). 17 (4.1.3) Role of Geographic Location: Neighborhood Effect As noted in the introduction, a focus of this study is to analyze the potential spatial aspects of intergenerational educational mobility in India, and whether the role of geography has changed over time in the post-reform period. A simple but powerful way to gauge the importance of geographic location is to include neighborhood fixed effects in the estimating equations, and then compare the estimates of sibling correlations and intergenerational correlations with and without the neighborhood fixed effect. Note that the fixed effect captures all the factors shared by the children growing up in a neighborhood which include peer effects and school availability and quality, among other things. Panel C of Table 3 presents the estimates that include neighborhood fixed effects where neighborhood is defined as the sample cluster (PSU). Our full samples include 3799 and 3400 such clusters (PSUs) in 2006 and 1993 respectively. The results show that geographic location as measured by PSU level fixed effect matters a lot for intergenerational mobility in education. The estimates for sibling correlations become substantially smaller when neighborhood fixed effects are taken into account: the sibling correlations in the full sample decline from 0.642 to 0.395 in 1992/93 and from 0.616 to 0.385 in 2006. The implied neighborhood correlations (after netting out caste and gender effects) are 0.23 in 1993 and 0.20 in 2006 for the full sample, 0.22 (1993) and 0.19 (2006) for men, and 0.30 (1993) and 0.29 (2006) for women. These estimates of neighborhood correlations are substantially larger than those found for developed countries (Bjorklund and Salvanes (2010)). 33 The neighborhood correlations account for nearly a third of sibling correlations among men and 40 percent of that among women. This can be interpreted as strong evidence in favor of geographic location as a first order mediating factor for the influence of family background on education of children.34 The estimates of sibling correlations in panel C of Table 3 can be considered to be lower bound estimates of family background‘s influence on children‘s educational outcomes (net of caste and religion, and neighborhood effects), because the estimates of neighborhood effects are biased upward due to sorting of similar families in a neighborhood. These lower bound estimates imply that about 40 percent of variations in children‘s education can be explained by family background (net of neighborhood, caste and religion) alone. For women, the net influence of family background declined 33 The largest estimate for neighborhood correlation is 0.15 for USA (Solon et al. (2000)). 34 The relatively larger role of location for women probably reflects lower geographic mobility among them. 18 from 0.46 to 0.39 between 1992/93 and 2006, whereas it increased slightly for men from 0.38 to 0.41 over the same period. The inclusion of neighborhood fixed effects reduces the estimates of intergenerational correlations between parents and children also. But compared with sibling correlations, the magnitudes of the reductions are much smaller. For instance, estimates for men (brothers) declined from 0.52 to 0.48 in 1992/93 and 0.49 to 0.45 in 2006. After controlling for the neighborhood fixed effects, the estimates of intergenerational correlations indicate that more than 20 percent of variations in total schooling of children and nearly a third of sibling correlations in education can be explained by parent‘s education alone. To provide a sense of relative importance of different factors in explaining the variations in children‘s education, we use estimates from Table 3 and plot them in Figure 1. Figure 1 show that in 2006, individual effort and other idiosyncratic factors explain about 40 percent of variations in education outcomes of men, and about 30 percent for women, the rest are due to common factors experienced by siblings. The sibling correlation can be decomposed into three components: (i) parental education, (ii) geographic location (i.e., neighborhood effect), and a residual family environment shared by siblings which presumably capture the parental child rearing skills, among other things. The neighborhood effect and parent‘s education together can explain more than 70 percent of sibling correlations. The common neighborhood factors and parental education are particularly important for sisters (women): their share in sibling correlations is large -- 0.81 in 1992/93 and 0.78 in 2006. For men, contribution of these two factors to sibling correlations decreased from 0.78 in 1992/93 to 0.69 in 2006 due mainly to decrease in the neighborhood correlations (Figure1). For women, intergenerational persistence has declined but neighborhood correlations remain nearly unchanged. (4.2) Geography of Educational Mobility: Evidence from Alternative Partitioning of the Data The evidence on strong neighborhood effects in sibling and intergenerational correlations discussed above brings the focus on geographic location as an important factor in understanding educational mobility in post-reform India. This raises the question whether the levels, time trends and gender patterns of sibling and intergenerational correlations differ significantly across different geographic areas; for example, are there any significant differences between rural and urban areas, between less developed and more developed states? The recent academic literature and reports in popular press in India give a strong impression that the rural areas and certain lagging states such as 19 Bihar and Uttar Pradesh (UP) have been largely bypassed by the positive effects of economic liberalization and strong economic growth that followed (World Bank (2011)). In this subsection, we provide additional analysis of the role of geographic location in intergenerational educational mobility. (4.2.1) Rural vs. Urban Areas This section presents results from estimation of sibling and intergenerational correlations for families living in rural and urban areas separately. The estimates of sibling and intergenerational correlations for rural and urban areas are reported in Tables 4 and 5 respectively. Consistent with the format of Table 3, we represent estimates from three different specifications of equations (6) and (7) in three panels (A, B and C) of Table 4 and 5. These specifications correspond exactly to the specifications in Table 3 and are not discussed here again for the sake of brevity. The sibling and intergenerational correlations for all children and for men are larger in magnitudes in urban areas compared with rural areas (Tables 4 and 5). For instance, for all children (men), the sibling correlation is 0.579 (0.556) in rural areas compared with 0.664 (0.652) in urban areas in 1992/93. The corresponding intergenerational correlations are 0.482 (0.465) and 0.572 (0.553) in rural and urban areas respectively. For women in 1992/93, there is practically no difference in sibling correlations between urban and rural areas (both approximately=0.74), though intergenerational correlation is higher in urban areas (0.593 vs. 0.523). Between 1992/93 and 2006, intergenerational correlations remained nearly unchanged for both men and women in rural areas. There was a marginal decline in the sibling correlation for rural women in the same period, but the sibling correlation among rural men increased slightly. For men in urban areas, both sibling and intergenerational correlations remained effectively unchanged between 1992/93 and 2006. In contrast, sibling and intergenerational correlations have decreased significantly for urban women during the same period. As a result, gender difference in sibling and intergenerational correlations effectively vanished in urban areas, though it is still significant in rural areas. Using estimates from Tables 4 and 5, we decompose the total variance in children‘s education into individual and common ‗family background‘ components. The family background component is further decomposed into three separate parts accounted for by parental education, common neighborhood environment and other common family factors. The relative contributions of these different factors to total variance of children‘s education are plotted in Figures 2 and 3. Consistent with results from full sample, influences of parent‘s education and common neighborhood 20 factors are important in both rural and urban areas, accounting for more than 60 percent of sibling correlations for men and 65 percent for women. The contribution of common neighborhood factors to variance in education is larger for rural women whereas parental education is relatively more important for both men and women in urban areas. Overall, common family backgrounds are perhaps most important factor for rural women for whom less than 30 percent of variations in education can be explained by individual effort, choices and other unobserved idiosyncratic factors. To summarize, though the influences of parental education and common family background are smaller in magnitude in rural areas, there has been little or no progress in mobility in education. The largest improvement in educational mobility has been experienced by urban women while men experienced effectively no improvement regardless of their location. We find common neighborhood environment and parental education as the most important source of sibling correlations in both urban and rural areas. The influences of common neighborhood factors are particularly important for rural women. (4.2.2) Less Developed vs. Developed Regions Living standards in India vary widely across states. The incidence of poverty among poorer states in India is amongst the highest among developing countries. On the other side of the spectrum, many states such as Punjab have low poverty rates that are comparable to richer countries (e.g. Turkey) (World Bank (2011)). The NFHS 1992/93 identifies the ―backward‖ districts.35 The NFHS 2006 does not identify any district because of confidentiality considerations with respect to AID/HIV testing results. The states where the most backward districts are located can be matched between the two surveys. The backward districts in 1992/93 are located in five states: Bihar, Madhya Pradesh, Rajasthan, Uttar Pradesh and West Bengal. These states are among the poorest in terms of income in 1993/94 (four of them belong to the so-called BIMARU), and also suffer from poor educational attainment and infrastructure indicators (Kingdon (2007), Deaton and Dreze (2002)).36 We take the 35 The Ministry of Health and Family Welfare, Government of India, has defined backward districts as those having a crude birth rate of 39 per 1,000 population or higher, estimated on the basis of data from the 1981 Population Census. 36 Note that there may not be a one to one correspondence between backward districts in our sample and backward states according to income and education indicators, because there are backward districts even in developed states. The estimates of Head Count Ratio by Deaton and Dreze (2002) show that Rajasthan has a lower incidence of poverty, but the evidence reported in Kingdon (2007) shows that it lags in terms of schooling indicators, even though it has made substantial strides in primary schooling. West Bengal, on the other hand, suffers from higher poverty, but has better schooling indicators. As noted by Deaton and Dreze (2002), West Bengal belongs to the same ―slow growth‖ group with the BIMARU states 21 districts in these five states as less developed regions with rest being lumped under the developed category. The estimation results for these samples are reported in Tables 6 and 7. The results in Tables 6 and 7 show that the sibling correlations in 1992/93 are similar across developed and less-developed states for both men and women. Intergenerational correlations in education are, on the other hand, higher in the less-developed states, especially for women. The trends in sibling and intergenerational correlations for less developed states are also different from the national trends discussed in earlier sub-sections. For men in less-developed states, the trend is unambiguously a worsening one; both sibling and intergenerational correlations have increased between 1992/93 and 2006. In relatively developed states, both correlations remained stable for men over the same period. For women, both sibling and intergenerational correlations decreased in both regions, but the decline in more developed region was much larger in magnitude (sibling correlations declined from 0.764 to 0.660 in developed states compared with 0.781 to 0.741 in less-developed states). We also computed the relative contributions of parental education and common neighborhood factors to variance of children‘s education. Both of these factors are important, accounting for much of the sibling correlations in both regions. Parental education has greater influence on children‘s educational outcomes in less developed region compared with relatively developed states. The results are available from the authors. (4.2.3) Caste and Educational Mobility: Does Geography Matter? Our main results presented in Tables 3-7 suggest no substantial differences in sibling and intergenerational correlations across caste groups. Readers may be curious if this conclusion holds true across geographical areas and gender groups. Table 8 reports sibling correlations for different caste status for men, women and all children across rural and urban areas and developed and less developed states. The estimates of sibling correlations are slightly smaller in magnitudes for lower caste men and all children samples in all of the four regions. This is mostly true for women as well with the exception of urban areas in 1992/93. Consistent with our earlier results, the sibling correlations for men and all children remained stable between 1992/93 and 2006 in all four regions. Though sibling correlations among women of both upper and lower castes declined in both urban and rural areas, the decline is substantial only in urban areas. The decline is particularly large for low caste women (from 0.77 to 0.56) compared with upper caste women (from 0.72 to 0.66). There are also interesting differences in the trends for women in developed and less developed states. In less 22 developed states, sibling correlations declined slightly for upper caste women but remained stable for lower caste women. But in developed states, women from both caste groups experienced substantial decrease in sibling correlations. (5) Robustness Checks We check the sensitivity of our estimates in two ways. As mentioned before, as children become old, they tend to leave the parental household, because of marriage (especially for girls), jobs and higher education. If children who exit household are better educated, then it may bias the estimates of intergenerational correlations. For instance, if there is substantial and negative birth order effect on children‘s education, and marriage timing follows birth order, then our estimate of intergenerational correlations may be underestimated. On the other hand, if better education among women delays marriage, the bias would be in the opposite direction. To check out sensitivity of our estimates, we repeat our entire analysis for younger age cohorts [16 to 20 years]. The possibility of having children exiting household at this age cohort is much smaller than that for 16-27 year age cohort. Table 9 reports the results for the full sample. For intergenerational correlations, we find no significant differences in the estimates for any of the sub-samples reported in Table 9 from those reported in Table 3 for any of the survey years. The changes in the sibling and intergenerational correlations between 1992/93 and 2006 implied by Table 9 are similar to those implied by Table 3. Consistent with Table 3, the estimates from Table 9 suggest large gender differences in sibling and intergenerational correlations. It also highlights the importance of parent‘s education and common neighborhood environment in explaining the sibling correlations. We omit the results from regional analysis for this younger age cohort to save space, but overall conclusions from our analysis based on 16-27 year age cohort hold true for 16-20 year age cohort too. The estimates of sibling correlations for the age cohort [16 to 20 years] are larger in magnitudes both for men and women (see Table 9). This is consistent with the evidence in literature which finds higher sibling correlations among closely spaced children compared with widely spaced children (Bjorklund and Salvanes (2010)). Such higher correlations arise from the fact that for more widely spaced children, family background may change substantially over time. The results presented above took the maximum of father and mother‘s education as the relevant metric of parental educational attainment. A reader might wonder if the conclusions reached earlier depend on this specific definition of parental education. To allay such concerns, we use the 23 average of father and mother‘s education as an indicator of parental education and re-estimate all of the regressions. The results in Table 10 show that if anything, the magnitudes of estimates are slightly larger in this new formulation. Thus our estimates of intergenerational correlations presented in the previous tables can be taken as conservative estimates of the effects of family background. The gender and geographic patterns and trends also remain unchanged with this alternative definition of parental education. Our main empirical results indicate that sibling and intergenerational correlations in education in India remained largely unchanged over a period of almost a decade and a half (1993- 2006) after the economic liberalization in 1991. The only group that experienced significant decline in the sibling and intergenerational correlations are women in urban areas. These results seem to contradict the evidence presented by Jalan and Murgai (2008) who find substantial improvements in educational mobility in India over time. They use 1998-99 NFHS data and find that the magnitude of the intergenerational regression coefficient declines substantially for younger age cohorts. We discussed earlier the pitfalls in relying on cohort based analysis when data consists of only the co- resident children. In fact, Jalan and Murgai (2008) are well aware of the limitations of the cohort analysis and discuss many of the same points we raised earlier. Our results thus can differ on two grounds: (i) intergenerational correlations take into account the declining variance in education in children‘s generation, and (ii) we compare the same age cohorts (16-27 years) across two surveys, instead of relying on the different age cohorts in a single survey round. It is, however, important to check if we get estimates similar to those of Jalan and Murgai (2008) from a cohort based analysis. We replicate the analysis in Jalan and Murgai (2008) and report the estimated intergenerational regression coefficients for both rural and urban areas for three age cohorts [15-19, 20-24 and 25-29 year] in Table 11. It is interesting that the estimates show a declining effect of parental education for the younger age cohorts for both the survey years which is consistent with the estimates in Jalan and Murgai (2008). This seems to justify the worry that the estimates for younger cohorts may be biased downward. As noted before, the estimated intergenerational regression coefficients are likely to be smaller for the younger cohorts simply because of the fact that some of the children have not completed their education. Also, since geographic mobility has increased over time, and better educated children usually migrate first, the downward bias in the estimate would be more pronounced in younger cohorts. On the other hand, estimates tend to be biased upward for older age cohort when parents co-reside with better educated children. This highlights the need for using separate survey rounds and multiple measures to understand the trends in intergenerational persistence given the data constraints in developing countries. 24 (6) Toward an Understanding of the Trends in and Pattern of Educational Persistence The objective of this study is to provide robust evidence on the trends in and pattern of educational mobility in post-reform India with special emphasis on the roles played by gender and geography. In other words, the goal here is to help establish the ―facts‖ about educational mobility in India over a period of a decade and a half after extensive economic liberalization. In this section, we attempt a first pass at understanding the observed trends in and pattern of educational mobility in post-reform India delineated in earlier sections. We, however, hasten to add that our discussion is only a small step in a major research program that needs to be undertaken to understand the nature of educational mobility in post-reform India. We also emphasize a caveat widely understood in the literature that although the estimates of sibling and intergenerational correlations are important for tracing out the changes in educational mobility over time, they do not imply causality. Among other things, the literature has emphasized the difficulties in causal interpretations because of correlations in genetic endowment (ability) and preference among the siblings and also between the parents and children (see, for example, Bjorklund and Salvanes (2010)).37 We, however, note that the changes observed over time are not likely to be driven primarily by changes in genetic correlations among siblings and between parents and children, as a decade and a half is a short span of time for any significant changes in genetic correlations. Thus when one observes large changes over a relatively short period of time, as we do in the case of women in urban areas, for example, it is more likely that they reflect changes in the ‗environmental factors‘ in the household and the community. The evidence in this paper can be helpful in narrowing down the search for potential causal factors. It thus constitutes an essential first step to policy relevant economic analysis of educational mobility. Our results indicate that the focus of a causal analysis of the observed educational persistence should primarily be on the geographic location, parental education and their correlates. The large impact of geography including the neighborhood effect points to the importance differences in school availability and quality and access to urban markets (returns to education). The importance of 37 It is, however, important not to push the distinction between ―nature‖ and ―nurture‖ too far, because there are important interactions between the two, a point emphasized in the Behavioral Genetics literature (see, for example, Plomin et al. (2001)). For interesting discussions on the limitations of the nature vs. nurture debate, see Goldberger (1979) and Manski (2011). 25 parental education, on the other hand, suggests credit constraints and role model effects as potential causal channels.38 The recent literature has underscored the importance of schooling expansion and returns to education as major factors in determining trends in educational mobility (and more generally economic mobility including income mobility).39 The evidence indicates that educational mobility improves when government invests heavily in educational infrastructure to ensure access at low costs. The lack of a significant improvement in educational mobility in post reform India thus immediately raises the issue of access to schools and its quality. After the independence, India began its journey with a low level of literacy and limited schooling infrastructure. In 1951 only 9 percent of women and 27 percent of men were literate. The number of primary schools grew from 215036 in 1951 to 641615 in 2001. By 2002, 87 percent villages had a primary school within 1 km. The effects of the recent expansion in primary schooling on educational mobility will probably become evident in the coming years. The number of secondary schools is, however, much smaller; there was only 1 secondary school for every 5 primary schools in 2002. Private schools have become increasingly important in India, especially in the urban areas (World Bank (2006)). The evidence also indicates that many public schools are plagued with teacher absence and fail to offer quality education and thus the learning outcomes are very poor (ASER Report (2006), Das and Zajonc (2010)). The growth in private schooling has taken place more in those places where public school quality is poor. While the recent expansion of primary schooling has been successful in achieving near universal enrollment, improvements beyond primary schooling remain limited. The returns to secondary and tertiary education have experienced the most increase, but inequality in access to secondary schooling remains high. 40 The increasing role of private schools and private tutoring has raised concerns about inequality in educational opportunity (see, for example, Kingdon (2007)).41 A private market for education can be especially inequalizing in a developing country such as India where the credit market is underdeveloped in general, and the student loan market is almost non-existent. 38 For evidence that parent‘s may be important role models, especially for women in Nepal, see Emran and Shilpi (2011). 39 See, for example, the discussion on the role of inequality in access of higher education and increasing wage premium for higher education in explaining the observed decline in mobility in UK by Blanden et al. (2008). 40 Inequality of access measured as the difference between the top and bottom quintiles of the income distribution. See World Bank (2006). 41 Banerjee et al. (2007) show that private tutoring improves learning outcomes in India. For an analysis of the role of private schools in decreasing mobility in UK, see Green et al. (2010). 26 (6.1) Rural-Urban Gap: Understanding the Higher Correlations in Urban Areas The magnitude of sibling and intergenerational correlations are in general larger in urban areas. This is true in 1993 for both men and women, also for men in 2006. It seems puzzling, because the schooling infrastructure and financial sector are expected to be more developed in urban areas. However, there are a number of factors that may help explain the observed higher persistence in educational attainment in urban areas compared to the rural areas. Most of the schools, both primary and secondary, in rural areas are public schools and thus tuition free. The public primary schools also provide mid-day meals. The absence of tuition costs and provision of mid-day meals help the poorer households (parents with lower education) to send their children to schools. Moreover, the private market for supplementary tutoring is not developed in rural areas. A private market for quality tutoring could potentially give an advantage to the children of richer (and more educated) parents, creating inequality in educational opportunities. The above factors combined together weaken the link between parental income and children‘s educational attainment in the rural areas. In contrast, there has been dramatic growth in private schools and supplementary private tutoring in urban areas in India in last couple of decades (Kingdon (2007), World Bank (2009)). According to one estimate the share of enrollment in private secondary schools in urban India was about 30-40 percent in 2002. Thus parental income and access to credit have become increasingly important in urban India for children‘s education, creating more prominent role for parental education and family background. This raises the worry that the inequality in access to education may accelerate in the urban areas in the coming years. Another important factor is the differences in returns to education. The available estimates for 1993/94 shows that while returns to primary education were higher in rural areas, returns to higher education were higher in urban areas (Duraisamy (2002)). The recent estimates indicate that the rural-urban gap in returns to education has increased after the liberalization (Aslam et al. (2011)). The returns to one more year of schooling in 2007 for self-employed is estimated to be 9.8 percent in rural areas and 34 percent in urban areas. For wage employment for men, it is 6.3 percent in rural areas, but 32 percent in urban areas. For female wage employment, the corresponding returns are 8 percent (rural) and 44 percent (urban) (see Aslam et al. (2011)).42 As noted by many observers, the economic growth in India after economic liberalization has been both skill-biased and urban-biased, 42 The skill biased nature of economic growth can also be seen from growth of wages across different schooling levels. The average wage in 1999/2000 for someone with a college degree was 73 percent higher than someone with high school degree, and 67 percent higher for someone with high school degree compared to someone with middle school education (based on NSS data). 27 driven by service sector growth including information technology (Kotwal et al. (2011), Bardhan (2010)).43 The higher returns to education in urban areas make the investment in children‘s education more attractive for all parents. The potential positive effect of higher returns to education on children‘s educational mobility is, however, counteracted by an important dynamic interaction between parental education and higher returns following liberalization. After the liberalization, the more educated parents could take advantage of the emerging opportunities in the urban labor market and they experienced higher income growth. The higher income allowed them to invest in children‘s education to reap the benefits of increasing returns to education. But the poor (and relatively uneducated) parents were less successful in taking advantage of the skill intensive growth process. Thus while the children of relatively educated parents in urban India continued to receive more and better education, the children of less educated parents failed to move beyond their parent‘s ranks, resulting in persistence between children and parent‘s education in the urban areas. Our evidence suggests that this is especially true for men in urban areas, but the experience of urban women requires additional explanations as they had substantial improvements in educational opportunities in the face of the forces discussed above. We turn to possible resolution to this puzzle in the next section. (6.2)The Curious Case of Urban Women Although the factors discussed above are expected to tighten the link between parental education and children‘s education in urban areas, the evidence in this paper shows that women in urban areas experienced substantial improvements in educational mobility from 1992/93 to 2006. This comes across as especially counterintuitive in the context of a country where son preference is strong. However, note that even though the sibling correlation among sisters has gone down the most over the sample period, even in 2006 the magnitude of both sibling and intergenerational correlations remain significantly higher for women, indicating lower educational mobility compared to men. A related important finding is that the lower caste women experienced a larger decline in sibling correlations compared to the upper caste women. This may seem doubly puzzling as in addition to gender bias the lower caste women face significant disadvantages both in social and market interactions. The apparently puzzling improvements in the educational mobility of women in urban areas during last two decades can be explained in terms of relevant economic and social forces. The urban 43 A substantial part of the readymade garments industry is located in large cities including Delhi and Bangalore. 28 parents in general experienced higher income growth after liberalization compared to the rural parents, thus they could afford to invest in the education of daughters to reap the benefits of high returns.44 Interestingly, contrary to the conventional view, son preference in education in fact implies that as incomes grow (and/or credit market access improves), parents find it acceptable to invest in daughters‘ education. To see this, note that given the high perceived returns to a son‘s education (family lineage, old age support, dowry, social prestige etc.), the parents try to invest in son‘s education even if they face poverty. They start to invest in lower return assets such as a daughters‘ education only when they have more income and/or face lower credit constraints. It is thus only natural that the urban parents began to invest more in girls‘ education when their income grew following the liberalization. As noted above, the returns to education for women in urban areas have increased substantially over the reform period which makes it more attractive to invest in daughters‘ education. The age at marriage for girls is also higher in urban areas, implying that the parents might be able to recoup some of the financial investment in the form of income support from working daughters before they get married.45 Also, there are indications that the trade-off between dowry and investment in education of daughters has started to tilt in favor of education in urban India, and thus parents might find investing in education as better option than accumulating savings for dowry (Mishra (2011)). Also, the force of the social norm against women‘s labor market participation is much weaker in urban areas, partly because parents are better educated and the life expectations of young women are influenced by peer effects and access to better information. The finding that lower caste urban women experienced more mobility compared to the upper caste women may in part reflect lower social constraints on their labor market participation. The caste difference can also be understood in terms of the insightful analysis of Munshi and Rosenzweig (2006) who find that the lower caste women were able to adapt better to the new occupations, as they are not expected to follow in their father‘s footprint. This ‗freedom by neglect‘ helps the daughters achieve better occupational mobility which is likely to feed into higher educational attainment. Using survey data from Mumbai, Munshi and Rosezweig (2006) find that the sons in lower caste families 44 As noted before, this is due to urban and skill biased nature of economic growth in post-reform India. 45 In the context of Malaysia, one explanation for higher educational mobility of daughters during the New Economic Policy is that parents invest in an older daughter‘s education in expectation that she in turn will finance the education of younger brothers by sending in remittances from high paying urban jobs. In an environment of low financial deepening and negative real interest rates, this strategy may make a good deal of economic sense, especially when the returns to women‘s education is high in urban areas, as is the case in India (see Lillard and Willis (1994) on Malaysia). 29 were channeled into local language schools with established parental network and entered into traditional parental occupations. The daughters, on the other hand, enrolled in English medium schools and were better prepared to take advantage of non-traditional jobs, especially those with a premium for English proficiency. A straightforward implication of the above discussion is that the same set of factors is likely to be responsible for lack of improvements in educational mobility for women in rural areas. Low parental income, low returns to education, lack of skilled jobs, and stronger social norms against women‘s participation in the labor market, all combined together can result in little or no improvements in educational mobility for rural women. Also, note that among the few socially- coveted jobs in rural areas for educated women are public sector jobs, for example, in schools and health clinics. However, hiring in public schools and health clinics was frozen or curtailed after 1991 liberalization as part of the fiscal reform, but there was no compensating private sector growth. This might have reinforced the disincentives for investing in daughter‘s education in rural areas. Conclusions The Indian economy grew at a robust pace since its economic liberalization in 1991and achieved significant reduction in poverty. At the same time, the evidence indicates an increase in inequality (World Bank (2011), Deaton and Dreze (2002), Datt and Ravallion (2010)). This paper examines the trends in and patterns of intergenerational mobility in education among new entrants in the labor force (16-27 year olds) between 1992/93 – a year immediately following the economic liberalization – and 2006 – nearly 15 years after liberalization. To the best of our knowledge, this is the first paper in the literature to employ both intergenerational and sibling correlations to study the evolution of educational mobility in a developing country. The empirical results indicate that educational mobility remained largely unchanged for a large proportion of Indian children after a decade and a half of high economic growth. Between 1992/93 and 2006, the only group that experienced significant improvements in educational mobility is women in urban areas and more developed states. We find that estimates of sibling and intergenerational correlations among men stayed almost the same over the reform period in urban areas and developed states, but may have increased slightly in rural and less developed regions. In contrast, the sibling and intergenerational correlations among women have declined irrespective of geographic location, but only women in urban areas and developed states have experienced a substantial decline. Interestingly, among the urban women, there are significant caste differences: 30 the lower caste women have experienced significantly better educational mobility compared to the upper caste women. Similar improvements in the educational mobility of women during a period of high growth and schooling expansion have been observed in other countries, for example, China and Malaysia. We discuss a number of conjectures for explaining the observed patterns in educational persistence in post-reform India which can motivate future analysis of intergenerational educational mobility in India. Although the trend in women‘s educational mobility, especially in urban areas and developed states, shows clear improvements, it is important to put the changes in perspective by looking at the magnitudes of the correlations. While the improvements have effectively eliminated gender gaps in urban areas, the magnitudes of the sibling correlations remain high in 2006 compared to other countries. For example, the sibling correlation for both men and women in 2006 is approximately 0.64 which is higher than the available estimates for Latin American countries including Brazil. In rural and less-developed regions, the gender gap remains substantial in 2006; for example, sibling correlation among women is 0.70 in rural areas and 0.74 in less developed region compared with 0.57 and 0.64 for men respectively. The high levels of educational persistence across generations are also evident in intergenerational correlations where the estimates for India are among the largest for Asian countries (Hertz. et al (2009)). In contrast to the evidence from developed countries, the majority of the variations in sibling correlations in India can be explained by two factors: parental education and geographic location as measured by the neighborhood effect. References Awakening Giants, Feet of Clay: Assessing the Economic Rise of China and India, Princeton University Press. 31 Bardhan, P (2007). ―Poverty and Inequality in China and India: Elusive Link with Globalization‖, Economic and Political Weekly, 42 (38). Basu, K (2008), ―The Enigma of India‘s Arrival: A review of Arvind Virmani‘s Propelling India: From Socialist Stagnation to Global Power‖, Journal of Economic Literature, 46 (2). Behrman, Jere R. and Paul Taubman (1986). ―Birth Order, Schooling and Earnings‖, Journal of Labor Economics, 4(3). Behrman, Jere, R. (1997). ―Mothers‘ Schooling and Child Education: A Survey‖. PIER Working Paper 97–025. Behrman, J., E. Hannum, M. Wang, and J. Liu (2008): Education in the Reform Era, in Brandt and T. Rawski Ed: China's Great Economic Transformation, Cambridge university Press. Birdsall, N and C. Graham (eds) (1999): New Markets, New Opportunities? Economic and Social Mobility in a Changing World, Brookings Institution. Bjorklund A and K. Salvanes. (2011). ―Education and Family Background: Mechanisms and Policies,‖ Handbook in the Economics of Education vol 3, EA Hanushek, S Machin and L Woessmann (es.), The Netherlands: North Holland, 2011, pp. 201-247. Bjorklund A, Lindahl L, Lindquist M. (2010). ―What More Than Parental Income, Education and Occupation? An Exploration of What Swedish Siblings Get from Their Parents.‖ B.E. Journal of Economic Analysis & Policy, 10 (1) (Contributions), Article 102. Bjorklund A, Jantti M, Lindquist M. (2009). ―Family background and income during the rise of the welfare state: trends in brother correlations for Swedish men born 1932-1968.‖ Journal of Public Economics, 93 (5-6), 671-680 Black, S. E. and P. Devereux (2010). ―Recent Developments in Intergenerational Mobility‖, Forthcoming Handbook of Labor Economics. Black, S. E., P. Devereux and K. Salvanes (2005). ―The More the Merrier? The Effect of Family Composition on Children‘s Outcomes‖. The Quarterly Journal of Economics, 120(2), 669– 700. Blanden, J, P. Gregg, S. Machin (2005). Intergenrational Mobility in Europe and North America, Center For Economic Performance Report, April, 2005. Blanden J., A. Goodman, P. Gregg and S. Machin (2004). Changes in Intergenerational Mobility in Britain, in M. Corak (ed.) Generational Income Mobility, Cambridge University Press. Blanden, J. P. Gregg and L. Macmillan (2008). Accounting for Intergenerational Persistence: Non- Cognitive Skills, Ability and Education, Economic Journal, 117, C43-C60. 32 Booth, Alison L. and Hiau Joo Kee ( 2009). ―Birth order matters: the effect of family size on educational attainment‖. Journal of Population Economics 22, 367-397. Conley, Dalton and Rebecca Glauber (2008). ―All In the Family? Family Composition, Resources, and Similarity in Socioeconomic Status‖. Research in Social Stratification and Mobility, 26, 297–306. Dahan, M and A. Gaviria (2001), ―Sibling Correlations and Intergenerational Mobility in Latin America‖, Economic Development and Cultural Change‖, Vol 49, no.3. Das, Jishnu and Zajonc, Tristan, 2010. " India Shining and Bharat Drowning: Comparing Two Indian States to the Worldwide Distribution in Mathematics Achievement,‖ Journal of Development Economics, Elsevier, vol. 92(2), pages 175-187, July. Datt, G and M. Ravallion (2010), ―Shining for the Poor, Too?‖ Economic and Political Weekly, February 13 2010. Dearden, L and S. Machin, and H. Reed (1997): Intergenerational Mobility in Britain, Economic Journal, January, pp.47-66. Deaton, A, and J. Dreze (2002). ―Poverty and Inequality in India: A Re-Examination‖, Economic and Political Weekly, Vol. 37, No. 36 (Sep. 7-13, 2002), pp. 3729-3748. Dreze, J and A. Sen (2011), ― Putting Growth In Its Place: It has to be but a means to development, not an end in itself‖, Otlook India.com, Nov 14, 2011. Duraisamy, P. (2002). ―Changes in the returns to education in India, 1983–94: by gender, age- cohort and location‖, Economics of Education Review, 21(6), 609–622. Emran, M. Shahe and F. Shilpi. 2011. ―Intergenerational Occupational Mobility in Rural Economy: Evidence from Nepal and Vietnam‖, Journal of Human Resources, 46(2), p.427-458. Emran. M. Shahe and Yan Sun (2011). ―Magical Transition? Intergenerational Educational and Occupational Mobility in Rural China: 1988-2002‖, Paper presented at American Economic Association Annual Conference, Denver, 2011. Fields, G, D. Grusky, and S. Morgan (Eds) (2005): Mobility and Inequality: Frontiers of Research from Sociology and Economics, Stanford University Press. Goldberger, A. (1989), "Economic and Mechanical Models of Intergenerational Transmission," American Economic Review, vol. 79, no.3, 504-513. Green, F, S. Machin, R. Murphy, and Y Zhu (2010), The Changing Economic Advantage From Private Schools, Working Paper. Hnatkovska, V., A. Lahiri and S. Paul. 2011. ―Breaking the Caste Barrier: Intergenerational Mobility in India,‖ Mimeo. 33 Hertz Tom, Tamara Jayasundera, Patrizio Piraino, Sibel Selcuk, Nicole Smith and Alina Veraschagina. 2007. ―The Inheritance of Educational Inequality: International Comparisons and Fifty-Year Trends‖. The B.E. Journal of Economic Analysis and Policy (Advances), 7(2), Article 10. Kotwal, A, B. Ramaswami and W. Wadhwa (2011). "Economic Liberalization and Indian Economic Growth: What's the Evidence?," Journal of Economic Literature, American Economic Association, vol. 49(4), pages 1152-99, December. Jalan, J. and R. Murgai. 2008. ―Intergenerational Mobility in Education in India‖, Manuscript, World Bank, Delhi. Jalan, J and M. Ravallion (1999), "China's Lagging Poor Areas," American Economic Review, vol. 89(2). Jushong, Z and R. Kanbur (2012), ―Confronting Rising Inequality in Asia‖, Chapter in Asian Development Outlook 2012, ADB. Kingdon, G (2007). "The progress of school education in India," Oxford Review of Economic Policy, vol. 23(2), pages 168-195, Summer. Kochhar, K., U. Kumar, R. Rajan, A. Subramanian and I. Tokatlidis. 2006. ―India‘s pattern of development: What happened, what follows?" Journal of Monetary Economics, 53 (5): 981– 1019. Lillard, L, and R. J. Willis, 1994. "Intergenerational Educational Mobility: Effects of Family and State in Malaysia," Journal of Human Resources, vol. 29(4), pages 1126-1166. Lindahl, L. 2008. ―A Comparison of Family and Neighborhood Effects on Grades, Test Scores, Educational Attainment and Income—Evidence from Sweden‖. Essay III in L. Lindahl, Family Background and Individual Achievement, dissertation no.75, Swedish Institute for Social Research. Lindahl L (forthcoming) ―Does the Childhood Environment Matter for School Performance, Education and Income? - Evidence from a Stockholm Cohort.‖ Journal of Economic Inequality. Manski, C (2011), ―Genes, Eyeglasses, and Social Policy‖, Journal of Economic Perspectives, Fall 2011. Mazmuder, B (2012), ―Is intergenerational economic mobility lower now than in the past?‖, Chicago Fed Letter, April 2012. 34 Mazumder, B. 2011. ―Family and Community Influences on Health and Socioeconomic Status: Sibling Correlations Over the Life Course,‖ The B.E. Journal of Economic Analysis & Policy: Vol. 11: Iss. 3 (Contributions), Article 1. Mazumder, B. 2008. ―Sibling Similarities and Economic Inequality in the US‖ Journal of Population Economics, 21, p. 685-701. Mishra, A (2011), INDIA: Degrees replace dowries for educated classes, University World News, November 27, 2011. Munshi, Kaivan and Mark Rosenzweig. 2006. "Traditional Institutions Meet the Modern World: Caste, Gender, and Schooling Choice in a Globalizing Economy," American Economic Review, vol. 96(4), pages 1225-1252, September. Orazem, P and E. King (2008). ―Schooling in Developing Countries: The Roles of Supply, Demand and Government Policy.‖ Chapter 55 in T. P. Schultz and John Strauss, eds. Handbook of development Economics, 2008. Amsterdam: North Holland. Pandey, P., S. Goyal and V. Sundararaman. 2008. ―Public Participation, Teacher Accountability and School Outcomes,‖ Policy Research Working Paper 4777, World Bank, Washington DC. Plomin, R, DeFries, J, P. McGuffin, G. McClearn, and (2001). Behavioral Genetics, Worth Publishers, New York, 4th Edition. Prasad, E (2012), ―A Clarion Call for Emerging Markets‖, Project Syndicate, February 14, 2012. Pratham. 2010. ―ASER 2009 – Annual Status of Education Report (Rural) 2009‖, New Delhi, Pratham Resource Center. Price, Joseph. 2008. ―Quality Time: The Effect of Birth Order‖ Journal of Human Resources, 43(1), 240-265. Raaum, Oddbjørn, Erik Ø. Sørensen and Kjell G. Salvanes. 2006. ―The Neighborhood Is Not What It Used To Be‖. The Economic Journal, 116(1), 278–300. Rabe-Hesketh, Sophia, Anders Skrondal and Andrew Pickles. 2002. ―Reliable Estimation of Generalized Linear Mixed Models Using Adaptive Quadrature,‖ Stata Journal, 2(1), 1-21. Ravallion, Martin (2000): ―Should Poverty Measures be Anchored to the National Accounts?‖, Economic and Political Weekly, 34 (35 and 36), 26 August, 3245-52. Sankar, D. 2008. ―What is the Progress in Elementary Education Participation in India during the Last Two decades? An Analysis of NSS Education Rounds,‖ South Asia Discussion Paper. 24,World Bank, Washington, DC. 35 Sen, A. and Himanshu (2004), ―Poverty and Inequality in India-II: Widening Disparities during the 1990s‖, Economic & Political Weekly, 39 Solon, Gary (1999). ―Intergenerational Mobility in the Labor Market,‖ in O. Ashenfelter and D. Card (eds.), Handbook of Labor Economics 3A, Elsevier, Amsterdam, North Holland. Solon, G, M. Page, and G. Duncan (2000). ―Correlations between Neighboring Children in Their Subsequent Educational Attainment,‖ Review of Economics and Statistics 82(3), 383-392. Solon, G, M. Corcoran, R. Gordon and D. Laren. 1991. ―A Longitudinal Study of Sibling Correlations in Economic Status,‖ Journal of Human Resources 26 (3), 509-534. Srivastava, S (2004), ―India mega-wedding justifies hype‖, BBC Online, February 11, 2004. Strauss, John and Thomas, Duncan, 1995. "Human resources: Empirical modeling of household and family decisions," Handbook of Development Economics, in: Hollis Chenery† & T.N. Srinivasan (ed.), Handbook of Development Economics, edition 1, volume 3, chapter 34, pages 1883-2023 Elsevier. Woolsey, M (2008), World's Most Expensive Homes: Inside the World's First Billion-Dollar Home‖, Forbes Com, April 30, 2008. Walton, M (2010). Inequality, Rents and the Long-run Transformation of India, Working Paper, KSG, Harvard University. World Bank (2011). Perspectives on Poverty in India: Stylized Facts from Survey Data, World Bank, Washington DC. 36 Table 1: Number of Observations for different samples All children Brothers (Men) Sisters (Women) 1993 2006 1993 2006 1993 2006 Full sample 34,585 39,562 21,895 23,625 12,690 15,937 Rural 22,308 20,191 14,510 12,247 7,798 7,944 Urban 12,277 19,371 7,385 11,378 4,892 7,993 Less developed Areas 7,136 11,055 4,928 6,917 2,208 4,138 Developed Areas 27,449 28,507 16,967 16,708 10,482 11,799 Table 2: Summary Statistics Full Sample Rural Sample 1992/93 2006 1992/93 2006 Mean Standard Mean Standard Mean Standard Mean Standard Deviation Deviation Deviation Deviation Children's Schooling Years All Children 7.36 4.51 8.72 3.92 6.28 4.44 7.71 3.94 Brothers 7.63 4.32 8.76 3.77 6.77 4.31 7.97 3.75 Sisters 6.90 4.77 8.67 4.13 5.36 4.54 7.32 4.18 Parent's Schooling Years Father 5.33 4.94 6.43 5.09 3.91 4.26 4.90 4.56 Mother 2.63 3.91 3.75 4.58 1.46 2.82 2.17 3.41 Parent's1 5.54 4.93 6.82 5.03 4.07 4.27 5.19 4.54 Children's Age (years) All Children 19.55 2.98 19.22 2.96 19.52 2.98 19.08 2.95 Brothers 19.91 3.07 19.67 3.12 19.87 3.07 19.55 3.12 Sisters 18.93 2.70 18.57 2.56 18.86 2.69 18.36 2.51 Caste and Religion Composition Proportion Scheduled Caste 0.11 0.31 0.17 0.38 0.12 0.33 0.18 0.38 Proportion Scheduled Tribe 0.12 0.33 0.11 0.31 0.16 0.36 0.16 0.37 Proportion Backward Caste - - 0.33 0.47 0.34 0.47 Proportion Muslim 0.17 0.37 0.16 0.36 0.19 0.39 0.12 0.33 Note: 1: Maximum of Father and Mother's schooling years 37 Table 3: Sibling and Intergenerational Correlations: Full Sample All children Brothers (Men) Sisters (Women) 1993 2006 1993 2006 1993 2006 PANEL A Sibling Correlation (SC) 0.642*** 0.616*** 0.614*** 0.624*** 0.780*** 0.696*** (110.901) (97.428) (71.203) (67.213) (89.719) (57.455) Intergenerational Correlation (IGC) 0.574 0.540 0.541 0.523 0.622 0.559 (107.42)*** (105.48)*** (84.35)*** (82.41)*** (82.91)*** (77.53)*** IGC squared 0.329 0.292 0.293 0.274 0.387 0.312 Proportion of SC explained by IGC 0.514 0.474 0.476 0.439 0.496 0.449 Controling for Caste and religion PANEL B Sibling Correlation (SC) 0.624*** 0.586*** 0.598*** 0.597*** 0.764*** 0.675*** (102.885) (87.493) (66.670) (61.089) (81.993) (51.828) Intergenerational Correlation (IGC) 0.551 0.505 0.521 0.491 0.594 0.522 (99.15)*** (94.70)*** (77.93)*** (74.12)*** (75.73)*** (69.17)*** IGC squared 0.304 0.255 0.271 0.241 0.353 0.272 Proportion of SC explained by IGC 0.487 0.435 0.454 0.404 0.462 0.404 Controling for Neighborhood FE PANEL C Sibling Correlation (SC) 0.395*** 0.385*** 0.375*** 0.406*** 0.460*** 0.389*** (45.036) (42.959) (29.361) (30.421) (21.848) (17.429) Intergenerational Correlation (IGC) 0.479 0.443 0.474 0.452 0.519 0.455 (97.66)*** (95.66)*** (76.56)*** (75.56)*** (70.65)*** (66.39)*** IGC squared 0.229 0.196 0.225 0.204 0.269 0.207 Proportion of SC explained by IGC 0.580 0.509 0.599 0.503 0.585 0.532 No. of observations 34,585 39,562 21,895 23,625 12682 15,937 Robust t statistics in parentheses. Standard errors corrected for clustering at family level * significant at 10%; ** significant at 5%; *** significant at 1% 38 Table 4: Sibling and Intergenerational Correlations: Rural Sample All children Brothers (Men) Sisters (Women) 1993 2006 1993 2006 1993 2006 PANEL A Sibling Correlation (SC) 0.579*** 0.569*** 0.556*** 0.571*** 0.746*** 0.698*** (72.583) (60.826) (47.598) (40.090) (57.813) (41.992) Intergenerational Correlation (IGC) 0.482 0.481 0.465 0.458 0.523 0.518 (69.33)*** (68.16)*** (57.70)*** (52.62)*** (48.72)*** (51.02)*** IGC squared 0.232 0.231 0.216 0.210 0.274 0.268 Proportion of SC explained by IGC 0.401 0.407 0.389 0.367 0.367 0.385 Controling for Caste and religion PANEL B Sibling Correlation (SC) 0.565*** 0.543*** 0.543*** 0.548*** 0.732*** 0.680*** (68.691) (55.757) (45.381) (37.074) (53.942) (38.511) Intergenerational Correlation (IGC) 0.464 0.450 0.449 0.428 0.500 0.484 (65.11)*** (62.26)*** (54.03)*** (48.13)*** (45.46)*** (46.44)*** IGC squared 0.215 0.203 0.202 0.183 0.250 0.234 Proportion of SC explained by IGC 0.381 0.373 0.371 0.335 0.341 0.345 Controling for Neighborhood FE PANEL C Sibling Correlation (SC) 0.379*** 0.327*** 0.361*** 0.355*** 0.480*** 0.396*** (35.604) (25.766) (23.351) (18.399) (18.073) (12.677) Intergenerational Correlation (IGC) 0.414 0.400 0.414 0.398 0.449 0.430 (69.31)*** (62.63)*** (54.39)*** (47.67)*** (48.21)*** (45.05)*** IG squared 0.171 0.160 0.171 0.158 0.202 0.185 Proportion of SC explained by IGC 0.452 0.490 0.474 0.446 0.420 0.466 No. of observations 22,308 20,191 14,510 12,247 7798 7,944 Robust t statistics in parentheses. Standard errors corrected for clustering at family level * significant at 10%; ** significant at 5%; *** significant at 1% 39 Table 5: Sibling and Intergenerational Correlations: Urban Sample All children Brothers (Men) Sisters (Women) 1993 2006 1993 2006 1993 2006 PANEL A Sibling Correlation (SC) 0.664*** 0.625*** 0.652*** 0.648*** 0.743*** 0.634*** (67.333) (68.340) (46.995) (51.552) (43.956) (30.248) Intergenerational Correlation (IGC) 0.572 0.526 0.553 0.534 0.593 0.508 (54.47)*** (65.49)*** (44.41)*** (54.60)*** (40.06)*** (45.63)*** IGC squared 0.327 0.277 0.306 0.285 0.352 0.258 Proportion of SC explained by IGC 0.493 0.443 0.469 0.440 0.474 0.407 Controling for Caste and religion PANEL B Sibling Correlation (SC) 0.649*** 0.588*** 0.638*** 0.614*** 0.730*** 0.600*** (62.871) (59.054) (44.492) (45.188) (41.263) (25.967) Intergenerational Correlation (IGC) 0.554 0.482 0.537 0.493 0.573 0.461 (51.19)*** (57.40)*** (41.89)*** (47.84)*** (37.64)*** (39.79)*** IGC squared 0.307 0.232 0.288 0.243 0.328 0.213 Proportion of SC explained by IGC 0.473 0.395 0.452 0.396 0.450 0.354 Controling for Neighborhood FE PANEL C Sibling Correlation (SC) 0.440*** 0.451*** 0.409*** 0.470*** 0.474*** 0.409*** (29.546) (36.931) (19.127) (26.581) (14.451) (13.173) Intergenerational Correlation (IGC) 0.488 0.441 0.483 0.461 0.531 0.431 (59.91)*** (68.37)*** (46.24)*** (55.09)*** (43.78)*** (44.44)*** IGC squared 0.238 0.194 0.233 0.213 0.282 0.186 Proportion of SC explained by IGC 0.541 0.431 0.570 0.452 0.595 0.454 No. of observations 12,277 19,371 7,385 11,378 4,892 7,993 Robust t statistics in parentheses. Standard errors corrected for clustering at family level * significant at 10%; ** significant at 5%; *** significant at 1% 40 Table 6: Sibling and Intergenerational Correlations: Less Developed Region Sample All children Brothers (Men) Sisters (Women) 1993 2006 1993 2006 1993 2006 PANEL A Sibling Correlation (SC) 0.609*** 0.627*** 0.607*** 0.637*** 0.781*** 0.741*** (47.152) (56.770) (34.284) (38.772) (34.933) (36.863) Intergenerational Correlation (IGC) 0.589 0.589 0.553 0.560 0.690 0.628 (54.94)*** (67.48)*** (42.86)*** (50.64)*** (42.49)*** (50.67)*** IGC squared 0.347 0.347 0.306 0.314 0.476 0.394 Proportion of SC explained by IGC 0.569 0.553 0.503 0.492 0.610 0.532 Controling for Caste and religion PANEL B Sibling Correlation (SC) 0.594*** 0.580*** 0.594*** 0.597*** 0.767*** 0.713*** (44.814) (47.923) (32.539) (33.837) (32.366) (31.123) Intergenerational Correlation (IGC) 0.571 0.533 0.536 0.512 0.665 0.564 (50.10)*** (55.29)*** (38.89)*** (42.73)*** (38.29)*** (39.22)*** IGC squared 0.326 0.284 0.287 0.262 0.442 0.318 Proportion of SC explained by IGC 0.548 0.490 0.484 0.439 0.577 0.446 Controling for Neighborhood FE PANEL C Sibling Correlation (SC) 0.362*** 0.407*** 0.377*** 0.433*** 0.472*** 0.496*** (19.365) (25.833) (14.565) (19.480) (8.581) (12.446) Intergenerational Correlation (IGC) 0.487 0.473 0.479 0.468 0.557 0.503 (46.74)*** (54.21)*** (36.91)*** (42.26)*** (33.56)*** (37.56)*** IGC squared 0.237 0.224 0.229 0.219 0.310 0.253 Proportion of SC explained by IGC 0.654 0.550 0.609 0.505 0.658 0.510 No. of observations 7,136 11,055 4,928 6,917 2,208 4,138 Robust t statistics in parentheses. Standard errors corrected for clustering at family level * significant at 10%; ** significant at 5%; *** significant at 1% 41 Table 7: Sibling and Intergenerational Correlations: Developed Region Sample All children Brothers (Men) Sisters (Women) 1993 2006 1993 2006 1993 2006 PANEL A Sibling Correlation (SC) 0.645*** 0.598*** 0.611*** 0.610*** 0.764*** 0.660*** (99.906) (75.443) (61.044) (51.623) (75.976) (43.135) Intergenerational Correlation (IGC) 0.565 0.521 0.533 0.506 0.609 0.537 (91.13)*** (83.66)*** (71.97)*** (65.58)*** (70.52)*** (61.79)*** IGC squared 0.319 0.271 0.284 0.256 0.371 0.288 Proportion of SC explained by IGC 0.495 0.454 0.465 0.420 0.486 0.437 Controling for Caste and religion PANEL B Sibling Correlation (SC) 0.626*** 0.572*** 0.594*** 0.587*** 0.747*** 0.639*** (91.841) (68.758) (56.851) (47.494) (68.971) (39.307) Intergenerational Correlation (IGC) 0.541 0.495 0.512 0.481 0.580 0.510 (84.59)*** (77.67)*** (67.07)*** (60.64)*** (64.75)*** (57.38)*** IGC squared 0.293 0.245 0.262 0.231 0.336 0.260 Proportion of SC explained by IGC 0.468 0.429 0.442 0.394 0.450 0.407 Controling for Neighborhood FE PANEL C Sibling Correlation (SC) 0.407*** 0.371*** 0.373*** 0.390*** 0.459*** 0.349*** (41.711) (34.240) (25.398) (23.250) (20.069) (13.474) Intergenerational Correlation (IGC) 0.478 0.436 0.471 0.448 0.522 0.448 (85.71)*** (79.98)*** (66.35)*** (62.96)*** (62.93)*** (55.79)*** IGC squared 0.228 0.190 0.222 0.201 0.272 0.201 Proportion of SC explained by IGC 0.562 0.512 0.595 0.515 0.594 0.575 No. of observations 27,449 28,507 16,967 16,708 10,482 11,799 Robust t statistics in parentheses. Standard errors corrected for clustering at family level * significant at 10%; ** significant at 5%; *** significant at 1% 42 Table 8: Sibling Correlations across Caste groups and Regions Upper Caste Lower Caste Upper Caste Lower Caste 1993 2006 1993 2006 1993 2006 1993 2006 Panel A Rural Urban All children 0.587*** 0.563*** 0.534*** 0.543*** 0.659*** 0.629*** 0.642*** 0.591*** (55.625) (48.229) (41.471) (32.664) (55.026) (59.619) (33.773) (30.884) Brothers 0.553*** 0.562*** 0.537*** 0.546*** 0.633*** 0.641*** 0.661*** 0.656*** (Men) (35.411) (31.366) (29.315) (22.032) (37.140) (43.496) (26.207) (27.689) Sisters 0.767*** 0.707*** 0.682*** 0.654*** 0.722*** 0.656*** 0.769*** 0.561*** (Women) (52.282) (35.779) (26.875) (20.919) (33.904) (27.628) (25.883) (12.762) Panel B Less Developed Developed All children 0.621*** 0.642*** 0.548*** 0.573*** 0.646*** 0.579*** 0.601*** 0.592*** (36.302) (52.480) (25.682) (24.866) (79.575) (58.267) (51.655) (42.165) Brothers 0.604*** 0.633*** 0.580*** 0.615*** 0.603*** 0.593*** 0.588*** 0.600*** (Men) (25.264) (34.683) (21.031) (20.156) (47.321) (40.425) (33.918) (28.148) Sisters 0.807*** 0.756*** 0.658*** 0.672*** 0.754*** 0.656*** 0.741*** 0.636*** (Women) (39.031) (34.100) (9.729) (14.554) (59.210) (35.342) (38.422) (22.358) Robust t statistics in parentheses. Standard errors corrected for clustering at family level * significant at 10%; ** significant at 5%; *** significant at 1% 43 Figure 1: Decomposition of Variance of Siblings' Years of Schooling: Full sample 1 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0 1993 2006 1993 2006 1993 2006 All children Male Female Idiosyncractic Neighb'hood Corr Intergen Corr (Sqrd) Other Family Factors 46 Table 9: Sibling and Intergenerational Correlations: Full Sample (16-20 year age cohort) All children Brothers (Men) Sisters (Women) 1993 2006 1993 2006 1993 2006 PANEL A Sibling Correlations (SC) 0.688*** 0.638*** 0.697*** 0.644*** 0.856*** 0.724*** (49.188) (69.590) (31.073) (40.915) (47.494) (41.221) Intergenerational Correlations (IGC) 0.575 0.530 0.531 0.510 0.631 0.552 (78.28)*** (92.81)*** (55.90)*** (68.41)*** (59.55)*** (69.30)*** Proportion of SC explained by IGC 0.519 0.403 0.404 0.404 0.465 0.421 PANEL B Controling for Caste Sibling Correlations 0.672*** 0.607*** 0.685*** 0.615*** 0.845*** 0.703*** (45.672) (62.080) (29.305) (36.714) (43.284) (37.210) Intergenerational Correlations 0.552 0.493 0.512 0.472 0.601 0.514 (72.14)*** (82.88)*** (51.69)*** (61.17)*** (54.39)*** (61.54)*** Proportion of SC explained by IGC 0.453 0.400 0.383 0.362 0.427 0.376 Controling for Neighborhood FE PANEL C Sibling Correlations 0.404*** 0.391*** 0.458*** 0.412*** 0.537*** 0.410*** (15.817) (28.300) (11.375) (17.001) (8.153) (11.418) Intergenerational Correlations 0.508 0.438 0.490 0.442 0.554 0.460 (66.11)*** (79.35)*** (49.75)*** (59.15)*** (48.17)*** (59.27)*** Proportion of SC explained by IGC 0.639 0.491 0.524 0.475 0.571 0.516 13,140 27,966 8,098 15,274 12,683 No. of observations 5,042 Robust t statistics in parentheses. Standard errors corrected for clustering at family level * significant at 10%; ** significant at 5%; *** significant at 1% 44 Table 10: Intergenerational correlations: Full Sample (Parent‘s average education level) All India Rural Urban Less Dev. Developed All Children Region Region 1993 0.585 0.500 0.580 0.595 0.576 (116.49)*** (77.44)*** (58.47)*** (59.55)*** (98.87)*** 2006 0.549 0.493 0.539 0.599 0.528 (114.86)*** (74.55)*** (71.91)*** (71.88)*** (91.05)*** Brothers 1993 0.540 0.465 0.559 0.542 0.534 (89.23)*** (60.15)*** (47.52)*** (44.09)*** (76.42)*** 2006 0.528 0.458 0.547 0.563 0.511 (88.95)*** (55.69)*** (59.60)*** (53.49)*** (70.95)*** Sisters 1993 0.648 0.569 0.603 0.721 0.631 (90.82)*** (60.51)*** (42.57)*** (48.13)*** (77.03)*** 2006 0.573 0.545 0.522 0.645 0.547 (84.18)*** (57.98)*** (50.13)*** (53.71)*** (66.74)*** Robust t statistics in parentheses. Standard errors corrected for clustering at family level * significant at 10%; ** significant at 5%; *** significant at 1% Table 11: Intergenerational Regression Coefficients (β): By Age cohorts Rural urban 15-19 yr 20-24yr 25-29yr 15-19 yr 20-24yr 25-29yr 2006 Parent's education1 0.272 0.409 0.465 0.259 0.427 0.508 (39.73)*** (31.79)*** (20.10)*** (37.65)*** (36.50)*** (23.91)*** No. of observations 8963 4132 1632 7583 4284 1522 1993 Parent's education1 0.383 0.504 0.550 0.327 0.486 0.553 (46.63)*** (37.22)*** (23.35)*** (32.54)*** (30.88)*** (20.53)*** No. of observations 9709 5485 2124 4694 2889 1063 Note: 1: Maximum of Father and Mother's schooling years. Regressions include control for caste and religion, children‘s age dummies and state fixed effects. Robust t statistics in parentheses. Standard errors corrected for clustering at family level * significant at 10%; ** significant at 5%; *** significant at 1% 45 Figure 2: Variance Decomposition of Siblings Years of Schooling: Rural Sample 1.0 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0.0 1993 2006 1993 2006 1993 2006 All children Male Female Idiosyncractic Neighb'hood Corr Intergen Corr (Sqrd) Other Family Factors Figure 3: Variance Decomposition of Sibling's Years of Schooling: Urban Sample 1.0 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0.0 1993 2006 1993 2006 1993 2006 All children Male Female Idiosyncractic Neighb'hood Corr Intergen Corr (Sqrd) Other Family Factors 47