Methodologies to Evaluate Early Childhood Development Programs December 2007 Acknowledgement This paper was written by Jere R. Behrman,1 Paul Glewwe,2 and Edward Miguel.3 The authors would like to thank Harold Alderman and Markus Goldstein for comments on an earlier draft of this paper. This work was task managed by Markus Goldstein and financed by the Trust Fund for Environmentally and Socially Sustainable Development supported by Finland and Norway and by the Bank-Netherlands Partnership Program. 1Department of Economics and Population Studies Center, University of Pennsylvania: jbehrman@econ.upenn.edu 2Department of Applied Economics, University of Minnesota: pglewwe@apec.umn.edu 3Department of Economics, University of California, Berkeley: emiguel@econ.berkeley.edu. The opinions reflected in this paper are the opinions of the authors and not opinions of their institutions. TABLE OF CONTENTS INTRODUCTION........................................................................................................................................ 1 A. WHY EVALUATE EARLY CHILDHOOD DEVELOPMENT (ECD) INTERVENTION PROGRAMS?....................1 B. STEPS IN THE PROCESS OF ASSESSING ECD INTERVENTIONS..................................................................6 C. THE CREDIBILITY OF AN INTERVENTION.................................................................................................7 I. ECD PROGRAM DIMENSIONS.......................................................................................................... 8 A. TYPES OF INTERVENTIONS......................................................................................................................8 Center-based ECD Programs...................................................................................................... 8 ECD Education for Parents to Enhance Parenting and Child Stimulation................................. 9 Comprehensive ECD Programs .................................................................................................. 9 Health Center Programs with Direct ECD Impacts.................................................................... 9 B. POLICY MOTIVES AND BENEFICIARIES..................................................................................................10 C. IMPACT INDICATORS.............................................................................................................................12 Process and Program Implementation Indicators..................................................................... 12 Short-Term Program Impact Indicators.................................................................................... 12 Medium- and Long-Term ECD Program Impact Indicators ..................................................... 14 D. VALUING THE IMPACTS TO OBTAIN BENEFITS IN THE SAME METRIC AT THE SAME TIME ....................15 Valuing the Impacts in a Common Metric................................................................................. 15 Time ­ Program Age, Child Age, Duration of Exposure and Lags in Impacts.......................... 16 Including All the Major Impacts but Avoiding Double-counting............................................... 17 E. VALUING THE RESOURCE COSTS...........................................................................................................18 The Identification of Project Inputs........................................................................................... 18 Costing the Inputs for ECD Projects......................................................................................... 20 Other Project Costs ................................................................................................................... 23 Treatment of Some Specific Cost Items...................................................................................... 23 II. FRAMEWORK FOR ESTIMATING IMPACTS OF ECD PROGRAMS .................................... 25 III. ESTIMATION ISSUES AND POSSIBLE RESOLUTIONS.......................................................... 32 Sample Selection........................................................................................................................ 33 Instrumental Variable (IV) or Two-Stage-Least Squares (2SLS) Estimates.............................. 35 Fixed Effects (FE) Estimates ..................................................................................................... 36 Propensity Score Matching (PSM) Estimates............................................................................ 38 Construction of Standard Errors............................................................................................... 38 IV. STRENGTHS AND LIMITATIONS OF ANALYSIS OF VARIOUS TYPES OF DATA.......... 39 A. SOME MAJOR CHARACTERISTICS PERTAINING TO DATA QUALITY.......................................................40 Representativeness..................................................................................................................... 40 Power, Sample Size, and Sample Design................................................................................... 40 Coverage of Relevant Variables ................................................................................................ 41 Measurement Errors.................................................................................................................. 41 Human Subject Protection......................................................................................................... 42 B. SOME MAJOR TYPES OF DATA FOR EVALUATING ECD INTERVENTIONS..............................................42 Cross-sectional Data................................................................................................................. 42 Longitudinal or Panel Data....................................................................................................... 43 Time Series of Cross-sectional Surveys..................................................................................... 46 Qualitative Data Sources........................................................................................................... 46 CONCLUSIONS......................................................................................................................................... 47 REFERENCES........................................................................................................................................... 49 APPENDIX TABLE A.1............................................................................................................................ 79 Introduction A. Why Evaluate Early Childhood Development (ECD) Intervention Programs? Three recently published papers in a prominent 2007 series in Lancet summarize what is known about early childhood development in developing countries. Estimates are that over 200 million children in developing countries under five years of age fail to reach their developmental potential because of risk factors associated with poverty (Grantham-McGregor et al. 2007). These risk factors are characterized in particular as including stunting, inadequate cognitive stimulation, iodine deficiencies, and iron deficiency anemia; but it is claimed that the evidence is also sufficient "to warrant interventions for malaria, intrauterine growth restriction, maternal depression, exposure to violence, and exposure to heavy metals," (Walker et al. 2007, p. 145). Therefore, the third paper in this series concludes that "governments and civil society should consider expanding high quality, cost-effective early child development programmes," (Engle et al. 2007, p. 229). Thus this series suggests that there potentially are considerable gains from expanded EDC interventions in developing countries. As noted in the third of these papers, the interest in developing countries and in international development-oriented organizations in ECD programs has increased in recent years: "Awareness of child development is increasing in developing countries. The health sector has advocated for early child development programmes for children with low birthweight, developmental delays, and from low-income disadvantaged environments. Child development information is often incorporated into growth monitoring charts. Government-supported preschool programmes for children are increasing; in the past 15 years, at least 13 developing countries have instituted compulsory preschool or pre-primary programmes. By 2005, the World Bank had financed loans to 52 developing countries for child development programmes, for a total of US$1680 million, at least 30 developing countries had policies on early child development, and UNICEF was assisting governments in supporting parenting programmes in 60 countries." (Engle et al. 2007, pp. 229-230) But this summary of increased activity relating to ECD in developing countries concludes with a pessimistic evaluation of what is really known about the impacts of ECD programs in these contexts: "Despite this interest, there have been few systematic evaluations of early child development programmes in developing countries." (p. 230) The third paper reviews 19 evaluations of ECD interventions since 1990 that met six criteria: "(a) randomized controlled trial or matched comparison group; (b) intervention before age 6 years; (c) effectiveness or program evaluations (not efficacy trials); (d) child development assessed; (e) targeted disadvantaged children; and (f) 1 developing country," (Engle et al. 2007, p. 232). These interventions are summarized in Table 1.4 Only one of these ECD intervention evaluations included in this table is based on a national sample, and over a third are based on fewer than 10 communities. That this review found that there had been so few ECD interventions in all the developing world over a decade and a half that had been systematically evaluated, and that many of these cases were based on very few communities, reinforces the point that there are likely to be high rates of return in terms of knowledge and in terms of the foundation for policy formation from expanded evaluation of ECD programs of different types in different developing country contexts. In a nutshell, thus, there are estimates that ECD problems are widespread in developing countries, increasing evidence that what happens in early childhood affects significantly options and productivities over the life cycles but very little systematic evidence to support that the impacts of these ECD programs are large or, more importantly, that the benefit-to-cost ratios of ECD interventions are high ­ particularly in light of the heterogeneous market, policy, and cultural contexts across developing countries that may limit the transferability of inferences from one context to another. Therefore the returns are potentially great not only for those who already are persuaded that more resources should be devoted to EDC interventions in developing countries in order that they can make their case more persuasively but also for those who are concerned more broadly about prioritizing resource allocations across what might seem to be a number of strong but difficult-to-compare alternatives ranging from other human resource investments to physical infrastructure investments to policies that affect markets such as for international goods and services and labor and capital flows, to have more systematic evaluations of the impacts of ECD interventions and of their benefit-to-cost ratios.5 This chapter is devoted to discussions of how assessments of ECD interventions in developing countries can be improved and extended. 4Appendix Table A.1 summarizes evaluations of a number of ECD programs in the United States. 5 The "Copenhagen Consensus" is a recent visible effort to try to establish systematically priorities among about 40 interventions in developing countries in ten broad topic areas ­ education, climate change, communicable diseases, conflicts, financial instability, governance and corruption, malnutrition and hunger, migration, sanitation and water, and subsidies and trade barriers ­ using benefit-cost ratios as guides with an expert panel composed of eight prominent economists (half of whom have received the Nobel Prize in Economics) ranking the proposals (Lomborg 2004). ECD programs are not considered explicitly as a major topic area in this effort, though some interventions related to ECD are prominent components of some topic areas, particularly those related to hunger and malnutrition (Behrman, Alderman and Hoddinott 2004). 2 Table 1. Summary of 20 Available Systematic Evaluations of ECD Programs in Developing Countries Since 1990 Based on Engle et al. (2007, Tables 2 and 3) Effect Size of Sample Child Cognitive Size for Country Intervention Age Outcome Measures Measure Evaluation Scale** I. Primarily Center-Based Programs 1 Argentina Increase in preschool places 3-5y Third-grade mathematics and Spanish 0.23 >125,000 3 achievement tests 2 Bangladesh Preschool run by NGO, feeding 4.5-6.5 y (1) Cognitive development from WPPSI-III 0.20-0.23 208 1 (2) School readiness 1.4 (3) Play observation Scale 0.19-0.72 3 Cape Verde Formal Preschool 3-6 y Cognitive development (Simplified Boehm 0.29; 0.48* 803 3 Basic Concept Test) at 5 y 4 Colombia Day care enter-based feeding and 42-75 m Stanford-Binet test initially N/A 333 children 1 stimulation; 5 groups: food alone, (170 at and food + different time periods of followup) stimulation, high SES control 5 Guinea Informal community-based early 2-6 y Cognitive development (Simplified Boehm 0.33; 0.66* 877 2 learning centers Basic Concept Test) at 5 y 6 Myanmar Community-based ECD center and 3-5 y (1) Primary school pass rate N/A (1) 3,484 2 community support (2) Repetition rate for grade 1 (2) 1880 (3) Test performance (3) 268 7 Nepal Community-based ECD center 3-6 y (1) Primary school pass rate N/A 935 2 (education and health) (2) Repetition rate for grade 1 (3) Annual drop out rate after 4 y 8 Vietnam Center and home (education, 0-3 y for Raven's Colored Progressive Matrices at 6.5 to 0.25 313 1 parenting, nutrition) nutrition 8.5 y 4-5 y for education Table 1. (continued) Summary of 20 Available Systematic Evaluations of ECD Programs in Developing Countries Since 1990 Based on Engle et al. (2007, Tables 2 and 3) Effect Size of Sample Child cognitive Size for Country Intervention Age Outcome Measures measure Evaluation Scale** II. Parenting and Parent-Child Interaction Training 9 Bangladesh Parent groups that meet weekly for 2-3 y (1) Maternal knowledge (1) .31 329 2 one year; mean attendance of 12 mother sessions (range 0-42; assessment 2 knowledge m. after end of programme (2) Home Scale and subscales (2) .34 on HOME (3) Receptive Vocabulary (4) Weight/height (5) Five preventative health behaviors (6) Mother-child picture and puzzle task 10 Bolivia Adult literacy programs and home 24 m; Psycho-social development (rating of 1 to 4); 454 2 visits (parenting, health, nutrition) some older fine motor, gross motor, hearing and language, personal and social assessment) 11 Colombia Nutritional supplement and/or a Prenatal to Griffiths at 4, 6, 12, 18, 24, 36m. N/A 433 families 1 stimulation (Home Visit) program 3y, follow up at 6y Locomotor, personal-social, speech & language, eye-hand coordination, Einstein scale applied through 18m 12 Jamaica Home-visits by roving caregivers 3-36 m Griffiths Mental Developmental Scales 0.5* 163 1 (health, nutrition, parenting, income-generating) 13 Jamaica Home-visits by health aides 9-30 m (1) Griffiths Mental Developmental Scales, 0.8* 130 1 (parenting) (2) Mothers' knowledge and practices of childrearing 14 Turkey 3 (Center) x 2 (mother training) 3-5 y (1) School attainment (2) 0.45 217 families 1 design; Center = educational, custodial, or none; Mother training (2) School achievement = MT, NMT (3) WISC-R vocabulary test Table 1. (continued) Summary of 20 Available Systematic Evaluations of ECD Programs in Developing Countries Since 1990 Based on Engle et al. (2007, Tables 2 and 3) Effect Size of Sample Child cognitive Size for Country Intervention Age Outcome Measures measure Evaluation Scale** III.Comprehensive Programs for ECD 15 Bolivia Child care centers in home; 6-72 mo Gross and fine motor skills, language and 0.4-1.5 1,198 2 Feeding health and nutrition auditory skills, and psychosocial skills monitoring, education) 16 India Integrated childcare center; support 3-6 y (1) Motor and mental development using N/A 3,724 3 for pregnant and lactating mothers, WHO Milestones assessment growth monitoring, feeding (2) Binet-Kamat IQ tests 17 Peru Preschool and non-formal preschool 3-5 y Grades (A-C) in mathematics and language N/A 304 3 (Spanish) as assessed by the first grade teacher. 18 Philippines Home (family day care programs, 0-4 y ECD checklist of gross and fine motor skills, 0.5~1.8 6,693 2 home visits) receptive and expressive language, socio- emotional skills, cognitive skills, and self-help skills 19 Uganda Communication on ECD, child 0-6 y (1) Ugandan version of the British Abilities N/A 2,010 2 health days, village grants on Scale nutrition, ECD centers (2) Parenting practices (3) Nutritional status *Controlling for SES; ** Scale:1 = coverage < 10 communities; 2 = coverage >10 communities or district, but not national; 3 = national coverage. Sources: 1. Berlinski, Galliani, and Gertler (2006); 2. Aboud (2006); 3. Jaramillo and Tietjen (2002); 4. McKay et al. (1978) and Pollitt and Escamilla (2007); 5. Jaramillo and Tietjen (2002); 6. Save the Children (2004); 7. Save the Children (2003); 8. Watanabe et al. (2005); 9. Aboud (2006); 10. Morenza et al. (2005); 11. Weber et al. (1981) and Super et al. (1990); 12. Powell et al. (2004); 13. Powell (2004); 14. Kagitcibasi, Sunar and Bekman (2001); 15. Behrman, Cheng and Todd (2004); 16. Vazir and Kashinath (1999); 17. Cuento and Diaz (1999); 18. Armecin et al. (2006) and Ghuman et al. (2006b); 19. Alderman and Engle (2007). B. Steps in the Process of Assessing ECD Interventions The major steps in the process of assessing ECD interventions (or most other interventions) include: · ascertaining what are the objectives of the interventions and what are good indicators of the impacts and costs associated with those objectives; · determining the critical characteristics of datasets needed to evaluate the interventions: baseline with longitudinal data with sufficient periodicity to capture dynamic effects of intervention and sufficient duration to capture longer-run effects of intervention, sample representative of some relevant larger population, measurement of critical impacts and ways of valuing those impact and of resource costs, sufficient power to be able to identify impacts of a desired magnitude, establishment of comparable control as well as treatment group, sufficient information with which to be able to locate entities in subsequent survey rounds, sufficient human subject protection; · determining and testing relevant survey instruments for households/individuals, for relevant communities and for service providers to be able to include good indicators of attainment of relevant objectives, possible other impacts, private and social resource costs and controls for confounding factors; · undertaking data collection for households/individuals and for relevant service providers and communities with procedures that permit high response rates, low attrition, quick availability of data for analysis, internal consistency and validity checks; · analyzing the data in a timely fashion beginning with careful examination of the data and then careful systematic analysis of impacts, the evaluation of those impacts to obtain measures of benefits in a common metric, resource costs ­ all based on comparisons over time between treatment and comparable control groups within frameworks such as suggested below and using estimation techniques to deal with the possible estimation issues noted below and ideally with the distinction between private and social benefits and costs; · communicating with relevant stakeholders the preliminary results of such analyses and obtaining feedback in order to improve the analyses; encouraging others to undertake independent analyses of the data; · making available the revised results of the analyses and their interpretation for policies with sufficient information and data so that others can test to see to what extent the results are replicable and to explore how robust they are to different estimation strategies. These steps are fairly straightforward and seem to reflect common sense. But for such analyses, the "devil is in the details." How useful the resulting evaluations of ECD 6 programs are depends very much on the details of the research strategy and its implementation. The sections below address a number of the critical aspects of the systematic evaluation of ECD programs in greater detail. C. The Credibility of an Intervention The credibility of an intervention is likely to depend on a number of factors. But very important among them is the quality of the perceived evaluation of impacts of the intervention and the extent to which these impacts are positive and substantial relative to the resource costs of the intervention. If a ECD intervention is perceived to have been well-evaluated and to have substantial impacts relative to resource costs, then not only is the program likely to have stronger backing for maintained or increased allocations from national and international technocrats but also is more likely to fare well in the larger political economy arenas that ultimately shape the political economy of resource allocations. An informative example is the experience of the well-known Mexican anti- poverty human resource program PROGRESA/Oportunidades, which includes some ECD components as well as components focused on schooling and other human resources later in the life cycle. Prior to the introduction of PROGRESA in 1997, evaluation of human- resource-related interventions in Mexico tended to be in-house and not very systematic (e.g., often without baseline data, small and idiosyncratic samples, and without persuasive controls or even without controls at all), so they did not contribute much to assessments of the credibility of the interventions. This meant that the programs often were abandoned or left to wither when every six years a new president assumed office and wanted to introduce his own programs, even though for over six decades the presidents were always selected by their predecessors from leading politicians in the same political party (PRI, Partido Revolucionario Institucional). PROGRESA, in contrast, undertook evaluation with longitudinal data starting with a pre-program baseline with fairly substantial power (about 25,000 households) with random assignment of initial treatment among the 506 communities in the evaluation sample (with the controls included in the program after two years though that was not public knowledge to attempt to minimize "announcement" or "anticipation" effects) with numerous publicly-available written reports on the evaluation undertaken by a relatively "arms-length" international group of experts that solicited and incorporated responses to critiques by others and with the data available for analysis by others. This resulted in a program that was judged generally to be good in attaining its objectives, though the evaluations suggested some changes to improve the program that were implemented (e.g., increasing the coverage of the program through upper secondary school). The credibility of the program perhaps was most vividly reflected in that the program survived the election in 2000 not only of a new president, but also the defeat of PRI after over six decades. The program survived with minor modifications and under a new name, Oportunidades, according to informed sources in no small part because it had credibility due to the relatively systematic and transparent evaluation that had been undertaken. 7 Good empirical analysis of ECD programs is challenging, as noted above. This chapter clarifies this challenge and possible contributions by considering: (1) what are the dimensions of ECD programs, (2) what estimated relations would be informative for improving understanding within a life-cycle behavioral framework with important unobserved variables (e.g. genetic endowments), (3) possible resolutions to estimation problems, and (4) different types of data. Through careful examination of existing data, keeping in mind considerations in this chapter, much can be learned about the impact of ECD interventions conditional on assumptions that are necessary for causal interpretations. But it is also important to be alert to opportunities for improving data and for encouraging collection of new and better data and undertaking new and better analyses. I. ECD Program Dimensions A. Types of Interventions The types of ECD interventions vary considerably in developing countries. Following Engle et al. (2007) and the organization in Table 1, it is useful to consider three broad types and then note variants through health centers ­ while noting that such categorization is not perfect since some programs overlap across these categories. Center-based ECD Programs Eight examples of ECD center-based programs are given in the first panel of Table 1. In these programs young children typically spend considerable periods of time ­ half days or whole days ­ in centers that are focused on providing ECD-related services. These center-based ECD programs can be further divided into (1) medium and large ECD integrated centers in educational, religious, and stand-alone public, private, and NGO institutions and (2) small public or private ECD integrated centers in neighborhood homes. All eight of the ECD center-based evaluations that are summarized in Panel 1 of Table 1 report significant effects on children's cognitive development, either through preschools (Bangladesh, Cape Verde, Colombia, Guinea, Myanmar, Nepal, Vietnam) or treatment centers for malnourished children (Bangladesh). All of these programs primarily have a physical center outside of homes, though the Vietnam program has a home component. There also are a few evaluations of home-based public programs that are not included in the first panel of Table 1: the Bolivian Projecto Integral de Desarrollo Infantil (PIDI) program (Behrman, Cheng, and Todd 2004) that is evaluated using matching methods (included in the third panel) and the Colombian Hogares Comunitarios program evaluated using instrumental variable techniques (with distance to the program as the instrument) to attempt to identify program impacts (Attanansio and Vera-Hernández 2004). All of these programs that have the necessary data find significantly positive effects on child cognitive skills, in many cases with fairly substantial effect sizes of the order of magnitude of about 0.20 (though information is not available with which to calculate effect sizes in all cases). The evaluations of many of 8 these programs also report non-cognitive gains such as social skills, self-confidence, willingness to talk to adults, and motivation. The subset of these evaluations that followed children into school (Colombia, Myanmar, Nepal) report improvements in the proportion of children entering school, age of entry into school, retention in school, and performance in school. ECD Education for Parents to Enhance Parenting and Child Stimulation Five examples of ECD programs directed towards improving parenting and other care-giving are given in the second panel of Table 1 (and at least one of the programs in the first panel, the one in Vietnam, and at least one in the third group, the one in the Philippines, also have this characteristic). Five of these programs used home visiting; the evaluations of all of these programs report positive effects on child development. For example, in Jamaica, parenting practices improved when children and parents were actively involved in a home-visiting program (though not if the parental component was limited to information sharing). Two programs used group sessions with mothers. In Turkey, for example, where mothers practiced skills to play with their children, there were short and long-term effects on ECD. However, in Bangladesh, where the sessions included providing information but no activities, mothers' knowledge increased, but there was no impact on ECD, leading Engle et al. (2007, p. 233) to suggest that "effective parenting programs should have skill-based activities involving children." Comprehensive ECD Programs Six examples of evaluations of what Engle et al. (2207) characterize as "comprehensive" programs because of their efforts at broader multi-dimensional interventions are summarized in the third panel of Table 1 (and the Colombian Hogares Comunitarios program noted in Section I.D also probably could be included in this group). The more recent of these programs are integrated into existing community-based systems and include families more effectively than earlier models. All but the Ugandan ECD program in this group report beneficial effects. Engle et al (2007) suggest that the Uganda program may illustrate that low-intensity ECD programs that do not direct services toward children may have limited impact on child outcomes. Health Center Programs with Direct ECD Impacts There are a number of efforts to improve aspects of ECD through health centers, ranging from vaccination programs for infectious diseases to growth monitoring to nutritional supplementation to informing mothers of good breastfeeding and complementary food practices. A recent example is the component of the Mexican conditional cash transfer program PROGRESA, mentioned in Section Intro.C above, directed towards infant nutrition through providing micro nutrient supplements as well as maternal training regarding infant and child nutrition. Studies using experimental data with control for non-random program participation and matching methods report a significant and fairly substantial effect on child growth and then on early school performance (Behrman and Hoddinott 2005; Behrman, Parker, and Todd 2006, 2007). 9 B. Policy Motives and Beneficiaries The standard motives for using public resources to support ECD, as for other economic activities, relate to efficiency and distribution. The efficiency motive pertains to differences between the social and the private rates of return to an activity. There may be such a difference, or a distortion, because of externalities or spillovers from one individual to another that are not transferred through markets ­ a type of "market failure." An example would be the cure of an infectious disease for a child, which benefits not only the child directly affected but also others who may have been exposed to infectious disease through that child if s/he had not been cured. Or there may be a difference because of policies that result in markets giving incentives for using resources that are different from the social marginal benefit of those resources, or "policy failures." Examples include policies that place effective ceilings or floors on prices (e.g., minimum wages for child-care workers), overly subsidize goods and services (i.e., beyond what may be appropriate due to market failures), or preclude or discourage entry into an activity (e.g., limiting the provision of child care services or subsidies provided for the provision of such services to public providers). Removing such inefficiencies or distortions has the attractive potential of making everyone better off or of making some people better off without making others worse off ­ and thus probably increasing welfare. In the real world there appear to be many inefficiencies or distortions due to both market and policy failures. Reducing any one distortion between private and social marginal incentives is likely to improve the productivity and welfare of an economy.6 The distributional motive could relate to any aspect of distribution, but usually most emphasis is given in the development literature to the poorest, those in the left tail of distributions of income or wealth. Whatever the distributional motive, the question naturally raises regarding how much policies benefit the desired beneficiaries, or how successfully they are targeted. In a short-run sense, policies that succeed in benefiting more the targeted group (e.g., the poor) are more successful. In the background there may be a political economy question, however, of how sustainable various policies are. For this reason, for example, it may be desirable in certain political systems that policies aimed at the poor also benefit some the middle class to assure ongoing political support (if, for example, the median voter is particularly important in assuring sustainability). Of course, many policies might be possible to improve attainment of either the efficiency or the distributional goals, but the alternatives are likely to have different economic (resource, not budgetary) costs (that should include private costs and distortionary costs, not just public-sector resource costs). To attain a particular efficiency (or likewise with distributional) goal, therefore, one can conceptualize of a policy 6"Likely" is used here because, as demonstrated in the literature on "the second best," it is possible that one distortion is offset by another so that removing only one of these offsetting distortions reduces overall efficiency and welfare. But in the absence of any explicitly offsetting distortion, moving any one distortion is likely to improve efficiency and welfare. 10 hierarchy in terms of such costs. Generally policy changes that are more directly addressed to the efficiency (or distributional) situation of concern and price (rather than quantitative) policies tend to be higher in such a policy hierarchy, particularly given information imperfections in a rapidly changing world. Some policies are "win-win" in that they may promote attainment of both efficiency and distributional goals. If the distributional goal is to improve options for those living in poverty and those living in poverty are most affected by capital, insurance, and informational market imperfections, for example, policies directed towards improving the functioning of these markets may be "win-win." For other policy choices there may be tradeoffs between the efficiency and distributional goals. Conceptually, to select among policy alternatives, social benefit-cost ratios or social internal rates of return provide guidance.7 Such estimates are difficult to make for many aspects of ECD programs (as well as other policy areas) in part because of the multiple impacts over the life cycle, the question of how to value some of the impacts such as adverted mortality, the difficulties in estimating social effects beyond private effects, and the difficulties in estimating some of the resource costs.8 Many of the issues about assessing the policy rationale for public support for ECD programs relate to possible efficiency and distributional impacts of ECD. There are many claims, for instance, that there are strong positive externalities of ECD, though very little systematic evidence to support such claims because such externalities often are challenging to identify empirically.9 These considerations about policy motives relate to substantial lacunae in the literature on evaluation of ECD programs in developing countries to date. While that literature pays some attention to the distributional motive, particularly with regard to the 7 A third alternative is to maximize the net present value. Criteria based on net present values, cost-benefit ratios, and internal rates of return are not equivalent in all cases. In the case of mutually exclusive alternatives (e.g., which bridge to build across the gorge when there is the possibility of building only one bridge), for example, the advantage of net present value is that it is able to reflect the absolute size of the potential benefits (see Belli et al. 1998 for further discussion). But for most (perhaps all) ECD projects, these approaches are equivalent. 8The "Copenhagen Consensus," as noted above, attempted to rank a number of possible interventions in ten broad areas using benefit-cost ratios to guide the ranking. Some of the studies underlying the benefit- cost ratios used present illustrations of how sensitive such estimates are to critical assumptions, such as the value of adverted mortality and the appropriate discount rate (e.g., Behrman, Alderman, and Hoddinott 2004). 9 An exception for primary schooling, though not for ECD programs, is Foster and Rosenzweig (1995), which estimates positive spillovers on technological adoption decisions of neighbors. There also is a substantial literature that interprets the associations between adult schooling (particularly women's schooling) and human resource outcomes of other family members as reflecting spillovers. But, as noted below in Section III, the few studies that have treated parental schooling as behaviorally determined within such a context report that the causal effects are much different (generally much smaller) than these associations. Also, within models such as in Section II, investments in other household members by parents may be motivated in part by private gains due to altruism or expected reciprocal transfers later in life; to the extent that this is the case, there is not a difference between the private and the social gains from investments in the parents' education. 11 poor, it pays no (or virtually no) attention to the efficiency motive or much attention to the notion of a policy hierarchy or benefit-cost ratios even for distributional targets. Future evaluations of ECD interventions in developing countries would be much more useful if they would be sensitive to and incorporate into their analyses these concerns about policy motives. C. Impact Indicators Process and Program Implementation Indicators Process and program implementation indicators include time series of measures such as the extent of participation of children in ECD programs, the inputs used in such programs, and the expenditures made in such programs. Such measures are likely to be useful in monitoring how programs are developing over time in some important dimensions. However, they on their own do not provide much insight into the impact of the program, to say nothing of whether the benefits relative to the costs merit maintaining the program or where the program might stand in the policy hierarchy except for the obvious point that presumably the program needs to be functioning reasonably if it is to have an impact. Short-Term Program Impact Indicators Most of the available studies of the impact of ECD programs in developing countries focus on fairly short-term indicators of ECD measured while the children are of pre-school age: · cognitive skills (e.g., Simplified Boehm Basic Concept Test; Receptive Vocabulary, Stanford-Binet Test, Griffiths Mental Development Scales), · ability (e.g., Raven's Colored Progressive Matrices, Binet-Kamat IQ tests), · school readiness, · language and auditory skills, · biochemical and clinical indicators of ECD (iron deficiencies, iodine deficiencies), · anthropometry, particularly related to stunting, · gross and fine motor skills, · social skills, · morbidity, · mortality, · play observation scale. Table 2 provides summary information on a wide variety of impact indicators, along with references for most of them. Further information is also provided in Fernald, Raikes, and Dean (2006). Needless to say, the decision on which indicators to use is a complicated one that depends on country circumstances, budget, and the availability of trained personnel. A detailed discussion of how to make this choice is beyond the scope of this paper (and outside the authors' areas of expertise). 12 Table 2: Indicators of Early Childhood Development Time to Can Be Amount of Type of Administer Administered to Training Indicator Name Indicator Age Range (per child) Groups? Needed 1. Primarily Mental Development or Cognition Boehm Test of Basic School readiness 5-7 years Yes Conceptsg Stanford-Binet Testa,d Intelligence test 2-23 years 45-90 minutes Yes(?) Extensive Griffith Mental Development Mental 0-8 years Scales development Raven's Progressive Coloured Test of mental 7 years and 45-90 minutes Yes, for up to 8- 1-2 days Matrices (book form)h development older 9 children Binet-Kamat Test of General mental 3 ­ 22 years Intelligence ability Wechsler Intelligence Scale Intelligence 5-15 years 50-75 minutes No for Children (WISC) b, d, g scales Wechsler Preschool and Intelligence 2.5-7 years 30-60 minutes No Extensive Primary Scale of Intelligence scales (WPPSI)c, d,g Kaufman Assessment Battery Intelligence and 2.5­12.5 Extensive for Children (KABC)b achievement test years Goodenough-Harris Drawing Nonverbal 5-17 years 10-15 minutes Yes Test (Draw a Man)d ability 2. Primarily Physical Development Hemoglobin Testh Blood test for 0-18 years 5 minutes No 1 day iron deficiency Height-for-Agee Chronic 0 ­ 18 years 5 minutes Only to a small 2-4 days malnutrition extent Weight-for-Agee Current 0-18 years 5 minutes Only to a small 2-4 days malnutrition extent Weight-for Heighte Chronic and Boys: 0-138 10 minutes Only to a small 2-4 days current months extent malnutrition Girls: 0-120 months Upper-arm circumferencee Current 6-60 months 5 minutes Only to a small 1 day nutritional status extent 3. Motor Development Bayley Scales of Infant and Cognition, 1-42 months 30-90 minutes No Extensive Toddler Developmentg language, motor, social-emotional 4. Social and Emotional Responses and Development and Social Skills Brazelton Neonatal Behavioral Newborn mental First 30 About 1 hour No Assessment Scalef and physical days of life development ahttp://www.answers.com/topic/stanford-binet-test?cat=health bhttp://www.answers.com/topic/intelligence?cat=health chttp://harcourtassessment.com/haiweb/Cultures/en-US/dotCom/WPPSI-III.com.htm dhttp://www.unu.edu/Unupress/unupbooks/80473e/80473E0h.htm#Description%20of%2 0methods%20in%20published%20studies summarizes a number of these measures ehttp://whqlibdoc.who.int/trs/WHO_TRS_854.pdf fhttp://jpepsy.oxfordjournals.org/cgi/content/full/25/8/577 ghttp://www.harcourt-uk.com/OccupationFocus.aspx?n=1316&s=1319 hhttp://www.hemocue.com/index.php?page=2999 13 Because of the perceived importance of the home environment, parenting, and stimulation, measures of these characteristics also are common even through presumably these are measures of possible facilitators of ECD rather than ECD itself: · home environment (e.g., HOME scale), · maternal knowledge regarding ECD. Table 1 gives examples of the uses of many of these indicators in developing countries. While some have been used across countries and other contexts that facilitate comparisons across such contexts (e.g., anthropometric measures, Griffiths Mental Development Scales, Simplified Boehm Basic Concept Test), in many cases, in part due to language differences, tests have been developed for local contexts. There is an obvious tradeoff between using standard measures that are used elsewhere to facilitate comparability and using measures that may measure better aspects of local conditions. Medium- and Long-Term ECD Program Impact Indicators While some of the short-term impacts mentioned in Section I.C may be of interest themselves because they are direct indicators of pre-school age child welfare, in many cases they are of interest because they are thought to have medium-tem (e.g., during the school years)10 or long-term (e.g., during post-schooling adult life) impacts. There also have been used a number of direct measures of the medium- and long-term impacts on children as they age such as: · schooling success and cognitive skills (age of school entry, grade repetition, early drop out, school test performance, school enrollment, school attendance, school progression rates, school attainment conditional on age, or completed schooling attainment, cognitive test performance); · productivity in economic activities (labor force participation, unemployment, occupation, hours worked, wage rates, entrepreneurial activities); · productivity in non-economic activities (health and nutritional status of family members, fertility control, marital status, migration); · health, morbidity, mortality and anthropometry; · risk taking behaviors, particularly those with externalities (smoking, other drug consumption, early and risky sex, crime). Table 1 (and Appendix Table A.1) gives some examples from the literature. Of course, longitudinal data, perhaps for many years, may be necessary to represent these indicators. For analysis of ECD in developing countries, there are a number of studies with medium-term school-related indicators but relatively few with longitudinal data from early childhood substantially into adulthood. Therefore, in any particular impact assessment it may be necessary to piece together evidence from various sources to obtain 10For example, Glewwe and Miguel (2007) discuss the impact of early childhood nutritional indicators on indicators of schooling success. 14 estimates of the longer-run effects (e.g., Alderman and Behrman 2006, Behrman, Alderman, and Hoddinott 2004). D. Valuing the Impacts to Obtain Benefits in the Same Metric at the Same Time To obtain the overall impact of an ECD program (of the "benefits" for a benefit- cost calculation), somehow the various short-, medium-, and long-term impacts that are discussed in Section I.C must be combined. This poses several challenges. Valuing the Impacts in a Common Metric To aggregate across the impacts, it is necessary to translate them into some common metric, such as monetary units. Some of the impacts may naturally translate into resources gained or saved through better ECD that relatively easily can be translated into common monetary values. For example, the resources saved through lowered infant morbidity (e.g., less parental time in care, less direct health curative care) can relatively easily be given a monetary value, as can any increase in wages that might result from better ECD. The market prices and wages that people pay or receive indicate the impact on their income of such changes due to better ECD. However from a social point of view, what is relevant is the true marginal resource values (or shadow prices) of these changes, which may differ from the market prices because, for example, of subsidies to the health system. So the value of the private and the social impacts well may differ. For some impacts, however, there is no easy direct way to translate into monetary terms. Perhaps the best example is the question of how to value in resource terms any adverted mortality due to better ECD. There are some who argue that the value of life time earnings or productivity should be used for this evaluation. But, at least from the point of view of the survivors, this would seem to be an overestimate because a child who does not die because of an ECD intervention not only generates income but also consumes resources over the life time. In fact, some recent estimates for Indonesia by Grimm (2006) suggest that for child mortality the net effects on consumption of other family members are basically zero because the reduction in consumption that the child would have consumed is about the same as the reduction in income that the child would have contributed. An alternative approach is to value adverted mortality by the value of resources used for the best (in the sense of the least resource cost) available alternative for adverting mortality. Summers (1992, 1994), for example, uses the resource cost of adverting child mortality through inoculations to value the impact of increased female schooling on child mortality in Pakistan (also see Behrman, Alderman, and Hoddinott 2004; Knowles and Behrman 2005). Some recent studies for the developed countries estimate the value of extending life as being much greater under the assumption that the marginal utility of non-health consumption within a period such as a year is sharply diminishing and the interperiod substitutability of utility is limited so that there are large utility gains from extending the periods of life with their initial within-period relatively high marginal utilities of consumption (e.g., Hall and Jones 2007). The bottom line of this 15 discussion is that there is controversy about how best to value adverted mortality and that different methods alter considerably the estimated value of adverting mortality (some examples are given in Behrman, Alderman, and Hoddinott 2004). Therefore, if one impact of an ECD intervention is to alter mortality risks, it is important to assess how sensitive the estimates of the overall impacts (benefits) are to alternative ways of valuing adverted mortality. For most probable impacts of ECD interventions the uncertainties regarding the valuation of the impacts are not as large as for adverting mortality. But for the impacts whose effects are a relatively large share of the overall benefits, it generally still is desirable to explore how sensitive the estimates are to alternative ways of valuing the impact(s). Time: Program Age, Child Age, Duration of Exposure and Lags in Impacts There are a number of respects in which attention needs to be paid to dimensions of time in evaluating impacts of ECD interventions: First, program implementation is not always coincident with announced schedules and usually there are start-up costs and learning. If attention is not paid to these dimensions of programs the result is likely to be an underestimate of the impacts of programs once they are established and functioning.11 Second, the impact of an ECD program is likely to depend on the age of the children exposed to the program, with the literature suggesting generally greater impacts on children past weaning ages but younger than 24-36 months (e.g., Engle et al. 2007). Third, the impact on children is likely to depend on duration of exposure to the program, with no discernible impact with very brief program impact and then increasing impact over a range with eventual diminishing marginal effects (e.g., Armecin et al. 2006; Behrman, Cheng, and Todd 2004). Fourth, some of the impacts may occur with considerable lags, such as the adult productivity, risk behaviors, and morbidity impacts referred to in Section I.C. In order to compare these impacts with costs or with impacts from other interventions with different time patterns of impacts, it is necessary to translate these impacts into present discounted values (PDVs). The basic point is that an impact of a certain magnitude is more valuable if it is received sooner rather than later because if it is received sooner the returns from the intervention can be reinvested. But there is not agreement on what discount rate is appropriate. Many social sector programs use discount rates of the order of magnitude of five percent, but Belli et al. (1998) in a Handbook on Economic Analysis of Investment Operations state that the World Bank generally uses a discount rate of 10 to 12 percent in 11 On the other hand those involved in the early stages of a program may devote extra efforts to trying to get it established and to make it work that are not sustained in the longer run. 16 its economic analyses.12 The Asian Development Bank (ADB) also recommends the use of a discount rate of 10 to 12 percent for the economic evaluation of its loan projects (Adhikari, Gertler, and Lagman 1999). Some other World Bank references on economic evaluation claim that in most countries the opportunity cost of capital is correctly reflected by the real rate of interest for low-risk securities, which is generally lower (usually less than five percent) and that the real rate of interest on low-risk securities such as World Bank Bonds and United States' Treasuries has remained at about three percent for many decades (Barnum, 1995; Phillips and Sanghvi 1996). The World Bank and WHO have also used a three percent discount rate in calculating the number of disability- adjusted life years (DALY) gained from alternative health interventions or in Global Burden of Disease estimates (http://www.who.int/healthinfo/statistics). The problem is that, for impacts with considerable lags, the choice of discount rate to use can make a substantial difference. Table 3 illustrates the impact of discounting and the sensitivity of the PDVs to the discount rate used for impacts that occur with considerable lags. An impact of $1000 in 30 years (e.g., when an ECD intervention might affect adult productivity), for example, has a PDV of $412 with a three percent discount rate, $231 with a five percent discount rate, $57 with a 10 percent discount rate, and $4 with a 20 percent discount rate. Therefore, it is important that estimates of the impact of ECD programs show the sensitivity of estimates of the PDV of impacts to different discount rates. Table 3. Present Discounted Value (PDV) of $1000 Gained Different Years in the Future with Different Discount Rates Years in Annual Discount Rate Future 1% 2% 3% 5% 10% 20% 5 $951.47 $905.73 $862.61 $783.53 $620.92 $401.88 10 $905.29 $820.35 $744.09 $613.91 $385.54 $161.51 20 $819.54 $672.97 $553.68 $376.89 $148.64 $26.08 30 $741.92 $552.07 $411.99 $231.38 $57.31 $4.21 40 $671.65 $452.89 $306.56 $142.05 $22.09 $0.68 50 $608.04 $371.53 $228.11 $87.20 $8.52 $0.11 60 $550.45 $304.78 $169.73 $53.54 $3.28 $0.02 Including All the Major Impacts but Avoiding Double-counting Section I.C indicates a number of possible short-, medium-, and long-run impacts of ECD programs. Of course, it is important to include all the major impacts in order not to underestimate the total impact (benefit) of an ECD program. It should be noted that a key aspect of measuring project impact is to assess the possible displacement of existing services (including privately-provided services) by project-provided services. As noted above, an important and often challenging aspect of estimating the social impacts is to estimate the "spillover" benefits on others' labor productivity, health, or learning. 12 This Handbook explains that this rate is used as a "rationing device" and that it does not necessarily reflect the true opportunity cost of capital in borrowing countries. It invites analysts to use another rate if it can be justified. However, this Handbook cautions that use of a rate lower than 10% might be difficult to justify in light of research suggesting that this is the low end of estimates of the opportunity cost of capital in developing countries. 17 But on the other hand, analysts need to be careful not to double-count impacts. For example, one important channel through which an ECD intervention might work to increase economic productivity is through increasing schooling success. To the extent that the interest in schooling is because it increases productivities, it would be misleading and overstate the program impact to add to the gain from increased productivities the gain from increased schooling. Only the part of the gain from increased schooling aside from that related to increased productivities should be added to the increased productivities. For this reason it is important to clarify what are the ultimate impacts of interest and what are the channels through which, at least in part, ECD programs may affect the ultimate impacts of interest. E. Valuing the Resource Costs As noted in the discussion above about policy motives (Section I.B), evaluations of ECD programs are most useful if they not only have good estimates of impacts and of the aggregate PDV of those impacts (Section I.E) but also have good estimates of the PDV of public and private resources used by the programs. A key feature of economic analysis ­ in contrast to financial or budgetary analysis ­ is that the costs and benefits are based on opportunity costs to society, not financial flows. For example, in evaluating the cost of delivering ECD services to children in public ECD facilities, the opportunity cost of inputs such as labor (including volunteer labor), food, books, toys, drugs and medical supplies, buildings (including land), and equipment should be used, instead of the financial costs that may be reflected in governmental or project accounts. Taxes and subsidies should not be included in the resource costs and benefits (even though their values are likely to affect both the project's fiscal impact and its distributional impact).13 The Identification of Project Inputs ECD projects typically involve several distinct activities. The first step in estimating project costs is to identify the distinct activities that are included in the project and to identify their inputs, outputs and outcomes. It is critically important to identify all relevant inputs, regardless of whether they are provided by the project and regardless of whether they involve any financial expenditure (inputs not provided by the project will often be missing from the project documents). Table 4 lists some of the inputs that are frequently used in ECD projects. These inputs are grouped into broad categories: (1) capital inputs (i.e., inputs that are not completely consumed during a given year) and (2) recurrent inputs (all other inputs). It is necessary to separate capital inputs from recurrent costs not only because they should be handled differently in calculations but also because the distinction is an important one in analyzing the issue of sustainability that is largely a function of recurrent costs. Project inputs should be limited to the additional inputs required to perform various project activities. For example, in the case of personnel, project inputs should be 13 However, activities that are financed by tax revenue may need further adjustments to reflect the distortionary costs of collecting additional taxes or of governmental expenditures, as discussed below. 18 limited to the additional time required by personnel to perform project-related tasks; therefore, if an ECD-related task is added to the responsibilities of health clinic workers, the time needed to perform that additional task is what should be considered. If the ECD project uses capital inputs (e.g., existing equipment or buildings), the project's inputs should be limited to the portion of the input's total use that is project-related, for example, the space occupied by the project in an existing building or the time during which an existing vehicle is used by the project. This principle should apply equally to both existing capital inputs and to capital inputs purchased by the project. Table 4. Frequently Used Inputs in ECD Projects Capital inputs: Buildings Land Vehicles Other equipment Renovations/major repairs Basic training (including training of trainers and the cost of developing training materials) Recurrent inputs: Personnel Client time Client transportation and related costs Materials and supplies (e.g., medicine, food, micronutrients, books, toys, training materials) Utilities (water, electricity) Telephone/communications Petrol Maintenance and repair Services (for example, legal, accounting) Refresher training Care must be taken to look closely at the implications of "piggybacking" additional activities onto already existing activities. If, in order to be involved in the proposed ECD project, it is necessary to divert time or space from some other activity, the inputs so used should be included in the cost calculations. Only if those resources are otherwise idle or unoccupied and expected to remain so for the life of the project and beyond should they not be considered inputs used by the project. The principles of cost measurement are the same for integrated ECD projects or ECD project components included in broader (non-ECD) projects, although their application may be more difficult the more complex the project. It is important to consider for such joint programs the possibility that the costs are not simply the sum of the costs of the individual components. It is likely that there will be some economies from sharing costs, particularly capital costs and management overheads. In fact, the perception that there are such economies of scope is one of the major reasons for having integrated programs or projects (the other being possible synergies in effectiveness/ benefits). For integrated programs, it may be very difficult to estimate the cost in isolation of any one component. In such cases, project evaluation may have to focus on the program as a whole rather than on its separate components. 19 Special attention should be given to identifying the inputs correctly for ECD project activities that are scaled-up from initial "pilots." The cost of providing a good or service for ECD is likely to change with the scale of operation. Often the cost per unit of output will initially fall, due to economies of scale or increased rates of utilization, as the level of output increases. On the other hand, costs may rise with considerable expansion because the conditions under which the initial ECD "pilot" operation was conducted may have been more favorable than conditions that more generally prevail and the dedication and enthusiasm of innovators may be hard to replicate on larger scales. It is important to include as inputs the management and other inputs that pilots may have received from the organizations that supported them. There is often a tendency to focus only on the direct costs of services without considering the various management costs that may be involved. Costs also are likely to rise with extensive expansion if, for example, children who are less well situated (e.g., because of location, family background, innate characteristics) increasingly are served. Since one of the important decisions to make concerning project design may be the choice of scale, it is useful to explore how costs are likely to vary as output increases. The level of inputs required by an ECD project may decline over time, at least initially, because there is important learning about how to procure, organize, distribute and manage the inputs and outcomes and how to produce the desired outcomes more effectively. These learning effects are conceptually distinct from scale effects, and it useful to keep them distinct in undertaking analysis. Learning can occur, for example, even if there is no change in scale. Since learning may be important for many projects, it is useful to explore how the needed inputs are likely to vary as the result of such learning (especially if sensitivity analysis indicates that alternative assumptions in this regard have a significant effect on the conclusions). Costing the Inputs for ECD Projects Once the ECD project's inputs have been correctly identified, the next step is to assign an economic cost to each input. The economic cost of an input is sometimes referred to as its "opportunity cost," i.e., the value of the input in its next best use. As previously indicated, the opportunity cost of an input is not necessarily the same as its financial cost (for example, the public expenditure cost of the input, or what the project expects to pay for the input). There are at least two important ways in which opportunity costs differ from financial costs: First, the prices actually paid for inputs may not correctly reflect their opportunity cost. An important example is inputs that are contributed by individuals or by the community (for example, volunteer labor). Although the project does not have to pay for these inputs (or may pay less than market prices for some of them), they do have an economic cost. In all such cases, the full market value of contributed inputs should be estimated and included in the project costs. Governmental personnel are often paid a salary that is considerably lower than the market wage, even when the market value of all allowances and benefits is included. In such cases, the economic cost of the personnel is not what they are actually paid, but is rather the full market value of the time they spend 20 performing their jobs. The use of some project inputs may impose external costs, for example, the effect of project vehicles in generating additional air pollution and congestion. Ideally, the cost of project inputs should reflect any such externalities. In the case of imported inputs, the local price paid may not reflect the opportunity cost of the input (for example, if the exchange rate is significantly over- or under-valued). However, this type of distortion is less important now than it was in past decades.14 Lastly, all taxes and subsidies should be removed from the prices paid when costing project inputs (for example, subsidies for petrol or electricity). Second, some project expenditures or financial outlays do not represent real resource costs of the project but are instead transfers that represent changes in the command over existing resources. The distinction between economic costs and transfers is one of the least-well understood aspects of economic project evaluation. It is particularly important in the context of ECD projects because many ECD projects include in-kind benefits. The case of cash transfers generally is easiest. If for example the government taxes adult workers 100 pesos and gives those pesos to mothers of small children, the cash transfer generally does not represent a project cost. The cash transfer represents only a change in the command over existing resources (e.g., from tax payers to mothers of small children). The actual costs related to such a cash transfer include the cost of administering the cash transfer program and related distortionary costs. The latter would include any reduction in resources available to the economy from any effect the cash transfer may have on work effort on the part of mothers receiving the cash transfers (for example, by reducing their job search effort or by discouraging them from accepting a relatively low-paying job) and from the additional cost of collecting the governmental revenue needed to finance both the cash transfers and the program's administrative costs (this includes both the administrative costs of collecting the additional tax revenue and the distortionary costs that result from increases in the collection of most types of taxes). Conditional cash transfers (i.e., transfers given to an individual who fulfills one or more conditions, such as enrolling in and regularly attending a pre-school program) are also generally primarily or exclusively transfers.15 In this case, project costs include (in addition to those discussed above for any cash grant) the additional cost of the activity that is encouraged by the condition attached to the transfer. For example, in the case of a conditional cash transfer provided to young children who enroll in and attend a pre- 14 A decade ago, for example, in this regard Devarajan, Squire, and Suthiwart-Narueput (1997) stated: "Reforms of trade policy and exchange rate systems have reduced the distortions of most concern. In these circumstances, paying only modest attention to shadow prices may be a sensible allocation of the time of the economists." 15 Some qualification is made here because conditional transfers, like in-kind transfers, may have real resource costs if, as summarized at the end of this section, (1) they lead to a significant input into the activity being supported and (2) the project results in a net increase in the utilization of this input. For example, consider scholarships (transfers conditional on attending school) for poor, malnourished young children to attend ECD programs. Empirical studies suggest that the increased monetary income provided by such scholarships will lead to increased nutrient consumption by the children (among other effects) that will increase their learning (e.g., Behrman, Alderman, and Hoddinott 2004, Maluccio et al. 2007). To the extent that this occurs, part of the conditional transfer is not a pure transfer but a resource cost. 21 school program, these costs include the cost of additional pre-school inputs required to accommodate any increased enrollment (including any additional costs incurred to offset any negative impact on children previously attending the program, i.e., the costs of "crowding"),16 increases in household out-of-pocket expenditure on pre-school-related items (for example, uniforms, books, transportation--but not fees to pre-schools, as that would be double-counting), and the opportunity cost of the additional time that grant recipients and their families spend in pre-school-related activities, including travel and volunteering for ECD-related activities. The distinction between project costs and in-kind transfers is a bit more subtle. The first relevant question is whether the item or items transferred in connection with the project are a significant (i.e., quantitatively important) input into the project-supported activity. Consider, for example, the case where food is given to a family in lieu of a cash grant in the above example. If the food is not a significant input into the educational activity that the project is attempting to encourage (for example, if the children in a pre- school program are already adequately nourished), the value of the food should be regarded as a transfer, and benefits and costs are the same as the case of a conditional cash transfer. If the food is an important input into the educational activity (for example, food provided to malnourished children whose malnourishment precludes them from learning to their potential), some or all of the food provided through the program might be regarded as a cost of the program. To be considered a cost of the project, the food provided through the program has to increase the total daily nutrient intake of the children participating in the program. If it simply substitutes for food previously provided to the same children by their households, it would not represent an increase in the level of this input into the education of school children and the value of the food provided should in this case be treated as a transfer. Even if the feeding program does not result in any increase in children's nutrient intake, the project might still yield benefits by encouraging children to attend the ECD program. In this case, however, benefits and costs would be similar to those with a conditional cash grant, as discussed above. Summarizing, the key questions in determining whether a given project expenditure is a project cost or an in-kind transfer are: (1) is the item a significant input into the activity supported by the project? (2) does the project result in a net increase in the utilization of this input in the project-supported activity, or does it simply affect how an unchanged level of the input is financed? 16Even if additional resources have been used in an effort to offset any negative effects of an intervention on others (non-beneficiaries), the estimation of project impacts should still include the possibility that some negative (or possibly positive) effects have been experienced by others. It is conventional to include the monetary value of any negative external effects as part of a project's costs, while the monetary value of any positive external effects is included among the project's benefits. 22 Other Project Costs In addition to the costs of direct ECD project inputs, ECD projects often entail costs borne by the private sector. For ECD programs, the most important private cost is likely to be the opportunity cost of time of the mother or other caregiver, often in terms of transporting children to programs and in some cases in participating directly in programs (whether they are paid or provide volunteer labor) or related program activities (e.g., parental oversight committees). Another important private cost is likely to be the distortionary cost of raising the necessary additional tax revenue to finance project- related governmental expenditure (including transfers) (Devarajan, Squire, and Suthiwart-Narueput 1997). In addition to the cost of collecting additional revenue, all taxes other than a lump sum tax lead to efforts to avoid the payment of the tax that involve real resource costs. Society has less product because of such distortions. These distortionary costs may be considerable. For example, it has been estimated that the distortionary cost (often called the "deadweight loss") of raising a dollar of tax revenue in the United States ranges from $0.17 to $0.56, depending on the type of tax used (e.g., Ballard, Shoven, and Whalley 1985; Feldstein 1995). Estimates for some other countries range from $0.18 to $0.85, depending on the tax (van der Gaag and Tan 1997). Harberger (1997) suggests using a shadow price of $1.20-1.25 for all fiscal flows on a project. The existence of substantial distortionary costs related to the collection of taxes provides one important justification for careful consideration of cost recovery possibilities in projects (Hammer 1996).17 In addition to distortionary costs related to the collection of additional tax revenue, some project activities may be responsible for modifying other types of private behavior that result in real resource costs. For example, conditional cash grants provided to a family to encourage enrollment in pre-school programs may affect the work effort of other family members. Treatment of Some Specific Cost Items To clarify the treatment of some key components of project costs, this section briefly discusses some specific cost items: Start-up Costs (including costs of technical assistance): Start-up costs for new ECD interventions involve real resource costs, so they should be included in cost calculations. They also by definition occur early in the project life, so they are not discounted to the same extent as recurrent costs and capital service costs that are distributed over time. Monitoring, Evaluation, and Analysis Costs: These are important activities for any ECD project, particularly new projects. Good projects should include such activities. 17Another important reason for using user fees that will result in some cost recovery is to encourage more efficient use of the goods and services provided by making the private incentives for use closer to the social marginal resource costs of the program (see Section 2.2). 23 Of course, these activities have resource costs in terms of personnel and other recurrent expenses (e.g., materials and supplies) and capital services. But within the range of resource uses for monitoring, evaluation, and analysis that usually are considered, it would appear that the expected gains in terms of program modification or possible abandonment exceed the real resource costs. Capital Costs: Many capital costs will typically involve investments made by the project. In this case, it is important to consider whether the capital input is consumed completely during the period in which costs and benefits are analyzed. For example, a building financed by a project will probably have a useful life beyond, say, a 10-year period used to evaluate the project's benefits and costs, whereas project-purchased vehicles will probably not. The easiest way to handle the former case is to subtract the present value of the estimated residual value of the capital item at the end of the project from its initial cost. Some projects may use all or part of an existing capital item as a project input (for example, space in an existing religious institution or NGO or household for center-based ECD programs such as those in the first panel of Table 1). In such cases, a capital cost should be imputed for the input. Capital costs include changes in the market value of the item during the year, such as may occur from the item's depreciation due to use, and the opportunity cost of the capital invested in the item during the year.18 Maintenance costs may reduce actual depreciation costs, so it is necessary to avoid double counting these two items. Although it is conventional to regard capital costs as a "fixed" cost, some capital costs increase with output levels (for example, the depreciation of vehicles used to provide outreach services). If estimates of building costs are included, it is important if possible to treat building costs differently from land costs. Buildings depreciate (unless maintained impeccably), but land does not. The appropriate capital cost for land is an estimate of its market rental value (with the estimated cost of the building removed). If such an estimate is unavailable, an estimate of rental cost can be obtained by multiplying the approximate market value of the land by the real interest rate (i.e., the opportunity cost of the capital tied up in the land). Training Costs: Training of trainers, such as those for child-care workers, is a capital cost, as is the cost of developing training materials. The cost of periodic re- training is a recurrent cost. This distinction is most important in the context of sustainability analysis. Time Matters: As for benefits, the timing of cost matters. Typically the costs tend to be more concentrated early in an ECD project than the benefits due to initial capital and start-up costs. But typically there also are significant ongoing recurrent costs, so the PDV of costs should be used ­ which again raises the question of how sensitive the estimates are to the discount rate used. 18Capital cost estimates often include an estimate of depreciation but not of the opportunity cost of capital. 24 II. Framework for Estimating Impacts of ECD Programs To illustrate more concretely the general issues involved, consider the following more formal stylized model of the possible impact of ECD programs on children over their life cycles. Part of what is of interest regarding the impact of ECD programs is whether they increase the resources that children will have as adults ­ will they make them more productive or healthier or less likely to engage in risky behaviors. A major pathway through which ECD programs might have such an effect is through affecting the children's intellectual functioning when they become adults. While there are other pathways that may be important as well, the basic points can be illustrated more simply by focusing on this particular pathway and on the impact of adult intellectual functioning on the resources that the child will have when s/he becomes an adult. When a child becomes an adult (indicated by a subscript a) s/he will have resources for her/his use (Ya) that depend primarily on her/his income-generating capacities, the income-generation capacities of her/his spouse (if any) and other family members, and sharing rules for determining the distribution of resources within the household ­ all embedded within a specific market, kin, public services, and social network context.19 These resources will depend basically on that individual's capabilities (Ka) including intellectual and physical functioning, that individual's physical and financial assets (Aa), that individual's preferences regarding matters such as their use of time and desires to have children (Pa), that individual's endowments (E0, given factors such as genetic abilities and innate health, gender, ethnicity, race, tribe ­ where the subscript 0 indicates that these are given factors) that may affect the nature of local labor income earnings and other resource options, that individual's bargaining power for intrahousehold allocations (Ba), and local community, market, and other contextual B factors (Ca), as well as on stochastic terms (Ua) for chance events: Ya = Y (Ka, Aa, Pa, E0, Ba, Ca, Ua). (1) Relation (1) is written as a general functional form, which includes the possibility of interactions among the arguments (e.g. differential returns to capabilities depending on gender and on markets) and other nonlinearities of the included variables (e.g. diminishing marginal returns to various capabilities). All of the variables in relation (1) in general are vectors with multiple components (e.g. as noted, capabilities are likely to include intellectual and physical dimensions as well as interpersonal skills). The impacts of ECD programs on the resources for use by this individual are through (a) affecting the human and physical assets that the individual has as an adult by altering the nature of his/her early life experiences and thereby investments in this individual in previous life-cycle stages and (b) changing the options that the individual has as an adult for investments in his/her children. To estimate the impacts of type (a), 19It may be desirable for some purposes to utilize a more-disaggregated representation of these resource sources, such as a labor income earnings function, a return to assets function, and a sharing rule for household resources. 25 however, is a challenge. Identifying the causal effects of early life ECD interventions on the capabilities of adult children (as well as on other variables in relation (1) is likely to be difficult because such investments are made within a life-cycle framework in the presence of unobservables (such as ability and health endowments) in previous life-cycle stages and because of limitations in most available data (e.g. limited representation of capabilities, data generally not available from conception to adulthood). To illustrate, consider adult intellectual functioning (Kia). The standard assumption is that adult intellectual functioning depends importantly on schooling and the interest in this chapter is in evaluating ECD programs prior to schooling, so consider three life-cycle stages:20 Life-Cycle Stage 1: pre-schooling (from conception through to about age five or six) during which children may be exposed directly to ECD programs and those programs may have important relatively short-run impacts. Life-Cycle Stage 2: schooling ages during which children no longer are directly exposed to ECD programs but there may be important medium-term impacts of exposure in the pre-school stage. Life-Cycle Stage 3: adulthood during which the individuals are no longer are directly exposed to ECD programs (though their children may be) but there may be important long-term impacts of exposure in the pre-school stage, perhaps with effects through pathways in the previous life-cycle stage ­ in particular, related to education. Adult intellectual functioning (Kia), then, can be considered to be determined by a production function in which the inputs are all previous experiences (Ei, i = 1, 2, 3 for the three life-cycle stages defined above; note that the subscript for life-cycle stage 3 is equivalent to the subscript a used in relation (1); genetic (and other) unobserved endowments (E0); and stochastic terms (U3i) to reflect all other idiosyncratic, and assumed exogenous, learning experiences: Kia = Kp (E1, E2, E3, E0, U3i) (2) where the first subscript for the right-side variables refers to the life-cycle stage, the second subscript if present refers to intellectual capabilities (i), and the right-side superscript p refers to the function being a production function. There may be important interactions and nonlinearities in this production function (and in other relevant production functions). For example, individuals with better pre-school nutrition may learn more from their school-age experiences (so that the cross-derivative of relation (2) with respect to the first two variables is positive). This production function also may 20The exact delineation of these life-cycle stages in terms of ages, of course, varies across contexts, with schooling, for example, tending to be of less duration in areas of greater poverty. The major transitions to adulthood also vary considerably in their timing (e.g. NRC/IOMn2005). For any particular study, moreover, it may be desirable to consider other life-cycle stages. The use of these three life-cycle stages here, nevertheless, serves to make the basic points relevant for this chapter. 26 reflect that some processes are not likely to be reversible at reasonable costs. For example, nutrition early in the life cycle may establish basic patterns of neural development and of other aspects of development, and it may be quite costly or impossible to offset these later in life (e.g. Barker 1992; Engle et al. 2007), which implies that E2 and E3 can only substitute imperfectly and to a limited extent for some components of E1. If one had good estimates of relation (2) and of parallel relations for the other right-side variables that enter into relation (1) and of relation (1) itself, then one could trace well the pathways from the effects of ECD programs experienced during the pre- school life-cycle stage through the school age life-cycle stage on the resources available for use by this individual as an adult and thus the extent of this aspect of the impact of ECD interventions for this individual. Estimation of relations such as (1) and (2), however, is challenging because at least in some cases the indicators of the right-side variables in relation (1) (the dependent variables in relations such as (2)) are quite imperfect and because the experiences for the three life-cycle stages on the right side of these relations all reflect previous behavioral choices. For the latter reason, for example, ordinary least squares (OLS) estimates of relation (2) are likely to be inconsistent due to the endogeneity of the life-cycle stage experiences. To motivate the assumptions underlying the exploration of how ECD programs may affect the three life-cycle stage experiences and to elucidate some of the estimation issues (e.g., the possible impact of the endowments on estimates that do not control for them), assume a very stylized model in which the "dynasty" (first the parents through intrahousehold bargaining between themselves and perhaps other relatives, then the children themselves increasingly as they age into youth though with intrahousehold bargaining with their parents and other relatives, and into adulthood usually with a spouse that involves further bargaining) makes decisions so as to maximize a welfare function W that includes Ya for each individual. This welfare function is maximized sequentially subject to the constraints at each life-cycle stage related to relevant current and expected production functions, resources allocated to this individual, community characteristics including community services and markets (among which are ECD-related options) that affect household decisions, and stochastic factors: Life-Cycle Stage 1 (pre-schooling): The parents (perhaps implicitly) bargain between themselves (and possibly with others, such as the grandparents) to decide how to allocate resources to obtain the optimal E1 for the child, given the child endowments, nutrients and other inputs into the E1 production function that are allocated by the parents, the current community-determined options (e.g. availability of ECD programs), expected future community characteristics (e.g. expected schooling options in life-cycle stage 2, expected labor market options in life-cycle stage 3), the expected relation between E1 and Ya (via capabilities and the other right-side variables in relation 1), and the child endowments. The E1 production function is: E1 = E1 (N1, C1p, E0, Ef0, U1E), p (3) 27 where N is a vector of family-determined inputs into the production of E1 (e.g. family- provided nutrients), C1 is a vector of community inputs into the production of E1 (e.g. p community-provided ECD programs, community disease environment, community learning environment), E0 is the child endowment that directly enters into the production of E1 (e.g. innate robustness), Ef0 is parental endowments that directly affect early childhood development (e.g. innate ability in raising children), and U1E is a stochastic disturbance term that directly affects the production of E1 (e.g. random fluctuations in the infectious disease environment). The parents choose the inputs into this production function N1 (and perhaps some components of C1 that reflect parental choice that may p include, for example, the child's participation in ECD programs) and therefore the expected value of E1 in order to maximize the expected welfare W given: a vector of parental family characteristics such as parental schooling, parental preferences such as for child quality versus quantity or work versus leisure, and parental assets in which the ownership of resources may matter because it may affect intrahousehold bargaining (F1); all relevant community characteristics for this life-cycle stage C1 (which includes the community characteristics such as the availability of ECD programs that directly affect the production of E1 through C1p but also other community characteristics that affect the household through other channels); all of the child endowments E0; all the stochastic terms that affect outcomes in the first life-cycle stage of the child U1 (which includes U1E but also other stochastic factors that affect the family during the first life-cycle stage for this child since, for example, stochastic factors affecting the health of other siblings may affect the inputs devoted to this child) - plus the expected values of these variables in the next two life-cycle stages (F12 , F13 , C12 , C13 , U12 , U13 , where the first subscript refers e e e e e e to the life-cycle stage at which the expectations are held, the second subscript refers to the stage for which the expectations are held and the superscript e refers to expectations) because the optimal decision for investing in E1 to maximize W depends in part on expectations regarding these variables over the next two life-cycle stages: N1 = N1 (F1, C1, E0, E , U1, F12 , F13 , C12 , C13 , U12 , U13 ) d f e e e e e e 0 (4a) and E1 = E1 (F1, C1, E0, E , U1, F12 , F13 , C12 , C13 , U12 , U13 ), d f e e e e e e 0 (4b) where the superscript d refers to reduced-form demand relations. As noted, E1 is a vector with a number of different components that are measured by indicators such as are given in Section I.C (short-term). Good estimates of relation (4b), with ECD interventions among the components of the vector for C1, would be informative about the impact of the ECD interventions on these relatively short-run (at least from a life-cycle perspective) outcomes of interest. Though the availability of ECD options from the point of view of parents may be exogenous (though there still may be some estimation issues depending on what determines the placement of ECD programs, as emphasized in the more general discussion of endogenous program placement in Rosenzweig and Wolpin 1986), if child participation in an ECD program is a parental choice then that participation is not on the right side of relations (4a) and (4b), but there is another relation parallel to (4a) that gives the reduced-form demand relation for the child's participation in ECD programs. 28 Life-Cycle Stage 2 (school-age): The dynasty (initially the parents but increasingly the child) decides on the components of E2 (such as schooling attainment21) of the child/youth conditional on (a) the outcome of stage 1 E1 that is assumed to summarize all the family and community factors that determine pre-school investments (including ECD programs in life-cycle stage 1),22 (b) life-cycle stage 2 family, community and stochastic factors, and (c) the expected values of those factors for life- cycle stage 3: E2 = E2 (E1, E0, Ef0, F2, C2, F23 , C23 , U2, U23 ), c e e e (5) where the superscript c refers to the conditional demand function. Relation (4b) can be used to substitute for the life-cycle stage 1 experience E1 in relation (5) to obtain the reduced-form demand relation for E2: E2=E2 (F1, C1, E0, E f0, F12 , F13 , C12 , C13 , F2, C2, F23 , C23 , U1, U2, U12 , U13 , U23 ). (6) d e e e e e e e e e As noted, E2 is a vector with a number of different components that are measured by medium-term indicators such as are given in Section I.C. Good estimates of relation (6), with ECD interventions among the components of the vector for C1, would be informative about the impact of the ECD interventions (but not the parental responses to those interventions) on these medium-term (at least from a life-cycle perspective) outcomes of interest. Life-Cycle Stage 3 (adulthood): The dynasty (primarily the post-school youth/young adult but perhaps with some input from the parents and in part in interaction with a spouse and the spouse's family) decides on the post-schooling experience E3 of the individual conditional on (a) the outcome of stage 1 E1 that is assumed to be a sufficient statistic for the family and community factors (including ECD programs) that determine pre-school investments, (b) the outcome of stage 2 E2 that is assumed to be a sufficient statistic for the family and community factors (including indirect effects of ECD programs through E1) that determine schooling and other elements of E2,23 and (c) life- cycle stage 3 family, community (including ECD options for his/her children that may affect his/her welfare as an adult) and stochastic factors: E3 = E3 (E1, E2, E0, E , F3, C3, U3). c f 0 (7) 21There are a number of other important transitions during this life-cycle stage that also condition options in adulthood considerably. Leading examples include transitions into work, into sexual activity, into marriages or other forms of unions, into parenthood, and away from the parental household and perhaps the parental community. 22This is not a necessary assumption for estimating the adult capabilities production functions as in relation (2), but it is consistent with the exclusion of at least some of the first life-cycle stage determinants from directly appearing in relation (5) so that the impact of E1 in that relation can be identified. 23 Again, (a) and (b) are not necessary assumptions for estimating the adult cognitive achievement production functions in relation (2) but are consistent with the exclusion of at least some of the first and second life-cycle stage determinants from directly appearing in relation (7) so that the impacts of E1 and E2 in that relation can be identified. 29 Relation (4b) can be used to substitute for the life-cycle stage 1 experience E1 and relation (6) can be used to substitute for the life-cycle stage 2 experience E2 in relation (7) to obtain the reduced-form demand relation for E3: E3 = E3 (F1, C1, E0, E f0, F12 , F13 , C12 , C13 , F2, C2, F23 , C23 , F3, C3, U1, U2, U3, U12 , U13 , U23 ). d e e e e e e e e e (8) Reduced-form Relations for Child's Adult Resource Access (and other adult variables): Through the sequential life-cycle stage processes the adult capabilities in relation (2), and the other right-side variables in relations parallel to (2) for each of them, are determined as well. This implies, of course, that the critical (for this chapter) adult access to resources (or other indicators of possible long-run impacts of ECD such as indicated in Section I.C) can also be written as a reduced-form demand relation (by substituting relations [4b], [6], and [8] into relations such as [2] and then substituting those into relation [1]) as: Ya = Ya (F1, C1, E0, E , F12 , F13 , C12 , C13 , F2, C2, F23 , C23 , F3, C3, U1, U2, U3, U12 , U13 , U23 ) d f e e e e e e e e e 0 (9) Good estimates of the coefficients of the ECD program variables (either affecting the individual in life-cycle stage 1 and therefore in C1 or affecting the options for the individual's children and therefore in C3) in relation (9) permit ascertaining the direct causal impacts of ECD program variables on the child's adult resource access and other relevant adult outcomes.24 Good estimates of relation (9) permit answering a number of important questions about the impact of ECD interventions on adult outcomes. Most directly, how important are the effects on adult outcomes of interest? But also, to what extent do parental family characteristics or other community services substitute for, or complement, the impact of ECD in the pre-school life-cycle stage? And are there important differences in all of these relations by ethnicity or other demographic characteristics? By income or poverty level? While good estimates of relation (9) are valuable in assessing the impact of ECD interventions, they are not the only estimates that would be illuminating regarding the impact of ECD interventions. Indeed, good estimates of any of the relations in this section (and of parallel relations for other pathways) would be illuminating for aspects of the impact of ECD interventions. For example: Just how important are various components of ECD programs in determining schooling? Just how important are intellectual capabilities ­ or of schooling, one input into intellectual capabilities ­ in the determination of adult resource access? Are intellectual capabilities more or less important than physical capabilities? Does the importance of such factors depend on individual characteristics such as gender or on community characteristics such as the nature of labor or capital markets? 24Note again that by the ECD program variables are meant the exogenous-to-the-parents ECD program options, not variables that reflect earlier parental decisions such as child participation in ECD programs. 30 Summary of Implications of this Section for Estimation of the Impact of ECD Programs: The set of relations discussed in this section have a number of implications for the estimation of the impact of ECD programs that relate to the discussion above. First, they imply that to obtain the total impact of ECD interventions over the life cycle directly requires following individuals exposed to ECD programs when they are young for many decades into mature adulthood. Such data are rare, though do exist in some cases (e.g., Behrman et al. 2006 and Maluccio et al. 2007 use data on adults 25-42 years of age as well as data on the same individuals when they were 0-7 years of age). Second, they point to a strategy for obtaining estimates of the long-run impacts even in the absence of longitudinal data over many decades if estimates can be made of the various linkages across life cycle stages. For example, if an ECD intervention affects preschool cognitive skills and stunting and these cognitive skills and stunting, through affecting schooling attainment and adult cognitive skills, affect adult outcomes of interest such as earnings, then it might be possible to link estimates of the short-run impact of ECD program availability on early life cognitive skills and stunting with separate estimates of the impacts of schooling attainment and adult cognitive skills on the outcomes of interest. An illustration of such a strategy is provided in the study of the Bolivian PIDI ECD program by Behrman, Cheng, and Todd (2004) in which study estimates of ECD program impact on short-run child outcomes based on pre-school data are linked to estimates from other studies about the medium-term impacts on schooling and the long-run impacts on adult earnings to obtain the PDV of long-run benefits (albeit with a number of strong assumptions). Third, the set of relations point to the problem of possible double counting (Section I.D). For example, relation (6) may indicate that an ECD program has impact on schooling attainment, and relations (8) and (9) may indicate that ECD has impact on adult earnings at least in part through affecting schooling attainment. To get the total impact of ECD on adult outcomes, therefore, it would be wrong to sum the effect on schooling attainment plus the effect on earnings (plus other possible effects). What should be included is the effect on earnings plus any effects on schooling attainment after netting out the part of schooling attainment that relates to the effect of schooling attainment on earnings. Fourth, though the emphasis typically is on the effect of ECD programs on the children who are exposed directly to the ECD program, there also are likely to be effects on their parents (basically through the C3 component that refers to ECD programs when the individual is an adult with children in relations [9] and [10]). These impacts and costs associated with them have to be incorporated into the analysis to obtain the full impacts of ECD programs, though many studies do not incorporate such effects. Fifth, these relations point to the fact that impact evaluations of ECD programs are context specific because the nature of community services, markets, culture, and other factors enter into each life-cycle stage and may affect interactively the impact of ECD programs. In different contexts, other (non-ECD) programs and family investments may 31 compensate differentially for limitations in pre-school child development (e.g., the special education programs that are common in the developed world are not common in poor areas of developing countries). Of course, the extent to which the currently available estimates are generalizable is an important part an empirical question. But there are very few available systematic studies of ECD programs in developing countries (Table 1). Only by undertaking more empirical studies can we learn how generalizable to other locations are the estimates that currently are available. Sixth, to make inferences about the impact of ECD programs requires good estimates of relations such as are discussed in this section. But obtaining good estimates may be a challenge for reasons to which the next section turns. III.Estimation Issues and Possible Resolutions Data limitations, no matter how good the data, lead to possible estimation problems. In all of the right-side relations in Section II there are vectors of variables, and a number of the components of those vectors are likely to be unobserved or poorly measured.25 For the production function relations and the conditional demand relations, moreover, some of the right-side variables are determined endogenously within the life- cycle framework.26 Indeed, if the empirical measure of exposure to a ECD intervention used in a study reflects behavioral choices of parents, caregivers, or program personnel rather than just the exogenous-to-the-parents ECD program availability, what are referred to as "reduced-form" demand relations parallel to those in Section II really are 25Even if the life-cycle experiences are treated in the estimation as behaviorally-determined, if the true specification in relation (1) includes all the variables indicated above and (2) includes all three life-cycle experiences but a specification is used that excludes one or more of the relevant variables (e.g., only schooling is included), omitted variable bias is likely to result. This is likely to be the case because on the right side of each of the three reduced-form demand relations for the three life-cycle stage experiences (relations [4b], [6], and [8]) are the endowments and the actual or expected values of the family, community, and stochastic factors for all three life-cycle stages, which means that the three life-cycle experiences are likely to be correlated, and thus the right-side variables in relation (1) also correlated. Of course, this is hardly surprising. A priori, a child with better parental family background or who lives in a better community in terms of health and educational services and job options is likely not only to have more schooling but also better pre- and post-schooling experiences. Behrman et al. (2006) provide estimates of adult knowledge production functions for rural Guatemala of the form of relation (1) and report that pre-schooling experiences are quite important but that schooling appears much more important than it is if pre- (and post-) schooling experiences are dropped from the specification. 26Direct estimates of relations such as (1) and (2) without controlling for the behavioral determinants of the three life-cycle experiences are likely to be biased because (as indicated in the reduced-form demand relations [4b], [6], and [8]) each of the three life-cycle experiences depends on all the endowments. These biases could be in either direction. For instance, the "ability bias" on which the schooling literature has focused is consistent with E2 (schooling) being correlated positively with both E0 with the result that the coefficient of schooling is likely to be upward-biased in OLS estimates of relations (2) and (3). On the other hand, if the summary measure of pre-school experience is some variable such as child stunting, and if ability and physical endowments are negatively correlated as suggested by Behrman and Rosenzweig (2004) and Behrman et al. (2006), then OLS estimates of relations (1) and/or (2) may lead to biases towards zero in the coefficient estimate for this variable. 32 conditional reduced-form relations with the conditionality being on the behavioral choice that determined the exposure to the ECD program. In such a case the ECD program experience is endogenous. As a result of these estimation issues ­ unobserved variables, measurement errors, and endogeneity ­ the disturbance terms in the relations to be estimated are likely to include not only the stochastic terms (U's) but also components that are correlated with the right-side variables in the relations. For example, the disturbance term in relation (9) is likely to include unobserved parental abilities, parental innate health, parental preferences, and family connections; unobserved individual abilities and innate health; and unobserved community characteristics such as the disease environment that may be related to program placement. These unobserved characteristics are likely to be correlated with the observed ones; for instance, if the ECD measure used reflects parental choices (e.g., enrollment in ECD programs), it is likely to be correlated with parental and child innate abilities, preferences, and family connections. Indeed, there may be such correlations even if the measure for ECD programs used is just the availability and the quality of local ECD services given endogenous program placement decisions. As a result, the OLS estimation of relations such as (9) is likely to lead to biased estimates of the key parameters of interest because in the estimation, for example, early-life ECD program experiences or ECD program placement proxies in part for correlated unobserved child and parental abilities, preferences, or family connections. Better data always help deal with such problems. Section IV addresses different types of data that may be used for the investigation of the ECD interventions, and the better the data, the less likely there will be such problems. But for given data, there exist standard methodologies for dealing at least in part with these problems. Some examples of these problems and how estimation methodologies may help follow: Sample Selection Selection may take many forms: only having data on test scores and other developmental measures in early childhood for those participating in ECD programs, only having information on health status or on health impacts of an intervention for those children who attend health clinics, only having data on the impact of early childhood program for those who survived infancy and earlier childhood, only having data on those who do not attrit in longitudinal data, only having data on those who stay in the respective control and treatment groups in experimental data. The general problem is that those who are selected are not likely to be a random subsample of the relevant population. A general solution is to model the selection rule and to use it to correct for selection in the estimates, such as in the well-known Heckman (1974, 1979) two-step procedure or other methods such as maximum likelihood estimates. Because sample attrition is a major concern for longitudinal data (a major type of data that is discussed in Section IV), some elaboration on this type of selection is provided here (similar points hold for other types of selection). Sample attrition has the potential to invalidate inferences that can be drawn from longitudinal data if the attrition is non-random with respect to the behavior being studied. Consider the following canonical selection model: 33 Lt* = b2 + b3Xt + b4Zt + U*t (10) and Yt = b0 + b1Xt + U**t (Yt observed only if Lt* < 0). (11) Relation (11) is the model of interest (e.g. a simplification of relation [9]). The outcome variable Yt is observed only for a subset of the entire sample, those for whom the latent index variable Lt* is less than zero. Relation (10) is a selection function depending (possibly) on the same independent variables in (11) as well as on additional factors. In practice, it is known only whether an observation is observed or not, i.e. Lt=1 (Lt* < 0) if observed and Lt=0 (Lt* 0) if not. If the error terms U**t and U*t, are correlated, estimation of (11) on the observed sample, ignoring (10), may lead to inconsistent parameter estimates and thus incorrect inferences. Often attrition appears to be selective in the sense that mean values differ between those who attrite and those who do not (e.g. with respect to schooling attainment in the baseline). However, what is of concern is not the level of attrition or such mean differences but whether, and to what extent, attrition (or other forms of selection) invalidates the inferences that can be made for the broader population using the data from the subsample. It is desirable to attempt to address sample attrition, even if such efforts must be limited to considering attrition on observable variables. Some options include: (1) Testing with baseline data whether the coefficients in multivariate relations differ significantly for those who subsequently attrite and those who do not. Simple tests using data from both developing and developed countries often find no evidence of significant differences even if mean characteristics do differ significantly (e.g. Alderman et al. 2001 for Bolivia, Kenya, and South Africa; Moffitt 1998 for developing countries); (2) Include in the specification of relation (11) all the plausible covariates, some of which may be associated with attrition. Conditional on the maintained assumptions about the functional form, attrition selection on observed right-side variables does not lead to attrition bias (Fitzgerald, Gottschalk, and Moffitt 1998a, b); (3) Implement correction procedures for attrition on observed variables that might relate to attrition even if they are not directly in the model, such as interviewer characteristics and whether other family members remain in the original sample unit (Fitzgerald, Gottschalk, and Moffitt 1998a, b); (4) Explore what are the bounds on the estimates given extreme assumptions on the key variables among those who attrited (e.g. Lee 2002). Recent studies for developing countries find that most key results are not influenced by sample attrition on observed variables (Behrman et al. 2006, Maluccio et al. 2007). Given the potential importance of attrition in confounding the results, nevertheless, it is desirable for studies of the impact of ECD interventions to test to the extent possible for attrition biases ­ and in new data collection, to try to limit the extent of attrition as much as possible (the Indonesian Family Life Survey, available on the web, provides an excellent model). 34 Instrumental Variable (IV) or Two-Stage-Least Squares (2SLS) Estimates One basic problem noted above in relations in Section II in which ECD program interventions are among the right-side variables is that the exposure of a particular child to the ECD intervention may reflect behavioral choices of parents, other caregivers, program administrators, or policy implementers that decide on program placement and program quality. As a result, the empirical representation of the ECD intervention is likely to be correlated with various unobserved variables ­ and the coefficient estimate for the ECD measure is likely to include not only the effect of the ECD program but also the correlated impact of these unobservables. To break the correlation between the observed right-side variables and the compound disturbance terms that include unobserved determinants in addition to stochastic terms in estimates of the impact of ECD programs over the life cycle, one estimation strategy is to use instrumental variables (IV) or two-stage least squares (2SLS). In IV estimates, the endogenous right-side variables are replaced by their predicted values that depend on "instruments" that do not appear directly in the relation of interest. Good instruments must (1) predict well the variable being instrumented and (2) not be correlated with the disturbance term in the second-stage relation of basic interest. The model should be suggestive of the set of potential instruments.27 Note that potential good instruments include experiments28 and so-called "natural experiments" in the form of natural events and policy changes.29 The IV (or 2SLS) procedure basically consists of making first-stage estimates in which 27The three reduced-form demand relations for the three life-cycle stage experiences in relations (4b), (6) and (8), for example, give the potential instruments to be used to identify the three life-cycle experiences in the adult intellectual capabilities production function in relation (2). Note that on the right-side of each of these three reduced-form demand relations are the same endowments and the actual or expected values of the family, community, and stochastic factors for all three life-cycle stages. That means that, though there may be instruments that seem a priori to have first-order effects on particular life-cycle experiences (e.g. pre-school programs or nutrition on E1, school characteristics on E2, labor market characteristics on E3), it would not be correct to assert a priori that a particular instrument identifies a particular life-cycle experience. Instead, there is a potential set of instruments that hopefully identifies the set of life-cycle experiences. This also means that it would not be a test of the plausibility of the instruments to see if subsequent life-cycle stage family or community variables are significant (e.g. if schooling characteristics or post-schooling labor market characteristics significantly determine pre-school experience E1) because the expected value of those variables should be included. Instead it would be a test of to what extent expectations are rational in the sense that the expected values for subsequent stages are equal to the realized values. 28 For example, receiving the Mexican PROGRESA treatment with random assignment by rural communities as examined in Behrman and Hoddinott (2005); Behrman, Sengupta, and Todd (2005); Schultz (2004); the random assignment by communities of different nutritional supplements in the INCAP Guatemalan data as examined in Maluccio et al. (2007) and Martorell et al. (2005); the random assignment of treatment of worms among Kenyan school children as examined in Miguel and Kremer (2004); the random assignment of flip charts among Kenyan schools as examined in Glewwe et al. (2004). 29For example, natural events (e.g. weather fluctuations that occurred when the individual was a child that are used to identify schooling and health impacts on access to resources in Indonesia by Maccini and Yang [2006]), policy changes (e.g. the Indonesian school-building program investigated by Duflo [2001]), or other behaviors that changed exposure to the programs of interest but that were not motivated by such programs (e.g., changing school quality due to migration of Jews from Ethiopia to Israel as used by Gould, Lavy, and Paserman [2004]) ­ though there is some debate about to what extent some of these events are independent of the first-stage compound disturbance term. 35 endogenous right-side variables in the relation of interest are regressed on the instrument set and then making second-stage estimates of the relation of interest that uses the predicted values of the endogenous right-side variables instead of the actual values.30 If the instruments are good in the two senses defined above, the predicted values of the right-side endogenous variables represent well the variation in the right-side variable (the first characteristic of good instruments) but are not correlated with the disturbance term in the second stage (the second characteristic of good instruments). The second-stage estimates then are good estimates of the local average treatment effects of the first-stage instruments. Good IV estimates, thus, can eliminate problems due to omitted (unobserved) variables, endogeneity, and random measurement error. Finding good instruments, however, is often not easy. Not all of the potential instruments that are suggested by the model structure, for example, are likely to be independent of the second-stage disturbance term. For the estimation of the adult intellectual capabilities production function in relation (2), for example, the reduced-form relations (4b), (6), and (8) suggest that family background characteristics are potential instruments. But if unobserved genetic ability endowments affect adult intellectual capabilities as posited in relation (2), if unobserved parental ability endowments affect their schooling attainment and income and if there are significant correlations between parental and child ability endowments, then parental schooling attainment and income may not satisfy the second condition for good instruments (and indeed do not in recent estimates of such a relation for Guatemala in Behrman et al., [2006]). It may also be difficult to find instruments that predict sufficiently well the second-stage right-side variables.31 Fixed Effects (FE) Estimates Some of the unobserved variables that are likely to cause problems if they are not controlled in the estimates may be fixed across observations in the data. From a longitudinal perspective (i.e. fixed over time) these include variables such as individual and parental genetic ability and innate health endowments and some aspects of community culture and environment. From a cross-sectional perspective (i.e. fixed across observations in some group such as members of the same family or the same community) these include the family and community environments and endowments shared by siblings and other members of the same family, the pre-school environment shared by children in the same pre-school program and the community environment shared by residents of the same community. Such factors that are fixed across observations can be 30 Fixed effects estimates to control for fixed unobserved factors, such as are discussed below, are sometimes used together with IV estimates. 31The econometric literature has been evolving recently in the development of diagnostic tests for good instruments (e.g. Stock and Yugo 2002 on the use of the Cragg-Donald statistic for the extent of bias due to "weak instruments" that do not satisfy the first condition for good instruments as well as would be desired). Recent standard software packages (e.g. ivreg2 in Stata 9) provide fairly up-to-date diagnostics for IV estimates (e.g. Cragg-Donald statistics for weak instruments for the first condition for good instruments, Hansen J overidentification statistics for the second condition for good instruments). 36 controlled so that they do not bias estimates of observed variables through using dummy variables for each group of observations for which the control is desired (i.e. individuals or families over time, siblings or community members at a point of time). Such methods have been used extensively to investigate aspects of the framework in Section II (e.g. adult sister sibling estimates to control for shared childhood background among adult sisters in the estimation of the impact of mother's schooling attainment on child health, nutrition, and schooling in Nicaragua in Behrman and Wolfe [1984], [1987a, b]; individual fixed effects to control for unobserved malnutrition that determined which children received nutritional supplements in the Mexican PROGRESA program in Behrman and Hoddinott [2005] or which children were admitted to pre-school programs in the Bolivian pre-school PIDI program in Behrman, Cheng, and Todd [2004]). They have the advantage of controlling for unobserved fixed characteristics that otherwise might bias the estimates and numerous studies suggest that controlling for fixed effects changes the estimates substantially.32 FE estimates, of course, have limitations. First, they do not control for unobserved varying characteristics (e.g. time-varying prices in longitudinal estimates that may affect endogenous behaviors), for which reason in some studies they are combined with IV estimates (e.g., the investigation of the impact of nutrition on labor allocation in Bangladesh in Pitt, Rosenzweig, and Hassan [1990] and in Pakistan in Behrman, Foster, and Rosenzweig [1997]). Second, they tend to increase the importance of noise relative to the signal, which tends to cause a bias towards zero. For this reason, FE-IV estimates have been used in some studies (e.g. using other respondents' reports for schooling attainment in the United States in Ashenfelter and Krueger [1994] and Behrman, Rosenzweig, and Taubman [1994]). Third, they do not permit estimates of the first-order impact of observed fixed variables but only of variables that vary across the observations for which the fixed effects are used (though these may include interactions between fixed variables and variables that vary across the observations for which the former variables are fixed). Therefore, for example, family FE do not permit estimating the impact of parental schooling on child ECD program exposure unless parental schooling varies over the time period in which women are having young children (as in the Rosenzweig and Wolpin [1995] study of the impact of young mothers' schooling on early childhood development in the United States). 32 For example, Behrman and Rosenzweig (2002, 2005) present a dramatic example regarding intergenerational schooling effects for the United States. Controlling for fixed characteristics including genetic endowments at conception between adult identical twins changes the estimated impact of maternal schooling on child schooling from significantly positive in OLS estimates to negative in FE estimates ­ apparently because, controlling for endowments such as innate abilities, women in that society who receive more schooling tend to spend more time in the labor market and less time caring for their children (there are not parallel changes in the estimated impact of paternal schooling ­ which is consistent with fathers not changing their time spent in child care much if they have more schooling. Other recent studies for European countries also report that OLS estimates of intergenerational schooling effects may be quite misleading. Plug (2004) uses data on adoptees to lessen problems of intergenerationally-correlated endowments and Black, Devereux, and Salvanes (2005) use instruments based on changes in mandatory schooling. 37 Propensity Score Matching (PSM) Estimates Recently there has been increasing development and use by economists (e.g. Heckman, Ichimura, and Todd [1998]) of propensity score matching methods that were developed originally in the statistical literature (e.g. Rosenbaum and Rubin 1983). These methods have been developed primarily in the context of the program evaluation literature. They are used to try to find the best comparison for someone exposed to the program ("treatment") among those not treated in terms of pre-program observed variables. The procedure is (1) to estimate a logit for whether one was exposed to treatment or not as a function of predetermined variables (i.e. variables not affected by the treatment), (2) to use the estimates to predict the latent propensity for treatment for everyone, and (3) to compare each individual treated with an individual or group of individuals not-treated but who are very similar in terms of the predicted latent propensity for being treated. This permits comparisons between very similar individuals who have received and who have not received treatment, where similarity is defined in terms of the weighted average of observed characteristics used to predict the propensity to be treated.33 An increasing number of studies have been undertaken to estimate in particular program impacts in developing countries that are consistent with the general life-cycle framework presented in Section I (e.g. the impact of early childhood development programs in Bolivia in Behrman, Cheng, and Todd [2004]; in Mexico in Behrman, Parker, and Todd [2006, 2007]; and, in the Philippines, in Armecin et al. [2006] and Ghuman et al. [2006a]). Recent standard statistical programs include matching estimators (nnmatch in Stata 9). PSM estimates have limitations in that it is not clear what should be the set of variables on which the matching is done (beyond that these variables should not be affected by the treatment or expectations of treatment and that they should lead to overlapping propensities for treatment ("common support") for the treated and the control groups) and the results at times seem sensitive to the exact choices that are made and in that they do not control for unobserved variables (except when combined with FE, in which case they still do not control for time-varying unobserved variables). Construction of Standard Errors Most household sample surveys with information relevant for assessing ECD programs collect data from clusters (e.g. census tracts, villages, neighborhoods) - or perhaps samples within clusters - because the fixed costs of data collection in a locale mean that a cluster design is much cheaper than would be, for example, a random sample of households in the overall population. The cluster design means that there are likely to be correlations across observations in the stochastic terms that, if not accounted for in the estimation of standard errors, might bias test statistics towards inferring greater significance to the results than is warranted. Estimation strategies that utilize within- family estimates may be further subject to this problem. Moulton (1990), for example, 33Unobserved fixed factors, such as those discussed above, are also controlled in some matching estimates (e.g. the study of the impact of pre-school programs on early childhood development in Bolivia in Behrman, Cheng, and Todd [2004] and in the Philippines in Ghuman et al. [2006a]). 38 notes, "[i]t is reasonable to expect that units sharing an observable characteristic ... also share unobservable characteristics that would lead the regression disturbances to be correlated." These correlations, if positive, may cause the estimated standard errors to be biased downwards. Therefore it is important to assess the sensitivity of the results to the construction of the standard errors. The starting point is to test for heteroscedasticity and correct, where appropriate, standard errors using established methods (e.g. Huber 1967; White 1980) that are readily available in standard estimation software. Most standard estimation software also has options to control for clustering among siblings or among members of the same sample cluster. Recent studies by Angrist and Lavy (2002) and Wooldridge (2003), however, suggest that these corrections for clustering are valid only when the number of units or groups or clusters of observations is large, say on the order of magnitude of 70 or greater. For many data sources this does not pose a problem, but for some it may because, for example, the data are from a relatively small number of communities (as in many of the studies in Table 1, though most of these studies do not correct the standard errors for such effects). In such cases alternative standard error estimators can be constructed as indicated in Bertrand, Duflo, and Mullainathan (2004) by block bootstrapping the t statistics. Another approach is to aggregate all covariates up to their group means and carry out estimation on the average data (Wooldridge 2003) at the cost of a considerable loss in degrees of freedom as the sample size drops from the number of households to the number of clusters. Explorations of such alternatives in a recent study using 16 birth-year cohorts from four villages in Guatemala to estimate the impact of early-life nutritional interventions on education over the life cycle suggest that at least in this case these methods do not change substantially the inferences from the estimates (Maluccio et al. 2007). IV. Strengths and Limitations of Analysis of Various Types of Data The previous two sections point to considerable challenges in undertaking empirical estimates of causal relations pertaining to the impact of ECD interventions. Better data lessens such challenges. The ideal would be representative panel data with substantial detail updated frequently on every member of the family over several generations, substantial detail on the context (markets, public services, environment, kin and social networks) also updated frequently over the same time period, and a series of experimental and quasi-experimental shocks over the same time period that would permit identification of the short- and long-run causal effects. Such data are not available for any society, and the data that are available generally tend to be less satisfactorily (though not always) for developing than for developed economies. While it would always be desirable to obtain better data, it is also desirable to gain as much understanding as possible from existing data. Most data permit at least some examination of how robust the estimates of the impact of ECD interventions are to some major assumptions regarding possible data limitations. This section considers various types of data and related analytical techniques in turn and how they can be informative about particular causal mechanisms related to ECD programs in developing countries, with some 39 references to studies using data sets of various types in case the readers want to obtain further information about the details of the data or how they have been used. A. Some Major Characteristics Pertaining to Data Quality Before turning to different major types of data, it is useful to note five critical aspects of data quality that are common at a general level across different data options: Representativeness How representative are the data for the population of interest? Can inferences be made for some population of interest beyond the sample, perhaps through weighting the observations appropriately? Some potentially very interesting data, such as individual and family histories (e.g. Watkin's [2004] use of journals kept by four individuals on HIV/AIDS in Malawi), pre-school- or clinic-based data and much (though not all) qualitative data may raise interesting questions and conjectures for more systematic study but be difficult to interpret with regard to their implications for broader populations. Power, Sample Size, and Sample Design Power refers to whether the sample is large enough to identify the effect of interest at a given significance level. Power calculations indicate how large the sample size needs to be to identify such an effect with a specified degree of confidence (e.g. at the 5% level); standard software packages such as Stata can facilitate power calculations (e.g. Behrman and Todd 1999b). For example, suppose that the question of interest is whether spending the third year of life in a particular comprehensive ECD program increases adult children's access to resources by at least 3% at the 5% significance level. The sample size in terms of households necessary to have any particular level of statistical power, of course, varies depending on what question is being asked. For instance, a larger number of households is required the more fine-tuned the question is with respect to demographic groups ­ so many more households will be needed to investigate the possibility of a given impact with given significance between ECD programs and cognitive skills among three-year old girls than to investigate the possibility of the same percentage impact with the same significance between ECD programs and schooling attendance for all 6-12 year-old children (even with correction of the standard errors for clustering at the family level). If the sample design involves clustering, the number of clusters and the intracluster correlations are important in addition to the number of households (see discussion on standard errors in Section III). It is sensible for researchers to ask questions about power when they initiate analysis rather than bemoan that the sample size is too small after they have invested a lot of resources in the research project. Data that in other respects might appear very promising for the analysis of ECD interventions may not warrant analysis if the power is too low. 40 Coverage of Relevant Variables To state the obvious, data are of value for the analysis of ECD interventions only if they include some information on variables for both the ECD intervention and the possible impacts on children (and their households) that are the targets of the intervention that capture at least some critical elements of the links across the life-cycle stages that are discussed in Section II. Many data sets, for example, have information on individuals' income and schooling and their co-resident children's schooling (and less frequently, pre- schooling) to date (e.g. most labor force surveys designed to capture the current conditions in the labor market). Such data often can illuminate some part of the chain implicit in going from the right-side of relation (9) to the adult child's resource access (the dependent variable in relation [9]) ­ such as the relation between adults' completed schooling and their income or the relation between parental household income and pre- school or school progression of co-resident children. It might be in some cases possible to link together different components of the linkage as estimated from various data sets as noted above, and thus make use of estimates of such links to develop the overall picture of the impact of ECD interventions over children's life cycles. But such data do not permit direct estimation of relation (9) nor of many of the links between adult children's income and their early life ECD program experience. Measurement Errors Data typically are imperfect representations of the underlying constructs of interest. Even for data such as self-reported completed schooling in developed countries, the noise-to-signal ratio34 has been estimated to be on the order of magnitude of 10% (e.g. Behrman, Rosenzweig, and Taubman 1994). Random measurement error in right- side variables tends to cause biases in the estimated coefficients towards zero ­ intuitively the noise masks part of the effect of the signal so the absolute magnitude of the coefficient is underestimated. This effect tends to be exacerbated in fixed effects (FE) estimates because controlling for fixed effects tends to increase the noise-to-signal ratio. Random measurement error can be eliminated if there are multiple reports on a variable and the measurement error across the reports are not correlated (e.g. schooling attainment as reported not only by the individual but by others, as in Ashenfelter and Krueger 1994 and Behrman, Rosenzweig and Taubman 1994). Instrumental variable (IV) estimates, as noted in Section III, may also eliminate this bias towards zero due to random measurement error. However, measurement error is not only random, it might also have systematic components. For example, parental reports on child morbidity and on aspects of child development may depend very much on what their expectations are for "normal" child development, which may be systematically related to the parents' socioeconomic 34The "noise-to-signal" ratio refers to the fact that most concepts are not measured perfectly, particularly in self-reported data, but have some random measurement error (leaving aside for the moment systematic measurement error). This measurement error is referred to as "noise" (since it disguises or hides the systematic part or "signal" in the data). The variance in the measured variable therefore can be decomposed into the variance due to noise and the variance due to the signal, with higher "noise-to-signal" indicating more contamination due to random measurement error. 41 status and education. Such systematic errors may make accurate inferences about relations between ECD programs and such measures of child development difficult. Human Subject Protection Somewhat different in flavor than the four general aspects of data noted above, but also of importance, is the question of human subject protection. For existing data, the primary concern generally is protecting confidentiality of sample members. For the collection of new data, in addition to the question of confidentiality, the major questions seem to be what is the burden on the respondents, what benefits the respondents are expected to receive, what compensation is provided for the time that the respondents devote to the process, and what are the risks and protections for more invasive procedures such as the collection of biomedical samples. Considerations regarding the possibility of collecting new data that will inform us better of regarding ECD programs should weigh the expected gains in terms of scientific knowledge against the expected costs, including such costs for the participants in the study. B. Some Major Types of Data for Evaluating ECD Interventions Cross-sectional Data Cross-sectional surveys and censuses are the most common type of available data. Cross- sectional household surveys tend to have some information on some of the links that are discussed in Section II, though not necessarily the ones from early-life ECD program exposure to intermediate outcomes or pathways of interest, and generally not on both early-life ECD experiences and adult outcomes as would be necessary to estimate directly relations such as (9). There are many cross-sectional surveys that are representative, often with a stratified cluster sample design, of populations of interest for this chapter. Also censuses, of course, are by definition representative of the populations covered except for possible undercounting (particularly of more marginal groups). There are also many cross-sectional surveys that are not representative but based instead on behaviors such as attending pre-school programs or health clinics. These non-representative data sources may have rich information - but interpretation of the implications of analysis for broader populations of interest may be difficult unless it is possible to control for the selection decision into the sample. In some cases it may be possible to control for such selectivity into the sample by using other representative or census data to estimate the selection rules on a set of variables common to the selected and the representative data sources. Cross-sectional data sources vary considerably in their sample sizes and statistical power, and, as noted above in Section III, the required sample size for a given level of power and significance depends on the extent to which the question being asked is focused on a narrow or broader demographic group. Typically, as noted, cross-sectional data do not include information on the variables necessary to estimate directly reduced-form relations between early-life ECD and adult resource access as in relation (9) though some cross-sectional data sets may 42 have some key information for examining the impact of ECD program options for small children on the behaviors such as time use of other household members (mothers, older sisters, other caregivers). Cross-sectional data can most commonly be used to estimate reduced-form relations in the spirit of relations (4b) and (6) ­ that is, what are the relations between ECD exposure and indicators of child development during preschool (based on recall data) and perhaps the school years for children. There are studies in the literature, for example, of associations between ECD program exposure and indicators of child development (e.g., Ghuman et al. [2005] for the Philippines). The typical cross-sectional data permit some, but limited, control for the estimation problems that are discussed in Sections II and III. For instance, the cluster structure of many cross-sectional data sets permits the control for unobserved cluster (e.g. community) effects that might be correlated with family background characteristics and cause biases in the estimated impact of family background characteristics and ECD program characteristics if not controlled.35 That information is available on a number of children also permits the investigation of time-varying changes that affect siblings differentially; this may be useful for evaluating ECD program effects if the programs change over time (e.g. see Parker, Todd, and Wolpin [2006] on the impact of the Mexican Oportunidades program on schooling of children too old to have been affected by the program versus those children of age to have been affected by the program). Many cross-sectional data sets can be enriched by linking them with time series administrative data on public services (particularly related to health and education, possibly including ECD program characteristics), communication and transportation, and weather conditions. For example: (1) Even if the basic household data being used are cross-sectional, time series on available services may be informative for time periods earlier in their children's life; (2) such data may make possible within-sibling estimates if different siblings faced different community services related to ECD during critical periods such as early childhood; (3) such data may provide instruments that arguably are independent of the unobserved factors on the right-side of the relations in Section II but that predict sufficiently well the right-side ECD options so that good IV estimates can be obtained (e.g. levels and variations in rainfall may provide good instruments for parental income in agricultural areas, as for a different purpose in India by Wolpin (1982) and in Thailand by Paxson (1992) and for purposes much more directly related to this chapter in the study of health, schooling, and socioeconomic consequences in Indonesia by Maccini and Yang (2006). Longitudinal or Panel Data These data follow individuals and/or households over time. They generally provide a more satisfactory means of identifying the impact of ECD interventions than do cross-sectional data because: (1) the prospective data gathered in earlier rounds is likely to be less contaminated with measurement error and more complete than recall data from 35This would work for ECD program characteristics only if there is variation in such characteristics among the programs in each sample cluster. 43 cross-sectional data sources; (2) the multiple observations over time in some cases permit the control for unobserved individual fixed effects such as innate ability and health; (3) the multiple observations over time permit the exploration of the dynamics of effects such as whether they tend to diminish over time or are enhanced over time, perhaps in part in interaction with dimensions of the environment in which the individual is developing (e.g. do early-life nutritional shocks have only short-run or long-run effects, and to what extent does it depend on whether subsequently the school system or other institutions can in part or in whole compensate for them); and (4) the multiple observations over time permit exploring the impacts of possibly changing contextual factors, depending in part on how rich is the contextual information. On the other hand, longitudinal data are more expensive to collect than a time series of cross sections of equal size because of the costs and problems in following up with the same individuals, are subject to attrition because of factors such as migration36 (see discussion in Section III), and are less likely to be representative of the current overall population (though not necessarily of particular birth cohorts) than a time series of cross sections even if there is not attrition. There currently exist relatively few longitudinal household data sets from developing countries with panels over several decades as needed to see how conditions related to ECD measured prospectively early in the life cycle affect adult outcomes. But there are a few. Examples include: the INCAP Guatemalan data on children 0-7 years old in 1969-1977 with follow-up rounds in 1988-9 and 2002-4, at which time the children were 25-42 years of age (Martorell et al. 2005); the Cebu (Philippines) Longitudinal Health and Nutrition data of births in 1983 with the last follow-up in 2005 when the children were up to 20-22 years old and their mothers were from 35 to 69 years old (Cebu Study Team 1991, 1992; Glewwe, Jacoby and King 2000; Glewwe and King 2001; Daniels and Adair 2004); the Pelotas Brazilian data on the birth cohort of 1982 with the last follow-up in 2004-5 when the children were up to 25 years of age (Victora, Victora and Barros 1990; Victora and Barros 2005); the NCAER rural Indian data starting in 1969-71 with follow-up until 2002 (Foster and Rosenzweig 1995, 2004); the Bangladeshi nutritional data with follow-up after over two decades (Pitt, Rosenzweig, and Hassan 1990, 2006); the ICRISAT village-level study (VLS) data starting in 1975 with follow-up ongoing to the present (Behrman 1988a,b; Behrman and Deolalikar 1987). There are many more longitudinal data sets that cover shorter, but important, segments of the life-cycle stages noted above. A few examples include: The Mexican PROGRESA data for 1997-2003 (Behrman, Parker, and Todd 2006, 2007); a number of the Demographic Health Survey (DHS) data sets; the Vietnam Living Standard Measurement Survey (LSMS, Agarwal, Dollar, and Glewwe 2004); the Chilean Encuesta de Protección Social survey from 2002-2006 (Bravo et al. 2006); the Bolivian PIDI evaluation data (Behrman, Cheng, and Todd, 2004); the Malawian Diffusion and Ideation Change Project Data for 1998-2006 (Watkins et al. 2003; Kohler, Behrman and, Watkins 2007); the Kenyan school-based sample (Glewwe et al. 2004, Miguel and Kremer 2004); 36Though not because of mortality, which implies a selection problem for a time series of cross sections following the same birth cohort just as for longitudinal data following that cohort. 44 the Colombian Familias en Acción sample for 2002-6 (Attanasio et al. 2004); the Philippines Early Childhood Development Survey for 2001-6 (Armecin et al. 2006; Ghuman et al. 2005, 2006a,b); the Mexican Family Life Survey (Rubalcalva and Teruel 2004); the Indonesian Family Life Survey (Thomas et al. 2003) Longitudinal data can be, and in some cases are, enriched in ways that are parallel to cross-sectional data: inclusion of questions for previous generations or other people not currently in the households, linkage to administrative data. In addition, some longitudinal data have built into their design controlled experiments with random assignment between treatment and controls groups.37 Some prominent examples include: The Mexican rural PROGRESA program with random assignment of initial treatment versus controls for 506 communities including provision of micro nutrients for infants and young children (Behrman and Hoddinott 2005; Behrman, Parker, and Todd 2006, 2007; the Kenyan random assignment of various treatments (including deworming, flip charts) among 75 schools (Glewwe et al. 2004, Miguel and Kremer 2004); the Guatemalan INCAP data with random assignment of nutritional supplements among the four participant communities (Martorell et al. 2005, Behrman et al. 2006, Maluccio et al. 2007); experimental assignment of fees and distances to VCT clinics in the Malawian Ideation and Diffusion Change Project (MDICP, Watkins et al. 2003; Kohler, Behrman, and Watkins 2007); random assignment of iron supplements in Indonesia (Thomas et al. 2003). Such experiments provide (a) capacity for identifying the causal effect of treatment and (b) the possibility of identifying the impact of one behavioral choice affected by the treatment on another by using the experimental assignment as an instrument for IV estimates. These are considerable strengths. Good experiments rightly are thought to be the "gold standard" for obtaining reduced-form impact estimates, including for ECD programs. But there also are limitations of experiments: some experiments may be viewed as unethical or politically unwise; selective attrition between the treatment and control areas may introduce selectivity biases; and even very good experiments only provide "black box" estimates of the impact of the specific intervention used and not of alternative counterfactuals, including the impacts in different market, policy and cultural contacts and even the longer-run impacts in the location in which the experiment was conducted.38 Therefore, while good experiments to improve the evaluation of ECD programs should be strongly encouraged, non-experimental data also 37 Since such experiments almost always have baseline and post-intervention data rounds, they are longitudinal and not cross-sectional. In principle, if the treatment and control groups are randomly selected then only looking at the cross-sectional post-treatment data should be informative. But it would not be possible in such a case to test whether or not the assignment really was random (as, for example, in Behrman and Todd (1999a) for the Mexican PROGRESA data). 38At the cost of the assumptions necessary to estimate structural models of the behaviors such as are outlined in Section III, evaluations of counterfactual polices can be made (e.g. different treatments, impacts for longer time periods than observed in the data). Todd and Wolpin (2007) provide an example using the Mexican PROGRESA data. They estimate a structural model using baseline data, test the model's predictions against the experimental results (and find that the model predicts fairly well), and then use the model to conduct counterfactual experiments (e.g. with different scholarship schedules for different grades, with the program running many years even through the experimental data were only for the baseline plus two years of follow-up). 45 can provide useful insights into key relations relating to the impacts of ECD programs over the life cycle if they are analyzed with sensitivity to the estimation problems raised in Sections II and III and the estimates interpreted with sensitivity to the limitations of the data and the approaches in light of the life-cycle framework outlined in Section II. Time Series of Cross-sectional Surveys A time series of cross-sectional surveys provides a means of tracing cross- sectional associations over time as cohorts age and possibly permit controlling for cohort- specific unobserved factors. This has the advantage of using more readily available data than longitudinal data, as well as data that are representative for each cross section.39 Deaton and Paxson (1994) give an example in which they trace the persistence of earnings shocks experienced early in the adult life cycle as cohorts age in Taiwan and the United States. The possibilities for using such an approach to investigate the Impact of ECD interventions seem limited, but perhaps underexplored. Qualitative Data Sources Most other possible data sources for investigating ECD interventions can be considered to fit within the categories of being either cross-sectional or longitudinal (particularly since cross-sectional and longitudinal data may be either quantitative or qualitative). The same general questions of data quality (Section III) apply for such data sources. That is, the questions of representativeness, power, variable coverage, measurement errors and human subject protection hold for qualitative as well as for quantitative data. Extensive family or individual histories or focus groups may provide useful insights regarding hypotheses regarding ECD interventions whether or not they are representative or have sufficient power or whatever the nature of the measurement errors. But if inferences are to be drawn from such data sources about aspects of ECD interventions for some population larger than the sample itself, it is necessary to know how the sample relates to the larger population and to assure that power is sufficient and to understand possible problems with measurement errors. Likewise, it is necessary to recognize that associations do not imply causality with qualitative data any more than with quantitative data. Indeed it may be equally important to attempt to control for unobserved factors in qualitative analyses as in qualitative analysis. With regard to representativeness of qualitative data, there is a strong attraction to drawing the sample in the same way that one would draw the sample for quantitative data. There are possibilities, that in a few cases have been exploited, of combining qualitative and quantitative data, not only with qualitative data collected first to inform the questionnaire design for quantitative data, but with the subsample for the qualitative data drawn randomly from that for the quantitative data so that not only the sample characteristics for the qualitative data are known but it is possible in the analysis to combine the quantitative and the qualitative data (e.g. such strategies have been followed for the MDICP project described in Watkins et al. [2003] and Kohler, Behrman and Watkins [2007]). 39But as noted above, such data are not representative of birth cohorts if there is selective mortality, including in early life. 46 Conclusions There are a small number of reasonably good evaluations of ECD programs in developing countries that suggest that the impacts of these programs may be substantial, in some cases over the medium- and long-runs in addition to the fairly short-run effects. In a few cases analysis also suggests that the benefit-cost ratios or rates of return to resources used in these programs are high. But most of the existing systematic analyses of ECD programs consider only a subset of the possible impacts over a fairly short time horizon with little or no attention to issues related to the timing of these impacts, the resource costs of the programs, the sensitivity of the programs to market, policy or cultural contexts, or the implications of the program for the efficiency motive for policies. Given the widespread perception that ECD programs in developing countries are important and the widespread advocacy for expanding them, therefore, the potential gains from expanding systematic evaluations of ECD programs is considerable. Undertaking good empirical analysis of many aspects of ECD interventions, however, is challenging given data limitations. These challenges are clarified in this chapter by: (1) considering the types of relations that might be estimated to be informative for understanding better aspects of the links that connect ECD programs and adult resource access and other relevant outcomes within an intergenerational life-cycle framework in which there may be important unobserved variables such as genetic ability endowments; (2) considering possible resolutions to some of the estimation problems that such a framework implies; (3) considering different types of data that are available that might permit advances in knowledge of the impact of ECD interventions in developing countries in light of such a framework of analysis and the related estimation alternatives; and (4) discussing the importance of investigating not only the impacts but how to value those impacts to obtain total benefits and comparing those benefits with total resource costs with sensitivity analysis for the critical parameters such as discount rates and the value of adverting early mortality. Throughout efforts are made to give illustrations of related studies, primarily on developing countries but also in some cases for developed economies. Despite these challenges, a number of interesting options are available for estimating relations pertaining to aspects of the ECD interventions and for exploring the robustness of estimates related to ECD interventions to alternative strategies for dealing with the estimation problems. In some cases, moreover, researchers can and should provide their understanding of the probable directions and, if possible, magnitudes of biases due to the estimation problems that they can not deal with directly in a particular study of ECD intervention. Of course, it is desirable to focus on where the expected gains are greatest. Relevant considerations in deciding where the gains are likely to be greatest would seem to include: · Are there aspects of the links between ECD programs and outcomes over the life cycle that are described in Section II for which improved knowledge is particularly important because the effects are thought to be particularly large or 47 because there is great uncertainty about the probable magnitudes of the effects?40 · What is the nature of data quality with regard to representativeness, power, coverage of important concepts in the linkage between ECD programs and outcomes over the life cycle and therefore the impact of ECD interventions, and measurement error? · What special features of the data might permit better exploration of the impact of ECD interventions? Can the data be linked to time series records on a range of contextual changes? Can the robustness of the estimates to at least some of the estimation problems be tested, for instance by exploiting information on siblings, members of the same sample cluster, experiments, and/or longitudinal data? · How best can good estimates of the resource costs for ECD programs be obtained? · How can better estimates of the social versus the private rates of return to ECD programs be obtained so that we would be better informed about the efficiency motive for policies? · Might incentives or other mechanisms be used to make ECD programs be more efficient and therefore have higher benefit-cost ratios? Through careful examination of existing data, keeping in mind the considerations that are discussed in this chapter, much can be learned about ECD interventions in developing countries. But at the same time, in order to create a better informational basis for such studies in the future, it is important to be alert to opportunities for improving data collection and encouraging the collection of new and better data with better indicators of costs and benefits of ECD programs, more randomization to facilitate more confident identification of causal effects of ECD programs, randomization with regard not only to program availability but also information about programs and key characteristics that may influence the quality of programs, longer time horizons for data collection so that the medium- and long-run effects can be explored directly, and greater variance of market, policy and cultural context so that the robustness of the benefits/costs of ECD programs to the great contextual variation in developing countries can be ascertained. Careful systematic analyses of such data will permit enhancing importantly our knowledge of ECD programs in developing countries. 40While such a question seems obvious to ask, it is not clear that it always is raised in determining the portfolio of social science research. Recent analysis suggests, for example, that the focus on particular health/disease conditions in economics in particular and in the social sciences more broadly on health and development has emphasized HIV/AIDS and injuries relative to non-communicable diseases much more than current or project future distributions of these health/disease conditions would suggest was warranted (Behrman, Behrman, and Perez 2006a,b). 48 References41 Aboud FE. 2006. "Evaluation of an early childhood preschool programme in rural Bangladesh." Early Childhood Res Quarterly 21 (1):46-60. Adair LS. 1999. "Filipino children exhibit catch-up growth from age 2 to 12 years." J Nutr 129 (6): 1140-8. Adair, L. and B. Popkin. 1988. "Birth Weight, Maturity and Proportionality in Philippine Infants," Human Biology 60, 319-339. Adhikari, Ramesh, Paul Gertler and Anneli Lagman. 1999. "Economic Analysis of Health Sector Projects--A Review of Issues, Methods, and Approaches," Manila: Asian Development Bank. Agarwal, DK, Upadhyay SK, Tripathi AM, and Agarwal KN. 1987. "Nutritional status, physical work capacity and mental function in school children." Nutritional Foundation of India. Scientific Report No. 6. Agarwal, Nisha, David Dollar and Paul Glewwe, eds. 2004. Economic Welfare, Poverty and Economic Growth in Vietnam, Washington, DC: World Bank. Alderman, Harold and Jere R. Behrman. 2006. "Reducing the Incidence of Low Birth Weight In Low-Income Countries has Substantial Economic Benefits," World Bank Research Observer 21:1, 25-48. Alderman, Harold, Jere Behrman, Victor Lavy and Rekha Menon. 2001. "Child health and School Enrollment: A Longitudinal Analysis." Journal of Human Resources 36(1): 185-205. Alderman, H., J. R. Behrman, H.-P. Kohler, J. A. Maluccio, and S. Cotts Watkins. 2001a. "Attrition in longitudinal household survey data: Some tests for three developing country samples." Demographic Research [Online] 5 (4): 79­124. Available at http://www.demographic-research.org. Alderman H, Engle P. The synergy of nutrition and ECD interventions in Africa. Africa: World Bank, In press, 2007. Alderman, H., J. Hentschel, and R. Sabates. 2003. "With the help of one's neighbors: externalities in the production of nutrition in Peru." Social Science & Medicine. 56(10):2019-2031. 41This reference list includes the references that are cited in this chapter, plus some others that are not cited explicitly but may be of interest to analysts working in this area. 49 Alderman, Harold, John Hoddinott and Bill Kinsey. 2006. "Long Term Consequences of Early Childhood Malnutrition," Oxford Economic Papers, 58: 450­474. Almond, Douglas, Kenneth Y. Chay and David S. Lee. 2005. "The Costs of Low Birth Weight," Quarterly Journal of Economics 120:3 (August), 131-1083. Als H, Duffy FH, McAnulty GB, et al. "Early experience alters brain function and structure." Pediatr 2004; 113 (4):846-857. Angrist, Joshua D., Eric Bettinger, Erik Bloom, Elizabeth King and Michael Kremer. 2002. "Vouchers for Private Schooling in Colombia: Evidence from a Randomized Natural Experiment," American Economic Review 92:5 (December), 1535-59. Angrist JD, Lavy V. 2002. "The effect of high school matriculation awards: Evidence from randomized trials." Cambridge, MA: NBER Working Paper 9389. Araya R, Rojas G, Fritsch R, Gaete J, Rojas M, Simon G. 2003. "Treating depression in primary care in low-income women in Santiago, Chile: a randomised controlled trial." Lancet 361 (9362):995-1000. Armecin, Graeme, Jere R. Behrman, Paulita Duazo, Sharon Ghuman, Socorro Gultiano, Elizabeth M. King, Nanette Lee, and The Office of Population Studies, University of San Carlos ECD Team. 2006. "Early Childhood Development through Integrated Programs: Evidence from the Philippines," Cebu City, New York, Philadelphia and Washington: Universities of Pennsylvania and San Carlos, Population Council and World Bank. Armstrong KL, Fraser JA, Dadds MR, Morris J. 1999. "A randomized, controlled trial of nurse home visiting to vulnerable families with newborns." J Pediatr Child Health 35 (3):237-244. Armstrong KL, Fraser JA, Dadds MR, Morris J. 2000. "Promoting secure attachment, maternal mood and child health in a vulnerable population: a randomized controlled trial." J Pediatr Child Health 36 (6):555-562. Ashenfelter, Orley, Angus Deaton, and Gary Solon. 1986. "Collecting Panel Data in Developing Countries: Does it Make Sense?" LSMS Working Paper 23, Washington, D.C.: The World Bank. Ashenfelter, Orley and Alan Krueger. 1994. "Estimates of the Economic Return to Schooling from a New Sample of Twins," American Economic Review 84:5 (December), 1157-74. Attanasio, Orazio, Luis Carlos Gomez, Ana Gomez Rojas and Marcos Vera-Hernández. 2004. "Child health in rural Colombia: determinants and policy interventions," Economics & Human Biology 2:3 (December), 411-438. 50 Attanasio, Orazio and Marcos Vera-Hernández. 2004. "Medium and Long Run Effects of Nutrition and Child Care: Evaluation of a Community Nursery Programme in Rural Colombia," London: University College London. Baker, Amy J. L., Chaya S. Piotrkowski and Jeanne Brooks-Gunn. 1998. "Effects of the Home Instruction Program for Preschool Youngsters (HIPPY) on Children's School Performance at the End of the Program and One Year Later," Early Childhood Research Quarterly 13:4, 571-588. Baker-Henningham H, Powell C, Walker S, Grantham-McGregor S. 2003. "Mothers of undernourished Jamaican children have poorer psychosocial functioning and this is associated with stimulation provided in the home." Eur J Clin Nutr 57 (6): 786-92. Baker-Henningham H, Powell C, Walker S, Grantham-McGregor S. 2005. "The effect of early stimulation on maternal depression: a cluster randomised controlled trial." Arch Dis Child 90 (12):1230-1234. Baker-Henningham H, Walker S, and Chang-Lopez S. 2006. "Special education needs study: end of phase 1 report." Kingston, Jamaica: Department of Educational Studies/Office of the Principal, University of the West Indies. Ballard, Charles, John Shoven, and John Whalley. 1985. "General Equilibrium Computations of the Marginal Welfare Costs of Taxes in the United States," American Economic Review 75(1):128-38. Barker, D. J. P. (ed.). 1992. "Fetal and infant origins of adult disease: papers written by the Medical Research Council Environmental Epidemiology Unit, University of Southhampton," British Medical Journal 1998: Mothers, babies and health in later life, 2nd edn., Churchill Livingstone, Edinburgh, London and New York Barnum H. 1995. "Economic Evaluation of Health Projects," Washington, DC: World Bank. Behrman, Jere R. 1988a. "Nutrition, Health, Birth Order and Seasonality: Intrahousehold Allocation in Rural India," Journal of Development Economics 28:1 (February), 43- 63. Behrman, Jere R. 1988b. "Intrahousehold Allocation of Nutrients in Rural India: Are Boys Favored? Do Parents Exhibit Inequality Aversion?" Oxford Economic Papers 40:1 (March), 32-54. Behrman J. 1996. The impact of health and nutrition on education. World Bank Research Observer 11(1):25­37. Behrman, Jere R.. 1997. "Women's Schooling and Child Education: A Survey," Philadelphia, PA: University of Pennsylvania, mimeo. 51 Behrman, Jere R., Harold Alderman and John Hoddinott. 2004. "Hunger and Malnutrition" in ed. Bjorn Lomborg, Global Crises, Global Solutions, Cambridge, UK: Cambridge University Press, 363-420. Behrman, Jere R., Julia A. Behrman and Nykia Perez. 2006a. "Where Should UNU- WIDER Place Their Bets? Some Considerations for Prioritizing Economic Research on Health and Development," for Invited Keynote Speach, United Nations University (UNU)-World Institute for Development Economics Research (WIDER) Conference on Health Deprivation and Inequality, 29-30 September 2006, Helsinki, Finland. Behrman, Jere R., Julia A. Behrman and Nykia Perez. 2006b. "Out of Sync? Social Science Research on Health and Development and Health Conditions in Developing Countries," Philadelphia: University of Pennsylvania, mimeo. Behrman, Jere R., Yingmei Cheng, and Petra Todd. 2004. "Evaluating Preschool Programs when Length of Exposure to the Program Varies: A Nonparametric Approach," Review of Economics and Statistics 86:1 (February 2004), 108-132. Behrman, Jere R. and Anil B. Deolalikar. 1987. "Will Developing Country Nutrition Improve with Income? A Case Study for Rural South India," Journal of Political Economy 95:3 (June), 108-138. Behrman, Jere R., Andrew Foster, and Mark R. Rosenzweig. 1997. "The Dynamics of Agricultural Production and the Calorie-Income Relationship: Evidence from Pakistan," Journal of Econometrics 77:1 (March), 187-207. Behrman, Jere R. and John Hoddinott. 2005. "Program Evaluation with Unobserved Heterogeneity and Selective Implementation: The Mexican Progresa Impact on Child Nutrition," Oxford Bulletin of Economics and Statistics 67:4, 547-569. Behrman, Jere R., John Hoddinott, John A. Maluccio, Erica Soler-Hampejsek, Emily L. Behrman, Reynaldo Martorell, Manuel Ramirez and Aryeh D. Stein. 2006. "What Determines Adult Skills? Impacts of Pre-School, School-Years and Post-School Experiences in Guatemala", Philadelphia, PA: University of Pennsylvania, mimeo. Behrman, Jere R., Susan W. Parker, and Petra E. Todd. 2006. "Medium-Term Effects of the Oportunidades Program Package on Young Children," Philadelphia, PA: University of Pennsylvania and Mexico City: CIDE, mimeo. Behrman, Jere R., Susan W. Parker and Petra E. Todd. 2007. "Medium-Term Impacts of the Oportunidades Conditional Cash Transfer Program on Rural Youth in Mexico," in Stephan Klasen and Felicitas Nowak-Lehmann, eds., Poverty, Inequality, and Policy in Latin America, Cambridge, MA: MIT Press. 52 Behrman, Jere R. and Mark R. Rosenzweig. 2002. "Does Increasing Women's Schooling Raise the Schooling of the Next Generation?" American Economic Review 92:1(March ), 323-334. Behrman, Jere R. and Mark R. Rosenzweig. 2004. "Returns to Birthweight," Review of Economics and Statistics 86:2 (May), 586-601. Behrman, Jere R.and, Mark R. Rosenzweig. 2005. "Does Increasing Women's Schooling Raise the Schooling of the Next Generation? ­ Reply" American Economic Review 95:5 (December), 1745-1751. Behrman, Jere R., Mark R. Rosenzweig, and Paul Taubman. 1994. "Endowments and the Allocation of Schooling in the Family and in the Marriage Market: The Twins Experiment," Journal of Political Economy 102:6 (December), 1131-1174. Behrman, Jere R., Piyali Sengupta and Petra Todd. 2005. "Progressing through PROGRESA: An Impact Assessment of Mexico's School Subsidy Experiment," Economic Development and Cultural Change 54:1 (October 2005) 237-275. Behrman, Jere R. and Petra E. Todd. 1999a. "Randomness in the Experimental Samples of PROGRESA (Education, Health and Nutrition Program)," Washington, DC: International Food Policy Research Institute. Behrman, Jere R. and Petra E. Todd. 1999b. "An Evaluation of the Sample Sizes for Evaluation of PROGRESA (Education, Health and Nutrition Program) of Mexico," Washington, DC: International Food Policy Research Institute. Behrman, Jere R. and Barbara L. Wolfe. 1984. "The Socioeconomic Impact of Schooling in a Developing Country," Review of Economics and Statistics 66:2 (May), 296- 303. Behrman, Jere R. and Barbara L. Wolfe. 1987a. "How Does Mother's Schooling Affect the Family's Health, Nutrition, Medical Care Usage, and Household Sanitation?" Journal of Econometrics 36, 185-204. Behrman, Jere R. and Barbara L. Wolfe. 1987b. "Investments in Schooling in Two Generations in Pre- Revolutionary Nicaragua: The Roles of Family Background and School Supply," Journal of Development Economics 27:1-2 (October), 395-420 (reprinted in International Trade, Investment, Macro Policies and History: Essays in Memory of Carlos F. Diaz-Alejandro (eds., Pranab Bardhan, Jere R. Behrman, and Albert Fishlow), Amsterdam: North-Holland, 1987), 395-420. Belli, Pedro, J. Anderson, H. Barnum, J. Dixon and J.P. Tan. 1998. Handbook on Economic Analysis of Investment Operations, Washington, DC: The World Bank 53 Berkman DS, Lescano AG, Gilman RH, Lopez SL, Black MM. 2002. "Effects of stunting, diarrhoeal disease, and parasitic infection during infancy on cognition in late childhood: a follow-up study." Lancet 359 (9306): 564-71. Berlinski, Samuel, Sebastian Galiani and Paul Gertler. 2006. "The Effect of Pre-Primary Education on Primary School Performance," London- University College London and Institute for Fiscal Studies, Buenos Aires -Universidad de San Andres and Washington-World Bank. Bertrand M, Duflo E and Mullainathan S. 2004. "How much should we trust differences- in-differences estimates?" Quarterly Journal of Economics 119 (1): 249­275. Bhandari N, Bahl R, Taneja S, de Onis M, Bhan MK. 2002. "Growth performance of affluent Indian children is similar to that in developed countries." Bull World Health Org 80: 189-95. Black J, Jones T, Nelson C, Greenough W. 1998. "Neuronal plasticity and the developing brain." Handbook of child and adolescent psychiatry, Vol. 1.New York: Wiley. 31- 53. Black, Sandra E., Paul J. Devereux, and Kjell G. Salvanes. 2005. "Why the Apple Doesn't Fall Far: Understanding Intergenerational Transmission of Human Capital," American Economic Review 95:1 (March), 437-449. Bobonis G, Miguel E, Sharma CP. 2004. "Iron deficiency, anemia and school participation." Berkeley, Calif.: Dept of Economics, University of California. Bolton P, Bass J, Neugebauer R, et al. 2003. « Group interpersonal psychotherapy for depression in rural Uganda: a randomized controlled trial." JAMA 289 (23):3117- 3124. Bradley R, Corwyn R. 2002. "Socioeconomic status and child development." Ann Rev Psychol 53: 371-99. Bravo, David, Jere R. Behrman, Olivia Mitchell and Petra E. Todd. 2006. "Encuesta de Protección Social 2004: Presentación General y Resultados Principales," Santiago, Chile: Centro de Microdatos, Universidad de Chile. Bredy T, Humpartzoomian R, Cain D, Meaney M. 2003. "Partial reversal of the effect of maternal care on cognitive function through environmental enrichment." Neurosci 118: 571-6. Brito GNO, de Onis M. 2004. "Growth status, behavior and neuropsychological performance." Arquivos Neuro-Psiquiatr 62 (4): 949-54. 54 Brooker S, Hall A, Bundy DAP, Adjei S, Allen E, Amuah E et al. 1999. "Short stature and the age of enrolment in primary school: studies in two African countries." Soc Sci Med 48 (5): 675-82. Brooks-Gunn J, Duncan GJ. 1997. "The effects of poverty on children." The Future of Children 7: 55-71. Brooks-Gunn J, Fuligni A, Berlin L. 2003. "Early child development in the 21st Century: Profiles of current research initiatives." In: College T, ed. http://www.unicef.org/earlychildhood/index_resources.html, New York: Columbia University. Bross D, Miyoshi T, Miyoshi P, Krugman R. 2000. World perspectives on child abuse: The fourth international resource book. Oxford, UK: Elsevier Science Ltd. Bundy, D, Guyatt J. "Schools for Health: Focus on Health, Education, and the School- Age Child." Parasitology Today 12:1-16. Bundy, D.A.P. 1988. "Population Ecology of Intestinal Helminth Infections in Human Communities." Philosophical Transactions of the Royal Society of London. Series B. 321 (1207), 405-420. Bundy, D.A.P., Chan, M-S., Medley, G.F., Jamison, D., and Savioli, L. 1998. "Intestinal Nematode Infections." In Health Priorities and Burden of Disease Analysis: Methods and Applications from Global, National and Sub-national Studies. Harvard University Press for the World Health Organization and the World Bank (in press). Card, D. 1999. `The Causal Effect of Education on Earnings', in O. Ashenfelter and D. Card, editors, Handbook of Labor Economics, Elsevier Science B.V. Case, Anne and Angus Deaton. 1999. "School Inputs and Educational Outcomes in South Africa," Quarterly Journal of Economics 114:3 (August), 1047-1084. Cebu Study Team (John Akin, David Guilkey, Barry Popkin, Wilhelm Flieger, John Briscoe, Robert E. Black, and Linda Adair). 1992. "A Child Health Production Function Estimated from Longitudinal Data," Journal of Development Economics 38:2 (April), 323-522. Cebu Study Team. 1991. "Underlying and Proximate Determinants of Child Health: The Cebu Longitudinal Health and Nutrition Study," American Journal of Epidemiology 133:2, 185-201. Checkovich, Tennille and Steven Stern. 2002. "Shared Caregiving Responsibilities of Adult Siblings with Elderly Parents," Journal of Human Resources, 37(3):441-478. 55 Chen C, Tseng Y, Chou F, Wang S. 2000. "Effects of support group intervention in postnatally distressed women. A controlled study in Taiwan." J Psychosom Res 49 (6):395-399. Cheung Y. "Growth and cognitive function of Indonesian children: Zero-inflated proportion models." Stat Med 2006; In press. Cheung YB, Yip PSF, Karlberg JPE. 2001. "Fetal growth, early postnatal growth and motor development in Pakistani infants." Int J Epidemiol 30 (1): 66-72. Chopra M. 2001. Assessment of participants on the care for development. IMCI training course. University of the West Cape, South Africa. Clarke N, Grantham-McGregor S, Powell C. 1990. "Health and nutrition predictors of school failure in Kingston, Jamaica." Ecol Food Nutr 26: 1-11. Colletta N, Satoto, Sockaling-Ham S, Zeitlan M. 1993. "The child development milestone chart: An approach to low cost programming in Indonesia." Early Child Dev and Care 96 161-171. Committee on Integrating the Science of Child Development. 2000. From neurons to neighborhoods: the science of child development. Washington D. C.: National Academy Press. Conley, Dalton and Neil G. Bennett. 2000. "Is Biology Destiny? Birth weight and Life Chances." American Sociological Review 65 (June): 458-467. Cooper P, Tomlinson M, Swartz L, Woolgar M, Murray L. 1999. "Post-Partum depression and the mother-infant relationship in a South African peri-urban settlement." Br J Psychiatry (175):554-558. Cueto S. 2005. "Height, weight, and education achievement in rural Peru." Food Nutr Bull 26 (2 Suppl 2): S251-S260 Cueto S, Diaz J. 1999. "Impacto de la educacion inicial en el rendimiento en primer grado de primaria en escuelas publicas urbanas de Lima." Revista de Psicologia 17 (1):74-91. Cunha, Flavio. 2006. "A Time to Plant and a Time to Reap," Chicago: University of Chicago. Cunha, Flavio and James J. Heckman. 2006. " Formulating, Identifying and Estimating the Technology of Cognitive and Noncognitive Skill Formation," Chicago: The University of Chicago. Cunha, F., J. J. Heckman, L. Lochner, and D. V. Masterov. 2005. "Interpreting the Evidence on Life Cycle Skill Formation," In Handbook of the Economics of Education, eds. E. Hanushek and F. Welch. Amsterdam: North Holland. 56 Currie, Janet and Duncan Thomas. 1995. "Does Head Start Make a Difference?" American Economic Review 85:3 (June), 341-364. Currie, Janet and Duncan Thomas. 1996. "Does Head Start Help Hispanic Children?" Cambridge, MA: National Bureau of Economic Research. Currie, Janet, and Duncan Thomas. 1999. "Early Test Scores, Socioeconomic Status and Future Outcomes," Cambridge, MA: National Bureau of Economic Research working paper no. 6943. Currie J, Thomas D. 2001. "Early test scores, socioeconomic status, school quality and future outcomes." Res Labor Econ 20 103-132. Currie J. 2001. "Early childhood intervention programs: what do we know?" J Econ Perspec 15 (2):213-238. Daniels, Melissa C. and Linda S. Adair. 2004. "Growth in Young Filipino Children Predicts Schooling Trajectories through High School," Journal of Nutrition 134, 1439-1446. Deaton A. 1997. The Analysis of Household Surveys: A Microeconometric Approach to Development Policy. Baltimore and London: The Johns Hopkins University Press for the World Bank. Deaton, Angus and Christina Paxson. 1994. "Intertemporal Choice and Inequality," Journal of Political Economy 102:3 (June), 437-467. Devarajan, S., L. Squire and S. Suthiwart-Narueput. 1997. "Beyond Rate of Return: Reorienting Project Appraisal, The World Bank Research Observer 12(1):35-46. Dewey K, Cohen R, Berown K, Rivera L. 2001. "Effects of exclusive breastfeeding for four versus six months on maternal nutritional status and infant motor development: results of two randomized trials in Honduras." J Nutr 131 (2):262-267. Dewey KG, Domellof M, Cohen RJ, Landa RL, Hernell O, Lonnerdal B. 2002. "Iron supplementation affects growth and morbidity of breast-fed infants: results of a randomized trial in Sweden and Honduras." J Nutr 132 (11):3249-3255. Dickson R, Awasthi S, Williamson P, Demellweek C, Garner P. 2000. "Effect of Treatment for Intestinal Helminth Infection on Growth and Cognitive Performance in Children: Systematic Review of Randomized Trials." British Medical Journal; 320, 1697-1701. Doryan EA, Guatam KC, Foege WH. 2002. "The Political Challenge: commitment and cooperation." In: Young M, ed. From early child development to human development. Washington, DC: The World Bank: 375-391. 57 Dos Santos I, Gonclaves H, Halpern R, Victora C. 1999. Pilot test of the child development of the IMCI "counsel of the mother" module. Study results and recommendations. Pelotas, Brazil. Drewett R, Wolke D, Asefa M, Kaba M, Tessema F. 2001. "Malnutrition and mental development: is there a sensitive period? A nested case-control study." J Child Psychol Psychiatr 42 (2): 181-7. Duflo, Esther. 2001. "Schooling and Labor Market Consequences of School Construction in Indonesia: Evidence from an Unusual Policy Experiment," American Economic Review 91:4 (September), 795-813. Dunn A. 2005. What about very young children? The Hague, Nederlands: Bernard Van Leer Foundation. Engle PL. 2001. "Early child development: Should we have global indicators?" In: Kolsteren P, Hoeree T, Perez-Cueto A, eds. Promoting growth and development of under fives. Proceedings of the International Colloquium. ITG Press., Antwerp: 23- 39. Engle, P.L., Bentley, M., Pelto, G. 2000. The role of care in nutrition programmes: current research and a research agenda. Proceedings of the Nutrition Society 59: 25- 35. Engle, Patrice L., Maureen M. Black, Jere R. Behrman, Meena Cabral de Mello, Paul J. Gertler, Lydia Kapiriri, Reynaldo Martorell, Mary Eming Young, and the International Child Development Steering Committee. 2007. "Strategies to Avoid the Loss of Potential Among 240 Million Children in the Developing World," Lancet 369 (January 20,2007), 229-242. Ertem I, Atay G, Bingoler E, Dogan DG, Bayhan A, Sarica D. 2006. "Promoting child development at sick child visits: A controlled trial to test the effect of the intervention on the home environment of young children." Pediatrics; 118 (1):e124- 131. Espy KA, Molfese VJ, DiLalla LF. 2001. "Effects of environmental measures on intelligence in young children: growth curve modeling of longitudinal data." Merrill-Palmer Quart 47 (1): 42-73. Faber M, Kvalsvig JD, Lombard CJ, Benade AJ. 2005. "Effect of a fortified maize-meal porridge on anemia, micronutrient status, and motor development of infants." Am J Clin Nutr 82 (5):1032-1039. Feinstein L. 2003. "Inequality in the early cognitive development of children in the 1970 cohort." Economica 70 (277): 73-97. 58 Feldstein, Martin. 1995. "Tax Avoidance and the Deadweight Loss of the Income Tax," Cambridge, MA: National Bureau of Economic Research Working Paper No. 5055. Fernald, Lia, Abbie Raikes and Robin Dean. 2006. "Summary of Child Development Assessments and Application to Evaluations in the Developing World." Report prepared for the World Bank Development Group. The World Bank. Washington, D.C. Fitzgerald J, Gottschalk P, and Moffitt R. 1998a "An analysis of sample attrition in panel data," Journal of Human Resources 33(2): 251­299. Fitzgerald J, Gottschalk P, and Moffitt R. 1998b "The impact of attrition in the PSID on intergenerational analysis," Journal of Human Resources 33 (2): 300­344. Florencio C. 1988. Nutrition, health and other determinants of academic achievement and school-related behavior of grades one to six pupils. Quezan City, Philippines: University of the Philippines. Foster, Andrew and Mark R. Rosenzweig. 1995. "Learning by Doing and Learning from Others: Human Capital and Technical Change in Agriculture," Journal of Political Economy 103:6 (December),1176-1209. Foster, Andrew and Mark R. Rosenzweig. 2004. "Household Division and Rural Economic Growth," Review of Economic Studies (October). Foster, Andrew and Mark R. Rosenzweig. 2004. "Technological Change and the Distribution of Schooling: Evidence from Green-Revolution India," Journal of Development Economics, June 2004. Galiani, Sebastian, Paul Gertler and Ernesto Schargrodsky. 2005. "School Decentralization: Helping the Good Get Better, but Leaving the Rest Behind," Buenos Aires, Argentina: Universidad de San Andres, processed: Galiani, Sebastian, Paul Gertler and Ernesto Schargrodsky. 2005. "Water for Life: The Impact of the Privatization of Water Services on Child Mortality," Journal of Political Economy 113:1, 83-120 Garber, H. L. 1988. The Milwaukee Project: Preventing Mental Retardation in Children at Risk, Washington, DC: American Association on Mental Retardation. Gertler P, Fernald L. 2004. The medium term impact of oportunidades on child growth and development in rural areas of Mexico. Cuernavaca, Mexico: Instituto Nacional de Salud Publica. Ghuman, Sharon, Jere Behrman, Judith B. Borja, Socorro Gultiano and Elizabeth M. King. 2005. "Family Background, Service Providers, and Early Childhood 59 Development in the Philippines: Proxies and Interactions," Economic Development and Cultural Change 54:1 (October) 129-164. Ghuman, Sharon, Jere R. Behrman, Socorro Gultiano, Graeme Armecin, Isabelita Bas, Paulita Duazo, Elizabeth M. King, Nanette Lee. 2006a. "Early Childhood Development: A Review of Findings from a Longitudinal Study in the Philippines," East Asian Economic Papers. Ghuman, Sharon, Jere R. Behrman, Socorro Gultiano, and Elizabeth M. King. 2006b. "Children's Nutrition, School Quality, and Primary School Enrollment in the Philippines," New York, NY: Population Council, processed. Gillespie S, Allen L. 2002. "What works and what really works? A review of the efficacy and effectiveness of nutrition interventions." Public Health Nutrition 5 513-514. Glewwe, Paul. 2005. "The Impact of Child Health and Nutrition on Education in Developing Countries: Theory, Econometric Issues, and Recent Empirical Evidence," Food and Nutrition Bulletin 26:2 (supplement 2), S235-S250. Glewwe, Paul, and Hanan Jacoby. 1995. "An Economic Analysis of Delayed Primary School Enrollment and Childhood Malnutrition in a Low Income Country." Review of Economics and Statistics 77(1): 156-169. Glewwe P, Jacoby H. 2000. "Panel data." In: Grosh M, Glewwe P, eds. Designing household survey questionnaires for developing countries: lessons from 15 years of the living standards measurement study. New York: Oxford University Press. Glewwe, Paul and Hanan Jacoby. 2004. "Economic Growth and the Demand for Education: Is there a Wealth Effect?" Journal of Development Economics 74:1 (June), 33-51. Glewwe, Paul, Hanan Jacoby, and Elizabeth King. 2000. "Early Childhood Nutrition and Academic Achievement: A Longitudinal Analysis." Journal of Public Economics 81(3): 345-368. Glewwe, Paul and Elizabeth King. 2001. "The Impact of Early Childhood Nutrition Status on Cognitive Achievement: Does the Timing of Malnutrition Matter?" World Bank Economic Review 15(1): 81-114. Glewwe, Paul, Michael Kremer, Sylvie Moulin and Eric Zitzewitz, 2004, "Retrospective vs. Prospective Analyses of School Inputs: The Case of Flip Charts in Kenya." Journal of Development Economics 74: 251-268. Glewwe, Paul and Edward Miguel. 2007. "The Impact of Child Health and Nutrition on Education in Less Developed Countries," Handbook of Development Economics, Vol. 4. Amsterdam: North-Holland Publishing Co. 60 Gorman KS, Pollitt E. 1996. "Does schooling buffer the effects of early risk?" Child Dev 67 (2): 314-26. Gould, Eric, Victor Lavy, and Daniele Paserman. 2004. "Immigrating to Opportunity: The Effect of School Quality Using a Natural Experiment on Ethiopians in Israel," Quarterly Journal of Economics 119(2):489-526. Grantham-McGregor, Sally, Yin Bun Cheung, Santiago Cueto, Paul Glewwe, Linda Richter, Barbara Strupp, and the International Child Development Steering Group. 2007. "Developmental Potential in the First 5 Years for Children in Developing Countries," Lancet 369 (January 6, 2007), 60-70.. Grantham-McGregor S. 2002. "Linear growth retardation and cognition." Lancet 359 (9306):542. Grantham-McGregor SM, Powell CA, Walker SP, Himes JH. 1991. "Nutritional supplementation, psychosocial stimulation, and mental development of stunted children: the Jamaican Study," Lancet 338 (8758):1-5. Grantham-McGregor, S., C. Walker, S. Chang, and C. Powell. 1997. "Effects of Early Childhood Supplementation with and without Stimulation on Later Development in Stunted Jamaican Children." American Journal of Clinical Nutrition 66: 247-253. Grantham-McGregor, Sally, Lia Fernald and Kavita Sethuraman. 1999a. "Effects Of Health And Nutrition On Cognitive and Behavioural Development in Children in the First Three Years of Life: Part 1: Low Birthweight, Breastfeeding and Protein- Energy Malnutrition." Food and Nutrition Bulletin 20: 53-75 Grantham-McGregor, Sally, Lia Fernald and Kavita Sethuraman. 1999b. "Effects of Health and Nutrition on Cognitive and Behavioural Development in Children in the First Three Years of Life: Part 2: Infections and Micronutrient Deficiencies: Iodine, Iron and Zinc." Food and Nutrition Bulletin 2: 76-99. Griffiths R. 1970. The abilities of young children: a comprehensive system of mental measurement for the first eight years of life. London, United Kingdom: Child Development Research Centre. Grimm, Michae. 2006. "Mortality and Survivors' Consumption," Paris: DIAL DT/2006- 13. Gunnar M. (in press). Stress, nurture and the young brain. Concepts of care: twenty brief papers that point the way toward quality care for infants and toddlers. San Francisco: WestEd. Gunnar MR. Morison SJ, Chisholm K, Schuder M. 2001. "Salivary cortisol levels in children adopted from Romanian orphanages." Dev Psychopathol 13 (3):611-628. 61 Haas J, Beard J, Murray-Kolb L, del Mundo A, Felix A, Gregorio G. 2005. "Iron- biofortified rice improves the iron stores of nonanemic Filipino women." J Nutr 135 (12):2823-30. Hall D. 1996. Health for all children: Report of the third joint working party on child health surveillance. 3rd ed Oxford: Oxford University Press. Hall, Robert E. And Charles I. Jones. 2007. "The Value of Life and the Rise in Health Spending," Quarterly Journal of Economics 122:1 (February), 39-72. Hamadani J, Huda S, Khatun F, Grantham-McGregor S. 2005. "Effects of psychosocial stimulation on development and behavior of malnourished children in Bangladesh." Royal Soc Trop Med Hyg 99: 947. Hamadani JD, Fuchs GJ, Osendarp SJ, Huda SN, Grantham-McGregor SM. 2002. "Zinc supplementation during pregnancy and effects on mental development and behaviour of infants: a follow-up study." Lancet 360 (9329): 290-4. Hamadani JD, Fuchs GJ, Osendarp SJ, Khatun F, Huda SN, Grantham-McGregor SM. 2001. "Randomized controlled trial of the effect of zinc supplementation on the mental development of Bangladeshi infants." Am J Clin Nutr 74 (3): 381-6. Hammer, Jeffrey S. 1996. "Economic Analysis for Health Projects," Policy Research Working Paper No. 1611, Washington, DC: The World Bank (May). Harberger, Arnold C. 1997. "New Frontiers in Project Evaluation? A Comment on Devarajan, Squire and Suthiwart-Narueput," The World Bank Research Observer 12(1):73-79. Hatt LE, Waters HR. 2006. "Determinants of child morbidity in Latin America: A pooled analysis of interactions between parental education and economic status." Soc Sci Med 62 (2): 375-86. Heaver R. 2005. Strengthening country commitment to human development: Lessons from nutrition. Washington DC: The World Bank. Heckman, James. 1974. "Shadow Prices, Market Wages, and Labor Supply," Econometrica 42 (July), 679-694. Heckman, James J. 1979. "Sample Selection Bias as a Specification Error," Econometrica 47, 153-161. Heckman JJ. 1997.Instrumental Variables: A Study of Implicit Behavioral Assumptions Used in Making Program Evaluations. Journal of Human Resources 32(3): 441­ 462. Heckman JJ. 2006. "Skill formation and the economics of investing in disadvantaged children," Science 312 1900-1902. 62 Heckman, James H. Ichimura and Petra Todd. 1998. ``Matching as an Econometric Evaluation Estimator,'' Review of Economic Studies 65:2 (April). Hiscock H, Wake M. 2002. "Randomised controlled trial of behavioural infant sleep intervention to improve infant sleep and maternal mood." BMJ 324 (7345):1062- 1065. Hoddinott, J. and B. Kinsey. 2001. "Child Growth in the Time of Drought." Oxford Bulletin of Economics and Statistics 63(4): 409-436. Holding PA, Kitsao-Wekulo PK. 2004. "Describing the burden of malaria on child development: What should we be measuring and how should we be measuring it?" Am J Trop Med Hyg 71 (2):71-79. Huber PJ. 1967. "The behavior of maximum likelihood estimates under non-standard conditions." In: Proceedings of the Fifth Berkeley Symposium in Mathematical Statistics and Probability, Vol. 1: pp. 221­233, Hutchinson SE, Powell CA, Walker SP, Chang SM, Grantham-McGregor SM. 1997. "Nutrition, anaemia, geohelminth infection and school achievement in rural Jamaican primary school children." Eur J Clin Nutr 51 (11): 729-35. Ivanovic DM, Perez HT, Olivares MD, Diaz NS, Leyton BD, Ivanovic RM. 2004. "Scholastic achievement: A multivariate analysis of nutritional, intellectual, socioeconomic, sociocultural, familial, and demographic variables in Chilean cshool-aged children." Nutrition 20: 878-889. Jacoby H. 2002. "Is there an intrahousehold flypaper effect? Evidence from a school feeding program." Econ J 112(476):196­221. Jamison DT. 1986. "Child malnutrition and school performance in China." J Dev Econ 20 (2): 299-309. Jaramillo A, Mingat A. 2003. Early childhood care and education in Sub-Saharan Africa: What would it take to meet the millennium development goals? Africa Region: The World Bank. Jaramillo A, Tietjen K. 2001. Early childhood development in Africa: Can we do more or less? A look at the impact and implications of preschools in Cape Verde and Guinea. Washington, DC: World Bank. Johnston FE, Low SM, de Baessa Y, Macvean RB. 1987. "Interaction of nutritional and socioeconomic status as determinants of cognitive development in disadvantaged urban Guatemalan children." Am J Phys Anthropol 73 (4): 501-6. Jukes M. 2005. "The long-term impact of preschool health and nutrition on education." Food Nutr Bull. Jun 26 (2 Suppl 2):S193-201. 63 Kagitcibasi C, Sunar D, Bekman S. 2001. "Long-term effects of early intervention: Turkish low-income mothers and children." J Appl Dev Psychol 22 (4):333-361. Kariger PK, Stoltzfus RJ, Olney D, Sazawal S, Black R, Tielsch JM et al. 2005. "Iron deficiency and physical growth predict attainment of walking but not crawling in poorly nourished Zanzibari infants." J Nutr 135 (4): 814-9. Karoly L, Greenwood P, Everingham S, et al. 1998. Investing in our children: What we know and don't know about the costs and benefits of early childhood interventions. Santa Monica, CA: The Rand Corporation. Karoly L, Kilburn M, Canon. 2005. Early childhood interventions. Santa Monica, CA: Rand Corporation. Kielmann, Arnfried A., and Associates. 1983. Child and Maternal Health Services in Rural India: The Narangwal Experiment, Baltimore, MD: The Johns Hopkins University Press. Kirksey A. 1994. "Relation of maternal zinc nutrition to pregnancy outcome and early infant development in an Egyptian village." Am J Clin Nutr 60: 782-792. Knowles, James C. and Jere R. Behrman. 2005. "Economic Returns to Investing in Youth," in Jere R. Behrman, Barney Cohen, Cynthia Lloyd and Nelly Stromquist, eds,The Transition to Adulthood in Developing Countries: Selected Studies, Washington, DC: National Academy of Science-National Research Council, 424- 490. Kohler, Hans-Peter, Jere R. Behrman and Susan C. Watkins. 2007. "Social Networks and HIV/AIDS Risk Perceptions," Demography 44:1 (February), 1-33. Kohli T. 1990. "Impact of home-centre based training programme in reducing developmental deficiencies of disadvantaged children." Indian J Disab & Rehab 4 (2):65-74. Kremer, Michael. 2003. "Randomized Evaluations of Educational Programs in Developing Countries: Some Lessons", American Economic Review: Papers and Proceedings, 93 (2), 102-106. Kuklina EV, Ramakrishnan U, Stein AD, Barnhart HH, Martorell R. 2004. "Growth and diet quality are associated with the attainment of walking in rural Guatemalan infants." J Nutr 134 (12): 3296-300. Kvalsig JD, Cooppan RM, Connolly KJ. 1991. "The effects of parasite infections on cognitive processes in children." Ann Trop Med Parasitol 73:501­6. Landrigan P. 2004. "Children as a vulnerable population." Int J Occup Med Environ Health 17 (1):175-7. 64 Landry SH, Smith KE, Swank PR. 2006. "Responsive parenting: Establishing early foundations for social, communication, and independent problem-solving skills." Developmental Psychology 42 (4):627-642. Lanphear B, Hornung R, Khoury J, et al. 2005. "Low-level environmental lead exposure and children's intellectual function: an international pooled analysis." Environ Health Perspect 113 (7):894-9. Lasky RE, Klein RE, Yarbrough C, Engle PL, Lechtig A, Martorell R. 1981. "The relationship between physical growth and infant behavioral development in rural Guatemala." Child Dev 52 (1): 219-26. Lee D. 2002. "Trimming for Bounds on Treatment Effects with Missing Outcomes," Working Paper #51, Center for Labor Economics, University of California, Berkeley. Li H, Barnhart HX, Stein AD, Martorell R. 2003. "Effects of early childhood supplementation on the educational achievement of women." Pediatr 112 (5):1156- 1162. Liddell C, Rae G. 2001. "Predicting early grade retention: A longitudinal investigation of primary school progress in a sample of rural South African children." Br J Educ Psychol 71: 413-28. Lien N, Meyer K, Winick M. 1977. "Early malnutrition and "late" adoption: a study of their effects on the development of Korean orphans adopted into American families." Am J Clin Nutr (30):1734-1939. Lima MC, Eickmann SH, Lima ACV, Guerra MQ, Lira PIC, Huttly SRA et al. 2004. "Determinants of mental and motor development at 12 months in a low income population: a cohort study in northeast Brazil." Acta Paediatr 93 (7): 969-75. Liu D, Diorio J, Day J, Francis D, Meaney M. 2000. "Maternal care, hippocampal synaptogenesis and cognitive development in rats." Nat Neurosci 3: 799-806. Lomborg, Bjorn, ed. 2004. Global Crises, Global Solutions, Cambridge, UK: Cambridge University Press. Love JM, Kisker EE, Ross C, et al. 2005. "The effectiveness of early head start for 3- year-old children and their parents: lessons for policy and programs." Dev Psychol 41 (6):885-901. Lozoff B, Jimenez E, Hagen J, Mollen E, Wolf AW. 2000. "Poorer behavioral and developmental outcome more than 10 years after treatment for iron deficiency in infancy." Pediatr 105 (4):E51. 65 Lozoff B, Teal SJL. 2004. "Longitudinal analysis of cognitive and motor effects of iron deficiency in infancy." Pediatr Res 55: 23A. Maccini, Sharon and Dean Yang. 2006. "Under the Weather: Health, Schooling, and Socioeconomic Consequences of Early-Life Rainfall," Ann Arbor, MI: University of Michigan, processed. MacMillan HL, Thomas BH, Jamieson E, et al. 2005. "Effectiveness of home visitation by public-health nurses in prevention of the recurrence of child physical abuse and neglect: a randomised controlled trial." Lancet 365 (9473):1786-1793. Magnuson K, Waldfogel J. 2005. "Early childhood care and education: effects on ethnic and racial gaps in school readiness." Future Child 16 (1):170-196. Majumdar I, Paul P, Talib V, Ranga S. 2003. "The effect of iron therapy on the growth of iron-replete and iron-deplete children." J Trop Pediatr 49 (2):84-88. Malekpour M. 2004. "Low birth-weight infants and the importance of early intervention: enhancing mother-infant interactions. A literature review." Br J Dev Disab 50 (2):78-88. Maluccio, John A., John Hoddinott, Jere R. Behrman, Agnes Quisumbing, Reynaldo Martorell and Aryeh D. Stein. 2007. "The Impact of Nutrition During Early Childhood on Education among Guatemalan Adults," Philadelphia-Washington- Atlanta: University of Pennsylvania, IFPRI, Emory, processed. Manary MJ, Krebs NF, Gibson RS, Broadhead RL, Hambidge KM. 2002. "Community- based dietary phytate reduction and its effect on iron status in Malawian children." Ann Trop Paediatr 22 (2):133-136. Manski C. 1995. Casual Inference in the Social Science. Harvard University Press, Cambridge. Marshall N. 2004. "The quality of early child care and children's development." Curr Dir Psych Sci 13 (4):165-168. Martorell, Reynaldo, Jere R. Behrman, Rafael Flores, and Aryeh D. Stein. 2005. "Rationale for a Follow-up Focusing on Economic Productivity," Food and Nutrition Bulletin 26:2 (Supplement 1), S5-S14. Martorell R, Habicht JP, Rivera JA. 1995. History and Design of the INCAP Longitudinal Study (1969-1977) and its Follow-up (1988-89). Journal of Nutrition; 125: 1027S-1041S. Martorell R, Rivera J, Kaplowitz J, Pollitt E. 1992. "Long term consequences of growth retardation during early childhood." In: Hernandez M, Argenta J, editors. Human growth: basic and clinical aspects. Amsterdam: Elsevier; 143-9. 66 Martorell R, Schroeder DG, Rivera JA, Kaplowitz HJ. 1995. "Patterns of linear growth in rural Guatemalan adolescents and children." J Nutr 125 (4): S1060-S1067. McCormick MC, Brooks-Gunn J, Buka SL, et al. 2006. "Early intervention in low birth weight premature infants: results at 18 years of age for the infant health and development program." Pediatr 117 (3):771-780. McKay H, Sinisterra L, McKay A, Gomez H, Lloreda P. 1978. "Improving cognitive ability in chronically deprived children." Science 200 (4339):270-278. Meaney M, Mitchell J, Aitken D, Bhatnagar S, Bodnoff S, Iny L et al. 1991. "The effects of neonatal handling on the development of the adrenocortical response to stress: implications for neuropathology and cognitive deficits in later life." Psychoneuroendocrinol 16: 85-103. Meaney M. 2001. "Maternal Care, gene expression, and the transmission of individual differences in stress reactivity across generations." Ann Rev Neurosci 24: 1161-92. Mehansho H. 2006. "Iron fortification technology development: new approaches." J Nutr 136 (4):1059-1063. Mendez, Michelle A. and Linda S. Adair. 1999. "Severity and Timing of Stunting in the First Two Years of Life Affect Performance on Cognitive Tests in Late Childhood," Journal of Nutrition 129, 1555-1562. Miguel, Edward and Michael Kremer. 2004. "Worms: Identifying Impacts on Education and Health in the Presence of Treatment Externalities," Econometrica, 72 (1), 159- 217 MINED, UNICEF, CELEP. Educate your child: The Cuban experience in integrated early child care. http://www.unicef.org/earlychildhood/index_resources.html, Cuba: UNICEF. Minkovitz CS, Hughart N, Strobino D, et al. 2003. "A practice-based intervention to enhance quality of care in the first 3 years of life: the healthy steps for young children program." JAMA 290 (23):3081-3091. Mistry RS, Biesanz JC, Taylor LC, Burchinal M, Cox MJ. 2004. "Family income and its relation to preschool children's adjustment for families in the NICHD study of early child care." Dev Psychol 40 (5): 727-45. Moffitt R (ed.). 1998. Special Issue: "Attrition in Longitudinal Surveys." Journal of Human Resources 33(2). Monckeberg F. 1972. "Malnutrition and mental capacity." In: Organization PAH, editor. Nutrition, the nervous system and behaviour, Scientific Publication No 251.Washington D.C.: PAHO: 48-54. 67 Montie J. 2005. "The IEA Preprimary project age seven follow-up: Characteristics of early childhood education programs and children's development." Highscope ReSource 24 (1):11-12. Morenza L, Arrazola O, Seleme I, Martinez F. Evaluacion Proyecto Kallpa Wawa. 2005. http://www.unicef.org/earlychildhood/index_resources.htmlSanta, Cruz, Bolivia: 1- 130. Morgan R, Garavan H, Smith E, Driscoll L, Levitsky D, Strupp B. 2001. "Early lead exposure produces lasting changes in sustained attention, response initiation, and reactivity to errors." Neurotoxicol Teratol 23: 519-31. Morris SS, Olinto P, Flores R, Nilson EA, Figueiro AC. 2004." Conditional cash transfers are associated with a small reduction in the rate of weight gain of preschool children in northeast Brazil." J Nutr (Sept) 134(9):2336-41. Moulton, Brent. 1986. "Random Group Effects and the Precision of Regression Estimates," Journal of Econometrics 32: 385-97. Moulton BR. 1990. "An illustration of a pitfall in estimating the effects of aggregate variables on micro units." Review of Economics and Statistics: 334­338. Murray L, & Cooper P J. Effects of postnatal depression on infant development. Archives Mustard J. 1999. The early years study: reversing the brain drain. Toronto, Ont, Canada Ontario Children's Secretariat. 1-107. Myers R. 1992. The twelve who survive. In: Routledge, ed London and New York, Nair MK, Radhakrishnan SR. 2004. "Early childhood development in deprived urban settlements." Indian Pediatr 41 (3):227-237. National Research Council and Institute of Medicine, panel on Transitions to Adulthood in Developing Countries, Cynthia B. Lloyd (ed.). 2005. Growing Up Global: The Changing Transitions to Adulthood in Developing Countries, Washington, DC: National Academies Press. Network NECCR. 2002. "Early child care and children's development prior to school entry: Results from the NICHD study of early child care." Am Ed Res J 39: 133- 164. NICHD Early Child Care Research Network. Before Head Start: Income and ethnicity, family characteristics, child care experiences and child development. Early Educ Dev 2001; 12: 545-76. NIPCCD. 1992. National evaluation of ICDS. New Delhi. 68 Noble KG, Tottenham N, Casey BJ. 2005. "Neuroscience perspectives on disparities in school readiness and cognitive achievement." Future Child 15 (1):71-89. Nokes C, Grantham-McGregor S, Sawyer A, Cooper E, Bundy D. 1992. "Parasitic helminth infection and cognitive function in school children." Proceedings: Biological Sciences, 247(1319):77­81. Nokes C, van den Bosch C, Bundy D. 1998. The Effects of Iron Deficiency and Anemia on Mental and Motor Performance, Educational Achievement, and Behavior in Children: A Report of the International Nutritional Anemia Consultative Group. USAID: Washington DC. Olds D, Henderson CR, Jr., Cole R, et al. 1998. « Long-term effects of nurse home visitation on children's criminal and antisocial behavior: 15-year follow-up of a randomized controlled trial." JAMA 280 (14):1238-1244. Ozmert EN, Yurdakok K, Soysal S, Kulak-Kayikci ME, Belgin E, Ozmert E et al. 2005. "Relationship between physical, environmental and sociodemographic factors and school performance in primary schoolchildren." J Trop Pediatr 51 (1): 25-32. Panthuraamphorn C, Dookchitra D, Sanmaneechai M. 1998. "Environmental influences on human brain growth and development. J Prenat & Perinat Psychol & Health 12 (3):163-174. Parker S, Nelson C. 2005. "An event-related potential study of the impact of institutional rearing on face recognition." Dev Psychopathol 17: 621-639. Parker, Susan W., Jere R. Behrman and Petra E. Todd. 2005. "Medium-Term Effects on Education, Work, Marriage and Migration in Rural Areas," Philadelphia, PA (Technical Document Number 1 on the Evaluation of Oportunidades 2004 conducted by INSP). Parker, Susan W., Petra E. Todd, Kenneth I. Wolpin. 2006. "Within-Family Treatment Effect Estimators: the Impact of Oportunidades on Schooling in Mexico," Philadelphia: University of Pennsylvania. Partnership for Child Development. 2001. "An association between chronic undernutrtition and educational test scores in Vietnamese children." Eur J Clin Nutr 55 (9): 801-4. Paxson, Christina H. 1992. "Using Weather Variability to Estimate the Response of Savings to Transitory Income in Thailand," American Economic Review 82:1 (March), 15-33. Paxson C, Schady N. 2005. "Cognitive development among young children in Ecuador: the roles of wealth, health and parenting." World Bank Policy Research Working Paper 3605. 69 Paxson, Christina and Norbert R. Schady. 2007. "Cognitive Development Among Young Children in Ecuador: The Roles of Wealth, Health and Parenting," Journal of Human Resources 42:1 (Winter), 49-84. Pelto G, Dickin K, Engle P. 1999. "A critical link: Interventions for physical growth and psychological development." Geneva: Dept of Child and Adolescent Health and Development, WHO. Phatak P. 2000. "Early stimulation of infants with risks for development." Psychol Studies 45 (1-2):83-86. Phillips, Margaret and Tina G. Sanghvi. 1996. "The Economic Analysis of Nutrition Projects: Guiding Principles and Examples," Washington, DC: The World Bank Pianta RC, McCoy SJ. 1997. "The first day of school: the predictive validity of early school screening." J Appl Dev Psychol 18 (1): 1-22. Pitt, Mark M., Mark R. Rosenzweig, and Donna M. Gibbons. 1993. "The Determinants and Consequences of the Placement of Government Programs in Indonesia," The World Bank Economic Review 7:3 (September), 319-348. Pitt, Mark M., Mark R. Rosenzweig, and M.N. Hassan. 1990. "Productivity, Health and Inequality in the Intrahousehold Distribution of Food in Low-Income Countries," American Economic Review 80:5 (December), 1139-1156. Plug, Erik. 2004. "Estimating the Effect of Mother's Schooling on Children's Schooling Using a Sample of Adoptees," American Economic Review 94:1 (March), 358-368. Pollitt E, Escamilla R. Developmental timing as moderator in the rehabilitation of malnourished children: The Cali Study. http://www.unicef.org/earlychildhood/index_resources.html, 1996. Pollitt, E., K. S. Gorman, P. Engle, R. Martorell, and J. A. Rivera. 1993. "Early supplementary feeding and cognition: Effects over two decades." Monographs of the Society for Research in Child Development, Serial No. 235, 58 (7), 1-99 Pollitt E, Hathirat P, Kotchabhakadi N, Missel L, Valyasevi A. 1989. Iron deficiency and education achievement in Thailand. Am J Clin Nutr (3) 50:687­97. Powell C. 2004. "An evaluation of the roving caregivers programme of the rural family support organization." http://www.unicef.org/earlychildhood/index_resources.html, May Pen. Clarendon, Jamaica: UNICEF. Powell C, Baker-Henningham H, Walker S, Gernay J, Grantham-McGregor S. 2004. "Feasibility of integrating early stimulation into primary care for undernourished Jamaican children: cluster randomised controlled trial." BMJ 329 (7457):89. 70 Powell C, Grantham-McGregor S. 1989. "Home visiting of varying frequency and child development" Pediatr 84 (1):157-164. Powell CA, Grantham-Mcgregor S. 1985. "The ecology of nutritional status and development in young children in Kingston, Jamaica." Am J Clin Nutr 41 (6): 1322- 31. Psacharopoulos G. 1989. "The determinants of early age human capital formation: evidence from Brazil." Econ Dev Cult Change 37 (4): 683-708. Quisumbing, Agnes R, ed. 2003. Household Decisions, Gender and Development: A Synthesis of Recent Research Washington, DC: International Food Policy Research Institute (Distributed by The Johns Hopkins University Press). Rahman A, Iqbal Z, Harrington R. 2003. "Life events, social support and depression in childbirth: perspectives from a rural community in the developing world." Psychol Med (33):1161-1167. Ramey, Craig T., Frances A. Campbell and Clancy Blair. 1998. "Enhancing the Life Course for High-Risk Children." Social Programs that Work, ed. Jonathon Crane, Russell Sage Foundation: New York, 184-199. Rao N, Sharma A. 2002. "Early childhood services in India: Challenges and prospects for their development." In: Chan LKS, Mellor E, eds. International developments in early childhood services. New York: Peter Lang, 97-117. Rao N, Sharma A. 2004. "The impact of the integrated child development services program on the development of young children in India." http://www.unicef.org/earlychildhood/index_resources.html, Rao N. 2005. "Children's rights to survival, development, and early education in India: The critical role of the integrated child development services programme." International Journal of Early Childhood 37 (3):15-31. Richter L, Griesel R, De Wet T. 1998. "Growing up and going to school: Birth to Ten children's cognitive and social competence at school entry." In: Richter L, editor. In A view of school: preparation for and adjustment to school under rapidly changing social conditions. Johannesburg: The Goethe Institute. Richter LM, Yach D, Cameron N, Griesel RD, De Wet T. 1995. "Enrolment into Birth to Ten (BTT): population and sample characteristics." Paediatr Perinat Epidemiol 9 (1): 109-20. Rodier P. 2004. "Environmental causes of central nervous system maldevelopment." Pediatrics 113 (4): 1076-83. 71 Rogan WJ, Dietrich KN, Ware JH, et al. 2001. "The effect of chelation therapy with succimer on neuropsychological development in children exposed to lead." N Engl J Med 344 (19):1421-1426. Rosenbaum Paul R. and Donald B. Rubin. 1983. "The central role of the propensity score in observational studies for causal effects." Biometrika 70:1, 41-55. Rosenzweig, Mark R. 2003. "Payoffs from Panels in Low-Income Countries: Economic Development and Economic Mobility," American Economic Review, Papers and Proceedings, May. Rosenzweig, Mark R. and Kenneth J. Wolpin. 1986. "Evaluating the Effects of Optimally Distributed Public Programs," American Economic Review 76:3 (June), 470-487. Rosenzweig MR, Wolpin KI. 1994. "Are there increasing returns to the intergenerational production of human capital - maternal schooling and child intellectual achievement." J Human Res 29 (2): 670-93. Rosenzweig, Mark R. and Kenneth I. Wolpin. 1995. "Sisters, Siblings and Mothers: The Effects of Teen-Age Childbearing on Birth Outcomes," Econometrica 63:2 (March), 303-326. Rosenzweig, Mark R. and Kenneth I. Wolpin. 2000. "Natural "Natural Experiments" in Economics," Journal of Economic Literature 38 (December), 827-874. Rubalcava, Luis N. and Graciela M. Teruel. 2004. "The role of maternal cognitive ability on child health," Economics & Human Biology 2:3 (December), 439-455. Rutter M, O'Connor T. 2004. "English and Romanian Adoptees (ERA) Study Team. Are there biological programming effects for psychological development? Findings from a study of Romanian adoptees." Dev Psychol 40 (1): 81-94. Rutter M. 1989. "Pathways from childhood to adult life." J Child Psychol Psychiatr 30 (1): 23-51. Save the Children. 2003. "What's the difference? The impact of early childhood development programmes. A study from Nepal of the effects for children, their families and communities." In: http://www.unicef.org/earlychildhood/index_resources.html, ed. Kathmandu, Nepal. Save the Children. 2004. "Early childhood care and development: A positive impact, Myanmar." http://www.unicef.org/earlychildhood/index_resources.html: Save the Children Myanmar Field Office. Sazawal S, Black RE, Ramsan M, et al. 2006. "Effects of routine prophylactic supplementation with iron and folic acid on admission to hospital and mortality in 72 preschool children in a high malaria transmission setting: community-based, randomised, placebo-controlled trial." Lancet 367 (9505):133-143. Schluter, Christian and and Jackline Wahba. 2005. "Are parents altruistic? Evidence from Mexico," University of Southampton. Schroeder D, Martorell R, Rivera J, Ruel M, Habicht J. 1995. "Age differences in the impact of nutritional supplementation on growth." J Nutr 125 (4(Suppl)):1051S- 1059S. Schultz, T. Paul. 2004. "School Subsidies for the Poor: Evaluating the Mexican Progresa Poverty Program," Journal of Development Economics 74:2 (June), 199-250. Schweinhart, Lawrence J. and David P.Weikart. 1998. "High/Scope Perry Preschool Program Effects at Age Twenty-Seven," Social Programs that Work, ed. Jonathon Crane, Russels Sage Foundation: New York, 148-162. Seshadri S, Gopaldas T. 1989. "Impact of iron supplementation on cognitive functions in preschool and school-aged children: the Indian experience." Am J Clin Nutr 50(3): 675­86. Shariff ZM, Bond JT, Johnson NE. 2000. "Nutrition and educational achievement of urban primary schoolchildren in Malaysia." Asia Pacific J Clin Nutr 9 (4): 264-73. Shrimpton R, Victoria CG, de Onis M, Lima RC, Blossner M, Clugston G. 2001. "Worldwide timing of growth faltering: Implications for nutritional interventions." Pediatrics 107 (5): e75. Siegel EH, Stoltzfus RJ, Kariger PK, Katz J, Khatry SK, LeClerq SC et al. 2005. "Growth indices, anemia, and diet independently predict motor milestone acquisition of infants in South Central Nepal." J Nutr 135 (12): 2840-4. Sigman M, McDonald MA, Neumann C, Bwibo N. 1991. "Prediction of cognitive competence in Kenyan children from toddler nutrition, family characteristics and abilities." J Child Psychol Psychiat 32 (2): 307-20. Sigman M, Neumann C, Baksh M, Bwibo N, McDonald MA. 1989. "Relationship between nutrition and development in Kenyan toddlers." J Pediatr 115 (3): 357-64. Sigman M, Neumann C, Jansen AA, Bwibo N. 1989. "Cognitive abilities of Kenyan children in relation to nutrition, family characteristics, and education." Child Dev 60 (6): 1463-74. Soemantri AG, Pollitt E, Kim I. 1989. "Iron deficiency anemia and education achievement." Am J Clin Nutr 50(3):698­702. 73 Soewondo S, Husaini M, Pollitt E. 1989. "Effects of iron deficiency on attention and learning processes of preschool children: Bandung, Indonesia." Am J Clin Nutr 50(3):667­74. Steegmann AT, Datar FA, Steegmann RM. 1992. "Physical size, school performance and visual-motor maturity in the Philippines." Am J Human Biol 4 (2): 247-52. Stein Aryeh D., Jere Behrman, Ann DiGirolamo, Ruben Grajeda Reynaldo Martorell, Agnes Quisumbing, and Usha Ramakrishnan. 2005. "Schooling, Educational Achievement and Cognitive Functioning Among Young Guatemalan Adults," Food and Nutrition Bulletin 26:2 (Supplement 1), S46-S54. Stock, James H. and Motohiro Yogo. 2002. "Testing for Weak Instruments in Linear IV Regression," Cambridge, MA: NBER Technical Working Paper 284. Summers, Lawrence H. 1992. "Investing in All the People," Pakistan Development Review 31:4, 367-406. Summers, Lawrence H. 1994. "Investing in All the People: Educating Women in Developing Countries," Washington, DC: World Bank, Economic Development Institute Seminar Paper No. 45. Super CM, Herrera MG, Mora JO. 1990. "Long-term effects of food supplementation and psychosocial intervention on the physical growth of Colombian infants at risk of malnutrition." Child Dev 61 (1):29-49. Sweet MA, Appelbaum MI. 2004. "Is home visiting an effective strategy? A meta- analytic review of home visiting programs for families with young children." Child Dev 75 (5):1435-1456. Tchibindat F, Martin-Prevel Y, Kolsteren P, Maire B, Delpeuch F. 2004. "Bringing together viewpoints of mothers and health workers to enhance monitoring and promotion of growth and development of children: a case study from the Republic of Congo." J Health Popul Nutr 22 (1):59-67. The Lancet. 2003. "The world's forgotten children." Lancet 361: 1. Thomas, Duncan. 1990. "Intrahousehold Resource Allocation: An Inferential Approach," Journal of Human Resources 25:4 (Fall), 635-64. Thomas, Duncan. 1993. "The Distribution of Income and Expenditure within the Household," Annales de Economie et de Statistiques 29, 109-136. Thomas, Duncan. 1994. "Like Father, Like Son; Like Mother, Like Daughter: Parental Resources and Child Height," Journal of Human Resources 29:4 (Fall), 950-989. Thomas, Duncan, Elizabeth Frankenberg, Jed Friedman, Jean-Pierre Habicht, Mohammed Hakimi, Jaswadi, Nathan Jones, Gretel Pelto, Bondan Sikoki, Teresa 74 Seeman, James P. Smith, Cecep Sumantri, Wayan Suriastini, and Siswanto Wilopo, 2003, "Iron Deficiency and the Well-Being of Older Adults: Early Results from a Randomized Nutrition Intervention," Los Angeles, CA: UCLA. Thomas D, Frankenberg E, Smith J. 2002. "Lost but Not Forgotten: Attrition and Follow- up in the Indonesia Family Life Survey." Journal of Human Resources 36: 556-592, Thomas, Duncan, Victor Lavy, and John Strauss. 1996. "Public Policy and Anthropometric Outcomes in the Cote d'Ivoire," Journal of Public Economics 61, 155-192. Thomas, Duncan, Robert F. Schoeni, and John Strauss. 1996. "Parental Investments in Schooling: The Roles of Gender and Resources in Urban Brazil," Santa Monica, CA: RAND, mimeo. Thomas, Duncan and John Strauss. 1992. "Prices, Infrastructure, Household Characteristics and Child Height," Journal of Development Economics 39:2 (October), 301-332. Thomas, Duncan, John Strauss, and Maria Helena Henriques. 1990. "Child Survival, Height for Age and Household Characteristics in Brazil," Journal of Development Economics 33:2 (October), 197-234. Thomas, Duncan, John Strauss, and Maria Helena Henriques. 1991. "How Does Mother's Education Affect Child Height?" Journal of Human Resources 26:2 (Spring), 183- 211. Thompson RA, Nelson CA. 2001. Developmental science and the media: early brain development. Am Psychol 56 (1): 5-15. Thorndike RL, Hagen E, Sattler J. 1986. Stanford-Binet Intelligence Scale. 4th ed. Itasca: Riverside Publishing. Todd, Petra E. and Kenneth I. Wolpin. 2003. "On the Specification and Estimation of the Production Function for Cognitive Achievement," Economic Journal 118 (February), F3-F33. Todd, Petra E. and Kenneth I. Wolpin. 2004. "The Production of Cognitive Achievemen in Children: Home, School and Racial Test Score Gaps," Philadelphia, PA: University of Pennsylvania, PIER Working Paper 04-019. Todd, Petra and Kenneth I. Wolpin. 2007. "Using a Social Experiment to Validate a Dynamic Behavioral Model of Child Schooling and Fertility: Assessing the Impact of a School Subsidy Program in Mexico," American Economics Review (forthcoming). 75 UNICEF. 2006. 2005 Global Thematic Report: Integrated Early Child Development http://www.unicef.org/earlychildhood/index_resources.html: UNICEF. UNICEF, UNAIDS, USAID. 2004. "Children on the brink." Washington, DC: USAID. UNICEF. 2005. State of the world's children 2006. New York: UNICEF. Van der Gaag J. and J.P. Tan. 1997. "The benefits of early child development programs, an economic analysis," Human Development Network, Washington DC: The World Bank. Vazir S, Kashinath K. 1999. "Influence of the ICDS on psychosocial development of rural children in Southern India." J Indian Acad Appl Psychol 25 (1-2):11-24. Vazir S, Naidu AN, Vidyasagar P. 1998. "Nutritional status, psychosocial development and the home environment of Indian rural children." Indian Pediatr 35 (110): 959- 66. Victora CG, Barros FC, Lima RC, Behague DP, Goncalves H, Horta BL et al. 2003. "The Pelotas birth cohort study, Rio Grande do Sul, Brazil, 1982-2001." Cad Saude Publica, Rio de Janeiro: 1241-1256. Victora, Cesar G., Jean-Pierre Habicht, and Jennifer Bryce, 2004, "Evidence-Based Public Health: Moving Beyond Randomized Trials," American Journal of Public Health 94: 3 (March), 400-405. Victora CG, Wagstaff A, Schellenberg JA, Gwatkin D, Claeson M, Habicht JP. 2003. "Applying an equity lens to child health and mortality: more of the same is not enough." Lancet 362 (9379): 233-41. Victora M, Victora C, Barros F. 1990. "Cross-cultural differences in development rates: A comparison between British and Brazilian children child." Care, Health and Development 16 151-164. Victora MD, Victora CG, Barros FC. 1990. "Cross-cultural differences in developmental rates: a comparison between British and Brazilian children." Child Care Health Dev 16: 151-64. Victora, Cesar G.and Fernando C.Barros. 2005. "Cohort Profile: The 1982 Pelotas (Brazil) Birth Cohort Study," International Journal of Epidemiology Victora, Magda Daminani, Cesar Gomes Victora and Fernando C. Barros. 1990. "Cross- Cultural Differences in Development Rates: A Comparison between British and Brazilan Children," Child: Care, Health and Develoment 16, 151-164. Waber DP, Vuori-Christiansen L, Ortiz N, et al. 1981. Nutritional supplementation, maternal education, and cognitive development of infants at risk of malnutrition. Am J Clin Nutr 34 (Suppl 4):807-813. 76 Wachs TD. 2000. "Necessary but not sufficient: the respective roles of single and multiple influences on individual development." Washington D.C.: American Psychological Association. Walker, Susan P., Theodore D Wachs, Julie Meeks Gardner, Betsy Lozoff, Gail A Wasserman, Ernesto Pollitt, Julie A. Carter and the International Child Development Steering Committee. 2007. "Child Development: Risk Factors for Adverse Outcomes in Developing Countries," Lancet 369 (January 13), 145-157. Walker SP, Chang SM, Powell CA, Grantham-McGregor SM. 2005. "Effects of early childhood psychosocial stimulation and nutritional supplementation on cognition and education in growth-stunted Jamaican children: prospective cohort study." Lancet 366 (9499): 1804-7. Walker SP, Grantham-McGregor SM. 1990. "Growth and development of West Indian children. Part 2: Development." West Indian Med J 39 (1): 12-9. Watanabe K, Flores R, Fujiwara J, Tran LTH. 2005. "Early childhood development interventions and cognitive development of young children in rural Vietnam." J Nutr 135 (8):1918-1925. Watkins, Susan Cotts. 2004. "Navigating the AIDS Epidemic in Rural Malawi," Population and Development Review 30:4 (December). Watkins, Susan Cotts, Eliya Zulu, Hans-Peter Kohler and Jere R. Behrman. 2003. "Introduction to Social Interactions and HIV/AIDS in Rural Africa" in Susan Cotts Watkins, Eliya Zulu, Hans-Peter Kohter and Jere R. Behrman, eds, "Social Interactions and HIV/AIDS in Rural Africa" Special Collection of Articles, Demographic Research, September. Webb KE, Horton NJ, Katz DL. 2005. Parental IQ and cognitive development of malnourished Indonesian children. Euro J Clin Nutr 59 (4): 618-20. Webb S, Monk C, Nelson C. 2001. Mechanisms of postnatal neurobiological development: implications for human development. Dev Neuropsychol 19 (2): 147- 71. White H. 1980. A heteroscedasticity-consistent covariance matrix and a direct test for heteroscedasticity. Econometrica 48 (4): 817­838. WHO. 2006. Global database on child growth and malnutrition. World Health Organization available from: URL: http://www.who.int/nutgrowthdb/database/en/ Wilson MA, Johnston MV, Goldstein GW, Blue ME. 2000. "Neonatal lead exposure impairs development of rodent barrel field cortex." PNAS 97 (10): 5540-5. 77 Winick M, Meyer K, Harris R. 1975. "Malnutrition and environmental enrichment by early adoption." Science 190 (4220):1173-1175. Wolf A, De Andraca I, Lozoff B. 2002. "Maternal depression in three Latin American samples." Soc Psych Pyschiatr Epidemiol 37 (4):169-176. Wolpin, Kenneth I. 1982. "A New Test of the Permanent Income Hypothesis: The Impact of Weather on the Income and Consumption of Farm Households in India," International Economic Review 23, 583- 594. Wooldridge JM. 2003. "Cluster-sample methods in applied econometrics." American Economics Review 93 (2): 133­138. Wooldridge, Jeffrey M. 2002. Econometric Analysis of Cross Section and Panel Data, Cambridge, MA: MIT Press Young Lives. 2002. "Justification of the content of the 7.5-8.5 year old questionnaire. Young Lives: An International Study of Childhood Poverty," available from: http://www.younglives.org.uk/ Young M. 2002. "From early child development to human development." World Bank Publications. Zlotkin S, Arthur P, Antwi K, Yeung G. 2002. "Treatment of anemia with microencapsulated ferrous fumarate plus ascorbic acid supplied as sprinkles to complementary (weaning) foods." Am J Clin Nutr 76 (3):691-693. Zlotkin SH, Christofides AL, Hyder SM, Schauer CS, Tondeur MC, Sharieff W. 2004. "Controlling iron deficiency anemia through the use of home-fortified complementary foods." Indian J Pediatr 71 (11):1015-1019. 78 Appendix Table A.1 Comparison of Characteristics and Evaluation Results of Some Major U.S. Child Care Intervention Programs Program Perry Preschool Program(a) Head Abecedarian Project (c) Milwaukee Project(d) HIPPY program(e) Characteristics Start(b) target group 3-4 year olds 3-5 year olds Children aged 6 weeks-5 Children aged 6 weeks-5 4-5 year olds years, staggered ages of entry years; staggered ages of entry nature of intervention Half-day center based Center-based program; Full-day, center-based Full-day, center-based day- Home-based early childhood preschool program with children attend head-start program providing care/preschool program enrichment program; parents family visits. Highly trained preschool centers, receive developmental and pediatric receive training in how to and well-paid staff; pupil- preventative medical care, surveillance, nutritional educate their children from teacher ratio=5.7 nutritional supplements, and supplements, and educational paraprofessional trainers. cognitive stimulation. services. Program cost $7252 per child per year(f) $4491 per child per year(f) $11,000 per child per year N/A N/A (1999 dollars) Children from families with 90% of families in program Eligibility index based on Children from mothers with Children from predominantly low SES scores and with 3 below poverty line parents education, income, low IQ test scores (<75) poor and immigrant families year IQ scores between 70-85 presence of father, welfare, who also were already point parental IQ. enrolled in a city preschool program eligibility criteria type of evaluation Two comparison groups first Nonexperimental; within Randomized assignment to Randomized assignment to Randomized assignment to matched on observables, then sibling comparisons for the two experimental conditions: treatment treatment one randomly assigned as children of the NSLY data. preschool education or not treatment group and other as and Home Resource control group Intervention or not sample size 123 children; 58 in treatment 5000 observations (children 111 children; 55 in treated N/A 219; 98 in treatment group group, 65 in control group with siblings who have group, 54 in control group and 84 in control group mothers in the NLSY; 1/5 (two cohorts separately participated in Head Start) analysed) short term assessments Effects on educational Effects on PPVT test scores, Immediate effects on IQ test Persistent effects on IQ test Significant effects of program performance found at age 4- on the probability of being scores of early preschool scores through age 14 found on eleven outcomes, 7, but effects faded-out and immunized and on height for intervention programs, but no incl. cognitive ability and were not detected at older age found mainly for white effect found of home resource classroom adaptation (but ages (incl. Hispanic) children. program only for one of two cohorts.) Long term assessments At age 27, found reduced At age 9, lower probability of Increased IQ scores to age 21, N/A N/A crime rates, increased having repeated a grade for increased college att., educational attainment (for white children decreased adolescent child- females only), decreased bearing, decreased special ed welfare and out of wedlock births (a) Schweinhart and Weikart (1998); (b) Currie and Thomas (1995, 1996); (c) Ramey, Campbell and Blair (1998); (d) Ramey, Campbell and Blair (1998), Garber (1988); (e) Baker; Piotrkowski, and Brooks-Gunn (1998).