POLICY RESEARCH WORKING PAPER 2355 Greed and Grievance Of the 27 major armed conflicts that occurred in in Civil W ar 1999, all but two took place within national boundaries. Paul Collier As an impediment to Anke Hoe ffler development, internal rebellion especially hurts the world's poorest countries. What motivates civil wars? Greed or grievance? The World Bank Development Research Group U POLICY RESEARCH WORKING PAPER 2355 Summary findings Collier and Hoeffler compare two contrasting effect might be present in both greed-motivated and motivations for rebellion: greed and grievance. Most grievance rebellions. rebellions are ostensibly in pursuit of a cause, supported The authors' results contrast with conventional beliefs by a narrative of grievance. But since grievance about the causes of conflict. A stylized version of assuagement through rebellion is a public good that a conventional beliefs would be that grievance begets government will not supply, economists predict such conflict, which begets grievance, which begets further rebellions would be rare. conflict. With such a model, the only point at which to Empirically, many rebellions appear to be linked to the intervene is to reduce the level of objective grievance. capture of resources (such as diamonds in Angola and Collier and Hoeffler's model suggests that what Sierra Leone, drugs in Colornbia, and timber in actually happens is that opportunities for predation Cambodia). Collier and Hoeffler set up a simple rational (controlling primary commodity exports) cause conflict choice model of greed-rebellion and contrast its and the grievances this generates induce diasporas to predictions with those of a simple grievance model. finance further conflict. The point of policy intervention Some countries return to conflict repeatedly. Are they here is to reduce the absolute and relative attraction of conflict-prone or is there a feedback effect whereby primary commodity predation and to reduce the ability conflict generates grievance, which in turn generates of diasporas to fund rebel movements. further conflict? The authors show why such a feedback This paper - a product of the Development Research Group - is part of a larger effort in the group to study civil war and criminal violence. For more on this effort, go to www.worldbank.org/research/conflict. Copies of the paper are available free from the World Bank, 1818 H StreetNW, Washington, DC 20433. Please contactAudrey Kitson-Walters, room MC3- 527, telephone 202-473-3712, fax 202-522-1150, email address akitsonwalters@worldbank.org. Policy Research Working Papers are also posted on the Web at www.worldbank.org/research/workingpapers. Paul Collier may be contacted at pcollier@worldbank.org. May 2000. (44 pages) The Policy Research Working Paper Series disseminates the findings of work in progress to encourage the exchange of ideas about development issues. An objective of the series is to get the findings out quickly, even if the presentations are less than fully polished. The papers carry the names of the autthors and should be cited accordingly. The findings, interpretations, and conclusions expressed in this paper are entirely those of the authors. They do not necessarily represent the view of the World Bank, its Executive Directors, or the countries they represent. Produced by the Policy Research Dissemination Center Greed and Grievance in Civil War Paul Collier The World Bank, CEPR and CSAE Anke Hoeffler CSAE, Oxford 1. Introduction' Civil war is both a tragedy and an impediment to development, affecting most of the world's poorest countries. It is now far more common than international conflict: of the 27 major armed conflicts listed in the Stockholm International Peace Research Institute Yearbook for 1999 (SIPRI 1999), all but two were internal. In Section 2 we compare two contrasting motivations for rebellion: greed and grievance. Most rebellions are ostensibly in pursuit of a cause, supported by a narrative of grievance. However, since grievance-assuagement through rebellion is a public good which a government will not supply, economists would predict that such rebellions would be rare. Empirically, many rebellions appear to be linked to the capture of resources: diamonds in Angola and Sierra Leone, drugs in Colombia, and timber in Cambodia. We set up a simple rational choice model of greed-rebellion and contrast its predictions with those of a simple grievance model. A second empirical regularity is that some countries are prone to repeat conflict. This may be either because their underlying characteristics make them highly conflict-prone, or because of a feedback effect whereby conflict generates grievance which in turn generates further conflict. We show why such a feedback effect might be present in greed-motivated rebellions as well as in grievance-rebellions. Finally, we consider an integrated model in which the motivation for rebellion is both greed and grievance. In Section 3 we discuss the construction of a comprehensive data set of 161 countries for each of the eight, five-year periods between 1960 and 1999, giving a total of 1288 potential observations. For 73 of these observations the society was at peace at the start of the period but experienced civil war during it. In Section 4 we use logit regressions to explain these collapses into civil war in terms of characteristics at the start of the period. We perform non-nested tests on the greed and grievance models. Although the greed model substantially outperforms the grievance model, we cannot reject the hypothesis that the grievance model adds to explanatory power. An integrated model, which incorporates some features of grievance into the greed model, outperforms all other models. An economic calculus of the costs and opportunities for the control of primary commodity exports appears to be the main systematic initial impetus to rebellion, with an additional effect from fear of domination by an ethnic majority. After peace has been restored, the legacy of conflict-induced grievance enables rebel movements to restart conflict by drawing on the support of their diasporas. We show that the results are robust to the inclusion of a wide range of alternative variables, and to tests for random and fixed effects and to correction for rare events bias. The last section concludes. 'Previous versions of this paper have benefited from presentations at CERDI, NYU, LSE, Oxford, Princeton, the World Bank and a CEPR conference at Lisbon. We would like to thank participants for comments, and especially Todd Sandler for helpful written suggestions. 2 2. Theories of Rebellion Civil war occurs as a result of rebellion. Hence, the phenomenon to be explained is the emergence of a rebel organization. A rebellion is somewhat analogous to three other types of organization: protest movements, armies, and organized crime. In popular and political analysis, the most common analogue to rebellion is protest. The theory of protest movements, (Kuran, 1989), focuses on the coordination problem. In effect, by joining each other on the streets, people can create an election. However, the analogy is not very close. Protest is not a sustained economic activity: the participants are not full-time employees of the protest organization, and they risk little. Indeed, this is the essence of Kuran's 'prairie fire' model: protest only takes off if the risks fall fast enough (through increasing participation) to make it safe. By contrast, participants in a rebellion must be prepared to fight over a prolonged period against organized force which aims to kill them. In this, a rebel organization is more analogous to a regular army than to a protest movement. Like an army it must solve immense problems of hierarchy and cohesion in order to get people to risk their own death in order to further the military objective of the group. However, unlike both an army and a protest organization, a rebellion must generate revenue in order to feed and pay its workers. The payroll of an army is financed out of taxation which the army itself does not raise. A protest movement does not have a significant payroll. The rebel organization must generate income despite not being directly productive, and in this respect rebellion is like crime. Indeed the analogy with crime is standard in the present economic theory of rebellion For example, Grossman (1999) states 'in such insurrections the insurgents are indistinguishable from bandits or pirates' (p.269). However, the very scale of rebellion makes it distinctive from other crime. Rebel organizations usually have between 500 and 5,000 workers, whereas most criminals are self-employed or work in small groups. Hence, within crime, the closest analogy is that with organized crime. Even organized crime is usually a rather small scale activity, but the largest groups are approximately comparable in size to the smaller rebel organizations. The recent theory of organized crime (e.g. Konrad and Skaperdas, 1998), explains its larger scale than other crime by scale economies in extortion, eventually counterbalanced by policing. The analogies with the economic theories of organized crime and protest movements form the basis for the 'greed-rebellion' theory of Section 2.1. In this model rebellion is a distinctive type of organized crime, although subject to constraints similar to those facing an incipient protest movement. In Section 2.2 we present a contrasting political model of rebellion in which we put aside the economic considerations and focus instead upon various forms of grievance. Section 2.3 compares the two models and considers various ways in which rebellion might combine both greed and grievance. 2.1. A Model of Greed-Rebellion Consider a rebellion which, like organized crime, generates its income from extortion. The rebels menace legitimate economic activities and exact a tribute. Empirically, rebellion is nevertheless distinctive from other types of organized crime, if only in terms 3 of scale and casualties. We are concerned only with those rebellions which are sufficiently large and sustained, and come into sufficient conflict with government forces, to generate at least 1,000 battle-related deaths. This is the conventional empirical definition of civil war, and this is the phenomenon which we will be seeking to explain. Hence, rebellion is distinctive from other crime in the scale of organized violence. We suggest that rebellion is distinctive because the object of extortion is different. The typical object of criminal extortion is a street of shopkeepers. The criminal organization is small because the scale economies in menacing shopkeepers are modest, and because the rate of predation cannot get very high before businesses relocate. Rebellion is at the apex of organized crime because the object of extortion is not a street of shopkeepers but the export of primary produce. As a result, both the scale economies of menace and the sustainable rate of extortion, are atypically high. Primary commodity exports are sustainable targets for predation because their production is intensive in irreversible and immobile assets, and because produce must be transported to a port. The owners of irreversible and immobile assets such as land, trees and mines, receive rents which can be expropriated without curtailing the activity, whereas similar predation of the incomes generated by mobile factors would simply produce relocation. As primary commodities are transported to a port they are exposed to predation at many geographic 'choke points'. The government itself typically levies taxes at the tightest choke point, namely the port, but rebels may attack at any point along the transport route. We begin with a model which, for a country of given population, analyzes how the risk of such a predatory rebellion is affected by variations in the level of income and its structure. Corresponding to our subsequent empirical analysis, we will be seeking to explain only the initiation of conflict, rather than its duration. We first specify a function for the revenue which a rebellion can generate from predation upon primary commodity exports, while being opposed by government forces which are protecting those exports. The gross revenue function has two components, the tribute which can be exacted conditional upon a successful threat of military force, and a military contest function. The tribute is assumed to be a function of the value of primary commodity exports: the tribute increases in this base for predation, but at a diminishing rate. It is convenient to specify primary commodity exports as the product of income, y, and the share of income constituted by such exports, n. For the military contest function, we follow the specification of Konrad and Skaperdas (1998), in which military outcomes reflect the balance of opposing forces, rebel, r, and government, g. Hence: Rr = ny) a[r/(r+g)j. (1) We next introduce an insight from the economic theory of protest movements (Kuran, 1989). Government forces are not simply trying to impede rebels from predation, they are trying to kill them. If the rebel force is too small, when it attacks the choke points, the government forces which it encounters will turn from defense to attack. In Kuran's model, there is a tipping point of participation above which the protest becomes viable. 4 Here, we assume that there is a threshold size of rebel forces relative to government forces, below which predation of primary commodity exports is too dangerous to be viable. We introduce this as a survival constraint: rebel forces must exceed some fraction of government forces in order to engage in predation without suffering punishment. Thus, r 2,Bg. (2) The rebel cost function, Cr, is its wage bill: Cr = rwr. (3) Where wr = rebel wage rate. The rebel wage is assumed to be linear in the level of income, y: Wr = (Y. (4) The rebel leader thus chooses the size of the rebel laborforce so as to maximize net revenue, (1)-(3), subject to (2). Rebellion will occur when net revenue is non-negative, which we refer to as the financial viability constraint: Rr-Cr20. (5) We next endogenize the initial government defense effort, g, which precedes the initiation of rebellion. The government cost function, Cg, mirrors that of the rebels: Cg = gwg (6) wg = 2Y. (7) We assume that the government sets its precautionary military expenditure as fraction of its revenue, Rg. Reflecting observed fiscal patterns, government revenue is assumed to be elastic in income, and in the share of income from primary commodity exports. As discussed, the same factors make natural resource exports atypically taxable and lootable.2 Rg = 3y9 + yny. O> (8) 2 Govement military expenditure may also be increasing in the proportion of natural resource exports because the latter are recognized as being vulnerable to predation. This would simply reinforce the results below on the non-monotonic effect of natural resource exports on the risk of conflict. 5 Prior to rebellion, the government devotes a given share of its expenditure, v, to military expenditure. Hence, prior to rebellion, government forces, g, are: g = V(- S-l + M)/A. (9) The minimum size of a viable rebellion is that at which the survival constraint is binding. If the rebellion is financially viable at this size, the rebel leader may choose to expand it further. However, since there are diminishing marginal returns to rebel labor, the rebellion will only be worth expanding beyond the size imposed by the survival constraint if it is also financially viable at that size. Hence, financial viability at the size which just satisfies the survival constraint is the condition for the initiation of a rebellion. Substituting, and rearranging, the financial viability condition is: Ana/l( +,8) ˇ v(3qy0 a + npyI-a) (10) Equation (10) is thus the key condition for the initiation of greed-rebellion. Now consider whether the rebellions so initiated are likely to become sufficiently large to be recorded as civil wars: that is, whether they generate at least 1,000 combat-related deaths. For this a key consideration is whether the incipient rebellion generates a phase of Iarms race' in which both government and rebel forces grow in response to each other. If there is such a phase, then we will assume that the rebellion is large enough to generate a civil war. The sequence of rebellion is as follows. In the pre-history of the rebel movement it builds its forces until they reach the level which just satisfies the survival condition. At this stage rebel forces can start operations, and so combat deaths commence. We assume that the initial response of the government to rebellion is to increase its expenditure upon its military forces and focus on how rebel forces respond to this increase. If rebel forces respond by contracting, then it is possible that the rebellion gets snuffed out before combat-related mortalities reach the critical level of 1,000 at which the conflict is classified as a civil war. By contrast, if the rebel group initially responds by increasing its own forces, then there is a phase of mutual escalation, although this phase may end with the bankruptcy of the rebellion. The Konrad-Skaperdas (KS) military combat function predicts how the optimal size of the rebel organization changes in response to an increase in government forces: rebel forces will increase if r>g, and decrease if r1>c; the derivative of the RHS w.r.t. y is strictly positive: dRHS/dy = (6oa) v6yoy°a-1 + (1-a) vnopyq` > 0. (11) Hence, the conditions under which rebellion is profitable become more restrictive the higher is income. Restated, higher income is predicted to reduce the risk of conflict. Next, consider the structure of income. Differentiating (10) w.r.t. n yields: d(1O)/dn = [mln a"/(1 +/3)] - vplyI -a" (12) In the neighborhood of n=O, (10) is negative and (12) is positive, so that as n increases from zero the risk of conflict increases. However, (12) is not monotonic in n. Setting (12) to zero and solving for n yields a critical value, n*, below which the conditions for rebellion become easier as n increases, and above which they become tighter. Hence, n * denotes that intermediate level of primary commodity dependence at which the risk of rebellion is at its peak: = n (13) The above analysis applies to a country of given population. Now consider how the risk will be affected by variations in population size. If the risk in each of two identical territories is p, the risk that there will be at least one conflict in the two territories is p + p(l-p). The elasticity of risk with respect to population, controlling for n and y, will therefore be (l-p). Globally, over the period 1960-1999 the mean risk of conflict per period was around 0.06, so that the expected elasticity of conflict with respect to population is around 0.94. Thus far, the model has four testable predictions. The risk of conflict should be decreasing in per capita income, increasing in the share of natural resource exports in GDP at low levels, decreasing at high levels, and be slightly less than unit elastic in 3 Collier (2000) further analyzes this escalation. 7 population. We now introduce three refinements which introduce further testable propositions: relative military advantage, rebel costs of recruitment, and start-up finance. Relative military advantage The military combat function included in (1) is a convenient simple special case of the KS function. The more general form of the KS combat function is: r/(r+g+k), (14) where k denotes the relative military advantage of the two forces. For k=0, the special case we have adopted above, neither force has technological superiority. For k>0 government forces have the advantage, and conversely for k0) > 0 (17) A second presumed cause of grievance is political exclusion, j. The quantitative political science literature has already explored the relationship between conflict and the political rights of a society, q, as the latter range from dictatorial repression to full representative democracy. Econometric studies have found that other than when repression is very severe, it tends to increase the risk of conflict (Gleditsch and Hegre, 1997). However, even democracies may generate grievance if one voting block is able to forge a persistent majority and uses its power to disadvantage a minority. Nornally, such persistent majorities are illusive because excluded groups are able to make offers which split the majority. However, one circumstance in which a stable winning coalition can form is where political allegiance is pre-determined by ethnic identity and one ethnic group constitutes a majority of the population. Whether such a group uses its power to extract transfers from the minority depends in part upon the size of the majority. Conditional upon power, large majorities have less incentive to be exploitative than small majorities. For example, if there are fixed costs to inter-group transfers, there is a critical size of minority below which the minority is not worth exploiting. The circumstances in which one or more ethnic groups are permanently politically exploited by a dominant group thus depend upon the largest group having a sufficiently large share of the population to control the political process, but not being so large that exploitation is not worthwhile. The precise range of population shares over which the largest ethnic group exercises such dominance cannot be determined a priori, although simple models would bound the minimum at 50% and the maximum at strictly less than 100%. The grievance model postulates that a dummy variable for societies characterized by such ethnic dominance, D2, will be significant and positive, increasing the risk of conflict. A possible example of grievance-rebellion driven by an excluded ethnic minority might be the conflict in Sri Lanka, where the Tamil Tigers claim to defend the interests of the 12% of the population who are Tamil. A second circumstance of political exclusion is where the poor are marginalized from the political process. As Hirshleifer (1991) shows, normally the poor will succeed in using the political contest to ameliorate their economic position. A high degree of economic inequality, i, is therefore some indication that the poor are atypically politically marginalized. The 'rage of the poor' at high inequality is indeed probably the single most popular explanation for conflict after that of inter-ethnic hatred, and may be exemplified 12 by the Castro rebellion in Cuba. A final circumstance of persistent political exclusion is where a rich minority is heavily taxed by the majority. The rich may contest the government or, following Buchanan and Faith (1987), they might attempt to secede from the fiscal jurisdiction of the state. A possible example of the former might be the Contra rebellion in Nicaragua, and of the latter, the eventually successful secession of the Eritrean region from the much poorer state of Ethiopia. Since there is more incentive for the majority to tax the rich the larger is the share of income or wealth accruing to them, the 'rage of the rich' might also be expected to increase with economic inequality. Hence, grievance due to political repression is postulated to be a function of the general level of political rights, the ethnic composition of the society, and the degree of inequality: j =j(q, D2, i) (18) Finally, much of the case study literature dwells upon history: current rebellion is motivated by the desire to revenge atrocities committed during a previous conflict. Corresponding to the greed theory of induced conflict, we assume that the longer the period since a previous conflict, T, the less strong may be the demand for grievance assuagement for such atrocities. Thus, the intensity of grievance, G, is assumed to be a function of these three components: G =G(h, j, T). (19) Now consider the second stage in protest-rebellion, the failure of the initial protest to escalate into successful political revolution. In the Kuran model the protest movement succeeds in generating political change if, as people join the protest, the resulting reduction in the risk of punishment attracts enough new entrants further to reduce the risk, yielding a cascade. The key parameter is the elasticity of participation with respect to the risk of punishment. Kuran suggests that this elasticity will be greater the more homogenous the society. The participation cascade breaks if there are gaps in the distribution of preferences such as might occur if the society is fractionalized. Hence, social fractionalization enters twice in a model of rebellion as aborted protest. It is both an impetus for the initial cadre of protestors, and an impediment to political revolution. If rebellions are aborted protests they should therefore be significantly more common in fractionalized societies. However, unlike a protest, a rebellion which generates a civil war is a sustained, full-time effort. Adherents must eat, and so, as with a greed-rebellion, the rebel organization is constrained to be financially viable. Here the grievance-rebellion faces a severe test. Grievance-assuagement is predominantly a public good and evidently it is one which will not be provided by the government. Its financing will therefore face acute difficulties of free-riding. The willingness of non-participants to support the rebel organization 13 financially may be affected by their own level of grievance, so our grievance proxies might affect the risk of conflict both through generating hard core adherents, and by increasing the scope for funding. A further possible source of grievance-related funding is diasporas. There is some evidence that diasporas are more grievance-conscious than the populations from which they originated. Hence, it is possible that diasporas are willing to finance the initiation of rebellion even when local populations are not sufficiently concerned to do so themselves. We have already discussed how this proposition can be tested. However, the collective action problem is so severe in the temporary provision of public goods by non-governmental means, that their supply will usually be negligible. Thus, the principle prediction of an economic theory of grievance rebellion is that such a rebellion will not occur, regardless of the intensity of the grievance. We test this prediction using the grievance proxies in (19). 2.3 Greed-Grievance Interactions We now compare the greed and grievance models of conflict and consider a synthesis. The greed model postulates that the cause of initial conflict is an economic calculus of relative military advantage, the government's ability to finance defense expenditure, the scale of primary commodity exports, and the costs of rebel recruitment. If a conflict occurs, the rebel organization will generate subjective grievance. Post-conflict, until this gradually decays, it will increase the risk of subsequent conflict. The extent of this post- conflict risk depends upon the size of diasporas, since they are able to finance rebellion. By contrast, the grievance model postulates that the cause of initial conflict is not an economic calculus but rather is a protest generated by objective grievances: ethnic or religious hatreds, inequality, oppression, or historical vengeance. Rebellions are protests which fail to cascade into non-violent revolution. The scope for rebels to find a refuge, proxied by the three geography variables, can reasonably be added to the grievance model without upsetting its essentially non-economic spirit. Greed and grievance can co-habit. Where the conditions for greed-rebellion exist but those for grievance-rebellion do not, a group initially motivated by grievance may become dependent upon primary commodity predation for survival, thus transforming itself into a greed-rebellion. Conversely, greed-rebellions need to manufacture subjective grievance for military cohesion and may find an objective grievance an effective basis for generating it. Hence, the presence of primary commodity exports may sustain rebellions which are motivated by objective grievance, while the presence of objective grievance may sustain rebellions motivated by predation. Such interdependence may make case study evidence difficult to interpret. The two models are evidently not nested, relying largely upon distinct variables, and in Section 4 we compare their predictive power using standard tests. However, one variable, ethnic and religious fractionalization, is included in both models with an opposite predicted sign and so is of particular interest. The grievance model naturally sees ethnic and religious difference as a potential source of hatreds as compared with a homogenous 14 society, and as an impediment to non-violent revolution. It thus predicts that the risk of conflict will be lower in homogenous societies. The greed model sees fractionalization as tightening the constraint on rebellion: the need for organizational cohesion imposes homogeneity and so if the society is diverse the costs of rebel recruitment are raised. An integrated model of conflict would allow both greed and grievance to initiate conflict. Objective grievance could make recruits more willing to join, and so lower the costs of rebellion. Potentially, diasporas could provide funding for initial rebellions, motivated by objective grievance, rather than just financing subsequent conflicts motivated by induced grievance, as proposed by the greed model. The greed and grievance models thus potentially nest into an integrated model which combines both sets of variables as causes of initial, as well as subsequent conflicts. Testing these models is the agenda for the rest of the paper. 3. Data Our empirical analysis incorporates several new data sets constructed for the study. In this data section we present a brief description of the war variable and the various proxies for greed and grievance. The data source for the war variable and all other variables is Hoeffler and Sambanis (2000) where a more detailed data description can be found. 3.1. War data We analyze the risk of civil war using a panel data set for 161 countries. For each country we have potentially one observation for each of the eight sub-periods 1960-64, 1965-69, 1995-99. For 73 of these observations the country was at peace at the beginning of the period but a civil war started during it. We predict only war-starts, not their continuation.4 Hence, once civil war has commenced, the possibility for a further initiation of conflict only occurs once peace has been re-established. In fact, most of the countries which had one conflict went on to have further conflicts. Indeed, 47 of the 73 wars were in countries which had already had at least one previous civil war since 1945. We define a civil war as an internal conflict in which at least 1000 battle related deaths (civilian and military) occurred per year. This definition has become standard in the literature following the seminal data collection exercise and analysis by Singer and Small (1982, 1994). Table 1 lists the 73 observations for which a war started during a sub-period. Our new war data set primarily updates that of Singer and Small from 1992 to the end of the decade. 3.2. Rebel Revenue Sources The three main sources of rebel revenue are primary commodity exports, diasporas and, during the Cold War, foreign powers. 'In our earlier work (Collier and Hoeffler, 1998), we used a much smaller sample of wars and conflated war starts with war duration, using a tobit procedure. We now regard this approach as flawed since duration appears to be determined by rather different factors from starts. 15 Primary commodity exports are measured as a ratio of GDP at the beginning of each sub- period. This data was obtained from various World Bank sources. In order to proxy the size of the diaspora we used US immigration data. The US Bureau of the Census provides detailed figures on the size of the foreign born population.5 Inter- census figures were interpolated. To capture the relative size of the population we divided the US immigration figures by the total population of the country of origin. The end of the Cold War was proxied by a dummy variable which took the value of unity for the periods 1990-94 and 1995-99. 3.3 Relative Military Advantage Military advantage reflects geography, rebel cohesion and rebel motivation. Using geographic data we calculated a Gini coefficient measuring the dispersion of the population in a country. Analogous to the income Gini coefficient, the Gini coefficient of population concentration will be high if a large proportion of the country's population is concentrated in a relatively small area of the country. The more evenly the population is dispersed across the country the lower is the Gini coefficient of population concentration. For the calculation of this Gini coefficient we used population data per 400km2 cells.6 We also experimented with two further measures of population dispersion, population density and the proportion of the population living in urban areas, neither of which were significant. The data source for both series is WDI 1998. Data on forest coverage was obtained from the Food and Agriculture Organization (FAO)7. For each period this provided an estimate of the percentage of a country's land area covered in forests and woods. We could find no satisfactory existing data set on mountainous terrain. The study commissioned a new data set from Dr. Gerrard, a specialist on the subject. The measure allows for .the ruggedness of the terrain, rather than simply relying upon altitude and is available from the World Bank.8 We measure social fractionalization by ethno-linguistic and religious fractionalization. The ethno-linguistic fractionalization data was obtained directly from the Atlas Naradov Mira (1964) which lists the ethno-linguistic groups for each country. Based on the Altas Naradov Mira data an ethno-linguistic fractionalization index can be calculated.9 For each country it measures the probability that two randomly selected people do not speak the 5Source: http://www.census.gov/population 6 We would like to thank Uwe Deichman (World Bank) for extracting the original data from the Geographic Information System (GIS). 7Source: http://www.fao.org/forestry 'We would like to thank John Gerrard, University of Birmingham, for the compilation of this variable. 9 We would like to thank Tomila Lankina, University of Oxford, for translating the data entries from the Atlas Naradov Mira. The first study to use this measure in the economics literature is Mauro (1995). 16 same language. An index of 0 means that the entire population speaks the same language while a higher index indicates a higher degree of linguistic heterogeneity. The maximum value of the index is 100. Using the same concept we also constructed a religious fractionalization index which measures the probability that two randomly selected people do not share the same religious affiliation."0 The ethno-linguistic data was measured at about 1960 and the religious data was measured in 1970 and 1980. Since these measures only change very slowly over time we use the linguistic data for all sub-periods and the 1970 data on religion for the observations 1960-70 and the 1980 data for 1980-99. For 1975 we use the average of the 1970 and 1980 data. Social fractionalization can potentially be equal to, additive or multiplicative in religious and ethnic fractionalization. For example, if all the religious divisions coincide with some or all of the ethnic divisions, then the overall fractionalization of the society may be no greater than that measured by ethnic fractionalization. With such coincident divisions, social diversity would simply by the maximum of the underlying ethnic and religious diversity measures. A second possibility is that the religious divisions occur within ethnic groups (or conversely). In this case, social diversity would be approximated by the sum of the underlying measures. A third possibility is that cleavages are cross-cutting, so that ethnic groups are divided by religion and religious groups by ethnicity. In this case social diversity would be approximated by the product of the underlying measures. More precisely, since the index for each measure ranges 0-100, cross-cutting cleavages would be proxied by the product of the two indices plus the maximum of the two underlying indices. It is necessary to add the maximum of the underlying indices to the product, since otherwise, a society which is (say) ethnically diverse but homogenous in religion would appear to be as homogenous as a society that was homogenous in both ethnicity and religion. In the analysis of Section 4 we use only this third measure of social fractionalization, demonstrating in Table 8 that it dominates both the other approaches to aggregating ethnic and religious diversity. The time since the previous civil conflict is measured in months. If a country experienced a civil war we measure the peace period as the number of months between the end of the civil war and the beginning of the sub-period, 1960, 1965, ..., 1995. If a country never experienced a civil war we measure the peace period since the end of World War II, i.e. in 1960 these countries have 172 months of peace and they accumulate 60 additional peace months in every sub-period. 3.4 Opportunity Cost We use four measures of the opportunity cost of recruitment: secondary school enrolment rates for men, average income, population growth and income growth. Male schooling is measured by the gross male secondary school enrollment ratio. This was obtained from the World Bank World Development Indicators (WDI) 1998. Gross ' Most Of the data was kindly made available by Robert Barro (Barro 1997) and for some countries we used the data from the original source (Barrett 1982). 17 male enrollment ratios are defined as the ratio of total male enrollment, regardless of age, to the male population of the age group that officially corresponds to the level of secondary schooling. Thus, the gross enrollment ratio can exceed 100 percent. Income is measured as the per capita GDP at the beginning of each period. In order to be able to compare income data over time and across countries we mainly use the real PPP adjusted figures as provided in the Penn World Tables Mark 5.6 (PWT). For the 1995 data we used the growth rates from the real PPP adjusted GDP per capita data from the WDI 1998 in order to update the PWT data which only provide data up to 1992. Population growth is measured as the average annual growth rate of the population in the previous five years. The main source of the population data is WDI 1998. Income growth is measured as the average annual growth rate of real per capita GDP in the previous five years. The income data used to calculate the growth rates is as described above. We combine population growth and per capita income growth into a single variable, which is the weighted difference between them. We experimented with weights, choosing that which gave the variable the highest t-statistic, this being when the population growth rate was given three times the weight of per capita income growth. 3.5 Sources of Grievances Here we concentrate on two sources of grievances, political exclusion and inequality, since the variables which proxy sources of hatred (social fractionalization) and vengeance (previous conflicts) have already been discussed above. Data on political exclusion is available in the Polity III data set (Jaggers and Gurr 1995). We concentrate on the democracy variable which characterizes the general openness of political institutions. The democracy score ranges from 0 to 10 where 10 denotes a highly open regime. In addition we tried the autocracy score from the Polity III data set and a measure of political openness published by Freedom House ("Gastil Index"). II We constructed a measure of ethnic dominance based on the ethno-linguistic data from the Atlas Naradov Mira. This dummy takes a value of one if the largest linguistic group makes up between 45 and 90 percent of the total population and zero otherwise. We constructed a number of different dummies to proxy ethnic domination. As reported below, we found that when ethnic dominance was defined by this range the economic and statistical significance of the variable was at a maximum. Income inequality is measured as in Deininger and Squire (1996) either as the Gini coefficient of income distribution or as the ratio of top quintile's share of income to bottom quintile's share. We proxy land inequality by a Gini coefficient of land distribution, for a more detailed description please refer to Deininger and Squire (1998). Source: http://www.freedomhouse.org/ratings 18 In Table 2 we present some descriptive statistics for the main variables of interest, distinguishing between the peace observations, the war observations and the entire sample. The war observations are on average characterized by a higher proportion of previous civil wars, lower opportunity costs of war (lower secondary school enrolment rates, lower per capita income, lower economic growth and higher population growth) larger populations, higher ethnic fractionalization, more mountainous terrain, a lower democracy score and higher income inequality. However, war observations have a similar average share of primary commodity exports in GDP, similar religious fractionalization, a similar incidence of ethnic dominance and have a smaller diaspora than peace observations. We now turn to the regression analysis to examine the relationship between these possible causes of conflict and the risk of civil war. 4. Empirical Results Our empirical analysis attempts to predict the risk that a civil war will start during a five- year sub-period, through a logit regression in which the explanatory variables are characteristics at the start of the sub-period. We start with the greed model (see Table 3). Because per capita income and enrolment in secondary schooling are highly correlated, they cannot be used in the same regression. The first four columns include secondary schooling but not per capita income, which permits a sample of 688 episodes of which 43 are war observations. The final column replicates the core regression using per capita income instead of secondary schooling, which permits a sample of 747 episodes of which 47 are war observations. The first column omits the variables which proxy the effect of previous civil war. The two opportunity cost proxies are significant with the expected signs. A higher gross secondary school enrolment rate for males reduces the risk of war. The difference between income growth per capita and population growth, both measured for the previous five year period, decreases the risk of war. As predicted in (12), the effect of primary commodity exports on the risk of war is significant but non-linear. The highest risk of war is at a share in GDP of about 25 percent. The positive linear effect on risk suggests that there is a predatory element in rebellion. The negative quadratic effect suggests that the increased tax revenue eventually augments the capacity of the government to defend itself sufficiently to offset the attraction of enhanced loot. The elasticity of the risk of conflict with respect to population is approximately unity, as predicted. A higher degree of fractionalization makes societies significantly safer, high cross-cutting ethno-linguistic and religious fractionalization decrease the risk of civil war. Two of the technology parameters, the dispersion of the population and mountainous terrain, are also significant. More mountainous countries are more likely to experience a civil war, while a higher concentration of the population decreases the risk of conflict. 19 We then introduce the effect of previous conflicts. We hypothesize that a previous civil war may increase the risk of experiencing a civil war. Initially, we test this hypothesis by including a dummy which equals one if the country experienced a civil war in the past. Here we include any civil wars which occurred between the end of World War II and 1995. The coefficient on this previous war dummy is highly significant, previous wars increase the risk of new civil war. However, such a dummy could equally be picking up unobserved country-specific effects. We then introduce the length of the peace period measured in months since the end of the last civil war. In column 3 we add this peace duration variable to the model. The coefficient is negative and significant, i.e. the longer the peace period the lower the risk of conflict. The coefficient on the previous war dummy is now insignificant. Thus, the peace period is a more precise measure of the effect than is the previous war dummy. This indicates that the risk decays after conflict, as might grievance, rather than being a proxy for unobserved country-specific effects. In column 4 we therefore drop the insignificant previous war dummy, leaving a model in which greed causes initial conflict; conflict causes some effect such as grievance; and this grievance causes further conflict."2 In the last column we present an alternative specification in which we include the average income per capita as a measure of opportunity costs instead of schooling. The results are very similar, although the overall performance of the regression deteriorates. In Table 4 we investigate whether the effect of post-conflict grievance on the risk of conflict is reinforced by the diaspora, as suggested by the analytic model. Since the data set on the size of diasporas in the USA reduces the number of countries on which we have data, the sample size is consequentially radically reduced from the 747 observations and 47 wars which is our maximum sample in the previous analysis. In order to preserve sample size we therefore retreat to a more parsimonious version of the model, dropping four sample-constraining peripheral explanatory variables: ethnic and religious fractionalization, geographic concentration of the population, the extent to which the terrain is mountainous, and the rate of growth in the previous five year period. The remaining explanatory variables are thus per capita GDP, primary commodity exports, population, and the number of months since the previous conflict. Even with these data- restoring deletions, the sample size is reduced to 595 observations (containing 29 war observations). However, all the included explanatory variables remain significant. On this sample, we then test for the effect of a diaspora. We measure the size of the diaspora relative to the resident population in the country of origin. To allow for a fading post-conflict effect, we interact this measure with the number of months since the previous conflict, dividing the former by the latter. This variable, dias/peace in Table 4, is added to the regression, the result being shown in the first column. The variable is 12 We examined this time effect in more detail by including the natural logarithm of the peace variable or its square, however, a linear decay term provides a better fit. Note that the measure of peace since the end of the civil war is somewhat imprecise since we only measure it from the end of the war to the initial year of each sub-period. A duration model of post-war peace would allow a more detailed analysis of this peace effect, however, the duration model results in Collier, Hoeffler and S6derbom (1998) support the results presented in this paper. 20 positive and significant, indeed, slightly more significant than the number of months of peace by itself, although the latter also remains significant. A large diaspora considerably increases the risk of further conflict. If we compare the post-conflict society with the largest diaspora against that with the smallest, with other variables at their mean values, after five years of peace the risk of renewed conflict is around six times greater. However, while this result is consistent with the analytic model, it is also open to a more anodyne interpretation. Evidently, diasporas are to an extent endogenous to the intensity of conflict: when civil war occurs, many people leave and settle in the USA. Hence, the size of the diaspora might simply be proxying the intensity of conflict. The result may therefore simply show that more severe conflicts have a higher risk of renewed conflict. To test for this we decomposed observed diasporas into a component which is exogenous to the intensity of conflict and a residual component which is therefore endogenous to its intensity. For this decomposition we estimated a simple migration model, reported in the Appendix, Table A3. The size of the diaspora in a census year is predicted to be a function of its size in the previous census, time, per capita income in the country of origin, and whether there was a war in the intervening period. This model predicts the size of the diaspora with reasonable accuracy. We then replace the diaspora data used in the model with estimated diaspora size in all cases where the observed diaspora is for a year subsequent to a conflict. Thus, all post-conflict observations of diasporas are estimates which are purged of any effect from the intensity of conflict. The difference between these estimates and the actual figures are then used as an additional variable, measuring that part of the diaspora which is potentially endogenous to the intensity of conflict. Both of these measures are then introduced into the regression in place of the previous single measure of the diaspora. The results are reported in the second column of Table 4. The purged measure of the diaspora remains significant, and the size of the coefficient is only slightly altered. Further, its coefficient is not significantly different from that on the endogenous diaspora measure. Had the effect of the diaspora been simply a proxy for the intensity of conflict, neither of these would have been the case. The purged variable would have been insignificant, and the coefficient on the endogenous measure would have been larger. This suggests that the substantial effect of the diaspora on the risk of conflict renewal is indeed due to its financial contribution to war start-up. In Table 5 we turn to the examination of a rebellion which is motivated only by grievance. In the first column we examine the relationship between ethnic dominance, social fractionalization, democracy and the duration of peace. As in the greed model, we control for geographic military advantage by including the dispersion of the population and mountainous terrain. Since we are not including any lagged variables we can use 884 observations of which 57 observations experienced an outbreak of civil war. The results suggest that societies which are dominated by one large ethnic group (45-90 percent of the population belonging to the same ethnic group) are more likely to experience conflict. Greater openness of political institutions reduces the risk of conflict. In the second column we add the gini coefficient of income inequality as an explanatory variable and in column 3 we add the gini coefficient of land inequality as an alternative measure of inequality. Neither measure is significant. Note that the sample size is reduced when we 21 include the income inequality measures. However, we are still analyzing a substantial number of war occurrences (40 in column 2 and 38 in column 3).13All three grievance models have very low explanatory power, the regressions only have an R2 of 0.14 or lower. We now turn to the question of which model, greed or grievance, provides a better explanation of the risk of civil war. Since the two models are non-nested, i.e. one model is not a special case of the other, we use the J-test as suggested by Davidson and MacKinnon (1981). This non-nested test is based on the following artificial nesting procedure. First we explain the risk of civil war, p, in terms of the two different models, greed and grievance. (1) p=f(greed) (2) p=f(grievance) Based on these logit regressions we calculate the predicted probabilities and add these predicted values, p greed and p grievalce to our alternative models. (1) p=f(greed, p grevance) (2) p=f(grievance, p greed) According to the J-test the significance of the coefficients of these added variables enables us to choose between the two different models. If p grevance is significant in the greed model we reject the greed model in favor of the grievance model. If p greed iS significant in the grievance model we reject the grievance model in favor of the greed model. As can be seen in columns 2 and 3 of Table 6 p grevance is significant in the greed model and p greed iS significant in the grievance model. Thus, we conclude that we cannot choose between the two models, grievance adds explanatory power to the greed model and vice versa.'4 Since we find neither model dominates the other, we next investigate the combination of the two models as presented in column 3 of Table 6. In this combined model all variables apart from democracy are significant and the overall fit is reasonable (pseudo R2 of about 0.3). Since both the grievance and greed models are nested in the combined model, we can use a likelihood ratio test to determine whether the combined model is superior. We can reject the validity of the restrictions proposed by both the greed and the grievance models."5 13 We also tried the ratio of the income shares of the top to the bottom quintiles. This was also insignificant. " When we tested an alternative specification of the greed model which includes income rather than schooling, we found that grievance did not seem to add explanatory power to the greed model, i.e. the coefficient p gf"evae was insignificant. '5 Using the same sample as for the combined model (n=665) we obtain the following results: Greed model versus combined model, 2 degrees of freedom, Likelihood Ratio Test statistic 6.34, p-value 0.042; grievance model versus combined model, 5 degrees of freedom, LRT statistic 41.76, p-value 0.000. 22 Although the combined model is superior to the greed and grievance models, one variable, democracy, is completely insignificant."6 Dropping it yields the model in column 4. Without the democracy variable the coefficients and standard errors of all of the other variables remain virtually unchanged. Based on a likelihood ratio test we cannot reject the hypothesis of a zero coefficient on the democracy variable.'7 While all variables in the model of column 4 are significant, that for mountainous terrain is only significant at the ten percent level and in column 5 we investigate a more parsimonious model in which it is dropped. Since this model is nested within the two previously presented models, we again use likelihood ratio tests to compare it to those of columns 3 and 4. The tests narrowly favor this model over a model which includes mountainous terrain.' No further reduction in the model is accepted, and no additions of variables included in our previous models are accepted. We refer to the model in column 5 as our baseline model. Our baseline model allows us to calculate the change in the probability of war-starts for different values of the explanatory variables. We present these calculations in the Appendix, Table A5. At the median of all variables a country would have a 1.4 percent probability of experiencing a civil war, while at the mean of all variables the risk of a war-start is higher, about 14 percent. Our model predicts that a country with the worst characteristics (lowest per capita income, lowest GDP growth and highest population growth, a primary commodity export share of 0.26, the largest population, the lowest fractionalization, ethnic dominance, a geography Gini coefficient of zero and only one month of peace) would have a near-certain risk of war while a country with the best characteristics would be a very safe society. We now calculate how the change in one variable (while the others are assumed to take their mean values) affects the probability of experiencing a civil war. Whether a country is characterized by ethnic dominance has a considerable impact on the risk of experiencing a civil war. Countries without a dominant ethnic group have a risk of war of about 10 percent, however, the risk is doubled if countries have a dominant ethnic group. If the male secondary school enrollment rate is 10 percentage points higher than the average, the risk of war is reduced by about four percentage points (a decline in the risk from 14% to 10%). Higher income growth reduces the risk of conflict: an additional percentage point on the growth rate reduces the risk of war by one percentage point (a decline from 14% to 13%). A reduction in population growth has a larger effect on the risk of war, a one percentage point decrease implies a 2.5 percentage point reduction in the risk of conflict (from 14% to 11.5%). The effect of primary commodity exports is very considerable: at the risk maximizing value of 16 We tried different specifications to test for the effect of political repression by including the autocracy score instead of the democracy score and by using the difference between the two variables as suggested by Londregan and Poole (1996). We also tried the Freedom House measure of political freedom, but neither of these alternative political repression measures were found to be significant. 17Using the same sample (n=665) we obtained a log likelihood of-I 11.25 for the restricted model. Thus, the LRT statistic is 2.01, 1 degree of freedom, p-value 0.16. I First, using the same sample (n=665) we test the combined model (column 3) versus the baseline model (column 5). We obtain a LRT statistic of 3.24, 2 degrees of freedom, p-value 0.198. Second, we test the reduced model (column 4) against the baseline model (column 5). We obtain a LRT statistic of 2.03, 1 degree of freedom, p-value 0.16. 23 the primary commodity export share (0.26), the risk of civil war is about 23 percent, while a country with no natural resource exports only has a probability of a war-start of 0.5 percent. A society characterized by the maximum social fractionalization has a very small risk of experiencing a civil war (about three percent), however, the risk is about eight times higher for a country which is socially homogenous. Directly after a civil war there is a high probability of a re-start, the risk is about 42 percent. This risk declines over time, however, sustained peace only slowly decreases the risk of experiencing a new conflict. Five years after the end of the civil war the risk is still about 37 percent. The elasticity of the risk of conflict with respect to population is 0.95, close to the analytic expectation of 0.94 discussed in Section 2. Lastly, we examine the impact of the dispersion of the population on the risk of conflict. Countries with a highly concentrated population are very safe from conflict while countries which are characterized by a homogenously dispersed population have a very high risk of civil war (about 60 percent). In column 6 of Table 6 we run the baseline model with average per capita income instead of schooling. Recall that since we have more data on income than on schooling we are able to use a larger sample which includes four additional war observations. The variables are all significant at conventional levels, although the overall explanatory power is slightly lower. Using this maximum sample we analyze the effect of external assistance as causes of war. Adding a post cold-war dummy (which takes a value of one for the periods 1990-94 and 1995-99) in column 7 we find that that the risk of civil war was significantly higher during the cold war period. This is consistent with the prediction that the super-powers eased the financing constraint on rebellion.'9 Thus, we find some weak evidence that external assistance increased the risk of civil war during the cold war. In Table 7 we investigate a number of different estimation issues. We concentrate on the analysis of random effects, fixed effects, time effects and a correction for rare events. We re-estimated our models using random effects. For the baseline model, and indeed, all those models which include a feedback effect from previous conflict, we accept the hypothesis that we can pool across the observations.20 The estimation of fixed effects logits was only possible on a very small sub-sample of the observations. The countries for which the dependent variable does not vary over time (the majority of countries experienced only peace) cannot be included in the analysis. Although the fixed effects test is very severe, the non-monotonic effect of primary commodity exports remains significant. Were the effect of primary commodity exports dependent only upon cross- section data, it might suggest that the variable was proxying some other characteristic such as geography. However, the fixed effects regression uses only changes in primary commodity dependence, and so reduces the scope for alternative interpretations. ' This result only holds for the augmented sample which includes 47 civil wars. Due to data constraints the baseline model excludes four of these wars all of which started during the cold war period. The wars we have to exclude are Chad (1980-84), China (1965-69), Jordan (1970-74) and Liberia (1985-89). 20 The null-hypothesis (rho=0) is not rejected: the chi-squared statistic is 0 (p = 0.998). When the feedback effect from previous conflict is pxcluded, pooling is marginally rejected: the chi-squared statistic is 2.02 (p=0.155). 24 We analyzed whether time effects matter by including time dummies in the model. Based on a log likelihood ratio test we cannot reject the hypothesis that the time dummies are zero.21 However, as discussed above, the dummies for 1990-94 and 1995-99 are jointly significant and support consolidation into a single dummy for the post Cold War period in the variant of the baseline model. Finally, in the last column of Table 7 we use a recently developed correction method for rare events data (King and Zeng, 2000). Our data is characterized by a relatively small number of events (wars), only about six percent of the observations are characterized by a civil war outbreak. The results presented in King and Zeng (2000) suggest that standard logit estimation underestimates the probability of an event occurring when the events are rare. We used their correction procedure but found the differences between the standard logit results and the rare events corrected results to be negligible. Using the rare events logit procedure, all coefficients on the variables have the same signs and are significant at the same levels. The mean of the predicted probabilities obtained from the standard logit regression is 0.063 and the predicted probabilities from the rare events logit regression have a mean of 0.068. The standard deviations and the extreme values of the predicted probabilities obtained from the standard logit and the rare events logit were also similar. Thus, contrary to King and Zeng (2000), we do not find that the logit substantially underestimates the probability of the event occurring. We examined a number of different model specifications. We found that none of the following geographic and demographic characteristics were significant: forest coverage, population density, the proportion of the population living in urban areas and the proportion of young men aged 15 to 29. In Tables 8 and 9 we present a more detailed analysis of ethnic fractionalization. In Table 8 we demonstrate that the measure of cross- cutting cleavages (the product of religious and ethnic fractionalization plus the maximum of each component separately), dominates the other possible aggregation procedures for ethnic and religious diversity. When this measure of fractionalization is included with the ethnic and religious diversity indices either together or individually, it is significant whereas the underlying indices are not significant. In Table 9 we investigate alternative definitions of ethnic dominance. In the first column we include the ratio of the largest ethnic group to the total population in the model and find its coefficient to be insignificant. We then construct a number of different ethnic dominance dummies, taking the value one if 45-90, 45-85, 45-95, 30-90, 40-90, or 50-90 percent of the total population belong to the same ethnic group. As can be seen, the relationship between ethnic dominance is most significant when the largest ethnic group makes up 45-90 percent of the total population.22 5. Conclusion 21 The LRT statistic is 2.1, 6 restrictions, p=O.91. 22 also experimented with various other definitions of ethnic dominance, such as the dummy taking a value of one if the largest ethnic group made up 50-75, 50-80, 50-85, 50-90 percent of the population. Neither of these definitions provided a better fit than our preferred defnition based on 45-90 percent. 25 We started from two simple models of rebellion. Greed-rebellion was motivated by predation of the rents from primary commodity exports, subject to an economic calculus of costs and a military survival constraint. Grievance-rebellion was motivated by hatreds which might be intrinsic to ethnic and religious differences, or reflected objective resentments such as domination by an ethnic majority, political repression, or economic inequality. Both of these models could allow for technological constraints upon rebellion such as differences in geography. We then introduced the possibility of a feedback effect, whereby conflict increased the risk of subsequent conflict through generating grievance. In grievance-rebellion this added a simple destabilizing loop: grievance causes conflict which then causes further grievance. In greed-rebellion the feedback was more complex because induced grievance only raised the risk of further conflict if it augmented the potential financial rewards to rebellion. We postulated that such a loop might arise if grievance enabled rebellions to tap into the resources of diasporas. Finally, we allowed for interdependence between greed and grievance. Greed-rebellions need to generate grievance for military cohesion, grievance rebellions might be driven to predation to raise finance. Using a comprehensive panel data set of conflict over the period 1960-99 we examine the risk of civil war using logit regressions. Starting with the distinct greed and grievance models, we find that most of the proxies for objective grievance are insignificant and that the best-performing grievance model has very low explanatory power. By contrast, the simple greed model performs well. The extent of primary commodity exports is the largest single influence on the risk of conflict. Strikingly, ethnic and religious fractionalization makes a society safer, as predicted by the greed model, rather than more dangerous as predicted by the grievance model. We find that a war substantially increases the risk of subsequent war, although this risk fades over time. Thus, although objective grievance is not a powerful primary cause of conflict, conflict may generate grievances which become powerful additional risk factors. However, we can show that by far the strongest effect of war on the risk of subsequent war works through diasporas. After five years of post-conflict peace, the risk of renewed conflict is around six times higher in the societies with the largest diasporas in America than in those without American diasporas. Presumably this effect works through the financial contributions of diasporas to rebel organisations. Finally, we test for an integrated greed-grievance model. We find that only one of the potential sources of objective grievance significantly adds to the explanatory power of the greed model, namely ethnic dQminance. Societies in which the largest ethnic group has between 45% and 90% of the population have around double the risk of conflict of other societies. Presumably, this is because such societies have both the power and the incentive to exploit their minorities. Our results thus contrast with conventional beliefs about the causes of conflict. A stylized version of these beliefs would be that grievance begets conflict which begets grievance which begets further conflict. With such an analysis, the only point of intervention is to 26 reduce the level of objective grievance. Our model suggests that what is actually happening is that opportunities for primary commodity predation cause conflict, and that the grievances which this generates induce diasporas to finance further conflict. The policy intervention points here are reducing the absolute and relative attraction of primary commodity predation, and reducing the ability of diasporas to fund rebel movements. 27 References Barro, R. J. 1997. Determinants of Economic Growth. Cambridge, MA: MIT Press. Barrett, D. B., ed. 1982. World Christian Encyclopedia. Oxford: Oxford University Press. Buchanan, J.M. and R.L. Faith. 1987. Secession and the Limits of Taxation: Towards a Theory of Internal Exit, American Economic Review, 77:1023-31. Collier, P. 2000. Rebellion as a Quasi-Criminal Activity, Journal of Conflict Resolution 44, (forthcoming). Collier, P. and A. E. Hoeffler. 1998. On the Economic Causes of Civil War, Oxford Economic Papers, 50:563-73. Collier, P., A. E. Hoeffler, and M. S6derbom. 1998. On the Duration of Civil War and Postwar Peace. Centre for the Study of African Economies, Discussion Paper 98/8. Davidson R. and J.G. MacKinnon. 1981. Several Tests for model specification in the presence of alternative hypotheses. Econometrica 49:781-793. Deininger, K., and L. Squire. 1996. A New Data Set Measuring Income Inequality. World Bank Economic Review 10:565-91. Deininger, K. and Squire, L. 1998. New Ways of Looking at Old Issues: Inequality and Growth. Journal of Development Economics 57:249-287. Department of Geodesy and Cartography of the State Geological Committee of the USSR. 1964. Atlas Naradov Mira. Moscow. Gerrard, A.J.W. 2000. 'What is a Mountain?' Mimeo. DECRG, World Bank. Gleditsch, Nils Petter, and Havard Hegre. 1997. Peace and Democracy: Three Levels of Analysis, Journal of Conflict Resolution, 41:283-310. Grossman, H. I. 1999. Kleptrocracy and Revolutions. Oxford Economic Papers 51:267- 283. Herbst, Jeffrey. 2000. States and Power in Africa, Princeton, Princeton University Press. Hirshleifer, J. 1991. 'The Paradox of Power'. Economics and Politics 3:171-200. Hoeffler, A. E. and N. Sambanis. 2000. Data Notes - World Bank Project on 'The Economics of Civil Wars'. Mimeo. 28 Kaplan, Robert, D. 2000. The Coming Anarchy, New York, Random House. Keith Jaggers and Ted Robert Gurr. 1995. 'Tracking Democracy's Third Wave with the Polity III Data', Journal of Peace Research 32:469-482. King, G., and L. Zeng. 2000. Logistic Regression in Rare Events Data. http://gking.harvard.edu/preprints.shtml Konrad, Kai, A. and Stergios Skaperdas. 1998. 'Extortion', Economica, 65:461-77. Kuran, T. 1989. 'Sparks and Prairie Fires: A Theory of Unanticipated Political Revolution'. Public Choice 61:41-74. Londregan, J.B. and K. T. Poole. 1996. 'Does High Income Promote Democracy?' World Politics 49:1-30. Mauro, P. 1995. 'Corruption and Growth'. The Quarterly Journal of Economics 110:681- 712. Sachs, J.-D. A.M. Warner. 1995. 'Natural Resource Abundance and Economic Growth'. National Bureau of Economic Research Working Paper: 5398. The Stockholm International Peace Research Institute. 1993. Yearbook of World Armaments and Disarmaments. Oxford: Oxford University Press. 29 Table 1: Outbreaks of War country year war prev. country year war prev. started war started war Afghanistan 1975-79 1 0 Mozambique 1960-64 1 0 Afghanistan 1990-94 1 1 Mozambique 1975-79 1 1 Algeria 1960-64 1 1 Myanmar/Burma 1965-69 1 1 Algeria 1990-94 1 1 Myanmar/Bunna 1980-84 1 1 Angola 1960-64 1 1 Nicaragua 1975-79 1 0 Angola 1975-79 1 1 Nicaragua 1980-84 1 1 Azerbaijan 1990-94 1 0 Nigeria 1965-69 1 0 Bosnia 1990-94 1 0 Nigeria 1980-84 1 1 Burundi 1970-74 1 0 Pakistan 1970-74 1 0 Burundi 1984-89 1 1 Peru 1980-84 1 0 Burundi 1990-94 1 1 Philippines 1970-74 1 1 Cambodia 1970-74 1 1 Romania 1984-89 1 0 Chad 1980-84 1 0 Russia 1990-94 1 0 China 1965-69 1 1 Russia 1995-99 1 1 Colombia 1980-84 1 1 Rwanda 1960-64 1 1 Dominican Republic 1965-69 1 0 Rwanda 1990-94 1 1 El Salvador 1975-79 1 0 Somalia 1980-84 1 0 Ethiopia 1970-74 1 1 Somalia 1984-89 1 1 Georgia 1990-94 1 0 Sri Lanka 1970-74 1 0 Guatemala 1965-69 1 1 Sri Lanka 1980-84 1 1 Guatemala 1970-74 1 1 Sudan 1960-64 1 0 Guatemala 1975-79 1 1 Sudan 1980-84 1 1 Guinea-Bissau 1960-64 1 0 Tajikistan 1990-94 1 0 India 1980-84 1 1 Turkey 1990-94 1 0 Indonesia 1975-79 1 1 Uganda 1965-69 1 1 Iran 1970-74 1 1 Uganda 1980-84 1 1 Iran 1975-79 1 1 Vietnam 1960-64 1 1 Iran 1980-84 1 1 Yemen 1990-94 1 0 Iraq 1970-74 1 1 Yemen AR 1960-64 1 1 Iraq 1984-89 1 1 Yemen PR 1984-89 1 0 Iraq 1990-94 1 1 Yugoslavia 1990-94 1 0 Jordan 1970-74 1 0 Yugoslavia 1995-99 1 1 Laos 1960-64 1 1 Zaire 1960-64 1 1 Lebanon 1975-79 1 1 Zaire 1990-94 1 1 Liberia 1984-89 1 0 Zaire 1995-99 1 1 Liberia 1990-94 1 1 Zimbabwe 1970-74 1 0 Morocco 1975-79 1 1 30 Table 2: Descriptive Statistics sample (n=1 174) no civil war (n=1 101) civil war (n=73) mean st.dev. min max n mean st.dev. min max n mean st.dev. min max n war starts 0.06 1174 0 1101 1 73 previouswar 0.34 1174 0.32 1101 0.64 73 male secondary 43.3 31 0.3 147 965 44.3 31 0.3 147 902 30.2 26.6 1 102 63 schooling I_I_I GDP per capita 4056 4317 222 33946 1014 4200 4388 257 33946 958 1644 1353 222 583 56 (const. US$) 2 GDP per capita 1.62 3.83 - 14.41 864 1.72 3.78 -22.1 14.41 815 -0.05 4.28 -10.66 7.09 49 growth t-1 22.08 1 population growth 2.15 1.54 -6.38 16.62 991 2.12 1.57 -6.38 16.62 929 2.50 1.01 -0.013 5.81 62 t-1 I__ I primary 0.17 0.19 0 2.14 1090 0.17 0.19 0 2.14 1023 0.14 0.1 0.01 0.48 67 commodity exports/GDP population 25.4 94.5 0.014 1200 1161 24.5 92.9 0.014 1200 1089 40.0 116.4 0.54 715 72 (millions) I_I_I_I ethnic 40 28 0 93 1107 39 28 0 93 1039 50 29 4 90 68 fractionalization religious 36 24 0 79 1147 36 24 0 79 1078 36 25 0 69 69 fractionalization I_I_I geographic 0.57 0.23 0 0.97 1028 0.57 0.23 0 0.97 961 0.60 0.15 0.24 0.92 67 dispersion I__ mountainous 15.8 20.73 0 94.3 1174 15.12 20.28 0 94.3 1101 26.73 24.2 0 81 73 terrain I_I_I ethnic dominance 0.47 0 1 1107 0.47 1039 0.47 - 68 (45-90%) I _ I democracy 3.90 4.21 0 10 987 4.21 4.21 0 10 987 1.67 2.58 0 9 67 diaspora 0.02 0.03 0 0.19 654 0.02 0.03 0 0.19 654 0.004 0.005 0 0.16 37 Income Inequality 41.35 10.27 17.83 63.2 721 40.65 10.27 17.83 63.2 721 41.97 10.30 23.38 62.3 44 (Gini) 31 Table 3: Greed Model _1 2 3 4 5 male secondary schooling -0.036 -0.037 -0.028 -0.027 (0.011)*** (0.011)*** (0.012)** (0.012)** In GDP per capita -0.822 (0.288)*** (GDP growth - 3 *population -0.080 -0.074 -0.084 -0.086 -0.101 growth) t- 1 (0.036)** (0.036)** (0.036)** (0.036)** (0.035)*** primary commodity 34.088 34.109 32.147 32.058 22.923 exports/GDP (8.186)*** (8.423)*** (8.497)*** (8.465)*** (6.903)*** (primary commodity -67.792 -67.172 -62.307 -62.091 039.192 exports/GDP) 2 (18.683)*** (18.679)*** (18.954)*** (18.962)*** (14.710)*** In population 1.026 0.960 0.832 0.832 0.574 (0.191)*** (0.194)*** (0.198)*** (0.197)*** (0.146)*** social fractionalization -0.0002 -0.0003 -0.0003 -0.0003 -0.0003 (0.0001)** (0.0001)*** (0.0001)*** (0.0001)*** (0.0001)*** geographic dispersion -3.517 -3.888 -3.384 -3.289 -1.568 (1.142)*** (1.181)*** (1.200)*** (1.180)*** (1.051) mountainous terrain 0.019 0.013 0.014 0.015 0.012 (0.009)** (0.009) (0.010) (0.010) (0.009) previous war dummy 1.016 0.252 (0.405)*** (0.527) Peace duration -0.004 -0.004 -0.004 (0.002)** (0.001)*** (0.001)*** n 691 691 688 688 747 no of wars 43 43 43 43 47 pseudo R2 0.25 0.27 0.29 0.29 0.26 log likelihood -120.98 -117.63 -114.32 -114.43 -129.44 Notes: All regressions include a constant. Standard errors in parentheses. ***, **, * indicate significance at the 1, 5 and 10 percent level, respectively. 32 Table 4: Greed - Diaspora Model _ _ _ _ _ _ _ _ _ _ _ ~~~1 2 In GDP per capita -1.032 -1.037 (0.285)*** (0.286)*** primary commodity 23.98 23.78 exports/GDP (8.542)*** (8.56)*** (primary commodity -48.163 -47.75 exports/GDP)2 (22.1)* * * (22.12)* * In population 0.318 0.319 (1.156)** (1.155)** peace duration -0.002 -0.002 (0.001)*** (0.001)* diaspora/peace duration 648.77 (367.57)* predicted diaspora/peace 687.09 (393.12)* (diaspora-predicted 768.9 diaspora)/peace duration (562.42) n 595 595 no of wars 29 29 pseudo R2 0.23 0.23 log likelihood -89.05 -89.01 Notes: All regressions include a constant. Standard errors in parentheses. **, **, * indicate significance at the 1, 5 and 10 percent level, respectively. 33 Table 5: Grievance Model 1 2 3 social fractionalization 0.00003 0.00000 0.00008 (0.00007) (0.00009) (0.00009) ethnic 0.263 0.534 0.567 dominance (45-90%) (0.290) (0.342) (0.374) democracy -0.117 -0.091 -0.138 (0.046)*** (0.051)** (0.052)*** geographic dispersion 0.095 -0.276 -0.152 (0.787) (0.958) (1.039) mountainous terrain 0.015 0.011 0.015 (0.006)** (0.008) (0.006)** income inequality 0.014 (0.016) land inequality -0.150 (1.159) Peace duration -0.005 -0.004 -0.005 (0.001)*** (0.00i)*** (0.001)*** n 884 614 620 no of wars 57 40 38 pseudoR2 0.13 0.11 0.14 log likelihood -184.17 -132.11 -122.61 Notes: All regressions include a constant. Standard errors in parentheses. ***, **, * indicate significance at the 1, 5 and 10 percent level, respectively. 34 Table 6: Combined Greed and Grievance Model 1 2 3 4 5 6 7 male secondary schooling -0.026 -0.030 -0.031 -0.036 (0.012)** (0.012)*** (0.012)*** (0.01 1)*** In GDP per capita -1.007 -1.006 (0.281)*** (0.283)*** (GDP growth - -0.079 -0.085 -0.090 -0.085 -0.103 -0.113 3*population growth) t- I (0.036)** (0.037)** (0.037)** (0.036)** (0.035)*** (0.035)*** primary commodity 31.902 33.576 34.220 32.089 22.983 23.025 exports/GDP (8.333)*** (8.514)*** (8.610)*** (8.375)*** (6.806)*** (6.795)*** (prirnary commodity -61.491 -64.234 -65.676 -62.511 -39.293 -39.026 exports/GDP)2 (18.489)*** (18.873)*** (19.186)*** (18.964)*** (14.505)*** (14.394)*** In population 0.840 0.912 0.934 0.946 0.625 0.678 .______ _ (0.203)*** (0.21 1)*** (0.210)*** (0.204)*** (0.148)*** (0.153)*** social fractionalization -0.0003 0.00003 -0.0003 -0.0003 -0.0003 -0.0004 -0.0003 (0.0001)** (0.0001) (0.0001)** (0.0001)** (0.0001)*** (0.0001)*** (0.0001)*** democracy -0.042 -0.043 (0.058) (0.058) ethnic 0.895 0.909 0.882 0.847 0.623 0.676 dominance (45-90%) (0.373)* * (0.477)* ** (0.374)* * * (0.370)* * (0.348)* (0.351)* * geographic dispersion -2.443 0.023 -3.670 -3.600 -3.891 -1.851 -2.020 (1.270)** (1.016) (1.208)*** (1.200)*** (1.172)*** (1.006)* (1.023)* mountainous terrain 0.003 0.0004 0.016 0.016 (0.011) (0.009) (0.010)* (0.010)* peace duration 0.001 0.0004 -0.004 -0.004 -0.004 -0.004 -0.004 (0.003) (0.0015) (0.001)*** (0.001)*** (0.00i)*** (0.001)*** (0.001)*** post-cold war dummny -0.980 ____________ ~~~~~~~~~~~~~~~~~~~~~(0.499)** grievance residual 0.897 .____ ____ ____ ____ __ _ (0.481)* greed residual 1.066 (0.200)*** n 665 665 665 688 688 747 747 no of wars 43 43 43 43 43 47 47 pseudo R2 0.30 0.30 0.30 0.31 0.30 0.27 0.28 log likelihood -112.27 -111.02 -110.97 -111.61 -113.03 -128.71 -126.47 Notes: All regressions include a constant. Standard errors in parentheses. * **, * indicate significance at the 1, 5 and 10 percent level, respectively. 35 Table 7: Robustness Tests of the Baseline Model 1 2 3 4 Random Effects Fixed Effects Pooled Logit plus Rare Events Logit Time dummies male secondary schooling -0.036 0.0001 -0.034 -0.034 (0.011)*** (0.033) (0.012)*** (0.011)*** (GDP growth - -0.085 -0.063 -0.105 -0.080 3 *population growth) t- l (0.036)** (0.067) (0.041)*** (0.036)** primary commodity 32.089 34.885 32.978 29.399 exports/GDP (8.375)* ** (14.945)** (8.648)*** (8.667)*** (primary commodity -62.511 -63.363 -65.483 -55.982 exports/GDP)2 (18.964)*** (28.824)** (19.822)*** (22.781)*** In population 0.946 0.373 0.960 0.891 (0.204)*** (1.374) (0.210)*** (0.155)*** social fractionalization -0.0003 -0.007 -0.0003 -0.0003 (0.0001)*** (0.005) (0.0001)*** (0.0001)*** ethnic 0.847 0.913 0.807 dominance (45-90%) (0.370)** (0.381)** (0.350)** geographic dispersion -3.891 78.555 -3.936 -3.722 (1.172)*** (80.216) (1.196)*** (1.041)*** Peace duration -0.004 0.01 -0.004 -0.004 (0.001)*** (0.003)*** (0.001)*** (0.001)*** T70-74 -0.960 (0.841) T75-79 0.145 (0.776) T80-84' -0.002 (0.777) T85-89 0.477 (0.749) T90-94 -0.820 (0.855) T95-99 -0.229 (0.168) n 688 135 668 688 no of wars 43 43 43 43 pseudo R2 0.33 log likelihood -113.03 -112.27 -107.44 -113.03 Notes: All regressions include a constant. Standard errors in parentheses. * **, * indicate significance at the 1, 5 and 10 percent level, respectively. 36 Table 8: Social Fractionalization 1 2 3 4 5 6 7 8 9 10 Baselinc Baseline Model ethnic Baseline Model religious ethnic and Baseline Model social frac.= social frac.= social frac.= Model plus ethnic fiactionalization plus religious fractionalization religious plus ethnic and ethnic plus ethnic minus ethnic times fractionalization only fractionalization only fractionalization religious religious religious religious fractionalization fractionalization fractionalization fractionalization male secondary -0.036 0.037 -0.025 -0.037 -0.034 -0.036 -0.037 -0.034 -0.021 -0.036 schooling (0.011)*** (0.012)*** (0.001)*** (0.012)*** (0.011)*** (0.113)*** (0.116)*** (0.011)*** (0.010)** (0.011)*** (GDP growth - -0.085 -0.083 -0.093 -0.084 -0.078 -0.081 -0.084 -0.086 -0.082 -0.085 3*population (0.036)** (0.037)** (0.035)*** (0.037)** (0.036)** (0.036)** (0.037)** (0.036)** (0.036)** (0.036)** growth) t-l primary 32.089 32.700 23.442 32.019 28.368 29.447 32.862 28.809 21.120 32.147 commodity (8.375)*** (8.425)*** (7.213)*** (8.390)*** (7.872)*** (8.060)*** (8.454)*** (8.003)*** (6.822)*** (8.381)*** exports/GDP (primary -62,511 -64.283 -44.962 -62.791 -56.891 -58.464 -64.450 -56.049 -42.019 -62.621 commodity (18.964)*** (19.044)*** (16.501)*** (19.035)*** (18.260)*** (18.633)*** (19.020)*** (18.491)*** (15.923)*** (18.970)*** exports/GDP)2 In population 0.946 0.925 0.749 0.941 0.828 0.892 0.924 0.904 0.643 0.946 (0.204)*** (0.206)*** (0.177)*** (0.205)*** (0.186)*** (0.201)*** (0.206)*** (0.199)*** (0.164)*** (0.204)*** social -0.0003 -0.004 -0.0003 -0.0005 -0.014 0.004 -0.0003 fractionalization (0.0001)*** (0.001)*** (0.0002)* .(0 0003)* (0.005)*** (0,007) (0.000 I)**_* ethnic 0.009 -0.0138 -0.007 0.011 fractionalization (0.010) (0.007)** (0.007) (0.013) religious -0.006 -0.026 -0.023 0.004 fractionalization (0.014) (0.009)*** (0.009)** (0.018) ethnic 0.847 0.754 0.923 0.816 0.773 0.833 0.756 0.940 0.923 0.847 dominance (45- (0.370)** (0.381)** (0.369)*** (0.375)** (0.372)** (0.379)** (0.381)** (0.369)*** (0.373)*** (0.370)** 90%) geographic -3.891 -4.099 -2.530 -4.070 -4.129 -4.129 -4.027 -3.711 -2.581 -3.894 dispersion (1.172)*** (1.201)*** (1.042)** (1.242)*** (1,222)*** (1.230)*** (1.245)*** (1.163)*** (1.053)*** (1.172)*** peace duration -0.004 -0.0004 -0.005 -0.004 -0.004 -0.004 -0.004 -0.004 -0.004 -0.004 (0.001)*** (0.001)*** (0.001)*** (0.001)*** (0.001)*** (0.001)*** (0.001)*** (0.001)*** (0.00l)*** (0.001)*** N 688 688 668 688 688 688 688 688 688 688 no of wars 43 43 43 43 43 43 43 43 43 43 pseudo R2 0.30 0.30 0.27 0.30 0.29 0.29 0.30 0.29 0.26 0.30 log likelihood -113.03 -112.60 -118.56 -112.93 -114.42 -113.96 -112.57 -114.62 -118.98 -113.00 Notes: All regressions include a constant. Standard errors in parentheses. * **, * indicate significance at the 1, 5 and 10 percent level, respectively. 37 Table 9: Ethnic Dominance 1 2 3 4 5 6 7 Largest group as a 45-85% 45-90% 45-95% 30-90% 40-90% 50-90% proportion of the total population male secondary -0.034 -0.036 -0.036 -0.032 -0.039 -0.038 -0.035 schooling (0.011)*** (0.011)*** (0.011)*** (0.011)*** (0.011)*** (0.012)*** (0.011)*** (GDP growth - -0.079 -0.084 -0.085 -0.083 -0.085 -0.089 -0.084 3 *population (0.036)** (0.036)** (0.036)** (0.036)** (0.036)** (0.037)** (0.036)** growth) t- 1 primary 31.444 31.351 32.089 30.546 31.918 30.802 31.292 commodity (8.422)*** (8.289)*** (8.375)* ** (8.296)** * (8.285)*** (8.218)* ** (8.265)*** exports/GDP (primary -62.283 -61.182 -62.511 -59.257 -63.073 -60.491 -61.363 commodity (19.037)*** (18.712)*** (18.964)*** (18.837)*** (18.656)*** (18.433)*** (18.776)*** exports/GDp)2 In population 0.842 0.909 0.946 0.870 0.951 0.915 0.907 (0.196)*** (0.201)*** (0.204)*** (0.196)*** (0.204)*** (0.200)*** (0.198)*** social -0.0004 -0.0003 -0.0003 -0.0003 -0.0004 -0.0004 -0.0003 fractionalization (0.0001)*** (0.0001)*** (0.0001)*** (0.0001)*** (0.0001)*** (0.0001)*** (0.0001)*** ethnic -1.223 0.695 0.847 0.371 0.752 0.736 .0.659 dominance (1.076) (0.373)** (0.370)** (0.554) (0.385)** (0.374)** (0.373)* geographic -3.907 -4.042 -3.891 -3.891 -4.155 -3.931 -3.686 dispersion (1.186)*** (1.190)*** (1.172)*** (1.172)*** (1.182)*** (1.170)*** (1.150)*** peace duration -0.004 -0.004 -0.004 -0.004 -0.004 -0.004 -0.004 (0.001)*** (0.001)*** (0.001)*** (0.001)*** (0.001)*** (0.001)*** (0.001)*** N 688 688 688 688 688 688 688 No of wars 43 43 43 43 43 43 43 Pseudo R2 0.28 0.29 0.30 0.28 0.29 0.29 0.29 Log likelihood -115.05 -113.98 -113.03 -115.45 -113.69 -113.67 -114.14 Notes: All regressions include a constant. Standard errors in parentheses. ***, **, * indicate significance at the 1, 5 and 10 percent level, respectively. 38 Table 9A: Ethnic Dominance - Descriptive Statistics N=688 mean std. dev. min max Largest group as a 0.70 0.25 0.18 1 proportion of the total population 45-85% 0.33 0.47 0 1 45-90% 0.44 0.50 0 1 45-95% 0.77 0.42 0 1 30-90% 0.60 0.49 0 1 40-90% 0.55 0.50 0 1 50-90% 0.39 0.49 0 1 39 Country year war started previous war Dominican Rep. 1965 1 0 Nigeria 1965 1 0 Burundi 1970 1 0 Pakistan 1970 1 0 Sri Lanka 1970 1 0 Zimbabwe 1970 1 0 El Salvador 1975 1 0 Nicaragua 1975 1 0 Somalia 1980 1 0 Peru 1980 1 0 Romania 1985 1 0 Turkey 1990 1 0 Guatemala 1965 1 1 Myanmar/Burma 1965 1 1 Uganda 1965 1 1 Iraq 1970 1 1 Iran 1970 1 1 Guatemala 1970 1 1 Philippines 1970 1 1 Ethiopia 1970 1 1 Morocco 1975 1 1 Iran 1975 1 1 Guatemala 1975 1 1 Mozambique 1975 1 1 Angola 1975 1 1 Indonesia 1975 1 1 Nicaragua 1980 1 1 Colombia 1980 1 1 Iran 1980 1 1 Myanmar/Burma 1980 1 1 India 1980 1 1 Sri Lanka 1980 1 1 Sudan 1980 1 I Uganda 1980 1 1 Nigeria 1980 1 1 Burundi 1985 1 1 Iraq 1985 1 1 Somalia 1985 1 1 Burundi 1990 1 1 Algeria 1990 1 1 Rwanda 1990 1 1 Zaire 1990 1 1 Zaire 1995 1 1 43 31 40 Table A2: Descriptive Statistics sample (n=688) no civil war (n=645) civil war (n=43) mean st.dev. min max mean st.dev. min max mean st.dev. min max War starts 0.06_ Previous war 0.36 0.33 0.72 Male secondary schooling 44.5 31 1 147 45.6 31 1 147 27.3 21.7 3 87 GDP per capita (const. US$) 4025 4065 222 18993 4182 4137 290 18993 1673 1360 222 5832 GDP per capita Growth t-1 1.63 3.51 -13.1 14.41 1.72 3.43 -13.1 14.41 0.25 4.46 -10.66 7.09 Population growth t-1 2.09 1.19 -6.38 6.61 2.05 1.2 -6.38 6.61 2.73 0.64 0.47 4.09 Primary commodity 0.16 0.14 0 0.94 0.16 0.14 0 0.94 0.16 0.1 0.02 0.48 exports/GDP Population (millions) 30 _ 30 40 _ Ethnic fractionalization 41 29 0 93 40 29 0 93 50 30 4 90 Religious fractionalization 36 25 0 79 36 25 0 79 34 25 2 69 Geographic dispersion 0.6 0.21 0 0.97 0.6 0.21 0 0.97 0.57 0.15 0.24 0.91 Mountainous terrain 15.8 20.19 0 82.2 15.06 19.73 0 82.2 26.81 23.77 2.4 74.5 Ethnic dominance (45-90%) 0.28 0.27 0.4 Democracy 4.26 4.31 0 10 4.43 4.34 0 10 1.92 2.9 0 9 Diaspora 0.01 0.02 0 0.16 0.01 0.02 0 0.16 0.003 0.02 0 0.16 Income Inequality (Gini) 41.35 10.04 20.97 62.3 41.14 10.0 20.97 62.3 44.33 10.29 23.38 62.3 41 Table A3: A simple Migration model Dependent Variable: Diaspora,t Diaspora, ,, 1.163 (0.045)*** Ln GDP per capita t-1 -0.002 (0.001)** War dummy 0.003 (0.003) N 216 R_2 0.76 The regression is based on the following model: diasit=f(diasj1t-1, lnGDPi, ,, war1,, l, Tt) Where dias denotes diaspora which is measured as the ratio of emigrants in the USA to the total population of the country of origin. The variable "war" is a war dummy, measured at t-1 it takes a value of one if the country experienced a civil war in the previous period. The method of estimation is OLS. The data is measured at the beginning of each decade, i.e. 1960, 1970, 1980 and 1990. The regression includes time dummies, T, which are jointly significant. Based on this simple migration model we estimated the size of the diaspora at time t. Dias estj.t=xit*AB For countries which experienced a previous civil war23 we used these estimated values to correct for a possible endogeneity problem. We replaced a total of 64 observations. For countries which did not experience a civil war we use the actual diaspora data. We took the averages of this corrected diaspora data measured in 1960 and 1970 (1970 and 1980, 1980 and 1990) in order to obtain values for 1965, 1975 and 1985. For 1995 we use the observations measured in 1990. In Table A4 we present some descriptive statistics of the original diaspora data, the corrected diaspora data and the difference between the two variables. 23 Here we only consider wars after 1960. 42 Table A4: Diaspora - Descriptive Statistics N=704 mean st.dev. min max diaspora 0.0161 0.0286 0.00003 0.1948 predicted diaspora 0.0162 0.0283 0.00003 0.1948 diaspora-predicted -0.0001 0.0043 -0.0200 0.0625 diaspora 43 Table A5: Marginal Probabilities Variable Coeff. Mean ofX median mean worst best no ethnic ethnic plus 10% plus 2% less 2% dom. dom. education gyl popgl education -0.037 44.485 -1.460 -1.624 -OCI11 -5.366 -1.624 -1.624 -1.989 -1.624 -1.624 gyl- -0.085 4.696 0.433 0.397 2.804 -2.857 0.397 0.397 0.397 0.226 -0.114 exports 32.089 0.158 3.594 5.054 8.022 0.000 5.054 5.054 5.054 5.054 5.054 exports2 -62.511 0.025 -0.784 -1.551 -3.907 0.000 -1.551 -1.551 -1.551 -1.551 -1.551 Inpop 0.946 17.233 14.909 16.301 19.777 11.256 16.301 16.301 16.301 16.301 16.301 fractionaliz. -0.0003 1785.878 -0.607 -0.607 -0.004 -2.372 -0.607 -0.607 -0.607 -0.607 -0.607 ethnic dom. 0.847 0.440 0.000 0.373 0.847 0.000 0.000 0.847 0.373 0.373 0.373 geog dispersion -3.891 0.575 -2.366 -2.238 0.000 -3.778 -2.238 -2.238 -2.238 -2.238 -2.238 peace -0.004 347.670 -1.507 -1.488 -0.004 -2.534 -1.488 -1.488 -1.488 -1.488 -1.488 constant -16.443 -16.443 -16.443 -16.443 -16.443 -16.443 -16.443 -16.443 -16.443 -16.443 X*betahat -4.231 -1.826 11.081 -22.093 -2.199 -1.351 -2.191 -1.996 -2.337 probability 0.014 0.139 1.000 0.000 0.100 0.206 0.101 0.120 0.088 Variable Coeff. Mean of X Exports= Exports= min max min 5 yrs 10yrs geog dis- geog dis- 0.26 0 fraction. fraction. peace peace peace perslon=1 persion=O education -0.037 44.485 -1.624 -1.624 -1.624 -1.624 -1.624 -1.624 -1.624 -1.624 -1.624 gyl- -0.085 -4.696 0.397 0.397 0.397 0.397 0.397 0.397 0.397 0,397 0.397 exports 32.089 0.158 8.343 5.054 5.054 5.054 5.054 5.054 5.054 5.054 exports2 -62.511 0.025 4.226 -1.551 -1.551 -1.551 -1.551 -1.551 -1.551 -1.551 Inpo 0.946 17.233 16.301 16.301 16.301 ' 16.301 16.301 16.301 16.301 16.301 16.301 fractionali2 -0.0003 1785.878 -0.607 -0.607 -0.004 -2.372 -0.607 -0.607 -0.607 -0.607 -0.607 ethnic dom. 0.847 0.440 0.373 0.373 0.373 0.373 0.373 0.373 0.373 0.373 0.373 geog dispersion -3.891 0.575 -2.238 -2.238 -2.238 -2.238 -2.238 -2.238 -2.238 0.000 -3.891 peace -0.004 347.670 -1.488 -1.488 -1.488 -1.488 -0.004 -0.257 -0.514 -1.488 -1.488 constant -16.443 -16.443 -16.443 -16.443 -16.443 -16.443 -16.443 -16.443 -16.443 -16.443 Xbetahat -1.212 -5.329 -1.223 -3.590 -0.342 -0.595 -0.851 0.412 -3.479 probability 0.229 0.005 0.227 0.027 OA15 0.356 0.299 0.602 0.030 The probability of a war-start was calculated in the following way: probability--exp(XA B )I(l+exp(X^ 13)) 44 Policy Research Working Paper Series Contact Title Author Date for paper WPS2340 Currency Substitution in Latin Pere Gomis-Porqueras May 2000 M. Puentes America: Lessons from the 1990s Carlos Serrano 39621 Alejandro Somuano WPS2341 The Tyranny of Concepts: CUDIE Lant Pritchett May 2000 R. Widuri (Cumulated, Depreciated Investment Effort) Is Not Capital WPS2342 What Can We Learn about Country Martin Ravallion May 2000 P. Sader Performance from Conditional 33902 Comparisons across Countries? WPS2343 Ownership and Performance of David A. Grigorian May 2000 D. Brown Lithuanian Enterprises 33542 WPS2344 Designing Direct Subsidies for Vivien Foster May 2000 S. Delgado Water and Sanitation Services: Andres G6mez-Lobo 37840 Panama-A Case Study Jonathan Halpern WPS2345 Information and Modeling Issues in Andres G6mez-Lobo May 2000 S. Delgado Designing Water and Sanitation Vivien Foster 37840 Subsidy Schemes Jonathan Halpern WPS2346 The Middle Class Consensus and William Easterly May 2000 K. Labrie Economic Development . 31001 WPS2347 Terror as a Bargaining Instrument: Francis Bloch May 2000 P. Sader A Case Study of Dowry Violence in Vijayendra Rao 33902 Rural India WPS2348 Taxing Issues with Privatization: Jack M. Mintz May 2000 G. Chenet-Smith A Checklist Duanjie Chen 36370 Evangelia Zorotheos WPS2349 Trade, Foreign Direct Investment, Kamal Saggi May 2000 R. Bonfield and International Technology 31248 Transfer: A Survey WPS2350 Multilateral Trade Liberalization and Maurice Schiff May 2000 L. Tabada Political Disintegration: Implications 36896 for the Evolution of Free Trade Areas and Customs Unions WPS2351 Environmental Policy and Time Peter W. Kennedy May 2000 Y. D'Souza Consistency: Emissions Taxes Benoit Laplante 31449 and Emissions Trading Policy Research Working Paper Series Contact Title Author Date for paper WPS2352 How Stronger Patent Protection in Carsten Fink May 2000 L. Tabada India Might Affect the Behavior 36896 of Transnational Pharmaceutical Industries WPS2353 The Sao Mateus-Jabaquara Jorge Rebelo May 2000 S. Van Veldhuizen Trolleybusway Concession in Brazil Pedro Machado 38722 WPS2354 When the Bureaucrats Move out of Yi Chen May 2000 M. Yafi Business: A Cost-Benefit Ishac Diwan 34649 Assessment of Labor Retrenchment in China 6/f