Datasets

We are happy to welcome you to the Compendium Project dataset page. Essays of special relevance to data issues may be accessed directly from this page, and we have a listing of web datasets graciously provided by one of our authors, Paul Hensel, that will connect you to datasets of various sorts.

In the near future we also hope to provide a list of datasets in international studies that seeks to be comprehensive, and that will include materials not quite as complete, or directly related to international relations, as those in the Hensel list.

If you have any feedback or questions about this or any other element in our project please contact Compendium General Editor Bob Denemark

Compendium articles concerning data issues

Links to datasets

Compiled by Paul R. Hensel

Notes:
The “Source” entry for each data set indicates the best place to obtain it. Where possible, this is the official site where the data set is maintained, which should always have the most up-to-date version. Also note that some data sets that are listed as being available by purchase only can be purchased by an entire institution through a site license; interested users may want to check with their school's library to see whether these resources are already available to them before paying the fee to access the data individually.

The “Typical Uses” entry for each data set indicates the most common uses for the data in recent years. Note that some data sets are very broad, and can be used for multiple purposes. Where relevant, data sets are listed under several different categories, as with data sets that are commonly used for studying both armed conflict and conflict management.

We have made every effort to make sure that this information is correct and complete at the time of publication, but there will inevitably be changes as data sets are moved to new hosts or new URLs. Do let us know if any corrections need to be made.

Lists of States / Country Codes

Most of these data sets use one of the following lists of country codes to identify observations.

COW interstate system list

Gleditsch/Ward state list

ISO 3166-1 country codes (as used by the UN and many international agencies)

Interstate Conflict Data

Armed Conflict data (from Uppsala Conflict Data Program and PRIO)

COW Interstate War data

COW Militarized Interstate Dispute (MID) data

DYMID (Maoz' dyadic MID data)

ICB Crisis data

Lacina/Gleditsch Battle Deaths data

Event Data

Conflict and Mediation Event Observations (CAMEO)

Conflict and Peace Databank (COPDAB)

Integrated Data for Events Analysis (IDEA)

Kansas Events Data System (KEDS)

King/Lowe "10 Million International Dyadic Events" data

Protocol for the Analysis of Nonviolent Direct Action (PANDA)

  • Source: This data set does not appear to be available any more, although its coding schemes live on in other work.
  • Typical uses:

World Events Interaction Survey (WEIS)

World Handbook of Political and Social Indicators

Interstate Rivalry

Hewitt's Crisis Density Rivalry data

Klein/Goertz/Diehl Rivalry data

Strategic Rivalry data (from William Thompson)

  • Source: [Available by emailing author]
  • Typical uses:

Contentious Issues

COW Territorial Change data

Huth/Allee Territorial Dispute data

ICOW Territorial Claim, River Claim, and Maritime Claim data

  • Source: http://www.icow.org
  • Typical uses: Armed conflict over territorial, river, and maritime issues

Intrastate, Civil, and Ethnic Conflict Data

Armed Conflict data (from Uppsala Conflict Data Program and PRIO)

COW Extrastate War data (formerly Extrasystemic War data)

COW Intrastate War data (formerly Civil War data)

Cross-National Time Series Archive data (Banks)

Doyle/Sambanis "International Peacebuilding" data (from 2000 APSR article)

Fearon/Laitin Ethnicity, Insurgency, and Civil War data

Lacina/Gleditsch Battle Deaths data

Minorities at Risk data

Sambanis "Partition as a Solution to Ethnic War" data (from 2000 WP article)

State Failure Project / Political Instability Task Force

World Handbook of Political and Social Indicators

Human Rights and Repression

CIRI (Cingranelli/Richards) Human Rights data

Political Terror Scale

Terrorism

Global Terrorism Database (from the National Consortium for the Study of Terrorism and Responses to Terrorism or START)

Institute for Counterterrorism (ICT) International Terrorism Database

  • Source: [This doesn't currently appear to be publicly available.]
  • Typical uses:

International Terrorism: Attributes of Terrorist Events (ITERATE)

Memorial Institute for the Prevention of Terrorism (MIPT) Terrorism Knowledge Base

  • Source: [This data set is apparently no longer publicly available.]
  • Typical uses:

Terrorism, Insurgencies, and Guerrillas in Education and Research (TIGER) Suicide Attacks Worldwide data

Terrorism, Insurgencies, and Guerrillas in Education and Research (TIGER) Terrorist Groups Worldwide data

U.S. State Department Country Reports on Terrorism (formerly Patterns of Global Terrorism)

Conflict Management Data

Bercovitch's International Conflict Management data set

  • Source: Obtain by emailing the author?
  • Typical uses:

Doyle/Sambanis "International Peacebuilding" data (from 2000 APSR article)

Huth/Allee Territorial Dispute data

ICB Crisis data

ICOW Territorial Claim, River Claim, and Maritime Claim data

  • Source: http://www.icow.org
  • Typical uses: Management of territorial, river, and maritime issues

Sambanis "Partition as a Solution to Ethnic War" data (from 2000 WP article)

Third Party Interventions in Intrastate Disputes (TPI-Intrastate Disputes) data (from Mullenbach and Dixon)

Economic Data

Cukierman et al. Central Bank data (from Cukierman/Webb/Neyapti 1992 WBER and Cukierman/Webb 1995 WBER articles)

Deininger/Squire "Measuring Income Inequality" data (from 1996 WBER article)

Gleditsch's Expanded GDP data

IMF Balance of Payments data

IMF International Financial Statistics

Penn World Tables

Sturm/de Haan "Inflation in Developing Countries" data (from 2001 Ifo Studien article)

  • Source: [Data summary in article; Full data available by emailing author]
  • Typical uses: Central bank independence

UNCTAD Handbook of Statistics

World Bank Global Development Finance data (formerly World Debt Tables)

World Bank World Development Indicators

  • Source: http://www.worldbank.org/data [available by purchase only]
  • Typical uses: Income, trade, growth, GDP, export dependence, export penetration, oil exports, exchange rates, government spending

Trade and Investment

COW Trade data

Gleditsch's Trade data

IMF Annual Exchange Arrangements and Restrictions data (formerly Annual Report on Exchange Restrictions)

IMF Direction of Trade data

UNCTAD FDI data

UNCTAD Trade Analysis and Information (TRAINS) data

Geographic Data

Contiguity and Distance

COW Contiguity data

Furlong/Gleditsch Length of International Boundaries data

Gleditsch's Minimum Distance and Distance between Capital Cities data

World Distance Tables (from Hengeveld)

Conflict Location Data

Armed Conflict Location and Events Data (ACLED)

COW MID Location data

Environmental and Resource Data

Collier/Hoeffler "Greed and Grievance" data

DIADATA Diamond data

EM-DAT Emergency Events Database

  • Source: http://www.emdat.be/index.html
  • Typical uses: Occurrence of natural disasters, scope of disaster (number of people affected, deaths, economic costs)

Fearon/Laitin Ethnicity, Insurgency, and Civil War data

ICOW River Claim data

  • Source: http://www.icow.org
  • Typical uses: Occurrence of disagreements over international rivers, salience of claimed river, conflictual and cooperative management of river claims

PETRODATA Petroleum data

PRIO Shared River Basin data (from Toset, Gleditsch, and Hegre)

PRIO Shared Rivers data (from Toset, Gleditsch, and Hegre)

Transboundary Freshwater Dispute Database (TFDD)

UNFAO State of the World's Forests data

UNFAO Terrastat data

U.S. Geological Survey's Centennial Earthquake Catalog

World Resource Institute's World Resources data

Political Data

ACLP Democracy and Development data (Alvarez/Cheibub/Limongi/Przeworski; from 2000 book Democracy and Development)

Arzaghi & Henderson decentralization data (from 2005 Journal of Public Economics)

Democratic Electoral Systems around the World (from Golder's 2003 CPS article)

Freedom House data (Freedom in the World, Freedom of the Press)

Institutions and Elections Project (IAEP, collected by Regan and Clark)

International Country Risk Guide

Logic of Political Survival data (by Bueno de Mesquita/Smith/Siverson/Morrow, from 2000 book)

Political Constraint Index/POLCON (introduced in Henisz' 2000 Economics & Politics article)

Polity IV (and earlier Polity data sets)

Polyarchy data (from Tatu Vanhanen)

World Bank Database of Political Institutions (introduced in Beck et al. 2001 WBER article; merged with Keefer checks & balances data)

World Bank Worldwide Governance Indicators

Individual Political Leaders

Archigos (from Geomans/Gleditsch/Chiozza)

Geddes' "Authoritarian Breakdown" data [see especially Appendix]

Logic of Political Survival data (by Bueno de Mesquita/Smith/Siverson/Morrow, from 2000 book)

Peceny/Beer/Sanchez-Terry "Dictatorial Peace" data (from 2002 APSR article)

Power and Capability Data

See also the measures of economic capability discussed above. Many studies measure power/capabilities by GNP or GDP, which is highly correlated with the CINC score, although it is generally not available for as long a time span.

COW National Material Capabilities data

  • Source: http://www.correlatesofwar.org
  • Typical uses: Composite index of national capability (CINC), military personnel, military expenditures, iron/steel production, energy consumption, total population, urban population

Social and Demographic Data

Many of the economic data sets discussed above are also used for sociodemographic data, although the following sources generally focus more on social rather than economic factors in development:

UN World Urbanization Prospects (formerly World Urbanization Report)

WHO World Health Report

Minorities/Diversity/Heterogeneity

Atlas Narodov Mira

  • Source: [Not available in electronic form]
  • Typical uses:

CIA World Factbook

COW Cultural data

  • Source: Not currently available; will eventually be released at http://www.correlatesofwar.org
  • Typical uses: Ethnic, religious, linguistic diversity

Fearon/Laitin Ethnicity, Insurgency, and Civil War data (see description in 2003 Journal of Economic Growth article)

Geo-Referencing of Ethnic Groups (GREG)

Heger/Salehyan's "Ruthless Rulers" data

Minorities at Risk data

Roeder's Ethnolinguistic Fractionalization (ELF) data

Vanhanen "Ethnic Conflict and Ethnic Nepotism" data (from 1999 book and JPR article)

Survey Data

Afrobarometer

Chicago Council of Foreign Relations Public Opinion Survey

Eurobarometer

Latinobarometer/Latinobarometro

World Values Survey

Treaty, Institution, and International Law Data

Alliance Treaty Obligations and Provisions (ATOP) data

  • Source: http://atop.rice.edu/
  • Typical uses: Shared alliance memberships, details of alliance obligations

Continent of International Law (Koremenos et al.)

COW Alliance data

COW International Governmental Organization (IGO) data

Expected Utility Generation and Data Management Program (EUGene)

Gartzke/Jo "Affinity of Nations" data

ICOW Multilateral Treaties of Pacific Settlement (MTOPS) data

Morrow/Jo Laws of War data

Transboundary Freshwater Dispute Database (TFDD)

Voeten's Documenting Votes in the UN General Assembly data

Prominent Replication Data Sets

These data sets are used frequently for replicating or extending previously published research. They offer the advantage of a ready-made set of dependent, independent, and control variables, allowing the user to change one or two elements of a published study to investigate a potential new relationship without having to collect and code the remaining variables.

Expected Utility Generation and Data Management Program (EUGene)

Fearon/Laitin Ethnicity, Insurgency, and Civil War data

Peceny/Beer/Sanchez-Terry "Dictatorial Peace" data (from 2002 APSR article)

Poe/Tate data (from 1994 Poe/Tate APSR article and 1999 Poe/Tate/Keith ISQ article)

Russett/Oneal "Triangulating Peace" data (from 2001 book)

State Failure Project / Political Instability Task Force data

Replication Data Archives

Many journals now require that replication data be made available upon publication of scholarly research. Some users host their replication data sets on their personal web sites, but many opt to use these official archives. The author of this chapter, Paul Hensel, also maintains his own International Relations Data Site with links to additional resources not included on this page.

ICPSR Publication-Related Archive:

International Studies Quarterly:

Journal of Conflict Resolution:

Journal of Peace Research:

International Studies Compendium Project

Link to the ISA Compendium Website

Site Footer

Copyright Wiley Blackwell 2010