Data

Data hosted at Institutions Hub

Clio Infra wishes to express its gratitude to those who allowed us to host their data. When you use these data do not forget to refer to the author’s articles/papers.  

Topic

Datasets

Political institutions and governance

Adjusted state antiquity dataset 

Extraction ratio

Government revenues relative to GDP

Latent democracy indicator, 1850-2000

 

Conflicts and wars

 

Conflict Catalog (Violent Conflicts 1400 A.D. to the Present  in Different Regions of the World)

Homicide dataset, 1800-2000

Other institutions

 

Foundation of universities dataset

 

 

Links to external data

These data are not hosted on our server, we only supply links to their respective homepages. Please always consult the original homepage regarding contents, definitions, coverage and the terms of use.

 

Topic

Datasets

Political institutions and governance

 

Database of Political Institutions 2010

World Governance Indicators

Comparative Political Datasets I

Comparative Political Datasets II

Comparative Political Datasets III

Polity IV

Democracy and Dictatorship

Vanhanen’s democracy (polyarchy) dataset

State Antiquity (Statehist)

IDEA (Institute for Democracy and Electoral Assistance) voters turnout database

Legal system

 

Judicial Checks and Balances

 

Conflicts and wars

 

UCDP/PRIO Armed Conflict Dataset
UCDP Non-State Conflict Dataset
Correlates of War, Inter-state wars dataset
Correlates of War, Intra-state wars dataset

Economic institutions

 

Economic Freedom of the World

Institutional Characteristics of Trade Unions, Wage Setting, State Intervention and Social Pacts 

Colonial institutions

 

Geodist

Colonial/Dependency Contiguity, 1816-2002 

Transatlantic slave trade

ethnicity, language and religion

Ethnic, Linguistic and Religious Fractionalization

 

Religion adherence data

Ethnographic data on societies

 

Ethnographic Atlas The Standard Cross-cultural Sample

 

 

Political institutions and governance:

  1. Adjusted state antiquity dataset 

Authors: original state antiquity data: Louis Putterman and Valerie Bockstette, adjustments: Jan Luiten van Zanden
Content: state antiquity scores 1801-1950, with adjustments for some countries.
excel format: Click here for State Antiquity.xls hosted on our server
data description: Click here for State Antiquity Dataset.doc hosted on our server

 

2. Extraction ratio

Authors: Jan Luiten van Zanden, Joerg Baten, Peter Foldvari, Bas van Leeuwen

Content: Income inequality, extraction ratio for every benchmark years between 1820-2000. The extraction ratio is defined as the ratio of the observed income inequality and the theoretical maximum of income inequality. The later is estimated under the assumption that the elite (assumed here to be 0.1 or 1% of the population) can and does expropriate all incomes above the subsistence level (assumed to be 400 G-K dollars in 1990 prices) from the non-elite. The discrepancy between the observed and the theoretical ceiling of income inequality is a measure of the power of the elite.   

excel format: Click here for Extraction ratio.xls hosted on our server

data description: the methodology of income inequality estimates is described here, the methodology of extraction ratio is described here.

If you use this dataset please cite: Zanden, J .L. van, Baten,. J., Foldvari, P. and Leeuwen, B. van (2011) The Changing Shape of Global Inequality 1820-2000: Exploring a new dataset, CGEH Working Paper No. 1

 

3. Government revenues relative to GDP

Content: total government revenue, including taxes and excises, as percentage of GDP or comparable measure of aggregate economic activity

OECD countries (1800-2007)
Collected by Pim de Zwart
excel format: click here to download the data in excel format (oecd.xls)
data description: a description of sources and definitions can be found here (notes.doc)

African (1950-2005) and Asian (1900 2005) countries
collected by Peter Foldvari
excel format: click here to download the data in excel format (asia,africa.xls)
data source: Mitchell, Brian R., International Historical Statistics, Africa, Asia and Oceania: 1750-2005 (London: Palgrave Macmillan, 2007).

 

4. Latent democracy indicator, 1850-2000

Content:  a latent democracy indicator, extracted from five components of the PolityIV projects dataset (XRCOMP, XROPEN, XCONST, PARREG, PARCOMP) and two components of the Index of Democracy by Vanhanen (participation and competition) by a measurement error model factor model. The number of available countries varies between 38 (1850) and 139 (2000).

excel format: click here to download the data in excel format (latentD.xls)

data description: a description of sources and definitions can be found in Foldvari, P.: A latent democracy measure 1850-2000, Utrecht University, Centre for Global Economic History, Working paper no. 59., June 2014. When using this dataset please cite above working paper.

 

Conflicts and wars

  1. Conflict Catalog (Violent Conflicts 1400 A.D. to the Present  in Different Regions of the World)

Authors: Peter Brecke
Contents: 3708 conflicts, data on parties, fatalities, date and duration.
Link to data in excel format: Conflict Catalog 18 vars.xls hosted on our server
Click here for data description.

 

2. Homicide dataset, 1800-2000

Content: The number of homicides per 100.000 inhabitants. The official definition of intentional homicide, “unlawful death deliberately inflicted on one person by another person” (OECD, 2011), is used. The dataset excludes civilian and military deaths  inflicted during inter-state wars and deaths caused by civil wars (OECD, 2014).

excel format: click here to download the data in excel format (homicide.xls)

data description: click here to download the description in doc format (descriptionhomicide.doc)

When using this dataset please cite: Baten, J, Bierman, W., Foldvari, P. and van Zanden, J. L.: Chapter 8 Personal security since 1820 In How Was Life? Global Well-being since 1820 (Jan Luiten van Zanden, Joerg Baten, Marco Mira d’Ercole, Auke Rijpma, Marcel Timmer eds,), OECD, Paris, 2014

 

Other institutions

1.       Foundation of universities dataset

Author: Peter Foldvari

Content: The number of universities founded in a year within the current borders of a particular country. Coverage:  1500-2013, 95 countries.

excel format: click here to download the data in excel format 

data description: click here to download the description in docx format 

 

 

Links to external data

These data are not hosted on our server, we only supply links to their respective homepages. Please always consult the original homepage regarding contents, definitions, coverage and the terms of use.

 

Political institutions and governance

1. Database of Political Institutions 2010

Authors: Thorsten Beck, George Clarke, Alberto Groff, Philip Keefer, and Patrick Walsh (hosted by World Bank)

Contents: A wide range of indicators on political institution for 180 countries, 1975-2010. Variables cover the executive power, the legislature and the election system.

Click here for link to data (2009) in excel format.

Click here for link to data (2010) in stata format.

Click here for link to data description.

If you use this dataset please cite: Thorsten Beck, George Clarke, Alberto Groff, Philip Keefer, and Patrick Walsh, 2001. "New tools in comparative political economy: The Database of Political Institutions." 15:1, 165-176 (September), World Bank Economic Review.

 

2. World Governance Indicators

Authors: Daniel Kaufmann, Aart Kraay and Massimo Mastruzzi (hosted by World Bank)

Contents: different governace indicators for 213 economies, 1996-2010

Click here for link to data in excel format.

Click here for link to data description. 

 

3. Comparative Political Datasets I

Authors: Klaus Armingeon, David Weisstanner, Sarah Engler, Panajotis Potolidis, Marlène Gerber, Philipp Leimgruber (Institut für Politikwissenschaft, University of Bern)

Contents: 23 OECD countries, 1960-2009

Click here for link to data in excel format.

Click here for link to data in stata format.

Click here for link to data in spss format.

Click here for link to data description.

 

4. Comparative Political Datasets II

Authors: Klaus Armingeon, David Weisstanner, Sarah Engler, Panajotis Potolidis, Marlène Gerber, Philipp Leimgruber (Institut für Politikwissenschaft, University of Bern)

Contents: 29 post-Communist countries, 1989-2007

Click here for link to data in excel format.

Click here for link to data in spss format.

Click here for link to data description.

 

5. Comparative Political Datasets III

Authors: Klaus Armingeon, David Weisstanner, Sarah Engler, Panajotis Potolidis, Marlène Gerber, and Philipp Leimgruber (Institut für Politikwissenschaft, University of Bern)

Contents: 35 OECD and EU countries, 1990-2009

Click here for link to data in excel format.

Click here for link to data in stata format.

Click here for link to data in spss format.

Click here for link to data description.

 

6. Polity IV

Authors: Monty G. Marshall, Keith Jaggers, and Ted Robert Gurr

Contents: 164 countries, 1800-2010, autocracy, democracy index, ranging from -10 to 10

Click here for link to data in excel format.

Click here for link to data in spss format.

Click here for link to data description.  

 

7. Democracy and Dictatorship

Authors: José Antonio Cheibub, Jennifer Gandhi and James Raymond Vreeland

Contents: 204 countries, 1946-2008, types of

Click here for link to dataset in excel format.

Click here for link to dataset in spss format.

Click here for link to data description.

If you use this dataset, please cite: Antonio Cheibub, Jennifer Gandhi and James Raymond Vreeland "Democracy and dictatorship revisited" Public Choice  Volume 143, Numbers 1-2 (2010), 67-101.

 

8. Vanhanen’s democracy (polyarchy) dataset

Authors: Tatu Vanhanen

Contents: 188 countires, 1810-2010, calcualted from election outcomes.

Click here for link to dataset in excel format.

Click here for link to dataset in stata format.

Click here for link to dataset in spss format:

Click here for link to data description.

 

9. State Antiquity (Statehist)

Authors: Louis Putterman and Valerie Bockstette

Contents: 149 countries, scores of the presence of super-tribal polity

Click here for link to data in excel format (version 3).

Click here for link to data description. 

 

10. IDEA (Institute for Democracy and Electoral Assistance) voters turnout database

Contents: data on voter turnout since 1945 form 170 countries

Click here for the online data

 

Legal system

  1. Judicial Checks and Balances

Authors: Rafael La Porta, Florencio López-de-Silanes, Cristian Pop-Eleches, and Andrei Shleifer

Contents: 71 countries, cross-section

Click here for link to data in excel format

Click here for link to data description.

If you use this data, plase cite: La Porta, Rafael, Florencio López-de-Silanes, Cristian Pop-Eleches and Andrei Shleifer. 2004. “Judicial checks and balances”. Journal of Political Economy 112 (April): 445-470.

 

Conflicts and wars

1. UCDP/PRIO Armed Conflict Dataset

Authors: Gleditsch, Nils Petter, Peter Wallensteen, Mikael Eriksson, Margareta Sollenberg, and Håvard Strand

Contents:  260 armed conflicts 1946-2010

Click here for link to data in excel format.

Click here for link to data description.

If you use this data, please cite: Gleditsch, Nils Petter, Peter Wallensteen, Mikael Eriksson, Margareta Sollenberg, and Håvard Strand. 2002. “Armed Conflict 1946-2001: A New Dataset.” Journal of Peace Research 39(5).

 

2. UCDP Non-State Conflict Dataset

Authors: Ralph Sundberg, Kristine Eck and Joakim Kreutz

Contents:  784 armed conflicts when none of the parties were government or state, 1989-2013

Click here for link to data in excel format.

Click here for link to data description/ codebook.

If you use this data, please cite:  Sundberg, Ralph, Kristine Eck and Joakim Kreutz "Introducing the UCDP Non-State Conflict Dataset", Journal of Peace Research, March 2012, 49:351-362 

 

3. Correlates of War, Inter-state wars dataset

Authors: Meredith Reid Sarkees and Frank Wayman Contents: 95 inter-state wars (among states and governments), 1816-2000

Click here for link to data in csv format.

Click here for link to data description/ codebook.

If you use this data, please cite:  Sarkees, Meredith Reid and Frank Wayman (2010). Resort to War: 1816 - 2007. CQ Press.

 

4. Correlates of War, Intra-state wars dataset

Authors: Meredith Reid Sarkees and Frank Wayman Contents: 95 intra-state wars (conflicts taking palce within the boundaries of a state), 1816-2000

Click here for link to data in csv format.

Click here for link to data description/ codebook.

If you use this data, please cite:  Sarkees, Meredith Reid and Frank Wayman (2010). Resort to War: 1816 - 2007. CQ Press.

 

Economic institutions

  1. Economic Freedom of the World

Authors: Fraser Institute

Contents: 141 countries, 1970, 1975, 1980, 1985 1990, 1995 and 2000-2009 annually.

Click here for link to data and data description: you are required to install a software (PC) that contains the data and data management tools. Also you can export the required data to excel.

 

2.  Institutional Characteristics of Trade Unions, Wage Setting, State Intervention and Social Pacts (version 4)

Authors: Jelle Visser

Contents: Data on labour unions, collective bargain, government intervention, minimum wages and strike regulations in 46 countries (OECD, EU, emerging economies), 1960-2011

Click here for link to data in excel format and data description.

 

Colonial institutions

  1. Geodist

Authors: CEPII

Contents: data on 225 countries, including their distance, official language and colonial past

Click here for link to data in excel format.

Click here for link to data in stata format.

 

2. Colonial/Dependency Contiguity, 1816-2002 

Authors: Paul Hensel

Contents: All contiguity relationships between states in the international system through their colonies or dependencies, 1816-2002  Click here for link to data in csv format.

Click here for link to data description/ codebook.

If you use this data, please cite: Correlates of War 2 Project. Colonial/Dependency Contiguity Data, 1816-2002. Version 3.0. Online: http://correlatesofwar.org.

 

3. Transatlantic slave trade

Authors: Voyages: The Trans-Atlantic Slave Trade Database by Emory Universit

Contents: data on more than 30000 voyages over the Atlantic countries 1514-1866, with data on the number of slaves, length of voyages, percetange of males and children

Click here for link the online data.

If you use this data please cite: David Eltis, “A Brief Overview of the Trans-Atlantic Slave Trade,” Voyages: The Trans-Atlantic Slave Trade Database http://www.slavevoyages.org/tast/assessment/essays-intro-01.faces (accessed April 27, 2008).

 

Data on ethnicity, language and religion:

  1. Ethnic, Linguistic and Religious Fractionalization

Authors: Alberto Alesina, Arnaud Devleeschauwer, William Easterly,Sergio Kurlat, and Romain Wacziarg

Contents: 190 countries, ethnic, linguistic and religious fractionalization, cross-section

Click here for link to data in excel format.

Click here for link to data description (the original paper).

If you use this data, please cite: Alberto Alesina, Arnaud Devleeschauwer, William Easterly,Sergio Kurlat, and Romain Wacziarg. "Fractionalization" Journal of Economic Growth, vol. 8, no. 2, June 2003, pp. 155-194.

 

2. Religion adherence data

Authors: Robert Barro, Rachel M. McCleary

Contents: 213 countries, 1900, 1970, data on share of religions in population, Herfindahl indices of religious concentration.

Click here for link to data in excel format.

 

Ethnographic data on societies

1. Ethnographic Atlas The Standard Cross-cultural Sample

Authors: George P. Murdock and Douglas R. White

Content: data on different cultural, ethnic and institutional aspect of 186 cultures. This data is corrected for the effects of regional diffusion effects and auto-correaltions.

Click here for the data is SPSS format.

Clisck here for codebook