Historical GDP/GDPPC/Population: Difference between revisions
No edit summary |
No edit summary |
||
| Line 31: | Line 31: | ||
|UsedInPreprocessor | |UsedInPreprocessor | ||
|- | |- | ||
|HistoricalGDP | |HistoricalGDP | ||
|SeriesHistoricalGDP | |SeriesHistoricalGDP | ||
|Mean of historical GDP values extended with a Latent Variable Modeling Framework. | |Mean of historical GDP values extended with a Latent Variable Modeling Framework. | ||
| | |2025/08/21 | ||
|0 | |0 | ||
|- | |- | ||
|HistoricalGDPPC | |||
|SeriesHistoricalGDPPC | |||
|HistoricalGDPPC | |||
|SeriesHistoricalGDPPC | |||
|Mean of historical GDP per capita values extended with a Latent Variable Modeling Framework. | |Mean of historical GDP per capita values extended with a Latent Variable Modeling Framework. | ||
| | |2025/08/21 | ||
|0 | |0 | ||
|- | |- | ||
|HistoricalPOP | |||
|SeriesHistoricalPOP | |||
|HistoricalPOP | |||
|SeriesHistoricalPOP | |||
|Mean of historical population values extended with a Latent Variable Modeling Framework. | |Mean of historical population values extended with a Latent Variable Modeling Framework. | ||
| | |2025/08/21 | ||
|0 | |0 | ||
|} | |} | ||
| Line 135: | Line 81: | ||
4. GWNO codes and corresponding country names were provided in the supplementary files of the original paper. However, the supplementary only provided corresponding codes for 217 countries/regions and the dataset had 225 countries/regions (missing: 89,99,327, 396,397, 563,564,711). For the additional gwno codes, use this link for corresponding country names: <nowiki>http://ksgleditsch.com/data-4.html</nowiki>. | 4. GWNO codes and corresponding country names were provided in the supplementary files of the original paper. However, the supplementary only provided corresponding codes for 217 countries/regions and the dataset had 225 countries/regions (missing: 89,99,327, 396,397, 563,564,711). For the additional gwno codes, use this link for corresponding country names: <nowiki>http://ksgleditsch.com/data-4.html</nowiki>. | ||
5. For Yeman, there are ''Yemen (Arab Republic of Yemen)'' and ''Yemen, People’s Republic of;'' and for Viet Nam, there are ''Vietnam (Annam/Cochin China/Tonkin)'' and ''Vietnam, Democratic Republic of'' in the original dataset: | |||
5.1 for the years that only one of them had value, we took the value from whichever that is available for population, GDP, and GDPPC | |||
5.2 for the years that both of them had value, we summed the values for population and GDP; and the values for GDPPC are then calculated from population and GDP | |||
Latest revision as of 03:08, 22 August 2025
Summary
The historical GDP, population, and GDPPC data is pulled from the article "New Estimates of Over 500 Years of Historic GDP and Population Data" by Fariss et al. (2022). The article proposed a dynamic latent variable model to address three major issues in historical GDP, population, and GDP per capita data. Namely 1) missing data, 2) measurement uncertainty, 3) systematic bias across sources.
The authors built a dynamic latent variable model that combined multiple historical and contemporary datasets. The work was supported by the Security and Political Economy (SPEC) Lab at the University of Southern California.
For population and GDP per capita, the latent trait model parameters were constructed based on following formula:
Tables in IFs
| Variable | Table | Definition | Last IFs Update | UsedInPreprocessor |
| HistoricalGDP | SeriesHistoricalGDP | Mean of historical GDP values extended with a Latent Variable Modeling Framework. | 2025/08/21 | 0 |
| HistoricalGDPPC | SeriesHistoricalGDPPC | Mean of historical GDP per capita values extended with a Latent Variable Modeling Framework. | 2025/08/21 | 0 |
| HistoricalPOP | SeriesHistoricalPOP | Mean of historical population values extended with a Latent Variable Modeling Framework. | 2025/08/21 | 0 |
Data Pulling Instructions
Step 1:
To pull data from this paper, go to:
https://journals.sagepub.com/doi/full/10.1177/00220027211054432
In the Acknowledgements section, there is a link to the predicted data:
https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/FALCGS
Step 2:
Download the most updated datafiles (in this example, in 2024).
Data Notes
1. The paper was published in 2022. In 2024, the author shared an updated version of the datafile.
2. The data files are in .rds format, if using R, open the file and it will automatically import; if using python, use the read_r function in the pyreadr package.
3. The datafile included predicted values as well as datasets used in the modeling, slice “latent_gdp”, “latent_gdppc”, “latent_pop” from the indicator column of corresponding datafiles for the predicted values.
4. GWNO codes and corresponding country names were provided in the supplementary files of the original paper. However, the supplementary only provided corresponding codes for 217 countries/regions and the dataset had 225 countries/regions (missing: 89,99,327, 396,397, 563,564,711). For the additional gwno codes, use this link for corresponding country names: http://ksgleditsch.com/data-4.html.
5. For Yeman, there are Yemen (Arab Republic of Yemen) and Yemen, People’s Republic of; and for Viet Nam, there are Vietnam (Annam/Cochin China/Tonkin) and Vietnam, Democratic Republic of in the original dataset:
5.1 for the years that only one of them had value, we took the value from whichever that is available for population, GDP, and GDPPC
5.2 for the years that both of them had value, we summed the values for population and GDP; and the values for GDPPC are then calculated from population and GDP
