Historical GDP/GDPPC/Population: Difference between revisions

From Pardee Wiki
Jump to navigation Jump to search
No edit summary
No edit summary
 
Line 31: Line 31:
|UsedInPreprocessor
|UsedInPreprocessor
|-
|-
|HistoricalGDP%mean
|HistoricalGDP
|SeriesHistoricalGDP%mean
|SeriesHistoricalGDP
|Mean of historical GDP values extended with a Latent Variable Modeling Framework.
|Mean of historical GDP values extended with a Latent Variable Modeling Framework.
|7/23/2025
|2025/08/21
|0
|0
|-
|-
|HistoricalGDP%meanLog10
|HistoricalGDPPC
|SeriesHistoricalGDP%meanLog10
|SeriesHistoricalGDPPC
|Mean of log10 of historical GDP values extended with a Latent Variable Modeling Framework.
|7/23/2025
|0
|-
|HistoricalGDP%sd
|SeriesHistoricalGDP%sd
|SD of historical GDP values extended with a Latent Variable Modeling Framework.
|7/23/2025
|0
|-
|HistoricalGDP%sdlog10
|SeriesHistoricalGDP%sdlog10
|SD of log10 of historical GDP values extended with a Latent Variable Modeling Framework.
|7/23/2025
|0
|-
|HistoricalGDPPC%mean
|SeriesHistoricalGDPPC%mean
|Mean of historical GDP per capita values extended with a Latent Variable Modeling Framework.
|Mean of historical GDP per capita values extended with a Latent Variable Modeling Framework.
|7/23/2025
|2025/08/21
|0
|0
|-
|-
|HistoricalGDPPC%meanlog10
|HistoricalPOP
|SeriesHistoricalGDPPC%meanlog10
|SeriesHistoricalPOP
|Mean of log10 of historical GDP per capita values extended with a Latent Variable Modeling Framework.
|7/23/2025
|0
|-
|HistoricalGDPPC%sd
|SeriesHistoricalGDPPC%sd
|SD of historical GDP per capita values extended with a Latent Variable Modeling Framework.
|7/23/2025
|0
|-
|HistoricalGDPPC%sdlog10
|SeriesHistoricalGDPPC%sdlog10
|SD of log10 of historical GDP per capita values extended with a Latent Variable Modeling Framework.
|7/23/2025
|0
|-
|HistoricalPOP%mean
|SeriesHistoricalPOP%mean
|Mean of historical population values extended with a Latent Variable Modeling Framework.
|Mean of historical population values extended with a Latent Variable Modeling Framework.
|7/23/2025
|2025/08/21
|0
|-
|HistoricalPOP%meanlog10
|SeriesHistoricalPOP%meanlog10
|Mean of log10 of historical population values extended with a Latent Variable Modeling Framework.
|7/23/2025
|0
|-
|Historicalpop%sd
|SeriesHistoricalpop%sd
|SD of historical population values extended with a Latent Variable Modeling Framework.
|7/23/2025
|0
|-
|HistoricalPOP%sdlog10
|SeriesHistoricalPOP%sdlog10
|SD of log10 of historical population values extended with a Latent Variable Modeling Framework.
|7/23/2025
|0
|0
|}
|}
Line 135: Line 81:


4.     GWNO codes and corresponding country names were provided in the supplementary files of the original paper. However, the supplementary only provided corresponding codes for 217 countries/regions and the dataset had 225 countries/regions (missing: 89,99,327, 396,397, 563,564,711). For the additional gwno codes, use this link for corresponding country names: <nowiki>http://ksgleditsch.com/data-4.html</nowiki>.
4.     GWNO codes and corresponding country names were provided in the supplementary files of the original paper. However, the supplementary only provided corresponding codes for 217 countries/regions and the dataset had 225 countries/regions (missing: 89,99,327, 396,397, 563,564,711). For the additional gwno codes, use this link for corresponding country names: <nowiki>http://ksgleditsch.com/data-4.html</nowiki>.
5.    For Yeman, there are ''Yemen (Arab Republic of Yemen)'' and ''Yemen, People’s Republic of;'' and for Viet Nam, there are ''Vietnam (Annam/Cochin China/Tonkin)'' and ''Vietnam, Democratic Republic of'' in the original dataset:
5.1 for the years that only one of them had value, we took the value from whichever that is available for population, GDP, and GDPPC
5.2 for the years that both of them had value, we summed the values for population and GDP; and the values for GDPPC are then calculated from population and GDP

Latest revision as of 03:08, 22 August 2025

Summary

The historical GDP, population, and GDPPC data is pulled from the article "New Estimates of Over 500 Years of Historic GDP and Population Data" by Fariss et al. (2022). The article proposed a dynamic latent variable model to address three major issues in historical GDP, population, and GDP per capita data. Namely 1) missing data, 2) measurement uncertainty, 3) systematic bias across sources.

The authors built a dynamic latent variable model that combined multiple historical and contemporary datasets. The work was supported by the Security and Political Economy (SPEC) Lab at the University of Southern California.

For population and GDP per capita, the latent trait model parameters were constructed based on following formula:

(Farris et al., 2022)









Tables in IFs

Variable Table Definition Last IFs Update UsedInPreprocessor
HistoricalGDP SeriesHistoricalGDP Mean of historical GDP values extended with a Latent Variable Modeling Framework. 2025/08/21 0
HistoricalGDPPC SeriesHistoricalGDPPC Mean of historical GDP per capita values extended with a Latent Variable Modeling Framework. 2025/08/21 0
HistoricalPOP SeriesHistoricalPOP Mean of historical population values extended with a Latent Variable Modeling Framework. 2025/08/21 0

Data Pulling Instructions

Step 1:

To pull data from this paper, go to:

https://journals.sagepub.com/doi/full/10.1177/00220027211054432

In the Acknowledgements section, there is a link to the predicted data:

https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/FALCGS

Data source.png





Step 2:

Download the most updated datafiles (in this example, in 2024).

Data Notes

1.     The paper was published in 2022. In 2024, the author shared an updated version of the datafile.

2.     The data files are in .rds format, if using R, open the file and it will automatically import; if using python, use the read_r function in the pyreadr package.

3.     The datafile included predicted values as well as datasets used in the modeling, slice “latent_gdp”, “latent_gdppc”, “latent_pop” from the indicator column of corresponding datafiles for the predicted values.

4.     GWNO codes and corresponding country names were provided in the supplementary files of the original paper. However, the supplementary only provided corresponding codes for 217 countries/regions and the dataset had 225 countries/regions (missing: 89,99,327, 396,397, 563,564,711). For the additional gwno codes, use this link for corresponding country names: http://ksgleditsch.com/data-4.html.

5. For Yeman, there are Yemen (Arab Republic of Yemen) and Yemen, People’s Republic of; and for Viet Nam, there are Vietnam (Annam/Cochin China/Tonkin) and Vietnam, Democratic Republic of in the original dataset:

5.1 for the years that only one of them had value, we took the value from whichever that is available for population, GDP, and GDPPC

5.2 for the years that both of them had value, we summed the values for population and GDP; and the values for GDPPC are then calculated from population and GDP