Population Data: Difference between revisions

From Pardee Wiki
Jump to navigation Jump to search
No edit summary
No edit summary
Line 12: Line 12:


China's Age-Sex Cohorts was data that was received in the University of Michigan's [[China_Data_Center|China Data Center]] database, but the data was originally collected and published in the 2010 Census. The cohorts for all Provinces were in five year cohorts up to 100+ or the 21st cohort, except cohort 1. The first cohort or population 0-4 years of age, was calculated by summing the infant population and the population 1 to 4 years of age.
China's Age-Sex Cohorts was data that was received in the University of Michigan's [[China_Data_Center|China Data Center]] database, but the data was originally collected and published in the 2010 Census. The cohorts for all Provinces were in five year cohorts up to 100+ or the 21st cohort, except cohort 1. The first cohort or population 0-4 years of age, was calculated by summing the infant population and the population 1 to 4 years of age.
= '''ForecastNetMigrationRateUNPD''' =
The current data series in use comes from a paper out of the University of Washington. The authors used a variety of data sources to estimate interprovincial migration. This data was compiled and published as five-year averages of net migration rates. Because this series is an annualized series, the annual net migration rates were estimated through a simple multi-step process. First the five-year averages were assigned as the middle year value for the annualized data. For instance, if the observation for Anhui province was -0.5 for 1995-2000 then -0.5 was assigned to 1998. The data was then interpolated between the observations to produce annualized data for a fifteen-year span.
This series should be treated as a placeholder, not as a legitimate series. Further research is required to reproduce the data from the paper using the paper's methodology. The paper's appendices need to be used to locate the original data sources for all the data and for understanding the methods used for estimation. This series should be replicated and hypothetically there may be an update available, which would give the model more recent data. 

Revision as of 15:07, 17 April 2017

Population

The provincial population series for China runs from 1995 through 2015. The original data lacked observations for Chongqing in 1995 and 1996 because Chongqing did not gain its status as a municipality until 1997. Chongqing was part of Sichuan province prior to 1997 and thus, Sichuan province's data included Chongqing in 1995 and 1996 and there was a substantial drop in Sichuan province in 1997. To remedy this issue, the ratio of population in Chongqing relative to the sum of Sichuan province and Chongqing in 1997 was used to estimate the population in Chongqing in 1995 and 1996. This population estimate for Chongqing was subtracted from Sichuan's population in 1995 and 1996. This provides a smoother and more accurate historical series. and data for Chongqing and Sichuan province were adjusted/estimated for 1995 and 1996.

Despite this estimation, there are still some unexplained spikes and drops in the population data around 2000. Guangdong has the most noticeable shift, where in 1999 population is 73.9 million and in 2000 population jumps up to 86.9. These shifts in the data are indeed in the data and not human errors (at least not on our end). The data source was contact regarding this and other potential data issues that were found while pulling data, they have yet to respond. 

RTENOTITLE

The provincial population series for China's subregional model came from the China_Statistical_Yearbooks from 1996-2016. These yearbooks are available online on the National Bureau of Statistics of China website. There are other sources available for provincial population data in China, which can be read about at Alternative Population Data_Sources. The population data is published in tens of thousands, rather than the millions that are used in the full 186 version of IFs. Rather than converting the population data into millions, the population series was imported as it was published in tens of thousands and ApplyMultAll is selected in the data dictionary. This normalizes the historical data to the population of China in the full 186 version of IFs. This choice to normalize rather than change units is meant to decrease the likelihood of human error, and simplify future data updates.

Age-Sex Cohorts

China's Age-Sex Cohorts was data that was received in the University of Michigan's China Data Center database, but the data was originally collected and published in the 2010 Census. The cohorts for all Provinces were in five year cohorts up to 100+ or the 21st cohort, except cohort 1. The first cohort or population 0-4 years of age, was calculated by summing the infant population and the population 1 to 4 years of age.

ForecastNetMigrationRateUNPD

The current data series in use comes from a paper out of the University of Washington. The authors used a variety of data sources to estimate interprovincial migration. This data was compiled and published as five-year averages of net migration rates. Because this series is an annualized series, the annual net migration rates were estimated through a simple multi-step process. First the five-year averages were assigned as the middle year value for the annualized data. For instance, if the observation for Anhui province was -0.5 for 1995-2000 then -0.5 was assigned to 1998. The data was then interpolated between the observations to produce annualized data for a fifteen-year span.

This series should be treated as a placeholder, not as a legitimate series. Further research is required to reproduce the data from the paper using the paper's methodology. The paper's appendices need to be used to locate the original data sources for all the data and for understanding the methods used for estimation. This series should be replicated and hypothetically there may be an update available, which would give the model more recent data.