China's Data Sources

From Pardee Wiki
Jump to navigation Jump to search

China Statistical Yearbooks

The China Statistical Yearbooks are the single most important data source for the China sub-regional model because it is where the majority of the data came from. The first version of the China sub-regional model used only the 2006 China Statistical Yearbook, whose data was only available on a disc at the time.These statistical yearbooks are published annually and can be now be accessed through the National Bureau of Statistics China website here: http://www.stats.gov.cn/english/Statisticaldata/AnnualData/

The chapter and table numbers change from year to year, but the chapter and table names remain the same. Within the metadata the table number recorded as the source locations is the series location within the 2016 China Statistical Yearbook. 

There are some notable problems with some of the data from this source that has caused legitimate concerns about the overall quality of data, despite it being an official national data source. For instance, the GDP data in 2005 yuan appears to have been improperly sorted and some regions have the incorrect data. This particular issue was reported to the National Bureau of Statistics, but there has been no response. This data should be used with care and future updates may require more detailed vetting than typical data updates. 

Below are all of the preprocessor series that were sourced from the China Statistical Yearbooks:

Variables
AgProdCereals
EnReserCoal
GovCon%GDP
Illiteracy%
LandCrop
PopulationUrban
VaddInd%
AgProdFruitsExclMelons
EnProdCoal
GovtCurRev%GDP
ImportsMerchandise
LandCrop%
PopulationYouthDep%
VaddMan%
AgProdGrain
EnProdElec
GovtEdPub%GDP
Labor
LandForest
R&D%GDPWDI
VaddSer%
AgProdMeat
EnProdOil
GovtHl%GDP
LaborEmploy%
LandGrazing
R&DGovt%GNP
XFDIInflows%GDP
EDPriPTR
ExportMerchandise
HouseCon%GDP
LaborSecInd%Tot
LandIrrigated
TaxCorp%Tot

EdPriTransition2Sec%Total
GDP2005
Households
LaborSecSer%Tot
LandIrAreaEquipFAO
TaxGoodSer%CurRev

EnConElec
GovtCalcExpendTot%GDP
ICTTelephoneLinesPer1000
LandAgri
LiteracyAdultTotal%
VaddAg%

China Data Center

The China Data Center Database is published by the University of Michigan and has historical data from 1949 through 2003. This database was received through a library grant. The China Data Center produces other data that have more recent data, but we do not have access to these products at this time. The longer historical series is interesting for historical data analysis, but it is too old for model initialization. Because the sub-regional model's purpose is forecasting, much of this database was ommitted from the model. However, this data was used in cases where there was not another source available or when the historical data blended well with more recent data. 

The preprocessor series that came from the China Data Center database are listed below:

Variables
AgProdCereals
CDR
EnProdOil
Labor
LandIrAreaEquipFAO
AgProdFruitsExclMelons
EDPriPTR
GovCon%GDP
LaborEmploy%
LiteracyAdultTotal%
AgProdGrain
EdPriTransition2Sec%Total
HouseCon%GDP
LaborSecInd%Tot
XFDIInflows%GDP
AgProdMeat
EnProdCoal
ICTTelephoneLinesPer1000
LaborSecSer%Tot

CBR
EnProdElec
Illiteracy%
LandCrop%

Census of China

The Chinese government conducts a national census every decade. The most recent census took place in 2010. The census data that is in the China sub-regional model came with the China Data Center database and included census data from the 1990, 2000, and 2010 censuses.

The only data that is in the current version of the sub-regional model of China that came from the Census of China is the age-sex cohort data which is in IFsChina.mdb.

Human Development Reports of China

Humand Development Reports are published by the UNDP and there are several on China and its sub-regions. These reports focus on human development based upon educational attainment, life expectancy, and gross national income per capita. These reports were used for the series listed below:

Variables
CBR
EdSecEnrollGross%Total
EdTerEnrollNetTotalPcnt
TFRMedUNPD
CDR
EdSecEnrollNet
LifExpectFemale

EdPriEnrollNetTotalPcnt
EdTerEnrollGross%Total
LifExpectMale

 

DataPlanet

The DataPlanet database is a database that the University of Denver Library had temporary access to during the fall of 2014. There are over 700 Excel tables that were downloaded from this database before access to it ceased. Some of these tables contain relevant data, while most do not. The relevance of each table and its contents have been catalogued and notated in the metadata. None of these tables of data are in the sub-regional model currently. 

Ministry of Education of the People's Republic of China

The Ministry of Education of the People's Republic of China has data available online here: http://en.moe.gov.cn/Resources/Statistics/

There is data from 2010 through 2015 available in html format on enrollment, intake, and graduation rates for China's regions. None of this data is currently in the sub-regional model, but it is noted in the metadata table.