Doing Business: Difference between revisions

From Pardee Wiki
Jump to navigation Jump to search
No edit summary
No edit summary
 
(6 intermediate revisions by 2 users not shown)
Line 1: Line 1:
The World Bank has an annual global doing business report available here <[http://www.doingbusiness.org/ http://www.doingbusiness.org/]>. The latest data has been pulled from the 2017&nbsp;report.
The World Bank has an annual global doing business report available here <[http://www.doingbusiness.org/ http://www.doingbusiness.org/]>. The latest data has been pulled from the&nbsp;report updated on February 15, 2019 which covers 189 economies from year 2004 to year 2019.


= Series pulled from the World Bank Doing Business report&nbsp; =
= Series pulled from the World Bank Doing Business report&nbsp; =


{| border="1" cellpadding="0" cellspacing="0" width="64"
{| cellpadding="0" cellspacing="0" border="1" width="64"
|-
|-
| height="20" width="64" | Table
| width="64" height="20" | Table
|-
|-
| height="20" | SeriesGovBetterBusinessIndex
| height="20" | SeriesGovBetterBusinessIndex
Line 15: Line 15:
| height="20" | SeriesGovWBDoingBusConstructionPermitProcedures
| height="20" | SeriesGovWBDoingBusConstructionPermitProcedures
|-
|-
| height="20" | SeriesGovWBDoingBusContainerExportCostUSD
| height="20" | SeriesGovWBDoingBusContainerExportCostUSD<br/>
|-
|-
| height="20" | SeriesGovWBDoingBusContainerImportCostUSD
| height="20" | SeriesGovWBDoingBusContainerImportCostUSD
Line 70: Line 70:
= Instructions on pulling doing business series&nbsp; =
= Instructions on pulling doing business series&nbsp; =


#First I downloaded the data from the World Bank’s Doing Business database page [http://data.worldbank.org/data-catalog/doing-business-database http://data.worldbank.org/data-catalog/doing-business-database]&nbsp;by clicking on the link entitled "Doing Business(Excel)"
#The data can be accessed from the World Bank’s Doing Business database page [http://data.worldbank.org/data-catalog/doing-business-database http://data.worldbank.org/data-catalog/doing-business-database]. By clicking on the tab entitled "Data & Resources", the raw data can be downloaded either through CSV format or Excel format.
#This World Bank file contains three sheets with the names Data, Series, and Footnote. The Doing Business data was pulled from the Data tab. Prior to pulling the data from the Doing Business file, the country names were concorded with IFs country names. This resulted in some of the territories in the original World Bank file being excluded from what was imported into IFs.
#Using Excel files as an example, the file contains three sheets with the names Data, Series, and Footnote. The Doing Business data was acquired from the Data tab. Prior to importing the data into the IFs, the raw data needs some pre-processing. The first step is to concord the country names with the IFs country names since the IFs does not have a specific country concording list for Doing Business data. Then, the next step is to separate the data into several individual&nbsp;files based on different series. After those two procedures, the data should be ready for import.
#Certain series we had in the system are not being measured in the Doing Business data. If the&nbsp;data was included in the original download spreadsheet, then it was nearly always marked as&nbsp;Old methodology. Series that were not included in this data were typically not updated last&nbsp;time this data was pulled as well.&nbsp;
#One thing to note in the country concording process is that there are some sub-areas in the data (e.g., China - Beijing). Those data should be ignored since we only care about the data in certain countries. However, data for areas like 'Hong Kong, China' would be different, we actually need the data for that. Thus, to differentiate from sub-areas and main areas, you should look for that dash symbol '-' (i.e., China - Beijing would be treated as a sub-area).
#Old series that were not updated are as follows: Rigidity of employment, difficulty of hiring,&nbsp;worker redundancy, redundancy index, average days to clear customs, and total tax rate (not a&nbsp;percentage of profit). The first four fall into the Labor Market regulation data, which the World Bank didn't include in&nbsp;their index of ease of doing business. There is separate data for this, however, the variables still&nbsp;don't seem to quite match up. There is no redundancy Index, there is no difficulty of hiring&nbsp;index, or rigidity of employment index. Average days to clear customs used old methodology. Total tax rate&nbsp;% profit was changed to Total tax rate&nbsp;% commercial profits the previous time this was pulled.&nbsp;
#Certain series we had in the system are not being measured in the Doing Business data. Thus, series that were not included in the original downloaded data were typically not updated. Since the organization constantly updates their data and methodology each year, some series will not be concordant through years. Those series will need to be handled carefully.
#Series that were not updated are as follows (11 series in total): BetterBusinessIndex, BusinessRegulationIndex, ConstructionPermitDays, ConstructionPermitProcedures, EmpRigiIndex, ExportDocumentationNeeded, ImportDocumentationNeeded,&nbsp;HiringDiffIndex, RedundancyCostperWeekSalary, RedundancyIndex, TotalTaxRate. Note that the Tax%Profit series was updated using&nbsp;the "Total tax rate (% of profit)" in the raw downloaded data.
#Series that have been updated but need to be put on hold due to inconsistency in values through years are:&nbsp;SeriesGovWBDoingBusContainerExportCostUSD, SeriesGovWBDoingBusContainerImportCostUSD, SeriesGovWBDoingBusCreditTransparency, SeriesGovWBDoingBusImportDays, SeriesGovWBDoingBusLegalRightsStrengthIndex. Those 5 series in the most recent updated data only cover the years 2014-2017 while data in IFs covers 2006-2014. Moreover, the values in the overlapping year 2014 differ so much that we might need to create them&nbsp;as new series.
 
'''2019 Update'''
 
For the 2019 update, the WB Doing Business data was pulled from the website using R. It was then cleaned and concorded with the corresponding variable names and the IFs country concordance list. The R script can be found in the Data Team Shared folder and can be used for future pulls to automate the process. In 2015, there were updates to the Doing Business methodology. Specific changes that affected our pulls were for the following series, and t<span>hese series are no longer reported (stopped in 2015) and are now disaggregated into two separate series for each one.</span>
 
'''<span>GovWBDoingBusImportDays;&nbsp;SeriesGovWBDoingBusContainerImportCostUSD;&nbsp;SeriesGovWBDoingBusContainerExportCostUSD.</span>'''
 
<span>The following new series start in 2015. While they are aggregated to create the new variable, they don not relate to the old variables we have in IFs due to the change in methodology from the source.&nbsp;</span>
 
<font style="background-color: rgb(255, 255, 255);">GovWBDoingBusContainExportCostUSDNew, calculated by adding the Documentary compliance and Border compliance costs for import together.</font>
 
<font style="background-color: rgb(255, 255, 255);">GovWBDoingBusContainImportCostUSDNew, calculated by&nbsp;</font><span style="display: inline !important; float: none; background-color: rgb(255, 255, 255); color: rgb(51, 51, 51); font-family: sans-serif,arial,verdana,&quot;trebuchet ms"; font-size: 13px; font-style: normal; font-variant: normal; font-weight: 400; letter-spacing: normal; orphans: 2; text-align: left; text-decoration: none; text-indent: 0px; text-transform: none; -webkit-text-stroke-width: 0px; white-space: normal; word-spacing: 0px;">adding the Documentary compliance and Border compliance costs for export together.</span>
 
<font style="background-color: rgb(255, 255, 255);">GovWBDoingBusImportDaysNew, calculated by adding&nbsp;</font><span style="display: inline !important; float: none; background-color: rgb(255, 255, 255); color: rgb(51, 51, 51); font-family: sans-serif,arial,verdana,&quot;trebuchet ms"; font-size: 13px; font-style: normal; font-variant: normal; font-weight: 400; letter-spacing: normal; orphans: 2; text-align: left; text-decoration: none; text-indent: 0px; text-transform: none; -webkit-text-stroke-width: 0px; white-space: normal; word-spacing: 0px;">the Documentary compliance and Border compliance time for import together.</span><font style="background-color: rgb(255, 255, 255);"></font>


= Model implications =
= Model implications =

Latest revision as of 19:09, 30 April 2019

The World Bank has an annual global doing business report available here <http://www.doingbusiness.org/>. The latest data has been pulled from the report updated on February 15, 2019 which covers 189 economies from year 2004 to year 2019.

Series pulled from the World Bank Doing Business report 

Table
SeriesGovBetterBusinessIndex
SeriesGovBusinessRegulationIndex
SeriesGovWBDoingBusConstructionPermitDays
SeriesGovWBDoingBusConstructionPermitProcedures
SeriesGovWBDoingBusContainerExportCostUSD
SeriesGovWBDoingBusContainerImportCostUSD
SeriesGovWBDoingBusContractEnforceDays
SeriesGovWBDoingBusCostClose
SeriesGovWBDoingBusCreditTransparency
SeriesGovWBDoingBusDirectorLiability
SeriesGovWBDoingBusDisclosureIndex
SeriesGovWBDoingBusEmpRigiIndex
SeriesGovWBDoingBusExportDocumentationNeeded
SeriesGovWBDoingBusHiringDiffIndex
SeriesGovWBDoingBusImportDays
SeriesGovWBDoingBusImportDocumentationNeeded
SeriesGovWBDoingBusinessProcedures
SeriesGovWBDoingBusinessPropRegDays
SeriesGovWBDoingBusinessRanking
SeriesGovWBDoingBusinessStartDays
SeriesGovWBDoingBusInvestorProtection
SeriesGovWBDoingBusLegalRightsStrengthIndex
SeriesGovWBDoingBusPropertyProceduresRequired
SeriesGovWBDoingBusRedundancyCostperWeekSalary
SeriesGovWBDoingBusRedundancyIndex
SeriesGovWBDoingBusShareholderSuits
SeriesGovWBDoingBusTax%Profit
SeriesGovWBDoingBusTaxHoursperPerson
SeriesGovWBDoingBusTimeClose
SeriesGovWBDoingBusTotalTaxRate

Instructions on pulling doing business series 

  1. The data can be accessed from the World Bank’s Doing Business database page http://data.worldbank.org/data-catalog/doing-business-database. By clicking on the tab entitled "Data & Resources", the raw data can be downloaded either through CSV format or Excel format.
  2. Using Excel files as an example, the file contains three sheets with the names Data, Series, and Footnote. The Doing Business data was acquired from the Data tab. Prior to importing the data into the IFs, the raw data needs some pre-processing. The first step is to concord the country names with the IFs country names since the IFs does not have a specific country concording list for Doing Business data. Then, the next step is to separate the data into several individual files based on different series. After those two procedures, the data should be ready for import.
  3. One thing to note in the country concording process is that there are some sub-areas in the data (e.g., China - Beijing). Those data should be ignored since we only care about the data in certain countries. However, data for areas like 'Hong Kong, China' would be different, we actually need the data for that. Thus, to differentiate from sub-areas and main areas, you should look for that dash symbol '-' (i.e., China - Beijing would be treated as a sub-area).
  4. Certain series we had in the system are not being measured in the Doing Business data. Thus, series that were not included in the original downloaded data were typically not updated. Since the organization constantly updates their data and methodology each year, some series will not be concordant through years. Those series will need to be handled carefully.
  5. Series that were not updated are as follows (11 series in total): BetterBusinessIndex, BusinessRegulationIndex, ConstructionPermitDays, ConstructionPermitProcedures, EmpRigiIndex, ExportDocumentationNeeded, ImportDocumentationNeeded, HiringDiffIndex, RedundancyCostperWeekSalary, RedundancyIndex, TotalTaxRate. Note that the Tax%Profit series was updated using the "Total tax rate (% of profit)" in the raw downloaded data.
  6. Series that have been updated but need to be put on hold due to inconsistency in values through years are: SeriesGovWBDoingBusContainerExportCostUSD, SeriesGovWBDoingBusContainerImportCostUSD, SeriesGovWBDoingBusCreditTransparency, SeriesGovWBDoingBusImportDays, SeriesGovWBDoingBusLegalRightsStrengthIndex. Those 5 series in the most recent updated data only cover the years 2014-2017 while data in IFs covers 2006-2014. Moreover, the values in the overlapping year 2014 differ so much that we might need to create them as new series.

2019 Update

For the 2019 update, the WB Doing Business data was pulled from the website using R. It was then cleaned and concorded with the corresponding variable names and the IFs country concordance list. The R script can be found in the Data Team Shared folder and can be used for future pulls to automate the process. In 2015, there were updates to the Doing Business methodology. Specific changes that affected our pulls were for the following series, and these series are no longer reported (stopped in 2015) and are now disaggregated into two separate series for each one.

GovWBDoingBusImportDays; SeriesGovWBDoingBusContainerImportCostUSD; SeriesGovWBDoingBusContainerExportCostUSD.

The following new series start in 2015. While they are aggregated to create the new variable, they don not relate to the old variables we have in IFs due to the change in methodology from the source. 

GovWBDoingBusContainExportCostUSDNew, calculated by adding the Documentary compliance and Border compliance costs for import together.

GovWBDoingBusContainImportCostUSDNew, calculated by adding the Documentary compliance and Border compliance costs for export together.

GovWBDoingBusImportDaysNew, calculated by adding the Documentary compliance and Border compliance time for import together.

Model implications

What are the biggest differences (in both relative and absolute terms) in 2015? 2050? 2100?

What are the effects of these changes on other variables in the model?

Any other major changes or anomalies?