ggdc research memorandum 151ggdc.net › publications › memorandum › gd151.pdfof using a single...
TRANSCRIPT
university ofgroningen
groningen growth anddevelopment centre
GGDC RESEARCH MEMORANDUM 151
Cross-country income levels over time: did the developing world suddenly
become much richer?
Robert Inklaar and D.S. Prasada Rao
December 2014
Cross-country income levels over time: did the developing world suddenly
become much richer?
Robert Inklaar
Groningen Growth and Development Centre University of Groningen, The Netherlands
D.S. Prasada Rao
School of Economics The University of Queensland, Australia
December 12, 2014
A previous version of this paper was circulated under the title “Cross-country income levels of time: can ICP 2005 and ICP 2011 be reconciled?” The authors would like to thank Angus Deaton, Arvind Subramanian, Marcel Timmer and seminar participants at the University of Graz and the University of Groningen for helpful comments and the World Bank for providing detailed researcher data underlying ICP 2005 and ICP 2011. This work was undertaken when Rao was a visiting researcher at the Groningen Growth and Development Centre in August, 2014 and at UNU-MERIT during October, 2014. Correspondence to: Robert Inklaar, University of Groningen, Faculty of Economics and Business, PO Box 800, 9700 AV Groningen, The Netherlands ([email protected]).
2
Abstract
The latest global survey on relative prices and income levels showed revisions to income
levels that were larger in lower-income countries, thereby shifting the world income
distribution. The aim of this paper is to establish whether changes in measurement
methodology and price sampling methods between the latest price survey for 2011 and the
previous survey for 2005 can explain these large differences. We construct a counterfactual
set of relative prices for 2005 that harmonizes measurement and we find that the systematic
differences in income levels are substantially reduced or even eliminated, implying that
international income inequality had been overstated.
Key words: purchasing power parities; International Comparison Program (ICP); extrapolation; price levels; price sampling bias; real incomes; inequality
JEL Codes: C43, E01, E31, C83, O57
1
1. Introduction
Our understanding of differences in living standards across countries, international income
inequality and global poverty relies heavily on the periodical global price comparisons of the
International Comparison Program (ICP).1 Without ICP relative prices, comparisons of
income levels across countries would have to rely on exchange rates, which tend to
systematically underestimate the purchasing power of consumers in low-income countries.2
But despite the conceptual appeal of ICP relative prices, they are often also a source of
controversy. For instance, Deaton (2010) showed how results from consecutive ICP rounds
led to upward revisions in income inequality across countries. Similarly, Ciccone and
Jarociński (2010) and Johnson et al. (2013) show how various results from the cross-country
growth literature change depending on the vintage of relative income data that is used.3 More
broadly, changing the methods and model used to measure relative prices can have a notable
effect on the resulting relative income estimates, as shown by Neary (2004), Almås (2012)
and Feenstra et al. (2013).
The most recent source of controversy is the publication of ICP 2011 (World Bank, 2014a,b).
Compared with earlier estimates based on (extrapolations from) ICP 2005 (World Bank,
2008), the average country saw its income and consumption levels increase by about 25
percent. Revisions for developing countries were particularly large, resulting in major
changes to the economic geography of the world and a downward adjustment to global
inequality. Given the size and systematic pattern of these revisions, they can have far-
reaching consequences, ranging from a different number and geographical distribution of the
world’s poor to a different view on income convergence or appropriate policy.4 It is thus no
surprise that the revisions following ICP 2011 have already been the topic of substantial
commentary and debate.5
1 See e.g. Anand and Segal (2008), Deaton (2010), Deaton and Heston (2010), Chen and Ravallion (2010a) and Milanovic (2012). See also Feenstra, Inklaar and Timmer (2013) on how the ICP results are a crucial input in the Penn World Table and World Bank (2013) for details on ICP. 2 The Balassa (1964)-Samuelson (1964) effect, whereby prices of non-traded products tend to be lower in lower-income countries. 3 Differences between data vintages in these studies reflect revisions to relative prices – the focus of our paper – and revisions to national GDP statistics – a source of differences we exclude. 4 See Chen and Ravallion (2010a) on the impact of ICP 2005 on poverty, (e.g.) Pritchett and Summers (2014) on growth and convergence and Aghion, Akcigit and Howitt (2014) on productivity levels and appropriate policy. 5 See Dykstra, Kenny and Sandefur (2014), Chandy and Kharas (2014), Deaton and Aten (2014) and Ravallion (2014).
2
Yet these early commentaries may be misleading because of the major changes to how
relative prices were measured in ICP 2011 compared with ICP 2005.6 A particular concern,
as identified by Deaton and Aten (2014), is the so-called linking of regions. In both ICP 2005
and 2011, prices are first collected and compared across the countries within a region and in
the second stage, the regions are linked to each other to allow for a global price comparison.
Some regions consist mostly of low- and middle-income countries (Africa, Asia-Pacific) and
some mostly of high-income countries (Eurostat/OECD), which means that changes to the
linking approach can shift the prices of lower-income countries relative to higher-income
countries. Since the linking approach was considerably improved and refined in 2011,7
Deaton and Aten (2014) conclude that the linking data and methods are a good starting point
for trying to understand why the results from ICP 2011 were so different from the ICP 2005
results.
The goal of this paper is to identify the impact of the 2011 methodological and price
sampling changes and provide better estimates of the revisions to relative prices and income
levels. To this end, we construct a counterfactual price comparison for 2005 using ICP 2011
methods and adjusting for biases in the sampling of the 2005 price data, in particular
regarding the linking of regions. We draw upon the early diagnostic work of Deaton and Aten
(2014), but provide a more comprehensive and quantitative assessment of the impact of the
measurement changes. Given the theoretical problems in measuring relative prices (Van
Veelen, 2002) and the practical challenges (Deaton and Heston, 2010), even the
counterfactual we construct will show some differences.8 But if those differences remain
systematically larger in lower-income countries, this calls into question the common practice
of using a single set of cross-country relative prices to estimate relative income levels across
many years,9 and the research results that depend on such data.
To construct our counterfactual price comparison for 2005, we draw upon the original data
and methods used in the compilation of ICP 2005 and ICP 2011, including detailed price and
expenditure data, and we go through the full compilation of PPPs as detailed in World Bank
(2008) and World Bank (2014b). In presenting our results, we focus on the implications for
6 See World Bank (2014b) for a list of major improvements in ICP 2011 methodology compared to ICP 2005. 7 See in particular World Bank (2013). 8 See also McCarthy (2013b) for a more practical discussion of why subsequent price comparisons may not be consistent with observed relative inflation patterns. 9 This is standard practice in the World Bank’s World Development Indicators, the Maddison Project Database (Bolt and van Zanden, 2013) and for parts of the Penn World Table (Feenstra et al., 2013).
3
GDP per capita levels, which are most relevant for research into cross-country economic
performance, and consumption per capita, which is used most often in the analysis of poverty
and inequality. We make two sets of adjustments to the original ICP 2005 comparison; in the
first set, we take all 2005 price data as given, but apply the ICP 2011 methods. In this
‘harmonized methodology’ counterfactual, the average upward revision to relative income
and consumption levels goes down from 23-24 percent to 17-18 percent, but especially for
consumption, revisions are still systematically larger in lower-income countries. In the
second set of adjustments, we correct for a number of biases in the price data collected in
2005. Most importantly, we find clear evidence that the price data used in linking the regions
in 2005 was based on a product list that was only representative in higher-income countries; a
possibility that, notably, Deaton (2010) called attention to. In the ‘sampling bias adjusted’
counterfactual, the average upward revision goes down to 9-12 percent; making both sets of
adjustments simultaneously reduces the average revision further, to 4-10 percent. For both
GDP and consumption, lower-income countries no longer show systematically larger upward
revisions. Furthermore, the root mean squared revision decreases from 34-35 percent to 17-
22 percent, which is similar to estimates of the standard error of relative prices.10
The results from this analysis have important implications. Most importantly, the difference
between the original ICP 2005 and ICP 2011 suggested that datasets based on extrapolating
GDP per capita levels from a benchmark comparison to earlier years were inherently biased.
This, in turn, could have had serious consequences for research based on such datasets, which
include the World Development Indicators of the World Bank, the Maddison database (Bolt
and van Zanden, 2014) and parts of the Penn World Table (Feenstra et al., 2013). Our success
in eliminating the systematic differences with our counterfactual price comparison suggests
that such a fundamental reconsideration is not necessary. Furthermore, one concrete outcome
of our analysis is two sets of price comparisons based on the same methods and comparable
data. This means they can be fruitfully used in combination, such as in the part of the Penn
World Table that presents relative price and income levels based on all available ICP
benchmarks. Finally, from a statistical point of view, we have demonstrated the usefulness of
backward revisions to relative prices, which is similar in spirit to the backwards revision of
National Accounts time series when introducing new methods.
10 See e.g. Deaton (2012) and Rao and Hajargasht (2014).
4
2. Basic framework
The basic aim of the ICP is to measure the overall price level P in a country j relative to any
other country k. Implicit in this aim is the idea that the law of one price does not hold,
because if it did hold we would only need to observe market exchange rates to put GDP or
consumption on a comparable basis. The presence of non-traded goods in an economy can be
one reason for the law of one price to fail, but also see Rogoff (1996) for other reasons. As a
result, a procedure is needed to estimate the economy-wide price level using data on prices p
and expenditures v for individual products i. One could take a welfare-theoretic approach and
aim to compare the cost-of-living, as in e.g. Neary (2004). However, as argued by Deaton
and Heston (2010), such an approach leads to questions that are extremely hard to provide
sensible answers on, such as whether tastes or identical or, if not, how best to envisage
comparing countries that do not only differ in price and income levels but also along
dimensions such as climate or religion. What is done in ICP can thus best be thought of as
comparing an index of the price level, not the cost of living.
If we choose a so-called superlative index for this purpose, we know from Diewert (1976)
that such an index will provide a second-order local approximation to arbitrary preferences.
Let us, for expositional purposes, assume the superlative Törnqvist index. The relative price
level of country j relative to k at time t can then be written as:
log Pjt P
kt 1
2 sijt s
ikt i log p
ijt p
ikt , (1)
where sij are the expenditure shares of product i, v
ijv
iji . An important feature of equation
(1) is that the relative price level between j and k depends on expenditure shares in both
countries.11 How will this relative price index evolve over time? The standard approach, used
in e.g. the World Bank’s World Development Indicators, is to assume the change in the
relative price level is equal to relative inflation. Let country inflation also be measured by a
Törnqvist index:
log Pjt log P
jt P
jt1 1
2 sijt s
ijt1 i log p
ijt p
ijkt1 (2)
11 In the more general, so-called multilateral indexes used in ICP, the price level between j and k also depends (to a lesser degree) on expenditure shares in all countries, see Diewert (2013).
5
Since the price level in equation (1) depends on both country’s expenditure shares, while
national inflation only depends on national expenditure shares, the change in the price level is
not exactly equal to relative inflation, as shown by Deaton and Aten (2014). If we assume
that expenditure shares do not change over time, we can write the change in the price level by
combining equations (1) and (2) as:
log Pjt P
kt log P
jt log P
kt 1
2 sij s
ik i log pijt log p
ikt . (3)
This equation shows that the expected change in PPPs is given by relative inflation – the first
term in brackets – and a term reflecting differences in expenditure patterns and (average)
relative price changes across products. If there are no differences in expenditure patterns, no
changes in relative prices or if differences in expenditure patterns are uncorrelated with
average changes in relative prices, this term will be zero. In the absence of detailed data
about product level inflation, log pijt , it is not feasible to implement equation (3) in full.12
To extrapolate PPPs from 2005 to 2011, we will therefore rely only on data about relative
inflation, which according to equation (3) should be a fair approximation on conceptual
grounds. On practical grounds, discrepancies may well occur if, for example, country-specific
products are used to estimate national inflation but (for lack of cross-country comparisons)
are not used in ICP. McCarthy (2013b) details a comprehensive set of practical
considerations, but for none of these would we expect a larger effect in lower-income or
higher-income countries.
Given this framework, we now turn to a comparison of ICP 2011 with extrapolations from
ICP 2005. For ICP 2011, we use the results as published in the Summary Report (World
Bank, 2014a) and specifically the purchasing power parities (PPPs) and expenditure levels
for GDP and final consumption expenditure (‘consumption’). For ICP 2005, we use the PPPs
as originally published in World Bank (2008).13 The combined data set is constrained mostly
by ICP 2005, which covered 146 countries. Argentina, Syria and Lebanon participated in ICP
2005 but not in ICP 2011 and Zimbabwe participated in ICP 2005 but the coinciding period
of hyperinflation led to unreliable results. This leaves us with a data set of 142 countries,
12 Experiments with inflation data for (at most) four products in the consumption bundle shows small relative price changes over the 2005-2011 period, which suggests that the second term in equation (3) will be small. However, more detailed data could in principle reveal larger changes, so definite statements are not possible. 13 Or, to be more precise, the ratio of the PPP over the exchange rate. This avoids having to explicitly control for changes in currency units, such as the adoption of the euro in a number of countries.
6
which is used in the remainder of the paper. To extrapolate GDP PPPs, we use GDP
deflators; to extrapolate consumption PPPs, we use the consumer price index (CPI).14
Let YjtP be the PPP for GDP (Y) in country j at time t based on a benchmark year and let
Yjtp be the GDP deflator in year t in country j. Whenever t the PPP is extrapolated from
the benchmark year as:
Yjt YjYjt Yj
Ybt Yb
p pP P
p p
(4)
where b refers to a base or reference country – usually the United States. We can then
compare GDP per capita (y) or household consumption per capita (c) based on the PPP
extrapolated from ICP 2005 with the corresponding numbers based on the new ICP 2011
benchmark. We use the following relative difference measure:
2011 20052011 2011 2011
2011 2015 20112011 2011 2011
1 1j Yj YjYj
j Yj Yj
y P Pd
y P P (5)
As all the subsequent comparisons are for the year 2011, we drop the subscript 2011. Figure
1 shows a plot of dY against the log of GDP per capita converted using exchange rate E,
ln ( )j jy E , and dC against the log of consumption per capita, ln ( )j jc E . The GDP and
consumption per capita numbers are those reported in World Bank (2014a).
The figure illustrates the very large upward revisions to income and consumption levels
resulting from the downward revisions to PPPs when moving from extrapolations from ICP
2005 to ICP 2011.15 As the range of the graphs shows, relative income and consumption
levels have increased by more than half in a large number of countries. Differences in GDP
per capita extrapolations are between 30 to 40 percent also for large countries like Brazil,
China and India and differences are quite large in oil-rich economies including Indonesia.
However, the relative differences exhibit a slightly different pattern for per capita
consumption. Differences for India are around 50 percent whereas the difference for
Indonesia is more moderate. More generally, there is a clear negative correlation with the
14 An alternative would be the implicit price deflator of household consumption expenditure from the National Accounts but the results are very similar. 15 The figure is qualitatively similar to Figure 1 in Deaton and Aten (2014, p. 31), though for OECD/Eurostat countries, their consumption levels are extrapolated from a more recent benchmark comparison than ICP 2005.
7
highest-income countries typically showing only small revisions. The lower-income countries
typically show larger revisions and this effect is stronger for consumption.16
Figure 1, Differences between extrapolating ICP 2005 and ICP 2011
To summarize the patterns in Figure 1 and compare these to alternative PPP series in the
remainder of this paper, we use three statistics based on the difference measure in (5).
Mean difference across the set of J countries: di
1
J
Pij2005
Pij2011
1
, for i Y ,C
j1
J
Root mean squared difference (RMSD): di2
1
J
Pij2005
Pij2011
1
2
j1
J
, for i Y ,C
Slope coefficient of the regression: dij a
i b
iln e
ijE
j uij, for j 1, , J and i Y ,C ,
where eij is either GDP or consumption per capita. These summary measures are best
understood by viewing the problem as a forecasting exercise. Until the release of the
benchmark ICP 2011 PPPs, the typical approach was to forecast PPPs for 2011 by
16 The difference between the GDP and consumption pattern suggests that revisions to the price levels of investment and government consumption are not related with income levels.
8
extrapolating 2005 PPPs using equation (4).17 Statistic is then a measure of the forecasting
bias, RMSD is a measure of forecasting uncertainty18 and an indication whether the bias
varies systematically with GDP or consumption per capita.19
Given the challenge of comparing prices of more than a thousand individual products across
almost 200 countries, it is not surprising that measurement error is present, which would lead
to RMSD being non-zero.20 Indeed, this can be a rationale for conducting regular benchmark
price comparisons: since the PPP of a given country in a given year will be measured with
error, it is good to have measurements in multiple years to avoid relying too heavily on a
single (possibly error-prone) observation.21 That said, RMSD is still a useful measure since a
method of extrapolating PPPs that would lead to a lower value of RMSD would be
preferable. The average bias id and the systematic variation in the bias measured by ib are
arguably the most worrisome measures as they imply a shift in the world income distribution.
This is most obvious for ib , as this measures whether lower-income countries show
systematically larger (or smaller) differences. But a non-zero is also worrisome because
PPPs are given relative to the United States and this thus implies an average shift in prices –
and thus expenditure levels – relative to US.
Table 1 shows summary statistics corresponding to the two panels in Figure 1. The average
country has expenditure levels in ICP that are about 25 percent higher than those based on
extrapolations from ICP 2005. Put differently, the average country is 24 percent richer
relative to the US than estimated previously, implying a sizeable decline in cross-country
income inequality. The root mean squared difference is also comparable between the GDP
17 An exception was Ravallion (2013, 2014), who has argued for a so-called ‘dynamic Penn effect’. However, Inklaar (2013) showed that the forecasting performance of the dynamic Penn effect was inferior to the standard inflation-based extrapolation. 18 In the terminology of the forecasting literature, this would be the root mean squared prediction error, RMSPE, see West (2006). 19 Note that the average difference and root mean squared difference are not independent of the numeraire country. However, since typical comparisons are almost exclusively made with the US as the numeraire this is not a major drawback. However, see Diewert (2009) for alternative measures. 20 See Deaton and Heston (2010) for a discussion of the conceptual and practical challenges of cross-country price comparisons. Note also that measurement errors may not just be a problem for measuring PPPs but also for tracking inflation at the national level. The errors in inflation measurement should typically be smaller since it is more likely that a particular product can be priced from one period to the next than in to (potentially) disparate) countries, though information on spending patterns are not regularly updated, measured inflation could well start to substantially deviate from true inflation. 21 See also e.g., Deaton (2012) and Rao and Hajargasht (2014) on the estimation of PPP standard errors from the variation of individual product prices around the overall GDP or consumption PPP.
id
ib
id
9
and consumption, at 33–35 percent. The main difference between the GDP and consumption
extrapolation errors is that the coefficient on expenditure per capita is much larger for
consumption that for GDP, though both are significant. As was apparent from Figure 1, the
differences tend to be smaller for non-oil-producing countries and the mean difference is
even larger for developing economies than for the sample as a whole.
Table 1, Differences between ICP 2011 and extrapolation from ICP 2005 –summary
statistics
All countries
Non-oil countries
Developing economies
GDP Mean difference 0.237*** 0.199*** 0.278*** Root mean squared difference 0.336 0.284 0.370
Coefficient on log(expenditure/capita) -0.020* -0.033*** 0.025 (0.012) (0.010) (0.019)
Consumption Mean difference 0.254*** 0.218*** 0.308*** Root mean squared difference 0.349 0.300 0.380
Coefficient on log(expenditure/capita) -0.063*** -0.066*** -0.027 (0.011) (0.009) (0.019)
Notes: summary statistics based on differences in Figure 1 for 142 countries. Countries where fuel exports
contribute more than 10 percent of real GDP are labeled as ‘oil countries’ (source: Penn World Table, version
8.0). Developing economies are the set of low-income and middle-income countries as defined by the World
Bank, so with a GNI/capita level less than $12,746. Robust standard errors of the regression coefficients are
shown in parentheses below the coefficients. * denotes a variable significantly different from zero at a 10%-
level, ** at 5%-level, *** at a 1%-level.
In the remainder of this paper, the results in 1 will serve as a baseline for comparison with
statistics based on counterfactual versions of ICP 2005 that harmonize methodology and
resolving sampling biases. Ideally, those adjustments will lead to a closer alignment between
the extrapolated (counterfactual) ICP 2005 results and ICP 2011 thus providing a clue as to
whether the developing world has indeed suddenly become richer. In the following section,
we describe the main methodological improvements introduced in 2011 ICP, drawing on
World Bank (2014) and explain how the counterfactual versions of ICP 2005 are constructed.
3. Methodological innovations in ICP 2011 and the counterfactuals for ICP 2005
The International Comparison Program (ICP) is a major international statistical initiative in
which prices and national accounts data are collected and processed to compile PPPs and
10
internationally comparable real expenditures. The early phases of the ICP, until 1985, were
essentially world comparisons where data from all participating countries were treated as a
single set. A major drawback of such a world comparison was that it proved hard to collect
price data for the same products across such a wide range of countries and the requirements
for global comparability stood in the way of comparability between the more similar
countries in each region. A start was made to move away from a single world comparison
with ICP 1993, but ICP 2005 was the first comparison with a fully-implemented regionalized
approach.
The regionalized approach involves compilation of PPPs in two stages. The first stage is at
the regional level, where countries would collect and compile data according to regional
product specifications, leading to more reliable relative prices within each region. In the
second stage, the regional comparisons are linked to complete the global comparison. The
methodology for linking regional comparisons was pioneered while data collection for ICP
2005 was in progress. So it was perhaps not surprising that after the release of ICP 2005, an
assessment of the methods used in 2005 for linking regional comparisons indicated several
deficiencies. These deficiencies are discussed in Rao (2013), on the methods for linking
prices for product categories – referred to as ‘basic headings’ in ICP – and in Diewert
(2013b), for linking at the aggregate level. To remedy the signaled deficiencies, several
methodological innovations have subsequently been implemented in ICP 2011.
Major changes in methodology raise the question how the results from ICP 2005 and ICP
2011 can be compared. A direct comparison of ICP 2011 results with an extrapolation from
ICP 2005 is like comparing apples and oranges in view of significant improvements in the
methods used in the ICP 2011 round. In this paper, we offer a solution to this problem by
constructing a counterfactual for ICP 2005 based on ICP 2011 methodology. We briefly
describe the major methodological innovations in the ICP 2011 – with further details
available in World Bank (2014b). We then construct counterfactual 2005 PPPs based on data
used in the ICP 2005 computations. The basic data used are the PPPs and national
expenditure data at the basic heading level for all the 146 participating countries. In addition,
productivity adjustment data for 2005, information on dwellings;22 the 2005 ring product list
22 We thank Alan Heston and Nada Hamadeh for providing the data and the worksheets used in the computation of linking factors for dwellings.
11
and prices; the 2011 global core list products and their prices in all the participating countries
have been obtained from the ICP Global Office at the World Bank.23
3.1 Harmonization of methodology
Aggregation at the regional level
In ICP 2005, the African region employed the transitive and additively consistent aggregation
method, the Iklé-Dikhanov-Balk (IDB) method, to compute aggregate PPPs within Africa,
while all other regions used the Gini-Elteto-Koves-Szulc (GEKS) method.24 Bringing all the
regions to line, the GEKS method was implemented in all the regions in ICP 2011. The first
step in the construction of the counterfactual is thus the replacement of the 2005 PPPs within
the African region based on IDB method with PPPs computed using the GEKS method.
Productivity adjustment to relative wages
Government consumption, including spending on public administration, health and
education, have always been considered ‘comparison-resistant’ in ICP since market prices for
the output of most of these services cannot be observed. So instead of relative output prices,
ICP compares government consumption by comparing input prices, specifically wages and
salaries. A drawback of using input prices is that relative productivity differences are not
taken into account, which means that in lower-income countries, a simple measure of relative
wages would understate relative output prices and overstate real expenditure under this
category.
In ICP 2005, the Asia-Pacific and African regions recognized the need to make a productivity
adjustment to relative wage data, but no other regions implemented any adjustments. An
implicit assumption in this limited use of productivity adjustment is that productivity of
government employees in all the remaining economies is the same as that in the United States
which is unlikely to be true. Consequently, a fully-fledged productivity adjustment for all the
economies participating was implemented in ICP 2011, see World Bank (2014b) for details.
Following the approach used in ICP 2011, we implement productivity adjustments based on
the contribution in 2005 of differences in (physical) capital per worker to GDP.25 We find a
marginal increase in the share of Africa, Asia-Pacific and Eurostat/OECD countries when a
23 These data are routinely provided to all researchers upon request to the ICP Global Office. 24 See Diewert (2013a) for detailed descriptions of the IDB, GEKS and other aggregation methods. 25 See World Bank (2014b) for details on the methodology and the Penn World Table, version 8.0 for data on comparative capital stocks and the share of labor income in GDP.
12
comprehensive productivity adjustment is implemented, see Appendix Table A1 for regional
shares before and after comprehensive productivity adjustment.
Linking dwellings using quantity indicator data26
To compare relative rents of dwellings across regions, a procedure was followed for ICP
2005 and ICP 2011 that is different from other types of consumption, see Heston (2013) and
World Bank (2014b). Ideally, information would be available on the actual (or imputed) rent
paid on different types of dwellings, but data on the number of dwellings and some quality
characteristics are more widely available.
In ICP 2005, the available rental price data was used to link the relative rents for dwellings
across regions, see Heston (2013). For ICP 2011, a similar approach was tried but the rental
data supplied was judged to be of insufficient quality. Instead a method based on quantity
indicators was applied. First, the number of dwellings per capita27 is used to construct a
regional quantity index for dwellings with OECD-Eurostat set at 100. The quantity index is
then adjusted for quality differences using a quality index computed as the arithmetic average
share of dwellings with electricity, inside water and private toilet.
To harmonize this aspect of the methodology, we apply the ICP 2011 methodology on
dwellings quantity and quality indicators data for 2005 supplied by the Global Office. We
combine the per capita volume index with a per capita value index, computed as the ICP
2005 expenditure in the ‘Actual and imputed rentals for housing’ basic heading category,
converted to US dollars using the current exchange rate and divided by population. This gives
a price level index for each region. The steps involved in implementing the 2011 linking
methodology for dwellings on data from 2005 are shown in Appendix Table A2. In Africa,
Asia-Pacific and Latin America, the prices according to the ICP 2011 methodology are lower
than under the ICP 2005 methodology, suggesting an upward revision to income levels in
countries from these regions.
26 We are grateful to Alan Heston who provided details of the approach used in 2005; Nada Hamadeh for providing data used in 2005 as well as in 2011; and to Paul Konijn for providing us with a copy of the document “Linking the regions: the case of housing” outlining the methodology used in linking dwellings data in 2011 ICP. The Konijn document describes the actual procedure implemented by the Computational Task Force for ICP 2011. 27 Though data on usable surface in sq. meters was collected, it was not of sufficient quality for use in the linking process. Therefore, linking was essentially based on number of dwellings per capita data.
13
Aggregation above basic heading level
In the implementation of the ICP, the very last step involves linking of regions at higher
levels of aggregation, such as household consumption and GDP. The methodology used for
linking at the GDP and component level used in 2005 was replaced by the country
aggregation and redistribution (CAR) method in 2011.
The starting point for linking above the basic heading level is data on national expenditure
and PPPs for each of the 155 basic headings, usually with the US=1. The problem in
aggregating these data for global comparisons of GDP is the additional requirement of fixity.
The fixity requirement stipulates that the relative levels of consumption or GDP of countries
within a region must be identical to the relative levels established at the regional level,
thereby ruling out the standard aggregation methods. Diewert (2013b) summarizes a range of
suggested methods for global comparisons that maintain fixity. The method used in ICP 2005
treats each region like a super-country28 and requires regional quantity and aggregate price
vectors. To construct these, first the implicit quantity for each basic heading in each country
is derived by dividing national expenditure by the basic heading PPP. These basic heading
quantities are then added across all the countries in each region, resulting in one quantity
vector for each of the six regions for each basic heading. In the second step, the price of the
basic heading for the region can be computed as total regional expenditure in the units of
currency of regional reference country divided by the quantity measure. This yields a price
and quantity vector for each of the regions, where prices are expressed in the currency units
of the reference countries in each region. The GEKS method is then used to get relative
regional prices at the aggregate level, which are combined with the within-region relative
prices to compare prices across regions.
As detailed in Diewert (2013b), though, it was later found that the method implemented in
ICP 2005 was not invariant to the choice of the reference country in each region. For ICP
2011, the use of the Country Aggregation with Redistribution (CAR) method was
recommended. This method starts with a global comparison of prices using the GEKS
method and basic-heading-level data for all individual countries. The resulting relative prices
are used to compute real, PPP-converted GDP for each country and these GDP levels are
added to arrive at total regional real GDP. In the second step, the shares of each country in
within-region GDP – an outcome of the regional comparison – is applied to total regional real
28 The term is from Deaton and Aten (2014).
14
GDP. In constructing the counterfactual comparisons for 2005, we implement the CAR
approach with the basic heading PPPs and national expenditures obtained after using the
linking procedures detailed above.
Effect of harmonized methodology on ICP 2005
The first step in our approach to construct a counterfactual for ICP 2005 prices is to
harmonize the methodology that used in ICP 2011. In Table 2, we show how the original ICP
2005 and the different counterfactuals compare to ICP 2011. Comparing the first and last
column, it is clear that harmonizing the methodology results in lower mean differences and a
lower RMSD for both GDP and consumption. Though the mean difference for the fully
harmonized counterfactual is still sizeable at 17 to 18 percent, this is 26 percent smaller for
GDP and 31 percent smaller for consumption than the original. Similarly, the RMSD has
gone down by 15 (GDP) to 18 (consumption) percent. Going through the various stages of
the harmonization, the most substantive change results from harmonizing the within-Africa
PPPs, while implementing the CAR method, productivity adjustment and dwellings
adjustment matter relatively less.
Table 2, Differences between ICP 2011 and extrapolations from ICP 2005 –
counterfactuals based on harmonized methodology
Original CAR method
+ Within-Africa harmonization
+ Productivity adjustment
+ Dwellings adjustment
GDP
Mean difference 0.237*** 0.216*** 0.175*** 0.192*** 0.184*** Root mean squared difference 0.336 0.349 0.280 0.290 0.285
Coefficient on log(expenditure/capita)
-0.020* -0.043*** -0.018* -0.016 -0.015
(0.012) (0.013) (0.010) (0.010) (0.010)
Consumption
Mean difference 0.254*** 0.254*** 0.187*** 0.187*** 0.174*** Root mean squared difference 0.349 0.410 0.296 0.296 0.286
Coefficient on log(expenditure/capita)
-0.063*** -0.102*** -0.066*** -0.066*** -0.063***
(0.011) (0.017) (0.010) (0.010) (0.010) Note: column labeled ‘Original’ is from Table 1. ‘CAR method’ uses the regional linking methodology of ICP
2011, ‘+ Within-Africa harmonization’ uses the GEKS method rather than the IDB method to compare prices
within the African region; ‘+ Productivity adjustment’ also implements the productivity adjustment for public
wages for the Eurostat/OECD region and Latin America and ‘+ Dwellings adjustment’ also implements the ICP
2011 method for comparing the (rental) price of dwellings across regions. This column corresponds to a set of
PPPs where the methodology has been harmonized to the extent possible. Robust standard error of the
regression coefficients shown in parentheses below the coefficients. * denotes a variable significantly different
from zero at a 10%-level, ** at 5%-level, *** at a 1%-level.
15
In addition to the methodological innovations in 2011 discussed here, ICP 2011 also
introduced a new method for measuring relative prices in construction. ICP 2005 used the
‘basket of construction components’ (BOCC) method was – see McCarthy (2013a) for details
– but it was replaced in ICP 2011 by a simpler method based on the prices of basic
construction materials, different types of labor, and the hire of machinery and equipment.
However, the information required to implement the ICP 2011 construction methodology was
not available from the 2005 data sources.
3.2 Accounting for price sampling biases due to ring country linking methodology in
ICP 2005
This section outlines the adjustments made to 2005 ICP in the process of constructing a
counterfactual that incorporates the main improvements to methodology introduced in ICP
2011. World Bank (2014b) emphasizes changes to linking methodology as a major
innovation in ICP 2011. Linking regional comparisons to arrive at the overall global
comparison is a critical step in ICP. ICP 2011 introduced a global core list (GCL) of products
and pricing of GCL products by all 177 participating countries in ICP29 in the process of
linking PPPs. In contrast, the linking procedure in ICP 2005 was based on data for only 18
countries, the so-called ring countries,30 which were deemed to be representative for the full
set of countries in their respective regions. The ring countries collected prices for items on
the global ring product list and those prices were then used to link PPPs from different
regions; see also Deaton and Heston (2010) and Vogel (2013).31
After the completion of ICP 2005, several issues were identified with the linking procedure
based on ring countries and the ring products. The first is whether the particular selection of
18 ring countries would induce a bias compared to the use of data for all the participating
countries. We term this bias the ring country selection bias. The second issue relates to the
items included in the ring product list. A close examination of the products included in the
ring list and the product specifications used for pricing purposes revealed a strong rich
country bias in the products. The list included items like Uncle Toby oats; a bottle of
Heineken beer, Bordeaux wine; and a Peugeot 408, which is suggestive of a rich country bias
29 Though ICP 2011 covered 199 countries, only 177 countries participated fully and the remaining are Pacific Island countries whose participation was limited to comparisons of household consumption. 30 The ring countries are Comoros, Egypt, Kenya, Senegal, South Africa and Zambia in Africa; Hong Kong, Malaysia, Philippines and Sri Lanka in Asia-Pacific; Brazil and Chile in Latin America; Jordan and Oman in Western Asia; and Estonia, Japan, Slovenia and the United Kingdom in Eurostat/OECD. 31 See Rao (2013) on the linking methodology at the basic heading level and Diewert (2013b) on linking at the aggregate level.
16
in the ring list. Deaton (2010) presented further suggestive evidence that the use of the ring
product list led to relatively high price levels (and thus lower real incomes) in low-income
ring countries. We refer to this as the ring product selection bias. In addition, we also
consider the widely discussed urban bias in prices of household consumption goods in China
in ICP 2005. In this section we describe how we quantify these biases and use them in
constructing a counterfactual for ICP 2005.
Urban bias in consumption prices from China
It is well documented that consumption prices provided by China to the Asia-Pacific regional
comparison in 2005 were collected from only 11 capital cities and their surrounding areas
(see ADB, 2007 and Word Bank, 2008).32 The general consensus from Deaton and Heston
(2010), Chen and Ravallion (2010a,b) and Feenstra, Ma, Neary and Rao (2013) is that the
Chinese consumption prices used in ICP 2005 were biased upwards relative to national
average prices by 20 percent or more.33 In contrast, the Chinese price survey for ICP 2011
was much more comprehensive, covering urban and rural areas in each of the country’s
provinces (see ADB, 2014). In constructing the counterfactual for 2005, we make a 20
percent upward adjustment to consumption prices for China.
Ring country selection bias
We cannot use data for 2005 to establish whether there was any bias from the specific
selection of ring countries. However, given the price data for all 177 countries in 2011, we
can examine if selecting the 18 ring countries from 2005 would have introduced any
systematic bias in the (2011) linking factors at the basic heading level. The assumption here
is that the ring country bias we find in 2011 would have been present to the same degree in
2005 as well.
To measure this bias, we exploit the basic structure of the country-product-dummy (CPD)
model used in the ICP.34 We specify a simple regression model that explains log prices P of
product i in country j located in region r using dummies for each product, Di, each region, Dr,
32 The Global Office of the ICP at the World Bank and the regional coordinating agency, Asian Development Bank, devised a mechanism to derive national average prices for China based on price data from the 11 cities. However, the method employed failed to account for rural-urban and regional price differentials. 33 PWT 7.1 and 8.0 implemented a downward adjustment in the preparation of their PPP and real expenditure extrapolations. A similar adjustment was used in Chen and Ravallion (2010a,b) in estimating absolute poverty in China. 34 Details of the CPD model and its use in the ICP can be found in Rao (2013).
17
a ‘ring country’ dummy R (which takes value 1 if the country is one of the 18 ring countries)
and interactions between the regional dummies and the ring country dummy:
logP
ijr
ii Di
rr DrR
rr DrR ijr (6)
Following the linking procedure used at the basic heading level (see Rao, 2013 for details),
prices are first converted into regional numeraire currencies using within-region basic
heading PPPs. This model, without the ring country dummy and interaction terms, is used in
ICP to derive the linking factors for each basic heading, see Vogel (2013). The r parameters
are of main interest since a significant coefficient would indicate that pricing in the ring
countries in that region are significantly different from pricing in the other countries in the
region.
Table 3, Estimating the difference in 2011 pricing of global core list products
in ring countries versus other countries
Region coefficient Region x ring country coefficient
Africa 1.8023*** Africa x ring country −0.0023 (0.0288) (0.0117) Asia-Pacific 1.8221*** Asia-Pacific x ring country −0.0695*** (0.0309) (0.0202) Latin America 0.6997*** Latin America x ring country 0.0158 (0.0288) (0.0209) Western Asia −1.3541*** Western Asia x ring country -0.0107 (0.0294) (0.0152) Ring country dummy 0.0033 (0.0079) Number of observations 57,415 Note: table shows regression results where the log of consumption item prices for 2011, converted to the
regional base country’s currency, are explained by dummies for each region, whether a country was a ring
country in ICP 2005, interactions between this ring country dummy and the regional dummies, and dummies for
each of the 1192 items (not shown), see also equation (6). Items are weighted by their importance, see main text
for discussion. The Eurostat/OECD region is taken as the base region and hence the coefficients should be
interpreted as price differences relative to the Eurostat/OECD. Robust standard errors of the regression
coefficients shown in parentheses below the coefficients. * denotes a variable significantly different from zero at
a 10%-level, ** at 5%-level, *** at a 1%-level.
r
r
18
Table 3 reports the results of a pooled regression across all basic headings, based on 57,415
price observations35 and using the ‘importance weights’ that are also used in ICP 2011.36 The
regional coefficients in the left panel provide the average regional linking factorsr. For
example, the table shows that 6.15 Hong Kong dollars (exponential of 1.8023, the coefficient
for the Asia-Pacific) for one US dollar is the linking factor when all the countries are used in
linking the regions. We also observe that the coefficient of ring country dummy is
numerically very small and statistically insignificant. This means that the selection of ring
countries on average has no influence on the linking factors.
Turning to the r parameters, we find insignificant coefficients for ring countries in all the
regions with the exception of the Asia-Pacific region. The significant interaction term for the
Asia-Pacific ring countries indicates that prices in the ring countries (Hong Kong, Malaysia,
Philippines and Sri Lanka) from this region are, on average, 6.95 percent lower than in other
Asia-Pacific countries. In our counterfactual ICP 2005, we divide prices by the exponent of
this coefficient to counteract this ring country selection bias.
Ring product selection bias
In ICP 2005, the 18 ring countries priced items from the ring product list and ideally, this list
should contain products that are part of a representative consumption bundle in every country
of their respective region. In practice, this is much harder, as discussed in more detail by
Deaton and Heston (2010). For instance, a bottle of Heineken beer was on the ring product
list, but if this is a premium brand in poor countries, while a standard choice of beer in rich
countries, beer prices in the poor country will be relatively higher than would be the case for
beer in general. Deaton (2010) presented suggestive evidence that points to a ‘rich country’
bias in the ring product list.
Given this potential for bias, a global core list (GCL) of products was developed for ICP
2011 which is representative of goods and services that are consumed in all regions of the
world. The ICP 2011 GCL product approach seems to have been fairly successful as the GCL
had significant overlap with the regional product lists. Of the 692 consumption products on
the GCL, 610 matched with the regional list in Africa; 412 with Asia; 394 with OECD-
35 The regressions includes data from 148 countries. If these had priced all the 692 GCL products, we would have 102,416 observations, so not all products were priced in all countries. 36 A product that is deemed important in a particular country gets a weight of 3 and other products get a weight of 1.
19
Eurostat; 489 with Latin America and the Caribbean and 606 with the list used in Western
Asia.
To explicitly test whether the product selection in drawing up the ring product list was a
source of bias, we exploit the fact that there were multiple ICP 2005 ring countries in each
region and that these ring countries participated in two sets of price comparisons: one based
on the regional product list as a part of the regional comparisons and the other based on the
ring product list for the purpose of computing linking factors. As the regional lists were
drawn up to be representative of products in each region, it seems reasonable to assume that
the results based on the regional price comparison are a more accurate measure of relative
prices between the ring countries within the region than the relative price differences implied
by the ring product list. Indeed, in the compilation of ICP 2005 the ring product list prices are
only used for linking prices between regions, not for price comparisons within the region. So
if we find that the relative prices within each region based on the ring product list prices
differ systematically from the relative prices implied by the regional product prices, we take
this as evidence of a bias from the particular selection of the products on the ring product list.
We use the following variant of the CPD model to test for the presence of product selection
bias:
logPijr
iDi
rBr
ijri (7)
where P represents item price (converted using the basic heading PPP), i refers to item, j
refers to country and r represents region. The new dummy variable Br, the base country of
region r. Importantly, ijrP measures the price of item i in country j which is part of region r
relative to the basic heading PPP for the product group of which item i is a part. The basic
heading PPPs are expressed relative to the base country of each region. The base countries in
different regions in ICP 2005 tended to be the highest-income countries of the set of ring
countries in the region: South Africa in Africa, Hong Kong in Asia-Pacific, United Kingdom
in Eurostat/OECD, and Oman in Western Asia. The exception was Latin America, where
Brazil was the base country but Chile had a higher income level.
If there were no ring product selection bias, we would expect the estimate of r not to differ
significantly from zero. However, based on the arguments of Deaton (2010) and Deaton and
Heston (2010) that the ring product list was a ‘rich-country’ product list, we would expect
20
negative estimates of r in Africa, Asia-Pacific and Western Asia. Since the regional base
countries are the highest-income (ring) countries in the region, the ring products would be
(more) representative of consumption patterns in the base country, while in the other ring
countries in the region the ring products would be unrepresentative and thus relatively
expensive.
Table 4, Estimating the difference in ring product prices in 2005 between regional base
country and other ring countries
Region (base country) Base country dummy Country Country dummy Observations Products Africa (South Africa) −0.0940*** Comoros 0.1708*** 2981 711
(0.0247) (0.0299) Egypt 0.0958***
(0.0313) Kenya 0.2127***
(0.0295) Senegal 0.0788***
(0.0287) Zambia −0.1334***
(0.0314) Asia-Pacific (Hong-Kong) −0.1556*** Sri Lanka 0.2781*** 2171 681
(0.0159) (0.0223) Malaysia 0.1309***
(0.0176) Philippines 0.0834***
(0.0185) Eurostat-OECD (UK) 0.0035 Estonia −0.0082 2004 636
(0.0143) (0.0166) Japan −0.0022
(0.0234) Slovenia 0.0012
(0.0160) Latin America (Brazil) 0.0103 Chile −0.0103 594 297
(0.0251) (0.0251) Western Asia (Oman) −0.0661** Jordan 0.0661** 710 355 (0.0322) (0.0323)
Notes: Left part of the table shows regression results where the log of consumption ring item prices for 2005,
converted to the regional base country’s currency using basic heading PPPs, are explained by a base country
dummy and dummies for each of the items (not shown), see also equation (7). The right part of the table reports
regression results where the base country dummy is replaced by individual country dummies. Robust standard
errors of the regression coefficients shown in parentheses below the coefficients. * denotes a variable
significantly different from zero at a 10%-level, ** at 5%-level, *** at a 1%-level.
Equation 7 is estimated separately for each region. The results from estimating equation (4)
for each region are reported in the left panel of Table 4 and the results conform to our prior
expectations. The ring product list prices are systematically lower in the regional base
countries in Africa, by 9.4%, in Asia-Pacific by 15.6% and in Western Asia by 6.6%. This
means that ring product prices for the reference country South Africa in Africa are on average
21
9.4 percent below the prices of observed in other ring countries in Africa (Egypt, Kenya,
Senegal and Zambia). The ring-country effect is most pronounced in the Asia-Pacific region
where Hong Kong ring prices are found to be 15.56 percent lower, on average, than the ring
countries of the region, viz., Malaysia, Philippines and Sri Lanka. These results are contrary
to the expectation based on Balassa-Samuelson and Penn effect that price levels in lower
income countries are generally lower. The insignificant coefficient in the Eurostat/OECD
region provides further indication that the products on the ring product list were
representative of rich-country consumption patterns.
Table 5, Estimating the difference in global core list product prices in 2011 between
regional base country and 2005 ring countries
Region (base country) Base country dummy Country Country dummy Observations Products
Africa (South Africa) 0.0034 Comoros -0.0005 2770 567
(0.0270) (0.0306)
Egypt 0.0260
(0.0361)
Kenya -0.0083
(0.0305)
Senegal -0.0195
(0.0334)
Zambia -0.0239
(0.0329)
Asia-Pacific (Hong-Kong) 0.0702*** Sri Lanka 0.0812** 1478 440
(0.0242) (0.0352)
Malaysia -0.1618***
(0.0283)
Philippines -0.0859***
(0.0284)
Eurostat-OECD (UK) 0.0242 Estonia -0.0170 1330 380
(0.0158) (0.0197)
Japan -0.0592***
(0.0225)
Slovenia -0.0087
(0.0189)
Latin America (Brazil) 0.0020 Chile -0.0020 508 254
(0.0313) (0.0387)
Western Asia (Oman) -0.0254 Jordan 0.0254 920 460
(0.0220) (0.0238) Notes: Left part of the table shows regression results where the log of global core list consumption item prices
for 2011, converted to the regional base country’s currency using basic heading PPPs, are explained by a base
country dummy and dummies for each of the items (not shown), see also equation (7). The right part of the table
reports regression results where the base country dummy is replaced by individual country dummies. In both
22
sets of regressions, robust standard errors of the regression coefficients shown in parentheses below the
coefficients. * denotes a variable significantly different from zero at a 10%-level, ** at 5%-level, *** at a 1%-
level.
To provide further support for the findings in the left panel of Table 4, we reformulate
equation (7) with dummy variables for each of the ring countries within each region except
for the regional numeraire and the right panel in Table 4 shows the results. The coefficient of,
for example, Kenya implies that ring prices (relative to the regional PPPs) are 21 percent
higher than those in South Africa. The coefficient of Zambia is somewhat puzzling since it
shows that prices of ring products in Zambia are 13 percent lower than implied by the
regional comparison, making it the only country with a sign counter to expectations.37 In the
Asia-Pacific region, the signs and magnitudes are as expected, with e.g. Sri Lankan ring
prices 27.81 percent higher than in Hong Kong. None of the coefficients are significant in the
OECD-Eurostat and Latin American regions, which suggests that that ring prices are aligned
with regional prices.
Though we would expect smaller or no product selection bias based on the 2011 GCL, we
run similar regressions as reported in Table 4 based on those data. Results of the regressions
based on 2011 GCL prices are presented in Table 5. The results for the most part confirm
with our expectations as there is less evidence of product selection bias. The only exception
is the Asia-Pacific region, where prices in Hong-Kong were 7.02 percent higher than in the
other (2005) ring countries.
Product selection bias from Deaton and Aten (2014)
Deaton and Aten (2014) were the first to examine the sources of systematic and major
discrepancies between 2011 ICP and extrapolations from the 2005 benchmark. Their
approach to identifying a bias from the product selection in 2005 differs from ours. They
estimated country-level (consumption) PPPs for the 2005 ring countries based on the (2005)
ring product list and based on the (2011) GCL. They then compared the change in these PPPs
between 2005 and 2011 to the relative inflation rate as in equation (4), taking the UK from
the Eurostat-OECD region as the numeraire. For the high-income ring countries, they find
that relative inflation rates approximate the change in PPPs quite well but for the low-income
countries, they find smaller changes in PPPs than implied by relative inflation (see Deaton
and Aten, 2014, Figure 5). If one assumes that GCL prices are unbiased (which is supported
37 Using robust regression makes little difference, suggesting Zambia’s result is not driven by price outliers for individual products, but by a general pattern.
23
for most regions in Table 5), then 2005 relative prices for the low-income ring countries were
biased upwards. They conclude that the ring product prices may have overstated price levels
in the low-income ring countries in Western Asia, Africa and the Asia-Pacific by between 20
to 30 percent.
This is a larger adjustment than implied by our analysis in Tables 4 and 5. Table 4 showed
that prices in Africa were biased upwards relative to the regional base country by 9.4 percent
and a 6.6 percent bias in Western Asia. In Asia-Pacific, prices in 2005 were biased
downwards by 15.6 percent while prices in 2011 were biased upwards by 7.0 percent for a
total discrepancy of 22.6 percent.38 However, there is reason to believe that our estimates are
conservative. The analysis reported in Tables 4 and 5 can only identify a bias relative to the
regional base country, not a bias relative to the global base region, Eurostat-OECD.
Especially in Oman (Western Asia) and South Africa (Africa), consumption levels are
notably lower than in the UK, the Eurostat-OECD base country. This could mean that the
ring list products may not have been representative in those countries either, implying a
larger bias than we can identify in our regressions.
Accordingly, our preferred counterfactual is constructed on the basis of the adjustments we
derive for the methodological harmonization and ring country selection bias but we replace
the product selection bias estimates with those recommended by Deaton and Aten (2014). We
label this as ICP 2005C. The rest of the paper focuses on the preferred ICP 2005C
counterfactual, though for completeness we also present results for a conservative
counterfactual in the Appendix based on the regression coefficients in Tables 4 and 5.
Counterfactual for ICP 2005C based purely on sampling bias corrections
Based on the discussion and results presented in the previous section, we are in a position to
construct the counterfactual that is purely based on sampling bias corrections. The sampling
bias corrections include: (i) a 20 percent downward adjustment of Chinese consumption
prices; (ii) a downward adjustment of 6.4 percent to Asia-Pacific consumption prices to
correct for the bias due to the use of Hong Kong, Malaysia, Philippines and Sri Lanka as ring
countries instead of the full set of countries in the region; and finally (iii) a downward
adjustment for ring product selection bias of 25 percent in low-income ring countries.
38 These results suggest that the 2011 results could be adjusted rather than letting all adjustments fall on the 2005 relative prices. For the purpose of analyzing the differences between the extrapolation from 2005 and 2011, this does not matter.
24
Table 6, Differences between ICP 2011 and ICP 2005C
Counterfactual with corrections for price sampling bias
Original – Urban bias China
– Ring country selection bias
– Ring product selection bias
GDP
Mean difference 0.237*** 0.198*** 0.207*** 0.118*** Root mean squared difference 0.336 0.306 0.318 0.235
Coefficient on log(expenditure/capita)
-0.020* -0.021* -0.021* 0.010
(0.012) (0.011) (0.012) (0.010)
Consumption
Mean difference 0.254*** 0.223*** 0.238*** 0.088*** Root mean squared difference 0.349 0.327 0.346 0.196
Coefficient on log(expenditure/capita)
-0.063*** -0.064*** -0.067*** 0.000
(0.011) (0.011) (0.011) (0.009) Note: columns labeled ‘Original’ are from Table 1. Robust standard error of the regression coefficients shown in
parentheses below the coefficients. * denotes a variable significantly different from zero at a 10%-level, ** at
5%-level, *** at a 1%-level. Ring country selection bias correction is from this study and ring product selection
bias is from Deaton and Aten (2014).
The columns in Table 6 progressively correct for the differences price sampling biases. For
example, at the GDP level, adjusting for urban bias in China prices has reduced the mean
difference from 23.7 percent to 19.8 percent representing a 16 percent reduction in average
differences. The last column shows the results for the counterfactual which incorporates
corrections for all types of price sampling bias. At the GDP level the mean difference has
fallen from 0.237 to 0.118 representing, a 51 percent reduction. What is interesting is the
slope coefficient of the regression, which is close to zero and statistically insignificant. This
means that the country-specific differences do not exhibit any pattern around the mean
difference. The bottom panel of Table 6 for consumption shows that, the average difference
between the extrapolations and 2011 ICP is reduced by 66 percent from 0.254 to 0.087 and
the RMSD decreases from 0.349 to 0.195. In the next section we present the IRDA
counterfactual for 2005 which incorporates methodological harmonization and the adjustment
for price sampling biases.
4. The counterfactual for the ICP 2005C benchmark international comparisons –
implications for global inequality
This section presents what we consider is the most appropriate counterfactual for 2005 that
could be meaningfully compared to ICP 2011. The counterfactual, ICP 2005C, accounts for
all the methodological changes adopted in 2011 with the exception of the new method used
25
for construction and also accounts for price sampling biases associated with 2005 due to the
use of ring countries and prices for ring list products collected from these countries.
The first question we ask is to what extent the systematic differences between extrapolations
from ICP 2005 with ICP 2011 are reduced or eliminated when moving to ICP 2005C. We
then discuss salient features of the ICP 2005C, including the impact on the size of the world
economy, on major economies such as China and India and the effect on international income
inequality. In the Appendix, we present the counterfactual for 2005 based on the more
conservative adjustment for product selection bias based on the regression coefficients in
Tables 4 and 5.
Differences between ICP 2011 and extrapolations from ICP 2005C
In Table 7 we provide an assessment of the differences between ICP 2011 and ICP 2005C.
Figure 2 then shows a scatter plot of the differences for different countries and for GDP and
consumption, akin to Figure 1. Extrapolating from ICP 2005C leads to substantially lower
differences with ICP 2011, with the mean difference for GDP reduced from 0.237 to 0.100
(58 percent) and a large reduction from 0.254 to 0.040 (an 84 percent reduction) for
consumption. In the last column of the table we present population weighted differences. At
the GDP level, the counterfactual implies a reduction in the mean difference of 67 percent
while at the consumption level the differences are virtually eliminated. The counterfactual
has brought the ICP price comparisons a lot closer to the ICP 2011 with the remaining
differences accounted for by the usual factors that explain discrepancies between
extrapolations and benchmarks.39
The slope coefficient of the regression of differences on per capital nominal income, at the
GDP level, showed a change in sign but a reduction in absolute value with a change from -
0.020 to 0.016. However for consumption, the slope coefficient is practically eliminated from
a significant negative slope. This means that for consumption, the differences are randomly
scattered around the mean difference of 0.040. We also find that the remaining differences
are largely driven by the presence of “oil countries”. Column 3 shows that the average
difference reduced from 0.237 to 0.061 (a reduction of 74 percent) at the GDP level and
reduced by 94 percent from 0.254 to 0.014 for consumption. The root mean square error
differences are also substantially reduced for the counterfactual.
39 See McCarthy (2013b) and Inklaar and Timmer (2013) for a discussion on the discrepancies between benchmarks and extrapolations.
26
Table 7, Differences between ICP 2011 and extrapolations from ICP 2005C
counterfactual with methodological and sampling adjustments
Original Counterfactual
All countries
All countries
Non-oil countries
Developing economies
All countries, population-weighted
GDP
Mean difference 0.237*** 0.100*** 0.061*** 0.106*** 0.081***
Root mean squared difference 0.336 0.218 0.165 0.235 0.218
Coefficient on log(expenditure/capita)
-0.020* 0.016* 0.007 0.065*** -0.019
(0.012) (0.009) (0.007) (0.017) (0.015)
Consumption
Mean difference 0.254*** 0.041*** 0.015 0.045** 0.006
Root mean squared difference 0.349 0.168 0.132 0.177 0.181
Coefficient on log(expenditure/capita)
-0.063*** 0.000 -0.001 0.031** -0.019
(0.011) (0.008) (0.007) (0.014) (0.022)
Note: The first column (ICP 2005) is from 1. ICP 2005C refers to the counterfactual ICP 2005 prices that
harmonize methodology and correct for price sampling biases. The Deaton/Aten adjustment is a 25 percent
downward adjustment to African and (West) Asian consumption prices. Countries where fuel exports contribute
more than 10 percent of real GDP are labeled as ‘oil countries’ (source: Penn World Table, version 8.0).
Developing economies are low-income and middle-income countries as defined by the World Bank, so with a
GNI/capita level less than $12746. Robust standard errors of the regression coefficients shown in parentheses
below the coefficients. * denotes a variable significantly different from zero at a 10%-level, ** at 5%-level, ***
at a 1%-level.
It is also helpful to emphasize the differential roles played by methodological adjustments
(cf. Table 2) and price sampling adjustments (cf. Table 6). Comparing those results suggests
that the adjustment for the ring product selection bias has a larger effect than that resulting
from methodological adjustments. Our analysis also suggests the robustness of the use of
prices for global core list of products and the use of all the participating countries in the
linking process. However, results in Tables 3 and 4 on the estimation of ring country
selection bias, significant coefficients for the Asia-Pacific region suggests that there is need
for further fine-tuning and integration of the GCL product list in the Asia-Pacific region in
the next round of ICP.
At the GDP level the average differences are still significant at 10.0 percent and the RMSD
for GDP and consumption imply substantial country-specific differences. There a number of
possible reasons for this. First, we have not been able to implement any adjustment for the
new methodology for construction used in 2011. Second, ICP does not survey export and
import prices but uses exchange rates instead. This means that differences in terms of trade
27
are not taken into account, despite their importance in cross-country comparisons, see
Feenstra et al. (2013). McCarthy (2013b) provides a more extensive set of conceptual and
measurement reasons why two subsequent benchmarks may not align.40 Finally, ICP is a
survey of prices so measurement error, reflected in e.g. the RMSD, will always be present.
Estimates of the standard error of PPPs are in the order of 20 percent – Deaton (2012) and
Rao and Hajargasht (2014) – so the RMSD of 14–20 percent in the second column of Table 7
looks fairly good from that perspective. In that sense, these results confirm the need to
conduct future rounds of ICP since any given round can only measure relative prices across
countries with error. This is no different when estimating relative prices using information on
income per capita (i.e. the Balassa–Samuelson effect); the root mean squared error (RMSE)
of such regressions is also in the order of 20 percent, despite an R-squared of around 70
percent and highly significant coefficients.
Figure 2, Differences between extrapolating ICP 2005 counterfactual and ICP 2011
Size of the world economy and selected countries
At the time of the release of results from ICP 2005, commentaries from the economists and
researchers focused on the big reduction in the size of the world economy and of the
economies of China and India compared to the extrapolations from 1993 benchmark
40 See also e.g. Dalgaard and Sørensen (2002).
28
available at that time from the World Development Indicators and the Penn World Table 6.3.
Deaton and Heston (2010) and Deaton (2010) draw attention to the big downward adjustment
implicit in ICP. So it is useful to revisit this and place it in the context of ICP 2005 to see the
magnitudes of the differences implied by the counterfactual ICP 2005C.
Table 8, GDP per capita in 2005 in selected countries, ICP 2005 versus ICP 2005C
GDP per capita (in US$)
ICP 2005C/ICP 2005 – 1 ICP 2005 ICP 2005C
China 4124 5678 38% India 2234 2742 23% Kenya 1340 1636 22% Ethiopia 614 687 12% Brazil 8501 8175 -4%
World GDP ($trillion) 56.8 61.8 9%
At the time of release of ICP 2005 (World Bank, 2007), the general commentary focused on
the downward adjustment to the size of Chinese and Indian economies of 40 and 36 percent,
respectively, compared to what was expected at that time based on the World Development
Indicators (WDI, 2007). In Table 8 we find the ICP 2005C proposed in this paper pegs back
the downward adjustment. In the case of China, the adjustment is 38 percent bridging the 40
percent gap identified at that time whereas the counterfactual provides an upward revision of
23 percent of per capita income in India which is slightly lower than the 36 percent identified
in 2007. We find similar upward adjustments to Kenya, Ethiopia and most of the African and
Western Asian countries. In the case of Brazil, the counterfactual implies a downward
revision of nearly 4 percent. For most of the rich countries the revisions are marginal. An
immediate consequence of these revisions is a downward revision of the estimate of global
inequality in 2005 based on ICP 2005C. In Table 9 we confront our counterfactual based
extrapolations to 2011 with ICP 2011 actual and with extrapolations from ICP 2005.
The last column shows a significant reduction in the difference between the extrapolations
and actual 2011 when the counterfactual is used. However, some differences are still large for
countries like Zambia and Jordan. For India the difference reduces to 8 percent whereas the
difference goes down to –13 percent from +21 percent for China. This change is expected as
the price sampling adjustment includes a 20 percent downward adjustment to account for the
urban bias in Chinese prices in 2005.
29
Table 9, GDP per capita in 2011 for selected countries in PPP converted US dollars
Extrapolations from Actual ICP 2005 and Counterfactual ICP 2005
GDP per capita in 2011
ICP 2011 Extrapolations from ICP 2005
Difference ICP 2005 and 2011
Actual Actual Counterfactual Actual CounterfactualKenya 2136 1729 2111 0.24 0.01Zambia 3155 1745 2235 0.81 0.41India 4735 3566 4377 0.33 0.08China 10057 8335 11476 0.21 -0.12Jordan 11169 5844 6856 0.91 0.63Brazil 14639 11785 11333 0.24 0.29United Kingdom 35091 36153 37447 -0.03 -0.06Germany 40990 37729 39199 0.09 0.05United States 49782 49782 49782 0.00 0.00
Note: the column ‘ICP 2011’ shows 2011 GDP per capita as reported in World Bank (2014, Table 6.1), based
on ICP 2011 PPPs. Columns ‘ICP 2005’ shows 2011 GDP per capita based on the actual World Bank (2008)
ICP 2005 PPPs or the counterfactual 2005 PPPs developed in this paper, extrapolated to 2011 using the change
in GDP deflator in each country relative to the change in the United States. The columns ‘Difference’ are the
percentage change of the ICP 2011 column relative to the ICP 2005 columns.
International inequality
The ICP 2011 results showed a significant increase in per capita real income of low-income
countries compared to extrapolations from 2005 while there was no appreciable difference
for the high-income countries. This suggests that between-country income inequality is lower
according to ICP 2011 than based on ICP 2005. We analyze this more formally by examining
the trend in the population-weighted Gini coefficient. This provides an indication of what
Milanovic (2012) refers to as type-II international income inequality. This measure stops
short of a full global inequality measure since it does not take into account inequality within
countries, but (in contrast to type-I international income inequality) it does give greater
weight to more populous countries.41
Figure 3 shows three Gini coefficient series between 1990 and 2012: one based on ICP 2005,
one based on ICP 2011 and one based on ICP 2005C42. The most striking feature of this
figure is the consistent difference between inequality measures based on actual ICP 2005 and
2011 and the nearly indistinguishable profile of inequality from ICP 2005C and ICP 2011.
41 The Gini coefficient is related to the Lorenz curve, which plots cumulative income shares against cumulative population shares when ranking countries by their income/capita level. Specifically, the Gini coefficient is the area between the diagonal and the Lorenz curve divided by the area below the diagonal. 42 The Gini coefficients presented in this figure are presented in appendix table A3.
30
For instance, in 2011 the extrapolated Gini for GDP per capita from ICP 2005 was 0.525, the
ICP 2011 Gini was 0.484 and the ICP 2005C Gini was 0.489. Gini coefficients for
consumption per capita are similarly close when comparing ICP 2011 and ICP 2005C. This
means that from the perspective of international income and consumption inequality, our
suggested methodological and price sampling adjustments basically eliminate the differences
between the 2005 and 2011 price comparisons.
Figure 3, International inequality in income and consumption per capita,
population-weighted Gini coefficient, 1990-2012
Note: GDP per capita in PPP-converted dollars is computed in the benchmark year (2005 or 2011) and these
levels are extrapolated using growth in GDP per capita. The full set of 142 economies that participated in ICP
2005 and 2011 can be covered since 1990 and these 142 economies cover approximately 95 percent of the world
population.
5. Conclusions
The publication of the ICP 2011 results, in April 2014, immediately sparked much discussion
given the large upward revisions to GDP per capita in PPP-converted terms. The changes to
economic geography, global inequality and poverty implied by ICP 2011 are significant and
warranted an explanation. Deaton and Aten (2014) were the first to research this issue and
their findings point towards the data and methods used for linking the regions in the ICP
Actual'ICP'2005
Actual'ICP'2011
Counterfactual'ICP'2005
.5.55
.6.65
1990 1995 2000 2005 2010
GDP
.5.55
.6.65
1990 1995 2000 2005 2010
Consumption
31
2005 round. In contrast, Ravallion (2014) argues, using dynamic Penn effect models, that it is
the 2011 round that could be an issue.
The starting point for our research is the recognition that there have been major innovations
in the methodology used in the ICP 2011. These are now well documented in the recently
released Final Report for ICP 2011 (See World Bank, 2014b). Of particular note is the use of
the global core list of products and the use of all the 177 participating countries in the linking
process. There are other significant innovations including the use of the country aggregation
and redistribution (CAR) method for aggregation above the basic heading level; productivity
adjustments in dealing with government compensation and a new linking procedure for
dwellings. This raised the question as to what extent a counterfactual version of ICP 2005,
adjusted for these methodological changes, would be close to the ICP 2011 results.
The main finding is that making the price and methodology adjustments brings the
extrapolations from ICP 2005 counterfactual and the actual ICP 2011 rounds a lot closer on
average, though country level differences do remain. At the GDP level this gap, on average,
is reduced by 58 percent to 10.0 percent. The average difference in the consumption
aggregate has been reduced by 44 percent to 4 percent. While these results are broadly
reassuring, we find that the main drivers of the original differences are not the
methodological innovations but the price sampling biases. In this paper we have been able to
identify the main sources of bias induced by the use of 18 selected ring countries and the use
of the ring product list in ICP 2005. Our analysis suggests that the use of the global core list
of products and incorporating data from all participating countries in the linking process has
largely eliminated the biases associated with the linking approach used for ICP 2005.
We find that our counterfactual results imply a much lower degree of international income
and consumption inequality than the original ICP 2005 results. From a statistical point of
view our research suggests that constructing counterfactual PPPs is a useful exercise
following major methodological changes. Such counterfactuals can reduce the size of the
revisions following a new PPP release and highlight how much of the revision is due to the
methodological changes and how much due to new measurements in a different year. We
would also argue that our results are a case for more frequent price comparisons. Since a
price comparison in any year will provide results that are measured with a sizeable degree of
error, more frequent measurement will prevent outliers in any one year from distorting
international comparisons of living standards for a substantial period of time. Furthermore,
32
the introduction of new methodologies can more easily be phased in gradually if there is an
ongoing statistical infrastructure for surveying prices, again leading to fewer major revisions.
From a researcher point of view, our results can be seen as broadly comforting. Many
datasets used for cross-country analysis of economic performance rely on a single set of PPPs
to construct relative income levels, which are then extrapolated to other years using national
growth rates. Major differences across PPP sets that are systematically larger (or smaller) in
low-income countries would argue strongly against this practice and would call into question
research based on such datasets, which include the World Development Indicators, the
Maddison data and (parts of) the new Penn World Table. Our counterfactual for 2005
suggests that such revisionism is not necessary. At the same time, the large (idiosyncratic)
differences between the two benchmarks for individual countries suggests that relying on a
single benchmark for country figures is hazardous. Our counterfactual for 2005 provides a
helpful and consistent second point of reference to the new data for 2011.
33
References
Aghion, Philippe, Ufuk Akcigit and Peter Howitt (2014), “What Do We Learn From
Schumpeterian Growth Theory?” in Philippe Aghion and Steven N. Durlauf
(eds.) Handbook of Economic Growth, volume 2, Elsevier: Amsterdam, 515-563.
Almås, Ingvild (2012), “International income inequality: measuring PPP bias by estimating
Engel curves for food” American Economic Review 102(1): 1093-1117.
Anand, Sudhir and Paul Segal (2008), “What do we know about global income inequality?”
Journal of Economic Literature 46(1): 57-94.
ADB (2007), 2005 International Comparison Program in Asia and the Pacific – Purchasing
Power Parities and Real Expenditures, Asian Development Bank, Manilla, Philippines.
ADB (2014), 2011 International Comparison Program in Asia and the Pacific – Purchasing
Power Parities and Real Expenditures, Asian Development Bank, Manilla, Philippines.
Balassa, Bela (1964), “The Purchasing-Power Parity Doctrine: A Reappraisal” Journal of
Political Economy 72: 584-596.
Bolt, Jutta and Jan-Luiten van Zanden (2014), “TheMaddisonProject:collaborative
researchonhistoricalnationalaccounts”EconomicHistoryReview67(3):627‐651.
Chandy, Lawrence, and Homi Kharas (2014), “What do new price data mean for the goal of
ending extreme poverty?” Brookings Institution, May 5,
http://www.brookings.edu/blogs/up-front/posts/2014/05/05-data-extreme-poverty-
chandy-kharas.
Chen, Shaohua and Martin Ravallion (2008), “The developing world is poorer than we
thought, but no less successful in the fight against poverty” World Bank Policy
Research Working Paper no. 4703.
Chen, Shaohua and Martin Ravallion (2010a), “The developing world is poorer than we
thought, but no less successful in the fight against poverty” Quarterly Journal of
Economics 125(4): 1577-1625.
Chen, Shaohua and Martin Ravallion (2010b), “China is poorer than we thought, but no less
successful in the fight against poverty” World Bank Policy Research Working Paper
no. 4621.
Ciccone, Antonio and Marek Jarociński (2010), “Determinants of Economic Growth: Will
Data Tell?” American Economic Journal: Macroeconomics 2(4): 222-246.
34
Dalgaard, Esben and Henrik Sørensen (2002), “Consistency between PPP Benchmarks and
National Price and Volume Indices,” Paper presented at the 27th General Conference of
the International Association for Research on Income and Wealth, Sweden.
Deaton, Angus (2010), “Price indexes, inequality, and the measurement of world poverty,”
American Economic Review, 100(1): 5-34.
Deaton, Angus (2012), “Calibrating measurement uncertainty in purchasing power parity
exchange rates”, Research Program in Development Studies, mimeographed, Princeton.
Deaton, Angus, and Alan Heston (2010), “Understanding PPPs and PPP-based national
accounts,” American Economic Journal: Macroeconomics, 2:4, 1-35.
Deaton, Angus, and Olivier Dupriez (2011), “Purchasing power parity exchange rates for the
global poor” American Economic Journal: Applied Economics 3: 137-166.
Deaton, Angus and Betina Aten (2014), “Trying to Understand the PPPs in ICP2011: Why
are the Results so Different?” NBER Working Papers, no. 20244.
Diewert, W. Erwin (1976), “Exact and superlative index numbers” Journal of
Econometrics4:114‐145.
Diewert, W. Erwin (2009), “Similarity Indexes and Criteria for Spatial Linking” in D.S.
Prasada Rao (ed.) Purchasing Power Parities of Currencies: Recent Advances in
Methods and Applications, 183-216, Edward Elgar, Cheltenham, UK.
Diewert, W. Erwin (2013a), “Methods of aggregation above the basic heading level within
regions” in World Bank (ed.), Measuring the Real Size of the World Economy, Chapter
5, World Bank, Washington DC.
Diewert, W. Erwin (2013b), “Methods of Aggregation above the basic heading level: linking
the regions” in World Bank (ed.), Measuring the Real Size of the World Economy,
Chapter 6, World Bank, Washington DC.
Dykstra, Sarah, Charles Kenny, and Justin Sandefur (2014), “Global absolute poverty fell by
almost half on Tuesday,” Center for Global Development, May 2,
http://www.cgdev.org/blog/global-absolute-poverty-fell-almost-half-tuesday.
Dykstra, Sarah, Benjamin Dykstra and Justin Sandefur (2014), “We just ran twenty-three
million queries of the World Bank’s website” Center for Global Development Working
Papers, no. 362.
Feenstra, Robert C., Robert Inklaar and Marcel P. Timmer (2013), “The next generation of
the Penn World Table” NBER Working Paper no. 19255.
35
Feenstra, Robert C., Hong Ma, J. Peter Neary and D.S. Prasada Rao (2013), “Who Shrunk
China? Puzzles in the Measurement of Real GDP” Economic Journal 123(573): 1100-
1129.
Heston, Alan (2013), “Government services: productivity adjustments” in World Bank (ed.),
Measuring the Real Size of the World Economy, Chapter 16, World Bank, Washington
DC.
Inklaar, Robert (2013), “Price levels and economic growth: making sense of the revisions to
data on real incomes: a comment” Review of Income and Wealth 59(4): 614-622.
Johnson, Simon, William Larson, Chris Papgeorgiou and Arvind Subramanian (2013), “Is
newer better? Penn World Table revisions and their impact on growth estimates”
Journal of Monetary Economics 60: 255-274.
McCarthy, Paul (2013a), “Construction” in World Bank (ed.), Measuring the Real Size of the
World Economy, Chapter 12, World Bank, Washington DC.
McCarthy, Paul (2013b), “Extrapolating PPPs and Comparing ICP Benchmark Results” in
World Bank (ed.), Measuring the Real Size of the World Economy, Chapter 18, World
Bank, Washington DC.
Milanovic, Branko (2012), “Global inequality recalculated and updated: the effect of new
PPP estimates on global inequality and 2005 estimates” Journal of Economic Inequality
10(1): 1-18.
Neary, J. Peter (2004), “Rationalizing the Penn World Table: True multilateral indices for
international comparisons of real income” American Economic Review 94(5): 1411-
1428.
Pritchett, Lant and Lawrence H. Summers (2014), “Asiaphoria meets regression to the mean”
NBER Working Paper no 20573.
Rao, D.S. Prasada (2013), “Computation of basic heading PPPs for comparisons within and
between regions” in World Bank (ed.), Measuring the Real Size of the World Economy,
Chapter 4, World Bank, Washington DC.
Rao, D.S. Prasada and Gholamreza Hajargasht (2014), “ Stochastic Approach to Computation
of Purchasing Power Parities in the International Comparison Program (ICP)”
mimeographed, University of Queensland.
Ravallion, Martin, Shaohua Chen and Prem Sangraula (2009), “Dollar a day revisited” World
Bank Economic Review 23(2): 169-184.
36
Ravallion, Martin (2013), “Price levels and economic growth: making sense of the revisions
to data on real incomes” Review of Income and Wealth 59(4): 593-613.
Ravallion, Martin (2014), “An exploration of the international comparison program’s new
global economic landscape” NBER Working Paper, no. 20338.
Rogoff, Kenneth (1996), “The Purchasing Power Parity Puzzle” Journal of Economic
Literature34(2):647‐668.
Samuelson, Paul A. (1964), “Theoretical Notes on Trade Problems” Review of Economics
and Statistics 46(2): 145‐154.
Van Veelen, Matthijs (2002), “An impossibility theorem concerning multilateral international
comparison of volumes” Econometrica 70(1): 369-375.
Vogel, Frederic A. (2013), “The ring comparison: linking the regions” in World Bank (ed.),
Measuring the Real Size of the World Economy, Chapter 9, World Bank, Washington
DC.
West, Kenneth D., “Forecast Evaluation,” in Graham Elliot, Clive W. J. Granger, and Allan
Timmermann (eds), Handbook of Economic Forecasting, Volume 1, 99-134, 2006.
World Bank (2008), Global Purchasing Power Parities and Real Expenditures. 2005
International Comparison Program, World Bank, Washington DC.
World Bank (2013), Measuring the Real Size of the World Economy, World Bank,
Washington, DC.
World Bank (2014a), Purchasing Power Parities and Real Expenditure of World Economies
– Summary of Results and Findings of the 2011 International Comparison Program,
World Bank, Washington DC.
World Bank (2014b), Purchasing Power Parities and the Real Size of World Economies – A
Comprehensive Report of the 2011 International Comparison Program, World Bank,
Washington DC.
37
Appendix Tables and Figures
Table A1, Share of regions in real GDP, with partial and comprehensive productivity
adjustment of public wages
Partial Comprehensive Africa 3.36 3.38 Asia-Pacific 24.30 24.34 Eurostat/OECD 64.82 65.05 Latin America 5.57 5.26 Western Asia 1.94 1.96
Note: under partial productivity adjustment, only the publicwages in Africa, Asia‐Pacific andWestern
AsiaareadjustedfordifferencesinthecontributiontoGDPofdifferencesinthecapitalperworkerratio.
Undercomprehensiveadjustment,publicwagesinEurostat/OECDandLatinAmericaarealsoadjusted.
Table A2, Harmonizing the treatment of dwellings for linking the regions.
Africa
Asia-Pacific
Eurostat/OECD
Latin America
Western Asia
Dwellings/capita (1) 0.53 0.66 1.00 0.64 0.44 Quality (2) 0.32 0.73 1.00 0.77 0.88 Volume/capita (3) = (1) x (2) 0.17 0.48 1.00 0.49 0.39 Expenditure/capita (4) 0.03 0.16 1.00 0.10 0.28 Price index (5) = (4)/(3) 0.19 0.33 1.00 0.21 0.72 ICP 2005 price (6) 0.21 0.43 1.00 0.34 0.68 Price adjustment (7)=(5)/(6) 0.91 0.77 1.00 0.63 1.06
Note: the elements in lines (1) and (2) are based on an arithmetic unweighted average of the number of
dwellings per capita (line (1)) and of the quality characteristics (the percentage of houses with electricity, water
and a toilet; line (2). The averages are divided by the Eurostat/OECD values to arrive at the figures in the table.
Expenditure per capita (line (4)) is from the basic heading ‘actual and imputed rents’, in exchange-rate
converted US dollars, per capita. The ICP 2005 price is from Heston (2013, p333).
38
Table A3, Population-weighted Gini coefficients for GDP and consumption per capita,
1990-2012
GDP Consumption ICP 2005 ICP 2005C ICP 2011 ICP 2005 ICP 2005C ICP 2011
1990 0.640 0.613 0.611 0.659 0.622 0.6251991 0.638 0.611 0.609 0.658 0.620 0.6241992 0.635 0.608 0.605 0.654 0.615 0.6201993 0.631 0.602 0.600 0.651 0.610 0.6161994 0.628 0.599 0.596 0.651 0.610 0.6161995 0.624 0.593 0.591 0.646 0.604 0.6091996 0.619 0.588 0.586 0.640 0.597 0.6031997 0.618 0.587 0.584 0.640 0.598 0.6021998 0.617 0.585 0.584 0.641 0.597 0.6021999 0.616 0.583 0.582 0.641 0.597 0.6012000 0.614 0.582 0.580 0.641 0.598 0.6012001 0.609 0.576 0.574 0.634 0.591 0.5942002 0.603 0.570 0.568 0.631 0.588 0.5902003 0.597 0.563 0.561 0.627 0.583 0.5852004 0.590 0.556 0.554 0.624 0.579 0.5822005 0.583 0.548 0.545 0.618 0.573 0.5762006 0.574 0.539 0.536 0.613 0.568 0.5702007 0.564 0.528 0.525 0.600 0.553 0.5572008 0.555 0.519 0.516 0.592 0.545 0.5482009 0.539 0.502 0.499 0.577 0.528 0.5322010 0.531 0.494 0.491 0.570 0.522 0.5252011 0.525 0.489 0.484 0.562 0.514 0.5162012 0.520 0.485 0.480 0.557 0.509 0.510
39
Appendix Figure A 1, Lorenz curves for ICP 2011, and extrapolations from ICP 2005
and ICP 2005C
40
APPENDIX: AN ALTERNATIVE COUNTERFACTUAL FOR ICP 2005
In the main text of the paper we presented a counterfactual for ICP 2005 which adjusts for the
methodological innovations in ICP 2011 and also address concerns on possible ring country
selection and ring product price biases arising out of the use of 18 selected ring countries for
the purpose of linking regional comparisons. This appendix describes an alternative which
may be described as a conservative counterfactual. As discussed in the main text, we use
detailed item-level prices to establish the presence of product selection bias in 2005 (for
Africa, Asia-Pacific and Western Asia) and in 2011 (for Asia). In our preferred
counterfactual, ICP 2005C, we used evidence from Deaton and Aten (2014) on changes in
relative prices for the 2005 ring countries compared with national inflation trends in making
adjustments for product selection bias in 2005. We believe this combination of cross-country
prices and national price trends allows for a more comprehensive estimate of the product
selection bias.
The alternative, which we explore here, would be to directly use the biases implied by the
regression coefficients reported in Tables 4 and 5. This approach leads to downward
adjustments of 23.1 percent in the Asia-Pacific; 9.4 percent in Africa and 6.6 percent in
Western Asia. In both the approach and results, these adjustments are more conservative than
the 25 percent adjustment in all three regions suggested in the Deaton-Aten analysis. As
before, these adjustments are implemented only for the low-income countries in the region.
We now consider the features of this alternative counterfactual (ICP 2005C2) compared to
our preferred counterfactual (ICP 2005C).
Table A4, Differences between ICP 2005 and the two counterfactuals and ICP 2011
ICP 2005 ICP 2005C ICP 2005C2 GDP Mean difference 0.237*** 0.100*** 0.149*** Root mean squared difference 0.336 0.218 0.256 Coefficient on log(expenditure/capita)
-0.020* 0.016* -0.003 (0.012) (0.009) (0.010)
Consumption Mean difference 0.254*** 0.040*** 0.116*** Root mean squared difference 0.349 0.168 0.231 Coefficient on log(expenditure/capita)
-0.063*** 0.000 -0.038*** (0.011) (0.008) (0.009)
Note: The first column (ICP 2005) is from Table 1, the second column (ICP 2005C) is from Table 9, and the third column (ICP 2005C2) is based on the more conservative counterfactual discussed in this appendix. Robust
41
standard errors of the regression coefficients shown in parentheses below the coefficients. * denotes a variable significantly different from zero at a 10%-level, ** at 5%-level, *** at a 1%-level. As shown in Table A4, the smaller adjustments for product selection bias in our more
conservative counterfactual translates to larger mean differences and root mean squared
differences compared to our preferred counterfactual. However, compared with the original
ICP 2005 differences, the reduction is still large. A similar result can be seen in Table A5,
where the population-weighted Gini coefficients for the year 2011 are compared. By this
measure, inequality according to ICP 2011 and our preferred counterfactual are very similar,
as discussed in the main text. Inequality according to the alternative counterfactual is clearly
higher than in these other two cases, but still notably closer to the ICP 2011 Gini coefficient
than to the original ICP 2005 Gini.
Table A5, Population-weighted Gini coefficients for 2011 based on alternative PPs
GDP ConsumptionICP 2005 0.525 0.562ICP 2005C 0.488 0.513ICP 2005C2 0.495 0.526ICP 2011 0.484 0.516