the rise of the machines: automation, horizontal ... olsen rise of the... · the rise of the...

The Rise of the Machines: Automation, Horizontal

Innovation and Income Inequality

David Hemous and Morten Olsen ∗

March 2018 (first draft: September 2013)

Abstract

We construct an endogenous growth model with automation and horizontal in-

novation in an economy with low- and high-skill workers. Automation enables the

replacement of low-skill workers with machines, increasing the skill premium and

possibly decreasing low-skill wages. Horizontal innovation increases both wages.

Higher low-skill wages increase incentives to automate so that automation plays a

bigger role as an economy develops. Our model is consistent with a permanently

increasing skill premium, a temporary drop in low-skill wages and a drop in the

labor share. We calibrate it and show that taxing automation innovation reduces

low-skill wages in the long run.

JEL: O41, O31, O33, E23, E25.

KEYWORDS: Endogenous growth, automation, horizontal innovation, directed tech-

nical change, income inequality.

∗David Hemous, University of Zurich and CEPR, [email protected], Morten Olsen, Uni-versity of Copenhagen, [email protected]. Morten Olsen gratefully acknowledges the financial supportof the European Commission under the Marie Curie Research Fellowship program (Grant AgreementPCIG11-GA-2012-321693) and the Spanish Ministry of Economy and Competitiveness (Project ref:ECO2012-38134). We thank Daron Acemoglu, Philippe Aghion, Ufuk Akcigit, Pol Antras, TobiasBroer, Steve Cicala, Per Krusell, Brent Neiman, Jennifer Page, Andrei Shleifer, Che-Lin Su, FabrizioZilibotti and Joachim Voth among others for helpful comments and suggestions. We also thank seminarand conference participants at IIES, University of Copenhagen, Warwick, UCSD, UCLA Anderson,USC Marshall, Barcelona GSE Summer Forum, the 6th Joint Macro Workshop at Banque de France,Chicago Harris, the 2014 SED meeting, the NBER Summer Institute, the 2014 EEA meeting, EcolePolytechnique, the University of Zurich, NUS, London School of Economics, the 2015 World Congressof the Econometric Society, ECARES, Columbia University, EIEF, Brown University, Boston Univer-sity, Yale University, College de France and Washington University. We thank Ria Ivandic and MartonVarga for excellent research assistance.

1 Introduction

How does the automation of production drive economic growth and affect the distri-

bution of income? Conversely, how do wages shape technological progress? Developed

economies have seen dramatic changes in the income distribution which are often at-

tributed to skill-biased technical change, notably automation. By allowing for the use of

machines in some tasks, automation increases economic output, but also reduces the de-

mand for labor in those tasks. As the range of tasks performed by machines is expanding,

the general public increasingly worries about the negative consequences of technological

progress. Yet, economists often argue that technological development also creates new

products and tasks, which boost the demand for labor: certainly many of today’s jobs

did not exist just a few decades ago.1 Surprisingly, the economics literature lacks a

dynamic framework to analyze the interaction between automation and the creation of

new products or tasks. This paper provides the first model to do so.

We build our model to be consistent with three stylized facts of the evolution of the

income distribution in the United States over the past 50 years (shown in Figure 1).

Since 1963, the college premium (a proxy for the skill premium) has increased by 31%

even though the relative skill supply increased substantially. Wages at the bottom of the

income distribution have stagnated with accumulated growth between 1963 and 2007 of

13% for non-college educated workers. Finally, over the same period the labor share

has declined. As a result, our model does not feature balanced growth: as the economy

develops, labor income inequality increases and the labor share declines.

Of course, a large literature exists that relates exogenous technical change to the

income distribution (e.g. Goldin and Katz, 2008, and Krusell, Ohanian, Rıos-Rull and

Violante, 2002). Previous attempts at endogenizing the direction of technical change

rely on factor-augmentation and exogenous shocks to the skill supply (Acemoglu, 1998)

and feature balanced growth. The novelty of our approach is that we present an en-

dogenous growth version of a task framework in the vein of Autor, Levy and Murnane

(2003) and Acemoglu and Autor (2011), in which the direction of innovation evolves en-

dogenously. Endogenizing technological change matters, not only because it is consistent

with empirical evidence, but also, as we show, because the response of an economy with

endogenous technological change to policy interventions (such as taxes on automation

or on machines) differs from that of an economy with exogenous technological progress.

1For instance, the introduction of the telephone led to the creation of new jobs. In 1970 there were421 000 switchboard operators in the United States. This occupation has largely been automated today.

1

1960 1970 1980 1990 2000 2010

Years

1

1.2

1.4

1.6

1.8

2

2.2

Ski

ll-P

rem

ium

(ra

tio)

Panel A - Composition-adjustedcollege/non-college weekly wage ratio

1960 1970 1980 1990 2000 2010

Years

50

55

60

65

70

75

80

Per

cen

t

Panel B - Labor share of GDP

1960 1970 1980 1990 2000 2010

Years

80

90

100

110

120

130

140

(inde

x 19

63=

100)

Panel C - Composition-adjustedreal weekly wage for non-college wages

Figure 1: The US skill-premium, labor share and real wage growth for low-skill workers.Panel A and C are taken from Acemoglu and Autor (2011), college educated work-ers correspond to those with a college degree and half of those with some college,non-college educated workers are the rest. Panel B is from Koh, Santaeulalia-Llopisand Zheng (2016). See further details in Section 4.

Formally, we consider an expanding variety growth model with low-skill and high-

skill workers. Horizontal innovation, modeled as in Romer (1990), increases the demand

for both low- and high-skill workers. Automation allows for the replacement of low-skill

workers with machines in production and takes the form of a secondary innovation in

existing product lines.2 Within a firm, automation increases the demand for high-skill

workers but reduces the demand for low-skill workers. “Non-automated” products only

use low-skill and high-skill labor. Once invented, a specific machine is produced with

the same technology as a consumption good.

We initially take technical progress as given and study the effect of technological

change on wages. An increase in the number of products increases all wages, while

an increase in automation both increases the overall productivity of the economy and

allows for the substitution away from low-skill workers, resulting in an ambiguous net

effect on low-skill wages. Yet, we show that for very general processes of horizontal and

automation innovation, the asymptotic growth rate of low-skill wages must be positive,

albeit strictly lower than that of high-skill wages. Therefore, the introduction of new

(non-automated) products is not sufficient to guarantee balanced growth.

We then endogenize innovation. Low-skill wages play a key role in determining

technological change: as they increase the cost advantage of an automated over a non-

automated firm increases and the incentive to automate is not constant. Instead, an

2Secondary innovations in a growth model were introduced by Aghion and Howitt (1996) who studythe interplay between applied and fundamental research.

2

economy with an initially low level of technology first goes through a phase where growth

is mostly generated by horizontal innovation and the skill premium and the labor share

are constant. Only when low-skill wages are sufficiently high, do firms invest in au-

tomation. During this second phase, the share of automated products increases, the

skill premium rises, the labor share drops and low-skill wages might temporarily de-

crease—replicating the stylized facts of Figure 1. Finally, the economy moves towards

its asymptotic steady state. The share of automated products stabilizes as the entry

of new, non-automated products compensates for the automation of existing ones. The

skill-premium keeps rising but more slowly. The economy will then have endogenously

shifted from a Cobb-Douglas aggregate production function to a nested CES.

Finally, we extend our baseline model to include a capital stock and calibrate it

to match the evolution of the skill premium, the labor share, productivity and the

equipment to GDP ratio the 1960s. As is common in the literature and only for this

exercise, we identify skill groups with education groups, such that high-skill workers

correspond to college-educated workers. Our model is able to reproduce the trends in the

data quantitatively. In particular, the impact of automation on low-skill wages and the

skill-premium decreases in the 90s and 2000s even though expenditures on automation

remain nearly constant; a response to a point of critique of the skill-biased technological

change hypothesis put forward by Card and DiNardo (2002). We use these parameters

to discipline our model for policy experiments. We find that a tax on machine has a

positive effect on low-skill wages even more so due to the endogeneity of technology. In

addition, a tax on automation innovation initially increases low-skill wages but ends up

having a negative impact after a few years.

Our modeling of automation as high-skill-biased is motivated by a large empiri-

cal literature. Autor, Katz and Krueger (1998) and Autor, Levy and Murnane (2003)

demonstrate that computerization is associated with relative shifts in demand for college-

educated workers. Bartel, Ichniowski and Shaw (2007) present similar evidence at the

firm level. Autor, Katz, and Kearney (2006, 2008) and Autor and Dorn (2013) show

that the more recent phenomenon of wage and job polarization, the relative decline of

wages and employment in the middle of the income distribution, can be explained by

the computer-driven automation of routine tasks often performed by middle-skill work-

ers (storing, processing and retrieving information).3 Graetz and Michaels (2018) and

Acemoglu and Restrepo (2017b) find that the introduction of industrial robots leads to

3See also Spitz-Oener (2006) and Goos, Manning and Salomons (2009) for job polarization in Europe,and Feng and Graetz (2016) for a model of why automation targets middle-skill workers.

3

a reduction in the demand for (mostly) low-skill workers. For our purpose, we will not

distinguish between low- and middle-skill workers since both have often performed tasks

which have later on been automated.4

The idea that high wages might incentivize technological progress in the form of au-

tomation dates back to Habakkuk (1962). The associated empirical literature is smaller,

but Lewis (2001) finds that low-skill immigration slows down the adoption of automation

technology and Hornbeck and Naidu (2014) find that the emigration of black workers

from the American South favored the adoption of modern agricultural production tech-

niques there. A large literature shows that the direction of innovation is endogenous in

other contexts (see Newell, Jaffe and Stavins, 1999, or Aghion, Dechezlepretre, Hemous,

Martin and Van Reenen, 2016).

There is a small theoretical literature on labor-replacing technology. In Zeira (1998),

exogenous increases in TFP raise wages and encourage the adoption of a capital-intensive

technology analogous to automation in this paper. Acemoglu (2010) shows that labor

scarcity induces labor-saving innovation. Neither paper analyzes labor-saving innovation

in a fully dynamic model nor focuses on income inequality. Peretto and Seater (2013)

build a dynamic model where innovation allows firms to replace labor with capital. Since

wages are constant over time, so is the incentive to automate. In addition, they do not

focus on income inequality. In subsequent work, Acemoglu and Restrepo (2017a) also

develop a growth model where technical change involves automation and the creation of

new tasks. Automation plays a similar role in both papers (although in their baseline

version, there is only one type of labor). Yet, while in our model all tasks are symmetric

(except for whether they are automated), in theirs, new tasks are exogenously born with

a higher labor productivity. As a result, their model features a balanced growth path

and their focus is on the self-correcting elements of the economy after a technological

shock. In contrast, our model does not feature a balanced growth path and we focus on

accounting for secular trends such as the rise in the skill premium.5

A large literature argues that skill-biased technical change can explain the increase in

the relative demand for skilled workers since the 1970’s. This literature can be divided

into three strands. The first emphasizes Nelson and Phelps (1966)’s hypothesis that

more skilled workers are better able to adapt to technological change (see Lloyd-Ellis,

4A previous version of our paper, Hemous and Olsen (2016) presented an extension of our modelseparating low- and middle- skill workers.

5Benzell, Kotlikoff, LaGarda and Sachs (2017), following Sachs and Kotlikoff (2012) build a modelwhere a code-capital stock can substitute for labor, and show that a technological shock which favorsthe accumulation of code-capital can lead to lower long-run GDP.

4

1999, Caselli, 1999, Galor and Moav, 2000,, Aghion, Howitt and Violante, 2002, and

Beaudry and Green, 2005).6 While such theories mostly explain transitory increases in

inequality, our model features permanent and widening inequality. Yet, we borrow the

idea of a shift in production technology spreading through the economy.

A second strand sees the complementarity between capital and skill as the source for

the increase in the skill premium. Krusell, Ohanian, Rıos-Rull and Violante (2000) find

that the observed increase in the stock of capital equipment can account for most of the

variation in the skill premium. Our model also features capital-skill complementarity but

our focus is different since we seek to explain why innovation has been directed towards

automation and analyze the interactions between automation and horizontal innovation.

Finally, a third branch of the literature, building on Katz and Murphy (1992), consid-

ers technology to be either high-skill or low-skill labor augmenting. Using this framework,

Goldin and Katz (2008) find that technical change has been skill-biased throughout the

20th century in the United States (Katz and Margo, 2014, argue that the relative demand

for white-collar workers has been increasing since 1820). Further, the directed technical

change literature (Acemoglu, 1998, 2002 and 2007) endogenizes the bias of technology.

Such models of factor-augmenting technical change deliver important insights about

inequality and technical change, but have no role for labor-replacing technology and

therefore cannot generate declines in low-skill wages (a point emphasized in Acemoglu

and Autor, 2011). Our model is also a directed technical change framework but de-

viates from the assumption of factor-augmenting technologies. It explicitly allows for

labor-replacing automation, generating the possibility for (temporary) absolute losses

for low-skill workers, and permanently increasing income inequality.

Section 2 describes the baseline model with exogenous technology and shows the

effects of technological change on wages. Section 3 endogenizes the path of technology

and describes the evolution of the economy. Section 4 calibrates an extended version

of the model and conducts policy exercises. Section 5 concludes. Appendix 6 presents

the main extensions of our model and discusses the identification of our parameters.

Appendix 7 presents proofs, other extensions and details the calibration exercise.

6Relatedly, Beaudry, Green and Sand (2016) model the IT revolution as an exogenous increase inthe demand for organizational capital, which is built by cognitive labor. Once the capital stock reachesits steady-state, the demand for cognitive tasks decreases. We reproduce a somewhat similar patternbut in the growth rate of the skill premium instead of the level of demand for cognitive tasks and withan endogenous increase in the benefits from automation.

5

2 A Baseline Model with Exogenous Innovation

This section presents a baseline model with exogenous technology to study the conse-

quences of automation and horizontal innovation on factor prices. Section 2.3 derives

comparative statics results and section 2.4 the asymptotic behavior of wage for general

paths of technology. We discuss some of our modeling assumptions in section 2.5.

2.1 Preferences and production

We consider a continuous time infinite-horizon economy populated by H high-skill and

L low-skill workers. Both types of workers supply labor inelastically and have identical

preferences over a single final good of:

Uk,t =

∫ ∞t

e−ρ(τ−t)C1−θk,τ

1− θdτ,

where ρ is the discount rate, θ ≥ 1 is the inverse elasticity of intertemporal substitution

and Ck,t is consumption of the final good at time t by group k ∈ H,L. The final good

is produced by a competitive industry combining a set of intermediate inputs, i ∈ [0, Nt]

using a CES aggregator:

Yt =

(∫ Nt

0

yt(i)σ−1σ di

) σσ−1

,

where σ > 1 is the elasticity of substitution between these inputs and yt(i) is the use

of intermediate input i at time t. As in Romer (1990), an increase in Nt represents

a source of technological progress. Throughout the paper, we use interchangeably the

terms “intermediate input” and “product”.

We normalize the price of Yt to 1 at all points in time and drop time subscripts when

there is no ambiguity. The demand for each intermediate input i is:

y(i) = p(i)−σY, (1)

where p(i) is the price of intermediate input i and the normalization implies that the

ideal price index, [∫ N

0p(i)1−σdi]1/(1−σ) equals 1.

Each intermediate input is produced by a monopolist who owns the perpetual rights

of production. She can produce the intermediate input by combining low-skill labor, l(i),

6

high-skill labor, h(i), and, possibly, type-i machines, x(i), using the production function:

y(i) =[l(i)

ε−1ε + α(i) (ϕx (i))

ε−1ε

] εβε−1

h(i)1−β, (2)

where α(i) ∈ 0, 1 is an indicator function for whether or not the monopolist has access

to an automation technology which allows for the use of machines. If the product is not

automated (α(i) = 0), production takes place using a Cobb-Douglas production function

with only low-skill and high-skill labor and a low-skill factor share of β. If the product

is automated (α(i) = 1) machines can be used in the production process. We allow

for perfect substitutability, in which case ε = ∞ and the production function is y(i) =

[l(i) + α(i)ϕx (i)]β h(i)1−β. The parameter ϕ is the relative productivity advantage of

machines over low-skill workers and G denotes the share of automated products.

Since each input is produced by a single firm, we identify each input with its firm

and refer to a firm which uses an automated production process as an automated firm.

We refer to the specific labor inputs provided by high-skill and low-skill workers in the

production of different inputs as “different tasks” performed by these workers, so that

each product comes with its own tasks. It is because α(i) is not fixed, but can change

over time, that our model captures the notion that machines can replace low-skill labor

in new tasks. A model with a fixed α(i) for each product would only allow for machines

to be used more intensively in production, but always for the same tasks. Although,

we will refer to x as “machines”, our interpretation also includes any form of computer

inputs, algorithms, the services of cloud-providers, etc.

For now, machines are an intermediate input. Once invented, machines of type i

are produced competitively one for one with the final good, such that the price of an

existing machine for an automated firm is always equal to 1 and technological progress

in machine production follows that in the rest of the economy. Yet, our model can

capture the notion of a decline in the real cost of equipment, as automation for firm i

can equivalently be interpreted as a decline of the price of machines i from infinity to 1.

2.2 Equilibrium wages

In this section we derive how wages are determined in equilibrium, taking as given the

technological levels N (the number of products), G (the share of automated products)

and the employment of high-skill workers in production, HP ≡∫ N

0h(i)di (we letHP ≤ H

to accommodate later sections where high-skill labor is used to innovate).

7

First, note that all automated firms are symmetric and therefore behave in the sameway. Similarly all non-automated firms are symmetric. This gives aggregate output of:

Y = N1

σ−1×

(1−G)1σ

((LNA)β(HP,NA)1−β︸︷︷︸

)T1

σ−1σ

+G1σ

([(LA)

ε−1ε + (ϕX)

ε−1ε ]

εβε−1 (HP,A)1−β︸︷︷︸

)σ−1σ

T2

σσ−1

,

(3)

where LA (respectively LNA) is the total mass of low-skill workers in automated (re-

spectively non-automated) firms, HP,A (respectively HP,NA) is the total mass of high-

skill workers hired in production in automated (respectively non-automated) firms and

X =∫ N

0x(i)di is total use of machines. The aggregate production function takes the

form of a nested CES between two sub-production functions. The first term T1 cap-

tures the classic case where production takes place with constant shares between factors

(low-skill and high-skill labor), while the second term T2 represents the factors used

within automated products and features the substitutability between low-skill labor and

machines. G is the share parameter of the “automated” products nest and therefore an

increase in G is T2-biased (as σ > 1). N1

σ−1 is a TFP parameter. Besides the functional

form the aggregate production function (3) differs from the often assumed aggregate

CES production function in two ways: First, instead of the often-used factor-augmenting

technical in an aggregate production function, we explicitly model automation of tasks

and derive the aggregate production function from the cost function of individual firms.

Second, with an endogenous G we will be able to capture effects that the usual focus on

an exogenous aggregate production function cannot.7

The unit cost of intermediate input i is given by:

c(wL, wH , α(i)) = β−β(1− β)−(1−β)(w1−εL + ϕα(i))

β1−εw1−β

H , (4)

where ϕ ≡ ϕε, wL denotes low-skill wages and wH high-skill wages. When ε < ∞,

c(·) is strictly increasing in both wL and wH and c(wL, wH , 1) < c(wL, wH , 0) for all

wL, wH > 0 (automation reduces costs). Price is set as a markup over costs: p(i) =

σ/(σ − 1) · c(wL, wH , α(i)). Using Shepard’s lemma and equations (1) and (4) delivers

7At the firm level, this model features an elasticity of substitution between high-skill labor and ma-chines equal to that between high-skill and low-skill labor. This, however, does not hold at the aggregatelevel, consistent with Krusell et al. (2000), who argue that the aggregate elasticity of substitution be-tween high-skill and low-skill labor is greater than that between high-skill labor and machines.

8

the demand for low-skill labor of a single firm.

l(wL, wH , α(i)) = βw−εL

w1−εL + ϕα(i)

(σ − 1

σ

)σc(wL, wH , α(i))1−σY, (5)

which decreases in wL and wH . The effect of automation on demand for low-skill labor in

a firm is generally ambiguous. This is due to the combination of a negative substitution

effect (automation allows for substitution between machines and low-skill workers) and a

positive scale effect (automation decreases costs, lowers prices and increases production).

As we focus on labor-substituting innovation, we impose the condition ε > 1 +β (σ − 1)

throughout the paper which is necessary and sufficient for the substitution effect to

dominate and ensures l(wL, wH , 1) < l(wL, wH , 0) for all wL, wH > 0.

Let x (wL, wH) denote the use of machines by an automated firm. The relative use

of machines and low-skill labor for such a firm is then:

x(wL, wH)/l(wL, wH , 1) = ϕwεL, (6)

which increases in wL as the wage is also the price of low-skill labor relative to machines.

The iso-elastic demand (1), coupled with constant mark-up σ/(σ − 1), implies that

revenues are given by R(wL, wH , α(i)) = ((σ − 1) /σ)σ−1 c(wL, wH , α(i))1−σY and profits

are a fixed share of revenue: π(wL, wH , α(i)) = R(wL, wH , α(i))/σ. We define µ ≡β(σ − 1)/(ε − 1) < 1 (by our assumption on ε). Using (4), the relative revenues (and

profits) of non-automated and automated firms are given by:

R(wL, wH , 0)/R(wL, wH , 1) = π(wL, wH , 0)/π(wL, wH , 1) = (1 + ϕwε−1L )−µ, (7)

which is a decreasing function of w. As non-automated firms rely more heavily on

low-skill labor, their relative market share drops with higher low-skill wages.

Since firms’ profits are a constant share of firms’ revenues, aggregate profits are a

constant share 1/σ of output Y . Similarly, the share of firms’ revenues accruing to high-

skill labor in production is the same for all firms and given by νh = (1 − β)(σ − 1)/σ.

Therefore payment to high-skill labor in production is a constant share of output:

wHH = (1− β)σ − 1

σN [GR(wL, wH , 1) + (1−G)R(wL, wH , 0)] = (1− β)

σ − 1

σY. (8)

Using factor demand functions, the share of revenues accruing to low-skill labor is given

by νl(wL, wH , α(i)) = σ−1σβ(1 + ϕwε−1

L α(i))−1

, and decreases with automation. Using

9

labor market clearing (∫ N

0l(i)di = L), we obtain total wages of low-skill workers as:

wLL = N [GR(wL, wH , 1)νl(wL, wH , 1) + (1−G)R(wL, wH , 0)νl(wL, wH , 0)] . (9)

Equations (7), (8) and (9) give the high-skill to low-skill labor share in production as:8

wHHP

wLL=

1− ββ

G+ (1−G)(1 + ϕwε−1L )−µ

G(1 + ϕwε−1

L

)−1+ (1−G)(1 + ϕwε−1

L )−µ. (10)

This expression gives the relative demand curve for high-skill and low-skill labor. We

represent this relationship for given technology levels and factor supply ratio L/HP by

a curve in the (wL, wH) space in Figure 2. For G = 0, the curve is a straight line, with

slope (1−β)L/(βHP ), reflecting the constant factor shares in a Cobb-Douglas economy.

For G > 0, the right-hand side of (10) increases in w, so that the relative demand

curve is non-homothetic and rotates counter-clockwise as wL grows. Intuitively, higher

low-skill wages both induce more substitution towards machines in automated firms (as

reflected by the term (1 + ϕwε−1L )−1 in equation (10)) and improve the cost-advantage

and therefore the market share of automated firms (term (1 + ϕwε−1L )−µ ) and increases

the ratio of high-skill to low-skill labor share in production. This immediately implies

that in the long-run low-skill wages and high-skill wages cannot grow at the same rate,

i.e. we cannot have positive balanced growth.

Figure 2: Relative demand curve and isocost curve for different values of N and G.

8When ε = ∞, the skill premium is given by wHwL

= 1−ββ

LHP

if wL < ϕ−1 such that no firm uses

machines, and wHwL

= 1−ββ

LHP

G+(1−G)(ϕwL)−1

(1−G)(ϕwL)−1 if wL > ϕ−1.

10

With constant mark-ups, the cost equation (4) and the price normalization give:

σ

σ − 1

N1

1−σ

ββ (1− β)1−β

(G(ϕ+ w1−ε

L

)µ+ (1−G)w

β(1−σ)L

) 11−σ

w1−βH = 1. (11)

This relationship defines the unit isocost curve in figure 2. It shows the positive re-

lationship between real wages and the level of technology given by N , the number of

intermediate inputs, and G the share of automated firms. Together (10) and (11) deter-

mine real wages uniquely as a function of N,G and HP .

Given the amount of resources devoted to production (L,HP ), the static equilibrium

is closed by the final good market clearing condition:

Y = C +X (12)

where C = CL + CH is total consumption. GDP includes the payment to labor and

aggregate profits so GDP and the total labor share become

GDP ≡ 1

σY + wLL+ wHH, LS = 1− 1

1 + (σ − 1) (1− β)(

wLLwHHP + H

HP

) , (13)

where the second equality uses (8). Therefore, the labor share decreases in the skill

premium for a given mass of high-skill workers in production HP .

2.3 Technological change and wages

The consequences of technological changes on the level of wages are most easily seen

with the help of Figure 2. An increase in the number of products, N , pushes out the

isocost curve and increases both low-skill and high-skill wages. When G = 0, both types

of wages grow at the same rate as the relative demand curve is a straight line, but for

G > 0, the demand curve is non-homothetic and the skill premium grows. Therefore,

an increase in N at constant G is high-skill biased.

An increase in the share of automated products G has a positive effect on high-skill

wages and the skill premium but an ambiguous effect on low-skill wages: Higher au-

tomation increases the productive capability of the economy and pushes out the isocost

curve (an aggregate productivity effect), which increases low-skill wages, but it also al-

lows for easier substitution away from low-skill labor which pivots the relative demand

curve counter-clockwise (an aggregate substitution effect), decreasing low-skill wages.

11

Therefore automation is always high-skill labor biased (wH/wL increases) but low-skill

labor saving (wL decreases) if and only if the aggregate substitution effect dominates

the aggregate productivity effect. Formally, one can show (proof in Appendix 7.1):9

Proposition 1. Consider the equilibrium (wL, wH) determined by equations (10) and

(11). Assume that ε <∞, it holds that

A) An increase in the number of products N (keeping G and HP constant) leads to

an increase in both high-skill (wH) and low-skill wages (wL). Provided that G > 0, an

increase in N also increases the skill premium wH/wL and decreases the labor share.

B) An increase in the share of automated products G (keeping N and HP constant)

increases the high-skill wages wH , the skill premium wH/wL and decreases the labor

share. Its impact on low-skill wages is generally ambiguous, but low-skill wages are

decreasing in G if i) 1 ≤ (σ − 1)(1− β) or if ii) N and G are high enough.

Automation is low-skill labor saving if (1− β) (σ − 1) ≥ 1. The aggregate substitu-

tion effect is larger than the scale effect when i) σ is large (as then the increased demand

for the newly automated product does not lead to a large increase in demand for other

products). ii) when the cost share of the low-skill labor-machines aggregate β is small (as

the cost-saving effect of automation is smaller in that case). iii) when G is high as most

of the aggregate productivity gains are realized for low G and for high G the automation

of one more firm hurts low-skill workers more, as there are fewer non-automated firms

left. iv) when N is large and the higher wages make the substitution effect stronger.

One could also consider the effect of an increase in the number of non-automated

products (that is an increase in N keeping GN constant), which corresponds to the

“horizontal innovation” introduced in section 3. Such technological change pushes out

the iso-cost curve but also makes the relative demand curve rotate clockwise. This

increases both low-skill and high-skill wages (see proof in Appendix 7.1). In addition,

for N large enough or ε < σ, the rotation of the relative demand curve is sufficiently

strong for this form of technical change to be low-skill labor biased.10

In section 3, when we specify the innovation process, we will show that as the number

of products N increases, the share of automated products endogenously increases from

9In the perfect substitute case, ε = ∞, wH increases in N and weakly increases in G, wH/wLweakly increases in N and G and wL weakly increases in N and weakly decreases in G provided that1/ (1− β) ≤ σ − 1 or G is large enough. When ε = ∞ and G = 1, the isocost curve has a horizontalarm and the relative demand curve a vertical one.

10In the perfect substitute case, an increase in the number of non-automated products increases wH ,weakly increases wL and, if G < 1 and N is large enough, decreases the skill premium.

12

an initial level close to 0. As a result, growth will progressively become unbalanced

with a rising skill premium (and accordingly a decline in the labor share), and for some

parameter values, low-skill wages will temporarily decline.

2.4 Asymptotics for general technological processes

We study the asymptotic behavior of the model for given paths of technologies and mass

of high-skill workers in production. For any variable at (such as Nt), we let gat ≡ at/at

denote its growth rate and ga∞ = limt→∞gat if it exists. In Appendix 7.2.1 we derive:

Proposition 2. Consider three processes [Nt]∞t=0, [Gt]

∞t=0 and [HP

t ]∞t=0 where (Nt, Gt, HPt ) ∈

(0,∞) × [0, 1] × (0, H] for all t. Assume that Gt, gNt and HP

t all admit limits G∞, gN∞

and HP∞ with gN∞ > 0 and HP

∞ > 0.

A) If G∞ ∈ (0, 1), the asymptotic growth of high-skill wages wHt and output Yt are:

gwH∞ = gY∞ = gN∞/ ((1− β)(σ − 1)) , (14)

and the asymptotic growth rate of wLt is given by

gwL∞ = gY∞/ (1 + β(σ − 1)) . (15)

B). If G∞ = 1, the asymptotic growth rates of wHt and Yt also obey (14). If Gt

converges sufficiently fast (so that limt→∞

(1−Gt)Nψ(1−µ) ε−1

εt exists and is finite) then :

-i) If ε <∞ the asymptotic growth of wLt is positive at :

gwL∞ = gY∞/ε. (16)

-ii) If low-skill workers and machines are perfect substitute then limt→∞

wLt is finite and

weakly greater than ϕ−1 (equal to ϕ−1 when limt→∞

(1−Gt)Nψt = 0).

C) If G∞ = 0 and Gt converges sufficiently fast (so that limt→∞

GtNβt exists and is

finite), then the asymptotic growth rates of wLt, wHt and Yt obey:

gwL∞ = gwH∞ = gY∞ = gN∞/(σ − 1). (17)

This proposition first relates the growth rate of output and high-skill wages to the

growth rate of the number of products. Without automation, that is if Gt converges to 0

sufficiently fast, Yt is proportional to N1/(σ−1)t as in a standard expanding-variety model.

13

Automation introduces machines as an additional reproducible input such that a higher

level of productivity leads to a higher supply of machines further increasing output when

G∞ > 0. This multiplier effect is increasing in the asymptotic share of machines, β.

Second, with positive growth in Nt, mild assumptions are sufficient for asymptotic

positive growth in low-skill wages. wLt only remains bounded when there is economy-

wide perfect substitution, i.e. low-skill workers and machines are perfect substitutes,

ε = ∞, and all products are automated asymptotically (Gt converges to 1 sufficiently

fast). Even then low-skill wages are bounded below by ϕ−1, as a lower wage would

imply that no firm would use machines. In general, the processes of Nt and Gt depend

on the rate at which new products are introduced, the extent to which they are initially

automated, and the rate of automation. As long as new non-automated products are

continuously introduced, and the intensity at which non-automated firms are automated

is bounded, the share of non-automated products is always positive, i.e. G∞ < 1 (see

proof in Appendix 7.2.2). This ensures that there is no economy-wide perfect substitu-

tion between low-skill workers and machines.

With aggregate imperfect substitution (because G∞ < 1 or ε <∞), a growing stock

of machines and a fixed supply of low-skill labor imply that the relative price of a worker

(wLt) to a machine (pxt ) must grow at a positive rate. Since machines are produced

with the same technology as the consumption good, pxt = pCt , where pCt is the price

of the consumption good (1 with our normalization), the real wage wLt = wLt/pCt =

(wLt/pxt )(p

xt /p

Ct ) must also grow at a positive rate.

Third, the proposition shows that as long as G∞ > 0, low-skill wages cannot grow

at the same rate as output. This is easily seen from the aggregate production function

(3), which asymptotically is a nested CES with constant share parameters and where

technological change in the form of an increase in Nt is not labor-augmenting (unless

G∞ = 0). Therefore, following Uzawa’s theorem, balanced growth is impossible. Within

our framework, this holds when automation intensity is bounded away from 0 which

ensures that the asymptotic share of automated products is positive and rules out case

C (see Appendix 7.2.2 for a proof).

If ε <∞ and G∞ = 1 (sufficiently fast), low-skill workers derive their income asymp-

totically from automated firms and the asymptotic growth rate depends on the elasticity

of substitution between machines and low-skill workers in automated firms, ε.

In contrast, when G∞ ∈ (0, 1), the demand for low-skill labor increasingly comes

from the non-automated firms (as automation is labor-saving at the firm level). With

14

growing wages, the relative market share of non-automated firms decreases in proportion

with (1 +ϕwε−1Lt )−µ ∼ ϕ−µw

−β(σ−1)Lt , while most of the demand for high-skill labor comes

from automated firms. Then, the growth rate of low-skill wages is a fraction of the

growth rate of high-skill wages given by (15). The ratio between high-skill and low-skill

wage growth rates increases with a higher importance of low-skill workers (a higher β) or

a higher substitutability between automated and non-automated products (a higher σ)

since both imply a faster loss of competitiveness of the non-automated firms. Yet, it is

independent of the elasticity of substitution between machines and low-skill workers, ε or

of the exact asymptotic share of automated products G∞. In this case, non-automated

products provide employment opportunities for low-skill workers which limits the relative

losses of low-skill workers compared to high-skill workers (their wages grow according to

(15) instead of (16) and ε > 1 + β(σ − 1)).

2.5 Discussion

Proposition 2 establishes general conditions under which low-skill wages asymptotically

grow but slower than high-skill wages. We now discuss the robustness of this result.

First, our assumption that machines are an intermediate input is innocuous: Section 4

relaxes this assumption and lets machines take the form of capital with no qualitative

change of result. Second, Appendix 6.4 relaxes the assumption of an exogenous stock

of labor and considers a Roy model where workers are heterogeneous in the quantity

of high-skill labor they can supply. Proposition 2 generalizes to this case, although the

relative growth rate of low-skill wages is higher and asymptotically all workers supply

high-skill labor. Third, Appendix 7.4 presents a model where the production technologies

for machines and the consumption good differ allowing for negative growth in pxt /pCt . In

this case, low-skill wages may decline asymptotically. Fourth, even if some of the tasks

(but not all) performed by high-skill workers are automatable, our results would remain

similar as long as high-skill workers remain essential in production.11

Appendix 6.1 breaks the assumption of symmetry and assumes that new products

have a higher productivity. Higher productivity comes in the form of TFP improvements

(which augments the productivity of all factors, including machines) or higher (low-skill

and high-skill) labor productivity. Productivity increases exponentially, so we assume

that Nt grows linearly to maintain non-explosive growth, and non-automated products

11If all labor tasks are automatable, infinite production is possible in finite time once N is largeenough. Factors such as natural resources or land are then likely to be the scarce factor.

15

still have a positive probability of becoming automated. We show that, if all productivity

improvements are labor-augmenting, horizontal innovation becomes more low-skill biased

and balanced growth is possible; a result similar to that of Acemoglu and Restrepo

(2017a). However, if machines also become partly more productive in new tasks (once

those are automated), balanced growth is impossible. Therefore, the invention of new

non-automated tasks is not enough to ensure balanced growth, what is required is that

labor but not potential machines are more productive in these new tasks.12

3 Endogenous innovation

We now model automation and horizontal innovation as a the result of investment. This

allows us first to look at the impact of wages on technological change (the reverse of

Proposition 1), second to study the transitional dynamics of the system, and third to

explore the interactions between the two innovation processes. Section 3.1-3.5 character-

izes the model and its solution, section 3.6 relates it to historical experience and section

3.7 provides additional results and comparative statics.

3.1 Modeling innovation

A non-automated firm can hire hAt (i) high-skill workers to perform automation re-

search which will result in the firm becoming automated as a Poisson process with

rate ηGκt

(Nth

At (i)

)κ. Once a firm is automated it remains so forever. η > 0 denotes

the productivity of the automation technology, κ ∈ (0, 1) measures the concavity of

the automation technology, Gκt , κ ∈ [0, κ], represents possible knowledge spillovers from

the share of automated products, and Nt represents knowledge spillovers from the total

number of intermediate inputs. The spillovers in Nt ensure that both automation and

horizontal innovation may take place in the long-run; they exactly compensate for the

mechanical reduction in the amount of resources for automation available for each prod-

uct (namely high-skill workers) when the number of product increases.13 Automation is

12We have also abstracted from the accumulation of low-skill human capital. “Traditional” humancapital would be equivalent to augmenting low-skill labor in (3), and, from Uzawa’s theorem, wouldbe insufficient to guarantee balanced growth when G∞ > 0 and Nt grows exponentially. Grossman etal. (2017) show how non-traditional human capital can lead to balanced growth when technologicalchange is not purely labor-augmenting. He and Liu (2008) show that the endogenous accumulation ofskill through a “schooling” technology fits the rise in the skill-ratio and the skill-premium well.

13These spillovers can be micro-funded as follows: let there be a fixed mass one of firms indexedby j each producing a continuum Nt of products indexed by i so that production is given by Yt =

16

undertaken by the incumbent firm, but we could accommodate automation by entrants

as long as the incumbent also automates with positive probability or captures a share of

the surplus created by the automation innovation.

New intermediate inputs are developed by high-skill workers in a standard manner

according to a linear technology with productivity γNt. With HDt high-skill workers

pursuing horizontal innovation, the mass of intermediate inputs evolves according to:

Nt = γNtHDt .

We assume that firms do not exist before their product is created and therefore cannot

invest in automation. As a result, new products are born non-automated, which means

that “horizontal innovation” corresponds to an increase in Nt keeping GtNt constant

and (following our discussion in section 3.7) is low-skill biased under certain conditions.

This is motivated by the idea that when a task is new and unfamiliar, the flexibility

and outside experiences of workers allow them to solve unforeseen problems. Only as

the task becomes routine and potentially codefiable a machine (or an algorithm) can

perform it (Autor, 2013). Our results carry through if only a share of the new products

are born non-automated as discussed in section 3.7.14

Therefore the rate and direction of innovation will depend on the equilibrium allo-

cation of high-skill workers between production, automation and horizontal innovation.

Defining the total mass of high-skill workers working in automation as HAt ≡

∫ Nt0hAt (i)di,

we get that high-skill labor market clearing leads to

HAt +HD

t +HPt = H. (18)

3.2 Innovation allocation

We denote by V At the value of an automated firm, by rt the economy-wide interest rate

and by πAt ≡ π(wLt, wHt, 1) the profits at time t of an automated firm. The asset pricing

(∫ 1

0

∫ Nt(j)0

yt(i, j)σ−1σ didj)

σσ−1 . When a firm hires HA

t (j) high-skill workers in automation each of its non-

automated products gets independently automated with a Poisson rate of ηGκt [HAt (j)/(1 − Gt(j)))]κ.

The aggregate economy would be identical to ours and have the same social planner allocation (thedecentralized equilibrium would qualitatively behave qualitatively similarly but the externality in theautomation technology from the number of products would be internalized).

14The model predicts that the ratio of high-skill to low-skill labor in production is higher for automatedthan non-automated firms, though not overall since non-automated firms also hire high-skill workersfor the purpose of automating. In particular, new firms do not always have a higher ratio of low tohigh-skill workers (and at the time of its birth a new firm only relies on high-skill workers).

17

equation for an automated firm is given by

rtVAt = πAt + V A

t . (19)

This equation states that the required return on holding an automated firm, V At , must

equal the instantaneous profits plus appreciation. An automated firm only maximizes

instantaneous profits and has no intertemporal investment decisions to make.

A non-automated firm invests in automation. Denoting by V Nt the value of a non-

automated firm and letting πNt ≡ π(wLt, wHt, 0), we get the asset pricing equation:

rtVNt = πNt + ηGκ

t

(Nth

At

)κ (V At − V N

t

)− wHthAt + V N

t , (20)

where hAt is the mass of high-skill workers in automation research hired by a single non-

automated firm. This equation is similar to equation (19), but profits are augmented

by the instantaneous expected gain from innovation ηGκt

(Nth

At

)κ (V At − V N

t

)net of

expenditure on automation research, wHthAt . This gives the first order condition:

κηGκtN

κt

(hAt)κ−1 (

V At − V N

t

)= wHt. (21)

hAt increases with the difference in value between automated and non-automated firms,

and thereby current and future low-skill wages—all else equal.

Since non-automated firms get automated at Poisson rate ηGκt

(Nth

At

)κ, and since

new firms are born non-automated, the share of automated firms obeys:15

Gt = ηGκt

(Nth

At

)κ(1−Gt)−Gtg

Nt . (22)

Free-entry in horizontal innovation guarantees that the value of creating a new firm

cannot be greater than its opportunity cost:

γNtVNt ≤ wHt, (23)

with equality whenever there is strictly positive horizontal innovation (Nt > 0).

The low-skill and high-skill representative households’ problems are standard and

15Using that by symmetry the total amount of high-skill workers hired in automation research isHAt = (1−Gt)NthAt , we can also write this expression as Gt = ηGκt

(HAt

)κ(1−Gt)1−κ −GtgNt .

18

lead to Euler equations which in combination give

Ct/Ct = (rt − ρ) /θ, (24)

with a transversality condition requiring that the present value of all time-t assets in the

economy (the aggregate value of all firms) is asymptotically zero:

limt→∞

(exp

(−∫ t

0

rsds

)Nt

((1−Gt)V

Nt +GtV

At

))= 0.

3.3 Equilibrium characterization

Following Proposition 2, asymptotically high-skill wages output and consumption grow

proportionately to Nψt where ψ ≡ ((1− β) (σ − 1))−1 when G∞ > 0 (hence ψ is the

asymptotic elasticity Yt with respect to Nt). Therefore to study the behavior of the

system we introduce the normalized variables vt ≡ wHtN−ψt and ct ≡ ctN

−ψt . As hAt

mechanically tends to 0 as the mass of non-automated firms grows we also introduce

hAt ≡ NthAt . We further define the auxiliary variable χt ≡ ct

θ/vt which allows us to

simplify the system (χt is related to the mass of high-skill workers in production and

therefore, given hAt , to HDt and the growth rate of Nt). Since the economy does not

feature balanced growth, we also need to keep track of the level of Nt, we do this by

introducing nt ≡ N−β/[(1−β)(1+β(σ−1))]t , which tends toward 0 as Nt tends toward infinity.

Finally, we define ωt ≡ (wLtN−ψ/(1+β(σ−1))t )β(1−σ) which asymptotes a finite positive

number. The equilibrium can then be characterized by a system of differential equations

with two state variables nt, Gt, two control variables, hAt , χt and an auxiliary equation

defining ωt (see equations (31), (32), (39), (40), (45) and (47)-(49) in Appendix 6.2).

This system admits a steady-state as stipulated below (proof in Appendix 7.5.1):

Proposition 3. Assume that

κ−κ (γ(1− κ)/ρ)κ−1 ρ/η + ρ/γ < ψH, (25)

then the system of differential equations admits a steady state (n∗, G∗, hA∗, χ∗) with n∗ =

0, 0 < G∗ < 1 and positive growth(gN)∗> 0.

We will refer to the steady state (n∗, G∗, hA∗, χ∗) as as an asymptotic steady state

for our original system of differential equations. In addition, the assumption that θ ≥

19

1 ensures that the transversality condition always holds.16 For the rest of the paper

we restrict attention to parameters such that there exists a unique saddle-path stable

steady state (n∗, G∗, hA∗, χ∗) with n∗ = 0, G∗ > 0. Then, for an initial pair (N0, G0) ∈(0,∞) × [0, 1] sufficiently close to the asymptotic steady state, the model features a

unique equilibrium converging towards it.17

The following section describes the equilibrium and explains how the economy reaches

this asymptotic steady-state even starting far from it. We focus on the evolution of the

automation incentives, which, we show, are crucially linked to the level of low-skill wages.

3.4 Innovation incentives along the transitional path

A distinctive feature of this economy is that the path of technological change itself will

be unbalanced through the transitional dynamics. In the following, we elaborate on this

by showing that the economy goes through three phases: a Phase 1 where the incentive

to automate is low and the economy behaves close to a Romer model, a Phase 2 where

automation increases the share of automated products Gt and a Phase 3 where the

economy approaches the steady state. This section helps guide the intuition, and formal

proofs of all the results mentioned are contained in Appendices 7.5.3, 7.5.4 and 7.5.5.

These are “quantitative phases” in that the incentive to automate in Phase 1 is small

but not exactly 0 and in Phase 3, Gt is only approximately constant.

Following (21), the mass of high-skill workers in automation (HAt = (1−Gt)Nth

At )

and therefore the automation intensity rate, given by ηGκt

(HAt / (1−Gt)

)κ, depends on

the ratio between the gain in firm value from automation V At −V N

t , and its effective cost

namely the high-skill wage divided by the number of products wHt/Nt, that is:

HAt = (1−Gt)

(κηGκ

t

V At − V N

t

wHt/Nt

)1/(1−κ)

. (26)

Crucially, as the number of products in the economy increases, the ratio(V At − V N

t

)/ (wHt/Nt)

changes value. To see this, we combine (19), (20) and (21), and integrate over the dif-

16To understand equation (25), let the efficiency of the automation technology η be arbitrarily largesuch that the model approaches a Romer model with only automated firms. Then equation (25) becomesρ/γ < ψH, which mirrors the condition for growth in a Romer model with linear innovation technology.With a smaller η the present value of a new product is reduced and the condition is more stringent.

17Multiple asymptotic steady states with G∗ > 0 are technically possible but are not likely for rea-sonable parameter values (see Appendix 7.5.2). In addition, with two state variables (nt and Gt) saddlepath stability requires exactly two eigenvalues with positive real parts. In our numerical investigation,for all parameter combinations which satisfy the previous restrictions, this condition was always met.

20

ference in value between an automated and a non-automated firm to get:

V At − V N

t =

∫ ∞t

exp

(−∫ τ

t

rudu

)(πAτ − πNτ −

1− κκ

wHτhAτ

)dτ, (27)

such that the difference in value between the two types of firms is given by the discounted

difference of the profit flows adjusted for the cost and probability of automation. Further,

by the assumptions of Cobb-Douglas and iso-elastic demand, high-skill wages (for given

HPt ) and aggregate profits are both proportional to aggregate output. Therefore, wHt/Nt

is proportional to average profits: wHt/Nt =[Gtπ

At + (1−Gt) π

Nt

]/[ψHP

t

]. As a result,

the mass of high-skill workers in automation essentially depends on the discounted flow

of profits of automated versus non-automated firms divided by the average profits made

by firms. Intuitively — from equation (27) — with a positive discount rate, as a first

approximation V At − V N

t will move like πAt − πNt , so that one gets

V At − V N

t

wHt/Nt

∝ πAt − πNtGtπAt + (1−Gt) πNt

=1−

(1 + ϕwε−1

Lt

)−µGt + (1−Gt)

(1 + ϕwε−1

Lt

)−µ , (28)

where we used πAt −πNt =[(

1 + ϕwε−1Lt

)µ − 1]πNt from (7) and where ∝ denotes“approxi-

mately proportional”. This highlights low-skill wages (relative to the inverse productivity

of machines ϕ−1) as the key determinant of automation innovations. Note, that when

wLt ≈ 0 the incentive for automation innovation is very low, whereas when wLt → ∞it approaches 1/Gt > 0. This price effect bears similarity with Zeira (1998), where the

adoption of a labor-saving technology also depends on the price of labor.18

First Phase. When the number of products, Nt, is sufficiently low that wLt is small

relative to ϕ−1, the difference in profits between automated and non-automated firms

is small relative to average profits. Following (26) and (28), the allocation of high-skill

labor to automation, HAt , is low and automation intensity is low. Consequently, growth

is driven by horizontal innovation and the behavior of the economy is close to that of a

Romer model with a Cobb-Douglas production function between low-skill and high-skill

labor, and both wages approximately grow at a rate gNt / (σ − 1). This corresponds to

what we label as Phase 1 of the economy (naturally, if Gt is not initially low, it must

depreciate during this period following equation (22)).

18Beyond a focus on different empirical phenomena (an increase in inequality vs. cross-countryproductivity differences), there are two important differences between our model and Zeira (1998).Zeira (1998) assumes exogenous technological progress (while we model endogenous innovation) andhere the innovation cost changes over time while Zeira (1998) has constant adoption cost.

21

Second Phase. As wLt grows relative to ϕ−1, the term (V At − V N

t )/(wHt/Nt) in-

creases, in fact, following (28) it grows like(1 + ϕwε−1

Lt

)µ−1 when Gt is low. This raises

the incentive to innovate in automation. Without the externality in the automation

technology (κ = 0), (26) directly implies that HAt must raise significantly above zero,

and with it the Poisson rate of automation, η(HAt / (1−Gt)

)κand thereby the share of

automated products, Gt. For κ > 0, the depreciation in the share of automated products

during Phase 1 might gradually make the automation technology less effective which can

delay or even potentially prevent the take-off of automation.19

In line with Proposition 1, both the increase in Gt and Nt lead to an increase in the

skill premium. As we show below, the low-skill wage may temporarily decline. We will

label this time period where the share of automated products in the economy increases

sharply the second phase (the transition between phases is smooth and therefore the

exact limits are arbitrary). Arguably, this time period is the one where our model differs

the most from the rest of the literature.

Third Phase. As the share of automated products Gt is no longer near zero, the

gain from automation V At − V N

t and its effective cost wHt/Nt grow at the same rate

(the right-hand side in (28) is now close to 1/Gt). As a result, the normalized mass of

high-skill workers in automation research (NthAt ) stays bounded (see (26)), and so does

the Poisson rate of automation, which implies that Gt converges to a constant below 1.

The economy then converges toward the asymptotic steady-state which features a

constant share of automated products strictly below zero and a constant growth rate in

the number of products. Following Proposition 1, part A, the skill premium grows, but

the continued existence of non-automated products ensures that low-skill wages grow at

a positive rate asymptotically given by gwL∞ = gwH∞ / (1 + β(σ − 1)) . In this phase, the

profits made by a non-automated firm are negligible relative to the horizontal innovation

cost, therefore it is the prospect of future automation which guarantees the entry of new

products. This plays an important role in the interaction between the two innovation

technologies. Besides, the growth rate in the number of products is always lower in Phase

3 than in Phase 1 in particular because automation reduces the share of output which

goes to profits of new firms (see Appendix 7.5.5 for more details). Though in Phase 3

the share parameters of the nested CES function are constant, the model continues to

differ from a generic capital deepening model in that long-run growth is endogenized

19Automation takes off if either G0 and N0 are not too low or, for any values of N0, G0 > 0 wheneverκ(1− β) + κ < 1—see Appendix 7.5.4. If we were to assume instead that the automation technology isgiven by max

ηGκt , η

(Nth

At

)κ, then automation would always take off.

22

and depends on its interaction with automation.

3.5 An illustration of the transitional dynamics

We now illustrate our previous result and further analyze the behavior of our economy

through the use of numerical simulations.20 Thereafter, we relate our theoretical results

to the historical experience of the US economy. Unless, otherwise specified, and in

line with the results of Sections 2.3 and 3.4, the broad patterns described below do

not depend on specific parameter choices and we simply choose “reasonable” parameters

(Table 1). Appendix 7.7.6 gives a systematic exploration of the parameter space and

section 4 calibrates a richer model to the U.S. data.

Table 1: Baseline Parameter Specification

σ ε β H L θ η κ ϕ ρ κ γ N0 G0

3 4 2/3 1/3 2/3 2 0.2 0.5 0.25 0.02 0 0.3 1 0.001

Baseline Parameters. Total stock of labor is 1 with L = 2/3 and β = 2/3 such

that absent automation and if all high-skill workers were in production the skill premium

would be 1. The initial mass of products is set low at N0 = 1 to ensure we begin in Phase

1. The initial share of automated products is low, G0 = 0.001, but would initially decline

had we chosen a higher level. We set σ = 3 to capture an initial labor share close to 2/3.

We set ϕ = 0.25 and ε = 4, so that at t = 0, the profits of automated firms relative to

non-automated firms are only 0.004%. The innovation parameters (γ, η, κ) are chosen

such that GDP growth is close to 2% both initially and asymptotically. There is no

externality from the share of automated products in the automation technology, κ = 0.

ρ and θ are chosen such that the interest rate is around 6% initially and asymptotically.

Figure 3 plots the evolution of the economy. Based on the behavior of the automation

expenditures (Panel C) we delimit Phase 1 as corresponding to the first 100 years and

Phase 2 as the period between year 100 and year 250.

Innovation and growth. Initially, low-skill wages and hence the incentive to auto-

mate—proportional to (V At − V N

t )/(wHt/Nt)—are low (Panel B) and so is the share of

automated firms Gt (Panel C). With growing low-skill wages, the incentive to automate

picks up a bit before year 100. Then the economy enters Phase 2 as automation expenses

sharply increase (up to 4% of GDP). This leads to an increase in the share of automated

20We employ the so-called “relaxation” algorithm for solving systems of discretized differential equa-tions (Trimborn, Koch and Steger, 2008). See Appendix 7.6 for details.

23

products Gt which eventually stabilizes at a level strictly below 1. There is no simple

one-to-one link between automation spending and rising inequality in our model: here,

automation spending is higher in Phase 3 than in Phase 2 (Panel C in figure 3), yet the

growth in the skill premium is slower.21

Panel A: Growth Rates of Wages and GDPPhase 1 Phase 2 Phase 3

0 100 200 300 400Year

0

1

2

3

4

Per

cen

t

gGDP

gwL

gwH

0 100 200 300 4000

1

2

3

4

5

automation incentive (left)

Panel B: Automation Incentive and Skill PremiumPhase 1 Phase 2 Phase 3

0 100 200 300 400Year

0

2

4

6

log

unit

skill premium (right)

0 100 200 300 4000

5

10

15

Per

cen

t

Autom./GDP (left)Hori./GDP (left)

Panel C: Research Expenditures and G

Phase 1 Phase 2 Phase 3

0 100 200 300 400Year

0

20

40

60

80

100

G (right)

Panel D: Factor SharesPhase 1 Phase 2 Phase 3

0 100 200 300 400Year

0

20

40

60

80

100

Per

cen

t

Total labor shareLow-skill labor share

Figure 3: Transitional Dynamics for baseline parameters. Panel A shows growth rates forGDP, low-skill wages (wL) and high-skill wages (wH), Panel B the incentive to au-tomate,

(V At − V Nt

)/ (wHt/Nt), and the skill premium, Panel C the total spending

on horizontal innovation and automation as well as the share of automated prod-ucts (G), and Panel D the wage share of GDP for total wages and low-skill wages.

As shown in Panel C, spending on horizontal innovation as a share of GDP declines

during Phase 2 and for any parameter values ends up being lower in Phase 3 than Phase

1. Despite this, the growth rate of GDP is roughly the same in Phases 1 and 3 because

the lower rate of horizontal innovation in Phase 3 is compensated by a higher elasticity of

GDP wrt. Nt (1/ [(σ − 1) (1− β)] instead of 1/ (σ − 1)). As a result, the phase of intense

automation—which also contributes to growth—is associated with a temporary boost

of growth. This is, however, specific to parameters (Appendix 7.7.2 gives a counter-

example). That intense automation need not be associated with a sharp increase in

growth is important because the lack of an acceleration in GDP growth in recent decades

has often been advanced in opposition to the hypothesis that a technological revolution

21Intuitively the elasticity of the skill-premium with respect to the skill-bias of technology is notconstant in our model, contrary to a CES framework with factor-augmenting technologies.

24

explains the recent increase in the skill-premium (Acemoglu, 2002a).

Wages. In the first phase, and referring to figure 2, the relative demand curve stays

close to the straight dotted line with slope 1−ββ

LHP associated with a Cobb-Douglas pro-

duction. Continuous horizontal innovation pushes the isocost curve towards the North

East so that both wages grow at around 2% (Panel A).

As rising low-skill wages trigger the second phase, the relative demand curve pivots

counter-clockwise and bends upwards increasing the growth rate of high-skill wages to

almost 4% and suppressing the growth rate of low-skill wages to around 1%. Though

our parameter values satisfy the conditions of Proposition 1 B.ii and any increase in

Gt has a negative impact on wLt, the growth in Nt is sufficient to ensure that low-skill

wages grow at a positive rate throughout. Section 3.7 demonstrates that this need not

be the case. It is precisely this movement of the relative demand curve that allows our

model to capture labor-saving innovation; a feature which a model with constant G or

a capital deepening model would not include. In the third phase, the relative demand

curve no longer moves, but the continuous push of the isocost curve keeps increasing the

skill-premium albeit more slowly than previously.22 Yet, at the same time, the decline

in horizontal innovation reduces it. Appendix 7.7.1 presents a growth decomposition

exercise where we compute the instantaneous contribution of horizontal innovation and

automation to low-skill and high-skill wages.23

Factor shares. Panel D of figure 3 plots the labor share and the low-skill labor

share. With machines as intermediate input, capital income corresponds to aggregate

profits, which are a constant share of output. High-skill labor in production also earns a

constant share of output. Both correspond to a rising share of GDP in Phase 2 as during

this time period, the ratio Y/GDP increases since machines expenditures are excluded

from GDP . Low-skill labor earns (close to) a constant share of output in Phase 1 but

its share declines with automation in Phase 2 and approaches 0 in Phase 3. As a result,

the labor share of GDP , constant in Phase 1, declines in Phase 2 and stabilizes at a

22Changes in the mass of high-skill workers in production, HPt , also affect the skill premium: increas-

ing Gt requires hiring more high-skill workers in automation innovation (in the same vein as the GeneralPurpose Technology literature; notably Beaudry et al. (2016)). Here, this effect is counteracted by adecrease in horizontal innovation and the net effect is small.

23At the aggregate level, our model boils down to a nested CES production function (see equation(3)), and Phase 2 corresponds to a period where the share parameter of the composite which featuressubstitutability between machines and low-skill labor, Gt, rises. This change in the share parametershould not be confused with an increase in the elasticity of substitution between machines and low-skilllabor (in fact, the Morishima’s elasticity of substitution between these two factors is symmetric anddeclines in Phase 2 from a value close to ε to a value close to 1 + β(σ − 1)).

25

lower level than before in Phase 3. Working contrary to this is the increase in innovation

as only high-skill workers work in innovation, but the net effect is an increase in the

capital share. For different parameter values, since HPt is not constant, the drop in the

labor share can be delayed relative to the rise in the skill premium (see Appendix 7.7.3).

The ratio of wealth to GDP also increases during Phase 2 and asymptotes a constant

in Phase 3 (see Appendix 7.7.1).

3.6 Comparison with the historical experience

Our model suggests that technological progress is overall biased against low-skill labor,

especially at later stages when low-skilled wages are sufficiently high. This bias manifests

as a process of automation which causes a widening skill premium and a decline in the

labor share. This is in line with the experience of the United States since the 1980s,

where the college premium (considered to be a good proxy for the skill premium over

that time period) has been steadily increasing (our stylized fact 1 in the introduction)

and the labor share has declined by around 5 percentage points (stylized fact 3, see

Karabarbounis and Neiman, 2013, or Elsby, Hobijn and Sahin, 2013). This contrasts

our paper with most of the growth literature which features a balanced growth path and

therefore does not have permanently increasing labor inequality.24

Extrapolating further in the past is more complicated, but it is worth discussing

it briefly since automation did not start in the 1980s. Goldin and Katz (2008) argue

that technological change has been skill-biased throughout the 20th century as periods

of decline in the skill premium (such as the 1970s) can be accounted for by exogenous

changes in the relative supply of skills. Even before, Katz and Margo (2014) argue that

the relative demand for highly skilled workers (in professional, technical and manage-

rial occupations) has increased steadily from perhaps as early as 1820 to the present.

This shift in relative demand may have been (partly) compensated for by changes in

relative skill supply (see Appendix 6.4) and human capital accumulation (see Section

2.5) making growth more balanced. However, automation did not always replace the

least skilled individuals: the mechanization of the 19th century replaced skilled artisans,

and the computerization of the last 30 years displaced more severely middle-skill work-

ers. Therefore our model offers a framework to shed light on the historical experience

24For instance, in Acemoglu (1998), low-skill and high-skill workers are imperfect substitutes inproduction. Yet, since the low-skill augmenting technology and the high-skill augmenting technologygrow at the same rate asymptotically, the relative stocks of effective units of low-skill and high-skilllabor is constant, leading to a constant relative wage.

26

provided that the definition of ‘high-skill’ is narrow.

The capital share has followed a U-curve in the 20th century (Piketty and Zucman,

2014 and Piketty, 2014), which our baseline model cannot account for. As the decline of

the labor share in manufacturing is more persistent (9 percentage points from 1960 to

2005; Alvarez-Cuadrado, Van Long and Poschke, 2018), one way to reconcile our model

with the historical pattern on factor shares would be to extend it to include multiple

sectors (agriculture, manufacturing and services) with an elasticity of substitution less

than 1, experiencing automation waves at different points in time. In such a model, as

one sector automates, spending on the other sectors would increase (as in Acemoglu and

Guerrieri, 2008) securing a higher labor share.

3.7 Additional results

We now exploit the endogenous nature of endogenous change in our model and examine

the interaction between horizontal innovation and automation

Declining low-skill wages. Empirical evidence suggests that low-skill wages have

been stagnating and perhaps even declining in recent periods (Stylized fact 2 ). Because

our model features a labor saving innovation, it can accommodate declining low-skill

wages in Phase 2, unlike a model with fixed G, a capital deepening model with perfectly

elastic capital or in the previous DTC literature. Here, it happens when the relative

demand curve pivots sufficiently fast counter-clockwise compared with the movement

of the isocost curve in Figure 2. We can ensure this, for instance, by introducing ex-

ternalities in the automation function and setting κ = 0.49. This delays the onset of

automation, but intensifies it once it takes off, so that the relative demand curve moves

quickly enough compared to the isocost curve for a decline in low-skill wages.25 Yet,

this drop must be temporary because lower low-skill wages discourage automation. See

Appendix 6.3 for details and another example with κ = 0.

Effect of innovation parameters. In Appendix 7.5.6, we establish:

Proposition 4. The asymptotic growth rates of GDP gGDP∞ and low-skill wages gwL∞

increase in the productivity of automation η and horizontal innovation γ.

Therefore, in the long-run, a better automation technology (a higher η) actually

benefits low-skill wages: the reason is that firms automate faster which encourages hor-

izontal innovation. During the transition, however, a higher η also means that Phase 2

25Horizontal innovation drops during this intense automation phase because its cost increases as thereis a high demand for high-skill workers in automation innovation.

27

starts sooner, leading to lower low-skill wages at that point and a higher skill premium

(see a numerical example in Appendix 7.7.4). These result preview those on the effect

of taxes on automation in section 4.2.

Further results and extensions in the Appendix. i) Our model features ele-

ments of self-correction in the presence of exogenous shocks (as Acemoglu and Restrepo,

2017). For instance, with no automation externality (κ = 0), a positive exogenous shock

to Gt will be followed by a period where automation is relatively less intense (as the skill

premium would decline), so that eventually the asymptotic share of automated prod-

ucts stays the same (see Appendix 7.7.5). ii) Appendix 7.8 studies the social planner’s

problem. The optimal allocation is qualitatively similar to the equilibrium we described,

which shows that our results are not driven by the market structure we imposed. iii) Ap-

pendix 7.9 presents a setting in which automation can only be undertaken before a firm

starts production. The transition of the economy is qualitatively identical, which shows

that the main results of the paper depend only on the feature that higher wLt creates in-

centives to automate and not on the assumption that firms are born non-automated, iv)

Appendix 6.4 presents the transitional dynamics with an endogenous supply response.

4 Quantitative Exercise and Policy Experiments

In this section, we conduct a quantitative exercise to compare empirical trends for the

United States with the predictions of our model. This allows us to dicipline the param-

eters of our model and subsequently conduct policy experiments.

4.1 Quantitative exercise

To match the data quantitatively, we need to modify the baseline model. First, since the

share of high-skill workers has dramatically increased, we let H and L vary over time and

here take as given their path from the data (as opposed to endogenizing labor supply as

in Appendix 6.4). Second, we assume that producers rent machines from a capital stock.

Capital can also be used used as structures in both automated and non-automated firms.

Third, we allow for the possibility that low-skill workers are replaced by a composite of

machines and high-skill workers. The production function (2) is then replaced by

y(i) = [l(i)ε−1ε + α(i)(ϕhe(i)

β4ke(i)1−β4)

ε−1ε ]

εβ1ε−1hs(i)

β2ks(i)β3 , (29)

28

where β1 +β2 +β3 = 1 and β4 ∈ [0, 1). The central difference between equations (2) and

(29) is the introduction of he(i) as high-skill labor which—along with machines—perform

the newly automated tasks (“e” for equipment). This feature is necessary to capture a

relatively low drop in the labor share. ks(i) is structures and ke(i) and ks(i) are both

rented from the same capital stock Kt. Kt increases with investment in final goods and

depreciates at a fixed rate ∆, so that (12) is replaced by:

Kt = Yt − Ct −∆Kt. (30)

The cost advantage of automated firms now depends on the ratio between low-skill

wage and the price of the high-skill labor capital aggregate namely wLt/(wβ4Htr

1−β4t ) where

rt = rt + ∆ is the gross rental rate of capital. The logic of the baseline model directly

extends to this case. Proposition 1 still holds and Proposition 2.A) holds with gwH∞ =

gY∞ = gN∞/ ((β2 + β1β4)(σ − 1)) and gwL∞ = gY∞ (1 + (σ − 1) β1β4) / (1 + β1(σ − 1)) . An

equivalent to Proposition 3 holds but the system of differential equations includes three

control variables and three state variables and Proposition 4 holds as well. The tran-

sitional dynamics are similar to that of the baseline model but automation innovation

now depends on wLt/(wβ4Htr

1−β4t ). It is low in a first phase, increases in a second phase

and stabilizes in the third phase as the economy approaches its asymptotic steady-state.

The capital share and the capital output ratio increase in Phase 2 as equipment replaces

low-skill labor in production. Details and proofs are provided in Appendix 7.11.

We match our extended model to the data (details on the data and the method in Ap-

pendix 7.12). Because of data availability and to make our exercise easily comparable to

the rest of the literature, we identify low-skill workers with non-college educated workers

and high-skill workers with college educated workers and focus on the years 1963-2007.

We match the skill-premium and take the empirical skill-ratio as given (and normal-

ize total population to 1). We also match the growth rate of real GDP/employment

and the labor share. We further associate the use of machines with private equipment

(excluding transport) and software. As pointed out by Gordon (1990) the NIPA price

indices for real equipment are likely to understate quality improvements in equipment

and therefore growth in the real stock of equipment. Hence, we use the adjusted price

index from Cummins and Violante (2002) for equipment, and build (private) equipment

29

and software to GDP ratios from 1963 to 2000.26

Our exercise has similarities with Goldin and Katz (2008) who attempt to explain

the skill-premium using a constant trend in the skill bias of technical change or Krusell

et al. (2000) who feed in the empirical path of equipment to explain the increase in the

skill-premium. But, it is more demanding: both the technology path and the equipment

stock must be endogenous.

Our model is not stochastic and cannot be directly estimated. Instead, we take a

parsimonious approach and choose parameters to minimize the weighted squared log-

difference between observed and predicted paths. We start the simulation 40 years before

1963 to force N1963 and G1963 to be consistent with the long-run behavior of our model.27

Since the model requires the skill-ratio before and beyond the time period we estimate,

we fit a generalized logistical function to the path of the log of the skill-ratio and use

the predicted values outside 1963-2007 (over that time period, the fit is excellent).

The model features a total of 14 parameters (ϕ can be normalized to 1 without loss

of generality). Instead of exogenously restricting parameters we will fit the model with

all parameters freely (other than the economically motivated boundaries imposed by the

model itself) and then assess whether these parameter estimates fit with other similar

estimates. Table 2 gives the resulting parameters. The elasticity of substitution across

products σ is estimated at 6.7, consistent with Christiano, Eichenbaum and Evans (2005)

who find that observed markups are consistent with a sigma of around 6. The elasticity

of substitution between machines and workers is estimated at almost 5. κ is estimated at

0.58 implying a substantial automation externality; a force which causes an accelerated

Phase 2. Finally, we find β1—the factor share of machines/low-skill workers—of 0.62

which implies sizable room for automation, though a β4 of 0.73 means that the share

of high-skill workers in the composite that replaces low-skill workers is of substantial

importance. The preference parameters are within standard estimates with a ρ of 3.7%

and the implied θ resulting in log-preferences. The only parameter that is estimated

outside a common range is the depreciation rate ∆ (but it is not precisely identified).

Appendix 6.5 discusses in details how the parameters are identified.

26The series on the skill premium, the skill ratio, GDP/employment and the labor share directlypredict a series for low-skill wages. Since this series differs substantially from the low-skill wages seriesof Acemoglu and Autor (2011) depicted in Figure 1 (particularly because of compositional effects), wedo not attempt to match the latter.

27Initial values for Gt have little impact on the state of the economy several years later (see e.g.Figure 22 in Appendix 7.7.6). And the same is true for initial values of Kt. Therefore by simulatingthe economy 40 years prior, we ensure that the simulated moments are nearly independent of the initialvalues for Gt and Kt. We fix K1923 at its steady-state value in a model with no automation.

30

Table 2: Parameters from quantitative exercise

Parameter σ ε β1 γ κ θ η κ ρ β2 ∆ β4 N1963 G1963

Value 6.7 4.97 0.62 0.64 0.58 1 0.41 0.58 0.037 0.18 0.014 0.73 21.6 0.02

1960 1970 1980 1990 2000 2010 2020Years

1

1.2

1.4

1.6

1.8

2

2.2

Ski

ll-P

rem

ium

(ra

tio)


Predicted ValuesEmpirical Values

1960 1970 1980 1990 2000 2010 2020Years

50

55

60

65

70

75

80

Per

cen

t


1960 1970 1980 1990 2000 2010 2020Years

50

100

150

200

250300350400

Inde

x (1

963

= 1

00)

Panel C - GDP / Employment

1960 1970 1980 1990 2000 2010 2020Years

0.25

0.5

1

2

3

Equ

ipm

ent /

GD

P

Panel D - Equipment / GDP

Figure 4: Predicted and empirical time paths

Figure 4 further shows the predicted path of the matched data series along with

their empirical counterparts. Panel A demonstrates that the model matches the rise in

the skill premium from the early 1980s and the flat skill premium in the period before

reasonably well. Though less pronounced than in the data, our model also includes the

more recent decline in the growth rate of the skill premium, which, computed over a

5 years moving window, peaks in 1984 at 1.3% and drops to 0.64% in the 2000s. The

decline in the labor share is matched from 1970 onward. The average growth rate of

the economy is matched completely as shown in Panel C. Although, the model largely

captures the average growth rate of capital equipment over GDP during the period, the

predicted path differs somewhat from its empirical counterpart as shown in Panel D on

log-scale. Whereas the empirical path is close to exponential, the predicted path tapers

off somewhat towards the end of the period.28 In Appendix 6.5.5, we reproduce the same

28 Interestingly, more recent data show a slow down in software investment (see Beaudry et al., 2016).Moreover, the ratio of equipment to GDP in the data is only a proxy for the ratio of machines to GDPin the model, since all equipment may not be used to replace low-skill workers.

31

1980 2000 2020 2040 2060Year

0

0.5

1

1.5

2

2.5

Per

cen

tPanel A: Growth Rates of Wages and GDP

gGDP

gwL

gwH

1980 2000 2020 2040 206050

55

60

65

70

75

Per

cen

t

labor share(left)

1980 2000 2020 2040 2060Year

0.3

0.4

0.5

0.6

0.7

0.8

0.9

log

unit

Panel B: Labour share and Skill Premium


1980 2000 2020 2040 2060Year

0

1

2

3

4

5

6

7

8

9

Per

cen

t


1980 2000 2020 2040 2060Year

0

20

40

60

80

100

Per

cen

t


G (right)

Figure 5: Transitional Dynamics with calibrated parameters (the growth rates are computedover a 5 year moving average).

exercise but only matching the first 30 years, the parameters are nearly identical and

the calibrated model matches well the rest of the sample period.

Figure 5 plots the transitional dynamics from 1963 to 2063. Panel A shows that

GDP growth slows down past 2007, Panel B that the skill premium keeps growing albeit

at a slower rate than in the 1980s, while the labor share smoothly declines toward its

steady-state value of 56%. Panel C shows that the share of automated products increases

sharply through the 1963-2007 time period: In 1963 the value is 0.02 (yet this value is

not precisely identified) but rises to 0.20 by the 1980s and 0.49 by 2007 to finally settle

at 0.88 asymptotically—in this case, the marginal effect of automation unambiguously

reduces low-skill wages on the entire path. Besides, the growth rate of the skill premium

decreases in the 1990s and 2000s even though automation expenditures are roughly

constant between 1980 and 2000. This constitutes a response to the critique of the

literature on skill-biased technical change put forward by Card and DiNardo (2002),

who argue that inequality rising the most in the early to mid 1980s and technological

change continuing in the 1990s, squares poorly with the predictions of a framework based

on skill-biased technological change.

4.2 Automation taxes

Among the many policy proposals to address rising income inequality, is a tax on the

use of automation technology or a “robot tax”. In the following we examine the conse-

quences of two taxes: either on the use of machines—in the form of a tax on the rental

rate of equipment—or the innovation of new machines—in the form of taxing high-skill

workers in automation innovation. In either cases we consider the permanent unexpected

32

introduction of a tax of 50% (See Appendix 7.11 for details).

First, consider a tax on the use of machines. To clarify the role of endogenous

technology we also simulate the economy holding technology, Nt andGt and thereforeHPt

at the baseline level. Figure 6 reports the results. The immediate effect is to discourage

the use of machines and consequently low-skill wages rise by 4% on impact (Panel B)

with a corresponding lower skill premium (Panel C). The endogeneity of technology

amplifies the effect of the tax over time (in panel B, the gap between the endogenous

and the exogenous cases widens). This results from two effects. First, the tax discourages

automation innovation leading to a lower G (Panel E). Second, since high-skill workers

and machines are complements, the tax reallocates high-skill workers toward horizontal

innovation, increasing N (Panel D). Consequently, the positive effect on low-skill wages

is eventually larger than the initial 4%. Even output will asymptotically be higher than

the baseline (extending Panel A).29

2007 2027 2047 2067 2087Year

-8

-6

-4

-2

0

2

Per

cen

t

Panel A: Output deviation

Machine tax(endogenous technology)

Machine tax(exogenous technology)

Automation Innovation tax

2007 2027 2047 2067 2087Year

-4

-2

0

2

4

6

Per

cen

t

Panel B: Low-skill wages deviation

2007 2027 2047 2067 2087Year

-12

-10

-8

-6

-4

-2

0

Per

cen

t

Panel C: Skill premium deviation

2007 2027 2047 2067 2087Year

-20

-10

0

10

20

30

Per

cen

t

Panel D: N deviation

2007 2027 2047 2067 2087Year

-12

-10

-8

-6

-4

-2

0

Per

cen

t

Panel E: G deviation

Figure 6: Effects of a machine tax and an automation innovation tax relative to baseline.

A tax on automation innovation has very different implications: First, high-skill

workers reallocate from innovation in automation toward production which, on impact,

boosts output and marginally low-skill wages. As the share of automated product G

29Asymptotically, a machine tax has no effect on G or on the growth rate of N : as using low-skillworkers instead of machines becomes prohibitively expensive, the allocation of high-skill workers remainsundistorted by the presence of a finite tax. See Proposition 12 in Appendix 7.11.

33

decreases, low-skill wages further increase though very modestly. However, discouraging

automation innovation also discourages horizontal innovation which eventually reduces

low-skill wages. The intuition is similar to that behind Proposition 4: a tax on automa-

tion innovation has similar effects to that of reducing the effectiveness of the automation

technology. The skill-premium is reduced as the economy grows at a slower rate.

This exercise highlights the importance of endogenous technology: Though both

forms of “robot” taxes increase low-skill wages on impact, the long-run effects depend

crucially on whether the tax is designed to encourage or discourage overall innovation.

Of course, this exercise is only a first pass and analyzing the welfare consequences of

these policies or others, say minimum wage legislation, is of interest for future research.

5 Conclusion

This paper introduces automation in a horizontal innovation growth model. In such

a framework, the economy undertakes a structural break. After an initial phase with

stable income inequality and stable factor shares, automation picks up. During this

second phase, the skill premium increases, low-skill wages stagnate and possibly decline,

the labor share drops—all consistent with the US experience in the past 50 years—and

growth starts relying increasingly on automation. In a third phase, the share of auto-

mated products stabilizes, but the economy still features a constant shift of low-skill

employment from recently automated tasks to as of yet non-automated tasks. Low-skill

wages grow in the long-run but slower than high-skill wages. A calibrated version of our

model shows that a tax on machine use increases low-skill wages more than in a model

with exogenous technology, while a tax on automation innovation eventually reduces

low-skill wages.

A lesson from our framework is that if tasks performed by a scarce factor (say labor)

can be automated but it is not presently profitable to do so, then, in a growing economy,

the return to this factor will eventually increase sufficiently to make it profitable. In

other words, there is a long-run tendency for technical progress to displace substitutable

labor (a point made by Ray, 2014,), but this only occurs if the relevant wages are large

relative to the price of machines. This in turn can only happen under three scenarios:

either automation itself increases the wages of these workers (the scale effect dominates

the substitution effect), or there is another source of technological progress (here, hori-

zontal innovation), or technological progress allows a reduction in the price of machines

34

relative to the consumption good (as in Appendix 7.4). Importantly, when machines

are produced with a technology similar to the consumption good, automation can only

reduce wages temporarily: a prolonged drop in wages would end the incentives to auto-

mate in the first place. Although, we focus on a general equilibrium model with low-skill

labor, these insights extend to subsectors of the economy and other scarce factors.

Fundamentally, the economy in our model undertakes an endogenous structural

change when low-skill wages become sufficiently high. This distinguishes our paper

from most of the literature, which seeks to explain changes in the distribution of income

inequality through exogenous changes: an exogenous increase in the stock of equipment

as per Krusell et al. (2000), a change in the relative supply of skills, as per Acemoglu

(1998), or the arrival of a general purpose technology as in the associated literature.

This makes our paper closer in spirit to the work of Buera and Kaboski (2012), who

argue that the increase in income inequality is linked to the increase in the demand for

high-skill intensive services, which results from non-homotheticity in consumption.

The present paper is only a step towards a better understanding of the links be-

tween automation, growth and income inequality. Given that automation has targeted

either low- or middle-skill workers and that artificial intelligence may now lead to the

automation of some high-skill tasks, a natural extension of our framework would include

more skill heterogeneity. Another natural next step would be to add firm heterogeneity

and embed our framework into a quantitative firm dynamics model. Our framework

could be used to study the recent phenomenon of “reshoring”, where US companies that

had offshored their low-skill intensive activities to China, now start repatriating their

production to the US after having further automated their production process.

References

Acemoglu, D. (1998). Why Do New Technologies Complement Skills? Directed Technical

Change and Wage Inequality. Quarterly Journal of Economics, 113(4):1055–1089.

Acemoglu, D. (2002a). Directed Technical Change. Review of Economic Studies,

69(4):871–809.

Acemoglu, D. (2002b). Technical Change, Inequality, and the Labor Market. Journal of

Economic Literature, 40(1):7–72.

Acemoglu, D. (2007). Equilibrium Bias of Technology. Econometrica, 75(5):1371–1409.

35

Acemoglu, D. (2010). When Does Labor Scarcity Encourage Innovation? Journal of

Political Economy, 118(6):1037–1078.

Acemoglu, D. and Autor, D. (2011). Skills, Tasks and Technologies: Implications for

Employment and Earnings. Handbook of Labor Economics, 4(B).

Acemoglu, D. and Guerrieri, V. (2008). Capital Deepening and Nonbalanced Economic

Growth. Journal of Political Economy, 116(3):467–498.

Acemoglu, D. and Restrepo, P. (2017a). The race between machine and man: Implica-

tions of technology for growth, factor shares and employment.

Acemoglu, D. and Restrepo, P. (2017b). Robots and jobs: Evidence from us labor

markets.

Aghion, P., Deschezlepretre, A., Hemous, D., Martin, R., and Van Reenen, J. (2016).

Carbon taxes, path dependency, and directed technical change: Evidence from the

auto industry. Journal of Political Economy, 214(1):1–51.

Aghion, P. and Howitt, P. (1996). Research and Development in the Growth Process.

Journal of Economic Growth, 1(1):49–73.

Aghion, P., Howitt, P., and Violante, G. (2002). General Purpose Technology and Wage

Inequality. Journal of Economic Growth, 7(4):315–345.

Alvarez-Cuadrado, F., Van Long, N., and Poschke, M. (2018). Capital-labor substitution,

structural change and the labor income share. Journal of Economic Dynamics and

Control, 87.

Autor, D. (2013). The “Task Approach” to Labor Markets: An Overview. Journal for

Labour Market Research, 46(3):185–199.

Autor, D. and Dorn, D. (2013). The Growth of Low-Skill Service Jobs and the Polar-

ization of the U.S. Labor Market. American Economic Review, 103(5):1553–1597.

Autor, D., Katz, L., and Kearney, M. (2006). The Polarization of the U.S. Labor Market.

American Economic Review, 96(2):189–194.

Autor, D., Katz, L., and Kearney, M. (2008). Trends in U.S. Wage Inequality: Revising

the Revisionists. Review of Economics and Statistics, 90(2):300–323.

Autor, D., Katz, L., and Krueger, A. (1998). Computing Inequality: Have Computers

Changed the Labor Market? Quarterly Journal of Economics, 113(4):1169–1213.

Autor, D., Levy, F., and Murnane, R. (2003). The Skill Content of Recent Technological

Change: An Empirical Exploration. Quarterly Journal of Economics, 118(4):1279–

1333.

Bartel, A., Ichniowski, C., and Shaw, K. (2007). How Does Information Technology

36

Really Affect Productivity? Plant-Level Comparisons of Product Innovation, Process

Improvement and Worker Skills. Quarterly Journal of Economics, 122(4):1721–1758.

Beaudry, P. and Green, D. (2005). Changes in US wages, 1976-2000: ongoing skill bias

or major technological change? Journal of Labor Economics, 23(3):609–648.

Beaudry, P., Green, D., and Sand, B. (2016). The great reversal in the demand for skill

and cognitive tasks. Journal of Labor Economics, 34(S1):S199–S247.

Benzell, S., Kotlikoff, L., LaGarda, G., and Sachs, J. (2017). Robots are us: some

economics of human replacement. Working Paper WP-785.

Buera, F. and Kaboski, J. (2012). The Rise of the Service Economy. American Economic

Review, 102(6):2540–2569.

Card, D. and DiNardo, J. (2002). Skill Biased Technological Change and Rising Wage

Inequality: Some Problems and Puzzles. Journal of Labor Economics, 20(4):733–783.

Caselli, F. (1999). Technological Revolutions. American Economic Review, 89(1):78–102.

Christiano, E. and Evans (2005). Nominal rigidities and the dynamic effects of a shock

to monetary policy. Journal of Political Economy, 113(1).

Cummins, J. and Violante, G. (2002). Investment-specific technical change in the united

states (1947-2000): Measurement and macroeconomic consequences. Review of Eco-

nomic Dynamics, 5(2):243–284.

Elsby, M., Hobijn, B., and Sahin, A. (2013). The Decline of the U.S. Labor Share.

Brookings Papers on Economic Activity, 47(2):1–63.

Feng, A. and Graetz, G. (2016). Rise of the machines: The effects of labor-saving

innovations on jobs and wages. Working Paper.

Galor, O. and Moav, O. (2000). Ability-Biased Technological Transition, Wage Inequal-

ity, and Economic Growth. Quarterly Journal of Economics, 115(2):469–497.

Goldin, C. and Katz, L. (2008). The Race between Education and Technology. Harvard

University Press.

Goos, M., Manning, A., and Salomons, A. (2009). Job Polarization in Europe. American

Economic Review, 99(2):58–63.

Gordon, R. J. (1990). The measurement of durable goods prices. NBER Monograph

Series.

Graetz, G. and Michaels, G. (2018). Robots at work. CEPDP 1330.

Grossman, G., Helpman, E., Oberfield, E., and Sampson, T. (2017). Balanced growth

despite uzawa. The American Economic Review, 107(4):1293–1312.

Habakkuk, J. (1962). American and British Technology in the Nineteenth Century.

37

Cambridge University Press.

He, H. and Liu, Z. (2008). Investment-specific technological change, skill accumulation

and wage inequality. Review of Economic Dynamics, 11.

Hemous, D. and Olsen, M. (2016). The rise of the machines: Automation, horizontal

innovation and income inequality. CEPR Discussion Paper No 10244.

Hornbeck, R. and Naidu, S. (2014). When the levee breaks: Black migration and eco-

nomic development in the american south. American Economic Review, 104(3):963–

990.

Karabarbounis, L. and Neiman, B. (2014). The Global Decline of the Labor Share.

Quarterly Journal of Economics, 129(1):61–103.

Katz, L. and Margo, R. (2014). Technical change and the relative demand for skilled

labor: The united states in historical perspective. In Boustan, L., Frydman, C., and

Margo, R., editors, Human Capital in History, pages 15–57. University of Chicago and

NBER.

Katz, L. and Murphy, K. (1992). Changes in Relative Wages, 1963-1987: Supply and

Demand Factors. Quarterly Journal of Economics, 107(1):35–78.

Koh, D., Santaeulalia-Llopis, R., and Zheng, Y. (2016). Labor share decline and intel-

lectual property products capital. Working Paper.

Krusell, P., Ohanian, L., Rıos-Rull, J.-V., and Violante, G. (2000). Capital-Skill Com-

plementarity and Inequality: A Macroeconomic Analysis. Econometrica, 68(5):1029–

1053.

Lewis, E. (2011). Immigration, skill mix and capital skill complementarity. Quarterly

Journal of Economics, 126 (2):1029–1069.

Lloyd-Ellis, H. (1999). Endogenous Technological Change and Wage Inequality. Ameri-

can Economic Review, 89(1):47–77.

Nelson, R. and Phelps, E. (1966). Investment in Humans, Technological Diffusion, and

Economic Growth. American Economic Review, 56(1/2):69–75.

Newell, R. G., Jaffe, A. B., and Stavins, R. N. (1999). The induced innovation hypoth-

esis and energy-saving technological change. The Quarterly Journal of Economics,

114(3):941–975.

Nordhaus, W. (2007). Two Centuries of Productivity Growth in Computing. Journal of

Economic History, 67(1):128–159.

Peretto, P. and Seater, J. (2013). Factor-Eliminating Technical Change. Journal of

Monetary Economics, 60(4):459–473.

38

Piketty, T. (2014). Capital in the Twenty-First Century. Harvard University Press.

Piketty, T. and Zucman, G. (2014). Capital is Back: Wealth-Income Ratios in Rich

Countries 1700-2010. Quarterly Journal of Economics, 129(3):1255–1310.

Ray, D. (2014). Nit-Piketty: A comment on Thomas Piketty’s Capital in the Twenty

First Century. Working paper, New York University.

Romer, P. (1990). Endogenous Technological Change. Journal of Political Economy,

98(5):71–S102.

Sachs, J. and Kotlikoff, L. (2012). Smart machines and long-term misery. NBER wp

18629.

Spitz-Oener, A. (2006). Technical Change, Job Tasks, and Rising Educational Demands:

Looking outside the Wage Structure. Journal of Labor Economics, 24(2):235–270.

Trimborn, T., Koch, K.-J., and Steger, T. (2008). Multi-Dimensional Transitional Dy-

namics: A Simple Numerical Procedure. Macroeconomic Dynamics, 12(3):301–319.

Zeira, J. (1998). Workers, Machines, and Economic Growth. Quarterly Journal of

Economics, 113(4):1091–1117.

39

6 Main Appendix (For Online Publication)

6.1 A combined HO-AR model

We now assume that intermediate i is produced according to

y (i) = ((b (i) l (i)) + α (i) b (i)ς x (i))β

(b (i)h (i))1−β

instead of (2). That is we restrict attention for simplicity to the case where low-skill

labor and machines are perfect substitute in automated firms and we normalize ϕ to 1.

We assume that b (i) = exp (Bi) for some B > 0 and ς ∈ [0, 1]. ς represents the share of

technological progress in new varieties which is TFP-augmenting while 1−α is the share

which is purely labor augmenting (for both forms of labor). We assume that Nt grows

exogenously and linearly, so that Nt = nt for some n > 0. Further, once invented a good

has an exogenous probability η of becoming automated, and all high-skill workers are

hired in production (HP = H).

We find that in this model low-skill and high-skill wages grow at the same rate if

and only if the form of technological progress which is associated from moving from

an intermediate i to an intermediate i′ with a higher index is purely labor augmenting

(ς = 0). As soon as newer intermediates also feature more productive machines if

automated (ς > 0) then growth will be asymptotically unbalanced. This result is in the

spirit of Uzawa’s theorem but differs in so far as it refers to technological progress from

one intermediate to another instead of aggregate technological progress (which here

features increasing the range of intermediate inputs and automating some of them).

Formally, we establish in Appendix 7.3:

Proposition 5. Low-skill and high-skill wages grow asymptotically at the same rate if

and only if ς = 0. Otherwise, high-skill wages grow asymptotically faster than low-skill

wages: gwH > gwL .

6.2 Formal description of the normalized system of differential

equations

We first define an equilibrium as follows:

Definition 1. A feasible allocation is defined by time paths of stock of prod-

ucts and share of those that are automated, [Nt, Gt]∞t=0, time paths of use of low-

40

skill labor, high-skill labor, and machines in the production of intermediate inputs

[lt(i), ht(i), xt(i)]∞i∈[0,Nt],t=0, a time path of intermediate inputs production [yt(i)]

∞i∈[0,Nt],t=0,

time paths of high-skill workers engaged in automation [hAt (i)]∞i∈[0,Nt],t=0, and in hori-

zontal innovation [HDt ]∞t=0, time paths of final good production and consumption levels

[Yt, Ct]∞t=0 such that factor markets clear ((18) holds) and good market clears ((12) holds).

Definition 2. An equilibrium is a feasible allocation, a time path of intermediate

input prices [pt(i)]∞i∈[0,Nt],t=0, a time path for low-skill wages, high-skill wages, the interest

rate and the value of non-automated and automated firms [wLt, wHt, rt, VNt , V

At ]∞t=0 such

that [yt(i)]∞i∈[0,Nt],t=0 maximizes final good producer profits, [pt(i), lt(i), ht(i), xt(i)]

∞i∈[0,Nt],t=0

maximize intermediate inputs producers’ profits, [hAt (i)]∞i∈[0,Nt],t=0 maximizes the value

of non-automated firms, [HDt ]∞t=0 is determined by free entry, [Ct]

∞t=0 is consistent with

consumer optimization and the transversality condition is satisfied.

We now derive the system of differential equations satisfied by the normalized vari-

ables (nt, Gt, ht, χt). The definition of nt immediately gives:

nt = − β

(1− β)(1 + β(σ − 1))gNt nt. (31)

Rewriting (22) with hAt gives:

Gt = ηGκt

(hAt

)κ(1−Gt)−Gtg

Nt . (32)

Defining normalized profits πAt ≡ N1−ψt πAt and πNt ≡ N1−ψ

t πNt and the normalized values

of firms V At ≡ N1−ψ

t V At and V N

t ≡ N1−ψt V N

t , then we can rewrite (19) and (20) as

(rt − (ψ − 1) gNt

)V At = πAt +

·

V At , (33)

(rt − (ψ − 1) gNt

)V Nt = πNt + ηGκ

t

(hAt

)κ (V At − V N

t

)− vthAt +

·

V Nt . (34)

Equation (21) can similarly be rewritten as:

κηGκt

(hAt

)κ−1 (V At − V N

t

)= vt. (35)

Equation (23) with equality implies that and V Nt = vt/γ, therefore using (35) into (34),

41

we get: (rt − (ψ − 1) gNt

)vt = γπNt + γ

1− κκ

vthAt +

·vt. (36)

Taking the difference between (33) and (34) and using (35)we obtain:

(rt − (ψ − 1) gNt

) (V At − V N

t

)= πAt − πNt −

1− κκ

vthAt +

( ·

V At −

·

V Nt

).

Using again (35) we get,

(rt − (ψ − 1) gNt

)= κηGκ

t

(hAt

)κ−1[πAt − πNt

vt− 1− κ

κhAt

]

+κηGκt

(hAt

)κ−1 d

dt

(hAt

)1−κ

κηGκt

+

·vtvt.

Using (36), we can rewrite this expression a

γ

(πNtvt

+1− κκ

hAt

)= κηGκ

t

(hAt


vt− 1− κ

κhAt

]

+κηGκt

(hAt

)κ−1 d

dt

(hAt

)1−κ

κηGκt

.

Using (32), this leads to:

γ

(πNtvt

+1− κκ

hAt

)= κηGκ

t

(hAt


vt− 1− κ

κhAt

](37)

+ (1− κ)

·

h

A

t

hAt− κ

Gt

(ηGκ

t

(hAt

)κ(1−Gt)−Gtg

Nt

).

From the definition of ωt and nt, we get that wβ(1−σ)Lt = ωtnt, so that

πNt = ωtnt

(ϕ+ (ωtnt)

1µ

)−µπAt (38)

42

We can then reorder terms in (37) and use (38) to obtain:

·

h

A

t =γhAt

1− κ

(ωtnt

(ϕ+ (ωtnt)

1µ

)−µ πAtvt

+1− κκ

hAt

)(39)

−κηGκ

t

(hAt

)κ1− κ

(1− ωtnt

(ϕ+ (ωtnt)

1µ

)−µ) πAtvt

+ηGκt

(hAt

)κ+1

+κhAt

1− κ

(ηGκ−1

t

(hAt

)κ(1−Gt)− gNt

)Rewriting (24) using the definition of ct, leads to

rt = ρ+ θ

·ctct

+ θψgNt .

Combining this equation with (36) and (38), and using the definitions of χt, vt and πAt

leads to

χt = χt

(γωtnt

(ϕ+ (ωtnt)

1µ

)−µ πAtvt

+ γ1− κκ

hAt − ρ− (θψ − ψ + 1) gNt

). (40)

Together equations (31), (32), (39) and (40) form a system of differential equations which

depends on ωt, πAt /vt and gNt . To determine πAt /vt, recall that (as proved in the text),

profits are given by

π (wL, wH , α (i)) =(σ − 1)σ−1

σσc (wL, wH , α (i))1−σ Y. (41)

Using (4) and the definition of ωt, one gets:

πAt =(σ − 1)σ−1

σσ

(ββ (1− β)1−β

)σ−1 (ϕ+ (ωtnt)

1µ

)µw−ψ

−1

Ht Yt. (42)

Rearranging terms in (11) gives

vt =

(σ − 1

σ

) 11−β

ββ

1−β (1− β)(G(ϕ+ (ωtnt)

1µ

)µ+ (1−G)ωtnt

)ψ. (43)

Using (8), one further gets:

Yt = σψvtHPt N

ψt . (44)

43

Therefore, rewriting (42) with (43) and (44), one gets:

πAtvt

=ψ(ϕ+ (ωtnt)

1µ

)µHPt

G(ϕ+ (ωtnt)

1µ

)µ+ (1−G)ωtnt

, (45)

which still requires finding HPt . Using (4), (5), (6) and aggregating over all automated

firms, one gets the following expression for the total demand of machines:

Xt = βGtNtϕ

(σ − 1

σ

)σ (ββ (1− β)(1−β)

)σ−1 (ϕ+ (ωtnt)

1µ

)µ−1

w−ψ−1

Ht Yt.

Using (43), this expression can be rewritten as:

Xt =σ − 1

σ

βGtϕ(ϕ+ (ωtnt)

1µ

)µ−1

G(ϕ+ (ωtnt)

1µ

)µ+ (1−G)ωtnt

Yt. (46)

This together with (44) implies that ct obeys

ct =

1− σ − 1

σ

βGtϕ(ϕ+ (ωtnt)

1µ

)µ−1

G(ϕ+ (ωtnt)

1µ

)µ+ (1−G)ωtnt

σψvtHPt .

Combining this equation with the definition of χt and (43), leads to

HPt =

(σ−1σ

) 11−β ( 1

θ−β)

×(1−β)

1θ β

β1−β ( 1

θ−1)χ

1θt

(Gt

(ϕ+(ωtnt)

1µ

)µ+(1−Gt)ωtnt

)ψ( 1θ−1)+1

Gt

((1−β σ−1

σ )ϕ+(ωtnt)1µ

)(ϕ+(ωtnt)

1µ

)µ−1

+(1−Gt)ωt

. (47)

Using the definition of HDt and hAt , one can rewrite (18) for high-skill workers as:

gNt = γ(H −HP

t − (1−Gt) hAt

). (48)

Together (45), (47) and (48) determine πAt /vt and gNt as a function of the original

variables nt, Gt, hAt , χt and of ωt, which still needs to be determined. To do so, combine

44

(10) and (11), and use the definitions of nt and ωt to obtain an implicit definition of ωt:

ωt =

(σ−1σβ) 1

1−β HPt

L

(Gt

(ϕ+ (ωtnt)

1µ

)µ−1

(ωtnt)1−µµ + (1−Gt)

)×(Gt

(ϕ+ (ωtnt)

1µ

)µ+ (1−Gt)ωtnt

)ψ−1

β(1−σ)

1+β(σ−1)

. (49)

Therefore eventually the system of differential equations satisfied by nt, Gt, hAt , χt is

defined by (31), (32), (39) and (40), with πAt /vt, HPt , gNt and ωt given by (45), (47), (48)

and (49).

6.3 Negative growth for low-skill wages

This section presents two examples with negative growth low-skill wages. Further results

on the transitional dynamics derived using simulations are presented in Appendix 7.7.

We ensure temporary negative growth in low-skill wages in figure 7 by setting κ =

0.49, thereby introducing the externality in automation.30 Initially Gt is small and the

automation technology is quite unproductive. Hence, Phase 2 starts later, even though

the ratio(V At − V N

t

)/ (wHt/Nt) has already significantly risen (Panel B). Yet, Phase 2 is

more intense once it gets started, partly because of the sharp increase in the productivity

of the automation technology (following the increase in Gt) and partly because low-skill

wages are higher. Intense automation puts downward pressure on low-skill wages. At

the same time, horizontal innovation drops considerably, both because new firms are less

competitive than their automated counterparts, and because the high demand for high-

skill workers in automation innovations increases the cost of inventing a new product.

This results in a short-lived decline in low-skill wages. Indeed, the decline in wLt (and

increase in high-skill wage wHt) lowers the incentive to automate (Panel B), which in

return reduces automation.

30We choose this value for κ instead of 0.5, because in that case there is no horizontal innovation forsome time periods (that is (23) holds with a strict inequality). This is not an issue in principle butsimulating this case would require a different numerical approach.

45

0 100 200 300 400Year

0

2

4

6

8

10

Per

cen

t

Panel A: Growth Rates of Wages and GDP

gGDP

gwL

gwH

0 100 200 300 4000

5

10automation incentive (left)

0 100 200 300 400Year0

2

4

6

log

unit

Panel B: Automation Incentive and Skill Premium


0 100 200 300 4000

5

10

15

Per

cen

t


0 100 200 300 400Year

0

20

40

60

80

100Panel C: Research Expenditures and G

G (right)

0 100 200 300 400Year

0

20

40

60

80

100

Per

cen

t

Panel D: Factor Shares


Figure 7: Transitional Dynamics with temporary decline in low-skill wages with an automa-tion externality. Note: same as for Figure 3 but with an automation externality ofκ = 0.49.

Importantly, low-skill wages can drop for κ = 0—albeit for a small parameter set—as

shown in Figure 8 where low-skill wages slightly decline for a short time period (our

numerical investigation suggests that larger declines in the absence of an automation

externality need to be associated with periods where horizontal innovation completely

ceases). The associated parameters are given in Table 3.

Table 3: Baseline Parameter Specification

σ ε β H L θ η κ ϕ ρ κ γ

3 73 0.72 0.35 0.65 2 0.2 0.97 0.25 0.022 0 0.28

The crucial parameter change is an increase in κ, such that the automation technology

is less concave. This delays Phase 2, which is then more intense and leads to a sharp

increase in high-skill wages, reducing considerably horizontal innovation (note that in the

period where low-skill wages decline the share of high-skill workers hired in production

increases slightly, which has a positive contribution on low-skill wages’ growth rates).

46

0 50 100 150 200 250 300 350 400Year

-1

0

1

2

3

4

5

6

7

Per

cen

t


gGDP

gwL

gwH

0 50 100 150 200 250 300 350 400Year

0

2

4

6

8

10

12

14

16

Per

cen

t


0 50 100 150 200 250 300 350 400Year

0

20

40

60

80

100

Per

cen

t

Panel B: Research Expenditures and G

G (right)

Figure 8: Transitional dynamics with temporary decline in low-skill wages without an au-tomation externality.

6.4 An endogenous supply response in the skill distribution

We present here an extension of the baseline model with an endogenous supply re-

sponse in the skill distribution. Specifically, let there be a unit mass of heterogeneous

individuals, indexed by j ∈ [0, 1] each endowed with lH units of low-skill labor and

Γ (j) = H (1+q)qj1/q units of high-skill labor (the important assumption here is the exis-

tence of a fat tail of individuals with low ability). The parameter q > 0 governs the shape

of the ability distribution with q → ∞ implying equal distribution of skills and q < ∞implying a ranking of increasing endowments of high-skill on [0, H(1+q)/q]. Proposition

3 can be extended to this case and in fact the steady state values (G∗, hA∗, gN∗, χ∗) are

the same as in the model with a fixed high-skill labor supply H. Proposition 2 also ap-

plies except that the asymptotic growth rate of low-skill wages is higher (see Appendix

7.10):

gwH∞ = gY∞ = ψgN∞ and gwL∞ =1 + q

1 + q + β(σ − 1)gY∞. (50)

At all points in time there exists an indifferent worker (jt) where wLt = (1+q)/q(jt)1/qwHt,

with all j ≤ jt working as low-skill workers and all j > jt working as high-skill work-

ers. This introduces an endogenous supply response as the diverging wages for low- and

high-skill workers encourage shifts from low-skill to high-skill jobs, which then dampens

the relative decline in low-skill wages. Hence, besides securing themselves a higher fu-

ture wage growth, low-skill workers who switch to a high-skill occupation also benefit

the remaining low-skill workers. Since all changes in the stock of labor are driven by

demand-side effects, wages and employment move in the same direction.

Figure 9 shows the transitional dynamics for this model when the common parameters

47

are the same as in Table 1, H = 1/3 (so that G∗, hA∗, gN∗, χ∗ are the same as in the

baseline model), l = 1 and q = 0.3. The figure looks similar to Figure 3, but the gap

in steady-state between the low-skill growth rate and the high-skill growth rate is a bit

smaller. In addition Panel B shows that the skill ratio increases from Phase 2 and Panel

A shows that the growth rate is lower in Phase 1 as the mass of high-skill workers is

lower then.

0 100 200 300 400Year

0

0.5

1

1.5

2

2.5

3

3.5

Per

cen

t


gGDP

gwL

gwH

0 100 200 300 400Year

-1

0

1

2

3

4

log

units

Panel B: Skill ratio and Skillpremium

skill ratioskillpremium

0 100 200 300 400Year

0

5

10

15

Per

cen

t


0 100 200 300 400Year

0

20

40

60

80

100

Per

cen

t

Panel C: Research expenditures and G

G (right)

0 100 200 300 400Year

0

20

40

60

80

100

Per

cen

t


Wage shareLow-skill labor share

Figure 9: Transitional Dynamics for model with endogenous skill supply. Panel A showsgrowth rates for GDP, low-skill wages (wL) and high-skill wages (wH), Panel Bthe skill ratio and the skill premium, Panel C the total spending on horizontalinnovation and automation as well as the share of automated products (G), andPanel D the wage share of GDP for total wages and low-skill wages.

6.5 Parameters identification

In this section we discuss how our parameters are identified, first by carrying a back-

of-the-envelope calibration, second by computing the elasticities of the initial and final

values of the series we match with respect to the parameters, and third by computing how

precisely each parameter is identified. We then discuss specifically how κ is determined

and finally carry an out-of-sample prediction exercise, where we only use the first 30

years of the data to calibrate our parameters.

48

6.5.1 Back-of-the-envelope calibration

We first study how the production parameters σ, β1, β2, β4 and ∆ would be identified

under a naive back-of-the-envelope calibration, where we assume that in 1963 the U.S.

economy was in the first phase while in 2007, it was in the third phase. Since both

assumptions are actually not met in our estimation, this naive calibration gives param-

eters that are still far from those which we actually estimate. Nevertheless, the exercise

is informative to understand how these production parameters are related to moments

in the data.

Assuming that the economy in 1963 is close to the first phase, and using (159), we

get that the skill premium must obey:

wH1963

wL1963

≈ β2

β1

L1963

H1963 − 1γgN1963

.

Further, using that most high-skill workers work in production, such that 1γgN1963 is small

relative to H, we obtainβ2

β1

≈ wH1963H1963

wL1963L1963

, (51)

so that the ratio β2/β1 is determined by the ratio between the high-skill wage bill and

the low-skill wage bill. Because the economy is in fact not in the first phase in 1963

(with an equipment stock to GDP ratio which is not 0), this approximation is likely to

overstate the ratio β2/β1. Similarly, using (159), (160), and (165), we get that the labor

share in 1963 should obey

ls1963 ≈β2

HH− 1

γgN1963

+ β1

σσ−1

+ β2

1γgN1963

H− 1γgN1963

,

which simplifies into

ls1963 ≈σ − 1

σ(β2 + β1) , (52)

if most high-skill workers are in production. Therefore, given σ, the initial labor share

determines β3, the ‘external’ capital share. We can then combine (51) and (52) to obtain

β1 ≈1

wH1963H1963

wL1963L1963+ 1

ls1963

1− 1σ

, (53)

49

so that β1 which the Cobb-Douglas share for low-skill workers in phase 1 is given by the

labor share in 1963 and the ratio between the high-skill wage bill and the low-skill wage

bill, and σ which determines mark-ups.

Combining (168) and (168), we get that if the economy is close to its asymptotic

steady-state in 2007, the growth rate of the skill premium is given by

gsp2007 ≈β1 (σ − 1) (1− β4)

1 + β1 (σ − 1)gGDP2007 . (54)

Using (159), (160) (165), the labor share now obeys:

ls2007 ≈ H

[σH

(σ − 1) (β2 + β1β4)−(

σ

(σ − 1) (β2 + β1β4)− 1

)(1

γgN1963 +HA

1963

)]−1

,

which under the assumption that most high-skill workers are in production would sim-

plify again into

ls2007 ≈σ − 1

σ(β2 + β1β4) . (55)

Combining (51), (52) and (55) we obtain:

β4 ≈ 1−(

1− ls2007

ls1963

)(wH1963H1963

wL1963L1963

+ 1

). (56)

Therefore, in this approximation, β4 is identified through the decline in the labor share

and the initial wage bill ratio between high-skill and low-skill workers. In the data the

labor share does not monotonically decline. To understand how the parameters are

identified, we replace ls2007 by the lowest value over 1983-2007 (which is 62.2%) and

ls1963 by the highest value (68.8%). With wH1963H1963

wL1963L1963= 0.562, we then obtain β4 ≈ 0.85.

This is higher than the value we actually end up finding (β4 = 0.73), mostly because

the economy is still far from its steady-state in 2007 (so that ls2007 is higher than the

asymptotic value of the labor share).

Using (53), (54) and (56) we obtain:

σ ≈ 1

ls1963

[gGDP2007

gsp2007

(1− ls2007

ls1963

)− 1

wH1963H1963wL1963L1963

+1

] .that is given the initial wage bill ratio and the labor shares in 1963 and 1964, which

informs us about β1, β2 and β4, σ is determined by the ratio between the growth rate

50

of GDP and that of the skill-premium in the third phase. The larger is σ, the more

automated firms gain over non-automated ones and therefore the more the skill premium

rises relative to GDP: hence a lowergGDP2007

gsp2007is associated with a larger σ. When using the

last 5 years to determinegGDP2007

gsp2007, we find that σ ≈ 5.53, while our estimation procedure

leads to σ = 6.7.

Given σ, one can then find β1 using (53), we find β1 ≈ 0.54, below but not too far

from the estimated value of 0.62 (this approximation is not too sensitive on σ provided

that σ is large enough). Using (51), we then obtain β2 ≈ 0.3 which is higher than the

estimated value of 0.18, in line with the fact that (51) gives an overestimate of β2/β1.

To get a proxy for ∆, we look at the steady-state value for the equipment to GDP

ratio. Using (184), (185) and the definition of GDP, we obtain that

K∗

GDP∗ =

1

r∗β3 + β1 (1− β4)

σσ−1

+ (β2 + β1β4)(

HHP∗ − 1

) .Denote by Keq the stock of capital used as equipment, we get

K∗eq =β1 (1− β4)

β3 + β1 (1− β4)K∗,

since in steady-state the economy is Cobb-Douglas with a total physical capital share of

β3 + β1 (1− β4) and an equipment of share β1 (1− β4). Using (183), we obtain

K∗eq

GDP∗ =

1

ρ+ ∆ + θgGDP∗β1 (1− β4)

σσ−1

+ (β2 + β1β4)(

HHP∗ − 1

) .Therefore, assuming that in 2000 (the last year for which we have data on the equipment

to GDP ratio), we are close to the steady-state, and that most high-skill workers are in

production, we get

ρ+ ∆ + θgGDP2000 =

(σ − 1

σ

)β1 (1− β4)

Keq,2000GDP2000

≈ 0.0443 (57)

using the values computed above. It is therefore not surprising that we find a low ∆

in the estimation. This is due in particular to the high-level of Keq,2000GDP2000

= 1.5 (with the

actual estimated values for σ, β1 and β4 we would still find that ρ+∆+θgGDP2000 ≈ 0.094).

Yet, as explained in footnote 28, this level is somewhat arbitrary.

51

6.5.2 Parameters’ effect on empirical moments

In this section we illustrate the role that parameters have on the empirical moments

which allows us to identify what features of the data pin down the parameters. Taking

as our starting point the parameter estimates of Section 4, we iteratively change each

on the parameters by 2 per cent and illustrate the resulting effect on the initial (1963)

and the final (2007) value of each of the four empirical paths. Table 4 gives the resulting

elasticities (note that β3 is completely determined by β1 and β2).

The initial skill premium is most strongly affected by the production function pa-

rameters β1, β2 and β4: A higher share of high-skill workers in production, β2, directly

increases the skill-premium. A higher value of β4 makes automation more expensive,

which increases the demand for low-skill workers and reduces the skill premium. A

higher β1 implies a lower β3 which reduces the role of structural capital, reduces the

rental rate of capital, which increases the use of capital and thereby increases the skill-

premium. β2 has the opposite effect on the skill premium in 2007. A higher β2 reduces

the multiplier of Nt on output, Yt which reduces the growth rate of the economy. The

automation technology parameters, κ, κ, η also have a large effect on the skill premium

in 2007.

The initial labor share depends on β1, β2 and β4, the latter having a much larger

effect in 2007 since the share of automated products is much larger.

GDP/labor is mechanically affected negatively by higher σ since we keep the stock of

products in 1963 constant. Both β1 and β2 reduce the importance of structural capital

and thereby have a negative effect on GDP/labor in 1963 as the stock of capital is

sufficiently large. In 2007 σ, β1,β2, β4 all reduce the multiplier of Nt on Yt and therefore

GDP/labor. The innovation parameters γ, η lead to higher growth and therefore higher

GDP/labor in 2007, though naturally not in 1963.

Capital equipment / GDP in 1963 depends positively on β1 and negatively on β4

because the initial capital stock is fixed. For 2007, a higher β4 increases the cost of

automation and thereby reduces the stock of Keq/GDP . The horizontal innovation pro-

ductivity, γ, encourages more innovation. This drives up the wage of high-skill workers

in 1963, makes automation more expensive and reduces Keq/GDP . It further increases

the growth rate of the economy and reduces G2007 such that Keq/GDP in 2007 is lower

as well.

52

Parameters Skill premium Labor share GDP/labor Keq/GDP1963 2007 1963 2007 1963 2007 1963 2007

σεβ1

γκθηκρβ2

∆β4

N1963

G1963

−0.1 0.1−0.1 −0.2

0.4 0.00.1 −0.2−0.2 −0.4

0.0 −0.10.1 0.4−0.1 −0.4−0.1 −0.1

0.5 −0.20.0 −0.1−0.7 −1.9

0.0 0.10.1 0.0

0.1 0.00.0 0.00.7 0.60.0 0.10.0 0.10.0 0.00.0 0.00.0 0.00.0 0.00.2 0.30.0 0.00.1 0.60.0 0.00.0 0.0

−0.6 −1.90.0 −0.1−0.8 −2.3

0.0 0.60.0 −0.20.0 −0.20.0 0.20.0 −0.30.0 −0.4−0.5 −1.4

0.0 −0.1−0.1 −1.3

0.2 0.20.0 0.0

2.1 1.1−0.2 −0.1

8.3 1.8−0.2 −0.5

0.3 −0.40.1 −0.2−0.1 0.2

0.3 −0.20.2 −0.31.1 −0.10.0 −0.2−5.1 −3.7−0.2 0.0

0.5 0.0

Table 4: The effect of parameters on the four empirical paths (numbers refer to elasticitiesof empirical value wrt. parameter)

6.5.3 The precision of parameters

In the following we calculate the effect the parameters have on the aggregate final mo-

ment. We do this allowing for all the other parameters to adjust, illustrating how

precisely each of the parameters are determined. Since deviations from the minimum

parameter values are naturally second order we do not compute elasticities. Instead for

a given parameter θi consider

V (θi, θ−i(θi)),

where θ−i(θi) are the parameters that minimize V for any given θi and θi = argminθiV (θi, θ−i(θi))

is the minimizing value of θi. Consequently, a Taylor expansion around θi yields:

V (θi, θ−i(θi))− V (θi, θ−i(θi))

V (θi, θ−i(θi))/

(θi − θiθi

)2

≈ 1

2

1

V (θi, θ−i(θi))

d2V (θi, θ−i(θi))

dθ2i

θ2i .

We compute the expression on the left. The results are in Table 5 for a 5% shock on

the parameter of interest. It shows that the parameters that govern the production

function: (σ, β1, β2, β4) are the hardest to vary and consequently the ones most precisely

identified. The exception is ε, the elasticity between low-skill labor and machines, which

as Proposition 2 makes clear, does not govern the asymptotic growth of income inequality.

53

Parameters σ ε β1 γ κ θ η κ ρ β2 ∆ β4 N1963 G1963

Curvature 82.0 2.7 219.3 3.7 1.7 4.2 3.0 47.1 4.8 12.8 2.4 2142.1 8.3 2.0

Table 5: The “curvature” of deviating from the optimal parameter

Parameters σ ε β1 γ κ θ η κ ρ β2 ∆ β4 N1963 G1963

Curvature 6.7 4.95 0.62 0.64 0.57 1.03 0.40 0.57 0.038 0.17 0.015 0.75 20.4 0.02

Table 6: Parameters (only matching the first 30 years)

ρ, θ, η, γ all govern the growth rate of the economy and are weakly identified individually.

∆ is also not well identified because it mostly affects the growth rate of the capital stock

which also depends on ρ and θ (equation 57). Given that this parameter is the one

estimated outside a common range this is a reassuring finding.

6.5.4 The role of the automation externality

To analyze more specifically the role played by the automation externality, we recali-

brate our model but without the automation externality (i.e., we impose that κ = 0).

Figure 10 reports the results. The model still reproduces the paths for the labor share,

GDP/employment and equipment/GDP. Yet, it does not capture the evolution of the

skill premium. Indeed, the fast rise in the skill premium in the 1980s and 1990s require

an “accelerated Phase 2” which, given the moderate decline in the labor share and the

stable economic growth, can only be brought about by a positive automation externality.

The data clearly favor a positive automation externality (even though the exact value

of κ is not precisely estimated, see Table 5).

6.5.5 Out-of-sample prediction

Finally, we reproduce our calibration exercise but only trying to match the first 30 years

of data. Figure 11 reports the results and Table 6 gives the new parameters. The model

behaves quite well out-of-sample. The parameter estimates are very similar and the

predicted path are close. The largest difference is in the skill premium, where the model

calibrated over the first 30 years captures the upward trend but underestimates the pace.

In addition, the parameters are very similar to those in the full sample calibration of

Table 2.

54

1960 1970 1980 1990 2000 2010 2020Years

1

1.2

1.4

1.6

1.8

2

2.2

Ski

ll-P

rem

ium

(ra

tio)



1960 1970 1980 1990 2000 2010 2020Years

50

55

60

65

70

75

80

Per

cen

t


1960 1970 1980 1990 2000 2010 2020Years

50

100

150

200

250300350400

Inde

x (1

963

= 1

00)


1960 1970 1980 1990 2000 2010 2020Years

0.25

0.5

1

2

3

Equ

ipm

ent /

GD

P


Figure 10: Predicted and empirical time paths for κ = 0

1960 1970 1980 1990 2000 2010 2020Years

1

1.5

2

Ski

ll-P

rem

ium

(ra

tio)



1960 1970 1980 1990 2000 2010 2020Years

50

60

70

80

Per

cen

t


1960 1970 1980 1990 2000 2010 2020Years

100

200

300

400

Inde

x (1

963

= 1

00)


1960 1970 1980 1990 2000 2010 2020Years

0.25

0.5

1

2

3

Equ

ipm

ent /

GD

P


Figure 11: Predicted and empirical time paths only matching the first 30 years

55

7 Secondary Appendix (For Online Publication)

7.1 Relationship between wages and N and G

7.1.1 Imperfect substitute case: ε <∞.

We first focus on the imperfect substitute case. Rewrite (10) as

wHwL

=1− ββ

L

HP

G(1 + ϕwε−1

L

)µ+ 1−G

G(1 + ϕwε−1

L

)µ−1+ 1−G

. (58)

Since 0 < µ < 1, (58) establishes wH as a function of G,HP and wL (but notN) such that

wH is increasing in wL and G and decreasing in HP , with wH/wL > (1− β) /β ×L/HP

for G > 0. (11) similarly establishes wH as a function of N,G and wL (but not HP ), wH

is decreasing in wL and increasing in N and G. It is then immediate that wH , wL are

jointly uniquely determined by (58) and (11) for given N,G and HP , both increase in

N , and wH increases in G (in addition, (11) traces an iso-cost curve in the input prices

plan which is convex).

In addition, (58) shows that wH/wL increases with wL. Since wL increases in N ,

then wH/wL increases in N as well (and following (13) the labor share decreases in N).

Assume that wL decreases in G, then since wH increases in G, we immediately get that

wH/wL increases in G. Assumes now that wL increases in G, then the right-hand side of

(58) increases with G both directly and because wL increases, this ensures that wH/wL

increases in G (and following (13) the labor share decreases in G). Therefore both an

increase in N and an increase in G are skill-biased.

Comparative statics of wL with respect to G. We now analyze how wL changes

with G (for given N and HP ). To do so, we combine both equations to get:

wL =σ−1σβ(HP

L

)(1−β)

N1

σ−1

(G(1 + ϕwε−1

L

)µ−1+ (1−G)

)1−β

×(G(1 + ϕwε−1

L

)µ+ (1−G)

) 1σ−1−(1−β)

, (59)

56

Log differentiating with respect to G one obtains:

wL =

1

σ − 1

(1 + ϕwε−1

L

)µ − 1

G(1 + ϕwε−1

L

)µ+ (1−G)︸︷︷︸

scale effect

− (1− β)

(1−

(1 + ϕwε−1

L

)µ−1

G(1 + ϕwε−1

L

)µ−1+ (1−G)

+

(1 + ϕwε−1

L

)µ − 1

G(1 + ϕwε−1

L

)µ+ (1−G)

)︸︷︷︸

substitution effect

GG

Den

(60)

where

Den ≡ 1− βϕwε−1L(

1 + ϕwε−1L

) G(1 + ϕwε−1

L

)µG(1 + ϕwε−1

L

)µ+ (1−G)

+ϕwε−1

L (ε− 1) (1− β)(1 + ϕwε−1

L

) (µG(1 + ϕwε−1

L

)µG(1 + ϕwε−1

L

)µ+ 1−G

+(1− µ)G

(1 + ϕwε−1

L

)µ−1

G(1 + ϕwε−1

L

)µ−1+ 1−G

).

Den > 0 as ε > 1, µ ∈ (0, 1) andβϕwε−1

L

1+ϕwε−1L

G(1+ϕwε−1L )

µ

G(1+ϕwε−1L )

µ+(1−G)

< 1. In (60) the scale effect

term is positive as(1 + ϕwε−1

L

)µ − 1 > 0. This term comes from the differentiation of

(11) with respect to G at constant wH (hence it represents the shift right of the isocost

curve). The substitution effect term is negative because 1 −(1 + ϕwε−1

L

)µ−1> 0 since

µ < 1, it comes from the differentiation of (10) with respect to G.

First note that if 1σ−1≤ 1−β, the scale effect is always dominated by the substitution

effect. Hence wL is decreasing in G.

If on, the other hand 1σ−1

> 1−β, then the scale effect is dominated by the substitu-

tion effect provided that1−(1+ϕwε−1

L )µ−1

G(1+ϕwε−1L )

µ−1+1−G

/(1+ϕwε−1

L )µ−1

G(1+ϕwε−1L )

µ+1−G

is large enough. From (59)

we get:

wL =σ − 1

σβ

(HP

L

)(1−β)

N1

σ−1

1−G(

1−(1 + ϕwε−1

L

)µ−1)

G((

1 + ϕwε−1L

)µ − 1)

+ 1

1−β (G((

1 + ϕwε−1L

)µ − 1)

+ 1) 1σ−1

>σ − 1

σβ

(HP

L

)(1−β)

N1

σ−1

((1 + ϕwε−1

L

)−1)1−β

,

57

where the last line uses that G ∈ [0, 1]. We then obtain that

wL(1 + ϕwε−1

L

)1−β>σ − 1

σβ

(HP

L

)(1−β)

N1

σ−1 ,

which ensures that limN→∞

wL =∞ uniformly with respect to G (i.e. for any wL > 0, there

exist N such that for any N > N and any G, w > wL). Since

limwL→∞,G→1

1−(1+ϕwε−1L )

µ−1

G(1+ϕwε−1L )

µ−1+1−G

(1+ϕwε−1L )

µ−1

G(1+ϕwε−1L )

µ+1−G

=∞,

we get that

limN→∞,G→1

1−(1+ϕwε−1L )

µ−1

G(1+ϕwε−1L )

µ−1+(1−G)

(1+ϕwε−1L )

µ−1

G(1+ϕwε−1L )

µ+(1−G)

=∞.

Therefore for N and G large enough the substitution effect dominates. This achieves

the proof of Proposition 1.

Increase in the number of non-automated products. To study the effect of an

increase in the number of non-automated products only, we log differentiate (59) with

respect to both N and G and obtain:

wL =

( 1σ−1−(1−β))((1+ϕwε−1

L )µ−1)

G(1+ϕwε−1L )

µ+(1−G)

−(1−β)

(1−(1+ϕwε−1

L )µ−1

)G(1+ϕwε−1

L )µ−1

+(1−G)

GG+1

σ − 1N

1

Den

In that case NG is a constant, so that G = −N , therefore denoting wNTL (NT for “new

tasks”), the change in wL, we get:

wNTL =

(1−β)G((1+ϕwε−1L )

µ−1)

G(1+ϕwε−1L )

µ+(1−G)

+(1−β)G

(1−(1+ϕwε−1

L )µ−1

)G(1+ϕwε−1

L )µ−1

+(1−G)

+ (σ−1)−1

G(1+ϕwε−1L )

µ+(1−G)

N

Den. (61)

Hence low-skill wages always increase with the arrival of non-automated products. Log-

58

differentiating (10), one gets:

wH − wL =

(µG(1 + ϕwε−1

L

)µG(1 + ϕwε−1

L

)µ+ 1−G

+(1− µ)G

(1 + ϕwε−1

L

)µ−1

G(1 + ϕwε−1

L

)µ−1+ 1−G

)(ε− 1)ϕwε−1

L

1 + ϕwε−1L

wL

+

G((

1 + ϕwε−1L

)µ − 1)

G(1 + ϕwε−1

L

)µ+ 1−G

+G(

1−(1 + ϕwε−1

L

)µ−1)

G(1 + ϕwε−1

L

)µ−1+ 1−G

G.

Using (61) and that G = −N , then we get that following a change in the mass of

non-automated products (keeping the mass of automated products constant):

wNTH =

((+

(1− µ)G(1 + ϕwε−1

L

)µ−1

G(1 + ϕwε−1

L

)µ−1+ (1−G)

)(ε− 1)ϕwε−1

L

1 + ϕwε−1L

+ 1

)

×

(1−β)G((1+ϕwε−1L )

µ−1)

G(1+ϕwε−1L )

µ+(1−G)

+(1−β)G

(1−(1+ϕwε−1

L )µ−1

)G(1+ϕwε−1

L )µ−1

+(1−G)

+ (σ−1)−1

G(1+ϕwε−1L )

µ+(1−G)

N

Den

−

G((

1 + ϕwε−1L

)µ − 1)

G(1 + ϕwε−1

L

)µ+ (1−G)

+G(

1−(1 + ϕwε−1

L

)µ−1)

G(1 + ϕwε−1

L

)µ−1+ (1−G)

N .

We then obtain:

wNTH =

((

µG(1+ϕwε−1L )

µ

G(1+ϕwε−1L )

µ+1−G

+(1−µ)G(1+ϕwε−1

L )µ−1

G(1+ϕwε−1L )

µ−1+1−G

)(ε−1)ϕwε−1

L

1+ϕwε−1L

+ 1

)(σ−1)−1

G(1+ϕwε−1L )

µ+1−G

+β

(G((1+ϕwε−1

L )µ−1)

G(1+ϕwε−1L )

µ+1−G

+G(

1−(1+ϕwε−1L )

µ−1)

G(1+ϕwε−1L )

µ−1+1−G

)[G(1+ϕwε−1

L )µ

G(1+ϕwε−1L )

µ+1−G

ϕwε−1L

(1+ϕwε−1L )− 1

] N

Den

=(σ − 1)−1

G(1 + ϕwε−1

L

)µ+ 1−G

(1 +

G (1− µ)(1 + ϕwε−1

L

)µ−1

G(1 + ϕwε−1

L

)µ−1+ 1−G

(ε− 1)ϕwε−1L

1 + ϕwε−1L

)N

Den

Therefore an increase in the mass of non-automated products leads to higher high-skill

wages. Finally we obtain

wNTH − wNTL =

(σ−1)−1

G(1+ϕwε−1L )

µ+1−G

((1−µ)(1+ϕwε−1

L )µ−1

G(1+ϕwε−1L )

µ−1+1−G

(ε−1)ϕwε−1L

1+ϕwε−1L

)− (1− β)

[(1+ϕwε−1

L )µ−1

G(1+ϕwε−1L )

µ+1−G

+1−(1+ϕwε−1

L )µ−1

G(1+ϕwε−1L )

µ−1+1−G

] GN

Den

59

wNTH − wNTL =

[(ε−1σ−1− β

)1

1+ϕwε−1L

− (1− β)] (

1 + ϕwε−1L

)µ−1ϕwε−1

L(G(1 + ϕwε−1

L

)µ+ 1−G

) (G(1 + ϕwε−1

L

)µ−1+ 1−G

) GNDen

.

Therefore an increase in the mass of non-automated products reduces the skill premium

(and increases the labor share) if and only if 1− β >(ε−1σ−1− β

)1

1+ϕwε−1L

. This in turn is

true for wL sufficiently large (that is N large enough) or for ε < σ.

7.1.2 Perfect substitute case:ε =∞

In the perfect substitute case, there are three possibilities. Case i) wL < ϕ−1: automated

firms only use low-skill workers and low-skill wages are given by

wL =σ − 1

σβ

(HP

L

)1−β

N1

σ−1 , (62)

with a skill premium obeying wHwL

= 1−ββ

LHP .

Case ii) wL = ϕ−1: automated firms use machines but also possibly workers, in which

case high-skill wages can be obtained from (11) which is now written as:

σ

σ − 1

N1

1−σ

ββ (1− β)1−β ϕ−βw1−β

H = 1.

Case iii) wL > ϕ−1 and all automated firms use machines only, in that case, we get

that (59) is replaced by

wL =σ − 1

σβ

(HP

L

)(1−β)

N1

σ−1 (1−G)1−β(G (ϕwL)β(σ−1) + 1−G

) 1σ−1−(1−β)

, (63)

and the skill premium obeys:

wHwL

=1− ββ

L

HP

G (wLϕ)β(σ−1) + 1−G1−G

. (64)

60

One can rewrite (63) as

w1−βL =

σ − 1

σβ

(HP

L

)(1−β) N1

σ−1 (1−G)1−β(G+ (1−G) (ϕwL)−β(σ−1)

) 1σ−1

ϕβ(G (ϕwL)β(σ−1) + 1−G

)1−β .

The left-hand side increases in wL and the right-hand side decreases in wL, hence this

expression defines wL uniquely, in addition, the solution is greater than ϕ−1 if and only

if

N1

σ−1 (1−G)1−β >σ

(σ − 1) βϕ

(L

HP

)(1−β)

.

Hence wL and wH are defined uniquely as functions of N,G and HP . If N1

σ−1 <σ

(σ−1)βϕ

(LHP

)(1−β), we are in case i), if N

1σ−1 (1−G)1−β ≤ σ

(σ−1)βϕ

(LHP

)(1−β) ≤ N1

σ−1

then we are in case ii) and if N1

σ−1 (1−G)1−β > σ(σ−1)βϕ

(LHP

)(1−β), we are in case iii).

It is then direct to show that wH increases in N and weakly increases in G, that

wH/wL is weakly increasing in N and G (weakly because of case i)), and that wL is

weakly increasing in N (weakly because of case ii)).

Comparative statics of wL with respect to G. Furthermore, (63) shows that

wL is decreasing in G in case iii) if 1σ−1≤ 1 − β. Therefore wL is weakly decreasing in

G if 1σ−1≤ 1− β.

Assume now that 1σ−1

> 1− β. Log-differentiating (63), one gets:

wL =

( 1

σ − 1− (1− β)

) ((ϕwL)β(σ−1) − 1

)G

G (ϕwL)β(σ−1) + (1−G)− (1− β)

G

1−G

G

Den(65)

where

Den ≡ 1− β G (ϕwL)β(σ−1)

G (ϕwL)β(σ−1) + 1−G+

(1− β) β (σ − 1)G (ϕwL)β(σ−1)

G (ϕwL)β(σ−1) + 1−G.

We have

(1

σ − 1− (1− β)

) ((ϕwL)β(σ−1) − 1

)G

G (ϕwL)β(σ−1) + 1−G− (1− β)

G

1−G

<1

σ − 1− 1− β

1−G,

61

which is negative for G large enough. Hence we obtain that for G high enough, wL is

weakly decreasing in G (strictly in case iii)).

Increase in the number of non-automated products. In case i) an increase in

the mass of non-automated products leads to an increase in wH and wL while wH/wL is

constant. In case ii), wH increases, wH/wL increases and wL is constant.

Log-differentiating (63) with respect to both N and G, one gets:

wL =

( 1

σ − 1− (1− β)

) ((ϕwL)β(σ−1) − 1

)G

G (ϕwL)β(σ−1) + 1−G− (1− β)

G

1−G

G+1

σ − 1N

1

Den.

For an increase in the mass of non-automated products, G = −N , so that the change in

wL in that case is given by:

wNTL =

(1− β)

(

(ϕwL)β(σ−1) − 1)G

G (ϕwL)β(σ−1) + 1−G+

G

1−G

+1

σ − 1

1−

((ϕwL)β(σ−1) − 1

)G

G (ϕwL)β(σ−1) + 1−G

N

Den.

Therefore wL increases.

Log-differentiating (64), we get:

wH =G(

(ϕwL)β(σ−1) − 1)

G (ϕwL)β(σ−1) + 1−GG+

G

1−GG+

(1 +

β (σ − 1)G (ϕwL)β(σ−1)

G (ϕwL)β(σ−1) + 1−G

)wL.

Therefore, for an increase in the mass of non-automated products, one gets:

wNTH =1

σ − 1

1−

((ϕwL)β(σ−1) − 1

)G

G (ϕwL)β(σ−1) + 1−G

N

Den,

which ensures that wH increases with the mass of non-automated products. Finally,

wNTH − wNTL = − (1− β)G (ϕwL)β(σ−1)(G (ϕwL)β(σ−1) + 1−G

)(1−G)

N

Den.

Therefore an increase in the mass of non-automated products decreases the skill premium

in case iii).

Overall we get that an increase in the mass of non-automated products weakly in-

62

creases wL, increases wH and decreases wH/wL if N is large enough but G 6= 1 (so that

we are in case iii)).

7.2 Proofs of the asymptotic results

7.2.1 Proof of Proposition 2

Case with G∞ > 0 (Parts A and B). To see that wLt is bounded from below, assume

that liminfwLt=0. Then using that HPt and Gt admit positive limits, (10) implies that

liminfwHt = 0. Plugging this further in (11) gives liminfNt = 0, which is impossible

since gNt admits a positive limit. Therefore, wLt must be bounded below, so that (11)

gives gwH∞ = ψgN∞. Further, using that HPt admits a limit and (8) gives the growth rate

of Yt. We now derive the asymptotic growth rate of wLt. To do so we consider in turn

the case where ε <∞, and the case where ε =∞.

Subcase with ε < ∞. We use equation (59) which gives wLt as a function of

Nt, Gt and HPt . Note that assuming that wLt is bounded above leads to a contradiction,

therefore limwLt =∞.

Assume first that G∞ < 1, then, since limwLt =∞, (59) implies

wLt ∼

((σ − 1

σβ

) 11−β

(1−G∞)HP∞L

(G∞ϕµ)ψ−1

) 11+β(σ−1)

Nψ

1+β(σ−1)

t ,

where for xt and yt (possibly with no limits), xt ∼ yt signifies xt/yt → 1. This delivers

Part A).

Consider now the case where G∞ = 1. Note that (59) gives:

wLt ∼

((σ − 1

σβ

) 11−β HP

∞Lϕµ(ψ−1)

) 1ε

Nψεt

(ϕµ−1 + (1−Gt)w

(ε−1)(1−µ)Lt

) 1ε. (66)

Following the assumption of Part B in Proposition 2, we assume that lim (1−Gt)Nψε

(ε−1)(1−µ)t

exists and is finite. Suppose first that lim sup (1−Gt)w(ε−1)(1−µ)Lt =∞, then there must

exist a sequence of t’s, denoted tn for which:

wLtn ∼

((σ − 1

σβ

) 11−β HP

∞L

(ϕµ)ψ−1

) 11+β(σ−1) (

(1−Gtn)Nψtn

) 11+β(σ−1)

.

63

Yet, this implies

(1−Gtn)w(ε−1)(1−µ)Ltn

∼

((σ−1σβ) 1

1−β HP∞L

(ϕµ)ψ−1) (ε−1)(1−µ)

1+β(σ−1)

×(

(1−Gtn)Nψ(ε−1)(1−µ)

εtn

) ε1+β(σ−1)

,

the left-hand side is assumed to be unbounded, while the right-hand side is bounded:

there is a contradiction. Therefore, lim sup (1−Gt)w(ε−1)(1−µ)Lt <∞.

Consider now the possibility that lim (1−Gt)w(ε−1)(1−µ)Lt = 0, then (66) implies

wLt ∼

((σ − 1

σβ

) 11−β HP

∞Lϕµψ−1

) 1ε

Nψεt .

Therefore we get that gwL∞ = ψεgN∞ = 1

εgY∞.

31

Alternatively, lim sup (1−Gt)w(ε−1)(1−µ)Lt is finite but strictly positive (given by λ1).

In this case, there exists a sequence of t′s, denoted tm such that

wLtm ∼

((σ − 1

σβ

) 11−β HP

∞Lϕµ(ψ−1)

(ϕµ−1 + λ1

)) 1ε

Nψεtm . (67)

This leads to

λ1 ∼

((σ − 1

σβ

) 11−β HP

∞Lϕµ(ψ−1)

(ϕµ−1 + λ1

)) (ε−1)(1−µ)ε

(1−Gtm)Nψε

(ε−1)(1−µ)tm ,

which is only possible if lim (1−Gt)Nψε

(ε−1)(1−µ)t > 0. We denote such a limit by λ.

Then (66) leads to

(wεLtN

−ψt

)∼(σ − 1

σβ

) 11−β HP

∞Lϕµ(ψ−1)

(ϕµ−1 + λ

(N−ψt wεLt

) (ε−1)(1−µ)ε

),

which defines uniquely the limit of wεLtN−ψt . We then obtain that gwL∞ = ψ

εgN∞. This

completes the poof of part B).

Subcase with ε = ∞. Low skill wages are now defined as described in Appendix

31Expressions regarding the asymptotic growth rates (here and below) assume existence of the limitsbut expressions on equivalence (∼) or orders of magnitude (O) do not.

64

7.1.2. First consider the case where G∞ < 1, then Part A) immediately follows. Assume

now that G∞ = 1 and that lim (1−Gt)Nψt exists and is finite. Note first that (62)

implies that wLt must be bounded weakly above ϕ in the long-run. As a result, (63)

leads to

wLt ∼

((σ − 1

σβϕβ(1−ψ−1)

) 11−β HP

∞L

) 11+β(σ−1) (

(1−Gt)Nψt

) 11+β(σ−1)

if wLt > ϕ.

Since lim (1−Gt)Nψt exists and is finite, wLt also admits a finite limit. In particular, if

lim (1−Gt)Nψt = 0, then wL∞ = ϕ.

Case where G∞ = 0 (Part C). If limGt = 0 then (59) implies that for ε <∞:

wLt ∼σ − 1

σβ

(HP∞L

)(1−β)

N1

σ−1

t

(Gt

(1 + ϕwε−1

Lt

)µ+ 1) 1σ−1−(1−β)

.

This expression directly implies that limwLt = ∞ (otherwise there is a subsequence

where the left-hand side is bounded while the right-hand side is unbounded). Therefore

we actually get:

wLt ∼σ − 1

σβ

(HP∞L

)(1−β)

N1

σ−1

t

(Gtw

β(σ−1)Lt ϕµ + 1

) 1σ−1−(1−β)

. (68)

Note that if ε = ∞, then we must be in case iii) when G∞ = 0 and (63) also directly

implies (68) (as ϕµ = ϕβ(σ−1) in that case).

Assume that limt→∞

GtNβt = λ exists and is finite. Then (68) implies:

wLtN− 1σ−1

t ∼ σ − 1

σβ

(HP∞L

)(1−β)(λ

(wLtN

− 1σ−1

t

)β(σ−1)

ϕµ + 1

) 1σ−1−(1−β)

,

which implies that limt→∞

wLtN− 1σ−1

t exists and is finite as well. Therefore one gets that

gwL∞ = gN∞/(σ − 1). Using (58) then immediately implies (17).

7.2.2 Sufficient conditions for Part A of Proposition 2.

We prove the following Lemma:

Lemma 1. Consider processes [Nt]∞t=0, [Gt]

∞t=0 and [HP

t ]∞t=0 , such that gNt and HPt admit

65

strictly positive limits. If i) the probability that a new product starts out non-automated

is bounded below away from zero and ii) the intensity at which non-automated firms are

automated is bounded above and below away from zero, then any limit of Gt must have

0 < G∞ < 1.

Note that GtNt is the mass of automated firms and let ν1,t > 0 be the intensity at

which non-automated firms are automated at time t and 0 ≤ ν2,t < 1 be the fraction

of new products introduced at time t that are initially automated. Then ˙(GtNt) =

ν1,t(1 − Gt)Nt + ν2,tNt such that Gt = ν1,t(1 − Gt) − (Gt − ν2,t)gNt . First assume that

G∞ = 1, then if ν1,t < ν1 < ∞ and ν2,t < ν2 < 1, we get that Gt must be negative for

sufficiently large t, which contradicts the assumption that G∞ = 1. Similarly if G∞ = 0,

then having ν1,t > ν for all t, gives that Gt must be positive for sufficiently large t, which

also implies a contradiction. Hence a limit must have 0 < G∞ < 1.

7.3 Proof of Proposition 5

We denote by g (i, t) the share of products indexed by i which are automated by time t.

The Poisson process for automation implies that

g (i, t) = 1− exp

(−η(t− i

n

))for t ≥ i

n, (69)

since a product i must be born at time t = i/n (and it is born non-automated).

The unit cost function of intermediate i can be written as:

ci (wL, wH , α (i)) =

(min

(wL

b (i)1−ς ,1

α (i)

))βw1−βH

b (i)1−β(1−ς) ββ (1− β)1−β .

Therefore an automated firm (α (i) = 1) will use machines instead of low-skill labor if

wL > b (i)1−α. This implies that two cases must be considered, if wLt > exp (B (1− ς)nt),then all automated firms will use machines. Otherwise, there exists a It ∈ [0, Nt) such

that wLt = exp (B (1− ς) It) and automated firms with an index i < It use machines

while those with an index i > It use low-skill workers instead. We fix It = Nt if

wLt ≥ exp(B (1− ς)nt.The resolution of the system then follows that in the baseline case, firms charge a

mark-up σ/ (σ − 1) and revenues of firm i are given by

Ri (wL, wH , α (i)) = ((σ − 1) /σ)σ−1 ci (wL, wH , α (i))1−σ Y.

66

A share (1− β) (σ − 1) /σ of firms’ revenues accrue to high-skill workers, while low-skill

workers obtain a share β (σ − 1) /σ if the firm is non-automated or has an index i > It

and 0 otherwise. Therefore

wHtH = (1− β)σ − 1

σ

∫ Nt

0

(1− g (i, t))Ri (wLt, wHt, 0) + g (i, t)Ri (wLt, wHt, 1) di,

wLtL = βσ − 1

σ

∫ Nt

0

(1− g (i, t))Ri (wL, wH , 0) + 1i>Itg (i, t)Ri (wLt, wHt, 1) di,

where 1i>It denotes the index function for i > It. Taking the ratio between these two

expressions, we obtain:

wHtH

wLtL=

1− ββ

[1 + w

β(σ−1)Lt

∫ Nt0

1i<Itg (i, t) b (i)(1−β(1−ς))(σ−1) di∫ Nt0

(1− 1i<Itg (i, t)) b (i)(σ−1) di

], (70)

which traces the relative demand curve (and replaces (10)).

Similarly, using the price normalization, we obtain:

σ

σ − 1

w1−βHt

ββ (1− β)1−β

(∫ Nt

0

((1− 1i<Ig (i, t))w

β(1−σ)Lt b (i)(σ−1)

+1i<Ig (i, t) b (i)(1−β(1−ς))(σ−1)

)di

) 11−σ

= 1, (71)

which replaces (11). We must then consider two cases in turn: It = Nt and It < Nt:

Case 1: It = Nt. Using (69) and the expression for b (i) we can compute the integral

in (70) and obtain:

wHtH

wLtL=

1− ββ

1 + wβ(σ−1)Lt

exp(B(1−β(1−ς))(σ−1)nt)−1B(1−β(1−ς))(σ−1)

− exp(B(1−β(1−ς))(σ−1)nt)−exp(−ηt)(B(1−β(1−ς))(σ−1)+ η

n)exp(B(σ−1)nt)−exp(−ηt)

(B(σ−1)+ ηn)

.For t large enough (and since (1− β (1− ς)) (σ − 1) > 0), we get:

wHtH

wLtL∼t→∞

1− ββ

[1 +

(B (σ − 1) + η

n

)ηn

(wL exp (−B (1− ς)nt))β(σ−1)

(B (1− β (1− ς)) (σ − 1))((B (1− β (1− ς)) (σ − 1) + η

n

))] .(72)

67

Similarly (71) gives

σ

σ − 1

w1−βHt

ββ (1− β)1−β

=

wβ(1−σ)Lt

exp(B(σ−1)nt)−exp(−ηt)B(σ−1)+ η

n+ exp(B(1−β(1−ς))(σ−1)nt)−1

B(1−β(1−ς))(σ−1)

− exp(B(1−β(1−ς))(σ−1)nt)−exp(−ηt)(B(1−β(1−ς))(σ−1)+ η

n)

1σ−1

.

From this, we obtain:

wHt ∼t→∞

exp

(B (1− β (1− ς))nt

1− β

)(1− β)

(σ − 1

σββ) 1

1−β

(73)(

exp(B(1−ς)nt)wLt

)β(σ−1)

B (σ − 1) + ηn

+ηn

B (1− β (1− ς)) (σ − 1)(B (1− β (1− ς)) (σ − 1) + η

n

)ψ

.

with ψ ≡ 1/ [(σ − 1) (1− β)] as before.

Since It = Nt, then we must have wLt ≥ exp (B (1− ς)nt). Therefore (73) implies

that

wHt = O

(exp

(B (1− β (1− ς))n

1− βt

)).

Further (72) implies that

wHtwLt

= O(

(wL exp (−B (1− ς)nt))β(σ−1)),

from which we get that

wLt = O

(exp

((1− β (1− ς) + (1− β) β (σ − 1) (1− ς)

(1 + β (σ − 1)) (1− β)

)Bnt

)).

We need to verify that wLt ≥ exp (B (1− ς)nt). Note that(1− β (1− ς) + (1− β) β (σ − 1) (1− ς)

(1 + β (σ − 1)) (1− β)

)Bn ≥ B (1− ς)n⇐⇒

ς ≥ 0.

Therefore, if ς > 0, then wLt > exp (B (1− ς)nt) is verified for sure for large t, and we

get that gwH∞ and gwL∞ exist with gwH∞ = (1−β(1−ς))Bn1−β and gwL∞ = 1−β(1−ς)+(1−β)β(σ−1)(1−ς)

(1+β(σ−1))(1−β)Bn

68

and we can verify that gwH∞ > gwL∞ .

In contrast if ς = 0, then we have gwH∞ = gwL∞ = Bn but this case only applies if there

exist constant wH and wL such that

wH = (1− β)

(σ − 1

σββ) 1

1−β(

wβ(1−σ)L

B (σ − 1) + ηn

+ηn

B (1− β) (σ − 1)(B (1− β) (σ − 1) + η

n

))ψ

,

(74)

wHH

wLL=

1− ββ

[1 + w

β(σ−1)L

(B (σ − 1) + η

n

)ηn

B (1− β) (σ − 1)((B (1− β) (σ − 1) + η

n

))] , (75)

with wL ≥ 1, in which case wHt ∼t→∞

wH exp (Bnt) and wLt ∼t→∞

wL exp (Bnt) and

It = Nt is a possibility.

Case 2: It < Nt. Then (70) implies

wHtH

wLtL=

1− ββ

1 + wβ(σ−1)Lt

exp(B(1−β(1−ς))(σ−1)It)−1B(1−β(1−ς))(σ−1)

− exp (−ηt) exp((B(1−β(1−ς))(σ−1)+ ηn)It)−1

(B(1−β(1−ς))(σ−1)+ ηn)

exp(B(σ−1)nt)−exp(B(σ−1)It)B(σ−1)

+ exp (−ηt) exp((B(σ−1)+ ηn)It)−1

(B(σ−1)+ ηn)

.Note that we must have lim It =∞, otherwise, there would be periods where wLt remain

bounded even though an arbitrarily large number of products use low-skill workers.

Therefore, using wLt = exp ((1− ς)BIt), the previous equation leads to:

wHtH

wLtL∼t→∞

1− ββ

1 +

1B(1−β(1−ς))(σ−1)

− exp(− ηn

(nt− It))

1B(1−β(1−ς))(σ−1)+ η

n

exp(B(σ−1)(nt−It))−1B(σ−1)

+ exp(− ηn

(nt− It))

1B(σ−1)+ η

n

,Since nt ≥ It, then we must have that wHt = O (wLt) = O (exp ((1− ς)BIt)). Similarly

(71) now implies:

σw1−βHt

(σ − 1) ββ (1− β)1−β =

wβ(1−σ)Lt

exp(B(σ−1)nt)−exp(B(σ−1)It)B(σ−1)

+ wβ(1−σ)Lt exp (−ηt) exp((B(σ−1)+ η

n)It)−1

B(σ−1)+ ηn

+ exp(B(1−β(1−ς))(σ−1)It)−1B(1−β(1−ς))(σ−1)

− exp (−ηt) exp((B(1−β(1−ς))(σ−1)+ ηn)It)−1

(B(1−β(1−ς))(σ−1)+ ηn)

1

σ−1

.

Using that It →∞ and that wLt = exp ((1− ς)BIt), we then get

σw1−βHt

(σ − 1) ββ (1− β)1−β ∼t→∞ exp ((1− (1− ς) β)BIt)

exp(B(σ−1)(nt−It))−1B(σ−1)

+exp(− ηn (nt−It))B(σ−1)+ η

n

+ 1B(1−β(1−ς))(σ−1)

− exp(− ηn (nt−It))B(1−β(1−ς))(σ−1)+ η

n

1

σ−1

.

69

The left-hand side is of order exp ((1− ς − β (1− ς))BIt) while the the right-hand side

is of order at least exp ((1− (1− ς) β)BIt), therefore this situation is only possible if

ς = 0. Moreover, if nt − It is unbounded then the ratio of the right-hand side to the

left-hand side is also unbounded, which is a contradiction. Therefore it must be that

nt− It remains bounded. In that case, we must then have that wLt and wHt are of the

same order as exp (BIt).

We then obtain an equilibrium with It < Nt if there exist wH , wL < 1 (which implies

limnt− It = − logwLB

) such that:

wHH

wLL=

1− ββ

1 +

1B(1−β)(σ−1)

− wηBnL

1B(1−β)(σ−1)+ η

n

w(1−σ)L −1

B(σ−1)+ w

ηBnL

1B(σ−1)+ η

n

, (76)

wH = (1− β)

(σ − 1

σββ) 1

1−β

wL

w(1−σ)L −1

B(σ−1)− w

ηBnL Bβ(σ−1)

(B(σ−1)+ ηn)(B(1−β)(σ−1)+ η

n)

+ 1B(1−β)(σ−1)

ψ

. (77)

(76) can be rewritten as wH = 1−ββ

LHs1 (wL) with

s1 (wL) ≡ wL

1 +

1B(1−β)(σ−1)

− wηBnL

1B(1−β)(σ−1)+ η

n

w(1−σ)L −1

B(σ−1)+ w

ηBnL

1B(σ−1)+ η

n

.We obtain (after a lot of algebra):

s′1 (wL) =

(ηn( η

Bn+1)

(B(σ−1)+ ηn)(B(1−β)(σ−1)+ η

n)+ 1

B(1−β)(σ−1)+

β(σ−1)1−β +1+ β

B(1−β)ηn

B(σ−1)+ ηn

)w

ηBnL

w1−σL −1

B(σ−1)

+

(w1−σL −1

B(σ−1)+

1−wηBnL

B(1−β)(σ−1)

)w1−σL −1

B(σ−1)+

1−wηBnL

B(1−β)(σ−1)

w1−σL

B+ 1+β(σ−1)

B(1−β)(σ−1)

wηBnL

B(σ−1)+ ηn

+ηnw

ηBnL

(B(σ−1)+ ηn)(B(1−β)(σ−1)+ η

n)

(w1−σL

B+

wηBnL

B(σ−1)+ ηn

)

(w1−σL −1

B(σ−1)+

wηBnL

B(σ−1)+ ηn

)2 ,

which is positive when wL ≤ 1 (as then w1−σL − 1 ≥ 0 and 1−w

ηBnL ≥ 0), so that s1 is in-

creasing in wL for wL ≤ 1. Similarly (77) can be rewritten as wH = (1− β)(σ−1σββ) 1

1−β s2 (wL)ψ,

70

with

s2 (wL) ≡ w(σ−1)(1−β)L

(w

(1−σ)L − 1

B (σ − 1)− w

ηBnL Bβ (σ − 1)(

B (σ − 1) + ηn

) (B (1− β) (σ − 1) + η

n

) +1

B (1− β) (σ − 1)

).

We get

s′2 (wL) = −w(σ−1)(1−β)−1L

[(B (σ − 1) β

B (σ − 1) + ηn

)w

ηBnL

B+ β

w1−σL − 1

B

],

which is negative for wL ≤ 1. Therefore (75) and (76) together trace a positive relation-

ship between wH and wL (it is easy to show continuity between the two expressions) and

similarly (74) and (77) trace a positive relationship (continuity is also easily verified),

checking the limits when wL → 0,∞, we obtain that there exist a single solution. This

ensures that asymptotically when ς = 0, the asymptotic equilibrium described above

exists, and depending on other parameters it is in case 1 or case 2.

7.4 Alternative production technology for machines

The assumption of identical production technologies for consumption and machines im-

poses a constant real price of machines once they are introduced. As shown in Nordhaus

(2007) the price of computing power has dropped dramatically over the past 50 years

and the declining real price of computers/capital is central to the theories of Autor and

Dorn (2013) and Karabarbounis and Neiman (2013). As explained in section 2, it is

possible to interpret automation as a decline of the price of a specific equipment from

infinity (the machine does not exist) to 1. Yet, our assumption that once a machine is

invented, its price is constant, is crucial for deriving the general conditions under which

the real wages of low-skill workers must increase asymptotically in Proposition 2. We

generalize this in what follows.

Let there be two final good sectors, both perfectly competitive employing CES pro-

duction technology with identical elasticity of substitution, σ. The output of sector 1,

Y , is used for consumption. The output of sector 2, X, is used for machines. The two

final good sectors use distinct versions of the same set of intermediate inputs, where we

denote the use of intermediate inputs as y1(i) and y2(i), respectively, with i ∈ [0, N ].

The two versions of intermediate input i are produced by the same intermediate input

71

supplier using production technologies that differ only in the weight on high-skill labor:

yk(i) =[lk(i)

ε−1ε + α(i)(ϕxk(i))

ε−1ε

] εβkε−1

hk(i)1−βk ,

where a subscript, k = 1, 2, refers to the sector where the input is used. Importantly, we

assume β2 ≥ β1, such that the production of machines relies more heavily on machines

as inputs than the production of the consumption good. Continuing to normalize the

price of final good Y to 1, such that the real price of machines is pxt , and allowing

for the natural extensions of market clearing conditions, we derive below the following

generalization of Proposition 2 (where ψk = (σ − 1)−1(1− βk)−1).

Proposition 6. Consider three processes [Nt]∞t=0, [Gt]

∞t=0 and [HP

t ]∞t=0 where (Nt, Gt, HPt ) ∈

(0,∞)× [0, 1]× (0, H] for all t. Assume that Gt, gNt and HP

t all admit strictly positive

limits, then:

gpx

∞ = −ψ2 (β2 − β1) gN∞

gGDP∞ =

[ψ1 + ψ1

β1 (β2 − β1)

1− β2

]gN∞, (78)

and if G∞ < 1 then the asymptotic growth rate of wLt is32

gwL∞ =1

1 + β1(σ − 1)

1− β2 + β1 (β2 − β1)(1− ψ−1

1

)1− β2 + β1 (β2 − β1)

gGDP∞ . (79)

Proposition 6 naturally reduces to Proposition 2 for the special case of β2 = β1.

When β2 > β1, the productivity of machine production increases faster than that of the

production of Y , implying a gradual decline in the real price of machines. For given

gN∞, a faster growth in the supply of machines increases the (positive) growth in the

relative price of low-skill workers compared with machines, wL/px, but simultaneously,

it reduces the real price of machines, px. The combination of these two effects always

implies that low-skill workers capture a lower fraction of the growth in Y . Low-skill wages

are more likely to fall asymptotically for higher values of the elasticity of substitution

between products, σ, as this implies a more rapid substitution away from non-automated

products.

32If Gt tends towards 1 sufficiently fast such that limt→∞(1−Gt)Nψ2(1−µ1)

ε−1ε

t is finite, then gwL∞ =1ε

(1− (β2−β1)(ε−1)

(1−β2+β1)

)gGDP∞ ≥ gp

x

∞ whether ε is finite or not. It is clear that there always exists an ε

sufficiently high for the real wage of low-skill workers to decline asymptotically.

72

Proof. The analysis follows similar steps as in the baseline model. The cost function (4)

now becomes

ck (α (i)) = β−βκk (1− βk)−(1−βk) (w1−εL + ϕ (px)1−ε α (i)

) βk1−ε w1−βk

H , (80)

for k ∈ 1, 2 indexing, respectively, the production of final good and machines. As

before aggregating (80) and the price normalization gives a “productivity” condition,

which replaces (11).

(G(w1−εL + ϕ (px)1−ε)µ1 + (1−G)w

β1(1−σ)L

) 11−σ

w1−β1H =

σ − 1

σββ11 (1− β1)1−β1N

1σ−1 ,

(81)

where we generalize the definition of µ: µk ≡ βk(σ−1)ε−1

. Following the same methodology

for the production of machines, we get

(G(w1−εL + ϕ (px)1−ε)µ2 + (1−G)w

β2(1−σ)L

) 11−σ

w1−β2H =

σ − 1

σββ22 (1− β2)1−β2N

1σ−1px.

(82)

Taking the ratio between these two expressions, we get

(G

((wLpx

)1−ε+ ϕ

)µ2+ (1−G)

(wLpx

)β2(1−σ)) 1

1−σ

wβ1−β2H(G

((wLpx

)1−ε+ ϕ

)µ1+ (1−G)

(wLpx

)β1(1−σ)) 1

1−σ=ββ22 (1− β2)1−β2 (px)1−β2+β1

ββ11 (1− β1)1−β1.

(83)

The share of revenues accruing to machines in the production of intermediate input i for

the usage-k (i.e for use in the final sector or the machines sector) is given by

νk,x (α (i)) =σ − 1

σα (i) βk

ϕ (px)1−ε

w1−εL + ϕ (px)1−ε , (84)

aggregating over all intermediates inputs and denoting Rk (α (i)) the revenues generated

through usage k by a firm of type α (i), we get that the total expenses in machines are

given by

pxX = NG (R1(1)ν1,x(1) +R2(1)ν2,x(1)) . (85)

The zero profit condition in the machines sector gives

pxX = N (GR2 (1) + (1−G)R2 (0)) . (86)

73

Revenues themselves are given by

R1 (α (i)) =

(σ − 1

σ

)σ−1

c1 (α (i))1−σ Y and R2 (α (i)) =

(σ − 1

σ

)σ−1

c2 (α (i))1−σ pxX,

(87)

so that (7) still holds but separately for revenues occurring from each activity and with

µk replacing µ. Combining (7), (84), (85) and (86), we get(G(

1− σ−1σβ2

ϕ(px)1−ε

w1−εL +ϕ(px)1−ε

)+ (1−G)

(1 + ϕ

(wLpx

)ε−1)−µ2)

R2(1)R1(1)

= Gσ−1σβ1

ϕ(px)1−ε

w1−εL +ϕ(px)1−ε

, (88)

which determines the revenues ratio as a function of input prices solely.

To derive low-skill wages, we compute the share of revenues accruing to low-skill

labor in the production of intermediate input i for the usage-k as:

νk,l (α (i)) =σ − 1

σβk

(1 + α (i)ϕ

(wLpx

)ε−1)−1

,

so that total low-skill income can be written as:

wLL = N (GR1(1)ν1,l(1) + (1−G)R1(0)ν1,l(0) +GR2(1)ν2,l(1) + (1−G)R2(0)ν2,l(0)) .

(89)

The share of revenues going to high-skill workers is given by νk,h = σ−1σ

(1− βk) both in

automated and non-automated firms. As a result

wHHP = N (ν1,h (GR1(1) + (1−G)R1(0)) + ν2,h (GR2(1) + (1−G)R2(0))) , (90)

74

Take the ratio between (89) and (90), and use (7) to obtain:

wLL

wHHP=

β1

(G

(1 + ϕ

(wLpx

)ε−1)−1

+ (1−G)

(1 + ϕ

(wLpx

)ε−1)−µ1)

+β2R2(1)R1(1)

(G

(1 + ϕ

(wLpx

)ε−1)−1

+ (1−G)

(1 + ϕ

(wLpx

)ε−1)−µ2)

(1− β1)

(G+ (1−G)

(1 + ϕ

(wLpx

)ε−1)−µ1)

+ (1− β2) R2(1)R1(1)

(G+ (1−G)

(1 + ϕ

(wLpx

)ε−1)−µ2)

(91)

Together (81), (83), (88) and (91) determine wL, wH , px and R (2) /R (1) given N,G and

HP .

Asymptotic behavior for ε < 1. As the supply of machines is going up and there

is imperfect substitutability in production between machines and low-skill labor, any

equilibrium must feature wL∞/px∞ =∞ even if wL∞ <∞. Applying this to (83), we get

(pxt )1−β2+β1 ∼ ββ11 (1− β1)1−β1

ββ22 (1− β2)1−β2ϕµ2−µ11−σ wβ1−β2Ht . (92)

Further plugging this last relationship in (81), we get:

wHt ∼(σ−1σ

)1+β1

1−β2 ϕψ2µ1ββ11 (1− β1)1−β1

(β

β21−β22 (1− β2)

)β1Gψ1

(1+

β1(β2−β1)(1−β2)

)t N

ψ1

(1+

β1(β2−β1)(1−β2)

)t

, (93)

Hence

gwH∞ = ψ1

(1 +

β1 (β2 − β1)

(1− β2)

)gN∞. (94)

Through (88), the revenues of the machines sector and the final good sector are of the

same order, which implies that Y , pxX and wH grow at the same rate. Therefore

gGDP∞ = gY∞ = gwH∞ = ψ1

(1 +

β1 (β2 − β1)

(1− β2)

)gN∞.

In fact (88) givesR2,t (1)

R1,t(1)∼

σ−1σβ1

1− σ−1σβ2

. (95)

75

Using (92) and (93), one further gets:

pxt ∼ββ11 (1− β1)1−β1(

ββ22 (1− β2)1−β2) 1−β1

1−β2

ϕψ2µ1

(β1−β2)β1

(σ

σ − 1

)β2−β11−β2

G−ψ2(β2−β1)t N

−ψ2(β2−β1)t ,

therefore

gpx∞ = −ψ2 (β2 − β1) gN∞ < 0, (96)

since β2 > β1. Using that wL∞/px∞ =∞ and (95) in (91) leads to:

wLt

(wLtpxt

)ε−1

∼

wHtHPt

β1

(Gt + (1−Gt)

(ϕ(wLtpxt

)ε−1)1−µ1

)

+β2

σ−1σβ1

1−σ−1σβ2

(Gt + (1−Gt)

(ϕ(wLtpxt

)ε−1)1−µ2

)

ϕGtL(

1− β1 + (1− β2)σ−1σβ1

1−σ−1σβ2

) . (97)

Since β2 > β1, then (1−Gt)(wLtpxt

)(ε−1)(1−µ1)

dominates (1−Gt)(wLtpxt

)(ε−1)(1−µ2)

asymp-

totically regardless of the value ofG∞ (in other words, we can always ignore (1−Gt)(wLtpxt

)(ε−1)(1−µ2)

in our analysis).

The reasoning then follows that of Appendix 7.2.1. If G∞ < 1, then (97) implies

w1+β1(σ−1)Lt ∼ (pxt )

(σ−1)β1 wHtHPβ1 (1−Gt)

ϕµ1GtL(

1− β1 + (1− β2)σ−1σβ1

1−σ−1σβ2

) , (98)

which, together with (94) and (96) gives (79).

Alternatively assume that G∞ = 1 and that lim (1−Gt)Nψ2(1−µ1) ε−1

εt exists and is

finite. Suppose first that lim sup (1−Gt)(wLtpxt

)(ε−1)(1−µ1)

= ∞, then there must be a

sub-sequence where (98) is satisfied, which with (94) and (96) leads to a contradiction

with the assumption that lim (1−Gt)Nψ2(1−µ) ε−1

εt exists and is finite.

If lim (1−G)(wLtpx

)(ε−1)(1−µ1)

= 0, then (97) gives

wεLt ∼(pxt )

ε−1wHtHPt

(β1 + β2

σ−1σβ1

1−σ−1σβ2

)ϕL(

1− β1 + (1− β2)σ−1σβ1

1−σ−1σβ2

) ,

76

which implies with (94) and (96) that:

gwL∞ =1

ε

(1− (β2 − β1) (ε− 1)

(1− β2 + β1)

)gGDP∞ . (99)

Finally, if lim sup (1−Gt)w(ε−1)(1−µ)Lt is finite but strictly positive, then as in Ap-

pendix 7.2.1, one can show that this requires that lim (1−Gt)Nψ2ε

(ε−1)(1−µ1)t > 0, from

which we can derive that (99) also holds in that case. This proves Proposition 6 and the

associated footnote in the imperfect substitutes case.

Perfect substitutes case. In the perfect substitutes case, (81) becomes:

(Gϕβ1(σ−1) (px)β1(1−σ) + (1−G)w

β1(1−σ)L

) 11−σ

w1−β1H

= σ−1σββ11 (1− β1)1−β1N

1σ−1 for wL > px/ϕ

, (100)

wβ1L w1−β1H =

σ − 1

σββ11 (1− β1)1−β1N

1σ−1 for wL < px; (101)

(83) becomes

(Gϕβ2(σ−1) (px)β2(1−σ) + (1−G)w

β2(1−σ)L

) 11−σ

wβ1−β2H(Gϕβ1(σ−1) (px)β1(1−σ) + (1−G)w

β1(1−σ)L

) 11−σ

=ββ22 (1− β2)1−β2px

ββ11 (1− β1)1−β1for wL > px/ϕ,

(102)

px =ββ11 (1− β1)1−β1

ββ22 (1− β2)1−β2wβ2−β1L wβ1−β2H for wL < px/ϕ; (103)

(88) becomes(G

(1− σ − 1

σβ2

)+ (1−G) ϕβ2(1−σ)

(wLpx

)β2(1−σ))R2 (1)

R1(1)= G

σ − 1

σβ1 for wL > px/ϕ,

(104)

with R2 (1) = 0 for wL < px/ϕ; and (91) becomes

wLL

wHHP= (1−G)

β1

(ϕwLpx

)β1(1−σ)

+ β2R2(1)R1(1)

(ϕwLpx

)β2(1−σ)(1− β1)

(G+ (1−G)

(ϕwLpx

)β1(1−σ))

+ (1− β2) R2(1)R1(1)

(G+ (1−G)

(ϕwLpx

)β2(1−σ))

for wL > px/ϕ,

(105)

77

wLL

wHHP=

β1

1− β1

for wL < px/ϕ. (106)

Together (101), (103) and (106) show that we must have wLt ≥ pxtϕ

for t large enough,

which delivers (94) and (96).

Assume that G∞ < 1, then (105) gives (98) from which we get that (79) is satisfied.

Now consider the case where G∞ = 1 and lim (1−Gt)Nψ2t exists and is finite. Then

(105) and (104) imply

wLt ∼ (1−Gt)wHt

(ϕwLtpxt

)β1(1−σ)

(β1 + β2

σ−1σβ1

1−σ−1σβ2

(ϕwLt

pxt

)−(β2−β1)(σ−1))

1− β1 + (1− β2)σ−1σβ1

1−σ−1σβ2

HPt

Lfor wL > px/ϕ.

We can then derive that ϕwLtpxt

must have a finite (and positive) limit, so that

gwL∞ = gpx

∞ = − β2 − β1

1− β2 + β1

gGDP∞ .

This proves Proposition 6 and its associated footnote in the perfect substitutes case.

7.5 Proofs and analytical results for the baseline dynamic model

7.5.1 Proof of Proposition 3

We look for a steady state with positive long-run growth for the system defined by

(31), (32), (39) and (40) and we denote such a (potential) steady state n∗, G∗, hA∗, χ∗

(more generally we denote all variables at steady state with a ∗).33 Following (31), we

immediately get that n∗ = 0. Using (32), we get that G∗ obeys:

G∗ =η (G∗)κ

(hA∗)κ

η (G∗)κ(hA∗)κ

+ gN∗. (107)

We focus on a solution with G∗ > 0 (when κ > 0, G∗ = 0 is also a solution), this

equation implies that with(gN)∗> 0, G∗ < 1 . Then, recalling that µ ∈ (0, 1), (49),

implies that:

33Note that if gNt = 0, the economy does not obey this system of equations but that it is also impossibleto achieve positive long-run growth, as production is bounded by the production of an economy whichhas Gt = 1.

78

ω∗ =[ (

σ−1σβ) 1

1−β HP∗

L(1−G∗) (G∗ϕµ)ψ−1

] β(1−σ)1+β(σ−1)

.

Using (45), (40) implies that in steady state,

hA∗ =κ

γ (1− κ)

(ρ+ ((θ − 1)ψ + 1) gN∗

)(108)

which uniquely defines defines hA∗ as a linear and increasing function of gN∗ (recall that

θ ≥ 1). Note that if gN∗ > 0, then hA∗ > 0.

Then, for G∗ > 0, (107) combined with (108), defines G∗ uniquely as an increasing

function of gN∗. (48) also uniquely defines HP∗ as a function of gN∗:

HP∗ = H − gN∗

γ− (1−G∗) hA∗. (109)

(45) and (107) allows to rewrite (39) in steady state as:

ηκ (G∗)κ−1(hA∗)κ

1− κψHP∗ =

γ

κ

(hA∗)2

+ ηGκt

(hA∗)κ+1

. (110)

Since G∗, hA∗ and HP∗ are functions of gN∗, one can rewrite (110) as an equation

determining gN∗. A steady state with positive growth-rate is a solution to

f(gN∗)≡ 1− κ

κ

γG∗hA∗

ψHP∗

(1

κη (G∗)κ

(hA∗)1−κ

+1

γ

)= 1, (111)

with gN∗ > 0. Indeed, (47) simply determines χ∗ as:

χ∗ =

(σ

σ − 1

) 11−β (1−θβ)

(1− β σ−1

σ

)θ (HP∗)θ

(1− β) ββ

1−β (1−θ) (G∗ϕµ)ψ(1−θ), (112)

which achieves the characterization of a steady state for the system of differential equa-

tions defined by (31), (32), (39) and (40).

In order to establish the sufficiency of equation (25) for positive growth. Note that

as gN∗ → 0, then equations (108), (107) and (109) imply that

f (0) =ρ

ψH

(1

ηκκ (1− κ)1−κ

(ρ

γ

)1−κ

+1

γ

).

79

In addition, gN∗

γ+(1−G∗) hA∗ is always greater than gN∗

γ, therefore for a sufficiently large

gN∗ (smaller than γH), HP∗ is arbitrarily small, while for the same value G∗ and hA∗

are bounded below and above. This establishes that for gN∗ large enough, f(gN∗)> 1.

Therefore a sufficient condition for the existence of at least one steady state with positive

growth and positive G∗ is that f (0) < 1 (such that f(gN∗)

= 1 has a solution), which

is equivalent to condition (25).

7.5.2 Uniqueness of the steady state

Generally the steady state is not unique. Nonetheless, consider the special case in which

κ = 0. Then f can be rewritten as

f(gN∗)

=1− κκ

γG∗hA∗

ψHP∗

(1

κη

(hA∗)1−κ

+1

γ

), (113)

note that HP∗ is decreasing in gN∗ and hA∗ is increasing in gN∗, so a sufficient condition

for f to be increasing in gN∗ is that G∗hA∗ is also increasing in gN∗. With κ = 0, using

(108), (107), we get:

G∗hA∗ =η(

κγ(1−κ)

)κ+1 (ρ+ ((θ − 1)ψ + 1) gN∗

)κ+1

η(

κγ(1−κ)

)κ(ρ+ ((θ − 1)ψ + 1) gN∗)κ + gN∗

.

Therefore

d(G∗hA∗

)dgN∗

=

η( κγ(1−κ))

κ+1(ρ+((θ−1)ψ+1)gN∗)

κ

(η( κγ(1−κ))

κ(ρ+((θ−1)ψ+1)gN∗)κ+gN∗)

2

×

(η(

κγ(1−κ)

)κ((θ − 1)ψ + 1)

(ρ+ ((θ − 1)ψ + 1) gN∗

)κ−ρ+ gN∗κ ((θ − 1)ψ + 1)

) .

Since gN∗ > 0, we get thatd(G∗hA∗)dgN∗

> 0 (so that the steady state is unique) if(1−κ)κγκ

ηκκρ1−κ < (θ − 1)ψ + 1. This condition is likely to be met for reasonable pa-

rameter values as long as the automation technology is not too concave: ρ is a small

number, θ ≥ 1 and γ and η being innovation productivity parameters should be of the

same order (it is indeed met for our baseline parameters).

80

7.5.3 Transitional dynamics and the first phase

Combining (27) and (26), we can write:

NthAt =

(κηGκ

t

(∫ ∞t

exp

(−∫ τ

t

rudu

)(Nt

wHt

(πAτ − πNτ

)dτ − 1− κ

κ

Nt

Nτ

wHτwHt

(Nτh

Aτ

))dτ

)) 11−κ

.

Using (8) and that aggregate profits Πt = Nt

(Gtπ

At + (1−Gt) π

Nt

)are a share 1/σ of

output, we can rewrite this equation as:

hAt =

(κηGκ

t

(∫ ∞t

exp

(−∫ τ

t

rudu

)(ψHP

t

πAτ − πNτGtπAt + (1−Gt) πNt

dτ − 1− κκ

Nt

Nτ

vτvthAτ

)dτ

)) 11−κ

.

(114)

Recalling (7), we can write:

hAt =

κηGκt

∫ ∞t

ψHPt

(1+ϕwε−1Lτ )

µ−1

Gt(1+ϕwε−1Lt )

µ+1

exp(∫ τ

t

(gπ

N

u − ru)du)

−1−κκ

exp(∫ τ

t

(gvu − gNu − ru

)du)hAτ

dτ

1

1−κ

.

Consider a fixed t > 0. Then for an arbitrarily large T , if wL0 is sufficiently small

relative to ϕ−1, we will have that wLt is small relative to ϕ−1 over(0, t+ T

). For any

τ ∈(0, t+ T

), we have that

(1+ϕwε−1Lτ )

µ−1

Gt(1+ϕwε−1Lt )

µ+1

= µϕwε−1Lτ + o

(ϕwε−1

Lτ

). The notation o (z)

denotes negligible relative to z (that is f (z) = o (z), if limz→0

f (z) /z = 0) and O (z) will

denote of the same order or negligible in front of z (f (z) = O (z) if lim supz→0

|f (z) /z| <

∞). Then for any t ∈(0, t)

(hAt

)1−κ≤ κηGκ

t

∫ t+Tt

ψHPt

(µϕwε−1

Lτ + o(ϕwε−1

Lτ

))exp

(∫ τt

(gπ

N

u − ru)du)dτ

+∫∞t+T

ψHPt

(1+ϕwε−1Lτ )

µ−1

Gt(1+ϕwε−1Lt )

µ+1

exp(∫ τ

t

(gπ

N

u − ru)du)dτ

.

Further, we know that ru = ρ + θgCu with θ ≥ 1. In addition Cu = Yu − Xu, with Xu

the aggregate spending on machines (initially negligible and later on a share of output

bounded away from 1) and πNu initially grows like Yu/Nu (and from then on will grow

81

slower), therefore we have that ru − gπN

u > ρ. Hence one can write:(hAt

)1−κ

≤ κηGκt

(∫ t+T

t

µψHPt ϕw

ε−1Lτ exp

(∫ τ

t

(gπ

N

u − ru)du

)dτ + o

(ϕwε−1

Lt+T

)+ o

(e−ρ(T+t−t)

))

Since ru − gπN

u > ρ, there exists a φ > 0, such that

∫ t+T

t

exp

(∫ τ

t

(gπ

N

u − ru)du

)dτ ≤

∫ t+T

t

e−(ρ+φ)(τ−t)dτ

≤ 1

ρ+ φ

(1− e−(ρ+φ)(t+T−t)

).

This allows us to rewrite:

(hAt

)1−κ≤ κηGκ

t

(µψHP

t ϕwε−1

Lt+T

ρ+ o

(ϕwε−1

Lt+T

)+ o

(e−ρT

))

Therefore, since T is large and ϕwε−1

Lt+Tis small, then hAt must be small too. In fact, we

get that hAt = O

((ϕwε−1

Lt+T

) 11−κ)

+ o(e−ρT

).

For any t ∈(0, t), we can then rewrite (40) as

χtχt

= γψHP − ρ− (θψ − ψ + 1) gNt +O

((ϕwε−1

Lt+T

) 11−κ)

+ o(e−ρT

). (115)

Using (46) we obtain

Ct = Yt −Xt =(1 +O

(Gtϕw

ε−1Lt

))Yt.

Next (5) and the corresponding equation for high-skill labor demand in production imply:

LNA

LA=

(1−Gt)(1 + ϕwε−1

Lt

)−µ−1

Gt

andHP,NA

HP,A=

(1−Gt)(1 + ϕwε−1

Lt

)−µGt

.

Using (3), we can then write

Yt = N1

σ−1Lβ(HPt

)1−β ×

82

Gt

[1 +O

(ϕwε−1

Lt

)+

(O(ϕwε−1

Lt

)ϕ

1εYtL

) ε−1ε

] εβε−1

σ−1σ (

1 +O(ϕwε−1

Lt

))+ 1−G+O

(ϕwε−1

Lt

)σσ−1

Note that we have wLt = O (Yt/L) therefore ϕ1εYt/L = O

(ϕ

1εwLt

). Therefore

Yt =(1 +O

(ϕwε−1

Lt

))N

1σ−1Lβ

(HPt

)1−β.

From this, using (8), one obtains that high-skill wages obey:

wHt =(1 +O

(ϕwε−1

Lt

)) σ − 1

σ(1− β)N

1σ−1

t Lβ(HPt

)−β,

while for low-skill wages, we get

wLt =(1 +O

(ϕwε−1

Lt

)) σ − 1

σβN

1σ−1

t Lβ−1(HPt

)1−β. (116)

Therefore using the definition of χt, we obtain that

χt =(1 +O

(ϕwε−1

Lt

))σψLβ(θ−1)

(HPt

)(1−β)θ+βN

(1−θ)β(σ−1)(1−β)t .

Differentiating and plugging into (115) and using (48), we get (recalling (116) so that

d ln(1 +O

(ϕwε−1

Lt

))/dt will be of order O

(ϕwε−1

Lt

)as well).

((1− β) θ + β)

·HPt

HPt

= γψHPt −ρ−

(θ − 1

σ − 1+ 1

)γ(H −HP

t

)+O

((ϕwε−1

Lt+T

) 11−κ)

+o(e−ρT

),

(117)

we dropped terms in ϕwε−1Lt since there will negligible in front of

(ϕwε−1

Lt+T

) 11−κ

. The

exact counterpart of this system admits a BGP with HPt constant, and as in the Romer

(1990), there is no transitional dynamics. Therefore, here, we must have over the interval(0, t)

HPt =

(θ−1σ−1

+ 1)H + ρ

γ

ψ + θ−1σ−1

+ 1+O

((ϕwε−1

Lt+T

) 11−κ)

+ o(e−ρT

)and gNt =

γHψ − ρψ + θ−1

σ−1+ 1

+O

((ϕwε−1

Lt+T

) 11−κ)

+ o(e−ρT

)(118)

which is positive under assumption (25). We then have that for Nt low, (22) can be solved

83

as Gt = G0 exp

(− γHψ−ρψ+ θ−1

σ−1+1t

)+ O

((ϕwε−1

Lt+T

) 11−κ)

+ o(e−ρT

). This characterizes the

solution during Phase 1. We can summarize our discussion in the following Proposition.

Proposition 7. For N0 low enough that wL0 is small relative to ϕ1/(ε−1), then there

is an interval(0, t)

during which the automation rate ηGκt

(hAt

)κis small, the share of

automated products Gt depreciates and the economy behaves similarly to that of a Romer

model where automation is impossible.

7.5.4 Transition from the first to the second phase

We prove the following Proposition:

Proposition 8. If κ(1− β) + κ < 1, then Gt cannot go toward 0.

If κ = 0, Phase 1 cannot last forever as at some point, Nt and therefore wLt will

become large. Since, the Poisson rate is η(hAt

)κ= O

(ϕwε−1

Lt

). This implies that Gt

must start growing at a positive rate and that we enter the second phase.

When κ > 0 (and G0 6= 0, otherwise automation is impossible), however, whether the

Poisson rate of automation becomes negligible or not depends on a horse race between the

drop in the share of automated products (and therefore the efficiency of the automation

technology) and the rise in the low-skill wages (which, through horizontal innovation can

become arbitrarily large). We look for a sufficient condition under which the Poisson

rate will take off.

First assume that Gtwβ(σ−1)Lt does not tend towards 0. Then from (114) we obtain

that:

hAt = hAt(Gκ−1t

) 11−κ =⇒ ηGκ

t

(hAt

)κ= O

(G

κ−κ1−κt

)(119)

Since κ ≤ κ, we obtain that the Poisson rate of automation diverges: a contradiction.

Assume now thatGtwβ(σ−1)Lt does tend towards 0. This ensures that wLt = O

(N

1σ−1

t

).

Moreover,πAt −πNt

GtπAt +(1−Gt)πNt= O

(wβ(σ−1)Lt

). Then using this in (114), we obtain

hAt = O(Gκtw

β(σ−1)Lt

) 11−κ

.

Note that hAt must remain bounded otherwise high-skill labor market clearing is violated.

Therefore, we must haveGκtw

β(σ−1)Lt bounded (which implies thatGtw

β(σ−1)Lt tends towards

84

0). Therefore the Poisson rate obeys:

ηGκt

(hAt

)κ= O

(G

κ1−κt N

βκ1−κt

)Plugging this in (32) we get:

·Gt = O

(G

κ1−κt N

βκ1−κt

)− gNt Gt

To obtain that the share Gt is going towards 0, it must first be that Gκ

1−κt N

βκ1−κt declines

at the same rate or faster than Gt.

Consider first the case where, Gκ

1−κt N

βκ1−κt and Gt are of the same order. In that case,

we must have:

Gt = O

(N

βκ1−κ−κt

)This cannot go towards 0 if 1−κ− κ > 0. In addition, recall that this reasoning assumed

that Gκtw

β(σ−1)Lt remains bounded. We have

Gκtw

β(σ−1)Lt = O

(N

β(1−κ)(1−κ)1−κ−κ

t

),

which is indeed declining if 1 − κ − κ < 0. Furthermore, in that case we must have·Gt ≥ −gNt Gt, that is Gtshould not decline at a rate faster than N−1

t . This implies that

we must have βκκ+κ−1

≤ 1⇐⇒ κ(1− β) + κ ≥ 1.

Alternatively, if Gκ

1−κt N

βκ1−κt goes towards 0 faster than Gt then Gt will be declining

at the rate gNt , so that we have Gt = O(N−1t

). This then implies

Gκ

1−κt N

βκ1−κt /Gt = O

(N

βκ+1−κ−κ1−κ

t

).

As soon as κ(1− β) + κ < 1 then this cannot go towards 0.

Therefore κ(1 − β) + κ < 1 is a sufficient condition which ensures that the Poisson

rate of automation must take off.

7.5.5 Transitional dynamics in the third phase

In this appendix, we prove two results:

85

Lemma 2. If Gt is bounded above 0 then hAt is bounded.

Proposition 9. If Gt and gNt admit positive limits then the economy converges toward

a steady-state as described in Proposition 3,

We then provide details on the behavior of the economy close to the steady-state.

Proof of Lemma 2. Assume that Gt is bounded above 0 (in fact the analysis of

Phase 2 in Appendix 7.5.4 shows that as long as κ (1− β) + κ < 1, it is impossible

to have G∞ = 0). Note that HPt must be bounded below otherwise there would be

arbitrarily large welfare gains from increasing consumption at time t and reducing it at

later time periods. As HPt is also bounded above (by H), then we must have (following

a reasoning similar to that in Appendix 2.4), that wHt = Θ(Nψt

), Ct = Θ

(Nψt

)and

wLt is bounded below, so that vt and ct are bounded above and below and ωt must be

bounded above.

Integrating (19), using the transversality condition and dividing by wHt/Nt, we get:

V At

wHt/Nt

=

∫ ∞t

exp

(−∫ s

t

r (u) du

)πAs

wHt/Nt

ds,

using the Euler equation (24), this leads to:

V At

wHt/Nt

=

∫ ∞t

exp (−ρ (t− s))(C (s)

C (t)

)−θπAs N

ψ−1s

vsNψ−1t

ds.

Rewriting this expression with the normalized variables and using (45), we get:

V At

wHt/Nt

=

∫ ∞t

exp (−ρ (s− t))(Ns

Nt

)−(1+(θ−1)ψ)c (s)θ

c (t)θ

ψ(ϕ+ (ωsns)

1µ

)µHPs

Gs

(ϕs + (ωsns)

1µ

)µ+ (1−Gs)ωsns

vsvtds.

Note that c(s)c(t)

,ψ

(ϕ+(ωsns)

1µ

)µHPs

G

(ϕs+(ωsns)

1µ

)µ+(1−Gs)ωsns

and vsvt

are all bounded and that Ns is weakly

increasing, therefore we get that

V At

wHt/Nt

≤∫ ∞t

exp (−ρ (s− t))Mds,

for some constant M . This ensures thatV At −V NtwHt/Nt

must remain bounded, and following

(21), hAt must be bounded as well.

86

Proof of Proposition 9. Assume now that Gt has a limit G∞ and that gNt also

has a positive limit gN∞. Then (32) implies that G∞ < 1. Following Proposition (2), we

then get that wLt = O

(N

ψ1+β(σ−1)

t

)or ωt = O (1). Therefore, we can rewrite the system

as (32),

˙hAt = γ

1

κ

(hAt

)2

−ηκGκ

t

(hAt

)κ1− κ

πAtvt

+ ηGκt

(hAt

)κ+1

+κ

1− κ

(ηGκ−1

t

(hAt

)κ+1

(1−Gt)− gNt hAt)

+O (nt) ,

χt = χt

(γ

1− κκ

hAt − ρ− (θψ − ψ + 1) gNt

)+O (nt) .

Knowing that

HPt =

(σ − 1

σ

) 11−β ( 1

θ−β) (1− β)

1θ β

β1−β ( 1

θ−1)χ

1θt (Gtϕ

µ)ψ( 1

θ−1)

Gtϕµ(1− β σ−1

σ

) +O

(n

1µ

t

), (120)

πAtvt

=ψHP

t

Gt

+O

(n

1µ

t

), (121)

and (48). Using that gNt and Gt have limits in (32) implies that hAt must also have

a limit. Using (48), this implies that HPt must also have a limit and therefore using

(120) that χt must have a limit. In other words, the equilibrium path tends toward the

steady-state(hA∗, G∗, χ∗

)with nt → 0 defined in Proposition 3.

Behavior close to the steady-state. In this steady-state, using (108) we get

gN∗ =1

(θ − 1)ψ + 1

(γ (1− κ)

κhA∗ − ρ

).

Therefore in the third phase, we obtain that Nt grows at rate gNt = gN∗+ o (1), that the

share of automated product obeys Gt = G∗ + o (1), with the mass of high-skill workers

in automation given by HAt = (1−G∗)hA∗ + o (1) and the mass of high-skill workers in

production given by HPt = HP∗+ o (1), with HP∗ given by (109). Using (43), we obtain

87

that wages obey:

wHt = (1− β)

(σ − 1

σββ) 1

1−β

(G∗ϕµ)ψNψt + o

(Nψt

),

and

wLt = (ω∗)1

β(1−σ) Nψ

1+β(σ−1)

t + o

(N

ψ1+β(σ−1)

t

).

Using (121), the profit made by an automated firm are given by

πAt =1

σ

((σ − 1) βϕ

εε−1

σ

) β1−β

HP∗ (G∗)ψ−1Nψ−1t + o

(Nψ−1t

)while the profits made by a non-automated firm πNt are negligible in front of πAt . Using

(33), the value of an automated firm is then simply given by:

V At =

πAtr∗ − (ψ − 1) gN∗

+ o(Nψ−1t

), (122)

where

r∗ = ρ+ θψgN∗ (123)

is the steady-state interest rate. Following (34) and (35), the normalized value of a

non-automated firm obeys:

(rt − (ψ − 1) gNt

)V Nt = πNt + (1− κ) ηGκ

t hAt

(V At − V N

t

)+

·

V Nt .

Therefore, one gets that for large Nt,

V Nt =

(1− κ) ηG∗κhA∗

r∗ − (ψ − 1) gN∗ + (1− κ) ηG∗κhA∗V At + o

(Nψ−1t

)(124)

so that asymptotically all the value of a new firm comes from the profits it makes once

automated (obviously we still get that in equilibrium V Nt = wHt/Nt).

Comparing the growth rate in the number of products in Phase 1 and

Phase 3. Use (122), (124) and (123) to get that in steady-state:

V N∗ = fV A∗

88

with

f =(1− κ) ηG∗κhA∗

ρ+ (ψ (θ − 1) + 1) g∗N + (1− κ) ηG∗κhA∗, (125)

V A∗ =πAt

ρ+ (ψ (θ − 1) + 1) g∗N.

Using V N∗ = v∗/γ and (121), we then obtain:

f1

ρ+ (ψ (θ − 1) + 1) g∗N

ψHP∗

G∗=

1

γ.

Rearranging terms, this leads to

gN∗ =

fG∗γψ(H − (1−G∗) hA∗

)− ρ

fG∗ψ + θ−1

(σ−1)(1−β)+ 1

, (126)

while from (118) the growth rate in the first period is approximately given by

gN1 =γHψ − ρψ + θ−1

σ−1+ 1

. (127)

The two expressions differ by three terms: In the numerator, H − (1−G∗) hA∗ in (126)

replaces H in (127), since some high-skill workers are hired to automate in Phase 3, the

pool of high-skill workers available for horizontal innovation or production is smaller,

and this force pushes toward gN∗ < gN1. In the denominator, θ−1(σ−1)(1−β)

in (126) replacesθ−1σ−1

in (127), this reflects the fact that the growth rate in the number of products has a

larger impact on the economy growth rate in phase 3 than in phase 1, which in return

reduces the present value of an automated firm (it increases the effective interest rate).

This also pushes toward gN∗ < gN1. Finally the term f/G∗ in (126) does not exist

in (127). Note that ∂gN∗/ (∂f/G∗) > 0, so that this term reflects two different forces:

on one hand in phase 3, the value of a new firm is a fraction f < 1 of the value of an

automated firm and this reduces the growth rate in the number of products, on the other

hand a lower G∗ reduces the demand for high-skill workers and therefore high-skill wages

compared to the profits an automated firm, which increases the asymptotic growth rate

89

in the number of products. Combining (32), (107) and (125), we get

f

G∗< 1⇔ 1

G(1− κ) ηG∗κhA∗ < ρ+ (ψ (θ − 1) + 1) g∗N + (1− κ) ηG∗κhA∗

⇔ 1−GG

(1− κ) ηG∗κhA∗ < ρ+ (ψ (θ − 1) + 1) g∗N ,

⇔ (1− κ) gN∗ < ρ+ (ψ (θ − 1) + 1) g∗N .

Since θ ≥ 1 and κ < 1, this equation necessarily holds. Therefore f < G∗ and it is

always the case that gN∗ > gN1.

7.5.6 Comparative statics

In this section, we prove Proposition 4. The proposition is established when the steady

state is unique but it extends to the case of the steady states with the highest and lowest

growth rates when there is multiplicity. Recall that the steady state is characterized as

the solution to an equation f(gN∗)

= 1 through (111), where G∗, hA∗ and HP∗ can all

be written as functions of gN∗ and parameters. Moreover, when there is a single steady

state (as well as for the steady states with the highest and the lowest growth rates in

case of multiplicity), f must be increasing in the neighborhood of gN∗.

Comparative static with respect to γ. (108) implies that hA∗ is inversely pro-

portional to γ (for given gN∗). Formally, we have:

∂hA∗

∂γ= − h

A∗

γ. (128)

Differentiating (107) and using (128) leads to:

∂G∗

∂γ=

−κgN∗G∗

γ(η (G∗)κ

(hA∗)κ

+ (1− κ) gN∗) , (129)

so that for a given gN∗, G∗ is also decreasing in γ. Using (109), (128) and (129), we get:

∂HP∗

∂γ=

1

γ

gN∗

γ+ (1−G∗) (1− κ) hA∗

+(1−κ)κhA∗G∗(gN∗)

2

(η(G∗)κ(hA∗)κ)(η(G∗)κ(hA∗)

κ+(1−κ)gN∗)

> 0

90

so that HP∗ is increasing in γ. Note that f , defined in (111), can be rewritten as

f(gN∗)

=1− κκ

1

ψHP∗

(G∗)1−κ(hA∗)1−κ (

γhA∗)

κη+G∗hA∗

,

which shows that f is decreasing in γ for a given gN∗ (HP∗ is increasing, G∗ and hA∗ are

decreasing, and γhA∗ is constant). Since f is increasing in gN∗ at the equilibrium value,

(111) implies that gN∗ increases in γ.

Comparative static with respect to η. For given gN∗, (108) implies that hA∗

does not depend on η. Differentiating (107), we get:

∂ lnG∗

∂ ln η=

gN∗


+ (1− κ) gN∗, (130)

so for given gN∗, G∗ increases in η. (109) implies then that

∂ lnHP∗

∂ ln η=G∗hA∗

HP∗gN∗


+ (1− κ) gN∗.

Using this equation together with (130) and (111), we obtain:

∂ ln f

∂ ln η=

gN∗

η(G∗)κ(hA∗)κ

+(1−κ)gN∗

(1− G∗hA∗

HP∗ − κ1

κη(G∗)κ (hA∗)1−κ

1


+ 1γ

)−

1


1


+ 1γ

.

Using (108), we can rewrite this as:

∂ ln f

∂ ln η=

− 1

η(G∗)κ(hA∗)κ

+(1−κ)gN∗

×

gN∗G∗hA∗

HP∗ + ρ+((θ−1)ψ+κ)gN∗

γ(1−κ)

(1

ηLκGκL

(hA∗)1−κ

+ 1γ

) ,

so that f is decreasing in η. This implies that gN∗ must be increasing in η. Since hA∗

only depends on η through gN∗, we also get that hA∗ increases in η.

91

7.6 Simulation technique

In the following we describe the simulation techniques employed for the baseline model

presented in 2. The approach for the extensions follow straightforwardly. Let xt ≡(nt, Gt, h

At , χt, ωt) and note that equation (49) defines ω implicitly. We can therefore

write equations (31), (32), (39) and (40) as a system of autonomous differential equations

(nt, Gt,˙hAt , χt) = F (xt) with initial conditions on state variables as (n0, G0) and an

auxiliary equation of ωt = ϑ(xt).

For the numerical solution, we specify a (small) time period of dt > 0 and a (large)

number of time periods T . Using this we approximate the four differential equations by

(T − 1)× 4 errors as:

st = (nt+1 − nt

dt,Gt+1 −Gt

dt,hAt+1 − hAt

dt,χt+1 − χt

dt)− F ((xt + xt+1)/2), t = 1, ...T − 1

with T corresponding errors for ωt :

sωt = ωt − ϑ(xt), t = 1, ..., T.

As shown in Appendix 7.5.1 for a set of parameter values, the system admits an

asymptotic steady state. We assume that the system has reached this asymptotic steady

state by time T and restrict hAT and χT accordingly. Together with the initial conditions

(n1 = nstart and G1 = Gstart) this leads to a vector of errors:

sT ≡ (n1 − nstart, G1 −Gstart, hAT − hA∗, χT − χ∗)′.

Letting x = xtTt=1, we then stack errors to get a vector, S(x), of length 5T and solve

the following problem:

x = argminxS(x)′WS(x),

for a 5T × 5T diagonal weighting matrix, W , and the 5T vector x. For dt → 0 and

T → ∞ S(x)′WS(x) → 0. For the simulations we set dt = 2 and T = 2000. We

accept the solution when S(x)′WS(x) < 10(−7), but the value is typically less than

10(−20). The choice of weighting matrix matters somewhat for the speed of convergence,

but is inconsequential for the final result. With the solution xt, ωtTt=1 in hand, it is

straightforward to find remaining predicted values.

92

7.7 Complements on simulation

7.7.1 Additional results on the baseline simulation

0 100 200 300 4000

1

2

3

4

Year

Per

cen

t

Panel A: Growth Rate of Consumption

gC

0 100 200 300 400

0

5

10

Year

units

Panel B: Log Consumption per Capita

Low−skill consumptionHigh−skill consumption

0 100 200 300 40040

60

80

100

120

140Panel C: Share of Assets Held by Low−Skill Workers

Year

Per

cen

t

% Share of Assets held by L

0 100 200 300 4002

4

6

8

10

Year

units

Panel D: Wealth Ratio

Wealth ratio

Figure 12: Consumption and wealth for baseline parameters. Panel A shows yearly growthrates for consumption, Panel B log consumption of high-skill workers and low-skillworkers (per capita), Panel C the share of assets held by low-skill workers andPanel D the wealth to GDP ratio.

Wealth and consumption. Figure 12 shows the evolution of wealth and consumption

for the baseline parameters both in the aggregate and for each skill group. Panel A

shows that consumption growth follows a pattern very similar to that of GDP growth

(displayed in Figure 3.A), which is in line with a stable ratio of total R&D expenses over

GDP across the three phases (Figure 3.D). In the absence of any financial constraints,

low-skill and high-skill consumption must grow at the same rate, with high-skill workers

consuming more since they have a higher income (Panel B). Since low-skill labor income

becomes a negligible share of GDP , while the high-skill labor share increases, a con-

stant consumption ratio can only be achieved if high-skill workers borrow from low-skill

workers in the long-run. This is illustrated in Panel C, which shows the share of assets

held by low-skill workers, under the assumption that initially assets holdings per capita

are identical for low-skill and high-skill workers (so that low-skill workers hold 2/3 of

the assets in year 0, since with these parameters H/L = 1/2). Initially, low-skill and

93

high-skill income grow at a constant rate so that the share of assets held by low-skill

workers is stable; but, in anticipation of a lower growth rate for low-skill wages than for

high-skill wages, low-skill workers start saving more and more, and the share of assets

they hold increases. This share eventually reaches more than 100%, meaning that the

high-skill workers net worth becomes negative. As claimed in the text, Panel D shows

that since profits become a higher share of GDP (an effect which dominates a temporary

increase in the interest rate in Phase 2), the wealth to GDP ratio increases in phase 2,

such that its steady state value is nearly 3 times higher than its original, which still need

to be automated (foll value.

The accumulation of asset holdings by low-skill workers predicted by the model seems

counter-factual, it results from our assumptions of infinitely lived agents with identical

discount rates and no financial constraints. Reversing these unrealistic assumptions

would change the evolution of the consumption side of the economy but should not alter

the main results which are about the production side.

0 100 200 300 400Year

-3

-2

-1

0

1

2

3

4

Per

cen

t

Panel A: Low-skill wages growth decomposition

Total gwL

Horiz. innov. contributionAutomation contribution

0 100 200 300 400Year

-3

-2

-1

0

1

2

3

4

Per

cen

t

Panel B: High-skill wages growth decomposition

Total gwH

Horiz. innov. contributionAutomation contribution

Figure 13: Growth decomposition. Panel A: The growth rate of low-skill wages and theinstantaneous contribution from horizontal innovation and automation, respec-tively. Panel B is analogous for high-skill wages. See text for details.

Growth decomposition. Figure 13 performs a growth decomposition exercise for

low-skill and high-skill wages by separately computing the instantaneous contribution

of each type of innovation. We do so by performing the following thought experiment:

at a given instant t, for given allocation of factors, suppose that all innovations of a

given type fail. By how much would the growth rates of wL and wH change?34 In Phase

34More specifically we can write wLt = f(Nt, Gt, HPt ), using equations (10) and (11). Differentiating

with respect to time and using equation (32) gives:

gwLt =

(NtwLt

∂f

∂N− GtwLt

∂f

∂G

)γHD

t +1

wLt

∂f

∂GηGκt (1−Gt)(hAt )κ +

1

wLt

∂f

∂HPHPt .

94

1, there is little automation, so wage growth for both skill-groups is driven almost en-

tirely by horizontal innovation. In Phase 2, automation sets in. Low-skill labor is then

continuously reallocated from existing products which get automated, to new, not yet

automated, products. The immediate impact of automation on low-skill wages is neg-

ative, while horizontal innovation has a positive impact, as it both increases the range

of available products and decreases the share of automated products. The figure also

shows that automation plays an increasing role in explaining the instantaneous growth

rate of high-skill wages, while the contribution of horizontal innovation declines. This is

because new products capture a smaller and smaller share of the market and therefore

do not contribute much to the demand for high-skill labor. Consequently, automation is

skill-biased while horizontal innovation is unskilled-biased. We stress that this growth

decomposition captures the immediate effect of automation or horizontal innovation.

This should not be interpreted as “automation being harmful” to low-skill workers in

general. In fact, as we demonstrate in Section 3.7, an increase in the effectiveness of the

automation technology, η, will have positive long-term consequences. A decomposition

of gGDPt would look similar to the decomposition of gwHt : while instantaneous growth is

initially almost entirely driven by horizontal innovation, automation becomes increas-

ingly important in explaining it (long-run growth, however, is ultimately determined by

the endogenous rate of horizontal innovation).

7.7.2 Decreasing growth rate

Figure 14 shows a case where growth is substantially lower in Phase 3 than in Phase

1 and in fact continuously decreases from the middle of Phase 2. The parameters are

identical to the baseline case except for: σ = 2.5, β = 0.55, η = 0.1, γ = 0.23 and

N0 = 344.25, these parameters lower the growth rate of the economy particularly in

the asymptotic steady state because automation consumes more resources and is less

effective as high-skill workers have a larger factor share in production (N0 is higher so

as to shorten Phase 1 in the graph). As argued in the text, this case illustrates that our

model does not necessarily predict that intense automation needs to be associated with a

boost in economic growth. Conversely, making the automation technology more effective

(say by reducing the cost share of high-skill workers, β) could create the opposite pattern

of a low initial growth rate followed by a higher eventual growth rate.

Figure 13 plots the first two terms as the growth impact of expenses in horizontal innovation andautomation, respectively. The third term ends up being negligible. We perform a similar decompositionfor wHt.

95

0 100 200 300 400Year

0

1

2

3

4P

er c

ent


gGDP

gwL

gwH

0 100 200 300 4000

5

10

15

Per

cen

t Autom./GDP (left)Hori./GDP (left)

0 100 200 300 400Year

0

20

40

60

80

100Panel B: Research Expenditures and G

G (right)

Figure 14: Transitional dynamics with a low growth rate in Phase 2.

7.7.3 A delayed decline in the labor share

Empirically, the drop in the labor share is a more recent phenomenon than the increase

in the skill premium. In Figure 15, we choose parameters such that this happens.

The automation technology is more productive η = 0.4; the automation technology

is less concave κ = 0.9 (so that a higher incentive to automate is required to get a

significant share of high-skill workers in automation innovation); and all other parameters

are identical to the baseline case. In this case, more high-skill workers get allocated to

automation during Phase 2: as shown in Panel C automation expenditures represent a

much larger share of GDP during Phase 2 than they do in the baseline case. The mass of

high-skill workers engaged in production declines during Phase 2. This results first in a

sharper increase in the skill premium (the skill premium condition moves further to the

left). In addition, the drop in the labor share is delayed since innovation spending are

part of GDP while capital income is a constant share of output Y . The growth rates of

low-skill and high-skill wages start diverging significantly from around year 135 and by

year 150, the high-skill wage growth rate is 2pp higher than the low-skill wage growth

rate, while the total labor share only start declining from around year 150 and in fact

increases slightly before.

96

0 100 200 300 400Year

0

1

2

3

4

5

Per

cen

t


gGDP

gwL

gwH

0 100 200 300 400Year

0

1

2

3 automation incentive (left)

0 100 200 300 400Year

0

2

4

6

8

log

unit

Panel B: Automation Incentive and Skill Premium


0 100 200 300 400Year

0

5

10

15

Per

cen

t


0 100 200 300 400Year

0

20

40

60

80

100

Per

cen

t


G (right)

0 100 200 300 400Year

0

20

40

60

80

100

Per

cen

t



Figure 15: Transitional dynamics with a delayed drop in capital share

7.7.4 The effect of the innovation parameters

0 200 400

0

50

100

Year

Per

cen

t

Panel A: GDP deviation

High γHigh η

0 200 400

0

50

100

Year

Per

cen

t

Panel B: Low−skill wages deviation

High γHigh η

0 200 400

0

50

100

Year

Per

cen

t

Panel C: Skill premium deviation

High γHigh η

Figure 16: Deviations from baseline model for more productive horizontal innovation tech-nology (γ) and more productive automation technology (η).

Figure 16 shows the impact (relative to the baseline case) of increasing productivity in

the automation technology to η = 0.4 (from 0.2) and the productivity in the horizontal

innovation technology to γ = 0.32 (instead of 0.3). A higher η initially has no impact

during Phase 1, but it moves Phase 2 forward as investing in automation technology is

profitable for lower level of low-skill wages. Since automation occurs sooner, the absolute

level of low-skill wages drops relative to the baseline case (Panel B), which leads to a fast

increase in the skill premium. However, as a higher η means that new firms automate

97

100 150 200 250Year

-20

-10

0

10

20

Per

cen

tPanel A: High-skill workers

in automation relative to baseline

-50% shock on G150

+50% shock on G150

100 150 200 250Year

-50

-40

-30

-20

-10

0

10

20

30

40

50

Per

cen

t

Panel B: Share of automated products relative to baseline

-50% shock on G150

+50% shock on G150

100 150 200 250Year

-10

-5

0

5

10

Per

cen

t

Panel C: skill premium relative to baseline

-50% shock on G150

+50% shock on G150

Figure 17: Technological shocks on Gt in year 150.

faster, it encourages further horizontal innovation. A faster rate of horizontal innovation

implies that the skill premium keeps increasing relative to the baseline, but also that

low-skill wages are eventually larger than in the baseline case. A higher productivity for

horizontal innovation, γ, implies that GDP and low-skill wages initially grow faster than

in the baseline. Therefore Phase 2 starts sooner, which explains why the skill premium

jumps relative to the baseline case before increasing smoothly.

7.7.5 Technological shocks

Figure 17 shows the effect of a technological shock in the form of an exogenous change

in Gt in year 150 (for the same parameters as in the baseline case). Panel A shows that

after an increase in G, the mass of high-skill workers in automation innovation declines,

this guarantees that the share of automated products goes back to its initial path (panel

B). The skill premium is reduced by the shock but thereafter increases relative to the

baseline level (panel C). It does not converge back to exactly the same level as before,

however, because the shock to Gt and the following decrease in HAt implies that for some

time more high-skill workers undertake horizontal innovation which increases a bit the

number of automated products relative to the baseline.

7.7.6 Systematic comparative statics

In this section we carry a systematic comparative exercise with respect to the parameters

of the model, namely σ, ε, β, ρ, θ, ϕ, η, κ, κ,γ,H/L (we keep H + L = 1), N0, G0. We

show the evolution of the growth rate of high-skill and low-skill wages and the share of

98

automated products for the baseline parameters and two other values for one parameter,

keeping all the other ones fixed. In all cases, the broad structure of the transitional

dynamics in three phases is maintained.

0 100 200 300 400

2

4

6

Year

Per

cen

t

Panel A: Growth rate of high−skill wages

σ=3.5baseline: σ=3σ=2.5

0 100 200 300 4000

2

4

Year

Per

cen

t

Panel B: Growth rate of low−skill wages


0 100 200 300 4000

50

100

Year

Per

cen

t

Panel C: Share of automated products


0 100 200 300 400

2

4

6

Year

Per

cen

t

Panel D: Growth rate of high−skill wages

ε=6baseline: ε=4ε=3

0 100 200 300 4000

2

4

Year

Per

cen

tPanel E: Growth rate of low−skill wages


0 100 200 300 4000

50

100

Year

Per

cen

t

Panel F: Share of automated products


0 100 200 300 400

2

4

6

Year

Per

cen

t

Panel G: Growth rate of high−skill wages

β=3/4baseline: β=2/3β=1/2

0 100 200 300 4000

2

4

Year

Per

cen

t

Panel H: Growth rate of low−skill wages


0 100 200 300 4000

50

100

YearP

er c

ent

Panel I: Share of automated products


Figure 18: Comparative statics with respect to the elasticity of substitution acrossproducts (σ), the elasticity of substitution between machines and low-skillworkers in automated firms (ε) and the factor share of low-skill workersand machines in production (β).

Figures 18.A,B,C show that a higher elasticity of substitution across products σ

reduces the growth rate of the economy (the elasticity of output with respect to the

number of products is lower), which leads to a delayed transition. The asymptotic

growth rate of low-skill wages is a smaller fraction of that of high-skill wages (following

Proposition 2), since automated products are a better substitute for non-automated

ones. Figures 18.D,E,F show that the elasticity of substitution between machines and

low-skill workers in automated firms, ε, plays a limited role (as long as the assumption

µ < 1 is kept), a higher elasticity reduces the growth of low-skill wages and increases

that of high-skill wages during Phase 2. Figures 18.G,H,I show that a lower factor share

in production for high-skill workers (a higherβ) increases the growth rate of the economy

(high-skill wages are lower which favors innovation). As a result Phase 2 occurs sooner.

In addition, following Proposition 2, the asymptotic growth rate of low-skill wages is a

99

lower fraction of that of high-skill wages (the cost advantage of automated firms being

larger).

0 100 200 300 400

2

4

6

Year

Per

cen

t


ρ=0.03baseline: ρ=0.02ρ=0.01

0 100 200 300 4000

2

4

Year

Per

cen

t


ρ=0.03baseline: ρ=0.02ρ=0.01

0 100 200 300 4000

50

100

Year

Per

cen

t


ρ=0.03baseline: ρ=0.02ρ=0.01

0 100 200 300 400

2

4

6

Year

Per

cen

t


θ=2.5baseline: θ=2θ=1.5

0 100 200 300 4000

2

4

Year

Per

cen

t

Panel E: Growth rate of low−skill wages


0 100 200 300 4000

50

100

Year

Per

cen

t



0 100 200 300 400

2

4

6

Year

Per

cen

t


ϕ=0.35

baseline: ϕ=0.25

ϕ=0.15

0 100 200 300 4000

2

4

Year

Per

cen

t

Panel H: Growth rate of low−skill wages

ϕ=0.35

baseline: ϕ=0.25

ϕ=0.15

0 100 200 300 4000

50

100

Year

Per

cen

t


ϕ=0.35

baseline: ϕ=0.25

ϕ=0.15

Figure 19: Comparative statics with respect to the discount rate (ρ), the inverse elasticityof intertemporal substitution (θ) and the productivity of machines (ϕ)

Figures 19.A,B,C show that a higher discount rate ρ reduces the growth rate of the

economy, which slightly postpones Phase 2. At the time of Phase 2, the growth rate of

low-skill wages is not affected much by the discount rate: on one hand, since low-skill

wages are lower Phase 2 is postponed, which favor low-skill wages’ growth, but on the

other hand, horizontal innovation is lower which negatively affects low-skill wages. A

lower elasticity of intertemporal substitution (a higher θ) has a similar effect on the

economy’s growth rate (Figures 19.D,E,F). Figures 19.G,H,I show that the productivity

of machines (ϕ) only affects the timing of Phase 2 (Phase 2 occurs sooner when machines

are more productive).

100

0 100 200 300 400

2

4

6

Year

Per

cen

tPanel A: Growth rate of high−skill wages

η=0.4baseline: η=0.2η=0.1

0 100 200 300 4000

2

4

Year

Per

cen

t



0 100 200 300 4000

50

100

Year

Per

cen

t



0 100 200 300 400

2

4

6

Year

Per

cen

t


κ=0.75baseline: κ=0.5κ=0.25

0 100 200 300 4000

2

4

YearP

er c

ent



0 100 200 300 4000

50

100

Year

Per

cen

t



0 100 200 300 4002

4

6

8

Year

Per

cen

t


κ=0.49κ=0.25baseline κ=0

0 100 200 300 400−1

0

1

2

3

Year

Per

cen

tPanel H: Growth rate of low−skill wages


0 100 200 300 4000

50

100

Year

Per

cen

t



Figure 20: Comparative statics with respect to the automation productivity (η), the concav-ity of the automation technology (κ) and the automation externality (κ)

The comparative statics with respect to the automation technology shown in Figures

20.A,B,C follow the pattern described in the text. A less concave automation technology

(higherκ) delays Phase 2 and reduces the economy’s growth rate. It particularly affects

the growth rate of low-skill wages in Phase 2 (as the increase in automation expenses

comes more at the expense of horizontal innovation)—see Figures 20.D,E,F. The role

of the automation externality has already been discussed in the text, Figures 20.G,H,I

reveal that for a mid-level of the automation externality (κ = 0.25), the economy looks

closer to the economy without the automation externality than to the economy with a

large automation externality.

101

0 100 200 300 400

2

4

6

Year

Per

cen

t


γ=0.35baseline: γ=0.3γ=0.25

0 100 200 300 4000

2

4

Year

Per

cen

t



0 100 200 300 4000

50

100

Year

Per

cen

t



0 100 200 300 400

2

4

6

8

Year

Per

cen

t


H=1/2 L=1/2baseline: H=1/3 L=2/3H=1/4 L=3/4

0 100 200 300 4000

2

4

YearP

er c

ent


H=1/2 L=1/2baseline: H=1/3 L=2/3H=1/4 L=3/4

0 100 200 300 4000

50

100

Year

Per

cen

t

Panel F: Share of automated products G

H=1/2 L=1/2baseline: H=1/3 L=2/3H=1/4 L=3/4

Figure 21: Comparative statics with respect to the horizontal innovation productivity (η)and the skill ratio (H/L)

Figures 21.A,B,C show the impact of the horizontal innovation parameter γ, which

was already discussed in the text. Figures 21.D,E,F show that a higher ratio H/L nat-

urally leads to a higher growth rate, which implies that Phase 2 occurs sooner. Figures

22.A,B,C show that a higher initial number of products simply advance the entire evo-

lution of the economy. Figures 22.D,E,F show that a higher initial value for the share

of automated products (even as high as the steady-state value G∗) barely affects the

evolution of the economy, the share of automated products initially drops quickly as

there is little automation to start with.

7.8 Social planner problem

This section presents the solution to the social planner problem. After having set-up the

problem, we derive the optimal allocation, emphasizing in particular the different inef-

ficiencies in our competitive equilibrium. Then, we show the optimal allocation for our

baseline parameters. Finally, we derive how the optimal allocation can be decentralized.

7.8.1 Characterizing the optimal allocation

We introduce the following notations: NAt (respectively NN

t ) denotes the mass of auto-

mated (respectively non-automated) firms, LAt (respectively LNt ) is the mass of low-skill

workers hired in automated (respectively non-automated) firms, and HP,At (respectively

HP,Nt ) is the mass of high-skill workers hired in production in automated (respectively

non-automated) firms. The social planner problem can then be written as (we write the

102

0 100 200 300 400Year

1

2

3

4

5P

er c

ent

Panel A: Growth rate of high-skill wages

N(1)=2baseline: N(1)=1N(1)=0.5

0 100 200 300 400Year

0

1

2

3

Per

cen

t

Panel B: Growth rate of low-skill wages

N(1)=2baseline: N(1)=1N(1)=0.5

0 100 200 300 400Year

0

20

40

60

80

100

Per

cen

t


N(1)=2baseline: N(1)=1N(1)=0.5

0 100 200 300 400Year

1

2

3

4

5

Per

cen

t

Panel D: Growth rate of high-skill wages

G(1)=G ss

G(1)=0.3baseline: G(1)=0.001

0 100 200 300 400Year

0

1

2

3

Per

cen

t

Panel E: Growth rate of low-skill wages

G(1)=G ss

G(1)=0.3baseline: G(1)=0.001

0 100 200 300 400Year

0

20

40

60

80

100

Per

cen

t

Panel F: Share of automated products G

G(1)=G ss

G(1)=0.3baseline: G(1)=0.001

Figure 22: Comparative statics with respect to the initial number of products N0 and theinitial share of automated products G0

Lagrange multipliers next to each constraint):

max

∫ ∞0

e−ρtC1−θt

1− θ

such that

λt : Ct +Xt = F(LAt , H

P,At , Xt, L

Nt , H

P,Nt , NA

t , NNt

),

with

F ≡

(NAt

) 1σ

((ϕX

ε−1ε

t +(LAt) ε−1

ε

) εε−1

β (HP,At

)1−β)σ−1

σ

+(NNt

) 1σ

((LNt)β (

HP,Nt

)1−β)σ−1

σ

σσ−1

,

wt : LAt + LNt = L,

vt : HP,At +HP,N

t +HAt +HD

t = H,

ζt :·NN

t = γ(NAt +NN

t

)HDt − η

(NAt

)κ (NNt +NA

t

)κ−κ (HAt

)κ (NNt

)1−κ,

ξt :·NA

t = η(NAt

)κ (NNt +NA

t

)κ−κ (HAt

)κ (NNt

)1−κ,

103

HDt ≥ 0.

The first order condition with respect to Ct gives

C−θt = λt

To denote the ratio of the Lagrange parameter of each constraint with respect to λt

(that is the shadow value expressed in units of final good at time t), we remove the tilde

(hence wLt ≡ wLt/λt is the shadow wage of low-skill workers).

The first order conditions with respect to Xt implies that

∂F

∂Xt

= 1, (131)

so that the shadow price of a machine must be equal to 1. First order conditions with

respect to LAt , LNt , HP,At , HP,N

t lead to

wLt =∂F

∂LAt=

∂F

∂LNtand wHt =

∂F

∂HP,At

=∂F

∂HP,Nt

, (132)

so that labor inputs are paid their marginal product in aggregate production. This is

not the case in the competitive equilibrium, where labor inputs are paid their marginal

product in the production of intermediates, while intermediates themselves are priced

with a mark-up as they are provided by a monopolist. It is easy to show that for a given

HPt , the optimal provision of machines and allocation of high-skill and low-skill workers

across firms can be obtained if the purchase of all intermediate inputs is subsidized by

at rate 1/σ (a lump-sum tax finances the subsidy).

The first-order conditions with respect to NNt and NA

t are given by:

ρζt −·

ζt =λt

∂F∂NN

t+ ζtγH

Dt +

(ξt − ζt

)η(HAt

)κ (NNt

)−κ×(NAt

)κ ((1− κ)NN

t + (1− κ)NAt

) (NNt +NA

t

)κ−κ−1, (133)

ρξt −·

ξt =λt

∂F∂NA

t+ ζtγH

Dt +

(ξt − ζt

)η(HAt

)κ×(NNt

)1−κ (NAt

)κ−1 (κNN

t + κNAt

) (NNt +NA

t

)κ−κ−1. (134)

Interestingly, ∂Ft∂NN

tand ∂F

∂NAt

correspond to the profits realized by a non-automated and

an automated firm respectively in the equilibrium once the subsidy to the use of inter-

104

mediates is implemented. Therefore we denote

πNt =∂Ft∂NN

t

and πAt =∂Ft∂NA

t

Further the (shadow) interest rate is given by rt = ρ + θ·CtCt

= ρ −·λtλt

. Using that

HAt = (1−Gt)Nth

At , we can rewrite (133) and (134) as:

rtζt = πNt + ζtgNt + (ξt − ζt) η (Gt)

κNκt

(hAt)κ

((1− κ) (1−Gt) + (1− κ)Gt) +·ζt, (135)

rtξt = πAt + ζtgNt + (ξt − ζt) η (Gt)

κNκt

(hAt)κ

(1−Gt)

(κ

1−Gt

Gt

+ κ

)+·ξt. (136)

These expressions parallel equations (19) and (20) in the paper. The rental social value

of a non-automated firm (rtζt) consists of the current value of one intermediate (which

equals the profits when the optimal subsidy to the use of intermediates inputs is in

place), its positive impact on the horizontal innovation technology (the productivity of

which is γNt), its positive impact on the automation technology (which results from

the direct externality embedded in the automation technology from the number of firms

diminished by the additional externality coming from the share of automated products),

the expected increase in its value if it becomes automated minus the cost of the resources

required (the difference between these two terms is positive since the automation technol-

ogy is concave) and the change in its value. The rental social value of an automated firms

(rtξt) is the sum of the profits, its impact on horizontal innovation (through the same

externality as non-automated firm), its impact on the automation technology (which

results from two externalities as both the number of firms and the share of automated

products improve the automation technology), and the change in its value.

The first order condition with respect to HDt gives (together with the condition that

HDt ≥ 0):

wHt ≥ ζtγNt, (137)

with equality when HDt > 0. This equation is the counterpart of (23) in the equilibrium

case, it stipulates that when horizontal innovation takes place the social value of a non-

automated intermediate equals the cost of creating one. The first-order condition with

respect to HAt gives:

wHt = (ξt − ζt)κη (Gt)κNκ

t

(hAt)κ−1

. (138)

105

This equation is the counterpart of (21) in the equilibrium case. Everything else given,

ξt − ζt increases with πAt − πNt , which increases with wLt, therefore this equation shows

that automation increases with low-skill wages (everything else given), just as in the

equilibrium case.

7.8.2 System of differential equations and steady state

After having introduced the same variables as in the equilibrium case, one can follow

the same steps and derive a system of differential equation in(nt, Gt, h

At , χt

)which

characterizes the solution (when there is positive growth). Equations (31) and (32) still

hold, while equations (39) and (40) are replaced with

·

h

A

t =

γhAt1−κ

(ωtnt

(ϕ+ (ωtnt)

1µ

)−µπAtvt

+ 1−κ+(κ−κ)(1−Gt)κ

hAt

)−ηκGκt

1−κ

(htA)κ(

1− ωtnt(ϕ+ (ωtnt)

1µ

)−µ)πAtvt

+ ηGκt

(htA)κ+1

+ 1−κ1−κg

Nt h

At

,

(139)·χt = χt

(γωtnt

(ϕ+ (ωtnt)

1µ

)−µ πAtvt

+ γ1− κ+ (κ− κ) (1−Gt)

κhAt − ρ− (θ − 1)ψgNt

).

gNt is still given by (48),πAtvt

, HPt and ωt are now given by

πAtvt

=ψ(ϕ+ (ωtnt)

1µ

)µHPt

Gt

(ϕ+ (ωtnt)

1µ

)µ+ (1−Gt)ωtnt

,

HPt =

(1− β)1θ β

β1−β ( 1

θ−1)χ

1θt

(Gt

(ϕ+ (ωtnt)

1µ

)µ+ (1−Gt)ωtnt

)ψ( 1θ−1)+1

Gt

((1− β)ϕ+ (ωtnt)

1µ

)(ϕ+ (ωtnt)

1µ

)µ−1

+ (1−Gt)ωtnt

,

ωt =

β1

1−β HPt

L

(Gt

(ϕ+ (ωtnt)

1µ

)µ−1

(ωtnt)1−µµ + (1−Gt)

)×(Gt

(ϕ+ (ωtnt)

1µ

)µ+ (1−Gt)ωtnt

)ψ−1

β(1−σ)

1+β(σ−1)

,

which replace (45), (47) and (49).

One can then solve for a steady state of this system with G∗ > 0 (and(gN)∗> 0 so

106

that n∗ = 0). (107) and (109) still apply, but (108) is replaced with

hA∗ =κ

γ

ρ+ (θ − 1)ψgN∗

1− κ+ (1−G∗) (κ− κ), (140)

and (111) with

f sp(gN∗)≡ ρ+ (θ − 1)ψgN∗

ψHP∗

(hA∗)1−κ

ηκ (G∗)κ−1+

1

γ

,

which is obtained by fixing·

h

A

t = 0 in (139) using (107) and (140). For gN∗ large enough

(but finite—and, in particular smaller than γH), HP∗ is arbitrarily small, while for the

same value G∗ and hA∗ are bounded below and above. As before, this establishes that

for gN∗ large enough f sp(gN∗)> 1. Furthermore f sp (0) = f (0), therefore condition 25

is also a sufficient condition for the existence of a steady state with positive growth and

G∗ > 0 for the system of differential equations.

7.8.3 Decentralizing the optimal allocation

We have already seen that the “static” optimal allocation given HPt is identical to the

equilibrium allocation once a subsidy to the use of intermediates 1/σ is in place. The

“dynamic” part of the problem consists of the allocation of high-skill workers across the

two types of innovation and production. Therefore, we postulate that a social planner

can decentralize the optimal allocation using the subsidy to the use of intermediate

inputs and subsidies (or taxes) for high-skill workers hired in automation(sAt)

and in

horizontal innovation(sHt). Let us consider such an equilibrium and introduce the

notations ΩAt ≡ 1− sAt and ΩH

t similarly defined. In this situation, the law of motion for

the private value of an automated firm, V At , is still given by (19), for a non-automated

firm it obeys:

rtVNt = πNt − ΩA

t wHtht + η (Gt)κNκ

t

(hAt)κ (

V At − V N

t

)+·VN

t , (141)

instead of (20), the first-order condition for automation is given by:

κη (Gt)κNκ

t

(hAt)κ−1 (

V At − V L

t

)= ΩA

t wHt, (142)

107

instead of (21), while the free entry condition, when gNt > 0, is given by

γNtVNt = ΩH

t wHt, (143)

instead of (23). For ΩAt and ΩH

t to decentralize the optimal allocation it must be that

these 4 equations hold together with (135), (136), (137) and (138).

Using (137) and (143), we then get that ΩHt must satisfy

ΩHt ζt = V N

t , (144)

similarly, using (138) and (142), we get

ΩAt (ξt − ζt) = V A

t − V Lt . (145)

Plugging (144) and (145) in (141), we get that

rtζt =πNtΩHt

− ΩAt

ΩHt

wHtht + η (Gt)κNκ

t

(hAt)κ ΩA

t

ΩHt

(ξt − ζt) +

·ΩH

t

ΩHt

ζt +·ζt. (146)

Similarly, using (145) and the difference between (19) and (141) gives:

rt (ξt − ζt) =πAt − πNt

ΩAt

+wHtht−η (Gt)κNκ

t

(hAt)κ

(ξt − ζt)+

·ΩA

t

ΩAt

(ξt − ζt)+·ξt−

·ζt. (147)

Combining (146) with (135), using (138) and (137) and the definition of ΩAt and ΩH

t , we

get:

·sH

t =γπNtvtsHt −

(1− sHt

)gNt

+γhAtκ

((1− sAt

)(1− κ) +

(1− sHt

)(κ (1−Gt) + κGt − 1)

) . (148)

Similarly combining (147) with the difference between (136) and (135) and using

(137) gives:

·sA

t

(hAt

)1−κ

ηGκt

= κπAt − πNt

vtsAt − κ

(1− sAt

)hAt

1−Gt

Gt

. (149)

108

Therefore, in steady state, we have

sA∞ =κhA∞ (1−G∞)

κψHP∞ + κhA∞ (1−G∞)

≥ 0.

Note from (149) that the share of automated products, sAt , must always be non-negative,

otherwise it cannot converge to a positive value, therefore sAt ≥ 0 everywhere (and in

fact > 0 if κ 6= 0). Furthermore, if κ = 0, sAt = 0 everywhere, the only externality

in automation comes from the total number of products, therefore the equilibrium fea-

tures the optimal amount of automation investment (when the monopoly distortion is

corrected and the optimal subsidy to horizontal innovation is implemented).

(148) gives the steady state value of the subsidy to horizontal innovation as:

sH∞ = 1−γhA∞ (1− κ)

(1− sA∞

)κgN∞ + γhA∞ (1− κ (1−G∞)− κG∞)

.

In addition, knowing that sAt > 0, imposes that sHt > 0—as sHt < 0 would lead to·sH

t < 0.

7.8.4 Transitional dynamics for the social planner case

Figure 23 plots the transitional dynamics for the optimal allocation in our baseline case

(which features κ = 0) and in the case where κ = 0 analyzed in Figure 7. As shown

in Panel A and C, the economy also goes through three phases as a higher (shadow)

low-skill wage leads to more automation over time and a transition from a small share

to a high share of automated products. Relative to Figure 3.A and Figure 7.A, the

overall dynamics look quite similar but the growth rates are higher in the social planner

case, and the transition to phase 2 now happens roughly at the same time with and

without the automation externality, while in the equilibrium it is considerably delayed

in the presence of the externality (as, effectively, the productivity of the automation

technology is initially very low). In both cases, the social planner maintains a positive

subsidy to horizontal innovation. When κ = 0 (without the automation externality), the

subsidy to automation is 0, while when κ > 0 there is a positive subsidy to automation,

which is the largest in Phase 1. This subsidy explains why Phase 2 now starts at around

the same time.

109

0 100 200 300 400Year

0

2

4

6

8

Per

cen

t

Without the Automation ExternalityPanel A: Growth Rates of Wages and GDP

gGDP

gwL

gwH

0 100 200 300 400Year

0

2

4

6

8

Per

cen

t

With the Automation ExternalityPanel B: Growth Rates of Wages and GDP

gGDP

gwL

gwH

0 100 200 300 400Year

0

20

40

60

80

100

Per

cen

t

Panel C: Research Subsidies

automationhorizontal innovation

0 100 200 300 400Year

0

20

40

60

80

100

Per

cen

t

Panel D: Research Subsidies

automationhorizontal innovation

Figure 23: Transitional Dynamics in the Social Planner Case. Panel A and B, baseline case.Panel C and D, with κ = 0.5

7.9 Alternative model with automation at the entry-stage

To highlight that the evolution of the economy through three phases does not depend on

our assumption that new products are born non-automated, we present in this section a

model where, instead, we assume that automation can only take place at the entry stage.

That is, when a new firm is born, it can hire hAt workers to automate it, in which case

it is successful with probability min(η(Nth

At

)κ, 1)

(we abstract from the automation

externality for simplicity). Ex-ante a firm does not know whether it will succeed or not,

therefore, the free-entry condition can now be written as

wHt ≥ γNtVt,

where

Vt = min(η(Nth

At

)κ, 1)V At +

(1−min

(η(Nth

At

)κ, 1))V Nt − wHthAt .

is the expected value of a new firm. Since we used similar functional forms we have that

hAt obeys (21) unless κη1κNt

(V At − V L

t

)> wHt, in which case Nth

At = η−

1κ . Afterward a

110

firm never becomes automated so that the law of motion for the value of an automated

and a non-automated firms both follow (19). In addition, the law of motion for Gt is

now given by·Gt = gNt

(η(Nth

At

)κ −Gt

).

The resolution of the model follows the same steps as in the baseline case, and under

the appropriate condition on the discount rate, there exists an asymptotic steady state

with gNt > 0.

An important difference is that G∗ may be equal to 1 since all new products may

choose to be automated in steady state. In fact, one can derive that hA∗ = min(η−

1κ , κ

1−κ1γ

).

Therefore G∗ < 1, if and only if η(

κ1−κ

1γ

)κ< 1. When G∗ < 1, we will have that

G∞ = G∗ < 1, so that, following Proposition 2,

gwL∞ =1

1 + β (σ − 1)gwH∞ .

On the contrary, if G∗ = 1, then G∞ = 1, and following Proposition 2, we get that

gwL∞ = gwH∞ /ε.

Figure 24 draws the transitional dynamics for the same parameters as in the baseline

case (even though the automation technology parameters have a different meaning here).

These parameters satisfy η(

κ1−κ

1γ

)κ< 1, and the figure shows that the economy goes

through three phases as in our baseline model.

7.10 Supply response in the skill distribution: details

The supply of low-skill and high-skill labor are now endogenous. This does not affect

(11) which still holds. (10) also holds with Lt replacing L and knowing that HPt obeys

(18) but with Ht instead of H in the right-hand side. Because workers are ordered such

that a worker with a higher index j supplies relatively more high-skill labor, then at all

point in times there exists a threshold jt such that workers j ∈(0, jt

)supply low-skill

labor and workers j ∈(jt, 1

)supply high-skill labor. As a result, we get that the total

mass of low-skill labor is:

Lt = lHjt, (150)

111

0 100 200 300 400Year

0

0.5

1

1.5

2

2.5

3

3.5

4

Per

cen

tGrowth Rates of Wages and GDP

gGDP

gwL

gwH

0 100 200 300 400Year

0

2

4

6

8

10

12

14

16

18

Per

cen

t


0 100 200 300 400Year

0

20

40

60

80

100

Per

cen

t

Research Expenditures and G

G (right)

Figure 24: Transitional Dynamics for an alternative model where automation only happensat entry. Baseline parameters.

and the mass of high-skill labor is

Ht = H

(1− j

1+qq

t

)≤ H. (151)

The cut-off jt obeys lHwLt = Γ(jt)wHt, that is

jt =

(q

1 + q

lwLtwHt

)q. (152)

jt decreases as the skill premium increases and q measures the elasticity of jt with respect

to the skill premium.

7.10.1 Asymptotic growth rates

We consider processes(Nt, Gt, H

Pt

)such that gNt , Gt and HP

t admit strictly positive

limits. Plugging (152) and (150) in (10), we get:

wHtwLt

= l

(1− ββ

H

HPt

(q

1 + q

)q Gt + (1−Gt)(1 + ϕwε−1

Lt

)−µGt

(1 + ϕwε−1

Lt

)−1+ (1−Gt)

(1 + ϕwε−1

Lt

)−µ) 1

1+q

,

(153)

112

which together with (11) determines wHt and wLt for given(Nt, Gt, H

Pt

). From then on

the reasoning follows that of Appendix 7.2.1. First, we derive that wL∞ > 0, such that

gwH∞ = gGDP∞ = ψgN∞, and that we must have gwL∞ < gwH∞ , such that j∞ = 0. Second, we

study the asymptotic behavior of wLt both when ε <∞ and when ε =∞.Case with ε <∞. Plugging (153) in (11) gives wLt in function of Nt, Gt and HP

t :

wLt =

σ−1σβ

1+βq1+q

((1− β)1+q

q

) (1−β)q1+q 1

l1−β

(HPt

H

) 1−β1+q

N1

σ−1

×(Gt(1+ϕwε−1

Lt )µ−1

+(1−Gt)) 1−β

1+q

(Gt(ϕwε−1Lt +1)

µ+1−Gt)

11−σ+

1−β1+q

, (154)

which replaces (59). It is direct that when G∞ < 1, we obtain (50). In this case, we

further have

gj∞ = q (gwL∞ − gwH∞ ) = − qβ (σ − 1)

1 + q + β (σ − 1)gGDP∞ . (155)

Case with ε =∞. In this case, (154) becomes

wLt =

σ−1σβ(

1+qq

)(1−β)q (HPt

lH

)1−βN

1σ−1 (1−Gt)

1−β1+q

×(Gt (ϕwLt)

β(σ−1) + (1−Gt)) 1σ−1− 1−β

1+q, if wLt > ϕ−1,

wLt =σ − 1

σβ

(1 + q

q

)(1−β)q (HPt

lH

)1−β

N1

σ−1 , if wLt < ϕ−1.

Once again, following the steps of Appendix 7.2.1, we get that if G∞ < 1, (50) applies

(and accordingly we also get (155)).

7.10.2 Dynamic system

It is convenient to redefine nt ≡ N−β

(1−β)(1+q)

1+q+β(σ−1)

t , we can then write the entire dynamic

system as a system of differential equations in(nt, Gt, h

At , χt

)with two auxiliary vari-

ables ωt and jt ≡ jtn− q

1+q

t . Equations (31) is now given by

·nt = − β

1− β1 + q

1 + q + β (σ − 1)gNt nt,

113

(32), (39), (40), (45), (47) still apply and equation (48) as well provided that H is

replaced by Ht given by (151). ωt is implicitly defined by:

ωt =

(σ−1σ

) 1+q1−β β

1+βq1−β

((1− β)1+q

q

)qHPt

l1+qH

(Gt

(1 + ϕ (ωtnt)

− 1µ

)µ−1

+ (1−Gt)

)×(Gt

(ϕ+ (ωtnt)

1µ

)µ+ (1−Gt)ωtnt

)ψ(1+q)−1

β(1−σ)

1+q+β(σ−1)

,

which replaces (49) and is a rewriting of (154) and jt is given by

jt =

ωt q

1 + q

β

1− β

Gt

(1 + ϕ (ωtnt)

− 1µ

)µ−1

+ 1−Gt

Gt

(ϕ+ (ωtnt)

1µ

)µ+ (1−Gt)ωtnt

HPt

H

q

1+q

which is derived using (152) and (153).

The steady state for this system involves n∗ = 0 and therefore ω∗and j∗ are positive

constant (so that j∗

= 0: in steady-state all workers are high-skill). As a result H∗ = H,

so that the steady state values of(gN∗, G∗, hA∗, χ∗

)are identical to the baseline case

with H replacing H.

7.11 Machines as a capital stock

7.11.1 Set-up

To avoid repetitions, we already include the taxes of section 4.2, namely, we assume

that there is a tax τm on the rental rate of equipment and a tax τa on high-skill workers

in automation innovations. The solution follows similar steps to the baseline case. We

denote by rt the gross rental rate of machines and by ∆ their depreciation rate, such

that:

rt = rt + ∆. (156)

The Euler equation (24) still applies and the capital accumulation equation is given by

(30). The unit cost of intermediate input i is now given by

c (wL, wH , r, α (i)) =

(w1−εL + α(i)ϕ

(r1−β4wβ4H

)1−ε) β1

1−ε

wβ2H rβ3

ββ11 ββ22 β

β33

(157)

114

instead of (4) where ϕ ≡ ϕε(ββ44 (1− β4)1−β4

)ε−1

. Define µ ≡ β1 (σ − 1) / (ε− 1), we

can then derive the isocost curve as:

N1

1−σσ

σ − 1

wβ2H rβ3

ββ11 ββ22 β

β33

(G

(ϕ(

((1 + τm) r)1−β4 wβ4H

)1−ε+ w1−ε

L

)µ+ (1−G)w

β1(1−σ)L

) 11−σ

= 1.

(158)

The same steps as before allows us to obtain the relative demand for high-skill versus

low-skill workers as:

wHHP

wLL(159)

=

G

(β2 +

β1β4ϕ(

((1+τm)r)1−β4wβ4H

)1−εw1−εL +ϕ

(((1+τm)r)1−β4w

β4H

)1−ε)(

ϕ(

((1 + τm) r)1−β4 wβ4H

)1−ε+ w1−ε

L

)µ+ β2 (1−G)w

β1(1−σ)L

β1

(Gw1−ε

L

(ϕ(

((1 + τm) r)1−β4 wβ4H

)1−ε+ w1−ε

L

)µ−1

+ (1−G)wβ1(1−σ)L

) .

Similarly, taking the ratio of income going to high-skill workers in production over income

going to machines owners, we obtain a relationship linking the gross rental rate of capital

and high-skill wages:

rK

wHHP(160)

=

G

(β3 +

β1(1−β4)ϕ(


)1−ε(1+τm)

(w1−εL +ϕ

(((1+τm)r)1−β4w

β4H

)1−ε))(

ϕ(

((1 + τm) r)1−β4 wβ4H

)1−ε+ w1−ε

L

)µ+β3 (1−G)w

β1(1−σ)L

G

(β2 +

β1β4ϕ(


)1−εw1−εL +ϕ

(((1+τm)r)1−β4w

β4H

)1−ε)(

ϕ(

((1 + τm) r)1−β4 wβ4H

)1−ε+ w1−ε

L

)µ+ β2 (1−G)w

β1(1−σ)L

.

7.11.2 Effect of technology on wages

First note that one can rewrite (159)

wHHP

wLL=G(β2 + β1β4

Φ1+Φ

)(Φ + 1)µ + β2 (1−G)

β1

(G (Φ + 1)µ−1 + (1−G)

) , (161)

115

where we defined

Φ ≡ ϕ

(wL

((1 + τm) r)1−β4 wβ4H

)ε−1

= ϕ

((wL

(1 + τm) r

)1−β4 (wLwH

)β4)ε−1

.

In (161), the RHS is increasing in wL and decreasing in wH/wL for given G, r. Therefore,

this equation defines the relative demand curve in the wL, wH space as rotating counter-

clockwise (when G > 0) when wL increases. Plugging (161) in (158) then defines wL

uniquely as a function of N , G, r and HP . We can then derive the effect of changes in

G and N for given HP and r (i.e. when K is perfectly elastically supplied) on wages,

the skill premium and the labor share as follows:

Proposition 10. Consider the equilibrium (wL, wH) determined by equations (161) and

(158). Assume that ε <∞, it holds that

A) An increase in the number of products N (keeping G and HP constant) leads to

an increase in both high-skill (wH) and low-skill wages (wL). Provided that G > 0, an

increase in N also increases the skill premium wH/wL and decreases the labor share for

H ≈ HP .

B) An increase in the share of automated products G (keeping N and HP constant)

increases the high-skill wages wH , the skill premium wH/wL and decreases the labor

share for H ≈ HP . Its impact on low-skill wages is ambiguous.

Proof. One can rewrite (158) as:

N1

1−σσ

σ − 1

rβ3

ββ11 ββ22 β

β33

(wHwL

)β2wβ2L

G

(ϕ(

((1 + τm) r)1−β4 wβ4H

)1−ε+ w1−ε

L

)µ+ (1−G)w

β1(1−σ)L

1

1−σ

= 1.

(162)

Using that (161) establishes wHwL

as an increasing function of wL otherwise independent

of N , we get that (162) implies that wL and therefore wH/wL (when G > 0) and wH

itself must increase in N .

(161) also establishes that wHwL

increases in G for a given wL. Therefore if wL is

increasing in G, then it is direct that wHwL

and wH both also increase in G. Assume on

the contrary that wL decreases in G, then in (158) the direct effect of an increase in G

is to decrease the LHS (because

(ϕ(

((1 + τm) r)1−β4 wβ4H

)1−ε+ w1−ε

L

)µ> w

β1(1−σ)L ), in

addition an increase in G would reduce wL which further reduces the LHS. To maintain

116

the inequality, it must be that wH increases. Therefore in this case too, wH increases in

G and so does wH/wL.

This model is isomorphic to the previous one when β4 = β3 = 0 (with ϕ ((1 + τm) r)1−ε

replacing ϕ), therefore the impact of a change of G on wL is also ambiguous.

The labor share is now given by

LS =wLL+ wHH

Y + (1 + τa)wH (H −HP ).

As before profits are a share 1σ

of output so that

Y =σ

σ − 1

(wLL+ wHH

P + rK + Tm), (163)

where Tm denotes the tax proceeds from the tax on equipment. We have

TmwHHP

(164)

=

Gτmβ1(1−β4)ϕ

(((1+τm)r)1−β4w

β4H

)1−ε(1+τm)

(w1−εL +ϕ

(((1+τm)r)1−β4w

β4H

)1−ε)(ϕ(

((1 + τm) r)1−β4 wβ4H

)1−ε+ w1−ε

L

)µG

(β2 +

β1β4ϕ(


)1−εw1−εL +ϕ

(((1+τm)r)1−β4w

β4H

)1−ε)(

ϕ(

((1 + τm) r)1−β4 wβ4H

)1−ε+ w1−ε

L

)µ+ β2 (1−G)w

β1(1−σ)L

.

Then, we obtain:

LS =wLL+ wHH

σσ−1

(wLL+ wHHP + rK + Tm) + (1 + τa)wH (H −HP ). (165)

Assume that H = HP , then we get that

LS =σ − 1

σ

(1 +

rK + TmwLL+ wHH

)−1

.

117

Using (159), (160), (164) and the definition of Φ, we obtain:

rK + TmwLL+ wHH

(166)

=G(β3 + β1 (1− β4) Φ

1+Φ

)(Φ + 1)µ + β3 (1−G)

G(β2 + β1β4 + β1

(1−β4)Φ+1

)(Φ + 1)µ + (β1 + β2) (1−G)

=β3

β2 + β1

+1

β2 + β1

G (1− β4) β1

G(β2 + β1β4 + β1

(1−β4)Φ+1

)+ (β1 + β2) (1−G) (Φ + 1)−µ

Φ

1 + Φ.

This expression is increasing in Φ. From (161), Φ moves like wH/wL, therefore the labor

share decreases in N (the opposite of wH/wL) when H ≈ HP (this result may not extend

if HP is far from H when β4 is close to 1).

Further, we can rewrite (161) as:

wHHP

wLL=β2

β1

+(β2 + β1β4)

β1

GΦ (Φ + 1)µ−1

G (Φ + 1)µ−1 + (1−G).

We have already derived that an increase inG increases wH/wL, therefore, this expression

shows that it will increase GΦ(Φ+1)µ−1

G(Φ+1)µ−1+(1−G). We can then rearrange terms in (166) and

write:

rK + TmwLL+ wHH

=β3

β2 + β1

+(1− β4) β1

β2 + β1

(β1β4 + β2 + (β2 + β1)

G (Φ + 1)µ−1 + (1−G)

GΦ (1 + Φ)µ−1

)−1

.

The right hand side is an increasing function of GΦ(Φ+1)µ−1

G(Φ+1)µ−1+(1−G), which ensures that the

labor share decreases in G when H ≈ HP .

7.11.3 Asymptotic behavior

The asymptotic behavior is in line with Proposition 2 but the fact that automation

now replaces low-skill workers with a Cobb-Douglas aggregate of capital and high-skill

workers limit the ratio between the growth rate of high-skill and low-skill wages. In

addition, we here need to consider the long-run behavior of the gross rental rate r. Since

r is determined by the Euler equation, then on a path where consumption growth is

asymptotically constant, then r is also asymptotically constant (see (156)). We focus on

118

the case where G∞ ∈ (0, 1) (although results analogous to those in Proposition 2 could

be derived when G∞ ∈ 0, 1) and prove:

Proposition 11. Consider four processes [Nt]∞t=0, [Gt]

∞t=0,

[HPt

]∞t=0

and [rt]∞t=0 where(

Nt, Gt, HPt , rt

)∈ (0,∞) × [0, 1] × (0, H] × (0,∞) for all t. Assume that Gt, g

Nt , HP

t

and rt all admit positive and finite limits with G∞ ∈ (0, 1). Then the asymptotic growth

rate of high-skill wages wHt and output Yt are

gwH∞ = gY∞ = gN∞/ [(σ − 1) (β2 + β1β4)] , (167)

and the asymptotic growth rate of low-skill wages is

gwL∞ =1 + (σ − 1) β1β4

1 + (σ − 1) β1

gwH∞ . (168)

Proof. For simplicity we assume that the limits gwH∞ , gwL∞ and gY∞ exist (although we could

show that formally as we did in Appendix 7.2.1). Suppose that gwL∞ ≤ β4gwH∞ . Then

Φt must either tend toward a positive constant or toward 0, in either case (161) implies

that gwL∞ = gwH∞ , which is a contradiction as β4 < 1. Hence it must be that gwL∞ > β4gwH∞ ,

which ensures that Φt →∞. Using this in (158), we obtain:

wβ2+β1β4Ht ∼

t→∞

σ − 1

σ

ββ11 ββ22 β

β33 (G∞ϕ)

1σ−1

(1 + τm)(1−β4)β1 rβ3+(1−β4)β1∞

N1

σ−1

t .

This establishes gwH∞ = gN∞/ [(σ − 1) (β2 + β1β4)], from which we can obtain that gY∞ =

gwH∞ = gN∞/ [(σ − 1) (β2 + β1β4)] (using that HPt admits a positive limit).

Moreover (161) now implies

wHtHP∞

wLtL∼t→∞

G∞ (β2 + β1β4) Φµt

β1 (1−G∞),

=⇒ w1+β1(σ−1)Lt ∼

t→∞

β1 (1−G∞) (1 + τm)(1−β4)β1(σ−1) r(1−β4)β1(σ−1)∞ HP

∞G∞ (β2 + β1β4)ϕµL

w1+β4β1(σ−1)Ht ,

which implies (168). Since 1+(σ−1)β1β41+(σ−1)β1

> β4, we verify that gwL∞ > β4gwH∞ .

7.11.4 Dynamic equilibrium

We can solve for the dynamic equilibrium as in the baseline model. The long-run

elasticity of output with respect to the number of products is now given by ψ ≡

119

1/ [(σ − 1) (β2 + β1β4)]. We then introduce the same normalized variables as in the

baseline model: V At , V N

t , πAt , πNt , hAt , ct and vt. We also introduce Yt ≡ YtN−ψ and

Kt ≡ KtN−ψ. Finally we now define

nt ≡ N− 1−β4

1+β1(σ−1)β1

β2+β1β4t

and

ωt ≡

(N

1−β4(σ−1)(β2+β1β4)(1+β1(σ−1))

t

r1−β4t wβ4HtwLt

)β1(σ−1)

,

so that (wL

r1−β4t wβ4H

)β1(1−σ)

= ωtnt.

The transitional dynamics can then be expressed as a system of differential equations

in xt ≡(nt, Gt, Kt, h

At , vt, ct

)where the first three variables are state variables and the

last three control variables.

Equation (41) still applies, therefore, we get using (157) that

πAt =(σ − 1)σ−1

σσ

(ϕ(

((1 + τm) rt)1−β4 wβ4H

)1−ε+ w1−ε

L

)µ(wβ2H r

β3

ββ11 ββ22 β

β33

)1−σ

Yt.

We can rewrite this as

πAt =(σ − 1)σ−1

σσ

(ββ11 β

β22 β

β33

)σ−1 (r

(1−β4)β1+β3t vβ2+β4β1

t

)1−σ (ϕ (1 + τm)(1−β4)(1−ε) + (ωtnt)

1µ

)µYt.

(169)

We can derive πNt similarly and we find

πNt = ωtnt

(ϕ (1 + τm)(1−β4)(1−ε) + (ωtnt)

1µ

)−µπAt (170)

(31) is now replaced by

·nt = − 1− β4

1 + β1 (σ − 1)

β1

β2 + β1β4

gNt nt. (171)

(32) still applies and so does (33). Because of the automation tax (34) is replaced by

(rt − (ψ − 1) gNt

)V Nt = πNt + ηGκ

t

(hAt

)κ (V At − V N

t

)− (1 + τa) vtht +

·

V Nt (172)

120

and (35) by

κηGκt

(hAt

)κ−1 (V At − V N

t

)= (1 + τa) vt. (173)

Combining (170), (172), (173) and (23) with equality, we now obtain:

·vt = vt

(rt −∆− (ψ − 1) gNt − γωtnt

(ϕ (1 + τm)(1−β4)(1−ε) + (ωtnt)

1µ

)−µ πAtvt− γ (1 + τa)

1− κκ

hAt

).

(174)

Following the same steps as those used to derive (39), we now obtain:

·

hAt (175)

= γhAt

1− κ

(ωtnt

(ϕ (1 + τm)(1−β4)(1−ε) + (ωtnt)

1µ

)−µ πAtvt

+ (1 + τa)1− κκ

hAt

)

−κηGκ

t

(hAt

)κ(1− κ) (1 + τa)

(1− ωtnt

(ϕ (1 + τm)(1−β4)(1−ε) + (ωtnt)

1µ

)−µ) πAtvt

+ ηGκt

(hAt

)κ+1

+κ

1− κ

(η(hAt

)κ+1

Gκ−1t (1−Gt)− gNt hAt

)Further, (24) still applies and we can rewrite it as:

·ct =

ctθ

(rt −

(ρ+ ∆ + θψgNt

)). (176)

Finally, we can rewrite (30) as

·

Kt = Yt − ct −(∆ + ψgNt

)Kt (177)

Equations (171), (32), (174), (175), (176) and (177) form a system of differential

equations which depend on Yt, πAt , rt and gNt .

(158) implies

σ

σ − 1

vβ2+β4β1 rβ3+β1(1−β4)

ββ11 ββ22 β

β33

(G(ϕ (1 + τm)(1−β4)(1−ε) + (ωtnt)

1µ

)µ+ (1−G)ωtnt

) 11−σ

= 1,

121

so that

r =

[σ − 1

σ

ββ11 ββ22 β

β33

vβ2+β4β1

(G(ϕ (1 + τm)(1−β4)(1−ε) + (ωtnt)

1µ

)µ+ (1−G)ωtnt

) 1σ−1

] 1β3+β1(1−β4)

,

(178)

which defines r as a function of xt and ωt. (160) can be written as:

HPt =

rtKt

vt

Gt

(β2 + β1β4ϕ(1+τm)(1−β4)(1−ε)

(ωtnt)1µ+(1+τm)(1−β4)(1−ε)

)(ϕ (1 + τm)(1−β4)(1−ε) + (ωtnt)

1µ

)µ+ β2 (1−Gt)ωtnt

Gt

(β3 + β1(1−β4)ϕ(1+τm)(1−β4)(1−ε)−1

(ωtnt)1µ+ϕ(1+τm)(1−β4)(1−ε)

)(ϕ (1 + τm)(1−β4)(1−ε) + (ωtnt)

1µ

)µ+ β3 (1−Gt)ωtnt

,

(179)

which gives, together with (178), HPt as a function of xt and ωt. gNt still obeys (48),

which then defines it as a function of xt and ωt.

Combine (163), (159), (160) and (164) to obtain:

Y

wHHP

=

σσ−1

(G

(ϕ(

((1 + τm) r)1−β4 wβ4H

)1−ε+ w1−ε

L

)µ+ (1−G)w

β1(1−σ)L

)G

(β2 +

β1β4ϕ(


)1−εw1−εL +ϕ

(((1+τm)r)1−β4w

β4H

)1−ε)(

ϕ(

((1 + τm) r)1−β4 wβ4H

)1−ε+ w1−ε

L

)µ+ β2 (1−G)w

β1(1−σ)L

,

which we can rewrite as

Yt =σ

σ − 1

[Gt

(ϕ (1 + τm)(1−β4)(1−ε) + (ωtnt)

1µ

)µ+ (1−G)ωtnt

]vtH

Pt

G

(β2 + β1β4ϕ(1+τm)(1−β4)(1−ε)

ϕ(1+τm)(1−β4)(1−ε)+(ωtnt)1µ

)(ϕ (1 + τm)(1−β4)(1−ε) + (ωtnt)

1µ

)µ+ β2 (1−G)ωtnt

.

(180)

This expression, with the previous equations, gives Yt as a function of xt and ωt. (169)

then ensures that πAt is defined as a function xt and ωt.

Finally, from (159) we obtain:

ωt =

(vtrt

)1−β4 HPt

Lβ1

(G(ϕ (1 + τm)(1−β4)(1−ε) (ωtnt)

− 1µ + 1

)µ−1

+ (1−G)

)G

(β2 + β1β4ϕ(1+τm)(1−β4)(1−ε)

(ωtnt)1µ+ϕ(1+τm)(1−β4)(1−ε)

)(ϕ (1 + τm)(1−β4)(1−ε) + (ωtnt)

1µ

)µ+ β2 (1−G)ωtnt

β1(1−σ)

1+β1(σ−1)

,

(181)

122

which implicitly defines ωt as a function of xt. Hence, together with (178), (179), (48),

(180), (169) and (181), the system formed by (171), (32), (174), (175), (176) and (177)

describes the dynamic equilibrium. We then obtain

Proposition 12. Assume that

ρ

((1 + τa)

κ

κκ (1− κ)1−κ η

(ρ

γ

)1−κ

+1

γ

)< ψH (182)

is satisfied, then the economy admits a steady-state(n∗, G∗, K∗, hA∗, v∗, c∗

)with n∗ = 0,

G∗ ∈ (0, 1) and gN∗ > 0. gN∗, G∗ and hA∗ are independent of τm.

Proof. As before, we directly get that in a steady-state with gN∗ > 0, we must have

n∗ = 0. (181) then implies that ω∗ is a constant defined by

ω∗ =

[(v∗

r∗

)1−β4 HP∗

L

β1 (1−G∗) (1 + τm)(1−β4)(σ−1)

G∗ (β2 + β1β4)ϕµ

] β1(1−σ)1+β1(σ−1)

.

This guarantees that in such a steady-state, wLt ∼ ω∗ 1β1(1−σ) r∗1−β4 v∗β4N

1+β4β1(σ−1)1+β1(σ−1)

1(σ−1)(β2+β1β4)

t

such that gwL∞ = 1+β4β1(σ−1)1+β1(σ−1)

gwH∞ as stipulated in Proposition 11.

In addition, (176) implies that in steady-state,

r∗ = ρ+ ∆ + θψgN∗. (183)

(179) implies that

HP∗ =r∗K∗

v∗(β2 + β1β4)

(β3 + β1 (1− β4)). (184)

Then (180) implies that

Y ∗ =σ

(σ − 1) (β2 + β1β4)v∗HP∗. (185)

We then get that (169) implies that

πA∗

v∗=

(σ − 1)σ−2

σσ−1

(ββ11 β

β22 β

β33

)σ−1

(β2 + β1β4)

(r∗(1−β4)β1+β3 v∗β2+β4β1 (1 + τm)(1−β4)β1

)1−σϕµHP∗.

(186)

123

(178) gives

r∗ =

[σ − 1

σ

ββ11 ββ22 β

β33

v∗β2+β4β1

(G∗ϕµ)1

σ−1

(1 + τm)(1−β4)β1

] 1β3+β1(1−β4)

. (187)

Therefore (186) simplifies intoπA∗

v∗= ψ

HP∗

G∗, (188)

just as in the baseline model. Then (174) and (183) together imply that

hA∗ =κ

γ (1 + τa) (1− κ)

(ρ+ ((θ − 1)ψ + 1) gN∗

). (189)

This defines hA∗ as an increasing function of gN∗. Further, in steady-state G∗ still obeys

(107) and HP∗ obeys (109), which imply that G∗ and HP∗ also be defined as function

of gN∗.

(188), (175), (107), (189) then lead to

1− κκ

γG∗ (1 + τa) hA∗

ψHP∗

(1 + τaκηGκ

t

(hA∗)1−κ

+1

γ

)= 1, (190)

which up to the term 1+τa is the same as (111) in the baseline case. Therefore following

the same reasoning, there exists a steady-state with gN∗ > 0 and G∗ ∈ (0, 1) as long as

(182) is satisfied. As (190), (107), (109) and (189) are independent of τm, so are gN∗,

hA∗ (now given by (189)), G∗ (given by (107)) and HP∗ (given by (109)).

We further obtain r∗ through (183), which must be independent of τm as well. We

then get v∗ through (187) as

v∗ =

[σ − 1

σ

ββ11 ββ22 β

β33

r∗β3+β1(1−β4)

(G∗ϕµ)1

σ−1

(1 + τm)(1−β4)β1

] 1β2+β4β1

.

We then get K∗ through (184) and c∗ from (177) which, using (185), implies:

c∗ =σ

(σ − 1) (β2 + β1β4)v∗HP∗ −

(∆ + ψgN

∗)K∗.

Further if τa = τm = 0, gN∗, G∗, hA∗ are determined by the same equations are in

the baseline model except that the definition of ψ has changed. It is then direct that

124

Proposition 4 extends to this case.

7.12 Quantitative Exercise

We choose parameters to minimize the log-deviation of predicted and observed variables

for the four time paths of the skill-premium, the labor-share of GDP, stock of equipment

over GDP and an index of GDP per hours worked. That is, for a given set of parameters

b the model produces predicted output of Yi =Yi,t

Tit=1

for each of these four paths from

1963 and until 2007 for the labor share, skill-premium, and GDP per hour, and 2000

for equipment over GDP (due to data limitations from Cummins and Violante, 2002).

We let Y (b) = Yi(b)4i=1 as the combined vector of these paths and make explicit the

dependency on the parameters b. Y is the corresponding vector of actual values. We

then solve :

minb(log(Y (b))− log(Y ))′W (log(Y (b))− log(Y )),

where W is a diagonal matrix of weights. In a previous version of the paper (Hemous

and Olsen, 2016) we articulated a stochastic version of our model by introducing auto-

correlated measurement errors. Here we choose a much simpler approach and simply

choose “reasonable” weights based on how easily the model matches the path. In partic-

ular, the diagonal elements are 4 for the skill-premium, though 10 for the first 5 years,

10 for the labor share, 1 for GDP/hours and 2 for equipment over GDP. For a given

starting value of b we then run 12 estimations based on“nearby”randomly chosen param-

eters. We choose the best fit of these 13 (12 plus the original starting point), take that

value as the next starting value and repeat the step. We continue this process until 100

steps (1200 nearby simulations) have not improved the fit. We do this for 10 (substan-

tially) different starting points. They all give the same result. There is little substantial

difference between the Bayesian approach taken previously and the one pursued here.

7.12.1 Data

We do not seek to match the skill-ratio H/L but take it as exogenously given. We

normalize H + L = 1, throughout. The skill-ratio is taken from Acemoglu and Autor

(2012). However, since our estimation requires a skill-ratio both before and after the pe-

riod 1963-2007 we match the observed path of the log of the skill-ratio to a “generalized”

125

logistical function of the form:

α

1 + exp(µ−ts

) + β,

where (α, β, µ, s) are parameters to be estimated. We use the observed skill-ratio in the

period 1963− 2007 and the predicted values outside of this time interval. Yet, the fit is

so good that there is no visual difference in the match of the four time periods between

this approach and using the predicted value in the interval 1963− 2007.

The skill-premium is taken directly from Acemoglu and Autor (2012) specifically the

data underlying their Figures 1 and 2 from David Autor’s website. The labor share is

taken from Koh, Santaeulalia-Llopis and Zheng (2016). We take GDP per hours worked

from the series on non-farm business from the BLS (series PRS85006092).

Capital equipment is calculated as follows. We follow Krusell et al. (2000) and use

quality-adjusted price indices of equipment from Cummins and Violante (2002) who

update the series from Gordon (1990). We combine two different series. First, we

use NIPA data on private investment in equipment excluding transportation equipment

(Tables 1.5 and 5.3.5. from NIPA). We iteratively construct an index for the stock

of private real capital equipment by assuming a depreciation rate of 12.5 per cent (as

Krusell et al., 2000) and using the price index for private equipment from Cummins

and Violante (2002). We start this approach in 1947 but only use the stock from 1963

onwards. We combine this with the growth rate of real private GDP to get an index

for equipment over GDP. We match this index to the NIPA private equipment capital

stock (excluding transportation) over (private) GDP number for 1963 to get a series in

absolute value. To this, we add software, but following the suggestion of Cummins and

Violante (2002), we use the NIPA data on the stock of software over GDP (table 2.1 from

NIPA). We add these two values to get our combined stock of equipment (+software)

over GDP.

126

the rise of the machines: automation, horizontal ... olsen rise of the... · the rise of the...

Documents