es„ma„on of dp models

33
Esmaon of DP Models ScPo CompEcon April , /

Upload: others

Post on 28-Apr-2022

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Es„ma„on of DP Models

Es�ma�on of DP ModelsScPo CompEcon

April 20, 2017

1 / 32

Page 2: Es„ma„on of DP Models

Table of contents

Rust Bus Replacement

Berry, Levinsohn and Pakes (BLP) as MPEC

2 / 32

Page 3: Es„ma„on of DP Models

Rust Bus Replacement

Berry, Levinsohn and Pakes (BLP) as MPEC

3 / 32

Page 4: Es„ma„on of DP Models

Es�ma�on of Dynamic Programming Models

• Now that we know how to solve them, how do we es�mate DPmodels?• Examples

• Rust (1987)• Berry et al. (1995)• There are many different methods. We will introduce just a few.Look at the survey Aguirregabiria and Mira (2010) for moredetails.

4 / 32

Page 5: Es„ma„on of DP Models

Rust (1987)• Each Bus comes in once a month for repair

• Harold Zurcher decides a�er observing mileage xt since lastengine replacment and some other unobserved variable εwhether to replace or not:u(x + t, dt, θc, RC) =

{−c(xt, θc) if dt = 0−(RC + c(0, θc) if dt = 1

• He solves the DPVθ(xt) = sup

dt

E

{∞

∑j=t

βj−tu(xj, dj, θ) + εt(dt)|xt

}

• Parameters to be es�mated: θ = (θc, RC, θp)

• This formula�on results a�er making a set of simplifyingassump�ons.5 / 32

Page 6: Es„ma„on of DP Models

Rust (1987)

• To simplify, the odometer progress is assumed to be a randomprocess.• that is, xt evolves stochas�cally.• The assump�on is that xt+1 ∈ {s, s + 1, s + 2, s + 3} where s isthe state of xt, i.e. the bin it lies in.

6 / 32

Page 7: Es„ma„on of DP Models

Model and Data• Data: a �me series {xt, dt}T

t=1

• Likelihood func�on isL(θ) = ΠT

t=2P(dt|xt, θc, RC)p(xt|xt−1, dt−1, θp)

where the condi�onal choice probabili�es are given byP(dt|xt, θc, RC) =

exp[u(x, d, θc, RC) + βEVθ(x, d)]∑d′∈{0,1} exp[u(x, d′, θc, RC) + βEVθ(x′, d′)]

and, importantly, EV is the solu�on to

EVθ(x, d) =Tθ(EVθ)(x, d)

≡∫ ∞

x′=0log

(∑

d′∈0,1exp[u(x, d′, θc, RC) + βEVθ(x′, d′)]

)

7 / 32

Page 8: Es„ma„on of DP Models

Nested Fixed Point Algorithm (NXFP)

1 Outer Loop: Solve the Likelihood func�onmaxθ>0L(θ) = ΠT

t=2P(dt|xt, θc, RC)p(xt|xt−1, dt−1, θp)

2 Inner Loop: Compute Expected value func�on EVθ for a givenguess θEVθ = Tθ(EVθ)(x, d)

8 / 32

Page 9: Es„ma„on of DP Models

Poten�al Issues with NXFP

• We need a stopping rule for the likelihood func�on.• We need one for the inner loop as well.• Errors will propagate from the inner loop to the outer loop.• Given that the search direc�on on L(θ) depends on it’sgradient, errors will ma�er a lot.• the tolerance on the inner loop needs to be �ght, like 1.0e−13

9 / 32

Page 10: Es„ma„on of DP Models

MPECMathema�cal Programming with Equality constraints

• We can turn the problem around.• Instead of asking Whats the EV compa�ble with my guess θ?, wecould directly a�ack the likelihood:• Maximize L(θ) subject to the constraint, that behavior isop�mal according to the model.

• in other words, augment the likelihood:

L(θ, EV; X) = ΠTt=2P(dt|xt, θc, RC)p(xt|xt−1, dt−1, θp)

P(dt|xt, θc, RC) =exp[u(x, d, θc, RC) + βEV(x, d)]

∑d′∈{0,1} exp[u(x, d′, θc, RC) + βEV(x′, d′)]

10 / 32

Page 11: Es„ma„on of DP Models

Different Op�miza�on problems

NXFP solves the unconstrained op�miza�on problem:max

θL(θ, EVθ)

MPEC solves the constrained op�miza�on problem:maxθ,EVL(θ, EV; X)

subject toEV = T(EV, θ)

11 / 32

Page 12: Es„ma„on of DP Models

Su and Judd (2012)• Su and Judd (2012) perform MPEC on the bus model.• the key difference to note is that EV now becomes a choicevariable.• In fact, the op�mizer will be fed a vector

x = [RC, θc, EV]

where EV is an approxima�on to EV. In Su and Judd (2012), thisis just going to beEV ≡ [EV(x1), EV(x2), . . . , EV(xn)]

i.e. the approxima�on needs to hold pointwise.

12 / 32

Page 13: Es„ma„on of DP Models

Su and Judd (2012)

13 / 32

Page 14: Es„ma„on of DP Models

Performance• In general, NXFP is a computa�onally expensive opera�on.

• you have to solve a DP for many many many �mes in order tofind your θ.• However, there is much to qualify about this statement. Thedetails ma�er here.

• For example, Su and Judd (2012) are very cri�cal about NXFP inthe Bus Model. They compare it to the performance of MPEC.• But Iskhakov et al. (2016) redo the exercise with Rust’s originalmethod to solve EV and show that NXFP is s�ll a very strongcontender in this example.

14 / 32

Page 15: Es„ma„on of DP Models

Rust Bus Replacement

Berry, Levinsohn and Pakes (BLP) as MPEC

15 / 32

Page 16: Es„ma„on of DP Models

BLP a�er Dube et al. (2012)• Berry et al. (1995) is a model for automobile sales.• It has become a very widely applied model and es�ma�ontechnique, short: BLP.• The original paper performs demand es�ma�on with a largenumber of differen�ated products:

• characteris�cs approach• useful when only aggregate data are available• allows for flexible subs���on pa�erns• controls for price endogeneity• The computa�onal algorithm derives moment condi�ons from anon-linear model• The method is also known as Random Coefficients LogitDemand

16 / 32

Page 17: Es„ma„on of DP Models

Random Coefficients Logit Demand• The U�lity of i from purchasing product j in market t is

uijt = β0i + xjtβ

xi − β

pi pjt + ξjt + εijt (1)

• with product characteris�cs xjt, pjt, ξjt

• xjt, pjt: ovserved with cov(pjt, ξjt) 6= 0• ξjt: unovserved to econometrician.• βi ≡ [β0

i , βxi , β

pi ]: random coefficients or individual specifictastes to be es�mated.

• We posit a distribu�on: βi ∼ Fβ(β, θ)

• Goal of BLP : es�mate θ in the above parametric distribu�on.• errors are assumed type 1 EV• Consumer picks product j if uijt ≥ uij′t

17 / 32

Page 18: Es„ma„on of DP Models

The Model: Market Shares

• The model predicts market shares :sj(xt, pt, xit; θ) =

∫{βi,εi|uijt≥uij′t,∀j′ 6=j}

dFβ(β, θ)dFε(ε) (2)

• with type 1 EV shocks ε, there is an analy�cal solu�on to one ofthose integrals:sj(xt, pt, xit; θ) =

∫β

exp(β0 + xjtβx − βppjt + ξjt)

1 + ∑Jk=1 exp(β0 + xktβx − βppkt + ξkt)

dFβ(β, θ)

(3)

18 / 32

Page 19: Es„ma„on of DP Models

The Model: Market Shares

They use numerical integra�on:sj(xt, pt, xit; θ) =

1ns

ns

∑r=1

exp(β0r + xjtβxr − βprpjt + ξjt)

1 + ∑Jk=1 exp(β0r + xktβxr − βprpkt + ξkt)

dFβ(β, θ)

(4)to arrive at the market share (moment) condi�ons:sj(xt, pt, ξit; θ) = Sjt, ∀j ∈ J, t ∈ T (5)

where Sjt is data.

19 / 32

Page 20: Es„ma„on of DP Models

GMM Es�mator• If firms can observe demand shocks ξt, they will set pricesaccordingly.• There will be correla�on between pt and ξt⇒ Endogeneity Bias!• BLP solve endogeneity of prices with a vector zjt of IVs, whichare excluded from the demand equa�on (1)• they propose a moment condi�on E[ξjt|zjt, xjt] = 0

• zjt: e.g. product-specific cost shi�ers, or K non-pricecharacteris�cs in xj,t (assumed mean independent of ξt)• We o�en form E[ξjt · h(zjt, xjt)] = 0 for some known func�on h.

20 / 32

Page 21: Es„ma„on of DP Models

Ge�ng moment equa�ons• To get the sample analog of E[ξjt|zjt, xjt] = 0, we need to find

ξjt corresponding to θ

• System (5) defines a mapping ξjt and St

• Berry proved that s has an inverse, hence any observed St canbe explained by a unique ξjt(θ) = s−1(St, θ)

• Sample analog of E[ξjt|zjt, xjt] = 0 is thusg(θ) =

1TJ ∑

t,jξjt(θ)

′zjt

21 / 32

Page 22: Es„ma„on of DP Models

GMM Es�mator

• Data are {(xjt, pjt, Sjt, zjt)j∈J,t∈T}

• We want to minimize the GMM objec�veQ(θ) = g(θ)′Wg(θ)

• There is no analy�c form for ξjt(θ), see previous slide

22 / 32

Page 23: Es„ma„on of DP Models

Berry et al. (1995) Es�ma�on Algorithm - NFXP• Outer Loop: minθ g(θ)′Wg(θ)

1 Guess vector θ to get g(θ) = 1TJ ∑t ∑j ξjt(θ)

′zjt

2 Stop whenever ||∇θ(g(θ)′Wg(θ))|| ≤ εout

• Inner loop: compute ξjt(θ) given θ

1 Solve system sj(xt, pt, xit; θ) = S·t by Berry constrac�on:ξh+1

t = ξht + log St − log sj(xt, pt, ξh

t ; θ)

2 Stop whenever ||ξh+1·t − ξh

·t|| ≤ εin3 Call resul�ng demand shock ξjt(θ, εin)• Clearly, need to choose both stopping rules for inner and outerloop.

23 / 32

Page 24: Es„ma„on of DP Models

Berry et al. (1995) Es�ma�on Algorithm - NFXP• Outer Loop: minθ g(θ)′Wg(θ)

1 Guess vector θ to get g(θ) = 1TJ ∑t ∑j ξjt(θ)

′zjt

2 Stop whenever ||∇θ(g(θ)′Wg(θ))|| ≤ εout• Inner loop: compute ξjt(θ) given θ

1 Solve system sj(xt, pt, xit; θ) = S·t by Berry constrac�on:ξh+1

t = ξht + log St − log sj(xt, pt, ξh

t ; θ)

2 Stop whenever ||ξh+1·t − ξh

·t|| ≤ εin3 Call resul�ng demand shock ξjt(θ, εin)• Clearly, need to choose both stopping rules for inner and outerloop.

23 / 32

Page 25: Es„ma„on of DP Models

Kni�el and Metaxoglou (2014)• They perform an extensive inves�ga�on into BLP on two widelyused datasets: cars and cereals.• They use 10 free solvers and 50 star�ng points for each.• Find: convergence occurs at several local extrema, saddles, andin regions where the FOC is not sa�sfied.• Resul�ng parameter es�mates of economic variables (marketshares, price elas�citeis) exhibit huge varia�on depending onsolver/star�ng point.• All in all, they found 400 local solu�ons.

24 / 32

Page 26: Es„ma„on of DP Models

Dube et al. (2012)’s concerns

1 Too much computa�on• need to know ξ(θ) only at true θ.• NFXP solves for ξ(θ) at each stage.

2 Stopping criteria• inner loop can be slow to converge• it’s temp�ng to loosen εin (o�en see εin = 1e−6 or higher!)• outer loop may not converge with loose inner criterion

3 Inner loop error propagates to outer loop.

25 / 32

Page 27: Es„ma„on of DP Models

Errors from loose stopping

θ∗ = arg maxθ

Q(ξ(θ, 0))

θ = arg maxθ

Q(ξ(θ, εin))

• Dube et al. (2012) derive bounds on the order of es�ma�n erroras a func�on of εin• Consider Kni�el and Metaxoglou (2014) for numericalexperiments.

26 / 32

Page 28: Es„ma„on of DP Models

BLP as an MPEC

• Dube et al. (2012) cast this as an MPEC:min

θ,ξξTZWZTξ

subject to s(ξ, θ) = S

• Advantages:1 No need to set up 2 tolerances2 no inner errors propagated3 easy to code in AMPL4 fewer itera�ons, given that AMPL provides analy�cgradients/hessian

27 / 32

Page 29: Es„ma„on of DP Models

Exploring Sparsity in BLP• The way this is formulated now, the Hessian of objec�ve isdense. :-(• They add an addi�onal variable r and associated constraint

ZTξ = r

minθ,ξ,r

rTWr

subject to s(ξ, θ) = S

and ZTξ = r

• advantages:1 Hessian of objec�ve func�on is now sparse2 Very big saving in memory requirements.

28 / 32

Page 30: Es„ma„on of DP Models

Convergence and Loose vs Tight

29 / 32

Page 31: Es„ma„on of DP Models

Speed

30 / 32

Page 32: Es„ma„on of DP Models

References IJohn Rust. Op�mal replacement of gmc bus engines: An empiricalmodel of harold zurcher. Econometrica, 55(5):999–1033, 1987. ISSN00129682. URL http://www.jstor.org/stable/1911259.S. Berry, J. Levinsohn, and A. Pakes. Automobile prices in marketequilibrium. Econometrica: Journal of the Econometric Society,pages 841–890, 1995.Victor Aguirregabiria and Pedro Mira. Dynamic discrete choicestructural models: A survey. Journal of Econometrics, 156(1):38 –67, 2010. ISSN 0304-4076. doi:

DOI:10.1016/j.jeconom.2009.09.007. URLhttp://www.sciencedirect.com/science/article/

B6VC0-4X7R85S-3/2/b78d013f2b1f3dbce3f2325f0c41d4e4.Structural Models of Op�miza�on Behavior in Labor, Aging, andHealth.31 / 32

Page 33: Es„ma„on of DP Models

References IIChe-Lin Su and Kenneth L. Judd. Constrained op�miza�onapproaches to es�ma�on of structural models. Econometrica, 80(5):2213–2230, 2012. ISSN 1468-0262. doi: 10.3982/ECTA7925.URL http://dx.doi.org/10.3982/ECTA7925.Fedor Iskhakov, Jinhyuk Lee, John Rust, Bertel Schjerning, andKyoungwon Seo. Comment on “constrained op�miza�onapproaches to es�ma�on of structural models”. Econometrica, 84(1):365–370, 2016.Jean-Pierre Dube, Jeremy T Fox, and Che-Lin Su. Improving thenumerical performance of sta�c and dynamic aggregate discretechoice random coefficients demand es�ma�on. Econometrica, 80(5):2231–2267, 2012.Christopher R Kni�el and Konstan�nos Metaxoglou. Es�ma�on ofrandom-coefficient demand models: Two empiricists’ perspec�ve.Review of Economics and Sta�s�cs, 96(1):34–59, 2014.

32 / 32