the optimal early-withdrawal penalty on tax-deferred ... · this paper explores the optimal early...

The Optimal Early-Withdrawal Penalty on Tax-deferred

Saving Accounts∗

Anson T. Y. Ho†

Abstract

Tax-deferred savings accounts (TDA) are commonly used to encourage retirement

savings. While TDA have similar institutional settings in many countries, the penalty

rates on assets withdrawal before retirement differ significantly. Through an overlapping-

generations model calibrated to the U.S. economy, this paper shows that the optimal

early withdrawal penalty rate is 50.4%, which is substantially higher than the 10%

in the current regulation. Adopting the optimal penalty rate will obtain a welfare

gain equivalent to a 2.16% increase in per period consumption. A reform with income-

dependent early withdrawal penalty will improve welfare equivalent to a 2.90% increase

in per period consumption. Experiment results indicate that households facing negative

income shocks over withdraw TDA assets for consumption. Sensitivity test show that

the welfare gain from an income-dependent penalty is more robust to the assumption

on household’s time inconsistent preferences, and it provides four times more welfare

gain than a constant penalty rate if households are naıvete.

JEL classification: D14, D91, E21, H24

Keywords : Retirement, savings, tax-deferred savings accounts, hyperbolic discounting

∗I am grateful for comments from Dean Corbae, Kim P. Huynh, Jie Zhou, and participants at 2017

Asian Meeting of the Econometric Society. All errors are mine. I thank Johan Brannlund for his excellent

assistance with the Bank of Canada EDITH High Performance Cluster, where all the computations were

performed. The views expressed in this paper are mine. No responsibility for them should be attributed to

the Bank of Canada.†Bank of Canada, 234 Wellington Street, Ottawa, Ontario, K1A 0G9, Canada.

Email: [email protected].

1

mailto:[email protected]

1 Introduction

Tax-deferred saving accounts (hereafter TDA) are systematically used in many countries

to increase households’ preparedness for retirement. The most common types of TDA in

the U.S. are 401(k) and Individual Retirement Accounts (IRA).1 TDA provide significant

tax benefits to account holders for higher returns on savings – labor earnings contributed

to TDA are income-tax deductible and capital income within these accounts are untaxed.

Only when account holders withdraw assets from TDA, the whole amount taken out will be

taxed as ordinary income. However, savings in TDA also come with liquidity risk as early

withdrawal before before retirement age is subject to penalty payment in addition to the

income tax incurred. Thus, households’ use of TDA is a tradeoff between tax benefits and

assets liquidity, in addition to the intertemporal tradeoff on consumption. Although early

withdrawal penalty is a common regulation in place for TDA, Beshears, Choi, Hurwitz,

Laibson, and Madrian (2015) point out that TDA in the U.S. has one of the most liquid

arrangements.

Early withdrawals, also known as leakage, from TDA is an important consideration in

TDA institutional settings. Illiquidity, as a commitment to save for retirement, can also be a

risk. Using tax returns data, Amromin and Smith (2003) find that early withdrawals mainly

reflects consumption smoothing behavior by liquidity-constrained households, and Argento,

Bryant, and Sabelhaus (2015) show that early withdrawals increased during the great reces-

sion. Indeed, liquidity constraint also enables on “semi-liquid” TDA to serve as commitment

devices for retirement savings. Existing literature largely focuses on the theoretical context.

Amador, Werning, and Angeletos (2006) and Ambrus and Egorov (2013) theoretically an-

alyze the trade-off between commitment and flexibility. Beshears, Choi, Clayton, Harris,

Laibson, and Madrian (2015) explores the optimal penalty on early withdrawal in an ab-

stracted 3-period model. On the extensive margin, Diamond and Koszegi (2003) show that

households may use insufficient savings as a strategy to commit their future selves for later

retirement.

In a quantitative framework, Laibson (1997) argues that financial innovations lowered

saving rates in the U.S. as they increased the liquidity of savings. Laibson, Repetto, Tobac-

man, Hall, Gale, and Akerlof (1998) show that TDA is an effective commitment device for

household to save for retirement. Imrohoroglu, Imrohoroglu, and Joines (2003) point out

that the benefits of social security, as a mandatory defined-benefits retirement savings pro-

1Other less common types of TDA are 403(b) for nonprofit sector, 457 plan for public sector, and Keogh

accounts. They all have similar settings as 401(k).

2

gram for time-inconsistent household, is not substantial enough to overcome the distortions

created via payroll taxes. Love (2006) argues that the generosity of unemployment insur-

ance crowds out 401(k) contributions made by younger workers as they primarily save for

for precautionary reasons. Love and Phelan (2015) show that, with Epstein-Zen preferences,

the elasticity of inter-temporal substitution significantly increase the impact of hyperbolic

discounting on households’ wealth accumulation.

This paper explores the optimal early withdrawal penalty mechanism for TDA through

an overlapping-generations model, in which households exhibit time inconsistent preferences.

In the model, households are subject to idiosyncratic shocks on labor earnings and unemploy-

ment. They have assess to TDA that mimics 401(k) in the U.S., with certain contribution

limit and an employer match provision. Early assets withdrawal from TDA is subject to

10% penalty. Households’ income, after deducting TDA contributions, are taxed through a

progressive tax system. Social security and unemployment benefits are also included in the

model.

Two types of early withdrawal reforms are considered in this paper. Within the current

TDA regulatory framework, raising the early withdrawal penalty to about 50% will result in

welfare gains equivalent to a 4% per-period consumption. Although imposing a higher early

withdrawal penalty raises households’ liquidity risks and motivates them to substitute TA

savings for TDA savings, it also allows them to build precautionary savings in TA such that

they are less prone to withdraw from TDA in case of adverse income shocks. An income-

dependent penalty scheme for early withdrawal will further improve welfare equivalent to

a 0.5% increase in per-period consumption. Our findings shed light on the importance

of households’ liquidity concerns in accumulating wealth in semi-illiquid TDA. This paper

shows that simple reforms to the TDA early withdrawal policy can results in significant

welfare gains. It relates to the long-standing debate on the adequacy of households’ savings

for retirement and their participation in TDA.

The rest of the paper is organized as follow: Section 3 describes the model used for our

analysis, Section 4 explains the model calibration strategy, Section 5 conducts experiments

on the optimal early withdrawal penalty, Section 6 reports the results with naivete quasi-

hyperbolic discounting households, and Section 7 concludes.

3

2 TDA and Early Withdrawal Penalty

In general, distributions from conventional IRAs and 401(k) prior to age 59.5 are known as

early withdrawals, which are subject to a 10% penalty in addition to the income tax incurred.

Penalty-free early withdrawal is only allowed under specific circumstances for: (1) first-time

home purchase, (2) qualified education expenses, (3) death or disability, (4) unreimbursed

medical expenses, or (5) health insurance if the account holder is unemployed. According

to Vanguard (2016), about 1.5% of the aggregate assets are leaked out from 401(k) per year

from 0.79% of account holders. The most significant factors contributing to the leakage are

in-service withdrawals and cashouts.

Loan provisions for TDA differ based on account types. Conventional IRAs do not offer

loans under IRS regulations. On the other hand, Vanguard (2016) reports that 78% of 401(k)

plans offer loan options in 2015, covering 89% of plan participants. If account holders leave

their current jobs, they are typically required to repay the entire outstanding balance of

their 401(k) loans within 60 days. Any amount not repaid is considered an early withdrawal.

Non-hardship loans against 401(k) must be repaid within 5 years if an account holder stays

with her employer. Statutory regulation limits 401(k) participants to borrow at most 50%

of their 401(k) balance or up to $50,000, whichever is less. Generally, loans on 401(k) is not

common among US households. In 2015, only about 2% of the aggregate plan assets are in

loans.

3 Model

The life-cycle model used for analyzing households’ consumption-savings decisions is specified

in this section. Key features of the model includes households time inconsistent preferences,

uninsurable idiosyncratic risks on labor efficiency and unemployment, savings in tax-deferred

savings accounts (TDA) and taxable accounts (TA), and a progressive income tax system.

3.1 Demographics and Preferences

There is a large number of households born every period and the population grows at a

constant rate of g. Households have stochastic lifetime, entering the economy at age 25 and

live at most up to age 95 (J = 71 periods). Their conditional survival probability from

period j to period j + 1 is denoted by sj . Let µj be the fraction of population at age j,

the relative size of age j and age j + 1 households is given by µj+1 =sj1+g

µj and the total

4

population is∑J

j=1 µj = 1.

Preferences over lifetime consumption for a household of age j∗ exhibit time inconsis-

tency via hyperbolic discounting.2 Adopted from Laibson (1997), household’s preferences is

represented by

Uj∗ =

j∗−1∑

j=1

δj−j∗

b u(cj) + u(cj∗) + βEj∗

J∑

j=j∗+1

δj−j∗

f

j−1∏

t=j∗

stu(cj), (1)

where δb is the backward-looking discounting factor, δf is the forward-looking discounting

factor, and β ≤ 1 controls for households’ patience. The per-period utility function is given

by u(cj) =c1−γj

1−γ, where γ is the coefficient of relative risk aversion.

3.2 Income Process

Households are endowed with 1 unit of time each period and supply it inelastically to work

from age 25 to 64 (R = 40 periods). They enter the economy with different education levels

and are subject to idiosyncratic labor efficiency shocks and unemployment shocks.

The level of labor efficiency for a household at age j with education level i is given by

ej = exp(εj + εj), (2)

where εj is an age-specific deterministic efficiency, and εj is an uninsurable idiosyncratic

shock. The labor efficiency shock is persistent and follows the AR(1) process

εj = ρεεj−1 + ξj, (3)

where ξj is i.i.d. random variables normally distributed with mean zero and variance σ2ε .

The employment status of a household at age j is denoted by ηj ∈ {0, 1}, where ηj = 0

and ηj = 1 indicates that the household is being unemployed and employed, respectively.

The probability of unemployment, Pη(j), is conditional on a household’s age j. Unemployed

household receive unemployment insurance benefits with replacement rate λue of their em-

ployed labor earnings. Households retire after R periods and receive social security benefits,

which is a constant fraction, λss, of last working period’s labor income from employment.

This specification simplifies the solution for the model, as it retains heterogeneity in retire-

ment income without keeping track of households’ entire income histories. Let w be the

2Alternatively, self-control issues can be modeled via a class of utility function specified by Gul and

Pesendorfer (2004), with which Kumru and Thanopoulos (2008) and Kumru and Thanopoulos (2011) study

the U.S. social security system.

5

effective wage rate, the labor earnings, yj, for an age-j household is calculated as

yj =

{[(1− ηj)λue + ηj ]wej if j ≤ R

λssweR if j > R. (4)

3.3 Tax-deferred Accounts and Taxable Accounts

All households have access to tax-deferred savings accounts (TDA) and taxable savings

accounts (TA). Households can contribute their pre-tax labor earnings to TDA in each

employed period of their working lives. Contributions to TDA is limited to fraction q of

their current income or q amount in dollars, whichever is lower. After retirement, households

cannot contribute to TDA, and there is a minimum required distribution after period R+6.3

The amount of assets in TDA at the beginning of period j is denoted by aDj . Let qj denote

household’s period j contributions to TDA, with qj < 0 implies that assets are withdrawn

from TDA. Thus,

qj ∈

[−aDj ,min(q, qwej)

]if j ≤ R and ηj = 1

[−aDj , 0

]if j ≥ R + 1 and j ≤ R + 6[

−aDj ,−1

J−j+1aDj

]if j > R + 6

(5)

where 1J−j+1

is the minimum withdrawal rate.

Employed households also receive employers’ match of their TDA contributions, which is

q fraction of employee’s contributions up to 6% of the employee’s labor income. Therefore,

the amount of employer’s match, denoted by qEj , is

qEj =

{min(q ∗ qj, q ∗ 0.06yj) if qj > 0 and ηj = 1

0 otherwise. (6)

Assets in both TDA and TA earn a constant rate of return r.4 The amount of assets in

TDA grows as

aDj+1 = (1 + r)aDj + qj + qEj . (7)

3According to the current regulations in the United States, individuals must begin to take withdrawals

by age 70 1

2.

4To keep the model tractable, households’ portfolio choice and asset location problems are abstract

from this model. There is a strand of literature studying the portfolio choice in the presence of TDA.

See Amromin (2003), Dammon, Spatt, and Zhang (2004), Garlappi and Huang (2006), Huang (2008), and

Gomes, Michaelides, and Polkovnichenko (2009), among others.

6

Let aTj be the assets in households’ TA at the beginning of period j. Both TDA and TA are

subject to zero borrowing constraints

aTj ≥ 0 and aDj ≥ 0 for all j. (8)

Assets of deceased households are treated as accidental bequest (TR) and distributed evenly

across all households as lump sum transfers.

TDA assets withdrawn prior to age 59.5 (model period j < R − 4) is subject to early

withdrawal penalty at rate τpen ∈ (0, 1), in addition to the ordinary income tax incurred.

The early withdrawal penalty is specified as

pen(qj , j)

{= τpen × qj if qj < 0and j < R − 4

= 0 otherwise(9)

3.4 Taxes

Household’s labor earnings is taxed through a piece-wise linear progressive income tax sys-

tem, T (·), with contributions to TDA being tax deductible and Γ amount of income tax

exemption in the U.S. tax code. Taxable income in period j, denoted by AGIj , is defined as

the sum of labor earnings, interest income in TA, and the assets withdrawn from TDA, net

of TDA contributions and income tax exemption, such that

AGIj = we(εij, εj) + raTj − qj − Γ. (10)

Marginal income tax rates are conditional on household’s taxable income above the tax

exemption. Let IC = {IC1, IC2, IC3, IC4} be the cutoff points of the tax brackets and

{τ1, τ2, τ3, τ4, τ5} denote the corresponding marginal tax rates. For household with taxable

income AGIj ∈ (IC3, IC4], income tax payment is

T (AGIj) = τ1 (IC2 − IC1) + τ2 (IC3 − IC2) + τ3 (AGIj − IC3) . (11)

Households also pay payroll taxes and unemployment insurance tax, denoted by τss and

τui respectively, for balancing the social security and the unemployment insurance budgets.

Households’ capital income is subject to capital income tax at rate τk, included in the model

to mimic the corporate profits tax, and consumption is also taxed at rate τc.

7

3.5 Household Problem

The budget constraint for a household at age j is

cj + aTj+1 + qj

= yj + [1 + (1− τk)r]aTj + TR

−T (AGIj)− (τss + τui) yj − τccj if j ≤ R

= yj + [1 + (1− τk)r]aTj + TR

−T (AGIj)− τccj if j > R

. (12)

Households’ state variables are the current employment status, labor efficiency, TA and

TDA assets. Let xj be the set of state variables for a household in period j, with xj =(ηj, εj, a

Tj , a

Dj

)∈ 1η×R×R×R. They make decisions on consumption (cj), TDA contribution

(qj), and next period TA assets (aTj+1) based on their states, such that cj = c(xj), qj = q(xj),

and aTj+1 = aT (xj). Households are assumed to be sophisticated, who fully aware of their

time inconsistent preferences. They assume that their future selves will also engage in quasi-

hyperbolic discounting, and they account for that in their decision making in the current

period.

The decision problem for an age-j household in recursive form is written as

V (j, xj) = maxcj ,qj ,a

Tj+1

{c1−γj

1− γ+ βδfsj+1Ej

[V (j + 1, xj+1)

]}, (13)

where

V (j + 1, xj+1) =c1−γj+1

1− γ+ δfsj+2Ej+1

[V (j + 2, xj+2)

], (14)

subject to income processes (2) to (4) and constraints (5) to (12), in addition to the non-

negativity constraint on consumption.

3.6 Firm Problem

Firms have access to a Cobb-Douglas production function that uses capital and labor to

produce output. Factor markets are competitive. In addition to paying wages for labor input,

firms also match a fraction of their employees’ contributions to TDA. The representative

firm’s profit maximization problem is written as

π = zKαN1−α − (r + δ)K − wN −Qe (15)

where z is the total factor productivity (TFP), K is the aggregate capital, N is the aggregate

effective labor, and Qe is the total amount of employer’s match of employees contribution to

8

TDA specified in equation (6). The employer’s match can also be expressed as qe percentage

of total wages such that

Qe = qewN. (16)

The firm’s profit maximizing levels of capital and effective labor are characterized by the

first-order conditions

r = αzkα−1 − δ (17)

(1 + qe)w = (1− α)zkα. (18)

3.7 Stationary Equilibrium

To describe the heterogeneity among households of age j is described by A probability

measure ϕj that defines the subset of households state space (X). Let the probability space

be (X,B(X), ϕ), where B(X) is the Borel σ-algebra on X . For a household of age j > 1,

the probability measure is given by the recursion

ϕj+1(B) =

∫

X

Pr(x, j, B)dϕj, (19)

where

Pr(x, j, B) =

{ ∑i P

ηij

∫Pr(ε′, ε)dε if (η′, ε′, aD(x, j), aT (x, j)) ∈ B

0 otherwise. (20)

A stationary equilibrium consists of factor prices {r, w, qe}, household decision rules

{c(x, j), q(x, j), aT (x, j)}, employer’s match q, tax payments Γj, accidental bequests TR,

social security benefits bss, unemployment insurance benefits bue, aggregate capital K, aggre-

gate efficient labor N , government consumption G, payroll tax τss, unemployment insurance

tax τui, consumption tax τc, income tax regime T (·), and distributions of households {ϕj}Tj=1

such that

1. c(x, j), q(x, j), aT (x, j) are the optimal decision rules on consumption, TDA contribu-

tion, and TA assets, respectively.

2. Factor prices are determined competitively according to (17) and (18).

3. Markets clear

(a) Capital market clears

(1 + g)K =

J∑

j=1

µj

∫

X

aD(x, j) + aT (x, j)dϕj

9

(b) Labor market clears

N =

R∑

j=1

µj

∫

X

[1− Pη(j)] ejdϕj

(c) Goods market clears

C =J∑

j=1

µj

∫

X

c(x, j)dϕj

Y = C + (g + δ)K +G

4. Social Security budget balance

J∑

j=R+1

µj

∫

X

λsswejdϕj =R∑

j=1

µj

∫

X

ηjτsswejdϕj

5. Unemployment Insurance budget balance

R∑

j=1

µj

∫

X

λuewejdϕj =

R∑

j=1

µj

∫

X

ηjτuiwejdϕj

6. Government maintain a balanced budget

G =

J∑

j=1

µj

∫

X

T (AGI(x, j)) + τkr(aT (x, j) + aD(x, j)

)+ τcc(x, j)dϕj

7. Accidental bequest is equal to transfers

TR =J∑

j=1

(1− sj)µj

∫

X

(1 + r)[aD(x, j) + aT (x, j)]dϕj

4 Calibration

The model is calibrated to the long-run U.S. economy. Nominal values are adjusted to 2000

US dollars using the Consumer Price Index. Parameter values are summarized in Table 1.

Each model period is equal to 1 year. Households enter the economy at age 25 and live at

most to 95 years old. Household’s age-dependent survival probabilities, {sj}Jj=1, are taken

from year 2000 life table of the National Center for Health Statistics. The population growth

rate (g) is 1.2%, matching the long-run growth from 1960 to 2010 in the U.S.

Production and preference parameters are taken from Imrohoroglu, Imrohoroglu, and

Joines (2003). The capital share of output (α) is 0.31 and the depreciation rate of capital

10

(δk) is 0.044. The total factor productivity (z) is set at 0.986, so that the average labor

income of high-school graduates at age 25 ($29,343.28) is normalized to 1. For preference

parameters,household’s impatience parameter (β) is 0.85. Discount factors δf and δb are set

at 1.013 such that the benchmark capital-output ratio is 2.52. The coefficient of relative risk

aversion (γ) is 2, which is commonly used in the macroeconomic literature.

The earnings processes and the probabilities of unemployment are taken from Love (2006,

2007), in which estimates are constructed using data from the Panel Study of Income Dy-

namics and the Current Population Survey, respectively. The probabilities of unemployment

is reproduced in Table 2. The fraction of households with college degree is 37.5%.

Regulations on TDA follow the Employee Retirement Income Security Act of 1974

(ERISA). The benchmark penalty rate on early TDA assets withdrawal, τpen, is 10%. The

contribution limit in year 2000 was $10,500 in dollars (q) or 25% of household’s labor earnings

(q).

Consumption tax rate is 5.5%. Capital income tax rate is 10.7%. Payroll tax rate

(τss) estimated as the average percentage of social security payment (OASDI) to the total

compensation to employees and the labor share of propriety income from 1987 to 2000.

Correspondingly, the social security replacement rate (λss) is set at 51.2% to maintain the

social security budget balanced. Similarly, the unemployment insurance benefits replacement

rate (λui) is 25% and the unemployment insurance tax (τui) is 1.2%.5 The US income tax

code in year 2000. Tax brackets and the marginal tax rates are described in Table 3. To be

consistent with the income process, households are assumed to take the standard deduction

and personal exemption of $7,200 for their income tax exemptions.

5 Optimal Early Withdrawal Penalty

The policy reforms on early withdrawal penalty are considered in this section. The straight-

forward reform entails changing the penalty rate to one that maximizes households’ welfare.

In a more complete reform, the constant penalty rate is replaced by a scheme that includes

a certain level of exemption based on households’ on labor earnings.

For these policy experiments, I evaluate the welfare impacts as a consumption supplement

in the benchmark economy, such that the welfare of a household in the benchmark economy

5In reality, households are entitled to 26 weeks of unemployment benefits at 50% of their employed labor

income. There is also a dollar limit on the benefits, which varies by state with a median of 500 per week.

Extended benefits are also available in the times of recession. Given the dollar limit, the model unemployment

benefits are more generous than that in the actual system.

11

equals to the welfare of a household into the reformed economy. The consumption supplement

is computed as a fixed percentage increase in consumption at each age. This approach is used

in Imrohoroglu, Imrohoroglu, and Joines (2003), and Caliendo and Findley (2015) further

show that consumption-saving plan formulated by an unborn household Pareto dominates

the actual allocation from the view of all future selves. To illustrate the relative importance

of welfare gain from the optimal penalty reforms, results are compared with the impacts of

removing TDA from the economy.

5.1 Constant Rate

Simulation results, reported in the second column of Table 4, show that the optimal early

withdrawal penalty rate is 50.4%, which is substantially higher than the current rate of 10%.

The fraction of households who withdraw TDA assets before age 60 (j < 36) decreased from

30.3% to 5.5%, and the aggregate leakage of TDA assets due to early withdrawal decreases

from 3.26% to 0.43%. Households also contribute less to TDA due to the heightened liquidity

risk, and the aggregate TDA contribution decreases to 5.74% of labor earnings. The net effect

of imposing the optimal penalty rate is a 3.2% increase in net worth, with TDA assets reduce

by 5.5% and TA assets increase by 1.8 times. The weighted welfare gain from raising the

early withdrawal penalty is equivalent to 2.16% increase in per period consumption.

5.2 Income-dependent Penalty Rate

As TDA are semi-illiquid in the sense that withdrawals before retirement will incur penalties,

young and low-income households are generally more reluctant to contribute to TDA, due

to the liquidity risk they face with the lack of precautionary savings. I investigate a TDA

policy reform in which early withdrawal penalty is income dependent. This reform entails

an early withdrawal penalty and a penalty-free withdrawal limit conditional on households’

labor income. Specifically, the early withdrawal penalty (τpen) follows an exponential decay

structure such that

τpen = τ incpen ∗min{q − φp0 exp(−φ

p1inc), 0}, (21)

where φp0 is the maximum early withdrawal exemption provided that a household has no labor

earnings and φp1 governs the decrease in exemption level for each dollar of labor earnings.

TDA asset withdrawals above the final exemption levels will be taxed at rate τ incpen. The

optimal values for {τ incpen, φp0, φ

p1} are estimated within the model to maximize the welfare

gain from the policy reform.

12

Simulation results are reported in the last column of Table 4. The optimal φp0 and φ

p1

estimated from the model are -1.62 and -5.4, respectively. It implies that the maximum

amount of exemption is about $48,600 per year, and households lose $5.4 of exemption for

each dollar of labor earnings they receive, including benefits from unemployment insurance.

TDA assets withdrawn beyond the exemption level is taxed at a rate of 86%.

Since low-income households, mainly due to unemployment, can withdraw TDA assets

without penalty, the fraction of households withdrawing TDA assets before age 60 increases

to 33.2%. The aggregate percentage of TDA leakage is 0.49%, which is substantially lower

than the benchmark but sightly higher than the leakage with constant penalty rate. Due

to the very high penalty rate for early withdrawal beyond the allowance, the aggregate

TDA contribution rate further decreases to 5.68% of total labor income. Despite the lower

contribution and higher leakage rate, households’ average net worth increases more than that

with the constant penalty policy, showing a rise of 3.9% from the benchmark. Households’

TA assets increase by 3.09 time and their TDA assets reduce by 3.5%. The weighted welfare

gain from this policy reform is equivalent to a 2.9% increase in per period consumption.

6 Sensitivity

In this section, the robustness of the policy reforms is evaluated under different assumptions

on households’ time inconsistent preferences. Specifically, I assess the optimal policy on

TDA early withdrawals and its outcome under different views of their future-selves.

6.1 Naıvete Households

I explore the impacts of TDA early withdrawal penalty with an alternative assumption that

households are naıvete, who believe that their future selves will follow the optimal decision

rules that they formulate in the current period, despite repeated violations in the past.

Households’ perception of their future selves can have important implications on their use

of TDA. Since naıvete households expect their future selves to follow the decision rules that

they formulate in the current period, assets in TDA have higher growth potentials (i.e. less

leakage) and they have less incentives to make early withdrawals from TDA. On the other

hand, naıvete households do not expect their future selves to be nonresistant to temptations

and do not see the needs to commit to retirement savings by contributing to TDA, effectively

putting a liquidity barrier for future selves to consume savings prematurely.

13

Household’s dynamic programming problem in equation (13) is written as

V (j, xj) = maxcj ,qj

{c1−γj

1− γ+ βδfEj

[sj+1V (j + 1, xj+1) + (1− sj+1)

W1−γj+1

1− γ

]}, (22)

where

V (j + 1, xj+1) = max

{c1−γj+1

1− γ+ δfEj+1

[sj+2V (j + 2, xj+2) + (1− sj+2)

W1−γj+2

1− γ

]}, (23)

subject to income processes (2) to (??) and constraints (5) to (12), in addition to the non-

negativity constraint on consumption. The model with naıvete households is re-calibrated to

match the target capital-output ratio specified in Section 4. The calibrated discount factor

(δ) is 0.944 and total factor productivity (z) is 0.985. Experiments in Section 5 are repeated

and simulation results are reported in Table 5.

Under the reform with a constant penalty rate and no withdrawal allowance (second

column), the optimal penalty rate is 32.3%, with which the fraction of households who

withdraw TDA assets early decreases to 21.7%. The average leakage of TDA assets reduces

to 1.7%. As TDA are more illiquid, the average contribution rate is lowered to 6.2%. The net

effect of the higher penalty rate is a 2.5% increase in households’ net worth, with households’

accumulating twice as much assets in the liquid TA. This reform is self-financing, and the

welfare gains from it is equivalent to 0.5% increase in per period consumption.

When an income-dependent withdrawal allowance is introduced (third column), the op-

timal penalty is 52.3% with an allowance of $54,720 (φP0 = −1.82). The allowance is reduced

by $2.56 for each dollar of household labor earnings (φP1 = −2.56). While this penalty sys-

tem increases the percentage of households withdrawing TDA assets early to 34.9%, the high

penalty charged on withdrawals beyond households’ penalty-free allowances also decreases

leakage of TDA assets to 1.8% on average. Overall, households’ net worth increase by 4.6%

as the withdrawal allowance reduces household’s liquidity risk. When compared to the case

without withdrawal allowance, households contribute more to TDA and save less in their

liquid TA, so that their assets in TDA and TA increase by 2.5% and 53.8% respectively.

The program is also self-financing, and the welfare gains are almost 4 times of that with-

out withdrawal allowance, because it does not penalize households who are most liquidity

constrained.

6.2 Comparison with Sophisticated Households

Experiment results indicate that the welfare gains from TDA penalty reforms for naıvete

households are smaller than that for sophisticated households, especially when a constant

14

penalty rate is considered. Since naıvete households believe their future selves will value

the growth potentials of TDA assets and resist the temptation of tapping into TDA for

consumption, the commitment value of TDA is lowered. As a result, raising the early

withdrawal penalty has a smaller effect on improving the value of TDA as commitment

devices while it also increases households’ liquidity risk. It is because the optimal penalty

for TDA early withdrawal are more lenient for naıvete households. Therefore, the optimal

penalty for TDA early withdrawal are more lenient for naıvete households, with lower penalty

rates and more generous penalty-free withdrawal allowance under the income-dependent

scheme.

The early withdrawal penalty, ex-post, becomes sub-optimally lenient for households’

current selves when they decide to use TDA assets for consumption smoothing in face of

negative shocks on labor earnings. As such, the penalty scheme becomes less effective in

preventing households who exhibits present bias from over-withdrawing their TDA assets

for current consumption.

7 Conclusion

This paper investigates the optimal early withdrawal penalty on TDA, through the lens of

an OLG model with households exhibiting time-inconsistent preferences. Policy experiments

indicate that the current penalty rate in the U.S. TDA system is sub-optimally low, inducing

households to become over reliant on their retirement savings to smooth their negative

labor earnings shocks. While raising early withdrawal penalty to the optimal level reduces

households’ desire to save in TDA, it also motivates households to accumulate precautionary

savings in TA and restrain themselves from prematurely consuming their TDA assets that

provide substantial tax benefits.

Reforming the early withdrawal penalty to an income-dependent penalty system provides

substantial welfare gains. Quantitatively, it increases the welfare gains from introducing the

TDA system by 30%. The welfare impacts are robust to the assumptions on households’

perceptions of their time-inconsistent preferences (sophisticated vs. naıvete), and they are

consistent with the multiself Pareto criterion in the sense that households at every vintage

point find that income-dependent penalty system welfare improving. This reform is easy to

implement and it is self-financing, as it stimulates aggregate economic activities such that

the government raise extra tax revenues from capital income tax and consumption tax to

compensate the loss of that from TDA early withdrawals. On the other hand, the impacts

15

of raising early withdrawal penalty within the current regulatory framework are sensitive to

assumptions on whether households are sophisticated or naıvete. It implies that the optimal

penalty rate hinges on households’ behavioral expectation of their future selves.

Findings in this paper highlights the importance of understanding TDA early withdrawal

penalty as a two-edged sword on promoting households’ retirement security and heightening

their liquidity risks. TDA reforms considered also contribute to policy concerns on reducing

households’ vulnerability to adverse labor earnings shocks.

16

Table 1: Summary of Calibrated Parameter Values

Parameter Description Value Target / Data Source

Demographics

J Lifespan 71 Age 25 to 95

R Number of working periods 40 Work until age 64

{sj}Jj=1 Survival probability Life table in year 2000

g Population growth 0.012 Avg. population growth 1960-2010

Preferences

γ Relative risk aversion 2 Imrohoroglu, Imrohoroglu, and Joines (2003)

β Impatience 0.850 Imrohoroglu, Imrohoroglu, and Joines (2003)

δf Discount factor 1.013 Capital-output ratio is 2.52

Income

{fj} Age earnings profile Love (2006, 2007)

ρCOLǫ Persistence of income shock 0.887 Love (2006, 2007)

σCOLε Std. dev. of income shock 0.253 Love (2006, 2007)

ρHSε Persistence of income shock 0.780 Love (2006, 2007)

σHSε Std. dev. of income shock 0.239 Love (2006, 2007)

λ Social security replacement rate 0.512 Social Security budget balance

b Unemployment benefits 0.250

TDA Savings

τpen Early withdrawal penalty rate 10% ERISA of 1974

q Contribution limit (% of income) 0.25 ERISA of 1974

q Contribution limit in dollars 0.358 Contribution limit of $10,500

q Employer’s match 50%

Production

z Total factor productivity 0.986 Avg. age-1 labor income of HS = 1

α Capital share of output 0.310 Imrohoroglu, Imrohoroglu, and Joines (2003)

δk Depreciation rate 0.044 Imrohoroglu, Imrohoroglu, and Joines (2003)

Tax

τc Consumption tax 5.5%

τk Capital income tax 10.7% Estimated from OMB data

τss Payroll tax rate 9.8% Estimated from OMB data

τui Unemployment insurance tax 1.0%

17

Table 2: Probability of household unemployment risk

Age High school graduate College graduate

25-34 0.052 0.035

35-44 0.043 0.030

45-54 0.039 0.028

55-64 0.039 0.027

Note: Age and education dependent probably of unemployment is taken from Love (2006).

Table 3: Income Tax Scheme

Labor income Dollars Model unit Marginal tax rate

(0, I1] ($0− $26, 250] (0.0− 0.895] τ1 = 0.15

(I1, I2] ($26, 250− $6, 3550] (0.895− 2.166] τ2 = 0.28

(I2, I3] ($63, 550− $132, 600] (2.166− 4.519] τ3 = 0.31

(I3, I4] ($132, 600− $288, 350] (4.519− 9.827] τ4 = 0.36

> I4 > $288, 350 > 9.827 τ5 = 0.396

Note: The average labor income of high school graduate in age 1 ($29,343.28) is normalized

to 1 in the model. Households in the model is assumed to take standard deduction of $4,400

and personal exemption of $2,800, totaling $7,200.

18

Table 4: Estimated optimal early withdrawal penalty (sophisticated households)

Benchmark Constant penalty Income-dependent

Penalty rate 0.100 0.504 0.860

% households with early withdrawal 30.32% 5.51% 33.20%

Avg % of TDA contribution 8.20% 5.74% 5.68%

Avg % of TDA leakage -3.26% -0.43% -0.49%

Net worth 1.000 1.032 1.039

TDA 1.000 0.945 0.965

TA 1.000 2.764 4.090

r 0.071 0.068 0.068

w 1.000 1.015 1.016

τadj · -0.003 0.000

Weighted welfare gain · 2.16% 2.90%

Note: Experiment results are based on the assumption that households are sophisticated

towards their time-inconsistent preferences. For income-dependent early withdrawal penalty,

the optimal penalty-free early withdrawal parameters are φP0 = −1.622 and φP

1 = −5.398.

19

Table 5: Estimated optimal early withdrawal penalty (naıvete households)

Benchmark Constant penalty Income-dependent

Penalty rate 0.100 0.323 0.523

% households with early withdrawal 32.55% 21.69% 34.92%

Avg % of TDA contribution 8.69% 6.18% 7.80%

Avg % of TDA leakage -3.36% -1.68% -1.81%

Net worth 1.000 1.025 1.046

TDA 1.000 0.982 1.025

TA 1.000 2.048 1.538

r 0.071 0.069 0.067

w 1.000 1.008 1.016

τadj · -0.009 -0.003

Weighted welfare gain · 0.51% 2.01%

Note: Experiment results are based on the assumption that households are naıvete towards

their time-inconsistent preferences. Model parameters are re-calibrated to match the capital-

output ratio. Re-calibrated values for the discount factor (δ) and the total factor productivity

(z) are 0.944 and 0.985, respectively. For income-dependent early withdrawal penalty, the

optimal penalty-free early withdrawal parameters are φP0 = −1.824 and φP

1 = −2.556.

20

References

Amador, M., I. Werning, and G.-M. Angeletos (2006): “Commitment vs. Flexibil-

ity,” Econometrica, 74(2), pp. 365–396.

Ambrus, A., and G. Egorov (2013): “Comment on “Commitment vs. Flexibility”,”

Econometrica, 81(5), 2113–2124.

Amromin, G. (2003): “Household Portfolio Choices in Taxable and Tax-Deferred Accounts:

Another Puzzle?,” European Finance Review, 7(3), 547–582.

Amromin, G., and P. Smith (2003): “What Explains Early Withdrawals from Retirement

Accounts? Evidence from a Panel of Taxpayers,” National Tax Journal, 56(3), pp. 595–

612.

Argento, R., V. L. Bryant, and J. Sabelhaus (2015): “Early Withdrawals from

Retirement Accounts During the Great Recession,” Contemporary Economic Policy, 33(1),

1–16.

Beshears, J., J. J. Choi, C. Clayton, C. Harris, D. Laibson, and B. C. Madrian

(2015): “Optimal Illiquidity,” Manuscript.

Beshears, J., J. J. Choi, J. Hurwitz, D. Laibson, and B. C. Madrian (2015):

“Liquidity in Retirement Savings Systems: An International Comparison,” American Eco-

nomic Review, 105(5), 420–25.

Caliendo, F., and T. S. Findley (2015): “Commitment and Welfare,” .

Dammon, R., C. Spatt, and H. Zhang (2004): “Optimal Asset Location and Allocation

with Taxable and Tax-Deferred Investing,” Journal of Finance, 59(3), 999–1037.

Diamond, P., and B. Koszegi (2003): “Quasi-hyperbolic discounting and retirement,”

Journal of Public Economics.

Garlappi, L., and J. Huang (2006): “Are Stocks Desirable in Tax-deferred Accounts?,”

Journal of Public Economics, 90(12), 2257–2283.

Gomes, F., A. Michaelides, and V. Polkovnichenko (2009): “Optimal Savings with

Taxable and Tax-Deferred Accounts,” Review of Economic Dynamics, 12(4), 718–735.

21

Gul, F., and W. Pesendorfer (2004): “Self-Control and the Theory of Consumption,”

Econometrica, 72(1), 119–158.

Huang, J. (2008): “Taxable and Tax-Deferred Investing: A Tax-Arbitrage Approach,”

Review of Financial Studies, 21(5), 2173–2207.

Imrohoroglu, A., S. Imrohoroglu, and D. H. Joines (2003): “Time-Inconsistent

Preferences And Social Security,” The Quarterly Journal of Economics, 118(2), 745–784.

Kumru, C. S., and A. C. Thanopoulos (2008): “Social security and self control prefer-

ences,” Journal of Economic Dynamics and Control, 32(3), 757 – 778.

(2011): “Social security reform with self-control preferences,” Journal of Public

Economics, 95(78), 886 – 899.

Laibson, D. (1997): “Golden Eggs and Hyperbolic Discounting,” The Quarterly Journal

of Economics, 112(2), 443–478.

Laibson, D. I., A. Repetto, J. Tobacman, R. E. Hall, W. G. Gale, and G. A.

Akerlof (1998): “Self-Control and Saving for Retirement,” Brookings Papers on Eco-

nomic Activity, 1998(1), pp. 91–196.

Love, D. (2006): “Buffer stock saving in retirement accounts,” Journal of Monetary Eco-

nomics, 53(7), 1473 – 1492.

Love, D., and G. Phelan (2015): “Hyperbolic discounting and life-cycle portfolio choice,”

Journal of Pension Economics and Finance, 14(04), 492–524.

Love, D. A. (2007): “What can the life-cycle model tell us about 401(k) contributions and

participation?,” Journal of Pension Economics and Finance, 6(02), 147–185.

Vanguard (2016): “How America Saves 2016: A report on Vanguard 2015 defined contri-

bution plan data,” .

22

the optimal early-withdrawal penalty on tax-deferred ... · this paper explores the optimal early...

Documents