joseph l breeden, phd · 2017-10-04 · drivers of performance origination: consumer & loan...

35
Joseph L Breeden, PhD Prescient Modeling © 2012 1

Upload: others

Post on 31-Jul-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Joseph L Breeden, PhD · 2017-10-04 · Drivers of Performance Origination: Consumer & loan information. Vintages: Biases for specific cohorts. Lifecycle: Changes with age of the

Joseph L Breeden, PhD

Prescient Modeling © 2012 1

Page 2: Joseph L Breeden, PhD · 2017-10-04 · Drivers of Performance Origination: Consumer & loan information. Vintages: Biases for specific cohorts. Lifecycle: Changes with age of the

What are Scores for?

This is the assumed hierarchy under which scores are created.

9/17/2013 Prescient Models LLC © 2012 2

Improving profitability

by

by

by

Determining the risk of incomplete repayment or reduced revenue

Estimating the risk of default or prepayment

Ranking risk

Computing a score

by

Page 3: Joseph L Breeden, PhD · 2017-10-04 · Drivers of Performance Origination: Consumer & loan information. Vintages: Biases for specific cohorts. Lifecycle: Changes with age of the

Profit Model

The connection between cut-off score and PD is tenuous.

We can be much more accurate in pricing if we start with a real PD forecast.

9/17/2013 Prescient Models LLC © 2012 3

Cut-off Score 620 Max Loss

Max PD % Exp. Revenue

Loan Amt Expenses

EAD Min Margin

LGD

Page 4: Joseph L Breeden, PhD · 2017-10-04 · Drivers of Performance Origination: Consumer & loan information. Vintages: Biases for specific cohorts. Lifecycle: Changes with age of the

Using Scores: Setting Pricing

9/17/2013 Prescient Models LLC © 2012 4

Score Cut-off Score

Accepts

Rejects

Page 5: Joseph L Breeden, PhD · 2017-10-04 · Drivers of Performance Origination: Consumer & loan information. Vintages: Biases for specific cohorts. Lifecycle: Changes with age of the

Using Scores: Setting Pricing

9/17/2013 Prescient Models LLC © 2012 5

Score

Log-O

dds

Cut-off Score

Max o

dds

of

“bad”

Accepts

Rejects

The score distribution shifts with the economy.

The score-odds relationship shifts with the economic cycle and credit cycle.

Cut-off score is usually adjusted intuitively.

Page 6: Joseph L Breeden, PhD · 2017-10-04 · Drivers of Performance Origination: Consumer & loan information. Vintages: Biases for specific cohorts. Lifecycle: Changes with age of the

Standard Bank Practice

Credit Risk has a group that builds scores that rank-order.

Model Audit validates the scores rank-ordering ability.

Credit Policy sets the cut-off score to control risk (odds of default).

Finance creates a profit model that assumes a certain odds of default.

9/17/2013 Prescient Models LLC © 2012 6

This structure fails by design.

What we needed from the start was a probability model. Any score provides that in-sample, but how do we produce it out of sample?

Page 7: Joseph L Breeden, PhD · 2017-10-04 · Drivers of Performance Origination: Consumer & loan information. Vintages: Biases for specific cohorts. Lifecycle: Changes with age of the

Prescient Models LLC © 2012 7 9/17/2013

Page 8: Joseph L Breeden, PhD · 2017-10-04 · Drivers of Performance Origination: Consumer & loan information. Vintages: Biases for specific cohorts. Lifecycle: Changes with age of the

Drivers of Performance

Origination: Consumer & loan information.

Vintages: Biases for specific cohorts.

Lifecycle: Changes with age of the loan.

Behavior: As the loan matures, observed loan performance can refine the predictions.

Account Management: Changes made to individual loans.

Environment: Changes in the economy or portfolio management.

Loan Idiosyncratic Effects: Every loan is unique.

Prescient Models LLC © 2012 8 9/17/2013

Page 9: Joseph L Breeden, PhD · 2017-10-04 · Drivers of Performance Origination: Consumer & loan information. Vintages: Biases for specific cohorts. Lifecycle: Changes with age of the

Age-Vintage-Time (A-V-T) Dependency

Because , measurements from one information source will be contaminated by other sources: Behavior is dependent on Lifecycle and Environment.

The linear components of any vintage, age, or time dependencies cannot be uniquely measure.

Prescient Models LLC © 2012 9

Information Type …is a function of

Originations Recent history or loan (i) up to origination date (v)

Vintage Vintage origination date (v)

Lifecycle Age of the loan (a)

Behavior All time since origination (v, v+1, … , t=v+a)

Account Management Time since event (te)

Environment Recent time (t-n, … t-1, t)

Loan Idiosyntricities Loan (i)

vta

9/17/2013

Page 10: Joseph L Breeden, PhD · 2017-10-04 · Drivers of Performance Origination: Consumer & loan information. Vintages: Biases for specific cohorts. Lifecycle: Changes with age of the

Modeling Implications

From the Age-Period-Cohort literature, we know that linear effects in age, vintage, and time cannot be uniquely determined regardless of the model used.

For short data sets (such as a typical 2-year scoring dataset), the macroeconomic environment by time and credit risk by vintage are mostly linear trends.

Coefficients estimated against the linear components of macroeconomic variables and scoring factors will be unstable, i.e. adding macroeconomic factors to scoring models rarely works.

Prescient Modeling © 2012 10

Page 11: Joseph L Breeden, PhD · 2017-10-04 · Drivers of Performance Origination: Consumer & loan information. Vintages: Biases for specific cohorts. Lifecycle: Changes with age of the

GLM-AVT Model

1. Use the Age-Vintage-Time structure found in APC, but in a loan-level model.

Make an explicit choice about trend allocation between f(a), g(v), and h(t) to obtain a unique solution.

1. (Optional) Fit h(t) to macroeconomic data. Retrend lifecycle and credit risk trends to compensate.

2. Introduce either origination or behavioral scoring factors to estimate score.

11 Prescient Models LLC © 2012 9/17/2013

logpi a,v, t( )1- pi a,v, t( )

æ

èçç

ö

ø÷÷ = f (a)+ g(v)+h(t), a = t - v

Page 12: Joseph L Breeden, PhD · 2017-10-04 · Drivers of Performance Origination: Consumer & loan information. Vintages: Biases for specific cohorts. Lifecycle: Changes with age of the

Origination Bads Goods

Excluded from Traditional Analysis

History Forecast

Data for Loan-level Modeling

A standard approach to defining Good and Bad accounts for logistic-style regression without inclusion of temporal information.

Prescient Models LLC © 2012 12

Origination

(5) Bads

(2) Goods

(3)

x

x

x x

x x

x

x x

x

x x

x x

9/17/2013

x

Page 13: Joseph L Breeden, PhD · 2017-10-04 · Drivers of Performance Origination: Consumer & loan information. Vintages: Biases for specific cohorts. Lifecycle: Changes with age of the

In-sample Bads

History Forecast

Data for V-A-T Decomposition Methods

All in-sample loans are included in analysis with emphasis on time of Event occurrence.

Goods are simply accounts that were not observed to go bad. They are included as censored accounts in the estimation process.

Prescient Models LLC © 2012 13

Censored Accounts (Goods)

x x

x x

x x

x

x x

x

x x

x x

9/17/2013

x

Page 14: Joseph L Breeden, PhD · 2017-10-04 · Drivers of Performance Origination: Consumer & loan information. Vintages: Biases for specific cohorts. Lifecycle: Changes with age of the

1. Create Loan-level Age-Vintage-Time Model

Put an APC structure into a loan-level GLMM framework.

1 constant term, 2 linear terms, and nonlinear terms in a, v, and t. No explicit cross-terms.

are orthogonal basis functions.

, , and are the number of observations in a, v, and t.

Assumes environmental trend = 0.

aN vN tN

14 Prescient Models LLC © 2012

iO

9/17/2013

logpi a,v, t( )1- pi a,v, t( )

æ

èçç

ö

ø÷÷ =a0 +a1a+ a jOj a( )

j=2

Na-1

å +b1v+ b jOj v( )j=2

Nv-1

å + g jOj t( )j=2

Nt-1

å

Page 15: Joseph L Breeden, PhD · 2017-10-04 · Drivers of Performance Origination: Consumer & loan information. Vintages: Biases for specific cohorts. Lifecycle: Changes with age of the

Create Age-Vintage-Time Functions

Recast as functions as

1

2

1

2

1

1

2

10

)(

)(

)(

t

v

a

N

j

jj

N

j

jj

N

j

jj

tOtH

vOvvG

aOaaF

)(),(),( tHvGaF

15 Prescient Models LLC © 2012 9/17/2013

Page 16: Joseph L Breeden, PhD · 2017-10-04 · Drivers of Performance Origination: Consumer & loan information. Vintages: Biases for specific cohorts. Lifecycle: Changes with age of the

Example

Regional US auto loan portfolio, ~$400 mln receivables.

We modeled PD, EAD, LGD, and attrition to predict losses.

Prescient Modeling © 2012 16

0 12 24 36 48 60 72 84 96 108

0.0

00

0.0

01

0.0

02

0.0

03

0.0

04

Small Auto Loan Portfolio

Age

Lifecycle

Fu

nction

PD

Page 17: Joseph L Breeden, PhD · 2017-10-04 · Drivers of Performance Origination: Consumer & loan information. Vintages: Biases for specific cohorts. Lifecycle: Changes with age of the

Credit Risk Function Example

Prescient Modeling © 2012 17

2004 2006 2008 2010 2012

−2

−1

01

2

Small Auto Loan Portfolio

Vintage

Vin

tag

e F

un

ctio

n

PD

Page 18: Joseph L Breeden, PhD · 2017-10-04 · Drivers of Performance Origination: Consumer & loan information. Vintages: Biases for specific cohorts. Lifecycle: Changes with age of the

Environment Function Example

Prescient Modeling © 2012 18

2004 2006 2008 2010 2012

−2

−1

01

2

Small Auto Loan Portfolio

Time

Environ

men

tal F

unction

PD

Page 19: Joseph L Breeden, PhD · 2017-10-04 · Drivers of Performance Origination: Consumer & loan information. Vintages: Biases for specific cohorts. Lifecycle: Changes with age of the

2. Fit to Macroeconomic Data

Create OLS fit of H(t) to macroeconomic factors Ei(t) with

an explicit trend term. Possibly use a constrained fit so the long-run trend = 0.

We have T0 observations of function H(t).

We have Th measurements of n economic factors Ei(t) prior to the start of the performance data.

Total length of economic data is T = T0 + Th.

- Th -1( ) 0 To

19 Prescient Models LLC © 2012 9/17/2013

)(tH

H (t)= c0t + ciEi t( )i=1

N

å +e

Page 20: Joseph L Breeden, PhD · 2017-10-04 · Drivers of Performance Origination: Consumer & loan information. Vintages: Biases for specific cohorts. Lifecycle: Changes with age of the

Create a macroeconomic model

Model just the environmental function with macroeconomic factors. (Macroeconomic adverse selection impacts on vintage quality is a separate effect.)

Use the correct transformations: log-ratio, not % change; log interest rates; etc.

Allow for a linear component.

Prescient Modeling © 2012 20

Estimate Std.Error tvalue Pr(>|t|)(Intercept) 5.908513 1.250051 4.727 3.31E-06t -0.009547 0.002535 -3.765 0.000195Unemp.lwMovingAvg.Lm6.W3 2.050558 0.535645 3.828 0.000153HPI.lwLogRatio.L5.W24 -2.262367 1.039257 -2.177 0.030153Housing.Starts.lwLogRatio.L8.W15 -0.306789 0.047639 -6.44 3.94E-10

Residual standard error: 0.434 on 351 degrees of freedom

(12 observations deleted due to missingness)

Multiple R-squared: 0.3918,Adjusted R-squared: 0.3848

F-statistic: 56.52 on 4 and 351 DF, p-value: < 2.2e-16

Page 21: Joseph L Breeden, PhD · 2017-10-04 · Drivers of Performance Origination: Consumer & loan information. Vintages: Biases for specific cohorts. Lifecycle: Changes with age of the

Retrend the environmental function

Prescient Modeling © 2012 21

1990 1992 1994 1996 1998 2000 2002 2004 2006 2008 2010 2012

01

23

Multivariate Forecast of Environmental Function for All Products

Date

Pre

dic

ted

En

vir

on

me

nta

l F

un

ctio

nMultivariate Economic Forecast

Detrended

Backward extrapolation In-sample

Page 22: Joseph L Breeden, PhD · 2017-10-04 · Drivers of Performance Origination: Consumer & loan information. Vintages: Biases for specific cohorts. Lifecycle: Changes with age of the

Readjust Age-Vintage-Time Functions

Compensate for the trend in H(t).

The linear specification problem is NOT unique to APC models. In fact, it appears to be the cause of scores constantly being recalibrated.

22 Prescient Models LLC © 2012 9/17/2013

F '(a) = F(a)+ c0t

G '(v) =G(v)+ c0t

H '(t)= H (t)- c0t = ciEi t( )i=1

N

å +e

Page 23: Joseph L Breeden, PhD · 2017-10-04 · Drivers of Performance Origination: Consumer & loan information. Vintages: Biases for specific cohorts. Lifecycle: Changes with age of the

Counter-Adjustments to Lifecycle and Credit Risk

0 12 24 36 48 60 72 84

0.0

00

0.0

01

0.0

02

0.0

03

0.0

04

0.0

05

0.0

06

auto.term.72.84

Age

Lifecycle

Fu

nctio

n

Prescient Modeling © 2012 23

Page 24: Joseph L Breeden, PhD · 2017-10-04 · Drivers of Performance Origination: Consumer & loan information. Vintages: Biases for specific cohorts. Lifecycle: Changes with age of the

3. Fit the Score

Using GLM and monthly performance data, use F(a) + H(t) as a fixed offset.

The score is estimated as

glm(Bad ~ 1 + offset(F+H) +

factor1 + factor2 + …)

At this point, the factors can be either origination or behavior variables without destabilizing the probability model.

Prescient Modeling © 2012 24

Page 25: Joseph L Breeden, PhD · 2017-10-04 · Drivers of Performance Origination: Consumer & loan information. Vintages: Biases for specific cohorts. Lifecycle: Changes with age of the

Score Example

“Offset” captures the population odds. The rest of the model is the idiosyncratic variation – a score.

Prescient Modeling © 2012 25

glm(formula = Bad ~ 1 + offset(offset) + Bureau.Score + log(LTV) +

Subcategory + Log.Dep.Bal + Term, family = binomial, …)

Deviance Residuals:

Min 1Q Median 3Q Max

-0.2726 -0.0276 -0.0133 -0.0042 4.7835

Coefficients:

Estimate Std. Error z value Pr(>|z|)

(Intercept) 10.306961 1.367256 7.538 4.76e-14 ***

Bureau.Score -0.012886 0.001830 -7.043 1.88e-12 ***

log(LTV) 1.203598 0.546832 2.201 0.0277 *

SubcategoryNew 0

SubcategoryUsed 0.133842 0.312062 0.429 0.6680

Log.Dep.Bal -0.918796 0.187975 -4.888 1.02e-06 ***

Term -0.017266 0.008915 -1.937 0.0528 .

---

Page 26: Joseph L Breeden, PhD · 2017-10-04 · Drivers of Performance Origination: Consumer & loan information. Vintages: Biases for specific cohorts. Lifecycle: Changes with age of the

Outputs – The Score

The glm-avt result is still a “score” if you exclude the offset term.

It can be scaled to match scores currently in use.

It will be more robust to changes in the economy – recalibrate much less often.

It can incorporate measured adverse selection for existing loans.

It can be built across both older and more recent data history to capture more consumer dynamics.

26 Prescient Models LLC © 2012 9/17/2013

Page 27: Joseph L Breeden, PhD · 2017-10-04 · Drivers of Performance Origination: Consumer & loan information. Vintages: Biases for specific cohorts. Lifecycle: Changes with age of the

Outputs – The Probability

pi is the complete probability including the offset term to provide population odds over time.

It is conditional on age of the loan, “score”, and scenario for the environment.

It is available for each month, arbitrarily far into the future.

It can be aggregated over any desired time window to predict total probability.

27 Prescient Models LLC © 2012 9/17/2013

Page 28: Joseph L Breeden, PhD · 2017-10-04 · Drivers of Performance Origination: Consumer & loan information. Vintages: Biases for specific cohorts. Lifecycle: Changes with age of the

Scoring with and without Offset

The scoring offset = F(a) + H(t)

0.0 0.2 0.4 0.6 0.8 1.0

0.0

0.2

0.4

0.6

0.8

1.0

False positive rate or (1 − Specificity)

Tru

e p

ositiv

e r

ate

or

Sensitiv

ity

ROC curve for Bad ~ 1 + Open.Score + Subcategory + Source + Log.Deposit.Bal + log(LTV) + offset(offset)

Auto

Auto

Model with offset. KS = 0.49. Gini = 0.62

Model without offset. KS = 0.42. Gini = 0.56

Prescient Modeling © 2012 28

Page 29: Joseph L Breeden, PhD · 2017-10-04 · Drivers of Performance Origination: Consumer & loan information. Vintages: Biases for specific cohorts. Lifecycle: Changes with age of the

Out-of-Sample Validation

Prescient Modeling © 2012 29

0.0 0.2 0.4 0.6 0.8 1.0

0.0

0.2

0.4

0.6

0.8

1.0

False positive rate or (1 − Specificity)

Tru

e p

ositiv

e r

ate

or

Se

nsitiv

ity

ROC curve for Bad ~ 1 + Open.Score + Subcategory + Source + Log.Deposit.Bal + log(LTV) + offset(offset)

Auto

Auto

Out−of−Sample. KS = 0.49. Gini = 0.62

In−Sample. KS = 0.5. Gini = 0.63

Page 30: Joseph L Breeden, PhD · 2017-10-04 · Drivers of Performance Origination: Consumer & loan information. Vintages: Biases for specific cohorts. Lifecycle: Changes with age of the

Out-of-Sample Validation, with and without Offset

Prescient Modeling © 2012 30

0.0 0.2 0.4 0.6 0.8 1.0

0.0

0.2

0.4

0.6

0.8

1.0

False positive rate or (1 − Specificity)

Tru

e p

ositiv

e r

ate

or

Se

nsitiv

ity

ROC curve for Bad ~ 1 + Open.Score + Subcategory + Source + Log.Deposit.Bal + log(LTV) + offset(offset)

Auto

Auto

Out−of−Sample with offset. KS = 0.49. Gini = 0.62

Out−of−Sample without offset. KS = 0.4. Gini = 0.53

Page 31: Joseph L Breeden, PhD · 2017-10-04 · Drivers of Performance Origination: Consumer & loan information. Vintages: Biases for specific cohorts. Lifecycle: Changes with age of the

Out-of-Time Model Validation

Prescient Modeling © 2012 31

0.0 0.2 0.4 0.6 0.8 1.0

0.0

0.2

0.4

0.6

0.8

1.0

False positive rate or (1 − Specificity)

Tru

e p

ositiv

e r

ate

or

Se

nsitiv

ity

ROC curve for Bad ~ 1 + Open.Score + Subcategory + Source + Log.Deposit.Bal + log(LTV) + offset(offset)

Auto

Auto

Out−of−Time. KS = 0.47. Gini = 0.6

In−Time. KS = 0.48. Gini = 0.61

Page 32: Joseph L Breeden, PhD · 2017-10-04 · Drivers of Performance Origination: Consumer & loan information. Vintages: Biases for specific cohorts. Lifecycle: Changes with age of the

Out-of-Time Validation, with and without Offset

Prescient Modeling © 2012 32

0.0 0.2 0.4 0.6 0.8 1.0

0.0

0.2

0.4

0.6

0.8

1.0

False positive rate or (1 − Specificity)

Tru

e p

ositiv

e r

ate

or

Se

nsitiv

ity

ROC curve for Bad ~ 1 + Open.Score + Subcategory + Source + Log.Deposit.Bal + log(LTV) + offset(offset)

Auto

Auto

Out−of−Time with offset. KS = 0.47. Gini = 0.6

Out−of−Time without offset. KS = 0.37. Gini = 0.47

Page 33: Joseph L Breeden, PhD · 2017-10-04 · Drivers of Performance Origination: Consumer & loan information. Vintages: Biases for specific cohorts. Lifecycle: Changes with age of the

Forecasting

Use lifecycle and an environmental scenario to create the offset.

The model predicts loan-level, monthly probabilities.

Integrate over the product lifetime for pricing or a fixed window for account management.

Also works for behavior scoring.

Prescient Modeling © 2012 33

Page 34: Joseph L Breeden, PhD · 2017-10-04 · Drivers of Performance Origination: Consumer & loan information. Vintages: Biases for specific cohorts. Lifecycle: Changes with age of the

Stress Testing & Capital

No additional modeling is required for stress testing.

Insert an extreme scenario for the future H(t)

and aggregate the probabilities over the desired forecast horizon.

Through the cycle PDs can be obtained the same way, by setting H(t) = HTTC.

No difference between stress testing and capital.

Prescient Modeling © 2012 34

Page 35: Joseph L Breeden, PhD · 2017-10-04 · Drivers of Performance Origination: Consumer & loan information. Vintages: Biases for specific cohorts. Lifecycle: Changes with age of the

Contact Us

For additional information, contact:

Joseph L. Breeden, CEO

Prescient Models LLC

300 Catron St., Suite B

Santa Fe, NM 87501

+1-505-670-7670

[email protected]

9/17/2013 Prescient Models LLC © 2012 35