agency mbs prepayment model using neural networks
TRANSCRIPT
Machine Learning in Finance Workshop 2020
AGENCY MBS PREPAYMENT MODEL USING NEURAL
NETWORKS Jiawei “David” Zhang
MSCI
-
AGENCY MBS PREPAYMENT MODEL USING NEURAL NETWORKS
Columbia Bloomberg Machine Learning in Finance 2020
David Zhang
MSCI Securitized Products Research
© 2016 MSCI Inc. All rights reserved. Please refer to the disclaimer at the end of this document.
— GENERAL —
Zhang, PhD
ewYork
MANAGING DIRECTOR, HEAD OF
SECURITIZED PRODUCTS RESEARCH
212. 981. 7464
MSCII
Speaker
David Zhang is a Managing Director and Head of Securitized Products Research at MSCI. His
team is responsible for developing models and analytics to support investment analysis, risk
management, and regulatory compliance. Before joining MSCI, Dr Zhang was Managing
Director and head of Securitized Products modeling at Credit Suisse for more than a decade. At
Credit Suisse he was responsible for supporting risk, regulatory and client analytics as well as
sales/trading quantitative strategies. Dr Zhang’s group developed one of the most widely used
MBS models by fixed income institutional investors. Their work was consistently awarded top
ranking by various industry and client surveys, including Institutional Investor All-America
Research Team ranking in Agency prepayment. They also won the award for best paper by the
American Real Estate Society for research on effectiveness of government mortgage programs
The regulatory projects Dr Zhang lead at Credit Suisse included developing models for CCAR
and PPNR (Pre-Provision Net revenue), Dodd-Frank IHC (Intermediate Holding Company) and
related VaR, RWA and RBPL modeling, and FRTB (Fundamental Review of Trading
Book). Prior to Credit Suisse, Dr Zhang worked at FreddieMac, CIBC Oppenheimer, and
University of Chicago. He holds leadership positions at PRMIA (Professional Risk Management
International Association) and GCREC (Global Chinese Real Estate Congress). He is a frequent
speaker at industry and academic conferences, and his research on risk, financial modeling and
real estate has been published in many academic journals. Dr Zhang has a Ph.D. from Princeton
University.
3
SUMMARY: NEURAL NETWORKS AGENCY MBS PREPAYMENT MODEL
Why a machine learning model for Agency MBS? • Prepayment is a highly complex and non-linear process with many idiosyncratic risk factors, among the most
complex financial models
• Recent development in computational software and hardware enable us to make significant advancement in AI prepayment models
• Machine learning models have excelled in many areas, such as image recognition, natural language processing, fraud detection, etc.
What is the model and what have we learned?
• Deep neural network model applied to pool level agency MBS prepayment data, compared with MSCI1 (the
human model)
• Results show the deep learning model is able to capture very complex prepayment patterns and signals with
extremely high computational efficiency
— GENERAL — 4
MACHINE LEARNING IN FINANCE
• Consumer credit risk models via Machine-Learning Algorithms (Dr. Andrew Lo, 2010)
Using machine-learning model for consumer credit default and delinquency
Generalized classification and regression trees
Accurately forecasted credit events 3 to 12 months in advance
• Risk and risk management in credit card industry (Dr. Andrew Lo, 2016)
Analyzed very large dataset consisting of credit card data from six large banks.
Decision trees and random forests model perform better than logistic regression at short time horizon
• Deep learning for mortgage risk (Dr. Kay Giesechke, 2015-2018)
Using deep neural network to model mortgage prepayment, delinquency and foreclosure
Loan level data
Compared NNM with a logit model
— GENERAL — 5
MSCI •-
US Bond Market 2018 (42.4$Tn) Federal Agency
Securities Money Markets 1.05
Mortgage Related 9.66
US Bond Market 2007 (29.5$Tn}
Money Markets 1.79
AgencyMBS 5.80
US BOND MARKET
— GENERAL — 6
MODELING OBJECTIVE
Forecast prepayment rate for agency RMBS pools
SMM : Single Monthly Mortality Rate
CPR: Conditional Prepayment Rate
Agencies report last month’s prepayment speed on the 4th business day of each month.
Prepayment types:
─ Rate refinance
─ House turnover
─ Cash-out
─ Curtailment
─ Buyout
— GENERAL — 7
MORTGAGE PREPAYMENT MODELING: SCIENCE AND CRAFTSMANSHIP
Difficulties with mortgage prepayment modeling
• Large data sets: ~20-2000 G data, Agency MBS covers ~400,000 pools/100+mm loans performance over 20-30 years, pool/loan variables ~30-100
• Multiple, highly non-linear and interactive risk drivers (“layered risk”)
• Loan size vs. prepayment is function of moneyness
• Age vs. prepayment is function of past moneyness history
• Loan purpose (refi vs purchase) vs. prepayment is function of origination year
• ….
• Regime changes
• Mortgage credit and borrower risk appetite cycles, and business practice affect absolute level and risk drivers for prepayment/default
— GENERAL — 8
•
MSCII
• • • • • • •1•• . ,, ...
• • • .. ·- ..
I • • •
• • •
Agency MBS prepayment
• Complex behaviors
– 30–100 risk factors: rates, loan size, GEO, purpose, property, HPA….
– “layered risk”- non-linear interaction (e.g., loan size vs moneyness, purpose vs. origination year, ..)
– Regime changes: behavior, policy
– Statistical noises
• Large data set to model
– 400,000 pools/100m loans, 30yr
• Modelers as craftsman?
– Idiosyncratic modeler risk
Po
ol c
ou
nt
Mo
de
l C
PR
50
40
30
20
10
0
0 10 20 30 40 50 60 70 80
Actual CPR
Model CPR distribution
1,000
800
600
400
200
0
CPR range
90-100bps ITM FNCL 4s pool speeds
9
Architecture* Hidden
Input layer
r1 r .. :!'.1 = mox(A 1K I B,.,0) :!'.'... - mlll:'.(Ani' ,-i + Bn, 0)
Vilhtes ot hirJden lilyers ilrP. generilterJ hy Mtiviltion functions with previous layers as inputs, and weights and
biases as parameters
•Example of a Neural Network archite.cture Q Node
MSCII
Model training 1 Select a set of training data:
inputs togP.thP.r with correct/actual output
2 Pass training Al Model data through the network to obtain predicled output
T wea~ weiqttls tmcl biases using gradient function
Actual Output
3 reedforward pass: compare ilr.t ual Oll tp11t With pred i r.tP.rl output ilnd determinP. cost function based on the error rnlc::u I aterJ
Predicted MeP.ts Output Ctiteria? --~o•Y•••I
4 Backward propagation: oplirniLe weiqhls am.I I.Jiases (p,:H c1meler s) until some stopping condition has been met by passing the error si nal throu h the network usin radient function
AI agency MBS prepayment model
Deep neural network model
• Feed forward neural network – Applied successfully in many other fields
– Layers and nodes, hyper-parameters
– Ensemble techniques, bagging and boosting
• Competing vs. “human” /MSCI production model – Forecast accuracy
– New signals, new discoveries
– x100-1000 Efficiency gains: 3hrs vs. weeks/months
10
MSCII
CP
R
40
20
AI vs. “human” models: higher accuracy
8080
6060
0 0
CP
R
40
20
Au
g-0
3
Ma
r-0
4
Oc
t-0
4
Ma
y-0
5
De
c-0
5
Ju
l-0
6
Fe
b-0
7
Se
p-0
7
Ap
r-0
8
No
v-0
8
Ju
n-0
9
Ja
n-1
0
Au
g-1
0
Ma
r-1
1
Oc
t-1
1
Ma
y-1
2
De
c-1
2
Ju
l-1
3
Fe
b-1
4
Se
p-1
4
Ap
r-1
5
No
v-1
5
Au
g-0
3
Se
p-0
4
Oc
t-0
5
No
v-0
6
De
c-0
7
Ja
n-0
9
Fe
b-1
0
Ma
r-1
1
Ap
r-1
2
Ma
y-1
3
Ju
n-1
4
Ju
l-1
5
Au
g-1
6
Se
p-1
7
De
c-0
3
Ja
n-0
5
Fe
b-0
6
Ma
r-0
7
Ap
r-0
8
Ma
y-0
9
Ju
n-1
0
Ju
l-1
1
Au
g-1
2
Se
p-1
3
Oc
t-1
4
No
v-1
5
De
c-1
6
Ja
n-1
8
CA NY
Actual NNM CPR NNM Hmodel
• Higher modeling accuracy – Across cohorts and multiple dimensions of risk factors
– Highly adaptive to high dimensionality and non-linearity
11
-~-----------------------...
MSCII
...... ..... ......... _ ------
AI vs. “human” models: higher accuracy
FICO SATO
30 Actual
720 740 760 780 720 740 760 780
Pre-2008 Post-2008
0
10
20
30
40
50
60
CP
R
Actual
NNM
-1.5 -1 -0.5 0 0.5 1
CLTV
25 NNM
20
15
10
5
0
CP
R
Current loan size
CP
R
35
30
25
20
30 Actual Actual
25NNM NNM
20
CP
R15
10
5
0
15
10
5
0
50,000 100,000 150,000 200,000 250,000 300,000 350,000 55 65 75 85 95 105
12
-
MSCII
,, , ,,
---------------------_,
,, ,, ,,
,,,'
I , , , , ,, ,
AI vs. “human” models: higher accuracy
Ranking based sample error tracking for FNCL 4s
4545
4040
3535
3030
CP
R
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15
CP
R 25
20
25
20
15
10
15
10
5 5
0 0
Actual NNM Actual Hmodel
• Higher modeling accuracy – Across cohorts and multiple dimensions of risk factors
– Highly adaptive to high dimensionality and non-linearity
13
MSCII
CP
R
AI vs. “human” models: higher accuracy
25 • “Media effect”
– When rates hit 20 historical low, new & lower coupons ramp
15 up faster
– Highly non-linear behavior and depends 10 on multiple risk factors
5
0
Ju
l-1
2
Se
p-1
2
No
v-1
2
Ja
n-1
3
Ma
r-1
3
Ma
y-1
3
Ju
l-1
3
Se
p-1
3
No
v-1
3
Au
g-1
2
Oc
t-1
2
De
c-1
2
Fe
b-1
3
Ap
r-1
3
Ju
n-1
3
Au
g-1
3
Oc
t-1
3
De
c-1
3
Ju
l-1
2
Se
p-1
2
No
v-1
2
Ja
n-1
3
Ma
r-1
3
Ma
y-1
3
Ju
l-1
3
Se
p-1
3
No
v-1
3
3.5 4 4.5
Actual NNM Hmodel
14
MSCII
AI vs. “human” models: new signals
60 1280
70 50 10
60
40 8 50
30
-10
0
-60
-40
-20
-10 0
10
20
30
40
30 6
0
20 4 8
0
20
10
0
10 2 10
12
0 0 0 0
14
0
16
0
20
0
24
0
CP
R
CP
R
CP
R
40
75k 100k 150k 200k 250k 300k 400k 500k
Incentive (bps)
75k 100k 150k 200k 300k 400k 500k 60 bps(LHS) -60 bps(RHS)
• Accuracy vs overfitting: loan size example – Understand sensitivities of risk drivers and economic rationale
– Apply regularization to penalize overfitting
6
15
MSCII
AI vs. “human” models: new signals
Model speeds for OTM 60bps vs loan size (in thousands) Housing turnover vs loan size and year*
12 0.80%
0.70% 10
0.60% 8
0.50%
6 0.40%
0.30% 4
0.20% 2
0.10%
0 0.00% 50 100 150 200 250 300 350 400 450 500 550
2005Yr 2012Yr <= 75k 76k-150k 151k-250k 251k-350k 351k+
• Is low loan balance still safe investment for extension risk?
– Sensitivity tests for the AI model indicate relationship between loan size and housing turnover has flipped after the recession
– This is verified by Black Knight’s proprietary data
* Source: Black Knight, used with permission 16
CONCLUSION
NN model vs. “Human Model”
• Accurate forecasts and successfully flag prepayment anomalies over the study period
• Accurate model large numbers of risk factors • Accurate model highly non-linear and interactive risk factors • Highly efficient modeling process - hundreds times of
increases in modeling efficiency • Was able to find/flag prepayment signals that eluded human
models
— GENERAL — 17
ACKNOWLEDGEMENT
Joy Zhang:
Executive Director, Securitized Product Research, MSCI
Jan Zhao:
Principal, Advanced Analytics, Ernst & Young
Fei Teng:
Senior Quantitative Analysts, Quantitative Advisory Services , Ernst & Young
Siyu Lin:
Senior Quantitative Analysts, Quantitative Advisory Services , Ernst & Young
Henry Li:
Executive Director, in Quantitative Advisory Services practice, Ernst & Young
— GENERAL — 18
4: Agency MBS prepayment regimes since 2003
100
2003: historical l'ow
mortgage rates and
, refinancing wave ----FN 30 CPR/ Rate Incentive Bracket (Age= 10-30)
Mid 2004-,Mid 2006: Strong home price Mid 2007-2009: Severe
appreciation, (HPA), fast housing turnovet", high cash - home price depreciation
out refinancing and fast discount prepayment speed (HPD), low housing turnover,
2009: historical low mortgage
rates, but muted refinancing
_response.High premium coupons
prepay ever slower than, lower
premium coupons, because of 80 -----~--------------st------------ no cash out refinancing and
60 ct: Q. 0 40
20
0
Jan-03
MSCI •-
Aug-03 Mar-04
slow rate tet"m refinancing
Nov-04 Jun-05 Jan-06 Sep-06 Apr-07 Nov-07
Month
-(-50,-25) -(75,100) -(150,175)
- much tighter lending standard for
Jul-08 feb-09 Oct-09
MORTGAGE PREPAYMENT MODELING: SCIENCE AND CRAFTSMANSHIP
— GENERAL — 20
Exhibit 5: HARP CL TV curve history and long term model assumptions The CL TV curve represents the ratio of refinance speeds across CL TV spectrum, using sub-50 CL TV cohort as benchmark, with all other pool variables (for example, loan size, moneyness, FICO, etc.) holding constant
1.80
1.60
1.40
1.20
1.00
0.80
0.60
0.40
0.20
0.00 60
MSCI •-
70 80 90
Current LTV
-model(noHARP)
-model(Long Term)
--HARP1
- -201202
--201203
-201212
---•201204
-201206
~--------- - -201503
100 110 120
MORTGAGE PREPAYMENT MODELING: SCIENCE AND CRAFTSMANSHIP
The HARP program caused temporary inversion of the CLTV Prepayment Curve
HARP: Home Affordable Refinance Program
NNM 21 — GENERAL —
Q. u
100
80
60
40
20
0
40
30
!f 20 u
10
0
MSCI
V2002 Major Premium Coupons At 2003 Refi Wave In 2003 refi wave, nigher coupons prepay raster.
But in 2009 refi wave, higher coupons prepay lower e weake ccediLQi!J:Ual!!!!!J. f,.OQ!f..hd}!li ~rh~e:r..---===t~--'---------.....;::.,.. _________________ _
coupons was not much an issue in 2003's refi, but
rt become a big refi hurdle in 2009's refi.
Jan-03 Feb-03 Mar-03 Apr-03 May-03 Jun-03 Jul-03 Aug-03 Sep-03 Oct-03 Nov-03 Dec-03
Month
-5.5-6-6.5
V2008 Major Premium Coupons At 2009 Refi Wave
In 2003 refi wave,.,bigber cou oos rel'!!',,y.,fa.,,st,,e.,_c..,Be,,ut"-'-"inc,..--2.e.0,.,0..,9,_r._,e.:.,fi..,w"a"v_,.,e ___________________________ _
higher coupons prepay slower. The weaker credrt quality of higher coupons was not much an issue in 2003"s refi. but it become a big refi
hurdle in 2009's refi.
Jan-09 Feb-09 Mar-09 Apr-09 May-09 Jun-09 Jul-09 Aug-09 Sep-09 Oct-09
Month
-5.5-6-6.5
MORTGAGE PREPAYMENT MODELING: SCIENCE AND CRAFTSMANSHIP
— GENERAL — 22
35
30
25
.. C. 20 -u
15
1 0
5
0 "' "' ~ .. 9 0 0 0 a
.!.. I ' I
C: >, a. > Ill Ill (\) :::I
(l) 0 ..., :2i: ~ ...,
(/) z
FN30 6.0 V2004 CPR / Fico, (owner occ clsz=1 G0K-150 K oltv < 80)
■ Seasoned loan's FICO lending environment
■ :FICO beoomes less important after a few years' seasoning
IO I,() co <O <O r-- r-- r-- r-- r-- r- a:> co a 0 0 0 0 ~ 0 0 q 0 a a 0 0 0 0 0 ~ I .!.. I I I I
....!. > I .'.. I I I I ' ,c: >- a. > >- a. C: >- a. > C: (ti Ill Ill :::I (l) 0 (ti Ill :::I (l) 0 (ti (ti Ill :::I
(II 0 (\) (ti ..., :2i: ~ ...,
(/) z :2i: ~ ...,
(/) z ..., :a :!? ...,
(/J z ..., :a Ill 01 ,ti 1
■ Higher FICO prepays faster for new mortgages l--+-680-120 I ----> 720
MSCI •-
OJ a 0 0 0 9 a I I I ..!. ...!. >- a. > >, (ti QI 0 (\) <'O :,
:!? (/J z :i ~ ...,
MORTGAGE PREPAYMENT MODELING: SCIENCE AND CRAFTSMANSHIP
— GENERAL — 23
MORTGAGE PREPAYMENT MODELING: SCIENCE AND CRAFTSMANSHIP
Example of modeling: Assume ppm (pool, time) = f(X1, X2, X3,…. Xn) …
start by assuming separable risk factor: ppm= f1(x1)*f2(x2)…. Until (often) proven incorrect…
estimating f1(x1) by “building cohort”, by bucketing loans/pools for groups of x1, but similar x2, x3….
(this further assumes quasi linear property of x2, x3…. Average(f2(x2) f3(x3)…)= f2(ave(x2))* f3(ave(x3))….
….. Checking overall fit after all Xn are fitted, adding extra variables to deal with non-linear and interactive variables… this often does not lead to convergence …
• Time consuming and non-standard approaches • Experience and step-by-step / regime-by-regime progress are valued
• Can new techniques of AI modeling provide the much needed disruption?
— GENERAL — 24
neural network model
Architecture*
Input layer
!:1 !n !'.1 = mllll(tl.1;! I 8 1,0) :!:'.,, - m111<(A11r·,-1 + 8 11, 0)
Values ot hidden layP.rs are ~enerated hy ar.tivation futtctiu11s with previous layers as inputs, and weights and
biases as parameters
•Exampfe of a Neural Network architecture Q Node
MSCI •-
Mode~ training 1 Select a set of training data:
inputs tog~thP.r with correct/actual output
2 Pass trail'linq Al Model data through the network to obtain predidet.l output
t Tweak weiqt1Ls i:mu Ltiases using gradient function
Actual Output
Predicted Output
F eedforwa rd pass: compare ac:tual outp11t with predicted out rut and dP.termine cost function based on the error calqtlnted
MeP.ts Criteria?
~Ye• ·I 4 Backward propagation: oµlfrnize weiqhls am.I uiases (par c1meter s)
until some stopping condition has been met by passing the error signal through the network using gradien function
FEED FORWARD NEURAL NETWORK
Network architecture:
Layers and nodes
Hyper-parameters Batch size, number of nodes, learning rate, max-norm constraint, dropout rate
Ensemble techniques: Bagging: minimum MSE of different realizations and neural networks Boosting: Fine tune a neural network via changing a few hyper-parameters
— GENERAL —
MSCI
~ C: ro E .g (I) a...
Data Size
Large NN
Med um NN
Sma
Traditional ML algorith Statisbcal Leaming ms I
GENERAL
TRADITIONAL VS.. DEEP MACHINE LEARNING
Identify problems
Set benchmarks
Train models Compare
performance Choose model
Optimize model
Deploy
Traditional learning algorithm Deep Learning
Pros Cons Pros Cons
Works better on smaller data
Hard to scale state-of-the-art for certain domains, such as computer vision and speech recognition.
require large amount of data.
Financially and computationally cheap
Lack of variability Perform very well on image, audio, and textual data, Easily updated with new data
Not suitable for classical machine learning problems.
Algorithms are easier to interpret, have more theories to back them up
Labor intensive model maintenance
Versatile architecture and low overhead maintenance
Computationally intensive to train, and they require much more expertise to tune
— —
X1 W1 y(t) y(l) y(2) y(3)
X2
X3 out w =
wi Wo x(t) x(l) x(2) x(3)
Xo ~ Xi
MSCI •-
NEURAL NETWORKS MODEL
Feed forward neural network (FNN)
the information moves in only forward direction from the input nodes to the output nodes. There are no cycles or loops in the network.;
Deep FNN consists of tens of layers and thousands of nodes; the simplest kind of FNN is logistic model
Feedforward Neural Network Recurrent Neural Network Logistic Model
Recurrent Neural Network (RNN)
A class of neural networks exploit the historical input sequences. Such inputs could be text, speech, time series, and anything else where the occurrence of an input in the sequence is dependent on the inputs that appeared before it
Motivation: Not all problems can be converted into one with fixed length inputs and outputs, such as text translation, speech recognition or time-series; predictions require a system to store and use context information
The input at time t include both the attributes at t and the intermediate values containing history at t-1.
— GENERAL —
BUILDING NEURAL NETWORK MODEL
EDA Feature selection
Build model
Performance Evaluation
1, sanity check 1,information value 1,error tracking 1. link weights 2. data cleansing 2.correlation matrix 2.sensitivity 2. hyper-parameters 3. data transformation 3.domain knowledge 3. transparency tool (deeplift)
Deep neural network fitting
2003-2018 30yr agency MBS data (~25G data)
30+ input variables: pool attributes, macro-economic variables
To reduce complexity, we added incentive, 1 regime indicators, and 1 policy indicator (HARP)
Cost function of RMS error of pool level prepayment
1 round of fitting can be completed in ~ 3 hours on a GPU machine
— GENERAL —
MODEL DRIVERS
WALA Weighted Average Loan Age
WAC Weighted Average Coupon
CLNSZ Current Average Loan Size
OLTV Original Loan to Value
Refi% Percentage of Refinanced Loans by UPB
SecHome% Percentage of Second Home Loans by UPB
MultiFamily% Percentage of Muti Family Loans by UPB
Investor% Percentage of Investor Loans by UPB
TPO% Percentage of Third party origination by UPB
AOL Original Average Loan Size
LNSZ_Q4 Max original loan size
LNSZ_Q3 Max original Loan Size - 3rd Quartile
LNSZ_Q1 Max original Loan Size - 1st Quartile
Geo_CA% Percentage of California Loans by UPB
Geo_FL% Percentage of Florida Loans by UPB
Geo_TX% Percentage of Taxas Loans by UPB
Geo_NY% Percentage of New York Loans by UPB
Geo_NE% Percentage of New England Region Loans by UPB
Geo_NO% Percentage of North Region Loans by UPB
Geo_SO% Percentage of South region Loans by UPB
Geo_PC% Percentage of Pacific region Loans by UPB
Geo_AT% Percentage of Atlantic region Loans by UPB
Geo_NONUS% Percentage of non-US region Loans by UPB
Seasonality Calendar month
Independent variables
Incentive WAC - Mortgage Rate(t)
Rolling Incentive Average Incentive ( 20month)
Loan size dispersion (LNSZ_Q3-LNSZ_Q1)/AOL
SATO Spread-at_origination = WAC - Mortgage Rate(0)
HPA House Price Appreciation ( HPI(t)/HPI(0)-1 )
HARP-able
1: IssueMonth <= Jun. 2009 and factor date between Mar. 2009
and Dec. 2011
2: IssueMonth <= Jun. 2009 and factor date > Dec. 2011
HARP-ed Refi% = 100 and OLTV > 80 and issueMonth > Jun. 2009
Underwritting standard 0: before 2008, 1: after 2008
cBal Current Balance
Prepayment speed Prepayment speed in SMM
Derived Variables
Weight
Dependent Variable
— GENERAL — 29
~ tn n -
• l> ri ,-+ C ~
z z ~
I-' 0 0
Aug-03
Apr-04
Dec-04
Aug-05
Apr-06
Dec-06
Aug-07
Apr-08
Dec-08
Aug-09
Apr-10
Dec-10
Aug-11
Apr-12
Dec-12
Aug-13
Apr-14
Dec-14
Aug-15
CPR I',.) w .i:,. V, 0-, -...J 0 0 0 0 0 0
AG
ENC
Y 30
YR U
NIV
ERSE SP
EEDS ER
RO
R TR
AC
KIN
G
• Train
ing D
ata: 20
03
– 20
15
Dec. R
and
om
samp
le 10
% p
oo
ls. •
Error trackin
g is generated
usin
g ou
t-of-th
e-samp
le po
ols.
— G
ENER
AL —
3
0
MSCI •-
er:: c.. u
---- Actual --NNM ---Hmodel
OUT-OF-SAMPLE FORECASTS
• True out-of-time and out-of-sample test. • Overall fitting is good in out-of-sample test • Missed the refi wave in second half of 2016
— GENERAL — 31
-----------
MSCI •-
MODEL RISK FACTORS C
PR
30
25
20
15
10
5
0
0
10
20
30
40
50
60
CP
R
-1.5 -1 -0.5 0 0.5 1
720 740 760 780 720 740 760 780
Pre-2008 Post-2008
FICO SATO Actual NNM
Actual NNM
30
35 25
30 20
25 C
PR
20
15
15
CP
R
10
5
10
5
0
50000 100000 150000 200000 250000 300000 350000
Current Loan Size
0
55 65 75
CLTV
85 95 105
Actual NNM Actual NNM
— GENERAL — 32
30
25
o::: 20 c.. u 15
10
5
0 0.4
MSCI •-
-~~-~ ... ...... ...... ---~"" ........
0.6 0.8
...... ......
1
...... ...... ..... ..... .... .... ....
-----Actual
--NNM
1.2 1.4
Rolling Incentive
1.6
MODEL BURNOUT
NNM and actual prepayment speeds against average incentive in prior 20 months
— GENERAL — 33
~ tn n -
n )>
n ""Cl ;;o
z z s
I I 3 0 a. (1)
z -<
Jun-09
Apr-10
Feb-11
Dec-11
Oct-12
Aug-13
Jun-14
Apr-15
Feb-16
Dec-16
Oct-17
Oct-0
Aug-0
Jun-OS
Apr-06
Feb-07
Dec-07
Oct-08
Aug-09
Jun-10
Apr-11
Feb-12
Dec-12
Oct-13
Aug-14
Jun-15
Apr-16
Feb-17
Dec-17
CPR ~ N W ~ ~ m ~ 00
0 0 0 0 0 0 0 0 0
MO
DEL P
OO
L VA
RIA
BLES V
S “HU
MA
N”M
OD
EL
NN
M an
d H
mo
del Erro
r Tracking again
st State Variab
les
NN
M accu
rately captu
red state
-level prep
aymen
t beh
aviors
— G
ENER
AL —
3
4
45
40
35
30
a:: 25 Cl. u 20
15
10
5
---- ------,,' _,
, , , ,
I I
I I
I ,
0-+-----~~-~----~~-~----~~-1 2 3 4 5 6 7 8 9 10 11 12 13 14 15
----Actual -NNM
MSCI •-
45
40
35
30
a:: 25 Cl. u 20
15
10
5
0
,, , ,, ----
--..... --------,
------, , , ,
I , I
I I
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15
----Actual -Hmodel
MODEL POOL VARIABLES VS HUMAN MODEL
Ranking-Based Sample Error Tracking for Coupon 4s
• Ranking based error tracking methodology provides a comprehensive measure of model accuracy across all pool variables
• NNM performed better than Hmodel
— GENERAL — 35
MSCI •-
50
40
50
40
o::: 30 Q..
u 20
----Actual -NNM
1 4 7 1013 3 6 9 1215 2 5 8 1114 1 4 7 1013 3 6 9 1215 2 5 8 1114
3.5 4 3.5 4 3.5 4
Jun-13 Jun-16 Mar-18
----Actual -Hmodel
1 4 7 1013 3 6 9 1215 2 5 8 1114 1 4 7 1013 3 6 9 1215 2 5 8 1114
3.5 4 3.5 4 3.5 4
Jun-13 Jun-16 Mar-18
MODEL POOL VARIABLES VS HUMAN MODEL
Sample ranking-based error tracking at different time point
— GENERAL — 36
60
55 ' ' 50 ' '" ... -, 45 ';x:::J ~ ~ c.:: -Q. 40 ,
' u 35
30
25
20
60 80 9010 12 60 80 9010 12 60 80 9010 12
2012Q1 2012Q2 2012Q3 2012Q4
CLTV
----Actual NNM
MSCI •-
MODEL HARP EFFECTIVENESS
Error Tracking against HARP effectiveness across CLTV Cohorts
NNM is able to pick up the general trend of HARP effectiveness but missed the complexity of its revolution
Back 37
— GENERAL —
80
70
60
so f 40 u
30
20
10 l ...... ,.~~~~,,,t.,..... 0 ---t---.----.----.---.--.----.----.~~~~~~~--.----.---~
MSCI •-
-100 -40 -10 10 30 60 100 140 200
Incentive (bps)
~7Sk -100k-a-1SOk~200k
-300k-+-400k-SOOk
60 12
so 10
40 8
f 30 6 a: Cl.
u u
20 4
10 2
0 ---t-----.-~~~~--.---~--.-----+ 0
~ _c-.'4.-~'4.- r;:f- ~'4.- f::)'4.-_c-.'4.-r;:f-"\ ""\)-~ '"1-o '"I-<-, ,,_,o ~\)-~o
-60 bps(LHS} -.--60 bps(RHS}
MODEL SENSITIVITY
Model prepayment sensitivity to loan sizes and refinance Incentives
NNM captured the prepayment behavior for loan size
— GENERAL — 38
~ tn n -
w 0 -< :-, "Tl
X m a.. ~ 0 -, r-t-
C]Q QJ
C]Q m :::0 QJ r-t-m
Jan-10
Jun-10
Nov-10
Apr-11
Sep-11
Feb-12
Jul-12
Dec-12
May-13
Oct-13
Mar-14
Aug-14
Jan-15
Jun-15
Nov-15
Apr-16
Sep-16
Feb-17
Jul-17
Dec-17
May-18
w w .i:,. .i:,. V, V,
0 V, 0 V, 0 V,
w 0 < :"I 'Tl ~-c.. s 0 ""'II ,...
OQ Q)
OQ t1)
::a Q) ,... t1)
“MED
IA EFFEC
T”
Co
hort
Ob
servation R
angeC
PR
WA
LA
SA
TO
CL
TV
CL
NS
ZIncentive
FIC
OA
vg.UP
B(b
n)
FH
3.5
20
11
Jul.12
- Dec. 1
21
6.1
13
-577
212258
52
770
2.9
1
FH
4 2
010
No
v. 11
- Feb
. 12
13.9
15
378
201901
45
767
6.2
6
FH
3.5
20
11
Jul.20
12
- Dec. 1
22
1.9
12
-376
235847
50
770
4.0
4
FH
4 2
010
No
v. 11
- Feb
. 12
16.4
16
378
224734
45
765
8.6
6
FH
3.5
20
11
Jul.12
- Dec. 1
22
9.2
12
-266
216270
54
771
7.3
1
FH
4 2
010
No
v. 11
- Feb
. 12
15.3
15
11
70
208962
52
766
30.8
9
FH
3.5
20
11
Jul.12
- Dec. 1
24
6.1
12
-864
269298
46
773
9.5
8
FH
4 2
010
No
v. 11
- Feb
. 12
26.2
15
269
245496
44
767
23.0
2
Refi/T
PO
FH
20
11
3.5
vs 20
10
4 co
mp
arisons, acro
ss TP
O/R
etail and R
efi/Purchase co
mb
inations
Purchase/R
etail
Purchase/T
PO
Refi/R
etail
20
11
3.5
s and
20
10
4s p
repaym
ent sp
eeds
are com
pared
across lo
an attrib
utes, lo
an
pu
rpo
se and
origin
ation
chan
nel
3.5
s is mu
ch faster th
an 4
s given sim
ilar lo
an attrib
utes an
d in
centive
— G
ENER
AL —
3
9
~ tn n CPR
f-'> f-'> N N - 0 V, 0 V, 0 V,
~h Jul-12
Oct-12
Jan-13
' VJ Apr-13
)> V, ("') Jul-13 -C (l)
Oct-13
Sep-12
z Dec-12 z Mar-13 ~ .i:,.
Jun-13
Sep-13
Dec-13 I
Aug-12 3 0
Nov-12 Q. ~
.i:,. Feb-13 V,
May-13 I ""> "'~ Aug-13
Nov-13
CPR f-'> f-'> N N VJ
0 V, 0 V, 0 V, 0
--1
May-16 Sep-16
VJ Jan-17 ' V, )> May-17
("') - Sep-17 C (l)
Jan-18
Apr-16
z Aug-16
z Dec-16 ~ .i:,. Apr-17
Aug-17 Dec-17 Ai;ir-18
I Mar-16 3 0 Jul-16 Q. ~-, ~ Nov-16 s.
.i:,. V, Mar-17
Jul-17 ~i-Nov-17
Mar-18 ~, -..
MO
DEL “M
EDIA
EFFECT
”
20
12
Vin
tage in 2
01
2 refin
ance w
ave 2
01
5 V
intage in
20
16
refinan
ce wave
— G
ENER
AL —
4
0
~ tn n -
..., ...,.. N Nm ..... rvw.::..v,~" a,
! """" 0 • 0 • 0 • H I ::::f I ff I 11 I H 20160201 3 "' 20040401 3 "
20160301 §: ~ 20040801 §: r;'
20160401 ~ 8 20041201 0 ~ " 20050401 ,l ~
20160501 20050801
20160601 20051201
20160701 20060401 20060801
20160801 20061201
20160901 20070401
20161001 2oo7os01 20071201
20161101 20080401
20161201 20080801
20170101 20081201 20090401
20170201 20090801
20170301 20091201
20170401 20100401 20100801
20170501 20101201
20170601 20110401
20170701 20110801 20111201
20170801 20120401
20170901 20120801
20171001 20121201 20130401
20171101 20130801
20171201 20131201
20180101 20140401
20180201 20140801 20141201
20180301 201S0401
20180401 201S0801 20151201
I ! I I I i ~ ~ s: ► Q. i; Q ~ ;!; ;:; it &. -,:, -o r, n :::0 ;Q "0 "0
:,:, :,:,
MO
DEL ER
RO
R TR
AC
KIN
G
In-tim
e O
ut-o
f-samp
le (1
/20
03
-12
/20
15
)
1.
All attrib
utes statistics are very
close o
n Ju
ly and
Au
gust 2
01
6
except C
PR
. 2
. R
isk driver is m
issing, i.e., m
edia
effect or regim
e chan
ge
Ou
t-of-tim
e O
ut-o
f-samp
le (1
/20
16
-4/2
01
8)
— G
ENER
AL —
4
1
~ tn n -
;'~O ONAO'!OO
" "'6' -----,v 0,._,,AC"+COO
!!. .. ✓• ~ rl q, ,. o✓•
6''Y, rl q, o.,,_.o
rl :;,~ o.,,..,,
orl rl ~ o.,,
->o.,,. rl ~ o.,,
:..>4. rl ~ o.,,
->a ~ ~ o.,,..,,
rl~ o.,,_.o rl :>~ o.,,..,, ~
rl ~ o.,, .>~
rl v.,, o.,,:.>
rl ,l~ o.,, ~
rl ,l~ o.,,:.>
'Y, rl ~ o✓, d'o ~ :;,~ o✓•
d'orl rl ~ o✓,
d'o.,,. ~ ~ o✓•
d'q,. ~
)
/ vj ~v
j~
\ I
'I
~
f
I I ' ;:: ► ll
~ [ ,.., ,.., .,, .,, :,0 :,0
V /
)
J
~ I IL <
i f I . j f
! oo~~~~E~>
s. 20030801 ~ I I I I I "I I i M 20040101
• 20040601
20041101
20050401
20050901
20060201
20060701
20061201
20070501
20071001
20080301
20080801
20090101
20090601
20091101
20100401
20100901
20110201
20110701
20111201
20120501
20121001
20130301
20130801
20140101
20140601
20141101
20150401
20150901
20160201
20160701
I I f .:: >
i [ ,.., ,.., .,, .,, :,0 :,0
MO
DEL ER
RO
R TR
AC
KIN
G
In-tim
e O
ut-o
f-samp
le (1
/20
03
-10
/20
16
)
Ou
t-of-tim
e O
ut-o
f-samp
le (1
1/2
01
6-4
/20
18
)
— G
ENER
AL —
4
2
~ tn n -
I :::o, °" .. • • " " • • • H I ::::f j . . . j j i H "o,., ~ 20040601 ~ •
"0.t_,0 !i ~ 20041101 1! I~ .,0 "o,., i '§ 20050401 ~ 1'§
..-_,od 20050901 .>0 .,,., 20060201
"'>o.,,0 20060101
<'o " 20061201 .,_, 40 20070501 ~... ~~
d 0-£>,., 20080301 .,,.,_, 20080801
.>0 %,., 20090101
">o_,. 20090601
<'o ".t 20091101 ;,_, ~ 20100401
"o,.,_, " 20100901 d' rn~ ~ ~~
d 'b✓ 20111201 0 .,.,,., 20120501
~~ ~~ "'>,.,.,o 20130301
<'o " 20130801 "e. 0.to 20140101 d ;, ".tc90 20140601
d ?o✓ 20141101
".tc90 20150401 ::,i
~ ~ 2M50901
"d'4; 20160201 0,., 20160701
0 1 • -1,(
I If I I l ;:: l> f l> 2 ll ll &" & & ;:;r, nn -i::i ""Cl "Cl ~ :,:i,::, :,::,;o
MO
DEL ER
RO
R TR
AC
KIN
G
In-tim
e O
ut-o
f-samp
le (1
/20
03
-10
/20
16
)
Wh
en In
crease weigh
ts on
8
/20
16
– 10
/20
16
by 4
0 tim
es in
trainin
g: 1
. B
etter in th
e early stage o
f ou
t-of-tim
e test 2
. Sacrifice o
ther p
eriod
.
Ou
t-of-tim
e O
ut-o
f-samp
le (1
1/2
01
6-4
/20
18
)
— G
ENER
AL —
4
3
MSCI
Budcet Value • Coupon •
Aclllal CPR Model CPR
70 ..-------------------------------------
60 +-------------------------------------50-tt-------------------------------------
40 ------------------------------------------
30 +1-~1----------------,..----':'"'ll,,-r-----------
20 t--'V--bl5H~t::----------l~Ht-r-#--¼-----,1i------¼----,,.----k--
10 -l------¥~'1¥-!!ilic::,l-~+---------...:!l..,.~l!-....::l"'---\,,ey.
0 tt,Tmmmm"lffl'ITTTTTmTTmmmm....,'TfflTTTTTTffl"IT'mmmm'fflTTTTTTTmTTmmmm"lffl'ITTTTTfflTTfflTTmm....,TTTTTTTTTTmmmmmm'fflTTfflffmmm"f
Fac!O<Oale •
Budcet Value • Coupon •
Aclllal CPR Model CPR
12
8
6+-------------------------------------4
0+--------------~-----~------~------~------~ 20171101
Fac!O<Oale •
20171201 20180101 20180201 20180301 20180401
Values
--Actual CPR
-Model CPR
Values
--Actual CPR
-Model CPR
MODEL ERROR TRACKING
In-time Out-of-sample (1/2003-10/2017)
Out-of-time Out-of-sample (10/2017-4/2018)
— GENERAL — 44
MSCII
About MSCI
MSCI is a leading provider of critical decision support tools and services for the
global investment community. With over 45 years of expertise in research, data
and technology, we power better investment decisions by enabling clients to
understand and analyze key drivers of risk and return and confidently build
more effective portfolios. We create industry-leading research-enhanced
solutions that clients use to gain insight into and improve transparency across
the investment process. To learn more, please visit www.msci.com.
45
MSCII
Contact us
AMERICAS EUROPE, MIDDLE EAST ASIA PACIFIC & AFRICA
Americas +1 888 588 4567 * Cape Town + 27 21 673 0100 China North 10800 852 1032 *
Atlanta + 1 404 551 3212 Frankfurt + 49 69 133 859 00 China South 10800 152 1032 *
Boston + 1 617 532 0920 Geneva + 41 22 817 9777 Hong Kong + 852 2844 9333
Chicago + 1 312 675 0545 London + 44 20 7618 2222 Mumbai + 91 22 6784 9160
Monterrey + 52 81 1253 4020 Milan + 39 02 5849 0415 Seoul 00798 8521 3392 *
New York + 1 212 804 3901 Paris 0800 91 59 17 * Singapore 800 852 3749 *
San Francisco + 1 415 836 8800 Sydney + 61 2 9033 9333
São Paulo + 55 11 3706 1360 Taipei 008 0112 7513 *
Toronto + 1 416 628 1007 Thailand 0018 0015 6207 7181 *
Tokyo +81 3 5290 1555
• = toll free
msci.com [email protected]
46
MSCII
Notice & disclaimer
This document and all of the information contained in it, including without limitation all text, data, graphs, charts (collectively, the “Information”) is the property of MSCI Inc. or its subsidiaries (collectively, “MSCI”), or MSCI’s licensors, direct or indirect suppliers or any third party involved in making or compiling any Information (collectively, with MSCI, the “Information Providers”) and is provided for informational purposes only. The Information may not be modified, reverse-engineered, reproduced or redisseminated in whole or in part without prior written permission from MSCI.
The Information may not be used to create derivative works or to verify or correct other data or information. For example (but without limitation), the Information may not be used to create indexes, databases, risk models, analytics, software, or in connection with the issuing, offering, sponsoring, managing or marketing of any securities, portfolios, financial products or other investment vehicles utilizing or based on, linked to, tracking or otherwise derived from the Information or any other MSCI data, information, products or services.
The user of the Information assumes the entire risk of any use it may make or permit to be made of the Information. NONE OF THE INFORMATION PROVIDERS MAKES ANY EXPRESS OR IMPLIED WARRANTIES OR REPRESENTATIONS WITH RESPECT TO THE INFORMATION (OR THE RESULTS TO BE OBTAINED BY THE USE THEREOF), AND TO THE MAXIMUM EXTENT PERMITTED BY APPLICABLE LAW, EACH INFORMATION PROVIDER EXPRESSLY DISCLAIMS ALL IMPLIED WARRANTIES (INCLUDING, WITHOUT LIMITATION, ANY IMPLIED WARRANTIES OF ORIGINALITY, ACCURACY, TIMELINESS, NON-INFRINGEMENT, COMPLETENESS, MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE) WITH RESPECT TO ANY OF THE INFORMATION.
Without limiting any of the foregoing and to the maximum extent permitted by applicable law, in no event shall any Information Provider have any liability regarding any of the Information for any direct, indirect, special, punitive, consequential (including lost profits) or any other damages even if notified of the possibility of such damages. The foregoing shall not exclude or limit any liability that may not by applicable law be excluded or limited, including without limitation (as applicable), any liability for death or personal injury to the extent that such injury results from the negligence or willful default of itself, its servants, agents or sub-contractors.
Information containing any historical information, data or analysis should not be taken as an indication or guarantee of any future performance, analysis, forecast or prediction. Past performance does not guarantee future results.
The Information should not be relied on and is not a substitute for the skill, judgment and experience of the user, its management, employees, advisors and/or clients when making investment and other business decisions. All Information is impersonal and not tailored to the needs of any person, entity or group of persons.
None of the Information constitutes an offer to sell (or a solicitation of an offer to buy), any security, financial product or other investment vehicle or any trading strategy.
It is not possible to invest directly in an index. Exposure to an asset class or trading strategy or other category represented by an index is only available through third party investable instruments (if any) based on that index. MSCI does not issue, sponsor, endorse, market, offer, review or otherwise express any opinion regarding any fund, ETF, derivative or other security, investment, financial product or trading strategy that is based on, linked to or seeks to provide an investment return related to the performance of any MSCI index (collectively, “Index Linked Investments”). MSCI makes no assurance that any Index Linked Investments will accurately track index performance or provide positive investment returns. MSCI Inc. is not an investment adviser or fiduciary and MSCI makes no representation regarding the advisability of investing in any Index Linked Investments.
Index returns do not represent the results of actual trading of investible assets/securities. MSCI maintains and calculates indexes, but does not manage actual assets. Index returns do not reflect payment of any sales charges or fees an investor may pay to purchase the securities underlying the index or Index Linked Investments. The imposition of these fees and charges would cause the performance of an Index Linked Investment to be different than the MSCI index performance.
The Information may contain back tested data. Back-tested performance is not actual performance, but is hypothetical. There are frequently material differences between back tested performance results and actual results subsequently achieved by any investment strategy.
Constituents of MSCI equity indexes are listed companies, which are included in or excluded from the indexes according to the application of the relevant index methodologies. Accordingly, constituents in MSCI equity indexes may include MSCI Inc., clients of MSCI or suppliers to MSCI. Inclusion of a security within an MSCI index is not a recommendation by MSCI to buy, sell, or hold such security, nor is it considered to be investment advice.
Data and information produced by various affiliates of MSCI Inc., including MSCI ESG Research LLC and Barra LLC, may be used in calculating certain MSCI indexes. More information can be found in the relevant index methodologies on www.msci.com.
MSCI receives compensation in connection with licensing its indexes to third parties. MSCI Inc.’s revenue includes fees based on assets in Index Linked Investments. Information can be found in MSCI Inc.’s company filings on the Investor Relations section of www.msci.com.
MSCI ESG Research LLC is a Registered Investment Adviser under the Investment Advisers Act of 1940 and a subsidiary of MSCI Inc. Except with respect to any applicable products or services from MSCI ESG Research, neither MSCI nor any of its products or services recommends, endorses, approves or otherwise expresses any opinion regarding any issuer, securities, financial products or instruments or trading strategies and MSCI’s products or services are not intended to constitute investment advice or a recommendation to make (or refrain from making) any kind of investment decision and may not be relied on as such. Issuers mentioned or included in any MSCI ESG Research materials may include MSCI Inc., clients of MSCI or suppliers to MSCI, and may also purchase research or other products or services from MSCI ESG Research. MSCI ESG Research materials, including materials utilized in any MSCI ESG Indexes or other products, have not been submitted to, nor received approval from, the United States Securities and Exchange Commission or any other regulatory body.
Any use of or access to products, services or information of MSCI requires a license from MSCI. MSCI, Barra, RiskMetrics, IPD and other MSCI brands and product names are the trademarks, service marks, or registered trademarks of MSCI or its subsidiaries in the United States and other jurisdictions. The Global Industry Classification Standard (GICS) was developed by and is the exclusive property of MSCI and Standard & Poor’s. “Global Industry Classification Standard (GICS)” is a service mark of MSCI and Standard & Poor’s.
47