supervised topic models - computer sciencejbg/teaching/csci_7000/06c.pdfsupervised topic models...

71
Supervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 1 of 29

Upload: others

Post on 31-May-2020

11 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

Supervised Topic Models

Advanced Machine Learning for NLPJordan Boyd-GraberHIERARCHIES

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 1 of 29

Page 2: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

Motivation: Representing Elected Officials with Ideal Points

An essential tool in political science: distinguish trends and characterizesubgroups

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 2 of 29

Page 3: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

Evaluation: Tea Party in the House

The Tea Party• American political movement for freedom, small government, lower tax• Disrupting Republican Party and recent elections• Organizations:

◦ Institutional: Tea Party Caucus◦ Other: Tea Party Express, Tea Party Patriots, Freedom Works

• “Conventional views of ideology as a single–dimensional, leftÂU-rightspectrum experience great difficulty in understanding or explaining theTea Party.”

Goal• Explain Tea Partiers in terms of issues and votes• Identify Tea Partiers from their rhetoric

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 3 of 29

Page 4: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

Not everyone has a voting record

• Ideal points estimated based onvoting record

• Not all candidates have a votingrecord

◦ Governors◦ Entertainers◦ CEOs

• But all politicians—bydefinition—talk

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 4 of 29

Page 5: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

Not everyone has a voting record

• Ideal points estimated based onvoting record

• Not all candidates have a votingrecord

◦ Governors◦ Entertainers◦ CEOs

• But all politicians—bydefinition—talk

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 4 of 29

Page 6: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

Let’s use whatever data we have

A single model that uses:

• Bill text

• Votes

• Commentary

to map political actors to thesame continuous space.

Thiswork: congressional floorspeeches

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 5 of 29

Page 7: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

Let’s use whatever data we have

A single model that uses:

• Bill text

• Votes

• Commentary

to map political actors to thesame continuous space. Thiswork: congressional floorspeeches

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 5 of 29

Page 8: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

Ideal Point Review

Outline

1 Ideal Point Review

2 Hierarchical Ideal Point Topic Model

3 Predicting Membership

4 How They Vote

5 How They Talk

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 6 of 29

Page 9: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

Ideal Point Review

One-dimensional Ideal Point using Votes

YEA

NAY

YEA

NAY

YEA

NAY

YEA

YEA

YEA

NAY

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 7 of 29

Page 10: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

Ideal Point Review

One-dimensional Ideal Point using Votes

YEA

NAY

YEA

NAY

YEA

NAY

YEA

YEA

YEA

NAY

p(va,b = Yea) = �(uaxb + yb)

Legislator a votes 'Yea' on bill b with probability

�(↵) =exp(↵)

exp(↵) + 1

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 7 of 29

Page 11: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

Ideal Point Review

One-dimensional Ideal Point using Votes

YEA

NAY

YEA

NAY

YEA

NAY

YEA

YEA

YEA

NAY

p(va,b = Yea) = �(uaxb + yb)

Legislator a votes 'Yea' on bill b with probability

One-dimensional ideal point of legislator a

Polarity of bill b Popularity of bill b

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 7 of 29

Page 12: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

Ideal Point Review

One-dimensional Ideal Point using Votes

YEA

NAY

YEA

NAY

YEA

NAY

YEA

YEA

YEA

NAY

p(va,b = Yea) = �(uaxb + yb)

Legislator a votes 'Yea' on bill b with probability

One-dimensional ideal point of legislator a

Polarity of bill b Popularity of bill b

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 7 of 29

Page 13: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

Ideal Point Review

One-dimensional Ideal Point using Votes

YEA

NAY

YEA

NAY

YEA

NAY

YEA

YEA

YEA

NAY

p(va,b = Yea) = �(uaxb + yb)

Legislator a votes 'Yea' on bill b with probability

One-dimensional ideal point of legislator a

Polarity of bill b Popularity of bill b

LIBERAL CONSERVATIVE

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 7 of 29

Page 14: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

Ideal Point Review

Multi-dimensional Ideal Point using Votes

YEA

NAY

YEA

NAY

YEA

NAY

YEA

YEA

YEA

NAY

Legislator a votes 'Yea' on bill b with probability

p(va,b = Yea) = �

KX

k=1

ua,kxb,k + yb

!

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 8 of 29

Page 15: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

Ideal Point Review

Multi-dimensional Ideal Point using Votes

YEA

NAY

YEA

NAY

YEA

NAY

YEA

YEA

YEA

NAY

Legislator a votes 'Yea' on bill b with probability

p(va,b = Yea) = �

KX

k=1

ua,kxb,k + yb

!

Multi-dimensional ideal point of legislator a

K dimensions

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 8 of 29

Page 16: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

Ideal Point Review

Multi-dimensional Ideal Point using Votes

YEA

NAY

YEA

NAY

YEA

NAY

YEA

YEA

YEA

NAY

Legislator a votes 'Yea' on bill b with probability

p(va,b = Yea) = �

KX

k=1

ua,kxb,k + yb

!

Multi-dimensional ideal point of legislator a

K dimensions

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 8 of 29

Page 17: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

Ideal Point Review

Multi-dimensional Ideal Point using Votes

YEA

NAY

YEA

NAY

YEA

NAY

YEA

YEA

YEA

NAY

Legislator a votes 'Yea' on bill b with probability

p(va,b = Yea) = �

KX

k=1

ua,kxb,k + yb

!

Multi-dimensional ideal point of legislator a

K dimensions

??

?Dimensions are difficult to interpret

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 8 of 29

Page 18: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

Ideal Point Review

Multi-dimensional Ideal Point using Votes & Text

YEA

NAY

YEA

NAY

YEA

NAY

YEA

YEA

YEA

NAY

Bill Text

??

?Dimensions are difficult to interpret

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 9 of 29

Page 19: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

Ideal Point Review

Multi-dimensional Ideal Point using Votes & Text

YEA

NAY

YEA

NAY

YEA

NAY

YEA

YEA

YEA

NAY

Legislator a votes 'Yea' on bill b with probability

p(va,b = Yea) = �

xb

KX

k=1

ua,k#b,k + yb

!

??

?Dimensions are difficult to interpret

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 9 of 29

Page 20: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

Ideal Point Review

Multi-dimensional Ideal Point using Votes & Text

YEA

NAY

YEA

NAY

YEA

NAY

YEA

YEA

YEA

NAY

Legislator a votes 'Yea' on bill b with probability

p(va,b = Yea) = �

xb

KX

k=1

ua,k#b,k + yb

!

Multi-dimensional ideal point of legislator a

Topic proportion of bill b estimated from its text

??

?Dimensions are difficult to interpret

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 9 of 29

Page 21: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

Ideal Point Review

Multi-dimensional Ideal Point using Votes & Text

YEA

NAY

YEA

NAY

YEA

NAY

YEA

YEA

YEA

NAY

Legislator a votes 'Yea' on bill b with probability

p(va,b = Yea) = �

xb

KX

k=1

ua,k#b,k + yb

!

Multi-dimensional ideal point of legislator a

Topic proportion of bill b estimated from its text

obamacare, patient, doctor, insurance, affordable_care, hospital

balanc_budget, debt_ceil, cap, cut_spend, rais_tax debt_limit, nation_debt,

employ, hire, job_creator, union, nlrb, boe, labor, business_owner

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 9 of 29

Page 22: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

Hierarchical Ideal Point Topic Model

Outline

1 Ideal Point Review

2 Hierarchical Ideal Point Topic Model

3 Predicting Membership

4 How They Vote

5 How They Talk

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 10 of 29

Page 23: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

Hierarchical Ideal Point Topic Model

Hierarchical Ideal Point Topic Model: Intuition

What are your thoughts on the issue of immigration?

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 11 of 29

Page 24: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

Hierarchical Ideal Point Topic Model

Hierarchical Ideal Point Topic Model: Overview

Using both votes and text to learn• Two-level topic hierarchy:

• Ideal points in multiple interpretable dimensions

patient, doctor, physician, hospit, medicaid, board,

georgia, save_medicar, nurs, tennesse,

page, bureaucrat, advisori_board,

medicin, independ_payment

afford_care, exchang,

patient_protect, human_servic, public_health,

slush_fund, ppaca, mandatori,

mandatori_spend, governor, hospit, health_center,

flexibl, teach_health,

obamacar, replac, mandat, insur, health_insur,

coverag, social_secur,

premium, repeal_obamacar,

entitl, govern_takeov,

purchas, unconstitut, preexist_condit

obamacar, patient, doctor, physician, afford_care, hospit, insur, replac, mandat, exchang, health_insur,

coverag, medicaid, patient_protect, board

Health

Frame H1 Frame H2 Frame H3

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 12 of 29

Page 25: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

Hierarchical Ideal Point Topic Model

Hierarchical Ideal Point Topic Model: Overview

Using both votes and text to learn• Two-level topic hierarchy:

◦ First-level nodes map to agenda issues◦ Second-level nodes map to issue-specific frames

• Ideal points in multiple interpretable dimensions

patient, doctor, physician, hospit, medicaid, board,

georgia, save_medicar, nurs, tennesse,

page, bureaucrat, advisori_board,

medicin, independ_payment

afford_care, exchang,

patient_protect, human_servic, public_health,

slush_fund, ppaca, mandatori,

mandatori_spend, governor, hospit, health_center,

flexibl, teach_health,

obamacar, replac, mandat, insur, health_insur,

coverag, social_secur,

premium, repeal_obamacar,

entitl, govern_takeov,

purchas, unconstitut, preexist_condit

obamacar, patient, doctor, physician, afford_care, hospit, insur, replac, mandat, exchang, health_insur,

coverag, medicaid, patient_protect, board

Health

Frame H1 Frame H2 Frame H3First-level

nodes ~

Agenda issues

Second-level nodes

~ Issue-specific

frames

TopicHierarchy

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 12 of 29

Page 26: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

Hierarchical Ideal Point Topic Model

Hierarchical Ideal Point Topic Model: Overview

Using both votes and text to learn• Two-level topic hierarchy: Use existing labeled data to learn priors for

interpretable issues

• Ideal points in multiple interpretable dimensions

patient, doctor, physician, hospit, medicaid, board,

georgia, save_medicar, nurs, tennesse,

page, bureaucrat, advisori_board,

medicin, independ_payment

afford_care, exchang,

patient_protect, human_servic, public_health,

slush_fund, ppaca, mandatori,

mandatori_spend, governor, hospit, health_center,

flexibl, teach_health,

obamacar, replac, mandat, insur, health_insur,

coverag, social_secur,

premium, repeal_obamacar,

entitl, govern_takeov,

purchas, unconstitut, preexist_condit

obamacar, patient, doctor, physician, afford_care, hospit, insur, replac, mandat, exchang, health_insur,

coverag, medicaid, patient_protect, board

Health

Frame H1 Frame H2 Frame H3

Use prior to learn interpretable issue topics

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 12 of 29

Page 27: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

Hierarchical Ideal Point Topic Model

Hierarchical Ideal Point Topic Model: Overview

Using both votes and text to learn• Two-level topic hierarchy: Ideal points for frames for predictions using

text only

• Ideal points in multiple interpretable dimensions

patient, doctor, physician, hospit, medicaid, board,

georgia, save_medicar, nurs, tennesse,

page, bureaucrat, advisori_board,

medicin, independ_payment

afford_care, exchang,

patient_protect, human_servic, public_health,

slush_fund, ppaca, mandatori,

mandatori_spend, governor, hospit, health_center,

flexibl, teach_health,

obamacar, replac, mandat, insur, health_insur,

coverag, social_secur,

premium, repeal_obamacar,

entitl, govern_takeov,

purchas, unconstitut, preexist_condit

obamacar, patient, doctor, physician, afford_care, hospit, insur, replac, mandat, exchang, health_insur,

coverag, medicaid, patient_protect, board

Health

Frame H1 Frame H2 Frame H3

Learn ideal point for each frame

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 12 of 29

Page 28: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

Hierarchical Ideal Point Topic Model

Hierarchical Ideal Point Topic Model: Overview

Using both votes and text to learn• Two-level topic hierarchy:

• Ideal points in multiple interpretable dimensions

patient, doctor, physician, hospit, medicaid, board,

georgia, save_medicar, nurs, tennesse,

page, bureaucrat, advisori_board,

medicin, independ_payment

afford_care, exchang,

patient_protect, human_servic, public_health,

slush_fund, ppaca, mandatori,

mandatori_spend, governor, hospit, health_center,

flexibl, teach_health,

obamacar, replac, mandat, insur, health_insur,

coverag, social_secur,

premium, repeal_obamacar,

entitl, govern_takeov,

purchas, unconstitut, preexist_condit

obamacar, patient, doctor, physician, afford_care, hospit, insur, replac, mandat, exchang, health_insur,

coverag, medicaid, patient_protect, board

Health

Frame H1 Frame H2 Frame H3

Multi-dimensional Ideal Points

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 12 of 29

Page 29: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

Hierarchical Ideal Point Topic Model

Hierarchical Ideal Point Topic Model: Overview

Hierarchical Ideal Point Topic Model: Inputs

• A collection of votes {va ,b }• A collection of D speeches {w d }, each of which is given by legislator

ad

• A collection of B bill text {w ′b }

YEA

NAY

YEA

NAY

YEA

NAY

YEA

YEA

YEA

NAY

Votes

Bill text

Speeches

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 12 of 29

Page 30: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

Hierarchical Ideal Point Topic Model

Hierarchical Ideal Point Topic Model

Modeling bill text

• Each bill text b is a mixture over K issues ϑb

• Each bill token generated from topic at first-level issue node

patient, doctor, physician, hospit, medicaid, board,

georgia, save_medicar, nurs, tennesse,

page, bureaucrat, advisori_board,

medicin, independ_payment

afford_care, exchang,

patient_protect, human_servic, public_health,

slush_fund, ppaca, mandatori,

mandatori_spend, governor, hospit, health_center,

flexibl, teach_health,

obamacar, replac, mandat, insur, health_insur,

coverag, social_secur,

premium, repeal_obamacar,

entitl, govern_takeov,

purchas, unconstitut, preexist_condit

obamacar, patient, doctor, physician, afford_care, hospit, insur, replac, mandat, exchang, health_insur,

coverag, medicaid, patient_protect, board

Health

Frame H1 Frame H2 Frame H3

balanc_budget, debt_ceil, cap,

cut_spend, debt_limit, spend_cut, fiscal_hous,

grandchildren, guarante, default, august, obama,

deficit_spend, rein, feder_budget

white_hous, shut, continu_resolut,

mess, hous_republican,

novemb, govern_shutdown,

senat_reid, harri_reid, vision, shutdown, liber,

arriv, republican_parti,

blame

borrow, nation_debt,

rais_tax, entitl, prosper, chart, grandchildren, spend_monei,

size, gdp, tax_increas, cent,

govern_spend, social_secur

balanc_budget, borrow, debt_ceil, cap, cut_spend, nation_debt, grandchildren, rais_tax, entitl,

white_hous, debt_limit, prosper

Macroeconomics

Frame M1 Frame M2 Frame M3

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 13 of 29

Page 31: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

Hierarchical Ideal Point Topic Model

Hierarchical Ideal Point Topic Model

Hierarchical Ideal Point Topic Model: Generative Process

• Each speech d also has a distribution θd over K issues

• Each issue k , each speech d has distribution over frames ψd ,k

• Each speech token from topic at second-level frame node

patient, doctor, physician, hospit, medicaid, board,

georgia, save_medicar, nurs, tennesse,

page, bureaucrat, advisori_board,

medicin, independ_payment

afford_care, exchang,

patient_protect, human_servic, public_health,

slush_fund, ppaca, mandatori,

mandatori_spend, governor, hospit, health_center,

flexibl, teach_health,

obamacar, replac, mandat, insur, health_insur,

coverag, social_secur,

premium, repeal_obamacar,

entitl, govern_takeov,

purchas, unconstitut, preexist_condit

obamacar, patient, doctor, physician, afford_care, hospit, insur, replac, mandat, exchang, health_insur,

coverag, medicaid, patient_protect, board

Health

Frame H1 Frame H2 Frame H3

balanc_budget, debt_ceil, cap,

cut_spend, debt_limit, spend_cut, fiscal_hous,

grandchildren, guarante, default, august, obama,

deficit_spend, rein, feder_budget

white_hous, shut, continu_resolut,

mess, hous_republican,

novemb, govern_shutdown,

senat_reid, harri_reid, vision, shutdown, liber,

arriv, republican_parti,

blame

borrow, nation_debt,

rais_tax, entitl, prosper, chart, grandchildren, spend_monei,

size, gdp, tax_increas, cent,

govern_spend, social_secur

balanc_budget, borrow, debt_ceil, cap, cut_spend, nation_debt, grandchildren, rais_tax, entitl,

white_hous, debt_limit, prosper

Macroeconomics

Frame M1 Frame M2 Frame M3

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 13 of 29

Page 32: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

Hierarchical Ideal Point Topic Model

Hierarchical Ideal Point Topic Model

Hierarchical Ideal Point Topic Model: Modeling votes

• Legislator a votes ‘Yea’ on bill b with probabilityp (va ,b = Yea) =Φ(xb

∑Kk=1ϑb ,k ua ,k + yb )

• Ideal point ua ,k ∼N (∑Jk

j=1ηk , jψa ,k , j ,ρ)

patient, doctor, physician, hospit, medicaid, board,

georgia, save_medicar, nurs, tennesse,

page, bureaucrat, advisori_board,

medicin, independ_payment

afford_care, exchang,

patient_protect, human_servic, public_health,

slush_fund, ppaca, mandatori,

mandatori_spend, governor, hospit, health_center,

flexibl, teach_health,

obamacar, replac, mandat, insur, health_insur,

coverag, social_secur,

premium, repeal_obamacar,

entitl, govern_takeov,

purchas, unconstitut, preexist_condit

obamacar, patient, doctor, physician, afford_care, hospit, insur, replac, mandat, exchang, health_insur,

coverag, medicaid, patient_protect, board

Health

Frame H1 Frame H2 Frame H3

balanc_budget, debt_ceil, cap,

cut_spend, debt_limit, spend_cut, fiscal_hous,

grandchildren, guarante, default, august, obama,

deficit_spend, rein, feder_budget

white_hous, shut, continu_resolut,

mess, hous_republican,

novemb, govern_shutdown,

senat_reid, harri_reid, vision, shutdown, liber,

arriv, republican_parti,

blame

borrow, nation_debt,

rais_tax, entitl, prosper, chart, grandchildren, spend_monei,

size, gdp, tax_increas, cent,

govern_spend, social_secur

balanc_budget, borrow, debt_ceil, cap, cut_spend, nation_debt, grandchildren, rais_tax, entitl,

white_hous, debt_limit, prosper

Macroeconomics

Frame M1 Frame M2 Frame M3

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 13 of 29

Page 33: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

Hierarchical Ideal Point Topic Model

Hierarchical Ideal Point Topic Model

Hierarchical Ideal Point Topic Model: Modeling votes

• Legislator a votes ‘Yea’ on bill b with probabilityp (va ,b = Yea) =Φ(xb

∑Kk=1ϑb ,k ua ,k + yb )

• Ideal point ua ,k ∼N (∑Jk

j=1ηk , jψa ,k , j ,ρ)

patient, doctor, physician, hospit, medicaid, board,

georgia, save_medicar, nurs, tennesse,

page, bureaucrat, advisori_board,

medicin, independ_payment

afford_care, exchang,

patient_protect, human_servic, public_health,

slush_fund, ppaca, mandatori,

mandatori_spend, governor, hospit, health_center,

flexibl, teach_health,

obamacar, replac, mandat, insur, health_insur,

coverag, social_secur,

premium, repeal_obamacar,

entitl, govern_takeov,

purchas, unconstitut, preexist_condit

obamacar, patient, doctor, physician, afford_care, hospit, insur, replac, mandat, exchang, health_insur,

coverag, medicaid, patient_protect, board

Health

Frame H1 Frame H2 Frame H3

balanc_budget, debt_ceil, cap,

cut_spend, debt_limit, spend_cut, fiscal_hous,

grandchildren, guarante, default, august, obama,

deficit_spend, rein, feder_budget

white_hous, shut, continu_resolut,

mess, hous_republican,

novemb, govern_shutdown,

senat_reid, harri_reid, vision, shutdown, liber,

arriv, republican_parti,

blame

borrow, nation_debt,

rais_tax, entitl, prosper, chart, grandchildren, spend_monei,

size, gdp, tax_increas, cent,

govern_spend, social_secur

balanc_budget, borrow, debt_ceil, cap, cut_spend, nation_debt, grandchildren, rais_tax, entitl,

white_hous, debt_limit, prosper

Macroeconomics

Frame M1 Frame M2 Frame M3

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 13 of 29

Page 34: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

Hierarchical Ideal Point Topic Model

Hierarchical Ideal Point Topic Model

Hierarchical Ideal Point Topic Model: Modeling votes

• Legislator a votes ‘Yea’ on bill b with probabilityp (va ,b = Yea) =Φ(xb

∑Kk=1ϑb ,k ua ,k + yb )

• Ideal point ua ,k ∼N (∑Jk

j=1ηk , jψa ,k , j ,ρ)

patient, doctor, physician, hospit, medicaid, board,

georgia, save_medicar, nurs, tennesse,

page, bureaucrat, advisori_board,

medicin, independ_payment

afford_care, exchang,

patient_protect, human_servic, public_health,

slush_fund, ppaca, mandatori,

mandatori_spend, governor, hospit, health_center,

flexibl, teach_health,

obamacar, replac, mandat, insur, health_insur,

coverag, social_secur,

premium, repeal_obamacar,

entitl, govern_takeov,

purchas, unconstitut, preexist_condit

obamacar, patient, doctor, physician, afford_care, hospit, insur, replac, mandat, exchang, health_insur,

coverag, medicaid, patient_protect, board

Health

Frame H1 Frame H2 Frame H3

balanc_budget, debt_ceil, cap,

cut_spend, debt_limit, spend_cut, fiscal_hous,

grandchildren, guarante, default, august, obama,

deficit_spend, rein, feder_budget

white_hous, shut, continu_resolut,

mess, hous_republican,

novemb, govern_shutdown,

senat_reid, harri_reid, vision, shutdown, liber,

arriv, republican_parti,

blame

borrow, nation_debt,

rais_tax, entitl, prosper, chart, grandchildren, spend_monei,

size, gdp, tax_increas, cent,

govern_spend, social_secur

balanc_budget, borrow, debt_ceil, cap, cut_spend, nation_debt, grandchildren, rais_tax, entitl,

white_hous, debt_limit, prosper

Macroeconomics

Frame M1 Frame M2 Frame M3

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 13 of 29

Page 35: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

Hierarchical Ideal Point Topic Model

Hierarchical Ideal Point Topic Model

Hierarchical Ideal Point Topic Model: Modeling votes

• Legislator a votes ‘Yea’ on bill b with probabilityp (va ,b = Yea) =Φ(xb

∑Kk=1ϑb ,k ua ,k + yb )

• Ideal point ua ,k ∼N (∑Jk

j=1ηk , jψa ,k , j ,ρ)

patient, doctor, physician, hospit, medicaid, board,

georgia, save_medicar, nurs, tennesse,

page, bureaucrat, advisori_board,

medicin, independ_payment

afford_care, exchang,

patient_protect, human_servic, public_health,

slush_fund, ppaca, mandatori,

mandatori_spend, governor, hospit, health_center,

flexibl, teach_health,

obamacar, replac, mandat, insur, health_insur,

coverag, social_secur,

premium, repeal_obamacar,

entitl, govern_takeov,

purchas, unconstitut, preexist_condit

obamacar, patient, doctor, physician, afford_care, hospit, insur, replac, mandat, exchang, health_insur,

coverag, medicaid, patient_protect, board

Health

Frame H1 Frame H2 Frame H3

balanc_budget, debt_ceil, cap,

cut_spend, debt_limit, spend_cut, fiscal_hous,

grandchildren, guarante, default, august, obama,

deficit_spend, rein, feder_budget

white_hous, shut, continu_resolut,

mess, hous_republican,

novemb, govern_shutdown,

senat_reid, harri_reid, vision, shutdown, liber,

arriv, republican_parti,

blame

borrow, nation_debt,

rais_tax, entitl, prosper, chart, grandchildren, spend_monei,

size, gdp, tax_increas, cent,

govern_spend, social_secur

balanc_budget, borrow, debt_ceil, cap, cut_spend, nation_debt, grandchildren, rais_tax, entitl,

white_hous, debt_limit, prosper

Macroeconomics

Frame M1 Frame M2 Frame M3

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 13 of 29

Page 36: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

Hierarchical Ideal Point Topic Model

Hierarchical Ideal Point Topic Model

Hierarchical Ideal Point Topic Model: Modeling votes

• Legislator a votes ‘Yea’ on bill b with probabilityp (va ,b = Yea) =Φ(xb

∑Kk=1ϑb ,k ua ,k + yb )

• Ideal point ua ,k ∼N (∑Jk

j=1ηk , jψa ,k , j ,ρ)

patient, doctor, physician, hospit, medicaid, board,

georgia, save_medicar, nurs, tennesse,

page, bureaucrat, advisori_board,

medicin, independ_payment

afford_care, exchang,

patient_protect, human_servic, public_health,

slush_fund, ppaca, mandatori,

mandatori_spend, governor, hospit, health_center,

flexibl, teach_health,

obamacar, replac, mandat, insur, health_insur,

coverag, social_secur,

premium, repeal_obamacar,

entitl, govern_takeov,

purchas, unconstitut, preexist_condit

obamacar, patient, doctor, physician, afford_care, hospit, insur, replac, mandat, exchang, health_insur,

coverag, medicaid, patient_protect, board

Health

Frame H1 Frame H2 Frame H3

balanc_budget, debt_ceil, cap,

cut_spend, debt_limit, spend_cut, fiscal_hous,

grandchildren, guarante, default, august, obama,

deficit_spend, rein, feder_budget

white_hous, shut, continu_resolut,

mess, hous_republican,

novemb, govern_shutdown,

senat_reid, harri_reid, vision, shutdown, liber,

arriv, republican_parti,

blame

borrow, nation_debt,

rais_tax, entitl, prosper, chart, grandchildren, spend_monei,

size, gdp, tax_increas, cent,

govern_spend, social_secur

balanc_budget, borrow, debt_ceil, cap, cut_spend, nation_debt, grandchildren, rais_tax, entitl,

white_hous, debt_limit, prosper

Macroeconomics

Frame M1 Frame M2 Frame M3

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 13 of 29

Page 37: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

Hierarchical Ideal Point Topic Model

Nested Chinese Restaurant Process

• Start at a CRP, choose a table

• That table has not just a dish (distribution over words) but also businesscard

• That card tells you which restaurant to go to next

• You do this L times

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 14 of 29

Page 38: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

Hierarchical Ideal Point Topic Model

Topic Hierarchies

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 15 of 29

Page 39: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

Hierarchical Ideal Point Topic Model

Generative Process

φ1,1

Start at the root node

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 16 of 29

Page 40: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

Hierarchical Ideal Point Topic Model

Generative Process

φ1,1

φ2,1 φ2,2 φ2,3

Need to choose which table to sit at

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 16 of 29

Page 41: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

Hierarchical Ideal Point Topic Model

Generative Process

φ1,1

φ2,1 φ2,2 φ2,3 φ2*

This is a CRP! (Can create new table too.)

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 16 of 29

Page 42: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

Hierarchical Ideal Point Topic Model

Generative Process

φ1,1

φ2,1 φ2,2

φ7,1 φ7,2 φ7,3

φ2,3 φ2*

Repeat

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 16 of 29

Page 43: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

Hierarchical Ideal Point Topic Model

Generative Process

φ1,1

φ2,1 φ2,2

φ7,1 φ7,2 φ7,3

φ2,3 φ2*L

Your path then becomes the set of topics you use for this document

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 16 of 29

Page 44: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

Hierarchical Ideal Point Topic Model

Generative Process

φ1,1

φ2,1 φ2,2

φ7,1 φ7,2 φ7,3

φ2,3 φ2*L

Warning: Probably don’t want to only use one path per document (butuseful explanation)

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 16 of 29

Page 45: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

Hierarchical Ideal Point Topic Model

Evaluation: Tea Party in the House

The Tea Party• American political movement for freedom, small government, lower tax• Disrupting Republican Party and recent elections

Data• 240 Republican Representatives in the 112th U.S. House• 60 are members of the Tea Party Caucus (self-identified)• 60 key votes selected by Freedom Works (2011-2012)• Speeches, bill text and voting records from the Library of Congress

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 17 of 29

Page 46: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

Hierarchical Ideal Point Topic Model

Evaluation: Tea Party in the House

The Tea Party• American political movement for freedom, small government, lower tax• Disrupting Republican Party and recent elections

Data• 240 Republican Representatives in the 112th U.S. House• 60 are members of the Tea Party Caucus (self-identified)• 60 key votes selected by Freedom Works (2011-2012)• Speeches, bill text and voting records from the Library of Congress

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 17 of 29

Page 47: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

Predicting Membership

Outline

1 Ideal Point Review

2 Hierarchical Ideal Point Topic Model

3 Predicting Membership

4 How They Vote

5 How They Talk

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 18 of 29

Page 48: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

Predicting Membership

Tea Party Caucus Membership Prediction

Experiment setup• Task: Binary classification of whether a legislator is a member of the Tea Party

Caucus

• Evaluation metric: AUC-ROC

• Classifier: SVMl i g h t

• Five-fold stratified cross-validation

Features• Text-based features: normalized term frequency (TF) and TF-IDF

• Vote: binary features

• HIPTM: features extracted from our model including

◦ K -dim ideal point ua ,k estimated from both votes and text◦ K -dim ideal point estimated from text only ηT

k ψa ,k

◦ B probabilities estimating a ’s votes Φ(xb

∑Kk=1ϑb ,k ua ,k + yb )

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 19 of 29

Page 49: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

Predicting Membership

Tea Party Caucus Membership Prediction

Experiment setup• Task: Binary classification of whether a legislator is a member of the Tea Party

Caucus

• Evaluation metric: AUC-ROC

• Classifier: SVMl i g h t

• Five-fold stratified cross-validation

Features• Text-based features: normalized term frequency (TF) and TF-IDF

• Vote: binary features

• HIPTM: features extracted from our model including

◦ K -dim ideal point ua ,k estimated from both votes and text◦ K -dim ideal point estimated from text only ηT

k ψa ,k

◦ B probabilities estimating a ’s votes Φ(xb

∑Kk=1ϑb ,k ua ,k + yb )

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 19 of 29

Page 50: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

Predicting Membership

Tea Party Caucus Membership Prediction: Votes & Text

AUCROC

0.60

0.65

0.70

0.75

TF TFIDF Vote HIPTM Vote-TF Vote-TF-IDF Vote-HIPTM All

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 20 of 29

Page 51: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

Predicting Membership

Tea Party Caucus Membership Prediction: Votes & Text

AUCROC

0.60

0.65

0.70

0.75

TF TFIDF Vote HIPTM Vote-TF Vote-TF-IDF Vote-HIPTM All

Text-based Features

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 20 of 29

Page 52: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

Predicting Membership

Tea Party Caucus Membership Prediction: Votes & Text

AUCROC

0.60

0.65

0.70

0.75

TF TFIDF Vote HIPTM Vote-TF Vote-TF-IDF Vote-HIPTM All

Text-based Features

VoteFeatures

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 20 of 29

Page 53: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

Predicting Membership

Tea Party Caucus Membership Prediction: Votes & Text

AUCROC

0.60

0.65

0.70

0.75

TF TFIDF Vote HIPTM Vote-TF Vote-TF-IDF Vote-HIPTM All

Text-based Features

VoteFeatures

Our Features

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 20 of 29

Page 54: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

Predicting Membership

Tea Party Caucus Membership Prediction: Votes & Text

AUCROC

0.60

0.65

0.70

0.75

TF TFIDF Vote HIPTM Vote-TF Vote-TF-IDF Vote-HIPTM All

Text-based Features

VoteFeatures

Our Features

Combining Vote with TF/TF-IDF

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 20 of 29

Page 55: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

Predicting Membership

Tea Party Caucus Membership Prediction: Votes & Text

AUCROC

0.60

0.65

0.70

0.75

TF TFIDF Vote HIPTM Vote-TF Vote-TF-IDF Vote-HIPTM All

Text-based Features

VoteFeatures

Our Features

Combining Vote with TF/TF-IDF

Combining Vote with Our Features

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 20 of 29

Page 56: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

Predicting Membership

Tea Party Caucus Membership Prediction: Text Only

AUCROC

0.61

0.63

0.65

TF TF-IDF HIPTM

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 21 of 29

Page 57: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

Predicting Membership

Tea Party Caucus Membership Prediction: Text Only

AUCROC

0.61

0.63

0.65

TF TF-IDF HIPTM

Training: text onlyTest: text only

Training: text and votesTest: text only

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 21 of 29

Page 58: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

How They Vote

Outline

1 Ideal Point Review

2 Hierarchical Ideal Point Topic Model

3 Predicting Membership

4 How They Vote

5 How They Talk

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 22 of 29

Page 59: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

How They Vote

One-dimensional Ideal Points

Mario DiazBalart

Dave G. Reichert

Ileana RosLehtinen

Jon Runyan

Steve Stivers

Bob Dold

Rodney M. Alexander

Patrick Meehan

Peter T. King

Harold Rogers

Christopher H. Smith

Michael Grimm

Jim Gerlach

Ander Crenshaw

Judy Biggert

-1.6 -1.5 -1.4 -1.3 -1.2Ideal Point

Tea Party Caucus a aMember Nonmember

Trent Franks

Trey Gowdy

Tom McClintock

David Schweikert

Doug Lamborn

Scott Garrett

Joe Walsh

Raúl Labrador

Jim Jordan

Tom Graves

Paul C. Broun

Mick Mulvaney

Justin Amash

Jeff Flake

Jeff Duncan

1.4 1.6 1.8Ideal Point

Tea Party Caucus a aMember Nonmember

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 23 of 29

Page 60: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

How They Vote

One-dimensional Ideal Points

Mario DiazBalart

Dave G. Reichert

Ileana RosLehtinen

Jon Runyan

Steve Stivers

Bob Dold

Rodney M. Alexander

Patrick Meehan

Peter T. King

Harold Rogers

Christopher H. Smith

Michael Grimm

Jim Gerlach

Ander Crenshaw

Judy Biggert

-1.6 -1.5 -1.4 -1.3 -1.2Ideal Point

Tea Party Caucus a aMember Nonmember

Trent Franks

Trey Gowdy

Tom McClintock

David Schweikert

Doug Lamborn

Scott Garrett

Joe Walsh

Raúl Labrador

Jim Jordan

Tom Graves

Paul C. Broun

Mick Mulvaney

Justin Amash

Jeff Flake

Jeff Duncan

1.4 1.6 1.8Ideal Point

Tea Party Caucus a aMember Nonmember

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 23 of 29

Page 61: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

How They Vote

One-dimensional Ideal Points

Mario DiazBalart

Dave G. Reichert

Ileana RosLehtinen

Jon Runyan

Steve Stivers

Bob Dold

Rodney M. Alexander

Patrick Meehan

Peter T. King

Harold Rogers

Christopher H. Smith

Michael Grimm

Jim Gerlach

Ander Crenshaw

Judy Biggert

-1.6 -1.5 -1.4 -1.3 -1.2Ideal Point

Tea Party Caucus a aMember Nonmember

• Alexander and Crenshaw’s votesonly agree with Freedom Works 48%and 50% respectively

• Both voted for raising the debt ceilingand are listed as “traitor”

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 23 of 29

Page 62: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

How They Vote

One-dimensional Ideal Points

• Flake and Amash didn’t self-identifyas members of the Tea PartyCaucus but have been endorsed byother Tea Party organizations

“Some 46 House members and six senatorshad been [Tea Party] . . . In addition, therewere about 18 other House members likeTrey Gowdy, Mark Meadows, and JustinAmash, and several senators, including JeffFlake and Pat Toomey, who owed theirelection to support from the Tea Party and itsWashington allies.” Trent Franks

Trey Gowdy

Tom McClintock

David Schweikert

Doug Lamborn

Scott Garrett

Joe Walsh

Raúl Labrador

Jim Jordan

Tom Graves

Paul C. Broun

Mick Mulvaney

Justin Amash

Jeff Flake

Jeff Duncan

1.4 1.6 1.8Ideal Point

Tea Party Caucus a aMember Nonmember

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 23 of 29

Page 63: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

How They Vote

Multi-dimensional Ideal Points

l ll ll ll

l

l

l l

ll

l

Banking..Finance..and.Domestic.Commerce

Foreign.Trade

Labor..Employment..and.Immigration

Transportation

Macroeconomics

Government.Operations

6 3 0 3Ideal Points

Tea Party Caucus Member Nonmember

Freedom Works’ key votes on most highly polarized dimensions are aboutgovernment spending

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 24 of 29

Page 64: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

How They Talk

Outline

1 Ideal Point Review

2 Hierarchical Ideal Point Topic Model

3 Predicting Membership

4 How They Vote

5 How They Talk

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 25 of 29

Page 65: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

How They Talk

Framing Healthcare

Healthobamacar, patient, doctor, physician, afford_care, hospit, insur, replac, mandat, exchang, health_insur, coverag, medicaid, patient_protect, board

Frame H1afford_care, exchang, patient_protect, human_servic, public_health, slush_fund, ppaca, mandatori, mandatori_spend, governor, hospit, health_center, flexibl, teach_health, unlimit

Frame H2

patient, doctor, physician, hospit, medicaid, board, georgia, save_medicar, nurs, tennesse, page, bureaucrat, advisori_board, medicin, independ_payment

Frame H2

obamacar, replac, mandat, insur, health_insur, coverag, social_secur, premium, repeal_obamacar, entitl, govern_takeov, purchas, unconstitut, preexist_condit, employ

-.13 .04 .56

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 26 of 29

Page 66: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

How They Talk

Framing Macroeconomics

Macroeconomicsbalanc_budget, borrow, debt_ceil, cap, cut_spend,

nation_debt, grandchildren, rais_tax, entitl

Frame M1

white_hous, shut, continu_resolut, mess,

hous_republican, novemb, govern_shutdown

Frame M2

balanc_budget, debt_ceil, cap, cut_spend, debt_limit,

spend_cut, fiscal_hous, grandchildren, guarante

Frame M2

borrow, nation_debt, rais_tax, entitl, prosper,

chart, grandchildren, spend_monei, size, gdp

-.57 -.24 .56

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 27 of 29

Page 67: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

How They Talk

Polarization

Ideal PointDistributions

Dist

ribu

tion

of

Issu

e Fr

ames

PolarizedNot

Pola

rized

Not Civil Rights, Minority

Issues, Civil LibertiesBanking and Finance;

Transportation

Health; Public Lands and Water Management

Macroeconomics; Government Operations

YES NO

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 28 of 29

Page 68: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

How They Talk

Polarization

Ideal PointDistributions

Dist

ribu

tion

of

Issu

e Fr

ames

PolarizedNot

Pola

rized

Not Civil Rights, Minority

Issues, Civil LibertiesBanking and Finance

Transportation

Health; Public Lands and Water Management

Macroeconomics; Government Operations

YES NO

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 28 of 29

Page 69: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

How They Talk

Polarization

Ideal PointDistributions

Dist

ribu

tion

of

Issu

e Fr

ames

PolarizedNot

Pola

rized

Not Civil Rights, Minority

Issues, Civil LibertiesBanking and Finance

Transportation

Health; Public Landsand Water Management

Macroeconomics; Government Operations

YES NO

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 28 of 29

Page 70: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

How They Talk

Polarization

Ideal PointDistributions

Dist

ribu

tion

of

Issu

e Fr

ames

PolarizedNot

Pola

rized

Not Civil Rights, Minority

Issues, Civil LibertiesBanking and Finance

Transportation

Health; Public Lands and Water Management

Macroeconomics; Government Operations

YES NO

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 28 of 29

Page 71: Supervised Topic Models - Computer Sciencejbg/teaching/CSCI_7000/06c.pdfSupervised Topic Models Advanced Machine Learning for NLP Jordan Boyd-Graber HIERARCHIES ... Overview Using

How They Talk

Hierarchies are Cool

• For a sweep of single parameter, BNP not that useful

• Complex structures are more realistic applications

• Combining with supervised objective

• Unsolved problem: good prediction with interpretable structure

Advanced Machine Learning for NLP | Boyd-Graber Supervised Topic Models | 29 of 29