equal opportunity in online classification with …...equal opportunity in online classification...

38
Equal Opportunity in Online Classification with Partial Feedback Yahav Bechavod, Hebrew University Simons Institute, July 2019 Katrina Ligett Hebrew University Aaron Roth University of Pennsylvania Bo Waggoner University of Colorado, Boulder Steven Wu University of Minnesota

Upload: others

Post on 26-Jul-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Equal Opportunity in Online Classification with …...Equal Opportunity in Online Classification with Partial Feedback Yahav Bechavod, Hebrew University Simons Institute, July 2019

Equal Opportunity in Online Classification with Partial Feedback

Yahav Bechavod, Hebrew University

Simons Institute, July 2019

Katrina LigettHebrew University

Aaron RothUniversity of Pennsylvania

Bo WaggonerUniversity of

Colorado, Boulder

Steven WuUniversity of Minnesota

Page 2: Equal Opportunity in Online Classification with …...Equal Opportunity in Online Classification with Partial Feedback Yahav Bechavod, Hebrew University Simons Institute, July 2019

Online Classification with Full Feedback

Time 2

Page 3: Equal Opportunity in Online Classification with …...Equal Opportunity in Online Classification with Partial Feedback Yahav Bechavod, Hebrew University Simons Institute, July 2019

3

Page 4: Equal Opportunity in Online Classification with …...Equal Opportunity in Online Classification with Partial Feedback Yahav Bechavod, Hebrew University Simons Institute, July 2019

Online Classification with Full Feedback

Time 4

Full Feedback

Page 5: Equal Opportunity in Online Classification with …...Equal Opportunity in Online Classification with Partial Feedback Yahav Bechavod, Hebrew University Simons Institute, July 2019

The Majority of settings where fairness is a primary concern are Partial Feedback.

• Lending

• Hiring

• College Admissions

• Recidivism prediction

• Online advertising

• Predictive policing

• Medical treatments

5

Page 6: Equal Opportunity in Online Classification with …...Equal Opportunity in Online Classification with Partial Feedback Yahav Bechavod, Hebrew University Simons Institute, July 2019

Online Classification with Partial Feedback

Time 6

Partial Feedback

Page 7: Equal Opportunity in Online Classification with …...Equal Opportunity in Online Classification with Partial Feedback Yahav Bechavod, Hebrew University Simons Institute, July 2019

Standard techniques on gathered data may lead to feedback loops. Risk being highly unfair.

Decisions not only affect how accurate we are, but also the amount and type of data we collect.

7

Page 8: Equal Opportunity in Online Classification with …...Equal Opportunity in Online Classification with Partial Feedback Yahav Bechavod, Hebrew University Simons Institute, July 2019

“The key point of our PredPol study is that virtually all predictive policing models use crime data from police departments. But police department data is not representative of all crimes committed; police aren’t notified about all crimes that happen, and they don’t document all crimes they respond to. Police-recorded crime data is a combination of policing strategy, police-community relations, and criminality. If a police department has a history of over-policing some communities over others, predictive policing will merely reproduce these patterns in subsequent predictions.”

– Kristian Lum, Human Rights Data Analysis Group

8

Page 9: Equal Opportunity in Online Classification with …...Equal Opportunity in Online Classification with Partial Feedback Yahav Bechavod, Hebrew University Simons Institute, July 2019

Problem Setting: Online Classification with One-Sided Feedback

9Time

Possible loan policies:Loan

policyor

Feedback

For :Learner selects policy .

Environment draws ; learner observes . Learner predicts .

If , learner observes .

Page 10: Equal Opportunity in Online Classification with …...Equal Opportunity in Online Classification with Partial Feedback Yahav Bechavod, Hebrew University Simons Institute, July 2019

“How well did I do compared to best available policy?” day1 day2day3 day4 day5

Selected:

10

Loanpolicy

or

Feedback

Best:

Possible loan policies:

Problem Setting: Online Classification with One-Sided Feedback

Page 11: Equal Opportunity in Online Classification with …...Equal Opportunity in Online Classification with Partial Feedback Yahav Bechavod, Hebrew University Simons Institute, July 2019

Learner’s Goal – Minimize Regret

Optimal policy:

Learner’s (pseudo) regret:

11

Page 12: Equal Opportunity in Online Classification with …...Equal Opportunity in Online Classification with Partial Feedback Yahav Bechavod, Hebrew University Simons Institute, July 2019

12

Talk Outline

1. Low regret with one-sided feedback.

2. What about fairness?

3. Fairness + one-sided feedback.

- Algorithm

- Lower bound

Page 13: Equal Opportunity in Online Classification with …...Equal Opportunity in Online Classification with Partial Feedback Yahav Bechavod, Hebrew University Simons Institute, July 2019

13

Talk Outline

1. Low regret with one-sided feedback.

2. What about fairness?

3. Fairness + one-sided feedback.

- Algorithm

- Lower bound

Page 14: Equal Opportunity in Online Classification with …...Equal Opportunity in Online Classification with Partial Feedback Yahav Bechavod, Hebrew University Simons Institute, July 2019

Warmup Question: Can we guarantee low regret despite only having one-sided feedback?

14

Page 15: Equal Opportunity in Online Classification with …...Equal Opportunity in Online Classification with Partial Feedback Yahav Bechavod, Hebrew University Simons Institute, July 2019

From One-Sided Feedback to Contextual Bandits

Approve Approve

DenyDeny

Repays RepaysDefaults Defaults

0

01

1 0

1

2

1

1

1

15

Add to second column

Page 16: Equal Opportunity in Online Classification with …...Equal Opportunity in Online Classification with Partial Feedback Yahav Bechavod, Hebrew University Simons Institute, July 2019

16

From One-Sided Feedback to Contextual Bandits

Time

Possible loan policies:Loan

policyor

Feedback

Approve

Deny

Repays Defaults

0

1

2

1

Feedback

Contextual Bandits

2 actions: Approve, DenyContexts: IndividualsPolicy class: Mappings from contexts to actions

Page 17: Equal Opportunity in Online Classification with …...Equal Opportunity in Online Classification with Partial Feedback Yahav Bechavod, Hebrew University Simons Institute, July 2019

From One-Sided Feedback to Contextual Bandits

Loss matrix transformation is Regret-Preserving.

Time

Approve

Deny

Repays Defaults

0

1

2

1

17

Page 18: Equal Opportunity in Online Classification with …...Equal Opportunity in Online Classification with Partial Feedback Yahav Bechavod, Hebrew University Simons Institute, July 2019

Conclusion: Given a contextual bandit algorithm that guarantees w.h.p., we can translate it to an One-Sided Feedback algorithm that guarantees w.h.p.

From One-Sided Feedback to Contextual Bandits

18

Page 19: Equal Opportunity in Online Classification with …...Equal Opportunity in Online Classification with Partial Feedback Yahav Bechavod, Hebrew University Simons Institute, July 2019

19

Talk Outline

1. Low regret with one-sided feedback.

2. What about fairness?

3. Fairness + one-sided feedback.

- Algorithm

- Lower bound

Page 20: Equal Opportunity in Online Classification with …...Equal Opportunity in Online Classification with Partial Feedback Yahav Bechavod, Hebrew University Simons Institute, July 2019

Online Learning setting with Partial Feedback

20Time

Possible loan policies:Loan

policyor

Feedback

For :Learner selects policy .

Environment draws ; learner observes . Learner predicts .

If , learner observes .

Fairness?

Page 21: Equal Opportunity in Online Classification with …...Equal Opportunity in Online Classification with Partial Feedback Yahav Bechavod, Hebrew University Simons Institute, July 2019

Fairness

Definition. False positive rate:

Definition. We say an algorithm is -fair if:

Question: Is the policy we deploy at every round fair?

21

Randomization. We allow deploying .

Page 22: Equal Opportunity in Online Classification with …...Equal Opportunity in Online Classification with Partial Feedback Yahav Bechavod, Hebrew University Simons Institute, July 2019

Example

22

Optimal -fair policy is always of support size at most 2.

Page 23: Equal Opportunity in Online Classification with …...Equal Opportunity in Online Classification with Partial Feedback Yahav Bechavod, Hebrew University Simons Institute, July 2019

Partial Feedback + Fairness

Question: What is the tradeoff between fairness and regret in the partial feedback setting?

More specifically: Regret guarantee if algorithm has to be -fair on every round?

23

Page 24: Equal Opportunity in Online Classification with …...Equal Opportunity in Online Classification with Partial Feedback Yahav Bechavod, Hebrew University Simons Institute, July 2019

24

Talk Outline

1. Low regret with one-sided feedback.

2. What about fairness?

3. Fairness + one-sided feedback.

- Algorithm

- Lower bound

Page 25: Equal Opportunity in Online Classification with …...Equal Opportunity in Online Classification with Partial Feedback Yahav Bechavod, Hebrew University Simons Institute, July 2019

25

Main Result: Oracle-efficient fair online learning algorithm

There exists an algorithm that runs in polynomial time given access to optimization oracle over and guarantees:

1. Fairness: -fair on every round.2. Regret: to best -fair policy in .

Page 26: Equal Opportunity in Online Classification with …...Equal Opportunity in Online Classification with Partial Feedback Yahav Bechavod, Hebrew University Simons Institute, July 2019

Oracle-efficient algorithm

Agarwal et al. 2014 – “Mini-Monster”Oracle-efficient algorithmHigh probability guarantees for contextual bandits

26

Page 27: Equal Opportunity in Online Classification with …...Equal Opportunity in Online Classification with Partial Feedback Yahav Bechavod, Hebrew University Simons Institute, July 2019

Algorithm

Time

27

1. Label first arrivals as , observe labels.

2. Instantiate mini-monster with policy class:

3. Label remaining arrivals according to the instantiated algorithm. Use loss matrix transformation for feedback.

Mini-Monster

Oracle

Page 28: Equal Opportunity in Online Classification with …...Equal Opportunity in Online Classification with Partial Feedback Yahav Bechavod, Hebrew University Simons Institute, July 2019

Cost-Sensitive Classification (CSC) Oracle

Step 1: CSC Oracle

Given: Compute:

28

Equivalent to weighted binary classification

Page 29: Equal Opportunity in Online Classification with …...Equal Opportunity in Online Classification with Partial Feedback Yahav Bechavod, Hebrew University Simons Institute, July 2019

CSC Oracle -> Fair CSC Oracle

Step 2: Fair CSC Oracle

Theorem: Let . There exists an oracle-efficient algorithm that calls at most times, and outputs a -fair such that:

29

Adapted from Agarwal et al. 2018

Page 30: Equal Opportunity in Online Classification with …...Equal Opportunity in Online Classification with Partial Feedback Yahav Bechavod, Hebrew University Simons Institute, July 2019

Algorithm

Time

30

1. Label first arrivals as , observe labels.

2. Instantiate mini-monster with policy class:

3. Label remaining arrivals according to the instantiated algorithm. Use loss matrix transformation for feedback.

Mini-Monster

FairCSC(H)

Page 31: Equal Opportunity in Online Classification with …...Equal Opportunity in Online Classification with Partial Feedback Yahav Bechavod, Hebrew University Simons Institute, July 2019

31

In the paper

1. Adapting the CSC(H)->Fair CSC(H) construction to the case where the fairness constraint is only defined on a subset of the points considered in the cost objective.

2. Regret analysis for a fair version of Mini-Monster, also taking into account additional approximation error induced by fairness constraints.

Page 32: Equal Opportunity in Online Classification with …...Equal Opportunity in Online Classification with Partial Feedback Yahav Bechavod, Hebrew University Simons Institute, July 2019

32

Talk Outline

1. Low regret with one-sided feedback.

2. What about fairness?

3. Fairness + one-sided feedback.

- Algorithm

- Lower bound

Page 33: Equal Opportunity in Online Classification with …...Equal Opportunity in Online Classification with Partial Feedback Yahav Bechavod, Hebrew University Simons Institute, July 2019

Lower bound

33

Claim (simplified): There exists a hypothesis class such that any algorithm that is -fair must have expected regret

.

Page 34: Equal Opportunity in Online Classification with …...Equal Opportunity in Online Classification with Partial Feedback Yahav Bechavod, Hebrew University Simons Institute, July 2019

34

Proof Idea

1. Two similar distributions .

2. Until it is able to distinguish , algorithm has to actconservatively, otherwise risks being unfair.

3. Acting conservatively in the first rounds forces high regret on each of these rounds.

Page 35: Equal Opportunity in Online Classification with …...Equal Opportunity in Online Classification with Partial Feedback Yahav Bechavod, Hebrew University Simons Institute, July 2019

Proof Idea

Two similar distributions:

D1 D2

Hypothesis class:

35

Page 36: Equal Opportunity in Online Classification with …...Equal Opportunity in Online Classification with Partial Feedback Yahav Bechavod, Hebrew University Simons Institute, July 2019

D1D2

Proof Idea

36Linear regret for rounds.

Alg. Is -fair:

For rounds:

Page 37: Equal Opportunity in Online Classification with …...Equal Opportunity in Online Classification with Partial Feedback Yahav Bechavod, Hebrew University Simons Institute, July 2019

37

Open problems

1. Both equal FP, FN constraints.

2. Other definitions of fairness in the one-sided feedback setting.

Page 38: Equal Opportunity in Online Classification with …...Equal Opportunity in Online Classification with Partial Feedback Yahav Bechavod, Hebrew University Simons Institute, July 2019

Yahav Bechavod, Hebrew University

[email protected]

Katrina LigettHebrew University

Aaron RothUniversity of Pennsylvania

Bo WaggonerUniversity of

Colorado, Boulder

Steven WuUniversity of Minnesota

Thank you!