david stern ralf herbrich thore graepel microsoft research cambridge, uk

16
David Stern Ralf Herbrich Thore Graepel Microsoft Research Cambridge, UK Horst Samulowitz National ICT Australia University of Melbourne Melbourne, Australia Luca Pulina Armando Tacchella Universita di Genova Genova, Italy Collaborative Expert Portfolio Management

Upload: lana

Post on 24-Feb-2016

61 views

Category:

Documents


0 download

DESCRIPTION

Collaborative Expert Portfolio Management. Luca Pulina Armando Tacchella Universita di Genova Genova , Italy. David Stern Ralf Herbrich Thore Graepel Microsoft Research Cambridge, UK. Horst Samulowitz National ICT Australia University of Melbourne Melbourne, Australia. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: David Stern Ralf  Herbrich Thore Graepel Microsoft Research Cambridge, UK

David SternRalf Herbrich

Thore Graepel

Microsoft ResearchCambridge, UK

Horst Samulowitz

National ICT AustraliaUniversity of Melbourne

Melbourne, Australia

Luca PulinaArmando Tacchella

Universita di GenovaGenova, Italy

Collaborative Expert Portfolio Management

Page 2: David Stern Ralf  Herbrich Thore Graepel Microsoft Research Cambridge, UK

Expert PortfoliosStream of Problems Solve Problem

using recommended Expert

Update Model

Expert Portfolio Experts: Expert 1 Expert 2 ... Expert n

Submit Problem Characterization

(e.g., Feature Vector)

Recommend Expert

Query Model

Report UtilityExpert changes

Applications:

e.g., SATZilla[Xu et al., 07]

e.g., AQME [Pulina et al., 08], CPHydra [O’Mahony et al., 08]

Page 3: David Stern Ralf  Herbrich Thore Graepel Microsoft Research Cambridge, UK

Adaptive Expert Portfolios • Requirements:

- Model must be trained online so it can immediately take account of each outcome to improve future decisions.

- Computation cost should not depend on the number of previously seen problems [Pulina, 2008].

- The system should select a specific scheduling strategy for each task (based on task features) [Streeter and Smith, 2008].

- Model should adapt continuously over time, tracking domain and changing expert characteristics.

- Support different forms of feedback (to support different problem domains)

Cannot be addressed by previously presented approach

Model based on Collaborative Filtering fulfills all requirements.

Page 4: David Stern Ralf  Herbrich Thore Graepel Microsoft Research Cambridge, UK

4

Map Features To ‘Trait’ Space234566

456457

13456

654777

User ID

Male

FemaleGender

CountryUK

USA

34

345

64

5474

Item ID

Horror

Movie Genre

Drama

Documentary

Comedy

Page 5: David Stern Ralf  Herbrich Thore Graepel Microsoft Research Cambridge, UK

5

Learning Feature Contributions234566

456457

13456

654777

User ID

Male

FemaleGender

CountryUK

USA

34

345

64

5474

Item ID

Horror

Movie Genre

Drama

Documentary

Comedy

Page 6: David Stern Ralf  Herbrich Thore Graepel Microsoft Research Cambridge, UK

User/Item Trait Space

-1.5 -1 -0.5 0 0.5 1 1.5

-1.5

-1

-0.5

0

0.5

1

1.5

UsersMoviesA Cinderella Story

AI: Artificial Intelligence

24: Season 3Adaptation

A Clockwork Orange

A Knights Tale

24: Season 2

‘Preference Cone’ for user 145035

Page 7: David Stern Ralf  Herbrich Thore Graepel Microsoft Research Cambridge, UK

TaskFeatures

AlgorithmFeatures

FeedbackModel P(t)

Time to complete task(or other objective)

u(t) E(u)

Utility Function

u

t

Adaptive Algorithm Expert Portfolios

P(r)

Trait Space

InnerProduct

AlgorithmPerformance

U

V

Page 8: David Stern Ralf  Herbrich Thore Graepel Microsoft Research Cambridge, UK

8

Test Data

• QBF Solvers Competition Data– 11 State-of-the-art solvers.– Run times (600 sec time-out).– 5000 tasks.

• Microsoft Solver Foundation Performance Data– Linear Programming Daily test runs.– 6 Simplex Solvers.– 7 Interior Point Method (IPM) Solvers.– Run times.

Page 9: David Stern Ralf  Herbrich Thore Graepel Microsoft Research Cambridge, UK

9

Task Features Allow Generalisation

• QBF Features

– 103 Basic Features: #Clauses, #Variables, etc. 69 – Combined Features: Ratio Universal/Existential, ...

• LP Model Features

– Number Variables.– Number Rows.– Number Zeros.

• Goal: to predict solver performance on unseen tasks

Page 10: David Stern Ralf  Herbrich Thore Graepel Microsoft Research Cambridge, UK

Threshold Feedback Model

a b

> <

r

q

Time-Out Slow Fast

Page 11: David Stern Ralf  Herbrich Thore Graepel Microsoft Research Cambridge, UK

-1 -0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6 0.8 1

-1

-0.8

-0.6

-0.4

-0.2

0

0.2

0.4

0.6

0.8

1 Formulae

Solvers

QuBE 6.12clsQ

Nenfex

QMRes

Quantor 3

QuBE 3.0

sKizzo

ssolve A

ssolve B

yQuaffle

ssolve C

QuBE 6.12clsQ

Nenfex

QMRes

Quantor 3

QuBE 3.0

sKizzo

ssolve A

ssolve B

yQuaffle

ssolve C

QuBE 6.12clsQ

Nenfex

QMRes

Quantor 3

QuBE 3.0

sKizzo

ssolve A

ssolve B

yQuaffle

ssolve C

QuBE 6.12clsQ

Nenfex

QMRes

Quantor 3

QuBE 3.0

sKizzo

ssolve A

ssolve B

yQuaffle

ssolve C

QBF Time Trait Space

Properties

Page 12: David Stern Ralf  Herbrich Thore Graepel Microsoft Research Cambridge, UK

User-Defined Algorithm Utility

Example:

Page 13: David Stern Ralf  Herbrich Thore Graepel Microsoft Research Cambridge, UK

Basic Combo All1200

1300

1400

1500

1600

1700

1800

1900

2000Total Utility

K=1K=2K=3QeBE6.1

QBF Portfolio Performance

Basic Combo All1700

1750

1800

1850

1900

1950

2000

2050

2100

2150Number Solved

K=1K=2K=3QeBE6.1

Features Features

Page 14: David Stern Ralf  Herbrich Thore Graepel Microsoft Research Cambridge, UK

14

Comparison to other Approachesfor QBF

Approach Problems Solved

Average Time used per problem

(in seconds)

AQME [Pulina, Tacchella, 2009](Adaptive Portfolio that retrains offline + other limitations) 2155 18.0

Collaborative Expert Portfolio Manager 2169 16.6

Oracle 2240 12.8

Page 15: David Stern Ralf  Herbrich Thore Graepel Microsoft Research Cambridge, UK

Inte

rior P

oint

Met

hod

Sim

plex

Met

hod

Dua

l

Prim

al

Page 16: David Stern Ralf  Herbrich Thore Graepel Microsoft Research Cambridge, UK

16

Conclusions

– Presented adaptive portfolio manager based on ‘Collaborative Filtering’

– Approach supports:• Online adaption of portfolio at a negligible cost• Tracking of domain as well as expert changes• User-Defined feedback model

– Can be applied in other domains as well:• e.g., Yahoo Question-Answer