david stern ralf herbrich thore graepel microsoft research cambridge, uk horst samulowitz national...

16
David Stern Ralf Herbrich Thore Graepel Microsoft Research Cambridge, UK Horst Samulowitz National ICT Australia University of Melbourne Melbourne, Australia Luca Pulina Armando Tacchella Universita di Genova Genova, Italy Collaborative Expert Portfolio Management

Upload: esmond-joseph

Post on 12-Jan-2016

216 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: David Stern Ralf Herbrich Thore Graepel Microsoft Research Cambridge, UK Horst Samulowitz National ICT Australia University of Melbourne Melbourne, Australia

David SternRalf Herbrich

Thore Graepel

Microsoft ResearchCambridge, UK

Horst Samulowitz

National ICT AustraliaUniversity of Melbourne

Melbourne, Australia

Luca PulinaArmando Tacchella

Universita di GenovaGenova, Italy

Collaborative Expert Portfolio Management

Page 2: David Stern Ralf Herbrich Thore Graepel Microsoft Research Cambridge, UK Horst Samulowitz National ICT Australia University of Melbourne Melbourne, Australia

Expert Portfolios

Stream of Problems Solve Problem

using recommended Expert

Update Model

Expert Portfolio

Experts: Expert 1 Expert 2 ... Expert n

Submit Problem Characterization

(e.g., Feature Vector)

Recommend Expert

Query Model

Report UtilityExpert changes

Applications:

e.g., SATZilla[Xu et al., 07]

e.g., AQME [Pulina et al., 08],

CPHydra [O’Mahony et al., 08]

Page 3: David Stern Ralf Herbrich Thore Graepel Microsoft Research Cambridge, UK Horst Samulowitz National ICT Australia University of Melbourne Melbourne, Australia

Adaptive Expert Portfolios • Requirements:

- Model must be trained online so it can immediately take account of each outcome to improve future decisions.

- Computation cost should not depend on the number of previously seen problems [Pulina, 2008].

- The system should select a specific scheduling strategy for each task (based on task features) [Streeter and Smith, 2008].

- Model should adapt continuously over time, tracking domain and changing expert characteristics.

- Support different forms of feedback (to support different problem domains)

Cannot be addressed by previously presented approach

Model based on Collaborative Filtering fulfills all requirements.

Page 4: David Stern Ralf Herbrich Thore Graepel Microsoft Research Cambridge, UK Horst Samulowitz National ICT Australia University of Melbourne Melbourne, Australia

4

Map Features To ‘Trait’ Space234566

456457

13456

654777

User ID

Male

FemaleGender

CountryUK

USA

34

345

64

5474

Item ID

Horror

Movie Genre

Drama

Documentary

Comedy

Page 5: David Stern Ralf Herbrich Thore Graepel Microsoft Research Cambridge, UK Horst Samulowitz National ICT Australia University of Melbourne Melbourne, Australia

5

Learning Feature Contributions234566

456457

13456

654777

User ID

Male

FemaleGender

CountryUK

USA

34

345

64

5474

Item ID

Horror

Movie Genre

Drama

Documentary

Comedy

Page 6: David Stern Ralf Herbrich Thore Graepel Microsoft Research Cambridge, UK Horst Samulowitz National ICT Australia University of Melbourne Melbourne, Australia

User/Item Trait Space

-1.5 -1 -0.5 0 0.5 1 1.5

-1.5

-1

-0.5

0

0.5

1

1.5

UsersMoviesA Cinderella Story

AI: Artificial Intelligence

24: Season 3Adaptation

A Clockwork Orange

A Knights Tale

24: Season 2

‘Preference Cone’ for user 145035

Page 7: David Stern Ralf Herbrich Thore Graepel Microsoft Research Cambridge, UK Horst Samulowitz National ICT Australia University of Melbourne Melbourne, Australia

TaskFeatures

AlgorithmFeatures

FeedbackModel

P(t)

Time to complete task(or other objective)

u(t) E(u)

Utility Function

u

t

Adaptive Algorithm Expert Portfolios

P(r)

Trait Space

InnerProduct

AlgorithmPerformance

U

V

Page 8: David Stern Ralf Herbrich Thore Graepel Microsoft Research Cambridge, UK Horst Samulowitz National ICT Australia University of Melbourne Melbourne, Australia

8

Test Data

• QBF Solvers Competition Data– 11 State-of-the-art solvers.– Run times (600 sec time-out).– 5000 tasks.

• Microsoft Solver Foundation Performance Data– Linear Programming Daily test runs.– 6 Simplex Solvers.– 7 Interior Point Method (IPM) Solvers.– Run times.

Page 9: David Stern Ralf Herbrich Thore Graepel Microsoft Research Cambridge, UK Horst Samulowitz National ICT Australia University of Melbourne Melbourne, Australia

9

Task Features Allow Generalisation

• QBF Features

– 103 Basic Features: #Clauses, #Variables, etc. 69 – Combined Features: Ratio Universal/Existential, ...

• LP Model Features

– Number Variables.– Number Rows.– Number Zeros.

• Goal: to predict solver performance on unseen tasks

Page 10: David Stern Ralf Herbrich Thore Graepel Microsoft Research Cambridge, UK Horst Samulowitz National ICT Australia University of Melbourne Melbourne, Australia

Threshold Feedback Model

a b

> <

r

q

Time-Out Slow Fast

Page 11: David Stern Ralf Herbrich Thore Graepel Microsoft Research Cambridge, UK Horst Samulowitz National ICT Australia University of Melbourne Melbourne, Australia

-1 -0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6 0.8 1

-1

-0.8

-0.6

-0.4

-0.2

0

0.2

0.4

0.6

0.8

1 Formulae

Solvers

QuBE 6.1

2clsQ

Nenfex

QMRes

Quantor 3

QuBE 3.0

sKizzo

ssolve A

ssolve B

yQuaffle

ssolve C

QuBE 6.1

2clsQ

Nenfex

QMRes

Quantor 3

QuBE 3.0

sKizzo

ssolve A

ssolve B

yQuaffle

ssolve C

QuBE 6.1

2clsQ

Nenfex

QMRes

Quantor 3

QuBE 3.0

sKizzo

ssolve A

ssolve B

yQuaffle

ssolve C

QuBE 6.1

2clsQ

Nenfex

QMRes

Quantor 3

QuBE 3.0

sKizzo

ssolve A

ssolve B

yQuaffle

ssolve C

QBF Time Trait Space

Properties

Page 12: David Stern Ralf Herbrich Thore Graepel Microsoft Research Cambridge, UK Horst Samulowitz National ICT Australia University of Melbourne Melbourne, Australia

User-Defined Algorithm Utility

Example:

Page 13: David Stern Ralf Herbrich Thore Graepel Microsoft Research Cambridge, UK Horst Samulowitz National ICT Australia University of Melbourne Melbourne, Australia

Basic Combo All1200

1300

1400

1500

1600

1700

1800

1900

2000Total Utility

K=1K=2K=3QeBE6.1

QBF Portfolio Performance

Basic Combo All1700

1750

1800

1850

1900

1950

2000

2050

2100

2150Number Solved

K=1K=2K=3QeBE6.1

Features Features

Page 14: David Stern Ralf Herbrich Thore Graepel Microsoft Research Cambridge, UK Horst Samulowitz National ICT Australia University of Melbourne Melbourne, Australia

14

Comparison to other Approachesfor QBF

Approach Problems Solved

Average Time used per problem

(in seconds)

AQME [Pulina, Tacchella, 2009](Adaptive Portfolio that retrains offline + other limitations)

2155 18.0

Collaborative Expert Portfolio Manager 2169 16.6

Oracle 2240 12.8

Page 15: David Stern Ralf Herbrich Thore Graepel Microsoft Research Cambridge, UK Horst Samulowitz National ICT Australia University of Melbourne Melbourne, Australia

Inte

rio

r P

oin

t M

eth

od

Sim

ple

x M

eth

od

Du

al

Pri

mal

Page 16: David Stern Ralf Herbrich Thore Graepel Microsoft Research Cambridge, UK Horst Samulowitz National ICT Australia University of Melbourne Melbourne, Australia

16

Conclusions

– Presented adaptive portfolio manager based on ‘Collaborative Filtering’

– Approach supports:• Online adaption of portfolio at a negligible cost• Tracking of domain as well as expert changes• User-Defined feedback model

– Can be applied in other domains as well:• e.g., Yahoo Question-Answer