[cars2012@recsys]optimal feature selection for context-aware recommendation using differential...

20
Yong Zheng Robin Burke Bamshad Mobasher Proceedings of the 4th International Workshop on Context-Aware Recommender Systems, RecSys 2012, Dublin, Ireland; 09/09/2012 Optimal Feature Selection for Context-aware Recommendation using Differential Relaxation Style: Jazz

Upload: yong-zheng

Post on 11-May-2015

600 views

Category:

Technology


0 download

TRANSCRIPT

Page 1: [CARS2012@RecSys]Optimal Feature Selection for Context-Aware Recommendation using Differential Relaxation

Yong ZhengRobin BurkeBamshad Mobasher

Proceedings of the 4th International Workshop on Context-Aware Recommender Systems, RecSys 2012, Dublin, Ireland; 09/09/2012

Optimal Feature Selection for Context-aware Recommendation using Differential Relaxation

Style: Jazz

Page 2: [CARS2012@RecSys]Optimal Feature Selection for Context-Aware Recommendation using Differential Relaxation

CONTEXT-AWARE RECOMMENDER SYSTEM (CARS)

1

R: Users × Items × Contexts Ratings

Assumptions:

1. Contexts – Characterize the situation/condition users like the items;

2. Even the same user, may have different preferences for the same item BUT under different contexts;

Page 3: [CARS2012@RecSys]Optimal Feature Selection for Context-Aware Recommendation using Differential Relaxation

RESEARCH IN CARS

2

Detecting the useful and relevant features -- Q1.which should be used? contexts only or other features?

Which contextual variables are influential ones? -- Q2.which should be used? feature selection!

Incorporating contextual information into recommendation process -- Q3.how to use contexts?

Our proposed approach: differential context relaxation (DCR)

First proposed in EC-WEB 2012:“Differential Context Relaxation for Context-aware Travel Recommendation”

Page 4: [CARS2012@RecSys]Optimal Feature Selection for Context-Aware Recommendation using Differential Relaxation

DCR —— “RELAXATION”

User-based collaborative filtering: Predict (user, item, contexts)

Neighbor selection select neighbors who rated the item under the same “contexts”; Use the exactly full contexts? —— may be very few or even no matches

At CinemaWeekend

With Girlfriend

Contexts = [Cinema, Weekend, Girlfriend]

Black areas: matched users.

Solution: a set of relaxed dimensionsSuch as [Cinema, Girlfriend]

Optimal feature selection:balance between accuracy & coverage

3

Take seeing a movie for example:

Introducing contexts into recommendation? Sparsity Problem!!

Page 5: [CARS2012@RecSys]Optimal Feature Selection for Context-Aware Recommendation using Differential Relaxation

DCR —— “DIFFERENTIAL”User-based collaborative filtering: Predict (user, item, contexts)

Differential aspect: Decompose algorithms into functional componentsand apply appropriate different aspect of contexts to each component!

Goal: to maximize the functional contribution of each component in the prediction function

Neighbor Selection Neighbor contribution

User baseline 4

Page 6: [CARS2012@RecSys]Optimal Feature Selection for Context-Aware Recommendation using Differential Relaxation

DCR MODEL – A GENERAL MODEL

Apply it to user-based collaborative filtering: Predict (user, item, contexts)

Choose appropriate relaxations for each algorithm component (feature selection) as contextual constraints, and then perform regular recommendation.

C = Full contextual situationsC1, C2, C3 = relaxed context dimensions

Ci can be modeled as a binary selection vector.<1, 0, 1> denotes we select the 1st and 3rd contextual dimension for Ci 5

Page 7: [CARS2012@RecSys]Optimal Feature Selection for Context-Aware Recommendation using Differential Relaxation

DCR MODEL

Q2. Which contextual variables should be used?– Optimal feature selection in shape of context relaxations

Q3. How to use contexts? – Apply optimal constraints to each component, differentially

Remaining Question:Q1.Which variables are relevant/useful/should be used?

6

Page 8: [CARS2012@RecSys]Optimal Feature Selection for Context-Aware Recommendation using Differential Relaxation

Q1.WHICH VARIABLES ARE RELEVANT?

: influential features linked to contextsWhich kinds of users Contexts Which kinds of items

Jim

Nadia

Romantic Movie

Action Movie

Comedy Movie

Alone

Alone

User’s preferences on “Genre” are linked to the context “Companion”7

Page 9: [CARS2012@RecSys]Optimal Feature Selection for Context-Aware Recommendation using Differential Relaxation

DCR MODEL — OPTIMIZATION

How to find optimal feature selection for each algorithm component?Recall that the selection is modeled by binary vectors.

Search Space Reduction [Contexts + Context-linked Features]

Neighbor Selection(No item features)

Neighbor contribution(No user profiles)

User baseline(No user profiles) 8

Page 10: [CARS2012@RecSys]Optimal Feature Selection for Context-Aware Recommendation using Differential Relaxation

DCR MODEL — OPTIMIZATION

Two approaches to find the optimal context relaxations:

1. Exhaustive Search

Try all combinations of binary vectorsAssume there are two dimensions, then it could be 4 possibilities for eachcomponent: <0, 0>; <0, 1>; <1, 0>; <1, 1>

Not efficient, because it increases computational costs significantly!

More practical and efficient optimization requires for:1).Larger dataset;2).Several more contextual dimensions;

Other optimization techniques, such as Hill climbing and Gradient descent may not work well.

9

Page 11: [CARS2012@RecSys]Optimal Feature Selection for Context-Aware Recommendation using Differential Relaxation

DCR MODEL — OPTIMIZATION

2. Binary Particle Swarm Optimization (Binary PSO)

PSO is derived from swarm intelligence. Binary PSO is a discrete version of PSO. Let’ see how PSO works.

Fish Birds Bees 10

Page 12: [CARS2012@RecSys]Optimal Feature Selection for Context-Aware Recommendation using Differential Relaxation

DCR MODEL — OPTIMIZATION

2. Binary Particle Swarm Optimization (Binary PSO)

Example: Birds are looking for the pizza

Swarm = a group of birdsParticle = each birdGoal = the location of pizza

So, how to find goal by swam?1.Each bird is looking for the pizza

A machine can tell the distance to pizza2.Each iteration is an attempt or move3.Cognitive learning from particle itself

Am I closer to the pizza comparing with my “best ”locations in previous history?

4.Social Learning from the swarmHey, my distance is 1 mile.It is the closest ever! Follow me!!

The moving direction is a hybrid function of cognitive and social learning!

11

Page 13: [CARS2012@RecSys]Optimal Feature Selection for Context-Aware Recommendation using Differential Relaxation

DCR MODEL — OPTIMIZATION

2. Binary Particle Swarm Optimization (Binary PSO)

Birds Example DCR ModelSwarm a group of birds a group of objects or agentsParticle each bird each object or agent

Goal location of pizza minimal prediction error (RMSE)Location bird's position vector the binary selection vectorLearning adjust each bit of position vector adjust each bit of the binary vector

Binary PSO is a discrete version, where the bit value in position vectoris binary value instead of real number – switching between 0 and 1.

Disadvantages: 1). Converge slowly; 2). Local optimum

There are several improvements on PSO, but few on Binary PSO.We use an improved Binary PSO introduced by Mojtaba et al,It is demonstrated to be able to converge quickly.More details about it, please refer to our paper.

12

Page 14: [CARS2012@RecSys]Optimal Feature Selection for Context-Aware Recommendation using Differential Relaxation

EXPERIMENTS

Dataset: AIST Context-aware Food Preference Data (thanks to Hideki Asoh!)

Contextual dimensions:1).Contexts: real hunger, virtual hunger (hungry/normal/full)2).Possible Context-linked features

User Profile: genderItem feature:

food genre (Chinese/Japan/Western)food stuff (vegetable, pork, beef, fish, etc)food style = the style of food preparation

This is a dataset with dense context information:212 users, 6,360 ratings;Each user rated 5 out of 20 items;Once two users rated one same item, they rated it in 6 same situations!

We run exhaustive search – to get performance baseline;Then we run improved BPSO – to see whether it can help find optimum! 13

Page 15: [CARS2012@RecSys]Optimal Feature Selection for Context-Aware Recommendation using Differential Relaxation

EXPERIMENT DESIGN

Comparison:1).ModelsStandard user-based CF vs. Contextual Pre-filtering vs. DCR Model

2).Contextual dimensionsContexts (CO) vs. Context-linked feature (CL) vs. Hybrid of CO+CL

14

Page 16: [CARS2012@RecSys]Optimal Feature Selection for Context-Aware Recommendation using Differential Relaxation

EXPERIMENTAL RESULTS BY EXHAUSTIVE SEARCH

Experimental Results

15

Page 17: [CARS2012@RecSys]Optimal Feature Selection for Context-Aware Recommendation using Differential Relaxation

EXPERIMENTAL RESULTS BY EXHAUSTIVE SEARCH

1.Best relaxation2.Effects of contexts3.Effects of context-linked features

16

Page 18: [CARS2012@RecSys]Optimal Feature Selection for Context-Aware Recommendation using Differential Relaxation

EXPERIMENTAL RESULTS BY BINARY PSO

1.More particles, more efficient (less iterations); but it requires a balance.2.Data set is larger, may be more complicated – more particles are required.

Exhaustive search requires 8,192 iterations;1-BPSO found optimum at 18th iteration; 5-BPSO founds it at 12th iteration.

17

Page 19: [CARS2012@RecSys]Optimal Feature Selection for Context-Aware Recommendation using Differential Relaxation

LIMITATION AND FUTURE RESEARCH

1. The 4th component – introduce contexts to user-user similarity?2. Optimal model selection – multi-objective function (RMSE, coverage, etc)3. Optimal feature weighting other than feature selection4. Contextual dimensions do NOT match – may also share similarities5. Integrate DCR model with latent factor models, such as MF, etc6. Expand DCR to more recommendation algorithms

Solutions may help alleviate sparsity problem: #3, #4, #5

18

Limitation of DCR model: sparse contexts!!

Page 20: [CARS2012@RecSys]Optimal Feature Selection for Context-Aware Recommendation using Differential Relaxation

Proceedings of the 4th International Workshop on Context-Aware Recommender Systems, RecSys 2012, Dublin, Ireland; 09/09/2012

Thanks!

Style: Jazz