the effect of dimensionality reduction in recommendation systems juntae kim department of computer...

24
The Effect of Dimensionality Reduction in Recommendation Systems Juntae Kim Department of Computer Engineering Dongguk University

Upload: alexia-burns

Post on 04-Jan-2016

215 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: The Effect of Dimensionality Reduction in Recommendation Systems Juntae Kim Department of Computer Engineering Dongguk University

The Effect of Dimensionality Reduction in Recommendation Systems

Juntae Kim

Department of Computer EngineeringDongguk University

Page 2: The Effect of Dimensionality Reduction in Recommendation Systems Juntae Kim Department of Computer Engineering Dongguk University

Contents Introduction

Collaborative Recommendation

Data Sparseness Problem

Dimensionality Reduction by using SVD

An Example

Experiments

Conclusion

Page 3: The Effect of Dimensionality Reduction in Recommendation Systems Juntae Kim Department of Computer Engineering Dongguk University

Introduction e-CRM

Provides personalized service Enhance sales by

Product recommendation, target advertisement, etc.

Recommendation System

Demographic features

Item features

Sales history

Purchase historyCustomer

Recommend items

Page 4: The Effect of Dimensionality Reduction in Recommendation Systems Juntae Kim Department of Computer Engineering Dongguk University

Introduction Use item-to-item similarity – content-based

Use item-to-item similarity – association

A

C

B

like

similarcontents

Recommend

A

C

B

like

highcorrelation

Recommend

Page 5: The Effect of Dimensionality Reduction in Recommendation Systems Juntae Kim Department of Computer Engineering Dongguk University

Introduction Use people-to-people similarity – demographic

Use people-to-people similarity – collaborative

A

C

Bsimilarfeature

like

Recommend

A

C

B

A B

highcorrelation

like

like

Recommend

Page 6: The Effect of Dimensionality Reduction in Recommendation Systems Juntae Kim Department of Computer Engineering Dongguk University

Collaborative Method Advantages

No needs of contents analysis Items that are difficult to analyze contents can be

recommended Ex> Movie, music, …

No needs of user information High precision

Method1. Find out similar users

2. Predict preferences based on similar users preferences

Page 7: The Effect of Dimensionality Reduction in Recommendation Systems Juntae Kim Department of Computer Engineering Dongguk University

Collaborative Method Computing similarity

Pearson correlation coefficient ( [-1, 1] )

: Rating of user a to item i

Example User a: (1, 8, 9) (-5, +2, +3) User b: (2, 9, 7) (-4, +3, +1) User a is similar to b User c: (9, 3, 3) (+4, -2, -2)

Page 8: The Effect of Dimensionality Reduction in Recommendation Systems Juntae Kim Department of Computer Engineering Dongguk University

Collaborative Method Prediction of preferences

Weighted sum of similar users’ preferences

: 사용자 a 와 u 의 유사도

Example Average rating of user a: 5 Preferences of user a User b: (2, 8, 8), wa,b = 0.5 = (5, 5, 5) + (-4, 2, 2)*0.5

User c: (4, 4, 7), wa,c = 0.1 + (-1, -1, 2)*0.1

= (2.9, 5.9, 6.2)

Page 9: The Effect of Dimensionality Reduction in Recommendation Systems Juntae Kim Department of Computer Engineering Dongguk University

Data Sparseness Problem Example data

10000

01000

11000

00001

00110

01101

6

5

4

3

2

1.

user

user

user

user

user

userScreamHolloweenPocahontasKingLionCoMonster

A

Page 10: The Effect of Dimensionality Reduction in Recommendation Systems Juntae Kim Department of Computer Engineering Dongguk University

Data Sparseness Problem Explicit ratings are not usually available

Available data purchase, click, etc.

0 or 1 Computing correlation is not appropriate

(no negative preference information)

use cosine similarity

ua

uaua rr

rrw

,

Page 11: The Effect of Dimensionality Reduction in Recommendation Systems Juntae Kim Department of Computer Engineering Dongguk University

Data Sparseness Problem Available data are usually very sparse

Buy 2~3 items among thousands of items Cosine similarity can not be computed

Reduce dimension

10000

01000

11000

00001

00110

01101

A

?

A

Page 12: The Effect of Dimensionality Reduction in Recommendation Systems Juntae Kim Department of Computer Engineering Dongguk University

Dimensionality Reduction Using category information

Represent user preference vector with item categories Monster Co., Lion King, Pocahontas animation Holloween, Scream horror

10

10

10

01

01

11

A

10000

01000

11000

00001

00110

01101

A

Page 13: The Effect of Dimensionality Reduction in Recommendation Systems Juntae Kim Department of Computer Engineering Dongguk University

Dimensionality Reduction Singular Value Decomposition (SVD)

Decompose the user-item matrix Amn

Amn = Umm Smn (Vnn)T

S : Diagonal matrix that contains the singular values of A in descending order

U, V : Orthogonal matrices

Rotating the axes of the n-dimensional space 1st axis runs along the direction of largest variation

Page 14: The Effect of Dimensionality Reduction in Recommendation Systems Juntae Kim Department of Computer Engineering Dongguk University

Dimensionality Reduction SVD example

22.058.033.041.012.0

41.058.012.022.033.0

19.000.020.063.045.0

63.058.045.019.020.0

29.000.075.053.028.0

53.000.028.029.075.0

U

39.000.000.000.000.0

00.000.100.000.000.0

00.000.028.100.000.0

00.000.000.059.100.0

00.000.000.000.016.2

S

09.058.041.065.026.0

16.058.015.035.070.0

61.000.037.051.048.0

73.000.059.033.013.0

25.058.057.030.044.0

TV

Page 15: The Effect of Dimensionality Reduction in Recommendation Systems Juntae Kim Department of Computer Engineering Dongguk University

Dimensionality Reduction Approximation of A

Select largest k singular values

A’mn = Umk Skk (Vnk)T

Computing user similarity AAT = USVT(USVT)T

= USVTVSTUT

= (US)(US)T

Projection of A into k dimensionA’mn Vnk = Umk Skk

Page 16: The Effect of Dimensionality Reduction in Recommendation Systems Juntae Kim Department of Computer Engineering Dongguk University

An Example User-item matrix

10000

01000

11000

00001

00110

01101

6

5

4

3

2

1.

user

user

user

user

user

userScreamHolloweenPocahontasKingLionCoMonster

A

Page 17: The Effect of Dimensionality Reduction in Recommendation Systems Juntae Kim Department of Computer Engineering Dongguk University

An Example Reduction, k = 2

65.026.0

35.071.0

00.197.0

30.004.0

84.060.0

46.062.0

2USVA

10000

01000

11000

00001

00110

01101

A

Page 18: The Effect of Dimensionality Reduction in Recommendation Systems Juntae Kim Department of Computer Engineering Dongguk University

An Example User-user similarity

00.1

74.000.1

93.094.000.1

87.032.062.000.1

54.016.018.088.000.1

10.074.047.040.078.000.1

))(( TUSUS

Page 19: The Effect of Dimensionality Reduction in Recommendation Systems Juntae Kim Department of Computer Engineering Dongguk University

An Example User vectors in 2-D space

u6

u4

u5

u3

u2

u1

Page 20: The Effect of Dimensionality Reduction in Recommendation Systems Juntae Kim Department of Computer Engineering Dongguk University

Experiments Dataset – MovieLens

943 users, 1628 movies, 1~5 rating, 6.4% rated Change ratings to 0/1 3.6% rated

Experiments Compare performance of plain collaborative(CF) and reduce

d dimension(SVD) recommendation CF: 60 neighbor SVD: rank 20

Change sparseness to 2.0%, 1.0%, 0.5%

Page 21: The Effect of Dimensionality Reduction in Recommendation Systems Juntae Kim Department of Computer Engineering Dongguk University

Experiments Metric

Hit ratio Remove 1 rating from each user test data Recommend 10 items for each user If the test data is in the recommended item hit

Total # of hit

Total # of test data

Result Sparseness 3.6% SVD improves hit ratio by x % Sparseness 0.5% SVD improves hit ratio by x %

Hit ratio =

Page 22: The Effect of Dimensionality Reduction in Recommendation Systems Juntae Kim Department of Computer Engineering Dongguk University

Experiments Results

0

0.05

0.1

0.15

0.2

0.25

3.6% 2.0% 1.0% 0.5% 0.1%

Sparseness

recall Avg

CF 60NN

SVD Rank20

Page 23: The Effect of Dimensionality Reduction in Recommendation Systems Juntae Kim Department of Computer Engineering Dongguk University

Conclusion Solve data sparseness problem

Reduce dimension – heuristics Reduce dimension – SVD

Experimental results SVD shows more performance improvement in sparser data

Future research Statistical analysis Combined methods

Page 24: The Effect of Dimensionality Reduction in Recommendation Systems Juntae Kim Department of Computer Engineering Dongguk University

References Basu, C, Hirsh, H., Cohen, W., “Recommeder Systems. Recommedation As Classification: Using Social And C

onent-Based Information,” Proceedings of the Workshop on Recommendation system. AAAI Press, Menlo Park California, 1998.

Billsus, D., Pazzani, M. j., “Learning Collaborative Information Filters,” Proceedings of workshop on recommender system, 1998.

Berry, M. W., Dumais, S. T., and O’Brain, G. W. “Using Linear Algebra for Intelligent Information Retrieval,” SIAM Review, 37(4), pp. 573-595, 1995.

Breese, J. S., Heckerman, D., and Kadie, C., “Empirical Analysis of Predictive Algorithm for Collaborative Filtering,”Proceeding of the Fourteenth Conference UAI, July 1998.

Goldberg, k., Roeder, T., Gupta, D., and Perkins, C., “Eigentaste: A Constant Time Collaborative Filtering Algorithm,” Technical Report M00/41. Electronics Research Laborotary, University of California, Berkeley, 2000.

Herlocker, J., Konstan, J., Borchers, A., Riedl, J., “An Algorithmic Framework for Performing Collaborative Filtering,”Proceedings of the 1999 Conference on Research and Development in Information Retrieval, Aug. 1999.

Sarwar, B. M. “Sparsity, Scalability, and Distribution in Recommender Systems,” Ph.D. Thesis, Computer Science Dept., University of Minnesota, 2001.

Sarwar, B. M., Karypis, G., Konstan, J. A., and Riedl, J., “Application of Dimensionality Reduction in Recommender System-A Case Study,”WebKDD   00-Web-mining for E-Commerce Workshop, 2000.

Schafer, J. B., Konstan, J., and Riedl, J., “Recommender Systems in E-Commerce ,” Proceedings of the ACM Conference on Electronic Commerce, November 1999.

Shardanand, U., "Social information filtering for music recommendation," Technical Report MA95, MIT Media Laboratory, 1995.