+ All Categories
Home > Documents > On the Use of Lanczos Vectors for Efficient Latent Factor...

On the Use of Lanczos Vectors for Efficient Latent Factor...

Date post: 14-Jun-2020
Category:
Upload: others
View: 2 times
Download: 0 times
Share this document with a friend
23
Introduction & Motivation Our Contribution Experimental Evaluation Conclusions On the Use of Lanczos Vectors for Efficient Latent Factor-Based Top-N Recommendation Athanasios N. Nikolakopoulos Maria Kalantzi John D. Garofalakis Computer Engineering and Informatics Department, University of Patras Computer Technology Institute & Press “Diophantus” 4th WIMS Conference, 2014 A. N. Nikolakopoulos, M. Kalantzi, J. D. Garofalakis Lanczos Latent Factor Recommender
Transcript
Page 1: On the Use of Lanczos Vectors for Efficient Latent Factor ...wims14.csd.auth.gr/wp-content/uploads/presentations/Kalantzi.pdf · vectors forming the basis of the Krylov subspace that

Introduction & MotivationOur Contribution

Experimental EvaluationConclusions

On the Use of Lanczos Vectors for Efficient LatentFactor-Based Top-N Recommendation

Athanasios N. Nikolakopoulos Maria KalantziJohn D. Garofalakis

Computer Engineering and Informatics Department,University of Patras

Computer Technology Institute & Press “Diophantus”

4th WIMS Conference, 2014

A. N. Nikolakopoulos, M. Kalantzi, J. D. Garofalakis Lanczos Latent Factor Recommender

Page 2: On the Use of Lanczos Vectors for Efficient Latent Factor ...wims14.csd.auth.gr/wp-content/uploads/presentations/Kalantzi.pdf · vectors forming the basis of the Krylov subspace that

Introduction & MotivationOur Contribution

Experimental EvaluationConclusions

Outline

1 Introduction & MotivationRecommender Systems - Collaborative FilteringChallenges of Modern CF AlgorithmsLatent Factor and Graph-Based models

2 Our ContributionMotivationLanczos Latent Factor RecommenderLLFR Computational Aspects

3 Experimental EvaluationEvaluation MethodologyCompared Algorithms and MetricsQuality of RecommendationsCold Start Recommendation

4 ConclusionsLLFR and Top-N Recommendation

A. N. Nikolakopoulos, M. Kalantzi, J. D. Garofalakis Lanczos Latent Factor Recommender

Page 3: On the Use of Lanczos Vectors for Efficient Latent Factor ...wims14.csd.auth.gr/wp-content/uploads/presentations/Kalantzi.pdf · vectors forming the basis of the Krylov subspace that

Introduction & MotivationOur Contribution

Experimental EvaluationConclusions

Recommender Systems - Collaborative FilteringChallenges of Modern CF AlgorithmsLatent Factor and Graph-Based models

Recommender System Algorithms

..RECOMMENDERSYSTEM

.

USERS

.ITEMS .

RATINGS

.

...

.

RecommendationList

.Rating

Predictions

.....

Collaborative Filtering Recommendation Algorithms

Wide deployment in Commercial EnviromentsSignificant Research Efforts

A. N. Nikolakopoulos, M. Kalantzi, J. D. Garofalakis Lanczos Latent Factor Recommender

Page 4: On the Use of Lanczos Vectors for Efficient Latent Factor ...wims14.csd.auth.gr/wp-content/uploads/presentations/Kalantzi.pdf · vectors forming the basis of the Krylov subspace that

Introduction & MotivationOur Contribution

Experimental EvaluationConclusions

Recommender Systems - Collaborative FilteringChallenges of Modern CF AlgorithmsLatent Factor and Graph-Based models

Recommender System Algorithms

..RECOMMENDERSYSTEM

.

USERS

.ITEMS .

RATINGS

.

...

.

RecommendationList

.Rating

Predictions

.....

Collaborative Filtering Recommendation Algorithms

Wide deployment in Commercial EnviromentsSignificant Research Efforts

A. N. Nikolakopoulos, M. Kalantzi, J. D. Garofalakis Lanczos Latent Factor Recommender

Page 5: On the Use of Lanczos Vectors for Efficient Latent Factor ...wims14.csd.auth.gr/wp-content/uploads/presentations/Kalantzi.pdf · vectors forming the basis of the Krylov subspace that

Introduction & MotivationOur Contribution

Experimental EvaluationConclusions

Recommender Systems - Collaborative FilteringChallenges of Modern CF AlgorithmsLatent Factor and Graph-Based models

Challenges of Modern CF Algorithms

SparsityIntrinsic RS Characteristic

Cold start Problem

Traditional CF techniques, such as neighborhood models, are verysusceptible to sparsity

Among the most promising approaches in alleviating sparsity relatedproblems are Latent Factor and Graph-Based models

A. N. Nikolakopoulos, M. Kalantzi, J. D. Garofalakis Lanczos Latent Factor Recommender

Page 6: On the Use of Lanczos Vectors for Efficient Latent Factor ...wims14.csd.auth.gr/wp-content/uploads/presentations/Kalantzi.pdf · vectors forming the basis of the Krylov subspace that

Introduction & MotivationOur Contribution

Experimental EvaluationConclusions

Recommender Systems - Collaborative FilteringChallenges of Modern CF AlgorithmsLatent Factor and Graph-Based models

Ranking - Based Algorithms

Graph-Based modelsFouss et al.

Random walks on a graph model

Gori and Pucci

ItemRank based on PageRank

Latent factor modelsCremonesi et al.

PureSVDUses the truncated singular value decomposition to approximate theuser-item rating matrix in order to produce recommendation vectorsfor the users.Produces better top-N recommendations compared to sophisticatedlatent factor methods and other popular CF techniques.

A. N. Nikolakopoulos, M. Kalantzi, J. D. Garofalakis Lanczos Latent Factor Recommender

Page 7: On the Use of Lanczos Vectors for Efficient Latent Factor ...wims14.csd.auth.gr/wp-content/uploads/presentations/Kalantzi.pdf · vectors forming the basis of the Krylov subspace that

Introduction & MotivationOur Contribution

Experimental EvaluationConclusions

MotivationLanczos Latent Factor RecommenderLLFR Computational Aspects

Motivation

While promising in dealing with sparsity related problems, all the previousmethods are computationally expensive.

The graph-based models are required to handle a graph of n+mnodes.

PureSVD involves the computation of a truncated singular valuedecomposition of the rating matrix.

In our approach, we follow the latent factor paradigm.

We are interested in ranking-based recommendations ⇒ notcaring about the exact recommendation scores.

Is there a cheaper way to reduce the dimensionality of themodel?

A. N. Nikolakopoulos, M. Kalantzi, J. D. Garofalakis Lanczos Latent Factor Recommender

Page 8: On the Use of Lanczos Vectors for Efficient Latent Factor ...wims14.csd.auth.gr/wp-content/uploads/presentations/Kalantzi.pdf · vectors forming the basis of the Krylov subspace that

Introduction & MotivationOur Contribution

Experimental EvaluationConclusions

MotivationLanczos Latent Factor RecommenderLLFR Computational Aspects

Motivation

While promising in dealing with sparsity related problems, all the previousmethods are computationally expensive.

The graph-based models are required to handle a graph of n+mnodes.

PureSVD involves the computation of a truncated singular valuedecomposition of the rating matrix.

In our approach, we follow the latent factor paradigm.

We are interested in ranking-based recommendations ⇒ notcaring about the exact recommendation scores.

Is there a cheaper way to reduce the dimensionality of themodel?

A. N. Nikolakopoulos, M. Kalantzi, J. D. Garofalakis Lanczos Latent Factor Recommender

Page 9: On the Use of Lanczos Vectors for Efficient Latent Factor ...wims14.csd.auth.gr/wp-content/uploads/presentations/Kalantzi.pdf · vectors forming the basis of the Krylov subspace that

Introduction & MotivationOur Contribution

Experimental EvaluationConclusions

MotivationLanczos Latent Factor RecommenderLLFR Computational Aspects

Our Approach

We approach the problem as follows:Build a symmetric m ×m inter-item Correlation Matrix A.

Reduce the dimensionality of the model by computing the Lanczosvectors forming the basis of the Krylov subspace that correspondsto the inter-item correlation matrix A.

Build a Lower Dimensional Model which can be readily used toproduce recommendation vectors for the users.

A. N. Nikolakopoulos, M. Kalantzi, J. D. Garofalakis Lanczos Latent Factor Recommender

Page 10: On the Use of Lanczos Vectors for Efficient Latent Factor ...wims14.csd.auth.gr/wp-content/uploads/presentations/Kalantzi.pdf · vectors forming the basis of the Krylov subspace that

Introduction & MotivationOur Contribution

Experimental EvaluationConclusions

MotivationLanczos Latent Factor RecommenderLLFR Computational Aspects

Related Work

The Lanczos Method:has primarily been used in the context of numerical linear algebra [6]

was found to achieve high quality results in applications fromInformation Retrieval as well as Face Recognition [7, 8]

this is the first work to suggest using Lanczos vectors for top-Nrecommendation.

A. N. Nikolakopoulos, M. Kalantzi, J. D. Garofalakis Lanczos Latent Factor Recommender

Page 11: On the Use of Lanczos Vectors for Efficient Latent Factor ...wims14.csd.auth.gr/wp-content/uploads/presentations/Kalantzi.pdf · vectors forming the basis of the Krylov subspace that

Introduction & MotivationOur Contribution

Experimental EvaluationConclusions

MotivationLanczos Latent Factor RecommenderLLFR Computational Aspects

Lanczos Latent Factor Recommender (LLFR)

The Algorithm:Lanczos Latent Factor Recommender (LLFR):Input: The inter-item Correlation Matrix A ∈Rm×m, the Rating Matrix R ∈ Rn×m, a ran-dom unit vector q1 ∈ Rm, and the number oflatent factors f .Output: Matrix Π ∈ Rn×m whose rows are therecommendation vectors for every user.

1: q0 ← 02: β1 ← 03: for i ← 1, ..., f do4: w← Aqi − βiqi−1

5: αi ← wᵀqi

6: w← w − αiqi

7: βi+1 ← ‖w‖2

8: qi+1 ← w/βi+1

9: end for10: return Π← RQQᵀ

Computational Aspects:

O((nnz + m)f ) time for sparse matrices

where nnz is the number of nonzeroelements of A

Computational Tests:

0 100 200 300 400 500 6000

500

1,000

Latent FactorsTim

e(sec) LLFR

PureSVD

A. N. Nikolakopoulos, M. Kalantzi, J. D. Garofalakis Lanczos Latent Factor Recommender

Page 12: On the Use of Lanczos Vectors for Efficient Latent Factor ...wims14.csd.auth.gr/wp-content/uploads/presentations/Kalantzi.pdf · vectors forming the basis of the Krylov subspace that

Introduction & MotivationOur Contribution

Experimental EvaluationConclusions

Evaluation MethodologyCompared Algorithms and MetricsQuality of RecommendationsCold Start Recommendation

Experimental Evaluation

MethodologyWe use the Yahoo!Music dataset.

We have adopted the methodology used by Cremonesi et al:

Randomly sample 1.4% of the ratings of the dataset ⇒ probe set PUse each item vj , rated with 5 stars by user ui in P ⇒ test set TRandomly select another 1000 unrated items of the same user foreach item in TForm ranked lists by ordering all the 1001 items according to therecommendation scores produced by each method

A. N. Nikolakopoulos, M. Kalantzi, J. D. Garofalakis Lanczos Latent Factor Recommender

Page 13: On the Use of Lanczos Vectors for Efficient Latent Factor ...wims14.csd.auth.gr/wp-content/uploads/presentations/Kalantzi.pdf · vectors forming the basis of the Krylov subspace that

Introduction & MotivationOur Contribution

Experimental EvaluationConclusions

Evaluation MethodologyCompared Algorithms and MetricsQuality of RecommendationsCold Start Recommendation

Recommendation Methods

We compare LLFR against:PureSVD

average Commute Time (CT)

Pseudo-Inverse of the user-item graph Laplacian (L†)Matrix Forest Algorithm (MFA)

ItemRank (IR)

A. N. Nikolakopoulos, M. Kalantzi, J. D. Garofalakis Lanczos Latent Factor Recommender

Page 14: On the Use of Lanczos Vectors for Efficient Latent Factor ...wims14.csd.auth.gr/wp-content/uploads/presentations/Kalantzi.pdf · vectors forming the basis of the Krylov subspace that

Introduction & MotivationOur Contribution

Experimental EvaluationConclusions

Evaluation MethodologyCompared Algorithms and MetricsQuality of RecommendationsCold Start Recommendation

Accuracy Metrics

Recall

Precision

R-Score

R-Score(α) =∑q

max(yπq − d , 0)

2q−1α−1

Normalized Distance-based Performance Measure

DCG@k(y,π) =k∑

q=1

2yπq − 1

log2(2 + q)

Mean Reciprocal Rank

RR =1

minq{q : yπq > 0}

A. N. Nikolakopoulos, M. Kalantzi, J. D. Garofalakis Lanczos Latent Factor Recommender

Page 15: On the Use of Lanczos Vectors for Efficient Latent Factor ...wims14.csd.auth.gr/wp-content/uploads/presentations/Kalantzi.pdf · vectors forming the basis of the Krylov subspace that

Introduction & MotivationOur Contribution

Experimental EvaluationConclusions

Evaluation MethodologyCompared Algorithms and MetricsQuality of RecommendationsCold Start Recommendation

Recommendation Quality

Evaluate the performance of the algorithms on low density datausing the Yahoo!Music dataset.

5 10 15 200

0.2

0.4

0.6

N

Recall(N

)

Yahoo1 (density = 1.63%)

5 10 15 200

0.1

0.2

0.3

0.4

N

Recall(N

)

Yahoo2 (density = 0.55%)

0 0.2 0.4 0.6 0.8 1

.05

0.1

Recall

Precision

0 0.2 0.4 0.6 0.8 1

.01

.02

.03

Recall

Precision

5 10 15 20

0.2

0.4

N

NDCG

5 10 15 20

.05

0.1

0.15

0.2

N

NDCG

LLFR PureSVD L† MFA Commute Time ItemRank

Figure: Evaluation of top-N recommendation performance.

A. N. Nikolakopoulos, M. Kalantzi, J. D. Garofalakis Lanczos Latent Factor Recommender

Page 16: On the Use of Lanczos Vectors for Efficient Latent Factor ...wims14.csd.auth.gr/wp-content/uploads/presentations/Kalantzi.pdf · vectors forming the basis of the Krylov subspace that

Introduction & MotivationOur Contribution

Experimental EvaluationConclusions

Evaluation MethodologyCompared Algorithms and MetricsQuality of RecommendationsCold Start Recommendation

The Cold Start Problem

Difficulty of making reliable recommendations due to an initial lackof ratings

In beginning stages, when there is not sufficient number of ratingsfor the collaborative filtering algorithms to uncover similarities ⇒New Community Problem

Introduction of new users to an existing system where they have notrated many items ⇒ New Users Problem

A. N. Nikolakopoulos, M. Kalantzi, J. D. Garofalakis Lanczos Latent Factor Recommender

Page 17: On the Use of Lanczos Vectors for Efficient Latent Factor ...wims14.csd.auth.gr/wp-content/uploads/presentations/Kalantzi.pdf · vectors forming the basis of the Krylov subspace that

Introduction & MotivationOur Contribution

Experimental EvaluationConclusions

Evaluation MethodologyCompared Algorithms and MetricsQuality of RecommendationsCold Start Recommendation

New Community problem

Methodology:

Randomly select to include 10%, 20%, and 30% of the overallratings on three new artificially sparsified versions of the dataset.

Create test sets from the new community datasets.

Table 1: Ranking Performance for the New Commu-nity Problem

LLFR PureSVD L† MFA CT IR

10%

MRR 0.1184 0.1075 0.0106 0.0571 0.0197 0.0870R-Score 0.1474 0.1296 0.0085 0.0563 0.0089 0.1028

20%

MRR 0.0874 0.0722 0.0257 0.0271 0.0459 0.0630R-Score 0.1238 0.1180 0.0309 0.0331 0.0728 0.0905

30%

MRR 0.0930 0.0924 0.0316 0.0348 0.0646 0.0741R-Score 0.1352 0.1289 0.0396 0.0454 0.1047 0.1117

Figure: Ranking Performance for the New Community Problem

A. N. Nikolakopoulos, M. Kalantzi, J. D. Garofalakis Lanczos Latent Factor Recommender

Page 18: On the Use of Lanczos Vectors for Efficient Latent Factor ...wims14.csd.auth.gr/wp-content/uploads/presentations/Kalantzi.pdf · vectors forming the basis of the Krylov subspace that

Introduction & MotivationOur Contribution

Experimental EvaluationConclusions

Evaluation MethodologyCompared Algorithms and MetricsQuality of RecommendationsCold Start Recommendation

New Users problem

Methodology:

Randomly select 50 users having rated at least 100 items andrandomly delete 95% of each users’ ratings.

Create the test set.

5 10 15 200

0.1

0.2

0.3

N

Recall(N

)

0 0.2 0.4 0.6 0.8 1

.02

.04

.06

Recall

Precision

5 10 15 20

0.1

0.2

N

NDCG

LLFR PureSVD L† MFA Commute Time ItemRank

Figure: Performance evaluation of top-N recommendation for New Usersproblem.

A. N. Nikolakopoulos, M. Kalantzi, J. D. Garofalakis Lanczos Latent Factor Recommender

Page 19: On the Use of Lanczos Vectors for Efficient Latent Factor ...wims14.csd.auth.gr/wp-content/uploads/presentations/Kalantzi.pdf · vectors forming the basis of the Krylov subspace that

Introduction & MotivationOur Contribution

Experimental EvaluationConclusions

LLFR and Top-N Recommendation

Conclusions

LLFRPerforms in a computationally efficient way

Reduces the dimensionality of the problem by constructing theLanczos basis of the Krylov subspace defined by a scaled inter-itemcorrelation matrix

Produces recommendations of high quality

Deals particularly well with the Cold-start Problem

New Community ProblemNew Users Problem

A promising candidate for large-scale recommendationscenarios

A. N. Nikolakopoulos, M. Kalantzi, J. D. Garofalakis Lanczos Latent Factor Recommender

Page 20: On the Use of Lanczos Vectors for Efficient Latent Factor ...wims14.csd.auth.gr/wp-content/uploads/presentations/Kalantzi.pdf · vectors forming the basis of the Krylov subspace that

Appendix References

References

Nikolakopoulos, A. N., Kouneli, M. and Garofalakis, J.:

A Novel Hierarchical Approach to Ranking BasedCollaborative Filtering.In: EANN 2013 (MHDW), Springer Verlag 2013.

Balakrishnan, S., Chopra, S.:

Collaborative ranking.In: ACM WSDM 2012, New York, USA, ACM (2012)143–152

Cremonesi, P., Koren, Y., Turrin, R.:

Performance of recommender algorithms on top-nrecommendation tasks.In: ACM RECSYS 2010, 39–46

Fouss, F., Pirotte, A., Renders, J., Saerens, M.:

Random-walk computation of similarities between nodesof a graph with application to collaborativerecommendation.IEEE TKDE on 19(3) (2007) 355–369

Gori, M., Pucci, A.:

Itemrank: a random-walk based scoring algorithm forrecommender engines.In: IJCAI 2007, San Francisco, CA, Morgan KaufmannPublishers Inc. (2007) 2766–2771

G. H. Golub and C. F. Van Loan.

Matrix computations, volume 3.JHU Press, 2012.

K. Blom and A. Ruhe.

A krylov subspace method for information retrieval.SIAM Journal on Matrix Analysis and Applications,26(2):566–582, 2004.

J. Chen and Y. Saad.

Lanczos vectors versus singular vectors for effectivedimension reduction.Knowledge and Data Engineering, IEEE Transactions on,21(8):1091–1103, 2009.

A. N. Nikolakopoulos, M. Kalantzi, J. D. Garofalakis Lanczos Latent Factor Recommender

Page 21: On the Use of Lanczos Vectors for Efficient Latent Factor ...wims14.csd.auth.gr/wp-content/uploads/presentations/Kalantzi.pdf · vectors forming the basis of the Krylov subspace that

Appendix References

Thanks!Q&A

A. N. Nikolakopoulos, M. Kalantzi, J. D. Garofalakis Lanczos Latent Factor Recommender

Page 22: On the Use of Lanczos Vectors for Efficient Latent Factor ...wims14.csd.auth.gr/wp-content/uploads/presentations/Kalantzi.pdf · vectors forming the basis of the Krylov subspace that

Appendix References

Lanczos Latent Factor Recommender Back

Inter-item Correlation Matrix A ∈ Rm×m

Captures the similarities between the elements of the item space.

ij th element is given by:

Ak` , ‖rk‖‖r`‖|Uk`|,

‖rj‖ is the euclidean length of the column that corresponds to itemvj in the rating matrix,Uk` ⊆ U denotes the set of users who rated both items vk and v`, i.e.

A. N. Nikolakopoulos, M. Kalantzi, J. D. Garofalakis Lanczos Latent Factor Recommender

Page 23: On the Use of Lanczos Vectors for Efficient Latent Factor ...wims14.csd.auth.gr/wp-content/uploads/presentations/Kalantzi.pdf · vectors forming the basis of the Krylov subspace that

Appendix References

Lanczos Latent Factor Recommender

Production of the recommendation listsFor each user ui we define a personalized recommendation vector:

πᵀi , rᵀi QQᵀ

rᵀi the ratings of user uiQ ∈ Rm×f is the matrix that contains the Lanczos vectors formingthe basis of the Krylov subspace Kf that corresponds to theinter-item correlation matrix A

A. N. Nikolakopoulos, M. Kalantzi, J. D. Garofalakis Lanczos Latent Factor Recommender


Recommended