+ All Categories
Home > Documents > Chong Wang and David M. Blei Best student paper award at ... · Latent Dirichlet allocation (LDA)...

Chong Wang and David M. Blei Best student paper award at ... · Latent Dirichlet allocation (LDA)...

Date post: 19-Aug-2020
Category:
Upload: others
View: 1 times
Download: 0 times
Share this document with a friend
40
Collaborative Topic Modeling for Recommending Scientific Articles Chong Wang and David M. Blei Best student paper award at KDD 2011 Computer Science Department, Princeton University Presented by Tian Cao 1 / 51
Transcript
Page 1: Chong Wang and David M. Blei Best student paper award at ... · Latent Dirichlet allocation (LDA) is a popular topic model. It assumes There are K topics For each article, topic proportions

Collaborative Topic Modeling for RecommendingScientific Articles

Chong Wang and David M. BleiBest student paper award at KDD 2011

Computer Science Department, Princeton University

Presented by Tian Cao

1 / 51

Page 2: Chong Wang and David M. Blei Best student paper award at ... · Latent Dirichlet allocation (LDA) is a popular topic model. It assumes There are K topics For each article, topic proportions

Outline

• Overview for Recommender Systems

• Methods• Collabarative Filtering• Topic Modeling• Collaborative topic models

• Results

• Conclusions

2 / 51

Page 3: Chong Wang and David M. Blei Best student paper award at ... · Latent Dirichlet allocation (LDA) is a popular topic model. It assumes There are K topics For each article, topic proportions

Overview for Recommender Systems

• The most widely used Recommender System

3 / 51

Page 4: Chong Wang and David M. Blei Best student paper award at ... · Latent Dirichlet allocation (LDA) is a popular topic model. It assumes There are K topics For each article, topic proportions

Overview for Recommender Systems

• The most widely used Recommender System

4 / 51

Page 5: Chong Wang and David M. Blei Best student paper award at ... · Latent Dirichlet allocation (LDA) is a popular topic model. It assumes There are K topics For each article, topic proportions

Overview for Recommender Systems

• Type “Digital Camera” in Amazon

• Too many choices to choose from

5 / 51

Page 6: Chong Wang and David M. Blei Best student paper award at ... · Latent Dirichlet allocation (LDA) is a popular topic model. It assumes There are K topics For each article, topic proportions

What would you do?

• Read every description yourself

• What do other people say

6 / 51

Page 7: Chong Wang and David M. Blei Best student paper award at ... · Latent Dirichlet allocation (LDA) is a popular topic model. It assumes There are K topics For each article, topic proportions

What would you do?

• Sorted by Avg. Customer Review

7 / 51

Page 8: Chong Wang and David M. Blei Best student paper award at ... · Latent Dirichlet allocation (LDA) is a popular topic model. It assumes There are K topics For each article, topic proportions

More recommender systems

• I am a graduate student and I also do research ...

From Chong Wang’s slides

8 / 51

Page 9: Chong Wang and David M. Blei Best student paper award at ... · Latent Dirichlet allocation (LDA) is a popular topic model. It assumes There are K topics For each article, topic proportions

This paper focus on Recommending Scientific artilces

• A search of “Data Mining” in Google Scholar gives 2,010,000 results.

• If I have read article A, B and C, what should I read next?

From Chong Wang’s slides

9 / 51

Page 10: Chong Wang and David M. Blei Best student paper award at ... · Latent Dirichlet allocation (LDA) is a popular topic model. It assumes There are K topics For each article, topic proportions

The problem of finding relevant articles

• Finding relevant articles is an important task for researcher

- learn about the general idea in an area- keep up to the state of art of an area

• Two popular exsting approaches

- following article references: easily missing relevant citations- using keyword search

- difficult to form queries- only good for directed exploration

• The author develop recommendation algorithms given onlinecommunities sharing referene libraries. (www.citeulike.org)

From Chong Wang’s slides

10 / 51

Page 11: Chong Wang and David M. Blei Best student paper award at ... · Latent Dirichlet allocation (LDA) is a popular topic model. It assumes There are K topics For each article, topic proportions

The problem of finding relevant articles

• Finding relevant articles is an important task for researcher

- learn about the general idea in an area- keep up to the state of art of an area

• Two popular exsting approaches

- following article references: easily missing relevant citations- using keyword search

- difficult to form queries- only good for directed exploration

• The author develop recommendation algorithms given onlinecommunities sharing referene libraries. (www.citeulike.org)

From Chong Wang’s slides

11 / 51

Page 12: Chong Wang and David M. Blei Best student paper award at ... · Latent Dirichlet allocation (LDA) is a popular topic model. It assumes There are K topics For each article, topic proportions

The problem of finding relevant articles

• Finding relevant articles is an important task for researcher

- learn about the general idea in an area- keep up to the state of art of an area

• Two popular exsting approaches

- following article references: easily missing relevant citations- using keyword search

- difficult to form queries- only good for directed exploration

• The author develop recommendation algorithms given onlinecommunities sharing referene libraries. (www.citeulike.org)

From Chong Wang’s slides

12 / 51

Page 13: Chong Wang and David M. Blei Best student paper award at ... · Latent Dirichlet allocation (LDA) is a popular topic model. It assumes There are K topics For each article, topic proportions

The problem of finding relevant articles

• Finding relevant articles is an important task for researcher

- learn about the general idea in an area- keep up to the state of art of an area

• Two popular exsting approaches

- following article references: easily missing relevant citations- using keyword search

- difficult to form queries- only good for directed exploration

• The author develop recommendation algorithms given onlinecommunities sharing referene libraries. (www.citeulike.org)

From Chong Wang’s slides

13 / 51

Page 14: Chong Wang and David M. Blei Best student paper award at ... · Latent Dirichlet allocation (LDA) is a popular topic model. It assumes There are K topics For each article, topic proportions

The problem of finding relevant articles

• Finding relevant articles is an important task for researcher

- learn about the general idea in an area- keep up to the state of art of an area

• Two popular exsting approaches

- following article references: easily missing relevant citations- using keyword search

- difficult to form queries- only good for directed exploration

• The author develop recommendation algorithms given onlinecommunities sharing referene libraries. (www.citeulike.org)

From Chong Wang’s slides

14 / 51

Page 15: Chong Wang and David M. Blei Best student paper award at ... · Latent Dirichlet allocation (LDA) is a popular topic model. It assumes There are K topics For each article, topic proportions

Two traditional approaches for recommendation

• Collaborative filtering (CF)

• Topic Modeling

• Combing of the two models

15 / 51

Page 16: Chong Wang and David M. Blei Best student paper award at ... · Latent Dirichlet allocation (LDA) is a popular topic model. It assumes There are K topics For each article, topic proportions

Collaborative Filtering

Three important elements

• users

• items: article

• ratings: a user likes/dislikes some of the articles

Popular solutions: collaborative filtering (CF)

• matrix factorization: one of the most popular algorithms forrecommender system

The user-item matrix

16 / 51

Page 17: Chong Wang and David M. Blei Best student paper award at ... · Latent Dirichlet allocation (LDA) is a popular topic model. It assumes There are K topics For each article, topic proportions

Matrix factorization

• Users and items are represented in a shared but unknown latent space(lantent factor model)

• user i − ui ∈ Rk

• item j − vj ∈ Rk

• Each dimension of the latent space is assumed to represent some kindof unknown factors

• The rating of item j by user i is achieved by the dot product,

rij = uTi vj ,

where rij = 1 indicates like and 0 dislike. In the matrix form,

R = UTV .

17 / 51

Page 18: Chong Wang and David M. Blei Best student paper award at ... · Latent Dirichlet allocation (LDA) is a popular topic model. It assumes There are K topics For each article, topic proportions

Learning and Prediction

• Learning the latent vectors for users and items

minU,V

∑i ,j

(rij − uTi vj)2 + λu‖ui‖2 + λv‖vj‖2,

where λu and λv are regularization parameters.

• Prediction for user i on item j (not rated by user i before),

rij ≈ uTi vj .

How do we understand these latent vectors for users and items?

18 / 51

Page 19: Chong Wang and David M. Blei Best student paper award at ... · Latent Dirichlet allocation (LDA) is a popular topic model. It assumes There are K topics For each article, topic proportions

Disadvantages for matrix factorization

Two main disadvantages to matrix factorization for recommendation

• learnt latent space is not easy to interpret

• only uses information from the users-cannot to geralize to completelyunrated items

19 / 51

Page 20: Chong Wang and David M. Blei Best student paper award at ... · Latent Dirichlet allocation (LDA) is a popular topic model. It assumes There are K topics For each article, topic proportions

The author’s criteria for an article recommender system

It should be able to

• recommend old articles (already rated, easy)

• recommend new articles (not rated before, not that easy, but doable)

• provide the interpretability - not just a list of items (challenging)

The goal is not only to improve the performance, but also theinterpretability.

20 / 51

Page 21: Chong Wang and David M. Blei Best student paper award at ... · Latent Dirichlet allocation (LDA) is a popular topic model. It assumes There are K topics For each article, topic proportions

Topic modeling

• Each topic is a distribution over words

• Each document is a mixture of topics

• Each word is drawn from one of those topics

From Chong Wang’s slides

21 / 51

Page 22: Chong Wang and David M. Blei Best student paper award at ... · Latent Dirichlet allocation (LDA) is a popular topic model. It assumes There are K topics For each article, topic proportions

Latent Dirichlet allcation

Latent Dirichlet allocation (LDA) is a popular topic model. It assumes

• There are K topics

• For each article, topic proportions θ ∼ Dirichlet(α)

Note that θ can explain the topics that article talks about!

From Chong Wang’s slides

22 / 51

Page 23: Chong Wang and David M. Blei Best student paper award at ... · Latent Dirichlet allocation (LDA) is a popular topic model. It assumes There are K topics For each article, topic proportions

The graphical model

• Vertices denote random variables

• Edges denote dependence between random variables

• Shading denotes observed variables

• Plates denote replicated variables

From Chong Wang’s slides

23 / 51

Page 24: Chong Wang and David M. Blei Best student paper award at ... · Latent Dirichlet allocation (LDA) is a popular topic model. It assumes There are K topics For each article, topic proportions

Running a topic model

• Data: article titles + abstracts from CiteUlike• 16,980 articles• 1.6M words• 8K unique terms

• Model:200-topic LDA model with variational inference

24 / 51

Page 25: Chong Wang and David M. Blei Best student paper award at ... · Latent Dirichlet allocation (LDA) is a popular topic model. It assumes There are K topics For each article, topic proportions

25 / 51

Page 26: Chong Wang and David M. Blei Best student paper award at ... · Latent Dirichlet allocation (LDA) is a popular topic model. It assumes There are K topics For each article, topic proportions

Inferred topic propostions for article

26 / 51

Page 27: Chong Wang and David M. Blei Best student paper award at ... · Latent Dirichlet allocation (LDA) is a popular topic model. It assumes There are K topics For each article, topic proportions

Comparison of the article representation

27 / 51

Page 28: Chong Wang and David M. Blei Best student paper award at ... · Latent Dirichlet allocation (LDA) is a popular topic model. It assumes There are K topics For each article, topic proportions

Collabrative topic models: motivations

• In matrix factorization, an article has a latent representation v insome unknown latent space

• In topic modeling, an article has topic proportions θ in the learnedtopic space

From Chong Wang’s slides

28 / 51

Page 29: Chong Wang and David M. Blei Best student paper award at ... · Latent Dirichlet allocation (LDA) is a popular topic model. It assumes There are K topics For each article, topic proportions

Collabrative topic models: motivations

If we simply fix v = θ, we seem to find a way to explain the unknownspace using the topic space.

From Chong Wang’s slides

29 / 51

Page 30: Chong Wang and David M. Blei Best student paper award at ... · Latent Dirichlet allocation (LDA) is a popular topic model. It assumes There are K topics For each article, topic proportions

Collabrative topic models: motivations

The author proposed an approach to fill the gap.

From Chong Wang’s slides

30 / 51

Page 31: Chong Wang and David M. Blei Best student paper award at ... · Latent Dirichlet allocation (LDA) is a popular topic model. It assumes There are K topics For each article, topic proportions

The basic idea

• What the users think of an article might be different from what thearticle is actually about, but unlikely entirely irreleant

• We assume the item latent vector v is close to topic propotions θ, butcould diverge from θ if it has to

For an article,

• When there are few ratings, vj is unlikely to be far from θj

• When there are lots of ratings, vj is likely to diverge from θj . Itactually generates or removes some topics to cater the users

31 / 51

Page 32: Chong Wang and David M. Blei Best student paper award at ... · Latent Dirichlet allocation (LDA) is a popular topic model. It assumes There are K topics For each article, topic proportions

The proposed model

For each user i ,

• Draw user latent vector ui ∼ N(0, λ−1u Ik).

For each article j ,

• Draw topic proportions θi ∼ Dirichlet(α).

• Draw item latent offset εj ∼ N(0, λ−1v Ik) and set the item latent

vector as vj = θj + εj .

• Everything else is the same, the rating becomes,

E [rij ] = uTi vj = uTi (θj + εj).

This model is called Collaborative Topic Regression (CTR).

• Offset εj corrects θj for the popularity

• Precision parameter λv penalizes how much vj could diverge from θj .

32 / 51

Page 33: Chong Wang and David M. Blei Best student paper award at ... · Latent Dirichlet allocation (LDA) is a popular topic model. It assumes There are K topics For each article, topic proportions

The graphical model

From Chong Wang’s slides

33 / 51

Page 34: Chong Wang and David M. Blei Best student paper award at ... · Latent Dirichlet allocation (LDA) is a popular topic model. It assumes There are K topics For each article, topic proportions

Learning and Prediction• Learning: use a standard EM algorithm to learn the maximum a

posteriori (MAP) estimates.• Prediction: consider two scenarios,

• In-matrix prediction: items have been rated before

r?ij ≈ (u?i )T (θ?j + ε?j ).

• Out-of-matrix prediction: items have never been rated

r?ij ≈ (u?i )T θ?j .

34 / 51

Page 35: Chong Wang and David M. Blei Best student paper award at ... · Latent Dirichlet allocation (LDA) is a popular topic model. It assumes There are K topics For each article, topic proportions

Experimental settings

• Data from CiteUlike:• 5,551 users, 16,980 articles, and 204,986 bibliography entries.

(Sparsity=99.8 %)• For each article, concatenate its title and abstract as its content.• These articles were added to CiteUlike between 2004 and 2010

• Evaluation: five-fold cross-validation with recall,

recall@M =number of articles the user likes in top M

total number of article the user likes

• Comparison: matrix factorization for collaborative filter (CF),text-based method (LDA).

35 / 51

Page 36: Chong Wang and David M. Blei Best student paper award at ... · Latent Dirichlet allocation (LDA) is a popular topic model. It assumes There are K topics For each article, topic proportions

Results

• In-matrix prediction: CTR improves more when number ofrecommendations gets larger.

• Out-of-matrix prediction: about the same as LDA.

36 / 51

Page 37: Chong Wang and David M. Blei Best student paper award at ... · Latent Dirichlet allocation (LDA) is a popular topic model. It assumes There are K topics For each article, topic proportions

When precision parameter λv variesRecall λv penalizes how v could diverge from θ,

• When λv is small, CTR behaves more like CF.

• When λv increases, CTR brings in both ratings and content.

• When λv is large, CTR behaves more like LDA.

37 / 51

Page 38: Chong Wang and David M. Blei Best student paper award at ... · Latent Dirichlet allocation (LDA) is a popular topic model. It assumes There are K topics For each article, topic proportions

Interpretation: example user profile I

38 / 51

Page 39: Chong Wang and David M. Blei Best student paper award at ... · Latent Dirichlet allocation (LDA) is a popular topic model. It assumes There are K topics For each article, topic proportions

Interpretation: example user profile II

39 / 51

Page 40: Chong Wang and David M. Blei Best student paper award at ... · Latent Dirichlet allocation (LDA) is a popular topic model. It assumes There are K topics For each article, topic proportions

Conclusions

• develop an algorithm to recommend scientific articles to users of anonline community

• combines the merits of traditional collaborative filtering andprobabilistic topic modeling

• provides an interpretable latent structure for users and items

• can form recommendation about both existing and newly publishedarticles

40 / 51


Recommended