+ All Categories
Home > Documents > L EARNING TO M ODEL R ELATEDNESS FOR N EWS R ECOMMENDATION Author: Yuanhua Lv and et al. UIUC,...

L EARNING TO M ODEL R ELATEDNESS FOR N EWS R ECOMMENDATION Author: Yuanhua Lv and et al. UIUC,...

Date post: 04-Jan-2016
Category:
Upload: bruno-booker
View: 214 times
Download: 0 times
Share this document with a friend
Popular Tags:
19
LEARNING TO MODEL RELATEDNESS FOR NEWS RECOMMENDATION Author: Yuanhua Lv and et al. UIUC, Yahoo! labs Presenter: Robbie 1 WWW 2011
Transcript
Page 1: L EARNING TO M ODEL R ELATEDNESS FOR N EWS R ECOMMENDATION Author: Yuanhua Lv and et al. UIUC, Yahoo! labs Presenter: Robbie 1 WWW 2011.

LEARNING TO MODEL RELATEDNESS FOR NEWS RECOMMENDATION

Author: Yuanhua Lv and et al. UIUC, Yahoo! labs

Presenter: Robbie

1

WWW 2011

Page 2: L EARNING TO M ODEL R ELATEDNESS FOR N EWS R ECOMMENDATION Author: Yuanhua Lv and et al. UIUC, Yahoo! labs Presenter: Robbie 1 WWW 2011.

OUTLINE

Introduction and Motivation Model Relatedness Experiment Conclusion

2

Page 3: L EARNING TO M ODEL R ELATEDNESS FOR N EWS R ECOMMENDATION Author: Yuanhua Lv and et al. UIUC, Yahoo! labs Presenter: Robbie 1 WWW 2011.

INTRODUCTION Post-Click news Recommendation

Seed news

Candidate news

3

Page 4: L EARNING TO M ODEL R ELATEDNESS FOR N EWS R ECOMMENDATION Author: Yuanhua Lv and et al. UIUC, Yahoo! labs Presenter: Robbie 1 WWW 2011.

MOTIVATION

Promote users’ navigation on the visited website

Yahoo!, Google focus on initial clicks, post-click news recommendation largely under-explored

Mainly depend on editors’ manual effort

No existing method proposed to model relatedness directly

4

Page 5: L EARNING TO M ODEL R ELATEDNESS FOR N EWS R ECOMMENDATION Author: Yuanhua Lv and et al. UIUC, Yahoo! labs Presenter: Robbie 1 WWW 2011.

MODEL RELATEDNESS

Four aspects•Relevance•Novelty•Connection clarity •Transition smoothness

5

Page 6: L EARNING TO M ODEL R ELATEDNESS FOR N EWS R ECOMMENDATION Author: Yuanhua Lv and et al. UIUC, Yahoo! labs Presenter: Robbie 1 WWW 2011.

RELEVANCE AND NOVELTY

Similar but not duplicate Novelty often in contrast to relevance Use same set of features to measure

them cosine similarity BM25 language models with Dirichlet prior smoothing language models with Jelinek-Mercer smoothing

6

Page 7: L EARNING TO M ODEL R ELATEDNESS FOR N EWS R ECOMMENDATION Author: Yuanhua Lv and et al. UIUC, Yahoo! labs Presenter: Robbie 1 WWW 2011.

CONNECTION CLARITY

Relevance and novelty can only model word overlap between two articles s and d

Example: s: White House: Obamas earn $5.5 million in 2009

d: Obama’s oil spill bill seeks $118 million, oil company

s and d must be topically cohesive Connection clarity defines topical

cohesion of two news

7

Page 8: L EARNING TO M ODEL R ELATEDNESS FOR N EWS R ECOMMENDATION Author: Yuanhua Lv and et al. UIUC, Yahoo! labs Presenter: Robbie 1 WWW 2011.

CONNECTION CLARITY

Rr Vw

dswcrwPrwPdswPdswP,

, ),(, )|()|(),()|(

Vw

dswcrwP,

, ),(, )|(

8

Page 9: L EARNING TO M ODEL R ELATEDNESS FOR N EWS R ECOMMENDATION Author: Yuanhua Lv and et al. UIUC, Yahoo! labs Presenter: Robbie 1 WWW 2011.

TRANSITION SMOOTHNESS

Example: s: Toyota dismisses account of runaway Prius

d1: What to do if your car suddenly accelerates

d2: Toyota to build Prius at 3rd Japan plant: report

Definition: Measures how well a user’s reading interests can transit from s to d

Transition smoothness from s-d to d-s, i.e. from “known” to “novel”

9

Smooth(s, d1)>smooth(s, d2)

Page 10: L EARNING TO M ODEL R ELATEDNESS FOR N EWS R ECOMMENDATION Author: Yuanhua Lv and et al. UIUC, Yahoo! labs Presenter: Robbie 1 WWW 2011.

TRANSITION SMOOTHNESS

10

Page 11: L EARNING TO M ODEL R ELATEDNESS FOR N EWS R ECOMMENDATION Author: Yuanhua Lv and et al. UIUC, Yahoo! labs Presenter: Robbie 1 WWW 2011.

LEARNING A RELATEDNESS FUNCTION

11

Page 12: L EARNING TO M ODEL R ELATEDNESS FOR N EWS R ECOMMENDATION Author: Yuanhua Lv and et al. UIUC, Yahoo! labs Presenter: Robbie 1 WWW 2011.

CONSTRUCTING TEST COLLECTION

Yahoo! News articles from March 1st to June 30th 2010

Each run, randomly generate 549 seed news from June 10th to June 20th with at least 2000 visits

Perform redundancy detection

12

Page 13: L EARNING TO M ODEL R ELATEDNESS FOR N EWS R ECOMMENDATION Author: Yuanhua Lv and et al. UIUC, Yahoo! labs Presenter: Robbie 1 WWW 2011.

EDITORIAL JUDGMENTS

A group of professional news editors from a commercial online news website

4 point relatedness scale “very related”, “somewhat related”,

“redundant”, “unrelated”, any document with two different judgments, select a judgment with a higher ratio

High agreement in relative relatedness (80.8%),inspire to learn relatedness functions from pair-wise preference information 13

Page 14: L EARNING TO M ODEL R ELATEDNESS FOR N EWS R ECOMMENDATION Author: Yuanhua Lv and et al. UIUC, Yahoo! labs Presenter: Robbie 1 WWW 2011.

EXPERIMENTS: COMPARING INDIVIDUAL RETRIEVAL MODELS

14

• “Body” is the best ---- title and abstract may lose information

• Cosine similarity as well as or even better than language models in some cases, but NDCG1 is worst • Effective for redundancy detection, which brings

redundant documents to the top

Page 15: L EARNING TO M ODEL R ELATEDNESS FOR N EWS R ECOMMENDATION Author: Yuanhua Lv and et al. UIUC, Yahoo! labs Presenter: Robbie 1 WWW 2011.

EXPERIMENTS: COMPARING MACHINE-LEARNED RELATEDNESS MODELS

15

Page 16: L EARNING TO M ODEL R ELATEDNESS FOR N EWS R ECOMMENDATION Author: Yuanhua Lv and et al. UIUC, Yahoo! labs Presenter: Robbie 1 WWW 2011.

EXPERIMENTS: ANALYZING THE UNIFIED RELATEDNESS MODEL

16

• Cosine similarity significantly worse than BM25 as individual relatedness function, but the most important in the unified model

• Connection clarity and transition smoothness contribute 7/15 together

Page 17: L EARNING TO M ODEL R ELATEDNESS FOR N EWS R ECOMMENDATION Author: Yuanhua Lv and et al. UIUC, Yahoo! labs Presenter: Robbie 1 WWW 2011.

CONCLUSIONS

First attempt at post-click news recommendation

Propose 4 aspects to characterize news relatedness

Future work Incorporate into the unified relatedness function

non-content features Document and user adaptive measures will be

more accurate

17

Page 18: L EARNING TO M ODEL R ELATEDNESS FOR N EWS R ECOMMENDATION Author: Yuanhua Lv and et al. UIUC, Yahoo! labs Presenter: Robbie 1 WWW 2011.

RELEVANCE AND NOVELTY

Problem: Top ranked documents may be redundant and unrelated articles

Solution : Passage retrieval

18

Page 19: L EARNING TO M ODEL R ELATEDNESS FOR N EWS R ECOMMENDATION Author: Yuanhua Lv and et al. UIUC, Yahoo! labs Presenter: Robbie 1 WWW 2011.

EXPERIMENTS: PASSAGE RETRIEVAL EVALUATION

19

o Fixed-length(250 empirically) arbitrary passage retrievalo Passage retrieval doesn’t help in most caseso Improve NDCG1 clearly. --- Probably relaxes the concern

of ranking redundant documents on top


Recommended