+ All Categories
Home > Documents > Improving Users’ Demographic Prediction via the Videos ...

Improving Users’ Demographic Prediction via the Videos ...

Date post: 29-Dec-2021
Category:
Upload: others
View: 2 times
Download: 0 times
Share this document with a friend
20
Improving Users’ Demographic Prediction via the Videos They Talk about Yuan Wang, Yang Xiao, Chao Ma, and Zhen Xiao Peking University, China EMNLP2016, 3 November 2016
Transcript

Improving Users’ Demographic Prediction via the Videos They Talk about

Yuan Wang, Yang Xiao, Chao Ma, and Zhen Xiao

Peking University, China

EMNLP2016, 3 November 2016

Table of Contents

• Introduction

• Data

• Indirect Relationships between Users and Videos

• Evaluation

• Conclusion

Introduction

Introduction

Love, pretty,

work, …

Sports, Financial,

…, Car

Female

Male

Introduction

• Mean Girls, Pretty Woman, The Devil Wears Prada

• House of Cards, Mission Impossible, NBA

video

video

video

video

Introduction

“Will the Big Bang Theory last into the next century?”

“Sheldon is so cool, I love him!”

“Jim Parsons was nominated for another Emmy Award”

Introduction

Introduction

User 1

User 1

Video name Actor name Keyword

With the help of

User 1

Video name Actor name Keyword

With the help of ???

Data

Normal User

Verified User

User followed by Verified User

Data

1 2 3 4 … n

1 2 3 4 m

Video

User

unobvious direct indirect

Discover Indirect Relationships

• Unobvious relationship• N*M pairs

• Direct relationship• User 2, “Will the Big Bang Theory last into the next century?”

• Indirect relationship• User 3 posts , “Sheldon is so cool, I love him!”

Video name Actor name Keyword

Discover Indirect Relationships

𝑃 𝑣𝑛 =𝑛𝑢𝑚 user𝑠 𝑤𝑎𝑡𝑐ℎ𝑒𝑑 𝑡ℎ𝑒 𝑛𝑡ℎ 𝑣𝑖𝑑𝑒𝑜

𝑛𝑢𝑚(𝑢𝑠𝑒𝑟𝑠)

𝑃 𝑤𝑛𝑖|𝑣𝑛 =𝑛𝑢𝑚 user𝑠 𝑤𝑎𝑡𝑐ℎ𝑒𝑑 𝑡ℎ𝑒 𝑛𝑡ℎ 𝑣𝑖𝑑𝑒𝑜 𝑎𝑛𝑑 𝑚𝑒𝑛𝑡𝑖𝑜𝑛𝑒𝑑 𝑡ℎ𝑒 𝑛𝑖𝑡ℎ 𝑘𝑒𝑦𝑤𝑜𝑟𝑑

𝑛𝑢𝑚(user𝑠 𝑤𝑎𝑡𝑐ℎ𝑒𝑑 𝑡ℎ𝑒 𝑛𝑡ℎ 𝑣𝑖𝑑𝑒𝑜)

𝑃 𝑎𝑛𝑗|𝑣𝑛 =𝑛𝑢𝑚 user𝑠 𝑤𝑎𝑡𝑐ℎ𝑒𝑑 𝑡ℎ𝑒 𝑛𝑡ℎ 𝑣𝑖𝑑𝑒𝑜 𝑎𝑛𝑑 𝑚𝑒𝑛𝑡𝑖𝑜𝑛𝑒𝑑 𝑡ℎ𝑒 𝑛𝑗𝑡ℎ 𝑎𝑐𝑡𝑜𝑟

𝑛𝑢𝑚(user𝑠 𝑤𝑎𝑡𝑐ℎ𝑒𝑑 𝑡ℎ𝑒 𝑛𝑡ℎ 𝑣𝑖𝑑𝑒𝑜)

Step 1

Step 2

Discover Indirect RelationshipsVideo name Actor name Keyword

Direct

Direct + Indirect(more denser)

Video name Actor name Keyword

Two Baseline Model

Two Indirect RelationshipBased Model

Discriminant Model• Matrix Factorization1, K=20

• LR2, SVM2, GBDT3

1 libFFM2 liblinear3 XGBoost

Generative Model

• Calculate video demographic tendency

• Calculate user demographic attribute

• Smooth the result

Evaluation

1

2

3

4

Evaluation

Evaluation

Conclusion• Our motivation is that user's video related behavior is usually

under-utilized on demographic prediction tasks.

• With the help of third-party video sites, we detect the direct and indirect relationships between users and video describing words, and demonstrate this effort can improve the accuracy of users' demographic predictions.

• To our knowledge, this is the first work which explores demographic prediction by fully using users' video describing words.

• This framework has good scalability and can be applied on other concrete features, such as user's book reading behaviors and music listening behaviors.

Thanks!


Recommended