+ All Categories
Home > Documents > Question Identification on Twitter Baichuan Li, Xiance Si, Michael R. Lyu, Irwin King, and Edward Y....

Question Identification on Twitter Baichuan Li, Xiance Si, Michael R. Lyu, Irwin King, and Edward Y....

Date post: 20-Jan-2016
Category:
Upload: amber-hunter
View: 226 times
Download: 0 times
Share this document with a friend
Popular Tags:
16
Question Identification on Twitter Baichuan Li, Xiance Si, Michael R. Lyu, Irwin King, and Edward Y. Chang 06/13/22 1
Transcript
Page 1: Question Identification on Twitter Baichuan Li, Xiance Si, Michael R. Lyu, Irwin King, and Edward Y. Chang 10/9/20151.

Question Identification on Twitter

Baichuan Li, Xiance Si, Michael R. Lyu, Irwin King, and Edward Y. Chang

04/21/23 1

Page 2: Question Identification on Twitter Baichuan Li, Xiance Si, Michael R. Lyu, Irwin King, and Edward Y. Chang 10/9/20151.

Agenda

• Background• Two-phase Classification• Experiments• Conclusion

04/21/23 2

Page 3: Question Identification on Twitter Baichuan Li, Xiance Si, Michael R. Lyu, Irwin King, and Edward Y. Chang 10/9/20151.

Background

04/21/23 3

Page 4: Question Identification on Twitter Baichuan Li, Xiance Si, Michael R. Lyu, Irwin King, and Edward Y. Chang 10/9/20151.

04/21/23 4

Page 5: Question Identification on Twitter Baichuan Li, Xiance Si, Michael R. Lyu, Irwin King, and Edward Y. Chang 10/9/20151.

Two Challenges

• 140 characters

• Special features

04/21/23 5

Page 6: Question Identification on Twitter Baichuan Li, Xiance Si, Michael R. Lyu, Irwin King, and Edward Y. Chang 10/9/20151.

Two-phase Classification

• Interrogative Tweet Detection– Tweets which contain question sentences

• Qweet Extraction– Interrogative tweets which require some information

or help and thus need to be answered

Interrogative Tweet

DetectionTweets Qweet

ExtractionQweetsInterrogative

Tweets

04/21/23 6

Page 7: Question Identification on Twitter Baichuan Li, Xiance Si, Michael R. Lyu, Irwin King, and Edward Y. Chang 10/9/20151.

Interrogative Tweet Detection

• Rule-based Approach– Question marks– 5W1H words and Refined 5W1H words – Heuristic Rules (Efron and Winget, 2010)

• Learning-based Approach– Frequent question patterns mining (Pei et al.,

2001) + One-class SVM (Schölkopf et al., 2001)– Over 850,000 QA pairs in community question

answering (CQA) portals were used

04/21/23 7

Page 8: Question Identification on Twitter Baichuan Li, Xiance Si, Michael R. Lyu, Irwin King, and Edward Y. Chang 10/9/20151.

Qweet Extraction

• Types of Interrogative Tweets

04/21/23 8

Page 9: Question Identification on Twitter Baichuan Li, Xiance Si, Michael R. Lyu, Irwin King, and Edward Y. Chang 10/9/20151.

Qweet Extraction

• Types of Interrogative Tweets

04/21/23 9

Page 10: Question Identification on Twitter Baichuan Li, Xiance Si, Michael R. Lyu, Irwin King, and Edward Y. Chang 10/9/20151.

Qweet Extraction

• Types of Interrogative Tweets

04/21/23 10

Page 11: Question Identification on Twitter Baichuan Li, Xiance Si, Michael R. Lyu, Irwin King, and Edward Y. Chang 10/9/20151.

Qweet Extraction

• Feature Extraction

04/21/23 11

Page 12: Question Identification on Twitter Baichuan Li, Xiance Si, Michael R. Lyu, Irwin King, and Edward Y. Chang 10/9/20151.

Experiments

• Data Set

04/21/23 12

Page 13: Question Identification on Twitter Baichuan Li, Xiance Si, Michael R. Lyu, Irwin King, and Edward Y. Chang 10/9/20151.

Results: Interrogative Tweet Detection

• Heuristics– H1: Must appear at the beginning of one sentence– H2: Add auxiliary words to the original 5W1H words

• “what” -> “what is” and “what are”

04/21/23 13

Page 14: Question Identification on Twitter Baichuan Li, Xiance Si, Michael R. Lyu, Irwin King, and Edward Y. Chang 10/9/20151.

Results: Qweet Extraction

• Context features are of great importance in distinguishing qweets from non-qweets

• Tweet-specific features also help in qweet identification

04/21/23 14

Page 15: Question Identification on Twitter Baichuan Li, Xiance Si, Michael R. Lyu, Irwin King, and Edward Y. Chang 10/9/20151.

Conclusion

• First Attempt in discovering questions from tweets automatically

• Two-phase classification – Interrogative Tweet Detection– Qweet Extraction

• Limitations and future work– Tweets containing rhetorical questions and

complicated self-ask-self-answer sentences– Real-time clustering (Ahmed et al., 2011)– Question analysis and classification

04/21/23 15

Page 16: Question Identification on Twitter Baichuan Li, Xiance Si, Michael R. Lyu, Irwin King, and Edward Y. Chang 10/9/20151.

Thank You!

Q&A

04/21/23 16


Recommended