+ All Categories
Home > Documents > Answer Type Prediction for KB-QA - Max Planck...

Answer Type Prediction for KB-QA - Max Planck...

Date post: 27-Jul-2019
Category:
Upload: ngodieu
View: 215 times
Download: 0 times
Share this document with a friend
41
TIPI: Answer Type Prediction for Question Answering Rishiraj Saha Roy Post-doctoral Researcher Max Planck Institute for Informatics With: David Ziegler Abdalghani Abujabal Gerhard Weikum
Transcript
Page 1: Answer Type Prediction for KB-QA - Max Planck Societypeople.mpi-inf.mpg.de/~rsaharo/adobe-talk-rishi-dec2016.pdf · TIPI: Answer Type Prediction for Question Answering ... (SPO) triples

TIPI: Answer Type Prediction for Question Answering

Rishiraj Saha RoyPost-doctoral Researcher

Max Planck Institute for Informatics

With:

David Ziegler

Abdalghani Abujabal

Gerhard Weikum

Page 2: Answer Type Prediction for KB-QA - Max Planck Societypeople.mpi-inf.mpg.de/~rsaharo/adobe-talk-rishi-dec2016.pdf · TIPI: Answer Type Prediction for Question Answering ... (SPO) triples

TIPI: Answer Type Prediction for Question Answering

Rishiraj Saha RoyPost-doctoral Researcher

Max Planck Institute for Informatics

Page 3: Answer Type Prediction for KB-QA - Max Planck Societypeople.mpi-inf.mpg.de/~rsaharo/adobe-talk-rishi-dec2016.pdf · TIPI: Answer Type Prediction for Question Answering ... (SPO) triples

MPI for Informatics

16-Dec-16 Answer Typing for KB-QA: Adobe TechTalk 3

Page 4: Answer Type Prediction for KB-QA - Max Planck Societypeople.mpi-inf.mpg.de/~rsaharo/adobe-talk-rishi-dec2016.pdf · TIPI: Answer Type Prediction for Question Answering ... (SPO) triples

DBIS@MPI-Inf

16-Dec-16 Answer Typing for KB-QA: Adobe TechTalk 4

Gerhard Weikum

29 PhD Students, 5 Post-docs from

21 countries!!

Klaus

Berberich

Pauli

Miettinen

Jilles

Vreeken

Page 5: Answer Type Prediction for KB-QA - Max Planck Societypeople.mpi-inf.mpg.de/~rsaharo/adobe-talk-rishi-dec2016.pdf · TIPI: Answer Type Prediction for Question Answering ... (SPO) triples

Research areas

16-Dec-16 Answer Typing for KB-QA: Adobe TechTalk 5

Named entity recognition, disambiguation and typing

Question-answering over knowledge bases

Credibility analysis of text

Information extraction from Web tables

User privacy in online forums

Page 6: Answer Type Prediction for KB-QA - Max Planck Societypeople.mpi-inf.mpg.de/~rsaharo/adobe-talk-rishi-dec2016.pdf · TIPI: Answer Type Prediction for Question Answering ... (SPO) triples

Question-answering

16-Dec-16 Answer Typing for KB-QA: Adobe TechTalk 6

Abdalghani Abujabal

David Ziegler

Page 7: Answer Type Prediction for KB-QA - Max Planck Societypeople.mpi-inf.mpg.de/~rsaharo/adobe-talk-rishi-dec2016.pdf · TIPI: Answer Type Prediction for Question Answering ... (SPO) triples

User Privacy

16-Dec-16 Answer Typing for KB-QA: Adobe TechTalk 7

Joanna Asia Biega

Sedigheh Eslami

Page 8: Answer Type Prediction for KB-QA - Max Planck Societypeople.mpi-inf.mpg.de/~rsaharo/adobe-talk-rishi-dec2016.pdf · TIPI: Answer Type Prediction for Question Answering ... (SPO) triples

Stunning Saarbruecken

Page 9: Answer Type Prediction for KB-QA - Max Planck Societypeople.mpi-inf.mpg.de/~rsaharo/adobe-talk-rishi-dec2016.pdf · TIPI: Answer Type Prediction for Question Answering ... (SPO) triples

Intuition

16-Dec-16 Answer Typing for KB-QA: Adobe TechTalk 9

Page 10: Answer Type Prediction for KB-QA - Max Planck Societypeople.mpi-inf.mpg.de/~rsaharo/adobe-talk-rishi-dec2016.pdf · TIPI: Answer Type Prediction for Question Answering ... (SPO) triples

Intuition

16-Dec-16 Answer Typing for KB-QA: Adobe TechTalk 10

KB-QA System

Lionel

MessiKB entitiesDan

Carter

Rachel

Bloom

Page 11: Answer Type Prediction for KB-QA - Max Planck Societypeople.mpi-inf.mpg.de/~rsaharo/adobe-talk-rishi-dec2016.pdf · TIPI: Answer Type Prediction for Question Answering ... (SPO) triples

Intuition

16-Dec-16 Answer Typing for KB-QA: Adobe TechTalk 11

KB-QA System

Lionel

MessiKB entitiesDan

Carter

Rachel

Bloom

Soccer

player

Actor

Rugby

player

Page 12: Answer Type Prediction for KB-QA - Max Planck Societypeople.mpi-inf.mpg.de/~rsaharo/adobe-talk-rishi-dec2016.pdf · TIPI: Answer Type Prediction for Question Answering ... (SPO) triples

Intuition

16-Dec-16 Answer Typing for KB-QA: Adobe TechTalk 12

KB-QA System

who won the golden ball award in wc2014?

Lionel

MessiKB entitiesDan

Carter

Rachel

Bloom

Soccer

player

Actor

Rugby

player

Page 13: Answer Type Prediction for KB-QA - Max Planck Societypeople.mpi-inf.mpg.de/~rsaharo/adobe-talk-rishi-dec2016.pdf · TIPI: Answer Type Prediction for Question Answering ... (SPO) triples

Intuition

16-Dec-16 Answer Typing for KB-QA: Adobe TechTalk 13

KB-QA System

who won the golden ball award in wc2014?TIPI

Lionel

MessiKB entitiesDan

Carter

Rachel

Bloom

Soccer

player

Actor

Rugby

player

Page 14: Answer Type Prediction for KB-QA - Max Planck Societypeople.mpi-inf.mpg.de/~rsaharo/adobe-talk-rishi-dec2016.pdf · TIPI: Answer Type Prediction for Question Answering ... (SPO) triples

Intuition

16-Dec-16 Answer Typing for KB-QA: Adobe TechTalk 14

KB-QA System

who won the golden ball award in wc2014?TIPI

Lionel

MessiKB entitiesDan

Carter

Rachel

Bloom

Soccer

player

Actor

Rugby

player

Page 15: Answer Type Prediction for KB-QA - Max Planck Societypeople.mpi-inf.mpg.de/~rsaharo/adobe-talk-rishi-dec2016.pdf · TIPI: Answer Type Prediction for Question Answering ... (SPO) triples

16-Dec-16 Answer Typing for KB-QA: Adobe TechTalk 15

Background: Knowledge Bases

Fact knowledge bases

Collection of subject-predicate-object (SPO) triples (millions!)

Use type ontologies

Useful for knowledge aggregation, browsing, search, …

Lionel_Messi plays_for Barcelona_FC

Lionel_Messi born_in Rosario

Rosario is_located_in Argentina

Lionel_Messi type soccer_player

Page 16: Answer Type Prediction for KB-QA - Max Planck Societypeople.mpi-inf.mpg.de/~rsaharo/adobe-talk-rishi-dec2016.pdf · TIPI: Answer Type Prediction for Question Answering ... (SPO) triples

16-Dec-16 Answer Typing for KB-QA: Adobe TechTalk 16

Background: Question Answering

Expressed in natural language

Entity-seeking factoid questions

Semantic parsing formulates formal query for NL question

Question: Where was Lionel Messi born?

Query: Lionel_Messi born_in x?

Query is executed over KB to fetch answer

Page 17: Answer Type Prediction for KB-QA - Max Planck Societypeople.mpi-inf.mpg.de/~rsaharo/adobe-talk-rishi-dec2016.pdf · TIPI: Answer Type Prediction for Question Answering ... (SPO) triples

Utility in KB-QA

16-Dec-16 Answer Typing for KB-QA: Adobe TechTalk 17

Entity

identification

Query

formulation

Query

ranking

Query

execution

Fader et al. 2014

Yih et al. 2015

Yin et al. 2015

Bast and Haussmann 2015

And many more…

Page 18: Answer Type Prediction for KB-QA - Max Planck Societypeople.mpi-inf.mpg.de/~rsaharo/adobe-talk-rishi-dec2016.pdf · TIPI: Answer Type Prediction for Question Answering ... (SPO) triples

Utility in KB-QA

16-Dec-16 Answer Typing for KB-QA: Adobe TechTalk 18

Entity

identification

Query

formulation

Query

ranking

Query

execution

TIPI

TIPI

Fader et al. 2014

Yih et al. 2015

Yin et al. 2015

Bast and Haussmann 2015

And many more…

Page 19: Answer Type Prediction for KB-QA - Max Planck Societypeople.mpi-inf.mpg.de/~rsaharo/adobe-talk-rishi-dec2016.pdf · TIPI: Answer Type Prediction for Question Answering ... (SPO) triples

16-Dec-16 Answer Typing for KB-QA: Adobe TechTalk 19

Levels of implicitness

Which soccer player won the Golden Ball award in the Soccer World Cup

of 2014?

Which footballer won the Golden Ball award in the World Cup of 2014?

Who won the Golden Ball award in the World Cup of 2014?

who got the 2014 golden ball?

last golden ball

Page 20: Answer Type Prediction for KB-QA - Max Planck Societypeople.mpi-inf.mpg.de/~rsaharo/adobe-talk-rishi-dec2016.pdf · TIPI: Answer Type Prediction for Question Answering ... (SPO) triples

16-Dec-16 Answer Typing for KB-QA: Adobe TechTalk 20

Ideas to build on…

Question type classification (Li and Roth '02, Blunsom et al ‘06)

Type coercion by Watson (Murdock et al. '12)

Named entity typing (HYENA, FINET)

Plug-in module (Yavuz et al. EMNLP '16)

Manual coarse-grained types (Bast and Haussmann '15)

Lexical types (Berant and Liang '15)

Page 21: Answer Type Prediction for KB-QA - Max Planck Societypeople.mpi-inf.mpg.de/~rsaharo/adobe-talk-rishi-dec2016.pdf · TIPI: Answer Type Prediction for Question Answering ... (SPO) triples

What comes when..

Use cases

System overview

Type system

Approach

Data collection

Evaluation

16-Dec-16 Answer Typing for KB-QA: Adobe TechTalk 21

Page 22: Answer Type Prediction for KB-QA - Max Planck Societypeople.mpi-inf.mpg.de/~rsaharo/adobe-talk-rishi-dec2016.pdf · TIPI: Answer Type Prediction for Question Answering ... (SPO) triples

16-Dec-16 Answer Typing for KB-QA: Adobe TechTalk 22

Use Cases

Query modification

Compatibility of decompositions

Query re-ranking

Answer ranking

Page 23: Answer Type Prediction for KB-QA - Max Planck Societypeople.mpi-inf.mpg.de/~rsaharo/adobe-talk-rishi-dec2016.pdf · TIPI: Answer Type Prediction for Question Answering ... (SPO) triples

16-Dec-16 Answer Typing for KB-QA: Adobe TechTalk 23

Use Cases

Query modification

Compatibility of decompositions

Query re-ranking

Answer ranking

Page 24: Answer Type Prediction for KB-QA - Max Planck Societypeople.mpi-inf.mpg.de/~rsaharo/adobe-talk-rishi-dec2016.pdf · TIPI: Answer Type Prediction for Question Answering ... (SPO) triples

Query modification

Which footballer won the Nobel prize?

16-Dec-16 Answer Typing for KB-QA: Adobe TechTalk 24

x? won Nobel_prize . x? type footballer

footballerPredict type

Instantiate main queryAdd type

constraint

www.freebase.com/award/

award_presenting_organization/

awards_presented

base.associationfootball.

soccer_playertype.object.type

Page 25: Answer Type Prediction for KB-QA - Max Planck Societypeople.mpi-inf.mpg.de/~rsaharo/adobe-talk-rishi-dec2016.pdf · TIPI: Answer Type Prediction for Question Answering ... (SPO) triples

16-Dec-16 Answer Typing for KB-QA: Adobe TechTalk 25

Compatibility of decompositions

Who plays for Barcelona?

Decompose

+

rewrite

Which Barcelona player won the Golden Ball?

Who won the Golden Ball?

soccer_player 0.65

hockey_player 0.23

violin_player 0.12

soccer_player 0.46

rugby_player 0.35

movie_actor 0.19

Page 26: Answer Type Prediction for KB-QA - Max Planck Societypeople.mpi-inf.mpg.de/~rsaharo/adobe-talk-rishi-dec2016.pdf · TIPI: Answer Type Prediction for Question Answering ... (SPO) triples

16-Dec-16 Answer Typing for KB-QA: Adobe TechTalk 26

Compatibility of decompositions

Who plays for Barcelona?

Decompose

+

rewrite

Which Barcelona player won the Golden Ball?

Who won the Golden Ball?

soccer_player 0.65

hockey_player 0.23

violin_player 0.12

soccer_player sportsperson 0.46

rugby_player 0.35

movie_actor 0.19

Page 27: Answer Type Prediction for KB-QA - Max Planck Societypeople.mpi-inf.mpg.de/~rsaharo/adobe-talk-rishi-dec2016.pdf · TIPI: Answer Type Prediction for Question Answering ... (SPO) triples

16-Dec-16 Answer Typing for KB-QA: Adobe TechTalk 27

System Overview: Training

TIPI

Type system Training data with

distant supervision

which * … is …

what * of * …

Pattern base

Page 28: Answer Type Prediction for KB-QA - Max Planck Societypeople.mpi-inf.mpg.de/~rsaharo/adobe-talk-rishi-dec2016.pdf · TIPI: Answer Type Prediction for Question Answering ... (SPO) triples

16-Dec-16 Answer Typing for KB-QA: Adobe TechTalk 28

System Overview: Runtime

TIPI

Which footballer

won the 2014

Golden Ball?

Candidate

typesFinal types!!

Pattern-

based

extraction

Hierarchical

classification

Soccer player

Sportsperson

Rugby player

Actor

Person

Person: 0.91

Actor: 0.14

Soccer

player: 0.68

Sportsperson: 0.87

Rugby

player: 0.35

Page 29: Answer Type Prediction for KB-QA - Max Planck Societypeople.mpi-inf.mpg.de/~rsaharo/adobe-talk-rishi-dec2016.pdf · TIPI: Answer Type Prediction for Question Answering ... (SPO) triples

16-Dec-16 Answer Typing for KB-QA: Adobe TechTalk 29

Type System

Freebase types (Bollacker et al. '08)

WordNet subsets

POLY (Grycner and Weikum '16)

FINET (Del Corro et al. '15)

HYENA (Yosef et al. '12)

Page 30: Answer Type Prediction for KB-QA - Max Planck Societypeople.mpi-inf.mpg.de/~rsaharo/adobe-talk-rishi-dec2016.pdf · TIPI: Answer Type Prediction for Question Answering ... (SPO) triples

16-Dec-16 Answer Typing for KB-QA: Adobe TechTalk 30

Type System

Wikipedia categories

YAGO types (Wikipedia + WordNet) (Suchanek et al. '07)

Page 31: Answer Type Prediction for KB-QA - Max Planck Societypeople.mpi-inf.mpg.de/~rsaharo/adobe-talk-rishi-dec2016.pdf · TIPI: Answer Type Prediction for Question Answering ... (SPO) triples

16-Dec-16 Answer Typing for KB-QA: Adobe TechTalk 31

Approach: Pattern-based Extractor

Leverage context-aware lexical types

Run fewer classifiers per question

Use lexicons to map to “valid” types

… which/what NN VB* … Which footballer won the Golden Ball?

… who/what/which <be> DT+ NN … Who is the governor of New York?

… what <be> NN’s NN VB* … What is Germany’s currency called?

… which/what NN of NN … What kind of government does UK have?

Page 32: Answer Type Prediction for KB-QA - Max Planck Societypeople.mpi-inf.mpg.de/~rsaharo/adobe-talk-rishi-dec2016.pdf · TIPI: Answer Type Prediction for Question Answering ... (SPO) triples

16-Dec-16 Answer Typing for KB-QA: Adobe TechTalk 32

Approach: Pattern-based Extractor

Use relation type signatures

Extract and disambiguate relational phrases in question

Find the type signature of the relation

Use arguments to identify candidate types

<film_actor> played_in <movie>

<sportsman> won_award <sports_award>

<person> born_in <city>

Page 33: Answer Type Prediction for KB-QA - Max Planck Societypeople.mpi-inf.mpg.de/~rsaharo/adobe-talk-rishi-dec2016.pdf · TIPI: Answer Type Prediction for Question Answering ... (SPO) triples

16-Dec-16 Answer Typing for KB-QA: Adobe TechTalk 33

Approach: Hierarchical Classification

Procure training data for each type and learn a classifier

Features

n-gram patterns

n-grams from DP

n-grams with entity types (NET)

Expand with paraphrases + word2Vec

Person: 0.91

Actor: 0.14

Soccer

player: 0.68

Sportsperson: 0.87

Rugby

player: 0.35

Page 34: Answer Type Prediction for KB-QA - Max Planck Societypeople.mpi-inf.mpg.de/~rsaharo/adobe-talk-rishi-dec2016.pdf · TIPI: Answer Type Prediction for Question Answering ... (SPO) triples

16-Dec-16 Answer Typing for KB-QA: Adobe TechTalk 34

Approach: Hierarchical Classification

Global threshold for stopping classifiers

Sibling policy for sampling negative examples

Person: 0.91

Actor: 0.14

Soccer

player: 0.68

Sportsperson: 0.87

Rugby

player: 0.35

Page 35: Answer Type Prediction for KB-QA - Max Planck Societypeople.mpi-inf.mpg.de/~rsaharo/adobe-talk-rishi-dec2016.pdf · TIPI: Answer Type Prediction for Question Answering ... (SPO) triples

16-Dec-16 Answer Typing for KB-QA: Adobe TechTalk 35

Data collection: Distant supervision

No direct human annotated answer types for questions

For each type, acquire training data

Question-answer pairs

Type descriptions

Look up answer types from KB

with respect to type system

Page 36: Answer Type Prediction for KB-QA - Max Planck Societypeople.mpi-inf.mpg.de/~rsaharo/adobe-talk-rishi-dec2016.pdf · TIPI: Answer Type Prediction for Question Answering ... (SPO) triples

Data collection: Distant supervision

Question: Where was F. Scott Fitzgerald educated?

Targets: St. Paul Academy Princeton UniversityNardin Academy

endowed

organisation

non-profit

organisation

business

operation

...art owneraward winnerfilm locationorganization

school university

16-Dec-16 Answer Typing for KB-QA: Adobe TechTalk 36

Page 37: Answer Type Prediction for KB-QA - Max Planck Societypeople.mpi-inf.mpg.de/~rsaharo/adobe-talk-rishi-dec2016.pdf · TIPI: Answer Type Prediction for Question Answering ... (SPO) triples

16-Dec-16 Answer Typing for KB-QA: Adobe TechTalk 37

Evaluation

To what degree typing module helps KB-QA system

Final F-score with typing module

Complex questions (to do)

How often wrong decompositions were ruled out

How often are we able to identify

“valid” types by aggregation

over decompositions

Page 38: Answer Type Prediction for KB-QA - Max Planck Societypeople.mpi-inf.mpg.de/~rsaharo/adobe-talk-rishi-dec2016.pdf · TIPI: Answer Type Prediction for Question Answering ... (SPO) triples

16-Dec-16 Answer Typing for KB-QA: Adobe TechTalk 38

Evaluation

WebQuestions benchmark with

5k questions

Use query with type constraints

in a learning-to-rank framework

Significant improvement over

baseline with typing

51.0*

50.5

QUINT+TIPI QUINT

F-SCORE

Page 39: Answer Type Prediction for KB-QA - Max Planck Societypeople.mpi-inf.mpg.de/~rsaharo/adobe-talk-rishi-dec2016.pdf · TIPI: Answer Type Prediction for Question Answering ... (SPO) triples

16-Dec-16 Answer Typing for KB-QA: Adobe TechTalk 39

Representative examples

Who played Bruce Wayne in Dark Knight? Film_actor

Who did Rupert Grint play on Harry potter? Fictional

character

Who led the attack on Normandy? Military_commander

Where is Abraham Lincoln buried? Place_of_interment

Who is in charge of Libya now? Charge

Page 40: Answer Type Prediction for KB-QA - Max Planck Societypeople.mpi-inf.mpg.de/~rsaharo/adobe-talk-rishi-dec2016.pdf · TIPI: Answer Type Prediction for Question Answering ... (SPO) triples

16-Dec-16 Answer Typing for KB-QA: Adobe TechTalk 40

In summary…

Proposed method for predicting answer type based on

question features

Using type information significantly improves quality of

answers on a popular benchmark

Must plug-in with other KB-QA systems!!

Components used may be applicable to more use cases

Page 41: Answer Type Prediction for KB-QA - Max Planck Societypeople.mpi-inf.mpg.de/~rsaharo/adobe-talk-rishi-dec2016.pdf · TIPI: Answer Type Prediction for Question Answering ... (SPO) triples

16-Dec-16 Answer Typing for KB-QA: Adobe TechTalk 41

Open questions

More ways to use types?

Better pattern-based extractors?

New features for hierarchical classifiers?

Cleaner method for distant supervision?

Thank you !!


Recommended