+ All Categories
Home > Documents > CPSC 503 Computational Linguistics

CPSC 503 Computational Linguistics

Date post: 18-Jan-2016
Category:
Upload: danno
View: 32 times
Download: 0 times
Share this document with a friend
Description:
CPSC 503 Computational Linguistics. Lecture 11 Giuseppe Carenini. Knowledge-Formalisms Map (including probabilistic formalisms). State Machines (and prob. versions) (Finite State Automata,Finite State Transducers, Markov Models ). Morphology. Logical formalisms (First-Order Logics). - PowerPoint PPT Presentation
46
06/18/22 CPSC503 Winter 2009 1 CPSC 503 Computational Linguistics Lecture 11 Giuseppe Carenini
Transcript
Page 1: CPSC 503 Computational Linguistics

04/21/23 CPSC503 Winter 2009 1

CPSC 503Computational Linguistics

Lecture 11Giuseppe Carenini

Page 2: CPSC 503 Computational Linguistics

04/21/23 CPSC503 Winter 2009 2

Knowledge-Formalisms Map(including probabilistic formalisms)

Logical formalisms (First-Order Logics)

Rule systems (and prob. versions)(e.g., (Prob.) Context-Free

Grammars)

State Machines (and prob. versions)

(Finite State Automata,Finite State Transducers, Markov Models)

Morphology

Syntax

PragmaticsDiscourse

and Dialogue

Semantics

AI planner(MDP Markov Decision Processes)

Page 3: CPSC 503 Computational Linguistics

04/21/23 CPSC503 Winter 2009 3

Next three classes• What meaning is and how to represent it• Semantic Analysis: How to map

sentences into their meaning– Complete mapping still impractical– “Shallow” version: Semantic Role Labeling

• Meaning of individual words (lexical semantics)

• Computational Lexical Semantics Tasks– Word sense disambiguation– Word Similarity

Page 4: CPSC 503 Computational Linguistics

04/21/23 CPSC503 Winter 2009 4

Today 16/10

• Semantics / Meaning /Meaning Representations

• Linguistically relevant Concepts in FOPC/FOL

• Semantic Analysis

Page 5: CPSC 503 Computational Linguistics

04/21/23 CPSC503 Winter 2009 5

SemanticsDef. Semantics: The study of the meaning of

words, intermediate constituents and sentences

Def1. Meaning: a representation that expresses the linguistic input in terms of objects, actions, events, time, space… beliefs, attitudes...relationships

Def2. Meaning: a representation that links the linguistic input to knowledge of the world

Language independent ?

Page 6: CPSC 503 Computational Linguistics

04/21/23 CPSC503 Winter 2009 6

Semantic Relations involving Sentences

Paraphrase: have the same meaning• I gave the apple to John vs. I gave John the apple• I bought a car from you vs. you sold a car to me• The thief was chased by the police vs. ……

Same truth conditions

Entailment: “implication”• The park rangers killed the bear vs. The bear is dead• Nemo is a fish vs. Nemo is an animal

Contradiction: I am in Vancouver vs. I am in India

Page 7: CPSC 503 Computational Linguistics

04/21/23 CPSC503 Winter 2009 7

Meaning Structure of Language

• How does language convey meaning?– Grammaticization

– Display a basic predicate-argument structure (e.g., verb complements)

– Display a partially compositional semantics

– Words

Page 8: CPSC 503 Computational Linguistics

04/21/23 CPSC503 Winter 2009 8

GrammaticizationConcept Affix

• Past• More than one• Again• Negation

• -ed• -s• re-• in-, un-, de-

Words from Nonlexical categories• Obligation

• Possibility• Definite, Specific• Indefinite, Non-specific• Disjunction• Negation• Conjunction

• must• may• the• a• or• not• and

Page 9: CPSC 503 Computational Linguistics

04/21/23 CPSC503 Winter 2009 9

Common Meaning Representations

FOL

Semantic Nets

Frames

I have a car

Common foundation: structures composed of symbols that correspond to objects and relationships

Page 10: CPSC 503 Computational Linguistics

04/21/23 CPSC503 Winter 2009 10

Requirements for Meaning Representations

e.g, • Does Maharani serve vegetarian food? -> Yes• What restaurants are close to the ocean?-> C and Monks

• Sample NLP Task: giving advice about restaurants– Accept queries in NL– Generate appropriate responses by

consulting a Knowledge Base

Page 11: CPSC 503 Computational Linguistics

04/21/23 CPSC503 Winter 2009 11

Verifiability (in the world?)

• Example: Does LeDog serve vegetarian food?• Knowledge base (KB) expressing our world

model (in a formal language)

• Convert question to KB language and verify its truth value against the KB content

Yes / No / I do not know

Page 12: CPSC 503 Computational Linguistics

04/21/23 CPSC503 Winter 2009 12

Non Yes/No Questions

• Example: I'd like to find a restaurant where I can get vegetarian food.

• Indefinite reference <-> variable serve(x,VegetarianFood)

• Matching succeeds only if variable x can be replaced by known object in KB.

What restaurants are close to the ocean?-> C and Monks

Page 13: CPSC 503 Computational Linguistics

04/21/23 CPSC503 Winter 2009 13

Canonical Form

Paraphrases should be mapped into the same representation.

• Does LeDog have vegetarian dishes?• Do they have vegetarian food at LeDog?• Are vegetarian dishes served at LeDog?• Does LeDog serve vegetarian fare?• ……………- Words with overlapping

meanings- Syntactic constructions are

systematically related

Page 14: CPSC 503 Computational Linguistics

04/21/23 CPSC503 Winter 2009 14

Inference

• Consider a more complex request– Can vegetarians eat at Maharani?

• KB contains

• Def. System’s ability to draw valid conclusions based on the meaning representations of inputs and its KB

serve(Maharani,VegetarianFood)

serve( x , VegetarianFood) => CanEat(Vegetarians,At( x ))

Page 15: CPSC 503 Computational Linguistics

04/21/23 CPSC503 Winter 2009 15

Meaning Structure of Language

• How does language convey meaning?– Grammaticization

– Display a basic predicate-argument structure (e.g., verb complements)

– Display a partially compositional semantics

– Words

Page 16: CPSC 503 Computational Linguistics

04/21/23 CPSC503 Winter 2009 16

Predicate-Argument Structure

• Subcategorization frames specify number, position, and syntactic category of arguments

• Examples: give NP2 NP1, find NP, sneeze []

• Represent relationships among concepts

• Some words act like arguments and some words act like predicates:– Nouns as concepts or arguments:

red(ball)– Adj, Adv, Verbs as predicates: red(ball)

Page 17: CPSC 503 Computational Linguistics

04/21/23 CPSC503 Winter 2009 17

Semantic (Thematic) Roles

• Semantic Roles: Participants in an event– Agent: George hit Bill. Bill was hit by George– Theme: George hit Bill. Bill was hit by George

Source, Goal, Instrument, Force…

This can be extended to the realm of semantics

• Verb subcategorization: Allows linking arguments in surface structure with their semantic roles

• Mary gave/sent/read a book to Ming Agent Theme Goal• Mary gave/sent/read Ming a book Agent Goal Theme

Page 18: CPSC 503 Computational Linguistics

04/21/23 CPSC503 Winter 2009 18

First Order Predicate Calculus (FOPC)

• FOPC provides sound computational basis for verifiability, inference, expressiveness…– Supports determination of truth– Supports Canonical Form– Supports question-answering (via variables)– Supports inference– Argument-Predicate structure– Supports compositionality of meaning

Page 19: CPSC 503 Computational Linguistics

04/21/23 CPSC503 Winter 2009 19

Today 16/10

• Semantics / Meaning /Meaning Representations

• Linguistically relevant Concepts in FOPC/FOL

• Semantic Analysis

Page 20: CPSC 503 Computational Linguistics

04/21/23 CPSC503 Winter 2009 20

Linguistically Relevant Concepts in FOPC

• Categories & Events (Reification)• Representing Time• Beliefs (optional, read if relevant to your project)

• Aspects (optional, read if relevant to your project)

• Description Logics (optional, read if relevant to your project)

Page 21: CPSC 503 Computational Linguistics

04/21/23 CPSC503 Winter 2009 21

Categories & Events

• Events: can be described in NL with different numbers of arguments… – I ate– I ate a turkey sandwich– I ate a turkey sandwich at my desk– I ate at my desk– I ate lunch– I ate a turkey sandwich for lunch– I ate a turkey sandwich for lunch at my desk

• Categories:– VegetarianRestaurant (Joe’s) - relation vs.

object– MostPopular(Joe’s,VegetarianRestaurant)

Reification

– ISA (Joe’s,VegetarianRestaurant)– AKO (VegetarianRestaurant,Restaurant)

Page 22: CPSC 503 Computational Linguistics

04/21/23 CPSC503 Winter 2009 22

MUC-4 Example

INCIDENT: DATE 30 OCT 89 INCIDENT: LOCATION EL SALVADOR INCIDENT: TYPE ATTACK INCIDENT: STAGE OF EXECUTION ACCOMPLISHED INCIDENT: INSTRUMENT ID INCIDENT: INSTRUMENT TYPEPERP: INCIDENT CATEGORY TERRORIST ACT PERP: INDIVIDUAL ID "TERRORIST" PERP: ORGANIZATION ID "THE FMLN" PERP: ORG. CONFIDENCE REPORTED: "THE FMLN" PHYS TGT: ID PHYS TGT: TYPEPHYS TGT: NUMBERPHYS TGT: FOREIGN NATIONPHYS TGT: EFFECT OF INCIDENTPHYS TGT: TOTAL NUMBERHUM TGT: NAMEHUM TGT: DESCRIPTION "1 CIVILIAN"HUM TGT: TYPE CIVILIAN: "1 CIVILIAN"HUM TGT: NUMBER 1: "1 CIVILIAN"HUM TGT: FOREIGN NATIONHUM TGT: EFFECT OF INCIDENT DEATH: "1 CIVILIAN"HUM TGT: TOTAL NUMBER

On October 30, 1989, one civilian was killed in a reported FMLN attack in El Salvador.

Page 23: CPSC 503 Computational Linguistics

04/21/23 CPSC503 Winter 2009 23

Reification Again

• Reification Advantages:– No need to specify fixed number of

arguments to represent a given sentence– You can easily specify inference rules

involving the arguments

“I ate a turkey sandwich for lunch” w: Isa(w,Eating) Eater(w,Speaker)

Eaten(w,TurkeySandwich) MealEaten(w,Lunch)

Page 24: CPSC 503 Computational Linguistics

04/21/23 CPSC503 Winter 2009 24

Representing Time• Events are associated with points or

intervals in time.• We can impose an ordering on distinct

events using the notion of precedes.

• Temporal logic notation: (w,x,t) Arrive(w,x,t)

• Constraints on variable tI arrived in New York(t) Arrive(I,NewYork,t) precedes(t,Now)

Page 25: CPSC 503 Computational Linguistics

04/21/23 CPSC503 Winter 2009 25

Interval Events• Need tstart and tend

“She was driving to New York until now”

tstart,tend ,e, i ISA(e,Drive) Driver(e, She)

Dest(e, NewYork) IntervalOf(e,i)Endpoint(i, tend) Startpoint(i, tend)

Precedes(tstart,Now) Equals(tend,Now)

Page 26: CPSC 503 Computational Linguistics

04/21/23 CPSC503 Winter 2009 26

Relation Between Tenses and Time• Relation between simple verb

tenses and points in time is not straightforward

• Present tense used like future:– We fly from Baltimore to Boston at 10

• Complex tenses:– Flight 1902 arrived late– Flight 1902 had arrived late

Representing them in the same way seems wrong….

Page 27: CPSC 503 Computational Linguistics

04/21/23 CPSC503 Winter 2009 27

Reference Point• Reichenbach (1947) introduced notion

of Reference point (R), separated out from Utterance time (U) and Event time (E)

• Example:– When Mary's flight departed, I ate lunch– When Mary's flight departed, I had eaten

lunch

• Departure event specifies reference point.

Page 28: CPSC 503 Computational Linguistics

04/21/23 CPSC503 Winter 2009 28

Today 15/10

• Semantics / Meaning /Meaning Representations

• Linguistically relevant Concepts in FOPC / FOL

• Semantic Analysis

Page 29: CPSC 503 Computational Linguistics

04/21/23 CPSC503 Winter 2009 29

Semantic Analysis

Syntax-drivenSemantic Analysis

Sentence

Literal Meanin

g

Discourse

Structure

Meanings of

words

Meanings of grammatical structures

Context

Common-SenseDomain

knowledge

Intended meaning

FurtherAnalysis

INFERENCE

Page 30: CPSC 503 Computational Linguistics

04/21/23 CPSC503 Winter 2009 30

Compositional Analysis

• Principle of Compositionality– The meaning of a whole is derived from

the meanings of the parts

• What parts?– The constituents of the syntactic parse

of the input

Page 31: CPSC 503 Computational Linguistics

04/21/23 CPSC503 Winter 2009 31

Compositional Analysis: Example

• AyCaramba serves meat

),()^,()^( MeateServedAyCarambaeServereServinge

Page 32: CPSC 503 Computational Linguistics

04/21/23 CPSC503 Winter 2009 32

Augmented Rules

• Augment each syntactic CFG rule with a semantic formation rule

• The class of actions performed by f will be quite restricted.

)}.,....({... 11 semsemfA nn • Abstractly

• i.e., The semantics of A can be computed from some function applied to the semantics of its parts.

Page 33: CPSC 503 Computational Linguistics

04/21/23 CPSC503 Winter 2009 33

Simple Extension of FOL: Lambda Forms

– Lambda-reduction: variables are bound by treating the lambda form as a function with formal arguments

)(),( yCountryyxyInx

)())((

SallyPSallyxxP

)(),())((),(

yCountryyBCyInBCyCountryyxyInx

)(),( yCountryyBCyIn

)(),())((),(

CANADACountryCANADABCInCANADAyCountryyBCyIn

)(xxP– A FOL sentence with variables in it that are to be bound.

Page 34: CPSC 503 Computational Linguistics

04/21/23 CPSC503 Winter 2009 34

Augmented Rules: Example

– PropNoun -> AyCaramba– MassNoun -> meat

• Attachments{AyCaramba}{MEAT}

assigning FOL constants

copying from daughters up to mothers.

– NP -> PropNoun– NP -> MassNoun

• Attachments{PropNoun.sem}{MassNoun.sem

}

• Simple non-terminals

• Concrete entities

Page 35: CPSC 503 Computational Linguistics

04/21/23 CPSC503 Winter 2009 35

Augmented Rules: Example

• Verb -> serves

• {VP.sem(NP.sem)}• {Verb.sem(NP.sem)

),(^),(^)(

xeServedyeServereServingeyx

Semantics attached to one daughter is applied to semantics of the other

daughter(s).• S -> NP VP• VP -> Verb NP

lambda-form

Page 36: CPSC 503 Computational Linguistics

04/21/23 CPSC503 Winter 2009 36

Example

• S -> NP VP• VP -> Verb NP• Verb -> serves• NP -> PropNoun• NP -> MassNoun• PropNoun -> AyCaramba• MassNoun -> meat

• {VP.sem(NP.sem)}• {Verb.sem(NP.sem)

• {PropNoun.sem}• {MassNoun.sem}• {AC}• {MEAT}

),()^,()^( xeServedyeServereServingeyx

MEAT

MEAT

…….

y y

AC

AC

Page 37: CPSC 503 Computational Linguistics

04/21/23 CPSC503 Winter 2009 37

Next Time

• Read Chp. 19 (Lexical Semantics)

Page 38: CPSC 503 Computational Linguistics

04/21/23 CPSC503 Winter 2009 38

Non-Compositionality

• Unfortunately, there are lots of examples where the meaning of a constituent can’t be derived from the meanings of the parts

- metaphor, (e.g., corporation as person)– metonymy, (??)– idioms, – irony, – sarcasm, – indirect requests, etc

Page 39: CPSC 503 Computational Linguistics

04/21/23 CPSC503 Winter 2009 39

English Idioms

• “buy the farm” • “bite the bullet” • “bury the hatchet” • etc…

• Lots of these… constructions where the meaning of the whole is either – Totally unrelated to the meanings of the

parts (“kick the bucket”)– Related in some opaque way (“run the

show”)

Page 40: CPSC 503 Computational Linguistics

04/21/23 CPSC503 Winter 2009 40

The Tip of the Iceberg– “Enron is the tip of the iceberg.”NP -> “the tip of the iceberg” {….}

– “the tip of an old iceberg”– “the tip of a 1000-page iceberg”– “the merest tip of the iceberg”

NP -> TipNP of IcebergNP {…}TipNP: NP with tip as its head IcebergNP NP with iceberg as its

head

Page 41: CPSC 503 Computational Linguistics

04/21/23 CPSC503 Winter 2009 41

Handling Idioms

– Mixing lexical items and grammatical constituents

– Introduction of idiom-specific constituents

– Permit semantic attachments that introduce predicates unrelated with constituents

NP -> TipNP of IcebergNP {small-part(), beginning()….}

TipNP: NP with tip as its head IcebergNP NP with iceberg as its head

Page 42: CPSC 503 Computational Linguistics

04/21/23 CPSC503 Winter 2009 42

Attachments for a fragment of English (Sect. 18.5)

old edition• Sentences• Noun-phrases• Verb-phrases• Prepositional-phrases

Based on “The core Language Engine” 1992

Page 43: CPSC 503 Computational Linguistics

04/21/23 CPSC503 Winter 2009 43

Full story more complex• To deal properly with quantifiers

– Permit lambda-variables to range over predicates. E.g.,

)(. xPxP

)yMenuyHad(e,

)xRestaurantxeHaver

eeHaving

)(

)(,(

)(

– Introduce complex terms to remain agnostic about final scoping

Page 44: CPSC 503 Computational Linguistics

04/21/23 CPSC503 Winter 2009 44

• Similarly to PP attachment, number of possible interpretations exponential in the number of complex terms

Solution: Quantifier Scope Ambiguity

• likelihood of different orderings• Mirror surface ordering• Domain specific knowledge

• Weak methods to prefer one interpretation over another:

Page 45: CPSC 503 Computational Linguistics

04/21/23 CPSC503 Winter 2009 45

Integration with a Parser• Assume you’re using a dynamic-

programming style parser (Earley or CKY).

• Two basic approaches– Integrate semantic analysis into the

parser (assign meaning representations as constituents are completed)– Pipeline… assign meaning representations to complete trees only after they’re completed

Page 46: CPSC 503 Computational Linguistics

04/21/23 CPSC503 Winter 2009 46

Pros and Cons• Integration

– use semantic constraints to cut off parses that make no sense

– assign meaning representations to constituents that don’t take part in any correct parse

• Pipeline– assign meaning representations only to

constituents that take part in a correct parse

– parser needs to generate all correct parses


Recommended