Date post: | 21-Dec-2015 |
Category: |
Documents |
View: | 217 times |
Download: | 2 times |
Outline
• NLG basics– What is NLG?– Example NLG systems– Applications of NLG– Architecture for NLG
• Three subtasks of NLG– Document planning– Microplanning– Surface realization
What is NLG?
• Definition (McDonald 1992): the process of deliberately constructing a NL text in order to meet specified communicative goals.
• Input: non-linguistic representation of info
• Output: text, hypertext, speech
NLG system #1: FoG
• FoG: Forecast Generator
• Input: weather map
• Output: textual weather report in English and French
• Developer: CoGen Tex
• Status: in operational use since 1992
NLG system #2: SumTime-Mousam
• FoG: Forecast Generator
• Input: weather data
• Output: textual weather report in English
• Developer: University of Aberdeen
• Status: Used by one company to generate weather forecasts for offshore oil rigs.
• Demo
NLG System #3: STOP
• Input: Questionnaire about smoking attitudes, history, beliefs
• Output: a personalized smoking-cessation leaflet
• Developer: University of Aberdeen
• Demo: non-tailored letter, questionnaire, a tailored letter.
• Status: undergoing clinical evaluation
Applications of NLG
• Automated document production:– Weather forecasts, letters, ….
• Dialogue systems: e.g., ELIZA
• Text summarization
• Machine translation
• Question Answering
• ….
ElIZA system• H: My head hurts• E: Why do you say your head hurts?
• H: My mother hates me.• E: Who else in your family hates you?
• H: *******.• E: Tell me more about it.
• H: What should I do?• E: That’s a tough decision.
• H: ****?• E: Does that question interest you?
ELIZA system (cont)
• The famous 1966 computer system by Joseph Weizenbaum, which acted as a therapist
• Named after “Eliza Doolittle” in in Shaw's Pygmalion (“My fair lady”).
• Use pattern matching rules
• Very successful: prompt him to write a book to explain the limits of computers
Challenges of NLG
• Decide what to say and how to say it.
• The output has to be well-formed text.– Orthography, morphology, syntax– Reference, word choice, pragmatics
• Example: weather report
Component tasks in NLG
• Content determination: what content to express• Document structuring: how to structure the info
to make a coherent text.• Aggregation: combine units in a document plan
tree.• Lexicalization: what words to use?• Referring expression generation: NP or
pronoun?• Structure realization: add markups • Linguistic realization: tense, aspect, voice, …
Tasks and modules in NLG
Document
Planning
Content determination
Document structuring
Microplanning Aggregation
Lexicalization
Referring expression generation
Surface realisation
Linguistic realization
Structure Realization
A pipelined architecture
Document Planning
Microplanning
Surface realization
Document plan
Text Specification
Text
Communicative goalKnowledge baseGrammar ….
Another architecture
Communicative Goal Knowledge base
Discourse Planner
Surface Realizer
NL output
Discourse specification
Outline
• NLG basics– What is NLG?– Example NLG systems– Applications of NLG– Architecture for NLG
• Three subtasks of NLG– Document planning– Microplanning– Surface realization
Document planning
• Content determination: – What is important?– a domain-dependent expert-system-like task.– Ex: Weather summary:
• This Nov was very dry.• The temperature was lower.
• Document structuring: – Use RST or other discourse theory
Document plan tree
Msg1
NucleusSatellite(contrast)
Msg2
NucleusSatellite(Elaboration)
NucleusNucleus(sequence)
Msg3
…
Aggregation
• Combinations can be on the basis of – Information content– Possible forms of realization
• Some possibilities:– Conjunction– Ellipsis– Embedding– …
Aggregation via conjunction
• Without aggregation:– Light rain fell on the 6th.– Light rain fell on the 8th.
• With Aggregation:– Light rain fell on the 6th and light rain fell on
the 8th. (conjunction)– Light rain fell on the 6th and the 8th.
Aggregation via embedding
• Without aggregation: – November had a rainfall of 20mm.– November normally is the wettest month.
• With aggregation:– November, which normally is the wettest
month, had a rainfall of 20mm this year.– Although November is the wettest month, this
November had a rainfall of only 20mm.
Aggregation strategies
• Conform to genre conventions and rules, and take account of pragmatic goals:– Ex: making sentences shorter for poor
readers
• Observe structural properties: – Ex: aggregate only messages that are siblings
in the document plan tree.
Lexicalization
• The process of choosing words to communicate the info in messages
• When several lexicalizations are possible, consider:– User knowledge and preference– Consistency with previous usage: sometimes,
it is better to vary lexemes– Pragmatics
Examples
• Light rain• A small amount of rain
• It is encouraging that you have many reasons to stop.
• It’s good that you have a lot of reasons to stop.
Referring expression generation
• How do we identify specific objects and entities?
• Two cases:– Initial introduction of an object– Subsequence references to an already
mentioned object.
Initial reference
A few options:
• Use a full name: John Smith
• Relate to an entity that is already salient– One of Dr. Klein’s patients– The person who came to the clinic yesterday
Subsequence reference
Some possibilities:
• Pronouns: He is very determined.
• Definite NPs: This person is very determined.
• Proper names: John is very talented.
Choosing referring expressions
• Some suggestions from the literature:– Use a pronoun if it refers to an entity
mentioned in the previous clause, and there is no other entity in the previous clause that this pronoun could refer to.
– Otherwise, use a name (a short one if possible)
– Otherwise, use a definite NP.
Choosing referring expressions (cont)
• Considering genre conventions and the context
• Ex: – Nov 2005– November– This month
Surface realization
• Goal: to convert text specification into actual text.
• Structural realization: e.g., add html markup
• Linguistics realization:– Insert function words– Choose correct inflection– Order words within a sentence– Add punctuation
Linguistics realization
• Systemic Grammar • Functional unification grammar
• Ex: – Input: (:action rain :tense past :time November :degree little)– Output: it rained little in November
Summary
• NLG basics– What is NLG?– Example NLG systems– Applications of NLG– Architecture for NLG
• Three subtasks of NLG– Document planning– Microplanning– Surface realization
Beyond text generation
• Flat text
• Structured text: itemized lists, section, chapter, ….
• Text and graphics: e.g., picture with caption
• Speech
• ….
Resources
• SIGGEN (ACL special interest group for Generation): www.siggen.org
• Book: “Building NLG systems” by Reiter et. al., Cambridge University Press, 2000.
• List of NLG systems:
• Companies:– CoGenTex: www.cogentex.com– ERLI: www.erli.com