+ All Categories
Home > Documents > Dialogue Management

Dialogue Management

Date post: 23-Feb-2016
Category:
Upload: bly
View: 37 times
Download: 0 times
Share this document with a friend
Description:
Dialogue Management. Ling575 Discourse and Dialogue May 18, 2011. Dialog Management Types. Finite-State Dialog Management Frame-based Dialog Management Initiative VoiceXML Design and evaluation Information State Management Dialogue Acts Recognition & generation - PowerPoint PPT Presentation
Popular Tags:
123
Dialogue Management Ling575 Discourse and Dialogue May 18, 2011
Transcript
Page 1: Dialogue Management

Dialogue Management

Ling575Discourse and Dialogue

May 18, 2011

Page 2: Dialogue Management

Dialog Management TypesFinite-State Dialog ManagementFrame-based Dialog Management

InitiativeVoiceXMLDesign and evaluation

Information State ManagementDialogue Acts

Recognition & generation

Statistical Dialogue Managemant (POMDPs)

Page 3: Dialogue Management

Finite-State Management

Page 4: Dialogue Management

Pros and ConsAdvantages

Straightforward to encodeClear mapping of interaction to modelWell-suited to simple information accessSystem initiative

DisadvantagesLimited flexibility of interaction

Constrained input – single itemFully system controlledRestrictive dialogue structure, order

Ill-suited to complex problem-solving

Page 5: Dialogue Management

Frame-based DialogueManagement

Finite-state too limited, stilted, irritatingMore flexible dialogue

Page 6: Dialogue Management

Frame-based Dialogue Management

Essentially form-fillingUser can include any/all of the pieces of

formSystem must determine which entered,

remain

Page 7: Dialogue Management

Frame-based Dialogue Management

Essentially form-fillingUser can include any/all of the pieces of

formSystem must determine which entered,

remain

Page 8: Dialogue Management

Frame-based Dialogue Management

Essentially form-fillingUser can include any/all of the pieces of formSystem must determine which entered, remain

System may have multiple framesE.g. flights vs restrictions vs car vs hotelRules determine next action, question, information

presentation

Page 9: Dialogue Management

Frames and InitiativeMixed initiative systems:

A) User/System can shift control arbitrarily, any timeDifficult to achieve

Page 10: Dialogue Management

Frames and InitiativeMixed initiative systems:

A) User/System can shift control arbitrarily, any timeDifficult to achieve

B) Mix of control based on prompt typePrompts:

Page 11: Dialogue Management

Frames and InitiativeMixed initiative systems:

A) User/System can shift control arbitrarily, any timeDifficult to achieve

B) Mix of control based on prompt typePrompts:

Open prompt:

Page 12: Dialogue Management

Frames and InitiativeMixed initiative systems:

A) User/System can shift control arbitrarily, any timeDifficult to achieve

B) Mix of control based on prompt typePrompts:

Open prompt: ‘How may I help you?’Open-ended, user can respond in any way

Directive prompt:

Page 13: Dialogue Management

Frames and InitiativeMixed initiative systems:

A) User/System can shift control arbitrarily, any timeDifficult to achieve

B) Mix of control based on prompt typePrompts:

Open prompt: ‘How may I help you?’Open-ended, user can respond in any way

Directive prompt: ‘Say yes to accept call, or no o.w.’Stipulates user response type, form

Page 14: Dialogue Management

Initiative, Prompts, Grammar

Prompt type tied to active grammarSystem must recognize suitable input

Restrictive vs open-ended

Page 15: Dialogue Management

Initiative, Prompts, Grammar

Prompt type tied to active grammarSystem must recognize suitable input

Restrictive vs open-ended

Shift from restrictive to openTune to user: Novice vs Expert

Page 16: Dialogue Management

Initiative, Prompts, Grammar

Prompt type tied to active grammarSystem must recognize suitable input

Restrictive vs open-ended

Shift from restrictive to openTune to user: Novice vs Expert

Page 17: Dialogue Management

Dialogue Management:Confirmation

Miscommunication common in SDS“Error spirals” of sequential errors

Highly problematicRecognition, recovery crucial

Confirmation strategies can detect, mitigateExplicit confirmation:

Page 18: Dialogue Management

Dialogue Management:Confirmation

Miscommunication common in SDS“Error spirals” of sequential errors

Highly problematicRecognition, recovery crucial

Confirmation strategies can detect, mitigateExplicit confirmation:

Ask for verification of each input Implicit confirmation:

Page 19: Dialogue Management

Dialogue Management:Confirmation

Miscommunication common in SDS“Error spirals” of sequential errors

Highly problematicRecognition, recovery crucial

Confirmation strategies can detect, mitigateExplicit confirmation:

Ask for verification of each input Implicit confirmation:

Include input information in subsequent prompt

Page 20: Dialogue Management

Confirmation StrategiesExplicit:

Page 21: Dialogue Management

Confirmation Strategy Implicit:

Page 22: Dialogue Management

Pros and ConsGrounding of user input

Weakest grounding I.e. continued att’n, next relevant contibution

Page 23: Dialogue Management

Pros and ConsGrounding of user input

Weakest grounding insufficient I.e. continued att’n, next relevant contibution

Explicit:

Page 24: Dialogue Management

Pros and ConsGrounding of user input

Weakest grounding insufficient I.e. continued att’n, next relevant contibution

Explicit: highest: repetition Implicit:

Page 25: Dialogue Management

Pros and ConsGrounding of user input

Weakest grounding insufficient I.e. continued att’n, next relevant contibution

Explicit: highest: repetition Implicit: demonstration, display

Explicit;

Page 26: Dialogue Management

Pros and ConsGrounding of user input

Weakest grounding insufficient I.e. continued att’n, next relevant contibution

Explicit: highest: repetition Implicit: demonstration, display

Explicit;Pro: easier to correct; Con: verbose, awkward, non-

humanImplicit:

Page 27: Dialogue Management

Pros and ConsGrounding of user input

Weakest grounding insufficient I.e. continued att’n, next relevant contibution

Explicit: highest: repetition Implicit: demonstration, display

Explicit;Pro: easier to correct; Con: verbose, awkward, non-

humanImplicit:

Pro: more natural, efficient; Con: less easy to correct

Page 28: Dialogue Management

RejectionSystem recognition confidence is too lowSystem needs to reprompt

Often repeatedly

Page 29: Dialogue Management

RejectionSystem recognition confidence is too lowSystem needs to reprompt

Often repeatedlyOut-of-vocabulary, out-of-grammar inputs

Strategies: Progressive prompting

Page 30: Dialogue Management

RejectionSystem recognition confidence is too lowSystem needs to reprompt

Often repeatedlyOut-of-vocabulary, out-of-grammar inputs

Strategies: Progressive prompting Initially: ‘rapid reprompting’: ‘What?’, ‘Sorry?’

Page 31: Dialogue Management

RejectionSystem recognition confidence is too lowSystem needs to reprompt

Often repeatedlyOut-of-vocabulary, out-of-grammar inputs

Strategies: Progressive prompting Initially: ‘rapid reprompting’: ‘What?’, ‘Sorry?’Later: increasing detail

Page 32: Dialogue Management

Progressive prompting

Page 33: Dialogue Management

VoiceXMLW3C standard for simple frame-based dialogues

Fairly common in commercial settingsConstruct forms, menus

Forms get field dataUsing attached promptsWith specified grammar (CFG)With simple semantic attachments

Page 34: Dialogue Management

Simple VoiceXML Example

Page 35: Dialogue Management

Frame-based Systems:Pros and Cons

Advantages

Page 36: Dialogue Management

Frame-based Systems:Pros and Cons

AdvantagesRelatively flexible input – multiple inputs, ordersWell-suited to complex information access (air)Supports different types of initiative

Disadvantages

Page 37: Dialogue Management

Frame-based Systems:Pros and Cons

AdvantagesRelatively flexible input – multiple inputs, ordersWell-suited to complex information access (air)Supports different types of initiative

Disadvantages Ill-suited to more complex problem-solving

Form-filling applications

Page 38: Dialogue Management

Dialogue Manager Tradeoffs

Flexibility vs Simplicity/PredictabilitySystem vs User vs Mixed Initiative

Order of dialogue interaction

Conversational “naturalness” vs Accuracy

Cost of model construction, generalization, learning, etc

Page 39: Dialogue Management

Dialog Systems DesignUser-centered design approach:

Study user and task:Interview users; record human-human interactions;

systems

Page 40: Dialogue Management

Dialog Systems DesignUser-centered design approach:

Study user and task:Interview users; record human-human interactions;

systems

Build simulations and prototypes:Wizard-of-Oz systems (WOZ): Human replaces system

Can assess issues in partial system; simulate errors, etc

Page 41: Dialogue Management

Dialog Systems DesignUser-centered design approach:

Study user and task:Interview users; record human-human interactions; systems

Build simulations and prototypes:Wizard-of-Oz systems (WOZ): Human replaces system

Can assess issues in partial system; simulate errors, etc

Iteratively test on users: Redesign prompts (email subdialog)Identify need for barge-in

Page 42: Dialogue Management

SDS EvaluationGoal: Determine overall user satisfaction

Highlight systems problems; help tuneClassically: Conduct user surveys

Page 43: Dialogue Management

SDS EvaluationUser evaluation issues:

Page 44: Dialogue Management

SDS EvaluationUser evaluation issues:

Expensive; often unrealistic; hard to get real user to doCreate model correlated with human satisfactionCriteria:

Page 45: Dialogue Management

SDS EvaluationUser evaluation issues:

Expensive; often unrealistic; hard to get real user to doCreate model correlated with human satisfactionCriteria:

Maximize task successMeasure task completion: % subgoals; Kappa of frame

values

Page 46: Dialogue Management

SDS EvaluationUser evaluation issues:

Expensive; often unrealistic; hard to get real user to doCreate model correlated with human satisfactionCriteria:

Maximize task successMeasure task completion: % subgoals; Kappa of frame

valuesMinimize task costs

Efficiency costs: time elapsed; # turns; # error correction turns

Quality costs: # rejections; # barge-in; concept error rate

Page 47: Dialogue Management

PARADISE Model

Page 48: Dialogue Management

PARADISE ModelCompute user satisfaction with questionnaires

Page 49: Dialogue Management

PARADISE ModelCompute user satisfaction with questionnairesExtract task success and costs measures from

corresponding dialogsAutomatically or manually

Page 50: Dialogue Management

PARADISE ModelCompute user satisfaction with questionnairesExtract task success and costs measures from

corresponding dialogsAutomatically or manually

Perform multiple regression:Assign weights to all factors of contribution to UsatTask success, Concept accuracy key

Page 51: Dialogue Management

PARADISE ModelCompute user satisfaction with questionnairesExtract task success and costs measures from

corresponding dialogsAutomatically or manually

Perform multiple regression:Assign weights to all factors of contribution to UsatTask success, Concept accuracy key

Allows prediction of accuracy on new dialog w/Q&A

Page 52: Dialogue Management

Information State Dialogue Management

Problem: Not every task is equivalent to form-filling

Real tasks require:

Page 53: Dialogue Management

Information State Dialogue Management

Problem: Not every task is equivalent to form-filling

Real tasks require:Proposing ideas, refinement, rejection, grounding,

clarification, elaboration, etc

Page 54: Dialogue Management

Information State Dialogue Management

Problem: Not every task is equivalent to form-fillingReal tasks require:

Proposing ideas, refinement, rejection, grounding, clarification, elaboration, etc

Information state models include: Information state Dialogue act interpreterDialogue act generatorUpdate rulesControl structure

Page 55: Dialogue Management

Information State SystemsInformation state :

Discourse context, grounding state, intentions, plans.

Page 56: Dialogue Management

Information State SystemsInformation state :

Discourse context, grounding state, intentions, plans.

Dialogue acts:Extension of speech acts, to include grounding

actsRequest-inform; Confirmation

Page 57: Dialogue Management

Information State SystemsInformation state :

Discourse context, grounding state, intentions, plans.

Dialogue acts:Extension of speech acts, to include grounding

actsRequest-inform; Confirmation

Update rulesModify information state based on DAs

Page 58: Dialogue Management

Information State SystemsInformation state :

Discourse context, grounding state, intentions, plans.

Dialogue acts:Extension of speech acts, to include grounding acts

Request-inform; Confirmation

Update rulesModify information state based on DAs

When a question is asked

Page 59: Dialogue Management

Information State SystemsInformation state :

Discourse context, grounding state, intentions, plans.Dialogue acts:

Extension of speech acts, to include grounding actsRequest-inform; Confirmation

Update rulesModify information state based on DAs

When a question is asked, answer itWhen an assertion is made,

Page 60: Dialogue Management

Information State SystemsInformation state :

Discourse context, grounding state, intentions, plans.Dialogue acts:

Extension of speech acts, to include grounding actsRequest-inform; Confirmation

Update rulesModify information state based on DAs

When a question is asked, answer itWhen an assertion is made,

Add information to context, grounding state

Page 61: Dialogue Management

Information State Architecture

Simple ideas, complex execution

Page 62: Dialogue Management

Dialogue ActsExtension of speech acts

Adds structure related to conversational phenomenaGrounding, adjacency pairs, etc

Page 63: Dialogue Management

Dialogue ActsExtension of speech acts

Adds structure related to conversational phenomenaGrounding, adjacency pairs, etc

Many proposed tagsetsVerbmobil: acts specific to meeting sched domain

Page 64: Dialogue Management

Dialogue ActsExtension of speech acts

Adds structure related to conversational phenomenaGrounding, adjacency pairs, etc

Many proposed tagsetsVerbmobil: acts specific to meeting sched domainDAMSL: Dialogue Act Markup in Several Layers

Forward looking functions: speech actsBackward looking function: grounding, answering

Page 65: Dialogue Management

Dialogue ActsExtension of speech acts

Adds structure related to conversational phenomenaGrounding, adjacency pairs, etc

Many proposed tagsetsVerbmobil: acts specific to meeting sched domainDAMSL: Dialogue Act Markup in Several Layers

Forward looking functions: speech actsBackward looking function: grounding, answering

Conversation acts:Add turn-taking and argumentation relations

Page 66: Dialogue Management

Verbmobil DA18 high level tags

Page 67: Dialogue Management

Dialogue Act Interpretation

Automatically tag utterances in dialogueSome simple cases:

Will breakfast be served on USAir 1557?

Page 68: Dialogue Management

Dialogue Act Interpretation

Automatically tag utterances in dialogueSome simple cases:

YES-NO-Q: Will breakfast be served on USAir 1557?

I don’t care about lunch.

Page 69: Dialogue Management

Dialogue Act Interpretation

Automatically tag utterances in dialogueSome simple cases:

YES-NO-Q: Will breakfast be served on USAir 1557?

Statement: I don’t care about lunch.Show be flights from L.A. to Orlando

Page 70: Dialogue Management

Dialogue Act Interpretation

Automatically tag utterances in dialogueSome simple cases:

YES-NO-Q: Will breakfast be served on USAir 1557?

Statement: I don’t care about lunch.Command: Show be flights from L.A. to Orlando

Is it always that easy?Can you give me the flights from Atlanta to

Boston?

Page 71: Dialogue Management

Dialogue Act Interpretation

Automatically tag utterances in dialogueSome simple cases:

YES-NO-Q: Will breakfast be served on USAir 1557?Statement: I don’t care about lunch.Command: Show be flights from L.A. to Orlando

Is it always that easy?Can you give me the flights from Atlanta to Boston?

Syntactic form: question; Act: request/commandYeah.

Page 72: Dialogue Management

Dialogue Act Interpretation

Automatically tag utterances in dialogueSome simple cases:

YES-NO-Q: Will breakfast be served on USAir 1557?Statement: I don’t care about lunch.Command: Show be flights from L.A. to Orlando

Is it always that easy?Can you give me the flights from Atlanta to Boston?Yeah.

Depends on context: Y/N answer; agreement; back-channel

Page 73: Dialogue Management

Dialogue Act AmbiguityIndirect speech acts

Page 74: Dialogue Management

Dialogue Act AmbiguityIndirect speech acts

Page 75: Dialogue Management

Dialogue Act AmbiguityIndirect speech acts

Page 76: Dialogue Management

Dialogue Act AmbiguityIndirect speech acts

Page 77: Dialogue Management

Dialogue Act AmbiguityIndirect speech acts

Page 78: Dialogue Management

Dialogue Act RecognitionHow can we classify dialogue acts?Sources of information:

Page 79: Dialogue Management

Dialogue Act RecognitionHow can we classify dialogue acts?Sources of information:

Word information: Please, would you: request; are you: yes-no question

Page 80: Dialogue Management

Dialogue Act RecognitionHow can we classify dialogue acts?Sources of information:

Word information: Please, would you: request; are you: yes-no questionN-gram grammars

Prosody:

Page 81: Dialogue Management

Dialogue Act RecognitionHow can we classify dialogue acts?Sources of information:

Word information: Please, would you: request; are you: yes-no questionN-gram grammars

Prosody:Final rising pitch: question; final lowering: statementReduced intensity: Yeah: agreement vs backchannel

Page 82: Dialogue Management

Dialogue Act RecognitionHow can we classify dialogue acts?Sources of information:

Word information: Please, would you: request; are you: yes-no questionN-gram grammars

Prosody:Final rising pitch: question; final lowering: statementReduced intensity: Yeah: agreement vs backchannel

Adjacency pairs:

Page 83: Dialogue Management

Dialogue Act RecognitionHow can we classify dialogue acts?Sources of information:

Word information: Please, would you: request; are you: yes-no questionN-gram grammars

Prosody:Final rising pitch: question; final lowering: statementReduced intensity: Yeah: agreement vs backchannel

Adjacency pairs:Y/N question, agreement vs Y/N question, backchannelDA bi-grams

Page 84: Dialogue Management

Task & CorpusGoal:

Identify dialogue acts in conversational speech

Page 85: Dialogue Management

Task & CorpusGoal:

Identify dialogue acts in conversational speechSpoken corpus: Switchboard

Telephone conversations between strangersNot task oriented; topics suggested1000s of conversations

recorded, transcribed, segmented

Page 86: Dialogue Management

Dialogue Act TagsetCover general conversational dialogue acts

No particular task/domain constraints

Page 87: Dialogue Management

Dialogue Act TagsetCover general conversational dialogue acts

No particular task/domain constraintsOriginal set: ~50 tags

Augmented with flags for task, conv mgmt220 tags in labeling: some rare

Page 88: Dialogue Management

Dialogue Act TagsetCover general conversational dialogue acts

No particular task/domain constraintsOriginal set: ~50 tags

Augmented with flags for task, conv mgmt220 tags in labeling: some rare

Final set: 42 tags, mutually exclusiveSWBD-DAMSLAgreement: K=0.80 (high)

Page 89: Dialogue Management

Dialogue Act TagsetCover general conversational dialogue acts

No particular task/domain constraintsOriginal set: ~50 tags

Augmented with flags for task, conv mgmt220 tags in labeling: some rare

Final set: 42 tags, mutually exclusiveSWBD-DAMSLAgreement: K=0.80 (high)

1,155 conv labeled: split into train/test

Page 90: Dialogue Management

Common TagsStatement & Opinion: declarative +/- opQuestion: Yes/No&Declarative: form, forceBackchannel: Continuers like uh-huh, yeahTurn Exit/Adandon: break off, +/- passAnswer : Yes/No, follow questionsAgreement: Accept/Reject/Maybe

Page 91: Dialogue Management

Probabilistic Dialogue Models

HMM dialogue models

Page 92: Dialogue Management

Probabilistic Dialogue Models

HMM dialogue modelsStates = Dialogue acts; Observations: Utterances

Assume decomposable by utteranceEvidence from true words, ASR words, prosody

Page 93: Dialogue Management

Probabilistic Dialogue Models

HMM dialogue modelsStates = Dialogue acts; Observations: Utterances

Assume decomposable by utteranceEvidence from true words, ASR words, prosody

Page 94: Dialogue Management

Probabilistic Dialogue Models

HMM dialogue modelsStates = Dialogue acts; Observations: Utterances

Assume decomposable by utteranceEvidence from true words, ASR words, prosody

Page 95: Dialogue Management

Probabilistic Dialogue Models

HMM dialogue modelsStates = Dialogue acts; Observations: Utterances

Assume decomposable by utteranceEvidence from true words, ASR words, prosody

Page 96: Dialogue Management

DA Classification - ProsodyFeatures:

Duration, pause, pitch, energy, rate, genderPitch accent, tone

Results:Decision trees: 5 common classes

45.4% - baseline=16.6%

Page 97: Dialogue Management

Prosodic Decision Tree

Page 98: Dialogue Management

DA Classification -WordsWords

Combines notion of discourse markers and collocations: e.g. uh-huh=Backchannel

Contrast: true words, ASR 1-best, ASR n-bestResults:

Best: 71%- true words, 65% ASR 1-best

Page 99: Dialogue Management

DA Classification - AllCombine word and prosodic information

Consider case with ASR words and acoustics

Page 100: Dialogue Management

DA Classification - AllCombine word and prosodic information

Consider case with ASR words and acousticsProsody classified by decision trees

Incorporate decision tree posteriors in model for P(f|d)

Page 101: Dialogue Management

DA Classification - AllCombine word and prosodic information

Consider case with ASR words and acousticsProsody classified by decision trees

Incorporate decision tree posteriors in model for P(f|d)

Slightly better than raw ASR

Page 102: Dialogue Management

Integrated ClassificationFocused analysis

Prosodically disambiguated classesStatement/Question-Y/N and Agreement/BackchannelProsodic decision trees for agreement vs backchannel

Disambiguated by duration and loudness

Page 103: Dialogue Management

Integrated ClassificationFocused analysis

Prosodically disambiguated classesStatement/Question-Y/N and Agreement/BackchannelProsodic decision trees for agreement vs backchannel

Disambiguated by duration and loudness

Substantial improvement for prosody+wordsTrue words: S/Q: 85.9%-> 87.6; A/B: 81.0%->84.7

Page 104: Dialogue Management

Integrated ClassificationFocused analysis

Prosodically disambiguated classesStatement/Question-Y/N and Agreement/BackchannelProsodic decision trees for agreement vs backchannel

Disambiguated by duration and loudness

Substantial improvement for prosody+wordsTrue words: S/Q: 85.9%-> 87.6; A/B: 81.0%->84.7ASR words: S/Q: 75.4%->79.8; A/B: 78.2%->81.7

More useful when recognition is iffy

Page 105: Dialogue Management

Many VariantsMaptask: (13 classes)

Serafin & DiEugenio 2004Latent Semantic analysis on utterance vectorsText onlyGame information; No improvement for DA history

Page 106: Dialogue Management

Many VariantsMaptask: (13 classes)

Serafin & DiEugenio 2004Latent Semantic analysis on utterance vectorsText onlyGame information; No improvement for DA history

Surendran & Levow 2006SVMs on term n-grams, prosodyPosteriors incorporated in HMMs

Prosody, sequence modeling improves

Page 107: Dialogue Management

Many VariantsMaptask: (13 classes)

Serafin & DiEugenio 2004Latent Semantic analysis on utterance vectorsText onlyGame information; No improvement for DA history

Surendran & Levow 2006SVMs on term n-grams, prosodyPosteriors incorporated in HMMs

Prosody, sequence modeling improves

MRDA: Meeting tagging: 5 broad classes

Page 108: Dialogue Management

ObservationsDA classification can work on open domain

Exploits word model, DA context, prosodyBest results for prosody+wordsWords are quite effective alone – even ASR

Questions:

Page 109: Dialogue Management

ObservationsDA classification can work on open domain

Exploits word model, DA context, prosodyBest results for prosody+wordsWords are quite effective alone – even ASR

Questions: Whole utterance models? – more fine-grainedLonger structure, long term features

Page 110: Dialogue Management

Detecting Correction ActsMiscommunication is common in SDS

Utterances after errors misrecognized >2x as oftenFrequently repetition or paraphrase of original input

Page 111: Dialogue Management

Detecting Correction ActsMiscommunication is common in SDS

Utterances after errors misrecognized >2x as oftenFrequently repetition or paraphrase of original input

Systems need to detect, correct

Page 112: Dialogue Management

Detecting Correction ActsMiscommunication is common in SDS

Utterances after errors misrecognized >2x as oftenFrequently repetition or paraphrase of original input

Systems need to detect, correctCorrections are spoken differently:

Hyperarticulated (slower, clearer) -> lower ASR conf.

Page 113: Dialogue Management

Detecting Correction ActsMiscommunication is common in SDS

Utterances after errors misrecognized >2x as oftenFrequently repetition or paraphrase of original input

Systems need to detect, correctCorrections are spoken differently:

Hyperarticulated (slower, clearer) -> lower ASR conf.

Some word cues: ‘No’,’ I meant’, swearing..

Page 114: Dialogue Management

Detecting Correction ActsMiscommunication is common in SDS

Utterances after errors misrecognized >2x as oftenFrequently repetition or paraphrase of original input

Systems need to detect, correctCorrections are spoken differently:

Hyperarticulated (slower, clearer) -> lower ASR conf.

Some word cues: ‘No’,’ I meant’, swearing..Can train classifiers to recognize with good acc.

Page 115: Dialogue Management

Generating Dialogue ActsGeneration neglected relative to generation

Page 116: Dialogue Management

Generating Dialogue ActsGeneration neglected relative to generationStent (2002) model: Conversation acts, Belief

modelDevelops update rules for content planning, e.g.

If user releases turn, system can do ‘TAKE-TURN’ actIf system needs to summarize, use ASSERT act

Page 117: Dialogue Management

Generating Dialogue ActsGeneration neglected relative to generationStent (2002) model: Conversation acts, Belief

modelDevelops update rules for content planning, i.e.

If user releases turn, system can do ‘TAKE-TURN’ actIf system needs to summarize, use ASSERT act

Identifies turn-taking as key aspect of dialogue gen.

Page 118: Dialogue Management

Generating ConfirmationSimple systems use fixed confirmation strategy

Implicit or explicit

Page 119: Dialogue Management

Generating ConfirmationSimple systems use fixed confirmation strategy

Implicit or explicitMore complex systems can select dynamically

Use information state and features to decide

Page 120: Dialogue Management

Generating ConfirmationSimple systems use fixed confirmation strategy

Implicit or explicitMore complex systems can select dynamically

Use information state and features to decideLikelihood of error:

Low ASR confidence score If very low, can reject

Page 121: Dialogue Management

Generating ConfirmationSimple systems use fixed confirmation strategy

Implicit or explicitMore complex systems can select dynamically

Use information state and features to decideLikelihood of error:

Low ASR confidence score If very low, can reject

Sentence/prosodic features: longer, initial pause, pitch range

Page 122: Dialogue Management

Generating ConfirmationSimple systems use fixed confirmation strategy

Implicit or explicitMore complex systems can select dynamically

Use information state and features to decideLikelihood of error:

Low ASR confidence score If very low, can reject

Sentence/prosodic features: longer, initial pause, pitch range

Cost of error:

Page 123: Dialogue Management

Generating ConfirmationSimple systems use fixed confirmation strategy

Implicit or explicitMore complex systems can select dynamically

Use information state and features to decideLikelihood of error:

Low ASR confidence score If very low, can reject

Sentence/prosodic features: longer, initial pause, pitch rangeCost of error:

Book a flight vs looking up information

Markov Decision Process models more detailed


Recommended