+ All Categories
Home > Documents > Progress Report Reihaneh Rabbany Presented for NLP Group Computing Science Department University of...

Progress Report Reihaneh Rabbany Presented for NLP Group Computing Science Department University of...

Date post: 15-Jan-2016
Category:
View: 215 times
Download: 1 times
Share this document with a friend
Popular Tags:
20
Progress Report Reihaneh Rabbany Presented for NLP Group Computing Science Department University of Alberta April 2009
Transcript
Page 1: Progress Report Reihaneh Rabbany Presented for NLP Group Computing Science Department University of Alberta April 2009.

Progress Report

Reihaneh Rabbany

Presented for NLP GroupComputing Science Department

University of AlbertaApril 2009

Page 2: Progress Report Reihaneh Rabbany Presented for NLP Group Computing Science Department University of Alberta April 2009.

Agenda

• Project Proposal for Guiding Agent by Speech• Many to Many Alignment by Bayesian

Networks– Letter to Phoneme Alignment– Evaluation of phylogenetic trees

2

Page 3: Progress Report Reihaneh Rabbany Presented for NLP Group Computing Science Department University of Alberta April 2009.

Quick RL overview• An agent interacting with environment– perceives state – performs actions – receive rewards

• Agent– Computes the value of each action in each state

• long term reward obtainable from this state by performing this action

– Performs action selection by choosing the best action or sometimes a random action• exploration-exploitation

3

10],,|...[),( 22

1 asrrrEasQ tttt

),(maxarg' asQa a

Page 4: Progress Report Reihaneh Rabbany Presented for NLP Group Computing Science Department University of Alberta April 2009.

Project Proposal for Guiding Agent by Speech

• Accelerate learning using speech– The emotion in speech signal has considerable

amount of side information – Happiness or anger of a speech signal can provide

a shaping reinforcement signal

• Developing tools and methods to extract emotion from speech and designing a methodology to use it as a shaping signal

4

Page 5: Progress Report Reihaneh Rabbany Presented for NLP Group Computing Science Department University of Alberta April 2009.

Approaches to use speech signal as a guide for learning

• Extracting prosodic features from speech• Associating meaning to these features– Supervised learning-based approach• data-set of (prosodic features, emotion) pairs

– excited, happy, upset, sad, bored• Assigns a reward to the recognized emotion

– Pure RL approach • inspired by the learning process of the parent-infant

– The infant gradually learns to associate value to perceived speech and how to use it to guide her exploration of the world

5

Page 6: Progress Report Reihaneh Rabbany Presented for NLP Group Computing Science Department University of Alberta April 2009.

RL Approach

• Two ways for developing this idea– Augmenting the observation space to include the

prosodic features• Emotion will become state-dependent

– Learns a separate instructor module • Estimates the value of prosodic features• Instructions (learnt instructor values) would affect the

agent's action selection

6

Page 7: Progress Report Reihaneh Rabbany Presented for NLP Group Computing Science Department University of Alberta April 2009.

Instructions

• Different ways that these instructions (learnt instructor values) could affect the agent's action selection– Balancing the exploration-exploitation

• When the speaker is not happy with what the agent is doing and it should explore other actions

– Use it directly in action selection by some weights• Motivates the agent to keep its previous action if the instructor is

satisfied with its current action

– Use it as a shaping reward to define a new reward function by adding it to the actual reward received from the environment

7

),,()1(),(maxarg' 1 ta aaeIasQa

)(eIrr

Page 8: Progress Report Reihaneh Rabbany Presented for NLP Group Computing Science Department University of Alberta April 2009.

Agenda

• Project Proposal for Guiding Agent by Speech• Many to Many Alignment by Bayesian

Networks– Letter to Phoneme Alignment– Evaluation of phylogenetic trees

8

Page 9: Progress Report Reihaneh Rabbany Presented for NLP Group Computing Science Department University of Alberta April 2009.

Many to Many Alignment by Bayesian Networks

• Finding Alignment between two sequences – Assuming the order is preserved

• I’ve applied it into two applications– Letter to phoneme alignment • Aligning for a given dictionary

– Evaluating Phylogenetic trees• Shows how compatible the tree is with the given

taxonomy

9

Page 10: Progress Report Reihaneh Rabbany Presented for NLP Group Computing Science Department University of Alberta April 2009.

Agenda

• Project Proposal for Guiding Agent by Speech• Many to Many Alignment by Bayesian

Networks– Letter to Phoneme Alignment– Phylogenetic trees evaluation

10

Page 11: Progress Report Reihaneh Rabbany Presented for NLP Group Computing Science Department University of Alberta April 2009.

Model• Word: – sequence of letters

• Pronunciation: – sequence of phonemes

• Alignment: – sequence of subalignments

• Problem: Finding the most probable alignment

• Assumption: sub alignments are independent

mpppP ...21

iii PLa ,

nlllL ...21

),|(maxarg PLAPA Abest

2|||,| ii PL

11

kaaaA ...21

k

ii PLaPPLAP

1

),|(),|(

Page 12: Progress Report Reihaneh Rabbany Presented for NLP Group Computing Science Department University of Alberta April 2009.

Many-to-Many EM

1. Initialize prob(SubAlignmnets)// Expectation Step2. For each word in training_set

2.1. Produce all possible alignments 2.2. Choose the most probable alignment// Maximization Step3. For all subalignments

3.1. Compute new_p(SubAlignmnets)

][

],[)(

i

iii lM

plMaP

12

Page 13: Progress Report Reihaneh Rabbany Presented for NLP Group Computing Science Department University of Alberta April 2009.

Dynamic Bayesian Network

• Model

• Subaligments : hidden variables• Learn DBN by EM

li Pi

ai

k

iiii PLaPAP

1

),|()(

],[

][)(

ii

ii lpM

aMaP

13

),|(),|( iiii PLaPPLaP

Page 14: Progress Report Reihaneh Rabbany Presented for NLP Group Computing Science Department University of Alberta April 2009.

Context Dependent DBN

• Context independency assumption– Makes the model simpler– It is not always a correct assumption– Example: P(<h,h>) in Chat and Hat

• Modelli Pi

aiai-1

k

iiiii PLaaPAP

11 ),,|()(

],,[

][)(

1 iii

ii lpaM

aMaP

14

),,|(),|( 1 iiiii PLaaPPLaP

Page 15: Progress Report Reihaneh Rabbany Presented for NLP Group Computing Science Department University of Alberta April 2009.

Agenda

• Project Proposal for Guiding Agent by Speech• Many to Many Alignment by Bayesian

Networks– Letter to Phoneme Alignment– Evaluation of phylogenetic trees

15

Page 16: Progress Report Reihaneh Rabbany Presented for NLP Group Computing Science Department University of Alberta April 2009.

Evaluation of Phylogenetic Trees• Phylogenetic Trees– Show the evolution of species

• Taxonomy– Caninae; True dogs; Canis; Coyote – …– Caninae; True foxes; Vulpes; Kit Fox– Caninae; True foxes; Vulpes; Fennec Fox– …– Caninae; Basal Caninae; Otocyon ; Bat-eared Fox – ...

16

Page 17: Progress Report Reihaneh Rabbany Presented for NLP Group Computing Science Department University of Alberta April 2009.

Tree Evaluation

• Labeling the inner nodes in the tree• For each species – A path in the tree • sequence of inner node labels

– A taxonomy description• taxonomy sequence

– There should be a many to many alignment between these two sequences

17

Page 18: Progress Report Reihaneh Rabbany Presented for NLP Group Computing Science Department University of Alberta April 2009.

Tree Evaluation (Cont.)

• Finding alignment between these sequences for all the species– Finding the most probable alignments

• Measuring the mean probability of these alignment – How probable is this tree given this taxonomy

18

Page 19: Progress Report Reihaneh Rabbany Presented for NLP Group Computing Science Department University of Alberta April 2009.

• Taxonomy and Trees

• Aligned result

19

Page 20: Progress Report Reihaneh Rabbany Presented for NLP Group Computing Science Department University of Alberta April 2009.

Discussion

20


Recommended