+ All Categories
Home > Technology > Edinburgh MT lecture 7: phrase-based MT

Edinburgh MT lecture 7: phrase-based MT

Date post: 25-Jul-2015
Category:
Upload: alopezfoo
View: 301 times
Download: 10 times
Share this document with a friend
Popular Tags:
112
Phrase-Based Translation
Transcript
Page 1: Edinburgh MT lecture 7: phrase-based MT

Phrase-Based Translation

Page 2: Edinburgh MT lecture 7: phrase-based MT

The IBM Models

Page 3: Edinburgh MT lecture 7: phrase-based MT

The IBM Models

•Fertility probabilities.

Page 4: Edinburgh MT lecture 7: phrase-based MT

The IBM Models

•Fertility probabilities.

•Word translation probabilities.

Page 5: Edinburgh MT lecture 7: phrase-based MT

The IBM Models

•Fertility probabilities.

•Word translation probabilities.

•Distortion probabilities.

Page 6: Edinburgh MT lecture 7: phrase-based MT

The IBM Models

•Fertility probabilities.

•Word translation probabilities.

•Distortion probabilities.

•Some problems:

•Weak reordering model -- output is not fluent.

•Many decisions -- many things can go wrong.

Page 7: Edinburgh MT lecture 7: phrase-based MT

The IBM Models

•Fertility probabilities.

•Word translation probabilities.

•Distortion probabilities.

•Some problems:

•Weak reordering model -- output is not fluent.

•Many decisions -- many things can go wrong.

Page 8: Edinburgh MT lecture 7: phrase-based MT

The IBM Models

•Fertility probabilities.

•Word translation probabilities.

•Distortion probabilities.

•Some problems:

•Weak reordering model -- output is not fluent.

•Many decisions -- many things can go wrong.

Page 9: Edinburgh MT lecture 7: phrase-based MT

Although north wind howls , but sky still very clear .虽然 北 风 呼啸 , 但 天空 依然 十分 清澈 。

However , the sky remained clear under the strong north wind .

虽然

� �北 风 呼啸 , 天空 天空 依然 清澈 。

north wind strong , the sky remained clear . under theHowever

IBM Model 4

Page 10: Edinburgh MT lecture 7: phrase-based MT

Tradeoffs: Modeling v. Learning

IBM Model 1 ✔ ✘ ✘ ✔ ✔

HMM ✔ ✔ ✘ ✘ ✔

IBM Model 4 ✔ ✔ ✔ ✘ ✘

Lexical

Tran

slatio

n

Local orderi

ng depen

dency

Fertilit

y

Convex

Tracta

ble Exa

ct

Inferen

ce

Page 11: Edinburgh MT lecture 7: phrase-based MT

Tradeoffs: Modeling v. Learning

IBM Model 1 ✔ ✘ ✘ ✔ ✔

HMM ✔ ✔ ✘ ✘ ✔

IBM Model 4 ✔ ✔ ✔ ✘ ✘

Lexical

Tran

slatio

n

Local orderi

ng depen

dency

Fertilit

y

Convex

Tracta

ble Exa

ct

Inferen

ce

Lesson:Trade exactnessfor expressivity

Page 12: Edinburgh MT lecture 7: phrase-based MT

Although north wind howls , but sky still very clear .虽然 北 风 呼啸 , 但 天空 依然 十分 清澈 。

However , the sky remained clear under the strong north wind .

虽然

� �北 风 呼啸 , 天空 天空 依然 清澈 。

north wind strong , the sky remained clear . under theHowever

IBM Model 4

What are some things this model doesn’t account for?

Page 13: Edinburgh MT lecture 7: phrase-based MT

虽然 北 风 呼啸 , 但 天空 依然 十分 清澈 。Although north wind howls , but sky still very clear .

However , the sky remained clear under the strong north wind .

虽然 北 风 呼啸 , 天空 天空 依然 清澈 。

north wind strong , the sky remained clear . under theHowever

What are some things this model doesn’t account for?

Page 14: Edinburgh MT lecture 7: phrase-based MT

虽然 北 风 呼啸 , 但 天空 依然 十分 清澈 。Although north wind howls , but sky still very clear .

Phrase-based Models

Page 15: Edinburgh MT lecture 7: phrase-based MT

虽然 北 风 呼啸 , 但 天空 依然 十分 清澈 。Although north wind howls , but sky still very clear .

Phrase-based Models

Page 16: Edinburgh MT lecture 7: phrase-based MT

虽然 北 风 呼啸 , 但 天空 依然 十分 清澈 。Although north wind howls , but sky still very clear .

However

Phrase-based Models

Page 17: Edinburgh MT lecture 7: phrase-based MT

虽然 北 风 呼啸 , 但 天空 依然 十分 清澈 。Although north wind howls , but sky still very clear .

However the strong north wind , the sky remained clear under .

Phrase-based Models

Page 18: Edinburgh MT lecture 7: phrase-based MT

虽然 北 风 呼啸 , 但 天空 依然 十分 清澈 。Although north wind howls , but sky still very clear .

However the strong north wind , the sky remained clear under .

However

Phrase-based Models

Page 19: Edinburgh MT lecture 7: phrase-based MT

虽然 北 风 呼啸 , 但 天空 依然 十分 清澈 。Although north wind howls , but sky still very clear .

However the strong north wind , the sky remained clear under .

However ,

Phrase-based Models

Page 20: Edinburgh MT lecture 7: phrase-based MT

虽然 北 风 呼啸 , 但 天空 依然 十分 清澈 。Although north wind howls , but sky still very clear .

However the strong north wind , the sky remained clear under .

However , the sky remained clear under the strong north wind .

Phrase-based Models

Page 21: Edinburgh MT lecture 7: phrase-based MT

虽然 北 风 呼啸 , 但 天空 依然 十分 清澈 。Although north wind howls , but sky still very clear .

However the strong north wind , the sky remained clear under .

However , the sky remained clear under the strong north wind .

p(English, alignment|Chinese) =p(segmentation) · p(translations) · p(reorderings)

Phrase-based Models

Page 22: Edinburgh MT lecture 7: phrase-based MT

虽然 北 风 呼啸 , 但 天空 依然 十分 清澈 。Although north wind howls , but sky still very clear .

However the strong north wind , the sky remained clear under .

However , the sky remained clear under the strong north wind .

p(English, alignment|Chinese) =p(segmentation) · p(translations) · p(reorderings)

Phrase-based Models

Page 23: Edinburgh MT lecture 7: phrase-based MT

虽然 北 风 呼啸 , 但 天空 依然 十分 清澈 。Although north wind howls , but sky still very clear .

However the strong north wind , the sky remained clear under .

However , the sky remained clear under the strong north wind .

p(English, alignment|Chinese) =p(segmentation) · p(translations) · p(reorderings)

Phrase-based Models

distortion = 6

Page 24: Edinburgh MT lecture 7: phrase-based MT

虽然 北 风 呼啸 , 但 天空 依然 十分 清澈 。Although north wind howls , but sky still very clear .

However the strong north wind , the sky remained clear under .

However , the sky remained clear under the strong north wind .

p(English, alignment|Chinese) =p(segmentation) · p(translations) · p(reorderings)

Phrase-based Models

distortion = 6

Page 25: Edinburgh MT lecture 7: phrase-based MT

虽然 北 风 呼啸 , 但 天空 依然 十分 清澈 。Although north wind howls , but sky still very clear .

However the strong north wind , the sky remained clear under .

However , the sky remained clear under the strong north wind .

p(English, alignment|Chinese) =p(segmentation) · p(translations) · p(reorderings)

Phrase-based Models

distortion = 6

Page 26: Edinburgh MT lecture 7: phrase-based MT

虽然 北 风 呼啸 , 但 天空 依然 十分 清澈 。Although north wind howls , but sky still very clear .

However the strong north wind , the sky remained clear under .

However , the sky remained clear under the strong north wind .

p(English, alignment|Chinese) =p(segmentation) · p(translations) · p(reorderings)

Phrase-based Models

distortion = 6

Page 27: Edinburgh MT lecture 7: phrase-based MT

Phrase-based Models

Page 28: Edinburgh MT lecture 7: phrase-based MT

Phrase-based Models

•Segmentation probabilities.

Page 29: Edinburgh MT lecture 7: phrase-based MT

Phrase-based Models

•Segmentation probabilities.

•Phrase translation probabilities.

Page 30: Edinburgh MT lecture 7: phrase-based MT

Phrase-based Models

•Segmentation probabilities.

•Phrase translation probabilities.

•Distortion probabilities.

Page 31: Edinburgh MT lecture 7: phrase-based MT

Phrase-based Models

•Segmentation probabilities.

•Phrase translation probabilities.

•Distortion probabilities.

•Some problems:

•Weak reordering model -- output is not fluent.

•Many decisions -- many things can go wrong.

Page 32: Edinburgh MT lecture 7: phrase-based MT

Phrase-based Models

•Segmentation probabilities.

•Phrase translation probabilities.

•Distortion probabilities.

•Some problems:

•Weak reordering model -- output is not fluent.

•Many decisions -- many things can go wrong.

Page 33: Edinburgh MT lecture 7: phrase-based MT

Phrase-based Models

•Segmentation probabilities.

•Phrase translation probabilities.

•Distortion probabilities.

•Some problems:

•Weak reordering model -- output is not fluent.

•Many decisions -- many things can go wrong.

Page 34: Edinburgh MT lecture 7: phrase-based MT

Phrase-based Models

•Segmentation probabilities: fixed (uniform)

•Phrase translation probabilities.

•Distortion probabilities: fixed (decaying)

Page 35: Edinburgh MT lecture 7: phrase-based MT

Learning p(Chinese|English)

•Reminder: (nearly) every problem comes down to computing either:

•Sums: MLE or EM (learning)

•Maximum: most probable (decoding)

Page 36: Edinburgh MT lecture 7: phrase-based MT

Recap: Expectation Maximization

•Arbitrarily select a set of parameters (say, uniform).

•Calculate expected counts of the unseen events.

•Choose new parameters to maximize likelihood, using expected counts as proxy for observed counts.

•Iterate.

•Guaranteed that likelihood is monotonically nondecreasing.

Page 37: Edinburgh MT lecture 7: phrase-based MT

虽然 北 风 呼啸 , 但 天空 依然 十分 清澈 。

However , the sky remained clear under the strong north wind .

虽然 北 风 呼啸 , 但 天空 依然 十分 清澈 。

However , the sky remained clear under the strong north wind .

p(

p(

虽然 北 风 呼啸 , 但 天空 依然 十分 清澈 。

However , the sky remained clear under the strong north wind .

p( )

) +

) +

Marginalize: sum all alignments containing the link

Page 38: Edinburgh MT lecture 7: phrase-based MT

虽然 北 风 呼啸 , 但 天空 依然 十分 清澈 。

However , the sky remained clear under the strong north wind .

虽然 北 风 呼啸 , 但 天空 依然 十分 清澈 。

However , the sky remained clear under the strong north wind .

p(

p(

虽然 北 风 呼啸 , 但 天空 依然 十分 清澈 。

However , the sky remained clear under the strong north wind .

p(

) +

) +

)

Divide by sum of all possible alignments

Page 39: Edinburgh MT lecture 7: phrase-based MT

虽然 北 风 呼啸 , 但 天空 依然 十分 清澈 。

However , the sky remained clear under the strong north wind .

虽然 北 风 呼啸 , 但 天空 依然 十分 清澈 。

However , the sky remained clear under the strong north wind .

p(

p(

虽然 北 风 呼啸 , 但 天空 依然 十分 清澈 。

However , the sky remained clear under the strong north wind .

p(

) +

) +

)

Divide by sum of all possible alignments

We have to sum over exponentially many alignments!

Page 40: Edinburgh MT lecture 7: phrase-based MT

EM for Model 1

probability of an alignment.

p(F,A|E) = p(I|J)Y

ai

p(ai = j)p(fi|ej)

Page 41: Edinburgh MT lecture 7: phrase-based MT

EM for Model 1

probability of an alignment.

observed uniform

p(F,A|E) = p(I|J)Y

ai

p(ai = j)p(fi|ej)

Page 42: Edinburgh MT lecture 7: phrase-based MT

factors across words.

EM for Model 1

probability of an alignment.

observed uniform

p(F,A|E) = p(I|J)Y

ai

p(ai = j)p(fi|ej)

Page 43: Edinburgh MT lecture 7: phrase-based MT

EM for Model 1

p(ai = j|F,E) =p(ai = j, F |E)

p(F,E)=

Page 44: Edinburgh MT lecture 7: phrase-based MT

EM for Model 1

北北

.�

a⇥A: �north

p(north| ) · p(rest of a)

p(ai = j|F,E) =p(ai = j, F |E)

p(F,E)=

Page 45: Edinburgh MT lecture 7: phrase-based MT

EM for Model 1

北北

.�

a⇥A: �north

p(north| ) · p(rest of a)

marginal probability of alignments containing link

p(ai = j|F,E) =p(ai = j, F |E)

p(F,E)=

Page 46: Edinburgh MT lecture 7: phrase-based MT

EM for Model 1

p(north| ).�

a⇥A: �north

p(rest of a)北北

marginal probability of alignments containing link

Page 47: Edinburgh MT lecture 7: phrase-based MT

EM for Model 1

p(north| ).�

a⇥A: �north

p(rest of a)北北

marginal probability of alignments containing link

c⇥Chinese words

p(north|c).�

a⇥A: �north

p(rest of a)

marginal probability of all alignments

Page 48: Edinburgh MT lecture 7: phrase-based MT

EM for Model 1

p(north| ).�

a⇥A: �north

p(rest of a)北北

marginal probability of alignments containing link

c⇥Chinese words

p(north|c).�

a⇥A: �north

p(rest of a)

marginal probability of all alignments

c

Page 49: Edinburgh MT lecture 7: phrase-based MT

EM for Model 1

p(north| ).�

a⇥A: �north

p(rest of a)北北

marginal probability of alignments containing link

c⇥Chinese words

p(north|c).�

a⇥A: �north

p(rest of a)

marginal probability of all alignments

c

identical!

Page 50: Edinburgh MT lecture 7: phrase-based MT

EM for Model 1

北p(north| ).�

c�Chinese words p(north|c)

Page 51: Edinburgh MT lecture 7: phrase-based MT

EM for Phrase-Based

•Model parameters: p(E phrase|F phrase)

•All we need to do is compute expectations:

p(ai = j|F,E) =p(ai,i0 = hj, j0i, F |E)

p(F,E)

Page 52: Edinburgh MT lecture 7: phrase-based MT

EM for Phrase-Based

•Model parameters: p(E phrase|F phrase)

•All we need to do is compute expectations:

p(F,E) sums over all possible phrase alignments

p(ai = j|F,E) =p(ai,i0 = hj, j0i, F |E)

p(F,E)

Page 53: Edinburgh MT lecture 7: phrase-based MT

EM for Phrase-Based

•Model parameters: p(E phrase|F phrase)

•All we need to do is compute expectations:

p(F,E) sums over all possible phrase alignments...which are one-to-one by definition.

p(ai = j|F,E) =p(ai,i0 = hj, j0i, F |E)

p(F,E)

Page 54: Edinburgh MT lecture 7: phrase-based MT

虽然 北 风 呼啸 , 但 天空 依然 十分 清澈 。Although north wind howls , but sky still very clear .

However

EM for Phrase-Based

p(ai = j|F,E) =p(ai,i0 = hj, j0i, F |E)

p(F,E)

, the sky remained clear under the strong north wind .

Page 55: Edinburgh MT lecture 7: phrase-based MT

虽然 北 风 呼啸 , 但 天空 依然 十分 清澈 。Although north wind howls , but sky still very clear .

However

EM for Phrase-Based

p(ai = j|F,E) =p(ai,i0 = hj, j0i, F |E)

p(F,E)

Can we compute this quantity?

, the sky remained clear under the strong north wind .

Page 56: Edinburgh MT lecture 7: phrase-based MT

虽然 北 风 呼啸 , 但 天空 依然 十分 清澈 。Although north wind howls , but sky still very clear .

However

EM for Phrase-Based

p(ai = j|F,E) =p(ai,i0 = hj, j0i, F |E)

p(F,E)

Can we compute this quantity?

, the sky remained clear under the strong north wind .

How many 1-to-1 alignments are there ofthe remaing 8 Chinese and 8 English words?

Page 57: Edinburgh MT lecture 7: phrase-based MT

Recap: Expectation Maximization

•Arbitrarily select a set of parameters (say, uniform).

•Calculate expected counts of the unseen events.

•Choose new parameters to maximize likelihood, using expected counts as proxy for observed counts.

•Iterate.

•Guaranteed that likelihood is monotonically nondecreasing.

Page 58: Edinburgh MT lecture 7: phrase-based MT

Recap: Expectation Maximization

•Arbitrarily select a set of parameters (say, uniform).

•Calculate expected counts of the unseen events.

•Choose new parameters to maximize likelihood, using expected counts as proxy for observed counts.

•Iterate.

•Guaranteed that likelihood is monotonically nondecreasing.

Computing expectations from a phrase-based model, given a sentence pair, is #P-Complete(by reduction to counting perfect matchings;

DeNero & Klein, 2008)

Page 59: Edinburgh MT lecture 7: phrase-based MT

argmaxa p(a|f,e) is also hard

Page 60: Edinburgh MT lecture 7: phrase-based MT

argmaxa p(a|f,e) is also hard

Page 61: Edinburgh MT lecture 7: phrase-based MT

argmaxa p(a|f,e) is also hard

Page 62: Edinburgh MT lecture 7: phrase-based MT

Now What?

•Option #1: approximate expectations

•Restrict computation to some tractable subset of the alignment space (arbitrarily biased).

•Markov chain Monte Carlo (slow).

Page 63: Edinburgh MT lecture 7: phrase-based MT

Now What?•Change the problem definition

•We already know how to learn word-to-word translation models efficiently.

•Idea: learn word-to-word alignments, extract most probable alignment, then treat it as observed.

•Learn phrase translations consistent with word alignments.

•Decouples alignment from model learning -- is this a good thing?

Page 64: Edinburgh MT lecture 7: phrase-based MT

Phrase Extraction

I open the box

watashi

wa

hako

wo

akemasu

Page 65: Edinburgh MT lecture 7: phrase-based MT

Phrase Extraction

I open the box

watashi

wa

hako

wo

akemasu

akemasu / open

Page 66: Edinburgh MT lecture 7: phrase-based MT

Phrase Extraction

I open the box

watashi

wa

hako

wo

akemasu

watashi wa / I

Page 67: Edinburgh MT lecture 7: phrase-based MT

Phrase Extraction

I open the box

watashi

wa

hako

wo

akemasu

watashi / I

Page 68: Edinburgh MT lecture 7: phrase-based MT

Phrase Extraction

I open the box

watashi

wa

hako

wo

akemasu

watashi / I ✘

Page 69: Edinburgh MT lecture 7: phrase-based MT

Phrase Extraction

I open the box

watashi

wa

hako

wo

akemasu

hako wo / box

Page 70: Edinburgh MT lecture 7: phrase-based MT

Phrase Extraction

I open the box

watashi

wa

hako

wo

akemasu

hako wo / the box

Page 71: Edinburgh MT lecture 7: phrase-based MT

Phrase Extraction

I open the box

watashi

wa

hako

wo

akemasu

hako wo / open the box

Page 72: Edinburgh MT lecture 7: phrase-based MT

Phrase Extraction

I open the box

watashi

wa

hako

wo

akemasu

hako wo / open the box✘

Page 73: Edinburgh MT lecture 7: phrase-based MT

Phrase Extraction

I open the box

watashi

wa

hako

wo

akemasu

hako wo akemasu / open the box

Page 74: Edinburgh MT lecture 7: phrase-based MT

Phrasal Translation Estimation

Page 75: Edinburgh MT lecture 7: phrase-based MT

Phrasal Translation Estimation

•Option #1 (EM over restricted space)

•Align with a word-based model.

•Compute expectations only over alignments consistent with the alignment grid.

Page 76: Edinburgh MT lecture 7: phrase-based MT

Phrasal Translation Estimation

•Option #1 (EM over restricted space)

•Align with a word-based model.

•Compute expectations only over alignments consistent with the alignment grid.

•Option #2 (Non-global estimation)

•View phrase pairs as observed, irrespective of context or overlap.

Page 77: Edinburgh MT lecture 7: phrase-based MT

Decoding

We want to solve this problem:

e⇤ = arg max

ep(e|f)

Page 78: Edinburgh MT lecture 7: phrase-based MT

Decoding

We want to solve this problem:

e⇤ = arg max

ep(e|f)

Q: how many English sentences are there?

Page 79: Edinburgh MT lecture 7: phrase-based MT

北 风 呼啸 。

Page 80: Edinburgh MT lecture 7: phrase-based MT

北 风 呼啸 。

segmentationssubstitutionspermutations

Page 81: Edinburgh MT lecture 7: phrase-based MT

北 风 呼啸 。

O(2n)segmentationssubstitutionspermutations

Page 82: Edinburgh MT lecture 7: phrase-based MT

北 风 呼啸 。

O(2n)O(5n)

segmentationssubstitutionspermutations

Page 83: Edinburgh MT lecture 7: phrase-based MT

北 风 呼啸 。

O(2n)O(5n)O(n!)

segmentationssubstitutionspermutations

Page 84: Edinburgh MT lecture 7: phrase-based MT

Key Idea

Page 85: Edinburgh MT lecture 7: phrase-based MT

Key Idea

Page 86: Edinburgh MT lecture 7: phrase-based MT

Key Idea

Page 87: Edinburgh MT lecture 7: phrase-based MT

Key Idea

Page 88: Edinburgh MT lecture 7: phrase-based MT

Key Idea

Page 89: Edinburgh MT lecture 7: phrase-based MT

Key Idea

Dynamic Programming

Page 90: Edinburgh MT lecture 7: phrase-based MT

虽然 北 风 呼啸 , 但 天空 依然 十分 清澈 。

Page 91: Edinburgh MT lecture 7: phrase-based MT

虽然 北 风 呼啸 , 但 天空 依然 十分 清澈 。

Page 92: Edinburgh MT lecture 7: phrase-based MT

虽然 北 风 呼啸 , 但 天空 依然 十分 清澈 。

虽然 北 风 呼啸 , 但 天空 依然 十分 清澈 。

虽然 北 风 呼啸 , 但 天空 依然 十分 清澈 。

START Although

crystal clear

START However

Page 93: Edinburgh MT lecture 7: phrase-based MT

虽然 北 风 呼啸 , 但 天空 依然 十分 清澈 。

虽然 北 风 呼啸 , 但 天空 依然 十分 清澈 。

虽然 北 风 呼啸 , 但 天空 依然 十分 清澈 。

START Although

crystal clear

START However

Page 94: Edinburgh MT lecture 7: phrase-based MT

虽然 北 风 呼啸 , 但 天空 依然 十分 清澈 。

虽然 北 风 呼啸 , 但 天空 依然 十分 清澈 。

虽然 北 风 呼啸 , 但 天空 依然 十分 清澈 。

wind shrieked

wind screamed

north wind

Page 95: Edinburgh MT lecture 7: phrase-based MT

虽然 北 风 呼啸 , 但 天空 依然 十分 清澈 。

虽然 北 风 呼啸 , 但 天空 依然 十分 清澈 。

虽然 北 风 呼啸 , 但 天空 依然 十分 清澈 。

wind shrieked

wind screamed

north wind

Page 96: Edinburgh MT lecture 7: phrase-based MT

虽然 北 风 呼啸 , 但 天空 依然 十分 清澈 。

虽然 北 风 呼啸 , 但 天空 依然 十分 清澈 。

虽然 北 风 呼啸 , 但 天空 依然 十分 清澈 。

the sky

shrieked ,

, yet

Page 97: Edinburgh MT lecture 7: phrase-based MT

虽然 北 风 呼啸 , 但 天空 依然 十分 清澈 。

虽然 北 风 呼啸 , 但 天空 依然 十分 清澈 。

虽然 北 风 呼啸 , 但 天空 依然 十分 清澈 。

the sky

shrieked ,

, yet

Page 98: Edinburgh MT lecture 7: phrase-based MT

虽然 北 风 呼啸 , 但 天空 依然 十分 清澈 。

sky ,

Page 99: Edinburgh MT lecture 7: phrase-based MT

虽然 北 风 呼啸 , 但 天空 依然 十分 清澈 。

sky ,

Page 100: Edinburgh MT lecture 7: phrase-based MT

虽然 北 风 呼啸 , 但 天空 依然 十分 清澈 。clear .

虽然 北 风 呼啸 , 但 天空 依然 十分 清澈 。still quite

虽然 北 风 呼啸 , 但 天空 依然 十分 清澈 。blue .

Page 101: Edinburgh MT lecture 7: phrase-based MT

虽然 北 风 呼啸 , 但 天空 依然 十分 清澈 。clear .

虽然 北 风 呼啸 , 但 天空 依然 十分 清澈 。still quite

虽然 北 风 呼啸 , 但 天空 依然 十分 清澈 。blue .

Page 102: Edinburgh MT lecture 7: phrase-based MT

虽然 北 风 呼啸 , 但 天空 依然 十分 清澈 。

Although the northern wind shrieked across the sky, but was still very clear.

Page 103: Edinburgh MT lecture 7: phrase-based MT

Approximation: Pruning

Page 104: Edinburgh MT lecture 7: phrase-based MT

Approximation: Pruning

Idea: prune states by accumulated path length

Page 105: Edinburgh MT lecture 7: phrase-based MT

Approximation: Pruning

Page 106: Edinburgh MT lecture 7: phrase-based MT

Approximation: Pruning

Solution: Group states by number of covered words.

Page 107: Edinburgh MT lecture 7: phrase-based MT
Page 108: Edinburgh MT lecture 7: phrase-based MT

•Some (not all) key ingredients in Google Translate:

Page 109: Edinburgh MT lecture 7: phrase-based MT

•Some (not all) key ingredients in Google Translate:

•Phrase-based translation models

Page 110: Edinburgh MT lecture 7: phrase-based MT

•Some (not all) key ingredients in Google Translate:

•Phrase-based translation models

•... Learned heuristically from word alignments

Page 111: Edinburgh MT lecture 7: phrase-based MT

•Some (not all) key ingredients in Google Translate:

•Phrase-based translation models

•... Learned heuristically from word alignments

•... Coupled with a huge language model

Page 112: Edinburgh MT lecture 7: phrase-based MT

•Some (not all) key ingredients in Google Translate:

•Phrase-based translation models

•... Learned heuristically from word alignments

•... Coupled with a huge language model

•... And decoding w/ severe pruning heuristics


Recommended