Deep Grammar Error Detection and Automated Lexical Acquisition

transcript

Background & Motivation Grammar Error Detection Automated Lexical Acquisition Summary

Deep Grammar Error Detection andAutomated Lexical Acquisition

Steps towards Wide-Coverage Open Texts Processing

Yi Zhangyzhang@coli.uni-sb.de

Department of Computational LinguisticsSaarland University

IGK Colloquium17th Nov. 2005

Outline

1 Background and MotivationDeep Processing: State-of-the-ArtCoverage of Deep Processing

2 Grammar Error DetectionPrevious Work on Grammar Error DetectionError Mining

3 Automated Lexical AcquisitionPrevious Work on Lexical AcquisitionStatistical Lexical Type Predictor

Outline

What is deep processing?

Deep processing means to maximally exploit grammaticalknowledge for language processing.Focus on linguistic precision and semantic modellingGrammar-centric approachThe opposite of deep is not statistical but shallow.

Why we need deep processing?

Explicit model of grammaticalityAbility to capture subtle linguistic interactionsSemantics

Problems with deep processing

EfficiencyDetailed language modelling creates large search space.Alleviated by efficient parsing algorithms and betterhardware

SpecificityLinguistic sound vs. application interestingRanking of the results is necessary.

Robustness/CoverageStrict grammaticality metricInsufficient coverage of the grammarDynamic nature of language

Robustness and specificity

Robustness and specificity are a pair of dual problems.

Grammar EngineeringOvergeneration � specificityUndergeneration � robustness

ApplicationRanked outputHigh coverag overnoisy inputs

For deep grammars, a balance point should be achieved tomaximize linguistic accuracy.Robustness and specificity should come with extramechanism.

Outline

Coverage problem of deep processing

Road-testing ERG over BNC [Baldwin et al., 2004]Test on 20,000 strings from BNCFull lexical span for only 32%Among these

57% are parsed (overall coverage 57%× 32% ≈ 18%)83% of the parses are correct40% parsing failures are caused by missing lexical entries39% parsing failures are caused by missing constructions

The focus of this talk

Deep grammar error detectionThe lexical coverage is a major problem for deepprocessing.Automated deep lexical acquisition

Outline

Symbolic approach

Inductive Logic ProgrammingBackground ∧ Hypothesis � Evidence

ILP based grammar extension[Cussens and Pulman, 2000]After a failed parse, abduction is used to find needed edges,which, if they existed, would allow a complete parse of thesentence. Linguistic constraints are applied to restrict thegeneration of implausible edges.

ProblemsThe generated rules are too general or too specific.

Symbolic approach

Outline

Error Mining

[van Noord, 2004]Large hand-crafted grammars are error-prone.Manual detection of errors is time consuming.Small test suite based validations are not reliable.Parsing failures are good indication of (under-generating)errors.

Parsability

Definition

R(wi . . . wj) =C(wi ...wj |OK )

C(wi ...wj )

If the parsability of a particular word sequence is (much)lower, it indicates that something is wrong.Parsabilities can be calculated efficiently for large corpuswith suffix arrays and perfect hashing.

Error mining experiment of ERG with BNC

1.8M sentences (21.2M words) with only ASCII charactersand no more than 20 words eachRunning best-only parsing with PET took less 2 days on elf

Status Num. of Sentence PercentageParsed 301,503 16.74%No lexical span 1,260,404 69.97%No parse found 239,272 13.28%Edge limit reached 96 0.01%

Error mining experiment of ERG with BNC

1.8M sentences (21.2M words) with only ASCII charactersand no more than 20 words eachRunning best-only parsing with PET took less 2 days on elf

Status Num. of Sentence PercentageParsed 301,503 16.74%No lexical span 1,260,404 69.97%No parse found 239,272 13.28%Edge limit reached 96 0.01%

Error analysis

Number Percentageuni-gram 2,336 10.52%bi-gram 15,183 68.36%tri-gram 4,349 19.58%

Table: N-grams with R < 0.1unigram

bigram

trigram

unigrambigramtrigramother

N-gram Countweed 59the poor 49a fight 113in connection 85as always 84peered at 28the World Cup 57

Table: Examples

Pin down the errors

1.8M sent.

Pin down the errors

1.8M sent.

full lex span 541K sent.

Pin down the errors

1.8M sent.

22K N-grams

Pin down the errors

1.8M sent.

22K N-grams

bi/tri-grams

Pin down the errors

1.8M sent.

22K N-grams

bi/tri-grams

lex err

Pin down the errors

1.8M sent.

22K N-grams

bi/tri-grams

lex err cons. err

Detecting lexical error

Missing lexical spanLow parsability unigramsLanguage dependent heuristics:i.e. low parsability bigrams started with determiner like“the poor”, “a fight”

Outline

Unification-based approach

[Erbach, 1990, Barg and Walther, 1998, Fouvry, 2003]Use underspecified lexical entries to parse the wholesentenceGenerate lexical entries afterwards by collectinginformation from the full parse

An example of how unification-based approach works

the kangaroo jumps

24STEMD

"THE"E

HEAD det

35 »STEM

D"KANGAROO"

26666666666664

"JUMPS"E

24PERSON 2 3rd

NUM 3 sg

*264HEAD

24PERSON 2

35375+

37777777777775

the kangaroo jumps

24STEMD

"THE"E

HEAD det

35 »STEM

D"KANGAROO"

26666666666664

"JUMPS"E

24PERSON 2 3rd

NUM 3 sg

*264HEAD

24PERSON 2

35375+

37777777777775

26666666664

HEAD 5

SPR 〈〉

HEAD-DTR

24HEAD 5

NON-HEAD-DTR 1

37777777775

the kangaroo jumps

24STEMD

"THE"E

HEAD det

35 »STEM

D"KANGAROO"

26666666666664

"JUMPS"E

24PERSON 2 3rd

NUM 3 sg

*264HEAD

24PERSON 2

35375+

37777777777775

26666666664

HEAD 5

SPR 〈〉

HEAD-DTR

24HEAD 5

NON-HEAD-DTR 1

37777777775

266664SUBJ 〈〉

HEAD-DTR

»SUBJ

E–NON-HEAD-DTR 4

377775

Problems with unification-based approaches

Generated lexical entries might be:too general: overgenerationtoo specific: undergeneration

Computational complexity increased significantly withunderspecified lexical entries, especially when twounknowns occur next to each other.

Outline

Statistical approach

[Baldwin, 2005]Based on a set of lexical typesTreat lexical acquisition as a classification taskGeneralize the acquisition model over various sencondarylanguage resources

POS taggerChunkerTreebankDependency parserLexical ontology

Importing lexicon from a semantic lexical ontology

AssumptionThere is a strong correlation between the semantic andsyntactic similarity of words. [Levin, 1993]

FactAbove 90% of the synsets in WordNet (2.0) share at least onelexical type among all included words.

Importing lexicon from a semantic lexical ontology

AssumptionThere is a strong correlation between the semantic andsyntactic similarity of words. [Levin, 1993]

FactAbove 90% of the synsets in WordNet (2.0) share at least onelexical type among all included words.

Importing lexicon from WordNet

[Baldwin, 2005]Construct semantic neighbours (all synonyms, directhyponyms, direct hypernyms)Take a majority vote across the lexical types of thesemantic neighbours

ImprovementVoting is weighted and must exceed a threshold.

[Baldwin, 2005]Construct semantic neighbours (all synonyms, directhyponyms, direct hypernyms)Take a majority vote across the lexical types of thesemantic neighbours

ImprovementVoting is weighted and must exceed a threshold.

Results

Baldwin05MTWVT

OverallAdvAdjVerbNoun

F−Sc

Deep Grammar Error Detection and Automated Lexical Acquisition

Documents