Using weighted ﬁnite state morphology with VISL CG-3—Some ... · Using weighted ﬁnite state...

Using weighted finite state morphology with VISL CG-3—Someexperiments with free open source Finnish resources

Tommi A PirinenOllscoil Chathair Bhaile Átha Cliath

CNGL—School of ComputingDublin City University, Dublin 9

[email protected]

Abstract

Traditionally, the coupling of finitestate morphology and constraint gram-mar has been strictly rule-based, mak-ing binary distinctions between allowedand disallowed readings, however, inthe recent years much of the researchin the finite state morphologies hasadapted the contemporary paradigm ofstatistically weighted analysis. This isreflected in current versions of free andopen source morphology of Finnish,omorfi, in the finite state morphol-ogy part. In this paper we exam-ine two strategies of making use ofthe weights as a part of VISL CG-3pipeline. We evaluate the results in-trinsically on small sample of analyseswe have disambiguated by hand our-selves, and extrinsically on the effect ithas on the rule-based machine transla-tion of that text using the freely avail-able open source translator, apertium-fin-eng.

1 IntroductionIn the recent years, use of statistical in-formation in computational linguistics hasgained much interest, with systems like hun-pos (Halácsy et al., 2007), moses (Koehn etal., 2007) etc. being the main points of inter-est of most research in the field. In finite statemorphology as well as constraint grammars,extensions to handle probabilities are recentand scarcely documented (Lindén and Pirinen,2009; Bick, 2009). In this paper we exper-iment with an existing weighted finite statemorphology of Finnish (Pirinen, 2011)1 withVISL CG-3. For CG we have adapted Fred

1https://github.com/flammie/omorfi/

Karlsson’s Finnish CG rules to omorfi’s tagset, however, the rules were written for com-pletely different analyser, which results in rel-atively low quality and high level of ambigu-ity at the current level. We estimate that sal-vaging these rules for the current version ofanalysis would require a substantial re-writingeffort. In the meanwhile, there are a lot ofeasy targets that correctly trained statisticalanalyser can already deal with without extraeffort. E.g., one large difference we assume be-tween the analyser these CG rules were writ-ten for and omorfi’s are that omorfi containsa huge number of proper nouns, dialectal andsub-standard forms, and rare language, ani-mal etc. names, that are left ambiguous. It isobvious for a human reader that these wordsare very unlikely and given most corpora weexpect them to be highly penalised as well.

The main goal of this experiment is to cre-ate a functional pipeline out of weighted finite-state analysis and current version of the con-straint grammar. There are obvious con-flicts between the statistically driven rankedhypotheses approach and the strictly delet-ing approach of the current constraint gram-mar, which may limit usefulness of our cur-rent method of combining these two informa-tion sources.

The rest of the paper is structured as fol-lows: In section 2 we explain our starting pointand current pipelines for morphological anal-ysis, disambiguation and machine translation.In section 3 we explain various approaches wetried to include and combine weight data fromthe weighted finite-state analysers into VISLCG-3 and finally into machine translation. Insection 4 we describe our experiments and howwe measured the workability of our approach.In section 5 we show the results of the exper-iment. In section 6 we perform error analy-

Proceedings of the Workshop on “Constraint Grammar - methods, tools and applications” at NODALIDA 2015, May 11-13, Vilnius, Lithuania

29

sis, compare our work with other existing ap-proaches and lay out the future work. Finallyin section 7 we summarise the conclusion ofthe experiments.

2 Background

Our starting point for this experiment is suchthat we had a modern, weighted finite-statemorphology (Beesley and Karttunen, 2003;?) implementation of Finnish morphology inomorfi (?). This morphology has rudimen-tary support for probabilistic weighting of sur-face forms or analyses using corpora-based un-igram training approach. However, with thelack of high quality free and open source cor-pora compatible with omorfi analyses meansthat it is distributed with very basic linguist-written weights on the analysis side. Forthe main purpose of this experimentation wedeemed this sufficient, to get the weights work-ing through the pipeline at all.

On the other hand we had a free and opensource, mature and large CG grammar by FredKarlsson, that needed conversion to omorficompatible tagging format, as well as somechanges from CG-1 syntax to VISL CG-3.2

The fact that the CG rules from Karlssonwere built using very different analyser thanours also played a large role in our decisionto combine the weighted approach to withpure constraint grammar approach: the rule-writers of the original grammar had not seenlarge portion of the ambiguities introduced bylarger, more varied lexicon of omorfi, includingthings like dialects, large inventories of propernouns and unlikely but attested readings likeplural cases of singular personal pronouns. Forexample, in the story we use for reference inour translation experiments, the sentence ini-tial common words like “Mutta” (but) and“Koira” (dog) are also proper nouns, but alsoproper nouns like “Mari” have been added acommon noun reading (slang for marihuana).Obviously these are not dealt with in the orig-inal ruleset as they have not appeared as am-biguities to the writers of th rules.

2even though CG-1 and VISL CG-3 are possiblyare mostly compatible, we found that some things mayhave started working better when changing to moreconventional VISL CG-3 constructions

3 Methods

To first convert the original CG-1 ruleset toomorfi format analyses, we went through therules by hand from beginning to end. Thisresulted in a ruleset where only a subset ofrules matched to any constructs in the anal-ysed texts. To further improve the quality andfix a lot of conversion errors we made use ofthe new VISL CG-3 features no-inline-sets.With help of this feature we got most of theambiguous word-forms at least to match someof the rules, which hopefully means conversionhas not too many tag formatting mismatch er-rors at the very least. The resulting rulesetwith weight-based rule integrated is availablefrom omorfi git repository.

To feed omorfi analyses into VISL CG-3 we have extended the python interface ofomorfi to output CG stream format analy-ses, with omor style [FEATURE=VALUE] tagsmapped into more conventional CG style tags,mostly of form VALUE. There are number ofdeviations to this of course, most notable be-ing the WEIGHT= feature, which is turned intoVISL CG-3 numeric tag. Other special con-versions include things like usage, dialect andsuch lexical information, which are all includedin angle bracket tags following VISL CG-3 con-ventions. Omorfi python interface also per-forms some case mangling (uppercasing, low-ercasing, title-casing and removing title case)that seems to be similar as CG-1 rules seem toexpect to appear in some angle-bracketed tags,so we have tried to map these to the readingsin the original ruleset, with limited success.

The probabilities in omorfi are provided bythe underlying HFST (Lindén et al., 2011) sys-tem as a floating point number based on thefinite-state implementation of a tropical semir-ing. This weight can be based on negativelogarithms of probabilities of the word-forms,lemmas, analyses etc., as well as linguist-defined arbitrary values. For the purposesof this experiment we only used the linguist-defined values that are neatly in range of 0.1—32. This simplifies the scaling of the weightsintroduced by VISL CG-3 processing as weonly have to scale against known range in-stead of e.g. combinations of negative loga-rithms’ maxima. As noted earlier in section 2,we use the default setting which is based on


30

linguist-approximated tag likelihoods. SinceVISL CG-3 does not support floating pointnumbers, e.g. 0.1, we output weight in a nu-meric tag multiplied by a 100 before round-ing them down and turning into a tag of theform <W=weight>, where weight is the multi-plied weight. This is sufficient for the coarseweights that default analyser produces, and inline what e.g. cg-conv does when it treatsstream formats containing decimal data to beconverted into numeric tags.

The basic support for numeric tag proces-sign in VISL CG-3 is done by the SELECT(<W=MIN>) statement. If applied as a sole ruleto result of omorfi to VISL CG-3 conversionit exactly like traditional weighted finite statemorphology producing 1-best analysis. Whencombined into existing ruleset, we add thisinto a last, separate SECTION, in order to inte-grate some weight handling to CG iterations.

One long-term goal of this experimentationwas to use VISL CG-3 also as a part of mor-phological analysis pipeline that produces n-best lists in same manner as weighted finite-state analyser does. To make this work, wetake the output of VISL CG-3’s cg-proc intrace mode before converting it back to an n-best list. There are multiple possible strategiesto use readigns for deleted analyses as weightsagain. With this experiment, we have sim-ply gone with adding the line number of therule, this reflects the fact that later rules in thefile are more risky and less ambiguous. Ide-ally however, we would like to annotate therules using rule name labels, such as “usually”,“dangerous” to denote e.g. multipliers for suchrules. Furthermore, it is likely that it is not ex-actly the line number, but rather the sectionnumber, that is relevant for the rule likelihood,due to way linguists and rulewriters will or-ganise rules within sections into blocks of re-lated rules where ordering within and betweenblocks may not be important.

4 Experimental Setup

For analysis we use the python API to omorfiversion 20150326, to turn the analyses into theformat understood by VISL CG 3. We use aversion Fred Karlsson’s Finnish CG found in

apertium’s repository.3, with the tag set man-ually converted to match omorfi’s,4 however,given the amount of ambiguous names of tagsand sets and lists in the grammar, there maybe some conversion errors left. The systemis tested with VISL CG-3 version 0.9.9.10730,compiled from Gentoo packaging.5

To test the functionality of our combinationof weighted finite-state analyser and VISL CG-3, we analyse a short text that we have man-ually disambiguated and measure the qualityof analyses. The source of the text is foundin the apertium’s SVN repository.6 For thepurpose of this experiment, we have manuallytokenised the text before processing it withomorfi. In addition to analysis we use the re-sults of analyses in apertium’s Finnish-Englishmachine translator, and measure the transla-tion quality. This way we can ensure that thegold annotation has not been selected to bestfit our results but is actually the semanticallymost fitting one. The gold annotations canalso be found in the omorfi git repository.

To perform evaluations we used simplepython script that performs string compar-isons of the gold file lines between the linesstarting with "< ignoring empty lines andthe ADDed CLB tags. The machine trans-lation analysis was performed against cur-rent apertium-fin-eng ruleset and the refer-ence translation in their svn, with standardmachine translation metrics as measured byNIST’s mteval-13a.pl, which is the standardBLEU metric of machine translation (?).

5 Evaluation

We first evaluated the analysers against thegold standard in table 1. We use simplemetrics of Recall and Precision, defined asRecall = Correct

Gold , where Correct is number ofcorrect readings and Gold is number of goldreadings, and Precision = Correct

All , where All isnumber of all readings given by the disam-

3http://sourceforge.net/p/apertium/svn/HEAD/tree/nursery/apertium-fin-eng/apertium-fin-eng.fin-eng.rlx

4https://github.com/flammie/omorfi/tree/master/src/vislcg3

5https://github.com/flammie/flammie-overlay/tree/master/sci-misc/vislcg3

6http://sourceforge.net/p/apertium/svn/HEAD/tree/nursery/apertium-fin-eng/texts/tarina.fin.text


31

Rules Precision RecallWeights 60 99Rules 78 91Combination 80 90

Table 1: Precision and recall of different com-binations of weighted morphology and rules.

Rules Precision RecallWeights 60 99Rules 78 91Combination 80 90

Table 2: Precision and recall of different com-binations of weighted morphology and rules.

biguation scheme. The row Weights stands forCG with only select weighted best applied, therow Rules stands for only converted CG rule-set applied, and row Combination uses both.

The resulting analyses is then convertedto format expected by apertium for machinetranslation and evaluated for machine transla-tion quality in table 2.

6 Discussion

First of all, we note that the quality differenceswith adding weights has diminished from theversion prior to conference and current ver-sion. This is largely due to newer versionreleased in the workshop containing featuresthat greatly improved the tag matching of theconverted ruleset. Following this result we cansay that the weights are most useful when therules are not as high coverage, i.e. early stagesof development or, as in this case, conversionprocess.

Nevertheless, the overall effect of combin-ing weights has still improvements to exactlythe shortcomings noted in the introduction asproblems of the mismatching morphologies. Inerror evaluation, the cases that are affected byrules are mostly in derivation and productivecompounding, but also some marginal casesthat are not covered by rules.

For future work we are aiming to use then-best list version of the result in a real-worldapplication pipeline.

7 Conclusion

We have implemented a VISL CG-3 output ontop of existing weighted finite-state analysis ofFinnish language and tested that it works com-bined with VISL CG-3. We have successfullyincluded this combination as a part of aper-tium machine translation pipeline. We notethat weighted finite-state analysis can be eas-ily combined with VISL CG 3 and results inan increased accuracy.

Acknowledgments

The research leading to these results has re-ceived funding from the European Union Sev-enth Framework Programme FP7/2007-2013under grant agreement PIAP-GA-2012-324414(Abu-MaTran).

ReferencesKenneth R Beesley and Lauri Karttunen. 2003.

Finite-state morphology: Xerox tools and tech-niques. CSLI, Stanford.

Eckhard Bick. 2009. Introducing probabilistic in-formation in constraint grammar parsing. InProceedings of Corpus Linguistics 2009.

Péter Halácsy, András Kornai, and Csaba Oravecz.2007. Hunpos: an open source trigram tagger.In Proceedings of the 45th annual meeting of theACL on interactive poster and demonstrationsessions, pages 209–212. Association for Com-putational Linguistics.

Philipp Koehn, Hieu Hoang, Alexandra Birch,Chris Callison-Burch, Marcello Federico, NicolaBertoldi, Brooke Cowan, Wade Shen, ChristineMoran, Richard Zens, et al. 2007. Moses: Opensource toolkit for statistical machine translation.In Proceedings of the 45th Annual Meeting of theACL on Interactive Poster and DemonstrationSessions, pages 177–180. Association for Com-putational Linguistics.

Krister Lindén and Tommi Pirinen. 2009. Weight-ing finite-state morphological analyzers usingHFST tools. In Bruce Watson, Derrick Courie,Loek Cleophas, and Pierre Rautenbach, editors,FSMNLP 2009, July.

Krister Lindén, Erik Axelson, Sam Hardwick,Tommi A Pirinen, and Miikka Silfverberg. 2011.Hfst—framework for compiling and applyingmorphologies. Systems and Frameworks forComputational Morphology, pages 67–85.


32

Tommi A Pirinen. 2011. Modularisationof Finnish finite-state language description—towards wide collaboration in open source de-velopment of morphological analyser. In Pro-ceedings of Nodalida, volume 18 of NEALT pro-ceedings.


33

Date post:	25-Apr-2020
Category:	Documents
Upload:	others
View:	9 times
Download:	0 times

Using weighted ﬁnite state morphology with VISL CG-3—Some ... · Using weighted ﬁnite state...

Documents