Date post: | 20-Mar-2017 |
Category: |
Internet |
Upload: | andrzej-zydron-mbcs |
View: | 196 times |
Download: | 0 times |
Multilingual Value Chain Solution for the Digital Single Market
Federated Active Linguistic data CuratiON
EU FP7 Project
The FALCON project combines the power of open data on the web with data-driven language technologies to construct the Localization Web.
Partners:
Wholesalers
Digital Single Market
Decoupage.ie
Wholesalers
ecommerce SaaS
ePayment Service
Customer
• Digital Single Market works well downstream– English as lingua franca– Export-focused Medium-SME wholesalers and service providers– MNCs with establish multilingual offering
• Language Barrier firmly in place upstream• Challenge to Customer Engagement Ecosystem
– Must become systematically multilingual– Must serve micro-domains that allow SMEs to add value
WholesalersWholesalers
Customer Engagement Ecosystem
Niche Value Add
Social Media
Online Communities
Search & SEO
Content Analytics
Trade Guilds/Associations
Events Knowledge &training resources
Translation
Translation workflowThe company has also reduced its production capacity by ceasing manufacture of chest freezers and freestanding microwave ovens
Extraction & Segmentation
production capacity
capacité de production
✔
✔ Annotation with Existing Terms
chest freezer
microwave oven
réfrigérateur
four à micro-onde
?
??
?
Auto suggestion from Babelfy/Babelnet
D'autre part, la société a réduit sa capacité de production en arrêtant la production de réfrigérateur et de fours micro-onde pose-libre
Machine Translate with Term Translations
MT Vendor?
D'autre part, la société a réduit sa capacité de production en arrêtant la production de congélateurs coffres et de fours micro-ondes pose-libre
✗
congélateurs coffres
fours micro-ondes
✔
Postedit and capture terms in context
✔✔
✔
✔
✔
✔
PE
PE
PE
PE
PE
PE
PE
✗
PE
✔
• Protect & pool niche knowledge
• Interlink corpora and lexical-conceptual resources
• Measuring ROI at each point in value chain
• Manage ownership, rights and rewards
• Privacy by Design for Social Media data resources
• Open data for NLP shared task
Integrated Content/Data Value Chain
Public Data
Content publisher
Support Service Provider
LanguageTechnology Provider
• Better in-context postediting:
– XTM-Easyling
• Feeding term suggestions from posteditor to Terminology Management
– XTM-Interverbum
• Dynamic Retraining
– XTM-DCU
• Bilingual Dictionary SMT improvements
– XTM-DCU
• NER, terminology enforcements, forced decoding
– XTM-Interverbum-DCU
• Postediting prioritisation and term flagging
– TCD-DCU-XTM
• Publishing interlinks of parallel text, lexically rich term bases
– TCD: DG-T TM, EurVoc, Snomed-CT, LEMON, BabelNet
FALCON Innovation
Terminology Management
Website in context translation
• THANK YOU!