Date post: | 19-Jan-2017 |
Category: |
Technology |
Upload: | taus-enabling-better-translation |
View: | 311 times |
Download: | 0 times |
Slide 1
Speech Translation: Past, Present, and Future
Mark Seligman Spoken Translation, [email protected]
Past
In the beginning
NEC demo, ITU Telecom World (1983 )Chat: Uni-verse, Amikai, Compuserve, Global-Link; (later) OrtsboC-STAR(-2) consortium (1991?)ATR trilateral demo (1993)English, French, Japanese, Italian, Korean, German, Chinese First unrestricted SLT (1997, 1998)GETA group (Seligman)/Compuserve Information Transcript Project (1995, 1996)MIT Biennale dArt Contemporain, LyonIBM Voice TypeGlobal-Link translation software Mac TEXT-TO-SPEECHCU-SeeMe, video, audioRobert PalmquistMobile device (1993)English-Spanish, Office of Naval Research (1997)SpeechGear launch (2001)Mobile device for JapaneseEnglish (2003)Interpreter mobile phone (2004)Compadre: Interact 4.0, English35 other languages (2009)Talk&Translate
(Still) In the beginning Verbmobil (Wolfgang Walster)German Federal Ministry of Research and Technology (1993-2000)Linguatec (>Lingenio)First boxed product (1998)More JapaneseNEC mobile device with Japanese-English (2006)ATR-Trek, Shabete Honyaku, mobile phone (2007)AppsGoogle Translate (2006-2007)Jibbigo, first offline (2009)Healthcare system pilotsSpoken Translation, Inc., Converser for Healthcare 3.0 (2011)Sehda (>Fluential) (2009?)Phone-basedSpeechTrans (multi-lingual conferences) (2014)Lexifone (2014)
4
Converser for Healthcare 3.0: Kaiser Permanente Pilot
Three departments at San Francisco Medical Center Pharmacy:Consulting or Drop-off use caseShortcuts: Consultation: Typhoid VaccinePickup use caseGreeter use caseInpatient NursingShortcuts: IV, External Catheter, Pain AssessmentEye CareShortcuts: Informed Consent for Cataract Surgery
Present
Google, Skype, Waibel
Alex Waibels Simultaneous Translation
7
Google Translates Dynamic OCR Feature
8
Future
More PlatformsWatchesApple WatchSpeechTrans GlassesAugmented realityIn 3 years, not 300!Issues:Cloud vs. on-deviceMust be everyday deviceMust be cool!Watch Apple!
10
The Way Forward
Improve statistical MTUser feedback + machine learningMore, better data Knowledge source integration DiscourseDomainProsody The Return of Semantics Interlingua/ontologiesTrue (grounded) semanticsNew tech Deep neural networksOther AI
Improve Statistical MT
User feedback (over) + machine learningMore, better dataParsing > hybrid MT
User FeedbackVerification and correction of translation Reliable RetranslationMeaning Cues
13
Knowledge Source Integration
Think: IBMs Watson or Verbmobil (till 2000)Nine Issues in Speech TranslationDiscourseSpeech actsTopic trackingDomainProsody PausesPitch, stressTranslation mismatchesSystem architecture, data structures
_car_obj_driving_do_personSyntactic structureNPVPSemantic structure
PPVNNPVNV
drive person person car
modagtobjThe Return of Semantics: Interlingua/Ontologies
English Real-time TextTexte Franais en Temps Vrais
English Voice
Voix Franais
Real-time Voice/TextDo you come here often?
Est-ce que tu viens souvent ici?Machine Translation
Google Knowledge Graph
17
IBM Watsons Knowledge Graph
18
The Return of Semantics: Grounded Semantics
19
New TechMT via deep neural networksGoogle DeepmindKyunghyun Cho (NYU)Other AIKurzweils brain projectMicrosoft research Issue: Is black box sufficient?
20
Contact
Mark Seligman [email protected]
Sendoff