turning nmt research into commercial products · • team members have published +100 on smt and...

28
Turning NMT research into commercial products Dragos Munteanu and Adrià de Gispert Proceedings of AMTA 2018, vol. 2: MT Users' Track Boston, March 17 - 21, 2018 | Page 166

Upload: others

Post on 26-May-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Turning NMT research into commercial products · • Team members have published +100 on SMT and related tech – Bill Byrne, Abdessamad Echihabi, Dragos Munteanu ... • Summer internships,

TurningNMTresearchintocommercialproducts

DragosMunteanuandAdriàdeGispert

Proceedings of AMTA 2018, vol. 2: MT Users' Track Boston, March 17 - 21, 2018 | Page 166

Page 2: Turning NMT research into commercial products · • Team members have published +100 on SMT and related tech – Bill Byrne, Abdessamad Echihabi, Dragos Munteanu ... • Summer internships,

•  Foundedin1992•  3800+Employees•  56Offices•  38Countries•  400Partners•  1500Enterprisecustomers

helpingbigbrandsgoglobal

marketingcampaigns eCommerce documentation web,socialmedia analytics

78ofthetop100globalcompaniesworkwithSDL

+10BILLIONwordstranslatedmonthly

Proceedings of AMTA 2018, vol. 2: MT Users' Track Boston, March 17 - 21, 2018 | Page 167

Page 3: Turning NMT research into commercial products · • Team members have published +100 on SMT and related tech – Bill Byrne, Abdessamad Echihabi, Dragos Munteanu ... • Summer internships,

SDLResearch–alonghistoryinMT

•  ResearchlabsinLosAngeles(USA)andCambridge(UK)•  Teammembershavepublished+100onSMTandrelatedtech

–  BillByrne,AbdessamadEchihabi,DragosMunteanu,GonzaloIglesias,EvaHasler,AdriàdeGispert,SteveDeNeefe,JonathanGraehl,WesFeely,LingTsou...

•  FormerlyLanguageWeaver–  15yearsofleadingexpertiseinSMT–  majorcontributions(papers/patents)inphrase-basedandstring-to-treeMT,

automata-basedhierarchicalMT,qualityestimation,tuning,evaluation...

•  Stronglinkswithacademia(UniversityofCambridge)

•  Summerinternships,industrialpost-docs

Proceedings of AMTA 2018, vol. 2: MT Users' Track Boston, March 17 - 21, 2018 | Page 168

Page 4: Turning NMT research into commercial products · • Team members have published +100 on SMT and related tech – Bill Byrne, Abdessamad Echihabi, Dragos Munteanu ... • Summer internships,

Ourmission:BringMTresearchresultstoproducts

Approachesthatworkformanylanguagepairs

Connectors,plug-ins…

Translationspeed

Privacy!Top-qualityMTonpremiseandinprivatecloud

Hightranslationquality

Customization/Personalization

Respectfileformatsandtags

Controllablememoryanddiskfootprint

Robustnesstomis-spellings

○  Westrivetoprovideourcustomers:

Abilitytolearnovertime(AdaptiveMT)

Terminologyanddictionaries Consistency

Proceedings of AMTA 2018, vol. 2: MT Users' Track Boston, March 17 - 21, 2018 | Page 169

Page 5: Turning NMT research into commercial products · • Team members have published +100 on SMT and related tech – Bill Byrne, Abdessamad Echihabi, Dragos Munteanu ... • Summer internships,

SDLSecureEnterpriseTranslationServer

ü 45NMTenginescurrentlyavailable

DataSecurity-  Onpremises/privatecloud-  Usedbygov’tfor15years

Quality/Customization-  NeuralMT-  CustomMTout-of-the-box

Cost-effectivescalability-  Elastic,optimizedfootprint-  Commodityhardware

EaseofUse/Integration-  deploysInhours-  MSplug-in&RESTAPI

Proceedings of AMTA 2018, vol. 2: MT Users' Track Boston, March 17 - 21, 2018 | Page 170

Page 6: Turning NMT research into commercial products · • Team members have published +100 on SMT and related tech – Bill Byrne, Abdessamad Echihabi, Dragos Munteanu ... • Summer internships,

Neural Machine Translation

Proceedings of AMTA 2018, vol. 2: MT Users' Track Boston, March 17 - 21, 2018 | Page 171

Page 7: Turning NMT research into commercial products · • Team members have published +100 on SMT and related tech – Bill Byrne, Abdessamad Echihabi, Dragos Munteanu ... • Summer internships,

SMT•  Symbolicmodels•  Independenceassumption

(separatesub-problems)•  Maximum-likelihood

estimation•  CPU-orientedtraining•  Source-side-guideddecoding•  Largedatabases

NeuralMT•  Continuous-spacemodels•  Singleend-to-endmodel•  Discriminativetraining•  RelianceonGPUs•  Target-side-guideddecoding•  Smallercompactmodels

Aparadigmshift

Proceedings of AMTA 2018, vol. 2: MT Users' Track Boston, March 17 - 21, 2018 | Page 172

Page 8: Turning NMT research into commercial products · • Team members have published +100 on SMT and related tech – Bill Byrne, Abdessamad Echihabi, Dragos Munteanu ... • Summer internships,

Bettertranslationmodels

[Zhouetal.’16]

[Gehringetal.’17][Vaswanietal.’17]

[Sutskeveretal.’14][Bahdanauetal.’15]

Proceedings of AMTA 2018, vol. 2: MT Users' Track Boston, March 17 - 21, 2018 | Page 173

Page 9: Turning NMT research into commercial products · • Team members have published +100 on SMT and related tech – Bill Byrne, Abdessamad Echihabi, Dragos Munteanu ... • Summer internships,

BetterBLEUscores

Rules-Based 1970

Statistical MT

2002

Neural MT

2016

WAT Jpn-Eng Eng-Jpn

2014 23.8 35.0

2015 25.4 35.8

2016 27.6 36.2

2017 28.4 41.5

+4.4 !! +6.5 !!

Page 10: Turning NMT research into commercial products · • Team members have published +100 on SMT and related tech – Bill Byrne, Abdessamad Echihabi, Dragos Munteanu ... • Summer internships,

COMPARINGOUTPUTS

©2017SDLPlc

SMT

NMT

Page 11: Turning NMT research into commercial products · • Team members have published +100 on SMT and related tech – Bill Byrne, Abdessamad Echihabi, Dragos Munteanu ... • Summer internships,

Observablequalityimprovement

国連難民高等弁務官事務所(UNHCR)は、内戦状態にあるシリアから逃れた難民の数が5百万人を超えたと発表した。

Office of the United Nations High Commissioner for Refugees (UNHCR) is in a state of civil war when the number of refugees who have escaped from Syria have exceeded 5 million people.

The United Nations High Commissioner for Refugees (UNHCR) announced that the number of refugees escaped from Syria in the civil war was over five million people.

ü  30%improvementoverSMTacrossallourproductizedengines

Page 12: Turning NMT research into commercial products · • Team members have published +100 on SMT and related tech – Bill Byrne, Abdessamad Echihabi, Dragos Munteanu ... • Summer internships,

But…isitALLthatgood?

TherearesituationsinwhichNMTfails

•  Whenitfails,itfailsspectacularly –  unrelatedfluenttext–  repetitions,neurobabble…

•  MTuser/customerexpectations–  “MTisnotsupposedtodothis”!!!?!–  “CanitsupportthefeaturesIneed”???

Page 13: Turning NMT research into commercial products · • Team members have published +100 on SMT and related tech – Bill Byrne, Abdessamad Echihabi, Dragos Munteanu ... • Summer internships,

Over-generationand‘neurobabble’

There was no clear correlation between the measured mass density and the measured mass density, and neither experiment A or B.

The company will pay approximately EUR 600 million in fines, and the U.S. Department of Justice (SEC) to pay for approximately EUR 600 million, and the U.S. Department of Justice and the Justice Department of Justice (SEC) to reduce the amount of internal control of the board of directors of the board of directors of the board of directors…

Page 14: Turning NMT research into commercial products · • Team members have published +100 on SMT and related tech – Bill Byrne, Abdessamad Echihabi, Dragos Munteanu ... • Summer internships,

Over-generationand‘neurobabble’

There was no clear correlation between the measured mass density and the measured mass density, and neither experiment A or B.

The company will pay approximately EUR 600 million in fines, and the U.S. Department of Justice (SEC) to pay for approximately EUR 600 million, and the U.S. Department of Justice and the Justice Department of Justice (SEC) to reduce the amount of internal control of the board of directors of the board of directors of the board of directors…

Page 15: Turning NMT research into commercial products · • Team members have published +100 on SMT and related tech – Bill Byrne, Abdessamad Echihabi, Dragos Munteanu ... • Summer internships,

DataisEVENMOREimportant

•  NewNMTmodelsarebetterlearners– Abetterfittothetrainingdata– Relevanttrainingdataiskey– Avoidbabbleandgethugegains!

•  Domainadaptation/dataselection[FreitagandAl-Onaizan’16][Chenetal.’17][Britzetal.’17][Farajianetal.’17][VanderWeesetal.’17][Wangetal.’17]…

Page 16: Turning NMT research into commercial products · • Team members have published +100 on SMT and related tech – Bill Byrne, Abdessamad Echihabi, Dragos Munteanu ... • Summer internships,

Adaptingneuralmodels

Majorimprovements!Challenge:§  Adapttocustomer

domain/datawithminimalre-training

§  Maintainhighqualityacrossdomains

Jpn-Engcorpus #words

Generic >300M

Automotive <1M

Page 17: Turning NMT research into commercial products · • Team members have published +100 on SMT and related tech – Bill Byrne, Abdessamad Echihabi, Dragos Munteanu ... • Summer internships,

Lexicalselection

•  NMTmodelshavefreedomtoproduceanytargetword–  Guidedbysource,notconstrained

•  SMTenginesweregoodatlexicalselection–canweleverage?–  T-table,n-gramandphraseprobabilities,memory-augmentedmodels/search

[Arthuretal.EMNLP’16][Stahlbergetal.EACL’17][Wangetal.;Dahlmannetal.;Fengetal.EMNLP’17][Zhangetal.IJCNLP’17]…

[Arthuretal.EMNLP’16]

[Wangetal.EMNLP’17]

Page 18: Turning NMT research into commercial products · • Team members have published +100 on SMT and related tech – Bill Byrne, Abdessamad Echihabi, Dragos Munteanu ... • Summer internships,

NMTcanuseN-gramposteriorprobabilities

Stahlbergetal.(EACL’17):“NeuralMachineTranslationbyMinimisingtheBayes-riskwithRespecttoSyntacticTranslationLattices”

Page 19: Turning NMT research into commercial products · • Team members have published +100 on SMT and related tech – Bill Byrne, Abdessamad Echihabi, Dragos Munteanu ... • Summer internships,

But…arethereguarantees?

•  Controlisamustforcommercialsuccess•  Oneverybadsentencecanputoffacustomer

–  Back-offifneeded

•  Customers/Usersexpectcertain‘features’–  Decodingspeed,dictionarysupport,formattingconstraints,AdaptiveMT,…

Page 20: Turning NMT research into commercial products · • Team members have published +100 on SMT and related tech – Bill Byrne, Abdessamad Echihabi, Dragos Munteanu ... • Summer internships,

Dictionarysupport

•  Translationoutputmusttranslatedictionarymatchesexactly–constrainedsearch

•  EasyforSMTdecoders•  NMTbeamdecoderdoesnotkeepanalignmentbetween

sourceandtargetwords

“Zimra Games continues to innovate with the release next month of Coke Assault 3, which will satisfy the most demanding gamers.”

English German

ZimraGames ZimraGamesGmbH

CokeAssault3 CokeAssaultIII

… …

[Andersonetal.EMNLP’17][Hokamp&LiuACL’17][Chatterjeeetal.WMT’17]

Page 21: Turning NMT research into commercial products · • Team members have published +100 on SMT and related tech – Bill Byrne, Abdessamad Echihabi, Dragos Munteanu ... • Summer internships,

Dictionarysupport

Constrainedsearch•  Buildafinite-stateacceptorwith

thetarget-sideconstraints•  Keeponeseparatestackper

eachacceptorstate•  Outputonlyhypothesesfrom

thefinalacceptorstateØ Constraintscanbewordsor

phrases[Andersonetal.EMNLP’17]

Page 22: Turning NMT research into commercial products · • Team members have published +100 on SMT and related tech – Bill Byrne, Abdessamad Echihabi, Dragos Munteanu ... • Summer internships,

Dictionarysupport

Challenges•  Computationalcomplexity

growsexponentiallywiththenumberofconstraints–  orderisunknown

•  Nothingpreventsrepeateddecoding:

“Zimra Games GmbH setzt mit dem Veröffentlichung auf Coke Assault III im nächsten Monat der Angriff …

Page 23: Turning NMT research into commercial products · • Team members have published +100 on SMT and related tech – Bill Byrne, Abdessamad Echihabi, Dragos Munteanu ... • Summer internships,

Entityconstraints

•  Decodermustalsorespectmeta-tags–  KeytosupportfileformatsusedbyMTusers

•  NMTmodelshouldnotbreaksequentialhistory•  Solutionsrequiremodelspecializationand/ordecodingrestrictions

“<B>Zimra Games</B> continues to innovate with the release <I>next month</I> of <B>Coke Assault <c=red>3</c></B>, which will satisfy the most demanding gamers.”

Page 24: Turning NMT research into commercial products · • Team members have published +100 on SMT and related tech – Bill Byrne, Abdessamad Echihabi, Dragos Munteanu ... • Summer internships,

Decodingspeed

•  MTusersareexpectedtocertaintranslationspeeds–  Targetspeedvaries,butwellaboveresearchengines

•  Goalistoprovidebestqualityatdesiredspeed–  Speedvsqualitytrade-off

•  NMTdeploymentscenarios–  CPUonly–hand-helddevices,…–  GPU

•  NMTtrainingspeedalsorelevant

Page 25: Turning NMT research into commercial products · • Team members have published +100 on SMT and related tech – Bill Byrne, Abdessamad Echihabi, Dragos Munteanu ... • Summer internships,

Decodingspeedvsqualitytrade-off(1)

•  Modelarchitecture–  recurrent,convolutional,attentional…

–  numberofparameters,layerprecomputations…

–  Unfoldingandshrinkingensembles

[StahlbergandByrne,EMNLP’17]

StahlbergandByrne(EMNLP’17):“UnfoldingandShrinkingNeuralMachineTranslationEnsembles”

Page 26: Turning NMT research into commercial products · • Team members have published +100 on SMT and related tech – Bill Byrne, Abdessamad Echihabi, Dragos Munteanu ... • Summer internships,

Decodingspeedvsqualitytrade-off(2)

•  HardwareandLinearAlgebralibrary–  TypeofGPUcard–  CPU-GPUcommunication–  GPUusage

•  Batching–  standardintraining

Page 27: Turning NMT research into commercial products · • Team members have published +100 on SMT and related tech – Bill Byrne, Abdessamad Echihabi, Dragos Munteanu ... • Summer internships,

Decodingspeedvsqualitytrade-off(3)

•  Decodingparameters–  beamsize,earlystopping…

•  Reducedvocabularysoftmax(CPU)•  Weightclippingintraining

–  Low-precisioninference[Wuetal.’16][Devlin,EMNLP’17]…

Page 28: Turning NMT research into commercial products · • Team members have published +100 on SMT and related tech – Bill Byrne, Abdessamad Echihabi, Dragos Munteanu ... • Summer internships,

Copyright©2008-2017SDLplc.Allrightsreserved.Allcompanynames,brandnames,trademarks,servicemarks,imagesandlogosarethepropertyoftheirrespectiveowners.ThispresentationanditscontentareSDLconfidentialunlessotherwisespecified,andmaynotbecopied,usedordistributedexceptasauthorisedbySDL.

SoftwareandServicesforHumanUnderstanding