neural machine translation: a report from the front line
TRANSCRIPT
Neural MT
Separating the hype from reality
a report from the front line
TAUS Webinar / Industry Leaders Forum / LocWorld 34
The Neural Narrative
Great, thanks. Let me use it now…
Level of D
ifficulty
Korean
Japanese
French IT
Marketing
Patent
Gisting
PEMT
Perfect
German
Use caseIndustryLanguage
Use cases for machine translation
Impact of Neural MT
Use cases covered by generic MT
Use cases needing custom MTThe
Bar
Impact of Neural MT
Use cases covered by generic MT
Use cases needing custom MTThe
Bar
Neural
MT has
raised the
bar
Of course.
We’re a team of MT experts. This is a big part of the value that
we bring to the table. We’re not just taking open-source tools off
the shelf. We’re innovating, researching, developing new
processes. Same for Neural MT.
e.g. lexically constrained decoding
Neural MT @ Iconic
“Do you ‘do’ Neural MT?”
It’s one of the ways.
MT is not a one-size-fits-all technology. What constitutes the best
approach depends on the language pair, domain, use case, and
various other factors. In some cases, the best approach will be
Neural MT, but not yet all the time.
Neural MT @ Iconic
“Is this the way you do MT now?”
When it gives the best output!
When you’re customising MT, there are so many things you can
do – different processors, parameters, ways of combining data
and tuning. We try multiple approaches and allow our systems to
use the best one.
Let’s look at some case studies, but first…
Neural MT @ Iconic
“When do you use it then?”
The Iconic Ensemble Architecture™
Patents Case Study
Average length: 7 words
Average length: 30 words
3 languages Same data sets Client evaluation
Ranking
1. Unusable
2. Poor
3. Adequate
4. Good
5. Excellent
Criteria
90% Adequate or above
0% Unusable
Linguist Review
both pass title criteria
only Iconic MT passed
abstracts
Patents Case Study – Chinese to English
31
3938
19
0
5
10
15
20
25
30
35
40
45
Titles Abstracts
Iconic MT Iconic Neural MT
Outcome
Iconic MT deployed in
production
Linguist Review
both passed on titles
only NMT passed abstracts
Patents Case Study – Japanese to English
52
33
0
10
20
30
40
50
60
Titles Abstracts
Iconic MT Iconic Neural MT
Outcome
Iconic MT deployed for titles
Neural MT deployed for
abstracts
4
54
4
Patents Case Study – Korean to English
40
25
0
5
10
15
20
25
30
35
40
45
Titles Abstracts
Iconic MT Iconic Neural MT
Linguist Review
Iconic MT below criteria
Neural MT significantly better
Outcome
Under review!
2
6
4
3
Neural MT raises the bar for general purpose MT
but the bar still needs to be tested.
Customisation Case Study
English to French English to Hindi
BLEU 1-TER BLEU 1-TER
Iconic MT 43.0 (+10.4) 55.2 (+7.7) 46.75 (+12.96) 56.4 (+5.5)
GNMT 32.6 47.5 33.79 50.9
Iconic NMT 39.2 50.5 - -
2 languages 1.5M training segments IT content
The Iconic Ensemble Architecture™
Neural MT is another
powerful tool in our
arsenal that helps us
deliver best-in-class
machine translation
output
“I don’t give a damn
what you do!”
MT @ Iconic – what we don’t do
MT MT MT MT
MT
MTMT
MTMTMTMTMT
MTMTMTMTMTMT
The ability to build your own MT
engines with Moses, Phrasal,
OpenNMT, Nematus, Fairseq.
Provide off-the-shelf general
industry engines. There are
some very adequate solutions for
that!
MT
Customised expert-built MT, using
the most appropriate tool for the job,
MT or otherwise.
MT @ Iconic – what we do do!
Develop products and solutions that
incorporate machine translation –
not just access to an API.
Engage our expert team on an Neural MT project to see if it works for your content
Neural MT – Early Adopter Program
To date
Custom-developments with
some of our closest partners
Now
Inviting early adopters to
expand the range of casesEarly Adopter Program