taking on new challenges in multi-word unit processing for machine translation

Post on 11-Jan-2016

22 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

DESCRIPTION

Taking on new challenges in multi-word unit processing for machine translation. Johanna MONTI , Anabela BARREIRO, Annibale ELIA , Federica MARANO, Antonella NAPOLI. Outline. Multi-word units in the Lexicon-Grammar: definition. - PowerPoint PPT Presentation

TRANSCRIPT

Johanna MONTI, Anabela BARREIRO, Annibale ELIA, Federica MARANO, Antonella NAPOLI

Taking on new challenges in multi-word unit processing for machine

translation

Outline

2Taking on new challenges in Multi-word Unit processing for machine translation – FreeBMT 2011

Multi-word units in the Lexicon-Grammar: definition

3Taking on new challenges in Multi-word Unit processing for machine translation –

FreeBMT 2011

Multi-word units in the Lexicon-Grammar

4Taking on new challenges in Multi-word Unit processing for machine translation –

FreeBMT 2011

Multi-word units in the Lexicon-Grammar : part of a continuum

5Taking on new challenges in Multi-word Unit processing for machine translation –

FreeBMT 2011

Multi-word units in the Lexicon-Grammar: lemmatization

6Taking on new challenges in Multi-word Unit processing for machine translation –

FreeBMT 2011

Multi-word units in the Lexicon-Grammar: lemmatization criteria

7Taking on new challenges in Multi-word Unit processing for machine translation –

FreeBMT 2011

The corpus-linguistic approach

8Taking on new challenges in Multi-word Unit processing for machine translation –

FreeBMT 2011

Multi-word units in Machine Translation

9Taking on new challenges in Multi-word Unit processing for machine translation –

FreeBMT 2011

Multi-word units in Machine Translation: main problems

10Taking on new challenges in Multi-word Unit processing for machine translation –

FreeBMT 2011

Multi-word units in Machine Translation: different approaches

Taking on new challenges in Multi-word Unit processing for machine translation – FreeBMT 2011

Lexical ambiguities handled by different systems

• Corpus: non-specialized texts approx. 300 sentences (10,000 words) multi-word units extracted from the Web

Webcorp LSE, Web as a Corpus

• MT systems : Google TranslateOpenLogos

12Taking on new challenges in Multi-word Unit processing for machine translation –

FreeBMT 2011

Typical ambiguities: examples

13Taking on new challenges in Multi-word Unit processing for machine translation –

FreeBMT 2011

Typical ambiguities: examples

14Taking on new challenges in Multi-word Unit processing for machine translation –

FreeBMT 2011

Integration of Semantico-Syntactic knowledge

15Taking on new challenges in Multi-word Unit processing for machine translation –

FreeBMT 2011

Integration of Semantico-Syntactic knowledge

16Taking on new challenges in Multi-word Unit processing for machine translation –

FreeBMT 2011

Integration of Semantico-Syntactic knowledge: mix up

17Taking on new challenges in Multi-word Unit processing for machine translation –

FreeBMT 2011

Semantic table (SEMTAB ) rule

Italian Transfer

MIX UP(VT) IN MESCOLARE IN

MIX UP(VT) N IN MESCOLARE N IN

MIX UP(VT) N WITH CONFONDERE N CON

MIX UP(VT) N(HUMAN) IN CONFONDERE N IN

MIX UP(VT) N(INGREDIENT) MESCOLARE N

MIX UP(VT) N(MEDICINE) PREPARARE N

MIX UP(VT) WITH CONFONDERE CON

MIX UP(VT) N(HUMAN,INFO) WITH CONFONDERE N CON

SemTab rules comment lines for the verb mix up

Taking on new challenges in Multi-word Unit processing for machine translation – FreeBMT 2011

Qualitative MT Evaluation metrics

19Taking on new challenges in Multi-word Unit processing for machine translation –

FreeBMT 2011

Qualitative MT Evaluation metrics

20Taking on new challenges in Multi-word Unit processing for machine translation –

FreeBMT 2011

Qualitative MT Evaluation metrics

21Taking on new challenges in Multi-word Unit processing for machine translation –

FreeBMT 2011

Qualitative MT Evaluation metrics

22Taking on new challenges in Multi-word Unit processing for machine translation –

FreeBMT 2011

Qualitative MT Evaluation metrics: the «ideal» evaluation tool

23Taking on new challenges in Multi-word Unit processing for machine translation –

FreeBMT 2011

Conclusions

24Taking on new challenges in Multi-word Unit processing for machine translation –

FreeBMT 2011

Johanna MONTI, Anabela BARREIROAnnibale ELIA, Federica MARANO, Antonella NAPOLI

Thank you for your attention !

top related