changes of word class during the translation process · changes of word class during the...

21
Changes of word class during the translation process Insights from a combined analysis of keystroke logging and eye-tracking data Tatiana Serbina, Sven Hintzen, Adjan Hansen-Ampah, Paula Niemietz, Stella Neumann Translation in Transition, Germersheim, 29.-30.01.2015 A HumTec Boost Fund Project funded by the Excellence Initiative of the German State and Federal Governments

Upload: others

Post on 30-Aug-2019

11 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Changes of word class during the translation process · Changes of word class during the translation process Insights from a combined analysis of keystroke logging and eye-tracking

Changes of word class during the

translation process

Insights from a combined analysis of keystroke logging

and eye-tracking data

Tatiana Serbina, Sven Hintzen, Adjan Hansen-Ampah, Paula Niemietz,

Stella Neumann

Translation in Transition, Germersheim, 29.-30.01.2015

A HumTec Boost Fund Project funded by the Excellence Initiative of the German

State and Federal Governments

Page 2: Changes of word class during the translation process · Changes of word class during the translation process Insights from a combined analysis of keystroke logging and eye-tracking

Overview

Translation shifts

Grammatical complexity

Aims of the study

Methodology

Product-based analysis: word class changes ST vs. TT

Process-based analyses: eye-tracking analysis, word class

changes in intermediate versions of translations

2

Page 3: Changes of word class during the translation process · Changes of word class during the translation process Insights from a combined analysis of keystroke logging and eye-tracking

Empirical translation studies

Product-based studies

Method: corpus analyses

Typical research questions: translation shifts or translation

properties

Process-based studies

Method: translation experiments (frequently keystroke

logging and eye-tracking)

Typical research questions: translators’ styles, levels of

expertise and their effect on the translation process

Our research

Treating keystroke logs as a corpus (cf. e.g. Alves &

Magalhães 2004, Alves & Vale 2009, 2011)

A combination of product-based and process-based

perspectives3

Page 4: Changes of word class during the translation process · Changes of word class during the translation process Insights from a combined analysis of keystroke logging and eye-tracking

GTrans: Ein Blatt Papier zusammen zu knüllen, erscheint einfach

und erfordert wenig Anstrengung; [die [Verhaltensweise]Noun des

Papierknäuels]NP zu erklären, ist dagegen eine völlig andere Sache.

(KLTC PROBRAL GT7)

Translation shifts

Translation shifts: differences between source and target texts,

e.g. part of speech change or change of semantic perspective

(Čulo et al. 2008, Cyrus 2009, Halverson 2007)

Changes of word class – transpositions (Vinay and Dalbernet

1958/1995, 36)

4

EO: Crumpling a sheet of paper seems simple and doesn't require

much effort, but explaining [why the crumpled ball [behaves]Verb the

way it does]Clause is another matter entirely.

Page 5: Changes of word class during the translation process · Changes of word class during the translation process Insights from a combined analysis of keystroke logging and eye-tracking

Word classes – contrastive

difference

German: nominal word classes - 40.21%

verbal classes - 22.53% ratio of 1.784

English: nominal word classes - 41.39%

verbal word classes - 25.47% ratio of 1.625

More pronouns in German (8.45%) than in English (5.46%)

German appears to be more nominal than English (Hansen-

Schirra, Neumann, and Steiner 2012, 77-78)

In the translations into German: more shifts from verbs to nouns

and fewer shifts from nouns to verbs than in the opposite

translation direction (Čulo et al. 2008, 50)

5

Page 6: Changes of word class during the translation process · Changes of word class during the translation process Insights from a combined analysis of keystroke logging and eye-tracking

Grammatical complexity

Association with different levels of grammatical complexity

Verbs – possible indicator that the process is realized canonically

through a clause

Nominalizations – may result in a more condensed and thus

grammatically more complex version (Halliday and Matthiessen

2014, 715).

Translation: understanding of the more complex units in the ST

could involve their paraphrase with grammatically more simple

structures in the TT

(Steiner 2001, Hansen-Schirra, Neumann, and Steiner 2012, 257-261)6

EO: [After the [crumpling]Noun of a sheet of thin aluminized Mylar]PP, the

researchers placed it inside a cylinder.

GTrans: [Nachdem sie ein dünnes Blatt aluminiumbeschichtetes Mylar

[verknittert]Verb hatten]Clause, gaben sie es in einen Zylinder. (KLTC

PROBRAL GT3)

Page 7: Changes of word class during the translation process · Changes of word class during the translation process Insights from a combined analysis of keystroke logging and eye-tracking

Aims of the study

Analysis of POS distribution and shifts between main word

classes

This study concentrates on nouns & verbs and investigates the

cognitive effort during the translation process depending on

the word class in the original

type of shift in the translation

Analysis of intermediate versions in the keystroke logging data

Assumptions:

due to the contrastive difference, the translation direction English-

German may be characterized through shifts from verbs to nouns

due to the process of understanding related to the grammatically

dense noun phrases in ST, translations into German may be

characterized through shifts from nouns to verbs (in the

intermediate or final versions)

7

Page 8: Changes of word class during the translation process · Changes of word class during the translation process Insights from a combined analysis of keystroke logging and eye-tracking

Our translation process data

Translation experiment (Neumann et al. 2010)

Translation direction: English-German

Subjects

8 professional translators

8 physicists

Material

Two versions of an authentic text with ten integrated stimuli

Abridged version of a popular-scientific text published in the

journal Scientific American Online

Apparatus

Tobii 2150 remote eyetracker, software Tobii Studio 1.5

Keystroke logging software Translog

8

Page 9: Changes of word class during the translation process · Changes of word class during the translation process Insights from a combined analysis of keystroke logging and eye-tracking

Keystroke Logged Translation

Corpus

The corpus consists of:

2 versions of the original (source texts)

16 translations (target texts)

16 log files (process texts)

Corpus size (comprising STs and final TTs):

approx. 3,650 words

Corpus register: Popular Scientific writings

(Serbina, Niemietz, and Neumann, forthcoming)

Automatic POS annotation of ST and TT using TreeTagger

(Schmid 1994)

Manual alignment between ST and TT words using the

alignment tool (Hansen-Ampah 2014) based on the alignment

guidelines (Samuelsson et al. 2010)9

Page 10: Changes of word class during the translation process · Changes of word class during the translation process Insights from a combined analysis of keystroke logging and eye-tracking

Methodology

Manual extraction of ST words belonging to main word classes

and the aligned TT words

Translation pairs selected for further analysis:

Shifts between nominal and verbal variants

Random samples of verbs and nouns that do not contain a shift in

the final translation

Keystroke data: identification of intermediate versions for the

selected ST words

Eye-tracking data: calculation of total fixation duration as a

concrete indicator of cognitive effort for the selected ST words

10

Page 11: Changes of word class during the translation process · Changes of word class during the translation process Insights from a combined analysis of keystroke logging and eye-tracking

POS distribution in ST and TT

English STs German TTs

Nouns 32,75% (113/345) 27% (882/3267)

Verbs 17,39% (60/345) 15,81% (511/3267)

Adjectives 11,30% (39/345) 9,77% (319/3267)

Adverbs 4,35% (15/345) 5,17% (169/3267)

11

More nouns and verbs in English originals than in German

translations

Technical problem: compound nouns counted as several

nouns in English but as one in German (Čulo et al. 2008,

49)

Page 12: Changes of word class during the translation process · Changes of word class during the translation process Insights from a combined analysis of keystroke logging and eye-tracking

Types of word class shifts

12

Absolute numbers % of all shifts

VERB → NOUN 37 49,95%

ADJ → NOUN 23 17,04%

NOUN → VERB 16 11,85%

VERB → ADJ 14 10,37%

ADV → PP 14 10,37%

NOUN → ADJ 10 7,41%

VERB → ADV 9 6,70%

ADV → ADJ 6 4,44%

NOUN → ADV 4 2,96%

ADJ → ADV 2 1,48%

Page 13: Changes of word class during the translation process · Changes of word class during the translation process Insights from a combined analysis of keystroke logging and eye-tracking

Types of word class shifts II

13

English

ST verb

English

ST noun

No shift 350 776

Shift 60 30

The translation direction

English-German is

characterized through shifts

from verbs to other word

classes, in particular to nouns

Page 14: Changes of word class during the translation process · Changes of word class during the translation process Insights from a combined analysis of keystroke logging and eye-tracking

Cognitive effort I

Does the translation of nouns require more cognitive effort than

the translation of verbs?

Cognitive effort is measured using log-transformed values for

total fixation duration normalized per character

14

Means: noun -1.9

verb -1.78

t = -0.7, df = 94.85, p-value = 0.48

Page 15: Changes of word class during the translation process · Changes of word class during the translation process Insights from a combined analysis of keystroke logging and eye-tracking

GTrans: Statt zu einer endgültigen festen Größe zusammenzufallen, nahm

die Höhe des zusammengeknüllten Papierballs weiter ab, und zwar auch

noch drei Wochen, [nachdem das Gewicht [angewendet]Verb wurde]Clause.

(KLTC PROBRAL GT5)

Cognitive effort II

15

Means: n-v -2.05

v-n -1.65

t = -1.57, df = 33.1, p-value = 0.13

Slightly lower mean for the total

fixation duration associated with shifts

from nouns to verbs could be

potentially explained through reduction

of grammatical complexity

EO: Instead of collapsing to a final fixed size, the height of the crushed ball

continued to decrease, even three weeks [after the [application]Noun of

weight]NP.

Page 16: Changes of word class during the translation process · Changes of word class during the translation process Insights from a combined analysis of keystroke logging and eye-tracking

Intermediate versions I

Verb to Noun shifts: Verb Verb Noun (3x)

Verb Noun Noun (1x)

16

EO: Crumpling a sheet of paper seems simple and doesn't require much

effort, but explaining [why the crumpled ball [behaves]Verb the way it

does]Clause is another matter entirely.

GTrans: Ein Blatt Papier zusammen zu knüllen, erscheint einfach und

erfordert wenig Anstrengung; [die [Verhaltensweise]Noun des

Papierknäuels]NP zu erklären, ist dagegen eine völlig andere Sache. (KLTC

PROBRAL GT7

GTrans_i: Ein Blatt Papier zusammen zu knüllen, erscheint einfach und

erfordert wenig Anstrengung, jedoch zu erklären, [warum der

zeras Papierknäuel sich so [verhält]Verb, wie es das

tut]Clause, ist eine völlig andere Sache.

Page 17: Changes of word class during the translation process · Changes of word class during the translation process Insights from a combined analysis of keystroke logging and eye-tracking

GTrans: [Nachdem sie ein dünnes Blatt aluminiumbeschichtetes Mylar

[verknittert]Verb hatten]Clause, gaben sie es in einen Zylinder. (KLTC

PROBRAL GT3)

Intermediate versions II

Noun to Verb shifts: Noun Noun Verb (1x)

Noun Verb (Verb) Verb (2x)

17

EO: [After the [crumpling]Noun of a sheet of thin aluminized Mylar]PP, the

researchers placed it inside a cylinder.

GTrans: [Nachdem sie ein dünnes Blatt aluminiumbeschichtetes Mylar

[verkrumpelt]Verb hatten]Clause, gaben sie es in einen Zylinder.

GTrans: [Nachdem sie ein dünnes Blatt aluminiumbeschichtetes Mylar

[verknäuelt]Verb hatten]Clause, gaben sie es in einen Zylinder.

Page 18: Changes of word class during the translation process · Changes of word class during the translation process Insights from a combined analysis of keystroke logging and eye-tracking

Conclusion & Outlook

An application of a Keystroke Logged Translation Corpus to

triangulate product and process data

Shifts from verbs in the ST to nouns in the final TT

the most pervasive type of shifts in the translation direction

English-German

Verbs shifted to nouns are fixated slightly longer than nouns

shifted to verbs

Both types of shifts can involve intermediate stages (either ST

POS or TT POS, i.e. a synonym of the item in the final TT)

Determining cognitive effort based not only on the shift in the

final but also intermediate translation versions more data

points required

Taking into account further indicators of cognitive effort (further

combining eye-tracking and keystroke logging data streams)

18

Page 19: Changes of word class during the translation process · Changes of word class during the translation process Insights from a combined analysis of keystroke logging and eye-tracking

e-cosmos platform

Creating a web-based platform for different scenarios of

multimodal data integration and analysis

Translation data

Integration: text, keystroke logging and eye-tracking data

Identification of word tokens in the intermediate translation

versions and their linguistic annotation

Query tool for quantitative analyses of product and process data

(cf. Carl & Jakobsen 2006)

19

e-cosmos platform

Linguistics Computer Science Information Management

Page 20: Changes of word class during the translation process · Changes of word class during the translation process Insights from a combined analysis of keystroke logging and eye-tracking

Thank you for your attention!

Tatiana Serbina

[email protected]

RWTH Aachen University

Templergraben 55

52056 Aachen

www.rwth-aachen.de

Page 21: Changes of word class during the translation process · Changes of word class during the translation process Insights from a combined analysis of keystroke logging and eye-tracking

References

Alves, Fabio, and Célia Magalhaes. 2004. “Using Small Corpora to Tap and Map the Process-product Interface in Translation.” TradTerm

10: 179–211.

Alves, Fabio, and Daniel Couto Vale. 2009. “Probing the unit of translation in time: Aspects of the design and development of a web

application for storing, annotating, and querying translation process data.” Across Languages and Cultures 10 (2): 251–73.

Alves, Fabio, and Daniel Couto Vale. 2011. “On drafting and revision in translation: On drafting and revision in translation: A corpus

linguistics oriented analysis of translation process data.” Translation: Computation, Corpora, Cognition 1: 105–22.

Carl, Michael, and Arnt LykkeJakobsen. 2009. Objectives for a query language for user-activity data. In 6th International Natural

Language Processing and Cognitive Science Workshop, Milano, Italy.

Čulo, Oliver, Silvia Hansen-Schirra, Stella Neumann, and Mihaela Vela. 2008. “Empirical studies on language contrast using the English-

German comparable and parallel CroCo corpus.” In Proceedings of the LREC 2008 Workshop “Building and Using Comparable Corpora”,

47–51. Marrakesh, Morrocco. http://www.dfki.de/lt/publication_show.php?id=3991.

Cyrus, Lea. 2009. “Old concepts, new ideas: Approaches to translation shifts.” MonTI. Monografías de Traducción e Interpretación 1: 87–

106.

Halliday, Michael A. K., and Christian M. I. M. Matthiessen. 2013. An introduction to functional grammar. London: Arnold.

Halverson, Sandra L. 2007. “A cognitive linguistic approach to translation shifts.” In The study of language and translation, edited by Willy

Vandeweghe, Sonia Vandepitte, and Marc van de Velde, 105–21. Amsterdam: Benjamins.

Hansen-Schirra, Silvia, Stella Neumann, and Erich Steiner. 2012. Cross linguistic corpora for the study of translations: Insights from the

language pair English-German. Berlin: de Gruyter.

Hansen-Schirra, Silvia, Stella Neumann, and Mihaela Vela. 2006. Multi-dimensional annotation and alignment in an English-German

translation corpus. In Proceedings of the 5th Workshop on NLP and XML (NLPXML-2006): Multi-Dimensional Markup in Natural

Language Processing, EACL 2006, (pp. 35–42). Trento, Italy.

Neumann, Stella, Adriana Pagano, Fabio Alves, Piritta Pyykkönen, Igor da Silva. 2010. Targeting (de-) metaphorization: Process-based

insights. The 22nd European Systemic Functional Linguistics Conferece and Workshop. 9th–11th July 2010. Koper, Slovenija.

Serbina, Tatiana, Paula Niemietz, and Stella Neumann. Forthcoming. Development of a keystroke logged translation corpus. In: Claudio

Fantinuoli und Federico Zanettin (eds.): Parallel corpora for translation studies: Language Science Press.

Steiner, Erich. 2001. Translations English-German. Investigating the relative importance of systemic contrasts and the text-type

"translation". In: SPRIKreports Reports of the Project Languages in Contrast 7, S. 1–49.

Vinay, Jean-Paul, and Jean Darbelnet. 1995 (1958). Comparative stylistics of French and English: A methodology for translation.

Amsterdam: Benjamins. Translated and edited by Juan C. Sager and M.-J. Hamel.

21