an annotated spanish corpus for corpus-based call in professional contexts

77
An annotated Spanish corpus for Corpus-based CALL in professional contexts María Sánchez-Tornel Pascual Pérez-Paredes José M. Alcaraz Calero Authenticating Language Learning: Web Collaboration Meets Pedagogic Corpora February 17-19, 2011. University of Tübingen

Upload: delora

Post on 09-Jan-2016

42 views

Category:

Documents


3 download

DESCRIPTION

An annotated Spanish corpus for Corpus-based CALL in professional contexts. María Sánchez- Tornel Pascual Pérez-Paredes José M. Alcaraz Calero. Authenticating Language Learning:  Web Collaboration Meets Pedagogic Corpora February 17-19, 2011. University of Tübingen. OUTLINE :. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: An annotated Spanish corpus for Corpus-based CALL in professional contexts

An annotated Spanish corpus for Corpus-based CALL in professional contexts

María Sánchez-Tornel

Pascual Pérez-Paredes

José M. Alcaraz Calero

Authenticating Language Learning: Web Collaboration Meets Pedagogic Corpora February 17-19, 2011. University of Tübingen

Page 2: An annotated Spanish corpus for Corpus-based CALL in professional contexts

OUTLINE:

1. Corpora in FLT• Background• Our proposal

2. The Backbone project

3. The Spanish subcorpus• Features• Our approach• Corpus compilation• Pedagogic enrichment• Corpus exploitation

4. Conclusion

Page 3: An annotated Spanish corpus for Corpus-based CALL in professional contexts

1. Corpora in FLT

Page 4: An annotated Spanish corpus for Corpus-based CALL in professional contexts

Background

Page 5: An annotated Spanish corpus for Corpus-based CALL in professional contexts

1. Scant scholarly attention

Tertiary education Non-tertiary education0

102030405060708090

10067

9

Number of empirical studies (1991-2010)

CONTEXT

Source: Boulton (2010)

Page 6: An annotated Spanish corpus for Corpus-based CALL in professional contexts

CAUTION!

Page 7: An annotated Spanish corpus for Corpus-based CALL in professional contexts

2. Different contexts different students with different objectives, abilities and needs

Page 8: An annotated Spanish corpus for Corpus-based CALL in professional contexts

Advanced corpus users

Research purposes

Translation

L2 learning at university

Page 9: An annotated Spanish corpus for Corpus-based CALL in professional contexts

Novice corpus users

L2 learning at high school

CLIL

Page 10: An annotated Spanish corpus for Corpus-based CALL in professional contexts

Novice corpus users

L2 learning at high school

CLIL

Page 11: An annotated Spanish corpus for Corpus-based CALL in professional contexts

CLIL – Content and Language Integrated Learning

CLIL

Using the

language to learn

Learning to use

the languag

e

Coyle (2007)

Page 12: An annotated Spanish corpus for Corpus-based CALL in professional contexts

CLIL – Content and Language Integrated Learning

Eurydice (2006:51)

OBSTACLES

Legislation

Qualified staff

Financial restrictions

Materials

Content

Language

Page 13: An annotated Spanish corpus for Corpus-based CALL in professional contexts

Our proposal

Page 14: An annotated Spanish corpus for Corpus-based CALL in professional contexts

1. Scant attention2. Learners’ profile

3. CLIL obstacles (materials)

PEDAGOGIC

CORPORA

Page 15: An annotated Spanish corpus for Corpus-based CALL in professional contexts

Why pedagogical?

Page 16: An annotated Spanish corpus for Corpus-based CALL in professional contexts

Large, general corpora

Novice users

Photo: Kordite, Flickr

Page 17: An annotated Spanish corpus for Corpus-based CALL in professional contexts

Homogeneous and systematic

Thematic relevance

Recontextualisation Authentication

Easy to use query tools and search optionsBraun (2006)

Page 18: An annotated Spanish corpus for Corpus-based CALL in professional contexts

Past initiatives

ELISA

English Language Interview Corpus as a Second-Language Application

• 2003 – 2004

• 25 video interviews in English

•5-15 minutes per interview

• 60.000 words

• Search interface

• Learning materials

• 2005 – 2008

• Video interviews in 7 EU languages - Teen talk

• Corpus compilation and exploitation tools

• Learning materials

• Corpora + tools freely available

Page 19: An annotated Spanish corpus for Corpus-based CALL in professional contexts

2. The project

Page 20: An annotated Spanish corpus for Corpus-based CALL in professional contexts

Lesser taught languages

DIY approach – small spoken corpora

Non-standard regional varieties

Non-native varieties of ELF

Page 21: An annotated Spanish corpus for Corpus-based CALL in professional contexts

CLIL settings: vocational training, secondary education, university.BLENDED LEARNING PRINCIPLES

Learner centeredness

Relevant topics

Connection to CEF progression

ICT implementation

Free online corpora + materials

Moodle integration

Page 22: An annotated Spanish corpus for Corpus-based CALL in professional contexts

Language authentication

Small homogeneous

corpora

Pedagogically relevant topics

MultimodalityFull texts – sections –

concordances

Page 23: An annotated Spanish corpus for Corpus-based CALL in professional contexts

Data-driven learning but…

… pedagogically selected,

annotated and enriched data

Page 24: An annotated Spanish corpus for Corpus-based CALL in professional contexts

3. The Spanish subcorpus

Page 25: An annotated Spanish corpus for Corpus-based CALL in professional contexts

Features

Page 26: An annotated Spanish corpus for Corpus-based CALL in professional contexts

Size

25 interviews 53000 words

300 minutes of video recordings

Page 27: An annotated Spanish corpus for Corpus-based CALL in professional contexts

Speakers from 9 different provinces

Regional varieties

Northern and southern accents

Page 28: An annotated Spanish corpus for Corpus-based CALL in professional contexts

Cultural issues

Topics

Science and research

The environment

World of work Social issues

Economic issues

Healthcare and social security

Government and Politics

EducationUrban and rural life

Page 29: An annotated Spanish corpus for Corpus-based CALL in professional contexts

Our approach

Page 30: An annotated Spanish corpus for Corpus-based CALL in professional contexts

COLLABORATIVEANNOTATION

TRANSCRIPTION

MATERIALS DEVELOPMENT

Page 31: An annotated Spanish corpus for Corpus-based CALL in professional contexts

Corpus compilation

Page 32: An annotated Spanish corpus for Corpus-based CALL in professional contexts

Speaker selection

Age range: 18 - 83

9 provinces

Diverse professional fields

doctor

teacher

archaeologist

sportswoman

ex-lawyer

confectioner

bio-farmertop researcher entrepreneur

Page 33: An annotated Spanish corpus for Corpus-based CALL in professional contexts

Transcription

Orthographic

TEI-compliant markup

<trunc> </trunc>

<unclear> </unclear>

<break/>

<foreign> </foreign><alternative> </alternative>

Page 34: An annotated Spanish corpus for Corpus-based CALL in professional contexts

Backbone Transcriptor

Transcribing and sectioning

Supports metadata information

Video formats: DIVX, XVID,AVI,MPEG, Quick Time, RM,

Audio formats: MP3, WAV, ASF 

Timestamping audio-text

Page 35: An annotated Spanish corpus for Corpus-based CALL in professional contexts
Page 36: An annotated Spanish corpus for Corpus-based CALL in professional contexts

Pedagogic enrichment

Page 37: An annotated Spanish corpus for Corpus-based CALL in professional contexts

Step 1: Annotation

Photo: J_O_I_D

Page 38: An annotated Spanish corpus for Corpus-based CALL in professional contexts

Pedagogic annotation

Unit-bound not

text-bound

Pérez-Paredes (2010)

Page 39: An annotated Spanish corpus for Corpus-based CALL in professional contexts

Pedagogic annotation

Teacher-driven &

Learner-oriented

Pérez-Paredes (2010)

Page 40: An annotated Spanish corpus for Corpus-based CALL in professional contexts

Pedagogic annotation

Page 41: An annotated Spanish corpus for Corpus-based CALL in professional contexts

Backbone Annotator

TEI-compliant XML

Drag & drop

Edit options in XML

Manages several corpora

Integrated with Transcriptor and Search Tool

Page 42: An annotated Spanish corpus for Corpus-based CALL in professional contexts

Backbone annotator

Corpus Management

Tool

Collaborative

annotation

Page 43: An annotated Spanish corpus for Corpus-based CALL in professional contexts

Corpus Management Tool

CMT

Annotator 1Bob

Annotator 2 David

Annotator 3

Helen

Annotator 4 Hugh

Annotator 5

Jane

Sánchez-Tornel et al. (Forthcoming)

CORPORA OUTPUT

Backbone Search

Tool

Page 44: An annotated Spanish corpus for Corpus-based CALL in professional contexts
Page 45: An annotated Spanish corpus for Corpus-based CALL in professional contexts

Step 2: Materials development

Photo: the waving cat

Page 46: An annotated Spanish corpus for Corpus-based CALL in professional contexts

Thematic relevance

Two types of materials:- Learning modules

- Corpus-based communicative and exploratory activities

The Virtual Resource Pool

Integration in Search Tool

Page 47: An annotated Spanish corpus for Corpus-based CALL in professional contexts

Learning modules

19 modules – 107 activities

Telos Language Partner

1 section – 1 module – several activities

Comprehension & focus on form

Page 48: An annotated Spanish corpus for Corpus-based CALL in professional contexts

Sample module: Science and society

Learning modules

Page 49: An annotated Spanish corpus for Corpus-based CALL in professional contexts

Comprehension activities

Learning modules

Page 50: An annotated Spanish corpus for Corpus-based CALL in professional contexts

Multiple choice: comprehension

Learning modules

Page 51: An annotated Spanish corpus for Corpus-based CALL in professional contexts

Fill in the gaps

Learning modules

Page 52: An annotated Spanish corpus for Corpus-based CALL in professional contexts

Multiple choice: vocabulary

Learning modules

Page 53: An annotated Spanish corpus for Corpus-based CALL in professional contexts

Matching: idiomatic expressions

Learning modules

Page 54: An annotated Spanish corpus for Corpus-based CALL in professional contexts

Production: word order

Learning modules

Page 55: An annotated Spanish corpus for Corpus-based CALL in professional contexts

Exploratory and communicative activities

10 packages – 93 activities

Corpus exploration:- lexis-grammar

- communication

Page 56: An annotated Spanish corpus for Corpus-based CALL in professional contexts

Lexi

s an

d gr

amm

ar

Page 57: An annotated Spanish corpus for Corpus-based CALL in professional contexts

Com

mun

icati

on

Page 58: An annotated Spanish corpus for Corpus-based CALL in professional contexts

Online integration of learning materials

Learning modules and C&E packages linked to interview sections

The Virtual Resource Pool

Page 59: An annotated Spanish corpus for Corpus-based CALL in professional contexts

The Virtual Resource Pool

Page 60: An annotated Spanish corpus for Corpus-based CALL in professional contexts

Corpus exploitation

Page 61: An annotated Spanish corpus for Corpus-based CALL in professional contexts

The learning space

Access to corpora + learning materials

Four search modes: - Browse

- Section search- Concordances- Co-ocurrences

Wordlists

The Search Tool

Page 62: An annotated Spanish corpus for Corpus-based CALL in professional contexts

The browse mode

Page 63: An annotated Spanish corpus for Corpus-based CALL in professional contexts

The section search mode

Page 64: An annotated Spanish corpus for Corpus-based CALL in professional contexts

The section search mode

Page 65: An annotated Spanish corpus for Corpus-based CALL in professional contexts

The co-occurrences search mode

Page 66: An annotated Spanish corpus for Corpus-based CALL in professional contexts

The concordances search mode

Page 67: An annotated Spanish corpus for Corpus-based CALL in professional contexts

The word lists view

Page 68: An annotated Spanish corpus for Corpus-based CALL in professional contexts

Moodle integration

Page 69: An annotated Spanish corpus for Corpus-based CALL in professional contexts

4. Conclusion

Page 70: An annotated Spanish corpus for Corpus-based CALL in professional contexts

Previously on...

Scant scholarly attention in non-tertiary education settings

CL methods and tools

Pedagogic mediation!

Page 71: An annotated Spanish corpus for Corpus-based CALL in professional contexts
Page 72: An annotated Spanish corpus for Corpus-based CALL in professional contexts

Where we are now…

From the possibilities scenario

the feasibility scenario

Page 73: An annotated Spanish corpus for Corpus-based CALL in professional contexts

Corpora can be exploited in the

language classroom in different ways

FLT

Language research-oriented

paradigm

CL Methods Sampling

RepresentativenessMorphological

tagging

The possibilities scenario

Alcaraz & Pérez-Paredes (2008)

Page 74: An annotated Spanish corpus for Corpus-based CALL in professional contexts

Corpora are devised to be exploited in the language

classroom by language learners in different ways FLT

Language learning-oriented

paradigm

Mediation roleTheory-informed taggingParametric framework-

awareLearner-oriented tagging

Representative of the world of the learner

The feasibility scenario

Alcaraz & Pérez-Paredes (2008)

Page 75: An annotated Spanish corpus for Corpus-based CALL in professional contexts

References• Alcaraz, J.M. & Pérez-Paredes, P. (2008). What do annotators annotate? An analysis of language teachers' corpus pedagogical annotation. In A. Frankenburg-Garcia (Ed) Proceedings of the 8th Teaching and Language Corpora Conference. Lisbon, Portugal: Associação de Estudos e de Investigação Cientifíca do ISLA-Lisboa, (p. 27-37).

• Boulton, A. (2010). Learning outcomes from corpus consultation. In M. Moreno Jaén, F. Serrano Valverde & M. Calzada Pérez (Eds), Exploring New Paths in Language Pedagogy: Lexis and corpus-based language teaching. London: Equinox. Expanded web supplement available at http://arche.univ-nancy2.fr/file.php/967/DDL_empirical_list.pdf

• Braun, S. (2006). ELISA – a pedagogically enriched corpus for language learning purposes. In S. Braun, K. Kohn & J. Mukherjee (Eds.), Corpus Technology and Language Pedagogy. New Resources, New Tools, New Methods (pp. 25-47). Frankfurt/M.: Peter Lang.

• Coyle, D. (2007). Content and Language Integrated Learning: Towards a connected research agenda for CLIL pedagogies. The international journal of bilingual education and bilingualism 10(5), 543-562

• Eurydice Report. (2006). Content and Language Integrated Learning (CLIL) at School in Europe. Retrieved February 2011 from http://eacea.ec.europa.eu/eurydice/ressources/eurydice/pdf/0_integral/071EN.pdf

• Pérez-Paredes, P. (2010). Appropriation and integration issues in corpus methods and mainstream language education. In T. Harris & C. Pérez Basanta (Eds.), Corpus Linguistics and Language Teaching. Linguistic Insights Series. Berlin: Peter Lang

• Sánchez-Tornel, M., Alcaraz Calero, J.M. & Pérez-Paredes , P. (in press). Collaborative annotation in implementing corpora for content and language integrated learning web services. Proceedings of the 2009 Eurocall Conference: New trends in CALL – Working together. Madrid: Macmillan ELT

Page 76: An annotated Spanish corpus for Corpus-based CALL in professional contexts

Acknowledgements

The authors gratefully acknowledge the funding provided by the European Commission. This publication reflects the views only of the authors, and the Commission cannot be held responsible for any use which may be made of the information contained therein. BACKBONE 143502-2008-LLP-DE-KA2-KA2MP

María Sánchez-Tornel gratefully acknowledges the support provided by the University of Murcia under its PhD Scholarships Programme (R.-549/2009).

Page 77: An annotated Spanish corpus for Corpus-based CALL in professional contexts

María Sá[email protected]

Pascual Pé[email protected]

José M. Alcaraz [email protected]

Authenticating Language Learning: Web Collaboration Meets Pedagogic Corpora February 17-19, 2011. University of Tübingen

Thank you!

www.um.es/backbone

http://u-002-segsv001.uni-tuebingen.de/backbone/moodle/