margaret cargill, michelle picard and cally guerin 25 november 2011
DESCRIPTION
Corpus-based approaches for research students: Using student-made corpora to promote autonomous learning. Margaret Cargill, Michelle Picard and Cally Guerin 25 November 2011. Corpus linguistics approaches. Corpus: a body of text selected for analysis using appropriate software - PowerPoint PPT PresentationTRANSCRIPT
School of Agriculture, Food & Wine/Researcher Education & Development
AALL Conference 2011: Forging New Directions
Life Impact | The University of Adelaide
Corpus-based approaches for research students:
Using student-made corpora to promote
autonomous learning Margaret Cargill, Michelle Picard and
Cally Guerin25 November 2011
School of Agriculture, Food & Wine/Researcher Education & Development
Life Impact | The University of Adelaide
Corpus linguistics approaches
• Corpus: a body of text selected for analysis using appropriate software
• The software: a concordancer• Available as web-based applications
– e.g. Springer Exemplar www.springerexemplar.com
• or stand-alone programs, e.g.– www.adelaide.edu.au/red/adtat/
School of Agriculture, Food & Wine/Researcher Education & Development
Life Impact | The University of Adelaide
Concordancing• A concordancer is software that
searches a group of texts (a corpus) for all examples of a particular item.
• It displays the results as lines of text across the screen for easy comparison
• Results can be sorted according to what is on the left or the right of the search item
• This can provide data to ‘drive’ language learning and improve written texts
An example of concordancing outputto utilise existing available soil water, unlike the perennial gres (4 g oven dry wt basis) of soil were weighed into 40 ml polyprrequired 9 kg P/ha, whereas a soil with a high P sorption capacitconcentration by 1 mg/kg on a soil with a low P sorption capacity00, it was expected that this soil would have consistently been t capacity (PBC), which is the soil's capacity to moderate changesand buffering capacity of the soil-an attempt to test Schofield'snisms that are present in the soil-plant microcosm environment. Tetermined in a growth-chamber soil-plant microcosm study. Nodding84) Lime and phosphate in the soil-plant system. Advances in Agroa where crops rely heavily on soil-stored water accrued in summerns Between Herbicides and the Soil. Academic Press, London, pp. 2fertility on these particular soils. Although this aberration hasover in a range of allophanic soils amended with 14Clabelled glucalues for 9 different pasture soils, 6 and 12 months after P fert
School of Agriculture, Food & Wine/Researcher Education & Development
Life Impact | The University of Adelaide
Data-driven language learning
• Very relevant for discipline-specific English and usage conventions
• Empowers users to research their own language issues
• Appeals particularly to research students
• Hypothesised to contribute to autonomous learning approaches
School of Agriculture, Food & Wine/Researcher Education & Development
Life Impact | The University of Adelaide
Making your own corpus • In order to make this tool
optimally effective for writing/self-editing, you need to (find or) make a corpus that is specific to the task at hand
• Texts need to be in.txt files• ‘Save as’ .txt from .doc, .html or
.pdf files– ‘Cleanup’ may be needed after
saving
School of Agriculture, Food & Wine/Researcher Education & Development
Life Impact | The University of Adelaide
Using a self-made corpus?• A simple concordancer called AdTAT
(Adelaide Text Analysis Tool) is available at http://www.adelaide.edu.au/red/adtat/
• Recently developed at the University of Adelaide and made available freely for use
• Designed for authors focused on writing texts, not for linguistic researchers
School of Agriculture, Food & Wine/Researcher Education & Development
Life Impact | The University of Adelaide
Roundtable structure• Margaret: Self-made and online
corpora in an EFL ESP context• Michelle: Use with Turn-it-in to
investigate intertextuality• Cally:
– Discussion and questions– Uptake options within ALL
Slide 8
School of Agriculture, Food & Wine/Researcher Education & Development
Life Impact | The University of Adelaide
Self-made and online corpora in an EFL ESP context• China Academy of Engineering Physics,
Mianyang, Sichuan• Consecutive 5-day workshops, 40+
participants (5 to date)• Mixed disciplines within engineering
physics• Working researchers, some without
higher degrees, English level variable• Strong and increasing pressure to
publish in EnglishSlide 9
School of Agriculture, Food & Wine/Researcher Education & Development
Life Impact | The University of Adelaide
Resulting workshop design features• Emphasis required on listening,
speaking, reading and writing, but all integrated with publication focus
• Day 1, afternoon: ‘Developing discipline-specific English writing skills’– Re-using language vs plagiarism (sentence
templates)– Noun phrases and articles– Identifying vocabulary for learning →
concordancingSlide 10
School of Agriculture, Food & Wine/Researcher Education & Development
Life Impact | The University of Adelaide
“Selecting noun phrases to learn
• Extending vocabulary is an ongoing need for EAL scientist authors
• One effective way to select vocabulary to learn is to use a word frequency list from your own discipline.
• Such a word list can be created by using concordancing software to search a collection of discipline-specific texts such as research articles.
• These text collections are called corpora (sing. =corpus)”
School of Agriculture, Food & Wine/Researcher Education & Development
Life Impact | The University of Adelaide
To make a Frequency list• Open AdTAT• Load a corpus• From top menu bar select
Corpus, Word frequency• Resulting screen lists all words
in the corpus in order of frequency of use
Slide 12
School of Agriculture, Food & Wine/Researcher Education & Development
Life Impact | The University of Adelaide
Needed: a discipline-specific corpus• To demonstrate effectiveness, I
organise preparation of a CAEP corpus
• Participants are asked to each prepare a single file for homework
Slide 13
School of Agriculture, Food & Wine/Researcher Education & Development
Life Impact | The University of Adelaide
“Homework: Building a CAEP corpus• Each participant will prepare
one research article for the corpus tonight
• Prepared files should be sent as an email attachment to 刘希 <[email protected]>
• Include full bibliographic details of the article (the full reference) in the body of the email”
School of Agriculture, Food & Wine/Researcher Education & Development
Life Impact | The University of Adelaide
“Preparing text for a corpus• Select articles written by native-speakers of English wherever possible.
• Texts in a corpus for concordancing must be stored as plain text files (.txt).
• Remove un-needed parts before saving: biodata, keywords, tables, figures, reference list, acknowledgements.
• An easy way to do this is to download your selected articles as .html files, remove unwanted parts and ‘save as’ .txt
• If you can only get .pdf versions, use ‘Save as text’ option – OR copy desired text one page/ column at a time into a word document, correct spacing, and save as .txt
• Label the file with name of journal, first author and year of paper”
School of Agriculture, Food & Wine/Researcher Education & Development
Life Impact | The University of Adelaide
Issues re student preparation of texts for corpora• Following directions!• Selection (especially author language
status)• Cleaning of text (headers, footers,
page numbers/labels, figures/tables, author affiliations, etc.
• Provision of reference data for record keeping purposes
• Consistent file labellingSlide 16
School of Agriculture, Food & Wine/Researcher Education & Development
Life Impact | The University of Adelaide
My response• Use files as received• Conduct a search every time an
issue arises in student drafts that can be addressed this way
• When anomalies occur in search outcomes, point out the link to inappropriate corpus preparation
• Use Exemplar as a comparisonSlide 17
School of Agriculture, Food & Wine/Researcher Education & Development
Life Impact | The University of Adelaide
Demonstration/discussion• I will load into AdTAT the 3
corpora made by CAEP participants in 2010 and 2011
• Together we will run some searches to see – What questions we can answer– What anomalies we may find in the
corpusSlide 18
School of Agriculture, Food & Wine/Researcher Education & Development
Life Impact | The University of Adelaide
Types of searches: Collocations
• Sort left to find verbs or adjectives that go with a search noun (e.g. reason) or adverbs that go with a search verb (vary)
• Sort right to find prepositions that go with a search noun (role) or verb (compare)
School of Agriculture, Food & Wine/Researcher Education & Development
Life Impact | The University of Adelaide
Types of searches: Usage conventions
• Search for we to see if it is used in the genre of interest
• Search for also to see if it appears at the start of a sentence (i.e. with a capital letter – AdTAT is case-sensitive)
• bioinformatical or bioinformatic?– No examples of -al in self-made corpus –
but strong supervisor preference– Check Exemplar site
Slide 21Can give writers agency to counter supervisor idiosyncrasy
School of Agriculture, Food & Wine/Researcher Education & Development
Life Impact | The University of Adelaide
“Springer Exemplar• Go to www.springerexemplar.com • Choose to search a field or a journal• It is web-based, so no software download• BUT, you cannot
– sort output to answer your own questions– see more context than a few words– know if the English is native-speaker or not
• These problems are solved with AdTAT”
School of Agriculture, Food & Wine/Researcher Education & Development
Life Impact | The University of Adelaide
Uses of the Exemplar site• When your discipline-specific corpus
has no or too few examples of a term• When you do not have an appropriate
corpus to search• When you want to compare usage
more widely than your own small corpus allows but still specific to the discipline
• e.g. evolvement (geology); bioinformatical (plant science)
Slide 23
School of Agriculture, Food & Wine/Researcher Education & Development
Life Impact | The University of Adelaide
Other web concordancers: Uses
• See Virtual Language Centre Web Concordancer at http://www.edict.biz/concordance/WWWConcappE.htm • Allows choice of corpora to search, including
the Brown Corpus of US general English • To demonstrate differences between
discipline-specific and general English usage– e.g. And at start of sentence, or use of
contractions ending in n’t
Slide 28
But a similar AdTAT search of a corpus from the New Phytologist journal (plant science, impact factor over 5) finds 0 examples …
US general English
School of Agriculture, Food & Wine/Researcher Education & Development
Life Impact | The University of Adelaide
To summarise…• High potential usefulness of both self-
made and online corpora, especially where ‘native-speaker’ models are lacking
• Labour of constructing a corpus can be seen as a disincentive to use of self-made corpora
• Hands-on demonstration to address students’ actual errors can – help overcome this perception, and– provide needed training in search construction
Slide 32
School of Agriculture, Food & Wine/Researcher Education & Development
Life Impact | The University of Adelaide
Corpora and Concordancing
School of Agriculture, Food & Wine/Researcher Education & Development
Life Impact | The University of Adelaide
The research reversal?
You can use the same text as long
as you cite
We all use the same
words
School of Agriculture, Food & Wine/Researcher Education & Development
Life Impact | The University of Adelaide
‘Obligatory intertextuality’• Document structure intertextuality
• Engaging with the literature
• Co-authored texts
• Discipline-specific language
(Eira, 2007)
School of Agriculture, Food & Wine/Researcher Education & Development
Life Impact | The University of Adelaide
Unacceptable intertextuality
School of Agriculture, Food & Wine/Researcher Education & Development
Life Impact | The University of Adelaide
Explicit instruction• Take focussed notes • Separate the “English” from the
“science/ content” • Assemble notes related to topics • “Story” each paragraph in dot-points • Identifying acceptably recyclable text
Researcher Education & DevelopmentAdelaide Graduate Centre
School of Agriculture, Food & Wine/Researcher Education & Development
Life Impact | The University of Adelaide
Intelligently reading reports
Researcher Education & DevelopmentAdelaide Graduate Centre
School of Agriculture, Food & Wine/Researcher Education & Development
Life Impact | The University of Adelaide
Categorising matches Category Action required Too close the text is too similar to the
source and needs to be paraphrased or rewritten
Not relevant some other text has been highlighted, e.g., a formula or a bibliographic reference
Discipline-specific phrase this is the way that concept must be expressed in this context
Unsure anything else the student does not know how to categorise
School of Agriculture, Food & Wine/Researcher Education & Development
Life Impact | The University of Adelaide
Too original: HamdaFor students who are producing unidiomatic awkward, non-standard usage) and/or non-academic English
• Step 1 Concordancer • (focussing on collocations, standard
phrases) • Step 2 Text-matching to check for originality- Unrelated matches- Standard strings- Sentence templates- Some discipline-specific language
School of Agriculture, Food & Wine/Researcher Education & Development
Life Impact | The University of Adelaide
Not original: Weimin For students who are patchwriting with little understanding of referencing and citation conventions
• Step 1 Training in the use of bibliographic software
• Step 2 Instruction in note-taking & organising writing
• Step 3 Text-matching to check for originality• Step 4 Concordancer (focussing on general
usage and discipline-specific language)
- Phrases commonly used in general academic English- Discipline-specific language - Unacceptable recycling
School of Agriculture, Food & Wine/Researcher Education & Development
Life Impact | The University of Adelaide
Refining authorial voice: Liang• For students who have a good grasp of both
citation conventions and reasonably high-level English expression.
• Step 1 Text-matching to check for originality• Step 2 Concordancer (focussing on general
usage and discipline-specific language)
• Step 3 General Google Scholar search- Matches with unrelated student writing- Discipline-specific language- Springer Exemplar
Researcher Education & DevelopmentAdelaide Graduate Centre
School of Agriculture, Food & Wine/Researcher Education & Development
Life Impact | The University of Adelaide
Examples of intertexuality
School of Agriculture, Food & Wine/Researcher Education & Development
Life Impact | The University of Adelaide
Corpus output
School of Agriculture, Food & Wine/Researcher Education & Development
Life Impact | The University of Adelaide
Concrete Outcomes • Increased participants’ understanding of
acceptable and unacceptable intertextuality
• Enhanced participants’ knowledge of disciplinary language
• Developed participants’ autonomy through stages of a reflective process, enabling them to “critically review [their] suppositions of subject discipline and existing knowledge” (Chan et al. 2002:515)
School of Agriculture, Food & Wine/Researcher Education & Development
Life Impact | The University of Adelaide
Questions?
School of Agriculture, Food & Wine/Researcher Education & Development
Life Impact | The University of Adelaide
Your thoughts• How much instruction is required
from ALL staff? Will this process promote autonomy?
• Difficulties in implementation in your own situation?
• Other uses for the software?
Slide 47