itinera nova in the world(s) of crowdsourcing and tei

39
Itinera Nova in the World(s) of Crowdsourcing and TEI Ben Brumfield International Colloquium Itinera Nova

Upload: itinera-nova

Post on 02-Jul-2015

90 views

Category:

Business


2 download

DESCRIPTION

By Ben Brumfield Colloquium Itinera Nova. Tools, people & history (City Archives Leuven) April 25th, 2013

TRANSCRIPT

Page 1: Itinera nova in the World(s) of Crowdsourcing and TEI

Itinera Nova in the World(s) of Crowdsourcing and TEI

Ben Brumfield

International Colloquium Itinera Nova

Page 2: Itinera nova in the World(s) of Crowdsourcing and TEI

Crowdsourced Transcription

Offline projects from the 1990s

Van Papier Naar Digitaal (NL)

FreeBMD/FreeREG/FreeCEN (GB)

Demogen (BE)

Arkivalieronline (DK)

Western Michigan Genealogy Society (US)

Page 3: Itinera nova in the World(s) of Crowdsourcing and TEI

Crowdsourced Transcription

Online tools developed from 2005

Diverse projects released from 2006

2006 FamilySearch Indexing

2008 FromThePage

2008 Wikisource (+ ProofreadPage)

2009 North American Bird Phenology Program

Page 4: Itinera nova in the World(s) of Crowdsourcing and TEI
Page 5: Itinera nova in the World(s) of Crowdsourcing and TEI
Page 6: Itinera nova in the World(s) of Crowdsourcing and TEI
Page 7: Itinera nova in the World(s) of Crowdsourcing and TEI

Tension

Volunteer transcribers vs. Professional editors

Easy tools vs. Powerful tools

Page 8: Itinera nova in the World(s) of Crowdsourcing and TEI

Easy Tools, Hard Mark-up

Amateurs + Mark-up = ???

So get rid of the mark-up, right?

Page 9: Itinera nova in the World(s) of Crowdsourcing and TEI

Power vs Usability

• Power can enable users.

• Lack of power frustrates users.

• For transcription, mark-up is power.

Page 10: Itinera nova in the World(s) of Crowdsourcing and TEI

Power vs Usability

• A little story about scrambled eggs...

Page 11: Itinera nova in the World(s) of Crowdsourcing and TEI
Page 12: Itinera nova in the World(s) of Crowdsourcing and TEI
Page 13: Itinera nova in the World(s) of Crowdsourcing and TEI
Page 14: Itinera nova in the World(s) of Crowdsourcing and TEI
Page 15: Itinera nova in the World(s) of Crowdsourcing and TEI

TEI

Ultimate in mark-up

Standard since 1990

Ubiquitous in scholarly editing

Usually hand-edited XML in offline tools“TEI? That's just for data entry.”

Page 16: Itinera nova in the World(s) of Crowdsourcing and TEI

TEI

Strengths– Powerful data model

– Tools for presentation and analysis– Active community

Genetic Edition Module– Represents changes to texts– Still in development

Page 17: Itinera nova in the World(s) of Crowdsourcing and TEI
Page 18: Itinera nova in the World(s) of Crowdsourcing and TEI
Page 19: Itinera nova in the World(s) of Crowdsourcing and TEI
Page 20: Itinera nova in the World(s) of Crowdsourcing and TEI
Page 21: Itinera nova in the World(s) of Crowdsourcing and TEI
Page 22: Itinera nova in the World(s) of Crowdsourcing and TEI
Page 23: Itinera nova in the World(s) of Crowdsourcing and TEI
Page 24: Itinera nova in the World(s) of Crowdsourcing and TEI
Page 25: Itinera nova in the World(s) of Crowdsourcing and TEI
Page 26: Itinera nova in the World(s) of Crowdsourcing and TEI
Page 27: Itinera nova in the World(s) of Crowdsourcing and TEI

TEI

But how was that encoded?

Page 28: Itinera nova in the World(s) of Crowdsourcing and TEI
Page 29: Itinera nova in the World(s) of Crowdsourcing and TEI

TEI

Amateurs + TEI = ????

Rarely attempted– 29 projects in crowdsourced transcription tool

directory– Only 7 claim to “support TEI”

Page 30: Itinera nova in the World(s) of Crowdsourcing and TEI

TEI + Amateurs

• Tag Buttons– T-PEN/CCL

– TEI Toolbar (TB)

• Tag Menus– VdU– Papyrological Editor

Page 31: Itinera nova in the World(s) of Crowdsourcing and TEI

Buttons: Transcribe Bentham

Page 32: Itinera nova in the World(s) of Crowdsourcing and TEI

Menus: MOM-CA

Page 33: Itinera nova in the World(s) of Crowdsourcing and TEI

Button Limitations

• Users outgrow buttons– “I believe one or two transcribers now add

tags manually rather than use the toolbar, which says something about the improvement in their IT skills.” –Tim Causer (TB)

• Users ignore buttons– “One editor for exampled prefered to put || for

<lb> as he was used from the preparation of a printed edition.” –Georg Vogeler (VdU)

Page 34: Itinera nova in the World(s) of Crowdsourcing and TEI

Button Limitations

“...there were something like 67 necessary buttons, and it was maddening to fish around for the desired button. And the research assistants, who had been encoding in oXygen, just typed in angle brackets and memorized tags, instead of using the buttons. .” – Abigail Firey (CCL)

Page 35: Itinera nova in the World(s) of Crowdsourcing and TEI

A New Way

Page 36: Itinera nova in the World(s) of Crowdsourcing and TEI

A New Way

Page 37: Itinera nova in the World(s) of Crowdsourcing and TEI

A New Way

Page 38: Itinera nova in the World(s) of Crowdsourcing and TEI

A New Way

For data entry, consider alternatives to TEI– Existing print notations (e.g. Leiden+)

– Robust data entry tools

Use TEI for data models and presentation– Papyrological Editor

Opportunities to combine crowdsourcing tools with TEI– Skylark project at UMD MITH

• Zooniverse transcription components

• Genetic edition TEI module

Page 39: Itinera nova in the World(s) of Crowdsourcing and TEI

Questions

Ben Brumfield

@benwbrum

[email protected]

FromThePage.comSlides and transcript at

http://manuscripttranscription.blogspot.com/