wikisource
TRANSCRIPT
![Page 1: Wikisource](https://reader034.vdocuments.mx/reader034/viewer/2022042818/55c13fb0bb61ebcb478b4661/html5/thumbnails/1.jpg)
Wikisource
Where we areWhere we want to go
Andrea ZanniWikimedia Italia
Wikimania 2012
![Page 2: Wikisource](https://reader034.vdocuments.mx/reader034/viewer/2022042818/55c13fb0bb61ebcb478b4661/html5/thumbnails/2.jpg)
The library of Babel
“The universe (which others call the library)...”
J. L. Borgeshttp://en.wikipedia.org/wiki/The_Library_of_Babel
![Page 3: Wikisource](https://reader034.vdocuments.mx/reader034/viewer/2022042818/55c13fb0bb61ebcb478b4661/html5/thumbnails/3.jpg)
What are digital libraries?
Nobody really knows, but many agree on requirements:
1. Collection2. Metadata3. Services4. People
![Page 4: Wikisource](https://reader034.vdocuments.mx/reader034/viewer/2022042818/55c13fb0bb61ebcb478b4661/html5/thumbnails/4.jpg)
Collection
● Reliability● Readability ● Curation ● Quality
![Page 5: Wikisource](https://reader034.vdocuments.mx/reader034/viewer/2022042818/55c13fb0bb61ebcb478b4661/html5/thumbnails/5.jpg)
It's a kind of magic..Metadata are used for ● cataloging ● indexing● retrieving ● archiving ● communicating ● preserving information.
If we want to deal with information, we need metadata.
![Page 6: Wikisource](https://reader034.vdocuments.mx/reader034/viewer/2022042818/55c13fb0bb61ebcb478b4661/html5/thumbnails/6.jpg)
It's a kind of magic..Metadata are used for ● cataloging ● indexing● retrieving ● archiving ● communicating ● preserving information.
If we want to deal with information, we need metadata.
![Page 7: Wikisource](https://reader034.vdocuments.mx/reader034/viewer/2022042818/55c13fb0bb61ebcb478b4661/html5/thumbnails/7.jpg)
Metadata
On Wikisource metadata contains information about books and authors
● in simple text● human-readable ● no standard● not interoperable
… no magic :-(
![Page 8: Wikisource](https://reader034.vdocuments.mx/reader034/viewer/2022042818/55c13fb0bb61ebcb478b4661/html5/thumbnails/8.jpg)
Services
Everything that is beyond books:
● reference● (all kind of) categories ● lists● links● context● disambiguation ● redirects
![Page 9: Wikisource](https://reader034.vdocuments.mx/reader034/viewer/2022042818/55c13fb0bb61ebcb478b4661/html5/thumbnails/9.jpg)
People
Librarians (and users) form the community (we are not Google books!)
● curation → books, project, policies ● empowerment → from users to librarians
![Page 10: Wikisource](https://reader034.vdocuments.mx/reader034/viewer/2022042818/55c13fb0bb61ebcb478b4661/html5/thumbnails/10.jpg)
5th law of Library Science
“The library is a growing organism”
S. R. Ranganathanhttp://en.wikipedia.org/wiki/Five_laws_of_library_science
![Page 11: Wikisource](https://reader034.vdocuments.mx/reader034/viewer/2022042818/55c13fb0bb61ebcb478b4661/html5/thumbnails/11.jpg)
Hyperlibrary: Xanadu 0.1
“Can you imagine that they used to have libraries where the books didn't talk to each other?”
Marvin Minsky[citation needed]
![Page 12: Wikisource](https://reader034.vdocuments.mx/reader034/viewer/2022042818/55c13fb0bb61ebcb478b4661/html5/thumbnails/12.jpg)
“Collaboratory”
read write
laboratory
● Tools → framework → other tools ● MediaWiki, js, templates, python, bot, API,
toolserver, ...
library
![Page 13: Wikisource](https://reader034.vdocuments.mx/reader034/viewer/2022042818/55c13fb0bb61ebcb478b4661/html5/thumbnails/13.jpg)
The future
![Page 14: Wikisource](https://reader034.vdocuments.mx/reader034/viewer/2022042818/55c13fb0bb61ebcb478b4661/html5/thumbnails/14.jpg)
Interoperability
● Bibliographic data from OCLC, Open Library, catalogs.
● Disseminate metadata and full text (OAI-PMH)
● Wikisource API
![Page 15: Wikisource](https://reader034.vdocuments.mx/reader034/viewer/2022042818/55c13fb0bb61ebcb478b4661/html5/thumbnails/15.jpg)
ePub
Fresh generated ePub on the rocks (via ePub converter)
● outreach● eReader apps
![Page 16: Wikisource](https://reader034.vdocuments.mx/reader034/viewer/2022042818/55c13fb0bb61ebcb478b4661/html5/thumbnails/16.jpg)
Classification
Potential of MediaWiki categories:● Colon classification● subjects from National Libraries● thesauri and ontologies
![Page 17: Wikisource](https://reader034.vdocuments.mx/reader034/viewer/2022042818/55c13fb0bb61ebcb478b4661/html5/thumbnails/17.jpg)
Microcontribution
“the more simple and small task is, the wider audience you get”
● Citizen science (Galaxy Zoo, Ancient Lives)● from page unit to word unit
More: WikiCaptcha (next presentation same room!)
![Page 18: Wikisource](https://reader034.vdocuments.mx/reader034/viewer/2022042818/55c13fb0bb61ebcb478b4661/html5/thumbnails/18.jpg)
New architecture on djvu
● in-line transcription● high granularity● save text directly on djvu● multiple layers
![Page 19: Wikisource](https://reader034.vdocuments.mx/reader034/viewer/2022042818/55c13fb0bb61ebcb478b4661/html5/thumbnails/19.jpg)
![Page 20: Wikisource](https://reader034.vdocuments.mx/reader034/viewer/2022042818/55c13fb0bb61ebcb478b4661/html5/thumbnails/20.jpg)
Xanadu 0.2
Systematic use of transclusion● Interwiki● Wikiquote● Wikipedia● Blogs, websites, etc.
![Page 21: Wikisource](https://reader034.vdocuments.mx/reader034/viewer/2022042818/55c13fb0bb61ebcb478b4661/html5/thumbnails/21.jpg)
Born-digital documents processNo specific process: must pass through the whole process for digitized files
1) Djvu2) OCR3) Commons4) Transcription
Collaboration with repositories and digital libraries (scientific articles, thesis, free documents).