europeana newspapers (project details and aggregation workflow)
TRANSCRIPT
Search and Browse Europe’s Historical Newspapers: The Europeana Newspapers Content Browser
Connecting knowledge
Project Details Newspapers Content Browser
• Allowing the search and browsing of historic newspapers and putting them within everyone’s reach
• A three-year project from February 2012 to January 2015
• Funded under the European Commission’s CIP 2007 – 2013 Programme
• Aggregating 18 million historic newspaper pages for Europeana and The European Library
• Converting 10 million newspaper pages to full-text, helping users quickly search for specific articles, people and destinations mentioned within the newspaper
• Building a special content viewer to improve online newspaper browsing
• Building tools for professionals, which will better assess the quality of newspaper digitization in relation to levels of detail, speed and costs.
Our Newspaper content browser will launch in early 2014. For the latest information check our websites and follow us on Twitter: http://www.europeana-newspapers.eu http://www.theeuropeanlibrary.org @eurnews
Newspaper Main Page:
• Explore historical newspapers by various filters, for example, title, date, provider, language, popularity, country etc.
Hard Discs from UIBK/ CCS
DBDB DBDB
Partners servers
ServerMetadata Repository
Enrichment & Format Normalization
TEL IIP image server
External image server
Full-text Repository
Harvest Metadata Storing MetadataXSLT Transformations Copy/Transform METS/ALTO Enrichments
Enrich
Index
Fulltext Index
Fulltext Index
Newspaper Gallery:
• Navigate to newspaper record page to search for available issue dates and other newspaper publication details
Newspaper Results Page:
• Search by newspaper title or within newspaper content• Refine your search, filter results, obtain search suggestions etc.
Newspaper Issue Page: • Search within full-text panel or image viewer• Find a specific word/article and view its corresponding image• Search within the newspaper image and navigate to the highlighted full-text section.
Navigate to provider’s image server
Explore other search results within a particular issue
Search within full-text and refine your search
Search newspaper titles
Navigate to newspaper issue by date
Newspaper Viewer:
• Specific browsing tools available, for example, zooming, navigating etc. • Leaf through newspapers by page number and issue date
Search full-text Search by newspaperSearch within full-text and image viewer Newspaper page links to provider
Newspapers Aggregation Workflow Project Innovations• Dynamic image retrieval from partner libraries
• Named Entity extraction
• Searching across named national boundaries
• What was a published on a single day – an international perspective
Alena Fedasenka, Markus Muhr, Elizabeth Joss, Anastasia Gasia, Alastair Dunning
This project runs from February 2012 to February 2015. It is led by the Staatsbibliothek zu Berlin and co-funded by the European Commission under the Competitiveness and Innovation framework Programme. http://ec.europa.eu/ict_psp
Partners providing digitized content
Learn More