primo at the university of amsterdam - technology vs. real life
TRANSCRIPT
University library
Primo at the University of AmsterdamTechnology vs Real LifeLukas Koster - Library Systems Coordinator - Library of the University of Amsterdam@lukask – [email protected] 2012, Trondheim, October 1-3, 2012
University library
Agenda
Primo at the University of Amsterdam - EMTACL12
1-Discovery tools
2-Content
3-Indexing
4-User interface
1a-Technology
http://lib.uva.nl
University library
Primo at the University of Amsterdam - EMTACL12
Technology vs Real Life
http://www.flickr.com/photos/35669523@N04/3310187686
http://www.flickr.com/photos/brewbooks/3318600273
University library
Discovery tools
Primo at the University of Amsterdam - EMTACL12
A one-stop single-search-box web based solution for searching, browsing, discovery and delivery of print and digital publications and objects available from library collections, institutional resources and academic publishers, using a unified index
User interface• One stop• Single search box• Web based• Searching• Browsing• Discovery• Delivery
Content• Print publications• Digital publications• Objects• Library collections• Institutional resources• Academic publishers
Indexing• Unified index
University library
Discovery tools
Primo at the University of Amsterdam - EMTACL12
Gateways to all information a library has access to
What I would like:
University library
Discovery tools: the environment
Primo at the University of Amsterdam - EMTACL12
Academic librariesTeachingResearchAccessSubscriptionsFreely availableInformationTraditional publicationsAll other types
http://commons.wikimedia.org/wiki/File:Graduation_hat.svg
University library
Aside
Primo at the University of Amsterdam - EMTACL12
Everything I say applies to all discovery tools,Not only Primo
University library
Technology
Primo at the University of Amsterdam - EMTACL12
http://www.flickr.com/photos/quiltsalad/5991773081/
Harvesting & Indexing
http://www.flickr.com/photos/manchesterlibrary/2034771121
University library
Technology
Primo at the University of Amsterdam - EMTACL12
CentralMetadata
Index
Discoveryfrontend
UI
Externaldatabase
Externaldatabase External
database RepositoryLocal
database
Imagedatabase
ejournalejournalejournal
ILS
User interface
Indexing
Content UI
UI
UI
Harvesting
University library
Technology
Primo at the University of Amsterdam - EMTACL12
CentralMetadata
Index
Discoveryfrontend
User interface
Harvested and Indexed Content
University library
Primo at the University of Amsterdam - EMTACL12
Content
http://www.flickr.com/photos/mollyblock/7941237158
University library
Content
Primo at the University of Amsterdam - EMTACL12
Theoretically (technically) we can harvest everything we have access to
University library
Primo at the University of Amsterdam - EMTACL12
CentralMetadata
Index
Discoveryfrontend
UI
Externaldatabase
Externaldatabase External
database RepositoryLocal
database
Imagedatabase
ejournalejournalejournal
ILS
User interface
Indexing
Content UI
UI
UI
Harvesting
Content
University library
Content
Shared Metadata
IndexLocal
MetadataIndex
Discoveryfrontend
Externaldatabase
Externaldatabase External
database RepositoryLocal
database
Imagedatabase
ejournalejournalejournal
ILS
User interface
Indexing
Content Primo at the University of Amsterdam - EMTACL12
Harvesting
University library
Content
Primo at the University of Amsterdam - EMTACL12
Discoveryfrontend
User interface
Harvested and Indexed
Shared Metadata
IndexLocal
MetadataIndex
Content
University library
Content
Primo at the University of Amsterdam - EMTACL12
Shared Metadata
Index
Local Metadata
Index
Local
University library
Content
Primo at the University of Amsterdam - EMTACL12
Shared Metadata
Index
Local Metadata
Index
Local
RepositoriesLocalSFX
ILS
Externaldatabase
Externaldatabase External
database
ejournalejournalejournal
University library
Content
Primo at the University of Amsterdam - EMTACL12
Shared Metadata
Index
Local Metadata
Index
Local
RepositoriesLocalSFX
ILS
Externaldatabase
Externaldatabase External
database
ejournalejournalejournal
Contentprovider
Contentprovider
Contentprovider
Systemvendor
Student Theses
University library
Content
Primo at the University of Amsterdam - EMTACL12
Alternative coverage
University library
Content
Primo at the University of Amsterdam - EMTACL12
Content typesMostly: Traditional publications
BooksArticlesAlso other typesDatasetsMapsetc.
University library
Content
Primo at the University of Amsterdam - EMTACL12
In reality we can’t harvest everything we have access to
http://www.flickr.com/photos/adactio/2144119569
University library
Primo at the University of Amsterdam - EMTACL12
Indexing
University library
Indexing
Primo at the University of Amsterdam - EMTACL12Primo at the University of Amsterdam - EMTACL12
Theoretically (technically) we can index everything unambiguously
University library
Indexing
Primo at the University of Amsterdam - EMTACL12Primo at the University of Amsterdam - EMTACL12
Shared Metadata
Index
Local Metadata
Index
Local
Two separate indexes
University library
Indexing
Primo at the University of Amsterdam - EMTACL12
Data source
Sourcerecords PNX
PNXPNX
PNX
HarvestingNormalising
<search> <author> <title></search><display> <author> <title></display><facets> <author> <type> <date></facets><links></links><delivery></delivery>
PNX
Or similar,etc.
University library
Indexing
Primo at the University of Amsterdam - EMTACL12Primo at the University of Amsterdam - EMTACL12
Shared Metadata
Index
Local Metadata
Index
Local
No deduplication across indexesNo FRBRisation across indexes
University library
Indexing
Primo at the University of Amsterdam - EMTACL12Primo at the University of Amsterdam - EMTACL12
Shared Metadata
Index
Local Metadata
Index
Local
Consolidate both indexesAdapt local indexing to shared indexing
University library
Indexing
Primo at the University of Amsterdam - EMTACL12Primo at the University of Amsterdam - EMTACL12
Works reasonably wellMultiple search variants
But: strings, no unique identifiers
Author names<creatorcontrib> Beckett, Samuel</creatorcontrib><creatorcontrib> Samuel Beckett 1906-1989</creatorcontrib><creatorcontrib> Beckett, S</creatorcontrib><creatorcontrib> Samuel Beckett</creatorcontrib>
University library
Indexing
Primo at the University of Amsterdam - EMTACL12Primo at the University of Amsterdam - EMTACL12
Again: strings, no unique identifiersSubjects/topics/keywords etc. are taken from each datasource ‘as is’
Topics
University library
Indexing
Primo at the University of Amsterdam - EMTACL12Primo at the University of Amsterdam - EMTACL12
Match Resource Types codes across indexesOne Resource Type per record
Resource types
University library
Indexing
Primo at the University of Amsterdam - EMTACL12Primo at the University of Amsterdam - EMTACL12
<typeOfResource>text</typeOfResource><genre>info:eu-repo/semantics/doctoralThesis</genre>
Resource typesInteresting example from institutional repositoryMODS/DIDL
University library
Aside: Primo “hackable”
Primo at the University of Amsterdam - EMTACL12
Ex Libris Open APIscustomisable, plugins, addons
Linked Open Data Special Interest Working Grouphttp://igelu.org/special-interests/lod
University library
Indexing
Primo at the University of Amsterdam - EMTACL12Primo at the University of Amsterdam - EMTACL12
In reality we can’t index everything unambiguously
http://www.flickr.com/photos/maveric2003/3822708724/
http://www.flickr.com/photos/profzucker/3754015526/
University library
Primo at the University of Amsterdam - EMTACL12
User interface
http://www.flickr.com/photos/mafleen/125422650
University library
User interface
Primo at the University of Amsterdam - EMTACL12
Theoretically (technically) we can find all we need with one search
University library
User interface
Primo at the University of Amsterdam - EMTACL12
Broad
Scoped
Discipline
Known item
University library
User interface
Primo at the University of Amsterdam - EMTACL12
Setting context
Before AfterAdvanced search, etc.SubjectDisciplineScopeTypeDate…
Refine results/facetsSubjectDisciplineSourceTypeDate…
Search
University library
User interface
Primo at the University of Amsterdam - EMTACL12
Broad RefineFacets
http://www.flickr.com/photos/eirasi/2084477067/
University library
User interface
Primo at the University of Amsterdam - EMTACL12
Scoped
University library
User interface
Primo at the University of Amsterdam - EMTACL12
Discipline
Requires uniform classification by subject of each item
In Local and Shared index
At the moment only available for relevance ranking in
On Journal level ScholarRank
University library
User interface
Primo at the University of Amsterdam - EMTACL12
Known item
Search on Title , Title + Author
No discovery desired!
University library
User interface
Primo at the University of Amsterdam - EMTACL12
Different audiences, contextDepending on context, different search interfaces may be appropriate
http://www.flickr.com/photos/rrrrred/3923807023
University library
User interface
Primo at the University of Amsterdam - EMTACL12
In reality we can’t find all we need with one search
http://www.flickr.com/photos/yoursecretadmiral/4052368212/
University library
Primo at the University of Amsterdam - EMTACL12
Technology vs Real Life
We can’t harvest everythingWe can’t index unambiguouslyWe can’t find all we need with one search
YET!
University library
Primo at the University of Amsterdam - EMTACL12
YET!
NEXT?http://www.flickr.com/photos/katerha/7071545621