semantic technologies for cultural heritage ongoing projects at ontotext mariana damova, phd...

24
Semantic Technologies for Cultural Heritage Ongoing Projects at Ontotext Mariana Damova, PhD September, 2011

Upload: silas-edwards

Post on 28-Dec-2015

220 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Semantic Technologies for Cultural Heritage Ongoing Projects at Ontotext Mariana Damova, PhD September, 2011

Semantic Technologies for Cultural Heritage

Ongoing Projects at Ontotext

Mariana Damova, PhDSeptember, 2011

Page 2: Semantic Technologies for Cultural Heritage Ongoing Projects at Ontotext Mariana Damova, PhD September, 2011

#2September 2011Museum Reason-able View

Introduction

• A reason-able View of Linked Data for Cultural Heritage (Molto, FP-7)

http://museum.ontotext.com

• Semantic publishing of Bulgarian Cultural Heritage in Europeana (with FMI)

• Research Space (Btirish Museum)

Page 3: Semantic Technologies for Cultural Heritage Ongoing Projects at Ontotext Mariana Damova, PhD September, 2011

A Reason-able View of Linked Data for Cultural Heritage

Page 4: Semantic Technologies for Cultural Heritage Ongoing Projects at Ontotext Mariana Damova, PhD September, 2011

#4September 2011Museum Reason-able View

Introduction

Linked Open Data

combining facts and knowledge from different datasets is the ultimate goal of the Semantic Web

Need for convincing real life use cases demanstrating the benefits of these technologies

MacManus, the Founder and Editor-in-Chief of ReadWriteWeb defined an exemplary test for the Semantic Web

cities around the world which have Modigliani art works

Page 5: Semantic Technologies for Cultural Heritage Ongoing Projects at Ontotext Mariana Damova, PhD September, 2011

#5September 2011Museum Reason-able View

FactForge of Ontotext solves the Modigliani query

The cultural heritage domain can become a useful usecase for the application of semantic technologies.

Page 6: Semantic Technologies for Cultural Heritage Ongoing Projects at Ontotext Mariana Damova, PhD September, 2011

#6September 2011Museum Reason-able View

Linked Open Data – the Vision

Tim Berners-Lee graphs published on the web and explorable across servers in a manner similar

to the way the HTML web is navigated

Design principles of Linked Open Data– Use URIs to identify things.– Use HTTP URIs so that these things can be referred to and looked up ("dereferenced") by people and user agents.– Provide useful information about the thing when its URI is dereferenced, using standard formats such as RDF/XML.– Include links to other, related URIs in the exposed data to improve discovery of other related information on the Web.

Page 7: Semantic Technologies for Cultural Heritage Ongoing Projects at Ontotext Mariana Damova, PhD September, 2011

#7September 2011Museum Reason-able View

Linked Open Data Cloud

203 datasets as of september 2011

Page 8: Semantic Technologies for Cultural Heritage Ongoing Projects at Ontotext Mariana Damova, PhD September, 2011

#8September 2011Museum Reason-able View

Reason-able View – Linked Open Data Management

Using linked data for data management is considered to have great potential for the transformation of the web of data into a giant global graph (Heath, & Bizer, 2011). Still, there are several challenges that have to be overcome to make this possible, namely:

• LOD are hard to comprehend;• Diversity comes at a price;• LOD is unreliable;• Dealing with data distributed on the web is slow;• No consistency is guaranteed.

Using reason-able views (Kiryakov et al., 2009a) – a solution to LOD management.

Page 9: Semantic Technologies for Cultural Heritage Ongoing Projects at Ontotext Mariana Damova, PhD September, 2011

#9September 2011Museum Reason-able View

Reason-able View – Linked Open Data Management

• An approach for reasoning with and managing linked data

- an assembly of independent datasets, which can be used as a single body of knowledge with respect to reasoning and query evaluation

- lowering the cost and the risks of using specific linked datasets for specific purposes

• The linkage between the data- at the schema level- at the instance level

• Accessible via- SPARQL endpoint- keywords

• Queries with predicates from different datasets• “Federated” results from different datasets

Page 10: Semantic Technologies for Cultural Heritage Ongoing Projects at Ontotext Mariana Damova, PhD September, 2011

#10September 2011Museum Reason-able View

Museum Reason-able View – Data

• Requirements:

- the ability to handle generic knowledge, such as people, institutions, and locations

- the ability to handle specific subject domains, such as the cultural heritage and museums

• Datasets covering the Generic Knowledge of the Museum Reason-able View.

- DBpedia - the RDF-ized version of Wikipedia, describing more than 3.5 million things and covers 97 languages.

- Geonames - a geographic database that covers 6 million of the most significant geographical features on Earth.

- PROTON - an upper-level ontology, 542 entity classes and 183 properties.

Page 11: Semantic Technologies for Cultural Heritage Ongoing Projects at Ontotext Mariana Damova, PhD September, 2011

#11September 2011Museum Reason-able View

Museum Reason-able View – Museum Data Models

• CIDOC – CRM developed by the International Council of Museum’s Committee for

Documentation (ICOM-CIDOC)

an upper-level ontology for cultural and natural history for museum professionals to perform their work well 90 classes and 148 properties

- Entity, Temporal Entity, Time Span, Place, Dimension, - Production, Creation, Dissolution, Acquisition, Curation

Page 12: Semantic Technologies for Cultural Heritage Ongoing Projects at Ontotext Mariana Damova, PhD September, 2011

#12September 2011Museum Reason-able View

Museum Reason-able View – Museum Data Models

K-samsök, the Swedish Open Cultural Heritage (SOCH)

• a Web service for applications to retrieve data from cultural heritage institutions or associations with Cultural Heritage information.• includes features which are divided in the following categories: 

- Identification of the item in the collection- Internet address, and thumbnail address- Description of the item- Description of the presentation of the item, including a thumbnail- Geographic location coordinates- Museum information about the item- Context, when was it created, to which style it belongs, etc.- Item specification, e.g. size, and type of the item – painting, sculpture and the like

Page 13: Semantic Technologies for Cultural Heritage Ongoing Projects at Ontotext Mariana Damova, PhD September, 2011

#13September 2011Museum Reason-able View

Museum Reason-able View – Museum Data Models

MAO (Museum Artefact Ontology)link between K-samsök, CIDOC-CRM and real museum data

MuseumObject - ForemalsnumberURI:http://molto.ontotext.com/GSM_NrURI:http://molto.ontotext.com/GIM_Nr

ptop:Objectcrm:E22_Man_Made-Object

“GSM”/”GIM”

“painting”“portret”

“sculpture”

rdfs:label

rdf:type

mao:from

Collection“kontor” - “office”

“livsmedelsindustri” - “food”“grafisk industri” - “graphics industry”

mao:relatedCategory

crm:E54_Dimension

mao:itemMeasurement

mao:Quantity(Antal)

mao:Length(Langd)

mao:Bredth(Bredd)

mao:Height(Hojd)

mao:Diam(Diam)

mao:Weight(Vikt)

mao:hasLength (Langd)

mao:hasBredth (Bredd)

mao:hasHeight (Hojd)

mao:hasDiam (Diam)

mao:hasWeight (Vikt)

ptop:Personcrm:E21_Person

pext:author“firstname lastname”

rdfs:label ptop:TimeIntervalcrm:E52_Time-Span

“time”

mao:fromPeriod

rdfs:label

crm:E58_Measurement_Unitcrm:P91_has_unitmao:hasQuantity (Antal)

“value”“value”“value”“value”“value”“value”

mao:valuemao:valuemao:valuemao:valuemao:value mao:value

crm:E57_Material

“material”

rdfs:label

mao:hasM

aterial ptop:Locationcrm:E53_Place

ptop:locatedIn

pext:Address

pext:hasAddress

“address”

rdfs:label

Page 14: Semantic Technologies for Cultural Heritage Ongoing Projects at Ontotext Mariana Damova, PhD September, 2011

#14September 2011Museum Reason-able View

Museum Reason-able View – Gothenburg City Museum Data

8900 museum objects in two museum collections – GSM and GIM

39 properties describe each museum object

The Gothenburg city mueum data is integrated by using predicates from CIDOC-CRM, PROTON, MAO, and linkages to DBpedia

Page 15: Semantic Technologies for Cultural Heritage Ongoing Projects at Ontotext Mariana Damova, PhD September, 2011

#15September 2011Museum Reason-able View

Museum Reason-able View – Architecture

Process of triplification and localisation of GCM data in English.

Page 16: Semantic Technologies for Cultural Heritage Ongoing Projects at Ontotext Mariana Damova, PhD September, 2011

#16September 2011Museum Reason-able View

Museum Reason-able View – Environment

• OWLIM

• Ontologies and data loaded with full materialization– Dbpedia 3.6, Geonames 2.2.1, PROTON 3.0, CIDOC-CRM 1.0, MAO, GCM data

• 20% more retrievable statements than loaded explicit statements– 257,774,678 (explicit)– 305,313,536 (retrievable)

• SPARQL endpoint– Museum artefacts preserved in the museum since 2005– Paintings from the GSM collection– Inventory numbers of the paintings from the GSM collection – Location of the objects created by Anders Hafrin– Paintings with length less than 1 m– etc.

http://museum.ontotext.com

Page 17: Semantic Technologies for Cultural Heritage Ongoing Projects at Ontotext Mariana Damova, PhD September, 2011

#17September 2011Museum Reason-able View

Related Work

MAO – Finland http://www.seco.tkk.fi/projects/finnonto/

Europeana – EU http://www.europeana.eu/portal/

VUA – Amsterdam Museum with semantic technologieswithin Europeana connect

British Museum – Research Spacejust won tender funded by Melon foundation

Contribution of the paper:First to link real museum data to LODFirst to use schema-level mapping to data integration in a specific domain like cultural heritage

Page 18: Semantic Technologies for Cultural Heritage Ongoing Projects at Ontotext Mariana Damova, PhD September, 2011

#18September 2011Museum Reason-able View

Related Work

MAO – Finland http://www.seco.tkk.fi/projects/finnonto/

Europeana – EU http://www.europeana.eu/portal/

VUA – Amsterdam Museum with semantic technologieswithin Europeana connect

British Museum – Research Spacejust won tender funded by Melon foundation

Contribution of the paper:First to link real museum data to LODFirst to use schema-level mapping to data integration in a specific domain like cultural heritage

Page 19: Semantic Technologies for Cultural Heritage Ongoing Projects at Ontotext Mariana Damova, PhD September, 2011

#19September 2011Museum Reason-able View

ConclusionKnowledge Representation InfrastructureMethod of managing linked dataImplementation of Museum Reason-able view

extendable with information from other Swedish museumslinks to general knowledge from LOD datasets

Future workExperiments with the Museum Reason-able viewQuerying and NavigationExtension of the data models, for example with a painting ontologyQuerying structured information in natural languageRepresenting structured results in natural language

MOLTO, FP7-ICT-247914

Page 20: Semantic Technologies for Cultural Heritage Ongoing Projects at Ontotext Mariana Damova, PhD September, 2011

Semantic publishing of Bulgarian Cultural Heritage in

Europeana

Page 21: Semantic Technologies for Cultural Heritage Ongoing Projects at Ontotext Mariana Damova, PhD September, 2011

#21September 2011Museum Reason-able View

Scope of the Project• Digitalization of selected Bulgarian Cultural Heritage

• Conversion in semantic format

• Publishing in Europeana

• Selected Bulgarian Cultural Heritage– Cyrilo-Methodius manuscripts and psaltirs– Thracian golden treasures, thumbs, temples

• Additioanal objectives:– Establishing of a process for publishing, easy to adopt by other cultural institutions– Establishing of a group (Special Interest Group, SIG) for publisihing in Europeana, coordination with potential partners– Establishing of a technical aggredator for Bulgaria (www.bulgariana.eu)

Page 22: Semantic Technologies for Cultural Heritage Ongoing Projects at Ontotext Mariana Damova, PhD September, 2011

Research Space

Page 23: Semantic Technologies for Cultural Heritage Ongoing Projects at Ontotext Mariana Damova, PhD September, 2011

#23September 2011Museum Reason-able View

Scope of the Project• To create an RDF infrastructure based on the CIDOC-CRM ontology that harmonises data contributed by different cultural heritage and arts organisations.

• To provide collaboration environments and tools to enable effective online scholarly communication with direct access to supporting data. The main issue with current online scholarly collaboration is that the data is not, ‘on hand’. The ability to draw upon source materials during research is a major enhancement to this type of environment.

• To provide research tools that harness the benefits of semantic data for the discovery of new information and knowledge and support a range of research methods and techniques for analysing data, information and multimedia resources.

• Allow publication of data and documentation in web applications, and as linked data, which would be available to the scholarly community working outside the ResearchSpace environment.

• Be accessible to a wide range of humanities organisations, large or small and regardless of internal resources.

• Provide a full web browser interface, and open technology architecture to encourage community development.

Page 24: Semantic Technologies for Cultural Heritage Ongoing Projects at Ontotext Mariana Damova, PhD September, 2011

#24September 2011Museum Reason-able View

Thank you for your attention!

Questions

[email protected]