fedora overview

29
Challenge the future Delft University of Technology 1 Overview of Fedora-Commons Repository Fedora-Commons Repository •A brief overview Etienne Posthumus, DPO, TU Delft Library 08-05-22

Upload: eposthumus

Post on 13-Dec-2014

1.973 views

Category:

Education


0 download

DESCRIPTION

 

TRANSCRIPT

Page 1: Fedora Overview

Challenge the future

DelftUniversity ofTechnology

1Overview of Fedora-Commons Repository

Fedora-Commons Repository•A brief overview

Etienne Posthumus, DPO, TU Delft Library

10-04-23

Page 2: Fedora Overview

Challenge the future

DelftUniversity ofTechnology

2Overview of Fedora-Commons Repository

What is it?

Fedora is a general-purpose, open-source digital object repository system.

Page 3: Fedora Overview

Challenge the future

DelftUniversity ofTechnology

3Overview of Fedora-Commons Repository

What is it?

Fedora is a general-purpose, open-source digital object repository system.

"Flexible Extensible Digital Object Repository Architecture"

NOT the Linux distribution

Page 4: Fedora Overview

Challenge the future

DelftUniversity ofTechnology

4Overview of Fedora-Commons Repository

What is it?

Fedora is a general-purpose, open-source digital object repository system.

…a persistence and service delivery platform

Page 5: Fedora Overview

Challenge the future

DelftUniversity ofTechnology

5Overview of Fedora-Commons Repository

What is it?

Fedora is a general-purpose, open-source digital object repository system.

…a persistence and service delivery platform

…implemented as a set of Java services (Tomcat)

Page 6: Fedora Overview

Challenge the future

DelftUniversity ofTechnology

6Overview of Fedora-Commons Repository

What is it?

Fedora is a general-purpose, open-source digital object repository system.

A well-funded active community with a long history.

Page 7: Fedora Overview

7Overview of Fedora-Commons Repository

Challenge the future

DelftUniversity ofTechnology

Conforms to the OAIS reference model

Open Archival Information System - an ISO model defined by Consultative Committee for Space Data Systems widely adopted in Archives community

Built on sub-systems

Open Source, Open Standards

Page 8: Fedora Overview

8Overview of Fedora-Commons Repository

Challenge the future

DelftUniversity ofTechnology

Persistent digital archive ‘files on disk’ bootstrappeable in the face of disaster XML containers asset management

Built on sub-systems

Open Source, Open Standards

Page 9: Fedora Overview

9Overview of Fedora-Commons Repository

Challenge the future

DelftUniversity ofTechnology

Persistent digital archive ‘files on disk’ bootstrappeable in the face of disaster XML containers asset managementScaleable triple-store Mulgara

Built on sub-systems

Open Source, Open Standards

Page 10: Fedora Overview

10Overview of Fedora-Commons Repository

Challenge the future

DelftUniversity ofTechnology

Persistent digital archive ‘files on disk’ bootstrappeable in the face of disaster XML containers asset managementScaleable triple-store Mulgara Service delivery XSLT via Saxon built-in others pluggable

Built on sub-systems

Open Source, Open Standards

Page 11: Fedora Overview

11Overview of Fedora-Commons Repository

Challenge the future

DelftUniversity ofTechnology

Advantages?

• Scaling• Integrated Asset management• Extensive Versioning• Audit trails• Large(ish) community, esp. mindshare• General ‘fit and polish’

Goal is support for > 10 million objects

Page 12: Fedora Overview

12Overview of Fedora-Commons Repository

Challenge the future

DelftUniversity ofTechnology

Downsides?

• It’s a toolbox not a complete solution (front-end support)• Java code base (for some this is an advantage ;-)• Moving target in terms of innovation (also debatable)• Complexity of some concepts, steep learning curve

A lot of DIY is needed

Page 13: Fedora Overview

13Overview of Fedora-Commons Repository

Challenge the future

DelftUniversity ofTechnology

Digital Object Model

• Compound objects, aggregate items per object

Page 14: Fedora Overview

14Overview of Fedora-Commons Repository

Challenge the future

DelftUniversity ofTechnology

Digital Object Model

• Compound objects, aggregate items per object• Some ‘reserved’ item IDs

Page 15: Fedora Overview

15Overview of Fedora-Commons Repository

Challenge the future

DelftUniversity ofTechnology

Digital Object Model

• Compound objects, aggregate items per object• Some ‘reserved’ item IDs• Essentially an XML container

Page 16: Fedora Overview

16Overview of Fedora-Commons Repository

Challenge the future

DelftUniversity ofTechnology

Digital Object Model<foxml:digitalObject PID="demo:31" VERSION="1.1" xmlns:foxml="info:fedora/fedora-system:def/foxml#" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="info:fedora/fedora-system:def/foxml# http://www.fedora.info/definitions/1/0/foxml1-1.xsd"> <foxml:objectProperties> <foxml:property NAME="info:fedora/fedora-system:def/model#state" VALUE="Active"/><foxml:property NAME="info:fedora/fedora-system:def/model#ownerId" VALUE="fedoraAdmin"/> <foxml:property NAME="info:fedora/fedora-system:def/model#createdDate" VALUE="2008-07-02T05:09:42.015Z"/> <foxml:property NAME="info:fedora/fedora-system:def/view#lastModifiedDate" VALUE="2008-07-02T05:09:42.015Z"/> </foxml:objectProperties> <foxml:datastream CONTROL_GROUP="X" ID="DC" STATE="A" VERSIONABLE="true"> <foxml:datastreamVersion CREATED="2008-07-02T05:09:43.328Z" FORMAT_URI="http://www.openarchives.org/OAI/2.0/oai_dc/" ID="DC1.0" LABEL="Dublin Core Record for this object" MIMETYPE="text/xml"> <foxml:xmlContent> <oai_dc:dc xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:oai_dc="http://www.openarchives.org/OAI/2.0/oai_dc/"> <dc:title>Apache Batik 1.5 Demo Directory</dc:title> <dc:creator>Chris Wilper</dc:creator> <dc:creator>The Apache Group</dc:creator> <dc:subject>svg</dc:subject><dc:publisher>The Apache Group</dc:publisher> <dc:format>image/svg+xml</dc:format> <dc:identifier>demo:31</dc:identifier> <dc:relation>http://xml.apache.org/batik/</dc:relation> <dc:rights>The Apache License</dc:rights> </oai_dc:dc> </foxml:xmlContent> </foxml:datastreamVersion> </foxml:datastream> <foxml:datastream CONTROL_GROUP="E" ID="DS1" STATE="A" VERSIONABLE="true"> <foxml:datastreamVersion CREATED="2008-07-02T05:09:43.328Z" ID="DS1.0" LABEL="Local copy of batikBatik.svg" MIMETYPE="image/svg+xml"> <foxml:contentLocation REF="http://www.fedora-commons.org/demo/misc/batikBatik.svg" TYPE="URL"/> </foxml:datastreamVersion> </foxml:datastream>

An Example

Page 17: Fedora Overview

17Overview of Fedora-Commons Repository

Challenge the future

DelftUniversity ofTechnology

Content Model Architecture

“an integrated structure for persisting and delivering the essential characteristics of digital objects in Fedora”

Modelling behaviours and structure of objects.

Complicated, fairly new, not widely implemented yet, in flux.

…can also be safely ignored when starting out.(which we did not realise)

Page 18: Fedora Overview

18Overview of Fedora-Commons Repository

Challenge the future

DelftUniversity ofTechnology

Fedora at TU Delft Library

• Used to create a repository Infrastructure

Page 19: Fedora Overview

19Overview of Fedora-Commons Repository

Challenge the future

DelftUniversity ofTechnology

Fedora at TU Delft Library

• Used to create a Repository Infrastructure• Multiple projects:

• Institutional Repository (academic output)• Special Collections (Tresor, bouwkunde dias)• Expert Search (finding People)• 3TU Datacentrum

Page 20: Fedora Overview

20Overview of Fedora-Commons Repository

Challenge the future

DelftUniversity ofTechnology

Implementation

• Front-end is Django application

Page 21: Fedora Overview

21Overview of Fedora-Commons Repository

Challenge the future

DelftUniversity ofTechnology

Implementation

• Front-end is Django application• We use the REST API for communication

Page 22: Fedora Overview

22Overview of Fedora-Commons Repository

Challenge the future

DelftUniversity ofTechnology

Implementation

• Front-end is Django application• We use the REST API for communication• Middleware as Python code providing a web API

Page 23: Fedora Overview

23Overview of Fedora-Commons Repository

Challenge the future

DelftUniversity ofTechnology

Implementation

• Front-end is Django application• We use the REST API for communication• Middleware as Python code providing a web API• MOAI flexible OAI-PMH service

Page 24: Fedora Overview

24Overview of Fedora-Commons Repository

Challenge the future

DelftUniversity ofTechnology

Implementation

• Front-end is Django application• We use the REST API for communication• Middleware as Python code providing a web API• MOAI flexible OAI-PMH service• Indexing via SOLR subsequently linked to DISCOVER

Page 25: Fedora Overview

25Overview of Fedora-Commons Repository

Challenge the future

DelftUniversity ofTechnology

Page 26: Fedora Overview

26Overview of Fedora-Commons Repository

Challenge the future

DelftUniversity ofTechnology

Conversion of existing repository

• Legacy Stellent CMS stored in Oracle• Circa 12000 items• Mostly PDF plus metadata• Conversion scripts in Python• Bulk ingest

Page 27: Fedora Overview

27Overview of Fedora-Commons Repository

Challenge the future

DelftUniversity ofTechnology

Community Participation

• NL Fedora Usergroup• Contact with key Fedora participants at conferences• Presentations and workshop• Developer days planned• Possible collaboration on development• Sponsorship of open-source components

Page 28: Fedora Overview

28Overview of Fedora-Commons Repository

Challenge the future

DelftUniversity ofTechnology

Plans

• Convert existing repository with existing functionality• Incrementally build in new features on ongoing basis• Extend APIs and services• Linked Data repository• More innovative repository applications

Page 29: Fedora Overview

29Overview of Fedora-Commons Repository

Challenge the future

DelftUniversity ofTechnology

Questions?

[email protected]

http://www.fedora-commons.org/