the happi toolkit: from model to implementation
TRANSCRIPT
The HAPPI Toolkit: from model to implementation
“Digital Preservation in Earth-Science Missions” Workshop Rome, 26th November 2014 - INGV
virtualgix R&D LAB
SCIDIP-ES Workshop @ INGV
L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop
Outline
• What HAPPI is
• What HAPPI does
• How HAPPI works
• HAPPI Technology
• How to integrate HAPPI
• Current Status
• Next Steps
L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop
What HAPPI is
• HAPPI is one of the 12 software solutions composing the SCIDIP-ES e-Infrastructure
• HAPPI stands for Handling Authenticity, Provenance and Persistent Identifiers
Packaging
Notification
HAPPI
Finding Aid
Registry
Gap Ident.
L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop
What HAPPI is
• HAPPI Toolkit is a software component that manages part of preservation metadata defined in ISO:14721:2012, i.e. OAIS Preservation Description Information (PDI)
• HAPPI Toolkit is a software component that manages evidences for the transformations may occur on digital objects during their life cycle.
L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop
OAIS:PDI
Provenance Reference
Context
Fixity
Rights
EH
What HAPPI does
• HAPPI supports archive managers for documenting transformations of digital objects
• Documentation of transformation is based on the model of EvidenceRecord and EvidenceHistory, as well as OAIS, OPM and PREMIS
Archive Manager
Store HAPPI
Register Intellectual Entity
Capture Evidence Record
Import/Export Evidence History
Search & Browse Intellectual Entity, Evidence Records
L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop
Report • info • Fixity • SignificantProperties
How HAPPI works
L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop
Agent • ID+info • Type
Representation • ID+info • Format • Type
Transformation • ID+Info • Software • Type
How HAPPI works
Register the Intellectual Entity • title • creation date • reference • annotation
Gather information into Evidence Records
• transformation • who controls the transformation • result of transformation • report with annotation and specific properties
Step 1
Step 2
L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop
How HAPPI works
Browse the History of Data Timeline mode
L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop
How HAPPI works
Browse the History of Data Graph mode
L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop
How HAPPI works • Archive Manager can add specific significant properties, for later
supporting authenticity assessment
• Reference is applied to Intellectual Entities and evidence items (i.e. Agent, Transformation, Representation) • Organisation – who assigns the reference • Type – type of reference (e.g. URI, DOI, Protocol, …) • Value – value of reference
• Type of Transformations
• AGGREGATION • CAPTURE • CHANGEOFCUSTODY • EXTRACTION • INGESTION • MIGRATION
L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop
GraphDB
HAPPI-LOGIC
HAPPI Technology
Neo4j
HAPPI-SERVER
L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop
IntellectualEntityManager EvidenceHistoryManager
+ addIntellectualEntity(ie) + addEvidenceRecord(er, eh)
+ getIntellectualEntity(label) + getEvidenceRecord(label, eh)
+ getAllIntellectualEntities() + getAllEvidenceRecords(eh)
+ getIntellectualEntitiesBy(from, to, keyword) + getEvidenceRecordHistory(label, eh)
+ getLastEvidenceRecords(eh)
+ importEvidenceHistory(eh, gxmlFile)
+ exportEvidenceHistory(eh, gxmlFile)
HAPPI-LOGIC-1.5.0
How to integrate HAPPI
IEManager
EHManager
L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop
IntellectualEntity EvidenceHistory/Record
How to integrate HAPPI
// obtain the IntellectualEntityManager IntellectualEntityManager iemanager = ManagerFactory.getInstance().getIntellectualEntityManager(); // create the reference Reference sampleRef = new Reference("Picktochart", "URI","https://magic.piktochart.com/output/3098625-untitled-report"); /** * create the intellectual entity, that is composed by * reference, label, title, annotation and date of creation. */ IntellectualEntity ie1 = new IntellectualEntity(sampleRef, null, "HAPPI Infographics", "SCIDIP-ES HAPPI Infographics", new Date()); // add the intellectual entity through iemanager iemanager.addIntellectualEntity(ie1);
http://sourceforge.net/p/digitalpreserve/code/HEAD/tree/SCIDIP-ES/
L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop
How to integrate HAPPI
// obtain the EvidenceHistoryManager EvidenceHistoryManager ehmanager = ManagerFactory.getInstance().getEvidenceHistoryManager(); // get the evidence history of the intellectual entity EvidenceHistory eh1 = ie1.getEvidenceHistory(); /** create the first evidence record with sample data, by * using the buildRecord utility method. */ EvidenceRecord er1 = new EvidenceRecord(); er1 = buildSampleRecord("Luigi Briguglio", "capture","origin", "er1", null); // add the evidence record to its history eh1.addEvidenceRecord(er1); ehmanager.addEvidenceRecord(er1, eh1); // adding a second record to the history to the first one EvidenceRecord er2 = new EvidenceRecord(); er2 = buildSampleRecord("Luigi Briguglio", "ingestion","submitted", "er2", er1); eh1.addEvidenceRecord(er2); ehmanager.addEvidenceRecord(er2, eh1);
http://sourceforge.net/p/digitalpreserve/code/HEAD/tree/SCIDIP-ES/
L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop
Current Status
• HAPPI 1.5.0 instances are running for validation in
L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop
Current Status • A Modular Infrastructure for the Management of Authenticity and
Persistent Identifiers in Long Term Digital Preservation Repositories in Int. J. of Knowledge and Learning - forthcoming, http://www.inderscience.com/info/ingeneral/forthcoming.php?jcode=ijkl
• Thesis - Analisi Progettazione e Sviluppo di un Prototipo per la Gestione della Provenienza nel Processo di Conservazione Digitale, Tor Vergata Univ., October 2013
• “Modelling Data Value in Digital Preservation”, iPRES2013 Conference Proceedings, September 2013, http://purl.pt/24107/1/iPres2013_PDF/Modelling%20Data%20Value%20in%20Digital%20Preservation.pdf
• Preserving Authenticity Evidence to Assess Provenance and Integrity of Digital Resources in ECLAP 2013 Conference Proceedings, LNCS issue no. 7990, April 2013 - http://link.springer.com/chapter/10.1007%2F978-3-642-40050-6_7 L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop
Next Steps
• Extend the predefined types of transformations (external transformation, e.g. extraction and aggregation)
• Add integrity check at provenance graph
• Add management of distributed repositories
• Improve performances
• Extension of the model for supporting «Context Information» and «Rights Management» (OAIS:PDI)
L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop
time
Next Steps: Extending to Context and Rights
L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop
• add relationships to other digital objects, in order to document its context
• document rights and their evolution
OAIS:PDI
Provenance Reference
Context
Fixity
Rights
EH