toward semantic representation of science in electronic laboratory notebooks (elns)
TRANSCRIPT
![Page 1: Toward Semantic Representation of Science in Electronic Laboratory Notebooks (ELNs)](https://reader035.vdocuments.mx/reader035/viewer/2022062412/58ac4d341a28ab99028b6223/html5/thumbnails/1.jpg)
Toward Semantic Representation of Science in Electronic
Laboratory Notebooks (ELNs)Stuart J. Chalk
Department of Chemistry, University of North Florida
CINF Paper 50 – 251st ACS Meeting Spring 2016
#ACSCINFDataSummit
![Page 2: Toward Semantic Representation of Science in Electronic Laboratory Notebooks (ELNs)](https://reader035.vdocuments.mx/reader035/viewer/2022062412/58ac4d341a28ab99028b6223/html5/thumbnails/2.jpg)
Utopia: A Global Research Network What is an Electronic Notebook? The Semantics of Semantics What Needs to be Semantically Represented? Current lay of the land
ELN Item Manifest P-PLAN Ontology VIVO-ISF Ontology Chemical Analysis Metadata Platform HCLS Community Profiles Electronic Notebook Ontology A generic scientific data model Experimental information for LD (ExptLD)
Take Home Conclusion
Outline
![Page 3: Toward Semantic Representation of Science in Electronic Laboratory Notebooks (ELNs)](https://reader035.vdocuments.mx/reader035/viewer/2022062412/58ac4d341a28ab99028b6223/html5/thumbnails/3.jpg)
“Big Data” and the “Semantic Web” are the current buzz words du jour but what do they mean for chemistry?
Lots of heterogeneous data and metadata with even more “semantic” data to represent it
Look at what we want rather that what we have… We went chemical data that is:
Easy to share, find, and compare Freely available but with provenance Globally sourced and without IP restrictions on reuse
Utopia: A Global Research Network
![Page 4: Toward Semantic Representation of Science in Electronic Laboratory Notebooks (ELNs)](https://reader035.vdocuments.mx/reader035/viewer/2022062412/58ac4d341a28ab99028b6223/html5/thumbnails/4.jpg)
An electronic way to record data… ...equivalent to a laboratory notebook
But ELN’s should not be thought of so lowly...
An ELN must:* Keep track of research data Reference resources used in research and… …capture the story of research
What is anElectronic Laboratory Notebook?
* Insight from Tony Williams
![Page 5: Toward Semantic Representation of Science in Electronic Laboratory Notebooks (ELNs)](https://reader035.vdocuments.mx/reader035/viewer/2022062412/58ac4d341a28ab99028b6223/html5/thumbnails/5.jpg)
The interface should mirror a laboratory notebook Behind the scenes though it should use state of the
art software, data formats, data/metadata practices, and web technologies to manage data generation, workflows, remote data access, authentication etc…
As a result it needs to speak the same language as other data sources and store data in a format that others can read and reuse
Foundational building block of a Global Research Network
What should an ELN be?
![Page 6: Toward Semantic Representation of Science in Electronic Laboratory Notebooks (ELNs)](https://reader035.vdocuments.mx/reader035/viewer/2022062412/58ac4d341a28ab99028b6223/html5/thumbnails/6.jpg)
Semantics is the study of meaning-> We need to give meaning to what is created in an ELN
Described in computers using the Resource Description Framework (RDF) which: Makes statements about objects… … their relationships to other objects... ...using subject-predicate-object “triples”
RDF allows knowledge representation Meaning is represented by using one or more
ontologies
The Semantics of Semantics
![Page 7: Toward Semantic Representation of Science in Electronic Laboratory Notebooks (ELNs)](https://reader035.vdocuments.mx/reader035/viewer/2022062412/58ac4d341a28ab99028b6223/html5/thumbnails/7.jpg)
RDF in JSON-LD{ "@context": { "name": "http://schema.org/name", "isAlive": "http://example.org/isAlive", "age": "http://example.org/age", "height": "http://schema.org/height", "@base": "http://www.unf.edu/chemistry/stuart_chalk.aspx" }, "@id": "", "name": "Stuart Chalk", "isAlive": true, "age": 49, "height": 188.0}
http://json-ld.org/playground/
![Page 8: Toward Semantic Representation of Science in Electronic Laboratory Notebooks (ELNs)](https://reader035.vdocuments.mx/reader035/viewer/2022062412/58ac4d341a28ab99028b6223/html5/thumbnails/8.jpg)
<http://www.unf.edu/chemistry/stuart_chalk.aspx> <http://example.org/age>
"49"^^<http://www.w3.org/2001/XMLSchema#integer> .
<http://www.unf.edu/chemistry/stuart_chalk.aspx> <http://example.org/isAlive>
"true"^^<http://www.w3.org/2001/XMLSchema#boolean> .
<http://www.unf.edu/chemistry/stuart_chalk.aspx> <http://schema.org/height>
"188"^^<http://www.w3.org/2001/XMLSchema#integer> .
<http://www.unf.edu/chemistry/stuart_chalk.aspx> <http://schema.org/name>
"Stuart Chalk" .
RDF in JSON-LD
![Page 9: Toward Semantic Representation of Science in Electronic Laboratory Notebooks (ELNs)](https://reader035.vdocuments.mx/reader035/viewer/2022062412/58ac4d341a28ab99028b6223/html5/thumbnails/9.jpg)
Everything!
What areas? Data, Results and Resources Models, Tools for Data Workup (Equations, Tests,
Stats) General Workflows (Protocols and Procedures) The Research Story (What, Why, How)
User discussion and annotation ELN usage timeline
The Science (Area, Hypotheses, Theories) The People (Expertise, Provenance, Integrity,
Eminence)
What Needs to beSemantically Represented?
![Page 10: Toward Semantic Representation of Science in Electronic Laboratory Notebooks (ELNs)](https://reader035.vdocuments.mx/reader035/viewer/2022062412/58ac4d341a28ab99028b6223/html5/thumbnails/10.jpg)
![Page 11: Toward Semantic Representation of Science in Electronic Laboratory Notebooks (ELNs)](https://reader035.vdocuments.mx/reader035/viewer/2022062412/58ac4d341a28ab99028b6223/html5/thumbnails/11.jpg)
The P-PLAN Ontologyhttp://purl.org/net/p-plan
Workflows
Implement in Kepler, Taverna, Knime?
![Page 12: Toward Semantic Representation of Science in Electronic Laboratory Notebooks (ELNs)](https://reader035.vdocuments.mx/reader035/viewer/2022062412/58ac4d341a28ab99028b6223/html5/thumbnails/12.jpg)
People: The VIVO-ISF Ontology
https://wiki.duraspace.org/download/attachments/51052811/PeopleOrgsRolesGrants.2014-03-14.png
![Page 13: Toward Semantic Representation of Science in Electronic Laboratory Notebooks (ELNs)](https://reader035.vdocuments.mx/reader035/viewer/2022062412/58ac4d341a28ab99028b6223/html5/thumbnails/13.jpg)
The Chemical Analysis Metadata Platform (ChAMP)http://champ-project.org/
Identification of metadata related to chemical analysis and definition of an ontology to describe terms
Examples in both XML and JSON-LD with associate XML Schema and JSON-LD context Journal Article Standard Method of Analysis Reference Material
The Science: ChAMP (an example)
![Page 14: Toward Semantic Representation of Science in Electronic Laboratory Notebooks (ELNs)](https://reader035.vdocuments.mx/reader035/viewer/2022062412/58ac4d341a28ab99028b6223/html5/thumbnails/14.jpg)
ChAMP
![Page 15: Toward Semantic Representation of Science in Electronic Laboratory Notebooks (ELNs)](https://reader035.vdocuments.mx/reader035/viewer/2022062412/58ac4d341a28ab99028b6223/html5/thumbnails/15.jpg)
ChAMP
![Page 16: Toward Semantic Representation of Science in Electronic Laboratory Notebooks (ELNs)](https://reader035.vdocuments.mx/reader035/viewer/2022062412/58ac4d341a28ab99028b6223/html5/thumbnails/16.jpg)
The Healthcare and Life Science (HCLS) Community Profile is a Note from the Semantic Web HCLS Interest Group Access to consistent, high-quality metadata is critical to
finding, understanding, and reusing scientific data. This document describes a consensus among participating stakeholders in the Health Care and the Life Sciences domain on the description of datasets using the Resource Description Framework (RDF). This specification meets key functional requirements, reuses existing vocabularies to the extent that it is possible, and addresses elements of data description, versioning, provenance, discovery, exchange, query, and retrieval.
Data Descriptions:HCLS Community Profile
http://www.w3.org/TR/hcls-dataset/
![Page 17: Toward Semantic Representation of Science in Electronic Laboratory Notebooks (ELNs)](https://reader035.vdocuments.mx/reader035/viewer/2022062412/58ac4d341a28ab99028b6223/html5/thumbnails/17.jpg)
Describes three levels for description of datasets Summary Level
Type declaration (rdf:type = dctypes:Dataset)
Title (dct:title = rdf:langString) Description (dct:description =
rdf:langString) Publisher (dct:publisher = IRI)
Version Level Type declaration (rdf:type =
dctypes:Dataset) Title (dct:title = rdf:langString) Description (dct:description =
rdf:langString)
Creator (dct:creator = IRI) Publisher (dct:publisher = IRI) Version identifier (pav:version =
xsd:string) Version linking (dct:isVersionOf =
IRI) Distribution Level
Type declaration (rdf:type = void:Dataset OR dcat:Distribution)
Title (dct:title = rdf:langString) Description (dct:description =
rdf:langString) Creator (dct:creator = IRI) Publisher (dct:publisher = IRI) License (rdf:type = IRI)
Data Descriptions:HCLS Community Profile
http://www.w3.org/TR/hcls-dataset/#datasetdescriptionlevels
![Page 18: Toward Semantic Representation of Science in Electronic Laboratory Notebooks (ELNs)](https://reader035.vdocuments.mx/reader035/viewer/2022062412/58ac4d341a28ab99028b6223/html5/thumbnails/18.jpg)
Electronic Notebook Ontology (ENO)
![Page 19: Toward Semantic Representation of Science in Electronic Laboratory Notebooks (ELNs)](https://reader035.vdocuments.mx/reader035/viewer/2022062412/58ac4d341a28ab99028b6223/html5/thumbnails/19.jpg)
ENO
![Page 20: Toward Semantic Representation of Science in Electronic Laboratory Notebooks (ELNs)](https://reader035.vdocuments.mx/reader035/viewer/2022062412/58ac4d341a28ab99028b6223/html5/thumbnails/20.jpg)
ENO
![Page 21: Toward Semantic Representation of Science in Electronic Laboratory Notebooks (ELNs)](https://reader035.vdocuments.mx/reader035/viewer/2022062412/58ac4d341a28ab99028b6223/html5/thumbnails/21.jpg)
Use a GenericScientific DataModel
Captures dataand metadataabout datasetsand links torelated data
JSON-LD isideal file format
Data andResources
![Page 22: Toward Semantic Representation of Science in Electronic Laboratory Notebooks (ELNs)](https://reader035.vdocuments.mx/reader035/viewer/2022062412/58ac4d341a28ab99028b6223/html5/thumbnails/22.jpg)
A specification (written in XML) that describes different data types of information recorded during the scientific process (http://exptml.sourceforge.net
Experiment Markup Language (ExptML)
Sample Solution Space Specimen Substance Task Template Timeline User Vendor
Annotation Api Calculation Chemical Citation Communication Customer Data Dataset Definition
Element Equipment Event Experiment Group Project Protocol Quote Report Result
![Page 23: Toward Semantic Representation of Science in Electronic Laboratory Notebooks (ELNs)](https://reader035.vdocuments.mx/reader035/viewer/2022062412/58ac4d341a28ab99028b6223/html5/thumbnails/23.jpg)
Experimental Linked Data (ExptLD)
Define data packetsthat capture themetadata of Resources Data
Integrate with otherExptLD packets to create a SciData document
Or convert to RDF and store in a triplestore
![Page 24: Toward Semantic Representation of Science in Electronic Laboratory Notebooks (ELNs)](https://reader035.vdocuments.mx/reader035/viewer/2022062412/58ac4d341a28ab99028b6223/html5/thumbnails/24.jpg)
A lot exists to semantically represent the scientific process that can be leveraged as part of an ELN system
A data standard needs to be agreed upon Agreeing on implementation standards will take time
because of size of user community Integration and coverage of ontologies will be
necessary to fully implement a system that underpins a Global Research Network
Domain specific knowledge representation needed in many areas
Take Home
![Page 25: Toward Semantic Representation of Science in Electronic Laboratory Notebooks (ELNs)](https://reader035.vdocuments.mx/reader035/viewer/2022062412/58ac4d341a28ab99028b6223/html5/thumbnails/25.jpg)
[email protected] Phone: 904-620-1938 Skype: stuartchalk LinkedIn/Slidehare: https://www.linkedin.com/in/
stuchalk ORCID: http://orcid.org/0000-0002-0703-7776 ResearcherID:
http://www.researcherid.com/rid/D-8577-2013
Questions?