data and model management for systems biology

17
Data and Model Management for Systems Biology Dagmar Waltemath http://sems.uni-rostock.de Slides available from www.slideshare.net/dagwa/

Upload: university-of-rostock

Post on 12-Jan-2017

314 views

Category:

Science


1 download

TRANSCRIPT

Page 1: Data and Model Management  for Systems Biology

Data and Model Management for Systems Biology

Dagmar Waltemathhttp://sems.uni-rostock.de

Slides available from www.slideshare.net/dagwa/

Page 2: Data and Model Management  for Systems Biology

Developing a computational model

… may take a long time

… may involve different research groups

… may involve different tools

… may involve different modeling approaches

… may lead to different versions of a model

… will result in a bunch of files

… will result in a publication

Page 3: Data and Model Management  for Systems Biology

Journals ask for model code

The Physiome Model Repositoryhttps://models.cellml.org/ Yu et al (2011)

BioModelshttp://www.ebi.ac.uk/biomodels-main/ Li et al (2010)

Page 4: Data and Model Management  for Systems Biology

Journals ask for model code

Fig.: Curation and publication processes in BioModels (Figure courtesy Vijayalakshmi Chelliah)

Page 5: Data and Model Management  for Systems Biology

Journals ask for model code

Fig.: Curation and publication processes in BioModels (Figure courtesy Vijayalakshmi Chelliah)

Manage your data throughout(collaborative) projects

Prepare your model-related data for publication

Archive and documentmodel-related data

Get information about a model's evolution

Extract all data related to a modeling result

Find relevant models

Publish Reproduce & Reuse

Page 6: Data and Model Management  for Systems Biology

What data belongs to a model?

Figure courtesy M. Scharm

Page 7: Data and Model Management  for Systems Biology

What data belongs to a model?

Figure adapted from M. Scharm

Page 8: Data and Model Management  for Systems Biology

What data belongs to a model?

● the model(s) themselves● the semantic annotations describing the model and its

components● the simulation setups and model parametrisation● experimental data used to feed the model● result data (tables, figures)● reference publication

+ links between these files

See also: Henkel et al (2015) Combining computational models, … DATABASE

Page 9: Data and Model Management  for Systems Biology

Manage your data

Personal DataLocal Stores

ExternalDatabases

Articles

Standards

SOPs→ Hands on

Slide courtesy Wolgang Müller. See also: Wolstencraft et al (2015) BMC Systems Biology

Page 10: Data and Model Management  for Systems Biology

Prepare for publication

See also: Waltemath et al (2013) Reproducibility of model-based results in systems biology (Springer)

● Export your model in a standard format.

● Annotate model components with links to bio-ontologies.

– Use Gene Ontology, ChEBI etc for entities

– Use SBO for mathematical terms

– Use TEDDY for behavior

– Provide provenance information (model origin, author, creator, modification dates, software)

– Link to original publication

Page 11: Data and Model Management  for Systems Biology

See also: Waltemath et al (2013) Reproducibility of model-based results in systems biology (Springer)

● Provide the simulation recipe for each analysis you describe in the paper.– Simulation setup

– Model parametrisation

– Algorithm used in analysis (Software)

– Output

● Store the simulation setup in standard format.● Try to reproduce the results from the information intended to publish. ● Ask a colleague to try and reproduce the results.

– Every piece of information necessary for that person to reproduce the findings should be included in the supplementary information.

● Pack all necessary data files into an archive format and submit to a public database to receive a URI.

Prepare for publication

Page 12: Data and Model Management  for Systems Biology

Archive your results

See also: Bergmann et al (2014) COMBINE archive and OMEX format. BMC Bioinformatics

● COMBINE Archive

– Zip-like format to bundle all model related files in one place

– Eases the upload of a modeling study to BioModels

– Tells curators what to do with the files (manifest)

– Eases exchange of files with collaborators

→ Hands on

Page 13: Data and Model Management  for Systems Biology

Find models of interest

The safe way The experimental way– Use MaSyMoS to search for

models (through Cypher queries or M2CAT)

– Try ranked retrieval in PMR2

Figure courtesy Vijayalakshmi Chelliah

→ BioModels hands on

Page 14: Data and Model Management  for Systems Biology

Extract reproduciblesimulation experiments

The safe way– Use SEEK for

model management

– Export Research Objects

The experimental way– Create archives with the

COMBINE Archive Toolkit

– Extract archives from MaSyMoS using M2Cat

internet

internet

internet

SEARCHubiquitin

internet

RESULTSEXPORT

EXPORT

EXPORT

EXPORT

Query database for annotations, persons, simulation descriptions

Retrieve information about models, simulations, figures, documentation

Export simulation study as COMBINE archive

Download archive and open the study with your favourite simulation tool

Open archive in CAT to modify its contents and to share it with others

internet

API Commincations enrich your studies with simulation results

Simulate a Study with just a single click

Fig.: Waltemath and Scharm (2015) BTWFigure taken from http://researchobjects.org

Page 15: Data and Model Management  for Systems Biology

Get information about the model's evolution

The safe way– Use SEEK to look at

model versions

– Check releases of BioModels or exposures in PMR2

The experimental way– Compare model versions

using BiVeS

Fig.: Visualising differences between model versions in BiVeS. See also Scharm et al (2015) BIOINFORMATICS.

Fig.: Example of a model's history in PMR2.

Page 16: Data and Model Management  for Systems Biology

SEEK: Data and model management out of the box

● Questions we'll be answering in the afternoon hands on:– What kind of data can SEEK manage?

– How can we manage and simulate models in SEEK?

– What is the ISA-structure?

– How can we share models inside SEEK and how can we export them?

● Please keep the files you generate throughout the SABIO-RK, COPASI, and SYCAMORE tutorials

● Please register at our sandbox version of SEEK: https://denbi-school.fairdomhub.org/projects/3

Page 17: Data and Model Management  for Systems Biology

Thank you for your attention.

● See you later → Please register with SEEK at https://denbi-school.fairdomhub.org/projects/3

● Else → Check out http://sems.uni-rostock.de for further information on how to make your research reproducible.