building big data in food science

23
Building Big Data in Food Science Jan Top COMMIT /

Upload: surf

Post on 12-May-2015

308 views

Category:

Documents


6 download

DESCRIPTION

Presentation SURF Research and Innovation Event 2013 February 28, The Hague University of Applied Sciences Jan Top is Senior Scientist at Wageningen UR and Professor at VU University Amsterdam.

TRANSCRIPT

Page 1: Building Big Data in food science

Building Big Data in Food Science

Jan Top

COMMIT/

Page 2: Building Big Data in food science

Food data?

Page 3: Building Big Data in food science
Page 4: Building Big Data in food science

Data in Food Research

Health effects

Sensory effects

Physical properties

Genomics, metabolomics

Sustainable production

How to get high-quality, multidisciplinary, multi-location data in the first place?

Page 5: Building Big Data in food science

Traditional lab notes

Basically unstructured pages

Personal way of working

Chronologic, no erasing

Enable replication

Page 6: Building Big Data in food science

Datasets are scattered, hard to find, understand and combine

Emphasis on data processing

Structured registration of methods, materials, data and observations is part of good science

Modern lab notes

Page 7: Building Big Data in food science

New approach?

Page 8: Building Big Data in food science

Three lines of support

Research workflow - Tiffany

Linking data - Rosanne

Vocabularies – ROC+

Page 9: Building Big Data in food science

Three lines of support

Research workflow - Tiffany

Linking data - Rosanne

Vocabularies – ROC+

Page 10: Building Big Data in food science

Research output structured

Objectives

Activities

Products● Materials

● Methods

● Devices

● Data

● Models

● People

● ...

Page 11: Building Big Data in food science

method

device

e-note

person

method

dataset

person

paper

datasetmodel

presentation

material

my experiment

statistical analysis

conference

Network of activities

Page 12: Building Big Data in food science

Tiffany

Page 13: Building Big Data in food science
Page 14: Building Big Data in food science

Theme Council KM Platform June 16, 2011

Page 15: Building Big Data in food science

Three lines of support

Research workflow - Tiffany

Linking data - Rosanne

Vocabularies – ROC+

Page 16: Building Big Data in food science
Page 17: Building Big Data in food science

Rosanne

Page 18: Building Big Data in food science

Rosanne

Manual annotation

Heuristic annotation

RDF export

SPARQL-based selection and integration

Scientific Table: proposed addition to SDMX and RDF DataCube

Page 19: Building Big Data in food science

Three lines of support

Research workflow - Tiffany

Linking data - Rosanne

Vocabularies – ROC+

Page 20: Building Big Data in food science

Creating ontologies

Ontologies can be created from scratch → very time consuming

Ontologies can be downloaded → not optimally tuned to the application at hand

ROC+ allows domain experts to define an application-specific vocabulary by:

(i) Getting suggestions from existing ontologies

(ii) Getting suggestions from corpora

(iii) Structuring the identified terms

Page 21: Building Big Data in food science

ROC+ recipe

Start with a few characteristic terms

Add related terms from the suggestions

Iterate as long as you think is useful

Structure the terms

●Broader or narrower

●Synonym

●Related

Page 22: Building Big Data in food science

In a few words...

Food data available but scattered

Workflow approach puts scientific food data into context

Annotated tables support interpretation, selection and integration

Develop application-specific vocabularies

http://www.afsg.nl/InformationManagement/

Page 23: Building Big Data in food science

QUESTIONS, IDEAS?

COMMIT/