1 mage-om and arrayexpress database model ugis sarkans, ebi

44
1 MAGE-OM and ArrayExpress database model Ugis Sarkans, EBI

Upload: donald-johnson

Post on 11-Jan-2016

224 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: 1 MAGE-OM and ArrayExpress database model Ugis Sarkans, EBI

1

MAGE-OM and ArrayExpress database model

Ugis Sarkans, EBI

Page 2: 1 MAGE-OM and ArrayExpress database model Ugis Sarkans, EBI

2

Outline

• what is MAGE-OM

• what is ArrayExpress

• what language is used for modeling

• MAGE-OM structure

• ArrayExpress status and future

• MAGE future developments

Page 3: 1 MAGE-OM and ArrayExpress database model Ugis Sarkans, EBI

3

MAGE-OM

• MicroArray Gene Expression Object Model– also: MAGE-ML (.. Markup Language),

MAGE-STK (..Software ToolKit)

• Merging of MAML (MicroArray Markup Language) and GEML (Gene Expression Markup Language)

Page 4: 1 MAGE-OM and ArrayExpress database model Ugis Sarkans, EBI

4

MAGE: brief history

• December 2000 - initial submissions of proposals to OMG (Object Management Group):– EBI (on behalf of MGED) - MAML

– Rosetta (on behalf of GEML community) - GEML + some IDLs

– NetGenics - IDLs

• Decision to proceed with a joint submission• Decision to comply with Model Driven

Architecture (MDA) principles• October 2001 - joint submission to OMG (Rosetta

and MGED)

Page 5: 1 MAGE-OM and ArrayExpress database model Ugis Sarkans, EBI

5

Model Driven Architecture

• Platform Independent Model (UML)– most of the time spent on this

• Platform Specific Models– XML

• UML (refined from PIM)

• DTD (generated plus hand modifications)

– CORBA (not for MAGE)• UML (refined from PIM)

• IDL (hopefully generated)

– ….

Page 6: 1 MAGE-OM and ArrayExpress database model Ugis Sarkans, EBI

6

ArrayExpress

• first version (object model) - 1999, in collaboration with German Cancer Research Centre (DKFZ)

• second version (object model) - end of 2000, prototype development funded by Incyte

Page 7: 1 MAGE-OM and ArrayExpress database model Ugis Sarkans, EBI

7

ArrayExpress (2)

• implementation - first half of 2001 - Oracle schema, data loader (from MAML), prototype Web interface, a few datasets loaded

• decision to use MAGE-OM as basis for further development

• EU funding - 2002-2004, 8 new positions

Page 8: 1 MAGE-OM and ArrayExpress database model Ugis Sarkans, EBI

8

ArrayExpress - features

• MIAME-compliant• able to import MAML (MAGE-ML) formatted

data• can deal with both raw and processed data• independence of:

– experimental platforms

– image analysis methods

– data normalization methods

• object model-based query mechanism• supports upcoming OMG standard for expression

data

Page 9: 1 MAGE-OM and ArrayExpress database model Ugis Sarkans, EBI

9

Unified Modeling Language

• graphical language for describing software systems (and more ..)

• notation - yes

• methodology - no

Page 10: 1 MAGE-OM and ArrayExpress database model Ugis Sarkans, EBI

10

UML diagram types

• class

• state

• collaboration

• sequence

• ……..

Page 11: 1 MAGE-OM and ArrayExpress database model Ugis Sarkans, EBI

11

State diagram

Page 12: 1 MAGE-OM and ArrayExpress database model Ugis Sarkans, EBI

12

Sequence diagram

Page 13: 1 MAGE-OM and ArrayExpress database model Ugis Sarkans, EBI

13

Collaboration diagram

Page 14: 1 MAGE-OM and ArrayExpress database model Ugis Sarkans, EBI

14

Classdiagram

Page 15: 1 MAGE-OM and ArrayExpress database model Ugis Sarkans, EBI

15

Class diagrams - notation• classes

• attributes– types

• operations

• relationships– subclass relationship– aggregate relationship– association

• role names

• cardinalities

• navigation

Page 16: 1 MAGE-OM and ArrayExpress database model Ugis Sarkans, EBI

16

class

class fromanother package

attribute

aggregation

navigation

role name

cardinality

associationname

inheritance

Page 17: 1 MAGE-OM and ArrayExpress database model Ugis Sarkans, EBI

17

Classdiagram

Page 18: 1 MAGE-OM and ArrayExpress database model Ugis Sarkans, EBI

18

Implementation issues

• Java, C++ - “easy”

• relational databases– classes - tables– 1:1, 1:N - foreign key– N:M - table– subclass relations

• all subclasses in the same table

• separate table for superclass and subclasses

• XML

Page 19: 1 MAGE-OM and ArrayExpress database model Ugis Sarkans, EBI

19

Tools

• Rational Rose– bad graphical capabilities– forward/reverse engineering– API (VB-based)

• open source– ArgoUML

Page 20: 1 MAGE-OM and ArrayExpress database model Ugis Sarkans, EBI

20BSANE BQS

Description

Protocol

Measurement

Audit

Treatment

Transformation

BioEvent

Experiment

ArrayDesign

BioMaterial

BioAssayData BioAssay

DesignElement

UML Packages

HigherLevelAnalysis

BioSequence

ArrayManufactureQuantitationType

Page 21: 1 MAGE-OM and ArrayExpress database model Ugis Sarkans, EBI

21

Top level structure

Page 22: 1 MAGE-OM and ArrayExpress database model Ugis Sarkans, EBI

22

BioAssay

Page 23: 1 MAGE-OM and ArrayExpress database model Ugis Sarkans, EBI

23

Biomaterial

Page 24: 1 MAGE-OM and ArrayExpress database model Ugis Sarkans, EBI

24

ArrayDesign

Page 25: 1 MAGE-OM and ArrayExpress database model Ugis Sarkans, EBI

25

DesignElement

Page 26: 1 MAGE-OM and ArrayExpress database model Ugis Sarkans, EBI

26

DesignElement

Page 27: 1 MAGE-OM and ArrayExpress database model Ugis Sarkans, EBI

27

DesignElement mapping

Page 28: 1 MAGE-OM and ArrayExpress database model Ugis Sarkans, EBI

28

Data

Page 29: 1 MAGE-OM and ArrayExpress database model Ugis Sarkans, EBI

29

BioSequence

Page 30: 1 MAGE-OM and ArrayExpress database model Ugis Sarkans, EBI

30

ArrayManufacture

Page 31: 1 MAGE-OM and ArrayExpress database model Ugis Sarkans, EBI

31

Quantitations

Page 32: 1 MAGE-OM and ArrayExpress database model Ugis Sarkans, EBI

32

HigherLevelAnalysis

Page 33: 1 MAGE-OM and ArrayExpress database model Ugis Sarkans, EBI

33

BioEvent

Page 34: 1 MAGE-OM and ArrayExpress database model Ugis Sarkans, EBI

34

Protocol

Page 35: 1 MAGE-OM and ArrayExpress database model Ugis Sarkans, EBI

35

Description

Page 36: 1 MAGE-OM and ArrayExpress database model Ugis Sarkans, EBI

36

AuditAndSecurity

Page 37: 1 MAGE-OM and ArrayExpress database model Ugis Sarkans, EBI

37

Measurement

Page 38: 1 MAGE-OM and ArrayExpress database model Ugis Sarkans, EBI

38

ArrayExpress: current status

• Object model (MAGE-OM) - stable

• Database schema - generated (standard SQL, we run under Oracle)

• Data loader from MAGE-ML - generated

• Web interface (queries, browsing) - under development

Page 39: 1 MAGE-OM and ArrayExpress database model Ugis Sarkans, EBI

39

Near future developments

• Dedicated hardware for ArrayExpress

• Good quality data coming from collaborators (annotation tools needed)

• Data uploading and Web interface made public

Page 40: 1 MAGE-OM and ArrayExpress database model Ugis Sarkans, EBI

40

Future developments

• Integration with existing tools (Expression Profiler)

• New analytical tools

• Links with other databases

• Data curation, liaison with data providers

Page 41: 1 MAGE-OM and ArrayExpress database model Ugis Sarkans, EBI

41

ArrayExpress architecture

central database(experiment-centred)

data warehouse

application server(Java servlets)

Web server

image server

ArrayExpress

curation

MAGE-ML

API

curation tooldatabase

Page 42: 1 MAGE-OM and ArrayExpress database model Ugis Sarkans, EBI

42

MAGE schedule

• OMG meeting, Dublin, November 12-16 - specification hopefully adopted

• Mechanism for incorporating changes and user feedback

• MAGE programming jamboree, EBI, December 6-11: API development, parser generation, annotation tools (MAGE STK)

Page 43: 1 MAGE-OM and ArrayExpress database model Ugis Sarkans, EBI

43

Resources• Web site

– links to documents• presentations

• UML models – also HTML version and PNG image files of diagrams

– http://www.geml.org/omg.htm

• Mailing list– [email protected]– to subscribe, send the following to

[email protected]

subscribe lsr-ge <yourEmailAddress>

Page 44: 1 MAGE-OM and ArrayExpress database model Ugis Sarkans, EBI

44

• Doug Bassett (Rosetta)

• Alvis Brazma (EBI)

• Steve Chervitz (Affymetrix)

• Francisco Dela Vega (Applied Biosystems)

• Michael Dickson (NetGenics)

• David Frankel (IONA)

• Scott Markel (NetGenics)

• Michael Miller (Rosetta)

• Dave Nellesen (Incyte)

• Alan Robinson (EBI)

• Martin Senger (EBI)

• Paul Spellman (Lawrence Berkley Lab)

• Jason Stewart (NCGR)

• Charles Troup (Agilent)

Acknowledgements