das advance search and its prototype implementation in mydas gustavo adolfo salazar orejuela...

13
DAS Advance Search and its prototype implementation in MyDas Gustavo Adolfo Salazar Orejuela Supervised by: Nicola Mulder Henning Hermjakob DAS workshop - 201 QuickTime™ and a decompressor are needed to see this picture.

Upload: marjory-marsh

Post on 08-Jan-2018

218 views

Category:

Documents


2 download

DESCRIPTION

DAS Distributed Annotation System

TRANSCRIPT

Page 1: DAS Advance Search and its prototype implementation in MyDas Gustavo Adolfo Salazar Orejuela Supervised by: Nicola Mulder Henning Hermjakob DAS workshop

DAS Advance Searchand its prototype implementation in MyDas

Gustavo Adolfo Salazar Orejuela

Supervised by: Nicola MulderHenning Hermjakob

DAS workshop - 2011

QuickTime™ and a decompressor

are needed to see this picture.

Page 2: DAS Advance Search and its prototype implementation in MyDas Gustavo Adolfo Salazar Orejuela Supervised by: Nicola Mulder Henning Hermjakob DAS workshop

QuickTime™ and a decompressor

are needed to see this picture.

DASDistributed Annotation System

Page 3: DAS Advance Search and its prototype implementation in MyDas Gustavo Adolfo Salazar Orejuela Supervised by: Nicola Mulder Henning Hermjakob DAS workshop

QuickTime™ and a decompressor

are needed to see this picture. Problem Definition

??

Page 4: DAS Advance Search and its prototype implementation in MyDas Gustavo Adolfo Salazar Orejuela Supervised by: Nicola Mulder Henning Hermjakob DAS workshop

QuickTime™ and a decompressor

are needed to see this picture.

Proposed Solution Advance query

query: a new argument for the features command should be added, so now the request of this command is defined as:

SERVER/das/DSN/features? [;segment=RANGE]

[;type=TYPE] [;type=TYPE]

[;category=CATEGORY] [;feature_id=ID] [;maxbins=BINS] [;query=DASQUERY]

Page 5: DAS Advance Search and its prototype implementation in MyDas Gustavo Adolfo Salazar Orejuela Supervised by: Nicola Mulder Henning Hermjakob DAS workshop

QuickTime™ and a decompressor

are needed to see this picture.

Proposed Solution DAS Query Language

Based in LUCENE, A query is broken into terms and operators: Terms

• E.g. "alpha helix" Fields

• E.g. type Condition:

• E.g. type:”alpha helix” Terms Modifiers

• E.g. type:alpha* Operands

• E.g. typeCvId:CV:00001 AND featureLabel:"one Feature" Grouping

• (typeCvId:CV:00001 AND featureLabel:"one Feature") OR typeId:twoFeatureTypeIdOne

Page 6: DAS Advance Search and its prototype implementation in MyDas Gustavo Adolfo Salazar Orejuela Supervised by: Nicola Mulder Henning Hermjakob DAS workshop

QuickTime™ and a decompressor

are needed to see this picture.

Proposed Solution DAS Query Language

Defined Fields: featureId, featureLabel, segmentId,

segmentLabel, segmentStart, segmentStop, typeId, typeCvId, typeLabel, typeCategory, type, methodId, methodCvId, methodLabel, method, start, stop, score, orientation, phase, note, link, target, parent, part, all

Reporting capability <capability type="das1:advanced-search" />

Page 7: DAS Advance Search and its prototype implementation in MyDas Gustavo Adolfo Salazar Orejuela Supervised by: Nicola Mulder Henning Hermjakob DAS workshop

QuickTime™ and a decompressor

are needed to see this picture.

Proposed SolutionResponse

The document returned from the features request does not have to be extended because it supports to have more than one segment in it.

From the relax-ng<oneOrMore>

<element name="SEGMENT">…

</element></oneOrMore>

Page 8: DAS Advance Search and its prototype implementation in MyDas Gustavo Adolfo Salazar Orejuela Supervised by: Nicola Mulder Henning Hermjakob DAS workshop

QuickTime™ and a decompressor

are needed to see this picture.

Proposed Solution Prototype Implementation

MyDas has been extended to support this capability, but so far it is still a beta version, just downloadable through SVN: SNAPSHOT 1.6.4

LUCENE is used to create an index of the data source to add the advance search capability

The capability entry_point is required to be able of going through all the features per each entry point.

LUCENE is also used to construct the data source. This may cause a sync error with the data, but it avoids the requirement of the feature_id capability

Page 9: DAS Advance Search and its prototype implementation in MyDas Gustavo Adolfo Salazar Orejuela Supervised by: Nicola Mulder Henning Hermjakob DAS workshop

QuickTime™ and a decompressor

are needed to see this picture.

Proposed Solution Prototype Implementation

Page 10: DAS Advance Search and its prototype implementation in MyDas Gustavo Adolfo Salazar Orejuela Supervised by: Nicola Mulder Henning Hermjakob DAS workshop

QuickTime™ and a decompressor

are needed to see this picture.

Proposed Solution Data Source - Uniprot 2 Probes

It maps microarrays probes to UniprotKB accession numbers http://tomcat-18:21510/das-srv/das/uniprot2probes/features?

segment=Q58EV5 http://tomcat-18:21510/das-srv/das/uniprot2probes/features?

query=segmentId:Q58EV5 http://tomcat-18:21510/das-srv/das/uniprot2probes/features?

feature_id=234173_s_at.affy_hg_u133b http://tomcat-18:21510/das-srv/das/uniprot2probes/features?

query=featureId:234173_s_at.affy_hg_u133b http://tomcat-18:21510/das-srv/das/uniprot2probes/features?

query=typeLabel:affy_mouse430a_2 http://tomcat-18:21510/das-srv/das/uniprot2probes/features?

query=typeLabel:affy_mouse430a_2%20AND%20featureId:1416176*

Page 11: DAS Advance Search and its prototype implementation in MyDas Gustavo Adolfo Salazar Orejuela Supervised by: Nicola Mulder Henning Hermjakob DAS workshop

QuickTime™ and a decompressor

are needed to see this picture. Future work

Pagination of the feature commandClients using this capability.

• JsDas + Advance Search + Uniprot2probes

Proserver implementation?

Page 12: DAS Advance Search and its prototype implementation in MyDas Gustavo Adolfo Salazar Orejuela Supervised by: Nicola Mulder Henning Hermjakob DAS workshop

QuickTime™ and a decompressor

are needed to see this picture. Acknowledgments Supervisors

Doctor Nicola Mulder Henning Hermjakob

University of Cape Town CBIO laboratory

EBI Rafael Jimenez Andy Jenkinson

DAS Communty Jonathan Warren [email protected] http://lists.open-bio.org/mailman/listinfo/das

Page 13: DAS Advance Search and its prototype implementation in MyDas Gustavo Adolfo Salazar Orejuela Supervised by: Nicola Mulder Henning Hermjakob DAS workshop

Questions??