challenges & issues of an rsp-ql...

22
Challenges & Issues Of an RSP-QL Implementation Riccardo Tommasini and Emanuele Della Valle Politecnico di Milano, DEIB, Milan, Italy

Upload: others

Post on 12-Jul-2020

4 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Challenges & Issues Of an RSP-QL Implementationstreamreasoning.org/slides/2017/10/wsp2017-03.pdf · RSP-QL is a reference model to explain the semantics of exiting RDF Stream Processing

Challenges & Issues Of an RSP-QL

ImplementationRiccardo Tommasini and Emanuele Della Valle

Politecnico di Milano, DEIB, Milan, Italy

Page 2: Challenges & Issues Of an RSP-QL Implementationstreamreasoning.org/slides/2017/10/wsp2017-03.pdf · RSP-QL is a reference model to explain the semantics of exiting RDF Stream Processing

Who I AM

• Ph.D. Student at Politecnico di Milano

• Advisor: Emanuele Della Valle

• Work on:

• RSP Benchmarking

• Expressive Stream Reasoning

2

Page 3: Challenges & Issues Of an RSP-QL Implementationstreamreasoning.org/slides/2017/10/wsp2017-03.pdf · RSP-QL is a reference model to explain the semantics of exiting RDF Stream Processing

The Story

3

RSP-QL is a reference model to explain the semantics of exiting RDF Stream Processing Engines.

Prototypes have a central role to support and foster Semantic Web and Stream Reasoning research.

The issues discovered realising system*enable more foundational science.

Why Implementing RSP-QL

Page 4: Challenges & Issues Of an RSP-QL Implementationstreamreasoning.org/slides/2017/10/wsp2017-03.pdf · RSP-QL is a reference model to explain the semantics of exiting RDF Stream Processing

The Story

4

TheoryChallengesCode

Looking behind the implementation curtains

Page 5: Challenges & Issues Of an RSP-QL Implementationstreamreasoning.org/slides/2017/10/wsp2017-03.pdf · RSP-QL is a reference model to explain the semantics of exiting RDF Stream Processing

The Theory

• RSP-QL is a reference model to explain the semantics of exiting RDF Stream Processing Engines.

• It extends SPARQL 1.1 to include RDF Streams and the necessary operators to process them.

• It extends SPARQL 1.1 evaluation to include time.

5

Page 6: Challenges & Issues Of an RSP-QL Implementationstreamreasoning.org/slides/2017/10/wsp2017-03.pdf · RSP-QL is a reference model to explain the semantics of exiting RDF Stream Processing

The Theory

6

RSP-QL in a Nutshell

Page 7: Challenges & Issues Of an RSP-QL Implementationstreamreasoning.org/slides/2017/10/wsp2017-03.pdf · RSP-QL is a reference model to explain the semantics of exiting RDF Stream Processing

The Code

7

Yasper 1.0

SDS

Instantaneous Graph

Time-Varying Graph

Windowing

Window Operators

Stream

Stream Item

Query Response

R2S

ContinuousExecution

Formatter

Quering

Reasoning

Page 8: Challenges & Issues Of an RSP-QL Implementationstreamreasoning.org/slides/2017/10/wsp2017-03.pdf · RSP-QL is a reference model to explain the semantics of exiting RDF Stream Processing

The CodeYasper 1.0

• Generic Data Model, e.g., Triples and Named Graphs

• Configurable Time Model:

• Externally Controlled Time

• Ingestion Time vs Event Time

• Multi Stream Queries and SDS Consolidation & Maintenance

8

Page 9: Challenges & Issues Of an RSP-QL Implementationstreamreasoning.org/slides/2017/10/wsp2017-03.pdf · RSP-QL is a reference model to explain the semantics of exiting RDF Stream Processing

The Challenges

• (Q1) What is the best way to model the stream content?

• (Q2) How does the time model impact the processing?

• (Q3) What does define the current SDS at time t?

• (Q4) Is there an efficient way to maintain the SDS?

9

Page 10: Challenges & Issues Of an RSP-QL Implementationstreamreasoning.org/slides/2017/10/wsp2017-03.pdf · RSP-QL is a reference model to explain the semantics of exiting RDF Stream Processing
Page 11: Challenges & Issues Of an RSP-QL Implementationstreamreasoning.org/slides/2017/10/wsp2017-03.pdf · RSP-QL is a reference model to explain the semantics of exiting RDF Stream Processing
Page 12: Challenges & Issues Of an RSP-QL Implementationstreamreasoning.org/slides/2017/10/wsp2017-03.pdf · RSP-QL is a reference model to explain the semantics of exiting RDF Stream Processing
Page 13: Challenges & Issues Of an RSP-QL Implementationstreamreasoning.org/slides/2017/10/wsp2017-03.pdf · RSP-QL is a reference model to explain the semantics of exiting RDF Stream Processing

Stream Content

13

Generic Data Model

• Enables optimisation related to the query language

• Requires query rewriting to reduce overhead

RDF Triples

• Adopted my the majority of the existing RSP Engines

• Lower latency on simple queries

Named Graphs

• RSP-QL model of choice

• Better throughput on complex queries

Page 14: Challenges & Issues Of an RSP-QL Implementationstreamreasoning.org/slides/2017/10/wsp2017-03.pdf · RSP-QL is a reference model to explain the semantics of exiting RDF Stream Processing

Stream Ordering

• In a web environment, which is distributed and federated, out-of-order arrivals matters

• RSP-QL partially relaxed the assumption of totally ordered RDF stream

• Event-time vs Ingestion-time might influence the correctness of the RSP Engine

14

Page 15: Challenges & Issues Of an RSP-QL Implementationstreamreasoning.org/slides/2017/10/wsp2017-03.pdf · RSP-QL is a reference model to explain the semantics of exiting RDF Stream Processing

SDS Consolidation & Maintenance

• Assumption: Instantaneous Graph can be generated by a Time-Varying Graph for any instant t for which a window operator is defined.

• Consequence 1: the SDS content is always up-to-date

• Consequence 2: the RSP engine to maintain sub-windows accessible.

15

Page 16: Challenges & Issues Of an RSP-QL Implementationstreamreasoning.org/slides/2017/10/wsp2017-03.pdf · RSP-QL is a reference model to explain the semantics of exiting RDF Stream Processing

SDS Consolidation & Maintenance

16

W1 W2SDS(30) x1 y1,y2SDS(35) x2,x3 y3,y4SDS(36) x4 y4

Page 17: Challenges & Issues Of an RSP-QL Implementationstreamreasoning.org/slides/2017/10/wsp2017-03.pdf · RSP-QL is a reference model to explain the semantics of exiting RDF Stream Processing

Closed Active SDS

It considers only the instantaneous RDF graphs that were derived from closed active windows.

17

W1 W2SDS(30) x1SDS(35) x2,x3 y3,y4SDS(36)

Page 18: Challenges & Issues Of an RSP-QL Implementationstreamreasoning.org/slides/2017/10/wsp2017-03.pdf · RSP-QL is a reference model to explain the semantics of exiting RDF Stream Processing

Cached Active SDS

It considers the instantaneous RDF graphs from the latest closed active window, which was cached.

18

W1 W2SDS(30) x1 y1SDS(35) x2,x3 y3,y4SDS(36) x2,x3 y3,y4

Page 19: Challenges & Issues Of an RSP-QL Implementationstreamreasoning.org/slides/2017/10/wsp2017-03.pdf · RSP-QL is a reference model to explain the semantics of exiting RDF Stream Processing

The SDS Maintenance

• Deciding the semantic of the active SDS is not the only problem.

• How do we maintain the SDS consistent with frequent updates might influence the performance.

• Can I do it efficiently (incrementally)?

19

Page 20: Challenges & Issues Of an RSP-QL Implementationstreamreasoning.org/slides/2017/10/wsp2017-03.pdf · RSP-QL is a reference model to explain the semantics of exiting RDF Stream Processing

Opportunity: Descriptive Study

• Use Architectural Variants to define features for a descriptive study

• Generate a set of statical hypothesis out of the scientific Hypotheses we formulated for each challenge (In the paper)

20

Page 21: Challenges & Issues Of an RSP-QL Implementationstreamreasoning.org/slides/2017/10/wsp2017-03.pdf · RSP-QL is a reference model to explain the semantics of exiting RDF Stream Processing

Opportunity: Syntax

• RSP-QL is very expressive

• Maybe too expressive to be captured by a single declarative QL.

• Proposal: a syntax which is more usable yet less expressive.

21

Page 22: Challenges & Issues Of an RSP-QL Implementationstreamreasoning.org/slides/2017/10/wsp2017-03.pdf · RSP-QL is a reference model to explain the semantics of exiting RDF Stream Processing

Question?

• Can you fill this slide for me?

22