ideas 2013 presentation

32
Copyright 2010 Digital Enterprise Research Institute. All rights reserved. Digital Enterprise Research Institute www.deri.ie On-The-Fly Generation of Multidimensional Data Cubes for Web of Things Muntazir Mehdi (DERI, TU Kaiserslautern) [email protected] http://www.StefanDecker.org/

Upload: muntazir-mehdi

Post on 28-Jan-2015

105 views

Category:

Education


0 download

DESCRIPTION

7th International Database Engineering & Applications Symposium, Universitat Politècnica de Catalunya, Barcelona

TRANSCRIPT

Page 1: IDEAS 2013 Presentation

Copyright 2010 Digital Enterprise Research Institute. All rights reserved.

Digital Enterprise Research Institute www.deri.ie

On-The-Fly Generation of

Multidimensional Data Cubes for

Web of Things

Muntazir Mehdi

(DERI, TU Kaiserslautern)

[email protected]

http://www.StefanDecker.org/

Page 2: IDEAS 2013 Presentation

Digital Enterprise Research Institute www.deri.ie

Agenda

Motivation and Background

• Problem statement, Use case, Linked Data, WoT

Processing Metadata for Cube Creation

• Capturing and Publishing Sensor data, Event Registration

Cube Generation

• EDWH Agent, An example Scenario

Other Potential Use Cases

Results and Evaluation

Conclusion

Page 3: IDEAS 2013 Presentation

Digital Enterprise Research Institute www.deri.ie

Motivation & Background

Enterprises producing huge amounts of data

making data management, exchange and decision

making complex.

Use Case (Smart Buildings)

1. Rely on Sensor data for decision making

2. Heterogeneous and Big Data Management

3. Event Processing can be applied to sustain decision making

4. Limited support for decision making with event processing

techniques

5. Controlling supply / demand based on statistical data

6. Identify meaningful event and deal with them asap

Page 4: IDEAS 2013 Presentation

Digital Enterprise Research Institute www.deri.ie

Motivation & Background (continued)

Heterogeneous Data Management

1. Different Data generated from different applications within one or

more smart environments.

2. For example: A smart city relying on combined data from different

smart buildings.

3. Linked data: A set of best practices to represent data into RDF and

link, relate or connect to other RDF data.

4. Linked Open Data (LOD) Cloud: A huge openly available cloud of

linked data from different domains.

Page 5: IDEAS 2013 Presentation

Digital Enterprise Research Institute www.deri.ie

Motivation & Background (continued)

Big Data Management

1. A fast response to complex queries to support event processing.

2. Huge amounts of sensor data as RDF.

3. Generation of real-time multidimensional and contextual data cubes

to sustain fast responses to complex queries.

4. An event data-warehouse.

5. Multidimensional shape of data in data-warehouse = A data cube =

Structuring information into dimensions and facts or measures.

Page 6: IDEAS 2013 Presentation

Digital Enterprise Research Institute www.deri.ie

Why Data-warehouse for events?

1. Data characteristics:

• Logged once, never updated

• Flat data, no need to normalize

• Incoming data: temporal (based on time)

2. Objective characteristics:

• Reporting, Analysis, Prediction, Mining, Pattern Identification……

• To use a data model to speed up querying unlike transactional processing system

• To provide with a historical repository containing features as per interest

• Support Complex Event Processing

Motivation & Background (continued)

Page 7: IDEAS 2013 Presentation

Digital Enterprise Research Institute www.deri.ie

Motivation & Background (continued)

Web of Things

1. Extending the Web to easily blend real-world objects like electronic

appliances, sensors and embedded devices etc.

2. Even though we are limited to sensor data in our use case, the

approach can be easily extended.

3. CoAP (Constrained Application Protocol): A Web transfer protocol for

request/response model.

Page 8: IDEAS 2013 Presentation

Digital Enterprise Research Institute www.deri.ie

Related Work

Antoniades, Athos, et al. "Linked2Safety: A secure linked

data medical information space for semantically-

interconnecting EHRs advancing patients' safety in medical

research." Bioinformatics & Bioengineering (BIBE), 2012 IEEE

12th International Conference on. IEEE, 2012.

Lefort, Laurent, et al. "A Linked Sensor Data Cube for a 100

Year Homogenised Daily Temperature Dataset." SSN. 2012.

ENERGIE VISIBLE

(http://www.webofthings.org/energievisible/)

Page 9: IDEAS 2013 Presentation

Digital Enterprise Research Institute www.deri.ie

Processing Metadata for Cube Generation

Involves two major steps:

1. Capturing and Publishing Sensor Data

2. Event Registration

Page 10: IDEAS 2013 Presentation

Digital Enterprise Research Institute www.deri.ie

Capturing and Publishing Sensor Data: An

example Scenario

JMS

SERVER

& publish on JMS Server

RDF

Oh wait,

I see a way of converting them into RDF,

add relevant metadata,

SSN

Event Stream

Event Stream

Event Stream

Page 11: IDEAS 2013 Presentation

Digital Enterprise Research Institute www.deri.ie

Capturing and Publishing Sensor Data:

Process

Filter

UDP Listeners

&

CoAP Clients

RDFizer

JMS Publisher Enricher

JMS Server Metadata

Knowledge Base

S1

S2

S3

Sn

Page 12: IDEAS 2013 Presentation

Digital Enterprise Research Institute www.deri.ie

Event Registration: EDWH Ontology

NamedCubeGraph

Configuration

Dimension

Measure

Source Event

JMSSource

Page 13: IDEAS 2013 Presentation

Digital Enterprise Research Institute www.deri.ie

Event Registration Process

Specify Event

Type

Specify Event

Source

Select

Measures

Select

Dimensions

Specify Graph

Details

EDWH Ontology Instance

Page 14: IDEAS 2013 Presentation

Digital Enterprise Research Institute www.deri.ie

Dimension Selection: Example

Page 15: IDEAS 2013 Presentation

Digital Enterprise Research Institute www.deri.ie

Measure Selection: Example

Page 16: IDEAS 2013 Presentation

Digital Enterprise Research Institute www.deri.ie

Cube Generation

1. Requires an event to be registered into the system.

2. Current implementation generates cubes based on

time dimension only. However, it can be easily

extended to attain other dimensions.

3. Critical component: EDWH Agent

Page 17: IDEAS 2013 Presentation

Digital Enterprise Research Institute www.deri.ie

Cube Generation: EDWH Agent Architecture

Page 18: IDEAS 2013 Presentation

Digital Enterprise Research Institute www.deri.ie

Cubes Generation: An example Scenario

JMS

SERVER

RDF

RDF

CUBES AS

RDF MEETS Mr. CUBES

EDWH Ontology

CUBE

Store

Page 19: IDEAS 2013 Presentation

Digital Enterprise Research Institute www.deri.ie

Cubes Generation: Our Use Case

Page 20: IDEAS 2013 Presentation

Digital Enterprise Research Institute www.deri.ie

Use Cases

CUBE

Store

Page 21: IDEAS 2013 Presentation

Digital Enterprise Research Institute www.deri.ie

Use Case: 1

The electricity usage at location X for duration Y for consumer Z

has been moderate as compared to previous duration W.

CUBE

Store

Page 22: IDEAS 2013 Presentation

Digital Enterprise Research Institute www.deri.ie

Use Case: 2

Historical Data suggests that the weather is going to be windy and

Rainy in Galway even after the Easter.

CUBE

Store

Page 23: IDEAS 2013 Presentation

Digital Enterprise Research Institute www.deri.ie

Use Case: 3

CUBE

Store

Some suspicious activity has been detected on your credit card!

Page 24: IDEAS 2013 Presentation

Digital Enterprise Research Institute www.deri.ie

Use Case: 4

Linked

CUBE

Stores

Each of these things

can be achieved from

one place

Page 25: IDEAS 2013 Presentation

Digital Enterprise Research Institute www.deri.ie

Evaluation

We evaluated our system in terms of

1. Total number of cubes generated

2. Size of each cube

3. Accuracy of generated cubes

4. Impact of adding and removing dimensions on size of cube

5. Performance of the system to generate cubes

6. Query Execution Time (QET)

Page 26: IDEAS 2013 Presentation

Digital Enterprise Research Institute www.deri.ie

Evaluation

0

2000

4000

6000

8000

10000

12000

14000

16000

Tim

e (m

illis

eco

nd

s)

Quarter Cube

Day Cube

Hour Cube

Page 27: IDEAS 2013 Presentation

Digital Enterprise Research Institute www.deri.ie

Evaluation: Size

Page 28: IDEAS 2013 Presentation

Digital Enterprise Research Institute www.deri.ie

Evaluation: Impact of dimensions

1 Dim

1 Dim

1 Dim

2 Dim

2 Dim

2 Dim

3 Dim

3 Dim

3 Dim

0

50

100

150

200

Quarter Hour Day

Sto

rgae

Siz

e p

er

Cu

be

(K

B)

1 Dim 2 Dim 3 Dim

Page 29: IDEAS 2013 Presentation

Digital Enterprise Research Institute www.deri.ie

Evaluation: Query Set for QET

Page 30: IDEAS 2013 Presentation

Digital Enterprise Research Institute www.deri.ie

Evaluation: QET Comparison

Page 31: IDEAS 2013 Presentation

Digital Enterprise Research Institute www.deri.ie

Conclusion

With the approach presented, we were able to enrich

events with necessary metadata, and process

enriched events to generate on-the-fly data cubes.

After looking at performance chart shown in previous

slides, it is safe to conclude that our approach

provides a good way of generating data cubes on-the-

fly in a real-time sensor network.

Page 32: IDEAS 2013 Presentation

Digital Enterprise Research Institute www.deri.ie

Questions