interactive visualization of streaming data powered by spark

15
INTERACTIVE VISUALIZATION OF STREAMING DATA POWERED BY SPARK

Upload: spark-summit

Post on 08-Jan-2017

442 views

Category:

Data & Analytics


0 download

TRANSCRIPT

Page 1: Interactive Visualization of Streaming Data Powered by Spark

INTERACTIVE VISUALIZATION OF STREAMING DATA POWERED BY SPARK

Page 2: Interactive Visualization of Streaming Data Powered by Spark

Streaming @Zoomdata

Visualizations react to new data delivered

Users start, stop, pause the stream

Users select a rolling window or pin a start time to capture cumulative metrics

Page 3: Interactive Visualization of Streaming Data Powered by Spark

Drivers for Streaming Data

Data Freshness Time to Analytic Business Context

Page 4: Interactive Visualization of Streaming Data Powered by Spark

Challenges• Time

• Frequency

• Retention

• Synchronization

• Order

• Updates

Page 5: Interactive Visualization of Streaming Data Powered by Spark

Addressing streaming @Zoomdata

Historical Revised

Receive Data JMS Kafka

Manipulate Stream Single JVM in Memory Spark Streaming

Hold Data in Buffer MongoDB Pluggable

Interact with Data Custom Code Pluggable

Page 6: Interactive Visualization of Streaming Data Powered by Spark

Technology Cast• The Stream - Kafka, Kinesis, JMS• Processing Fabric - Spark Streaming• Landing Area - MemSQL, Solr, Kudu, Others

Page 7: Interactive Visualization of Streaming Data Powered by Spark

How it looks

Page 8: Interactive Visualization of Streaming Data Powered by Spark

With the rest of the app

Page 9: Interactive Visualization of Streaming Data Powered by Spark

Scale Out

Page 10: Interactive Visualization of Streaming Data Powered by Spark

Let’s put it all together• Process fruit orders from all

over• Buyers make orders via an

API• API is SMS

– [fruit] [quantity] e.g., apple 10

Page 11: Interactive Visualization of Streaming Data Powered by Spark

Demo

Page 12: Interactive Visualization of Streaming Data Powered by Spark

Benefits

• Contextual Expressiveness with Streaming Data

• Independent scalability (scale-up, scale-around)

• Expressiveness powered by Spark -- using Windowing (dataframe API with stream)

• DR COOP, other Data management concerns

Page 13: Interactive Visualization of Streaming Data Powered by Spark

Future Work• Cross stream synchronization & fusion

• On-demand scale out and resource management via Mesos

• Schema evolution

• More extensible landing strategies

Page 14: Interactive Visualization of Streaming Data Powered by Spark

Questions

For more information contact:

[email protected]

Come visit Zoomdata at our booth H2!

Page 15: Interactive Visualization of Streaming Data Powered by Spark

Thank You