january 2016 flink community update & roadmap 2016

Community Update &

Roadmap 2016

Robert Metzger

@rmetzger_

rmetzger@apache.org

Berlin Apache Flink Meetup,January 26, 2016

January Community Update

What happened in the last month

What happened?

Google proposed Dataflow API to Apache

Incubator

Proposal discussions at the mailing list:

• SQL / Stream SQL support

• CEP (Complex Event Processing) library

Flink Kinesis Connector

Chengxiang Li added as committer

Discussions for releasing 1.0.0

Now merged to master (1.0-SNAPSOT)

Savepoints: Manual checkpoints for restarting jobs with state

Kafka 0.9.0.0 integration

Job submission through JobManager web interface

Checkpoint statistics in JobManager web interface

Streaming examples are now in the binary dist

Reading List

Benchmarking Streaming Computation

Engines at Yahoo!

Receiving metrics from Apache Flink

applications

Running Apache Flink on Amazon Elastic

Mapreduce

1. http://yahooeng.tumblr.com/post/135321837876/benchmarking-streaming-computation-engines-at

2. http://mnxfst.tumblr.com/post/136539620407/receiving-metrics-from-apache-flink-applications

3. http://themodernlife.github.io/scala/hadoop/hdfs/sclading/flink/streaming/realtime/emr/aws/2016/01/06/running-apache-flink-on-amazon-elastic-mapreduce/

Upcoming talks

FOSDEM Brussels (4 talks) (Jan 30-31)

Big Data Technology Summit Warsaw

(Feb. 25-26)

Qcon London (March 7-9)

Hadoop Summit Dublin (2 talks) (April 13-

Strata San Jose

Strata London

Global Meetup Community

Brazil-Sao Paulo Apache Flink Meetup

Apache Flink Taiwan User Group

Also new groups in Delhi, Phoenix and

Dallas

Github stats

900 Stars

Roadmap 2016

Whats next?

Overview

SQL / StreamSQL

CEP Library

Managed Operator State

Dynamic Scaling

Miscellaneous

SQL and StreamSQL

SQL / StreamSQL

Structured queries over data sets and

streams

Add support for SQL

• Standard SQL queries over (batch) data sets

• Continuous StreamSQL queries over data

streams

Keep and extend Table API as structured

query API on data sets and streams

Proposed Architecture

Table API(Batch) SQL

Query StreamSQL

te Standard SQL parser

CustomizedStreamSQL

parser

Optimizer

Logical Plan

DataSetProgram

DataStreamProgram

Internals

SQL integration into APIs

val stream : DataStream[(String, Double, Int)] = env.addSource(new FlinkKafkaConsumer(...))

val tabEnv = new TableEnvironment(env)tabEnv.registerStream(stream, “myStream”, (“ID”, “MEASURE”, “COUNT”))

val sqlQuery = tabEnv.sql(“SELECT ID, MEASURE FROM myStream WHERE

COUNT > 17”)

Define Kafka input stream

Define table environment

SQL Query

Complex Event Processing

CEP Library

Complex Event Processing: the analysis of

complex patterns such as correlations and

sequence detection from multiple sources

Most current systems are not distributed

(beyond multi-threading)

Goal: provide an easy to use API for CEP,

running on a distributed high-throughput, low

latency engine.

CEP Example

Realtime stock prices

15.1 15.3 15.2 15.5State

MachineAlerts

StartPrice drop by at least $.5

Ignore

Programming API for CEP

CEPStream<Event> cepStream = CEP.from(inputDataStream)

// groupingGroupedCEPStream<Event> grouped = cepStream.groupBy(“id”)

// windowsWindowedCEPStream windowed = grouped.timeWindow(Time.minutes(10), Time.minutes(1))WindowedCEPStream windowed = grouped.countWindow(10L, 1L)

// pattern matchingCEPStream<Result> resultStream = CEP.from(input).groupBy(0).pattern(

Pattern.<Event>next("e1").where( (evt) -> evt.id == 42 ).followedBy("e2").where( (evt) -> evt.id == 1337 ).within(Time.minutes(10))

).select( (Map<String, Event> patternElements) -> new Result(patternElements.get("e2").timestamp -

patternElements.get("e1").timestamp) )18

convert stream into CEPStream of Events

Window events

Define a pattern to match

DSL for CEP

select e1.id, e1.price from every e1 = Event(price > 10) → e2 = Event(date == 42) → e3 = Event(price == 10) within 10 seconds where e1.id == e2.id

No programming required

Potentially integrated with SQL

Managed Operator State

State in Flink

Operator

“count tweet impressions”

User Function

impression counts

Retrieve/set count for tweet it

State in Flink

Operator

User Function

impression counts

What happens if the job crashes?

Loss of data

Solution: Checkpoints

Operator

User Function

impression counts

Periodic checkpoints of state to HDFS

Restore from HDFS in case of failure

Solution: Checkpoints

Operator

User Function

impression counts

Periodic checkpoints of state to HDFS

This is the current state in Flink!

State on Steroids

Operator

User Function

impression counts

State on Steroids

Operator

User Function

impression counts

Spill to diskasync/incremental snapshots

What if stategrows too big?

State on Steroids

Operator

User Function

impression counts

Spill to disk

State on Steroids

Operator

User Function

impression counts

What if stategrows too big?

Checkpointing stalls processing!

State on Steroids

Operator

User Function

impression counts

Dealing with Dynamic

Resources

Streams with varying data rate

events

/second

With static resources: Provision for max. rate

Idle capacity

(1) Adjust Parallelism

Initialconfiguration

Scale Out(for load)

Scale In(save resources)

(1) Adjust Parallelism

Adjusting parallelism without (significantly) interrupting the program

Initial version:

• Checkpoint -> stop -> restart-with-different-parallelism

Stateless operators: Trivial

Stateful operators: Repartition state

• Transparent for key/value state and windows

• Consistent hashing simplifies state reorganization

(2) Dynamic Worker Pool

JobManager

ResourceManager

Pool of Cluster ResourcesYARN/Mesos/…

TaskManager

Miscellaneous

Support for Apache Mesos

Security• Over-the-wire encryption of RPC (akka) and data

transfers (netty)

More connectors• Apache Cassandra

• Amazon Kinesis

Enhance metrics• Throughput / Latencies

• Backpressure monitoring

• Spilling / Out of Core

january 2016 flink community update & roadmap 2016

Technology

unified stream & batch processing with apache flink (hadoop...

continuous processing with apache flink - strata london 2016

flink forward sf 2017: eron wright - introducing flink...

apache flink - overview

apache flink berlin meetup may 2016

flink and apache spark fernanda de camargo magano dylan...

apache flink

apache flink: the latest and greatestapache flink: the...

flink forward 2016

flink forward san francisco 2017 - flink meet dc/os

strategic roadmap january 2016...crcsi information...

codemotion 2016 - big data para javeros con apache flink

flink history, roadmap and vision

gradoop: scalable graph analytics with apache flink @ fosdem...

apache flink internals

apache flink: the latest and greatest...apache flink: the...

flink forward sf 2017: ted dunning - non-flink machine...

apache flink big data stream processing · pdf fileapache...

2016 roadmap of state highway safety laws -...

streaming data flow with apache flink @ paris flink meetup...