20150617 spark meetup zagreb

25
© 2015 IBM Corporation Meet-up group in Zagreb June 17, 2015

Upload: andrey-vykhodtsev

Post on 11-Aug-2015

139 views

Category:

Data & Analytics


2 download

TRANSCRIPT

© 2015 IBM Corporation

Meet-up group in Zagreb June 17, 2015

IBM Spark © 2015 IBM Corporation

IBM Spark © 2015 IBM Corporation

You are part of a global Spark community #SparkInsight

Denver

Washington DC

New York

Columbus

St. Louis

Chicago

Dublin

London

Brussels

Moscow

Bonn

Paris

Milan

Melbourne

Singapore

Bangalore

Sydney

Kuala Lumpur

Welcome from Around the Globe

Helsinki

Stockholm

Oslo

Madrid

Tel Aviv

Warsaw

Seattle

Minneapolis

Atlanta

Hartford

Dallas

Houston

Toronto

IBM Spark © 2015 IBM Corporation

IBM Announces Major Commitment

to Advance Apache® Spark™

…the Most Significant Open Source Project of the Next Decade…

IBM Spark © 2015 IBM Corporation

Open Source SystemML

Educate One Million Data Professionals

Establish Spark Technology Center

Founding Member of AMPLab

Contributing to the Core

Announcing

Our commitment to Spark

IBM Spark © 2015 IBM Corporation

•IBM will build Spark into the core of the company's analytics and

commerce platforms.

•IBM's Watson Health Cloud will leverage Spark

•IBM will offer Spark as a Cloud service on IBM Bluemix

•IBM will commit more than 3,500 researchers and developers to work

on Spark-related

IBM commitment to spark

IBM Spark © 2015 IBM Corporation

SystemML unifies the fractured machine learning environments

Gives the core Spark ecosystem a complete set of DML

Allows a data scientist to focus on the algorithm, not the implementation

Improves time to value for data science teams

Establish a de facto standard for reusable machine learning routines

We are Contributing SystemML

Our largest contribution to open source since Linux

IBM Spark © 2015 IBM Corporation

Big Data University MOOC

Spark Fundamentals I and II

Advanced Spark Development series

Foundational Methodology for Data Science

Partnerships with Databricks, AMPLab, DataCamp and MetiStream

Educate 1 Million Data Scientists and

Data Engineers

Our investment to grow skills

IBM Spark © 2015 IBM Corporation

Inspire the use of Spark to solve business problems

Encourage adoption through open and free educational assets

Demonstrate real world solutions to identify opportunities

Use the learning to improve Spark and its application

Spark Technology Center

Our goal is to be the #1 Spark contributor and adopter

IBM Spark © 2015 IBM Corporation

Our Partner Ecosystem

IBM Spark © 2015 IBM Corporation

Clients Have Started Innovating with IBM and Spark

IBM Spark © 2015 IBM Corporation

•Optibus

•Findability Services

•Independence Blue Cross

•IBM, NASA and SETI

Signature Spark projects

IBM Spark © 2015 IBM Corporation

the Analytics operating system

IBM Spark © 2015 IBM Corporation

An Apache Foundation open source project. Not a product.

An in-memory compute engine that works with data. Not a data store.

Enables highly iterative analysis on large volumes of data at scale

Unified environment for data scientists, developers and data engineers

Radically simplifies process of developing intelligent apps fueled by data.

What is Spark

IBM Spark © 2015 IBM Corporation

Spark is open so accelerates community innovation

Spark is fast 100x faster than Hadoop MapReduce

Spark is about all data for large scale data processing

Spark supports agile data science to iterate rapidly

Spark can be integrated with IBM solutions

Why Spark?

IBM Spark © 2015 IBM Corporation

Spark is at Work with our Analytics Platform

Spark

Discovery & Exploration

Content Analytics

Prescriptive Analytics

Streaming Analytics

Business Intelligence & Predictive Analytics

Data Management

Content Management

Hadoop Systems

Data Warehousing

Information Integration & Governance

Apache Spark as a Service on IBM Bluemix (beta)

IBM Open Platform with Apache Hadoop can use

Spark as alternative to MapReduce; supports all

Apache Spark components

IBM BigInsights modules intend to leverage Spark

Apply existing Spark models directly to IBM Streams

Java Code written on Spark runs on IBM Streams

Use same cluster for Spark & IBM Streams

Hadoop Systems

Streaming Analytics

IBM Spark © 2015 IBM Corporation

Data Scientist Workbench Preview

IBM Spark © 2015 IBM Corporation

Ipython that lives in cloud and runs R, Scala,

Python

IBM Spark © 2015 IBM Corporation

Start with Stampede to Accelerate

Your Outcome

1

2

3

4

5

Address Common Spark Use Cases & Intelligent Applications

Domain-specific value in one day or 2-3 weeks

Knowledge Transfer from IBM

Customize Reference Architecture & Roadmap

IP that can be leveraged for business impact

IBM Spark © 2015 IBM Corporation

Now

IBM Open Platform with Apache Hadoop

IBM InfoSphere Streams

IBM Platform Computing

Our Use of Spark at IBM

More than 30 IBM Research initiatives

100 incubated applications in 10 days

3,500 Researchers and Developers to Spark

Targeted for later in year

Apache Spark as a Service on IBM Bluemix (in beta)

IBM Watson Analytics

SPSS Modeler & Analytics Server

IBM DataWorks

IBM PureData Systems with Fluid Query

IBM Commerce

IBM Spark © 2015 IBM Corporation

Contact your IBM rep to schedule a deeper dive

Discover Visit IBM Big Data Hub to read the latest news

Learn Start with the “Spark Fundamentals” at Big Data University

Try Spark Sign up for Apache Spark as a Service on IBM Bluemix at www.spark.tc/beta

Try Spark with Hadoop Download at IBM.com/Hadoop

Engage Join the IBM Spark Technology Center at www.spark.tc

Converse #SparkInsight

Take Your Next Step with IBM

IBM Spark © 2015 IBM Corporation

•Big Data Essentials

•SQL on Hadoop

•R on Hadoop / Spark

•Stream computing

Possible topics for the next

meetup

© 2015 IBM Corporation

Power of data. Simplicity of design. Speed of innovation.

© 2015 IBM Corporation

Additional Background