spark and hadoop at production scale-(anil gadre, mapr)

17
© 2015 MapR Technologies 1 ® © 2015 MapR Technologies Taking Your Spark To Production Scale Anil Gadre, SVP Product Management, MapR Technologies June 15, 2015

Upload: spark-summit

Post on 21-Apr-2017

3.962 views

Category:

Data & Analytics


1 download

TRANSCRIPT

®© 2015 MapR Technologies 1

®

© 2015 MapR Technologies

Taking Your Spark To Production Scale Anil Gadre, SVP Product Management, MapR Technologies June 15, 2015

®© 2015 MapR Technologies 2

The Journey To Production Scale

Trials, science projects

Large mission-critical, operational

deployments

®© 2015 MapR Technologies 2

®© 2015 MapR Technologies 3

Companies with Spark & MapR in Production

GLOBAL TELECOM

HEALTHCARE

GLOBAL FINANCIAL SERVICES

®© 2015 MapR Technologies 4

Key Issues To Plan For

Spark stack

support?

Real-time?

Enterprise reliability &

security?

Open ended agility?

1

2

3

4

®© 2015 MapR Technologies 5

Global Managed Security Services

delivered on Hadoop

Spark Stream processing used to first check for known threats

Data next processed on Hadoop

using MLLib and GraphX

Additional SQL querying done via Spark SQL

Security Intelligence Operations

Delivers Lightning Fast Analytics for Clients

Building largest Hadoop cluster in Australia Real-time analytics using Spark on MapR–reducing data loading time from hours to minutes Leverage multi-tenancy, high-performance and reliability of MapR

®© 2015 MapR Technologies 7

Next-Gen Genomics

Develop flexible platform to keep up with fast changing research techniques POSIX file access lets bio-informaticians use existing tools with open source tools (Spark) Graph manipulations can be done reliably and at scale using Spark

®© 2015 MapR Technologies 8

Real-Time Customer Analytics

• MapR Data Lake stores both online and archive data

• Spark on MapR reduced ETL processing

• NFS moved data into the cluster seamlessly

• 1/10th Total Cost of Ownership vs. old way

• New customer onboarding cut from months to weeks

®© 2015 MapR Technologies 9

Databricks & MapR Strategic Partnership (since April 2014)

Support for the complete Spark stack

Engineering & roadmap collaboration

Back-end support +

®© 2015 MapR Technologies 10

The Most Complete Spark Environment

Spark SQL (SQL)

Spark Streaming (Streaming)

MLlib (Machine learning)

GraphX (Graph computation)

Foundation For Enterprise-Grade Spark

®© 2015 MapR Technologies 11

DB Operations

Real-Time and Actionable

Analytics

Operations + Analytics on One Hadoop Platform with SQL Access

Mobile application

server

Customer 360 dashboard

Churn analysis Product/service optimization and personalization

Real-time ad targeting

Web application server

Data exploration (SQL)

• User profiles and state • User interactions • Real-time location data

• Web and mobile session state • Comments/rankings

®© 2015 MapR Technologies 12

Spark + MapR = Ready For Production Success

World-record performance on disk High Performance

SLA-Driven Applications •  High availability •  Data protection •  Disaster recovery Reliability for Production

Strategic partnership with Databricks to ensure enterprise support for the entire stack

24/7 Best-in-class Global Support

MapR-DB + Spark = real-time analytics Operational Data Store

®© 2015 MapR Technologies 13

MapR Introduces 3 New Spark-Based Quick Start Solutions

Real-Time Security Log Analytics

Time Series Analytics

Genome Sequencing

®© 2015 MapR Technologies 14

Self-Service Data Exploration

Data Agility with Less IT Required

Single SQL Interface for Structured and Semi-Structured Data

®© 2015 MapR Technologies 15

Free On-Demand Training

www.mapr.com/training

®© 2015 MapR Technologies 16

Get Your Tattoo In The MapR Booth!

Show off your Kickstart My Heart skills and enter to win Xbox 360 & Guitar Hero

®© 2015 MapR Technologies 17

Top-Ranked NoSQL

Top-Ranked Hadoop Distribution

Top-Ranked SQL-on Hadoop Solution