gluecon miller horizon

NEARING THE EVENT HORIZON.HADOOP WAS PREDICTABLE, WHAT’S NEXT?

Mike Millermike@cloudant.com

@mlmilleratmitMay 23, 2012

Mike Miller, GlueCon May 2012

What I Am

Cloudant Founder, Chief Scientist(we’re hiring at all positions)

A!liate Assistant Professor, Particle Physics(UW)

Background: machine learning, analysis, big data, globally distributed systems

What I Am

A CDN for your Application Data

What I Am Not

didn’t see these comingSuper luminal neutrinosRed Sox epic collapse in SeptemberRed Wings losing in the first round...

But here I go anyway

My First Postulate of Big-Data

What matters for google...... matters for the internet......and therefore matters for the enterprise...... will therefore be re-architected by Apache...... and therefore matters to you.

Google Matters

Evidence

Business Week, 12/24/2007

Evidence

The Old Canon

• Google File System (the important one)http://labs.google.com/papers/gfs.html

• MapReduce (the big one)http://labs.google.com/papers/mapreduce.html

• BigTable (clone me!)http://labs.google.com/papers/bigtable.html

• Dynamo (ok, AWS. but masterless quorum) http://s3.amazonaws.com/AllThingsDistributed/sosp/amazon-dynamo-sosp2007.pdf

copy these. use these. print $$$

MapReduce: The Awesome• Approachable interface

“What do I do with a single piece of data?”

• Data ParallelDevelopers can basically forget about scatter-gather

• Fault TolerantFailure at scale is the norm!Protects both user and system operator

• IO OptimizedBuilt for sequential IOcommodity disks spinning forward at O(20 MB/sec) each

So... is that it?

http://gigaom.com/cloud/democratizing-big-data-is-hadoop-our-only-hope/

So... is that it?

http://gigaom.com/cloud/what-it-really-means-when-someone-says-hadoop/

So... is that it?

http://gigaom.com/cloud/what-it-really-means-when-someone-says-hadoop/

http://mackiemathew.com/2012/02/25/the-problems-in-hadoop-when-does-it-fail-to-deliver/

MapReduce: The not so Awesome

• Hadoop doesn’t power big data applicationsNot a transactional datastore. Slosh back and forth via ETL

• Processing latencyNon-incremental, must re-slurp entire dataset every pass

• Ad-Hoc queriesBare metal interface, data import

• GraphsOnly a handful of graph problems amenable to MRhttp://www.computer.org/portal/web/csdl/doi/10.1109/MCSE.2009.120

Mike Miller, GlueCon May 2012 11

To the Event Horizon

Enter The New Canon• Percolator

incremental processinghttp://research.google.com/pubs/pub36726.html

• Dremelad-hoc analysis querieshttp://research.google.com/pubs/pub36632.html

• PregelBig graphshttp://dl.acm.org/citation.cfm?id=1807184

Scalable, Fault Tolerant, Approachable

Percolator

Percolator: incremental processing• Replaced MapReduce as the tool to build search index

“However, reprocessing the entire web discards the work done in earlier runs and makes latency proportional to the size of the repository, rather than the size of the update.”

• Bigtable alone can’t do it“BigTable scales...but doesn’t provide tools to help programmers maintain data invariants in the face of concurrent updates.”

• ApplicabilityIncrementally updating dataComputational output can be broken down into small piecesComputation large in some dimension (data size, cpu, etc)

• Does it matter?“...Converting the indexing system to an incremental system ... reduced the averaging document processing latency by a factor of 100...”

Percolator: incremental processing• BigTable plus...

Multi-row ACID Transactionssnapshot isolation, lazy locksup to 10s write latencies

Timestamps

NotificationsDo not maintain invariants

Observer Frameworkyour code to be run upon notification of an update

Start Timestamp (read)

Commit Timestamp (write)

Percolator: incremental processing

Near Linear Scaling to 15k Cores

Percolator: incremental processing

Latency lower than MapReduce by 100x

Dremel

Dremel: ad-hoc Query• Scalable, interactive ad-hoc query system for read-only nested data

“...capable of running aggregation queries over trillion-row tables in seconds.”

• ... on nested data structures in situWeb and scientific data is often non-relationalnested data (protobu"s) underlies most structured data at Google

• UsageDEFINE TABLE t AS /path/to/data/*SELECT TOP(signal1,100), COUNT(*) FROM t

• ApplicabilityAnalysis of crawled documentsTracking of install data for apps on Android MarketCrash reportsSpam analysis...

Dream BI Tool

Dremel: ad-hoc Query

• IngredientsIn situ dataSQL like interfaceServing trees for query executionColumn striped data (3-10x)Analysis Catalogs

21Columns ~10x faster than Records

MapReduce (via Sawzall)

Dremel (via SQL)

Benchmark Data

Dremel ~100x Faster than Stock MR

Significant Optimization Possible

Most Production Queries Executed in <10 seconds

Pregel

Pregel: Big Graphs• Massively parallel processing of big graphs

billions of vertices, trillions of edges

• Bulk synchronous parallel modelsequence of vertex oriented iterationssend/receive messages from other vertex computationsread/modify state of vertex, outgoing edges, graph topology

• Expressive, easy to programdistribution details hidden behind abstract API

• Iterativecomputation continues until each vertex votes to terminate

• In productionPageRank 15 lines of code

Pregel: Big Graphs• Master “Name” node

connects processes for messaging

• Message Passingno remote procedures, reads

• Graph hashed across nodesvertex, outgoing edges stored in RAM

• Aggregators global mechanism for aggregationall but final reduce computed on node local data

• Checkpointing configurable, enables automatic recovery

Pregel: Big Graphs

29Near Linear Scaling to 1B nodes

Learn More• Incremental Processing

Incremental, in-database map/reduce in Cloudant’s BigCouchHBase 0.92 supports observers/coprocessors Stream processing via Storm, HStreaming, etc.

• Ad Hoc QueryGoogle BigQueryColumn stores (Vertica, etc)OpenDremel (stalled?)?

• Big GraphsGiraph on Hadoop (Apache Incubator)Golden Orb (stalled?)

Lessons Learned

• Hire Je! Dean and Sanjay Ghemawat

• GFS enables everything

• There is massive opportunity on the horizon

gluecon miller horizon

ho miller

coresmike miller

xmike miller

approachablemike miller

updatemike miller

anywaymike miller

percolatormike miller

dremelmike miller

Technology

apistrat workshop @gluecon 2015:swagger - extended code...

nosql session gluecon may 2010

gluecon 2010

coreos @ gluecon 2015

what's next for apis - gluecon 2011

swagger apis for humans and robots (gluecon)

storm: the real-time layer - gluecon 2012

gluecon 2013 keynote ravello systems

gluecon 2014 - bringing node.js to the jvm

autonomic management of cloud applications with tonomi,...

gluecon 2016 keynote: deploying and managing blockchain...

gluecon 2013 netflix api crash course

bigdoor's jeff malek gluecon presentation

architectural patterns for scaling microservices and apis -...

apache hadoop an introduction - todd lipcon - gluecon 2010

gaming aws with docker - gluecon 2014

enabling walk up contributions to your documentation at...

gluecon: faster feedback with feature flags

between the apps gluecon session 05 27-2010

midokura gluecon 2014 - level up your openstack neutron...