john glendenning - real time data driven services in the cloud
DESCRIPTION
John Glendenning from DataStax's presentation from our Big Data breakfast conferenceTRANSCRIPT
John Glendenning
DataStax
‘Real-time data driven services in the Cloud’
Real-time Data Driven Services in the CloudJohn Glendenning, DataStaxVP & GM EMEA
Line of Business Manager: Adapt With Customers
“I have to move as fast as my market. I can’t get
slowed down by people telling me this is going to
take six months. It’s got to be ready, quickly. No matter
what. And I need to adapt quickly with my customers.
VP of IT: How Can I Scale Without Surprises?
“Given the explosion of data in the enterprise, how can I scale my IT investment to meet the demands of my lines of business, without taking on undue risk? (My choices are to spend $10 million to scale what I’ve got versus do something new)”
Nearly All Businesses Must Think Global
Datacenter
Cloud
About 1/2 OF ALL SALES will be online BY THE END
OF 2013
Source: (http://www.datastax.com/resources/whitepapers/bigdata)
24/7 monitoring demands
Globalmarket
demands
Localizationdeployment
Your Data Demands Can Change in an Instant
2012
2011
2010
2009
Fluctuating
traffic demands
14
24
25
13
Fi
5
24
Major Changes:
The Evolving
Data Center
DataStax in the News
Big movies, big data: Netflix embraces NoSQL in the cloud
With billions of reads and writes daily, Netflix relies on NoSQL database Cassandra to replace a legacy Oracle deployment
May 02, 2013
(AP) The company chose Cassandra from DataStax for its flexibility to create and manage data clusters quickly, particularly in the cloud. Christos Kalantzis, Netflix's manager of cloud and platform engineering, explains that "solutions like Oracle don't run very well on virtualized hardware ... the architecture of Cassandra and the availability and consistency tuning and scalability made it a clear choice." To address these
Major Changes: The Evolving Data Center
LOBApp
Oracle
LOBApp
MySQL
LOBApp
SQLServe
r
“What’s Happening?”Hyper VelocityTransactional
NoSQL
Data Warehouse
Teradata/Exadata
“What Happened?”Massive Volume
Bit Bucket
Hadoop
Not Only SQL
What is a NoSQL Solution?
NoSQL is a broad class of next-generation database management systems that differ from the classic model of the relational database management system (RDBMS) in some significant ways, most important being they are:
• Designed from the ground up to deal with the challenges of Big Data
• Massively scalable at a fraction of the cost of a traditional RDBMS
• Less-rigid, more dynamic data model that drives flexibility and agility
• Can store structured, semi-structured and unstructured data• Not beholden to traditional RDBMS constraints such as ACID
compliance
What is Apache Cassandra?
Apache Cassandra™ is a massively scalable distributed open source database.
Cassandra is designed to handle big data workloads across multiple data centres with no single point of failure, providing enterprises with continuous availability without compromising performance.
Cassandra Architecture Overview
• Fast / Linear performance• Elastic scalability • No single point of failure • Enterprise / multi-data center /
cloud data distribution• Location independence – read
and write anywhere• Tunable data consistency (per
operation) • Familiar SQL-Like language –
CQL • Dynamic / Flexible schema• Can store structured, semi-
structured and unstructured data
• Replication Strategies from Amazon Dynamo paper• Data structure and storage design from Google
BigTable paper
Apache Cassandra Leading in Performance“In terms of scalability, there is a clear winner throughout our experiments. Cassandra achieves the highest throughput for the maximum number of nodes in all experiments with a linear increasing throughput.”Solving Big Data Challenges for Enterprise Application Performance Management, Tilman Rable, et al., August 2013, p. 10. Benchmark paper presented at the Very Large Database Conference, 2013. http://vldb.org/pvldb/vol5/p1724_tilmannrabl_vldb2013.pdf
http://techblog.netflix.com/2011/11/benchmarking-cassandra-scalability-on.html
Netflix Cloud Benchmark…End Point Independent NoSQL
Benchmark
Highest in throughput…
Lowest in latency…
Who’s using Cassandra?
Why We Exist
“I can create a Cassandra cluster in any region of the world in 10 minutes. When marketing guys decide we want to move into a certain part of the world, we’re ready.”
Today’s applications must be always available and lightning fast as they scale to previously unimaginable levels.
Cassandra delivers both with a beautifully simple and elegant architecture.
What We Do Best
Cassandra was designed to do things that are impossible in other databases when it comes to availability and performance. Forget about losing a machine here or there -- Cassandra delivers a world where you can lose an entire datacenter and still perform as your customers expect.
“We have to be ready for disaster recovery all the time. It’s really great that Cassandra allows for active-active multiple data centers where we can read and write anywhere”
Jay PatelTechnical Architect at eBay(Describing why they switched from legacy relational architecture)
Without Breaking Your Budget
“To do what we need to do today without Cassandra would cost a couple million dollars more and would be significantly harder to manage operationally.”
DataStax: An Overview• Founded in April 2010
• Home to Apache Cassandra Chair & most committers
• DataStax Enterprise – ‘Certified for Production’ Big Data platform
• 300+ customers
• 100+ employees
• Headquartered in San Francisco Bay area
• European HQ in London, UK
• Funded by prominent venture firms
DataStax Enterprise
Cassandra users come to DataStax
For Confidence and Innovation
What Innovation?
• Production-certified Cassandra
• Round-the-clock support by the world’s experts
• Your big data system is easy to manage
• Satisfy your top security officer
• Search and analyze your hot data in context
Ask Different Things of Your Hot Data
Analyze(Hadoop) Write
Read
Write Search(Solr)
Search(Solr)
Write
Read
DataStaxEnterpriseMulti-Data
Center
With the Security You Need
Analyze(Hadoop) Write
Read
Write Search(Solr)
Search(Solr)
Write
Read
Into the Mainstream
“Security is very important to us, so we’re naturally very pleased to see all the new security features in DataStax Enterprise 3. Its scalability and performance are enabling us to develop an exciting financial data analytics platform that will create a better experience for our audience.”
Managed From a Single Pane
ProvisionMonitorPlanOptimizeRecover
CALL FOR PAPERSSPONSORSHIP 30+ SessionsTWO DAYSTRAINING DAY
Cassandra Summit Europe 2013
CALL FOR PAPERSSPONSORSHIP OPPORTUNITY
TWO DAYS30+ SESSIONSTRAINING DAY
London Barbican 2013