avoid re-inventing the wheel when seeking big data bliss

11
Avoid Re-Inventing the Wheel When Seeking Big Data Bliss April 9 th , 2014

Post on 21-Oct-2014

240 views

Category:

Software


1 download

DESCRIPTION

For a webinar I did with BMC earlier this year. See the recording here: http://coteindustries.com/post/82218000239/avoid-re-inventing-the-wheel-when-seeking-big-data

TRANSCRIPT

Page 1: Avoid Re-Inventing the Wheel When Seeking Big Data Bliss

Avoid Re-Inventing the Wheel When Seeking Big Data Bliss

April 9th, 2014

Page 2: Avoid Re-Inventing the Wheel When Seeking Big Data Bliss

Michael CotéResearch Director, Infrastructure [email protected]@cote – http://cote.io

Responsible for systems management, application development, cloud software, and misc. “infrastructure software” agenda

Worked at Dell in corporate strategy, as an analyst for 6+ years, software developer for 10+ years

Joe GoldbergBMC Control-MSolutions [email protected]@GoldbergJoe

Joe is an IT professional with over 35 years of experience in the design, development, implementation, sales and marketing of enterprise solutions to Global 2000 organizations. Joe has been active in helping BMC products leverage new technology to deliver market-leading solutions

BMC slides were omitted from this presentation. See full presentation and recording here: https://www.brighttalk.com/webcast/9059/103135

Page 3: Avoid Re-Inventing the Wheel When Seeking Big Data Bliss

Usually, “Big Data” us a synonym for “Hadoop:” not so fastProcessing and analysis of very large data sets in their entirety

Massively parallel processing approaches

Both structured and multi-structured data

External (social) and corporate data

Schema-free and schema-on-read data storage/analysis

Predictive analytics as a fundamental BI tool

Reflection of collective intelligence

Identification of new patterns in data

Stream processing of sensor and machine-generated data

Native, SQL-based analysis of data in Hadoop and HBase

In-memory databases for rapid data ingestion

Real-time analysis of data prior to storage

TOTALDATA

Management alongside existing data technologies

Source: “Big data reconsidered: it's the economics, stupid,” 451 Research, Dec 2013.

Page 4: Avoid Re-Inventing the Wheel When Seeking Big Data Bliss

Another example: a provider of real-time information and analysis to the media and communications industries• Moved from storing 1% of data for 60 days in EDW @ $100,000/TB• To 100% of data for a year in Hadoop @ $900/TB• By migrating to Hadoop and open source databases the company identified over

$4m in cost savings over two years

Both companies have retained the use of traditional databases/warehousing, but Hadoop and other big data technologies add cost-effectiveness and flexibility

Big Data: “it’s the economics stupid”

“The price point that Hadoop comes in at is transformational. Hadoop has the ability to drive down operational cost and improve resource efficiency.”

Global Head of Architecture, Global Bank

Page 5: Avoid Re-Inventing the Wheel When Seeking Big Data Bliss

‘Big data’ not significant in core infrastructure yetAverage total storage capacity (TBs), and total storage footprint by workload illustrate the low level of adoption at today

Source: 2012: 451 Research The Info Pro Storage – Wave 16 | n=214 2013: 451 Research The Info Pro Storage – Wave 17 | n=200

2013

2012

0 1000 2000 3000 4000 5000 6000 7000 8000

DW and DBMSUnstructured fileVirtualized server/OSBackupArchiveOtherBig data/Hadoop

3%

3%

Page 6: Avoid Re-Inventing the Wheel When Seeking Big Data Bliss

Hadoop vs. EDW – not so much

Hadoop replacing data warehouse

Permanently migrating workloads to Hadoop

Temporaily offloading workloads to Hadoop

Hadoop for workloads not previously on DW

Hadoop not used

13.30%

31.60%

10.20%

37.80%

40.80%

Describe the relationship between Hadoop and the data warehouse within

your organization

Non-threatening, or additive

Threatening

Source: "Hadoop: a framework in search of a metaphor," 451 Survey conducted Sep/Oct 2013, sample=98.

Page 7: Avoid Re-Inventing the Wheel When Seeking Big Data Bliss

What’s big data good for?

• The processing and analysis of very large data sets in their entirety• Increased adoption of massively parallel processing approaches

• Storage and analysis of both structured and un-structured data• Integration of external (social) and corporate data for more complete perspective

• Ad hoc analytic approaches to identify new patterns in data• Interactive, native, SQL-based analysis of data in Hadoop and Hbase.

• Predictive analytics as a fundamental component of BI strategies• Machine-learning algorithms automate the reflection of collective intelligence

• Increased adoption of in-memory databases for rapid data ingestion• Stream processing of sensor and other machine-generated data/events

• Real-time analysis of data prior to storage within the data warehouse/Hadoop• “MR-ETL” – pre-processing data for EDW loads

Source: “Big data reconsidered: it's the economics, stupid,” 451 Research, Dec 2013.

Page 8: Avoid Re-Inventing the Wheel When Seeking Big Data Bliss

How to think strategically about big data

‘Big Data’ is the realization of competitive advantage by storing, processing and analyzing data that was previously ignored due to the cost and functional

limitations of traditional data management technologies to handle its volume, velocity and variety

Page 9: Avoid Re-Inventing the Wheel When Seeking Big Data Bliss

Zeroing in on Hadoop - barriers to Hadoop adoption

Hadoop is complex to configure, deploy and manage

Skilled staff are at a premium

Enterprises want to make the most of existing tools/skills

Enterprises are still trying tounderstand where Hadoop fits in their data management landscape

Page 10: Avoid Re-Inventing the Wheel When Seeking Big Data Bliss

Your homework…

1. What business problem are you solving? What questions will you ask The Data?

2. Baseline existing costs, monitor new costs – did you save?

3. Monitoring and managing your new grid

4. Bonus: self-service access for ad hoc analysts

Page 11: Avoid Re-Inventing the Wheel When Seeking Big Data Bliss

Thanks!@cote – [email protected] - http://cote.io