analyzing unstructured data in hadoop webinar

19
© 2014 Datameer, Inc. All rights reserved. Analyzing Unstructured Data in Hadoop

Upload: datameer

Post on 15-May-2015

1.127 views

Category:

Technology


1 download

DESCRIPTION

Unstructured data is growing 62% per year faster than structured data. According to Gartner, data volumes are set to grow 800% in aggregate over the next 5 years, and 80% of it will be unstructured data. This on-demand webinar will highlight and discuss: How applying big data analytics to unstructured data can help you gain richer, deeper and more accurate insights to gain competitive advantages The sources of unstructured data which include email, social media platforms, CRM systems, call center platforms (including notes and speech-to-text transcripts), and web scrapes How monitoring the communications of your customers and prospects enables you to make time-sensitive decisions and jump on new business opportunities

TRANSCRIPT

Page 1: Analyzing Unstructured Data in Hadoop Webinar

© 2014 Datameer, Inc. All rights reserved.

Analyzing Unstructured Data in Hadoop!

Page 2: Analyzing Unstructured Data in Hadoop Webinar

View Recording !! You can view the recording of this webinar

at:

http://info.datameer.com/Online-Slideshare-Analyzing-Unstructured-Data-in-Hadoop-

On-Demand.html

Page 3: Analyzing Unstructured Data in Hadoop Webinar

© 2013 Datameer, Inc. All rights reserved.

Matt Schumpert @datameer Senior Director, Solutions Engineering Matt has been working in the enterprise infrastructure software space for over 14 years in various capacities, including sales engineering, strategic alliances and consulting. Matt currently runs the pre-sales engineering team at Datameer, supporting all technical aspects of customer engagement from initial contact through roll-out of customers into production. Matt holds a BS in Computer Science from the University of Virginia. 

#datameer @datameer

About Our Speaker!

Page 4: Analyzing Unstructured Data in Hadoop Webinar

Agenda!•  Market & Data trends

•  Tuning into new channels

•  The good news

•  The rise of wrangling

•  Analytics requirements

•  Bringing order to chaos

•  Use Cases

Page 5: Analyzing Unstructured Data in Hadoop Webinar

What we learned in 2010… (or before)!

Page 6: Analyzing Unstructured Data in Hadoop Webinar

Market & Data Trends!

•  Data volumes will grow 800% in 5 years

•  Unstructured data is growing 62% faster

•  80% of all data will be unstructured in 2019

•  “Big Unstructured Data” requires new tech.

•  85% of the Fortune 500 will be unable to exploit Big Data for competitive advantage through 2015

Source: Gartner

Page 7: Analyzing Unstructured Data in Hadoop Webinar

Market & Data Trends!•  ‘Multi-structured’ is the word of the day

•  Mainstream IT tools broadening the base

•  Competitive advantage lies outside your firewall!

S U

Page 8: Analyzing Unstructured Data in Hadoop Webinar

Tuning Into New Channels!

Page 9: Analyzing Unstructured Data in Hadoop Webinar

Tuning Into New Channels!

•  Public & social data is available by the firehose

•  The new discipline: connecting, filtering, switching

•  Find the right keywords, dictionaries, segments

•  Learn from, but don’t emulate search engines

•  Beware of point solutions

Page 10: Analyzing Unstructured Data in Hadoop Webinar

The Good News!

•  All data has structure

•  Storage is cheap (Hadoop ~= $300 / TB)

•  Processing is cheap (“free”)

•  Unstructured data compresses well

•  Data APIs abound

•  Public data blossoming (data.gov, etc.)

Page 11: Analyzing Unstructured Data in Hadoop Webinar

The Rise of Wrangling!•  A ‘record’ is no longer a record

•  Event streams need different angles of attack

•  Explode, project, align, window, search

•  New companies/technologies specializing in it Source: Gartner

Page 12: Analyzing Unstructured Data in Hadoop Webinar

Analytics Requirements (1)!

•  A scalable Big Data foundation (Hadoop)

•  Schema-on-read

•  Data profiling & cleansing

•  Fast, visual iteration over samples

Source: Gartner

Page 13: Analyzing Unstructured Data in Hadoop Webinar

Analytics Requirements (2)!

•  Text mining, without programing

•  Helper functions for semi/un-structured formats

•  Data connectors, new visualizations

•  Patience, and a an culture of data discovery

Page 14: Analyzing Unstructured Data in Hadoop Webinar

Datameer:! End-to-End Big Data Analytics!

Page 15: Analyzing Unstructured Data in Hadoop Webinar

Enterprise Integration!

Page 16: Analyzing Unstructured Data in Hadoop Webinar

Bringing Order to Chaos!

•  ‘Big Data Visualization’ is an oxymoron

•  Rich, detailed summaries are the goal

•  ‘It’s the analytics, stupid’

Page 17: Analyzing Unstructured Data in Hadoop Webinar

Industry Use Cases!•  Retail: Competitive pricing through web scraping

•  MFG: Product sentiment through Twitter

•  FSI: Brand preferences from Facebook “likes”

•  Gov: Nefarious behavior through email seizure!

Page 18: Analyzing Unstructured Data in Hadoop Webinar
Page 19: Analyzing Unstructured Data in Hadoop Webinar

For more information!

http://www.datameer.com " @datameer " [email protected]

Learn more

Contact

#datameer @datameer