analyzing unstructured data in hadoop webinar

Post on 15-May-2015

1.127 Views

Category:

Technology

1 Downloads

Preview:

Click to see full reader

DESCRIPTION

Unstructured data is growing 62% per year faster than structured data. According to Gartner, data volumes are set to grow 800% in aggregate over the next 5 years, and 80% of it will be unstructured data. This on-demand webinar will highlight and discuss: How applying big data analytics to unstructured data can help you gain richer, deeper and more accurate insights to gain competitive advantages The sources of unstructured data which include email, social media platforms, CRM systems, call center platforms (including notes and speech-to-text transcripts), and web scrapes How monitoring the communications of your customers and prospects enables you to make time-sensitive decisions and jump on new business opportunities

TRANSCRIPT

© 2014 Datameer, Inc. All rights reserved.

Analyzing Unstructured Data in Hadoop!

View Recording !! You can view the recording of this webinar

at:

http://info.datameer.com/Online-Slideshare-Analyzing-Unstructured-Data-in-Hadoop-

On-Demand.html

© 2013 Datameer, Inc. All rights reserved.

Matt Schumpert @datameer Senior Director, Solutions Engineering Matt has been working in the enterprise infrastructure software space for over 14 years in various capacities, including sales engineering, strategic alliances and consulting. Matt currently runs the pre-sales engineering team at Datameer, supporting all technical aspects of customer engagement from initial contact through roll-out of customers into production. Matt holds a BS in Computer Science from the University of Virginia. 

#datameer @datameer

About Our Speaker!

Agenda!•  Market & Data trends

•  Tuning into new channels

•  The good news

•  The rise of wrangling

•  Analytics requirements

•  Bringing order to chaos

•  Use Cases

What we learned in 2010… (or before)!

Market & Data Trends!

•  Data volumes will grow 800% in 5 years

•  Unstructured data is growing 62% faster

•  80% of all data will be unstructured in 2019

•  “Big Unstructured Data” requires new tech.

•  85% of the Fortune 500 will be unable to exploit Big Data for competitive advantage through 2015

Source: Gartner

Market & Data Trends!•  ‘Multi-structured’ is the word of the day

•  Mainstream IT tools broadening the base

•  Competitive advantage lies outside your firewall!

S U

Tuning Into New Channels!

Tuning Into New Channels!

•  Public & social data is available by the firehose

•  The new discipline: connecting, filtering, switching

•  Find the right keywords, dictionaries, segments

•  Learn from, but don’t emulate search engines

•  Beware of point solutions

The Good News!

•  All data has structure

•  Storage is cheap (Hadoop ~= $300 / TB)

•  Processing is cheap (“free”)

•  Unstructured data compresses well

•  Data APIs abound

•  Public data blossoming (data.gov, etc.)

The Rise of Wrangling!•  A ‘record’ is no longer a record

•  Event streams need different angles of attack

•  Explode, project, align, window, search

•  New companies/technologies specializing in it Source: Gartner

Analytics Requirements (1)!

•  A scalable Big Data foundation (Hadoop)

•  Schema-on-read

•  Data profiling & cleansing

•  Fast, visual iteration over samples

Source: Gartner

Analytics Requirements (2)!

•  Text mining, without programing

•  Helper functions for semi/un-structured formats

•  Data connectors, new visualizations

•  Patience, and a an culture of data discovery

Datameer:! End-to-End Big Data Analytics!

Enterprise Integration!

Bringing Order to Chaos!

•  ‘Big Data Visualization’ is an oxymoron

•  Rich, detailed summaries are the goal

•  ‘It’s the analytics, stupid’

Industry Use Cases!•  Retail: Competitive pricing through web scraping

•  MFG: Product sentiment through Twitter

•  FSI: Brand preferences from Facebook “likes”

•  Gov: Nefarious behavior through email seizure!

For more information!

http://www.datameer.com " @datameer " mschumpert@datameer.com

Learn more

Contact

#datameer @datameer

top related