hadoop self-service data prep fuels analytics

34
HADOOP SELF-SERVICE DATA PREP FUELS ANALYTICS Trifacta Data Wrangler Demonstration

Upload: senturus

Post on 15-Apr-2017

217 views

Category:

Data & Analytics


0 download

TRANSCRIPT

Page 1: Hadoop Self-Service Data Prep Fuels Analytics

HADOOP SELF-SERVICE DATA PREP FUELS ANALYTICSTrifacta Data Wrangler Demonstration

Page 2: Hadoop Self-Service Data Prep Fuels Analytics

• Introduction• Data Analytics Process• Gap Between Raw Data & Analysis• The Trifacta Solution• Demo• Senturus Overview• Additional Resources

Agenda

Copyright 2016 Senturus, Inc. All Rights Reserved.

Page 3: Hadoop Self-Service Data Prep Fuels Analytics

Presenters

Copyright 2016 Senturus, Inc. All Rights Reserved.

Will DavisDirector of Product

MarketingTrifacta

Greg HerreraPresident and Co-

FounderSenturus, Inc.

Connor CarrerasCustomer Success

ManagerTrifacta

Andrea Herrera
what do we want to call Paul?/Senturian???
Page 4: Hadoop Self-Service Data Prep Fuels Analytics

Hundreds of Free Resources: www.senturus.com

RESOURCE LIBRARYAn extensive, free library of past webinars, demonstrations, whitepapers, presentations, helpful hints, and more.

Copyright 2016 Senturus, Inc. All Rights Reserved.

Page 5: Hadoop Self-Service Data Prep Fuels Analytics

This slide deck is from the webinar: Fuel Analytics withSelf-Service Hadoop Data Prep

To listen to the FREE recording of the presentation or download this deck go to:

http://www.senturus.com/resources/fuel-analytics-self-service-hadoop-data-prep/

Hear the Recording

Copyright 2016 Senturus, Inc. All Rights Reserved.

Page 6: Hadoop Self-Service Data Prep Fuels Analytics

FUEL ANALYTICS WITHSELF-SERVICE HADOOP DATA PREP

Page 7: Hadoop Self-Service Data Prep Fuels Analytics

DATA WRANGLING

The Data Analytics Process

QUESTION ANALYZE INSIGHT

DISCOVER STRUCTURE CLEANSE ENRICH VALIDATE PUBLISH

Page 8: Hadoop Self-Service Data Prep Fuels Analytics

8

Self-Service Data Preparation Coverage

Page 9: Hadoop Self-Service Data Prep Fuels Analytics

The Gap Between Raw Data & Analysis

of the work in any data project is preparing the data for analysis

ANALYSIS & VISUALIZATION

BUSINESS SYSTEM DATA

MACHINE GENERATED DATA

THIRD PARTY DATA

IT

BUSINESS

80%

Page 10: Hadoop Self-Service Data Prep Fuels Analytics

Back & Forth Between Business & IT

How can I access the raw data?

What do you want to analyze?

I can’t tell you until I see the data – let me see the data

first.

I can’t just point you to the raw data – you’ll need to tell

me.

ITBUSINESS

Page 11: Hadoop Self-Service Data Prep Fuels Analytics

What’s the Cause? Existing Tools…

Hand-Coding Mapping-Based ETL

Page 12: Hadoop Self-Service Data Prep Fuels Analytics

Oh Yeah.... There’s Excel Too

Page 13: Hadoop Self-Service Data Prep Fuels Analytics

To view the FREE recording of the presentation or download this deck go to: http://www.senturus.com/resources/fuel-analytics-self-service-hadoop-data-prep/

The Senturus comprehensive library of recorded webinars, demos, white papers, presentations, and case studies is available on our website:

http://www.senturus.com/resources/

Hear the Recording

Copyright 2016 Senturus, Inc. All Rights Reserved.

Page 14: Hadoop Self-Service Data Prep Fuels Analytics

14

Trifacta Overview

Page 15: Hadoop Self-Service Data Prep Fuels Analytics

15

What’s Required to Bridge the Gap?

Empower Users

Govern Data & Processes

Page 16: Hadoop Self-Service Data Prep Fuels Analytics

Trifacta’s Approach: Empower Users

Interact Predict

Preview

Page 17: Hadoop Self-Service Data Prep Fuels Analytics

PredictiveTransformation

MapReduce or Spark

Page 18: Hadoop Self-Service Data Prep Fuels Analytics

18

Maintain Governance While Empowering Users

ORGANIZATIONAL REQUIREMENTS

Metadata & Lineage Operationalization

TECHNOLOGY REQUIREMENTS

Security

Page 19: Hadoop Self-Service Data Prep Fuels Analytics

Trifacta’s Data Wrangling Solution

19

Page 20: Hadoop Self-Service Data Prep Fuels Analytics

Trifacta Self-Service Data PreparationEditions

➔ Hadoop Based ➔ Data Lake Initiatives➔ Unlimited Volume & Scalability➔ Enterprise Support➔ Subscription Fee

➔ Desktop➔ Smaller Data Sets➔ Community Support➔ Free

www.trifacta.com/start-wrangling

Page 21: Hadoop Self-Service Data Prep Fuels Analytics

#1 Ranked End User Data Preparation Vendor

Used by More Than 3,000 Companies

Page 22: Hadoop Self-Service Data Prep Fuels Analytics

22

Demo

Page 23: Hadoop Self-Service Data Prep Fuels Analytics

Trifacta & Hadoop Workflow

23

Register Hadoop Data Setsin Trifacta

1.

HDFS

Visualize, Interact & Define Transformation Script

2.

HDFS

Execute Script on Entirety of Data Set at Scale in Hadoop

3.

HDFSExecution in MapReduce or Spark

Select TransformationOutput Format & Location

4.

Analytic ToolsHadoop

HDFSParquet or Avro

Table in HCatalog

TableauZoomdata

Etc…

Analytic Tools

Page 24: Hadoop Self-Service Data Prep Fuels Analytics

The great paradox of data analysis is that 80% of the analysis process is spent cleaning or preparing data, due to complexity and diversity of the data.

Trifacta explained how the emergence of new self-service data prep solutions for Hadoop benefit organizations. We discussed how Trifacta’s data wrangling solutions are used to improve the efficiency of existing analytics processes and successfully execute new analytics initiatives.

We demonstrated Trifacta Data Wrangler and discussed:• Challenges self-service data prep solutions solves and why it

has quickly gained in popularity• Using self-service data prep to execute new types of analysis

or augment existing processes• Range of features of self-service data prep tools like Trifacta• Case studies: PepsiCo and Royal Bank of Scotland

Summary

Copyright 2016 Senturus, Inc. All Rights Reserved.

Page 25: Hadoop Self-Service Data Prep Fuels Analytics

To view the FREE recording of the presentation or download this deck go to: http://www.senturus.com/resources/fuel-analytics-self-service-hadoop-data-prep/

The Senturus comprehensive library of recorded webinars, demos, white papers, presentations, and case studies is available on our website:

http://www.senturus.com/resources/

Hear the Recording

Copyright 2016 Senturus, Inc. All Rights Reserved.

Page 26: Hadoop Self-Service Data Prep Fuels Analytics

Business Analytics ConsultantsWHO WE ARE

Page 27: Hadoop Self-Service Data Prep Fuels Analytics

Bridging the Gap Between Data & Decision Making

DECISIONS & ACTIONS

Business Needs

Analysis Ready Data

Analysis Ready Data

Page 28: Hadoop Self-Service Data Prep Fuels Analytics

.

• Dashboards, Reporting & Visualizations• Data Preparation & Modern Data

Warehousing • Self-Service Business Analytics • Big Data & Advanced Analytics• Planning & Forecasting Systems

Business Analytics Architects

Page 29: Hadoop Self-Service Data Prep Fuels Analytics

900+ Clients, 2000+ Projects, 16+ Years

29Copyright 2016 Senturus, Inc. All Rights Reserved.

Page 30: Hadoop Self-Service Data Prep Fuels Analytics

ADDITIONAL RESOURCES

Page 31: Hadoop Self-Service Data Prep Fuels Analytics

This full-version of the Trifacta Wrangler allows you to:• Accelerate the analytic process with an

intelligent, guided approach• Move beyond rigid ETL tools and traditional

spreadsheet formulas• Leverage new data sources once reserved for

data scientists

https://www.trifacta.com/trifacta-wrangler/?utm_medium=referral&utm_source=senturus&utm_campaign

=senturus+webinar

Free Trial of Trifacta Wrangler

Copyright 2016 Senturus, Inc. All Rights Reserved.

Page 32: Hadoop Self-Service Data Prep Fuels Analytics

www.senturus.com/events Upcoming Events

Copyright 2016 Senturus, Inc. All Rights Reserved.

Page 33: Hadoop Self-Service Data Prep Fuels Analytics

More Free Resources: www.senturus.com

Copyright 2016 Senturus, Inc. All Rights Reserved.

Page 34: Hadoop Self-Service Data Prep Fuels Analytics

Thank You!

www.senturus.com [email protected]

888 601 6010

Copyright 2016 by Senturus, Inc. This entire presentation is copyrighted and may not be reused or distributed without the written consent of Senturus, Inc.

Copyright 2016 Senturus, Inc. All Rights Reserved.