hadoop self-service data prep fuels analytics
TRANSCRIPT
HADOOP SELF-SERVICE DATA PREP FUELS ANALYTICSTrifacta Data Wrangler Demonstration
• Introduction• Data Analytics Process• Gap Between Raw Data & Analysis• The Trifacta Solution• Demo• Senturus Overview• Additional Resources
Agenda
Copyright 2016 Senturus, Inc. All Rights Reserved.
Presenters
Copyright 2016 Senturus, Inc. All Rights Reserved.
Will DavisDirector of Product
MarketingTrifacta
Greg HerreraPresident and Co-
FounderSenturus, Inc.
Connor CarrerasCustomer Success
ManagerTrifacta
Hundreds of Free Resources: www.senturus.com
RESOURCE LIBRARYAn extensive, free library of past webinars, demonstrations, whitepapers, presentations, helpful hints, and more.
Copyright 2016 Senturus, Inc. All Rights Reserved.
This slide deck is from the webinar: Fuel Analytics withSelf-Service Hadoop Data Prep
To listen to the FREE recording of the presentation or download this deck go to:
http://www.senturus.com/resources/fuel-analytics-self-service-hadoop-data-prep/
Hear the Recording
Copyright 2016 Senturus, Inc. All Rights Reserved.
FUEL ANALYTICS WITHSELF-SERVICE HADOOP DATA PREP
DATA WRANGLING
The Data Analytics Process
QUESTION ANALYZE INSIGHT
DISCOVER STRUCTURE CLEANSE ENRICH VALIDATE PUBLISH
8
Self-Service Data Preparation Coverage
The Gap Between Raw Data & Analysis
of the work in any data project is preparing the data for analysis
ANALYSIS & VISUALIZATION
BUSINESS SYSTEM DATA
MACHINE GENERATED DATA
THIRD PARTY DATA
IT
BUSINESS
80%
Back & Forth Between Business & IT
How can I access the raw data?
What do you want to analyze?
I can’t tell you until I see the data – let me see the data
first.
I can’t just point you to the raw data – you’ll need to tell
me.
ITBUSINESS
What’s the Cause? Existing Tools…
Hand-Coding Mapping-Based ETL
Oh Yeah.... There’s Excel Too
To view the FREE recording of the presentation or download this deck go to: http://www.senturus.com/resources/fuel-analytics-self-service-hadoop-data-prep/
The Senturus comprehensive library of recorded webinars, demos, white papers, presentations, and case studies is available on our website:
http://www.senturus.com/resources/
Hear the Recording
Copyright 2016 Senturus, Inc. All Rights Reserved.
14
Trifacta Overview
15
What’s Required to Bridge the Gap?
Empower Users
Govern Data & Processes
Trifacta’s Approach: Empower Users
Interact Predict
Preview
PredictiveTransformation
MapReduce or Spark
18
Maintain Governance While Empowering Users
ORGANIZATIONAL REQUIREMENTS
Metadata & Lineage Operationalization
TECHNOLOGY REQUIREMENTS
Security
Trifacta’s Data Wrangling Solution
19
Trifacta Self-Service Data PreparationEditions
➔ Hadoop Based ➔ Data Lake Initiatives➔ Unlimited Volume & Scalability➔ Enterprise Support➔ Subscription Fee
➔ Desktop➔ Smaller Data Sets➔ Community Support➔ Free
www.trifacta.com/start-wrangling
#1 Ranked End User Data Preparation Vendor
Used by More Than 3,000 Companies
22
Demo
Trifacta & Hadoop Workflow
23
Register Hadoop Data Setsin Trifacta
1.
HDFS
Visualize, Interact & Define Transformation Script
2.
HDFS
Execute Script on Entirety of Data Set at Scale in Hadoop
3.
HDFSExecution in MapReduce or Spark
Select TransformationOutput Format & Location
4.
Analytic ToolsHadoop
HDFSParquet or Avro
Table in HCatalog
TableauZoomdata
Etc…
Analytic Tools
The great paradox of data analysis is that 80% of the analysis process is spent cleaning or preparing data, due to complexity and diversity of the data.
Trifacta explained how the emergence of new self-service data prep solutions for Hadoop benefit organizations. We discussed how Trifacta’s data wrangling solutions are used to improve the efficiency of existing analytics processes and successfully execute new analytics initiatives.
We demonstrated Trifacta Data Wrangler and discussed:• Challenges self-service data prep solutions solves and why it
has quickly gained in popularity• Using self-service data prep to execute new types of analysis
or augment existing processes• Range of features of self-service data prep tools like Trifacta• Case studies: PepsiCo and Royal Bank of Scotland
Summary
Copyright 2016 Senturus, Inc. All Rights Reserved.
To view the FREE recording of the presentation or download this deck go to: http://www.senturus.com/resources/fuel-analytics-self-service-hadoop-data-prep/
The Senturus comprehensive library of recorded webinars, demos, white papers, presentations, and case studies is available on our website:
http://www.senturus.com/resources/
Hear the Recording
Copyright 2016 Senturus, Inc. All Rights Reserved.
Business Analytics ConsultantsWHO WE ARE
Bridging the Gap Between Data & Decision Making
DECISIONS & ACTIONS
Business Needs
Analysis Ready Data
Analysis Ready Data
.
• Dashboards, Reporting & Visualizations• Data Preparation & Modern Data
Warehousing • Self-Service Business Analytics • Big Data & Advanced Analytics• Planning & Forecasting Systems
Business Analytics Architects
900+ Clients, 2000+ Projects, 16+ Years
29Copyright 2016 Senturus, Inc. All Rights Reserved.
ADDITIONAL RESOURCES
This full-version of the Trifacta Wrangler allows you to:• Accelerate the analytic process with an
intelligent, guided approach• Move beyond rigid ETL tools and traditional
spreadsheet formulas• Leverage new data sources once reserved for
data scientists
https://www.trifacta.com/trifacta-wrangler/?utm_medium=referral&utm_source=senturus&utm_campaign
=senturus+webinar
Free Trial of Trifacta Wrangler
Copyright 2016 Senturus, Inc. All Rights Reserved.
www.senturus.com/events Upcoming Events
Copyright 2016 Senturus, Inc. All Rights Reserved.
More Free Resources: www.senturus.com
Copyright 2016 Senturus, Inc. All Rights Reserved.
Thank You!
www.senturus.com [email protected]
888 601 6010
Copyright 2016 by Senturus, Inc. This entire presentation is copyrighted and may not be reused or distributed without the written consent of Senturus, Inc.
Copyright 2016 Senturus, Inc. All Rights Reserved.