big data discovery: unlock the potential in a big data reservoir

20

Upload: oracle-analytics

Post on 03-Jul-2015

2.492 views

Category:

Technology


2 download

DESCRIPTION

What if you could make data preparation 20 percent of your effort so you can focus 80 percent of your time on executing and improving your business? Come to this session to learn how you can easily use guided search across all Hadoop Distributed File System (HDFS) files with automated data enrichment; highlight which attributes are important, which data elements have statistical meaning, and which have quality issues; use visualization to identify multiple segments of data that matter most; and fix data quality problems and create new data elements—all this and more with big data discovery.

TRANSCRIPT

Page 1: Big Data Discovery: Unlock the Potential in a Big Data Reservoir
Page 2: Big Data Discovery: Unlock the Potential in a Big Data Reservoir

Oracle Big Data Discovery Unlock Potential in Big Data Reservoir

Gokula Mishra Premjith Balakrishnan Business Analytics Product Group September 29, 2014

Copyright © 2014, Oracle and/or its affiliates. All rights reserved. |

Page 3: Big Data Discovery: Unlock the Potential in a Big Data Reservoir

Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |

Safe Harbor Statement

The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment to deliver any material, code, or functionality, and should not be relied upon in making purchasing decisions. The development, release, and timing of any features or functionality described for Oracle’s products remains at the sole discretion of Oracle.

Oracle Confidential – Internal 3

Page 4: Big Data Discovery: Unlock the Potential in a Big Data Reservoir

Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |

Agenda

Oracle Confidential – Internal 4

Introduction to Big Data Discovery

Q&A

1

2

Page 5: Big Data Discovery: Unlock the Potential in a Big Data Reservoir

Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |

Agenda

Oracle Confidential – Internal 5

Introduction to Big Data Discovery

Q&A

1

2

Page 6: Big Data Discovery: Unlock the Potential in a Big Data Reservoir

Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |

Hadoop Data Reservoir Concept Gaining Momentum

Oracle Confidential – Internal 6

Data Warehouse Data Reservoir

Emerging Sources Existing Sources

Source: wikibon.org/wiki/v/Big_Data_Vendor_Revenue_and_Market_Forecast_2013-2017 Source: 451 Research – Total Data Warehousing: 2013-2018

Source: The Forrester WaveTM: Big Data Hadoop Solutions, Q1 2014

Page 7: Big Data Discovery: Unlock the Potential in a Big Data Reservoir

Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |

Not Easy to Get Analytic Value from Hadoop Data Reservoir

Oracle Confidential – Internal 7

? • Volume, Variety, Velocity = Complexity – Data not organized

– Complex, non-integrated tools

– Specialized skills required

• Impact: Lack of Analytic Agility – 80% effort spent on data

preparation vs. analytics

• Path to Production Unclear – Difficult to share with masses

– Hard to secure

– Lack of governance

• Impact: Poor Enterprise Adoption – Insights not widely leveraged

across the organization

Page 8: Big Data Discovery: Unlock the Potential in a Big Data Reservoir

Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | Oracle Confidential – Internal 8

What if we could reverse: •80% - Data Preparation •20% - Analysis

The Big Data Opportunity

Page 9: Big Data Discovery: Unlock the Potential in a Big Data Reservoir

Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |

Requires a Fundamentally New Approach

Oracle Confidential – Internal 9

An intuitive, interactive and visual user interface

then share results for enterprise leverage

Data Warehouse

Business Intelligence

Advanced Analytics

Other Hadoop Tools

Explore

Transform Discover

Find

for anyone to quickly find, explore, transform and analyze data in Hadoop

Data Scientist

Business Analyst Business User

Increase Analytic Agility Maximize Enterprise Adoption

Page 10: Big Data Discovery: Unlock the Potential in a Big Data Reservoir

Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | Oracle Confidential – Internal 10

Oracle Big Data Discovery. The Visual Face of Hadoop

Explore

Transform Discover

Find

Page 11: Big Data Discovery: Unlock the Potential in a Big Data Reservoir

Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |

• Navigate a rich catalog of all data in the Hadoop cluster

• Familiar search and guided navigation for ease of use

• Access data set summaries, annotation and recommendations

• Provision your own data through self-service upload

• Browse personal big data projects and those shared by the community

Oracle Confidential – Internal 11

Easily Find Relevant Data Sets

Page 12: Big Data Discovery: Unlock the Potential in a Big Data Reservoir

Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |

• Understand shape of the data. Visualize attributes by type

• Entropy based sorting by information potential

• View attribute statistics, data quality and outliers

• Use scratch pad to see statistical correlations between attribute combinations

• Evaluate whether a data set is worthy of further investment

Oracle Confidential – Internal 12

Explore the Data and Understand Potential

Page 13: Big Data Discovery: Unlock the Potential in a Big Data Reservoir

Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |

• Intuitive user driven data wrangling

• Library of data transformations to replace values, convert types, collapse, reshape, pivot, group, custom tag, merge and much more

• Data enrichments for inferring location and language. Theme, entity and sentiment enrichments for text

• Preview results, undo, commit and replay transforms

• Run on sample data in memory or full data set in Hadoop

Oracle Confidential – Internal 13

Transform and Enrich Data to Make it Ready

Page 14: Big Data Discovery: Unlock the Potential in a Big Data Reservoir

Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |

• Mash up different data sets for deeper perspectives

• Drag and drop from a rich library of interactive visualizations to compose discovery dashboards

• Filter through data with powerful search and intuitive guided navigation

• Publish blended data sets back to Hadoop

• Share projects, bookmarks and snapshots with team members for collaboration

Oracle Confidential – Internal 14

Analyze the Data to Discover New Insights

Page 15: Big Data Discovery: Unlock the Potential in a Big Data Reservoir

Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |

Share Results and Publish for Enterprise Leverage

Oracle Confidential – Internal 15

• Share and collaborate with the team

– Share projects, bookmarks and snapshots then collaborate and iterate

• Publish back to Hadoop

– Transforms and enrichments may be applied to original data sets in Hadoop

– Publish blended data sets back to HDFS

• Leverage results in other tools

– Publish data to Hadoop in format optimized for advanced analytic tools (e.g. ORAAH)

– Hadoop compliant BI tools (e.g. OBIFS) can burst out to the masses

– Leverage any native Hadoop tooling (e.g. Pig, Hive, Impala, Python, etc)

– Integrate BDD data sets with DWH to secure, govern and optimize for query performance (e.g. Oracle Big Data SQL)

Oracle Big Data Discovery plays well with the big data ecosystem

Explore

Transform Discover

Find

Share & Collaborate

raw data

transformed data

data reservoir

(HDFS)

Publish

data warehouse

business intelligence

advanced analytics

other hadoop tools

Leverage

Page 16: Big Data Discovery: Unlock the Potential in a Big Data Reservoir

Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |

Productize, Secure & Govern

Experiment, Prototype & Collaborate

Data Reservoir

Un

stru

ctu

red

D

ata

Data Warehouse

Oracle Database

Stru

ctu

red

Dat

a

Oracle Big Data Discovery

Oracle Big Data SQL

Hadoop (HDFS)

Oracle R for Hadoop

Oracle Advanced Analytics

Tables in Hadoop

Tables in DB

SQL join

In-Memory Appliance

Oracle BI Foundation Suite

Oracle SQL Queries

Exalytics

Exadata

BDA

Oracle’s Unified Big Data Management and Analytics Strategy

• Experiment, Prototype, Collaborate

– Quickly find, explore, transform, discover and share in BDD

– Publish results to HDFS

– Use to build predictive models with Oracle R for Hadoop

• Productize, Secure, Govern

– Connect published HDFS files to secure Oracle DB using Oracle Big Data SQL

– No data movement required

– Seamlessly extends existing DWH and BI investments with non-traditional data in Hadoop

• Available as Engineered Systems

Oracle Confidential – Internal

Page 17: Big Data Discovery: Unlock the Potential in a Big Data Reservoir

Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | Oracle Confidential – Internal 17

Oracle Big Data Discovery. A Game Changing Platform

Benefits to the Business • Get Value Faster. Rapidly turn raw data into actionable

insights that can be leveraged across the enterprise

• Democratize Value from Big Data. Increase the size, diversify the skills, and improve the efficiency of Big Data project teams

Benefits to IT • Destroy Existing Technical Barriers. Run natively on

Hadoop cluster for maximum scalability and performance

• Share, Publish, Secure and Leverage. Integrate with Hadoop open standards and leverage the Oracle big data ecosystem

Page 18: Big Data Discovery: Unlock the Potential in a Big Data Reservoir

Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |

Agenda

Oracle Confidential – Internal 18

Introduction to Big Data Discovery

Q&A

1

2

Page 19: Big Data Discovery: Unlock the Potential in a Big Data Reservoir

Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |

Page 20: Big Data Discovery: Unlock the Potential in a Big Data Reservoir