one slide overview: orcl big data integration and governance

6
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | Staging Oracle Big Data Integration and Lambda/Kappa Architecture 1 Oracle Big Data Integration and Governance Sqoop HDFS Hive Flume Capture Trail Route Deliver Pump Transformation Model First Analytics Oracle BIEE SAS, Cognos / SPSS Business Objects Microstrategy Data Streaming Discovery Sandbox/s Kafka (MPP Pub/Sub) Storm and Trident Spark Streaming Data First Analytics Oracle Endeca Tableau Cliq Spotfire In-Motion Analytics & Data Services Vertical specific Internet of Things / Telematics Data monetization HBase R Oracle GoldenGate Oracle Data Integrator Oracle Data Governance

Upload: jeffrey-t-pollock

Post on 08-Jul-2015

230 views

Category:

Software


2 download

DESCRIPTION

One Slide Overview: ORCL Big Data Integration and Governance

TRANSCRIPT

Page 1: One Slide Overview: ORCL Big Data Integration and Governance

Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |

Staging

Oracle Big Data Integration and Lambda/Kappa Architecture

1Oracle Big Data Integration and Governance

Sqoop

HDFS

Hive

Flume

Cap

ture

Trai

l

Ro

ute

De

live

r

Pu

mp

Transformation

Model FirstAnalytics

• Oracle BIEE• SAS, Cognos / SPSS• Business Objects• Microstrategy

Data Streaming

Discovery Sandbox/s

Kafka (MPP Pub/Sub)

Storm and Trident

Spark Streaming

Data FirstAnalytics

• Oracle Endeca• Tableau• Cliq• Spotfire

In-Motion Analytics& Data Services

• Vertical specific• Internet of Things

/ Telematics• Data monetization

HBase

R

Oracle GoldenGate

Oracle Data Integrator

Oracle Data Governance

Page 2: One Slide Overview: ORCL Big Data Integration and Governance

Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |

Comprehensive Oracle Data Integration & Governance

Oracle Company Confidential 2

FastLoad

Speed Layer

Batch Layer

Oracle Data Integrator(Transform)

Oracle GoldenGate(Move)

Data Service Integrator(Federate)

DataGovernanceFoundation

Enterprise Data Quality(Profile & Cleanse)

Enterprise Metadata Management & Business Glossary(Business Glossary, Data Lineage, Impact Analysis and Data Provenance)

Veridata(Verify)

Data Enrichment(Prepare)

Real-Time Data Movement– Low impact capture, stage in Hadoop– Continuous data availability

Data Transformation– Bulk data movement– Pushdown data processing

Data Federation– Virtualized Data Services

Data Governance– Prepare unstructured data– Profile data with sampling– Clean data in real time or batch– Verify data for consistency– Trace lineage of all data– Define glossary of business terms

Page 3: One Slide Overview: ORCL Big Data Integration and Governance

Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |

4 Business Patterns of Big Data Reservoir Success

Oracle Company Confidential 3

Sandbox

ETL Offload

Staging

Deep Data Storage

Data FirstAnalytics

Model FirstAnalytics

In-MotionAnalytics

DataServices

New Data Services:– Stakeholder: Line of Business (LoB)– Core Value: Monetizing data by reselling services– Innovation: Streaming platforms correlate and

analyze realtime data from devices and apps

Faster Analytics:– Stakeholders: Line of Business (LoB) with IT– Core Value: Faster access to business data, Faster

time to value on Analytics– Innovation: Schema-on-read empowers rapid

data staging and true Data Discovery

ETL Offload:– Stakeholder: Information Technology (IT)– Core Value: Cost avoidance on DW/Marts– Innovation: YARN/Hadoop empowers lower cost

compute and lower cost storage

Deep Data Storage:– Stakeholder: Risk / Compliance (LoB)– Core Value: High fidelity aged data– Innovation: SQL on Hadoop engines enable very

low cost, queryable data access

Streaming

Page 4: One Slide Overview: ORCL Big Data Integration and Governance

Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |

Oracle Approach to Big Data Integration is Superior

Oracle Company Confidential 4

Sandbox

ETL Offload

Staging

Deep Data Storage

Data FirstAnalytics

Model FirstAnalytics

In-MotionAnalytics

DataServices

Oracle GoldenGate:– Non-invasive data capture– Low-latency data movement– Full or partial records staging– Most proven integration tool worldwide

Oracle Data Integrator:– No ETL engine is required– Logical design is separate from physical– Deploys in Hadoop or off cluster– Many options for movement

Data Governance:– Data Preparation and Enrichment– Data Profiling and Cleansing– Data Verification– Metadata Management– Business Glossary

Streaming

Oracle Data Integrator(Transform)

Oracle GoldenGate(Move)

Data Service Integrator(Federate)

DataGovernanceFoundation

Enterprise Data Quality(Profile & Cleanse)

Enterprise Metadata Management & Business Glossary(Business Glossary, Data Lineage, Impact Analysis and Data Provenance)

Veridata(Verify)

Data Enrichment(Prepare)

Page 5: One Slide Overview: ORCL Big Data Integration and Governance

Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |

Oracle Does Big Data Better: Dynamic Data Movement

5Oracle Confidential, under Non-Disclosure

HDFS (Files)

HBase (NoSQL)

Hive / Hive Streaming (SQL)

Flume & Storm (Streaming)

Kafka (MPP Pub/Sub)

Spark Streaming (Machine Learning)

Capture Database Transactions and Deliver to Big Data in Real-Time

Cap

ture

Trai

l

Ro

ute

Del

iver

Pu

mp

GoldenGate

Page 6: One Slide Overview: ORCL Big Data Integration and Governance

Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |

Oracle Does Big Data Better: Invented Pushdown Processing

6

OR

CL

Inve

stm

ents

in E

LT/P

ush

do

wn

Tec

h

ScriptedSQL

StoredProcs

WarehouseBuilder

DataIntegrator

(Heterogeneous)

ODI forColumnar

DBs

ODI forIn-Memory

DBs

ODI forEngineered

Systems

ODI forHadoopNoSQL

ODI forHadoop

Pig & Oozie

ODI forSpark

ODI for …

1990’s

Eon of Scripts and PL-SQL Era of Native SQL Big Data Revolution

Oracle’s tool maturity and operational know-how for E-LT is unmatched 10x bigger footprint with E-LT than next closest competitor using “pushdown” Simple and easy way to blend Hadoop and SQL E-LT execution from one tool

ODI forHadoop

Hive

Oracle Confidential, under Non-Disclosure