one slide overview: orcl big data integration and governance
DESCRIPTION
One Slide Overview: ORCL Big Data Integration and GovernanceTRANSCRIPT
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |
Staging
Oracle Big Data Integration and Lambda/Kappa Architecture
1Oracle Big Data Integration and Governance
Sqoop
HDFS
Hive
Flume
Cap
ture
Trai
l
Ro
ute
De
live
r
Pu
mp
Transformation
Model FirstAnalytics
• Oracle BIEE• SAS, Cognos / SPSS• Business Objects• Microstrategy
Data Streaming
Discovery Sandbox/s
Kafka (MPP Pub/Sub)
Storm and Trident
Spark Streaming
Data FirstAnalytics
• Oracle Endeca• Tableau• Cliq• Spotfire
In-Motion Analytics& Data Services
• Vertical specific• Internet of Things
/ Telematics• Data monetization
HBase
R
Oracle GoldenGate
Oracle Data Integrator
Oracle Data Governance
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |
Comprehensive Oracle Data Integration & Governance
Oracle Company Confidential 2
FastLoad
Speed Layer
Batch Layer
Oracle Data Integrator(Transform)
Oracle GoldenGate(Move)
Data Service Integrator(Federate)
DataGovernanceFoundation
Enterprise Data Quality(Profile & Cleanse)
Enterprise Metadata Management & Business Glossary(Business Glossary, Data Lineage, Impact Analysis and Data Provenance)
Veridata(Verify)
Data Enrichment(Prepare)
Real-Time Data Movement– Low impact capture, stage in Hadoop– Continuous data availability
Data Transformation– Bulk data movement– Pushdown data processing
Data Federation– Virtualized Data Services
Data Governance– Prepare unstructured data– Profile data with sampling– Clean data in real time or batch– Verify data for consistency– Trace lineage of all data– Define glossary of business terms
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |
4 Business Patterns of Big Data Reservoir Success
Oracle Company Confidential 3
Sandbox
ETL Offload
Staging
Deep Data Storage
Data FirstAnalytics
Model FirstAnalytics
In-MotionAnalytics
DataServices
New Data Services:– Stakeholder: Line of Business (LoB)– Core Value: Monetizing data by reselling services– Innovation: Streaming platforms correlate and
analyze realtime data from devices and apps
Faster Analytics:– Stakeholders: Line of Business (LoB) with IT– Core Value: Faster access to business data, Faster
time to value on Analytics– Innovation: Schema-on-read empowers rapid
data staging and true Data Discovery
ETL Offload:– Stakeholder: Information Technology (IT)– Core Value: Cost avoidance on DW/Marts– Innovation: YARN/Hadoop empowers lower cost
compute and lower cost storage
Deep Data Storage:– Stakeholder: Risk / Compliance (LoB)– Core Value: High fidelity aged data– Innovation: SQL on Hadoop engines enable very
low cost, queryable data access
Streaming
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |
Oracle Approach to Big Data Integration is Superior
Oracle Company Confidential 4
Sandbox
ETL Offload
Staging
Deep Data Storage
Data FirstAnalytics
Model FirstAnalytics
In-MotionAnalytics
DataServices
Oracle GoldenGate:– Non-invasive data capture– Low-latency data movement– Full or partial records staging– Most proven integration tool worldwide
Oracle Data Integrator:– No ETL engine is required– Logical design is separate from physical– Deploys in Hadoop or off cluster– Many options for movement
Data Governance:– Data Preparation and Enrichment– Data Profiling and Cleansing– Data Verification– Metadata Management– Business Glossary
Streaming
Oracle Data Integrator(Transform)
Oracle GoldenGate(Move)
Data Service Integrator(Federate)
DataGovernanceFoundation
Enterprise Data Quality(Profile & Cleanse)
Enterprise Metadata Management & Business Glossary(Business Glossary, Data Lineage, Impact Analysis and Data Provenance)
Veridata(Verify)
Data Enrichment(Prepare)
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |
Oracle Does Big Data Better: Dynamic Data Movement
5Oracle Confidential, under Non-Disclosure
HDFS (Files)
HBase (NoSQL)
Hive / Hive Streaming (SQL)
Flume & Storm (Streaming)
Kafka (MPP Pub/Sub)
Spark Streaming (Machine Learning)
Capture Database Transactions and Deliver to Big Data in Real-Time
Cap
ture
Trai
l
Ro
ute
Del
iver
Pu
mp
GoldenGate
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |
Oracle Does Big Data Better: Invented Pushdown Processing
6
OR
CL
Inve
stm
ents
in E
LT/P
ush
do
wn
Tec
h
ScriptedSQL
StoredProcs
WarehouseBuilder
DataIntegrator
(Heterogeneous)
ODI forColumnar
DBs
ODI forIn-Memory
DBs
ODI forEngineered
Systems
ODI forHadoopNoSQL
ODI forHadoop
Pig & Oozie
ODI forSpark
ODI for …
1990’s
Eon of Scripts and PL-SQL Era of Native SQL Big Data Revolution
Oracle’s tool maturity and operational know-how for E-LT is unmatched 10x bigger footprint with E-LT than next closest competitor using “pushdown” Simple and easy way to blend Hadoop and SQL E-LT execution from one tool
ODI forHadoop
Hive
Oracle Confidential, under Non-Disclosure