data blending, caching and optimizing
TRANSCRIPT
Data Blending, Caching and Optimizing
Alma Martin
#Logi16
ALMA MARTINProduct ManagerLogi [email protected]
Here is an interesting fact about myself few people know.
ABOUT ME
2 @soulety
#Logi16
► The Data Problem
► How Logi addresses the Data Problem
► Logi DataHub Overview
WHAT WE ARE GOING TO LEARN TODAY
3 @soulety
The DATA Problem
#Logi16
Data is often the biggest challenge of self-service analytics
Preparing Data for Analytics is Hard
5 @soulety
#Logi16
The Data Problem in Self Service Analytics
6 @soulety
Data lives in different places.
Organizations outsource applications to run their business (e.g. CRM, Sales, Marketing)
Accessing Data
RDBMS Applications Files
Half of the organizations are accessing external data sources*
*MQ Survey for BI and Analytic Platforms
#Logi16
The Data Problem in Self Service Analytics
7 @soulety
Data lives in different places.
Organizations outsource applications to run their business (e.g. CRM, Sales, Marketing)
Accessing Data
RDBMS Applications Files
Transactional systems are often not ready for analysis.
Need to blend data across sources to get a 360° view of the business.
Acquiring Data
RDBMS Applications Files
#Logi16
The Data Problem in Self Service Analytics
8 @soulety
Data lives in different places.
Organizations outsource applications to run their business (e.g. CRM, Sales, Marketing)
Accessing Data
RDBMS Applications Files
Transactional systems are often not ready for analysis.
Need to blend data across sources to get a 360° view of the business.
Acquiring Data
RDBMS Applications Files
Data needs to be refreshed and up to date for reporting.
Accessing and reporting on data in a performant experience.
Managing Data
RDBMS Applications Files
OUR SOLUTIONLogi DataHub
Connect and acquire data, including files, databases, and cloud applications
Create, prepare, and manage dataviews for self-service analysis
Speed data prep with smart profiling, joining, and data enrichment
Accelerate performance for large data sets with a self-tuning, easy to maintain columnar data store
@soulety
Connect
• Applications
• Databases
• Files
Data Connectors
Author
• Joining objects
• Blending data sources
• Filter objects
Dataview Authoring
Cache
• Columnar store
• Self-tuning
• Scheduled refresh
Data Repository
Prepare
• DataSmart profiling
• Calculated columns
• Multi-part text
Data Enrichment
… For Self-Service
• Element in Logi Studio
• Info, SSM, Discovery
• Columnar store for Vision
Logi Integration
Create and Manage Dataviews
@soulety
#Logi16
Primary DataHub Use Cases
12 @soulety
1
2
3
4
Offload transactional systems that are not optimized for analysisEnsure transactional system is not overloaded with analytical requests
Blend data from multiple sourcesCombine data from databases, applications and files into a single dataview
Support application data sources not included with Logi InfoExtends self-service analysis (SSM) to application sources
Create and manage dataviews for self-service analyticsSelf-managed data repository that does not require DBAs to administer
#Logi16
Primary DataHub Use Cases
13 @soulety
1
2
3
4
Offload transactional systems that are not optimized for analysisEnsure transactional system is not overloaded with analytical requests
Blend data from multiple sourcesCombine data from databases, applications and files into a single dataview
Support application data sources not included with Logi InfoExtends self-service analysis (SSM) to application sources
Create and manage dataviews for self-service analyticsSelf-managed data repository that does not require DBAs to administer
#Logi16
Offload transactional systems from analytical requests
14 @soulety
Info Analytic Application
Transactional Application
Data is optimized for transactions
(inserts / updates)
Data is optimized for reporting and analysis
#Logi16
Offload transactional systems from analytical requests
15 @soulety
Franchise Management Software
Transactional system overloading concerns with self service reporting.
Healthcare Solutions
Managed and self service solutions that require isolation of the
transactional system.
#Logi16
Primary DataHub Use Cases
17 @soulety
1
2
3
4
Offload transactional systems that are not optimized for analysisEnsure transactional system is not overloaded with analytical requests
Blend data from multiple sourcesCombine data from databases, applications and files into a single dataview
Support application data sources not included with Logi InfoExtends self-service analysis (SSM) to application sources
Create and manage dataviews for self-service analyticsSelf-managed data repository that does not require DBAs to administer
#Logi16
Blend data from DBs, Cloud Applications, and Files
18 @soulety
Sales & Marketing Files
OFX
DatabasesFinance / ERP
#Logi16
• Salesforce Connect
In App Data Blending Solutions Are Limited
19 @soulety
Connects Salesforce data to external sources ✓Recommended for big (external) datasets Follows security rules defined by the company Generates reports and charts from blended data External data can be used in formulas
#Logi16
Primary DataHub Use Cases
20 @soulety
1
2
3
4
Offload transactional systems that are not optimized for analysisEnsure transactional system is not overloaded with analytical requests
Blend data from multiple sourcesCombine data from databases, applications and files into a single dataview
Support application data sources not included with Logi InfoExtends self-service analysis (SSM) to application sources
Create and manage dataviews for self-service analyticsSelf-managed data repository that does not require DBAs to administer
#Logi16
Extended Support for Application Sources in Info
21 @soulety
Info Supported SourcesDataHub Supported Sources
#Logi16
Primary DataHub Use Cases
22 @soulety
1
2
3
4
Offload transactional systems that are not optimized for analysisEnsure transactional system is not overloaded with analytical requests
Blend data from multiple sourcesCombine data from databases, applications and files into a single dataview
Support application data sources not included with Logi InfoExtends self-service analysis (SSM) to application sources
Create and manage dataviews for self-service analyticsSelf-managed data repository that does not require DBAs to administer
#Logi16
Self-managed data repository that does not require DBAs to administer
23 @soulety
• No need to tune/index DB for self-service demands
• Minimal involvement from DBAs
• Faster deployment
Using Logi DataHub
#Logi16
Data Authoring in 5 Steps
25 @soulety
1.Create a Source
2.Build your Dataview
3. Enrich your Dataview
4.Define a Data Refresh Schedule
5.Connect to Logi Info
#Logi16
1. Create a Source
Establish data connectivity
Applications Databases Files
OFX
#Logi16
2. Build a Dataview
27 @soulety
Define and cache an optimized table that blends data across sources
#Logi16
3. Enrich your Dataview
28 @soulety
Create calculated columns, adjust column names and types, etc.
New Col 1
New Col 2
New Col 3
#Logi16
104105106
ID100101
103102
4. Schedule Data Cache Refresh
29 @soulety
Full Replace or Incremental Append
Source Data DataviewID
100101102103
104105106
ID100101
103102
#Logi16
4. Schedule Data Cache Refresh
30 @soulety
Full Replace or Incremental Append
DataviewID
100101102103
104105106
ID100101
103102
Source Data
#Logi16
5. Connect to Logi Info
31 @soulety
Use Dataviews for Self Service reporting and custom Logi Apps
Interactive Dashboards & Reports Data Analysis SharingData Query AuthoringDiscovery
BRINGING IT ALL TOGETHER
#Logi16
Logi Analytics for Self-Service
34 @soulety
Learn more with the Gartner 2016 Critical Capabilities Report for BI and Analytics Platforms