smart data lakes: revolutionizing enterprise analytics
TRANSCRIPT
©2016 Cambridge Semantics Inc. All rights reserved.
Smart Data Lakes®: Revolutionizing Enterprise Analytics
Marty LoughlinVice President
Cambridge Semantics, Inc.
Strata+Hadoop World September 2016
©2016 Cambridge Semantics Inc. All rights reserved.
Any Questions?
©2016 Cambridge Semantics Inc. All rights reserved.
Business Questions
Which traders traded Tesla in their personal account in the 24
hours before a news story broke?
What is our exposure to Lehman Brothers?
Who is the best investigator for a phase II trial of an injectable liver
cancer drug?
©2016 Cambridge Semantics Inc. All rights reserved.
©2016 Cambridge Semantics Inc. All rights reserved.5
©2016 Cambridge Semantics Inc. All rights reserved.
©2016 Cambridge Semantics Inc. All rights reserved.
©2016 Cambridge Semantics Inc. All rights reserved.
©2016 Cambridge Semantics Inc. All rights reserved.
Linking and Contextualizing Information
On Tuesday, Drugs123 Inc. announced phase 1 development of their newest sleep aid therapeutic, Narcoleptol.
On Tuesday, Drugs123 Inc. announced phase 1 development of their newest sleep aid therapeutic, Narcoleptol.
Company Website Mkt Cap
Bio Corp biocorp.com $2.2B
Drugs123 drugs123.com $930M
… … …
Competitive Intelligence database
Company
Drugs123
930,000,000
name
marketcap
drugs123.com
website
Web news
Drug Development
1
developmentstage
activityDrug
developing
Insomnia
indication
Narcoleptol
brandname
CRM System
Note
about
3/7/2012
Initial safety signals are …
when
note
©2016 Cambridge Semantics Inc. All rights reserved.
Cambridge Semantics(Illustrative Pharma Company Use Case))
©2016 Cambridge Semantics Inc. All rights reserved.
Anzo Smart Data Lake® 4.0Unified Data Lake Offering
Data Landscape
Smart Data Discovery
Enterprise Data Lake
Smart Data Discovery
©2016 Cambridge Semantics Inc. All rights reserved.
What Data Makes Sense in a Smart Data Lake?
Data Sets
Data Sources
Few
Many
Small Large
Simple Data Big Data
Diverse Data Complex Data Smart Data Lakes unrivaled value• Multiple sources
• Many entity types & relationships• Structured and unstructured
Limited data sources with small data sets – historically the bulk of enterprise data harmonization efforts
Large data sets but which originated from a limited number of data sources (i.e. a few tables)
• Large data sets that• Multiple, disparate structured &
unstructured sources
As Data Sources and Data Sets continue to grow, the need and value of Smart Data Lakes increases
©2016 Cambridge Semantics Inc. All rights reserved.
Watch the video of this presentation