agile data management with a multi-model database · ecommerce /pricedynamically: slide: 17 31 may...

34
31 May 2018 © MARKLOGIC CORPORATION Justin Makeig Senior Director, Product Management @featurebacklog Agile Data Management with a Multi-Model Database

Upload: others

Post on 04-Jul-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Agile Data Management with a Multi-Model Database · eCommerce /priceDynamically: SLIDE: 17 31 May 2018© MARKLOGIC CORPORATION: ... Best Practices for World-Class Search: Tuesday,

31 May 2018© MARKLOGIC CORPORATION

Justin MakeigSenior Director, Product Management

@featurebacklog

Agile Data Management with a Multi-Model Database

Page 2: Agile Data Management with a Multi-Model Database · eCommerce /priceDynamically: SLIDE: 17 31 May 2018© MARKLOGIC CORPORATION: ... Best Practices for World-Class Search: Tuesday,

SLIDE: 2 31 May 2018© MARKLOGIC CORPORATION

Integrated data, Not data integration

Page 3: Agile Data Management with a Multi-Model Database · eCommerce /priceDynamically: SLIDE: 17 31 May 2018© MARKLOGIC CORPORATION: ... Best Practices for World-Class Search: Tuesday,

SLIDE: 3 31 May 2018© MARKLOGIC CORPORATION

Alternative ApproachesSILO BUSTIN’

DATA WAREHOUSES POINT-TO-POINT DATA LAKE

360º view and operational MaybePartial view, not in real-time

NoNo single source of truth

NoNot easily accessible

Agility to handle changes NoRigid schemas, ETL

MaybeO(N2) complexity

YesVery little structure

Enterprise governance YesGranular controls

MaybePoor data lineage/auditing

NoNo granularity

Page 4: Agile Data Management with a Multi-Model Database · eCommerce /priceDynamically: SLIDE: 17 31 May 2018© MARKLOGIC CORPORATION: ... Best Practices for World-Class Search: Tuesday,

SLIDE: 4 31 May 2018© MARKLOGIC CORPORATION

Heightened Urgency New competition: Non-traditional threats,

lower barriers to entry

More data: Everything is software, everything is measurable

Tougher rules: More and more comprehensive regulations

Page 5: Agile Data Management with a Multi-Model Database · eCommerce /priceDynamically: SLIDE: 17 31 May 2018© MARKLOGIC CORPORATION: ... Best Practices for World-Class Search: Tuesday,

SLIDE: 5 31 May 2018© MARKLOGIC CORPORATION

DOWNSTREAM SYSTEMS

TRANSACTIONAL APPS

OPERATIONAL APPS

Operational Data HubAGILE DATA MANAGEMENT

Page 6: Agile Data Management with a Multi-Model Database · eCommerce /priceDynamically: SLIDE: 17 31 May 2018© MARKLOGIC CORPORATION: ... Best Practices for World-Class Search: Tuesday,

SLIDE: 6 31 May 2018© MARKLOGIC CORPORATION

KEY PRINCIPLES

Enabling Agility API First

Expect and Embrace Change

Governed by Default

Deploy Anywhere

Page 7: Agile Data Management with a Multi-Model Database · eCommerce /priceDynamically: SLIDE: 17 31 May 2018© MARKLOGIC CORPORATION: ... Best Practices for World-Class Search: Tuesday,

SLIDE: 7 31 May 2018© MARKLOGIC CORPORATION

DATA AGILITY

API First Minimize up-front work by focusing on

business value and working back to data

Reduce execution risk with aggressive scoping, frequently iterations, continuous feedback

Increase returns on cumulative data

Direct analogue and enabler for Agile software development

Page 8: Agile Data Management with a Multi-Model Database · eCommerce /priceDynamically: SLIDE: 17 31 May 2018© MARKLOGIC CORPORATION: ... Best Practices for World-Class Search: Tuesday,

SLIDE: 8 31 May 2018© MARKLOGIC CORPORATION

Leading with Business ValueAPI FIRST

1. Define and Flesh Out API 2. Model and Govern

3. Load, Discover, Validate4. Map and Harmonize

1. Define and Assemble API

Page 9: Agile Data Management with a Multi-Model Database · eCommerce /priceDynamically: SLIDE: 17 31 May 2018© MARKLOGIC CORPORATION: ... Best Practices for World-Class Search: Tuesday,

SLIDE: 9 31 May 2018© MARKLOGIC CORPORATION

Return on InvestmentLEADING WITH BUSINESS VALUE

Business Value

Level of Effort

High impactPredict fraud

Flag fraud

Pricing

Page 10: Agile Data Management with a Multi-Model Database · eCommerce /priceDynamically: SLIDE: 17 31 May 2018© MARKLOGIC CORPORATION: ... Best Practices for World-Class Search: Tuesday,

SLIDE: 11 31 May 2018© MARKLOGIC CORPORATION

Run Code Close to the DataDATA LOGIC

C++ built-ins

JavaScript runtimeXQuery runtime V8

XQuery libraries JS libraries

User JavaScriptUser XQuery

Compiled engine

Scripting environment

HTTP Server External interface

Page 11: Agile Data Management with a Multi-Model Database · eCommerce /priceDynamically: SLIDE: 17 31 May 2018© MARKLOGIC CORPORATION: ... Best Practices for World-Class Search: Tuesday,

SLIDE: 12 31 May 2018© MARKLOGIC CORPORATION

New sources Messy or unexpected data Ambiguous or conflicting

definitions

New regulations and enforcement Increased threats Sharing, not hoarding

New opportunities enabled by creative reuse Get value sooner Experiment with less risk, cost

Expect and Embrace Change

Quality and Meaning

Business Requirements

UPSTREAM DOWNSTREAM

Compliance and Governance

EVERYWHERE

Page 12: Agile Data Management with a Multi-Model Database · eCommerce /priceDynamically: SLIDE: 17 31 May 2018© MARKLOGIC CORPORATION: ... Best Practices for World-Class Search: Tuesday,

SLIDE: 13 31 May 2018© MARKLOGIC CORPORATION

ERP

/priceDynamically

Customer

Collection:/acme/customers

Page 13: Agile Data Management with a Multi-Model Database · eCommerce /priceDynamically: SLIDE: 17 31 May 2018© MARKLOGIC CORPORATION: ... Best Practices for World-Class Search: Tuesday,

SLIDE: 14 31 May 2018© MARKLOGIC CORPORATION

Hierarchical, sparse, high cardinality

Precise structure to free text

Change the data, change the schema

Standard JSON or XML, text, binary

Documents Represent Data More Naturally

DATA MODEL

Page 14: Agile Data Management with a Multi-Model Database · eCommerce /priceDynamically: SLIDE: 17 31 May 2018© MARKLOGIC CORPORATION: ... Best Practices for World-Class Search: Tuesday,

SLIDE: 15 31 May 2018© MARKLOGIC CORPORATION

ERP

Customer

Collection:/acme/customers

Document Data Model Load as-is Universal indexing

Values Full text Structure Scalar ranges Geospatial

Schema on read Organize by collections,

directories

/priceDynamically

Page 15: Agile Data Management with a Multi-Model Database · eCommerce /priceDynamically: SLIDE: 17 31 May 2018© MARKLOGIC CORPORATION: ... Best Practices for World-Class Search: Tuesday,

SLIDE: 16 31 May 2018© MARKLOGIC CORPORATION

Customer

CRM ERP eCommerce

/priceDynamically

Page 16: Agile Data Management with a Multi-Model Database · eCommerce /priceDynamically: SLIDE: 17 31 May 2018© MARKLOGIC CORPORATION: ... Best Practices for World-Class Search: Tuesday,

SLIDE: 17 31 May 2018© MARKLOGIC CORPORATION

Customer

prov:derivedFrom

prov:generatedBy

rdf:type

rdf:type

prov:wasRevisionOf

/priceDynamically

e7e9879a…

?

Page 17: Agile Data Management with a Multi-Model Database · eCommerce /priceDynamically: SLIDE: 17 31 May 2018© MARKLOGIC CORPORATION: ... Best Practices for World-Class Search: Tuesday,

SLIDE: 18 31 May 2018© MARKLOGIC CORPORATION

Semantic RelationshipsGRAPHS

Entities === Documents

Relationships as triples

- Entities related to Entities

- Entities related to Facts

- Facts related to Facts

Infer new relationships

Customer

Order

Product

Purchased

Places

Includes

Page 18: Agile Data Management with a Multi-Model Database · eCommerce /priceDynamically: SLIDE: 17 31 May 2018© MARKLOGIC CORPORATION: ... Best Practices for World-Class Search: Tuesday,

SLIDE: 19 31 May 2018© MARKLOGIC CORPORATION

Semantic RelationshipsGRAPHS

Entities === Documents

Relationships as triples

- Entities related to Entities

- Entities related to Facts

- Facts related to Facts

Derived fromType

Type

PII

SSN

Page 19: Agile Data Management with a Multi-Model Database · eCommerce /priceDynamically: SLIDE: 17 31 May 2018© MARKLOGIC CORPORATION: ... Best Practices for World-Class Search: Tuesday,

SLIDE: 20 31 May 2018© MARKLOGIC CORPORATION

Semantic RelationshipsGRAPHS

Entities === Documents

Relationships as triples

- Entities related to Entities

- Entities related to Facts

- Facts related to Facts

73fa4dc0…

Is Concept

Same As

SKOS

Page 20: Agile Data Management with a Multi-Model Database · eCommerce /priceDynamically: SLIDE: 17 31 May 2018© MARKLOGIC CORPORATION: ... Best Practices for World-Class Search: Tuesday,

SLIDE: 21 31 May 2018© MARKLOGIC CORPORATION

Customer

30d623ff…

,

Order Product

acme:includesacme:places

73fa4dc0…

isConcept

rdf:type

rdf:type

acme:purchased

e7e9879a…

acme:powerOfAttorney

0.

1.

rdf:type

prov:generatedBy

Page 21: Agile Data Management with a Multi-Model Database · eCommerce /priceDynamically: SLIDE: 17 31 May 2018© MARKLOGIC CORPORATION: ... Best Practices for World-Class Search: Tuesday,

SLIDE: 22 31 May 2018© MARKLOGIC CORPORATION

Governed by DefaultPUTTING THE “MS” BACK IN DBMS

Manage policy along with the data and metadata that it governs

Query that policy just like data to make enforcement model-driven

Automatically enforce policy in the database

Track lineage as data and policy change

Page 22: Agile Data Management with a Multi-Model Database · eCommerce /priceDynamically: SLIDE: 17 31 May 2018© MARKLOGIC CORPORATION: ... Best Practices for World-Class Search: Tuesday,

SLIDE: 23 31 May 2018© MARKLOGIC CORPORATION

Developer

Huh?

Domain Expert

Page 23: Agile Data Management with a Multi-Model Database · eCommerce /priceDynamically: SLIDE: 17 31 May 2018© MARKLOGIC CORPORATION: ... Best Practices for World-Class Search: Tuesday,

SLIDE: 24 31 May 2018© MARKLOGIC CORPORATION

MODEL-DRIVEN

Data, Metadata, and Policy Model important business concepts as

needed (and not before)

Manage policy along with the data and the metadata it governs

Drive business processes and configuration from queryable policy definitions

Domain Expert

Page 24: Agile Data Management with a Multi-Model Database · eCommerce /priceDynamically: SLIDE: 17 31 May 2018© MARKLOGIC CORPORATION: ... Best Practices for World-Class Search: Tuesday,

SLIDE: 25 31 May 2018© MARKLOGIC CORPORATION

Customer

e7e9879a…Secure by DesignGOVERNED BY DEFAULT

Confidentiality: Role-based access control and encryption at rest, in motion

Integrity: Transactional consistency and auditable trustworthiness

Availability: Elastic scale out and HA/DR

Page 25: Agile Data Management with a Multi-Model Database · eCommerce /priceDynamically: SLIDE: 17 31 May 2018© MARKLOGIC CORPORATION: ... Best Practices for World-Class Search: Tuesday,

SLIDE: 26 31 May 2018© MARKLOGIC CORPORATION

Deploy AnywhereDeploy AnywhereAGILE INFRASTRUCTURE

Align infrastructure costs with SLAs using elastic scaling

Avoid lock-in with flexible cloud and on premise deployment

Reduce risk with automation and componentization

https://upload.wikimedia.org/wikipedia/commons/6/67/Inside_Suite.jpg

Page 26: Agile Data Management with a Multi-Model Database · eCommerce /priceDynamically: SLIDE: 17 31 May 2018© MARKLOGIC CORPORATION: ... Best Practices for World-Class Search: Tuesday,

SLIDE: 27 31 May 2018© MARKLOGIC CORPORATION

Elastic Scaling and High AvailabilityDEPLOY ANYWHERE

DE/D

Page 27: Agile Data Management with a Multi-Model Database · eCommerce /priceDynamically: SLIDE: 17 31 May 2018© MARKLOGIC CORPORATION: ... Best Practices for World-Class Search: Tuesday,

SLIDE: 28 31 May 2018© MARKLOGIC CORPORATION

Elastic Scaling and High AvailabilityDEPLOY ANYWHERE

E

D

https://images.pexels.com/photos/38136/pexels-photo-38136.jpeg

Page 28: Agile Data Management with a Multi-Model Database · eCommerce /priceDynamically: SLIDE: 17 31 May 2018© MARKLOGIC CORPORATION: ... Best Practices for World-Class Search: Tuesday,

SLIDE: 29 31 May 2018© MARKLOGIC CORPORATION

Elastic Scaling and High AvailabilityDEPLOY ANYWHERE

E

D

E

Page 29: Agile Data Management with a Multi-Model Database · eCommerce /priceDynamically: SLIDE: 17 31 May 2018© MARKLOGIC CORPORATION: ... Best Practices for World-Class Search: Tuesday,

SLIDE: 30 31 May 2018© MARKLOGIC CORPORATION

Elastic Scaling and High AvailabilityDEPLOY ANYWHERE

E

D

E

D

Page 30: Agile Data Management with a Multi-Model Database · eCommerce /priceDynamically: SLIDE: 17 31 May 2018© MARKLOGIC CORPORATION: ... Best Practices for World-Class Search: Tuesday,

SLIDE: 31 31 May 2018© MARKLOGIC CORPORATION

Elastic Scaling and High AvailabilityDEPLOY ANYWHERE

E E E E

D D D DZo

ne 1

Zone

2

Page 31: Agile Data Management with a Multi-Model Database · eCommerce /priceDynamically: SLIDE: 17 31 May 2018© MARKLOGIC CORPORATION: ... Best Practices for World-Class Search: Tuesday,

SLIDE: 32 31 May 2018© MARKLOGIC CORPORATION

MarkLogic in the CloudCLOUD NEUTRAL

Proven in the cloud

– Private, hybrid, or public cloud

– AWS, Azure, and Google Cloud

– Deployment automation

Page 32: Agile Data Management with a Multi-Model Database · eCommerce /priceDynamically: SLIDE: 17 31 May 2018© MARKLOGIC CORPORATION: ... Best Practices for World-Class Search: Tuesday,

You’ve always had the power, my dear. You just had to learn it for yourself.”

Glinda, Good Witch of the East

Page 33: Agile Data Management with a Multi-Model Database · eCommerce /priceDynamically: SLIDE: 17 31 May 2018© MARKLOGIC CORPORATION: ... Best Practices for World-Class Search: Tuesday,

SLIDE: 34 31 May 2018© MARKLOGIC CORPORATION

KEY PRINCIPLES

Enabling Agility API First

Expect and Embrace Change

Governed by Default

Deploy Anywhere

https://patentimages.storage.googleapis.com/pages/US2415012-2.png

Page 34: Agile Data Management with a Multi-Model Database · eCommerce /priceDynamically: SLIDE: 17 31 May 2018© MARKLOGIC CORPORATION: ... Best Practices for World-Class Search: Tuesday,

SLIDE: 35 31 May 2018© MARKLOGIC CORPORATION

Technical breakouts for further exploration this week and beyondLearn More

Using MarkLogic Semantics for Data Integration Tuesday, 11:30 am

MarkLogic Data Hub Framework™ – A New Hope for Data Harmonization Tuesday, 3:00 pm

Deep Dive: How to Secure and Share Sensitive Information Tuesday, 2:00 pm

Best Practices for World-Class Search Tuesday, 3:00 pm

Master Data In Minutes With Smart Mastering Tuesday at 4:15

How to Govern Integrated Data (And Prove It) Wednesday, 11:00 am

MarkLogic as a Real-Time Data Hub Wednesday, 1:00 pm

MarkLogic and Microsoft Azure Accelerate Digital Transformation Wednesday, 1:00 pm

Cloud Neutral: Running MarkLogic on AWS or Azure Wednesday, 2:00 pm

Esri and MarkLogic: Location Analytics, Multi-Model Data Wednesday, 3:15 pm