automating data governance and stewardship to build data trust

28
Automating Data Governance and Stewardship To Build Data Trust Pieter De Leenheer, PhD Founder & VP, Research and Education June 2016

Upload: pieter-de-leenheer

Post on 15-Apr-2017

734 views

Category:

Technology


2 download

TRANSCRIPT

Page 1: Automating Data Governance and Stewardship to Build Data Trust

Automating Data Governance and Stewardship

To Build Data Trust

Pieter De Leenheer, PhDFounder & VP, Research and Education

June 2016

Page 2: Automating Data Governance and Stewardship to Build Data Trust

2©Collibra 2016 Collibra Data Citizens Conference | #collibracitizens

Misconceptions of Data Governance• A published repository of common definitions

• Concern of - hence managed by - IT

• Just Data quality management or MDM

• Siloed Islands

• No ownership, no process hence no trust in data

• Lack of data citizen participation

Who approved this?

I wish these guys spoke our language

I can’t understand this report !

I’ve never seen this product code! Who introduced this ?

Are we sure this definition of ‘customer’ is correct ?The Problem

This data quality rule is differently implemented in our department!

Are we allowed to share this customer data with analysts?

Page 3: Automating Data Governance and Stewardship to Build Data Trust

3©Collibra 2016 Collibra Data Citizens Conference | #collibracitizens

• Commonalities and differences in definitions for reports, terms, policies, etc.

• Business Traceability

• Business Data Lineage

• Technical Data Lineage

Understand & Explain

Data Governance is anholistic lens on your ever-expanding data universe

• Onboarding and approval of CDEs

• Report Certification and Watermarking

• Helpdesk and Issue Management

• Data Access and Usage Agreements

• …

Monitor & Predict

Through a Data Collaboration Platform

Page 4: Automating Data Governance and Stewardship to Build Data Trust

4©Collibra 2016 Collibra Data Citizens Conference | #collibracitizens

Customers in Higher Education

Page 5: Automating Data Governance and Stewardship to Build Data Trust

5©Collibra 2016 Collibra Data Citizens Conference | #collibracitizens

Data Governance Framework• Three Tiers

– DG Operating Model– Stewardship Applications– Integrations

• 1 single platform • N steward applications• Education and Certification

university.collibra.com

https://compass.collibra.com/display/COOK/Collibra+Body+of+Knowledge

Page 6: Automating Data Governance and Stewardship to Build Data Trust

6©Collibra 2016 Collibra Data Citizens Conference | #collibracitizens

Data Governance Platform Demo

Page 7: Automating Data Governance and Stewardship to Build Data Trust

7©Collibra 2016 Collibra Data Citizens Conference | #collibracitizens

Search and Filter Reports

Page 8: Automating Data Governance and Stewardship to Build Data Trust

8©Collibra 2016 Collibra Data Citizens Conference | #collibracitizens

Report Definition, Attributes and Relations

Page 9: Automating Data Governance and Stewardship to Build Data Trust

9©Collibra 2016 Collibra Data Citizens Conference | #collibracitizens

Report Ownership

Page 10: Automating Data Governance and Stewardship to Build Data Trust

10©Collibra 2016 Collibra Data Citizens Conference | #collibracitizens

Traceability vs Lineage

Page 11: Automating Data Governance and Stewardship to Build Data Trust

11©Collibra 2016 Collibra Data Citizens Conference | #collibracitizens

Workflows, Statuses and Roles

Page 12: Automating Data Governance and Stewardship to Build Data Trust

12©Collibra 2016 Collibra Data Citizens Conference | #collibracitizens

The Rise of the CDO, Business Data AuthorityData governance & stewardship provide the right level of control and trust in data

Data Infrastructure (IT) Data Consumers (Business)

LEADERSHIPCEO, CFO, VP, Marketing

ROLESData Scientist, Business Analyst

TECHNOLOGYVisualization, Self-service BI

NEED

Data Authority

LEADERSHIPCIO

ROLESInformation Manager, Data Architect, Data Modeler

TECHNOLOGYHadoop, Databases, Data Integration

Data Authority

LEADERSHIPChief Data Officer

ROLESData Governance Manager,

Data Steward

TECHNOLOGYData Stewardship

Platform

Page 13: Automating Data Governance and Stewardship to Build Data Trust

13©Collibra 2016 Collibra Data Citizens Conference | #collibracitizens

• Collaboration: inwards / outwards

• Data Space: traditional data / big data

• Value Impact: service / strategy

• MIT Sloan & Collibra: http://www.iscdo.org/

Full Text: http://www.mitcdoiq.org/wp-content/uploads/2014/01/Lee-et-al.-A-Cubic-Framework-for-the-CDO-MISQE-Forthcoming-2014-copy.pdf

CDO Roles

Page 14: Automating Data Governance and Stewardship to Build Data Trust

14©Collibra 2016 Collibra Data Citizens Conference | #collibracitizens

Stanford University Data Stewardship (SUDS)

• All Materials available here dg.stanford.edu• Establish foundation for Institutional Research

• Data Quality– How many faculty do we have?

• Context and Meaning– What does faculty mean in which context?– How is faculty data structured and where is it

stored?• Data Usage Request

– Am I allowed to use faculty or student name and age for external reporting?

Page 15: Automating Data Governance and Stewardship to Build Data Trust

15©Collibra 2016 Collibra Data Citizens Conference | #collibracitizens

SUDS: Approach

• DecentralizedØ 1 DG coordinator (also show vacancy)Ø Project staffØ cross-functional working groups : natural scope and

resourcesØ focus on BI reporting, with input from above projectsØ sign off by DG coordinator and end user through usage

(full cycle)• Step-by step; success by success

Page 16: Automating Data Governance and Stewardship to Build Data Trust

16©Collibra 2016 Collibra Data Citizens Conference | #collibracitizens

SUDS: First Success in OBIEE reporting

REST / JSON / CSV / Excel

Page 17: Automating Data Governance and Stewardship to Build Data Trust

17©Collibra 2016 Collibra Data Citizens Conference | #collibracitizens

What attribute- and relation-types do we want to capture?

• https://stanford.app.box.com/CollibraQuickReference• https://stanford.box.com/UsingCollibraFields

Page 18: Automating Data Governance and Stewardship to Build Data Trust

How to execute and monitor? From Best Practice to Auto-Validation Rules

http://web.stanford.edu/dept/pres-provost/cgi-bin/dg/wordpress/?p=577

(generic example – not from SUDS)

Page 19: Automating Data Governance and Stewardship to Build Data Trust

19©Collibra 2016 Collibra Data Citizens Conference | #collibracitizens

George Washington University

• GW is the largest institution of higher education in the District of Columbia. • More than 20,000 students— studying a rich range of disciplines: from forensic

science and creative writing to international affairs and computer engineering, as well as medicine, public health, law and public policy.

• The university is currently ranked in the top 100 universities in the country.

Page 20: Automating Data Governance and Stewardship to Build Data Trust

20©Collibra 2016 Collibra Data Citizens Conference | #collibracitizens

GWU Data Governance & Stewardship Vision

+ + =People Process / Policy Technology

Data Governance Center

Ensuring the highest quality data is delivered throughout the university providing valuable information serving individual and organizational needs

Data governance at GW focuses on improving data quality, protecting access to data, establishing business definitions, maintaining metadata, documenting data policies and setting the foundation for analytics and reporting.

• Policy – The What• Process – The

How

Page 21: Automating Data Governance and Stewardship to Build Data Trust

21©Collibra 2016 Collibra Data Citizens Conference | #collibracitizens

Everyone has a seat at the table

AcademicsAdvancementFinanceResearchHuman ResourcesServices &Resources

The Data Governance Committee meets once a month to review data quality issues, discuss proposed business terms, review policies and discuss other institutional data related topics. This committee is comprised of functional data stewards from across all functions and departments of the university.

GWU Data Governance Vision

Page 22: Automating Data Governance and Stewardship to Build Data Trust

22©Collibra 2016 Collibra Data Citizens Conference | #collibracitizens

GWU - Technology – The game changerTechnology is helping GWU to achieve their vision of commonly understood, consistent, trusted and high-quality data throughout GW.

• Making data transparent • Serves as Single source of truth of all our

data governance and stewardship activities• Makes business terms visible an searchable by all• Common agreed upon business terms and data assets• Provides traceability between business and technical assets, policies and rules

• Data Quality • Allows us to assess the integrity of data and resolve Data Quality issues.

• Analytics and Reporting • Enables portfolios to define reports and visualizations• Provides workflow to share data• Provides workflow to certify reports and visualizations

• Bonus - Provides metrics and KPIs to track progress and maturity

Page 22

Page 23: Automating Data Governance and Stewardship to Build Data Trust

23©Collibra 2016 Collibra Data Citizens Conference | #collibracitizens

How can we make YOU visible on this thriving new competency market?

Collibra university, COMPASS and OUR certification program

Page 24: Automating Data Governance and Stewardship to Build Data Trust

24©Collibra 2016 Collibra Data Citizens Conference | #collibracitizens

Introducing Collibra University• Free Guided Self Learning• Delivers the knowledge you need to

become a high value data governance professional

• Best place to learn Collibra’sthought-leading technology and how to apply it to implement data governance

• Choose your own level: Steward, Community Manager, Developer, Ranger, etc.

• Sample courses:<<ADD LINKS>>– Report Cert., DHD, Good Definitions

https://university.collibra.com/shared/start/key:ZLBDNHRK

Page 25: Automating Data Governance and Stewardship to Build Data Trust

25©Collibra 2016 Collibra Data Citizens Conference | #collibracitizens

Introducing the Collibra Certification• https://compass.collibra.com/dis

play/COOK/Collibra+University+Certifications

• Respond to the Challenge• Objective standard

– for skills and competence in practical data governance applying and integrating Collibra technologies

– against which to measure quality of implementations

Prove your value today : [email protected]

COLLIBRA RANGER

Jane Smith

has been formally evaluated for demonstrated experience, knowledge and performance of the Collibra Data Governance Software and is hereby

bestowed the global credential

CERTIFICATE NUMBER

CUR.2015.007

ORIGINAL GRANT DATE

August 21th, 2015

This is to certify that

In testimony whereof, we have subscribed our signatures

Director of Professional Services Co-founder & Collibra University Dean

Ram Naresh Pratti Dr. Pieter De Leenheer

http://university.collibra.com

Collibra NVOorlogskruisenlaan 116, 1120 BrusselsBelgium

Collibra Inc25 Broadway, NY, 10004 New York United States

DATA DRIVEN . BUSINESS . DRIVEN DATA

CO

LLIB

RA

UNIVERSITY CERTIFIED

RANGER

Page 26: Automating Data Governance and Stewardship to Build Data Trust

26©Collibra 2016 Collibra Data Citizens Conference | #collibracitizens

Customer Success supported by communities

Self-paced leaning platform for all customers and partners. Ranger certifications awarded upon completion.

university.collibra.com

Knowledge repository with BOK, documentation, questions and answers, use-cases, integrations, and more.

compass.collibra.com

CUSTOMER COMPASS

Page 27: Automating Data Governance and Stewardship to Build Data Trust

Thank You

Page 28: Automating Data Governance and Stewardship to Build Data Trust