reimagine data governance with azure purview

Post on 08-Feb-2022

6 Views

Category:

Documents

1 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Unified | Hybrid | Open

Reimagine data governance with Azure Purview

FranckMercier

Data governance is becoming increasingly

interdisciplinary

What data do I have?

Where did the data originate?

Can I trust it?

DISCOVERY

What’s my exposure to risk?

Is my usage compliant?

How do I control access & use?

What is required by regulation X?

COMPLIANCE

Data-related users

Security officers

Elements of successful data governance

Manage growing

data landscape

Overcome

operational silos

Increase

data agility

Comply with

industry regulations

• Fully managed, serverless, PaaS service

• Automate discovery of data in on-premises,

multicloud and SaaS sources

• Classify data at scale to specify sensitivity,

compliance, industry, business and company-

specific value

• Know where data came from and what was derived

from it with data lineage

• Deliver a curated and consistent glossary of

business terms and definitions

Reimagine data governance in the cloud

Azure PurviewUNIFIED DATA GOVERNANCE

Data Map

• Automate and manage metadata at scale

Data Catalog

• Enable effortless discovery for data

consumers

Data Insights

• Assess data usage across your

organization

Data MapMulticloudOn-prem

Data Insights

Azure Purview

Data Catalog

SaaS

“Data Map” = Data Assets | Lineage | Classifications

On-prem & Multicloud Operational, Analytical, SaaS

Open APIsAutomated Scanning & Classification

Azure Purview Power BI

SQL Server on-prem

Azure Synapse

Azure Data Services

M365 Compliance Center

(Apache Atlas 2.0)

Data Catalog Data Insights

Search LineageBusiness Glossary Data use reports

Azure Purview: Unified Data Governance

Publish, Discover & Curate Data

Unified Experience

Unified Platform

Azure Purview Features at Public Preview

Azure Purview Platform

Azure Purview Studio

Azure Purview Catalog (C1)

Automated Scanning & Classification

• Dedicated per customer on shared infra• Provisioned default capacity with option to add-on capacity

Data Map

• Serverless, pay per use • Includes connectors, scanning of sources, processing into data assets, lineage capture, classification

• Search, browse, asset details • Automated meta-data and lineage extraction• Automated classification based on content inspection

• Private Endpoint • Management center

On-prem & Multi-cloud* Operational, Analytical, SaaS*

Azure Purview Data Insights (D1)

* Power BI, SQL Sever on-prem, Azure Data Services including Synapse, Cosmos DB & Storage, Non-Microsoft systems including SAP ECC, SAP S4 HANA & Teradata, Multi-cloud systems including AWS S3

• Business Glossary templates• Lineage visualization & workflows

Azure Purview Catalog included with Platform (C0)

• Catalog Insights (Asset, Scan, Glossary)• Sensitive Information Types & Labeling insights

Data Producers &

Consumers

Data Officers &

Security Officers Power BI

SQL Server on-prem

Azure Synapse

Azure Data Services

M365 Compliance Center

Open APIs

(Apache Atlas 2.0)

The home page quick access buttons (tiles) depend on the role assigned to the user.

•For data curator, the buttons are Knowledge Center, Browse Assets, Manage Glossary and View Insights.•For data reader, the featured buttons are Knowledge Center, Browse Assets, View Glossary, and View Insights.•For data source administrator + data curator, the featured buttons are Knowledge Center, Register Data Sources, Browse Assets, and Manage Glossary.•For data source administrator + data reader, the featured buttons are Knowledge Center, Register Data Sources, Browse Assets, and View Glossary.•For data source administrator, no access to Purview Studio.

*Note: Only Owners and User Access Administrators can assign roles for Purview Studio in Azure portal.

Home Page ActivitiesHome page key activities based on assigned user roles and selected purview tiers (C0, C1 & D1)

Power BI Integration

• Native out-of-the box connector

Power BI Integration

• Quickly find Power BI assets:

• Workspaces

• Reports

• Datasets

• Dashboards

Power BI Integration

Power BI Integration

• Inherit Microsoft Information Protection (MIP)

labels after Azure Purview Scanning

• Report and Goals created on top of labeled

dataset inherit label

• Announcing Power BI inheritance of MIP

labels from Azure Synapse Analytics

(Public Preview) | Microsoft Power BI Blog

| Microsoft Power BI

Classified as Microsoft Confidential

DEMO

Azure PurviewStudio(Unified Experience)

Purview StudioA single, centralized place that provides unified experience for data producers, data consumers, data & security officers

Sources

Map your data to manage an enriched metadata map of operational and transactional data no matter where it lives

Benefits

• Automated scanning of on-prem, multicloud, SaaS data

• Discover Azure data sources, PowerBI, SQL better. Leverage turnkey integrations with Power BI, SQL (on-prem, azure, MI) and key Azure Data Services such as Azure Synapse, Cosmos DB, ADLS.

• Manage metadata and scale understanding of data with automated, fully managed, serverless metadata management capability

• Leverage Apache Atlas Open APIs to programmatically publish metadata and lineage from a wide range open-source data systems

Sources

Automated scanning & classification of on-prem, multicloud and SaaS data

Sources

Automated scanning & classification of on-prem, multicloud and SaaS data

Scan Sources

Select files types for scanning. Define custom file types

Custom File Type

Scan Sources

100+ built in classifiers, define your own custom classifiers

100 + out of box classifiers

Custom Classifiers

Scan Sources

Run the scan one-time or on a schedule

Purview Catalog Browse & Search(Effortless discovery of trusted & accurate data)

Browse & Search

Discover your data based on relevance using signals derived from scanning, classification, business context

Benefits

• Empower business and technical data analysts via a catalog to find and interpret data

• Provide intelligent recommendations based on data relationships, business context, search history

• Power data scientists and engineers with business context to drive BI, Analytics, AI and ML initiatives

Browse & Search

Search results by relevance

Benefits

• Return relevant results without writing complex queries or applying advanced filters

• Semantic search by understanding the context of every single word in search query and the intent (searching for one asset or exploration) of the user

• Support for spell check, keyword suggestions, query expansion (synonyms, semantics) and content expansion (matching the keyword with things like glossary, classification and asset name)

Browse & Search

Filter search results by business terms, classifications, contacts

Asset Overview

Discover operational, semantic and business information about a specific dataset

Operational Metadata

Semantic Metadata

Business Metadata

Asset Schema

Discover technical, semantic and business information about a specific dataset

Asset Lineage

Trace lineage of data assets across the data estate

Benefits

• Ensure data provenance with a visual representation of owners, sources, transformation, and lifecycle

• Leverage support of Apache Atlas’s open-source Lineage APIs and built-in integrations with solutions such as Azure Data Factory, Azure Data Share and Power BI

• Analyze impact of changes to data and understand dependencies visually.

• Root cause analysis of failures by inspecting dependencies upstream and determine downstream impact

Asset Contacts

Identify experts and owners of the data asset

Asset Related

Browse assets by hierarchy. Works for unstructured, semi-structured and unstructured data

Structured Data

Browse Hierarchy

Unstructured Data

Browse Hierarchy

Purview Catalog Business Glossary(Search & Browse your data estate from a business lens)

Business Glossary

Consistent and curated understanding of business terms and definitions

Benefits

• Understand business context associated with data in the organization

• Bulk Import glossary terms from existing data dictionaries easily

• Flexible business terms definition with custom attributes per business domain

• Browse & Search your data estate from a business lens

Business Glossary

Vocabulary for business users attached to assets in the catalog

Business Glossary

Understand business context associated with data in the organization

Business Glossary

Bulk import business terms from existing dictionaries

Business Glossary

Flexible business terms definition per business domain

Classified as Microsoft Confidential

Purview Catalog Costs

Understanding Azure Purview Pricing

Data Map

Data Catalogue

Scanning & Classification

On Prem & Multi-Cloud Operational, Analytical, SaaS

Data Privacy

Data Producers & Consumers Data Engineers & SMEs Data Officers

• Business Glossary templates• Lineage visualization & workflows

Synapse

Open APIs

Power BI

AML

Sentinel

M365

SQL Server

Synapse

and more

• Catalog Insights (Asset, Scan, Glossary)• Sensitive Information Types & Labeling

insights

Data Discovery Data Use Governance

• Search, browse, asset details • Automated meta-data and lineage extraction• Automated classification based on content inspection

• Private Endpoint • Management center

Purview Data Catalog Base (C0)

* Power BI, SQL Sever on-prem, Azure Data Services including Synapse, Cosmos DB & Storage, Non-Microsoft systems including SAP ECC, SAP S4 HANA & Teradata, Multi-cloud systems including AWS S3

Purview Data Catalog (C1) Purview Data Insights (D1)

Purview Studio

Purview PlatformAutomated Scanning & Classification

• Serverless, pay per use • Includes connectors, scanning of sources, processing into

data assets, lineage capture, classification

Purview Data Map

• Dedicated per customer on shared infrastructure• Provisioned default capacity with option to add-on capacity

Azure Purview

Total Azure Purview Cost

•Preview Offer : 4 free capacity units per month through May 31, 2021•Preview Offer : Data Map Metadata storage (3GB/CU) free

top related