semantic-aware business intelligence · transforming and summarizing available business data from...
TRANSCRIPT
![Page 2: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/2.jpg)
Contents
Motivation Traditional BI Vs. Exploratory BI On the Need of Semantic-Aware BI Systems
Semantic-Aware Formalisms A Characterization of BI Systems
Current State of the Art Discussion and Future Research Trends
An Open-Access Semantic-Aware BI System Architecture Modules AMDO, GEM, ORE and COAL
2 09/07/2013 Semantic-Aware Business Intelligence
![Page 3: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/3.jpg)
The Fact
3
“The amount of information in the world doubles every 20 months and the size and number of databases are increasing even faster”
Small amounts of data will be ever considered by a human Automatic Knowledge Discovery is badly NEEDED
Data growth rate
“We are drowning in information but starved for knowledge” John Naisbitt, “Megatrends” (1982)
09/07/2013 Semantic-Aware Business Intelligence
![Page 4: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/4.jpg)
The Definition
• Business intelligence is “the ability to apprehend the interrelationships of presented facts in such a way as to guide action towards a desired goal”
4
H. P. Luhn A Business Intelligence System IBM Journal of Research and Development. Vol. 2(4). 1958
09/07/2013 Semantic-Aware Business Intelligence
![Page 5: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/5.jpg)
The Definition
• Business intelligence is “an umbrella term that includes the applications, infrastructure and tools, and best practices that enable access to and analysis of information to improve and optimize decisions and performance”
5
Gartner Reports, IT Glossary, 2013
09/07/2013 Semantic-Aware Business Intelligence
![Page 6: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/6.jpg)
The Definition
• [Business] Business intelligence is “the process of collecting business data and turning it into information that is meaningful and actionable towards a strategic goal”
• [IT] Business Intelligence is “aimed at gathering, transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making tasks”
6
Data Analytics: Reporting, OLAP, data mining, prediction (what-if analysis), etc. Data Modeling: Data warehousing and the multidimensional model 09/07/2013 Semantic-Aware Business Intelligence
![Page 7: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/7.jpg)
The Definition
• [IT] Business Intelligence is “aimed at gathering, transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making tasks”
7
Data Modeling: Data warehousing and the multidimensional model 09/07/2013 Semantic-Aware Business Intelligence
![Page 8: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/8.jpg)
The Market
• BI has become a huge industrial and scientific domain and a major economic driver
• Some numbers: – In 2014, the overall BI market value will be $100 billion (the Economist) – It is expected to have the largest increment in data management and
analytics, at almost 10% each year, roughly twice as fast as the software business as a whole.
– The market growth is estimated in $14 billion in 2014, up from $8.8 billion in 2008 (Forrester and the Economist)
– BI remains unaffected by the economic crisis and it is mentioned among the top priorities of Chief Information Officers worldwide (ranging from 1st to 5th according to the source)
8 09/07/2013 Semantic-Aware Business Intelligence
![Page 9: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/9.jpg)
Traditional Vs. Exploratory BI
Joint work with: Alberto Abelló (UPC) Torben Bach Pedersen (AAU) Rafael Berlanga (UJI) Victoria Nebot (UJI) M.J. Aramburu (UJI) Alkis Simitsis (HP Labs)
![Page 10: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/10.jpg)
Traditional BI
10 09/07/2013 Semantic-Aware Business Intelligence
![Page 11: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/11.jpg)
Lessons Learnt
11
Requirements
1. Requirement Elicitation
Designer End-Users
09/07/2013 Semantic-Aware Business Intelligence
![Page 12: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/12.jpg)
Lessons Learnt
12
2. Design and Deployment of the DW and ETL flows
Sources ETL Flows DW
09/07/2013 Semantic-Aware Business Intelligence
![Page 13: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/13.jpg)
Lessons Learnt
13
3. Flexible and Dynamic Query Tools
DW End-Users
09/07/2013 Semantic-Aware Business Intelligence
![Page 14: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/14.jpg)
New Requirements
• From the point of view of the end-user:
14
Right-time Analysis
Fresh data
Ad-hoc Requirements
09/07/2013 Semantic-Aware Business Intelligence
![Page 15: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/15.jpg)
New Requirements
• From the point of view of the end-user:
15
Analyze any source
Extraction
Transformation
Load / Visualization
09/07/2013 Semantic-Aware Business Intelligence
![Page 16: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/16.jpg)
Exploratory BI
16 09/07/2013 Semantic-Aware Business Intelligence
Materialized Data
![Page 17: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/17.jpg)
Towards Exploratory BI
09/07/2013 Semantic-Aware Business Intelligence 17
Loosely Coupled Tightly Coupled
Advanced Optimization
Weak Optimization
ETQ
ETL
Live BI
Ad-hoc BI Open BI
On-demand BI
Fusion Cubes
Data Fusion
Situational BI
Semantic Cockpit
![Page 18: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/18.jpg)
Challenge
• From the IT point of view:
09/07/2013 Semantic-Aware Business Intelligence 18
End-user Requirement Sources
MD Schema
ETL Flows
MACHINE PROCESSABLE
METADATA
SUPERVISION
![Page 19: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/19.jpg)
Semantic-Aware Formalisms
Joint work with: Alberto Abelló (UPC) Torben Bach Pedersen (AAU) Rafael Berlanga (UJI) Victoria Nebot (UJI) M.J. Aramburu (UJI) Alkis Simitsis (HP Labs)
![Page 20: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/20.jpg)
Basic Semantic Web Architecture
09/07/2013 Semantic-Aware Business Intelligence 20
Data Format: RDF
RDF(S)
OWL(2)
Query:
SPARQL
Ontology:
Source: Sven Groppe. Data Management and Query Processing in Semantic Web Databases
![Page 21: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/21.jpg)
RDF Data
• RDF model
• Several serialization formats – N3, Turtle, RDF/XML, etc.
09/07/2013 Semantic-Aware Business Intelligence 21
![Page 22: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/22.jpg)
Ontologies: RDF(S)
• Describe the schema within the same framework – Extends RDF to allow triples to be defined over
classes and properties
• Semantics based on type systems – Classes (rdfs:Class / rdf:type) – Subclasses (rdfs:subClassOf) – Typed properties (rdfs:domain / rdfs:range)
09/07/2013 Semantic-Aware Business Intelligence 22
![Page 23: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/23.jpg)
Ontologies
09/07/2013 Semantic-Aware Business Intelligence 23
Datalog Description Logics
Focus Instances Knowledge
Approach Centralized Decentralized
Reasoning Closed-world assumption
Open-world assumption
Unique name Unique name assumption
Non-unique name assumption
![Page 24: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/24.jpg)
Description Logics
09/07/2013 Semantic-Aware Business Intelligence 24
TBox (Terminology)
ABox (Assertions)
C(a)
R(a,b)
Source: DL Handbook
![Page 25: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/25.jpg)
Description Logics
09/07/2013 Semantic-Aware Business Intelligence 25
TBox (Terminology)
ABox (Assertions)
C(a)
R(a,b)
Source: DL Handbook
![Page 26: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/26.jpg)
Reasoning
• Traditional Reasoning Inferences – Concept satisfiability: Is a concept non-
contradictory? • Example: Concept correctness
– Subsumption: Is an ontology concept C more general than another concept D?
• Example: Taxonomies and equivalences
– Query Answering: All asserted instances that satisfy a concept description
• Example: Arbitrary queries
09/07/2013 Semantic-Aware Business Intelligence 26
![Page 27: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/27.jpg)
Open-World Assumption
• Something evaluates false only if it contradicts other information in the ontology
09/07/2013 Semantic-Aware Business Intelligence 27
hasSon(Iokaste,Oedipus) hasSon(Iokaste,Polyneikes) hasSon(Oedipus,Polyneikes) hasSon(Polyneikes,Thersandros) patricide(Oedipus) ¬patricide(Thersandros)
Oedipus
Iokaste
Polyneikes
Thersandros Query≡∃hasSon.(patricide ⊓ ∃hasSon.¬patricide) ABox ⊨ Query(Iokaste)?
Source: DL Handbook
![Page 28: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/28.jpg)
OWL
09/07/2013 Semantic-Aware Business Intelligence 28
Source: Sven Groppe. Data Management and Query Processing in Semantic Web Databases
OWL DL
OWL Lite
RDFS Subproperties, domain, range,
subclasses, individuals
Intersection, equality, cardinality (0/1), datatypes,
inverse, transitive, symmetric, hasValue, someValuesFrom,
allValuesFrom
unionOf, negation Arbitraty cardinality Enumerated types
![Page 29: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/29.jpg)
OWL 2 Profiles
09/07/2013 Semantic-Aware Business Intelligence 29
OWL 2 EL:
Based on EL++
Large number of properties / classes Reasoning: Polynomial with regard to the ontology TBOX
OWL 2 QL:
Based on DL-Lite Captures (most of) ER and UML expressive power Reasoning: Reducible to LOGSPACE (i.e., DBs)
OWL 2 RL: Based on Description Logic programs Scalable reasoning without sacrificing much expressivity Reasoning: Polynomial with regard to the size of the ontology
![Page 30: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/30.jpg)
A Characterization of BI Systems
Joint work with: Alberto Abelló (UPC) Torben Bach Pedersen (AAU) Rafael Berlanga (UJI) Victoria Nebot (UJI) M.J. Aramburu (UJI) Alkis Simitsis (HP Labs)
![Page 31: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/31.jpg)
A Characterization of BI Systems
31 09/07/2013 Semantic-Aware Business Intelligence
Materialized Data
![Page 32: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/32.jpg)
BI Categorization Criteria
09/07/2013 Semantic-Aware Business Intelligence 32
![Page 33: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/33.jpg)
SW Categorization Criteria
09/07/2013 Semantic-Aware Business Intelligence 33
![Page 34: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/34.jpg)
State of the Art
09/07/2013 Semantic-Aware Business Intelligence 34
![Page 35: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/35.jpg)
Trends Identified
• Data Schema Design (Trend I)
09/07/2013 Semantic-Aware Business Intelligence 35
Reasoning: Concept Satisfiability
FDs MD Ids
![Page 36: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/36.jpg)
Trends Identified
• Data Schema Design (Trend II)
09/07/2013 Semantic-Aware Business Intelligence 36
Reasoning: Fds ID Discovery
UML2DL UML2Datalog
![Page 37: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/37.jpg)
Common Features
• Data Schema Design
09/07/2013 Semantic-Aware Business Intelligence 37
Legend:
![Page 38: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/38.jpg)
Trends Identified
• Data Schema Design (Trend III)
09/07/2013 Semantic-Aware Business Intelligence 38
V: Logic rules
Q: Ad hoc alg.
![Page 39: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/39.jpg)
Common Features
09/07/2013 Semantic-Aware Business Intelligence 39
Legend:
![Page 40: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/40.jpg)
Trends Identified
• Data Provisioning
09/07/2013 Semantic-Aware Business Intelligence 40
UML2DL UML2Datalog
![Page 41: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/41.jpg)
Common Features
09/07/2013 Semantic-Aware Business Intelligence 41
Legend:
![Page 42: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/42.jpg)
Common Features
09/07/2013 Semantic-Aware Business Intelligence 42
Legend:
![Page 43: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/43.jpg)
Conclusions and Future Research Trends
• Small steps towards exploratory BI – Far from being mature!
• Expressivity and Computation is correlated (expected!) – Non-standard reasoning services not considered
• Reasoning at the instance level is prohibitive – Size matters for data complexity – Current solutions for ETL yield combined complexity – Datalog and Ontology-Based Data Access
• Reasoning at the schema level is feasible... – Combined complexity
… but exploratory systems demand further advances – Aggregations have been completely overlooked – LaV Vs. GaV
09/07/2013 Semantic-Aware Business Intelligence 43
![Page 44: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/44.jpg)
Conclusions and Future Research Trends
• The multidimensional model as a mature modeling paradigm for exploratory BI – Specific ontology languages (RDF Data Cube
Vocabulary) – Detecting MD expressions
• MD optimization • Expressiveness Vs. Reasoning Vs. Kind of Queries
– Cloud and parallel computing (TROWL) • The Web and Data Silos / Data Niches
– Autonomy Vs. Consistency – Registries (e.g., Freebase, DBPedia)
09/07/2013 Semantic-Aware Business Intelligence 44
![Page 45: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/45.jpg)
A Semantic-Aware Business Intelligence System
Joint work with: Alberto Abelló (UPC) Petar Jovanovic (UPC) Alkis Simitsis (HP Labs)
![Page 46: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/46.jpg)
Our Vision
• Towards Exploratory BI:
09/07/2013 Semantic-Aware Business Intelligence 46
MULTIDIMENSIONAL MODEL AND BPMN
![Page 47: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/47.jpg)
A Toy Example
09/07/2013 Semantic-Aware Business Intelligence 47
![Page 48: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/48.jpg)
A Toy Example
09/07/2013 Semantic-Aware Business Intelligence 48
takings
![Page 49: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/49.jpg)
A Toy Example
09/07/2013 Semantic-Aware Business Intelligence 49
takings
Up-to-date sources
![Page 50: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/50.jpg)
A Toy Example
09/07/2013 Semantic-Aware Business Intelligence 50
takings
Up-to-date sources
![Page 51: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/51.jpg)
A Toy Example
09/07/2013 Semantic-Aware Business Intelligence 51
takings
Up-to-date sources
![Page 52: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/52.jpg)
A Toy Example
09/07/2013 Semantic-Aware Business Intelligence 52
takings
Up-to-date sources
![Page 53: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/53.jpg)
A Toy Example
09/07/2013 Semantic-Aware Business Intelligence 53
takings
Up-to-date sources
+ slicers (e.g., year > 2000)
![Page 54: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/54.jpg)
A Toy Example
09/07/2013 Semantic-Aware Business Intelligence 54
takings
Up-to-date sources
![Page 55: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/55.jpg)
A Toy Example
09/07/2013 Semantic-Aware Business Intelligence 55
A: 1/1/2013 B: Today C: Today
takings
Up-to-date sources
![Page 56: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/56.jpg)
09/07/2013 Semantic-Aware Business Intelligence 56
![Page 57: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/57.jpg)
09/07/2013 Semantic-Aware Business Intelligence 57
Can ETLnew be rewritten in terms of ETLref?
![Page 58: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/58.jpg)
A Toy Example
09/07/2013 Semantic-Aware Business Intelligence 58
![Page 59: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/59.jpg)
A Toy Example
09/07/2013 Semantic-Aware Business Intelligence 59
![Page 60: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/60.jpg)
A Toy Example
09/07/2013 Semantic-Aware Business Intelligence 60
income?
![Page 61: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/61.jpg)
A Toy Example
09/07/2013 Semantic-Aware Business Intelligence 61
M(movie, income, director) RDBMS
income?
![Page 62: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/62.jpg)
A Toy Example
09/07/2013 Semantic-Aware Business Intelligence 62
M(movie, income, director) RDBMS
SELECT movie, income FROM M
income?
![Page 63: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/63.jpg)
A Toy Example
09/07/2013 Semantic-Aware Business Intelligence 63
![Page 64: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/64.jpg)
Contextualization
09/07/2013 Semantic-Aware Business Intelligence 64
Materialized Data
![Page 65: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/65.jpg)
Contextualization
09/07/2013 Semantic-Aware Business Intelligence 65
![Page 66: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/66.jpg)
The AMDO Module
![Page 67: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/67.jpg)
AMDO: The MD Integrity Constraints
09/07/2013 Semantic-Aware Business Intelligence 67
![Page 68: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/68.jpg)
AMDO: The MD Integrity Constraints
09/07/2013 Semantic-Aware Business Intelligence 68
D1 (year)
D2 (product)
F (profit)
Summarizability • Disjointness • Completeness • Compatibility (2013)
Spain (t-shirt)
The MD Space
![Page 69: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/69.jpg)
AMDO: Overview
09/07/2013 Semantic-Aware Business Intelligence 69
![Page 70: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/70.jpg)
AMDO: Dimension Discovery
• IC: Facts are related to concepts by means of to-one relationships
• Definition: A dimensional concept is defined by an ending concept and a path of properties. The path must be considered because it adds relevant semantics.
09/07/2013 Semantic-Aware Business Intelligence 70
![Page 71: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/71.jpg)
AMDO: Measure Discovery
• IC: Measures are numerical attributes enabling correct data aggregation (datatype)
• Definition: A measure is defined by a datatype and a path of properties (i.e., a composite property)
09/07/2013 Semantic-Aware Business Intelligence 71
![Page 72: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/72.jpg)
AMDO: Fact Likeliness
• Fact Estimation – f(#D, #M) – Other parameters could be considered
09/07/2013 Semantic-Aware Business Intelligence 72
![Page 73: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/73.jpg)
The GEM Module
![Page 74: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/74.jpg)
GEM: Overview
09/07/2013 Semantic-Aware Business Intelligence 74
![Page 75: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/75.jpg)
GEM: Requirement Validation
Requirement: Revenue related to Serbian customers.
09/07/2013 Semantic-Aware Business Intelligence 75
![Page 76: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/76.jpg)
GEM: Requirement Validation
Requirement: Revenue related to Serbian customers.
09/07/2013 Semantic-Aware Business Intelligence 76
![Page 77: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/77.jpg)
GEM: Requirement Validation
Concept’s tagging… •Identifying concepts in the ontology
Requirement: Revenue related to Serbian customers.
09/07/2013 Semantic-Aware Business Intelligence 77
![Page 78: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/78.jpg)
GEM: Requirement Validation
Concept’s tagging… •Identifying concepts in the ontology •Tagging the concepts with appropriate MD roles
L
Requirement: Revenue related to Serbian customers.
09/07/2013 Semantic-Aware Business Intelligence 78
![Page 79: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/79.jpg)
GEM: Requirement Validation
Concept’s tagging… •Identifying concepts in the ontology •Tagging the concepts with appropriate MD roles
L
M
D
Requirement: Revenue related to Serbian customers.
09/07/2013 Semantic-Aware Business Intelligence 79
![Page 80: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/80.jpg)
GEM: Requirement Validation
Concept available?
L
M
D
Requirement: Revenue related to Serbian customers.
09/07/2013 Semantic-Aware Business Intelligence 80
![Page 81: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/81.jpg)
GEM: Requirement Validation
Concept available?
L
M
D
Requirement: Revenue related to Serbian customers.
09/07/2013 Semantic-Aware Business Intelligence 81
![Page 82: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/82.jpg)
GEM: Requirement Validation
Concept available? If no source can supply that concept:
L
M
D
Requirement: Revenue related to Serbian customers.
09/07/2013 Semantic-Aware Business Intelligence 82
![Page 83: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/83.jpg)
GEM: Requirement Validation
Concept available? If no source can supply that concept:
• Search for taxonomies or synonyms • Operations
L
M
D
Requirement: Revenue related to Serbian customers.
09/07/2013 Semantic-Aware Business Intelligence 83
![Page 84: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/84.jpg)
GEM: Requirement Validation
UNION
Concept available? If no source can supply that concept:
• Search for taxonomies or synonyms • Operations
L
M
D
Requirement: Revenue related to Serbian customers.
09/07/2013 Semantic-Aware Business Intelligence 84
![Page 85: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/85.jpg)
GEM: Requirement Validation
UNION
L
M
D
Building the initial ETL structure…
Requirement: Revenue related to Serbian customers.
09/07/2013 Semantic-Aware Business Intelligence 85
![Page 86: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/86.jpg)
GEM: Requirement Validation
UNION
L
M
D
EXTRACTION
EXTRACTION
EXTRACTION EXTRACTION
UNION
Building the initial ETL structure…
Requirement: Revenue related to Serbian customers.
09/07/2013 Semantic-Aware Business Intelligence 86
![Page 87: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/87.jpg)
GEM: Requirement Validation
L
M
D
Concepts are identified in the ontology, tagged with a MD role and it is guaranteed at least one source can supply them
Requirement: Revenue related to Serbian customers.
09/07/2013 Semantic-Aware Business Intelligence 87
![Page 88: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/88.jpg)
L
D
M Identifying paths… • Paths between tagged concepts
GEM: Requirement Completion
09/07/2013 Semantic-Aware Business Intelligence 88
![Page 89: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/89.jpg)
L
D
M
Paths between the required concepts are identified, concepts are partially tagged with MD roles
GEM: Requirement Completion
09/07/2013 Semantic-Aware Business Intelligence 89
![Page 90: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/90.jpg)
L
D
C L
M Cm
L
L
All concepts are appropriately tagged and generated schemas are validated according to MD integrity constraints. Then, the schema transformations are lowered to the instance level by means of ETL operators (Operator Library)
GEM: MD Validation and ETL Generation
09/07/2013 Semantic-Aware Business Intelligence 90
![Page 91: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/91.jpg)
GEM Output: Multidimensional Schema
09/07/2013 Semantic-Aware Business Intelligence 91
![Page 92: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/92.jpg)
GEM Output: ETL Flows
09/07/2013 Semantic-Aware Business Intelligence 92
![Page 93: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/93.jpg)
The ORE Module
![Page 94: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/94.jpg)
ORE: TPC-H Example
09/07/2013 Semantic-Aware Business Intelligence 94
Example information requirements: IR1: The total quantity of the parts shipped from Spanish suppliers to French
customers. IR2: For each nation, the profit for all supplied parts, shipped after 01/01/2011. IR3: The total revenue of the parts supplied from East Europe. IR4: For German suppliers, the total available stock value of supplied parts. IR5: Shipping priority and total potential revenue of the parts ordered before certain
date and shipped after certain date to a customer of a given segment.
![Page 95: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/95.jpg)
ORE: TPC-H Example
IR1 IR2
MD Schema satisfying IR1 + IR2
Iteration 1:
09/07/2013 Semantic-Aware Business Intelligence 95
![Page 96: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/96.jpg)
ORE: TPC-H Example
MD Schema satisfying IR1- IR5
Final Output:
09/07/2013 Semantic-Aware Business Intelligence 96
![Page 97: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/97.jpg)
ORE: Overview
Inputs MD interpretations of requirements (GEM)
Stages 1. Matching Facts 2. Matching Dimensions 3. Complementing the MD Design 4. Integration
09/07/2013 Semantic-Aware Business Intelligence 97
![Page 98: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/98.jpg)
ORE: Matching Facts
• Two facts match if they produce an equivalent set of points in the MD space
• Alternative solutions with different costs
09/07/2013 Semantic-Aware Business Intelligence 98
![Page 99: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/99.jpg)
ORE: Matching Dimensions
09/07/2013 Semantic-Aware Business Intelligence 99
• Dimension - partially ordered set of individual levels (DAG) • We search for possible matchings among the individual levels • Match levels through the shortest path producing a valid MD
relation (=, 1-1, 1-* or *-1 ) between them • Alternative solutions with different costs
![Page 100: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/100.jpg)
ORE: Integration
09/07/2013 Semantic-Aware Business Intelligence 100
• Producing the final MD schema – Relaxing the final schema
from currently irrelevant information • Two phases
i. Partitioning grouping different concepts that:
• Produce a connected subgraph • Have the same MD interpretation
ii. Folding
• Consider only the concepts currently required by the user
• All the knowledge still preserved in TM for future integration steps
IR1 + IR2
![Page 101: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/101.jpg)
The COAL Module
![Page 102: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/102.jpg)
COAL: TPC-H Example
09/07/2013 Semantic-Aware Business Intelligence 102
![Page 103: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/103.jpg)
COAL: TPC-H Example
09/07/2013 Semantic-Aware Business Intelligence 103
![Page 104: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/104.jpg)
COAL: TPC-H Example
Semantic-Aware Business Intelligence 09/07/2013 104
![Page 105: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/105.jpg)
COAL: Goals and Challenges
• WHAT? – Incremental integration of ETLs – Consolidation Algorithm
• Apply equivalence rules • Maximize matching area
– Consider partial matching
– Consider execution costs • HOW?
– Consider the ETL as DAG 1. Match the inputs 2. Match paths from the same input(s)
– Consider a pair of operations can match a. Fully b. Partially
– Two operations may match if their inputs coincide
Semantic-Aware Business Intelligence 09/07/2013 105
![Page 106: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/106.jpg)
COAL: Equivalence Rules
• Unary Operations: – Swap – Distribute – Factorize
• Binary Operations: – Associate – Distribute
Semantic-Aware Business Intelligence 09/07/2013 106
![Page 107: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/107.jpg)
COAL: Equivalence Rules
09/07/2013 Semantic-Aware Business Intelligence 107
![Page 108: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/108.jpg)
COAL: Operator Matching
Semantic-Aware Business Intelligence 09/07/2013 108
![Page 109: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/109.jpg)
COAL: Operator Matching
• Filter
• Schema modification
• Join • Union
Semantic-Aware Business Intelligence 09/07/2013 109
![Page 110: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/110.jpg)
COAL: Main Idea
1. Match leaves-sources → queue 2. While queue is not empty
I. Try to match next (topological) operation II. Else for each pair of operations
i. Reorder both graphs ii. If full match then queue iii. Elsif partial match then add to output list
III. If no match then add to output list
3. Choose the lowest cost in output list
09/07/2013 Semantic-Aware Business Intelligence 110
![Page 111: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/111.jpg)
COAL: Main Idea
• Invariants – One pair of operations matches – A new match is only considered if inputs match
• Rule – Compare operations before trying to move
09/07/2013 Semantic-Aware Business Intelligence 111
Matched Matched
Oref
O’ref
Onew
O’new
![Page 112: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/112.jpg)
COAL: Main Idea
• Invariants – One pair of operations matches – A new match is only considered if inputs match
• Rule – Compare operations before trying to move
09/07/2013 Semantic-Aware Business Intelligence 112
Matched Matched
Oref
O’ref
Onew
O’new
![Page 113: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/113.jpg)
COAL: Main Idea
• Invariants – One pair of operations matches – A new match is only considered if inputs match
• Rule – Compare operations before trying to move
09/07/2013 Semantic-Aware Business Intelligence 113
Matched Matched
Oref
O’ref Onew
O’new
![Page 114: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/114.jpg)
COAL: Main Idea
• Invariants – One pair of operations matches – A new match is only considered if inputs match
• Rule – Compare operations before trying to move
09/07/2013 Semantic-Aware Business Intelligence 114
Matched Matched
Oref
O’ref Onew
O’new
![Page 115: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/115.jpg)
Conclusions
• BI Systems Characterization – Exploratory BI – State of the Art: Trends
• Technical Challenges – Semantic-Aware Systems – Autonomy Vs. Consistency – Automation
• Our Vision: A System to Enable Exploratory BI – AMDO – GEM – ORE – COAL
• Still a lot to do!
09/07/2013 Semantic-Aware Business Intelligence 115
![Page 116: Semantic-Aware Business Intelligence · transforming and summarizing available business data from available sources to generate analytical information suitable for decision-making](https://reader034.vdocuments.mx/reader034/viewer/2022050419/5f8f061e11c6c20c6a0e0a96/html5/thumbnails/116.jpg)
Gràcies per la seva Atenció! ¡Gracias por su Atención!
Thank you for your Attention!
Oscar Romero ([email protected])
EM Master IT4BI: http://it4bi.univ-tours.fr/
EM Joint Doctorate IT4BI-DC: http://it4bi-dc.ulb.ac.be/
Questions?