process mining reloaded: event structures as a unified representation of process models and event...

34
Marlon Dumas University of Tartu, Estonia Petri Nets 2015 | Brussels | 24 June 2015

Upload: marlon-dumas

Post on 13-Aug-2015

249 views

Category:

Education


1 download

TRANSCRIPT

Page 1: Process Mining Reloaded: Event Structures as a Unified Representation of Process Models and Event Logs

Marlon Dumas

University of Tartu, Estonia

Petri Nets 2015 | Brussels | 24 June 2015

Page 2: Process Mining Reloaded: Event Structures as a Unified Representation of Process Models and Event Logs

Process Mining

/

event log

discovered model

Discovery

Conformance

Deviance

Differencediagnostics

Performance

input model

Enhanced model

event log’

2

Page 3: Process Mining Reloaded: Event Structures as a Unified Representation of Process Models and Event Logs

Automated Process Discovery

3

CID Task Time Stamp …

13219 Enter Loan Application 2007-11-09 T 11:20:10 -

13219 Retrieve Applicant Data 2007-11-09 T 11:22:15 -

13220 Enter Loan Application 2007-11-09 T 11:22:40 -

13219 Compute Installments 2007-11-09 T 11:22:45 -

13219 Notify Eligibility 2007-11-09 T 11:23:00 -

13219 Approve Simple Application 2007-11-09 T 11:24:30 -

13220 Compute Installements 2007-11-09 T 11:24:35 -

… … … …

Page 4: Process Mining Reloaded: Event Structures as a Unified Representation of Process Models and Event Logs

Automated Process Discovery

• Relations-based– Alpha

4

Page 5: Process Mining Reloaded: Event Structures as a Unified Representation of Process Models and Event Logs

Alpha Algorithm

• Direct successors:A > B, B > C, C > D, A > C, C > B, B > E, E > FC > E, E > GB > D

A B C D

A C B E F

• Causality:A B, C D, A C, B E, C E, E F, E G , B D

• Concurrency:B ║ C

• Exclusiveness: all other pairs

A B C E G

A

C

B

D

5

Page 6: Process Mining Reloaded: Event Structures as a Unified Representation of Process Models and Event Logs

Alpha Relations Matrix

A B C D E F G

A # # # #

B # || # #

C || # # #

D # # # # #

E # # #

F # # # # # #

G # # # # # #

6

Page 7: Process Mining Reloaded: Event Structures as a Unified Representation of Process Models and Event Logs

A B C D E F G

A # # # # #

B # || # #

C || # # #

D # # # # #

E # # #

F # # # # # #

G # # # # # #

Alpha Algorithm – Patterns

7

a b, a c,b ║ c

Page 8: Process Mining Reloaded: Event Structures as a Unified Representation of Process Models and Event Logs

Automated Process Discovery

• Relations-based– Alpha: lossy (Badouel, Petri Nets 2012)– Alpha++– Heuristics miner (frequency information)

• Genetic• Region theory• Petri net synthesis• Integer Linear Programming (ILP)• …

8

Page 9: Process Mining Reloaded: Event Structures as a Unified Representation of Process Models and Event Logs

Automated Process Discovery

9

Page 10: Process Mining Reloaded: Event Structures as a Unified Representation of Process Models and Event Logs

Conformance Checking

?

10

Page 11: Process Mining Reloaded: Event Structures as a Unified Representation of Process Models and Event Logs

Alignment-Based ConformanceLog Model

A B C D EA B B C

Alignment

E

Fitness PrecisionHow much behavior of the log

is captured by the model?How accurate is the model

describing the log?Munoz-Gama et al. Petri nets 2013

11

Page 12: Process Mining Reloaded: Event Structures as a Unified Representation of Process Models and Event Logs

Deviance Mining

12

T1 <e11[d111:v111, …, d11n:v11n] e12[d121:v121, …, d12m:v12m] … e1p[d1p1:v1p1, …, d1pm:v1pm]>…Tq <eq1[dq11:vq11, …, dq1n:vq1n] eq2[dq21:vq21, …, dq2m:vq2m] … eqp[dqp1:vqp1, …, dqpm:vqpm]>

T1 <e11[d111:v111, …, d11n:v11n] e12[d121:v121, …, d12m:v12m] … e1p[d1p1:v1p1, …, d1pm:v1pm]>…Tq <eq1[dq11:vq11, …, dq1n:vq1n] eq2[dq21:vq21, …, dq2m:vq2m] … eqp[dqp1:vqp1, …, dqpm:vqpm]>

Find a function F: Trace Boolean (or probability [0…1]) s.t.•F is an accurate approximation of the given labeling•F is explainable, e.g. set of simple predicates

Page 13: Process Mining Reloaded: Event Structures as a Unified Representation of Process Models and Event Logs

Simple “timely” claims Simple “slow” claims

Deviance Mining via Model Delta Analysis

13Suriadi et al. Understanding Process Behaviours in a Large Insurance Company in Australia. CAiSE 2013

Page 14: Process Mining Reloaded: Event Structures as a Unified Representation of Process Models and Event Logs

Deviance Mining via Model Delta Analysis

14

Page 15: Process Mining Reloaded: Event Structures as a Unified Representation of Process Models and Event Logs

Deviance Mining via Sequence Classification

• Apply discriminative sequence mining methods to extract features characteristic of one class

• Build classification models (e.g. decision trees)• Extract difference diagnostics from classification model

C. Sun et al. Mining explicit rules for software process evaluation. ICSSP’2013.

15

Page 16: Process Mining Reloaded: Event Structures as a Unified Representation of Process Models and Event Logs

No Unified Foundation

≠ 16

Page 17: Process Mining Reloaded: Event Structures as a Unified Representation of Process Models and Event Logs

(Prime) Event Structures

• Model of concurrency based on events (occurrences of actions) and three relations– Causality– Conflict– Concurrency

17

Page 18: Process Mining Reloaded: Event Structures as a Unified Representation of Process Models and Event Logs

Petri Nets Event Structures

18

Page 19: Process Mining Reloaded: Event Structures as a Unified Representation of Process Models and Event Logs

Nets With Cycles Prefix Unfolding

21

Petri net NPetri net N

Complete prefix unfolding

Complete prefix unfolding

Causality-preserving prefix unfolding

Causality-preserving prefix unfolding

Page 20: Process Mining Reloaded: Event Structures as a Unified Representation of Process Models and Event Logs

Comparison of Event Structures

22

?

ES1

ES2

Armas-Cervantes et al. Behavioral Comparison of Process Models Based on […] Event Structures. BPM’2014

Partially Synchronized Product (PSP)

Page 21: Process Mining Reloaded: Event Structures as a Unified Representation of Process Models and Event Logs

PSP Difference Statements

23

Page 22: Process Mining Reloaded: Event Structures as a Unified Representation of Process Models and Event Logs

Comparison of Event Structures

24

In ES1, tasks C and B are mutually exclusive, while in ES2, B precedes C

In ES1, tasks C and B are mutually exclusive, while in ES2, B precedes C

?

ES1

ES2

Armas-Cervantes et al. Behavioral Comparison of Process Models Based on […] Event Structures. BPM’2014

Page 23: Process Mining Reloaded: Event Structures as a Unified Representation of Process Models and Event Logs

BP-Diff: BPMN model comparison

25http://diffbp-bpdiff.rhcloud.com/

Page 24: Process Mining Reloaded: Event Structures as a Unified Representation of Process Models and Event Logs

Event Logs Event Structures

B || C

Concurrency Oracle

RunMerger

55 22 33

26

Page 25: Process Mining Reloaded: Event Structures as a Unified Representation of Process Models and Event Logs

Event Structures for Log Delta Analysis

27van Beest et al. Log delta analysis: Interpretable differencing of business process event logs. BPM’2015

Page 26: Process Mining Reloaded: Event Structures as a Unified Representation of Process Models and Event Logs

Event Structures for Log Delta Analysis

In L1, task C can be skipped after B, whereas in L2 it cannot

In L1, task C can be skipped after B, whereas in L2 it cannot

van Beest et al. Log delta analysis: Interpretable differencing of business process event logs. BPM’201528

Page 27: Process Mining Reloaded: Event Structures as a Unified Representation of Process Models and Event Logs

Log Delta Analysis vs. Sequence Classification

448 cases7329 events

363 cases, 7496 events

Sequence classification 106-130 statements

IF |“NursingProgressNotes”| > 7.5 THEN L1IF |“Nursing Progress Notes”| ≤ 7.5 AND |“Nursing Assessment”| > 1.5 THEN L2…

Sequence classification 106-130 statements

IF |“NursingProgressNotes”| > 7.5 THEN L1IF |“Nursing Progress Notes”| ≤ 7.5 AND |“Nursing Assessment”| > 1.5 THEN L2…

Log delta analysis48 statements

In L1, “Nursing Primary Assessment” is repeated after “Medical Assign Start” and “Triage Request”, while in L2 it is not.…

Log delta analysis48 statements

In L1, “Nursing Primary Assessment” is repeated after “Medical Assign Start” and “Triage Request”, while in L2 it is not.… 29

van Beest et al. Log delta analysis: Interpretable differencing of business process event logs. BPM’2015

Page 28: Process Mining Reloaded: Event Structures as a Unified Representation of Process Models and Event Logs

Event Structures for Conformance Checking

30

ABDEADBEACDEADCE

Page 29: Process Mining Reloaded: Event Structures as a Unified Representation of Process Models and Event Logs

Event Structures for Conformance Checking

31

In the model, task C and B are in conflict, whereas in the log, B precedes C

In the model, task C and B are in conflict, whereas in the log, B precedes C

Page 30: Process Mining Reloaded: Event Structures as a Unified Representation of Process Models and Event Logs

… vs. alignment-based conformance checking

32

ABDEADBEACDEADCE

ABCDEABDCEADBCE

A B C D E

A C D E

A B D C E

A B D E A D B C E

A D C E

?

Page 31: Process Mining Reloaded: Event Structures as a Unified Representation of Process Models and Event Logs

Event Structures for Process Discovery?

33

ABDEACDEACDF

Fold

Page 32: Process Mining Reloaded: Event Structures as a Unified Representation of Process Models and Event Logs

Process Mining Reloaded

34

Page 33: Process Mining Reloaded: Event Structures as a Unified Representation of Process Models and Event Logs

The Road Ahead

• Developing more accurate concurrency oracles– Dealing with (short) loops in parallel branches

• Defining folding operators to generalize & simplify Petri nets synthesized from ES– Controlled generalization

• Extensions to events with data payloads

35

Page 34: Process Mining Reloaded: Event Structures as a Unified Representation of Process Models and Event Logs

Discovering concurrency

36