+ All Categories
Home > Documents > 33 Mine Your Own Business - 2013.secrus.org2013.secrus.org/2013/files/101_aalst.pdf]À ] Ç ^ t _ t...

33 Mine Your Own Business - 2013.secrus.org2013.secrus.org/2013/files/101_aalst.pdf]À ] Ç ^ t _ t...

Date post: 11-Jun-2018
Category:
Upload: trananh
View: 214 times
Download: 0 times
Share this document with a friend
66
33 PAGE 0 Mine Your Own Business Turning (Big) Data into Real Value using Process Mining prof.dr.ir. Wil van der Aalst Central and Eastern European Software Engineering Conference in Russia (CEE-SECR 2013), Moscow, October 25 th 2013
Transcript

33

PAGE 0

Mine Your Own Business

Turning (Big) Data into Real Value

using Process Mining

prof.dr.ir. Wil van der Aalst

Central and Eastern European Software Engineering

Conference in Russia (CEE-SECR 2013), Moscow,

October 25th 2013

PAGE 1

Season 1, Episode 4 (1969)

PAGE 2

How to get

started?

Evidence-

based BPM

and Auditing

Process

discovery

Process Mining:

The missing link

Big (Event)

Data

Aligning

reality and

model

PAGE 3

How to get

started?

Evidence-

based BPM

Process

discovery

Process Mining:

The missing link

Big (Event)

Data

Aligning

reality and

model

PAGE 4

Why do (larger)

organizations

have software?

PAGE 5

It is always about

supporting or improving

business processes

PAGE 6

Business process

problem or IT problem?

Business Process Management (BPM)

PAGE 7

PAGE 8 BPM efforts focus on models

PAGE 9

• enormous investments in process models

• large collections of "dead" process models

• not taken seriously, unrelated to reality

Models should be:

- descriptive,

- predictive, and/or

- prescriptive

PAGE 10

PAGE 11

How to get

started?

Evidence-

based BPM

and Auditing

Process

discovery

Process Mining:

The missing link

Big (Event)

Data

Aligning

reality and

model

Motivation: Increasing awareness of the

value of (Big) Data

• "In God we trust. All others must bring data"

(William Edwards Deming, statistician),

• "Data is a precious thing and will last longer

than the systems themselves" (Tim Berners-

Lee),

• "Statistics are like bikinis. What they reveal is

suggestive, but what they conceal is vital"

(Aaron Levenstein, statistician),

• "Every 2 days we create as much information as

we did up to 2003" (Eric Schmidt, Google CEO,

August 4, 2010).

PAGE 12

PAGE 13

In 10 years we will have 50

times as much data! (IDC)

PAGE 14

Motivation: Internet of Things

PAGE 15

Events

PAGE 16

PAGE 17

Big Data ?

Big … or fast and efficient?

www.solarteameindhoven.nl

PAGE 19

Process-awareness is an essential

but often forgotten ingredient when

converting big data into real value

PAGE 20

How to get

started?

Evidence-

based BPM

and Auditing

Process

discovery

Process Mining:

The missing link

Big (Event)

Data

Aligning

reality and

model

PAGE 21

process mining

data-oriented analysis (data mining, machine learning, business intelligence)

process model analysis (simulation, verification, optimization, gaming, etc.)

performance-oriented

questions, problems and

solutions

compliance-oriented

questions, problems and

solutions

PAGE 22

PAGE 24

let's play

Play-Out

PAGE 25

event logprocess model

A

B

C

DE

p2

end

p4

p3p1

start

Play-Out (Classical use of models)

PAGE 26

A B C D

A C B D A B C D

A E D

A C B D

A C B D

A E D

A E D

Let’s not worry about syntax (there is

difference between analysis and presentation)

PAGE 27

A

B

C

DE

p2

end

p4

p3p1

start

Play-In

PAGE 28

event log process model

A

B

C

DE

p2

end

p4

p3p1

start

Play-In

PAGE 29

A C B D A B C D

A E D

A C B D

A C B D

A E D

A E D A B C D

Example Process Discovery (Vestia, Dutch housing agency, 208 cases, 5987 events)

PAGE 30

Example Process Discovery (ASML, test process lithography systems, 154966 events)

PAGE 31

Example Process Discovery (AMC, 627 gynecological oncology patients, 24331 events)

PAGE 32

Replay

PAGE 33

event log process model

· extended model

showing times,

frequencies, etc.

· diagnostics

· predictions

· recommendations

A

B

C

DE

p2

end

p4

p3p1

start

Replay

PAGE 34

A B C D

A

B

C

DE

p2

end

p4

p3p1

start

Replay

PAGE 35

A E D

A

B

C

DE

p2

end

p4

p3p1

start

Replay can detect problems

PAGE 36

A C D

Problem!

missing token

Problem!

token left behind

Conformance Checking (WOZ objections Dutch municipality, 745 objections, 9583 event, f= 0.988)

PAGE 37

A

B

C

DE

p2

end

p4

p3p1

start

Replay can extract timing information

PAGE 38

A5 B8 C9 D13

5

8

9

13

3

4

5

4 3

2 6 5

8

7 6 4

7

7 4

3

PAGE 39

Performance Analysis Using Replay (WOZ objections Dutch municipality, 745 objections, 9583 event, f= 0.988)

ProM Demo: Fuzzy mining

PAGE 40

Disco Demo

PAGE 41

PAGE 42

Models are like the glasses required to see

and understand event data!

PAGE 43

How to get

started?

Evidence-

based BPM

and Auditing

Process

discovery

Process Mining:

The missing link

Big (Event)

Data

Aligning

reality and

model

Language identification in the limit

(Mark Gold 1967)

PAGE 44 Language identification in the limit by E Mark Gold, Information and Control, 10(5):447–474, 1967.

abc

abd

abc ?

ab(c|d) ?

ad

abbc

ac

… (ad)|(ab(c|d)) ?

ab*(c|d) ?

A language is learnable in

the limit if there exists a

perfect child that

generates only finitely

many hypotheses.

Learning is not easy …

• Even simple languages like

regular languages are not

learnable in the limit.

• Many settings: evil or well-

behaving mothers, with or

without negative examples,

frequencies, etc.

PAGE 45

sentence trace in event log

language process model

Process discovery algorithms (small selection)

PAGE 46

α algorithm

α++ algorithm

α# algorithm

language-based regions

state-based regions genetic mining

heuristic mining

hidden Markov models

neural networks

automata-based learning

stochastic task graphs

conformal process graph

mining block structures

multi-phase mining partial-order based mining

fuzzy mining

LTL mining

ILP mining

distributed genetic mining

ETM genetic algorithm Inductive Miner (infrequent)

PAGE 47

Mine your Map

PAGE 48

models are like maps, their usefulness

is determined by the intended use,

i.e., there is not a single "perfect map"

PAGE 49

How to get

started?

Evidence-

based BPM

and Auditing

Process

discovery

Process Mining:

The missing link

Big (Event)

Data

Aligning

reality and

model

Conformance Checking

PAGE 50

an activity that should

not happen happened

an activity that should

happen did not happen

an activity was executed

by the wrong person

an activity was

executed too late

two activities were

swapped

PAGE 51

• conformance checking to diagnose deviations

• squeezing reality into the model to do model-based

analysis

Alignments are essential!

PAGE 52

process

model event log

synchronous

move

move on

model only

move on log

only

Example: BPI Challenge 2012 (Dutch financial institute, doi:10.4121/uuid:3926db30-f712-4394-aebc-75976070e91f)

PAGE 53

“O_DECLINED” and “W_Wijzigen contractgegevens” are often skipped

Many moves on log of “O_CANCELLED”,

”O_CREATED”,”O_SELECTED”,

“O_SENT” occurred with the same

frequency value (i.e. 60) before parallel

branch

Many moves on log of “W_Afhandelen leads” ( > 2200 times) occurred in the end of traces

Loops of “W_Completeren aanvraag” and “W_Nabellen offertes” are often performed

Work of Arya Adriansyah (Replay project)

PAGE 54

“O_DECLINED” and “W_Wijzigen contractgegevens” are often skipped

Many moves on log of “O_CANCELLED”,

”O_CREATED”,”O_SELECTED”,

“O_SENT” occurred with the same

frequency value (i.e. 60) before parallel

branch

Many moves on log of “W_Afhandelen leads” ( > 2200 times) occurred in the end of traces

Loops of “W_Completeren aanvraag” and “W_Nabellen offertes” are often performed

Synchronous moves of “Completeren aanvraag”

Move on log of “Completeren aanvraag”

Moves on model towards end of traces

Move on log of “O_CANCELLED” and “A_CANCELLED”

Auditor's toolbox

PAGE 55

“O_ACCEPTED” has average sojourn time of 27.07 minutes, while “A_REGISTERED”, ”A_ACTIVATED”, and

“A_APPROVED” have average sojourn time of 29.56 minutes

Activity “W_Wijzigen contractgegevens” is the bottleneck, but it occured rarely (only 4 times)

The average waiting time for the input place of “W_Nabellen offertes+START” is very long (2.83 days) compares to the average waiting time of other places

Business analyst's

toolbox

PAGE 56

traffic jams are

projected on map

PAGE 57

Demand TomTom! Do not settle for restrictive

information systems and

static process models

predict: when

will I be home

recommend:

turn right

adapt: use real-

time traffic

information

PAGE 58

How to get

started?

Evidence-

based BPM

and Auditing

Process

discovery

Process Mining:

The missing link

Big (Event)

Data

Aligning

reality and

model

PAGE 59

How to get started?

Collect data: Events are everywhere!

• Minimal requirement:

events referring to an

activity name and a

process instance.

• Good to have:

timestamps, resource

information, additional

data elements.

• Challenges: scoping and

sometimes correlation.

60

databases, ERP systems (SAP etc.), WFM/BPM

logs, message logs, audit trails, etc.

Get at a process mining tool, e.g., ProM

61

Download from: www.processmining.org

600+ plug-ins available covering the

whole process mining spectrum

Commercial process mining tools

• Disco (Fluxicon)

• Perceptive Process Mining (before Futura Reflect and BPM|one)

• ARIS Process Performance Manager

• QPR ProcessAnalyzer

• Celonis Discovery

• Interstage Process Discovery (Fujitsu)

• Discovery Analyst (StereoLOGIC)

• XMAnalyzer (XMPro)

• …

Example of a dedicated process mining

consulting firm: ProcessGold AG.

62

Approach: Start simple

Questions:

• What kind problems

would you like to

address (cost, time,

risk, compliance,

service, etc.)?

• Related to discovery,

conformance,

enhancement?

• Iterative process: can

be “curiosity driven”

initially.

63

Stage 0: plan and justify

Stage 2: create control-flow model

and connect event log

Stage 1: extract

historic

data

handmade

models

objectives

(KPIs)questions

event log control-flow model

Stage 3: create integrated process

model

event log process model

data understanding business understanding

Stage 4: operational support

explore

discover

check

compare

promote

enhance

detect

predict

recommend

inte

rpre

t

diagnose

current data

redesign

adjust

intervene

support

Join our expedition: Mine your processes!

PAGE 64

process

mining

data-oriented analysis

(data mining, machine learning, business intelligence)

process model analysis

(simulation, verification, etc.)pe

rform

an

ce

-orie

nte

d q

ue

stio

ns

,

pro

ble

ms

an

d s

olu

tion

s

co

mp

lian

ce

-orie

nte

d q

ue

stio

ns

,

pro

ble

ms

an

d s

olu

tion

s

PAGE 65

Learn more?

http://www.youtube.com/watch?v=7oat7MatU_U

Process Mining Movie

http://pais.hse.ru/


Recommended