Alpha algorithm

This is an old revision of this page, as edited by Ɯ (talk | contribs) at 11:58, 13 April 2010. The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.


The α-algorithm is an algorithm used in process mining. It was first put forward by Wil van der Aalst [1]. Several extensions or modifications of it have since been presented, which will be listed below.

It constructs P/T nets with special properties (workflow nets) from event logs (as might be collected by an ERP system). Each transition in the net corresponds to an observed task.

Short description

The algorithm takes a workflow log   as input and results in a workflow net being constructed.

It does so by examining causal relationships observed between tasks. For example, one specific task might always precede another specific task in every execution trace, which would be useful information.

It can be shown [2] that in the case of a complete workflow log generated by a sound SWF net, the net generating it can be reconstructed.

Definitions used

  • A workflow trace or execution trace is a string over an alphabet   of tasks.
  • A workflow log is a set of workflow traces.
  • A complete workflow log of a workflow net   is the set of firing sequences of   resulting in the end marking  .

Description

Declaratively, the algorithm can be presented as follows. Three sets of tasks are determined:

*   is the set of all tasks which occur in at least one trace
*   is the set of all tasks which occur trace-initially
*   is the set of all tasks which occur trace-terminally

Basic ordering relations are determined (  first, the latter three can be constructed therefrom)

*   iff b directly precedes a in some trace
*   iff  
*   iff  
*   iff   

Places are discovered. Each place is identified with a pair of sets of tasks, in order to keep the number of places low.

*   is the set of all maximal pairs   of sets of tasks such that
  * Neither   and   contain any members of   and
  *   is a subset of  
*   contains one place   for every member of  ,
  plus the input place   and the output place  

The flow relation   is the union of the following:

*  
*  
*  
*  

The result is

*  


Limitations

See [3]

Extensions

for example [4]

References

  1. ^ van der Aalst, W M P and Weijter, A J M M and Maruster, L (2003). "Workflow Mining: Discovering process models from event logs", IEEE Transactions on Knowledge and Data Engineering, vol 16
  2. ^ van der Aalst et al. 2003
  3. ^ A. de Medeiros, A K and van der Aalst, W M P and Weijters, A J M M (2003). "Workflow Mining: Current Status and Future Directions". in: "volume 2888 of Lecture Notes in Computer Science", Springer-Verlag
  4. ^ A. de Medeiros, A K and van Dongen, B F and van der Aalst, W M P and Weijters, A (2004). "Process mining: extending the α-algorithm to mine short loops"