Towards Scalable Process Mining Pipelines

Over the past two decades, process mining has proven to be a valuable approach to gain insights into or ganizations’ performance. The major sub-fields of discovery, conformance, and improvement have witnessed substantial de velopment. Contributions have covered the spectrum of better algorithms, ric...

Full description

Saved in:
Bibliographic Details
Main Author: Mohamed, Belal (author)
Other Authors: ElHelw, Mohamed (author), Awad, Ahmed (author)
Published: 2023
Subjects:
Online Access:https://bspace.buid.ac.ae/handle/1234/2937
https://ieeexplore.ieee.org/document/10361330
https://doi.org/10.1109/DASC/PiCom/CBDCom/Cy59711.2023.10361330.
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Over the past two decades, process mining has proven to be a valuable approach to gain insights into or ganizations’ performance. The major sub-fields of discovery, conformance, and improvement have witnessed substantial de velopment. Contributions have covered the spectrum of better algorithms, richer comparison metrics, and movement towards online analysis for process data. Mostly, these contributions were addressing process mining guidelines from the process mining manifesto. In this paper, we address the sixth guideline in the process mining manifesto. That is, process mining should be a continuous process. For this, we propose a pipelining approach that is: configurable, scalable, modular, and automated. We realize our proposal using Dask and evaluate it with different architectures, process discovery, and evaluation metrics.