What is it about?

This paper is about process mining, that is a research discipline that aims to discover, monitor and improve real processing using event logs. In this paper we tackle the problem of predicting the next activity given a sequence of activities. The proposed method can work on the the Spark parallel computation framework in order to process massive logs.

Featured Image

Why is it important?

This work is important since the prediction of the next activity is based on a scalable approach that can process big data, namely large event logs. This is possible thanks to a parallel and distributed solution (by exploiting the Spark parallel computation framework) which can make reasonable decisions in the absence of perfect models.

Perspectives

This work is a highly cited work in the area of process mining, defines a general approach, and faces a very practical problem. The background is a solid work on sequence mining.

Prof. Donato Malerba
Universita degli Studi di Bari Aldo Moro

Read the Original

This page is a summary of: Distributed Learning of Process Models for Next Activity Prediction, January 2018, ACM (Association for Computing Machinery),
DOI: 10.1145/3216122.3216125.
You can read the full text:

Read

Contributors

The following have contributed to this page