TY - JOUR
T1 - Interactive mining and retrieval from process traces
AU - Bottrighi, Alessio
AU - Canensi, Luca
AU - Leonardi, Giorgio
AU - Montani, Stefania
AU - Terenziani, Paolo
N1 - Publisher Copyright:
© 2018 Elsevier Ltd
PY - 2018/11/15
Y1 - 2018/11/15
N2 - The traces of past process executions are maintained in many contexts, since they constitute a strategic source of information. Different tasks on such data can be supported. In particular, we focus on process model discovery, by proposing an approach that helps the analyst in identifying a good balance between overfitting and underfitting. To achieve such a goal, we have designed SIM (Semantic Interactive Miner), an innovative interactive and incremental tool, which starts from a non-generalized model, and provides the user with a path retrieval facility to analyse the current model, and with semantic abstractions to build increasingly more generalized models (through the selective merging of retrieved paths). Additionally, the tool exploits the path retrieval facility and an indexing strategy to support efficient trace retrieval. As a consequence, our framework represents the first literature contribution able to integrate in a synergic approach process model discovery, path retrieval, and trace retrieval. We experimentally compare our tool to two well-known process mining algorithms, namely inductive miner (Leemans, Fahland, and van der Aalst, 2013) and heuristic miner (Weijters, van der Aalst, and de Medeiros, 2006). The comparison enlights the main innovative aspect of our approach, i.e., its ability to facilitate the analyst in directly using her/his domain knowledge to lead process model discovery, a feature that can be extremely advantageous in knowledge-rich applications, such as the medical ones.
AB - The traces of past process executions are maintained in many contexts, since they constitute a strategic source of information. Different tasks on such data can be supported. In particular, we focus on process model discovery, by proposing an approach that helps the analyst in identifying a good balance between overfitting and underfitting. To achieve such a goal, we have designed SIM (Semantic Interactive Miner), an innovative interactive and incremental tool, which starts from a non-generalized model, and provides the user with a path retrieval facility to analyse the current model, and with semantic abstractions to build increasingly more generalized models (through the selective merging of retrieved paths). Additionally, the tool exploits the path retrieval facility and an indexing strategy to support efficient trace retrieval. As a consequence, our framework represents the first literature contribution able to integrate in a synergic approach process model discovery, path retrieval, and trace retrieval. We experimentally compare our tool to two well-known process mining algorithms, namely inductive miner (Leemans, Fahland, and van der Aalst, 2013) and heuristic miner (Weijters, van der Aalst, and de Medeiros, 2006). The comparison enlights the main innovative aspect of our approach, i.e., its ability to facilitate the analyst in directly using her/his domain knowledge to lead process model discovery, a feature that can be extremely advantageous in knowledge-rich applications, such as the medical ones.
KW - Business process management
KW - Information search and retrieval
KW - Knowledge representation and reasoning
UR - http://www.scopus.com/inward/record.url?scp=85047901740&partnerID=8YFLogxK
U2 - 10.1016/j.eswa.2018.05.041
DO - 10.1016/j.eswa.2018.05.041
M3 - Article
SN - 0957-4174
VL - 110
SP - 62
EP - 79
JO - Expert Systems with Applications
JF - Expert Systems with Applications
ER -