학술논문

STARDUST: A Novel Process Mining Approach to Discover Evolving Models From Trace Streams
Document Type
Periodical
Source
IEEE Transactions on Services Computing IEEE Trans. Serv. Comput. Services Computing, IEEE Transactions on. 16(4):2970-2984 Aug, 2023
Subject
Computing and Processing
General Topics for Engineers
Business
Data models
Data mining
Computational modeling
Feature extraction
Context modeling
Process control
Process discovery
stream data mining
concept drift
sampling
pareto's principle
Language
ISSN
1939-1374
2372-0204
Abstract
In this article we introduce ${{\sf STARDUST}}$STARDUST (event STream Analysis for pRocess Discovery Using Sampling sTragies), a process discovery approach that analyses a trace stream, in order to discover a process model that may change over time. The basic idea is to adopt a sampling technique to select the most representative trace variants to be considered for the process discovery, then to alert a concept drift as the trace variants to be sampled change over time and, finally, to trigger the discovery of a new process model as a drift is alerted. We formulate the proposed approach under the assumption that the trace distribution commonly follows the Pareto's principle (i.e., a few trace variants covers the majority of cases) which is commonly satisfied in several business processes. Experimental results on various benchmark event logs handled as streams show the effectiveness of the proposed approach also compared to a state-of-the- art concept drift detection approach.