Logo PTI Logo FedCSIS

Proceedings of the 20th Conference on Computer Science and Intelligence Systems (FedCSIS)

Annals of Computer Science and Information Systems, Volume 43

Segmentation and Process Assignment of Semi-Structured Event Logs

, , ,

DOI: http://dx.doi.org/10.15439/2025F4755

Citation: Proceedings of the 20th Conference on Computer Science and Intelligence Systems (FedCSIS), M. Bolanowski, M. Ganzha, L. Maciaszek, M. Paprzycki, D. Ślęzak (eds). ACSIS, Vol. 43, pages 231241 ()

Full text

Abstract. Process mining provides valuable insights by discovering process models from execution logs. However, its effectiveness depends heavily on high-quality, well-structured logs. Many real-world systems produce low-level, semi-structured logs lacking clear process identifiers, causing misalignment with their intended process models. This paper introduces a method for structuring raw event logs by segmenting event streams and mapping them to known processes. Using process traces from experienced users, we develop a model that infers process assignments in unstructured logs. Our approach is motivated by a modular enterprise system without predefined workflows, where dynamic processes generate low-level logs requiring interpretation. We validate our method on a semi-synthetic business dataset and a fully synthetic dataset from PLG2. Our results demonstrate that trace segmentation improves process discovery, aligns logs with meaningful structures, and significantly enhances process mining in unstructured environments.

References

  1. W. M. van der Aalst, B. F. van Dongen, J. Herbst, L. Maruster, G. Schimm, and A. J. Weijters, “Workflow mining: A survey of issues and approaches,” Data & knowledge engineering, vol. 47, no. 2, pp. 237–267, 2003.
  2. J. Claes and G. Poels, “Process mining and the prom framework: an exploratory survey,” in Business Process Management Workshops: BPM 2012 International Workshops, Tallinn, Estonia, September 3, 2012. Revised Papers 10. Springer, 2013, pp. 187–198.
  3. S. J. van Zelst, F. Mannhardt, M. de Leoni, and A. Koschmider, “Event abstraction in process mining: literature review and taxonomy,” Granular Computing, vol. 6, pp. 719–736, 2021.
  4. I. Helal and A. Awad, “Correlating unlabeled events at runtime,” arXiv preprint https://arxiv.org/abs/2004.09971, 2020.
  5. E. Brzychczy, T. Pełech-Pilichowski, and Z. Dworakowski, “Case id detection based on time series data – the mining use case,” 2024. [Online]. Available: https://arxiv.org/abs/2410.23846
  6. R. Andrews, C. G. van Dun, M. T. Wynn, W. Kratsch, M. K. Röglinger, and A. H. ter Hofstede, “Quality-informed semi-automated event log generation for process mining,” Decision Support Systems, vol. 132, p. 113265, 2020.
  7. D. Sanchez-Charles, J. Carmona, and R. Raventos, “Building event logs from relational databases in the XES format,” Applied Sciences, vol. 12, no. 21, p. 10832, 2022.
  8. C. Kecht, A. Egger, W. Kratsch, and M. Röglinger, “Event log construction from customer service conversations using natural language inference,” in 3rd International Conference on Process Mining (ICPM), 2021.
  9. Ł. Korzeniowski and K. Goczyła, “Discovering interactions between applications with log analysis,” in 2022 17th Conference on Computer Science and Intelligence Systems (FedCSIS). IEEE, 2022, pp. 861–869.
  10. C. W. Günther, A. Rozinat, and W. M. van der Aalst, “Activity mining by global trace segmentation,” in Business Process Management Workshops: BPM 2009 International Workshops, Ulm, Germany, September 7, 2009. Revised Papers 7. Springer, 2010, pp. 128–139.
  11. N. Tax, N. Sidorova, R. Haakma, and W. M. van der Aalst, “Event abstraction for process mining using supervised learning techniques,” in Proceedings of SAI Intelligent Systems Conference (IntelliSys) 2016: Volume 1. Springer, 2018, pp. 251–269.
  12. N. Martin, B. Depaire, and A. Caris, “The use of process mining in a business process simulation context: Overview and challenges,” in 2014 IEEE Symposium on Computational Intelligence and Data Mining (CIDM). IEEE, 2014, pp. 381–388.
  13. W. M. van der Aalst, “Object-centric process mining: dealing with divergence and convergence in event data,” in Software Engineering and Formal Methods: 17th International Conference, SEFM 2019, Oslo, Norway, September 18–20, 2019, Proceedings 17. Springer, 2019, pp. 3–25.
  14. W. M. van der Aalst and A. Berti, “Discovering object-centric Petri nets,” Fundamenta informaticae, vol. 175, no. 1-4, pp. 1–40, 2020.
  15. A. F. Ghahfarokhi, G. Park, A. Berti, and W. M. van der Aalst, “Ocel: a standard for object-centric event logs,” in European Conference on Advances in Databases and Information Systems. Springer, 2021, pp. 169–175.
  16. M. J. Jans, M. Alles, and M. A. Vasarhelyi, “Process mining of event logs in auditing: Opportunities and challenges,” International Journal of Accounting Information Systems, vol. 12, no. 1, pp. 1–20, 2011.
  17. A. Burattin, A. Sperduti, and W. M. van der Aalst, “Control-flow discovery from event streams,” in Proceedings of the IEEE Congress on Evolutionary Computation (CEC), 2014, pp. 2420–2427.
  18. D. Fahland, “Extracting and pre-processing event logs,” CoRR, vol. abs/2211.04338, 2022.
  19. W. M. van der Aalst, A. J. Weijters, and L. Maruster, “Workflow mining: Discovering process models from event logs,” IEEE Transactions on Knowledge and Data Engineering, vol. 16, no. 9, pp. 1128–1142, 2004.
  20. A. J. Weijters, W. M. van der Aalst, and A. de Medeiros, “Process mining with the heuristics miner algorithm,” in Technical Report WP, vol. 166, 2003, pp. 1–34.
  21. S. J. van Zelst and W. M. van der Aalst, “Log skeletons: A classification approach to process discovery,” in International Conference on Advanced Information Systems Engineering (CAiSE). Springer, 2018, pp. 309–324.
  22. A. Rozinat and W. M. van der Aalst, “Conformance checking of processes based on monitoring real behavior,” Information Systems, vol. 33, no. 1, pp. 64–95, 2008.
  23. A. Ilyas, J. M. da Trindade, R. Castro Fernandez, and S. Madden, “Extracting syntactical patterns from databases,” in Proceedings of the 2018 International Conference on Management of Data (SIGMOD). ACM, 2018, pp. 1773–1788.
  24. A. Burattin and W. M. van der Aalst, “PLG2: Multiperspective process randomization with online and offline simulations,” in International Conference on Business Process Management. Springer, 2015, pp. 214–219.