Locally hosted conversational process mining for production data via graph-based retrieval-augmented generation (GraphRAG)

Shunfann, Wu and Cheng, Yee Low and Jingye, Yee and C., Nicolaj Stache and Tobias, Schmieg and Manurung, Yupiter H. P. (2025) Locally hosted conversational process mining for production data via graph-based retrieval-augmented generation (GraphRAG). Journal of Mechanical Engineering (JMechE), 14 (SI): 3. pp. 39-59. ISSN e-ISSN: 2550-164X

Official URL: https://jmeche.uitm.edu.my/

Identification Number (DOI): 10.24191/jmeche.v14i1.5757

Abstract

A secure, on-premises conversational process mining chatbot that enables manufacturing companies to analyze production event logs with natural language queries is presented in this paper. The system adopts a graph-based retrieval-augmented generation (GraphRAG) approach to process mining: PM4Py discovers the process from logs, which is converted into concise activity-, path-, and variant-level facts and stores them with the process graph in a graph database. A hybrid retriever and a lightweight cross-encoder reranker select focused evidence for a compact large language model (LLM), enabling accurate answers about flows, bottlenecks, and variants. A key contribution is the fully local, open-source design covering the embedding model, graph database, reranker, and LLM, to ensure the privacy of sensitive and confidential data. The architecture is detailed using the Active Structure methodology, and the deployment is demonstrated with the Analytics Canvas in a representative use case. The result is a practical, private way for manufacturers to ask questions of their data and act on the insights.

Metadata

Item Type: Article
Creators:
Creators
Email / ID Num.
Shunfann, Wu
UNSPECIFIED
Cheng, Yee Low
UNSPECIFIED
Jingye, Yee
UNSPECIFIED
C., Nicolaj Stache
UNSPECIFIED
Tobias, Schmieg
UNSPECIFIED
Manurung, Yupiter H. P.
UNSPECIFIED
Subjects: Q Science > QA Mathematics > Instruments and machines > Electronic Computers. Computer Science > Database management
T Technology > T Technology (General) > Industrial engineering. Management engineering > Applied mathematics. Quantitative methods > Operations research. Systems analysis
Divisions: Universiti Teknologi MARA, Shah Alam > College of Engineering
Journal or Publication Title: Journal of Mechanical Engineering (JMechE)
UiTM Journal Collections: UiTM Journals > Journal of Mechanical Engineering (JMechE)
ISSN: e-ISSN: 2550-164X
Volume: 14
Number: SI
Page Range: pp. 39-59
Keywords: Process mining, Large language models (LLMs), Graph-based retrieval-augmented generation (GraphRAG), Intelligent manufacturing, Conversational chatbot
Date: November 2025
URI: https://ir.uitm.edu.my/id/eprint/126913
Edit Item
Edit Item

Download

[thumbnail of 126913.pdf] Text
126913.pdf

Download (556kB)

ID Number

126913

Indexing

Altmetric
PlumX
Dimensions

Statistic

Statistic details