Journal of South China University of Technology (Natural Science Edition) ›› 2017, Vol. 45 ›› Issue (1): 112-117.doi: 10.3969/j.issn.1000-565X.2017.01.016

• Computer Science & Technology • Previous Articles     Next Articles

Improved Log Data-Merging Method for Process Mining

XU Yang LIN Qi LI Dong   

  1. School of Software Engineering,South China University of Technology,Guangzhou 510006,Guangdong,China
  • Received:2016-03-02 Revised:2016-10-12 Online:2017-01-25 Published:2016-12-01
  • Contact: 李东( 1970-) ,男,教授,博士生导师,主要从事数据库、移动计算、大数据技术研究 E-mail:cslidong@scut.edu.cn
  • About author:徐杨( 1970-) ,男,博士,讲师,主要从事业务流程建模、并行与分布式计算研究.E-mail: xuyang@ scut.edu.cn
  • Supported by:
    Supported by the National Natural Science Foundation of China( 71090403) and the Science and Technology Planning Projects of Guangdong Province( 2014B090901001, 2015B010103002, 2016B090918062)

Abstract:

The existing process mining techniques and tools are on the basis of a single log file.In actual business process environment,however,a business process may be supported by different computer systems,so that actual process data will be recorded into multiple log files.Therefore,it is necessary to merge the multiple recorded data into one log file for further global process mining and analysis.In this paper,an automatic method is proposed to merge event logs by combining an artificial immune algorithm and simulated annealing.In the method,on the basis of the characteristics of the process logs of multiple IT systems,two operators,namely,the occurrence frequency of activity sequences and the time overlap area between mergeable cases,are taken into account in an affinity function,so as to improve the accuracy of matching cases and the practicality of the proposed method.Moreover,the simulated annealing selection is introduced into the evolution of populations so as to solve the problems of the premature and continuous degradation of artificial immune algorithm,and the immunological memory is also introduced to preserve the diversity of populations and avoid their local convergence.Experiment results show that the proposed method achieves a merging success rate of more than 90%,and it can ensure that process mining results are correct,and that,as compared with the traditional log data-merging method on the basis of artificial immunity,the proposed method speeds up convergence significantly and increases merging efficiency.

Key words: process mining, log data merging, artificial immune algorithm, simulated annealing