CPC G06F 16/116 (2019.01) | 6 Claims |
1. A computer-implemented method comprising:
receiving, by one or more processors of a computing system, at least two diverse log files each having varying structural characteristics, wherein each log file of the at least two diverse log files comprises at least one log entry with at least one time stamp and at least one message, and the at least two diverse log files differ from one another with respect to at least one distinctive criteria;
extracting, by the one or more processors, file paths of each log file to identify a computing unit which generated the log file, a program which generated the log file, and configuration information of the computing unit which generated the log file;
clustering, by the one or more processors, the at least one message of each log file for extracting a content of the at least one message, and for extracting invariable parts and variable parts of the at least one message to determine a log entry template;
combining, by the one or more processors, each log file of the at least two diverse log files with the at least one time stamp, the content of the at least one message, the computing unit which generated the log file, the program which generated the log file, the configuration information of the computing unit which generated the log file, and the log entry template into at least two processed log files, wherein the at least two processed log files comply with a coherent representation;
loading, by the one or more processors, the coherent representation of the at least two processed log files into a knowledge graph such that the at least two diverse log files are transformed into the knowledge graph; and
performing a combination of statistical and knowledge graph analytics using the knowledge graph for diagnosis and repair of problems in an industrial environment.
|