US 12,072,838 B2
Method for generating a coherent representation for at least two log files
Dmitriy Fradkin, Wayne, PA (US); André Scholz, Ansbach (DE); Matthias Loskyll, Neumarkt (DE); Georgia Olympia Brikis, Munich (DE); Rakebul Hasan, Munich (DE); Vladimir Lavrik, Hessen (DE); and Alexander Storl, Hetzles (DE)
Assigned to SIEMENS AKTIENGESELLSCHAFT, Munich (DE)
Appl. No. 17/635,203
Filed by Siemens Aktiengesellschaft, Munich (DE)
PCT Filed Aug. 20, 2020, PCT No. PCT/EP2020/073289
§ 371(c)(1), (2) Date Feb. 14, 2022,
PCT Pub. No. WO2021/032820, PCT Pub. Date Feb. 25, 2021.
Application 17/635,203 is a continuation of application No. 16/547,782, filed on Aug. 22, 2019, abandoned.
Prior Publication US 2022/0292053 A1, Sep. 15, 2022
Int. Cl. G06F 16/11 (2019.01)
CPC G06F 16/116 (2019.01) 6 Claims
OG exemplary drawing
 
1. A computer-implemented method comprising:
receiving, by one or more processors of a computing system, at least two diverse log files each having varying structural characteristics, wherein each log file of the at least two diverse log files comprises at least one log entry with at least one time stamp and at least one message, and the at least two diverse log files differ from one another with respect to at least one distinctive criteria;
extracting, by the one or more processors, file paths of each log file to identify a computing unit which generated the log file, a program which generated the log file, and configuration information of the computing unit which generated the log file;
clustering, by the one or more processors, the at least one message of each log file for extracting a content of the at least one message, and for extracting invariable parts and variable parts of the at least one message to determine a log entry template;
combining, by the one or more processors, each log file of the at least two diverse log files with the at least one time stamp, the content of the at least one message, the computing unit which generated the log file, the program which generated the log file, the configuration information of the computing unit which generated the log file, and the log entry template into at least two processed log files, wherein the at least two processed log files comply with a coherent representation;
loading, by the one or more processors, the coherent representation of the at least two processed log files into a knowledge graph such that the at least two diverse log files are transformed into the knowledge graph; and
performing a combination of statistical and knowledge graph analytics using the knowledge graph for diagnosis and repair of problems in an industrial environment.