US 11,868,937 B1
Automatic troubleshooting of clustered application infrastructure
Harry Culley, London (GB); Adam Fischer Mangum, Richmond, CA (US); and Bryan Nolan Seay, Cary, NC (US)
Assigned to Sysdig, Inc., San Francisco, CA (US)
Filed by Sysdig, Inc., San Francisco, CA (US)
Filed on Dec. 9, 2022, as Appl. No. 18/078,752.
Int. Cl. G06Q 10/0635 (2023.01); G06F 3/0483 (2013.01); G06F 3/0484 (2022.01); G06F 3/0482 (2013.01)
CPC G06Q 10/0635 (2013.01) [G06F 3/0482 (2013.01); G06F 3/0483 (2013.01); G06F 3/0484 (2013.01); G06F 2203/04803 (2013.01)] 18 Claims
OG exemplary drawing
 
1. A computer-implemented method of troubleshooting issues occurring in a clustered application infrastructure, the method comprising:
collecting data from an execution environment including one or more hosts running a containerized application deployed onto the clustered application infrastructure, processing the collected data, and storing the processed data in at least one database;
performing, by a processor, queries on the at least one database to retrieve metrics about the clustered application infrastructure;
comparing, by the processor, the retrieved metrics against detection criteria for different categories of issues to detect issues occurring in the clustered application infrastructure;
determining, by the processor, for each issue category within a plurality of issue categories, one or more objects within the clustered application infrastructure that are affected by the issues of the issue category;
performing, by the processor, one or more queries on the at least one database to retrieve a set of data associated with the one or more affected objects and related to a set of remediation steps; and
generating, by the processor, a troubleshooting graphical user interface (GUI) to be presented on a user device, the troubleshooting GUI comprising a first area configured to present objects of the clustered application infrastructure as nodes of a hierarchy, and a second area configured to present the issue categories, wherein
each of the issue categories is associated with one or more levels of the hierarchy, and responsive to a selection of an issue category in the troubleshooting GUI, the troubleshooting GUI is updated to present the one or more objects within the clustered application infrastructure that are affected by the issues of the selected issue category.