| CPC G06F 11/0793 (2013.01) [G06F 11/0787 (2013.01)] | 16 Claims |

|
1. A system for averting possible disruptions to at least one software service, comprising:
data storage containing logged alerts from a plurality of software components and hardware components working together to provide the at least one software service;
one or more processors; and
non-transitory memory storing instructions that, when executed by the one or more processors, cause the one or more processors to:
query the data storage to identify a prevalence of errors associated with a particular software component or hardware component;
wherein a Context Layer is executed to supplement errors identified from the query with upstream and downstream contextual metadata and generate a profile of a current state of the plurality of software components and hardware components;
fuzzy match the generated profile with a past profile, the past profile being associated with a desired automated triage action;
determine, based on the matched past profile, that the prevalence of errors indicate potential disruption to the at least one software service; and
perform the automated triage action to prevent the potential disruption, wherein the automated triage action is selected from among: recycling a server, shutting down a server, redirecting traffic away from a server, and switching over a rail in a multi-rail blue-green architecture.
|