US 12,008,418 B2
Automated causal analysis of issues affecting workloads executing in an information technology infrastructure
Vindhya Gangaraju, Davanagere (IN); Vinay Sawal, Fremont, CA (US); and Jayanth Kumar Reddy Perneti, Bangalore (IN)
Assigned to Dell Products L.P., Round Rock, TX (US)
Filed by Dell Products L.P., Round Rock, TX (US)
Filed on Aug. 31, 2021, as Appl. No. 17/462,710.
Prior Publication US 2023/0066843 A1, Mar. 2, 2023
Int. Cl. G06F 9/50 (2006.01); G06F 11/14 (2006.01)
CPC G06F 9/5083 (2013.01) [G06F 11/1448 (2013.01); G06F 2201/805 (2013.01)] 20 Claims
OG exemplary drawing
 
1. An apparatus comprising:
at least one processing device comprising a processor coupled to a memory;
the at least one processing device being configured:
to execute a first set of a plurality of workloads on a set of storage systems of an information technology infrastructure, wherein executing the first set of the plurality of workloads produces a plurality of workload traces characterizing resource execution metrics associated with the execution of the first set of the plurality of workloads, wherein one of the resource execution metrics comprises an execution time of at least a subset of the first set of the plurality of workloads;
responsive to an occurrence of one or more issues causing the execution time of the subset of the first set of the plurality of workloads to exceed an expected execution time, to at least one of (i) expand at least one storage system in the set of storage systems of the information technology infrastructure, and (ii) deploy one or more additional storage systems in the set of storage systems of the information technology infrastructure; and
to execute a second set of the plurality of workloads on the set of storage systems in the information technology infrastructure;
wherein the occurrence of the one or more issues is based at least in part on one or more clusters of causal topics that comprise at least one causal topic that is part of a first set of causal topics and a second set of causal topics, the first set of causal topics being based at least in part on a first set of recurring patterns of relationships between different ones of the resource execution metrics in a first set of workload causality profiles, the first set of workload causality profiles being based at least in part on a dependence measurement model and the plurality of workload traces, the second set of causal topics being based at least in part on a second set of recurring patterns of relationships between different ones of the resource execution metrics in a second set of workload causality profiles, and the second set of workload causality profiles being based at least in part on a causal model and the plurality of workload traces.