CPC G06F 9/5005 (2013.01) [G06F 11/2051 (2013.01); G06N 20/00 (2019.01); G06F 2201/87 (2013.01)] | 11 Claims |
1. A method for managing workloads assigned to nodes in a data center, comprising:
assigning a primary node among a cluster of nodes, wherein each of the nodes are Information Handling Systems (IHS) hosting a plurality of virtual machines (VM), and wherein each workload is assigned to run on a selected VM in the cluster, the primary node monitoring a status of the nodes in the cluster;
identifying, by the primary node, nodes within the cluster that are in a quarantine mode, wherein a node is placed in quarantine mode when a potential node failure is detected but the node is still operating;
determining a remaining warranty time for the nodes in quarantine status by retrieving warranty information for each component in the node from a database within a vendor warranty system;
determining an expected duration for a selected workload;
determining a criticality of the selected workload by retrieving a workload tag assigned to the workload by a user of the cluster;
assigning, by the primary node, the selected workload to a node in the cluster based upon the remaining warranty time, the expected workload duration, and the workload criticality to ensure that the selected workload is completed within an assigned node's expected lifetime; and
executing the selected workload on the assigned node in the cluster.
|