CPC G06F 11/0793 (2013.01) [G06F 11/0721 (2013.01)] | 18 Claims |
1. A computer-implemented method for self-healing of clusters in a container orchestration system, the method being executed by one or more processors and comprising:
receiving, by a self-healing platform within the container orchestration system, fault data that is representative of two or more error events occurring within a cluster provisioned within the container orchestration system;
determining, by the self-healing platform, a set of actions to be executed in response to the two or more error events;
providing, by the self-healing platform, a priority value for each error event of the two or more error events; and
transmitting, by the self-healing platform, instructions to execute actions in the set of actions based on respective priority values of the two or more error events,
wherein execution of actions in the set of actions comprises draining a faulty node, providing a new node, and configuring the new node, wherein the draining comprises:
determining the faulty node based on the priority values of the two or more error events;
issuing instructions to the faulty node to execute draining based on the priority values; and
providing instruction for new node creation based on the issued instructions.
|