| CPC G06F 9/45558 (2013.01) [G06F 2009/45575 (2013.01); G06F 2009/45591 (2013.01)] | 20 Claims |

|
10. A method of monitoring health of a group of virtual machine nodes, the method comprising:
determining a leader virtual machine (VM) count for the group of VM nodes, the group of VM nodes being hosted on a plurality of computing devices;
selecting a number of VM nodes from the group to serve as leader VMs for the group, the number of the VM nodes selected corresponding to the leader VM count;
periodically performing a peer VM monitoring process, the peer VM monitoring process including:
storing health information for each of the VM nodes of the group in a data store;
accessing the health information of the VM nodes to identify unhealthy VMs using the leader VMs; and
automatically performing a healing process on the unhealthy VMs to improve performance of the unhealthy VMs; and
periodically performing a leader selection process comprising:
determining a current number of leader VMs;
determining a difference between the current number of leader VMs and the leader VM count;
in response to the difference being other than zero, identifying a VM node from the group having a predetermined health level; and
assigning the identified VM node as one of the leader VMs.
|