US 11,656,926 B1
Systems and methods for automatically applying configuration changes to computing clusters
Pratap Dande, Saint Johns, FL (US); Vinit Shah, Indian Trail, NC (US); Sarveshwar Kuncha, Frisco, TX (US); and Jedidiah Wallace, Charlotte, NC (US)
Assigned to Bank of America Corporation, Charlotte, NC (US)
Filed by BANK OF AMERICA CORPORATION, Charlotte, NC (US)
Filed on Jan. 26, 2022, as Appl. No. 17/585,028.
Int. Cl. G06F 11/00 (2006.01); G06F 11/07 (2006.01); G06F 11/34 (2006.01)
CPC G06F 11/0709 (2013.01) [G06F 11/0793 (2013.01); G06F 11/3476 (2013.01)] 17 Claims
OG exemplary drawing
 
1. A system comprising:
a computing cluster comprising a plurality of computing nodes configured to perform parallel computations;
a memory configured to store a database of historical configuration errors; and
a processor communicatively coupled to the memory, the processor configured to:
periodically retrieve one or more configuration logs from the computing cluster, each configuration log comprising a plurality of log messages associated with a plurality of services running on the computing cluster;
determine, by analyzing the one or more configuration logs, a particular service running on the computing cluster that has generated a plurality of errors within the plurality of log messages;
determine, by comparing a particular error of the plurality of errors generated by the particular service to a plurality of historical configuration errors in the database of historical configuration errors, whether the particular error has previously occurred;
in response to determining that the particular error has previously occurred, generate and send one or more commands to the computing cluster, the one or more commands operable to change a current configuration value for the particular service running on the computing cluster to a new configuration value, wherein the new configuration value is based on a historical value stored in the database of historical configuration errors; and
in response to determining that the particular error has not previously occurred, generate and send one or more other commands to the computing cluster, the one or more other commands operable to change the current configuration value for the particular service running on the computing cluster to a previous configuration value for the particular service running on the computing cluster.