CPC G06F 3/0632 (2013.01) [G06F 3/0604 (2013.01); G06F 3/067 (2013.01); G06F 16/3344 (2019.01)] | 10 Claims |
1. A computer-implemented method of handling configuration drift in a data storage system having a plurality of devices, comprising:
obtaining a configuration tree for each device of the plurality of devices;
traversing all valid terminal paths of the configuration tree to create a list of unique sentences;
encoding, in an embedding step, every sentence as a vector in n-dimensional space;
determining distances between every pair of sentences to acquire a feature-level context of a respective sentence, and creating a probability distribution of all sentences in the n-dimensional space;
creating a probability distribution for each of a golden configuration of the storage system, and a current configuration of each device;
creating a distance matrix for each pair-wise distance between each point of the golden configuration and the current configurations;
identifying configuration drift using the distance matrices and using an optimal transport mechanism; and
normalizing and ranking identified configuration drifts along a scale to trigger a possible remedial action in any device having excessive configuration drift.
|