US 12,366,980 B2
Handling configuration drift in backup storage systems
Jayanth K. Perneti, Bangalore (IN); Vinay Sawal, Fremont, CA (US); and Amitha Shetty, Bangalore (IN)
Assigned to EMC IP Holding Company LLC, Hopkinton, MA (US)
Filed by EMC IP Holding Company LLC, Hopkinton, MA (US)
Filed on Mar. 19, 2024, as Appl. No. 18/608,935.
Application 18/608,935 is a division of application No. 17/376,488, filed on Jul. 15, 2021, granted, now 11,966,601.
Claims priority of application No. 202111023664 (IN), filed on May 27, 2021.
Prior Publication US 2024/0220140 A1, Jul. 4, 2024
Int. Cl. G06F 3/06 (2006.01); G06F 16/334 (2025.01)
CPC G06F 3/0632 (2013.01) [G06F 3/0604 (2013.01); G06F 3/067 (2013.01); G06F 16/3344 (2019.01)] 10 Claims
OG exemplary drawing
 
1. A computer-implemented method of handling configuration drift in a data storage system having a plurality of devices, comprising:
obtaining a configuration tree for each device of the plurality of devices;
traversing all valid terminal paths of the configuration tree to create a list of unique sentences;
encoding, in an embedding step, every sentence as a vector in n-dimensional space;
determining distances between every pair of sentences to acquire a feature-level context of a respective sentence, and creating a probability distribution of all sentences in the n-dimensional space;
creating a probability distribution for each of a golden configuration of the storage system, and a current configuration of each device;
creating a distance matrix for each pair-wise distance between each point of the golden configuration and the current configurations;
identifying configuration drift using the distance matrices and using an optimal transport mechanism; and
normalizing and ranking identified configuration drifts along a scale to trigger a possible remedial action in any device having excessive configuration drift.