US 12,287,709 B2
System and method for managing a backup of a large set of files using a file system analysis for data stored in a network attached storage
Sunil Yadav, Bangalore (IN); and Shelesh Chopra, Bangalore (IN)
Assigned to Dell Products L.P., Round Rock, TX (US)
Filed by Dell Products L.P., Round Rock, TX (US)
Filed on Jul. 25, 2022, as Appl. No. 17/872,627.
Prior Publication US 2024/0028474 A1, Jan. 25, 2024
Int. Cl. G06F 11/14 (2006.01); G06F 3/06 (2006.01)
CPC G06F 11/1464 (2013.01) [G06F 11/1451 (2013.01); G06F 3/065 (2013.01)] 17 Claims
OG exemplary drawing
 
1. A method for managing data, the method comprising:
obtaining, by a universal connector, a backup request for a backup of a set of files,
wherein the set of files corresponds to file data comprising at least one million files,
wherein the universal connector comprises a library of protocols,
wherein the library of protocols comprises: network file system (NFS), server message block (SMB), and common internet file system (CIFS), and
wherein the backup request is formatted using a protocol of the library of protocols;
in response to the backup request:
performing a file system analysis on the set of files to determine an ordering of files to be accessed for the backup,
wherein the file system analysis comprises identifying parent folders of each of the files in the set of files,
wherein the file system analysis further comprises identifying changes in file dependencies after a previous backup and storing the changes in cache, and wherein the ordering of files is based on the parent folders of the files in the set of files and in the changes in file dependencies such that a number of times the parent folders are opened is minimized;
generating a set of file read requests based on the ordering of files;
sending a data access request based on the set of file read requests,
wherein the data access request is in a format readable to a network attached storage (NAS) server, and
wherein the backup request is not in a format readable to the NAS server;
obtaining a response from the NAS server, wherein the response comprises data associated with the set of files; and
generating the backup using the data associated with the set of files.