US 12,292,989 B2
Dataset lifecycle management using monitoring and ACL control for content-based datasets
Adam Brenner, Mission Viejo, CA (US); Jehuda Shemer, Kfar Saba (IL); Steven Sadhwani, Round Rock, TX (US); Valerie Lotosh, Ramat-Gan (IL); and Erez Sharvit, Ramat-Gan (IL)
Assigned to Dell Products L.P., Round Rock, TX (US)
Filed by Dell Products L.P., Round Rock, TX (US)
Filed on Oct. 27, 2022, as Appl. No. 17/975,429.
Prior Publication US 2024/0143811 A1, May 2, 2024
Int. Cl. G06F 21/62 (2013.01); G06F 16/11 (2019.01)
CPC G06F 21/6218 (2013.01) [G06F 16/125 (2019.01); G06F 2221/2101 (2013.01); G06F 2221/2141 (2013.01)] 20 Claims
OG exemplary drawing
 
1. A computer-implemented method of managing a lifecycle of data processed through a plurality of stages in a system using content-based datasets, comprising:
identifying data objects of disparate file formats that are subject to same control rules in each stage of the lifecycle as grouped data, wherein the control rules provide access only to authorized users or perform only authorized operations on the grouped data based on a current stage of the lifecycle, and further wherein the data objects are protected by different data protection policies utilizing the control rules;
generating a dataset for the grouped data by scanning the data objects to identify metadata of the grouped data to be processed similarly within the lifecycle, and storing the identified metadata in the dataset, wherein the lifecycle includes a backup operation implementing the data protection policies;
iteratively processing the dataset to tag the data objects according to a native file format;
attaching multiple tags to the dataset to indicate that the data objects of the dataset are of different file types according to the disparate file formats;
merging the protection policies to back up the dataset under a merged protection policy;
associating the control rules to the grouped data as stage tags for the dataset;
monitoring actions performed on and by the data objects referenced by the dataset in each stage of the lifecycle; and
ensuring that the monitored actions comply with control rules using the stage tags of the dataset.