| CPC G06F 16/2365 (2019.01) [G06F 11/1451 (2013.01); G06F 2201/84 (2013.01)] | 20 Claims |

|
1. A method for performing a corruption detection operation that includes read operations in a data protection system, comprising:
receiving backups into the data protection system configured to store the backups, wherein the backups are associated with corresponding lineages, wherein each of the lineages is associated with a sample frequency that is independent of a backup cadence;
selecting, by a detection engine, lineages subject to the corruption detection operation based on the sample frequencies of the lineages and based on a read capacity associated with the data protection system;
determining, by the detection engine, whether a logical size of the most recent backup in each of the selected lineages is less than or equal to the read capacity;
in response to determining that the logical size exceeds the read capacity, pruning one or more of the selected lineages based on a priority level associated with the lineage or a storage tier associated with the backup; and
performing, by the detection engine, the corruption detection operation on a most recent backup in each of the selected lineages according to the associated sample frequency, the corruption detection operation comprising reading the backup and generating statistics, inferences, or probabilities related to malware or other unwanted content.
|