US 12,067,242 B2
Performing secondary copy operations based on deduplication performance
Bhavyan Bharatkumar Mehta, Mumbai (IN); Anand Vibhor, Manalapan, NJ (US); and Niteen Jain, Maharashtra (IN)
Assigned to Commvault Systems, Inc., Tinton Falls, NJ (US)
Filed by Commvault Systems, Inc., Tinton Falls, NJ (US)
Filed on May 22, 2023, as Appl. No. 18/200,514.
Application 18/200,514 is a continuation of application No. 16/358,404, filed on Mar. 19, 2019, granted, now 11,698,727.
Claims priority of provisional application 62/779,991, filed on Dec. 14, 2018.
Prior Publication US 2023/0376204 A1, Nov. 23, 2023
This patent is subject to a terminal disclaimer.
Int. Cl. G06F 16/11 (2019.01); G06F 3/06 (2006.01); G06F 18/214 (2023.01); G06N 20/00 (2019.01); G06V 30/418 (2022.01)
CPC G06F 3/061 (2013.01) [G06F 3/0605 (2013.01); G06F 3/0608 (2013.01); G06F 3/0641 (2013.01); G06F 3/0653 (2013.01); G06F 3/067 (2013.01); G06F 16/125 (2019.01); G06F 18/214 (2023.01); G06N 20/00 (2019.01); G06V 30/418 (2022.01)] 20 Claims
OG exemplary drawing
 
1. A networked information management system configured to perform deduplication operations in accordance with a machine learning model output, the networked information management system comprising:
one or more computing devices configured to:
access a deduplication performance data,
wherein the deduplication performance data is associated with one or more secondary copy operations performed for primary data stored at one or more client computing devices,
wherein the primary data is assigned to a first information management policy,
wherein the first information management policy comprises a first set of parameters or settings for performing deduplication operations during secondary copy jobs of data assigned to the first information management policy,
wherein the deduplication performance data comprises data indicating an amount of the primary data that is deduplicated during the one or more secondary copy operations;
access a secondary copy operations metadata associated with the one or more secondary copy operations;
provide the deduplication performance data and the secondary copy operations metadata to a machine learning model for evaluating deduplication performance;
using output of the machine learning model, determine that deduplication performance associated with the primary data would be improved under, at least one or more, alternate settings or parameters for performing deduplication operations; and
in response to that determination:
assign a second information management policy to the primary data from the one or more client computing devices,
wherein the second information management policy comprises a second set of parameters or settings for performing deduplication operations during secondary copy jobs of data assigned to the second information management policy,
wherein the second set of parameters or settings for performing deduplication operations is different from the first set of parameters or settings for performing deduplication operation.