CPC G06F 3/061 (2013.01) [G06F 3/0605 (2013.01); G06F 3/0608 (2013.01); G06F 3/0641 (2013.01); G06F 3/0653 (2013.01); G06F 3/067 (2013.01); G06F 16/125 (2019.01); G06F 18/214 (2023.01); G06N 20/00 (2019.01); G06V 30/418 (2022.01)] | 20 Claims |
1. A networked information management system configured to perform deduplication operations in accordance with a machine learning model output, the networked information management system comprising:
one or more computing devices configured to:
access a deduplication performance data,
wherein the deduplication performance data is associated with one or more secondary copy operations performed for primary data stored at one or more client computing devices,
wherein the primary data is assigned to a first information management policy,
wherein the first information management policy comprises a first set of parameters or settings for performing deduplication operations during secondary copy jobs of data assigned to the first information management policy,
wherein the deduplication performance data comprises data indicating an amount of the primary data that is deduplicated during the one or more secondary copy operations;
access a secondary copy operations metadata associated with the one or more secondary copy operations;
provide the deduplication performance data and the secondary copy operations metadata to a machine learning model for evaluating deduplication performance;
using output of the machine learning model, determine that deduplication performance associated with the primary data would be improved under, at least one or more, alternate settings or parameters for performing deduplication operations; and
in response to that determination:
assign a second information management policy to the primary data from the one or more client computing devices,
wherein the second information management policy comprises a second set of parameters or settings for performing deduplication operations during secondary copy jobs of data assigned to the second information management policy,
wherein the second set of parameters or settings for performing deduplication operations is different from the first set of parameters or settings for performing deduplication operation.
|