| CPC G06F 16/215 (2019.01) [G06F 16/1748 (2019.01); G06F 16/2471 (2019.01)] | 20 Claims |

|
1. A method, comprising:
in response to receiving a write request targeting a data unit, dividing the data unit into sub-units according to a sub-unit size;
determining, by a redirector, a number of deduplicator instances to instantiate for deduplicating the sub-units based upon a deduplication service policy specifying a threshold amount of data that can be processed by a single executing deduplicator instance;
creating a data unit manifest for the data unit with an indication of an order and count of the sub-units, wherein the data unit manifest is populated with paths to the sub-units according to a hierarchical namespace or a flat namespace where a path is a namespace identifier used to obtain data of a constituent sub-unit; and
requesting deduplication for the sub-units by the number of deduplicator instances using the paths within the data unit manifest.
|