US 12,032,536 B2
Efficient method to optimize distributed segment processing mechanism in dedupe systems by leveraging the locality principle
Kalyan C. Gunda, Bangalore (IN); and Jagannathdas Rath, Karnataka (IN)
Assigned to EMC IP HOLDING COMPANY LLC, Hopkinton, MA (US)
Filed by EMC IP Holding Company LLC, Hopkinton, MA (US)
Filed on Mar. 3, 2021, as Appl. No. 17/191,403.
Prior Publication US 2022/0283997 A1, Sep. 8, 2022
Int. Cl. G06F 16/215 (2019.01); G06F 16/23 (2019.01)
CPC G06F 16/215 (2019.01) [G06F 16/2379 (2019.01)] 16 Claims
OG exemplary drawing
 
1. A method, comprising:
receiving at a dedupe system, from a client, a request that comprises a set of fingerprints, where each fingerprint in the set corresponds to a particular data segment;
filtering, at the dedupe system, the set of fingerprints into a set of unique fingerprints and a set of non-unique fingerprints;
determining a container in a set of containers in the dedupe system where copies of the non-unique fingerprints are stored;
reading, at the dedupe system, from the container, an additional set of non-unique fingerprints, which exist in the container and are additional to the set of non-unique fingerprints, wherein the fingerprints in the additional set of non-unique fingerprints are selected based on a likelihood that those fingerprints would be requested in a future request by the client;
sending, from the dedupe system to the client in response to the request comprising the set of fingerprints, a single response that comprises both the set of unique fingerprints and the additional set of non-unique fingerprints, wherein a number of fingerprints in the single response is predetermined; and
receiving from the client, at the dedupe system, data segments that respectively correspond to the unique fingerprints in the set of unique fingerprints, but no data segments corresponding to the non-unique fingerprints in the set of non-unique fingerprints are received by the dedupe system from the client.