| CPC G06F 16/215 (2019.01) [G06F 12/0804 (2013.01); G06F 16/2282 (2019.01); G06F 16/2322 (2019.01); G06F 16/235 (2019.01)] | 17 Claims |

|
1. A method for deduplication, comprising:
receiving, with one or more processors, a first request to write data, the first request including a first insert identifier uniquely identifying the data for determining whether the data is duplicate data;
comparing, with the one or more processors, the first insert identifier with other insert identifiers that have been stored in a table within a time window of predetermined duration moving relative to a current time, the other insert identifiers being stored in the table based on a timestamp associated with each of the other insert identifiers;
determining, with the one or more processors, that the data corresponding to the first insert identifier is not duplicate data based on the first insert identifier not being equivalent to any of the other insert identifiers;
storing, with the one or more processors, the first insert identifier with a timestamp in the table in response to determining that the data corresponding to the first insert identifier is not duplicate data; and
updating, with the one or more processors, the table to remove one or more insert identifiers of the other insert identifiers added before the time window of predetermined duration based on the timestamp associated with each of the other insert identifiers.
|