US 12,481,624 B2
System and methods for secure deduplication of compacted data
Joshua Cooper, Columbia, SC (US); and Aliasghar Riahi, Orinda, CA (US)
Assigned to ATOMBEAM TECHNOLOGIES INC., Moraga, CA (US)
Filed by AtomBeam Technologies Inc., Moraga, CA (US)
Filed on Aug. 16, 2023, as Appl. No. 18/450,402.
Application 18/450,402 is a continuation of application No. 17/578,476, filed on Jan. 19, 2022, granted, now 11,741,051.
Application 17/578,476 is a continuation in part of application No. 16/923,039, filed on Jul. 7, 2020, granted, now 11,232,076, issued on Jan. 25, 2022.
Application 16/923,039 is a continuation in part of application No. 16/716,098, filed on Dec. 16, 2019, granted, now 10,706,018, issued on Jul. 7, 2020.
Application 16/716,098 is a continuation of application No. 16/455,655, filed on Jun. 27, 2019, granted, now 10,509,771, issued on Dec. 17, 2019.
Application 16/455,655 is a continuation in part of application No. 16/200,466, filed on Nov. 26, 2018, granted, now 10,476,519, issued on Nov. 12, 2019.
Application 16/200,466 is a continuation in part of application No. 15/975,741, filed on May 9, 2018, granted, now 10,303,391, issued on May 28, 2019.
Claims priority of provisional application 63/027,166, filed on May 19, 2020.
Claims priority of provisional application 62/926,723, filed on Oct. 28, 2019.
Claims priority of provisional application 62/578,824, filed on Oct. 30, 2017.
Prior Publication US 2023/0401173 A1, Dec. 14, 2023
Int. Cl. G06F 16/174 (2019.01); G06F 3/06 (2006.01)
CPC G06F 16/1752 (2019.01) [G06F 3/0608 (2013.01); G06F 3/0641 (2013.01); G06F 3/067 (2013.01)] 6 Claims
OG exemplary drawing
 
1. A system for secure deduplication of compacted data, comprising:
at least one reference codebook comprising key-value pairs of data;
a library manager comprising at least a processor, a memory, and a plurality of programming instructions stored in the memory and operable on the processor of a computing device, wherein the plurality of programming instructions, when operating on the processor, cause the processor to:
receive a plurality of deconstructed sourceblocks from a data deconstruction engine;
perform secure data deduplication by comparing each of the plurality of deconstructed sourceblocks with sourceblocks already contained in the reference codebook, wherein:
the library manager uses machine learning algorithms to dynamically optimize sourceblock size based on data patterns and storage efficiency metrics;
access to both the reference codebook and a returned reference code is required to reconstruct to an original sourceblock; and
the reference codebook and the returned reference codes are stored separately from one another;
return the reference code to the data deconstruction engine, when the sourceblock received is a duplicate of an existing sourceblock in the reference codebook; and
for each received deconstructed sourceblock that is not present in the codebook:
create a new, unique reference code for the respective deconstructed sourceblock using machine learning algorithms that dynamically optimize reference code generation based on frequency analysis of previously stored souceblocks and predicted future data patterns;
store both the respective deconstructed sourceblock and the associated reference code in the reference codebook as a key-value pair; and
return the new reference code to the data deconstruction engine.