US 12,265,507 B2
Contextually irrelevant file segmentation
Raghuveer Prasad Nagar, Bangalore (IN); Sarbajit K. Rakshit, Kolkata (IN); Manjit Singh Sodhi, Bangalore (IN); and Sidharth Ullal, Bangalore (IN)
Assigned to International Business Machines Corporation, Armonk, NY (US)
Filed by International Business Machines Corporation, Armonk, NY (US)
Filed on Sep. 1, 2021, as Appl. No. 17/463,938.
Prior Publication US 2023/0067574 A1, Mar. 2, 2023
Int. Cl. G06F 15/16 (2006.01); G06F 16/11 (2019.01); G06F 16/14 (2019.01); G06F 16/16 (2019.01); G06F 16/182 (2019.01); G06F 21/60 (2013.01)
CPC G06F 16/182 (2019.01) [G06F 16/122 (2019.01); G06F 16/148 (2019.01); G06F 16/164 (2019.01); G06F 21/602 (2013.01)] 20 Claims
OG exemplary drawing
 
1. A computer-implemented method, comprising:
receiving an electronic file to be stored;
determining a first context for content of the electronic file to be stored, the first context indicating a nature or subject of the content of the electronic file so as to enable a determination of whether the content of the electronic file contains confidential information;
based on the determined first context of the content of the electronic file, determining whether the content of the electronic file contains confidential information;
splitting the electronic file into multiple segments such that the first context of the electronic file is not discernible from any segment;
determining at least one first segment of the segments that is to be more secure than others of the segments;
distributing the multiple segments amongst multiple remote storage devices, wherein the first segment is distributed to a remote storage device on a private network;
generating a metadata file to indicate how to assemble the multiple segments to re-construct the electronic file;
splitting the metadata file indicating how to assemble the multiple segments of the electronic file into multiple metadata segments and distributing the multiple metadata segments amongst the multiple remote storage devices such that a merge engine compiles multiple metadata segments from the multiple remote storage devices to reconstruct the electronic file, wherein splitting the metadata file comprises storing a graphical map indicating relations between different metadata segments to provide instructions for re-assembling the metadata file, the metadata file then providing instructions for re-assembling the electronic file; and
altering a splitting parameter of the electronic file based on a change to a security characteristic of the electronic file, altering the splitting parameter including updating metadata associated with the electronic file, the metadata associated with the electronic file used to identify a splitting arrangement and how to assemble the multiple segments.