US 12,242,498 B2
Storing unstructured data in a structured framework
Debra J. Benjamin-Deckert, San Jose, CA (US); Neal E. Bohling, San Jose, CA (US); Elaine Lai, San Jose, CA (US); Lawrence L. Law, Saratoga, CA (US); Brian Lee, San Jose, CA (US); Terri A. Menendez, Richmond, MA (US); Gary Pizl, San Jose, CA (US); Roity Prieto Perez, San Jose, CA (US); and Tony Xu, Sunnyvale, CA (US)
Assigned to International Business Machines Corporation, Armonk, NY (US)
Filed by International Business Machines Corporation, Armonk, NY (US)
Filed on Dec. 12, 2017, as Appl. No. 15/839,644.
Prior Publication US 2019/0179948 A1, Jun. 13, 2019
Int. Cl. G06F 16/25 (2019.01); G06F 16/2453 (2019.01); G06F 16/33 (2019.01); G06F 16/3332 (2025.01); G06F 16/81 (2019.01); G06F 16/84 (2019.01)
CPC G06F 16/258 (2019.01) [G06F 16/2453 (2019.01); G06F 16/3332 (2019.01); G06F 16/81 (2019.01); G06F 16/86 (2019.01)] 25 Claims
OG exemplary drawing
 
1. A method, comprising:
receiving an unstructured data record;
obtaining a first key pair, comprising a primary key-name associated with a first key-value;
hashing at least the first key-value to a predefined fixed length to obtain a hash value;
combining the hash value with metadata corresponding to the unstructured data record and the unstructured data record to produce a hashed identifier (ID) having the hash value, metadata following the hash value in the hashed ID, and data in the unstructured data record following the metadata in the hashed ID;
storing the hashed ID as a modified data record in a structured database;
updating an index of the structured database to include an entry for the modified data record, the entry including the hash value;
receiving a request to access the unstructured data record, the request including at least the first key-value;
hashing the received first key-value to create a second hash value;
querying the index of the structured database using the second hash value to determine a location of the modified data record in the structured database; and
outputting a portion of the modified data record that comprises the unstructured data record in response to receiving the request to access the unstructured data record.