US 12,235,803 B1
Vector embedding compression
Benjamin Kotvis, Sussex, WI (US)
Assigned to Insight Direct USA, Inc., Chandler, AZ (US)
Filed by Insight Direct USA, Inc., Chandler, AZ (US)
Filed on Apr. 10, 2024, as Appl. No. 18/631,541.
Int. Cl. G06F 16/00 (2019.01); G06F 16/174 (2019.01); G06F 16/22 (2019.01); G06F 16/2458 (2019.01)
CPC G06F 16/1744 (2019.01) [G06F 16/2237 (2019.01); G06F 16/2477 (2019.01)] 20 Claims
OG exemplary drawing
 
1. A method of database operations, the method comprising:
receiving, by a processor, a user query;
generating, by the processor, a query vector embedding representative of the user query;
querying, by the processor, a vector database using the query vector embedding, wherein:
the vector database comprises a plurality of database vectors representative of a plurality of data files;
each database vector is representative of one data file of the plurality of data files,
each data file of the plurality of data files belongs to one time-series data set of a plurality of time-series data sets, and
each data file of the plurality of data files corresponds to a first time;
retrieving, by the processor, a first database vector of the plurality of database vectors based on the query, the first database vector representative of a first data file belonging to a first time-series data set of the plurality of time-series data sets;
receiving, by the processor, a first plurality of delta encodings describing differences between vector representations of temporally-adjacent data files of the first time-series data set;
identifying, by the processor and based on the first plurality of delta encodings, a second data file of the first time-series data set having a second vector representation that differs from the first database vector, the second data file corresponding to a second time; and
retrieving, by the processor, the second data file of the first time-series data set from a database.