| CPC G06F 12/0862 (2013.01) | 20 Claims |

|
1. A Data Storage Device (DSD), comprising:
a first memory configured to store a plurality of reference files that have been used to derive a plurality of vector embeddings in a vector database stored in the DSD; and
circuitry configured to:
receive a query vector embedding from a host;
perform an Approximate Nearest Neighbor (ANN) search of the vector database to identify one or more vector embeddings that are close to the query vector embedding;
send the one or more vector embeddings to the host;
identify one or more reference files from which the one or more vector embeddings were derived using vector metadata for the one or more vector embeddings; and
store the one or more reference files in a second memory of the host or of the DSD, the second memory configured to provide access to data faster than the first memory.
|