US 12,443,534 B2
Reference file management for artificial intelligence models
Chao Sun, San Jose, CA (US)
Assigned to Sandisk Technologies, Inc., Milpitas, CA (US)
Filed by Sandisk Technologies, Inc., Milpitas, CA (US)
Filed on Feb. 13, 2024, as Appl. No. 18/440,263.
Prior Publication US 2025/0258774 A1, Aug. 14, 2025
Int. Cl. G06F 12/08 (2016.01); G06F 12/0862 (2016.01)
CPC G06F 12/0862 (2013.01) 20 Claims
OG exemplary drawing
 
1. A Data Storage Device (DSD), comprising:
a first memory configured to store a plurality of reference files that have been used to derive a plurality of vector embeddings in a vector database stored in the DSD; and
circuitry configured to:
receive a query vector embedding from a host;
perform an Approximate Nearest Neighbor (ANN) search of the vector database to identify one or more vector embeddings that are close to the query vector embedding;
send the one or more vector embeddings to the host;
identify one or more reference files from which the one or more vector embeddings were derived using vector metadata for the one or more vector embeddings; and
store the one or more reference files in a second memory of the host or of the DSD, the second memory configured to provide access to data faster than the first memory.