US 12,436,979 B1
Computing systems and methods for query expansion for use in information retrieval
Ilan Gofman, Toronto (CA); Jiapeng Wu, Toronto (CA); Jesse Cole Cresswell, Toronto (CA); Guangwei Yu, Toronto (CA); and Maksims Volkovs, Toronto (CA)
Assigned to The Toronto-Dominion Bank, Toronto (CA)
Filed by The Toronto-Dominion Bank, Toronto (CA)
Filed on Jul. 12, 2024, as Appl. No. 18/771,197.
Int. Cl. G06F 16/33 (2025.01); G06F 16/332 (2025.01); G06F 16/3332 (2025.01); G06F 16/334 (2025.01)
CPC G06F 16/3338 (2019.01) [G06F 16/332 (2019.01); G06F 16/3344 (2019.01)] 18 Claims
OG exemplary drawing
 
1. A system for performing query expansion, the system comprising:
a memory, a communication interface, and a processor operatively coupled to the memory and the communication interface;
the processor configured to:
for each document of a set of documents, use a large language model (LLM) to generate one or more synthetic queries related to the document;
select one or more synthetic queries related to a query;
dynamically generate an adaptive few-shot prompt to instruct the LLM to generate a response to the query, wherein the adaptive few-shot prompt comprises an example query-response pair for each of the selected one more synthetic queries;
provide the adaptive few-shot prompt to the LLM;
generate an amended query based on an output of the LLM in response to the adaptive few-shot prompt;
perform an information retrieval task on the set of documents based on the amended query using a zero-shot information retrieval system;
perform query expansion on the synthetic queries using a plurality of different query expansion methods to generate a plurality of amended synthetic queries for each synthetic query;
augment a training set used to train the zero-shot information retrieval system based on the amended synthetic queries; and
prior to performing a further information retrieval task, train the zero-shot information retrieval system using the augmented training set.