| CPC G06F 16/242 (2019.01) [G06F 16/24522 (2019.01); G06F 16/24578 (2019.01)] | 20 Claims |

|
1. A method comprising:
obtaining an input query at a retriever model, the retriever model comprising a reconfigurable sequence of one or more rankers selected from among a plurality of rankers, each ranker configured to identify a specified number of information chunks relevant to the input query;
providing one or more of the information chunks from the retriever model to a generative model;
using the generative model to create a response to the input query, the response based on the one or more information chunks; and
tuning the retriever model by determining the specified number of information chunks to be identified by each ranker in the reconfigurable sequence;
wherein the plurality of rankers comprises a bi-encoder, a cross-encoder, and a large language model (LLM)-ranker; and
wherein the specified number of information chunks to be identified by each ranker in the reconfigurable sequence is determined using a grid search.
|