US 12,253,973 B1
Intelligent information retrieval system and method
Shailesh Gavankar, Princeton, NJ (US); Afrid Mondal, Karnataka (IN); Keon Park, Yonkers, NY (US); Sanket Jain, Karnataka (IN); and Abhijit Naik, Maharashtra (IN)
Assigned to Morgan Stanley Services Group Inc., New York, NY (US)
Filed by Morgan Stanley Services Group Inc., New York, NY (US)
Filed on Aug. 21, 2024, as Appl. No. 18/811,016.
Int. Cl. G06F 16/14 (2019.01)
CPC G06F 16/152 (2019.01) 20 Claims
OG exemplary drawing
 
1. A computer-implemented method for information retrieval via artificial intelligence, the method comprising:
accessing, by at least one computing device configured by executing instructions stored on non-transitory processor readable media, content stored in documents in at least one database;
identifying, by the at least one computing device, contextual chunks of individual ones of the documents, wherein the contextual chunks include portions of content stored in the individual ones of the documents;
generating, by the at least one computing device, embeddings associated with the contextual chunks;
storing, by the at least one computing device, the embeddings in a vector database;
identifying, by the at least one computing device as a function of natural language processing, a plurality of relationships among at least some of the contextual chunks;
generating, by the at least one computing device, relational embeddings using the plurality of identified relationships;
storing, by the at least one computing device, the relational embeddings in a database;
receiving, by the at least one computing device from at least one other computing device, a query for information associated with at least some of the content;
generating, by the at least one computing device, an embedding representing the query;
transmitting, by the at least one computing device to at least one large language model, the embedding representing the query;
receiving, by the at least one computing device from the at least one large language model in response to the embedding representing the query, a response including at least some of the contextual chunks; and
transmitting, by the at least one computing device, the response to a computing device associated with the query.