US 12,235,882 B1
Controlling execution of artificial intelligence pipelines for data retrieval through client applications
Ravish Chawla, Atlanta, GA (US); Roman Fedoruk, Cumming, GA (US); John Manton, Cumming, GA (US); Spencer Reagan, Marietta, GA (US); Gregory Roberts, Dunwoody, GA (US); and Erich Stuntebeck, Alpharetta, GA (US)
Assigned to Airia LLC, Alpharetta, GA (US)
Filed by Airia LLC, Alpharetta, GA (US)
Filed on Oct. 18, 2024, as Appl. No. 18/920,411.
Claims priority of provisional application 63/648,162, filed on May 15, 2024.
Int. Cl. G06F 16/33 (2019.01); G06F 16/28 (2019.01); G06F 16/334 (2025.01); G06F 16/338 (2019.01)
CPC G06F 16/3347 (2019.01) [G06F 16/285 (2019.01); G06F 16/338 (2019.01)] 20 Claims
OG exemplary drawing
 
1. A method for controlling execution of artificial intelligence (AI) pipelines for semantic data retrieval through client applications, comprising:
receiving, at an AI pipeline endpoint, an AI endpoint key and an AI pipeline input from a client application that executes on a user device, the user device being associated with a user;
causing an AI pipeline execution engine to perform stages including:
identifying a dataset associated with the AI pipeline input;
based on a management policy, determining whether access to the dataset is authorized for a user submitting the AI pipeline input;
causing, based on the AI pipeline input, generation of input vectors with an embedding model, wherein dataset vectors in a vector database were also generated with the embedding model, and wherein the dataset vectors correspond to data chunks of the identified dataset;
causing identification of similar vectors based on comparing the input vectors to the dataset vectors of the vector database;
identifying the data chunks that correspond to the identified similar vectors;
identifying a first AI model for use with the AI pipeline input based on an object selection rule;
generating prompts for use with the first AI model, wherein at least one of the generated prompts relates to formatting specific to the client application;
transmitting the generated prompts and the identified data chunks to the first AI model; and
receiving, from the first AI model, search results that correspond to the AI pipeline input;
adding a hyperlink to the search results, the hyperlink being generated based at least in part on a location of text that corresponds to the identified data chunks; and
causing the search results to be displayed on a user interface provided by the client application.