| CPC G06F 16/3329 (2019.01) [G06F 40/284 (2020.01)] | 30 Claims |

|
1. A system for answering queries using one or more families of large language models (h-LLMs) comprising:
a processor;
a communication device operable to transmit and receive digital communication;
one or more h-LLMs implemented as microservices in one or more cloud container environments, each h-LLM of the one or more h-LLMs being accessible via a cloud service application programming interface (API); and
a non-transitory computer-readable storage medium having stored thereon software that, when executed by the processor, is operable to
operate an input broker having a broker API operable to receive a user prompt from a user interface;
generate a plurality of derived prompts from the user prompt at the input broker;
transmit the plurality of derived prompts to the one or more h-LLMs via the cloud service API;
operate an output broker operable to receive a plurality of h-LLM results;
process the plurality of h-LLM results at the output broker to generate a result; and
transmit the result to the user interface via the broker API.
|