US 12,386,871 B2
Method and system for multi-level artificial intelligence supercomputer design
Vijay Madisetti, Alpharetta, GA (US); and Arshdeep Bahga, Chandigarh (IN)
Assigned to Vijay Madisetti, Alpharetta, GA (US)
Filed by Vijay Madisetti, Alpharetta, GA (US)
Filed on Nov. 20, 2024, as Appl. No. 18/953,247.
Application 18/953,247 is a continuation of application No. 18/786,130, filed on Jul. 26, 2024, granted, now 12,210,550.
Application 18/786,130 is a continuation of application No. 18/470,487, filed on Sep. 20, 2023, granted, now 12,147,461.
Application 18/470,487 is a continuation of application No. 18/348,692, filed on Jul. 7, 2023, granted, now 12,001,462, issued on Jun. 4, 2024.
Claims priority of provisional application 63/469,571, filed on May 30, 2023.
Claims priority of provisional application 63/463,913, filed on May 4, 2023.
Prior Publication US 2025/0077553 A1, Mar. 6, 2025
This patent is subject to a terminal disclaimer.
Int. Cl. G06F 16/3329 (2025.01); G06F 40/284 (2020.01)
CPC G06F 16/3329 (2019.01) [G06F 40/284 (2020.01)] 30 Claims
OG exemplary drawing
 
1. A system for answering queries using one or more families of large language models (h-LLMs) comprising:
a processor;
a communication device operable to transmit and receive digital communication;
one or more h-LLMs implemented as microservices in one or more cloud container environments, each h-LLM of the one or more h-LLMs being accessible via a cloud service application programming interface (API); and
a non-transitory computer-readable storage medium having stored thereon software that, when executed by the processor, is operable to
operate an input broker having a broker API operable to receive a user prompt from a user interface;
generate a plurality of derived prompts from the user prompt at the input broker;
transmit the plurality of derived prompts to the one or more h-LLMs via the cloud service API;
operate an output broker operable to receive a plurality of h-LLM results;
process the plurality of h-LLM results at the output broker to generate a result; and
transmit the result to the user interface via the broker API.