US 12,293,277 B1
	Multimodal generative AI model protection using sequential sidecars
Kenneth Yeung, Ottawa (CA); and Jason Martin, Beaverton, OR (US)
Assigned to HiddenLayer, Inc., Austin, TX (US)
Filed by HiddenLayer, Inc., Austin, TX (US)
Filed on Aug. 1, 2024, as Appl. No. 18/792,455.
This patent is subject to a terminal disclaimer.
Int. Cl. G06N 3/0475 (2023.01); G06N 3/045 (2023.01)

CPC G06N 3/0475 (2023.01) [G06N 3/045 (2023.01)]

25 Claims

1. A computer-implemented method comprising:

receiving, over a computer network from a requestor, data comprising multimodal input for ingestion by a first generative artificial intelligence (GenAI) model, the first GenAI model comprising one or more machine learning models, the multimodal input comprising input having two or more modalities;

inputting the received data into the first GenAI model to result in a first output;

inputting both of the received data and the first output into a second GenAI model to result in a second output, the second GenAI model comprising one or more machine learning models;

determining, based on content of the second output, whether the second output indicates that guardrails associated with the second GenAI model have been triggered;

returning, over the computer network, the first output to the requestor when it is determined that the second output indicates that guardrails associated with the second GenAI model have not been triggered; and

initiating one or more remediation actions in lieu of returning the first output to the requestor when it is determined that the second output indicates that guardrails associated with the second GenAI model have been triggered, the one or more remediation actions preventing the first GenAI model from behaving in an undesired manner.