| CPC G06N 3/0455 (2023.01) [G05B 19/4069 (2013.01)] | 20 Claims |

|
1. A system comprising:
one or more processors; and
one or more non-transitory computer-readable media storing instructions executable by the one or more processors, wherein the instructions, when executed, cause the one or more processors to perform operations comprising:
inputting, as a first input into a decoder of a variable autoencoder, map data representing an environment;
inputting, as a second input into the decoder, discrete latent variable data associated with a first object and a second object in the environment, the discrete latent variable data representing a first action of the first object and a second action of the second object, the second action different than the first action;
receiving, from the decoder and based at least in part on the first input and the second input, output data representing a first bounding box for the first object and a second bounding box for the second object, the first bounding box including a first orientation and the second bounding box including a second orientation; and
at least one of:
performing, based at least in part on the output data, a simulation between a vehicle, the first object, and the second object; or
controlling, based at least in part on the output data, the vehicle in the environment relative to the first object and the second object.
|