| CPC G06F 21/577 (2013.01) [G06F 21/566 (2013.01); G06F 2221/034 (2013.01)] | 21 Claims |

|
1. A method for simulating a spatial environment, the method performed by an adversarial reinforcement learning system comprising one or more hardware processors, the method comprising:
generating, by a first model, a threat mitigation input, wherein the threat mitigation input comprises instructions for controlling one or more simulated objects in the simulation, wherein the first model is configured to minimize one or more harm outcomes of the simulation; and
generating, by a second model, a threat input, wherein the threat input comprises instructions for controlling one or more simulated objects in the simulation, wherein the second model is distinct from the first model and is configured to maximize one or more harm outcomes of the simulation; and
executing a first portion of the simulation based at least in part on the threat mitigation input and the threat input.
|