| CPC G05B 13/04 (2013.01) [G06F 16/50 (2019.01); G06T 3/00 (2013.01); G06T 7/50 (2017.01); G06T 2207/30252 (2013.01)] | 12 Claims |

|
1. A method comprising:
simulating a simulated agent's traversal of a simulated path in a real environment, including simulating an autonomous vehicle in a simulated traversal of a roadway;
prior to simulation of the simulated agent's traversal of the simulated path, obtaining a plurality of observations acquired by real entities as they traversed real paths, each observation including an image acquired when a corresponding real entity was in a location corresponding to said image on one of the real paths, wherein at least some of the real paths traverse the roadway, and wherein the real entities comprise a human-operated real vehicle that traversed one of the real paths;
wherein the simulating of the simulated agent's traversal comprises simulating a simulated agent's view to form simulated observations, including simulating a first image for a first location on the simulated path, the simulating of the first image including selecting a second image acquired by one of the real entities at a second location different than the first location, and transforming the second image to form the first image according to a relationship of the first location and the second location, wherein the second image comprises a two-dimensional image and the first image comprises a two-dimensional image, wherein transforming the second image to form the first image comprises forming a second three-dimensional image from the second two-dimensional image, forming a first three-dimensional image from the second three-dimensional image according to the relationship of the first location and the second location, and forming the first two-dimensional image from the first three-dimensional image;
wherein the simulating of the simulated agent's traversal further comprises processing the simulated observations according to a control policy to yield control actions, and causing the simulated agent to transverse the simulated path according to the control actions;
wherein the method further comprises updating the control policy according to the simulated agent's traversal of the simulated path, including updating the policy according to the simulated agent's traversal of a plurality of simulated paths according to simulations of the simulated agent's view in a plurality of environmental conditions.
|