CPC G06N 20/00 (2019.01) [G05B 13/0265 (2013.01); G05D 1/0088 (2013.01); G06N 7/01 (2023.01)] | 20 Claims |
1. A computer-implemented method for autonomous vehicle motion control, the method comprising:
obtaining sensor data for an environment comprising an object;
processing the sensor data with a machine-learned object motion prediction model to obtain a predicted location probability distribution for a future location of the object at one or more future times, wherein the machine-learned object motion prediction model has been trained by performing a REINFORCE gradient estimation technique to determine an approximate gradient of an expected loss that is a function of a non-differentiable prior knowledge reward function that encodes prior knowledge about motion of the object;
generating a vehicle trajectory based, at least in part, on the predicted location probability distribution for the future location of the object at the one or more future times; and
controlling motion of an autonomous vehicle based on the generated vehicle trajectory.
|