CPC G08G 1/08 (2013.01) [G06N 5/01 (2023.01); G06N 7/01 (2023.01); G06N 20/00 (2019.01); G08G 1/0129 (2013.01); G08G 1/082 (2013.01)] | 20 Claims |
1. A method for traffic signal control of a traffic network with a learned model, the traffic network comprising one or more intersections and sensors associated with the intersections to determine vehicle traffic approaching each intersection, the method comprising, for each timestep:
receiving sensor readings from the traffic network, the sensor readings comprising positions and speeds of vehicles approaching each intersection;
using a learned dynamics model that takes the sensor readings as input, predicting a plurality of possibilities for position and velocity of the vehicles approaching each intersection in a future timestep;
determining an action for the one or more intersections by performing a tree search on the plurality of possibilities and selecting the possibility with a highest action value; and
outputting the action to the traffic network for implementation as a traffic control action at the one or more intersections.
|