CPC G06T 11/00 (2013.01) [G06F 18/2431 (2023.01); G06N 20/00 (2019.01); G06T 11/20 (2013.01); G06V 10/25 (2022.01); G06V 10/764 (2022.01); G06V 20/64 (2022.01); G06T 2210/12 (2013.01)] | 22 Claims |
1. A computer vision system for object detection with reinforcement learning, comprising:
a memory storing at least one image; and
a processor in communication with the memory, the processor:
setting a plurality of reinforcement learning agent parameters;
retrieving the at least one image from memory;
detecting a target object in the at least one image based on the reinforcement learning agent parameters;
determining a bounding box for the detected target object;
displaying the bounding box on the image;
performing reinforcement learning on a portion of the image appearing within the bounding box; and
when the reinforcement learning agent receives a positive terminal reward, performing at least one of: (1) evaluating whether an optimal number of rewards for the reinforcement learning agent can be improved; or (2) determining a better fitting bounding box.
|