US 12,223,697 B1
System and method for unsupervised concept extraction from reinforcement learning agents
Praveen K. Pilly, West Hills, CA (US); and Nicholas A. Ketz, Topanga, CA (US)
Assigned to HRL LABORATORIES, LLC, Malibu, CA (US)
Filed by HRL Laboratories, LLC, Malibu, CA (US)
Filed on Sep. 1, 2022, as Appl. No. 17/901,703.
Application 17/901,703 is a continuation in part of application No. 17/590,726, filed on Feb. 1, 2022, granted, now 11,907,815.
Application 17/590,726 is a continuation in part of application No. 16/900,145, filed on Jun. 12, 2020, granted, now 11,420,655, issued on Aug. 23, 2022.
Claims priority of provisional application 63/290,031, filed on Dec. 15, 2021.
Claims priority of provisional application 63/146,314, filed on Feb. 5, 2021.
Claims priority of provisional application 62/906,269, filed on Sep. 26, 2019.
Int. Cl. B60W 60/00 (2020.01); B60W 40/08 (2012.01); G06F 18/21 (2023.01); G06F 18/214 (2023.01); G06F 30/20 (2020.01); G06N 20/00 (2019.01); G06V 10/74 (2022.01); G06V 10/762 (2022.01); G06V 10/774 (2022.01); G06V 10/776 (2022.01); G06V 20/70 (2022.01)
CPC G06V 10/776 (2022.01) [G06V 10/761 (2022.01); G06V 10/762 (2022.01); G06V 10/774 (2022.01); G06V 20/70 (2022.01)] 12 Claims
OG exemplary drawing
 
1. A system for performance of agent-based machine learning with unsupervised concept extraction from reinforcement learning agents, the system comprising:
one or more processors and a non-transitory computer-readable medium having executable instructions encoded thereon such that when executed, the one or more processors perform operations of:
training a reinforcement learning (RL) agent on an image processing task;
extracting a dataset of a plurality of states and corresponding actions from the RL agent, the plurality of states corresponding to a plurality of image frames;
perturbing each image frame by blurring each pixel in the image frame and focusing the blur around each pixel using a Gaussian filter centered on the pixel;
processing each image frame with the retrained RL agent to obtain an output;
comparing the output to the image frame prior to perturbation to determine a saliency score for each pixel in each image frame;
extracting, during action selection by the RL agent, a plurality of image patches of the input space based on the saliency score;
clustering, using an unsupervised clustering algorithm, portions of a first set of inputs based on similarity to the plurality of image patches, thereby generating a first set of clusters having cluster centers; and
providing non-semantic concept labels as distances to the cluster centers for each state in the dataset.