| CPC G06N 20/00 (2019.01) [G06F 16/951 (2019.01); G06N 20/10 (2019.01); G06N 20/20 (2019.01)] | 18 Claims |

|
1. A method for learning prototypical options for interpretable imitation learning, the method comprising:
initializing options by bottleneck state discovery, each of the options presented by an instance of trajectories generated by experts;
applying segmentation embedding learning to extract features to represent current states in segmentations by dividing the trajectories into a set of segmentations;
learning prototypical options for each segment of the set of segmentations to mimic expert policies by minimizing loss of a policy and projecting prototypes to the current states;
learning prototypical option embedding using an objective function:
![]() where LILloss is an imitation learning loss, fφ is, the second term is a segment representation function for a segment sν′m,νm from segment νm to segment νm′, ei and ej are embedded prototypes, K is a number of prototypes, M is a number of segments, dmin is a threshold value, and λ1, λ2, and λ3, are weighting parameters;
training option policy with imitation learning techniques to learn a conditional policy;
generating interpretable policies by comparing the current states in the segmentations to one or more prototypical option embeddings;
generating dosage options for a patient based on the interpretable policies;
displaying the dosage options on a user interface for a user; and
taking an action based on the dosage options.
|