CPC G06N 3/08 (2013.01) [G06N 20/00 (2019.01)] | 7 Claims |
1. A method for acquiring skills through imitation learning, the method comprising:
learning behaviors or tasks, by an agent, from state-action pairs of medical treatment for a given disease by:
learning to decompose the state-action pairs into segments, via a segmentation component, the segments corresponding to skills that are transferrable across different tasks;
learning relationships between the skills;
employing, via a graph generator, a graph neural network for learning implicit structures of the skills from the state-action pairs to define structured skills; and
generating policies from the structured skills to allow the agent to acquire the structured skills for application to one or more target tasks by optimizing an objective function:
![]() wherein pπθ is a generated policy with parameters πθ, pπE is an expert policy,
![]() providing a medication to a patient in accordance with the generated policies to treat the given disease.
|