| CPC B25J 9/163 (2013.01) [B25J 9/0081 (2013.01); B25J 9/1661 (2013.01); B25J 9/1697 (2013.01); G06N 20/00 (2019.01)] | 20 Claims |

|
1. A method performed by one or more computers, the method comprising:
receiving, from a user device by a skill template distribution system, characteristics of a robot and a selection of a skills template from a list of available skill templates compatible with the characteristics of the robot;
providing, by the skill template distribution system, to the user device, the selected skill template, wherein the skill template comprises information representing a state machine of one or more tasks, and wherein the skill template specifies which of the one or more tasks require adaptation using training data;
receiving, by the skill template distribution system, training data for a subtask of the skill template;
training a machine learning model for the subtask using the training data to generate learned parameter values; and
providing, to the user device, the learned parameter values for the machine learning model that causes the robot to implement a control policy that executes the subtask adapted using the training data.
|