| CPC G06N 3/08 (2013.01) [G06N 3/04 (2013.01); G09G 5/37 (2013.01)] | 20 Claims |

|
8. A method comprising:
generating, based at least on processing a representation of a current working observation of a canvas in a synthetic rendering environment using a natural media agent comprising one or more deep neural networks, a representation of at least one primitive graphic action;
generating an updated state of the canvas in the synthetic rendering environment based at least on the at least one primitive graphic action;
updating an accumulated reward, accumulated over a plurality of iterations of the natural media agent, based on a difference between at least a portion of the updated state of the canvas and a current training image of a set of training images; and
updating, in response to detecting a trigger, the one or more deep neural networks using the accumulated reward, wherein the trigger comprises at least one of a designated number of iterations of the natural media agent, a number of iterations of the natural media agent that increases between episodes of iterations during which the accumulated reward is updated, or a determination that a position of a media rendering instrument in the updated state of the canvas moved more than a threshold distance from a center of an ego-centric patch of the canvas.
|