CPC G06T 7/194 (2017.01) [G06V 10/774 (2022.01); G06V 10/82 (2022.01); G06V 20/49 (2022.01); G06T 2200/24 (2013.01); G06T 2207/10016 (2013.01); G06T 2207/10024 (2013.01); G06T 2207/20076 (2013.01); G06T 2207/20081 (2013.01); G06T 2207/20084 (2013.01)] | 20 Claims |
1. A computing platform comprising:
at least one processor;
a communication interface communicatively coupled to the at least one processor; and
memory storing computer-readable instructions that, when executed by the at least one processor, cause the computing platform to:
receive a set of images and corresponding ground truth foreground masks;
train, using the set of images and corresponding ground truth foreground masks, a first neural network to distinguish between image foregrounds and backgrounds, wherein training the first neural network results in a first set of foreground masks each corresponding to an image of the set of images, wherein distinguishing between the image foregrounds and the backgrounds comprises separating an identified subject of a video from a corresponding background, and wherein training the first neural network comprises training the first neural network using a first subset of the set of images corresponding to a first resolution and a first component configuration and a second subset of the set of images corresponding to a second resolution and a second component configuration;
estimate, for each image of the set of images and based on the foreground mask inferred by the first neural network, and images from a sequence temporarily related to the image, a first background clean plate;
train, using the set of images, the first background clean plates, and a set of corresponding ground truth mask images, a second neural network, wherein training the second neural network configures the second neural network to output foreground masks based on video input information; and
deploy, to an implementation computing device, the second neural network.
|