CPC G06T 7/11 (2017.01) [G06F 18/214 (2023.01); G06F 18/217 (2023.01); G06T 5/002 (2013.01); G06T 11/00 (2013.01); G06V 40/10 (2022.01); G06T 2207/10016 (2013.01); G06T 2207/20081 (2013.01); G06T 2207/20084 (2013.01); G06T 2207/30196 (2013.01)] | 20 Claims |
1. A method comprising:
receiving, by one or more processors, a monocular image that includes a depiction of a whole body of a user;
generating, by the one or more processors, a segmentation of the whole body of the user based on the monocular image;
accessing a video feed comprising a plurality of monocular images received prior to the monocular image;
predicting, based on the plurality of monocular images received prior to the monocular image, the segmentation of the whole body of the user that is generated based on the monocular image;
smoothing the segmentation of the whole body generated based on the monocular image based on predicting the segmentation to provide a smoothed segmentation, the smoothing comprising comparing predicted one or more segmentations of whole bodies provided by a second deep neural network with the segmentation of the whole body, in the received monocular image, generated by a first deep neural network; and
applying one or more visual effects to the monocular image based on the smoothed segmentation.
|