| CPC G06T 7/248 (2017.01) [G06T 7/11 (2017.01); G06T 7/194 (2017.01); G06T 7/74 (2017.01); G06V 10/62 (2022.01); G06V 10/764 (2022.01); G06V 10/806 (2022.01); G06V 20/70 (2022.01); G06T 2207/10016 (2013.01); G06T 2207/20081 (2013.01); G06T 2207/20221 (2013.01); G06V 2201/07 (2022.01)] | 30 Claims |

|
1. An apparatus of processing one or more frames, the apparatus comprising:
at least one memory; and
at least one processor coupled to the at least one memory and configured to:
determine first one or more features from a first frame, the first frame including a target object;
obtain a first mask associated with the first frame, the first mask including an indication of the target object;
generate, based on the first mask and the first one or more features, a representation of a foreground and a background of the first frame, wherein the foreground of the first frame is associated with the target object;
determine second one or more features from a second frame; and
determine, based on the representation of the foreground and the background of the first frame and the second one or more features, a location of the target object in the second frame.
|