| CPC G06T 5/70 (2024.01) [G06T 3/4007 (2013.01); G06T 3/60 (2013.01); G06T 5/20 (2013.01); G06T 7/20 (2013.01); G06T 7/60 (2013.01); G06T 2207/20032 (2013.01); G06T 2207/20044 (2013.01)] | 20 Claims |

|
1. A method of denoising an action dataset comprising video clips of actions comprising:
breaking the video clips into one or more fixed length clips of a predetermined number of frames;
determining the number of frames in the fixed number of frames that are noisy;
determining a ratio of the number of noisy frames to the total number of frames in the fixed length clip; and
removing the fixed length clip from the dataset when the ratio exceeds a predetermined noise threshold;
wherein skeletal representations of one or more persons in the fixed length clips are extracted from each frame of the fixed length clips, the skeletal representations comprising a set of joint coordinates defining locations of joints of the skeletal representation in a coordinate system defined in the context of each frame.
|