| CPC G06V 20/46 (2022.01) [G06V 10/82 (2022.01); G06V 20/41 (2022.01); G06V 20/49 (2022.01); G06V 20/582 (2022.01); G06V 20/44 (2022.01); G06V 30/19173 (2022.01)] | 30 Claims |

|
1. A method, comprising:
segmenting an input stream to generate a plurality of frame sets, each frame set including a plurality of frames;
identifying, by a permutation invariant convolutional layer of a neural network, for each frame set from the plurality of frame sets, a frame with a highest likelihood of including one or more actions of a set of predefined actions;
generating a global representation of the input stream from pooled representations of the identified frames; and
classifying a long-range activity based on the global representation.
|