| CPC G06V 40/23 (2022.01) [G06T 3/4046 (2013.01); G06V 10/454 (2022.01); G06V 10/7715 (2022.01); G06V 10/82 (2022.01); G06V 20/41 (2022.01); G06V 20/46 (2022.01); G06N 3/045 (2023.01); G06N 3/048 (2023.01)] | 20 Claims |

|
1. A neural network system implemented by one or more electronic devices, comprising a target neural network block configured to be inserted between a first baseline neural network block and a second baseline neural network block of a baseline neural network, the target neural network block including:
at least one pooling unit to:
receive an input feature map outputted by the first baseline neural network block processing an image,
temporally pooling the input feature map into at least one intermediate feature map;
at least one other processing unit to:
temporally process the at least one intermediate feature map to a residual feature map; and
generate an output feature map configured for inputting into the second baseline neural network block by combining the residual feature map with the input feature map;
wherein temporal information extracted by the target neural network block is different from temporal information extracted by the baseline neural network.
|