CPC G06V 30/242 (2022.01) [G06N 3/045 (2023.01); G06V 40/20 (2022.01)] | 19 Claims |
1. An action recognition method, performed by a data processing device, the method comprising:
determining, according to video data comprising an interactive object, node sequence information corresponding to video frames in the video data, the node sequence information of each video frame including position information of nodes in a node sequence, the nodes in the node sequence being nodes of the interactive object that are moved to implement a corresponding interactive action;
determining action categories corresponding to the video frames in the video data, comprising: determining, according to the node sequence information corresponding to N consecutive video frames in the video data, action categories respectively corresponding to the N consecutive video frames; and
determining, according to the action categories corresponding to the video frames in the video data, a target interactive action made by the interactive object in the video data,
wherein determining the node sequence information corresponding to the video frames in the video data comprises:
extracting an image feature of the video frames in the video data; and
determining, according to the image feature, the node sequence information corresponding to the video frames in the video data by using a node recognition model,
wherein the node recognition model is a neural network model that comprises a plurality of layers of prediction submodels, each layer of prediction submodel being configured to determine position information of nodes in the video frames and determine link information between nodes.
|