US 11,941,883 B2
Video classification method, model training method, device, and storage medium
Yongyi Tang, Shenzhen (CN); Lin Ma, Shenzhen (CN); and Wei Liu, Shenzhen (CN)
Assigned to TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED, Shenzhen (CN)
Filed by Tencent Technology (Shenzhen) Company Limited, Shenzhen (CN)
Filed on Apr. 14, 2021, as Appl. No. 17/230,778.
Application 17/230,778 is a continuation of application No. PCT/CN2020/077809, filed on Mar. 4, 2020.
Claims priority of application No. 201910168236.9 (CN), filed on Mar. 6, 2019.
Prior Publication US 2021/0232825 A1, Jul. 29, 2021
Int. Cl. G06V 20/40 (2022.01); G06N 20/00 (2019.01); G06V 10/56 (2022.01); G06V 10/62 (2022.01); G06V 10/75 (2022.01); G06V 10/82 (2022.01)
CPC G06V 20/41 (2022.01) [G06N 20/00 (2019.01); G06V 10/56 (2022.01); G06V 10/62 (2022.01); G06V 10/751 (2022.01); G06V 10/82 (2022.01)] 17 Claims
OG exemplary drawing
 
1. A video classification method, applicable to a computer device,
the method comprising:
obtaining an image frame sequence corresponding to a to-be-classified video file, the image frame sequence comprising T image frames, T being an integer greater than 1;
obtaining an appearance information feature sequence corresponding to the image frame sequence by applying the image frame sequence as an input to an image classification network model, the appearance information feature sequence comprising T appearance information features, each appearance information feature having a correspondence with one of the T image frames;
obtaining a motion information feature sequence corresponding to the appearance information feature sequence by applying the appearance information feature sequence as input to a motion prediction network model and predicting the motion information features using the appearance information features, the motion information feature sequence comprising T motion information features, each motion information feature having a correspondence with one of the T appearance information features; and
determining a video classification result of the to-be-classified video file according to the appearance information feature sequence and the motion information feature sequence.