US 11,954,591 B2
Picture set description generation method and apparatus, and computer device and storage medium
Bairui Wang, Shenzhen (CN); Lin Ma, Shenzhen (CN); and Wei Liu, Shenzhen (CN)
Assigned to TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED, Shenzhen (CN)
Filed by Tencent Technology (Shenzhen) Company Limited, Shenzhen (CN)
Filed on Aug. 11, 2020, as Appl. No. 16/990,877.
Application 16/990,877 is a continuation of application No. PCT/CN2019/090723, filed on Jun. 11, 2019.
Claims priority of application No. 201810732095.4 (CN), filed on Jul. 5, 2018.
Prior Publication US 2020/0387737 A1, Dec. 10, 2020
Int. Cl. G06N 3/08 (2023.01); G06F 16/58 (2019.01); G06F 16/583 (2019.01); G06F 18/25 (2023.01); G06V 10/40 (2022.01); G06V 10/764 (2022.01); G06V 10/77 (2022.01); G06V 10/82 (2022.01); G06V 20/00 (2022.01); G06V 20/40 (2022.01)
CPC G06N 3/08 (2013.01) [G06F 16/583 (2019.01); G06F 16/5866 (2019.01); G06F 18/253 (2023.01); G06V 10/40 (2022.01); G06V 10/764 (2022.01); G06V 10/77 (2022.01); G06V 10/82 (2022.01); G06V 20/35 (2022.01); G06V 20/47 (2022.01)] 20 Claims
OG exemplary drawing
 
1. A method of generating a description for a picture set, performed by a computer device having a processor and memory storing a plurality of programs to be executed by the processor, the method comprising:
acquiring a picture set to be processed;
performing picture feature extraction on each picture in the picture set to acquire a picture feature corresponding to the picture, and forming a picture feature sequence corresponding to the picture set by using the picture features corresponding to the picture set;
comparing any two adjacent picture features in the picture feature sequence corresponding to two adjacent pictures in the picture set to determine whether the adjacent picture features belong to the same scene;
forming a picture feature sub-sequence corresponding to one scene by using a plurality of consecutive picture features belonging to the same scene;
performing scene feature extraction on a respective picture feature sub-sequence corresponding to each scene within the picture feature sequence to acquire a scene feature corresponding to the scene, and forming a scene feature sequence corresponding to the picture set by using the scene features corresponding to the scenes; and
generating textual description information of the picture set according to the picture feature sequence and the scene feature sequence that correspond to the picture set.