US 12,243,268 B2
Panoramic video processing method and apparatus, and storage medium
Hequn Bai, Beijing (CN)
Assigned to BEIJING BYTEDANCE NETWORK TECHNOLOGY CO., LTD., Beijing (CN)
Appl. No. 17/618,398
Filed by BEIJING BYTEDANCE NETWORK TECHNOLOGY CO., LTD., Beijing (CN)
PCT Filed Jun. 5, 2020, PCT No. PCT/CN2020/094507
§ 371(c)(1), (2) Date Dec. 10, 2021,
PCT Pub. No. WO2020/248900, PCT Pub. Date Dec. 17, 2020.
Claims priority of application No. 201910497871.1 (CN), filed on Jun. 10, 2019.
Prior Publication US 2022/0277481 A1, Sep. 1, 2022
Int. Cl. G06T 7/73 (2017.01); G06T 7/246 (2017.01); G06T 7/66 (2017.01); H04N 5/265 (2006.01); H04S 7/00 (2006.01)
CPC G06T 7/75 (2017.01) [G06T 7/248 (2017.01); G06T 7/66 (2017.01); G06T 7/74 (2017.01); H04N 5/265 (2013.01); H04S 7/30 (2013.01); G06T 2207/10021 (2013.01); H04S 2400/11 (2013.01)] 18 Claims
OG exemplary drawing
 
1. A method for processing a panoramic video, comprising:
determining a reference frame in video frames of the panoramic video, and determining a target object in the reference frame;
acquiring position information about a position of the target object in the reference frame, wherein the position information comprises angle information about the position of the target object in the panoramic video and a center distance about the position of the target object in the panoramic video;
identifying a plurality of feature points of the target object in the reference frame;
acquiring a pixel area of a region occupied by the plurality of feature points in the reference frame;
determining the center distance about the position of the target object in the panoramic video based on an inverse relationship between the pixel area and a square of the center distance, wherein the center distance is a distance between the position of the target object in the panoramic video and a center of a three-dimensional model corresponding to the reference frame;
acquiring a motion track of the target object in the panoramic video based on the position information; and
processing a to-be-processed audio frame based on the motion track to obtain a target audio frame capable of characterizing a position of the target object in the panoramic video, wherein the target audio frame is used to be synthesized with the video frames of the panoramic video to obtain a panoramic video file.