US 11,659,181 B2
Method and apparatus for determining region of interest
Xiaoming Wang, Beijing (CN); Huaifei Xing, Beijing (CN); Wenpeng Ding, Beijing (CN); Huifeng Shen, Beijing (CN); and Feifei Cao, Beijing (CN)
Assigned to BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD., Beijing (CN)
Filed by BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD., Beijing (CN)
Filed on Jun. 4, 2020, as Appl. No. 16/892,910.
Claims priority of application No. 201911314000.8 (CN), filed on Dec. 19, 2019.
Prior Publication US 2021/0192217 A1, Jun. 24, 2021
Int. Cl. G06K 9/00 (2022.01); H04N 19/124 (2014.01); G06T 7/70 (2017.01); G06V 20/40 (2022.01); G06V 10/25 (2022.01); G06V 40/10 (2022.01); G06F 18/22 (2023.01); G06V 10/80 (2022.01); H04N 19/102 (2014.01); H04N 19/167 (2014.01); G06V 10/22 (2022.01); G06V 10/764 (2022.01); H04N 19/169 (2014.01); H04N 19/115 (2014.01); G06V 10/28 (2022.01); H04N 19/17 (2014.01)
CPC H04N 19/124 (2014.11) [G06F 18/22 (2023.01); G06T 7/70 (2017.01); G06V 10/22 (2022.01); G06V 10/25 (2022.01); G06V 10/28 (2022.01); G06V 10/764 (2022.01); G06V 10/811 (2022.01); G06V 20/41 (2022.01); G06V 40/10 (2022.01); H04N 19/102 (2014.11); H04N 19/115 (2014.11); H04N 19/167 (2014.11); H04N 19/169 (2014.11); H04N 19/17 (2014.11); G06T 2207/10016 (2013.01)] 19 Claims
OG exemplary drawing
 
1. A method for processing a video, the method comprising:
acquiring object regions obtained by performing object detection on a target video frame, a type of an object in each of the object regions being a preset type;
determining, for an object region in the acquired object regions, in response to determining that the object region satisfies a preset condition, that the object region is a non-ROI (region of interest);
using object regions other than the non-ROI in the object regions of the target video frame as ROIs; and
acquiring a quantization parameter change corresponding to each of the ROIs, and encoding the target video frame based on the quantization parameter change corresponding to each of the ROIs;
wherein acquiring the quantization parameter change corresponding to each of the ROIs, comprises:
determining, for each of the ROIs in the target video frame, the quantization parameter change corresponding to each of the ROIs based on the type of the object in each of the ROIs, the type of the object in each of the ROIs being one of at least one preset type, by:
acquiring a maximum value of a sum of increase ratios of code rates of types of ROIs in the target video frame, wherein the maximum value of the sum of the increase ratios of the code rates is obtained relative to types of ROIs obtained by encoding based on a specified quantization parameter of an encoder;
acquiring a preset constant corresponding to the type of the object in each of the ROIs, wherein different preset types correspond to different preset constants, and the quantization parameter change is determined based on the preset constant and a constant coefficient; and
determining a quantization parameter change corresponding to each type of ROI in the target video frame based on the maximum value of the sum of the increase ratios of the code rates, the preset constant corresponding to each type of ROI, and a ratio of each type of ROI to the target video frame.