US 11,889,096 B2
Video codec assisted real-time video enhancement using deep learning
Chen Wang, San Jose, CA (US); Ximin Zhang, San Jose, CA (US); Huan Dou, Beijing (CN); Yi-Jen Chiu, San Jose, CA (US); and Sang-Hee Lee, San Jose, CA (US)
Assigned to Intel Corporation, Santa Clara, CA (US)
Filed by INTEL CORPORATION, Santa Clara, CA (US)
Filed on Jun. 26, 2020, as Appl. No. 16/914,083.
Prior Publication US 2020/0327702 A1, Oct. 15, 2020
Int. Cl. G06T 9/00 (2006.01); G06T 3/40 (2006.01); H04N 19/44 (2014.01); H04N 19/30 (2014.01); G06N 3/08 (2023.01); H04N 19/159 (2014.01); H04N 19/184 (2014.01); H04N 19/176 (2014.01); H04N 19/132 (2014.01); G06F 18/25 (2023.01); G06V 10/82 (2022.01)
CPC H04N 19/44 (2014.11) [G06F 18/251 (2023.01); G06N 3/08 (2013.01); G06T 3/4007 (2013.01); G06T 3/4053 (2013.01); G06T 9/002 (2013.01); G06V 10/82 (2022.01); H04N 19/132 (2014.11); H04N 19/159 (2014.11); H04N 19/176 (2014.11); H04N 19/184 (2014.11); H04N 19/30 (2014.11)] 20 Claims
OG exemplary drawing
 
1. A system for providing adaptive enhancement video processing comprising:
a memory to store at least a portion of a bitstream; and
one or more processors coupled to the memory, the one or more processors to:
decode the bitstream to generate a video frame comprising pixel values at a first resolution, wherein the video frame comprises first and second blocks of pixel values at the first resolution, and wherein the first and second blocks comprise first and second video coding modes, respectively;
apply a deep learning network to the first block in response to the first coding mode to generate a first output block comprising pixel values at a second resolution greater than the first resolution;
bypass application of the deep learning network to the second block and retrieve a prior second output block comprising pixel values at the second resolution for the second block in response to the second coding mode; and
generate an output video frame comprising pixel values at the second resolution corresponding to the video frame based at least on a merge of the first and second output blocks into the output video frame.