US 11,989,852 B2
AI downscaling apparatus and operating method thereof, and AI upscaling apparatus and operating method thereof
Wookhyung Kim, Suwon-si (KR); Ilhyun Cho, Suwon-si (KR); Jayoon Koo, Suwon-si (KR); and Namuk Kim, Suwon-si (KR)
Assigned to SAMSUNG ELECTRONICS CO., LTD., Suwon-si (KR)
Appl. No. 17/312,276
Filed by SAMSUNG ELECTRONICS CO., LTD., Suwon-si (KR)
PCT Filed Jan. 11, 2021, PCT No. PCT/KR2021/000346
§ 371(c)(1), (2) Date Jun. 9, 2021,
PCT Pub. No. WO2021/210761, PCT Pub. Date Oct. 21, 2021.
Claims priority of application No. 10-2020-0045233 (KR), filed on Apr. 14, 2020.
Prior Publication US 2023/0177638 A1, Jun. 8, 2023
Int. Cl. G06T 3/40 (2006.01); G06T 3/4046 (2024.01); G06T 3/4053 (2024.01)
CPC G06T 3/4046 (2013.01) [G06T 3/4053 (2013.01)] 15 Claims
OG exemplary drawing
 
1. An artificial intelligence (AI) upscaling apparatus comprising:
a memory storing one or more instructions; and
a processor configured to execute the one or more instructions stored in the memory,
wherein the processor is configured to execute the one or more instructions to:
obtain a second image corresponding to a first image which is downscaled from an original image by an AI downscaling apparatus by using a first deep neural network (DNN); and
obtain a third image by upscaling the second image by using a second DNN corresponding to the first DNN, and
wherein the second DNN is trained to minimize a difference between a first restored image for an original training image and second restored images for translation images,
wherein the first restored image is obtained by performing downscaling by the first DNN on an image obtained by applying no pixel movement to the original training image, and performing upscaling by the second DNN on the downscaled image,
wherein the second restored images are obtained by performing downscaling by the first DNN on the translation images obtained by applying pixel movement to the original training image, performing upscaling by the second DNN on the downscaled translation images, and performing retranslation on the upscaled translation images,
wherein the translation images comprise a first translation image generated by applying pixel movement to the original training image in a first direction, and a second translation image generated by applying pixel movement to the original training image in a second direction vertical to the first direction,
wherein the upscaled translation images comprise a first upscaled translation image corresponding to the first translation image and a second upscaled translation image corresponding to the second translation image, and
wherein the retranslation comprises pixel movement applied to the first upscaled translation image in a direction reverse to the first direction of the pixel movement applied to the original training image, and pixel movement applied to the second upscaled translation image in a direction reverse to the second direction of the pixel movement applied to the original training image.