US 11,989,940 B2
Movie detection system and movie detection method
Yen-Hsun Chu, Hsinchu (TW)
Assigned to REALTEK SEMICONDUCTOR CORP., Hsinchu (TW)
Filed by REALTEK SEMICONDUCTOR CORP., Hsinchu (TW)
Filed on Dec. 15, 2021, as Appl. No. 17/551,355.
Claims priority of application No. 110108955 (TW), filed on Mar. 12, 2021.
Prior Publication US 2022/0292824 A1, Sep. 15, 2022
Int. Cl. G06V 20/40 (2022.01); G06N 3/08 (2023.01); G06T 3/40 (2006.01); G06V 10/26 (2022.01); G06V 10/776 (2022.01); G06V 10/82 (2022.01)
CPC G06V 20/41 (2022.01) [G06N 3/08 (2013.01); G06T 3/40 (2013.01); G06V 10/26 (2022.01); G06V 10/776 (2022.01); G06V 10/82 (2022.01); G06V 20/49 (2022.01)] 14 Claims
OG exemplary drawing
 
1. A movie detection method, comprising:
configuring an electronic device to play an input video source;
configuring a processor of the electronic device to capture a current image of the input video source and store the current image in a memory, wherein the current image has a first image scale;
performing a pre-processing process on the current image, wherein the pre-processing process includes configuring the processor to:
perform an image scaling process on the current image to generate a zoomed current image having a second image scale, wherein the second image scale is smaller than the first image scale;
perform a cutting process on the zoomed current image, and only retain a top and a bottom of the zoomed current image; and
join the top and the bottom to generate a joined current image; and
configuring the processor to input the joined current image into a trained machine learning model to classify the joined current image as a movie image or a non-movie image,
wherein the trained machine learning model is generated by performing a training process on a machine learning model, and the training process is performed based on a plurality of joined training images,
wherein the plurality of joined training images are generated by performing the pre-processing process on a plurality of movie images with black bars and a plurality of non-movie images without black bars, and the plurality of joined training images are respectively marked as the movie images and the non-movie images to be used as expected outputs of the machine learning model in the training process.