US 11,989,940 B2
	Movie detection system and movie detection method
Yen-Hsun Chu, Hsinchu (TW)
Assigned to REALTEK SEMICONDUCTOR CORP., Hsinchu (TW)
Filed by REALTEK SEMICONDUCTOR CORP., Hsinchu (TW)
Filed on Dec. 15, 2021, as Appl. No. 17/551,355.
Claims priority of application No. 110108955 (TW), filed on Mar. 12, 2021.
Prior Publication US 2022/0292824 A1, Sep. 15, 2022
Int. Cl. G06V 20/40 (2022.01); G06N 3/08 (2023.01); G06T 3/40 (2006.01); G06V 10/26 (2022.01); G06V 10/776 (2022.01); G06V 10/82 (2022.01)

CPC G06V 20/41 (2022.01) [G06N 3/08 (2013.01); G06T 3/40 (2013.01); G06V 10/26 (2022.01); G06V 10/776 (2022.01); G06V 10/82 (2022.01); G06V 20/49 (2022.01)]

14 Claims

1. A movie detection method, comprising:

configuring an electronic device to play an input video source;

configuring a processor of the electronic device to capture a current image of the input video source and store the current image in a memory, wherein the current image has a first image scale;

performing a pre-processing process on the current image, wherein the pre-processing process includes configuring the processor to:

perform an image scaling process on the current image to generate a zoomed current image having a second image scale, wherein the second image scale is smaller than the first image scale;

perform a cutting process on the zoomed current image, and only retain a top and a bottom of the zoomed current image; and

join the top and the bottom to generate a joined current image; and

configuring the processor to input the joined current image into a trained machine learning model to classify the joined current image as a movie image or a non-movie image,

wherein the trained machine learning model is generated by performing a training process on a machine learning model, and the training process is performed based on a plurality of joined training images,

wherein the plurality of joined training images are generated by performing the pre-processing process on a plurality of movie images with black bars and a plurality of non-movie images without black bars, and the plurality of joined training images are respectively marked as the movie images and the non-movie images to be used as expected outputs of the machine learning model in the training process.