US 12,010,393 B2
Automatic appending of subtitles based on media context
Jian Dong Yin, Beijing (CN); Wen Wang, Beijing (CN); Zhuo Cai, Beijing (CN); Rong Fu, Ningbo (CN); Hao Sheng, Ningbo (CN); and Kang Zhang, Shanghai (CN)
Assigned to International Business Machines Corporation, Armonk, NY (US)
Filed by International Business Machines Corporation, Armonk, NY (US)
Filed on Jul. 28, 2021, as Appl. No. 17/387,316.
Prior Publication US 2023/0030342 A1, Feb. 2, 2023
Int. Cl. H04N 21/488 (2011.01); G06V 20/20 (2022.01); G06V 20/52 (2022.01); H04N 21/439 (2011.01); H04N 21/44 (2011.01)
CPC H04N 21/4884 (2013.01) [G06V 20/20 (2022.01); G06V 20/52 (2022.01); H04N 21/4394 (2013.01); H04N 21/44008 (2013.01)] 20 Claims
OG exemplary drawing
 
1. A system for automatically appending additional transcription in a media, the system comprising:
a memory; and
a processor in communication with the memory, the processor being configured to perform operations comprising:
extract dialog from a video fragment, wherein dialog is human voices;
determine one or more quality features from the extracted dialog;
identify a tone of the dialog based on the one or more quality features;
generate, automatically, one or more transcripts based on a media context, wherein the one or more transcripts are not a part of an existing transcript of the media;
append at least one of the one or more transcripts to the media, wherein appending the at least one of the one or more transcripts to the media includes relaying a mood associated with the identified tone of the dialog; and
modify the at least one of the one or more transcripts based on an adjustment to a weight factor.