US 11,670,284 B2
Systems and methods for adjusting dubbed speech based on context of a scene
Mario Sanchez, San Jose, CA (US); Ashleigh Miller, Denver, CO (US); and Paul T. Stathacopoulos, San Carlos, CA (US)
Assigned to Rovi Guides, Inc., San Jose, CA (US)
Filed by Rovi Guides, Inc., San Jose, CA (US)
Filed on Sep. 21, 2021, as Appl. No. 17/480,550.
Application 17/480,550 is a continuation of application No. 16/934,230, filed on Jul. 21, 2020, granted, now 11,151,980.
Application 16/934,230 is a continuation of application No. 16/610,225, granted, now 10,755,724, issued on Aug. 25, 2020, previously published as PCT/US2017/030971, filed on May 4, 2017.
Prior Publication US 2022/0005455 A1, Jan. 6, 2022
Int. Cl. H04N 5/93 (2006.01); G10L 13/033 (2013.01); G10L 15/18 (2013.01); G10L 15/22 (2006.01); G10L 25/51 (2013.01); H04N 9/802 (2006.01); G10L 21/02 (2013.01); G06V 40/16 (2022.01)
CPC G10L 13/033 (2013.01) [G06V 40/174 (2022.01); G10L 15/1815 (2013.01); G10L 15/22 (2013.01); G10L 21/02 (2013.01); G10L 25/51 (2013.01); H04N 9/802 (2013.01)] 20 Claims
OG exemplary drawing
 
1. A method comprising:
analyzing metadata of a media content portion to determine a context of the media content portion, the media content portion comprising audio and video;
analyzing the media content portion to identify modifications to the audio in the media content portion to cause the audio in the media content portion to match the context of the media content portion;
modifying the audio in the media content portion to match the context of the media content portion using the identified modifications to the audio in the media content portion; and
generating for output the media content portion with the modified audio.