| CPC G10L 21/055 (2013.01) [G10L 15/02 (2013.01); G10L 15/22 (2013.01); G10L 15/26 (2013.01); H04N 21/43074 (2020.08); H04N 21/4394 (2013.01); H04N 21/4884 (2013.01); G10L 2015/025 (2013.01)] | 24 Claims |

|
20. A system comprising:
a sending device comprising:
one or more first processors; and
first memory storing first instructions that, when executed by the one or more first processors, cause the sending device to:
determine a first transcript of an audio portion of media content, wherein the first transcript comprises timing information synchronizing first words of the first transcript with the media content;
based on a correlation between first words of the first transcript and second words of a second transcript of the audio portion, determine an updated second transcript that comprises timing information synchronizing the second words, of the second transcript of the audio portion, with the media content; and
determine, based on the updated second transcript, caption data associated with the media content; and,
a receiving device comprising:
one or more second processors; and
second memory storing second instructions that, when executed by the one or more second processors, cause the receiving device to display the media content and captions of the caption data.
|