| CPC H04N 21/262 (2013.01) [H04N 21/2187 (2013.01); H04N 21/233 (2013.01); H04N 21/23418 (2013.01); H04N 21/251 (2013.01); H04N 21/44218 (2013.01); H04N 21/4788 (2013.01); H04N 21/812 (2013.01)] | 20 Claims |

|
1. A method comprising:
obtaining video data and chat log text, wherein the chat log text is aligned with a timeline of the video data;
encoding the video data and the chat log text using a fusion encoder model to obtain a combined feature vector representing the video data and the chat log text;
generating a moment importance score for a time of the video data by decoding the combined feature vector using a decoder of a machine learning model, wherein the moment importance score indicates a probability that the time of the video comprises a key moment; and
presenting content to a user at the time of the video data based on the moment importance score.
|