US 12,235,888 B2
Content summarization leveraging systems and processes for key moment identification and extraction
Piyush Saggi, Atlanta, GA (US); Nitesh Chhajwani, Atlanta, GA (US); and Thomas Ploetz, Atlanta, GA (US)
Assigned to SalesTing, Inc., Atlanta, GA (US)
Filed by SalesTing, Inc., Atlanta, GA (US)
Filed on Nov. 6, 2023, as Appl. No. 18/502,466.
Application 18/502,466 is a continuation of application No. 16/881,615, filed on May 22, 2020, granted, now 11,836,181.
Claims priority of provisional application 62/851,434, filed on May 22, 2019.
Prior Publication US 2024/0070187 A1, Feb. 29, 2024
This patent is subject to a terminal disclaimer.
Int. Cl. G06V 20/40 (2022.01); G06F 16/45 (2019.01); G06F 16/483 (2019.01); G06N 20/00 (2019.01)
CPC G06F 16/45 (2019.01) [G06F 16/483 (2019.01); G06N 20/00 (2019.01); G06V 20/41 (2022.01); G06V 20/47 (2022.01)] 20 Claims
OG exemplary drawing
 
1. A method comprising:
receiving multimedia content comprising a plurality of frames;
obtaining a transcript of the multimedia content;
generating a plurality of keywords based on the transcript;
mapping the plurality of keywords across each of the plurality of frames, frame by frame, wherein the mapping comprises:
excluding one or more sentences from the transcript that do not contain a keyword from the plurality of keywords;
annotating the transcript with one or more timestamps based on the plurality of keywords; and
generating a keyword mapping comprising the transcript with the one or more timestamps and without the one or more sentences;
computing, for each of the plurality of frames, an importance score based on the keyword mapping and a chapter score, wherein the chapter score measures transitions between semantically coherent units of the multimedia content;
generating a ranking of the plurality of frames based on the importance scores and the chapter scores;
determining one or more top-ranked frames from the ranking that satisfy an importance threshold;
merging the one or more top-ranked frames into one or more moments; and
aggregating the one or more moments into a summarization of the multimedia content.