CPC G10L 15/22 (2013.01) [G06F 40/295 (2020.01); G10L 15/26 (2013.01); G10L 2015/223 (2013.01)] | 20 Claims |
1. A method comprising:
accessing, by one or more processors, an annotation file for an audio file, the annotation file comprising a text transcription of the audio file;
determining, by the one or more processors, based on the text transcription, a topic for each segment of a plurality of segments of a first predetermined length;
determining, by the one or more processors, a confidence level of the topic for each segment of the plurality of segments;
determining, by the one or more processors, a topic for each larger segment of a plurality of larger segments, each larger segment comprising a predetermined number of consecutive component segments of the plurality of segments, the determining of the topic for a larger segment based on the topics of the component segments and the confidence levels of the topics of the component segments; and
modifying the annotation file to include the determined topics for the plurality of larger segments.
|