| CPC G10L 25/63 (2013.01) [G10L 15/04 (2013.01); G10L 15/22 (2013.01); G10L 25/30 (2013.01)] | 20 Claims |

|
1. A method, comprising:
at an electronic device:
receiving an audio file;
dividing the audio file into a plurality of segments;
automatically, without user input, determining, for each respective segment, a descriptor from a plurality of descriptors and a value of the descriptor for the segment, wherein determining the descriptor for each respective segment comprises:
applying a rolling time window to the respective segment to generate a set of descriptors, each descriptor in the set of descriptors corresponding to a respective time window of the respective segment, wherein the rolling time window is shorter than a length of the respective segment;
selecting one or more segments, less than all, of the plurality of segments, based on a comparison of the respective values of respective descriptors for respective segments and genre-specific criteria selected based on a genre of the audio file; and
generating a summarized version of the audio file by arranging the selected one or more segments.
|