CPC G06F 16/41 (2019.01) [G06F 16/13 (2019.01); G06F 16/489 (2019.01); G10L 15/04 (2013.01); G10L 15/08 (2013.01); G10L 15/1822 (2013.01); G10L 2015/088 (2013.01); H04L 67/01 (2022.05)] | 15 Claims |
1. A computer-implemented method comprising:
as implemented by one or more computing devices configured with specific executable instructions,
obtaining a media file;
obtaining a second media file, wherein the second media file is a lower resolution version of the media file;
identifying a portion of an audio stream of the media file, wherein the portion of the audio stream is bounded by a first pause in speech at a start of the portion and a second pause in speech at an end of the portion, and wherein the start of the portion is mapped to a time code of the media file;
performing speech recognition at least on the portion to generate a transcript;
determining that the transcript comprises a keyword, wherein the keyword is a word describing content of the media file;
storing a mapping of the keyword and the portion to the time code in an index;
receiving from a user device an identification of the media file and the keyword;
querying the index to determine that the time code is mapped to the keyword and to identify a range of bytes corresponding to the time code;
receiving a request for the second media file from a user device;
transmitting the second media file to the user device;
receiving a second request for a portion of the media file, wherein the second request comprises the keyword;
querying the index to determine that the time code is mapped to the keyword and to identify a range of bytes in the media file corresponding to the time code; and
transmitting the range of bytes to the user device to satisfy the second request.
|