US 12,124,498 B2
Time code to byte indexer for partial object retrieval
Jeremiah Dunham, Portland, OR (US); Andrew Tunall, Portland, OR (US); Benjamin Schwartz, Portland, OR (US); Jason LaPier, Portland, OR (US); and Justin Abrahms, Portland, OR (US)
Assigned to Amazon Technologies, Inc., Seattle, WA (US)
Filed by Amazon Technologies, Inc., Seattle, WA (US)
Filed on Jan. 9, 2020, as Appl. No. 16/738,808.
Application 16/738,808 is a continuation of application No. 15/274,865, filed on Sep. 23, 2016, granted, now 10,546,011.
Prior Publication US 2020/0151208 A1, May 14, 2020
This patent is subject to a terminal disclaimer.
Int. Cl. G06F 16/41 (2019.01); G06F 16/13 (2019.01); G06F 16/48 (2019.01); G10L 15/04 (2013.01); G10L 15/08 (2006.01); G10L 15/18 (2013.01); H04L 67/01 (2022.01)
CPC G06F 16/41 (2019.01) [G06F 16/13 (2019.01); G06F 16/489 (2019.01); G10L 15/04 (2013.01); G10L 15/08 (2013.01); G10L 15/1822 (2013.01); G10L 2015/088 (2013.01); H04L 67/01 (2022.05)] 15 Claims
OG exemplary drawing
 
1. A computer-implemented method comprising:
as implemented by one or more computing devices configured with specific executable instructions,
obtaining a media file;
obtaining a second media file, wherein the second media file is a lower resolution version of the media file;
identifying a portion of an audio stream of the media file, wherein the portion of the audio stream is bounded by a first pause in speech at a start of the portion and a second pause in speech at an end of the portion, and wherein the start of the portion is mapped to a time code of the media file;
performing speech recognition at least on the portion to generate a transcript;
determining that the transcript comprises a keyword, wherein the keyword is a word describing content of the media file;
storing a mapping of the keyword and the portion to the time code in an index;
receiving from a user device an identification of the media file and the keyword;
querying the index to determine that the time code is mapped to the keyword and to identify a range of bytes corresponding to the time code;
receiving a request for the second media file from a user device;
transmitting the second media file to the user device;
receiving a second request for a portion of the media file, wherein the second request comprises the keyword;
querying the index to determine that the time code is mapped to the keyword and to identify a range of bytes in the media file corresponding to the time code; and
transmitting the range of bytes to the user device to satisfy the second request.