US 11,929,096 B1
Content-based adaptive speed playback
Mnatsakan Sharafyan, Yerevan (AM); Gor Matevosyan, Yerevan (AM); and Levon Khachatryan, Yerevan (AM)
Assigned to SAIMA, INC., Berkeley, CA (US)
Filed by INTED INC., Berkeley, CA (US)
Filed on Mar. 30, 2023, as Appl. No. 18/128,255.
Claims priority of provisional application 63/405,790, filed on Sep. 12, 2022.
Int. Cl. G11B 27/00 (2006.01); G10L 15/08 (2006.01)
CPC G11B 27/005 (2013.01) [G10L 15/08 (2013.01)] 18 Claims
OG exemplary drawing
 
1. A computer-implemented method comprising: receiving media data, wherein the media data is associated with multimedia to be rendered at particular speed on a multimedia player on a computing device; determining a content density index value for a particular portion of the multimedia, the content density index value, representing a rate of content within the particular portion that a user of the multimedia player has to comprehend, at least, by determining whether each time interval period in a plurality of time interval periods of the particular portion contains speech, wherein the determining whether said each time interval period contains speech comprises: receiving voice-to-text textual data for the particular portion of the multimedia, or converting audio multimedia of the particular portion to the voice-to-text textual data, determining which respective segment of the voice-to-text textual data corresponds to said each time interval period in the plurality of time interval periods of the particular portion, based, at least in part, on existence of at least one lexical unit in the respective segment of the voice-to-text textual data that corresponds to said each time interval period, determining whether said each time interval period in the plurality of time interval periods of the particular portion contains speech, and generating one or more indications indicating one or more speech-time interval periods of the plurality of time interval periods of the particular portion as containing speech and one or more speech-free-time interval periods of the plurality of time interval periods as speech-free; receiving a desired content density comprehension index value associated with the user of the multimedia player, the desired content density comprehension index value representing a desired rate of content that the user has requested to comprehend; based, at least in part, on the content density index value for the particular portion and based, at least in part, on comparing the desired content density comprehension index value with the content density index value for the particular portion, determining a new speed for rendering at least a portion of the multimedia to match to the desired rate of content that the user has requested to comprehend; storing the new speed associated with the at least portion of the multimedia for rendering the at least portion of the multimedia at the new speed that is different from the particular speed.