US 12,300,233 B2
Media segment representation using fixed weights
Stephane Villette, San Diego, CA (US); Sen Li, San Diego, CA (US); and Daniel Jared Sinder, San Diego, CA (US)
Assigned to QUALCOMM Incorporated, San Diego, CA (US)
Filed by QUALCOMM Incorporated, San Diego, CA (US)
Filed on Oct. 18, 2022, as Appl. No. 18/047,562.
Prior Publication US 2024/0127809 A1, Apr. 18, 2024
Int. Cl. G10L 15/00 (2013.01); G10L 15/04 (2013.01); G10L 15/16 (2006.01); G10L 15/22 (2006.01); G10L 25/78 (2013.01)
CPC G10L 15/22 (2013.01) [G10L 15/04 (2013.01); G10L 15/16 (2013.01); G10L 25/78 (2013.01)] 28 Claims
OG exemplary drawing
 
1. A device comprising:
a memory configured to store a collection of sets of weights, each of the sets of weights representing a respective media segment;
one or more processors configured to:
detect a first input speech segment;
generate data representing the detected first input speech segment;
pass the data representing the detected first input speech segment into a collection of memory units, each memory unit of the collection of memory units including a set of weights from the collection of sets of weights, wherein each of the sets of weights represent one or more media parameters of the respective media segment associated with that set of weights, and wherein the one or more media parameters include at least one of: speech parameters including pulse code modulated (PCM) sample values associated with a respective memory unit, compressed representations of the PCM sample values associated with the respective memory unit, or acoustic features associated with the respective memory unit; and
generate a first estimate of an associated media segment that represents the detected first input speech segment, the associated media segment corresponding to a first memory unit in the collection of memory units.