US 11,949,946 B2
Dynamic insertion of supplemental audio content into audio recordings at request time
Rachel Hausmann, Mountain View, CA (US); and Collin Irwin, Mountain View, CA (US)
Assigned to GOOGLE LLC, Mountain View, CA (US)
Appl. No. 16/621,334
Filed by Google LLC, Mountain View, CA (US)
PCT Filed Nov. 26, 2019, PCT No. PCT/US2019/063317
§ 371(c)(1), (2) Date Dec. 11, 2019,
PCT Pub. No. WO2021/107932, PCT Pub. Date Jun. 3, 2021.
Prior Publication US 2022/0286732 A1, Sep. 8, 2022
Int. Cl. H04N 21/439 (2011.01); G10L 15/18 (2013.01); G11B 27/029 (2006.01); H04N 21/233 (2011.01); H04N 21/81 (2011.01)
CPC H04N 21/4394 (2013.01) [G10L 15/1822 (2013.01); G11B 27/029 (2013.01); H04N 21/2335 (2013.01); H04N 21/8106 (2013.01)] 20 Claims
OG exemplary drawing
 
1. A system to insert supplemental audio content into primary audio content via digital assistant applications, comprising:
a record indexer component executed on a data processing system having one or more processors to maintain, on a database, an audio recording of a content publisher and a content spot marker set by the content publisher, the content spot marker specifying a content spot defining a time or time window within the audio recording;
a natural language processor component executed on the data processing system to:
receive an audio data packet including an input audio signal that is detected by a sensor of a client device;
parse the input audio signal from the audio data packet to determine that the input audio signal corresponds to a request for the audio recording from the content publisher; and
identify, based on the request determined from the input audio signal, the audio recording of the content publisher from the database;
a content placement component executed on the data processing system to:
identify, responsive to the determination that the input audio signal corresponds to the request for the audio recording, an identifier associated with the client device as a content selection parameter; and
select, for the content spot of the audio recording, an audio content item of a content provider from a plurality of audio content items using the content selection parameter; and
an action handler component executed on the data processing system to:
insert the audio content item into the content spot of the audio recording specified by the content spot marker;
generate an action data structure including the audio recording inserted with audio content item at the time defined by the content spot marker; and
transmit the action data structure to the client device to present the audio recording inserted with the audio content item at the content spot.