US 11,722,731 B2
Integrating short-term context for content playback adaption
Victor Carbune, Zürich (CH); and Matthew Sharifi, Kilchberg (CH)
Assigned to Google LLC, Mountain View, CA (US)
Filed by Google LLC, Mountain View, CA (US)
Filed on Nov. 24, 2020, as Appl. No. 17/103,908.
Prior Publication US 2022/0167049 A1, May 26, 2022
Int. Cl. H04N 21/422 (2011.01); H04N 21/4223 (2011.01); H04N 21/433 (2011.01); H04N 21/439 (2011.01); H04N 21/442 (2011.01); H04N 21/458 (2011.01); G06F 3/16 (2006.01); G06N 3/08 (2023.01); H04N 21/45 (2011.01); H04N 21/466 (2011.01); H04N 21/472 (2011.01)
CPC H04N 21/458 (2013.01) [H04N 21/4396 (2013.01); H04N 21/44218 (2013.01); H04N 21/4532 (2013.01); H04N 21/466 (2013.01); H04N 21/47202 (2013.01); H04N 21/47217 (2013.01)] 20 Claims
OG exemplary drawing
 
1. A method comprising:
receiving, at data processing hardware of an assistant-enabled device while the assistant-enabled device is playing back media content in an environment of the assistant-enabled device, a contextual signal representing the environment, the contextual signal comprising at least one of audio present in the environment detected by a microphone of the assistant-enabled device or image data representing an image of the environment captured by an image capture device of the assistant-enabled device;
executing, by the data processing hardware of the assistant-enabled device while the assistant-enabled device is playing back the media content, an event recognition routine to determine whether the received contextual signal is indicative of an event present in the environment that conflicts with the playback of the media content in the environment from the assistant-enabled device; and
in response to the event recognition routine determining that the received contextual signal is indicative of the event present in the environment that conflicts with the playback of the media content in the environment, automatically adjusting, by the data processing hardware of the assistant-enabled device, content playback settings of the assistant-enabled device while continuing to receive the contextual signal that is indicative of the event present in the environment that conflicts with the playback of the media content in the environment from the assistant-enabled device,
wherein executing the event recognition routine comprises executing a neural network-based classification model configured to receive the contextual signal as input and generate, as output, a classification result indicating whether the received contextual signal is indicative of the event present in the environment that conflicts with the playback of the media content in the environment from the assistant-enabled device.