US 12,283,269 B2
Intent inference in audiovisual communication sessions
Paul Bates, Seattle, WA (US)
Assigned to Sonos, Inc., Santa Barbara, CA (US)
Filed by Sonos, Inc., Santa Barbara, CA (US)
Filed on Oct. 14, 2021, as Appl. No. 17/450,925.
Claims priority of provisional application 63/092,686, filed on Oct. 16, 2020.
Prior Publication US 2022/0122583 A1, Apr. 21, 2022
Int. Cl. G10L 15/22 (2006.01); G10L 15/05 (2013.01); G10L 15/07 (2013.01); H04L 12/18 (2006.01)
CPC G10L 15/05 (2013.01) [G10L 15/07 (2013.01); G10L 15/22 (2013.01); H04L 12/1818 (2013.01); H04L 12/1831 (2013.01); G10L 2015/223 (2013.01)] 18 Claims
OG exemplary drawing
 
1. A network microphone device comprising:
one or more microphones;
a network interface;
one or more processors;
data storage having instructions stored therein that, when executed by the one or more processors, cause the network microphone device to perform operations comprising:
capturing voice input from a first user via the one or more microphones during an ongoing communication session involving at least the first user, a second user, and a third user;
transmitting the voice input to one or more remote computing devices for the communication session;
analyzing the voice input to detect one or more utterances from the first user;
monitoring a context parameter of the communication session,
based on the one or more utterances detected during the ongoing communication session, inferring an intent of the first user; and
based on the inferred intent of the first user and the context parameter, causing a user prompt to be displayed via a first display device communicatively coupled to the network microphone device, the first display device associated with the second user, wherein the user prompt is not displayed via a second display device communicatively coupled to the network microphone device, the second display device associated with the third user.