US 11,736,665 B1
Custom and automated audio prompts for devices
Elliott Lemberger, Santa Monica, CA (US); John Modestine, Redondo Beach, CA (US); Kevin Park, Los Angeles, CA (US); Richard Carter Mosher, Atherton, CA (US); Trevor Grolle, Mesa, AZ (US); and Kirk David Bacon, Westminster, CA (US)
Assigned to Amazon Technologies, Inc., Seattle, WA (US)
Filed by Amazon Technologies, Inc., Seattle, WA (US)
Filed on Sep. 3, 2021, as Appl. No. 17/466,959.
Application 17/466,959 is a continuation of application No. 16/359,520, filed on Mar. 20, 2019, granted, now 11,115,630.
Claims priority of provisional application 62/649,504, filed on Mar. 28, 2018.
Int. Cl. H04N 7/18 (2006.01); G06F 3/16 (2006.01); H04L 67/125 (2022.01); G06V 20/20 (2022.01); G06V 20/40 (2022.01); G06V 20/52 (2022.01)
CPC H04N 7/186 (2013.01) [G06F 3/167 (2013.01); G06V 20/20 (2022.01); G06V 20/40 (2022.01); G06V 20/52 (2022.01); H04L 67/125 (2013.01); G06V 20/44 (2022.01)] 20 Claims
OG exemplary drawing
 
1. A system comprising:
one or more processors; and
one or more computer-readable media storing instructions that, when executed by the one or more processors, cause the one or more processors to perform operations comprising:
receiving video data representing a visitor detected by an electronic device;
receiving first audio data representing speech of the visitor;
determining, by performing a lookup on a lookup table based on the first audio data, one or more audio messages;
sending the video data to a user device;
sending, to the user device, audio message data indicating the one or more audio messages;
receiving, from the user device, selection data representing a selection of a first audio message of the one or more audio messages; and
based at least in part on the receiving of the selection data, sending, to the electronic device, an instruction to output the first audio message.