US 12,308,045 B2
Acoustic event detection
Qingming Tang, Cambridge, MA (US); Chieh-Chi Kao, Somerville, MA (US); Qin Zhang, Cambridge, MA (US); Ming Sun, Winchester, MA (US); Chao Wang, Newton, MA (US); Sumit Garg, Acton, MA (US); Rong Chen, Boston, MA (US); James Garnet Droppo, Carnation, WA (US); and Chia-Jung Chang, Cambridge, MA (US)
Assigned to Amazon Technologies, Inc., Seattle, WA (US)
Filed by Amazon Technologies, Inc., Seattle, WA (US)
Filed on Sep. 8, 2023, as Appl. No. 18/243,804.
Application 18/243,804 is a continuation of application No. 17/547,644, filed on Dec. 10, 2021, granted, now 11,790,932.
Prior Publication US 2024/0071408 A1, Feb. 29, 2024
This patent is subject to a terminal disclaimer.
Int. Cl. G10L 25/51 (2013.01); G06N 3/045 (2023.01); G06N 3/08 (2023.01); G10L 25/21 (2013.01); G10L 25/30 (2013.01); G10L 15/08 (2006.01); G10L 15/22 (2006.01)
CPC G10L 25/51 (2013.01) [G06N 3/045 (2023.01); G06N 3/08 (2013.01); G10L 25/21 (2013.01); G10L 25/30 (2013.01); G10L 15/08 (2013.01); G10L 2015/088 (2013.01); G10L 15/22 (2013.01); G10L 2015/223 (2013.01)] 20 Claims
OG exemplary drawing
 
1. A computer-implemented method, comprising:
receiving first audio data representing occurrence of a first acoustic event;
determining first encoded data representing the first acoustic event;
receiving second audio data representing audio detected by a device;
determining, by processing the second audio data and the first encoded data using a first acoustic event detection (AED) component configured to detect occurrence of one or more acoustic events from a first set of acoustic events, first event detection data representing a first likelihood that at least one acoustic event from the first set of acoustic events is represented in the second audio data;
determining, by processing the first audio data using a second AED component configured to detect occurrence of one or more acoustic events from a second set of acoustic events, second event detection data representing a second likelihood that at least one acoustic event from the second set of acoustic events is represented in the second audio data;
determining, based at least in part on the first event detection data and the second event detection data, that at least one of the first acoustic event from the first set of acoustic events or a second acoustic event from the second set of acoustic events is represented in the second audio data; and
determining output data indicating that at least one of the first acoustic event or the second acoustic event occurred.