CPC G10L 25/51 (2013.01) [G06N 3/045 (2023.01); G06N 3/08 (2013.01); G10L 25/21 (2013.01); G10L 25/30 (2013.01); G10L 15/08 (2013.01); G10L 2015/088 (2013.01); G10L 15/22 (2013.01); G10L 2015/223 (2013.01)] | 20 Claims |
1. A computer-implemented method, comprising:
receiving first audio data representing occurrence of a first acoustic event;
determining first encoded data representing the first acoustic event;
receiving second audio data representing audio detected by a device;
determining, by processing the second audio data and the first encoded data using a first acoustic event detection (AED) component configured to detect occurrence of one or more acoustic events from a first set of acoustic events, first event detection data representing a first likelihood that at least one acoustic event from the first set of acoustic events is represented in the second audio data;
determining, by processing the first audio data using a second AED component configured to detect occurrence of one or more acoustic events from a second set of acoustic events, second event detection data representing a second likelihood that at least one acoustic event from the second set of acoustic events is represented in the second audio data;
determining, based at least in part on the first event detection data and the second event detection data, that at least one of the first acoustic event from the first set of acoustic events or a second acoustic event from the second set of acoustic events is represented in the second audio data; and
determining output data indicating that at least one of the first acoustic event or the second acoustic event occurred.
|