US 12,154,585 B2
Voice activity detection
Elie Bou Daher, Marlborough, MA (US); Vigneish Kathavarayan, Marlborough, MA (US); and Cristian Marius Hera, Lancaster, MA (US)
Assigned to Bose Corporation, Framingham, MA (US)
Filed by Bose Corporation, Framingham, MA (US)
Filed on Feb. 25, 2022, as Appl. No. 17/680,559.
Prior Publication US 2023/0274753 A1, Aug. 31, 2023
Int. Cl. G10L 25/84 (2013.01); G10L 21/0224 (2013.01); G10L 25/78 (2013.01); G10L 21/0208 (2013.01); G10L 21/0216 (2013.01)
CPC G10L 21/0224 (2013.01) [G10L 25/78 (2013.01); G10L 25/84 (2013.01); G10L 21/0208 (2013.01); G10L 2021/02166 (2013.01)] 20 Claims
OG exemplary drawing
 
1. A method of detecting voice activity, the method comprising:
receiving a primary signal representative of acoustic energy in a detection region, the primary signal configured to include a speech component representative of a user's speech when the user is speaking;
receiving a reference signal representative of acoustic energy in the detection region, the reference signal configured to include a reduced speech component relative to the primary signal;
detecting a condition of the detection region, wherein the detected condition is indicative of an acoustic energy level in the detection region;
selecting a threshold value from among two or more values for determining whether the user is speaking, the threshold value selected based upon the detected condition in the region;
comparing the primary signal to the reference signal with respect to the selected threshold value; and
providing a binary indication of whether the user is speaking based at least in part upon the comparison.