CPC G10L 25/78 (2013.01) [G02B 27/0093 (2013.01); G02B 27/017 (2013.01); G06F 3/011 (2013.01); G06F 3/017 (2013.01); G06F 17/18 (2013.01); G10L 25/51 (2013.01); H04R 3/005 (2013.01); H04R 3/04 (2013.01); H04R 5/04 (2013.01); G10L 2025/783 (2013.01)] | 20 Claims |
1. A system comprising:
a wearable head device comprising:
a frame;
a first microphone disposed on the frame, the first microphone configured to rest at a first distance from a user's mouth when the frame is worn by the user; and
a second microphone disposed on the frame, the second microphone configured to rest at a second distance from the user's mouth when the frame is worn by the user, the second distance unequal to the first distance; and
one or more processors configured to perform a method comprising:
receiving, via the first microphone, a first voice audio signal;
determining a first probability of voice activity based on the first voice audio signal;
receiving, via the second microphone, a second voice audio signal;
determining a second probability of voice activity based on the first voice audio signal and the second voice audio signal;
determining whether a first threshold of voice activity is met based on the first probability of voice activity and the second probability of voice activity;
in accordance with a determination that the first threshold of voice activity is met, determining that a voice onset has occurred; and
in accordance with a determination that the first threshold of voice activity is not met, forgoing determining that a voice onset has occurred.
|