CPC G10L 15/16 (2013.01) [G06N 3/04 (2013.01); G06N 3/08 (2013.01); G10L 15/02 (2013.01); G10L 15/22 (2013.01); G10L 2015/088 (2013.01)] | 20 Claims |
1. A device configured to detect keywords in sound samples, the device comprising:
one or more microphones;
one or more processors;
a dynamic random-access memory block; and
a computer-readable memory storing instructions that, when executed by the one or more processors, cause the device to perform operations comprising:
capturing a plurality of sound samples via the one or more microphones;
extracting features from the plurality of sound samples, the extraction comprising:
storing the sound samples in a first portion of the memory block;
performing first computations based on spectral coefficients of the sound samples using a second portion of the memory block; and
storing results of the first computations as extracted features in a third portion of the memory block;
designating as temporary memory the second and third portions of the memory block; and
processing the extracted features utilizing a neural network, the processing comprising:
storing the extracted features in a fourth portion of the memory block;
performing second computations on the extracted features using the temporary memory, the second computations comprising computing at least one layer of the neural network; and
storing an output of the neural network as a classification in the temporary memory.
|