CPC G10L 25/66 (2013.01) [G10L 15/22 (2013.01); G10L 15/30 (2013.01)] | 20 Claims |
1. A method, comprising:
obtaining a first sequence of reference-sample feature vectors that quantify acoustic features of different respective portions of at least one reference speech sample, which was produced by a subject at a first time while a physiological state of the subject was known;
obtaining a second sequence of test-sample feature vectors that quantify the acoustic features of different respective portions of at least one test speech sample, which was produced by the subject at a second time while the physiological state of the subject was unknown;
aligning the test speech sample with the reference speech sample, by mapping the test-sample feature vectors to respective ones of the reference-sample feature vectors, under predefined constraints, such that a total distance between the second sequence and the first sequence is minimized; and
in response to aligning the test speech sample with the reference speech sample, generating an output indicating the physiological state of the subject at the second time.
|