US 11,929,077 B2
	Multi-stage speaker enrollment in voice authentication and identification
Michael M. Goodwin, Scotts Valley, CA (US); Teodora Ceanga, Brasov (RO); Eloy Geenjaar, Los Gatos, CA (US); Gadiel Seroussi, Cupertino, CA (US); and Brandon Smith, Kirkland, WA (US)
Assigned to DTS Inc., Calabasas, CA (US)
Filed by DTS, Inc., Calabasas, CA (US)
Filed on Dec. 22, 2020, as Appl. No. 17/131,702.
Claims priority of provisional application 62/952,926, filed on Dec. 23, 2019.
Prior Publication US 2021/0193150 A1, Jun. 24, 2021
Int. Cl. G10L 17/04 (2013.01); G06F 21/32 (2013.01); G10L 17/24 (2013.01)

CPC G10L 17/04 (2013.01) [G06F 21/32 (2013.01); G10L 17/24 (2013.01)]

17 Claims

1. A method for enrolling a user in a multi-stage enrollment voice authentication or identification system, comprising:

capturing a plurality of speech samples of the user's speech to obtain user speech samples';

categorizing the plurality of speech samples into the following sentence types: (1) a declarative sentence; (2) an imperative sentence; (3) an interrogative sentence; (4) an exclamatory sentence;

computing feature-space representations for each of the sentence types, wherein a boundary is determined for the feature-space representations for each of the sentence types;

generating a user enrollment voiceprint by aggregating the feature-space representations, wherein the aggregated feature-space representations include at least two different sentence types, wherein the aggregated feature-space representations have a modified boundary;

associating the user enrollment voiceprint with user information; and

storing the user enrollment voiceprint and associated user information in a database of enrolled users.