US 12,230,288 B2
Systems and methods for automated customized voice filtering
Jin Zhang, San Mateo, CA (US); Celeste Bean, San Mateo, CA (US); Sepideh Karimi, San Mateo, CA (US); and Sudha Krishnamurthy, San Mateo, CA (US)
Assigned to SONY INTERACTIVE ENTERTAINMENT LLC, San Mateo, CA (US); and SONY INTERACTIVE ENTERTAINMENT INC., Tokyo (JP)
Filed by SONY INTERACTIVE ENTERTAINMENT LLC, San Mateo, CA (US); and SONY INTERACTIVE ENTERTAINMENT INC., Tokyo (JP)
Filed on May 31, 2022, as Appl. No. 17/828,116.
Prior Publication US 2023/0410824 A1, Dec. 21, 2023
Int. Cl. G10L 19/02 (2013.01); G10L 15/187 (2013.01); G10L 15/22 (2006.01); G10L 21/013 (2013.01); G10L 25/51 (2013.01); G10L 25/90 (2013.01)
CPC G10L 21/013 (2013.01) [G10L 15/187 (2013.01); G10L 15/22 (2013.01); G10L 25/51 (2013.01); G10L 25/90 (2013.01)] 20 Claims
OG exemplary drawing
 
1. An apparatus for audio processing, the apparatus comprising:
at least one memory storing instructions; and
at least one processor that executes the instructions, wherein execution of the instructions by the at least one processor causes the at least one processor to:
receive audio content that includes a voice sample of a voice of a user saying at least one word, the at least one word including a plurality of characters;
analyze the voice sample to identify a sound type in the voice sample, wherein the sound type corresponds to a pronunciation by the user in the voice sample of at least one specified character of the plurality of characters;
generate a filtered voice sample using a personalized filter at least in part by filtering the voice sample to modify the sound type, wherein the personalized filter is customized to the voice of the user based on at least one additional voice sample of the voice of the user; and
output the filtered voice sample.