US 11,887,578 B2
	Automatic dubbing method and apparatus
Henry Gabryjelski, Redmond, WA (US); Jian Luan, Beijing (CN); and Dapeng Li, Beijing (CN)
Assigned to Microsoft Technology Licensing, LLC, Redmond, WA (US)
Filed by Microsoft Technology Licensing, LLC, Redmond, WA (US)
Filed on Nov. 10, 2022, as Appl. No. 17/985,016.
Application 17/985,016 is a continuation of application No. 16/342,416, granted, now 11,514,885, previously published as PCT/CN2016/106554, filed on Nov. 21, 2016.
Prior Publication US 2023/0076258 A1, Mar. 9, 2023
Int. Cl. G10L 13/00 (2006.01); G06F 40/58 (2020.01); G10L 13/08 (2013.01); G10L 17/00 (2013.01)

CPC G10L 13/00 (2013.01) [G06F 40/58 (2020.01); G10L 13/086 (2013.01); G10L 17/00 (2013.01)]

20 Claims

1. An automatic dubbing method, comprising:

extracting speeches of a first voice from an audio portion of a media content;

receiving an audio input of a second voice of a user of a user device;

after receiving the audio input of the second voice of the user of the user device, generating a voice print model for the second voice including a set of phonemes of the second voice using the received audio input;

receiving a selection of the media content for playback on the user device by the user of the user device; and

responsive to receiving the selection of the media content for playback on the user device:

processing the extracted speeches of the first voice by utilizing the voice print model to generate replacement speeches, the replacement speeches generated using the set of phonemes of the second voice;

replacing the extracted speeches of the first voice with the generated replacement speeches in the audio portion of the media content; and

outputting the audio portion with the generated replacement speeches for playback on the user device.