CPC G10L 17/04 (2013.01) [G10L 17/00 (2013.01); G10L 17/06 (2013.01)] | 21 Claims |
1. An audio processing method for automatic speech recognition, ASR, said method comprising:
for each audio segment of multiple audio segments in an audio stream comprising audio data of multiple audio programs, each audio segment comprising speech of a single speaker:
obtaining a speaker identifier of a speaker of said audio segment;
determining a program domain identifier based on a media description, wherein the media description could be any information or data element comprising information and metadata of the audio program; and
associating said speaker identifier, said program domain identifier and a speaker domain identifier with said audio segment to enable generation of ASR adaptation parameters based on said speaker identifier, said program domain identifier and said speaker domain identifier,
wherein said speaker domain identifier for said audio segment is based on information or metadata associated with an audio program of said multiple audio programs and said audio segment comprises audio data of said audio program.
|