CPC G10L 19/02 (2013.01) [G10L 25/78 (2013.01); G10L 25/90 (2013.01); G10L 25/93 (2013.01)] | 20 Claims |
1. A speech coding method, executed by an electronic device, the method comprising:
obtaining a first to-be-encoded speech frame and a subsequent speech frame from an audio signal;
extracting a first speech frame feature corresponding to the first to-be-encoded speech frame, and calculating a first speech frame criticality level corresponding to the first to-be-encoded speech frame based on the first speech frame feature, wherein the first speech frame criticality level represents a level of contribution made by sound quality of the first speech frame to overall speech quality within a period that includes one or more speech frames before the first speech frame and one or more speech frames after the first speech frame;
extracting a second speech frame feature corresponding to the subsequent speech frame, and calculating a second speech frame criticality level corresponding to the subsequent speech frame based on the second speech frame feature, wherein the second speech frame criticality level represents a level of contribution made by sound quality of the second speech frame to the overall speech quality within a period that includes one or more speech frames before the second speech frame and one or more speech frames after the second speech frame;
obtaining a criticality trend feature based on the first speech frame criticality level and the second speech frame criticality level, and determining, using the criticality trend feature, an encoding bit rate corresponding to the first to-be-encoded speech frame, the encoding bit rate corresponding to each to-be-encoded speech frame being controlled adaptively based on criticality trend strength represented by the criticality trend feature; and
encoding the first to-be-encoded speech frame based on the encoding bit rate to obtain an encoding result of the audio signal.
|