| CPC H03M 7/3082 (2013.01) [G06N 3/0455 (2023.01); G06F 17/16 (2013.01); G06N 3/084 (2013.01)] | 20 Claims |

|
1. A computer implemented system for encoding, comprising:
one or more computers; and
one or more computer memory devices interoperably coupled with the one or more computers and having tangible, non-transitory, machine-readable media storing one or more instructions that, when executed by the one or more computers, perform one or more operations for encoding, comprising:
encoding, using an encoding layer, a received first modal initial feature vector and a received second modal initial feature vector, to generate, respectively, a first modal feature vector and a second modal feature vector; and
performing, using at least one joint encoding unit, joint encoding on the first modal feature vector and the second modal feature vector, wherein the at least one joint encoding unit comprises an encoding module and a modal input switching module, and wherein:
processing, using the modal input switching module, the first modal feature vector and the second modal feature vector, to obtain, respectively a first modal switching encoding vector and a second modal switching encoding vector, and
processing, using the encoding module, the first modal switching encoding vector and the second modal switching encoding vector, to generate, respectively a first target modal fusion vector and a second target modal fusion vector.
|