CPC G06F 40/284 (2020.01) [G06F 40/30 (2020.01); G06N 3/084 (2013.01); G06N 7/01 (2023.01)] | 20 Claims |
1. A method comprising, by a computing device:
accessing at least one first set of tokens associated with a desired task and one or more modalities associated with a context of the desired task;
determining, for the one or more modalities, a second set of tokens using a classifier network associated with at least one modality;
generating a plurality of embedding vectors comprising a first set of embedding vectors mapped to the at least one first set of tokens and a second set of embedding vectors mapped to the second set of tokens, the at least one first set of tokens and the second set of tokens associated with the one or more modalities, wherein the first set of embedding vectors and the second set of embedding vectors are different and are mapped to an embedding space; and
producing a sequence of words addressing the desired task based on determining probability distributions of the words to determine whether to select the words for the sequence and based on processing the plurality of embedding vectors with an encoder-decoder network.
|