| CPC G06V 10/774 (2022.01) [G06N 3/0455 (2023.01); G06N 3/09 (2023.01); G06V 10/454 (2022.01); G06V 10/764 (2022.01); G06V 10/766 (2022.01); G06V 10/776 (2022.01); G06V 10/778 (2022.01); G06V 10/82 (2022.01); G06V 10/96 (2022.01); G06V 40/171 (2022.01); G06V 40/174 (2022.01)] | 20 Claims |

|
1. A computer system for multi-task joint training of a neural network including an encoder module and a multi-headed attention mechanism, the computer system comprising:
a processor coupled to a storage medium that stores instructions, which, upon execution by the processor, cause the processor to:
receive input data including a first set of labels and a second set of labels;
using the encoder module, extract features from the input data;
using a first task head of the multi-headed attention mechanism, compute a first training loss metric using the extracted features and the first set of labels;
using a second task head of the multi-headed attention mechanism, compute a second training loss metric using the extracted features and the second set of labels;
apply a first mask to filter the first training loss metric, wherein the first mask is computed based on the first set of labels;
apply a second mask to filter the second training loss metric, wherein the second mask is computed based on the second set of labels; and
compute a final training loss metric based on the filtered first training loss metric and the filtered second training loss metric.
|