CPC G06F 18/2431 (2023.01) [G06F 40/284 (2020.01); G06F 40/30 (2020.01); G06N 3/045 (2023.01); G06N 3/08 (2013.01); G06N 20/20 (2019.01); G06F 40/279 (2020.01); G06F 40/289 (2020.01); G06F 40/295 (2020.01); G10L 15/06 (2013.01); G10L 15/1807 (2013.01); G10L 15/1815 (2013.01); G10L 15/1822 (2013.01); G10L 25/30 (2013.01)] | 20 Claims |
1. A method comprising:
identifying a multimodal message comprising an image and a plurality of terms, the plurality of terms comprising an entity term and non-entity terms;
generating, using a convolutional neural network (CNN), an image representation from the image in the multimodal message;
generating, using a first recurrent neural network, a left representation from terms to the left of the entity term;
generating, using a second recurrent neural network, a target entity representation from the entity term;
generating, using a third recurrent neural network, a right representation from terms to the right of the entity term; and
generating a sentiment classification by combining the image representation, the left representation, the target entity representation, and the right representation.
|