US 11,756,567 B2
Autocreation of conversational image representation
Kimiko Wilson, South Melbourne (AU); Jorge Andres Moros Ortiz, Melbourne (AU); Suman Sedai, Hughesdale (AU); and Khoi-Nguyen Dao Tran, Southbank (AU)
Assigned to International Business Machines Corporation, Armonk, NY (US)
Filed by International Business Machines Corporation, Armonk, NY (US)
Filed on Aug. 26, 2020, as Appl. No. 17/2,852.
Prior Publication US 2022/0068296 A1, Mar. 3, 2022
Int. Cl. G10L 21/10 (2013.01); G06T 13/20 (2011.01); G06Q 50/00 (2012.01); G06N 3/049 (2023.01); G06T 13/40 (2011.01)
CPC G10L 21/10 (2013.01) [G06N 3/049 (2013.01); G06Q 50/01 (2013.01); G06T 13/205 (2013.01); G06T 13/40 (2013.01)] 14 Claims
OG exemplary drawing
 
1. A computer-implemented method comprising:
detecting, by one or more computer processors, one or more utterances by a user, wherein utterances are either textual or acoustic;
identifying, by one or more computer processors, the user associated with the one or more detected utterances, further comprising:
retrieving, by one or more computer processors, one or more images associated with social media associated the user;
creating, by one or more computer processors, a user model trained with the one or more retrieved images specific to the user; and
generating, by one or more computer processors, the avatar associated with the user utilizing the created user model, wherein the avatar is a realistic representation of the user covering a plurality of facial angles and expressions;
generating, by one or more computer processors, one or more image representations of the one or more detected utterances utilizing the generated avatar and a generative adversarial network restricted by one or more user privacy parameters, wherein the generative adversarial network is fed with an extracted sentiment, a generated avatar, an identified topic, an extracted location, and one or more user preferences; and
displaying, by one or more computer processors, the generated one or more image representations on one or more devices associated with one or more respective recipients of the one or more utterances.