CPC H04L 65/403 (2013.01) [G06T 13/40 (2013.01); G06T 17/20 (2013.01); G06T 19/20 (2013.01); G06V 10/56 (2022.01); G06V 40/172 (2022.01); G06V 40/176 (2022.01); G06V 40/19 (2022.01); H04L 51/10 (2013.01); H04L 65/1069 (2013.01); G06F 3/0482 (2013.01); G06T 2219/2016 (2013.01); G10L 15/26 (2013.01); H04N 7/157 (2013.01)] | 20 Claims |
1. A method comprising:
accessing image data at a client device responsive to an initiation of a communication session, the image data comprising facial tracking data;
detecting a trigger event based on the facial tracking data;
activating a microphone associated with the client device responsive to the trigger event;
capturing audio data at the client device via the microphone;
generating a transcription based on the audio data, the transcription comprising a text string; and
causing display of a presentation of the text string.
|