| CPC H04L 51/10 (2013.01) [G06F 16/44 (2019.01); G10L 15/22 (2013.01); H04N 7/15 (2013.01); H04N 21/4394 (2013.01); H04N 21/4788 (2013.01); G10L 15/005 (2013.01); G10L 15/16 (2013.01); G10L 2015/223 (2013.01); G10L 25/63 (2013.01)] | 20 Claims |

|
1. A computer-implemented method comprising:
receiving session video content during a video communication session between a first computing device and a second computing device;
detecting, in the session video content, a gesture performed by a user associated with the first computing device;
determining, with a trained machine-learning model and based on the gesture, that the user invoked a request for assistance, wherein the request comprises a request for media and wherein the trained machine-learning model outputs a confidence score associated with the determination;
in response to the confidence score meeting a threshold, outputting, by the trained machine-learning model, the media; and
sending a first command to at least one of the first computing device or the second computing device to display the media.
|