| CPC G06F 3/167 (2013.01) [G10L 15/22 (2013.01); G10L 15/26 (2013.01); G10L 2015/223 (2013.01); G10L 2015/225 (2013.01)] | 87 Claims |

|
1. A method comprising:
at a computer system in communication with a display generation component and one or more input devices:
displaying, via the display generation component, a representation of a first user other than a user of the computer system;
while displaying the representation of the first user, detecting, via the one or more input devices, a first input that includes:
attention of the user of the computer system; and
speech input from the user while the attention of the user satisfies one or more first criteria;
in response to detecting the first input, in accordance with a determination that one or more second criteria are satisfied, displaying, via the display generation component, a text representation of the speech input in a message entry region associated with transmitting messages to the first user without providing a representation of the speech input to the first user;
while displaying the text representation of the speech input in the message entry region and after detecting an end of the speech input, determining that a first threshold amount of time has elapsed since detecting the end of the speech input; and
in response to determining that the first threshold amount of time has elapsed since detecting the end of the speech input:
in accordance with a determination that one or more third criteria are satisfied, including a first criterion that is satisfied when the attention of the user has been directed to the message entry region for a second threshold amount of time after detecting the end of the speech input, initiating transmission of a message including the text representation of the speech input to the first user.
|