| CPC G06F 3/017 (2013.01) [G06F 3/013 (2013.01); G06F 3/167 (2013.01); G06V 40/28 (2022.01)] | 21 Claims |

|
1. A method comprising:
at an electronic device in communication with a display and one or more input devices comprising at least one hand-tracking sensor:
displaying, via the display, a three-dimensional environment;
while displaying the three-dimensional environment, detecting, via the at least one hand-tracking sensor, a first input including a gesture by a hand; and
in response to the first input:
in accordance with a determination that the first input satisfies first criteria, the first criteria including a first criterion that is satisfied when the hand is oriented in a specified direction relative to the electronic device or within a threshold of the specified direction relative to the electronic device, when a dorsal aspect of the hand is facing the electronic device and including a second criterion that is satisfied when a gaze is detected at or within a threshold distance of the hand or a representation of the hand in the three-dimensional environment, activating a digital assistant and displaying a visual representation of the digital assistant in the three-dimensional environment, wherein;
the digital assistant interprets natural language input and performs one or more actions based on the natural language input; and
in accordance with a determination that the digital assistant is activated:
obtaining audio data; and
changing the visual representation of the digital assistant based on the audio data while obtaining the audio data; and
in accordance with a determination that the first input fails to satisfy the first criteria, forgoing activating the digital assistant.
|