CPC G06F 3/167 (2013.01) [G06F 3/165 (2013.01); G06V 40/174 (2022.01); G10L 15/22 (2013.01); H04N 7/142 (2013.01); H04N 7/15 (2013.01)] | 20 Claims |
1. An electronic device comprising:
an image sensor;
a display;
an actuator;
at least one microphone;
a speaker; and
a processor interfacing with the image sensor, the display, the actuator, the at least one microphone, and the speaker, and configured to:
obtain a set of audio signals from the at least one microphone,
determine that the set of audio signals include speech spoken by a first person,
obtain video data from the image sensor,
determine, based on an analysis of the set of audio signals and the video data, that the first person is not within a field of view of the image sensor,
cause the actuator to rotate the display about an axis while a portion of the electronic device comprising the speaker does not rotate,
after causing the actuator to rotate the display about the axis, obtain second video data from the image sensor, and
determine, based on an analysis of the second video data, that the first person is within the field of view of the image sensor.
|