| CPC B60R 16/0373 (2013.01) [B60R 25/01 (2013.01); B60R 25/257 (2013.01); B60R 25/305 (2013.01); G05B 13/027 (2013.01); G06F 21/32 (2013.01); G06N 3/08 (2013.01); G06V 10/25 (2022.01); G06V 20/59 (2022.01); G10L 17/00 (2013.01); G10L 17/06 (2013.01); G10L 17/18 (2013.01)] | 20 Claims |

|
1. A method comprising:
receiving audio data generated using one or more microphones of a machine, the audio data representative of user speech from a first occupant of the machine;
processing the audio data to determine that the user speech from the first occupant indicates an identifier of a second occupant of the machine and an operation associated with a component of the machine, the second occupant being different than the first occupant;
receiving image data generated using one or more image sensors of the machine, the image data representative of an image depicting at least a portion of an interior of the machine;
based at least on the user speech from the first occupant indicating the identifier of the second occupant, determining, using at least the image data, that the image depicts the second occupant within a region of the machine that is associated with the component; and
causing the operation to be performed with respect to the component.
|