US 11,946,762 B2
Interactive voice navigation
Victor Carbune, Zurich (CH); Matthew Sharifi, Kilchberg (CH); and Blaise Aguera-Arcas, Seattle, WA (US)
Assigned to GOOGLE LLC, Mountain View, CA (US)
Appl. No. 17/251,244
Filed by Google LLC, Mountain View, CA (US)
PCT Filed Aug. 12, 2020, PCT No. PCT/US2020/045909
§ 371(c)(1), (2) Date Dec. 11, 2020,
PCT Pub. No. WO2022/035428, PCT Pub. Date Feb. 17, 2022.
Prior Publication US 2023/0160710 A1, May 25, 2023
Int. Cl. G01C 21/36 (2006.01); G01C 21/34 (2006.01); G06F 3/16 (2006.01); G10L 15/22 (2006.01); G10L 15/30 (2013.01)
CPC G01C 21/3608 (2013.01) [G01C 21/3415 (2013.01); G06F 3/165 (2013.01); G10L 15/22 (2013.01); G10L 2015/223 (2013.01); G10L 2015/225 (2013.01); G10L 2015/228 (2013.01); G10L 15/30 (2013.01)] 18 Claims
OG exemplary drawing
 
1. A computer-implemented method for interactive voice navigation, the method comprising:
providing, by a computing system including one or more processors, audio information including one or more navigation instructions to a user;
determining, by the computing system, a navigation difficulty value associated with the one or more navigation instructions based on one or more navigation difficulty factors;
determining, by the computing system, whether the navigation difficulty value exceeds a predetermined threshold;
in accordance with a determination that the navigation difficulty value exceeds a predetermined threshold, performing, by the computing system, a mitigation action;
activating, by the computing system, an audio sensor associated with the computing system;
collecting, by the computing system using the audio sensor, audio data associated with the user;
determining, by the computing system based on the audio data, whether the audio data is associated with the one or more navigation instructions;
in accordance with a determination that the audio data is associated with the one or more navigation instructions, determining, by the computing system, a context-appropriate audio response; and
providing, by the computing system, the context-appropriate audio response to the user.