US 12,334,049 B2
	Unstructured description-based chatbot development techniques
Asaf Aharoni, Ramat Hasharon (IL); Eyal Segalis, Tel Aviv (IL); Sasha Goldshtein, Tel Aviv (IL); Ofer Ron, Givatayim (IL); Yaniv Leviathan, New York, NY (US); and Yoav Tzur, Tel Aviv (IL)
Assigned to GOOGLE LLC, Mountain View, CA (US)
Filed by GOOGLE LLC, Mountain View, CA (US)
Filed on Dec. 5, 2022, as Appl. No. 18/074,799.
Prior Publication US 2024/0185834 A1, Jun. 6, 2024
Int. Cl. G10L 13/08 (2013.01); G10L 15/22 (2006.01)

CPC G10L 13/08 (2013.01) [G10L 15/22 (2013.01); G10L 2015/221 (2013.01); G10L 2015/223 (2013.01)]

20 Claims

1. A method implemented by one or more processors of a client device, the method comprising:

receiving, at the client device, unstructured free-form natural language input from a user of the client device, the unstructured free-form natural language input including one or more tasks associated with an entity;

in response to receiving the unstructured free-form natural language input that includes the one or more tasks associated with the entity:

generating, based on at least the unstructured free-form natural language input and locally at the client device, a chatbot to perform the one or more tasks associated with the entity and on behalf of the user; and

causing the chatbot to perform the one or more tasks associated with the entity and on behalf of the user, wherein causing the chatbot to perform the one or more tasks associated with the entity and on behalf of the user comprises:

causing the chatbot to engage in a corresponding conversation with the entity;

during the corresponding conversation with the entity:

causing the chatbot to render a plurality of instances of synthesized speech audio data for presentation to a representative of the entity, wherein at least a given instance of synthesized speech audio data, from among the plurality of instances of synthesized speech audio data, conveys details of the one or more tasks associated with the entity; and

receiving responsive content that is responsive to at least the given instance of synthesized speech audio data; and

causing the responsive content to be provided for presentation to the user of the client device.