US 12,431,138 B2
	Arranging and/or clearing speech-to-text content without a user providing express instructions
Victor Carbune, Zurich (CH); Krishna Sapkota, Zurich (CH); Behshad Behzadi, Freienbach (CH); Julia Proskurnia, Zurich (CH); Jacopo Sannazzaro Natta, Berkeley, CA (US); Justin Lu, Zurich (CH); Magali Boizot-Roche, Zurich (CH); Marius Sajgalik, Zurich (CH); Nicolo D'Ercole, Oberrieden (CH); Zaheed Sabur, Baar (CH); and Luv Kothari, Sunnyvale, CA (US)
Assigned to GOOGLE LLC, Mountain View, CA (US)
Filed by GOOGLE LLC, Mountain View, CA (US)
Filed on May 29, 2024, as Appl. No. 18/677,629.
Application 18/677,629 is a continuation of application No. 17/337,804, filed on Jun. 3, 2021, granted, now 12,033,637.
Claims priority of provisional application 63/189,386, filed on May 17, 2021.
Prior Publication US 2024/0321277 A1, Sep. 26, 2024
This patent is subject to a terminal disclaimer.
Int. Cl. G10L 15/22 (2006.01); G10L 15/26 (2006.01)

CPC G10L 15/26 (2013.01) [G10L 15/22 (2013.01); G10L 2015/223 (2013.01)]

17 Claims

1. A method implemented by one or more processors, the method comprising:

receiving, at a computing device, a spoken utterance that is directed to a first application from a user,

wherein the spoken utterance corresponds to a request for the first application to perform a speech-to-text operation for incorporating text into a field of a second application;

generating, based on the spoken utterance, textual content data that characterizes textual content to be incorporated into the field of the second application,

wherein the second application is different from the first application;

generating, based on a type of application of the second application, content arrangement data that characterizes an arrangement, within the field of the second application, of a first portion of the textual content relative to a second portion of the textual content,

wherein the content arrangement data that characterizes the arrangement, within the field of the second application, of the first portion of the textual content relative to the second portion of the textual content and that is generated based on the type of the application differs based on different types of applications corresponding to the second application; and

causing, based on the textual content data and the content arrangement data, the textual content to be incorporated into a field of the second application according to the arrangement, in response to the spoken utterance.