| CPC G10L 15/26 (2013.01) [G10L 15/22 (2013.01); G10L 2015/223 (2013.01)] | 17 Claims | 

| 
               1. A method implemented by one or more processors, the method comprising: 
            receiving, at a computing device, a spoken utterance that is directed to a first application from a user, 
                wherein the spoken utterance corresponds to a request for the first application to perform a speech-to-text operation for incorporating text into a field of a second application; 
                generating, based on the spoken utterance, textual content data that characterizes textual content to be incorporated into the field of the second application, 
                wherein the second application is different from the first application; 
                generating, based on a type of application of the second application, content arrangement data that characterizes an arrangement, within the field of the second application, of a first portion of the textual content relative to a second portion of the textual content, 
                wherein the content arrangement data that characterizes the arrangement, within the field of the second application, of the first portion of the textual content relative to the second portion of the textual content and that is generated based on the type of the application differs based on different types of applications corresponding to the second application; and 
                causing, based on the textual content data and the content arrangement data, the textual content to be incorporated into a field of the second application according to the arrangement, in response to the spoken utterance. 
               |