CPC G10L 15/183 (2013.01) [G06F 16/5846 (2019.01); G06V 10/778 (2022.01); G06V 30/1456 (2022.01); G06V 30/153 (2022.01); G10L 15/22 (2013.01); G10L 15/30 (2013.01)] | 18 Claims |
13. A computer-implemented method for responding to queries about an image, the method comprising:
obtaining, by a computing system with one or more processors, an image, wherein the image depicts a first set of textual content;
determining, by the computing system, one or more characteristics of the first set of textual content, wherein the one or more characteristics of the first set of textual content includes a density of the first set of textual content;
determining, by the computing system, a response type from a plurality of response types based on the one or more characteristics, wherein the plurality of response types includes a summarization response, an explanation response, and a query response, wherein the determined response type is a summarization response, and wherein determining a response type from a plurality of response types based on the one or more characteristics further comprise:
determining the density for the first set of textual content within the image;
responsive to a determination that the density for the first set of textual content within the image satisfies a threshold, determining that the response type is a summarization response type; and
updating a user interface to include a summarize user interface element;
generating, by the computing system, a model input, wherein the model input comprises data descriptive of the first set of textual content and a prompt associated with the response type;
providing, by the computing system, the model input as an input to a machine-learned language model;
receiving, by the computing system, a second set of text as an output of the machine-learned language model as a result of the machine-learned language model processing the model input; and
providing, by the computing system, the second set of text for display to a user, wherein the second set of textual content is associated with the response type.
|