| CPC G10L 13/08 (2013.01) [G06F 3/0485 (2013.01); G10L 13/02 (2013.01)] | 20 Claims |

|
1. A computer-implemented method to perform audio playback of displayed textual content, the method comprising:
obtaining, by a computing system comprising one or more processors, data descriptive of a content item to provide for display, wherein the content item comprises a plurality of portions of textual content;
determining, by the computing system, positional data that indicates respective positions of one or more of the portions of textual content provided for display;
receiving, by the computing system, data indicative of a user input that adjusts content provided for display via a navigational scroll of the content item; and
responsive to receiving the data indicative of the user input, determining, by the computing system, updated positional data that indicates respective updated positions of the one or more of the portions of textual content;
identifying, by the computing system and based at least in part on the updated positional data, that a first portion of textual content is positioned within a playback area of the display;
generating, by the computing system, a textual content card based on the first portion of textual content, wherein the textual content card comprises text associated with the first portion of textual content;
providing, by the computing system, the textual content card for display; and
providing, by the computing system, automatic playback of an audio signal that includes speech of at least a portion of the first portion of textual content, wherein the speech of at least a portion of the first portion of textual content is determined at least in part using a trained machine learned model.
|