US 12,277,925 B2
	Automatic audio playback of displayed textual content
Rachel Ilan Simpson, London (GB); Benedict Davies, London (GB); and Guillaume Boniface-Chang, London (GB)
Assigned to GOOGLE LLC, Mountain View, CA (US)
Filed by Google LLC, Mountain View, CA (US)
Filed on Dec. 11, 2023, as Appl. No. 18/535,279.
Application 18/535,279 is a continuation of application No. 17/052,046, granted, now 11,887,581, previously published as PCT/US2019/061401, filed on Nov. 14, 2019.
Prior Publication US 2024/0127792 A1, Apr. 18, 2024
Int. Cl. G06F 3/0485 (2022.01); G10L 13/02 (2013.01); G10L 13/08 (2013.01)

CPC G10L 13/08 (2013.01) [G06F 3/0485 (2013.01); G10L 13/02 (2013.01)]

20 Claims

1. A computer-implemented method to perform audio playback of displayed textual content, the method comprising:

obtaining, by a computing system comprising one or more processors, data descriptive of a content item to provide for display, wherein the content item comprises a plurality of portions of textual content;

determining, by the computing system, positional data that indicates respective positions of one or more of the portions of textual content provided for display;

receiving, by the computing system, data indicative of a user input that adjusts content provided for display via a navigational scroll of the content item; and

responsive to receiving the data indicative of the user input, determining, by the computing system, updated positional data that indicates respective updated positions of the one or more of the portions of textual content;

identifying, by the computing system and based at least in part on the updated positional data, that a first portion of textual content is positioned within a playback area of the display;

generating, by the computing system, a textual content card based on the first portion of textual content, wherein the textual content card comprises text associated with the first portion of textual content;

providing, by the computing system, the textual content card for display; and

providing, by the computing system, automatic playback of an audio signal that includes speech of at least a portion of the first portion of textual content, wherein the speech of at least a portion of the first portion of textual content is determined at least in part using a trained machine learned model.