| CPC G10L 13/10 (2013.01) [G10L 13/0335 (2013.01); G10L 2013/105 (2013.01)] | 49 Claims |

|
1. A first electronic device, comprising:
a display;
one or more processors;
a memory; and
one or more programs, wherein the one or more programs are stored in the memory and are configured to be executed by the one or more processors,
the one or more programs including instructions for:
receiving, at the first electronic device, a user input requesting an audible output of an electronic document including text, wherein the user input is the first audio input;
invoking a digital assistant;
determining, based on the digital assistant, to provide the audible output of the electronic document;
in accordance with a determination to provide the audible output of the electronic document:
generating a media item based on the text of the electronic document; and
after generating the media item, audibly outputting the media item, wherein the audible output of the media item is based on a semantic structure of the electronic document;
while audibly outputting the media item, displaying the electronic document and a graphical representation of the media item concurrently;
receiving a second user input, wherein the second user input is an audio input;
in accordance with a determination that the second user input is associated with an intent to modify the audible output of the media item:
modifying, based on the second user input and the semantic structure of the electronic document, the audible output of the media item;
while audibly outputting the media item, receiving a user input corresponding to a request to cease display of the electronic document; and
in response to receiving the user input corresponding to the request to cease display of the electronic document:
ceasing to display the electronic document while continuing to audibly output the media item; and
continuing to display the graphical representation of the media item.
|