US 11,671,696 B2
	User interfaces for managing visual content in media
Grant Paul, San Francisco, CA (US); Francisco Alvaro Munoz, Cupertino, CA (US); Jeffrey A. Brasket, Honolulu, HI (US); Brandon J. Corey, Palo Alto, CA (US); Thomas Deselaers, Cupertino, CA (US); Nathan De Vries, San Francisco, CA (US); Ryan S. Dixon, Mountain View, CA (US); Craig M. Federighi, Los Altos Hills, CA (US); Vignesh Jagadeesh, Cupertino, CA (US); James N. Jones, San Francisco, CA (US); Nicholas Lupinetti, Oakland, CA (US); Behkish J. Manzari, San Francisco, CA (US); Vinay Sharma, Cupertino, CA (US); Xin Wang, San Jose, CA (US); and Marco Zuliani, San Jose, CA (US)
Assigned to Apple Inc., Cupertino, CA (US)
Filed by Apple Inc., Cupertino, CA (US)
Filed on Sep. 24, 2021, as Appl. No. 17/484,844.
Claims priority of provisional application 63/197,497, filed on Jun. 6, 2021.
Claims priority of provisional application 63/176,847, filed on Apr. 19, 2021.
Prior Publication US 2022/0337741 A1, Oct. 20, 2022
Int. Cl. H04N 5/232 (2006.01); G06V 30/14 (2022.01); G06V 30/148 (2022.01); G06F 40/58 (2020.01); G06V 20/62 (2022.01); G06F 3/0481 (2022.01); G06F 3/04842 (2022.01); G06F 3/04845 (2022.01); G06F 3/0484 (2022.01); G06F 3/04886 (2022.01)

CPC H04N 5/23218 (2018.08) [G06F 3/0481 (2013.01); G06F 3/0484 (2013.01); G06F 3/04842 (2013.01); G06F 3/04845 (2013.01); G06F 3/04886 (2013.01); G06F 40/58 (2020.01); G06V 20/63 (2022.01); G06V 30/1456 (2022.01); G06V 30/153 (2022.01); H04N 5/23216 (2013.01); H04N 5/232933 (2018.08); H04N 5/232935 (2018.08); H04N 5/232939 (2018.08); G06F 2203/04806 (2013.01)]

54 Claims

1. A computer system configured to communicate with a display generation component, comprising:

one or more processors; and

memory storing one or more programs configured to be executed by the one or more processors, the one or more programs including instructions for:

displaying, via the display generation component, a camera user interface that includes concurrently displaying a representation of media and a media capture affordance;

while concurrently displaying the representation of media and the media capture affordance:

in accordance with a determination that a respective set of criteria is satisfied, wherein the respective set of criteria includes a criterion that is satisfied when respective text is detected in the representation of media, displaying, via the display generation component, a first user interface object corresponding to one or more text management operations; and

in accordance with a determination that the respective set of criteria is not satisfied, forgoing displaying the first user interface object;

while displaying the representation of media, detecting a first input directed to the camera user interface; and

in response to detecting the first input directed to the camera user interface:

in accordance with a determination that the first input corresponds to selection of the media capture affordance, initiating capture of media to be added to a media library associated with the computer system; and

in accordance with a determination that the first input corresponds to selection of the first user interface object, displaying, via the display generation component, a plurality of options to manage the respective text, wherein after detecting the first input and in accordance with a determination that the first input corresponds to selection of the first user interface object, the representation of media includes the respective text and an indication that a first one or more portions of the respective text is selected, wherein the first one or more portions of the respective text is displayed at a first position in the camera user interface;

while displaying the first one or more portions of the respective text at the first position in the camera user interface, detecting a change in a physical environment that is within a field of view of one or more cameras in communication with the computer system; and

in response to detecting the change in the physical environment that is within the field of view of the one or more cameras, continuing to display the first one or more portions of the respective text at the first position in the camera user interface.