US 12,425,716 B2
Content capture with audio input feedback
Newar Husam Al Majid, New York, NY (US); Laurent Desserrey, Los Angeles, CA (US); Christie Marie Heikkinen, Santa Monica, CA (US); Nathaniel Parrott, Brooklyn, NY (US); and Jeremy Voss, Los Angeles, CA (US)
Assigned to SNAP INC., Santa Monica, CA (US)
Filed by Snap Inc., Santa Monica, CA (US)
Filed on Jun. 23, 2020, as Appl. No. 16/909,850.
Claims priority of provisional application 62/868,191, filed on Jun. 28, 2019.
Prior Publication US 2020/0412975 A1, Dec. 31, 2020
Int. Cl. H04N 23/63 (2023.01); G06F 3/0484 (2022.01); G06F 3/0487 (2013.01); G06F 3/04883 (2022.01); G06F 9/54 (2006.01); G06N 20/00 (2019.01); G10L 25/60 (2013.01)
CPC H04N 23/631 (2023.01) [G06F 3/0484 (2013.01); G06F 3/0487 (2013.01); G06F 3/04883 (2013.01); G06F 9/542 (2013.01); G06N 20/00 (2019.01); G10L 25/60 (2013.01)] 20 Claims
OG exemplary drawing
 
1. A computing device comprising:
a touchscreen;
a microphone;
one or more processors;
memory storing computer-readable instructions that, when executed by the one or more processors, cause the one or more processors to perform operations comprising:
causing display, on the touchscreen, a target interface that includes one or more user interface elements that are selectable to cause a modular camera interface to be displayed, the one or more user interface elements including a user interface element corresponding to a contact of a user of the computing device;
responsive to selection of the user interface element, causing display, on the touchscreen, of the modular camera interface comprising a content capture element and comprising one or more context-based display elements that correspond to the target interface, the one or more context-based display elements including a context-based display element corresponding to the contact of the user;
initiating capture of a first piece of content in response to a capture input touch action received at the content capture element, the capture input touch action comprising a pressure action;
processing first data from the microphone during the pressure action to identify a first audio stream associated with the first piece of content;
determining a first audio environment corresponding to the first audio stream;
responsive to determining that the first piece of content is included in a content collection that is accessible by a broad audience, analyzing first audio data corresponding to a plurality of time segments in the first audio stream in accordance with a first set of metrics to determine a first audio quality score from the first audio stream;
causing display of a first audio quality feedback indicator on the touchscreen during the pressure action, wherein the first audio quality feedback indicator is associated with the capture of the first piece of content and is based on the first audio quality score;
processing second data from the microphone to identify a second audio stream, wherein the second data corresponds to a second piece of content of an ephemeral chat interface, the second piece of content including video content that is deleted after a single viewing and is less than 10 seconds in length;
determining a second audio environment corresponding to the second audio stream, the second audio environment being different from the first audio environment;
responsive to determining that the second piece of content corresponds to the ephemeral chat interface, analyzing second audio data in the second audio stream in accordance with a second set of metrics to determine a second audio quality score from the second audio stream, the second set of metrics being different from the first set of metrics; and
causing display of a second audio quality feedback indicator on the touchscreen.