US 11,790,611 B2
	Visual editor for designing augmented-reality effects that utilize voice recognition
Stef Marc Smet, London (GB); Hannes Luc Herman Verlinde, Ruislip (GB); Michael Slater, Nottingham (GB); Benjamin Patrick Blackburne, Baldock (GB); Ram Kumar Hariharan, Kirkland, WA (US); Chunjie Jia, Bothell, WA (US); and Prakarn Nisarat, Seattle, WA (US)
Assigned to Meta Platforms, Inc., Menlo Park, CA (US)
Filed by Meta Platforms, Inc., Menlo Park, CA (US)
Filed on Dec. 30, 2020, as Appl. No. 17/138,096.
Prior Publication US 2022/0207833 A1, Jun. 30, 2022
Int. Cl. G06T 19/00 (2011.01); G06F 3/0481 (2022.01); G10L 15/22 (2006.01); G06F 3/04815 (2022.01)

CPC G06T 19/006 (2013.01) [G06F 3/04815 (2013.01); G10L 15/22 (2013.01); G10L 2015/223 (2013.01)]

18 Claims

1. A computer-implemented method, comprising, by an artificial-reality (AR) design tool:

receiving, through a user interface (UI) of the AR design tool, a selection of a configurable interface element to place the AR design tool and the UI into a configure phase to configure an AR effect;

receiving, through the UI of the AR design tool after the AR design tool and the UI are placed into the configure phase in response to selecting the configurable interface element, instructions to add a voice-command module to the AR effect, the voice-command module having an intent type and at least one slot, the slot associated with one or more entities;

establishing, according to instructions received through the UI, a logical connection between the slot and a logic module configured to generate the AR effect depending on a runtime value associated with the slot; and

generating, for the AR effect, an executable program configured to:

detect an utterance at runtime;

determine that the detected utterance at runtime corresponds to the intent type and includes one or more words associated with the slot;

select, based on the one or more words, one of the one or more entities as the runtime value for the slot;

send the runtime value to the logic module according to the logical connection; and

after detecting the utterance at runtime, generate the AR effect based on the runtime value, received over the logical connection, that is selected based on the one or more words that are included by the detected utterance at runtime.