US 12,083,690 B2
	Systems and methods for authoring and modifying presentation conversation files for multimodal interactive computing devices/artificial companions
Mario E Munich, La Canada, CA (US); Stefan Scherer, Santa Monica, CA (US); Paolo Pirjanian, Glendale, CA (US); and Craig Allen, Seattle, WA (US)
Assigned to Embodied, Inc., Pasadena, CA (US)
Appl. No. 17/621,224
Filed by Embodied, Inc., Pasadena, CA (US)
PCT Filed Feb. 27, 2021, PCT No. PCT/US2021/020128 § 371(c)(1), (2) Date Dec. 20, 2021, PCT Pub. No. WO2021/174147, PCT Pub. Date Sep. 2, 2021.
Claims priority of provisional application 62/983,593, filed on Feb. 29, 2020.
Claims priority of provisional application 63/154,664, filed on Feb. 26, 2021.
Prior Publication US 2022/0176565 A1, Jun. 9, 2022
Int. Cl. B25J 9/16 (2006.01); G06F 40/30 (2020.01); G06V 40/16 (2022.01); G10L 15/22 (2006.01)

CPC B25J 9/1697 (2013.01) [G06F 40/30 (2020.01); G06V 40/174 (2022.01); G10L 15/22 (2013.01)]

19 Claims

1. A method of authoring and modifying presentation conversation files, comprising:

accessing computer-readable instructions from one or more memory devices for execution by one or more processors of the computing device;

executing the computer-readable instructions accessed from the one or more memory devices by the one or more processors of the computing device; and

wherein executing the computer-readable instructions further comprising:

receiving, at a renderer module of the computing device, voice files, visual effect files, facial expression files and/or mobility files;

analyzing, by the language processor module of the computing device, whether the voice files, the visual effect files, the facial expression files and/or mobility files follow guidelines of a multimodal authoring system;

generating, by the renderer module, one or more presentation conversation files based at least in part on the received voice files, visual effect files, facial expression files and/or mobility files;

testing, at an automatic testing system, the one or more presentation conversation files to verify correct operation of a robot computing device that receives the one or more presentation conversation files as an input;

identifying, by a multimodal review module, changes to be made to the voice files, the visual effect files, the facial expression files and/or the mobility files

generating revised voice files, revised visual effect files, revised facial expression files and/or revised mobility files based at least in part on the identified changes;

verifying, by the language processor module, that the revised voice files, revised visual effect files, revised facial expression files and/or revised mobility files are aligned with the robot computing device's personality and operational characteristics;

communicating the revised voice files, the revised visual effect files, the revised facial expression files and the revised mobility files to the robot computing device, wherein the revised voice files cause the robot computing device to make sounds and the revised mobility files cause parts of the robot computing device to move; and

testing, at the automatic testing system, the one or more presentation conversation files on different device configuration files to simulate testing on different robot computing devices.