US 11,706,357 B2
	Multifunction peripheral natural language chatbot
Hung Tran, Mission Viejo, CA (US); and Halley Bradt, Laguna Hills, CA (US)
Assigned to Toshiba TEC Kabushiki Kaisha, Shinagawa-ku (JP)
Filed by Toshiba TEC Kabushiki Kaisha, Shinagawa-ku (JP)
Filed on Aug. 24, 2021, as Appl. No. 17/409,983.
Prior Publication US 2023/0060553 A1, Mar. 2, 2023
Int. Cl. H04N 1/00 (2006.01); G10L 15/26 (2006.01); G10L 13/04 (2013.01); H04L 51/02 (2022.01); G10L 15/22 (2006.01)

CPC H04N 1/00403 (2013.01) [G10L 13/04 (2013.01); G10L 15/22 (2013.01); G10L 15/26 (2013.01); H04L 51/02 (2013.01); H04N 1/00042 (2013.01); H04N 1/00488 (2013.01); G10L 2015/223 (2013.01)]

18 Claims

8. A method comprising:

generating, via a processor from voice input received from a user into a microphone, digitized human speech comprising an inquiry related to a current state of a document processing system;

storing, in a memory, text responses associatively with one or more requests;

storing, in the memory, a list of identified users wherein each identified user in the list is associated with historical request information corresponding to prior requests;

monitoring device states of a multifunction peripheral;

storing, in the memory, text strings corresponding to each of a plurality of multifunction peripheral device states;

a document processing engine including a scanner and a printer;

converting, via the processor, the digitized human speech received via the microphone to text to form inquiry text;

determining an identity of the user in accordance with the digitized human speech received from the user;

determining one two or more text responses corresponding to the inquiry text and one or more text strings corresponding to device states monitored by a hardware monitor and historical request information associated with the identified user;

the processor further configured to generate a ranking of each determined text response, the ranking being determined by one or more of a confidence level relative to the received inquiry, availability of a response to a particular interpretation of the received query, a satisfaction level associated with previous responses to a similar inquiry, and a timing associated with previous responses;

outputting, via a speaker, verbal responses from the speaker in accordance with the two or more determined text responses and the one or more text strings; and scanning or printing a document associated with the voice input.