US 12,001,801 B2
Question answering using dynamic question-answer database
Yuanxin Wang, Thornhill (CA); Anuprit Kale, El Cerrito, CA (US); Zachary Alexander, Berkeley, CA (US); and Na Cheng, Dublin, CA (US)
Assigned to Salesforce, Inc., San Francisco, CA (US)
Filed by salesforce.com, inc., San Francisco, CA (US)
Filed on Nov. 15, 2019, as Appl. No. 16/685,909.
Prior Publication US 2021/0149964 A1, May 20, 2021
Int. Cl. G06F 40/30 (2020.01); G06F 16/9032 (2019.01); G06F 18/22 (2023.01); G06F 40/205 (2020.01); G06N 20/00 (2019.01)
CPC G06F 40/30 (2020.01) [G06F 16/90332 (2019.01); G06F 18/22 (2023.01); G06F 40/205 (2020.01); G06N 20/00 (2019.01)] 20 Claims
OG exemplary drawing
 
1. A system comprising:
a database system implemented using a server system, the database system configurable to cause:
generating a question using a set of documents stored in at least one data source by at least:
identifying a first one or more of the documents as being of a designated type of customer record,
identifying a second one or more of the documents including a transcript of a chat between an agent and a customer associated with the customer record, and
processing both the identified first one or more documents and the identified second one or more documents to identify one or more questions stored in one or more designated fields of one or more of: the identified first one or more documents, or the identified second one or more documents;
comparing the generated question with a first plurality of questions within a question-answer repository such that a first plurality of similarity scores are generated, each of the first plurality of similarity scores representing a similarity between the generated question and a corresponding one of the first plurality of questions;
determining whether the generated question is semantically distinct from the first plurality of questions based, at least in part, on the similarity scores;
adding the generated question to the question-answer repository based, at least in part, on having determined that the generated question is semantically distinct from the first plurality of questions; and
updating the set of documents to include the generated question.