US 12,079,224 B2
Database systems with adaptive automated metadata assignment
Zachary Alexander, Berkeley, CA (US); and Yixin Mao, San Francisco, CA (US)
Assigned to Salesforce, Inc.
Filed by Salesforce, Inc., San Francisco, CA (US)
Filed on Dec. 12, 2022, as Appl. No. 18/064,696.
Prior Publication US 2024/0193167 A1, Jun. 13, 2024
Int. Cl. G06F 7/00 (2006.01); G06F 16/00 (2019.01); G06F 16/242 (2019.01); G06F 16/2457 (2019.01); G06F 16/28 (2019.01)
CPC G06F 16/24573 (2019.01) [G06F 16/244 (2019.01); G06F 16/285 (2019.01)] 17 Claims
OG exemplary drawing
 
1. A method comprising:
identifying a set of out-of-domain conversations from among a plurality of unassigned conversations using a plurality of existing groups of semantically similar conversations, wherein the set of out-of-domain conversations comprises a first subset of the plurality of unassigned conversations;
determining, based upon the plurality of unassigned conversations, a new candidate group of semantically similar conversations comprising a subset of the plurality of unassigned conversations, the subset of the plurality of unassigned conversations comprising a second subset of the set of out-of-domain conversations, wherein the new candidate group is distinct from the plurality of existing groups of semantically similar conversations, wherein a respective existing group of the plurality of existing groups comprises a distinct subset of conversations assigned to the respective existing group based on a respective representative utterance associated with the respective conversation of the distinct subset of conversations;
determining a clustering performance metric associated with the new candidate group of semantically similar conversations based on a relationship between a reference semantic representation for the new candidate group of semantically similar conversations and respective semantic representations for the plurality of existing groups of semantically similar conversations; and
when the clustering performance metric is greater than a threshold:
automatically assigning one or more conversations of the plurality of unassigned conversations to the new candidate group based on a relationship between a respective representative utterance associated with a respective conversation of the one or more conversations and the reference semantic representation for the new candidate group; and
automatically updating, at a database system, one or more records associated with the one or more conversations to include metadata identifying the new candidate group.