US 12,013,903 B2
System and method for search discovery
Patrick Thomas Sidney Pidduck, Waterloo (CA)
Assigned to OPEN TEXT SA ULC, Waterloo (CA)
Filed by Open Text SA ULC, Halifax (CA)
Filed on May 13, 2020, as Appl. No. 15/931,351.
Application 15/931,351 is a continuation of application No. 14/086,420, filed on Nov. 21, 2013, granted, now 10,678,870.
Claims priority of provisional application 61/752,744, filed on Jan. 15, 2013.
Prior Publication US 2020/0342037 A1, Oct. 29, 2020
Int. Cl. G06F 16/951 (2019.01); G06F 16/36 (2019.01)
CPC G06F 16/951 (2019.01) [G06F 16/374 (2019.01)] 15 Claims
OG exemplary drawing
 
1. A method, comprising:
receiving, by an indexing engine executing on a processor, an indexing request to index an object containing a name value for the object;
creating, by the indexing engine from the name value for the object, tokens of the name value for the object by applying a first set of string parsing rules to the name value for the object;
creating, by the indexing engine from the name value for the object, supplementary terms, the creating supplementary terms including applying a second set of string parsing rules to the name value for the object to generate the supplementary terms, including variations of the name value and including partial repetition and overlapping data subsets of the name value, wherein creating supplementary terms further comprises:
identifying sets of characters from the name value for the object, wherein the supplementary terms comprise variations of the sets of characters, and wherein the second set of string parsing rules include a repetition rule, wherein the applying the second set of string parsing rules further comprises applying the repetition rule to the sets of characters, and wherein the supplementary terms include a partial repetition of a first set of the sets of characters;
storing the tokens in a first index;
storing the supplementary terms in a supplemental index; and
storing the object in a repository, wherein, responsive to a user search query containing input data terms, the object in the repository is searchable through the tokens in the first index and also searchable through variations of the name value for the object through the supplementary terms in the supplemental index against the input data items in the user search query.