CPC G06F 16/951 (2019.01) [G06F 16/374 (2019.01)] | 15 Claims |
1. A method, comprising:
receiving, by an indexing engine executing on a processor, an indexing request to index an object containing a name value for the object;
creating, by the indexing engine from the name value for the object, tokens of the name value for the object by applying a first set of string parsing rules to the name value for the object;
creating, by the indexing engine from the name value for the object, supplementary terms, the creating supplementary terms including applying a second set of string parsing rules to the name value for the object to generate the supplementary terms, including variations of the name value and including partial repetition and overlapping data subsets of the name value, wherein creating supplementary terms further comprises:
identifying sets of characters from the name value for the object, wherein the supplementary terms comprise variations of the sets of characters, and wherein the second set of string parsing rules include a repetition rule, wherein the applying the second set of string parsing rules further comprises applying the repetition rule to the sets of characters, and wherein the supplementary terms include a partial repetition of a first set of the sets of characters;
storing the tokens in a first index;
storing the supplementary terms in a supplemental index; and
storing the object in a repository, wherein, responsive to a user search query containing input data terms, the object in the repository is searchable through the tokens in the first index and also searchable through variations of the name value for the object through the supplementary terms in the supplemental index against the input data items in the user search query.
|