US 12,001,437 B2
Generating and using a semantic index
Eric Chris Wolfgang Sommerlade, Oxford (GB); Vivek Pradeep, Redmond, WA (US); Steven N. Bathiche, Bellevue, WA (US); and Nathan Luquetta-Fish, San Francisco, CA (US)
Assigned to Microsoft Technology Licensing, LLC, Redmond, WA (US)
Filed by Microsoft Technology Licensing, LLC, Redmond, WA (US)
Filed on Sep. 26, 2022, as Appl. No. 17/953,048.
Prior Publication US 2024/0104103 A1, Mar. 28, 2024
Int. Cl. G06F 16/2457 (2019.01); G06F 16/22 (2019.01); G06F 40/30 (2020.01)
CPC G06F 16/24575 (2019.01) [G06F 16/2228 (2019.01); G06F 40/30 (2020.01)] 18 Claims
OG exemplary drawing
 
1. A method for generating a semantic index, the method comprising:
receiving content data, the content data comprising a plurality of subsets of content data;
labeling each of the plurality of subsets of content data, based on a semantic context corresponding to the content data;
generating a feature vector for each of the labels corresponding to a respective one of the plurality of subsets of content data;
storing the plurality of subsets of content data and their corresponding labels;
grouping the plurality of subsets of content data, based on their labels, thereby generating one or more groups of subsets of content data, wherein the grouping of the plurality of subsets of content data, based on their labels, comprises:
determining a distance between each of the feature vectors;
determining that one or more of the distances are less than a predetermined threshold; and
grouping together the feature vectors with distances therebetween that are less than the predetermined threshold, thereby grouping together the respective subsets of the plurality of subsets of content data to which the feature vectors correspond; and
adapting a computing device to perform an action, based on the one or more groups of subsets of content data.