CPC G06F 16/90332 (2019.01) [G06F 16/2425 (2019.01); G06F 16/243 (2019.01); G06F 16/24522 (2019.01); G06F 16/2465 (2019.01); G06F 16/248 (2019.01); G06F 16/26 (2019.01); G06F 16/90324 (2019.01); G06F 16/9035 (2019.01); G06F 40/216 (2020.01); G06F 40/284 (2020.01); G06F 40/30 (2020.01); G06N 5/02 (2013.01)] | 21 Claims |
1. A computer-implemented method comprising:
detecting, in a dataset, an insight, the insight comprising a fact about a portion of data in the dataset, the insight maintained in a text-only structured format comprising a plurality of vocabulary terms, each vocabulary term in the plurality of vocabulary terms comprising a field and a plurality of options for values of the field;
determining, by analyzing a natural language input, a context, the context identifying an intent and an entity referenced by the natural language input, the context maintained in the text-only structured format;
computing, for the insight, a relevance score, the relevance score comprising a measure of similarity between an encoded form of the insight and an encoded form of the context, the encoded form of the insight and the encoded form of the context generated according to the same encoding scheme, the encoded form of the insight comprising an encoding of a vocabulary term used in the insight, the encoded form of the context comprising an encoding of a vocabulary term used in the context, wherein the encoding scheme is a one-hot encoding scheme in which each option in the plurality of options for values of the field is represented by a dimension of a vector; and
presenting a selected insight, the selected insight selected from a set of insights including the insight, the selected insight selected based on the relevance score.
|