CPC G06F 16/90332 (2019.01) [G06F 3/04847 (2013.01); G06F 16/243 (2019.01); G06F 16/24522 (2019.01); G06F 16/248 (2019.01); G06F 18/23213 (2023.01); G06F 40/30 (2020.01)] | 19 Claims |
1. A method of using natural language for visual analysis of a dataset, comprising:
at a computer having a display, one or more processors, and memory storing one or more programs configured for execution by the one or more processors:
displaying, on the display, a data visualization application, including displaying a data visualization based on a dataset retrieved from a database using a set of one or more queries;
receiving, by the data visualization application, a user input to specify a natural language command related to the displayed data visualization;
based on the displayed data visualization, extracting one or more independent analytic phrases from the natural language command;
for a first analytic phrase of the one or more extracted analytic phrases:
computing semantic relatedness between the first analytic phrase and data field names of numeric data fields in the dataset;
identifying one or more of the numeric data fields whose data field names have highest computed semantic relatedness to the first analytic phrase; and
selecting a numerical function according to the first analytic phrase, wherein the numerical function (1) comprises a parameterized data selection criterion and (2) compares data values in the one or more numeric data fields to a threshold value;
retrieving an updated dataset, including (1) selecting an initial range for values of the parameters of the parameterized data selection criterion and (2) requerying the database using an updated set of one or more queries to filter the one or more identified numeric data fields according to the numerical function; and
displaying, on the display in the data visualization application, an updated data visualization using the updated dataset.
|