US 11,675,808 B2
Dataset analysis and dataset attribute inferencing to form collaborative datasets
Bryon Kristen Jacob, Austin, TX (US); David Lee Griffith, Austin, TX (US); Triet Minh Le, Austin, TX (US); Jon Loyens, Austin, TX (US); Brett A. Hurt, Austin, TX (US); and Arthur Albert Keen, Austin, TX (US)
Assigned to data.world, Inc., Austin, TX (US)
Filed by data.world, Inc., Austin, TX (US)
Filed on Jan. 31, 2022, as Appl. No. 17/589,884.
Application 17/589,884 is a continuation of application No. 17/114,377, filed on Dec. 7, 2020, granted, now 11,366,824.
Application 17/114,377 is a continuation of application No. 16/292,120, filed on Mar. 4, 2019, granted, now 10,860,601.
Application 16/292,120 is a continuation of application No. 16/271,263, filed on Feb. 8, 2019, granted, now 10,860,600.
Application 16/271,263 is a continuation of application No. 15/186,516, filed on Jun. 19, 2016, granted, now 10,452,677.
Application 16/292,120 is a continuation of application No. 15/186,516, filed on Jun. 19, 2016, granted, now 10,452,677.
Application 17/114,377 is a continuation of application No. 16/271,263, filed on Feb. 8, 2019, granted, now 10,860,600.
Application 16/271,263 is a continuation of application No. 15/186,516, filed on Jun. 19, 2016, granted, now 10,452,677.
Application 17/114,377 is a continuation of application No. 16/271,687, filed on Feb. 8, 2019, granted, now 10,860,613.
Application 16/271,687 is a continuation of application No. 15/186,520, filed on Jun. 19, 2016, granted, now 10,346,429.
Application 17/114,377 is a continuation of application No. 16/292,135, filed on Mar. 4, 2019, granted, now 10,963,486.
Application 16/292,135 is a continuation of application No. 16/271,687, filed on Feb. 8, 2019, granted, now 10,860,613.
Application 16/271,687 is a continuation of application No. 15/186,520, filed on Jun. 19, 2016, granted, now 10,346,429.
Prior Publication US 2022/0229847 A1, Jul. 21, 2022
This patent is subject to a terminal disclaimer.
Int. Cl. G06F 16/25 (2019.01); G06N 20/00 (2019.01); G06F 16/901 (2019.01); G06N 5/022 (2023.01); G06F 16/178 (2019.01)
CPC G06F 16/258 (2019.01) [G06F 16/1794 (2019.01); G06F 16/9024 (2019.01); G06N 5/022 (2013.01); G06N 20/00 (2019.01)] 9 Claims
OG exemplary drawing
 
1. A method, comprising: receiving data associated with a query into a collaborative data consolidation system, the query being executed across a plurality of atomized datasets;
converting the data into one or more triples, the one or more triples being stored in one or more triplestores;
analyzing the query to classify a portion of the query to form classified query portions;
rewriting the query into a plurality of sub-queries based on the classified query portions, each of the plurality of sub-queries being formatted in a data type associated with at least one of the one or more triplestores;
transmitting each of the plurality of queries after the rewriting to one or more distributed data repositories formatted to the data type;
retrieving one or more query results in response to at least one of the plurality of sub-queries;
federating the one or more query results retrieved from the one or more distributed data repositories to generate a federated query result in response to the query;
transmitting each of the plurality of sub-queries to at least one of the one or more triplestores, each of the one or more triplestores being hosted on one or more distributed data repositories formatted according to the data type; and
retrieving one or more query results in response to at least one of the plurality of sub-queries, the one or more query results being transmitted in response to the query as the federated query result.