US 12,032,476 B2
Data unit test-based data management system
Francisco Ferreira, London (GB); Edgar Gomes de Araujo, Almere (NL); and Jose Angel Riarola, London (GB)
Assigned to Palantir Technologies Inc., Denver, CO (US)
Filed by Palantir Technologies Inc., Denver, CO (US)
Filed on Feb. 25, 2022, as Appl. No. 17/681,639.
Application 17/681,639 is a continuation of application No. 16/693,063, filed on Nov. 22, 2019, granted, now 11,294,801.
Application 16/693,063 is a continuation of application No. 15/996,265, filed on Jun. 1, 2018, granted, now 10,496,529, issued on Dec. 3, 2019.
Claims priority of provisional application 62/659,598, filed on Apr. 18, 2018.
Prior Publication US 2022/0179779 A1, Jun. 9, 2022
This patent is subject to a terminal disclaimer.
Int. Cl. G06F 11/36 (2006.01); G06F 8/30 (2018.01); G06F 8/41 (2018.01)
CPC G06F 11/3688 (2013.01) [G06F 8/30 (2013.01); G06F 8/436 (2013.01); G06F 11/3696 (2013.01)] 13 Claims
OG exemplary drawing
 
1. A method comprising:
asynchronous from and at least partially concurrent to operation of a first data transformation process performed by one or more data processors and operation of a second data transformation process performed by the one or more data processors, by one or more computer hardware processors separate from the one or more data processors:
loading, by the one or more computer hardware processors, a data set into memory, wherein the data set is generated by the first data transformation process executed by the one or more data processors and forwarded by the first data transformation process to the second data transformation process executed by the one or more data processors;
applying, by the one or more computer hardware processors, a first condition to the data set to obtain a subset of the data set;
executing, by the one or more computer hardware processors, a test function comprising unit test logic on the subset of the data set, wherein the unit test logic includes a matcher; and
determining, by the one or more computer hardware processors, that the subset of the data set is valid in response to execution of the test function on the subset of the data set;
wherein, in response to the determination that the data set is valid, the first and second data transformation processes by the one or more data processors are uninterrupted.