| CPC G06F 16/215 (2019.01) [G06F 16/22 (2019.01); G06F 16/2365 (2019.01)] | 19 Claims |

|
1. A method for storing data in a data storage device, wherein the method comprises:
receiving raw data or access addresses for the raw data by a data processing and search system (DPS) system, wherein the raw data have different structures;
parsing the raw data by a plurality of different parsers in order to ascertain data objects each having one or more data values, and an object ID of each of the data objects, wherein one semantic concept is assigned to each of at least some of the data values;
automatically importing results of the parsing of the raw data by the DPS system;
automatically storing all of the results of the parsing as redundancy-free data value lists in the data storage device by the DPS system, wherein the redundancy-free lists comprise:
one or more concept lists, wherein each of the concept lists represents a particular semantic concept, and a non-redundant list selectively includes those of the imported data values to which the semantic concept of this concept list was assigned during parsing, wherein each of the data values in the concept list is assigned all of the object IDs of those data objects that include this data value, wherein the included data value is a representation of the semantic concept of this concept list;
a concept-free list, wherein the concept-free list selectively includes those of the imported data values to which no semantic concept was assigned during parsing, wherein each of the data values in the concept-free list is assigned all object IDs of those data objects that include this data value, wherein no semantic concept could be assigned to the included data value of this data object during parsing; and
providing the redundancy-free lists by the DPS system in order to respond to a search request and/or to perform a data analysis, wherein the search request and/or the data analysis is performed without access to the raw data, wherein the use of the redundancy-free lists to perform the requested search and/or the data analysis comprises,
performing a set operation on sets of object IDs assigned to data values in two or more of the redundancy-free lists,
wherein the set operation comprises a calculation of an intersection set, a union set, a difference set, or a symmetric difference set.
|