| CPC G06F 40/157 (2020.01) [G06F 40/284 (2020.01)] | 20 Claims |

|
1. A computer-based method of encoding tabular data with permutation invariance, the method comprising:
receiving input including tabular data and linearizing a column or row within the received tabular data;
automatically assigning an increasing sequence of position identifiers to each non-delimiting tokenized cell in the linearized column or row until a header delimiter is reached;
in response to reaching the header delimiter, automatically assigning a monotonically increasing sequence of position identifiers for each non-delimiting tokenized cell positioned after the header delimiter, restarting from an integer corresponding to 1 greater than the position identifier assigned to the header delimiter for each non-delimiting tokenized cell positioned after cell delimiters;
automatically assigning a static position identifier for each of the cell delimiters in the linearized column or row, the static position identifier being 1 greater than a highest position identifier assigned to the non-delimiting tokenized cells; and
automatically outputting an encoded permutation-invariant representation of the linearized column or row.
|