US 11,914,738 B2
Categorizing a sensitive data field in a dataset
Peter Petrus van Liesdonk, Eindhoven (NL); Daniel Pletea, Eindhoven (NL); and Paul Koster, Eindhoven (NL)
Assigned to Koninklijke Philips N.V., Eindhoven (NL)
Appl. No. 17/267,838
Filed by KONINKLIJKE PHILIPS N.V., Eindhoven (NL)
PCT Filed May 14, 2020, PCT No. PCT/EP2020/063561
§ 371(c)(1), (2) Date Feb. 11, 2021,
PCT Pub. No. WO2020/229644, PCT Pub. Date Nov. 19, 2020.
Claims priority of provisional application 62/848,180, filed on May 15, 2019.
Prior Publication US 2021/0248263 A1, Aug. 12, 2021
Int. Cl. G06F 21/62 (2013.01); G16H 70/60 (2018.01); G06F 21/60 (2013.01); H04L 9/00 (2022.01)
CPC G06F 21/6227 (2013.01) [G06F 21/602 (2013.01); G16H 70/60 (2018.01); H04L 9/008 (2013.01); H04L 2209/42 (2013.01)] 13 Claims
OG exemplary drawing
 
1. A categorization system for categorizing a sensitive data field in a dataset, the dataset comprising one or more records, each record of the dataset comprising a value of the sensitive data field, the system comprising a client device, a data provider device and a categorization device, wherein:
the client device is for obtaining the categorization, said device comprising:
a memory configured to store categories of the sensitive data field for records of the dataset;
a processor configured to:
determine categorization data, the categorization data comprising, for one or more possible values of the sensitive data field, a homomorphic encryption of the one or more possible values and data encoding a category of the one or more possible values;
provide the categorization data to the data provider device;
obtain from the categorization device data encoding a category of the sensitive data field for a record of the dataset of the one or more records, and store said category for the record in the memory;
the data provider device is for providing data to be categorized, said device comprising:
a memory configured to store the dataset;
a processor configured to:
obtain the categorization data from the client device, the categorization data comprising a homomorphic encryption of a possible value of the sensitive data field of the one or more possible values and data encoding a category of the possible value;
determine, for the record of the dataset, a homomorphic encryption indicative of a difference between said possible value and a value of the sensitive data field for the record based on the homomorphic encryption of the possible value;
provide, to the categorization device, data encoding the category of the possible value and the homomorphic encryption indicative of the difference;
the categorization device is for enabling said categorization, said device comprising:
a memory configured to store a decryption key for the homomorphic encryptions;
a processor configured to:
obtain, from the data provider device, data encoding a category of the possible value of the sensitive data field and a homomorphic encryption indicative of a difference between the possible value and a value of the sensitive data field for the record of the dataset;
determine if the value of the sensitive data field for the record matches the possible value from said homomorphic encryption using the decryption key, and in response to determining that the value of the sensitive data field for the record matches the possible value, associate said category to the record;
provide, to the client device, data encoding the category associated to the record.