US 12,271,702 B2
Semantic-aware feature engineering
Paulo César Gonçalves Marques, Braga (PT); Miguel Ramos de Araújo, Oporto (PT); Bruno Casal Laraña, Oporto (PT); Nuno Miguel Lourenço Diegues, Lisbon (PT); Pedro Cardoso Lessa e Silva, Oporto (PT); and Pedro Gustavo Santos Rodrigues Bizarro, Lisbon (PT)
Assigned to Feedzai—Consultadoria e Inovação Tecnológica, S.A., (PT)
Filed by Feedzai—Consultadoria e Inovação Tecnológica, S.A., Coimbra (PT)
Filed on Apr. 26, 2024, as Appl. No. 18/647,596.
Application 18/647,596 is a continuation of application No. 16/567,761, filed on Sep. 11, 2019, granted, now 12,001,800.
Claims priority of provisional application 62/730,985, filed on Sep. 13, 2018.
Prior Publication US 2024/0411995 A1, Dec. 12, 2024
This patent is subject to a terminal disclaimer.
Int. Cl. G06Q 20/40 (2012.01); G06F 17/18 (2006.01); G06F 18/214 (2023.01); G06F 40/30 (2020.01); G06N 20/00 (2019.01); H04L 9/40 (2022.01)
CPC G06F 40/30 (2020.01) [G06F 17/18 (2013.01); G06F 18/2155 (2023.01); G06N 20/00 (2019.01); G06Q 20/4016 (2013.01)] 20 Claims
OG exemplary drawing
 
1. A method comprising:
receiving, by a processor, semantic labels for data fields of training data, wherein each of the semantic labels is associated with a semantic meaning associated with a corresponding data field;
training, by the processor, a machine learning model including by:
determining at least one new feature to improve interpretability of the machine learning model as compared to not using the at least one new feature including by:
determining, by the processor, that a received semantic label of the received semantic labels meets a transformation condition specified by a transformation, wherein the transformation condition defines attributes of a label to which a corresponding transformation is applicable;
in response to the determination that the received semantic label meets the transformation condition, applying, by the processor, the transformation to the data fields based at least in part on the semantic labels to determine the at least one new feature including by building a combination of tagged fields by adding the received semantic label and associated data field to a group of tagged fields;
wherein a tagged field includes a data field and an associated semantic label, and the transformation condition reduces computer processing resources by limiting a transformation scope for the at least one new feature as compared to not considering the transformation condition; and
using, by the processor, the at least one new feature to train the machine learning model, wherein the trained machine learning model has improved interpretability and consumes reduced computer processing resources as compared to an untrained machine learning model; and
determining, by the processor, a profile characterizing behavior of at least one entity, wherein the profile includes at least one transformation configured to automatically generate the at least one new feature.