US 11,836,447 B2
	Word embedding for non-mutually exclusive categorical data
Erina Ghosh, Boston, MA (US); Stephanie Lanius, Cambridge, MA (US); Emma Holdrich Schwager, Cambridge, MA (US); and Larry James Eshelman, Ossining, NY (US)
Assigned to Koninklijke Philips N.V., Eindhoven (NL)
Filed by KONINKLIJKE PHILIPS N.V., Eindhoven (NL)
Filed on Apr. 13, 2020, as Appl. No. 16/846,605.
Claims priority of provisional application 62/874,098, filed on Jul. 15, 2019.
Claims priority of provisional application 62/838,433, filed on Apr. 25, 2019.
Prior Publication US 2020/0342261 A1, Oct. 29, 2020
Int. Cl. G06K 9/62 (2022.01); G06N 20/00 (2019.01); G06F 40/216 (2020.01); G06F 18/214 (2023.01); G06F 18/24 (2023.01)

CPC G06F 40/216 (2020.01) [G06F 18/214 (2023.01); G06F 18/24765 (2023.01); G06N 20/00 (2019.01)]

16 Claims

1. A machine learning model, comprising:

a categorical input feature, having a defined set of values;

a plurality of non-categorical input features;

a word embedding layer configured to convert the categorical input feature into an output in a word space having two dimensions; and

a machine learning network configured to receive the output of the word embedding layer and the plurality of non-categorical input features and to produce a machine learning model output, wherein the word embedding layer comprises coefficients that are determined by training the machine learning model and wherein converting the categorical input feature into an output in a word space having two dimensions further comprises: calculating Ki=Wi*Xi, where Ki is the output of the word embedding, Wi is the word embedding matrix, and Xi is the categorical input value.