US 12,112,254 B1
	Optimizing loss function during training of network
Steven L. Teig, Menlo Park, CA (US); and Eric A. Sather, Palo Alto, CA (US)
Assigned to PERCEIVE CORPORATION, San Jose, CA (US)
Filed by Perceive Corporation, San Jose, CA (US)
Filed on Feb. 3, 2020, as Appl. No. 16/780,843.
Application 16/780,843 is a continuation in part of application No. 16/453,622, filed on Jun. 26, 2019, granted, now 11,531,879.
Claims priority of provisional application 62/913,707, filed on Oct. 10, 2019.
Claims priority of provisional application 62/838,629, filed on Apr. 25, 2019.
Int. Cl. G06N 3/047 (2023.01); G06N 3/048 (2023.01); G06N 3/084 (2023.01)

CPC G06N 3/047 (2023.01) [G06N 3/048 (2023.01); G06N 3/084 (2013.01)]

20 Claims

1. A method for training a machine-trained (MT) network, the method comprising:

using a set of training inputs to train parameters of the MT network according to an initial loss function that is a first combination of a plurality of possible loss functions defined by initial values for a plurality of coefficients;

using a set of validation inputs to compute an error measure for the MT network as trained by the first set of training inputs;

modifying the loss function for subsequent training of the MT network based on the error measure computed using the set of validation inputs to generate a modified loss function that is a second combination of the plurality of possible loss functions defined by modified values for the plurality of coefficients, wherein the plurality of coefficients are continuously differentiable with respect to a description length score that accounts for (i) an amount of information required to modify the loss function and (ii) improvements to predictiveness of the MT network based on modifications to the loss function; and

using the set of training inputs to train the parameters of the MT network according to the loss function as modified based on the error measure computed using the set of validation inputs.