US 12,235,850 B2
Systems and methods for online adaptation for cross-domain streaming data
Luyu Yang, District of Columbia, DC (US); Mingfei Gao, Sunnyvale, CA (US); Zeyuan Chen, Palo Alto, CA (US); Ran Xu, Mountain View, CA (US); and Chetan Ramaiah, San Bruno, CA (US)
Assigned to Salesforce, Inc., San Francisco, CA (US)
Filed by Salesforce, Inc., San Francisco, CA (US)
Filed on Jan. 28, 2022, as Appl. No. 17/588,022.
Claims priority of provisional application 63/280,941, filed on Nov. 18, 2021.
Prior Publication US 2023/0153307 A1, May 18, 2023
Int. Cl. G06F 16/2455 (2019.01); G06F 16/242 (2019.01); G06N 20/00 (2019.01)
CPC G06F 16/24568 (2019.01) [G06F 16/2425 (2019.01); G06N 20/00 (2019.01)] 18 Claims
OG exemplary drawing
 
1. A method of training one or more neural network prediction models for image recognition via online adaptation for cross-domain data, the method comprising:
receiving, via a data interface, a training dataset of image samples in a source domain;
receiving, at a first training iteration of the one or more neural network prediction models, a first target image query from a streaming sequence of target image queries in a target domain;
generating, by a first neural network prediction model implemented on one or more hardware processors, a first prediction relating to at least one item in the first target image query;
generating, by a second neural network prediction model implemented on one or more hardware processors, a second prediction relating to the at least one item in the first target image query;
generating, by the first neural network prediction model and the second neural network prediction model, a first output and a second output each identifying a respective probability that the at least one item is in an input image sample from the training dataset, respectively;
in response to determining that the second output is greater than a pre-defined threshold, selecting the second prediction as a first pseudo label;
training at least the first neural network prediction model based on a weighted sum of (i) a first loss objective computed as a cross-entropy between the first output and a corresponding label from the training dataset, and (ii) a second loss objective computed as a cross-entropy between the first prediction and the selected second prediction as the first pseudo label;
erasing, from a memory, the first target query at an end of the first training iteration of the one or more neural network prediction models; and
performing, using the trained first neural network prediction model, image recognition of a testing image.