US 12,033,083 B2
	System and method for machine learning architecture for partially-observed multimodal data
Yu Gong, Vancouver (CA); Jiawei He, Vancouver (CA); Thibaut Durand, Vancouver (CA); Megha Nawhal, Vancouver (CA); Yanshuai Cao, Toronto (CA); Gregory Mori, Vancouver (CA); and Seyed Hossein Hajimirsadeghi, Vancouver (CA)
Assigned to ROYAL BANK OF CANADA, Toronto (CA)
Filed by ROYAL BANK OF CANADA, Toronto (CA)
Filed on May 22, 2020, as Appl. No. 16/882,074.
Claims priority of provisional application 62/851,444, filed on May 22, 2019.
Prior Publication US 2020/0372369 A1, Nov. 26, 2020
Int. Cl. G06N 3/088 (2023.01); G06N 3/042 (2023.01); G06N 3/045 (2023.01)

CPC G06N 3/088 (2013.01) [G06N 3/042 (2023.01); G06N 3/045 (2023.01)]

20 Claims

1. A computer implemented system for conducting machine learning using partially-observed data by using a variational selective auto-encoder (VSAE) machine learning model framework, the system including a processor operating in conjunction with computer memory, the system comprising:

the processor configured to provide:

a data receiver adapted to receive one or more data sets representative of the partially-observed data, each having a subset of observed data and a subset of unobserved data, the data receiver configured to extract a mask data structure from each data set of the one or more data sets representative of which modalities are observed and which modalities are unobserved; and

a machine learning data architecture engine adapted to:

maintain a attributive proposal network for processing the one or more data sets, the attributive proposal network including a set of individual encoders, each individual encoder adapted for a corresponding observed modality;

maintain a collective proposal network for processing the corresponding mask data structure, the collective proposal network including a collective encoder corresponding to all of the unobserved modalities, the mask data structure utilized for conditional selection of a proposal distribution for an unobserved modality; and

maintain a first generative network including a first set of one or more decoders, each decoder of the first set of the one or more decoders configured to generate output estimated data proposed by the attributive proposal network and the collective proposal network wherein, for the unobserved modalities, expectation over collective observation from the collective proposal network is applied as a corresponding proposal distribution as an approximation of a true posterior distribution based on the mask data structure such that a joint distribution of all attributes and mask data structure can be learned from the partially-observed data.