CPC G06F 16/243 (2019.01) [G06F 16/24573 (2019.01); G06F 16/248 (2019.01); G06F 16/9024 (2019.01)] | 17 Claims |
1. A method comprising:
obtaining data of at least two different modalities, the data comprising at least first data and second data, the first data and second stored in respective data sources, the at least two different modalities selected from a group consisting of image and text modalities;
determining an overlapping keyword correlating the first data and the second data by analyzing textual descriptions associated with the first data and second data and identifying the overlapping keyword as appearing in a textual description of the first data and in a textual description of the second data;
storing the first data and the second data in an associative manner based on the overlapping keyword, the associative manner generated using the overlapping keyword;
annotating the first data to obtain a first annotation result after storing the first data and the second data;
annotating the second data to obtain a second annotation result:
performing a correlation annotation on the first data and the second data to obtain annotated correlation information; and
training at least one machine learning (ML) model using the first data, the second data, and the overlapping keyword.
|