| CPC G06F 18/22 (2023.01) [G06F 16/90335 (2019.01); G06F 18/213 (2023.01); G06F 18/214 (2023.01); G06N 3/08 (2013.01); G06V 10/95 (2022.01); G06V 30/274 (2022.01); G06V 30/40 (2022.01); G16H 10/60 (2018.01); G16H 30/20 (2018.01); G06V 30/10 (2022.01)] | 13 Claims |

|
1. A data similarity determination method, comprising:
acquiring first data of a first object, wherein the first data comprises first sub-data of a first modality and third sub-data of a second modality;
mapping the first sub-data as a first semantic representation in a semantic comparison space, wherein the semantic comparison space enables a similarity between a semantic representation obtained by mapping data of the first modality to the semantic comparison space and a semantic representation obtained by mapping data of the second modality to the semantic comparison space to be computed;
acquiring second data of a second object, wherein the second data comprises second sub-data of the first modality and fourth sub-data of the second modality;
mapping the second sub-data as a second semantic representation in the semantic comparison space; and
calculating a similarity between the first data and the second data based on at least the first semantic representation and the second semantic representation;
the first object comprises a first characteristic, the first sub-data comprises a first sub-semantic describing the first characteristic, and the third sub-data comprises a third sub-semantic describing the first characteristic;
the second object comprises a second characteristic, the second sub-data comprises a second sub-semantic describing the second characteristic, and the fourth sub-data comprises a fourth sub-semantic describing the second characteristic;
the method further comprises:
mapping the third sub-data as a third semantic representation in the semantic comparison space and mapping the fourth sub-data as a fourth semantic representation in the semantic comparison space; and
calculating the similarity between the first data and the second data based on at least the first semantic representation and the second semantic representation, comprises:
acquiring the similarity between the first data and the second data based on the first semantic representation, the second semantic representation, the third semantic representation, and the fourth semantic representation;
wherein the similarity between the first data and the second data is equal to a weighted sum of a similarity between the first semantic representation and the fourth semantic representation, a similarity between the second semantic representation and the third semantic representation, a similarity between the first semantic representation and the second semantic representation, and a similarity between the third semantic representation and the fourth semantic representation.
|