CPC G06F 18/256 (2023.01) [G06F 18/253 (2023.01); G06N 3/08 (2013.01); G06T 7/246 (2017.01); G06T 2207/20084 (2013.01); G06T 2207/30248 (2013.01); G06V 20/56 (2022.01)] | 20 Claims |
1. A multimodal perception system comprising:
a controller configured to,
receive a first signal from a first sensor, a second signal from a second sensor, and a third signal from a third sensor,
extract a first feature vector from the first signal,
extract a second feature vector from the second signal,
extract a third feature vector from the third signal,
determine an odd-one-out vector from the first, second, and third feature vectors via an odd-one-out network of a machine learning network, based on inconsistent modality prediction,
fuse, via a fusion network, the first, second, and third feature vectors and odd-one-out vector into a fused feature vector, and
output the fused feature vector.
|