CPC G06N 3/088 (2013.01) [G06N 3/045 (2023.01); G06V 10/764 (2022.01); G06V 10/82 (2022.01); G06V 20/41 (2022.01); G06V 20/46 (2022.01); G06V 20/48 (2022.01)] | 24 Claims |
1. A method for audiovisual source separation processing, the method comprising:
receiving video data including images of a plurality of sound sources;
receiving an optical flow data of the video data, the optical flow data indicating motions of pixels between frames of the video data; and
encoding, by a generative adversarial network (GAN) system, the received video data into video localization data comprising information associating pixels in the frames of video data with different channels of sound; and
encoding, by the GAN system, the received optical flow data into video separation data.
|