US 12,445,799 B2
Surround sound to immersive audio upmixing based on video scene analysis
Allan Devantier, Newhall, CA (US); Sunil Bharitkar, Stevenson Ranch, CA (US); Seongnam Oh, Irvine, CA (US); and Carlos Tejeda Ocampo, Tuxtla Gutiérrez (MX)
Assigned to Samsung Electronics Co., Ltd., Suwon-si (KR)
Filed by SAMSUNG ELECTRONICS CO., LTD., Suwon-si (KR)
Filed on Sep. 27, 2023, as Appl. No. 18/476,172.
Claims priority of provisional application 63/431,263, filed on Dec. 8, 2022.
Prior Publication US 2024/0196158 A1, Jun. 13, 2024
Int. Cl. H04S 7/00 (2006.01); G06V 20/40 (2022.01)
CPC H04S 7/305 (2013.01) [G06V 20/49 (2022.01)] 20 Claims
OG exemplary drawing
 
1. A method of audio upmixing, comprising:
performing video scene analysis by segmenting one or more visual objects from one or more video frames of a video;
performing audio analysis by extracting one or more audio signals from an audio corresponding to the video;
determining whether any of the audio signals correspond to any of the visual objects;
estimating a video-based trajectory of a visual object of the visual objects if the visual object is in motion and transitions from on-screen to off-screen, or vice versa, during the video; and
positioning an audio trajectory of an audio signal of the audio signals from at least one speaker associated with the display to at least one other speaker associated with providing surround sound, wherein the audio trajectory is automatically matched with the video, and the audio signal is delivered to the at least one speaker and the at least one other speaker for audio reproduction during presentation of the video.