| CPC H04N 5/265 (2013.01) [G06T 7/70 (2017.01); G06V 40/10 (2022.01); H04L 65/60 (2013.01); H04N 5/2622 (2013.01); H04N 5/2628 (2013.01); H04N 9/646 (2013.01); G06T 2207/10016 (2013.01); G06T 2207/30196 (2013.01); G06T 2207/30242 (2013.01)] | 22 Claims |

|
1. A method comprising, by an electronic device:
maintaining a video communication between two or more client devices, wherein each client device is associated with a respective video stream in the video communication, and wherein each video stream is associated with a respective match scale measured based on a height of frames of the video stream and a depth of subjects within the frames;
determining, for each of the video streams, a respective scaling factor and a respective cropping, wherein the respective scaling factor is calculated based on the match scale associated with the corresponding video stream and a target match scale determined from the match scales associated with the video streams associated with the two or more client devices, and wherein the respective cropping is determined based on a uniformity of positions of the subjects within the frames across the video streams associated with the two or more client devices;
generating, based on the respective scaling factor and cropping of each video stream, a merged video stream from each of the video streams for the video communication; and
sending, to one or more of the client devices, instructions for presenting the merged video stream.
|