US 12,177,597 B2
Eye contact optimization
Eric Edmond Thomasian, Los Angeles, CA (US); Shaun Paul Dunning, San Clemente, CA (US); and Amer Aref Hassan, Kirkland, WA (US)
Assigned to Microsoft Technology Licensing, LLC, Redmond, WA (US)
Filed by Microsoft Technology Licensing, LLC, Redmond, WA (US)
Filed on Sep. 26, 2022, as Appl. No. 17/953,066.
Prior Publication US 2024/0106969 A1, Mar. 28, 2024
Int. Cl. H04N 5/262 (2006.01); G06T 7/73 (2017.01); G06V 20/40 (2022.01); G10L 17/00 (2013.01)
CPC H04N 5/2624 (2013.01) [G06T 7/73 (2017.01); G06V 20/41 (2022.01); G10L 17/00 (2013.01); G06T 2207/10016 (2013.01); G06T 2207/30201 (2013.01); G06T 2207/30244 (2013.01)] 20 Claims
OG exemplary drawing
 
1. A method for conducting a videoconference with a plurality of participants, each of the participants utilizing a device for transmitting a multimedia feed of a participant associated with the device and receiving multimedia feeds from devices associated with other participants in the videoconference, the method comprising:
receiving the multimedia feeds from the plurality of devices associated with each of the participants in the videoconference, the multimedia feeds including an audio component and a video component;
displaying tiles on a display screen of the device of each of the participants, each of the tiles showing the video component of one of the multimedia feeds, respectively;
analyzing at least one of the audio component and the video component of a plurality of the multimedia feeds to detect characteristics indicative of a first participant and a second participant having a first conversation with each other;
identifying a first camera position for a camera on a device of the first participant and a second camera position for a camera on a device of the second participant; and
responsive to identifying that the first participant and the second participant are having the first conversation with each other:
moving a first tile for the first participant on a display screen of the device of the second participant from a first location to a second location, the second location being closer to the second camera position than the first location; and
moving a second tile for the second participant on a display screen of the device of the first participant from a third location to a fourth location, the fourth location being closer to the first camera position than the third location.