US 12,223,766 B2
Proximity framing in a video system
Nhu Quynh Pham Nguyen, Austin, TX (US); Kishore Venkat Rao Goka, Leander, TX (US); Joshua Raglin, Austin, TX (US); Dhaval Patel, Austin, TX (US); Sakshi Gupta, Round Rock, TX (US); Venkateswarlu Lakkamraju, Tuam (IE); and Krishna Balusu, Round Rock, TX (US)
Assigned to Hewlett-Packard Development Company, L.P., Spring, TX (US)
Filed by Hewlett-Packard Development Company, L.P., Spring, TX (US)
Filed on Oct. 21, 2022, as Appl. No. 17/971,527.
Claims priority of provisional application 63/351,318, filed on Jun. 10, 2022.
Prior Publication US 2023/0401890 A1, Dec. 14, 2023
Int. Cl. G06V 40/16 (2022.01); G06V 10/22 (2022.01); G06V 20/40 (2022.01)
CPC G06V 40/161 (2022.01) [G06V 10/22 (2022.01); G06V 20/40 (2022.01)] 20 Claims
OG exemplary drawing
 
16. A system comprising:
a video processor (106) comprising:
a head detection model (112) configured to generate head detection information for an image of a video stream, wherein the head detection information identifies a plurality of heads detected in the image,
a frame generator (116) configured to generate a plurality of head frame definitions for an image of an input video stream, wherein generating the plurality of head frame definitions comprises:
obtaining, using a head detection model and for an image of a video stream, head detection information, wherein the head detection information identifies a plurality of heads detected in the image;
obtaining a plurality of buffer bounding boxes, wherein obtaining the plurality of buffer bounding boxes comprises:
obtaining a plurality of head buffer bounding boxes for the plurality of heads detected in the image, and
combining at least two of the plurality of head buffer bounding boxes into a proximity buffer bounding box;
identifying a set of templates based on the plurality of buffer bounding boxes;
creating, individually, the plurality of head frame definitions for the plurality of buffer bounding boxes using the set of templates; and
an image framing processor (114) configured to process the video stream using the image frame definition.