| CPC G06V 40/161 (2022.01) [G06V 10/22 (2022.01); G06V 20/40 (2022.01)] | 20 Claims |

|
16. A system comprising:
a video processor (106) comprising:
a head detection model (112) configured to generate head detection information for an image of a video stream, wherein the head detection information identifies a plurality of heads detected in the image,
a frame generator (116) configured to generate a plurality of head frame definitions for an image of an input video stream, wherein generating the plurality of head frame definitions comprises:
obtaining, using a head detection model and for an image of a video stream, head detection information, wherein the head detection information identifies a plurality of heads detected in the image;
obtaining a plurality of buffer bounding boxes, wherein obtaining the plurality of buffer bounding boxes comprises:
obtaining a plurality of head buffer bounding boxes for the plurality of heads detected in the image, and
combining at least two of the plurality of head buffer bounding boxes into a proximity buffer bounding box;
identifying a set of templates based on the plurality of buffer bounding boxes;
creating, individually, the plurality of head frame definitions for the plurality of buffer bounding boxes using the set of templates; and
an image framing processor (114) configured to process the video stream using the image frame definition.
|