US 12,244,891 B2
	Systems and methods for assessing viewer engagement
Inderbir Sidhu, Lexington, MA (US); Yanfeng Liu, Long Island City, NY (US); and Yun Fu, Brookline, MA (US)
Assigned to TVISION INSIGHTS, INC., Cambridge, MA (US)
Filed by TVision Insights, Inc., Cambridge, MA (US)
Filed on Oct. 18, 2022, as Appl. No. 18/047,513.
Application 17/650,193 is a division of application No. 15/702,229, filed on Sep. 12, 2017, abandoned.
Application 18/047,513 is a continuation of application No. 17/650,193, filed on Feb. 7, 2022, granted, now 11,509,956.
Application 15/702,229 is a continuation of application No. PCT/US2017/012531, filed on Jan. 6, 2017.
Claims priority of provisional application 62/275,699, filed on Jan. 6, 2016.
Prior Publication US 2023/0106115 A1, Apr. 6, 2023
This patent is subject to a terminal disclaimer.
Int. Cl. H04N 21/442 (2011.01); H04H 60/33 (2008.01); H04H 60/45 (2008.01); H04N 21/258 (2011.01); H04N 21/422 (2011.01); H04N 21/4223 (2011.01); H04N 21/437 (2011.01); H04N 21/439 (2011.01); H04N 21/44 (2011.01); H04N 21/658 (2011.01); G06V 10/143 (2022.01); G06V 40/10 (2022.01); G06V 40/16 (2022.01); G06V 40/18 (2022.01); H04H 60/58 (2008.01); H04N 5/33 (2023.01); H04N 23/11 (2023.01); H04N 23/90 (2023.01)

CPC H04N 21/44218 (2013.01) [H04H 60/33 (2013.01); H04H 60/45 (2013.01); H04N 21/25883 (2013.01); H04N 21/42203 (2013.01); H04N 21/4223 (2013.01); H04N 21/437 (2013.01); H04N 21/4394 (2013.01); H04N 21/44008 (2013.01); H04N 21/44226 (2020.08); H04N 21/6582 (2013.01); G06V 10/143 (2022.01); G06V 40/103 (2022.01); G06V 40/161 (2022.01); G06V 40/171 (2022.01); G06V 40/172 (2022.01); G06V 40/174 (2022.01); G06V 40/178 (2022.01); G06V 40/18 (2022.01); H04H 60/58 (2013.01); H04N 5/33 (2013.01); H04N 23/11 (2023.01); H04N 23/90 (2023.01)]

12 Claims

6. A system for quantifying viewer engagement with content playing on a display, the system comprising:

at least one camera, disposed to image a viewing area in front of the display, to acquire image data of the viewing area, wherein the viewing area is a viewer's residence wherein the viewer's residence comprises one household of a plurality of households;

a microphone, disposed in proximity to the display, to acquire audio data representing a soundtrack of the video emitted by a speaker coupled to the display;

a memory, operably coupled to the at least one camera and the microphone, to store processor-executable instructions, the memory comprising a buffer to store the image data and the audio data; and

at least one processor, operably coupled to the at least one camera, the microphone, and the memory, wherein upon execution of the processor-executable instructions, the at least one processor performs a method comprising:

acquiring, with at least one camera, image data of a viewing area in front of the display while the video is being shown on the display;

estimating, with at least one processor and based at least in part on the image data, viewer data comprising a number of people present in the viewing area while the video is being shown on the display and a number of people engaged with the video in the viewing area;

acquiring, with a microphone, audio data representing a soundtrack of the video emitted by a speaker coupled to the display;

quantifying the viewer engagement with the video based at least in part on the viewer data wherein quantifying the viewer engagement of the video based at least in part on a number of people present in the viewing area while the video is being shown on the display and a number of people engaged with the video in the viewing area at each household in a plurality of households, wherein quantifying the viewer engagement comprises estimating an attention rate for the video, the attention rate representing a ratio of the total number of people attentive to a display over a total number of people in the households and determining an attention index based on the attention rate; and

estimating a viewer count and a positive duration ratio based on the image data and on demographic information about each household in the plurality of households, the viewer count representing the number of people engaged with each unique video and the positive duration ratio representing a ratio of total time spent by people in the plurality of households watching the unique video to a duration of the unique video, and wherein no changes are made to content presented to viewers based on viewer engagement quantification, and no inducement to make purchases is presented to viewers.