CPC H04N 21/84 (2013.01) [G06F 18/2411 (2023.01); G06F 18/41 (2023.01); G06V 10/764 (2022.01); G06V 10/7788 (2022.01); G06V 10/82 (2022.01); H04N 21/2187 (2013.01); H04N 21/23109 (2013.01); H04N 21/23418 (2013.01); H04N 21/26603 (2013.01); H04N 21/278 (2013.01); H04N 21/4223 (2013.01); H04N 21/42202 (2013.01); H04N 21/4312 (2013.01); H04N 21/44008 (2013.01); H04N 21/466 (2013.01); H04N 21/4666 (2013.01); H04N 21/482 (2013.01); H04N 21/80 (2013.01); H04N 21/8133 (2013.01)] | 19 Claims |
1. A system that comprises one or more processors programmed with computer program instructions that, when executed, cause the system to:
display, on a user interface, a video stream and a plurality of predicted concepts associated with the video stream, the plurality of predicted concepts generated by a prediction model based on the video stream, and the plurality of predicted concepts include a first set of predicted concepts and a second set of predicted concepts, the first set of predicted concepts being associated with first confidence values equal to or higher than a confidence threshold and the second set of predicted concepts being associated with second confidence values less than the confidence threshold, and the first and second confidence values indicate a confidence level that a particular predicted concept is present in a particular frame of the video stream;
receive, from an input device, a user selection of a concept related to a frame of the video stream, the selected concept being from among the plurality of predicted concepts;
receive, from the input device, a selection of a portion of the frame of the video stream;
determine an association between the selected portion of the frame and the selected concept; and
cause the prediction model to be trained based on the association.
|