US 11,810,350 B2
Processing of surveillance video streams using image classification and object detection
Shaomin Xiong, Fremont, CA (US); Toshiki Hirano, San Jose, CA (US); Damien Kah, San Jose, CA (US); Rajeev Nagabhirava, San Jose, CA (US); and David Berman, San Jose, CA (US)
Assigned to Western Digital Technologies, Inc., San Jose, CA (US)
Filed by Western Digital Technologies, Inc., San Jose, CA (US)
Filed on May 21, 2021, as Appl. No. 17/326,628.
Prior Publication US 2022/0374635 A1, Nov. 24, 2022
Int. Cl. G06T 7/70 (2017.01); G06V 20/40 (2022.01); G08B 13/196 (2006.01); G06V 20/52 (2022.01)
CPC G06V 20/41 (2022.01) [G06T 7/70 (2017.01); G06V 20/52 (2022.01); G08B 13/19608 (2013.01); G08B 13/19619 (2013.01); G08B 13/19663 (2013.01)] 20 Claims
OG exemplary drawing
 
1. A system, comprising:
a video image sensor; and
a controller comprising:
a plurality of image classifiers, wherein each image classifier of the plurality of image classifiers is configured for a different object type; and
a plurality of object detectors, wherein each object detector of the plurality of object detectors is configured for a different object type;
wherein the controller is configured to:
receive video data from the video image sensor, wherein the video data includes a time-dependent video stream of video frames captured by the video image sensor;
process the video data through the plurality of image classifiers to determine at least one object type for a first frame of the video data;
determine a corresponding object detector from the plurality of object detectors, the corresponding object detector configured to detect a first object type corresponding to the at least one object type determined by the plurality of image classifiers;
determine, using the corresponding object detector, position information for a detected object in the video data having the first object type; and
send, over a network, an event notification to a video surveillance application.