US 11,055,538 B2
Object re-identification with temporal context
Michal Koperski, Juan-les-Pins (FR); Slawomir W. Bak, Pittsburgh, PA (US); and G. Peter K. Carr, Allison Park, PA (US)
Assigned to Disney Enterprises, Inc., Burbank, CA (US)
Filed by Disney Enterprises, Inc., Burbank, CA (US)
Filed on Mar. 31, 2017, as Appl. No. 15/476,230.
Prior Publication US 2018/0286081 A1, Oct. 4, 2018
Int. Cl. G06K 9/62 (2006.01); G06K 9/00 (2006.01); G06T 7/292 (2017.01)
CPC G06K 9/00771 (2013.01) [G06T 7/292 (2017.01); G06K 9/6215 (2013.01); G06K 9/6218 (2013.01)] 20 Claims
OG exemplary drawing
 
1. A computer-implemented method of object identification, the computer-implemented method comprising:
capturing, with a first camera at a first point in time, a first image depicting a first scene;
capturing, with a second camera at a second point in time different from the first point in time, a second image depicting a second scene that does not overlap with the first scene;
extracting a first patch from the first image;
extracting a second patch from the second image;
extracting a first patch descriptor from the first patch;
extracting a second patch descriptor from the second patch;
mapping the first and second patch descriptors to a concatenated codeword in a clustered codebook learned via coupled clustering of a set of concatenated features, wherein at least one concatenated feature of the set of concatenated features is generated by concatenating corresponding patch descriptors from the first and second cameras, the concatenated codeword comprising dimensions of cluster centers of coupled clusters to which the first and second patch descriptors map;
computing an appearance cost comprising a measure of visual dissimilarity between the first and second images, based on the concatenated codeword and by operation of one or more computer processors;
computing a temporal cost between the first and second images, using a temporal context model and based on the first and second points in time;
combining the appearance cost and the temporal cost into a single cost function; and
determining whether the first and second images depict a common object, using the single cost function, after which an indication of whether the first and second images depict a common object is output.