US 12,106,487 B2
Feature prediction for efficient video processing
Mohsen Fayyaz, Bonn (DE); Hamidreza Vaezi Joze, Redmond, WA (US); and Eric Chris Wolfgang Sommerlade, Oxford (GB)
Assigned to Microsoft Technology Licensing, LLC, Redmond, WA (US)
Filed by Microsoft Technology Licensing, LLC, Redmond, WA (US)
Filed on Nov. 24, 2021, as Appl. No. 17/535,551.
Prior Publication US 2023/0162372 A1, May 25, 2023
Int. Cl. G06K 9/00 (2022.01); G06N 3/08 (2023.01); G06T 7/20 (2017.01); G06V 10/40 (2022.01); G06V 10/94 (2022.01)
CPC G06T 7/20 (2013.01) [G06N 3/08 (2013.01); G06V 10/40 (2022.01); G06V 10/95 (2022.01); G06T 2207/20081 (2013.01); G06T 2207/20084 (2013.01)] 20 Claims
OG exemplary drawing
 
1. A method for processing a stream of video frames, comprising:
obtaining a first frame of video content, the first frame being interpreted as a key frame;
converting the first frame into first feature information using an image analysis system;
caching the first feature information in a data store;
obtaining a second frame of video content, the second frame being interpreted as a predicted frame;
mapping the first frame and the second frame into first frame-change information using a motion-determining model, the first frame-change information expressing a change in video content from the first frame to the second frame;
converting the first frame-change information and the first feature information into second feature information using a prediction model; and
caching the second feature information in the data store,
the method interpreting every nth frame in the stream of video frames as a key frame, and frames between neighboring key frames as predicted frames, n being specified by a configuration parameter, and
the method switching between use of the image analysis system and use of the motion-determining model and the prediction model depending on whether a key frame is encountered or a predicted frame is encountered in the stream of video frames.