| CPC G06T 7/75 (2017.01) [G06T 2207/20084 (2013.01); G06T 2207/30196 (2013.01)] | 4 Claims |

|
1. A system for estimating a three dimensional pose of one or more persons in a scene, the system comprising:
one or more cameras, the one or more cameras configured to capture one or more images of the scene; and
a data processor including at least one hardware component, the data processor configured to execute computer executable instructions, the computer executable instructions comprising instructions for:
receiving the one or more images of the scene from the one or more cameras;
extracting features from the one or more images of the scene for providing inputs to a three dimensional pose estimation neural network;
generating, by using the three dimensional pose estimation neural network, vertices of a canonical human mesh model for the one or more images of the scene;
retrieving a particular weight matrix of a plurality of weight matrices from an annotation server that corresponds to a desired three dimensional keypoint set, the plurality of weight matrices corresponding to different applications of the system, the desired three dimensional keypoint set being a three dimensional keypoint set for a particular user-desired one of the different applications of the system;
generating the desired three dimensional keypoint set by multiplying the retrieved particular weight matrix with the vertices of the canonical human mesh model; and
wherein, during training of the system, the data processor is further configured to execute computer executable instructions for:
retrieving one or more canonical human mesh samples corresponding to the particular user-desired one of the different applications of the system from the annotation server, and displaying the one or more canonical human mesh samples to one or more human annotators using an annotation interface so that the one or more human annotators are able to annotate three dimensional keypoints that are located inside of the human mesh or on the human mesh to produce user-defined annotated three dimensional keypoint locations;
determining the particular weight matrix, which is used to generate the desired three dimensional keypoint set, from the user-defined annotated three dimensional keypoint locations and the vertices of the canonical human mesh model; and
storing the determined particular weight matrix on the annotation server; and
wherein the vertices of the canonical human mesh model are generated independently from the user-defined annotated three dimensional keypoint locations.
|