US 11,776,215 B1
Pre-labeling data with cuboid annotations
Chiao-Lun Cheng, San Francisco, CA (US); Elliot Branson, San Francisco, CA (US); Leigh Marie Braswell, San Francisco, CA (US); Daniel Havíř, San Francisco, CA (US); and Jeffrey Alan Anders, Denver, CO (US)
Assigned to SCALE AI, INC., San Francisco, CA (US)
Filed by Scale AI, Inc., San Francisco, CA (US)
Filed on Dec. 16, 2019, as Appl. No. 16/716,396.
Int. Cl. G06T 19/00 (2011.01); G06F 3/0482 (2013.01); G06F 3/04847 (2022.01); G06T 7/60 (2017.01); G06N 5/04 (2023.01); G06N 20/00 (2019.01)
CPC G06T 19/00 (2013.01) [G06F 3/0482 (2013.01); G06F 3/04847 (2013.01); G06N 5/04 (2013.01); G06N 20/00 (2019.01); G06T 7/60 (2013.01); G06T 2200/24 (2013.01); G06T 2207/10028 (2013.01); G06T 2207/20081 (2013.01); G06T 2219/004 (2013.01)] 17 Claims
OG exemplary drawing
 
1. A computer-implemented method for annotating point cloud data, the method comprising:
training a first machine learning (ML) model using a loss function evaluated by performing steps including:
predicting, using the first ML model, a first cuboid annotation;
determining a first distance between corners of a cuboid annotation included in training data and a second cuboid annotation that includes a size of the first cuboid annotation, a position of the cuboid annotation included in the training data, and a yaw of the cuboid annotation included in the training data;
determining a second distance between corners of the cuboid annotation included in the training data and a third cuboid annotation that includes a position of the first cuboid annotation, a size of the cuboid annotation included in the training data, and a yaw of the cuboid annotation included in the training data;
determining a third distance between corners of the cuboid annotation included in the training data and a fourth cuboid annotation that includes a yaw of the first cuboid annotation, a size of the cuboid annotation included in the training data, and a position of the cuboid annotation included in the training data; and
determining an average of the first distance, the second distance, and the third distance;
processing the point cloud data using the first ML model and a second ML model to generate cuboid annotations of objects in the point cloud data by performing steps including:
processing the point cloud data using the first ML model to determine cuboid tracks associated with the objects;
projecting regions of the point cloud data including the objects to a plurality of two-dimensional (2D) views; and
processing the plurality of 2D views and the cuboid tracks using the second ML model to determine updated cuboid tracks associated with the objects;
determining a confidence threshold based on a count of the cuboid annotations; and
causing to be displayed at least one of the cuboid annotations based on the confidence threshold, and at least one user interface element that permits a user to select, confirm, or modify the at least one of the cuboid annotations that is displayed.