| CPC G01S 17/89 (2013.01) [G01S 17/86 (2020.01); G06N 20/00 (2019.01)] | 31 Claims |

|
1. A computer-implemented method for training machine learning models to generate three-dimensional motions based on LiDAR point clouds, the method comprising:
encoding, by a computing system, a machine learning model representing an object in a scene; and
training, by the computing system, the machine learning model using a dataset comprising synchronous LiDAR point clouds captured by monocular LiDAR sensors and ground-truth three-dimensional motions obtained from Inertial Measurement Units (IMU) devices;
wherein the machine learning model is configured to generate a three-dimensional motion of the object based on an input of a plurality of point cloud frames captured by a monocular LiDAR sensor, and
the machine learning model comprises a feature learning network, a Gated Recurring Unit (GRU), a multiplayer perceptron (MLP) decoder, wherein the feature learning network is configured to extract a global descriptor from each point cloud frame, the GRU is configured to generate a plurality of hidden variables for the global descriptor, and the MLP decoder is configured to predict a plurality of joint locations based on the plurality of hidden variables.
|