| CPC G06T 7/181 (2017.01) [G06T 7/13 (2017.01); G06T 7/70 (2017.01); G06V 10/25 (2022.01); G06V 10/26 (2022.01); G06V 10/457 (2022.01); G06V 20/56 (2022.01); G06T 2207/30252 (2013.01); G06T 2210/12 (2013.01); G06V 2201/07 (2022.01)] | 20 Claims |

|
1. A system comprising:
one or more processors; and
one or more computer-readable media storing instructions executable by the one or more processors, wherein the instructions, when executed, cause the system to perform operations comprising:
receiving pixel data for a plurality of pixels corresponding to an object in an environment, wherein the pixel data comprises detection box parameters associated with individual pixels of the plurality of pixels;
generating, based at least in part on a detection box parameter, a cluster of pixels comprising a first pixel of the plurality of pixels and a second pixel of the plurality of pixels;
generating a detection box based at least in part on a first detection box parameter associated with the first pixel and a second detection box parameter associated with the second pixel; and
controlling a vehicle based at least in part on the detection box.
|