US 12,456,229 B2
Systems and methods for compression of three-dimensional volumetric representations
Danhang Tang, San Francisco, CA (US); Saurabh Singh, Mountain View, CA (US); Cem Keskin, San Francisco, CA (US); Phillip Andrew Chou, Bellevue, WA (US); Christian Haene, Berkeley, CA (US); Mingsong Dou, Cupertino, CA (US); Sean Ryan Francesco Fanello, San Francisco, CA (US); Jonathan Taylor, San Francisco, CA (US); Andrea Tagliasacchi, Toronto (CA); Philip Lindsley Davidson, Arlington, MA (US); Yinda Zhang, Dale City, CA (US); Onur Gonen Guleryuz, San Francisco, CA (US); Shahram Izadi, Tiburon, CA (US); and Sofien Bouaziz, Los Gatos, CA (US)
Assigned to GOOGLE LLC, Mountain View, CA (US)
Appl. No. 17/919,460
Filed by Google LLC, Mountain View, CA (US)
PCT Filed Apr. 17, 2020, PCT No. PCT/US2020/028689
§ 371(c)(1), (2) Date Oct. 17, 2022,
PCT Pub. No. WO2021/211133, PCT Pub. Date Oct. 21, 2021.
Prior Publication US 2023/0154051 A1, May 18, 2023
Int. Cl. G06T 9/00 (2006.01); G06T 17/00 (2006.01)
CPC G06T 9/001 (2013.01) [G06T 9/002 (2013.01)] 21 Claims
OG exemplary drawing
 
1. A computer-implemented method to perform compression of three-dimensional volumetric representations, the method comprising:
obtaining, by an encoding computing system comprising one or more computing devices, one or more voxel blocks from a three-dimensional volumetric representation of an object, wherein the three-dimensional volumetric representation comprises a plurality of voxels and a respectively associated plurality of textures, wherein each of the plurality of voxels comprises a magnitude value and a sign value, and wherein each of the one or more voxel blocks comprising a subset of the plurality of voxels;
encoding, by the encoding computing system, the one or more voxel blocks with a machine-learned voxel encoding model to obtain one or more encoded voxel blocks;
decoding, by the encoding computing system, the one or more encoded voxel blocks with a first instance of a machine-learned voxel decoding model to obtain a first instance of one or more reconstructed voxel blocks;
generating, by the encoding computing system, a first instance of a reconstructed mesh representation of the object based at least in part on the first instance of the one or more reconstructed voxel blocks;
encoding, by the encoding computing system, at least a portion of the plurality of textures according to an encoding scheme, wherein each texture of the at least the portion of the plurality of textures is encoded based at least in part on a spatial position of each of the one or more reconstructed voxel blocks in three-dimensional space to obtain a plurality of encoded textures;
encoding, by the encoding computing system and using an entropy encoder, the one or more encoded voxel blocks to obtain one or more entropy encoded voxel blocks; and
transmitting, by the encoding computing system, the one or more entropy encoded voxel blocks, a voxel block index, and the plurality of encoded textures to a decoding computing system that is remotely located from the encoding computing system, wherein the voxel block index describes the spatial position of each of the one or more reconstructed voxel blocks in three-dimensional space.