US 12,445,584 B2
	Supporting multi-view video operations with disocclusion atlas
Gregory John Ward, Berkeley, CA (US)
Assigned to Dolby Laboratories Licensing Corporation, San Francisco, CA (US)
Appl. No. 18/009,905
Filed by Dolby Laboratories Licensing Corporation, San Francisco, CA (US)
PCT Filed Jun. 16, 2021, PCT No. PCT/US2021/037527 § 371(c)(1), (2) Date Dec. 12, 2022, PCT Pub. No. WO2021/257639, PCT Pub. Date Dec. 23, 2021.
Claims priority of provisional application 63/039,595, filed on Jun. 16, 2020.
Claims priority of application No. 20180179 (EP), filed on Jun. 16, 2020.
Prior Publication US 2023/0224447 A1, Jul. 13, 2023
Int. Cl. H04N 13/161 (2018.01)

CPC H04N 13/161 (2018.05)

18 Claims

1. A method comprising:

sorting, in size, image fragments that are occluded in one or more reference images depicting a visual scene from one or more reference views and that become at least partly disoccluded in non-reference views adjacent to the one or more reference views, the image fragments including a first image fragment that is no less in size than any other image fragment in the image fragments;

generating a layout mask for a disocclusion atlas used to store the image fragments, the layout mask being covered with a quadtree that includes a first best fit node sized for covering the first image fragment, the disocclusion atlas being a combined image of minimal total area that contains multiple non-overlapping image fragments;

storing the sorted image fragments in a descending order into best fit nodes identified in the layout mask, wherein each of the best fit nodes is identified as a quadtree node of a minimum size for completely covering each of the respective image fragments, each image fragment in the sorted image fragments being stored in the respective best fit node, the best fit nodes including at least one best fit node that is obtained by iteratively dividing at least one node in the quadtree that covers the layout mask;

generating a volumetric video signal encoded with the one or more reference images, the volumetric video signal being further encoded with the image fragments in the disocclusion atlas, the one or more reference images for use by a recipient device of the volumetric video signal to synthesize a display image in a non-represented view for rendering on an image display, the image fragments in the disocclusion atlas for use by the recipient device to fill disoccluded image data in disoccluded spatial regions in the display image.