| CPC G06F 40/186 (2020.01) [G06F 40/106 (2020.01); G06V 30/1448 (2022.01); G06V 30/148 (2022.01); G06V 30/191 (2022.01); G06V 30/414 (2022.01); G06V 30/416 (2022.01)] | 20 Claims |

|
1. A method comprising:
receiving, by a processing device, a document for display in a user device, the document including an infographic image;
identifying, using a component extraction module, visual components of the infographic image, wherein the component extraction module includes an object detection model that generates bounding box data for candidate elements in the infographic image and an image segmentation algorithm that analyzes pixels of the infographic image to identify candidate regions of the infographic image, and
wherein for each candidate region of the candidate regions:
determining a candidate region maximally overlapping with one or more of the candidate elements of the infographic image, and
identifying the candidate region as a visual component of the infographic image;
determining, using an encoder-decoder network, an ordered sequence of the identified visual components;
rendering a modified visual representation of the infographic image based on the identified visual components and the determined ordered sequence of the identified visual components; and
presenting the document, including the modified visual representation of the infographic image in place of the infographic image, for display in a viewing pane of a user device, wherein the modified visual representation of the infographic image is resized to fit a width of the viewing pane of the user device.
|