CPC G06V 10/776 (2022.01) [G06V 10/26 (2022.01); G06V 10/764 (2022.01); G06V 10/774 (2022.01); G06V 10/82 (2022.01); G06V 20/17 (2022.01); G06V 20/70 (2022.01)] | 15 Claims |
1. A computer-implemented method comprising:
receiving an image, a ground truth annotation mask and a segment ID mask, wherein the image comprises a plurality of pixels, the ground truth annotation mask comprises for each pixel a label of a correct class, and the segment ID mask comprises for each pixel a segment identifier identifying a segment to which the pixel belongs;
feeding the image to a machine learning model for semantic segmentation and computing an outcome;
computing a per-segment value for each one of the plurality of segments from the information in the outcome and in the ground truth annotation mask related to pixels associated with the one of the plurality of segments in the segment ID mask;
calculating a segment-aware value from a plurality of per-segment values;
using the segment-aware value for training the machine learning model for semantic segmentation; and
updating a plurality of parameters of the semantic segmentation machine learning model using the segment-aware value,
wherein the step of computing the per-segment value further comprises
computing, for each one of the plurality of pixels, a per-pixel loss value using the ground truth annotation mask and the outcome, wherein the outcome is a per class probability map containing for each pixel a probability of belonging to each class, and
computing a per-segment value for each one of the plurality of segments using a plurality of per-pixel loss values of pixels associated with the one of the plurality of segments, thereby creating a per-segment loss, and wherein the step of calculating a segment-aware value further comprises computing a segment aware-loss value as an aggregation of the per-segment loss values.
|