| CPC G06V 10/82 (2022.01) [G06V 10/422 (2022.01); G06V 30/262 (2022.01)] | 17 Claims |

|
1. A system, said system comprising:
a memory; and
a processor in communication with said memory, said processor being configured to perform operations, said operations comprising:
receiving an input;
extracting features from said input, wherein extracting features from said input comprises:
using an attention network to extract textual features for a plurality of specific modules, the plurality of specific modules including at least a subject module, a location module, and a relation module, wherein the attention network parses different components of the input for each specific module, including parsing a subject component for the subject module, a location component for the location module, and a relation component for the relation module;
mining object relations using said features;
determining feature vectors using said object relations; and
generating, using said feature vectors, an output indicating a target region, wherein said target region corresponds to said input.
|