CPC G06F 21/552 (2013.01) [G06F 2221/034 (2013.01)] | 18 Claims |
1. A method for generating semantic representation of a document using an electronic device (100) to determine data security risk associated with the document, the method comprising:
receiving, by a document semantics controller (160) of the electronic device (100), a document in an electronic form, wherein the document comprises a plurality of content;
determining, by the document semantics controller (160) of the electronic device (100), raw text from the plurality of content;
generating, by the document semantics controller (160) of the electronic device (100), a plurality of sentence blocks of a predefined size using the raw text;
determining, by the document semantics controller (160) of the electronic device (100), at least one embeddings for each of the plurality of sentence blocks;
determining, by the document semantics controller (160) of the electronic device (100), the semantic representation of the document based on the at least one embeddings for each of the plurality of sentence blocks;
generating, by the document semantics controller (160) of the electronic device (100), the semantic representation of the document to determine the data security risk associated with the document;
determining, by the document semantics controller (160) of the electronic device (100), at least one attribute of a plurality of attributes associated with a user requesting access to the document, wherein at least one attribute indicates a user security risk profile;
determining, by the document semantics controller (160) of the electronic device (100), a document security risk profile based on the semantic representation of the document and semantic representation of neighboring documents; and
determining, by the document semantics controller (160) of the electronic device (100), whether the user security risk profile matches the document security risk profile to determine access to the document.
|