US 12,001,518 B2
	Method for predicting matching degree between resume and post, and related device
Honghui Chen, Hunan (CN); Taihua Shao, Hunan (CN); Chengyu Song, Hunan (CN); Miao Jiang, Hunan (CN); Mengru Wang, Hunan (CN); Xin Zhang, Hunan (CN); Fei Cai, Hunan (CN); Dengfeng Liu, Hunan (CN); and Siyuan Wang, Hunan (CN)
Assigned to National University of Defense Technology, Changsha (CN)
Filed by National University of Defense Technology, Hunan (CN)
Filed on Dec. 16, 2022, as Appl. No. 18/083,211.
Claims priority of application No. 202111548891.0 (CN), filed on Dec. 17, 2021.
Prior Publication US 2023/0195850 A1, Jun. 22, 2023
Int. Cl. G06F 17/16 (2006.01); G06F 18/241 (2023.01); G06N 5/01 (2023.01); G06Q 10/1053 (2023.01)

CPC G06F 18/241 (2023.01) [G06F 17/16 (2013.01); G06N 5/01 (2023.01); G06Q 10/1053 (2013.01)]

16 Claims

1. A method for predicting matching degree between resume information and post information, implemented via a processor, comprising:

receiving, via input interface, the resume information and the post information from storage;

obtaining, via a processor, a first key and a first value of a respective semi-structured post attribute in the post information and a second key and a second value of a respective semi-structured resume attribute in the resume information, the first key, the first value, the second key and the second value being all expressed in text data; and

predicting, via the processor, the matching degree between the resume information and the post information by a prediction model including a cascaded pre-trained language model, a Transformer encoder and a single label classification model, based on the first key and the first value of the respective post attribute, a first source representation corresponding to the post information, the second key and the second value of the respective resume attribute, and a second source representation corresponding to the resume information;

wherein predicting the matching degree between the resume information and the post information comprises:

for the first key and the first value of the respective post attribute, respectively encoding the first key and the first value into a semantic space through the pre-trained language model so as to obtain a first key embedding and a first value embedding, and fusing the first key embedding and the first value embedding so as to obtain a first fused embedding of the post attribute;

encoding the first source representation into the semantic space through the pre-trained language model so as to obtain a first source embedding;

for the second key and the second value of the respective resume attribute, respectively encoding the second key and the second value into the semantic space through the pre-trained language model so as to obtain a second key embedding and a second value embedding, and fusing the second key embedding and the second value embedding so as to obtain a second fused embedding of the resume attribute;

encoding, via the processor, the second source representation into the semantic space through the pre-trained language model so as to obtain a second source embedding; and

performing, via the processor, internal interaction of a first matrix including the first fusion embedding of the respective post attribute so as to obtain a first internal-interaction-attribute embedding matrix and performing internal interaction of a second matrix including the second fusion embedding of the respective resume attribute so as to obtain a second internal-interaction-attribute embedding matrix, with the Transformer encoder;

wherein the fusing and embedding operations are performed on all attributes in custom character

and

to obtain matrix representations of custom character

and

X_J=[j₁^a;j₂^a; . . . ;j_m^a],

X_R=[r₁^a;r₂^a; . . . ;r_n^a],

where J is the post information, custom character

is the resume information, j_i^a, r_j^a∈ custom character

^d^_inare fused and embedded expressions of an i-th attribute in custom character

and a j-th attribute in custom character

, respectively, i∈{1,2, . . . , m}, j∈{1,2, . . . , n}, X_j∈ custom character

^m×d^_inis a first matrix and the matrix representation of custom character

, X_R∈

^n×d^_inis a second matrix and the matrix representation of custom character

, m and n are corresponding numbers of attributes in custom character

and

, and d_inis a vector dimension after an attribute value and an attribute key are fused;

wherein a multi-head self-attention matrix representation of custom character

, M_J, and a multi-head self-attention matrix representation of custom character

, M_R, are calculated according to:

where Q_J^h, Q_R^h∈ custom character

^d^_in^×d^_q, K_J^h, K_R^h∈ custom character

^d^_in^×d^_k, V_J^h, V_R^h∈ custom character

^d^_in^×d^_vkand O_J, O_R∈ custom character

^H·d^_v^×d^_inare trainable network parameters; h∈{1,2, . . . , H}, H represents a number of headers of the Transformer encoder, d_q=d_k=d_v=d_in/H, M_J^hand M_R^hare self-attention matrix representations of custom character

and

at an h-th head;

M_Jand M_Rare input into a feedforward layer of the Transformer encoder to obtain internal interaction representations of custom character

and

as follows:

where j_i^a′, r_j^a′∈ custom character

^d^_inare internal-interaction-attribute embeddings of the i-th attribute in custom character

and the j-th attribute in custom character

, i∈{1,2, . . . , m} and j∈{1,2, . . . , n}, respectively; and

wherein the prediction model is trained by minimizing a binary cross entropy loss according to:

=−1/NΣ_i=1^N( custom character

_ilog(

′_i)+(1−

_i)log(1−

′_i)),

where

_iis a true matching degree of an i-th training instance; custom character

′_iis a predicted matching degree of the i-th training instance generated by the model; N is a total number of training instances,

thus predicting an accurate matching degree between the resume information and the post information to obtain rich representations of the resume information and the post information; and

presenting, via output interface, rich representations of the resume information and the post information, which comprising the resume information, the post information and the accurate matching degree, wherein when the accurate matching degree is 1 indicates match, and when the accurate matching degree is 0 indicates no match.