基于图注意力网络和混合密度网络的概率化薪资预测 / Probabilistic Salary Prediction with Graph Attention Networks and a Mixture Density Network
1️⃣ 一句话总结
该论文提出了一种名为GAT-MDN的模型,通过同时挖掘职位属性(如地点、职业、行业)之间的层级和语义关联,并输出薪资的概率分布而非单一数值,从而更准确地预测薪资并反映其不确定性。
Accurate salary prediction is critical for bridging the information gap between employers and job seekers in modern labor markets. Existing approaches predominantly yield a single point estimate and treat job attributes such as location, occupation, and industry as independent categorical features, ignoring both the inherent uncertainty and multi-modality of real-world compensation data and the rich hierarchical and semantic-similarity relationships that govern pay norms. In this paper we propose GAT-MDN, a unified framework that addresses both limitations simultaneously. For each of the three attribute domains we construct a domain-specific graph whose edges encode (i) hierarchical parent-child containment and (ii) weighted similarity links derived from a pre-trained Sentence-Transformer. Parallel Graph Attention Networks (GATs) with edge-feature-aware attention learn rich, context-sensitive node representations from these multi-relational graphs. A priority-based hierarchical selection module then assembles a composite feature vector that gracefully handles missing or coarse attributes, and a Mixture Density Network (MDN) head maps this vector to the parameters of a Gaussian Mixture Model (GMM), yielding a full conditional salary distribution. Extensive experiments on a real-world Dutch job-posting dataset of over 1 million records demonstrate that GAT-MDN significantly outperforms a non-graph MLP-MDN baseline in both Negative Log-Likelihood (NLL) and Mean Squared Error (MSE).
基于图注意力网络和混合密度网络的概率化薪资预测 / Probabilistic Salary Prediction with Graph Attention Networks and a Mixture Density Network
该论文提出了一种名为GAT-MDN的模型,通过同时挖掘职位属性(如地点、职业、行业)之间的层级和语义关联,并输出薪资的概率分布而非单一数值,从而更准确地预测薪资并反映其不确定性。
源自 arXiv: 2606.11663