菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-03-03
📄 Abstract - UniSkill: A Dataset for Matching University Curricula to Professional Competencies

Skill extraction and recommendation systems have been studied from recruiter, applicant, and education perspectives. While AI applications in job advertisements have received broad attention, deficiencies in the instructed skills side remain a challenge. In this work, we address the scarcity of publicly available datasets by releasing both manually annotated and synthetic datasets of skills from the European Skills, Competences, Qualifications and Occupations (ESCO) taxonomy and university course pairs and publishing corresponding annotation guidelines. Specifically, we match graduate-level university courses with skills from the Systems Analysts and Management and Organization Analyst ESCO occupation groups at two granularities: course title with a skill, and course sentence with a skill. We train language models on this dataset to serve as a baseline for retrieval and recommendation systems for course-to-skill and skill-to-course matching. We evaluate the models on a portion of the annotated data. Our BERT model achieves 87% F1-score, showing that course and skill matching is a feasible task.

顶级标签: natural language processing data systems
详细标签: skill extraction curriculum matching dataset text classification information retrieval 或 搜索:

UniSkill:一个用于匹配大学课程与职业能力的数据集 / UniSkill: A Dataset for Matching University Curricula to Professional Competencies


1️⃣ 一句话总结

这篇论文发布了一个名为UniSkill的新数据集,用于将大学课程内容与职业所需技能进行匹配,并训练了一个高效的AI模型来支持课程与技能之间的双向推荐。

源自 arXiv: 2603.03134