匈牙利学生作文中的反思层次自动分类 / Automatic Reflection Level Classification in Hungarian Student Essays
1️⃣ 一句话总结
这篇论文首次系统研究了如何自动判断匈牙利语学生作文中反思性思维的深度,通过对比传统机器学习模型与匈牙利语专用Transformer模型,发现经典方法在资源有限时表现更好,而Transformer模型在处理少数类别时更具优势,为形态丰富语言的自动反思分析奠定了基础。
Reflective thinking is a key competency in education, but assessing reflective writing remains a time-consuming and subjective task for education experts. While automated reflective analysis has been explored in several languages, Hungarian language was not researched extensively. In this paper, we present the first comprehensive study on automatic reflection level classification in Hungarian student essays. We used a large, expert-annotated Hungarian dataset consisting of 1,954 reflective essays collected over multiple academic years and labeled on a four-level reflection scale. We investigate two approaches: (1) classical machine learning models using TF-IDF and semantic embedding features, and (2) Hungarian-specific transformer models fine-tuned for document-level reflection classification. To address the strong class imbalance in the dataset, we systematically examine class weighting, oversampling, data augmentation, and alternative loss functions. An extensive ablation study is conducted to analyze the contribution of each modeling and balancing strategy. Our results show that shallow machine learning models with appropriate feature engineering achieve strong overall performance, reaching up to 71% overall score averaged over accuracy, F1-score, and ROC AUC metrics, while transformer-based models achieve slightly lower overall score (68%) averaged over the same metrics, but demonstrate better generalization on minority reflection classes. These findings highlight the continued relevance of classical methods for low-resource settings and the robustness of transformer models for imbalanced classification. The proposed dataset and experimental insights provide a solid foundation for future research on automated reflective analysis in Hungarian and other morphologically rich languages.
匈牙利学生作文中的反思层次自动分类 / Automatic Reflection Level Classification in Hungarian Student Essays
这篇论文首次系统研究了如何自动判断匈牙利语学生作文中反思性思维的深度,通过对比传统机器学习模型与匈牙利语专用Transformer模型,发现经典方法在资源有限时表现更好,而Transformer模型在处理少数类别时更具优势,为形态丰富语言的自动反思分析奠定了基础。
源自 arXiv: 2605.02402