面向无标注大语言模型自蒸馏的神经元感知数据选择方法 / Neuron-Aware Data Selection for Annotation-Free LLM Self-Distillation
1️⃣ 一句话总结
本文提出一种无需人工标注或外部反馈的神经网络自蒸馏框架,通过分析模型内部神经元的激活模式来智能选择高质量训练数据并构建教师模型,从而在保持跨领域通用性的同时提升特定专业任务的表现,避免传统方法导致的性能下降和校准误差问题。
Post-training large language models (LLMs) without real-world interaction feedback or human-labeled supervision remains challenging, particularly in specialized domains where expert annotations are costly to obtain. Recent annotation-free self-evolution methods address this by using the model's own outputs as supervision signals, constructing a teacher via additional context and aggregating predictions across multiple rollouts through majority voting to produce pseudo-labels. However, these approaches are not without drawbacks: SFT- and GRPO-based variants suffer out-of-domain performance degradation, while reward-based on-policy RL inflates calibration error. In this paper, we propose Neuron On-Policy Self-Distillation (Neuron-OPSD), a data-centric framework for annotation-free self-distillation that leverages internal neuron activations to guide both training-data selection and teacher context construction. The model is then trained via on-policy distillation from the teacher distribution, requiring no ground-truth labels at any stage. Across specialized-domain benchmarks, Neuron-OPSD improves in-domain task performance while preserving cross-domain generalization and mitigating calibration collapse over prior annotation-free baselines. This framework is particularly relevant to settings where online interaction or external supervision is costly or infeasible, and is conceptually distinct from offline RL approaches that rely on logged, reward-labeled trajectories.
面向无标注大语言模型自蒸馏的神经元感知数据选择方法 / Neuron-Aware Data Selection for Annotation-Free LLM Self-Distillation
本文提出一种无需人工标注或外部反馈的神经网络自蒸馏框架,通过分析模型内部神经元的激活模式来智能选择高质量训练数据并构建教师模型,从而在保持跨领域通用性的同时提升特定专业任务的表现,避免传统方法导致的性能下降和校准误差问题。
源自 arXiv: 2607.02460