别跑题:基于带注意力机制的非线性朴素贝叶斯的主题连续性模型 / Don't Shoot The Breeze: Topic Continuity Model Using Nonlinear Naive Bayes With Attention
1️⃣ 一句话总结
这篇论文提出了一种新颖的模型,通过结合朴素贝叶斯、注意力机制和对数非线性变换,来评估大语言模型聊天机器人的回复是否与对话初始主题保持一致,从而提升用户体验和计算效率,并且该模型能处理任意长度的对话且易于解释。
Utilizing Large Language Models (LLM) as chatbots in diverse business scenarios often presents the challenge of maintaining topic continuity. Abrupt shifts in topics can lead to poor user experiences and inefficient utilization of computational resources. In this paper, we present a topic continuity model aimed at assessing whether a response aligns with the initial conversation topic. Our model is built upon the expansion of the corresponding natural language understanding (NLU) model into quantifiable terms using a Naive Bayes approach. Subsequently, we have introduced an attention mechanism and logarithmic nonlinearity to enhance its capability to capture topic continuity. This approach allows us to convert the NLU model into an interpretable analytical formula. In contrast to many NLU models constrained by token limits, our proposed model can seamlessly handle conversations of any length with linear time complexity. Furthermore, the attention mechanism significantly improves the model's ability to identify topic continuity in complex conversations. According to our experiments, our model consistently outperforms traditional methods, particularly in handling lengthy and intricate conversations. This unique capability offers us an opportunity to ensure the responsible and interpretable use of LLMs.
别跑题:基于带注意力机制的非线性朴素贝叶斯的主题连续性模型 / Don't Shoot The Breeze: Topic Continuity Model Using Nonlinear Naive Bayes With Attention
这篇论文提出了一种新颖的模型,通过结合朴素贝叶斯、注意力机制和对数非线性变换,来评估大语言模型聊天机器人的回复是否与对话初始主题保持一致,从而提升用户体验和计算效率,并且该模型能处理任意长度的对话且易于解释。
源自 arXiv: 2602.09312