KoRe:面向大型语言模型的紧凑知识表示方法 / KoRe: Compact Knowledge Representations for Large Language Models
1️⃣ 一句话总结
本文提出了一种名为KoRe的方法,将知识图谱中的1跳子图压缩成离散的知识令牌,并注入大语言模型中,从而在减少10倍令牌使用量的同时,保持了与现有方法相当的任务表现,有效解决了大模型知识不透明、难以更新和易产生幻觉的问题。
Modern Large Language Models (LLMs) have shown impressive performances in user-facing tasks such as question answering, as well as consistent improvements in reasoning capabilities. Still, the way these models encode knowledge seems inherently flawed: by design, LLMs encode world-knowledge within their parameters. This way of representing knowledge is inherently opaque, difficult to debug and update, and prone to hallucinations. On the other hand, Knowledge Graphs can provide human-readable and easily editable world knowledge representations, and their application in knowledge-intensive tasks has consistently proven beneficial to downstream performance. Nonetheless, current integration techniques require extensive retraining or finetuning. To overcome this issue, we introduce KoRe, a methodology to encode 1-hop sub-graphs into compact discrete knowledge tokens and inject them into a LLM backbone. We test the proposed approach on three established benchmarks, and report competitive performances coupled with a significant reduction (up to 10x) in token usage. Our results show that compact discrete KG representations can efficiently and effectively be used to ground modern LLMs.
KoRe:面向大型语言模型的紧凑知识表示方法 / KoRe: Compact Knowledge Representations for Large Language Models
本文提出了一种名为KoRe的方法,将知识图谱中的1跳子图压缩成离散的知识令牌,并注入大语言模型中,从而在减少10倍令牌使用量的同时,保持了与现有方法相当的任务表现,有效解决了大模型知识不透明、难以更新和易产生幻觉的问题。
源自 arXiv: 2605.20170