← 返回列表

arXiv 提交日期: 2026-06-01

📄 Abstract - Multilinguality of Large Language Models From a Structural Perspective

Large language models (LLMs) have excelled in processing multiple languages through pre- and post-training on multilingual data, even though English dominates the training data. Prior work focusing on token representations has revealed how those LLMs process non-English text. Although these analyses have provided insightful findings, they fail to capture a structural view, which is an inherent property of language. In this study, we explore the multilinguality of LLMs through representational structural analysis. Our findings reveal that low-resource languages are structurally more different from English than high- and mid-resource languages, and that language-specific post-training alters their structures while preserving inter-language relationships.

顶级标签: llm natural language processing

从结构视角看大型语言模型的多语言能力 / Multilinguality of Large Language Models From a Structural Perspective

1️⃣ 一句话总结

本文通过分析大型语言模型内部的语言结构表示，发现低资源语言与英语的结构差异远大于高、中资源语言，并且针对特定语言的后训练过程会改变模型结构，但不会破坏不同语言之间的相对关系。

👋 没兴趣 ☆ 感兴趣 📌 待读

打开原文 PDF

源自 arXiv: 2606.01800

← 返回列表

菜单

AI 帮我研读全文

1️⃣ 一句话总结

密码管理

设置密码

修改密码

移除密码

菜单

AI 帮我研读全文

1️⃣ 一句话总结

获取最新论文摘要