菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-01-27
📄 Abstract - CiMRAG: Cim-Aware Domain-Adaptive and Noise-Resilient Retrieval-Augmented Generation for Edge-Based LLMs

Personalized virtual assistants powered by large language models (LLMs) on edge devices are attracting growing attention, with Retrieval-Augmented Generation (RAG) emerging as a key method for personalization by retrieving relevant profile data and generating tailored responses. However, deploying RAG on edge devices faces efficiency hurdles due to the rapid growth of profile data, such as user-LLM interactions and recent updates. While Computing-in-Memory (CiM) architectures mitigate this bottleneck by eliminating data movement between memory and processing units via in-situ operations, they are susceptible to environmental noise that can degrade retrieval precision. This poses a critical issue in dynamic, multi-domain edge-based scenarios (e.g., travel, medicine, and law) where both accuracy and adaptability are paramount. To address these challenges, we propose Task-Oriented Noise-resilient Embedding Learning (TONEL), a framework that improves noise robustness and domain adaptability for RAG in noisy edge environments. TONEL employs a noise-aware projection model to learn task-specific embeddings compatible with CiM hardware constraints, enabling accurate retrieval under noisy conditions. Extensive experiments conducted on personalization benchmarks demonstrate the effectiveness and practicality of our methods relative to strong baselines, especially in task-specific noisy scenarios.

顶级标签: llm systems model training
详细标签: retrieval-augmented generation edge computing computing-in-memory noise robustness domain adaptation 或 搜索:

CiMRAG:面向边缘大语言模型、具备存内计算感知、领域自适应与噪声鲁棒性的检索增强生成方法 / CiMRAG: Cim-Aware Domain-Adaptive and Noise-Resilient Retrieval-Augmented Generation for Edge-Based LLMs


1️⃣ 一句话总结

这篇论文提出了一种名为TONEL的新方法,旨在解决在边缘设备上运行个性化大语言模型时,因用户数据增长和环境噪声干扰导致的检索精度下降问题,从而在嘈杂多变的应用场景中实现更准确、更自适应的个性化响应生成。

源自 arXiv: 2601.20041