菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-03-16
📄 Abstract - $p^2$RAG: Privacy-Preserving RAG Service Supporting Arbitrary Top-$k$ Retrieval

Retrieval-Augmented Generation (RAG) enables large language models to use external knowledge, but outsourcing the RAG service raises privacy concerns for both data owners and users. Privacy-preserving RAG systems address these concerns by performing secure top-$k$ retrieval, which typically is secure sorting to identify relevant documents. However, existing systems face challenges supporting arbitrary $k$ due to their inability to change $k$, new security issues, or efficiency degradation with large $k$. This is a significant limitation because modern long-context models generally achieve higher accuracy with larger retrieval sets. We propose $p^2$RAG, a privacy-preserving RAG service that supports arbitrary top-$k$ retrieval. Unlike existing systems, $p^2$RAG avoids sorting candidate documents. Instead, it uses an interactive bisection method to determine the set of top-$k$ documents. For security, $p^2$RAG uses secret sharing on two semi-honest non-colluding servers to protect the data owner's database and the user's prompt. It enforces restrictions and verification to defend against malicious users and tightly bound the information leakage of the database. The experiments show that $p^2$RAG is 3--300$\times$ faster than the state-of-the-art PRAG for $k = 16$--$1024$.

顶级标签: llm systems data
详细标签: privacy-preserving retrieval secure computation secret sharing retrieval-augmented generation top-k retrieval 或 搜索:

p²RAG:支持任意Top-k检索的隐私保护RAG服务 / $p^2$RAG: Privacy-Preserving RAG Service Supporting Arbitrary Top-$k$ Retrieval


1️⃣ 一句话总结

这篇论文提出了一种名为p²RAG的新系统,它能在保护用户提问和数据所有者知识库隐私的前提下,高效且灵活地检索任意数量的最相关文档,从而提升大语言模型回答的准确性。

源自 arXiv: 2603.14778