📄
Abstract - RAISE: RAG Design as an Architecture Search Problem
Retrieval-augmented generation (RAG) systems expose numerous design choices spanning query rewriting, chunking, retrieval depth, reranking, and context compression. In practice, these choices are often configured through heuristics, hindering systematic evaluation and reproducibility across settings. We argue that this challenge is best formulated as RAG architecture search. To support controlled and reproducible study of this problem, we introduce the RAG Intelligence Search Engine (RAISE), a comprehensive framework and benchmark for RAG hyperparameter optimization, which evaluates optimization methods for RAG pipelines under standardized search spaces and budgets. RAISE implements 13 search algorithms and evaluates them across seven public text and multimodal datasets using three random seeds. Our experiments show that optimization performance is highly task-dependent: methods that perform strongly on one dataset may not generalize consistently across others, cautioning against interpreting aggregate rankings as evidence of universally superior strategies. RAISE provides a common experimental substrate for fair, reproducible, and systematic research on RAG hyperparameter optimization.
RAISE:将RAG设计视为一个架构搜索问题 /
RAISE: RAG Design as an Architecture Search Problem
1️⃣ 一句话总结
本文指出当前检索增强生成(RAG)系统的众多设计参数(如查询改写、分块策略、检索深度、重排序与上下文压缩)通常靠经验设定,缺乏系统性和可复现性,因此将其形式化为一个架构搜索问题,并提出了一个名为RAISE的综合框架与基准,通过统一搜索空间和预算、集成13种搜索算法并在7个文本与多模态数据集上测试,揭示了最优参数高度依赖于具体任务,不存在通用的最佳策略。