菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-05-27
📄 Abstract - Continual Model Routing in Evolving Model Hubs

AI model hubs provide access to a rapidly growing collection of powerful pre-trained models, enabling off-the-shelf mixture-of-experts systems with different routing strategies. However, this rapid growth poses two fundamental challenges: scaling model selection across thousands of experts and continually updating routing mechanisms as new models and tasks are introduced. In this paper, we formalise this setting as Continual Model Routing (CMR) and propose CMRBench, a new large-scale benchmark simulating realistic hub expansion and including over 2,000 candidate models. Finally, we introduce CARvE, a contrastive embedding approach for efficient continual model routing via checkpoint-based anchoring and structured replay. Extensive empirical results and ablations show that CARvE significantly outperforms zero-shot retrieval, fine-tuning, and adapter-merging baselines in model, family, and domain-level accuracy.

顶级标签: systems model evaluation benchmark
详细标签: model routing continual learning model hubs contrastive embedding mixture-of-experts 或 搜索:

在持续演化的模型枢纽中的连续模型路由 / Continual Model Routing in Evolving Model Hubs


1️⃣ 一句话总结

本文提出了一种名为CMR的连续模型路由框架,用于应对AI模型枢纽中模型数量快速增长和任务不断更新的挑战,并开发了基于对比嵌入的高效路由方法CARvE,在超过2000个候选模型的测试中显著优于现有基准方法。

源自 arXiv: 2605.28577