← 返回列表

arXiv 提交日期: 2026-06-03

📄 Abstract - DLLG: Dynamic Logit-Level Gating of LLM Experts

Leveraging multiple specialized LLMs can combine complementary strengths, but existing approaches trade adaptability for stability: routing commits prematurely, heuristic ensembling depends on fragile proxies, and parameter merging introduces interference. We propose DLLG (Dynamic Logit-Level Gating), a dynamic logit-level ensembling framework that learns token-level expert fusion from sparse response-level supervision. A lightweight gating module predicts step-wise fusion weights, linking trajectory-level correctness to generation without token-level labels or expert retraining. Across diverse reasoning and code benchmarks, DLLG consistently outperforms strong routing, heuristic ensembling, and parameter-merging baselines across model scales, highlighting learned logit-level fusion as a robust and scalable paradigm for integrating specialized experts.

顶级标签: llm model training

DLLG：大语言模型专家动态逻辑层级门控融合 / DLLG: Dynamic Logit-Level Gating of LLM Experts

1️⃣ 一句话总结

本论文提出了一种名为DLLG的轻量级框架，它能像智能调音师一样，在生成每个词语时动态学习多个专用大语言模型的最佳融合比例，无需昂贵的人工标注或重新训练模型，从而在推理和代码生成等任务上稳定超越传统模型集成方法。

👋 没兴趣 ☆ 感兴趣 📌 待读

打开原文 PDF

源自 arXiv: 2606.04378

← 返回列表

菜单

AI 帮我研读全文

1️⃣ 一句话总结

密码管理

设置密码

修改密码

移除密码

菜单

AI 帮我研读全文

1️⃣ 一句话总结

获取最新论文摘要