菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-02-02
📄 Abstract - DCoPilot: Generative AI-Empowered Policy Adaptation for Dynamic Data Center Operations

Modern data centers (DCs) hosting artificial intelligence (AI)-dedicated devices operate at high power densities with rapidly varying workloads, making minute-level adaptation essential for safe and energy-efficient operation. However, manually designing piecewise deep reinforcement learning (DRL) agents cannot keep pace with frequent dynamics shifts and service-level agreement (SLA) changes of an evolving DC. This specification-to-policy lag causes a lack of timely, effective control policies, which may lead to service outages. To bridge the gap, we present DCoPilot, a hybrid framework for generative control policies in dynamic DC operation. DCoPilot synergizes two distinct generative paradigms, i.e., a large language model (LLM) that performs symbolic generation of structured reward forms, and a hypernetwork that conducts parametric generation of policy weights. DCoPilot operates through three coordinated phases: (i) simulation scale-up, which stress-tests reward candidates across diverse simulation-ready (SimReady) scenes; (ii) meta policy distillation, where a hypernetwork is trained to output policy weights conditioned on SLA and scene embeddings; and (iii) online adaptation, enabling zero-shot policy generation in response to updated specifications. Evaluated across five control task families spanning diverse DC components, DCoPilot achieves near-zero constraint violations and outperforms all baselines across specification variations. Ablation studies validate the effectiveness of LLM-based unified reward generation in enabling stable hypernetwork convergence.

顶级标签: systems agents model training
详细标签: data center optimization generative ai reinforcement learning policy adaptation large language models 或 搜索:

DCoPilot:面向动态数据中心运营的生成式AI赋能的策略自适应系统 / DCoPilot: Generative AI-Empowered Policy Adaptation for Dynamic Data Center Operations


1️⃣ 一句话总结

这篇论文提出了一个名为DCoPilot的智能框架,它结合了大语言模型和超网络两种生成式AI技术,能够根据数据中心不断变化的工作负载和服务协议,自动、快速地生成最优控制策略,从而保障数据中心的安全和高效运行。

源自 arXiv: 2602.02137