菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-04-08
📄 Abstract - Measurement of Generative AI Workload Power Profiles for Whole-Facility Data Center Infrastructure Planning

The rapid growth of generative artificial intelligence (AI) has introduced unprecedented computational demands, driving significant increases in the energy footprint of data centers. However, existing power consumption data is largely proprietary and reported at varying resolutions, creating challenges for estimating whole-facility energy use and planning infrastructure. In this work, we present a methodology that bridges this gap by linking high-resolution workload power measurements to whole-facility energy demand. Using NLR's high-performance computing data center equipped with NVIDIA H100 GPUs, we measure power consumption of AI workloads at 0.1-second resolution for AI training, fine-tuning and inference jobs. Workloads are characterized using MLCommons benchmarks for model training and fine-tuning, and vLLM benchmarks for inference, enabling reproducible and standardized workload profiling. The dataset of power consumption profiles is made publicly available. These power profiles are then scaled to the whole-facility-level using a bottom-up, event-driven, data center energy model. The resulting whole-facility energy profiles capture realistic temporal fluctuations driven by AI workloads and user-behavior, and can be used to inform infrastructure planning for grid connection, on-site energy generation, and distributed microgrids.

顶级标签: systems model training model evaluation
详细标签: power consumption data center energy modeling benchmark infrastructure planning 或 搜索:

生成式AI工作负载功耗特性测量及其对数据中心整体设施基础设施规划的启示 / Measurement of Generative AI Workload Power Profiles for Whole-Facility Data Center Infrastructure Planning


1️⃣ 一句话总结

本研究通过高精度测量生成式AI任务(训练、微调、推理)的实时功耗,并建立模型将其扩展至整个数据中心层面,为电网接入、现场发电等基础设施规划提供了关键的数据和方法支持。

源自 arXiv: 2604.07345