基于精度和延迟感知的用户激励实现AI推理绿色化 / Greening AI Inference with Accuracy and Latency-aware User Incentives
1️⃣ 一句话总结
本文提出了一种通过用户激励框架来减少AI推理服务碳排放的方法,该框架根据用户对推理精度和延迟的偏好以及环保意识,设计两级订阅服务(如提供折扣换取低碳时段降低服务质量),在保证用户体验的同时平衡碳排放与服务质量。
The widespread use of AI services has raised concerns for its environmental sustainability, towards which recent studies have identified carbon emissions of AI inference as the major contributor. This paper introduces a framework for designing AI inference incentives based on the users' valuation for inference quality and latency, together with their environmental consciousness, while accounting for the tradeoff between carbon emissions and the two QoE parameters. Our approach can accommodate different tradeoffs, that depend on the size and complexity of the AI models and the allocation of resources to serve inference requests. The incentives can be offered through a practical two-tier service subscription that offers users a discount in exchange for reduced carbon emissions. The discounted service option gives the AI provider the flexibility to serve some percentage of inference requests at a lower quality and higher latency during periods of high carbon intensity.
基于精度和延迟感知的用户激励实现AI推理绿色化 / Greening AI Inference with Accuracy and Latency-aware User Incentives
本文提出了一种通过用户激励框架来减少AI推理服务碳排放的方法,该框架根据用户对推理精度和延迟的偏好以及环保意识,设计两级订阅服务(如提供折扣换取低碳时段降低服务质量),在保证用户体验的同时平衡碳排放与服务质量。
源自 arXiv: 2605.27309