面向联邦AI即服务的高保真网络管理:跨域编排 / High-Fidelity Network Management for Federated AI-as-a-Service: Cross-Domain Orchestration
1️⃣ 一句话总结
这篇论文提出了一种新的管理框架,通过引入一种名为‘尾部风险包络’的可组合描述符,帮助通信服务商在多域联合环境下,确保AI即服务从网络传输到模型推理的端到端高性能和可靠性。
To support the emergence of AI-as-a-Service (AIaaS), communication service providers (CSPs) are on the verge of a radical transformation-from pure connectivity providers to AIaaS a managed network service (control-and-orchestration plane that exposes AI models). In this model, the CSP is responsible not only for transport/communications, but also for intent-to-model resolution and joint network-compute orchestration, i.e., reliable and timely end-to-end delivery. The resulting end-to-end AIaaS service thus becomes governed by communications impairments (delay, loss) and inference impairments (latency, error). A central open problem is an operational AIaaS control-and-orchestration framework that enforces high fidelity, particularly under multi-domain federation. This paper introduces an assurance-oriented AIaaS management plane based on Tail-Risk Envelopes (TREs): signed, composable per-domain descriptors that combine deterministic guardrails with stochastic rate-latency-impairment models. Using stochastic network calculus, we derive bounds on end-to-end delay violation probabilities across tandem domains and obtain an optimization-ready risk-budget decomposition. We show that tenant-level reservations prevent bursty traffic from inflating tail latency under TRE contracts. An auditing layer then uses runtime telemetry to estimate extreme-percentile performance, quantify uncertainty, and attribute tail-risk to each domain for accountability. Packet-level Monte-Carlo simulations demonstrate improved p99.9 compliance under overload via admission control and robust tenant isolation under correlated burstiness.
面向联邦AI即服务的高保真网络管理:跨域编排 / High-Fidelity Network Management for Federated AI-as-a-Service: Cross-Domain Orchestration
这篇论文提出了一种新的管理框架,通过引入一种名为‘尾部风险包络’的可组合描述符,帮助通信服务商在多域联合环境下,确保AI即服务从网络传输到模型推理的端到端高性能和可靠性。
源自 arXiv: 2602.15281