TraceCodec:一种面向有状态多流网络流量追踪的编译器支持型神经编解码器 / TraceCodec: A Compiler-Backed Neural Codec for Stateful Multi-Flow Network Traffic Traces
1️⃣ 一句话总结
这篇论文提出了一种名为TraceCodec的新方法,它将网络数据包转换为带有时间标记的动作表示,并用编译器将这些动作重建为准确的网络流量,从而解决了现有方法在生成真实网络数据包时常常导致状态混乱的问题。
Critical networking workflows require high-fidelity packet captures (PCAPs) for testing, security analysis, and protocol validation, not just statistical flow-level summaries. Recent packet generators have demonstrated protocol-constrained PCAP synthesis, but they universally decode directly to raw packet fields. That interface entangles learned behavioral choices with deterministic protocol consequences, which forces packet realization to depend on post-hoc heuristic repair. We identify this decode interface as the fundamental bottleneck and present TraceCodec, a state-aware neural codec for stateful multi-flow traces. TraceCodec lifts each packet into a timed packet action with explicit flow slots and transport cues, then learns a continuous per-packet latent. A deterministic compiler lowers decoded actions back to PCAPs, owning endpoint assignment, TCP state, legality constraints, and packet rendering. The latent layer exposes a generator-facing sequence space, so downstream traffic models can operate on packet-action latents rather than raw header fields. On CICIDS2017 Monday, TraceCodec matches packet count, protocol composition, and flow population to within 0.03%. Raw-field baselines under the same non-repair policy distort flow counts and TCP state by orders of magnitude. Structural diagnostics show that TraceCodec preserves TCP state transitions and multi-flow interleaving that raw-field decoders fragment. This work establishes a new foundation for high-fidelity packet-trace generation.
TraceCodec:一种面向有状态多流网络流量追踪的编译器支持型神经编解码器 / TraceCodec: A Compiler-Backed Neural Codec for Stateful Multi-Flow Network Traffic Traces
这篇论文提出了一种名为TraceCodec的新方法,它将网络数据包转换为带有时间标记的动作表示,并用编译器将这些动作重建为准确的网络流量,从而解决了现有方法在生成真实网络数据包时常常导致状态混乱的问题。
源自 arXiv: 2605.29941