📄
Abstract - BitTP: The Lightweight Trajectory Prediction Model with BitLLM for Edge-Devices
Trajectory prediction is a fundamental task for autonomous systems, requiring complex reasoning about multi-agent interactions and intents. Large language models (LLMs) have recently been adopted for this task, as they provide strong contextual reasoning and interpretable, language-based trajectory representations. However, these LLM-based predictors are extremely memory- and compute-intensive, making them difficult to deploy on resource-constrained edge devices such as on-board computers in autonomous robots. To bridge this gap, we propose BitTP, which converts an LLM-based trajectory predictor into a lightweight bitlinear architecture. We demonstrate that weight-only quantization to 1.58-bit (BitTP-Weight) is optimal. Crucially, activations must remain in full precision, as quantizing them leads to severe degradation and instability in spatio-temporal reasoning. Empirically, BitTP-Weight not only preserves but improves prediction quality over the full-precision (BF16) LLM baseline, reducing ADE by 14.29% and FDE by 20.97% on average, while simultaneously reducing memory usage and inference latency relative to other quantization methods. These results demonstrate that carefully designed quantization acts as an effective regularizer, enabling the practical deployment of sophisticated LLM-based reasoning on edge devices. Code is available at: this https URL.
BitTP:面向边缘设备的轻量级轨迹预测模型与比特大语言模型 /
BitTP: The Lightweight Trajectory Prediction Model with BitLLM for Edge-Devices
1️⃣ 一句话总结
该论文提出了一种名为BitTP的轻量级轨迹预测模型,通过将大型语言模型(LLM)的权重极致压缩至1.58比特(同时保留全精度激活值),在显著降低内存和计算需求的同时,反而提升了轨迹预测的准确性(平均轨迹误差降低14.29%,最终位移误差降低20.97%),使得原本计算密集的LLM推理能力能够在自动驾驶机器人等边缘设备上高效部署。