扩展热力学AI模型的规模 / Scaling Up Thermodynamic AI Models
1️⃣ 一句话总结
本文提出了一种基于反向传播的可扩展训练算法,使得基于伊辛模型的热力学计算设备能够高效执行深度卷积网络的图像分类任务,在CIFAR-10和CIFAR-100数据集上分别达到94.9%和76.0%的准确率,并揭示了推理成本与精度之间的可控权衡关系,为低功耗边缘AI推理提供了理论支撑和硬件开发指导。
Thermodynamic computing devices based on the Ising model show great promise for low-power AI inference and edge computing, but scalable methods for training large models for such hardware remain limited. Prior theory shows that the time-averaged behavior of high-temperature Gibbs-sampled Ising systems can implement feed-forward neural inference. We turn this theoretical correspondence into a scalable and purely backpropagation-based algorithm for training deep convolutional networks for thermodynamic inference on Ising machine hardware. Our image classification models achieve accuracies of 94.9% on CIFAR-10 and 76.0% on CIFAR-100 under binary Gibbs sampling. We then develop and experimentally validate a mathematical theory relating inference cost to accuracy and controlling autocorrelation times. Subsequently, we calculate asymptotic results showing that inference cost is bounded by a well-controlled tradeoff with performance and exhibit algorithms for computing optimal inference schedules. Finally, we discuss implications for hardware development and the future of high-temperature thermodynamic AI models.
扩展热力学AI模型的规模 / Scaling Up Thermodynamic AI Models
本文提出了一种基于反向传播的可扩展训练算法,使得基于伊辛模型的热力学计算设备能够高效执行深度卷积网络的图像分类任务,在CIFAR-10和CIFAR-100数据集上分别达到94.9%和76.0%的准确率,并揭示了推理成本与精度之间的可控权衡关系,为低功耗边缘AI推理提供了理论支撑和硬件开发指导。
源自 arXiv: 2607.00170