误差高速公路:将预测编码扩展到非常深的网络 / Error Highways: Scaling Predictive Coding to Very Deep Networks
1️⃣ 一句话总结
本文提出了一种名为“高速公路误差传播”的方法,通过在预测编码网络中为深层隐藏层直接引入输出误差的线性反馈,解决了传统预测编码在深层网络中学习信号快速衰减的问题,从而能够有效训练多达128层的神经网络,且精度不受深度影响。
Predictive coding networks (PCNs) offer a biologically-plausible, local-learning alternative to back-propagation of errors (backprop). Nevertheless, they have remained largely confined to shallow architectures and evaluated on simple machine intelligence benchmarks. A central obstacle to scaling PCNs is that the learning signal decays rapidly as it propagates away from the clamped boundaries, leaving interior layers effectively unchanged. To directly counter this problem, we propose highway error propagation (HEP), a scheme that augments the free energy function underlying predictive coding (PC) by altering its neural structure with feedback matrices $V_{L\to i}$ that couple selected hidden states directly to the clamped output error. Since this coupling is linear in the hidden state, the highway pathway delivers a correction at every inference step whose magnitude is independent of depth, in contrast to vanilla PC where the output error reaches the $i$-th hidden layer with attenuation that decays exponentially in depth. This bypasses the Jacobian chain while preserving the local PC synaptic update rule. On MNIST and Fashion-MNIST, we show that HEP effectively trains MLPs of up to 128 layers with accuracy that is robust with respect to depth.
误差高速公路:将预测编码扩展到非常深的网络 / Error Highways: Scaling Predictive Coding to Very Deep Networks
本文提出了一种名为“高速公路误差传播”的方法,通过在预测编码网络中为深层隐藏层直接引入输出误差的线性反馈,解决了传统预测编码在深层网络中学习信号快速衰减的问题,从而能够有效训练多达128层的神经网络,且精度不受深度影响。
源自 arXiv: 2606.22744