菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-04-15
📄 Abstract - A Complete Symmetry Classification of Shallow ReLU Networks

Parameter space is not function space for neural network architectures. This fact, investigated as early as the 1990s under terms such as ``reverse engineering," or ``parameter identifiability", has led to the natural question of parameter space symmetries\textemdash the study of distinct parameters in neural architectures which realize the same function. Indeed, the quotient space obtained by identifying parameters giving rise to the same function, called the \textit{neuromanifold}, has been shown in some cases to have rich geometric properties, impacting optimization dynamics. Thus far, techniques towards complete classifications have required the analyticity of the activation function, notably excising the important case of ReLU. Here, in contrast, we exploit the non-differentiability of the ReLU activation to provide a complete classification of the symmetries in the shallow case.

顶级标签: theory machine learning model training
详细标签: neural networks symmetry classification relu activation parameter identifiability shallow networks 或 搜索:

浅层ReLU网络对称性的完整分类 / A Complete Symmetry Classification of Shallow ReLU Networks


1️⃣ 一句话总结

这篇论文首次为使用ReLU激活函数的浅层神经网络,完整地分类了所有会导致相同网络功能的参数组合(即参数对称性),解决了之前方法因ReLU不可微而无法处理的难题。

源自 arXiv: 2604.14037