Expressivity of congruence-based architectures for DNNs on positive-definite matrices

📄 Abstract - Expressivity of congruence-based architectures for DNNs on positive-definite matrices

This work studies neural architectures for classifying symmetric positive-definite matrices, focusing on congruence-like layers, in which the input matrix is multiplied on the left and right by a (possibly rectangular) weight matrix $W$ and its transpose. Such layers lie at the core of the celebrated SPDNet and have also been employed independently for dimensionality reduction on positive-definite data. We show that the (semi)-orthogonality constraint commonly imposed on $W$ limits the expressivity of these layers: for certain activation functions, the resulting architecture collapses to a one-hidden-layer equivalent. This lack of expressivity follows from a loss of spectral diversity in congruence-like layers for semi-orthogonal $W$ and is a direct consequence of Poincaré's separation theorem. We then examine the choice of the final classifier, comparing several Riemannian classifiers and discussing their compatibility with the feature maps produced by congruence-like layers.

基于同余架构的深度神经网络在正定矩阵上的表达能力研究 / Expressivity of congruence-based architectures for DNNs on positive-definite matrices

1️⃣ 一句话总结

本文揭示了处理正定矩阵的神经网络中常用的一种层（同余层）在施加半正交约束后，会因损失特征值的多样性而导致网络表达能力大幅下降，甚至退化为单隐层网络，并探讨了不同分类器与这种特征提取方式的匹配问题。

← 返回列表

菜单

AI 帮我研读全文

1️⃣ 一句话总结

密码管理

设置密码

修改密码

移除密码

菜单

AI 帮我研读全文

1️⃣ 一句话总结

获取最新论文摘要