菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-06-23
📄 Abstract - Zero-Shot Test-Time Canonicalization using Out-of-Distribution Scoring

Pretrained vision models often misclassify inputs that are rotated, scaled, or sheared, even though these affine transformations leave the object class unchanged. Robustness is usually restored either by building equivariance into the architecture or by retraining with augmentation, both of which require changing or retraining the model. Test-time canonicalization instead leaves the classifier untouched. It undoes the transformation of each input, mapping it to a canonical form near the training distribution before classification. Existing canonicalizers, however, rely on a narrow set of logit-based energy scores and bespoke search procedures, leaving the design space of scoring functions and optimizers unexplored. We reframe canonicalization as out-of-distribution (OOD) detection, which lets any OOD score serve as the energy minimized over transformations. Across benchmarks ranging from handwritten characters and sketches to natural images and 3D point clouds, we systematically evaluate around twenty OOD scores and nine search algorithms, finding that distance-based scores paired with random search and local refinement perform best overall. Because canonicalizing an already-aligned input can hurt accuracy, we add a gated mechanism that transforms an input only when its OOD score indicates this is needed, preserving most in-distribution accuracy while retaining the robustness gains on transformed inputs. Code is available at this http URL.

顶级标签: computer vision model evaluation
详细标签: test-time canonicalization out-of-distribution detection affine transformations robustness 或 搜索:

利用分布外评分实现零样本测试时规范化 / Zero-Shot Test-Time Canonicalization using Out-of-Distribution Scoring


1️⃣ 一句话总结

本文提出了一种无需重新训练或修改模型的方法,通过将图像变换问题转化为分布外检测任务,利用多种评分函数和搜索算法在测试时自动将旋转、缩放等变形的输入矫正为标准形态,从而提升预训练视觉模型对形变图像的分类鲁棒性。

源自 arXiv: 2606.24178