保証付き調整可能なソフト等価性

要旨

等価性はコンピュータビジョンモデルの基本的な性質であるが、実世界データでは厳密な等価性が満たされることは稀であり、モデルの性能を制限する要因となりうる。このため、等価性の度合いを制御することが望ましい。本研究では、モデルの重みを設計された部分空間へ射影することにより、ソフト等価性モデルを構築する汎用フレームワークを提案する。本手法は任意の事前学習済みアーキテクチャに適用可能であり、誘導される等価性誤差の理論的限界を提供する。実証実験では、ViTやResNetを含む複数の事前学習済みバックボーンにおいて、画像分類、セマンティックセグメンテーション、人間軌跡予測タスクにわたる手法の有効性を実証する。特に、競争的なImageNetベンチマークにおいて、等価性誤差を低減しつつ性能向上を達成した。

English

Equivariance is a fundamental property in computer vision models, yet strict equivariance is rarely satisfied in real-world data, which can limit a model's performance. Controlling the degree of equivariance is therefore desirable. We propose a general framework for constructing soft equivariant models by projecting the model weights into a designed subspace. The method applies to any pre-trained architecture and provides theoretical bounds on the induced equivariance error. Empirically, we demonstrate the effectiveness of our method on multiple pre-trained backbones, including ViT and ResNet, across image classification, semantic segmentation, and human-trajectory prediction tasks. Notably, our approach improves the performance while simultaneously reducing equivariance error on the competitive ImageNet benchmark.

保証付き調整可能なソフト等価性

Tunable Soft Equivariance with Guarantees

要旨

Support