ProtoGCD: 一般化カテゴリ発見のための統一的で偏りのないプロトタイプ学習

要旨

一般化カテゴリ発見（GCD）は実用的であるが未開拓の問題であり、モデルが既存クラスのラベル付きサンプルを活用して新規カテゴリを自動的にクラスタリングし発見することを要求する。課題は、未ラベルデータが既存クラスと新規クラスの両方を含むことである。擬似ラベリングとパラメトリック分類器を活用した初期の研究は、既存クラスと新規クラスを別々に扱い、それらの間で精度の不均衡を引き起こした。最近の手法では、コントラスティブ学習を採用しているが、潜在的なポジティブサンプルを見落とし、クラスタリング目的から切り離されているため、偏った表現と最適ではない結果をもたらしている。これらの問題に対処するため、我々は統一された偏りのないプロトタイプ学習フレームワーク、すなわちProtoGCDを導入し、既存クラスと新規クラスを共同プロトタイプと統一された学習目的でモデル化し、既存クラスと新規クラスの間で統一されたモデリングを可能にする。具体的には、確認バイアスを軽減するための二重レベルの適応型擬似ラベリングメカニズムを提案し、GCDに適した表現を学習するための2つの正則化項を組み合わせる。さらに、実用的な考慮から、新規クラスの数を推定する基準を考案した。また、ProtoGCDを拡張して未見の外れ値を検出し、タスクレベルの統一を達成する。包括的な実験により、ProtoGCDが汎用データセットと細粒度データセットの両方で最先端の性能を達成することが示された。コードはhttps://github.com/mashijie1028/ProtoGCDで公開されている。

English

Generalized category discovery (GCD) is a pragmatic but underexplored problem, which requires models to automatically cluster and discover novel categories by leveraging the labeled samples from old classes. The challenge is that unlabeled data contain both old and new classes. Early works leveraging pseudo-labeling with parametric classifiers handle old and new classes separately, which brings about imbalanced accuracy between them. Recent methods employing contrastive learning neglect potential positives and are decoupled from the clustering objective, leading to biased representations and sub-optimal results. To address these issues, we introduce a unified and unbiased prototype learning framework, namely ProtoGCD, wherein old and new classes are modeled with joint prototypes and unified learning objectives, {enabling unified modeling between old and new classes}. Specifically, we propose a dual-level adaptive pseudo-labeling mechanism to mitigate confirmation bias, together with two regularization terms to collectively help learn more suitable representations for GCD. Moreover, for practical considerations, we devise a criterion to estimate the number of new classes. Furthermore, we extend ProtoGCD to detect unseen outliers, achieving task-level unification. Comprehensive experiments show that ProtoGCD achieves state-of-the-art performance on both generic and fine-grained datasets. The code is available at https://github.com/mashijie1028/ProtoGCD.

ProtoGCD: 一般化カテゴリ発見のための統一的で偏りのないプロトタイプ学習

ProtoGCD: Unified and Unbiased Prototype Learning for Generalized Category Discovery

要旨

Support