Hugging Faceのモデルアトラスのチャート作成とナビゲーション

要旨

現在、数百万ものニューラルネットワークが公開されており、大規模なモデルリポジトリの検索と分析がますます重要になっています。これほど多くのモデルをナビゲートするにはアトラスが必要ですが、ほとんどのモデルは十分にドキュメント化されていないため、そのようなアトラスを作成するのは困難です。モデルリポジトリの潜在能力を探るため、私たちはHugging Faceのドキュメント化された部分を表す予備的なアトラスを作成しました。このアトラスは、モデルの風景と進化を驚くほど視覚化します。私たちは、このアトラスのいくつかの応用例を示します。例えば、モデルの属性（例：精度）を予測したり、コンピュータビジョンモデルのトレンドを分析したりします。しかし、現在のアトラスは不完全であるため、ドキュメント化されていない領域をマッピングする方法を提案します。具体的には、現実世界で支配的なモデルトレーニングの実践に基づいて、高信頼性の構造的プライアを特定します。これらのプライアを活用することで、私たちのアプローチは、以前にドキュメント化されていなかったアトラスの領域を正確にマッピングすることができます。私たちは、データセット、コード、およびインタラクティブなアトラスを公開します。

English

As there are now millions of publicly available neural networks, searching and analyzing large model repositories becomes increasingly important. Navigating so many models requires an atlas, but as most models are poorly documented charting such an atlas is challenging. To explore the hidden potential of model repositories, we chart a preliminary atlas representing the documented fraction of Hugging Face. It provides stunning visualizations of the model landscape and evolution. We demonstrate several applications of this atlas including predicting model attributes (e.g., accuracy), and analyzing trends in computer vision models. However, as the current atlas remains incomplete, we propose a method for charting undocumented regions. Specifically, we identify high-confidence structural priors based on dominant real-world model training practices. Leveraging these priors, our approach enables accurate mapping of previously undocumented areas of the atlas. We publicly release our datasets, code, and interactive atlas.

Hugging Faceのモデルアトラスのチャート作成とナビゲーション

Charting and Navigating Hugging Face's Model Atlas

要旨

Support