开放智能的经济学:模型生态系统中的权力与参与轨迹
Economies of Open Intelligence: Tracing Power & Participation in the Model Ecosystem
November 27, 2025
作者: Shayne Longpre, Christopher Akiki, Campbell Lund, Atharva Kulkarni, Emily Chen, Irene Solaiman, Avijit Ghosh, Yacine Jernite, Lucie-Aimée Kaffee
cs.AI
摘要
自2019年起,Hugging Face模型库已成为全球共享开源权重AI模型的核心平台。通过发布涵盖完整历史周期的周度模型下载数据集(2020年6月至2025年8月)及模型元数据,本研究对开放模型经济中的集中度动态与演进特征展开了迄今最严谨的实证分析。研究覆盖85.1万个模型、单模型超200项聚合属性及22亿次下载数据。我们记录了经济力量的根本性重构:谷歌、Meta和OpenAI主导的美国开源权重产业霸权显著削弱,独立开发者、社区组织及至2025年崛起的中国产业力量(以DeepSeek和Qwen模型为代表)正成为新主导者,可能预示市场权力的重新整合。研究发现模型属性出现统计显著性变迁:平均模型规模增长17倍,多模态生成(3.4倍)、量化技术(5倍)与专家混合架构(7倍)快速普及,但数据透明度呈现令人担忧的下滑——2025年开源权重模型首次超越真正开源模型。我们还揭示出新兴开发者中间层正专注于基座模型的量化调优与艺术化适配。为持续推进研究与社会监督,我们同步开放完整数据集及交互式仪表板,助力实时监测开放模型经济的集中度动态与演进特征。
English
Since 2019, the Hugging Face Model Hub has been the primary global platform for sharing open weight AI models. By releasing a dataset of the complete history of weekly model downloads (June 2020-August 2025) alongside model metadata, we provide the most rigorous examination to-date of concentration dynamics and evolving characteristics in the open model economy. Our analysis spans 851,000 models, over 200 aggregated attributes per model, and 2.2B downloads. We document a fundamental rebalancing of economic power: US open-weight industry dominance by Google, Meta, and OpenAI has declined sharply in favor of unaffiliated developers, community organizations, and, as of 2025, Chinese industry, with DeepSeek and Qwen models potentially heralding a new consolidation of market power. We identify statistically significant shifts in model properties, a 17X increase in average model size, rapid growth in multimodal generation (3.4X), quantization (5X), and mixture-of-experts architectures (7X), alongside concerning declines in data transparency, with open weights models surpassing truly open source models for the first time in 2025. We expose a new layer of developer intermediaries that has emerged, focused on quantizing and adapting base models for both efficiency and artistic expression. To enable continued research and oversight, we release the complete dataset with an interactive dashboard for real-time monitoring of concentration dynamics and evolving properties in the open model economy.