RedOne：揭示社交網絡服務中領域特定大語言模型的後訓練機制

摘要

作爲現代信息傳播的主要媒介，社交網絡服務（SNS）經歷了快速增長，這對平臺內容管理和互動質量提升提出了重大挑戰。近年來，大型語言模型（LLMs）的發展提供了潛在的解決方案，但現有研究多聚焦於孤立任務，不僅在單一場景下數據擴展的效益遞減，且難以靈活適應多樣的現實情境。爲應對這些挑戰，我們引入了RedOne，這是一款專爲SNS設計的領域特定LLM，旨在突破單任務基線的性能瓶頸，並爲SNS建立一個全面的基礎。RedOne通過持續預訓練、監督微調和偏好優化的三階段訓練策略開發，利用大規模真實世界數據集。通過廣泛實驗，RedOne保持了強大的通用能力，在8項主要SNS任務上平均提升達14.02%，在SNS雙語評估基準上提升7.56%，相較於基礎模型。此外，通過在線測試，RedOne在有害內容檢測中的曝光率降低了11.23%，在帖子瀏覽搜索中的點擊頁面率提升了14.95%，相比於單任務微調的基線模型。這些結果確立了RedOne作爲一款針對SNS的強健領域特定LLM，展示了在各種任務上的優秀泛化能力以及在現實場景中的廣闊應用前景。

English

As a primary medium for modern information dissemination, social networking services (SNS) have experienced rapid growth, which has proposed significant challenges for platform content management and interaction quality improvement. Recently, the development of large language models (LLMs) has offered potential solutions but existing studies focus on isolated tasks, which not only encounter diminishing benefit from the data scaling within individual scenarios but also fail to flexibly adapt to diverse real-world context. To address these challenges, we introduce RedOne, a domain-specific LLM designed to break the performance bottleneck of single-task baselines and establish a comprehensive foundation for the SNS. RedOne was developed through a three-stage training strategy consisting of continue pretraining, supervised fine-tuning, and preference optimization, using a large-scale real-world dataset. Through extensive experiments, RedOne maintains strong general capabilities, and achieves an average improvement up to 14.02% across 8 major SNS tasks and 7.56% in SNS bilingual evaluation benchmark, compared with base models. Furthermore, through online testing, RedOne reduced the exposure rate in harmful content detection by 11.23% and improved the click page rate in post-view search by 14.95% compared with single-tasks finetuned baseline models. These results establish RedOne as a robust domain-specific LLM for SNS, demonstrating excellent generalization across various tasks and promising applicability in real-world scenarios.

RedOne：揭示社交網絡服務中領域特定大語言模型的後訓練機制

RedOne: Revealing Domain-specific LLM Post-Training in Social Networking Services

摘要

Support