RedOne: 소셜 네트워킹 서비스에서의 도메인 특화 LLM 사후 훈련 공개

초록

현대 정보 전달의 주요 매체로서 소셜 네트워크 서비스(SNS)는 급속한 성장을 경험하며 플랫폼 콘텐츠 관리와 상호작용 품질 향상에 상당한 도전을 제시해 왔다. 최근 대규모 언어 모델(LLM)의 발전이 잠재적인 해결책을 제공하고 있지만, 기존 연구는 단일 작업에 초점을 맞추고 있어 개별 시나리오 내 데이터 확장으로 인한 한계를 겪을 뿐만 아니라 다양한 실제 상황에 유연하게 적응하지 못하고 있다. 이러한 문제를 해결하기 위해, 본 연구는 단일 작업 기준선의 성능 병목 현상을 극복하고 SNS를 위한 포괄적인 기반을 마련하기 위해 도메인 특화 LLM인 RedOne을 소개한다. RedOne은 대규모 실제 데이터셋을 사용하여 지속적인 사전 학습, 지도 미세 조정, 선호 최적화로 구성된 3단계 학습 전략을 통해 개발되었다. 광범위한 실험을 통해 RedOne은 강력한 일반화 능력을 유지하며, 기본 모델 대비 8가지 주요 SNS 작업에서 평균 14.02%, SNS 이중 언어 평가 벤치마크에서 7.56%의 성능 향상을 달성했다. 또한, 온라인 테스트를 통해 RedOne은 유해 콘텐츠 탐지에서 노출률을 11.23% 감소시키고, 게시물 검색에서 클릭 페이지율을 14.95% 향상시켰다. 이러한 결과는 RedOne이 다양한 작업에서 우수한 일반화 능력을 보이며 실제 시나리오에서 유망한 적용 가능성을 입증하는 SNS 도메인 특화 LLM으로서의 견고함을 입증한다.

English

As a primary medium for modern information dissemination, social networking services (SNS) have experienced rapid growth, which has proposed significant challenges for platform content management and interaction quality improvement. Recently, the development of large language models (LLMs) has offered potential solutions but existing studies focus on isolated tasks, which not only encounter diminishing benefit from the data scaling within individual scenarios but also fail to flexibly adapt to diverse real-world context. To address these challenges, we introduce RedOne, a domain-specific LLM designed to break the performance bottleneck of single-task baselines and establish a comprehensive foundation for the SNS. RedOne was developed through a three-stage training strategy consisting of continue pretraining, supervised fine-tuning, and preference optimization, using a large-scale real-world dataset. Through extensive experiments, RedOne maintains strong general capabilities, and achieves an average improvement up to 14.02% across 8 major SNS tasks and 7.56% in SNS bilingual evaluation benchmark, compared with base models. Furthermore, through online testing, RedOne reduced the exposure rate in harmful content detection by 11.23% and improved the click page rate in post-view search by 14.95% compared with single-tasks finetuned baseline models. These results establish RedOne as a robust domain-specific LLM for SNS, demonstrating excellent generalization across various tasks and promising applicability in real-world scenarios.

RedOne: 소셜 네트워킹 서비스에서의 도메인 특화 LLM 사후 훈련 공개

RedOne: Revealing Domain-specific LLM Post-Training in Social Networking Services

초록

Support