대규모 언어 모델을 위한 저작권 보호: 방법, 과제 및 동향에 대한 조사

초록

대규모 언어 모델(LLM)에 대한 저작권 보호는 그 막대한 개발 비용, 독점적 가치, 그리고 오용 가능성으로 인해 매우 중요합니다. 기존 연구들은 주로 LLM 생성 콘텐츠의 추적 기술, 즉 텍스트 워터마킹에 초점을 맞추어 왔으며, 모델 자체를 보호하는 방법(예: 모델 워터마킹 및 모델 핑거프린팅)에 대한 체계적인 탐구는 아직 부족한 상태입니다. 더욱이, 텍스트 워터마킹, 모델 워터마킹, 그리고 모델 핑거프린팅 간의 관계와 차이점이 포괄적으로 명확히 정리되지 않았습니다. 본 연구는 LLM 저작권 보호 기술의 현재 상태를 종합적으로 조사하며, 특히 모델 핑거프린팅에 초점을 맞추어 다음과 같은 측면을 다룹니다: (1) 텍스트 워터마킹에서 모델 워터마킹 및 핑거프린팅으로의 개념적 연결을 명확히 하고, 모델 워터마킹을 더 넓은 핑거프린팅 프레임워크에 통합하는 통일된 용어를 채택; (2) 다양한 텍스트 워터마킹 기술의 개요 및 비교를 제공하며, 이러한 방법이 모델 핑거프린팅으로 기능할 수 있는 사례를 강조; (3) LLM 저작권 보호를 위한 기존 모델 핑거프린팅 접근법을 체계적으로 분류 및 비교; (4) 최초로 핑거프린트 전이 및 제거 기술을 소개; (5) 모델 핑거프린트의 평가 지표(효과성, 무해성, 견고성, 은밀성, 신뢰성 등)를 요약; (6) 개방형 과제 및 향후 연구 방향을 논의. 본 조사는 연구자들에게 LLM 시대의 텍스트 워터마킹 및 모델 핑거프린팅 기술에 대한 깊은 이해를 제공함으로써, 그들의 지적 재산권 보호를 위한 추가 발전을 촉진하고자 합니다.

English

Copyright protection for large language models is of critical importance, given their substantial development costs, proprietary value, and potential for misuse. Existing surveys have predominantly focused on techniques for tracing LLM-generated content-namely, text watermarking-while a systematic exploration of methods for protecting the models themselves (i.e., model watermarking and model fingerprinting) remains absent. Moreover, the relationships and distinctions among text watermarking, model watermarking, and model fingerprinting have not been comprehensively clarified. This work presents a comprehensive survey of the current state of LLM copyright protection technologies, with a focus on model fingerprinting, covering the following aspects: (1) clarifying the conceptual connection from text watermarking to model watermarking and fingerprinting, and adopting a unified terminology that incorporates model watermarking into the broader fingerprinting framework; (2) providing an overview and comparison of diverse text watermarking techniques, highlighting cases where such methods can function as model fingerprinting; (3) systematically categorizing and comparing existing model fingerprinting approaches for LLM copyright protection; (4) presenting, for the first time, techniques for fingerprint transfer and fingerprint removal; (5) summarizing evaluation metrics for model fingerprints, including effectiveness, harmlessness, robustness, stealthiness, and reliability; and (6) discussing open challenges and future research directions. This survey aims to offer researchers a thorough understanding of both text watermarking and model fingerprinting technologies in the era of LLMs, thereby fostering further advances in protecting their intellectual property.

대규모 언어 모델을 위한 저작권 보호: 방법, 과제 및 동향에 대한 조사

Copyright Protection for Large Language Models: A Survey of Methods, Challenges, and Trends

초록

Support