大規模言語モデルの著作権保護：手法、課題、トレンドに関するサーベイ

要旨

大規模言語モデルの著作権保護は、その開発コストの高さ、独自の価値、そして悪用の可能性を考慮すると極めて重要である。既存の調査は主に、LLM生成コンテンツの追跡技術、すなわちテキスト透かしに焦点を当てており、モデル自体を保護する方法（モデル透かしやモデルフィンガープリンティング）の体系的な探求はまだ行われていない。さらに、テキスト透かし、モデル透かし、モデルフィンガープリンティングの間の関係と区別は包括的に明らかにされていない。本論文は、モデルフィンガープリンティングに焦点を当て、LLM著作権保護技術の現状を包括的に調査し、以下の側面をカバーする：(1) テキスト透かしからモデル透かしおよびフィンガープリンティングへの概念的つながりを明確にし、モデル透かしをより広範なフィンガープリンティングフレームワークに統合する統一用語を採用する；(2) 多様なテキスト透かし技術の概要と比較を提供し、そのような手法がモデルフィンガープリンティングとして機能する場合を強調する；(3) LLM著作権保護のための既存のモデルフィンガープリンティング手法を体系的に分類・比較する；(4) 初めて、フィンガープリント転送とフィンガープリント除去の技術を提示する；(5) モデルフィンガープリントの評価指標（有効性、無害性、堅牢性、秘匿性、信頼性）をまとめる；(6) 未解決の課題と今後の研究方向性について議論する。本調査は、LLM時代におけるテキスト透かしとモデルフィンガープリンティング技術の理解を研究者に提供し、それらの知的財産保護のさらなる進展を促進することを目的としている。

English

Copyright protection for large language models is of critical importance, given their substantial development costs, proprietary value, and potential for misuse. Existing surveys have predominantly focused on techniques for tracing LLM-generated content-namely, text watermarking-while a systematic exploration of methods for protecting the models themselves (i.e., model watermarking and model fingerprinting) remains absent. Moreover, the relationships and distinctions among text watermarking, model watermarking, and model fingerprinting have not been comprehensively clarified. This work presents a comprehensive survey of the current state of LLM copyright protection technologies, with a focus on model fingerprinting, covering the following aspects: (1) clarifying the conceptual connection from text watermarking to model watermarking and fingerprinting, and adopting a unified terminology that incorporates model watermarking into the broader fingerprinting framework; (2) providing an overview and comparison of diverse text watermarking techniques, highlighting cases where such methods can function as model fingerprinting; (3) systematically categorizing and comparing existing model fingerprinting approaches for LLM copyright protection; (4) presenting, for the first time, techniques for fingerprint transfer and fingerprint removal; (5) summarizing evaluation metrics for model fingerprints, including effectiveness, harmlessness, robustness, stealthiness, and reliability; and (6) discussing open challenges and future research directions. This survey aims to offer researchers a thorough understanding of both text watermarking and model fingerprinting technologies in the era of LLMs, thereby fostering further advances in protecting their intellectual property.

大規模言語モデルの著作権保護：手法、課題、トレンドに関するサーベイ

Copyright Protection for Large Language Models: A Survey of Methods, Challenges, and Trends

要旨

Support