大規模言語モデルにおける知識編集に関する包括的研究

要旨

大規模言語モデル（LLMs）は、人間のコミュニケーションに極めて近いテキストの理解と生成において驚異的な能力を示しています。しかし、その主要な制約は、膨大なパラメータ化に起因するトレーニング時の多大な計算資源の要求にあります。この課題は、世界のダイナミックな性質によってさらに深刻化しており、LLMsを頻繁に更新して古くなった情報を修正したり、新しい知識を統合したりする必要があります。これにより、LLMsの継続的な関連性が保たれます。多くのアプリケーションでは、トレーニング後のモデルの継続的な調整が求められ、欠陥や望ましくない動作に対処する必要があることに留意してください。その場でのモデル修正を効率的かつ軽量に行う方法に対する関心が高まっています。この目的のために、近年、LLMsの知識編集技術が急速に発展しており、特定のドメイン内でLLMsの動作を効率的に修正しつつ、様々な入力に対する全体的な性能を維持することを目指しています。本論文では、まず知識編集問題を定義し、次に最先端のアプローチを包括的にレビューします。教育および認知研究の理論からインスピレーションを得て、知識編集手法を外部知識の利用、モデルへの知識の統合、内在する知識の編集の3つのグループに分類する統一的な基準を提案します。さらに、代表的な知識編集アプローチを包括的に評価するための新しいベンチマーク、KnowEditを紹介します。加えて、知識の位置付けに関する詳細な分析を提供し、LLMsに内在する知識構造をより深く理解するための洞察を提供します。最後に、知識編集のいくつかの潜在的な応用について議論し、その広範で影響力のある意義を概説します。

English

Large Language Models (LLMs) have shown extraordinary capabilities in understanding and generating text that closely mirrors human communication. However, a primary limitation lies in the significant computational demands during training, arising from their extensive parameterization. This challenge is further intensified by the dynamic nature of the world, necessitating frequent updates to LLMs to correct outdated information or integrate new knowledge, thereby ensuring their continued relevance. Note that many applications demand continual model adjustments post-training to address deficiencies or undesirable behaviors. There is an increasing interest in efficient, lightweight methods for on-the-fly model modifications. To this end, recent years have seen a burgeoning in the techniques of knowledge editing for LLMs, which aim to efficiently modify LLMs' behaviors within specific domains while preserving overall performance across various inputs. In this paper, we first define the knowledge editing problem and then provide a comprehensive review of cutting-edge approaches. Drawing inspiration from educational and cognitive research theories, we propose a unified categorization criterion that classifies knowledge editing methods into three groups: resorting to external knowledge, merging knowledge into the model, and editing intrinsic knowledge. Furthermore, we introduce a new benchmark, KnowEdit, for a comprehensive empirical evaluation of representative knowledge editing approaches. Additionally, we provide an in-depth analysis of knowledge location, which can provide a deeper understanding of the knowledge structures inherent within LLMs. Finally, we discuss several potential applications of knowledge editing, outlining its broad and impactful implications.

大規模言語モデルにおける知識編集に関する包括的研究

A Comprehensive Study of Knowledge Editing for Large Language Models

要旨

Support