대규모 언어 모델을 위한 지식 편집에 관한 포괄적 연구

초록

대형 언어 모델(LLMs)은 인간의 의사소통을 매우 정교하게 이해하고 생성하는 탁월한 능력을 보여주고 있습니다. 그러나 이러한 모델의 주요 한계점은 방대한 파라미터화로 인해 학습 과정에서 발생하는 상당한 계산 자원 요구에 있습니다. 이러한 문제는 세상이 끊임없이 변화한다는 점에서 더욱 심화되는데, 이는 LLMs가 최신 정보를 반영하거나 새로운 지식을 통합하기 위해 빈번한 업데이트를 필요로 하여 지속적인 관련성을 유지해야 하기 때문입니다. 또한, 많은 응용 분야에서는 학습 후 모델의 결함이나 바람직하지 않은 행동을 해결하기 위해 지속적인 조정이 요구됩니다. 이에 따라 실시간 모델 수정을 위한 효율적이고 경량화된 방법에 대한 관심이 점차 증가하고 있습니다. 이를 위해 최근 몇 년간 LLMs의 특정 영역에서의 행동을 효율적으로 수정하면서도 다양한 입력에 대한 전반적인 성능을 유지하는 지식 편집 기술이 급속도로 발전해 왔습니다. 본 논문에서는 먼저 지식 편집 문제를 정의하고, 최첨단 접근 방식에 대한 포괄적인 리뷰를 제공합니다. 교육 및 인지 연구 이론에서 영감을 얻어, 우리는 지식 편집 방법을 외부 지식 활용, 모델 내 지식 통합, 내재적 지식 편집이라는 세 가지 그룹으로 분류하는 통일된 기준을 제안합니다. 더불어, 대표적인 지식 편집 접근법을 종합적으로 평가하기 위한 새로운 벤치마크인 KnowEdit를 소개합니다. 또한, LLMs 내에 내재된 지식 구조를 더 깊이 이해할 수 있도록 지식 위치에 대한 심층 분석을 제공합니다. 마지막으로, 지식 편집의 잠재적 응용 분야를 논의하며 그 광범위하고 중요한 함의를 제시합니다.

English

Large Language Models (LLMs) have shown extraordinary capabilities in understanding and generating text that closely mirrors human communication. However, a primary limitation lies in the significant computational demands during training, arising from their extensive parameterization. This challenge is further intensified by the dynamic nature of the world, necessitating frequent updates to LLMs to correct outdated information or integrate new knowledge, thereby ensuring their continued relevance. Note that many applications demand continual model adjustments post-training to address deficiencies or undesirable behaviors. There is an increasing interest in efficient, lightweight methods for on-the-fly model modifications. To this end, recent years have seen a burgeoning in the techniques of knowledge editing for LLMs, which aim to efficiently modify LLMs' behaviors within specific domains while preserving overall performance across various inputs. In this paper, we first define the knowledge editing problem and then provide a comprehensive review of cutting-edge approaches. Drawing inspiration from educational and cognitive research theories, we propose a unified categorization criterion that classifies knowledge editing methods into three groups: resorting to external knowledge, merging knowledge into the model, and editing intrinsic knowledge. Furthermore, we introduce a new benchmark, KnowEdit, for a comprehensive empirical evaluation of representative knowledge editing approaches. Additionally, we provide an in-depth analysis of knowledge location, which can provide a deeper understanding of the knowledge structures inherent within LLMs. Finally, we discuss several potential applications of knowledge editing, outlining its broad and impactful implications.

대규모 언어 모델을 위한 지식 편집에 관한 포괄적 연구

A Comprehensive Study of Knowledge Editing for Large Language Models

초록

Support