PEFT의 스케일링에 관하여: 조 단위 매개변수의 백만 개 개인 모델을 향하여

초록

파라미터 효율적 미세 조정(PEFT)은 일반적으로 전체 미세 조정의 저렴한 대안으로 취급된다. 본 연구에서는 더 넓은 역할을 탐구한다: 강력한 공유 기반 모델 위에 지속적 로컬 상태로서의 작은 학습 가능한 어댑터이다. 이러한 프레임워크에서 기본 모델은 공유된 능력을 제공하는 반면, 어댑터는 선호도, 기술, 도구 사용 습관 및 메모리 유사 업데이트와 같은 인스턴스별 행동을 전달한다. 우리는 세 가지 확장 축을 중심으로 문제를 구성한다: 더 강력한 공유 사전 지식이 작은 로컬 업데이트를 더 유용하게 만드는 '확대(Scale Up)', 어댑터가 얼마나 작아질 수 있으면서도 신뢰성을 유지하는지 연구하는 '축소(Scale Down)', 그리고 많은 지속적인 적응 인스턴스가 공존하는 '확장(Scale Out)'이다. MinT는 어댑터의 정체성, 버전 관리, 출처, 평가 및 서비스 상주 관리를 위한 하나의 인프라 사례를 제공한다. 이러한 결과들은 종합적으로 PEFT가 예산 대체 수단으로서의 전체 미세 조정에 국한되지 않고, 지속적인 개인 모델을 위한 소형 기반(substrate)이 될 수 있음을 시사한다.

English

Parameter-efficient fine-tuning (PEFT) is usually treated as a cheaper alternative to full fine-tuning. We study a broader role: small trainable adapters as persistent local state on top of strong shared foundation models. In this framing, the base model provides shared competence while adapters carry instance-specific behavior such as preferences, skills, tool habits, and memory-like updates. We organize the problem around three scaling axes: Scale Up, where stronger shared priors make small local updates more useful; Scale Down, where we study how small adapters can be while remaining reliable; and Scale Out, where many persistent adapted instances coexist. MinT provides one infrastructure example for managing adapter identity, revision, provenance, evaluation, and serving residency. Together, the results suggest that PEFT can be a compact substrate for persistent personal models rather than only a budget substitute for full fine-tuning.