FileGram: ファイルシステム行動履跡に基づくエージェントパーソナライゼーションの基盤

要旨

ローカルファイルシステム内で動作する協働型AIエージェントは、人間とAIの相互作用における新たなパラダイムとして急速に台頭している。しかし、厳格なプライバシー障壁やマルチモーダルな実世界行動痕跡の共同収集の困難さにより、拡張可能な訓練と評価が阻害され、効果的な個人化は深刻なデータ制約によって未だ限界がある。既存手法は相互作用中心であり、ファイルシステム操作における高密度な行動痕跡が見落とされている。この課題を解決するため、我々はファイルシステムの行動痕跡に基づいてエージェントのメモリと個人化を基礎付ける包括的フレームワーク「FileGram」を提案する。本フレームワークは3つの核心コンポーネントで構成される：(1) 現実的なワークフローをシミュレートし、細粒度のマルチモーダル行動序列を大規模生成するスケーラブルなペルソナ駆動型データエンジン「FileGramEngine」、(2) プロファイル再構築、痕跡分離、ペルソナドリフト検出、マルチモーダル接地の評価のため、ファイルシステム行動痕跡に基づく診断ベンチマーク「FileGramBench」、(3) 対話要約ではなく原子動作とコンテンツ差分から直接ユーザープロファイルを構築し、これらの痕跡を手続き的・意味的・エピソード的チャネルに符号化するボトムアップメモリアーキテクチャ「FileGramOS」。大規模な実験により、FileGramBenchが最新のメモリシステムにとって未だ困難な課題であり、FileGramEngineとFileGramOSが有効であることを示す。本フレームワークをオープンソース化することで、個人化されたメモリ中心のファイルシステムエージェントに関する将来の研究を支援したい。

English

Coworking AI agents operating within local file systems are rapidly emerging as a paradigm in human-AI interaction; however, effective personalization remains limited by severe data constraints, as strict privacy barriers and the difficulty of jointly collecting multimodal real-world traces prevent scalable training and evaluation, and existing methods remain interaction-centric while overlooking dense behavioral traces in file-system operations; to address this gap, we propose FileGram, a comprehensive framework that grounds agent memory and personalization in file-system behavioral traces, comprising three core components: (1) FileGramEngine, a scalable persona-driven data engine that simulates realistic workflows and generates fine-grained multimodal action sequences at scale; (2) FileGramBench, a diagnostic benchmark grounded in file-system behavioral traces for evaluating memory systems on profile reconstruction, trace disentanglement, persona drift detection, and multimodal grounding; and (3) FileGramOS, a bottom-up memory architecture that builds user profiles directly from atomic actions and content deltas rather than dialogue summaries, encoding these traces into procedural, semantic, and episodic channels with query-time abstraction; extensive experiments show that FileGramBench remains challenging for state-of-the-art memory systems and that FileGramEngine and FileGramOS are effective, and by open-sourcing the framework, we hope to support future research on personalized memory-centric file-system agents.

FileGram: ファイルシステム行動履跡に基づくエージェントパーソナライゼーションの基盤

FileGram: Grounding Agent Personalization in File-System Behavioral Traces

要旨

Support