ClawKeeper：スキル、プラグイン、ウォッチャーによるOpenClawエージェントの包括的安全保護

要旨

OpenClawは、ツール連携、ローカルファイルアクセス、シェルコマンド実行などの強力な機能を備えたオープンソースの自律エージェントランタイムとして、急速に主要な地位を確立しています。しかし、このような広範な操作権限は重大なセキュリティ脆弱性を導入し、モデルの誤りが機密データ漏洩、権限昇格、悪意ある第三者のスキル実行といった実在のシステムレベル脅威に変換される危険性があります。OpenClawエコシステムにおける既存のセキュリティ対策は非常に断片的であり、エージェントライフサイクルの特定段階のみを対象とするもので、包括的な保護を提供していません。この課題を解決するため、我々は3つの相補的なアーキテクチャ層にわたる多次元保護メカニズムを統合したリアルタイムセキュリティフレームワーク「ClawKeeper」を提案します。(1) スキルベース保護は命令レベルで動作し、構造化されたセキュリティポリシーをエージェントコンテキストに直接注入することで、環境固有の制約とクロスプラットフォーム境界の強制を実現します。(2) プラグインベース保護は内部ランタイムエンフォーサーとして機能し、実行パイプライン全体において設定の強化、積極的脅威検出、継続的行動監視を提供します。(3) ウォッチャーベース保護は、エージェントの状態遷移を継続的に検証する新しい分離型システムレベルセキュリティミドルウェアを導入します。これはエージェントの内部ロジックに結合することなくリアルタイム実行介入を可能にし、高リスク行動の停止や人的確認の強制などの操作を支援します。我々はこのウォッチャーパラダイムが、次世代自律エージェントシステムを保護するための基礎的構成要素として強い可能性を秘めていると主張します。多様な脅威シナリオにおけるClawKeeperの有効性と堅牢性は、詳細な定性・定量評価により実証されています。我々はコードを公開します。

English

OpenClaw has rapidly established itself as a leading open-source autonomous agent runtime, offering powerful capabilities including tool integration, local file access, and shell command execution. However, these broad operational privileges introduce critical security vulnerabilities, transforming model errors into tangible system-level threats such as sensitive data leakage, privilege escalation, and malicious third-party skill execution. Existing security measures for the OpenClaw ecosystem remain highly fragmented, addressing only isolated stages of the agent lifecycle rather than providing holistic protection. To bridge this gap, we present ClawKeeper, a real-time security framework that integrates multi-dimensional protection mechanisms across three complementary architectural layers. (1) Skill-based protection operates at the instruction level, injecting structured security policies directly into the agent context to enforce environment-specific constraints and cross-platform boundaries. (2) Plugin-based protection serves as an internal runtime enforcer, providing configuration hardening, proactive threat detection, and continuous behavioral monitoring throughout the execution pipeline. (3) Watcher-based protection introduces a novel, decoupled system-level security middleware that continuously verifies agent state evolution. It enables real-time execution intervention without coupling to the agent's internal logic, supporting operations such as halting high-risk actions or enforcing human confirmation. We argue that this Watcher paradigm holds strong potential to serve as a foundational building block for securing next-generation autonomous agent systems. Extensive qualitative and quantitative evaluations demonstrate the effectiveness and robustness of ClawKeeper across diverse threat scenarios. We release our code.