AutoMIA：エージェント的自律探索によるメンバーシップ推論攻撃のベースライン改善

要旨

メンバーシップ推論攻撃（MIA）は、機械学習モデルにおける訓練データ漏洩の評価を目的とした、基本的な監査ツールとして機能する。しかし、既存の手法は主に静的な人手設計のヒューリスティクスに依存しており、適応性に欠けるため、異なる大規模モデル間で転用した際にしばしば最適ではない性能となる。本研究では、メンバーシップ推論を自己探索と戦略進化の自動化プロセスとして再定義する、エージェント型フレームワークであるAutoMIAを提案する。高水準のシナリオ仕様が与えられると、AutoMIAは実行可能なロジットレベル戦略を生成し、閉ループ評価フィードバックを通じて段階的にそれらを洗練させることで、攻撃空間を自己探索する。抽象的な戦略推論を低水準の実行から分離することにより、本フレームワークはモデルに依存しない体系的な攻撃探索空間の走査を実現する。大規模な実験により、AutoMIAが手動の特徴量設計を不要としつつ、既存の最先端ベースライン手法に匹敵する、またはそれを上回る性能を一貫して発揮することを実証する。

English

Membership Inference Attacks (MIAs) serve as a fundamental auditing tool for evaluating training data leakage in machine learning models. However, existing methodologies predominantly rely on static, handcrafted heuristics that lack adaptability, often leading to suboptimal performance when transferred across different large models. In this work, we propose AutoMIA, an agentic framework that reformulates membership inference as an automated process of self-exploration and strategy evolution. Given high-level scenario specifications, AutoMIA self-explores the attack space by generating executable logits-level strategies and progressively refining them through closed-loop evaluation feedback. By decoupling abstract strategy reasoning from low-level execution, our framework enables a systematic, model-agnostic traversal of the attack search space. Extensive experiments demonstrate that AutoMIA consistently matches or outperforms state-of-the-art baselines while eliminating the need for manual feature engineering.

AutoMIA：エージェント的自律探索によるメンバーシップ推論攻撃のベースライン改善

AutoMIA: Improved Baselines for Membership Inference Attack via Agentic Self-Exploration

要旨

Support