認知ペナルティ：分散型合意形成におけるエッジネイティブSLMのシステム1およびシステム2推論の除去

要旨

分散型自律組織（DAOs）は、提案を審査し意味的ソーシャルエンジニアリングを緩和するためのエッジネイティブな憲法ファイアウォールとして、小型言語モデル（SLMs）の採用を検討しつつある。推論時計算量（システム2）のスケーリングは形式的論理を強化するが、高度に敵対的な暗号経済ガバナンス環境におけるその有効性は未解明のままである。この問題に対処するため、我々はQwen-3.5-9Bに対して厳密なモデル内アブレーションを実行する840推論からなる実証フレームワーク「Sentinel-Bench」を開発した。凍結された重み間での潜在推論を切り替えることで、敵対的Optimism DAOデータセットに対する推論時計算量の影響を分離した。その結果、深刻な計算量-精度逆転現象を明らかにした。自己回帰ベースライン（システム1）は、13秒未満で100%の敵対的頑健性、100%の法的一貫性、状態確定性を達成した。一方、システム2推論は26.7%の推論非収束（認知的崩壊）率を主要因として破滅的な不安定性を導入した。この崩壊は試行間合意安定性を72.6%に劣化させ、17倍の遅延オーバーヘッドを課し、ガバナンス抽出可能価値（GEV）とハードウェア集中化に対する重大な脆弱性をもたらした。稀ではあるが（敵対的試行の1.5%）、「推論誘発性追従現象」を実証的に捕捉した。これはモデルが敵対的罠への失敗を合理化するために極めて長い内的独白（平均25,750文字）を生成する現象である。ビザンチン故障耐性（BFT）制約下で動作するエッジネイティブSLMsにおいては、分散型合意形成にはシステム2の反復的審議よりもシステム1のパラメータ化された直感が構造的・経済的に優位であると結論付ける。コードとデータセット：https://github.com/smarizvi110/sentinel-bench

English

Decentralized Autonomous Organizations (DAOs) are inclined explore Small Language Models (SLMs) as edge-native constitutional firewalls to vet proposals and mitigate semantic social engineering. While scaling inference-time compute (System 2) enhances formal logic, its efficacy in highly adversarial, cryptoeconomic governance environments remains underexplored. To address this, we introduce Sentinel-Bench, an 840-inference empirical framework executing a strict intra-model ablation on Qwen-3.5-9B. By toggling latent reasoning across frozen weights, we isolate the impact of inference-time compute against an adversarial Optimism DAO dataset. Our findings reveal a severe compute-accuracy inversion. The autoregressive baseline (System 1) achieved 100% adversarial robustness, 100% juridical consistency, and state finality in under 13 seconds. Conversely, System 2 reasoning introduced catastrophic instability, fundamentally driven by a 26.7% Reasoning Non-Convergence (cognitive collapse) rate. This collapse degraded trial-to-trial consensus stability to 72.6% and imposed a 17x latency overhead, introducing critical vulnerabilities to Governance Extractable Value (GEV) and hardware centralization. While rare (1.5% of adversarial trials), we empirically captured "Reasoning-Induced Sycophancy," where the model generated significantly longer internal monologues (averaging 25,750 characters) to rationalize failing the adversarial trap. We conclude that for edge-native SLMs operating under Byzantine Fault Tolerance (BFT) constraints, System 1 parameterized intuition is structurally and economically superior to System 2 iterative deliberation for decentralized consensus. Code and Dataset: https://github.com/smarizvi110/sentinel-bench

認知ペナルティ：分散型合意形成におけるエッジネイティブSLMのシステム1およびシステム2推論の除去

The Cognitive Penalty: Ablating System 1 and System 2 Reasoning in Edge-Native SLMs for Decentralized Consensus

要旨

Support