평형 추론기: 학습된 끌개를 통한 확장 가능한 추론

초록

잠재 상태를 반복적으로 업데이트하는 방식으로 테스트 시간 연산을 확장하는 것은 추론을 위한 강력한 패러다임으로 부상하고 있다. 그러나 이러한 반복 모델이 암기된 패턴을 넘어 일반화할 수 있게 하는 내부 메커니즘은 여전히 불분명하다. 본 연구에서는 일반화 가능한 추론이 과제 조건부 어트랙터(attractor), 즉 안정적인 고정점이 유효한 해결책에 해당하는 잠재 동역학계를 학습함으로써 발생한다고 가정한다. 우리는 이 과정을 평형 추론기(Equilibrium Reasoner, EqR)를 통해 공식화하며, 이는 외부 검증기나 과제 특화 사전 정보 없이도 테스트 시간 확장을 가능하게 한다. EqR은 내부 동역학을 두 축을 따라 확장한다: 더 많은 반복을 실행함으로써 깊이를, 그리고 여러 초기화로부터의 확률적 궤적을 집계함으로써 폭을 확장한다. 실험적으로, 테스트 시간 확장으로 인한 이득은 해결책 정렬 어트랙터로의 더 강한 수렴과 밀접하게 연관되어 있다. 이러한 어트랙터 관점은 신경망이 과제 난이도에 따라 테스트 시간 연산을 적응적으로 할당할 수 있게 한다. 간단한 경우는 1~5회 반복 단계 내에 수렴하는 반면, 더 어려운 경우는 대규모 테스트 시간 확장의 이점을 얻는다. 최대 40,000개 층에 해당하는 전개를 통해, 확장 가능한 잠재 추론은 순방향 모델의 2.6% 정확도를 Sudoku-Extreme에서 99% 이상으로 향상시킨다. 이러한 결과는 학습된 어트랙터 경관이 반복적 잠재 모델에서의 확장 가능한 추론을 이해하기 위한 유용한 메커니즘적 렌즈를 제공함을 시사한다.

English

Scaling test-time compute by iteratively updating a latent state has emerged as a powerful paradigm for reasoning. Yet the internal mechanisms that enable these iterative models to generalize beyond memorized patterns remain unclear. We hypothesize that generalizable reasoning arises from learning task-conditioned attractors: latent dynamical systems whose stable fixed points correspond to valid solutions. We formalize this process through Equilibrium Reasoners (EqR), which enable test-time scaling without external verifiers or task-specific priors. EqR scales internal dynamics along two axes: depth, by running more iterations, and breadth, by aggregating stochastic trajectories from multiple initializations. Empirically, gains from test-time scaling are tightly coupled with stronger convergence toward solution-aligned attractors. This attractor perspective allows neural networks to adaptively allocate test-time compute based on task difficulty. While simple cases converge within 1 to 5 iteration steps, harder cases benefit from massive test-time scaling. By unrolling up to the equivalent of 40,000 layers, scalable latent reasoning boosts accuracy from 2.6% for feedforward models to over 99% on Sudoku-Extreme. These results suggest that learned attractor landscapes provide a useful mechanistic lens for understanding scalable reasoning in iterative latent models.