사고의 연쇄적 동역학 분석: 능동적 안내인가, 신뢰할 수 없는 사후 합리화인가?

초록

최근 연구에 따르면, 사고의 연쇄(Chain-of-Thought, CoT)는 분석적 추론이나 상식 추론과 같은 소프트 추론 문제에서 종종 제한된 성능 향상만을 보여줍니다. 또한 CoT는 모델의 실제 추론 과정과 불일치할 수 있습니다. 우리는 지시 튜닝된 모델, 추론 모델, 그리고 추론 증류 모델을 대상으로 소프트 추론 과제에서의 CoT 동역학과 신뢰성을 조사했습니다. 연구 결과, 이러한 모델들이 CoT에 의존하는 방식에 차이가 있음을 밝혔으며, CoT의 영향력과 신뢰성이 항상 일치하지는 않음을 보여줍니다.

English

Recent work has demonstrated that Chain-of-Thought (CoT) often yields limited gains for soft-reasoning problems such as analytical and commonsense reasoning. CoT can also be unfaithful to a model's actual reasoning. We investigate the dynamics and faithfulness of CoT in soft-reasoning tasks across instruction-tuned, reasoning and reasoning-distilled models. Our findings reveal differences in how these models rely on CoT, and show that CoT influence and faithfulness are not always aligned.