思考の風景：大規模言語モデルの推論プロセスの可視化

要旨

大規模言語モデル（LLMs）の数多くの応用は、段階的な推論を実行する能力に依存しています。しかし、LLMsの推論行動は十分に理解されておらず、研究、開発、安全性において課題を生んでいます。このギャップを埋めるため、我々は「ランドスケープ・オブ・シンクス」を導入します。これは、任意の多肢選択データセットにおけるチェーン・オブ・シンクス（CoT）およびその派生手法の推論パスをユーザーが視覚的に検査するための初めてのツールです。具体的には、推論パス内の状態を、すべての回答選択肢までの距離を定量化する特徴ベクトルとして表現します。これらの特徴は、t-SNEを用いて二次元プロットに可視化されます。ランドスケープ・オブ・シンクスを用いた定性的および定量的分析は、強力なモデルと弱いモデル、正解と不正解、異なる推論タスクを効果的に区別します。また、一貫性の低さや不確実性の高さといった望ましくない推論パターンも明らかにします。さらに、ユーザーはこのツールを、観察したい特性を予測するモデルに適応させることができます。我々はこの利点を、推論パスの正しさを評価する軽量な検証器にツールを適応させることで示します。コードは以下のURLで公開されています：https://github.com/tmlr-group/landscape-of-thoughts。

English

Numerous applications of large language models (LLMs) rely on their ability to perform step-by-step reasoning. However, the reasoning behavior of LLMs remains poorly understood, posing challenges to research, development, and safety. To address this gap, we introduce landscape of thoughts-the first visualization tool for users to inspect the reasoning paths of chain-of-thought and its derivatives on any multi-choice dataset. Specifically, we represent the states in a reasoning path as feature vectors that quantify their distances to all answer choices. These features are then visualized in two-dimensional plots using t-SNE. Qualitative and quantitative analysis with the landscape of thoughts effectively distinguishes between strong and weak models, correct and incorrect answers, as well as different reasoning tasks. It also uncovers undesirable reasoning patterns, such as low consistency and high uncertainty. Additionally, users can adapt our tool to a model that predicts the property they observe. We showcase this advantage by adapting our tool to a lightweight verifier that evaluates the correctness of reasoning paths. The code is publicly available at: https://github.com/tmlr-group/landscape-of-thoughts.