注意シフト：大規模言語モデルを用いたFOMC声明からの金融政策スタンスの解読

要旨

連邦公開市場委員会（FOMC）声明は金融政策情報の主要な情報源であり、その文言の微妙な変化さえも世界の金融市場を動かし得る。したがって、これらのテキストが伝えるタカ派的・ハト派的なスタンスを測定することが中心的な課題となる。既存のアプローチでは通常、スタンス検出を標準的な分類問題として扱い、各声明を個別にラベル付けする。しかし、金融政策コミュニケーションの解釈は本質的に相対的である：市場反応は声明のトーンだけでなく、会合間でのトーンの変化の仕方にも依存する。本研究では、デルタ一貫性スコアリング（DCS）を提案する。これは、絶対的スタンスと相対的な会合間の変化を同時にモデル化することで、固定化された大規模言語モデル（LLM）の表現を連続的なスタンススコアに写像する、教師データを必要としないフレームワークである。DCSは手動によるタカ派・ハト派ラベルに依存する代わりに、連続する会合を自己教師学習の情報源として利用する。各声明に対する絶対的スタンススコアと、連続する声明間の相対的変化スコアを学習する。デルタ一貫性目的関数は、絶対的スコアの変化が相対的変化と整合することを促す。これにより、DCSは手動ラベルなしで時間的に一貫したスタンスの軌跡を復元することが可能となる。4つのLLMバックボーンにおいて、DCSは教師ありプローブやLLM-as-judgeベースラインを一貫して上回り、文レベルのタカ派・ハト派分類で最大71.1%の精度を達成した。また、得られた会合レベルのスコアは経済的にも意味があり、インフレ指標と強く相関し、国債利回りの変動と有意に関連していた。全体として、これらの結果は、LLMの表現が、相対的な時間構造を通じて復元可能な金融政策シグナルを内包していることを示唆している。

English

Federal Open Market Committee (FOMC) statements are a major source of monetary-policy information, and even subtle changes in their wording can move global financial markets. A central task is therefore to measure the hawkish--dovish stance conveyed in these texts. Existing approaches typically treat stance detection as a standard classification problem, labeling each statement in isolation. However, the interpretation of monetary-policy communication is inherently relative: market reactions depend not only on the tone of a statement, but also on how that tone shifts across meetings. We introduce Delta-Consistent Scoring (DCS), an annotation-free framework that maps frozen large language model (LLM) representations to continuous stance scores by jointly modeling absolute stance and relative inter-meeting shifts. Rather than relying on manual hawkish--dovish labels, DCS uses consecutive meetings as a source of self-supervision. It learns an absolute stance score for each statement and a relative shift score between consecutive statements. A delta-consistency objective encourages changes in absolute scores to align with the relative shifts. This allows DCS to recover a temporally coherent stance trajectory without manual labels. Across four LLM backbones, DCS consistently outperforms supervised probes and LLM-as-judge baselines, achieving up to 71.1% accuracy on sentence-level hawkish--dovish classification. The resulting meeting-level scores are also economically meaningful: they correlate strongly with inflation indicators and are significantly associated with Treasury yield movements. Overall, the results suggest that LLM representations encode monetary-policy signals that can be recovered through relative temporal structure.

注意シフト：大規模言語モデルを用いたFOMC声明からの金融政策スタンスの解読

Mind the Shift: Decoding Monetary Policy Stance from FOMC Statements with Large Language Models

要旨

Support