주의해야 할 변화: 대규모 언어 모델을 활용한 FOMC 성명서의 통화정책 기조 해독

초록

연방공개시장위원회(FOMC) 성명은 통화정책 정보의 주요 원천이며, 그 문장 구성의 미세한 변화조차도 글로벌 금융 시장을 움직일 수 있습니다. 따라서 이 텍스트가 전달하는 강경-완화 스탠스를 측정하는 것이 핵심 과제입니다. 기존 접근법은 일반적으로 스탠스 탐지를 표준 분류 문제로 취급하며 각 성명을 개별적으로 라벨링합니다. 그러나 통화정책 커뮤니케이션의 해석은 본질적으로 상대적입니다: 시장 반응은 성명의 어조뿐만 아니라 회의 간 그 어조가 어떻게 변화하는지에도 의존합니다. 본 연구에서는 절대적 스탠스와 회의 간 상대적 변화를 함께 모델링하여 고정된 대형 언어 모델(LLM) 표현을 연속적인 스탠스 점수로 매핑하는 라벨링 불필요 프레임워크인 델타 일관성 스코어링(DCS)을 소개합니다. DCS는 수동으로 붙인 강경-완화 라벨에 의존하기보다 연속적인 회의를 자기 지도 학습의 원천으로 활용합니다. 이는 각 성명에 대한 절대적 스탠스 점수와 연속 성명 간의 상대적 변화 점수를 학습합니다. 델타 일관성 목적 함수는 절대 점수의 변화가 상대적 변화와 일관되도록 유도합니다. 이를 통해 DCS는 수동 라벨 없이도 시간적으로 일관된 스탠스 궤적을 복원할 수 있습니다. 4가지 LLM 백본에 걸쳐 DCS는 지도형 탐지기(supervised probe) 및 LLM-as-judge 기준선을 지속적으로 능가하며, 문장 수준 강경-완화 분류에서 최대 71.1%의 정확도를 달성했습니다. 그 결과 도출된 회의 수준 점수는 경제적으로도 의미가 있습니다: 이는 인플레이션 지표와 강한 상관관계를 보이며, 국채 수익률 변동과도 유의미하게 연관되었습니다. 전반적으로, 이러한 결과는 LLM 표현이 상대적 시간 구조를 통해 복원 가능한 통화정책 신호를 내포하고 있음을 시사합니다.

English

Federal Open Market Committee (FOMC) statements are a major source of monetary-policy information, and even subtle changes in their wording can move global financial markets. A central task is therefore to measure the hawkish--dovish stance conveyed in these texts. Existing approaches typically treat stance detection as a standard classification problem, labeling each statement in isolation. However, the interpretation of monetary-policy communication is inherently relative: market reactions depend not only on the tone of a statement, but also on how that tone shifts across meetings. We introduce Delta-Consistent Scoring (DCS), an annotation-free framework that maps frozen large language model (LLM) representations to continuous stance scores by jointly modeling absolute stance and relative inter-meeting shifts. Rather than relying on manual hawkish--dovish labels, DCS uses consecutive meetings as a source of self-supervision. It learns an absolute stance score for each statement and a relative shift score between consecutive statements. A delta-consistency objective encourages changes in absolute scores to align with the relative shifts. This allows DCS to recover a temporally coherent stance trajectory without manual labels. Across four LLM backbones, DCS consistently outperforms supervised probes and LLM-as-judge baselines, achieving up to 71.1% accuracy on sentence-level hawkish--dovish classification. The resulting meeting-level scores are also economically meaningful: they correlate strongly with inflation indicators and are significantly associated with Treasury yield movements. Overall, the results suggest that LLM representations encode monetary-policy signals that can be recovered through relative temporal structure.

주의해야 할 변화: 대규모 언어 모델을 활용한 FOMC 성명서의 통화정책 기조 해독

Mind the Shift: Decoding Monetary Policy Stance from FOMC Statements with Large Language Models

초록

Support