LLaTiSA: 시각적 인지에서 의미론에 이르는 난이도 계층화 시계열 추론을 향하여

초록

대규모 언어 모델(LLM)의 시계열 데이터에 대한 포괄적 이해는 여전히 큰 과제로 남아 있습니다. 현재 연구는 단편적인 과제 정의와 본질적 모호성을 지닌 벤치마크로 인해 제한되어 있어, 엄격한 평가와 통합적 시계열 추론 모델(TSRM) 개발을 저해하고 있습니다. 이러한 격차를 해소하기 위해 본 연구는 인지적 복잡성이 증가하는 4단계 분류 체계를 통해 시계열 추론(TSR)을 체계화합니다. 우리는 다양한 과제 조합과 검증된 사고 연쇄(CoT) 경로를 포함한 83,000개 샘플로 구성된 계층적 시계열 추론 데이터셋인 HiTSR을 소개합니다. HiTSR을 활용하여 시각-언어 모델(VLM)의 시간적 인식을 강화하기 위해 시각화된 패턴과 정밀 보정된 수치 테이블을 통합한 강력한 TSRM인 LLaTiSA를 제안합니다. 다단계 교욱 과정 미세 조정 전략을 통해 LLaTiSA는 우수한 성능을 달성하고 다양한 TSR 과제와 실제 시나리오에서 견고한 분포 외 일반화 능력을 보여줍니다. 우리의 코드는 https://github.com/RainingNovember/LLaTiSA에서 확인할 수 있습니다.

English

Comprehensive understanding of time series remains a significant challenge for Large Language Models (LLMs). Current research is hindered by fragmented task definitions and benchmarks with inherent ambiguities, precluding rigorous evaluation and the development of unified Time Series Reasoning Models(TSRMs). To bridge this gap, we formalize Time Series Reasoning (TSR) via a four-level taxonomy of increasing cognitive complexity. We introduce HiTSR, a hierarchical time series reasoning dataset comprising 83k samples with diverse task combinations and verified Chain-of-Thought (CoT) trajectories. Leveraging HiTSR, we propose LLaTiSA, a strong TSRM that integrates visualized patterns with precision-calibrated numerical tables to enhance the temporal perception of Vision-Language Models (VLMs). Through a multi-stage curriculum fine-tuning strategy, LLaTiSA achieves superior performance and exhibits robust out-of-distribution generalization across diverse TSR tasks and real-world scenarios. Our code is available at https://github.com/RainingNovember/LLaTiSA.

LLaTiSA: 시각적 인지에서 의미론에 이르는 난이도 계층화 시계열 추론을 향하여

LLaTiSA: Towards Difficulty-Stratified Time Series Reasoning from Visual Perception to Semantics

초록

Support