Diepgaand Tabelonderzoek via Continue Ervaringsgestuurde Uitvoering

Samenvatting

Grote taalmodellen worstelen vaak met complexe analytische taken met een lange horizon over ongestructureerde tabellen, die doorgaans hiërarchische en bidirectionele koppen en niet-standaard lay-outs hebben. Wij formaliseren deze uitdaging als Deep Tabular Research (DTR), wat multi-stap redeneren vereist over onderling afhankelijke tabelregio's. Om DTR aan te pakken, stellen we een nieuw agent-gebaseerd raamwerk voor dat tabelredenering behandelt als een closed-loop besluitvormingsproces. We ontwerpen zorgvuldig een gekoppeld vraag- en tabelbegrip voor padbesluitvorming en operationele uitvoering. Specifiek: (i) DTR construeert eerst een hiërarchische metagraaf om bidirectionele semantiek vast te leggen, waarbij natuurlijke taalvragen worden gemapt naar een operationele zoekruimte; (ii) Om door deze ruimte te navigeren, introduceren we een verwachtingsbewust selectiebeleid dat uitvoeringspaden met hoge bruikbaarheid prioriteert; (iii) Cruciaal is dat historische uitvoeringsresultaten worden samengevoegd tot een siamese gestructureerd geheugen, d.w.z. geparametriseerde updates en geabstraheerde teksten, wat continue verfijning mogelijk maakt. Uitgebreide experimenten op uitdagende ongestructureerde tabelbenchmarks verifiëren de effectiviteit en benadrukken de noodzaak om strategische planning te scheiden van low-level uitvoering voor tabelredenering met een lange horizon.

English

Large language models often struggle with complex long-horizon analytical tasks over unstructured tables, which typically feature hierarchical and bidirectional headers and non-canonical layouts. We formalize this challenge as Deep Tabular Research (DTR), requiring multi-step reasoning over interdependent table regions. To address DTR, we propose a novel agentic framework that treats tabular reasoning as a closed-loop decision-making process. We carefully design a coupled query and table comprehension for path decision making and operational execution. Specifically, (i) DTR first constructs a hierarchical meta graph to capture bidirectional semantics, mapping natural language queries into an operation-level search space; (ii) To navigate this space, we introduce an expectation-aware selection policy that prioritizes high-utility execution paths; (iii) Crucially, historical execution outcomes are synthesized into a siamese structured memory, i.e., parameterized updates and abstracted texts, enabling continual refinement. Extensive experiments on challenging unstructured tabular benchmarks verify the effectiveness and highlight the necessity of separating strategic planning from low-level execution for long-horizon tabular reasoning.

Diepgaand Tabelonderzoek via Continue Ervaringsgestuurde Uitvoering

Deep Tabular Research via Continual Experience-Driven Execution

Samenvatting

Support