AI Co-Wiskundige: Wiskundigen Versnellen met Agent-gebaseerde AI

Samenvatting

Wij introduceren de AI co-wiskundige, een werkbank waarmee wiskundigen interactief AI-agenten kunnen inzetten voor open-eindig onderzoek. De AI co-wiskundige is geoptimaliseerd om holistische ondersteuning te bieden voor de verkennende en iteratieve realiteit van wiskundige workflows, waaronder ideeontwikkeling, literatuuronderzoek, computationele verkenning, stellingbewijzen en theorievorming. Door een asynchrone, stateful werkruimte te bieden die onzekerheid beheert, gebruikersintentie verfijnt, mislukte hypothesen bijhoudt en native wiskundige artefacten produceert, weerspiegelt het systeem menselijke collaboratieve workflows. In vroege tests hielp de AI co-wiskundige onderzoekers om open problemen op te lossen, nieuwe onderzoeksrichtingen te identificeren en over het hoofd geziene literatuurverwijzingen aan het licht te brengen. Naast het demonstreren van een zeer interactief paradigma voor AI-ondersteunde wiskundige ontdekking, behaalt de AI co-wiskundige ook state-of-the-art resultaten op harde probleemoplossingsbenchmarks, waaronder een score van 48% op FrontierMath Tier 4, een nieuwe hoogste score onder alle geëvalueerde AI-systemen.

English

We introduce the AI co-mathematician, a workbench for mathematicians to interactively leverage AI agents to pursue open-ended research. The AI co-mathematician is optimized to provide holistic support for the exploratory and iterative reality of mathematical workflows, including ideation, literature search, computational exploration, theorem proving and theory building. By providing an asynchronous, stateful workspace that manages uncertainty, refines user intent, tracks failed hypotheses, and outputs native mathematical artifacts, the system mirrors human collaborative workflows. In early tests, the AI co-mathematician helped researchers solve open problems, identify new research directions, and uncover overlooked literature references. Besides demonstrating a highly interactive paradigm for AI-assisted mathematical discovery, the AI co-mathematician also achieves state of the art results on hard problem-solving benchmarks, including scoring 48% on FrontierMath Tier 4, a new high score among all AI systems evaluated.

AI Co-Wiskundige: Wiskundigen Versnellen met Agent-gebaseerde AI

AI Co-Mathematician: Accelerating Mathematicians with Agentic AI

Samenvatting

Support