ChatPaper.aiChatPaper

### 半自主形式化的弗拉索夫-麦克斯韦-朗道平衡系统

Semi-Autonomous Formalization of the Vlasov-Maxwell-Landau Equilibrium

March 16, 2026
作者: Vasily Ilin
cs.AI

摘要

我们完成了Vlasov-Maxwell-Landau(VML)系统中平衡态特征的完整Lean 4形式化,该系统描述了带电等离子体的运动。本项目展示了全流程AI辅助数学研究闭环:AI推理模型(Gemini DeepThink)根据猜想生成证明,智能编码工具(Claude Code)通过自然语言指令将其转化为Lean代码,专用证明器(Aristotle)完成了111条引理的证明,最终由Lean内核验证结果。整个流程由一名数学家耗时10天监督完成,成本200美元,且未编写任何代码。 项目开发过程完全公开:所有229条人类指令与213次git提交均存档于代码库。我们详细总结了AI的失效模式(假设蔓延、定义对齐错误、智能体规避行为)与成功经验(抽象/具体证明分离、对抗性自审、人类对关键定义与定理陈述的评审作用)。值得注意的是,形式化工作先于对应数学论文终稿的完成。
English
We present a complete Lean 4 formalization of the equilibrium characterization in the Vlasov-Maxwell-Landau (VML) system, which describes the motion of charged plasma. The project demonstrates the full AI-assisted mathematical research loop: an AI reasoning model (Gemini DeepThink) generated the proof from a conjecture, an agentic coding tool (Claude Code) translated it into Lean from natural-language prompts, a specialized prover (Aristotle) closed 111 lemmas, and the Lean kernel verified the result. A single mathematician supervised the process over 10 days at a cost of \$200, writing zero lines of code. The entire development process is public: all 229 human prompts, and 213 git commits are archived in the repository. We report detailed lessons on AI failure modes -- hypothesis creep, definition-alignment bugs, agent avoidance behaviors -- and on what worked: the abstract/concrete proof split, adversarial self-review, and the critical role of human review of key definitions and theorem statements. Notably, the formalization was completed before the final draft of the corresponding math paper was finished.
PDF122March 19, 2026