### 半自主形式化的弗拉索夫-麦克斯韦-朗道平衡系统

摘要

我们完成了Vlasov-Maxwell-Landau（VML）系统中平衡态特征的完整Lean 4形式化，该系统描述了带电等离子体的运动。本项目展示了全流程AI辅助数学研究闭环：AI推理模型（Gemini DeepThink）根据猜想生成证明，智能编码工具（Claude Code）通过自然语言指令将其转化为Lean代码，专用证明器（Aristotle）完成了111条引理的证明，最终由Lean内核验证结果。整个流程由一名数学家耗时10天监督完成，成本200美元，且未编写任何代码。项目开发过程完全公开：所有229条人类指令与213次git提交均存档于代码库。我们详细总结了AI的失效模式（假设蔓延、定义对齐错误、智能体规避行为）与成功经验（抽象/具体证明分离、对抗性自审、人类对关键定义与定理陈述的评审作用）。值得注意的是，形式化工作先于对应数学论文终稿的完成。

English

We present a complete Lean 4 formalization of the equilibrium characterization in the Vlasov-Maxwell-Landau (VML) system, which describes the motion of charged plasma. The project demonstrates the full AI-assisted mathematical research loop: an AI reasoning model (Gemini DeepThink) generated the proof from a conjecture, an agentic coding tool (Claude Code) translated it into Lean from natural-language prompts, a specialized prover (Aristotle) closed 111 lemmas, and the Lean kernel verified the result. A single mathematician supervised the process over 10 days at a cost of \$200, writing zero lines of code. The entire development process is public: all 229 human prompts, and 213 git commits are archived in the repository. We report detailed lessons on AI failure modes -- hypothesis creep, definition-alignment bugs, agent avoidance behaviors -- and on what worked: the abstract/concrete proof split, adversarial self-review, and the critical role of human review of key definitions and theorem statements. Notably, the formalization was completed before the final draft of the corresponding math paper was finished.

### 半自主形式化的弗拉索夫-麦克斯韦-朗道平衡系统

Semi-Autonomous Formalization of the Vlasov-Maxwell-Landau Equilibrium

摘要

Support