ChatPaper.ai
打開菜單
首頁
每日論文
arXiv
HuggingFace
定價
賬戶
工作台
🇭🇰
繁體中文
Loading...
•
•
•
•
•
•
•
•
•
•
AI研究論文每日精選
每日精選AI研究論文及翻譯
May 20th, 2025
鏈式模型學習於語言模型
Chain-of-Model Learning for Language Model
Kaitao Song, Xiaohua Wang, Xu Tan, Huiqiang Jiang, Chengruidong Zhang, Yongliang Shen, Cen LU, Zihao Li, Zifan Song, Caihua Shan, Yansen Wang, Kan Ren, Xiaoqing Zheng, Tao Qin, Yuqing Yang, Dongsheng Li, Lili Qiu
•
May 17, 2025
•
67
2
AdaptThink:推理模型能學會何時思考
AdaptThink: Reasoning Models Can Learn When to Think
Jiajie Zhang, Nianyi Lin, Lei Hou, Ling Feng, Juanzi Li
•
May 19, 2025
•
56
1
AdaCoT:基于强化学习的帕累托最优自适应思维链触发机制
AdaCoT: Pareto-Optimal Adaptive Chain-of-Thought Triggering via Reinforcement Learning
Chenwei Lou, Zewei Sun, Xinnian Liang, Meng Qu, Wei Shen, Wenqi Wang, Yuntao Li, Qingping Yang, Shuangzhi Wu
•
May 17, 2025
•
43
1
Delta注意力:通過Delta校正實現快速且精確的稀疏注意力推論
Delta Attention: Fast and Accurate Sparse Attention Inference by Delta Correction
Jeffrey Willette, Heejun Lee, Sung Ju Hwang
•
May 16, 2025
•
35
1
通过用户界面分解与合成实现计算机使用基础的规模化扩展
Scaling Computer-Use Grounding via User Interface Decomposition and Synthesis
Tianbao Xie, Jiaqi Deng, Xiaochuan Li, Junlin Yang, Haoyuan Wu, Jixuan Chen, Wenjing Hu, Xinyuan Wang, Yuhui Xu, Zekun Wang, Yiheng Xu, Junli Wang, Doyen Sahoo, Tao Yu, Caiming Xiong
•
May 19, 2025
•
34
2
無思:大語言模型學會何時思考
Thinkless: LLM Learns When to Think
Gongfan Fang, Xinyin Ma, Xinchao Wang
•
May 19, 2025
•
25
1
通過可訓練稀疏注意力實現更快的視頻擴散
Faster Video Diffusion with Trainable Sparse Attention
Peiyuan Zhang, Haofeng Huang, Yongqi Chen, Will Lin, Zhengzhong Liu, Ion Stoica, Eric P. Xing, Hao Zhang
•
May 19, 2025
•
24
1
在黑暗中探索:通过潜在空间中的测试时实例级策略梯度进行推理
Seek in the Dark: Reasoning via Test-Time Instance-Level Policy Gradient in Latent Space
Hengli Li, Chenxi Li, Tong Wu, Xuekai Zhu, Yuxuan Wang, Zhaoxin Yu, Eric Hanchen Jiang, Song-Chun Zhu, Zixia Jia, Ying Nian Wu, Zilong Zheng
•
May 19, 2025
•
23
3
大型語言模型預訓練中的模型融合
Model Merging in Pre-training of Large Language Models
Yunshui Li, Yiyuan Ma, Shen Yan, Chaoyi Zhang, Jing Liu, Jianqiao Lu, Ziwen Xu, Mengzhao Chen, Minrui Wang, Shiyi Zhan, Jin Ma, Xunhao Lai, Yao Luo, Xingyan Bin, Hongbin Ren, Mingji Han, Wenhao Hao, Bairen Yi, LingJun Liu, Bole Ma, Xiaoying Jia, Zhou Xun, Liang Xiang, Yonghui Wu
•
May 17, 2025
•
23
4
MM-PRM:通過可擴展的步驟級監督增強多模態數學推理能力
MM-PRM: Enhancing Multimodal Mathematical Reasoning with Scalable Step-Level Supervision
Lingxiao Du, Fanqing Meng, Zongkai Liu, Zhixiang Zhou, Ping Luo, Qiaosheng Zhang, Wenqi Shao
•
May 19, 2025
•
20
1
混合式3D-4D高斯潑濺技術用於快速動態場景表示
Hybrid 3D-4D Gaussian Splatting for Fast Dynamic Scene Representation
Seungjun Oh, Younggeun Lee, Hyejin Jeon, Eunbyung Park
•
May 19, 2025
•
20
1
FedSVD:基于LoRA的私有联邦学习自适应正交化方法
FedSVD: Adaptive Orthogonalization for Private Federated Learning with LoRA
Seanie Lee, Sangwoo Park, Dong Bok Lee, Dominik Wagner, Haebin Seong, Tobias Bocklet, Juho Lee, Sung Ju Hwang
•
May 19, 2025
•
20
2
CPGD:面向語言模型穩定基於規則的強化學習
CPGD: Toward Stable Rule-based Reinforcement Learning for Language Models
Zongkai Liu, Fanqing Meng, Lingxiao Du, Zhixiang Zhou, Chao Yu, Wenqi Shao, Qiaosheng Zhang
•
May 18, 2025
•
20
1
斷裂的思維鏈推理
Fractured Chain-of-Thought Reasoning
Baohao Liao, Hanze Dong, Yuhui Xu, Doyen Sahoo, Christof Monz, Junnan Li, Caiming Xiong
•
May 19, 2025
•
16
1
ChartMuseum:大型視覺-語言模型視覺推理能力測試
ChartMuseum: Testing Visual Reasoning Capabilities of Large Vision-Language Models
Liyan Tang, Grace Kim, Xinyu Zhao, Thom Lake, Wenxuan Ding, Fangcong Yin, Prasann Singhal, Manya Wadhwa, Zeyu Leo Liu, Zayne Sprague, Ramya Namuduri, Bodun Hu, Juan Diego Rodriguez, Puyuan Peng, Greg Durrett
•
May 19, 2025
•
15
2
神經符號查詢編譯器
Neuro-Symbolic Query Compiler
Yuyao Zhang, Zhicheng Dou, Xiaoxi Li, Jiajie Jin, Yongkang Wu, Zhonghua Li, Qi Ye, Ji-Rong Wen
•
May 17, 2025
•
14
2
SEED-GRPO:語義熵增強型GRPO,用於不確定性感知的策略優化
SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization
Minghan Chen, Guikun Chen, Wenguan Wang, Yi Yang
•
May 18, 2025
•
13
2
VisionReasoner:基於強化學習的統一視覺感知與推理系統
VisionReasoner: Unified Visual Perception and Reasoning via Reinforcement Learning
Yuqi Liu, Tianyuan Qu, Zhisheng Zhong, Bohao Peng, Shu Liu, Bei Yu, Jiaya Jia
•
May 17, 2025
•
13
1
鏡中奇遇:怪異圖像的常識一致性評估
Through the Looking Glass: Common Sense Consistency Evaluation of Weird Images
Elisei Rykov, Kseniia Petrushina, Kseniia Titova, Anton Razzhigaev, Alexander Panchenko, Vasily Konovalov
•
May 12, 2025
•
13
2
ViPlan:一個基於符號謂詞與視覺語言模型的視覺規劃基準
ViPlan: A Benchmark for Visual Planning with Symbolic Predicates and Vision-Language Models
Matteo Merler, Nicola Dainese, Minttu Alakuijala, Giovanni Bonetta, Pietro Ferrazzi, Yu Tian, Bernardo Magnini, Pekka Marttinen
•
May 19, 2025
•
11
1
當AI共研者失誤時:SPOT——科學研究自動化驗證的基準
When AI Co-Scientists Fail: SPOT-a Benchmark for Automated Verification of Scientific Research
Guijin Son, Jiwoo Hong, Honglu Fan, Heejeong Nam, Hyunwoo Ko, Seungwon Lim, Jinyeop Song, Jinha Choi, Gonçalo Paulo, Youngjae Yu, Stella Biderman
•
May 17, 2025
•
8
1
加速TarFlow採樣之GS-Jacobi迭代法
Accelerate TarFlow Sampling with GS-Jacobi Iteration
Ben Liu, Zhen Qin
•
May 19, 2025
•
7
1
R3:鲁棒性评分标准无关的奖励模型
R3: Robust Rubric-Agnostic Reward Models
David Anugraha, Zilu Tang, Lester James V. Miranda, Hanyang Zhao, Mohammad Rifqi Farhansyah, Garry Kuwanto, Derry Wijaya, Genta Indra Winata
•
May 19, 2025
•
6
1
Tiny QA Benchmark++:超輕量級、多語言合成數據集 生成與煙霧測試,用於持續大型語言模型評估
Tiny QA Benchmark++: Ultra-Lightweight, Synthetic Multilingual Dataset Generation & Smoke-Tests for Continuous LLM Evaluation
Vincent Koc
•
May 17, 2025
•
6
2
FinePhys:通过显式融入物理定律实现细粒度人体动作生成,以提供有效的骨骼引导
FinePhys: Fine-grained Human Action Generation by Explicitly Incorporating Physical Laws for Effective Skeletal Guidance
Dian Shao, Mingfei Shi, Shengda Xu, Haodong Chen, Yongle Huang, Binglu Wang
•
May 19, 2025
•
4
1
MTVCrafter:面向开放世界人体图像动画的四维运动标记化
MTVCrafter: 4D Motion Tokenization for Open-World Human Image Animation
Yanbo Ding, Xirui Hu, Zhizhi Guo, Yali Wang
•
May 15, 2025
•
4
1
ExTrans:基於範例增強強化學習的多語言深度推理翻譯
ExTrans: Multilingual Deep Reasoning Translation via Exemplar-Enhanced Reinforcement Learning
Jiaan Wang, Fandong Meng, Jie Zhou
•
May 19, 2025
•
3
1
HISTAI:一個開源的大規模全切片影像數據集,專為計算病理學而設計
HISTAI: An Open-Source, Large-Scale Whole Slide Image Dataset for Computational Pathology
Dmitry Nechaev, Alexey Pchelnikov, Ekaterina Ivanova
•
May 17, 2025
•
3
1
QVGen:突破量化視頻生成模型的極限
QVGen: Pushing the Limit of Quantized Video Generative Models
Yushi Huang, Ruihao Gong, Jing Liu, Yifu Ding, Chengtao Lv, Haotong Qin, Jun Zhang
•
May 16, 2025
•
3
1
SoftCoT++:基於軟性思維鏈推理的測試時擴展
SoftCoT++: Test-Time Scaling with Soft Chain-of-Thought Reasoning
Yige Xu, Xu Guo, Zhiwei Zeng, Chunyan Miao
•
May 16, 2025
•
3
1
從咕噥到語法:合作覓食中湧現的語言
From Grunts to Grammar: Emergent Language from Cooperative Foraging
Maytus Piriyajitakonkij, Rujikorn Charakorn, Weicheng Tao, Wei Pan, Mingfei Sun, Cheston Tan, Mengmi Zhang
•
May 19, 2025
•
2
1
MedCaseReasoning:基於臨床病例報告的診斷推理評估與學習
MedCaseReasoning: Evaluating and learning diagnostic reasoning from clinical case reports
Kevin Wu, Eric Wu, Rahul Thapa, Kevin Wei, Angela Zhang, Arvind Suresh, Jacqueline J. Tao, Min Woo Sun, Alejandro Lozano, James Zou
•
May 16, 2025
•
2
1
HelpSteer3-偏好:跨多樣任務與語言的開放式人類註解偏好數據集
HelpSteer3-Preference: Open Human-Annotated Preference Data across Diverse Tasks and Languages
Zhilin Wang, Jiaqi Zeng, Olivier Delalleau, Hoo-Chang Shin, Felipe Soares, Alexander Bukharin, Ellie Evans, Yi Dong, Oleksii Kuchaiev
•
May 16, 2025
•
2
1
一幣勝千幣:基於低秩克隆的高效知識蒸餾
A Token is Worth over 1,000 Tokens: Efficient Knowledge Distillation through Low-Rank Clone
Jitai Hao, Qiang Huang, Hao Liu, Xinyan Xiao, Zhaochun Ren, Jun Yu
•
May 19, 2025
•
1
1
LLM上下文條件設定與PWP提示法用於化學公式的多模態驗證
LLM Context Conditioning and PWP Prompting for Multimodal Validation of Chemical Formulas
Evgeny Markhasin
•
May 18, 2025
•
1
1
TechniqueRAG:針對網路威脅情報文本中對抗性技術的檢索增強生成註解
TechniqueRAG: Retrieval Augmented Generation for Adversarial Technique Annotation in Cyber Threat Intelligence Text
Ahmed Lekssays, Utsav Shukla, Husrev Taha Sencar, Md Rizwan Parvez
•
May 17, 2025
•
1
1
基於無配對數據學習的輕量級智能手機圖像信號處理器
Learned Lightweight Smartphone ISP with Unpaired Data
Andrei Arhire, Radu Timofte
•
May 15, 2025
•
1
1
基於持久工作流提示、元提示與元推理的AI驅動學術同行評審
AI-Driven Scholarly Peer Review via Persistent Workflow Prompting, Meta-Prompting, and Meta-Reasoning
Evgeny Markhasin
•
May 6, 2025
•
1
1
快速而非繁複:以豐富數據與規則模型重新思考G2P
Fast, Not Fancy: Rethinking G2P with Rich Data and Rule-Based Models
Mahta Fetrat Qharabagh, Zahra Dehghanian, Hamid R. Rabiee
•
May 19, 2025
•
0
1
從電腦使用行為建立通用用戶模型
Creating General User Models from Computer Use
Omar Shaikh, Shardul Sapkota, Shan Rizvi, Eric Horvitz, Joon Sung Park, Diyi Yang, Michael S. Bernstein
•
May 16, 2025
•
0
1