ChatPaper.aiChatPaper.ai
首页

arXiv

HuggingFace

定价账户工作台

•
•

•
•

•
•

•
•

•
•

Footer

Company name

ChatPaper.ai: Your advanced AI reading assistant.

Contact us: [email protected]

X (Twitter)

Products

  • AI Search
  • AI Mind Map
  • Arxiv Summary
  • Huggingface Summary

Support

  • FAQ
  • Contact

Company

  • Blog
  • Privacy Policy
  • Terms of Service

Available Languages

  • 🇬🇧English
  • 🇨🇳中文简体
  • 🇭🇰繁體中文
  • 🇯🇵日本語
  • 🇰🇷한국어
  • 🇩🇪Deutsch
  • 🇫🇷Français
  • 🇷🇺Русский
  • 🇪🇸Español

© 2025 chatpaper.ai All rights reserved.

AI研究论文每日精选

每日精选AI研究论文及翻译

TabSTAR:一种具备语义目标感知表征的基础表格模型
TabSTAR: A Foundation Tabular Model With Semantically Target-Aware Representations

Alan Arazi, Eilam Shapira, Roi Reichart•May 23, 2025•1004

QwenLong-L1:迈向基于强化学习的长上下文大推理模型
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning

Fanqi Wan, Weizhou Shen, Shengyi Liao, Yingcheng Shi, Chenliang Li, Ziyi Yang, Ji Zhang, Fei Huang, Jingren Zhou, Ming Yan•May 23, 2025•733

Quartet:原生FP4训练在大语言模型中可实现最优性能
Quartet: Native FP4 Training Can Be Optimal for Large Language Models

Roberto L. Castro, Andrei Panferov, Soroush Tabesh, Oliver Sieberling, Jiale Chen, Mahdi Nikdan, Saleh Ashkboos, Dan Alistarh•May 20, 2025•692

利用检索与代码工具将大语言模型智能体蒸馏至小型模型
Distilling LLM Agent into Small Models with Retrieval and Code Tools

Minki Kang, Jongwon Jeong, Seanie Lee, Jaewoong Cho, Sung Ju Hwang•May 23, 2025•645

推理模型固执难改:诊断推理模型中的指令覆盖问题
Reasoning Model is Stubborn: Diagnosing Instruction Overriding in Reasoning Models

Doohyuk Jang, Yoonjeon Kim, Chanjae Park, Hyun Ryu, Eunho Yang•May 22, 2025•582

一统视觉三重奏:视觉三元统一强化学习
One RL to See Them All: Visual Triple Unified Reinforcement Learning

Yan Ma, Linge Du, Xuyang Shen, Shaoxiang Chen, Pengfei Li, Qibing Ren, Lizhuang Ma, Yuchao Dai, Pengfei Liu, Junjie Yan•May 23, 2025•522

PhyX:你的模型具备“物理推理”的智慧吗?
PhyX: Does Your Model Have the "Wits" for Physical Reasoning?

Hui Shen, Taiqiang Wu, Qi Han, Yunta Hsieh, Jizhou Wang, Yuyue Zhang, Yuxin Cheng, Zijian Hao, Yuansheng Ni, Xin Wang, Zhongwei Wan, Kai Zhang, Wendong Xu, Jing Xiong, Ping Luo, Wenhu Chen, Chaofan Tao, Zhuoqing Mao, Ngai Wong•May 21, 2025•454

QwenLong-CPRS:迈向具备动态上下文优化的无限大语言模型
QwenLong-CPRS: Towards infty-LLMs with Dynamic Context Optimization

Weizhou Shen, Chenliang Li, Fanqi Wan, Shengyi Liao, Shaopeng Lai, Bo Zhang, Yingcheng Shi, Yuning Wu, Gang Fu, Zhansheng Li, Bin Yang, Ji Zhang, Fei Huang, Jingren Zhou, Ming Yan•May 23, 2025•393

通过测试时进化搜索实现图像与视频生成的规模化扩展
Scaling Image and Video Generation via Test-Time Evolutionary Search

Haoran He, Jiajun Liang, Xintao Wang, Pengfei Wan, Di Zhang, Kun Gai, Ling Pan•May 23, 2025•372

MOOSE-Chem3:通过模拟实验反馈实现实验引导的假设排序
MOOSE-Chem3: Toward Experiment-Guided Hypothesis Ranking via Simulated Experimental Feedback

Wanhao Liu, Zonglin Yang, Jue Wang, Lidong Bing, Di Zhang, Dongzhan Zhou, Yuqiang Li, Houqiang Li, Erik Cambria, Wanli Ouyang•May 23, 2025•283

模型已知最优噪声:通过注意力机制实现视频扩散模型中的贝叶斯主动噪声选择
Model Already Knows the Best Noise: Bayesian Active Noise Selection via Attention in Video Diffusion Model

Kwanyoung Kim, Sanghyun Kim•May 23, 2025•273

VeriThinker:学习验证使推理模型更高效
VeriThinker: Learning to Verify Makes Reasoning Model Efficient

Zigeng Chen, Xinyin Ma, Gongfan Fang, Ruonan Yu, Xinchao Wang•May 23, 2025•222

AudioTrust:音频大语言模型多维度可信度基准测试
AudioTrust: Benchmarking the Multifaceted Trustworthiness of Audio Large Language Models

Kai Li, Can Shen, Yile Liu, Jirui Han, Kelong Zheng, Xuechao Zou, Zhe Wang, Xingjian Du, Shun Zhang, Hanjun Luo, Yingbin Jin, Xinxin Xing, Ziyang Ma, Yue Liu, Xiaojun Jia, Yifan Zhang, Junfeng Fang, Kun Wang, Yibo Yan, Haoyang Li, Yiming Li, Xiaobin Zhuang, Yang Liu, Haibo Hu, Zhuo Chen, Zhizheng Wu, Xiaolin Hu, Eng-Siong Chng, XiaoFeng Wang, Wenyuan Xu, Wei Dong, Xinfeng Li•May 22, 2025•172

不确定性位置:大型语言模型中位置偏差的跨语言研究
Position of Uncertainty: A Cross-Linguistic Study of Positional Bias in Large Language Models

Menschikov Mikhail, Alexander Kharitonov, Maiia Kotyga, Vadim Porvatov, Anna Zhukovskaya, David Kagramanyan, Egor Shvetsov, Evgeny Burnaev•May 22, 2025•162

扩散分类器理解组合性,但需满足特定条件
Diffusion Classifiers Understand Compositionality, but Conditions Apply

Yujin Jeong, Arnas Uselis, Seong Joon Oh, Anna Rohrbach•May 23, 2025•143

Direct3D-S2:借助空间稀疏注意力实现千兆级3D生成的便捷之道
Direct3D-S2: Gigascale 3D Generation Made Easy with Spatial Sparse Attention

Shuang Wu, Youtian Lin, Feihu Zhang, Yifei Zeng, Yikang Yang, Yajie Bao, Jiachen Qian, Siyu Zhu, Philip Torr, Xun Cao, Yao Yao•May 23, 2025•142

s3:通过强化学习训练搜索代理,你并不需要那么多数据
s3: You Don't Need That Much Data to Train a Search Agent via RL

Pengcheng Jiang, Xueqiang Xu, Jiacheng Lin, Jinfeng Xiao, Zifeng Wang, Jimeng Sun, Jiawei Han•May 20, 2025•142

以假乱真教学法:基于合成负样本的课程式DPO用于幻觉检测
Teaching with Lies: Curriculum DPO on Synthetic Negatives for Hallucination Detection

Shrey Pandit, Ashwin Vinod, Liu Leqi, Ying Ding•May 23, 2025•132

全栈前端:跨全前端工程工作流程的多模态大模型基准测试
FullFront: Benchmarking MLLMs Across the Full Front-End Engineering Workflow

Haoyu Sun, Huichen Will Wang, Jiawei Gu, Linjie Li, Yu Cheng•May 23, 2025•132

思维增强策略优化:架起外部指导与内部能力的桥梁
Thought-Augmented Policy Optimization: Bridging External Guidance and Internal Capabilities

Jinyang Wu, Chonghua Liao, Mingkuan Feng, Shuai Zhang, Zhengqi Wen, Pengpeng Shao, Huazhe Xu, Jianhua Tao•May 21, 2025•132

时间推理R1:迈向大语言模型中的全面时序理解
Time-R1: Towards Comprehensive Temporal Reasoning in LLMs

Zijia Liu, Peixuan Han, Haofei Yu, Haoru Li, Jiaxuan You•May 16, 2025•133

晴朗之夜在前:迈向多天气条件下的夜间图像复原
Clear Nights Ahead: Towards Multi-Weather Nighttime Image Restoration

Yuetong Liu, Yunqiu Xu, Yang Wei, Xiuli Bi, Bin Xiao•May 22, 2025•112

RBench-V:面向多模态输出视觉推理模型的基础评估框架
RBench-V: A Primary Assessment for Visual Reasoning Models with Multi-modal Outputs

Meng-Hao Guo, Xuanyu Chu, Qianrui Yang, Zhe-Han Mo, Yiqing Shen, Pei-lin Li, Xinjie Lin, Jinnian Zhang, Xin-Sheng Chen, Yi Zhang, Kiyohiro Nakayama, Zhengyang Geng, Houwen Peng, Han Hu, Shi-Nin Hu•May 22, 2025•103

通过合成任务与强化学习指导大型语言模型保持上下文一致性
Teaching Large Language Models to Maintain Contextual Faithfulness via Synthetic Tasks and Reinforcement Learning

Shuzheng Si, Haozhe Zhao, Cheng Gao, Yuzhuo Bai, Zhitong Wang, Bofei Gao, Kangyang Luo, Wenhao Li, Yufei Huang, Gang Chen, Fanchao Qi, Minjia Zhang, Baobao Chang, Maosong Sun•May 22, 2025•105

Trinity-RFT:面向大语言模型强化微调的通用统一框架
Trinity-RFT: A General-Purpose and Unified Framework for Reinforcement Fine-Tuning of Large Language Models

Xuchen Pan, Yanxi Chen, Yushuo Chen, Yuchang Sun, Daoyuan Chen, Wenhao Zhang, Yuexiang Xie, Yilun Huang, Yilei Zhang, Dawei Gao, Yaliang Li, Bolin Ding, Jingren Zhou•May 23, 2025•92

无言以对:面向低资源语言的无声语音指令训练
Speechless: Speech Instruction Training Without Speech for Low Resource Languages

Alan Dao, Dinh Bach Vu, Huy Hoang Ha, Tuan Le Duc Anh, Shreyas Gopal, Yue Heng Yeo, Warren Keng Hoong Low, Eng Siong Chng, Jia Qi Yip•May 23, 2025•92

ScanBot:面向具身机器人系统的智能表面扫描技术
ScanBot: Towards Intelligent Surface Scanning in Embodied Robotic Systems

Zhiling Chen, Yang Zhang, Fardin Jalil Piran, Qianyu Zhou, Jiong Tang, Farhad Imani•May 22, 2025•92

视觉-语言模型在现实应用中安全吗?基于表情包的基准研究
Are Vision-Language Models Safe in the Wild? A Meme-Based Benchmark Study

DongGeon Lee, Joonwon Jang, Jihae Jeong, Hwanjo Yu•May 21, 2025•82

合成数据强化学习:任务定义即所需全部
Synthetic Data RL: Task Definition Is All You Need

Yiduo Guo, Zhen Guo, Chuanwei Huang, Zi-Ang Wang, Zekai Zhang, Haofei Yu, Huishuai Zhang, Yikang Shen•May 18, 2025•82

Transformer Copilot:基于大语言模型微调中的错误日志学习
Transformer Copilot: Learning from The Mistake Log in LLM Fine-tuning

Jiaru Zou, Yikun Ban, Zihao Li, Yunzhe Qi, Ruizhong Qiu, Ling Yang, Jingrui He•May 22, 2025•62

共舞时刻!身份保持型多人互动视频生成
DanceTogether! Identity-Preserving Multi-Person Interactive Video Generation

Junhao Chen, Mingjin Chen, Jianjin Xu, Xiang Li, Junting Dong, Mingze Sun, Puhua Jiang, Hongxiang Li, Yuhang Yang, Hao Zhao, Xiaoxiao Long, Ruqi Huang•May 23, 2025•52

RePrompt:基于强化学习的推理增强型文本到图像生成重提示方法
RePrompt: Reasoning-Augmented Reprompting for Text-to-Image Generation via Reinforcement Learning

Mingrui Wu, Lu Wang, Pu Zhao, Fangkai Yang, Jianjin Zhang, Jianfeng Liu, Yuefeng Zhan, Weihao Han, Hao Sun, Jiayi Ji, Xiaoshuai Sun, Qingwei Lin, Weiwei Deng, Dongmei Zhang, Feng Sun, Qi Zhang, Rongrong Ji•May 23, 2025•52

关于LLM推理的KL正则化策略梯度算法设计
On the Design of KL-Regularized Policy Gradient Algorithms for LLM Reasoning

Yifan Zhang, Yifeng Liu, Huizhuo Yuan, Yang Yuan, Quanquan Gu, Andrew C Yao•May 23, 2025•52

视觉-语言-动作模型的交互式训练后优化
Interactive Post-Training for Vision-Language-Action Models

Shuhan Tan, Kairan Dou, Yue Zhao, Philipp Krähenbühl•May 22, 2025•52

ReflAct:基于目标状态反思的LLM智能体世界锚定决策机制
ReflAct: World-Grounded Decision Making in LLM Agents via Goal-State Reflection

Jeonghye Kim, Sojeong Rhee, Minbeom Kim, Dohyung Kim, Sangmook Lee, Youngchul Sung, Kyomin Jung•May 21, 2025•52

大型语言模型仅通过阅读即可隐式学会视觉与听觉理解。
Large Language Models Implicitly Learn to See and Hear Just By Reading

Prateek Verma, Mert Pilanci•May 20, 2025•53

价值引导搜索助力高效思维链推理
Value-Guided Search for Efficient Chain-of-Thought Reasoning

Kaiwen Wang, Jin Peng Zhou, Jonathan Chang, Zhaolin Gao, Nathan Kallus, Kianté Brantley, Wen Sun•May 23, 2025•32

并非所有模型都适合专家卸载:论混合专家模型的本地路由一致性
Not All Models Suit Expert Offloading: On Local Routing Consistency of Mixture-of-Expert Models

Jingcong Liang, Siyuan Wang, Miren Tian, Yitong Li, Duyu Tang, Zhongyu Wei•May 21, 2025•32

确保安全!在问答场景下针对间接攻击的大语言模型上下文安全策略保持基准测试
Keep Security! Benchmarking Security Policy Preservation in Large Language Model Contexts Against Indirect Attacks in Question Answering

Hwan Chang, Yumin Kim, Yonghyun Jun, Hwanhee Lee•May 21, 2025•32

重访残差连接:正交更新助力深度网络的稳定与高效
Revisiting Residual Connections: Orthogonal Updates for Stable and Efficient Deep Networks

Giyeong Oh, Woohyun Cho, Siyeol Kim, Suhwan Choi, Younjae Yu•May 17, 2025•32

FREESON:基于语料库遍历蒙特卡洛树搜索的无检索器增强推理方法
FREESON: Retriever-Free Retrieval-Augmented Reasoning via Corpus-Traversing MCTS

Chaeeun Kim, Seungone Kim•May 22, 2025•22

通过动态笔记撰写增强大语言模型在复杂问答中的推理能力
Augmenting LLM Reasoning with Dynamic Notes Writing for Complex QA

Rishabh Maheshwary, Masoud Hashemi, Khyati Mahajan, Shiva Krishna Reddy Malay, Sai Rajeswar, Sathwik Tejaswi Madhusudhan, Spandana Gella, Vikas Yadav•May 22, 2025•22

NOVER:通过无验证器强化学习实现语言模型的激励训练
NOVER: Incentive Training for Language Models via Verifier-Free Reinforcement Learning

Wei Liu, Siya Qi, Xinyu Wang, Chen Qian, Yali Du, Yulan He•May 21, 2025•24

TIME:面向大语言模型现实场景时序推理的多层次基准测试
TIME: A Multi-level Benchmark for Temporal Reasoning of LLMs in Real-World Scenarios

Shaohang Wei, Wei Li, Feifan Song, Wen Luo, Tianyi Zhuang, Haochen Tan, Zhijiang Guo, Houfeng Wang•May 19, 2025•22

尼罗河对话:面向本地社区的语言多样性与文化感知大语言模型
NileChat: Towards Linguistically Diverse and Culturally Aware LLMs for Local Communities

Abdellah El Mekki, Houdaifa Atou, Omer Nacar, Shady Shehata, Muhammad Abdul-Mageed•May 23, 2025•11

伏羲MT:面向中文中心的多语言机器翻译的大规模语言模型稀疏化
FuxiMT: Sparsifying Large Language Models for Chinese-Centric Multilingual Machine Translation

Shaolin Zhu, Tianyu Dong, Bo Li, Deyi Xiong•May 20, 2025•12

通用生物序列重排序提升从头肽段测序性能
Universal Biological Sequence Reranking for Improved De Novo Peptide Sequencing

Zijie Qiu, Jiaqi Wei, Xiang Zhang, Sheng Xu, Kai Zou, Zhi Jin, Zhiqiang Gao, Nanqing Dong, Siqi Sun•May 23, 2025•02