ChatPaper.aiChatPaper.ai
首页

arXiv

HuggingFace

定价账户工作台

•
•

•
•

•
•

•
•

•
•

Footer

Company name

ChatPaper.ai: Your advanced AI reading assistant.

Contact us: [email protected]

X (Twitter)

Products

  • AI Search
  • AI Mind Map
  • Arxiv Summary
  • Huggingface Summary

Support

  • FAQ
  • Contact

Company

  • Blog
  • Privacy Policy
  • Terms of Service

Available Languages

  • 🇬🇧English
  • 🇨🇳中文简体
  • 🇭🇰繁體中文
  • 🇯🇵日本語
  • 🇰🇷한국어
  • 🇩🇪Deutsch
  • 🇫🇷Français
  • 🇷🇺Русский
  • 🇪🇸Español

© 2025 chatpaper.ai All rights reserved.

AI研究论文每日精选

每日精选AI研究论文及翻译

统一多模态预训练中的涌现特性
Emerging Properties in Unified Multimodal Pretraining

Chaorui Deng, Deyao Zhu, Kunchang Li, Chenhui Gou, Feng Li, Zeyu Wang, Shu Zhong, Weihao Yu, Xiaonan Nie, Ziang Song, Guang Shi, Haoqi Fan•May 20, 2025•671

SageAttention3:面向推理的微缩放FP4注意力机制及8位训练探索
SageAttention3: Microscaling FP4 Attention for Inference and An Exploration of 8-Bit Training

Jintao Zhang, Jia Wei, Pengle Zhang, Xiaoming Xu, Haofeng Huang, Haoxu Wang, Kai Jiang, Jun Zhu, Jianfei Chen•May 16, 2025•341

视觉质量-R1:通过强化学习排序实现推理引导的图像质量评估
VisualQuality-R1: Reasoning-Induced Image Quality Assessment via Reinforcement Learning to Rank

Tianhe Wu, Jian Zou, Jie Liang, Lei Zhang, Kede Ma•May 20, 2025•212

视觉代理强化微调
Visual Agentic Reinforcement Fine-Tuning

Ziyu Liu, Yuhang Zang, Yushan Zou, Zijian Liang, Xiaoyi Dong, Yuhang Cao, Haodong Duan, Dahua Lin, Jiaqi Wang•May 20, 2025•191

《芦荟家族》开放与专业化医疗大语言模型构建指南
The Aloe Family Recipe for Open and Specialized Healthcare LLMs

Dario Garcia-Gasulla, Jordi Bayarri-Planas, Ashwin Kumar Gururajan, Enrique Lopez-Cuena, Adrian Tormos, Daniel Hinjos, Pablo Bernabeu-Perez, Anna Arias-Duart, Pablo Agustin Martin-Torres, Marta Gonzalez-Mallo, Sergio Alvarez-Napagao, Eduard Ayguadé-Parra, Ulises Cortés•May 7, 2025•181

通过预算相对策略优化实现随时推理的最优化
Optimizing Anytime Reasoning via Budget Relative Policy Optimization

Penghui Qi, Zichen Liu, Tianyu Pang, Chao Du, Wee Sun Lee, Min Lin•May 19, 2025•161

神经符号扩散模型
Neurosymbolic Diffusion Models

Emile van Krieken, Pasquale Minervini, Edoardo Ponti, Antonio Vergari•May 19, 2025•161

潜在流变换器
Latent Flow Transformer

Yen-Chen Wu, Feng-Ting Liao, Meng-Hsi Chen, Pei-Chen Ho, Farhang Nabiei, Da-shan Shiu•May 20, 2025•141

探索大语言模型的联邦剪枝技术
Exploring Federated Pruning for Large Language Models

Pengxin Guo, Yinong Wang, Wei Li, Mengting Liu, Ming Li, Jinkai Zheng, Liangqiong Qu•May 19, 2025•121

Visionary-R1:利用强化学习缓解视觉推理中的捷径问题
Visionary-R1: Mitigating Shortcuts in Visual Reasoning with Reinforcement Learning

Jiaer Xia, Yuhang Zang, Peng Gao, Yixuan Li, Kaiyang Zhou•May 20, 2025•111

通用推理器:推动大语言模型跨领域推理能力发展
General-Reasoner: Advancing LLM Reasoning Across All Domains

Xueguang Ma, Qian Liu, Dongfu Jiang, Ge Zhang, Zejun Ma, Wenhu Chen•May 20, 2025•111

推理模型能更准确地表达其置信度
Reasoning Models Better Express Their Confidence

Dongkeun Yoon, Seungone Kim, Sohee Yang, Sunkyoung Kim, Soyeon Kim, Yongil Kim, Eunbi Choi, Yireun Kim, Minjoon Seo•May 20, 2025•111

推理路径压缩:压缩生成轨迹以实现高效大语言模型推理
Reasoning Path Compression: Compressing Generation Trajectories for Efficient LLM Reasoning

Jiwon Song, Dongwon Jo, Yulhwa Kim, Jae-Joon Kim•May 20, 2025•101

无训练自回归图像生成水印技术
Training-Free Watermarking for Autoregressive Image Generation

Yu Tong, Zihao Pan, Shuai Yang, Kaiyang Zhou•May 20, 2025•91

VideoEval-Pro:稳健且逼真的长视频理解评估
VideoEval-Pro: Robust and Realistic Long Video Understanding Evaluation

Wentao Ma, Weiming Ren, Yiming Jia, Zhuofeng Li, Ping Nie, Ge Zhang, Wenhu Chen•May 20, 2025•91

CS-Sum:代码切换对话摘要的基准测试与大语言模型的局限
CS-Sum: A Benchmark for Code-Switching Dialogue Summarization and the Limits of Large Language Models

Sathya Krishnan Suresh, Tanmay Surana, Lim Zhi Hao, Eng Siong Chng•May 19, 2025•92

NExT-Search:重构生成式AI搜索的用户反馈生态系统
NExT-Search: Rebuilding User Feedback Ecosystem for Generative AI Search

Sunhao Dai, Wenjie Wang, Liang Pang, Jun Xu, See-Kiong Ng, Ji-Rong Wen, Tat-Seng Chua•May 20, 2025•81

仅在需要时思考:大型混合推理模型的应用
Think Only When You Need with Large Hybrid-Reasoning Models

Lingjie Jiang, Xun Wu, Shaohan Huang, Qingxiu Dong, Zewen Chi, Li Dong, Xingxing Zhang, Tengchao Lv, Lei Cui, Furu Wei•May 20, 2025•81

奖励推理模型
Reward Reasoning Model

Jiaxin Guo, Zewen Chi, Li Dong, Qingxiu Dong, Xun Wu, Shaohan Huang, Furu Wei•May 20, 2025•71

基于零阶优化的量化神经网络微调
Fine-tuning Quantized Neural Networks with Zeroth-order Optimization

Sifeng Shang, Jiayi Zhou, Chenyu Lin, Minxian Li, Kaiyang Zhou•May 19, 2025•71

并非所有正确答案都同等重要:为何蒸馏源的选择至关重要
Not All Correct Answers Are Equal: Why Your Distillation Source Matters

Xiaoyu Tian, Yunjie Ji, Haotian Wang, Shuaiting Chen, Sitong Zhao, Yiping Peng, Han Zhao, Xiangang Li•May 20, 2025•61

Hunyuan-Game:工业级智能游戏创作模型
Hunyuan-Game: Industrial-grade Intelligent Game Creation Model

Ruihuang Li, Caijin Zhou, Shoujian Zheng, Jianxiang Lu, Jiabin Huang, Comi Chen, Junshu Tang, Guangzheng Xu, Jiale Tao, Hongmei Wang, Donghao Li, Wenqing Yu, Senbo Wang, Zhimin Li, Yetshuan Shi, Haoyu Yang, Yukun Wang, Wenxun Dai, Jiaqi Li, Linqing Wang, Qixun Wang, Zhiyong Xu, Yingfang Zhang, Jiangfeng Xiong, Weijie Kong, Chao Zhang, Hongxin Zhang, Qiaoling Zheng, Weiting Guo, Xinchi Deng, Yixuan Li, Renjia Wei, Yulin Jian, Duojun Huang, Xuhua Ren, Sihuan Lin, Yifu Sun, Yuan Zhou, Joey Wang, Qin Lin, Jingmiao Yu, Jihong Zhang, Caesar Zhong, Di Wang, Yuhong Liu, Linus, Jie Jiang, Longhuang Wu, Shuai Shao, Qinglin Lu•May 20, 2025•61

SSR:通过理性引导的空间推理增强视觉语言模型的深度感知能力
SSR: Enhancing Depth Perception in Vision-Language Models via Rationale-Guided Spatial Reasoning

Yang Liu, Ming Ma, Xiaomin Yu, Pengxiang Ding, Han Zhao, Mingyang Sun, Siteng Huang, Donglin Wang•May 18, 2025•61

从防御Gemini对抗间接提示注入中汲取的经验教训
Lessons from Defending Gemini Against Indirect Prompt Injections

Chongyang Shi, Sharon Lin, Shuang Song, Jamie Hayes, Ilia Shumailov, Itay Yona, Juliette Pluto, Aneesh Pappu, Christopher A. Choquette-Choo, Milad Nasr, Chawin Sitawarin, Gena Gibson, Andreas Terzis, John "Four" Flynn•May 20, 2025•51

通过机制可解释性探索从大语言模型中提取潜在知识
Towards eliciting latent knowledge from LLMs with mechanistic interpretability

Bartosz Cywiński, Emil Ryd, Senthooran Rajamanoharan, Neel Nanda•May 20, 2025•51

训练前热身:在资源受限环境下解锁通用推理能力
Warm Up Before You Train: Unlocking General Reasoning in Resource-Constrained Settings

Safal Shrestha, Minwu Kim, Aadim Nepal, Anubhav Shrestha, Keith Ross•May 19, 2025•51

仅需两位专家即可引导思维:无需额外训练即可增强MoE推理模型中的认知努力
Two Experts Are All You Need for Steering Thinking: Reinforcing Cognitive Effort in MoE Reasoning Models Without Additional Training

Mengru Wang, Xingyu Chen, Yue Wang, Zhiwei He, Jiahao Xu, Tian Liang, Qiuzhi Liu, Yunzhi Yao, Wenxuan Wang, Ruotian Ma, Haitao Mi, Ningyu Zhang, Zhaopeng Tu, Xiaolong Li, Dong Yu•May 20, 2025•41

以1美元修复7400个漏洞:低成本崩溃现场程序修复
Fixing 7,400 Bugs for 1$: Cheap Crash-Site Program Repair

Han Zheng, Ilia Shumailov, Tianqi Fan, Aiden Hall, Mathias Payer•May 19, 2025•41

真值神经元
Truth Neurons

Haohang Li, Yupeng Cao, Yangyang Yu, Jordan W. Suchow, Zining Zhu•May 18, 2025•41

Phare:大型语言模型的安全探测系统
Phare: A Safety Probe for Large Language Models

Pierre Le Jeune, Benoît Malézieux, Weixuan Xiao, Matteo Dora•May 16, 2025•41

MIGRATION-BENCH:Java 8代码库级迁移基准测试
MIGRATION-BENCH: Repository-Level Code Migration Benchmark from Java 8

Linbo Liu, Xinle Liu, Qiang Zhou, Lin Chen, Yihan Liu, Hoan Nguyen, Behrooz Omidvar-Tehrani, Xi Shen, Jun Huan, Omer Tripp, Anoop Deoras•May 14, 2025•41

CompeteSMoE —— 基于竞争机制的专家混合训练及其统计保证
CompeteSMoE -- Statistically Guaranteed Mixture of Experts Training via Competition

Nam V. Nguyen, Huy Nguyen, Quang Pham, Van Nguyen, Savitha Ramasamy, Nhat Ho•May 19, 2025•31

Solve-检测-验证:基于灵活生成验证器的推理时扩展
Solve-Detect-Verify: Inference-Time Scaling with Flexible Generative Verifier

Jianyuan Zhong, Zeju Li, Zhijian Xu, Xiangyu Wen, Kezhi Li, Qiang Xu•May 17, 2025•31

大语言模型中的分词约束:符号与算术推理局限研究
Tokenization Constraints in LLMs: A Study of Symbolic and Arithmetic Reasoning Limits

Xiang Zhang, Juntai Cao, Jiaqi Wei, Yiwei Xu, Chenyu You•May 20, 2025•21

偏见与否:用偏见检测器识别新闻中的偏见
To Bias or Not to Bias: Detecting bias in News with bias-detector

Himel Ghosh, Ahmed Mosharafa, Georg Groh•May 19, 2025•21

双向语言模型是更佳的知识记忆者吗?现实世界知识注入的基准测试
Bidirectional LMs are Better Knowledge Memorizers? A Benchmark for Real-world Knowledge Injection

Yuwei Zhang, Wenhao Yu, Shangbin Feng, Yifan Zhu, Letian Peng, Jayanth Srinivasa, Gaowen Liu, Jingbo Shang•May 18, 2025•21

在多跳问答中的掩码机制:语言模型在上下文置换下的表现分析
Masking in Multi-hop QA: An Analysis of How Language Models Perform with Context Permutation

Wenyu Huang, Pavlos Vougiouklis, Mirella Lapata, Jeff Z. Pan•May 16, 2025•11

在人工智能中融入脑启发的多模态学习机制
Incorporating brain-inspired mechanisms for multimodal learning in artificial intelligence

Xiang He, Dongcheng Zhao, Yang Li, Qingqun Kong, Xin Yang, Yi Zeng•May 15, 2025•11

理解Alpha世代数字语言:大语言模型内容审核安全系统的评估
Understanding Gen Alpha Digital Language: Evaluation of LLM Safety Systems for Content Moderation

Manisha Mehta, Fausto Giunchiglia•May 14, 2025•11

以对象为中心的表示方法提升机器人操作中的策略泛化能力
Object-Centric Representations Improve Policy Generalization in Robot Manipulation

Alexandre Chapin, Bruno Machado, Emmanuel Dellandrea, Liming Chen•May 16, 2025•01