ChatPaper.ai
打开菜单
首页
每日论文
arXiv
HuggingFace
定价
账户
工作台
🇨🇳
中文简体
Loading...
•
•
•
•
•
•
•
•
•
•
AI研究论文每日精选
每日精选AI研究论文及翻译
June 10th, 2025
强化预训练
Reinforcement Pre-Training
Qingxiu Dong, Li Dong, Yao Tang, Tianzhu Ye, Yutao Sun, Zhifang Sui, Furu Wei
•
Jun 9, 2025
•
143
4
灵枢:面向统一多模态医学理解与推理的通用基础模型
Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning
LASA Team, Weiwen Xu, Hou Pong Chan, Long Li, Mahani Aljunied, Ruifeng Yuan, Jianyu Wang, Chenghao Xiao, Guizhen Chen, Chaoqun Liu, Zhaodonghui Li, Yu Sun, Junao Shen, Chaojun Wang, Jie Tan, Deli Zhao, Tingyang Xu, Hao Zhang, Yu Rong
•
Jun 8, 2025
•
80
2
Saffron-1:迈向大语言模型安全保证的推理扩展范式
Saffron-1: Towards an Inference Scaling Paradigm for LLM Safety Assurance
Ruizhong Qiu, Gaotang Li, Tianxin Wei, Jingrui He, Hanghang Tong
•
Jun 6, 2025
•
60
1
MiniCPM4:终端设备上的超高效大型语言模型
MiniCPM4: Ultra-Efficient LLMs on End Devices
MiniCPM Team, Chaojun Xiao, Yuxuan Li, Xu Han, Yuzhuo Bai, Jie Cai, Haotian Chen, Wentong Chen, Xin Cong, Ganqu Cui, Ning Ding, Shengdan Fan, Yewei Fang, Zixuan Fu, Wenyu Guan, Yitong Guan, Junshao Guo, Yufeng Han, Bingxiang He, Yuxiang Huang, Cunliang Kong, Qiuzuo Li, Siyuan Li, Wenhao Li, Yanghao Li, Yishan Li, Zhen Li, Dan Liu, Biyuan Lin, Yankai Lin, Xiang Long, Quanyu Lu, Yaxi Lu, Peiyan Luo, Hongya Lyu, Litu Ou, Yinxu Pan, Zekai Qu, Qundong Shi, Zijun Song, Jiayuan Su, Zhou Su, Ao Sun, Xianghui Sun, Peijun Tang, Fangzheng Wang, Feng Wang, Shuo Wang, Yudong Wang, Yesai Wu, Zhenyu Xiao, Jie Xie, Zihao Xie, Yukun Yan, Jiarui Yuan, Kaihuo Zhang, Lei Zhang, Linyue Zhang, Xueren Zhang, Yudi Zhang, Hengyu Zhao, Weilin Zhao, Weilun Zhao, Yuanqian Zhao, Zhi Zheng, Ge Zhou, Jie Zhou, Wei Zhou, Zihan Zhou, Zixuan Zhou, Zhiyuan Liu, Guoyang Zeng, Chao Jia, Dahai Li, Maosong Sun
•
Jun 9, 2025
•
55
1
OneIG-Bench:面向图像生成的全维度精细评估基准
OneIG-Bench: Omni-dimensional Nuanced Evaluation for Image Generation
Jingjing Chang, Yixiao Fang, Peng Xing, Shuhan Wu, Wei Cheng, Rui Wang, Xianfang Zeng, Gang Yu, Hai-Bao Chen
•
Jun 9, 2025
•
37
1
SpatialLM:面向结构化室内建模的大规模语言模型训练
SpatialLM: Training Large Language Models for Structured Indoor Modeling
Yongsen Mao, Junhao Zhong, Chuan Fang, Jia Zheng, Rui Tang, Hao Zhu, Ping Tan, Zihan Zhou
•
Jun 9, 2025
•
29
1
图像重建作为特征分析的工具
Image Reconstruction as a Tool for Feature Analysis
Eduard Allakhverdov, Dmitrii Tarasov, Elizaveta Goncharova, Andrey Kuznetsov
•
Jun 9, 2025
•
24
1
Astra:通过分层多模态学习迈向通用移动机器人
Astra: Toward General-Purpose Mobile Robots via Hierarchical Multimodal Learning
Sheng Chen, Peiyu He, Jiaxin Hu, Ziyang Liu, Yansheng Wang, Tao Xu, Chi Zhang, Chongchong Zhang, Chao An, Shiyu Cai, Duo Cao, Kangping Chen, Shuai Chu, Tianwei Chu, Mingdi Dan, Min Du, Weiwei Fang, Pengyou Fu, Junkai Hu, Xiaowei Jiang, Zhaodi Jiang, Fuxuan Li, Jun Li, Minghui Li, Mingyao Li, Yanchang Li, Zhibin Li, Guangming Liu, Kairui Liu, Lihao Liu, Weizhi Liu, Xiaoshun Liu, Yufei Liu, Yunfei Liu, Qiang Lu, Yuanfei Luo, Xiang Lv, Hongying Ma, Sai Ma, Lingxian Mi, Sha Sa, Hongxiang Shu, Lei Tian, Chengzhi Wang, Jiayu Wang, Kaijie Wang, Qingyi Wang, Renwen Wang, Tao Wang, Wei Wang, Xirui Wang, Chao Wei, Xuguang Wei, Zijun Xia, Zhaohao Xiao, Tingshuai Yan, Liyan Yang, Yifan Yang, Zhikai Yang, Zhong Yin, Li Yuan, Liuchun Yuan, Chi Zhang, Jinyang Zhang, Junhui Zhang, Linge Zhang, Zhenyi Zhang, Zheyu Zhang, Dongjie Zhu, Hang Li, Yangang Zhang
•
Jun 6, 2025
•
20
1
预训练的大型语言模型在上下文中学习隐马尔可夫模型
Pre-trained Large Language Models Learn Hidden Markov Models In-context
Yijia Dai, Zhaolin Gao, Yahya Satter, Sarah Dean, Jennifer J. Sun
•
Jun 8, 2025
•
16
2
重新思考多模态扩散变换器中的跨模态交互
Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers
Zhengyao Lv, Tianlin Pan, Chenyang Si, Zhaoxi Chen, Wangmeng Zuo, Ziwei Liu, Kwan-Yee K. Wong
•
Jun 9, 2025
•
14
1
GTR-CoT:图遍历作为分子结构识别的视觉思维链
GTR-CoT: Graph Traversal as Visual Chain of Thought for Molecular Structure Recognition
Jingchao Wang, Haote Yang, Jiang Wu, Yifan He, Xingjian Wei, Yinfan Wang, Chengjin Liu, Lingli Ge, Lijun Wu, Bin Wang, Dahua Lin, Conghui He
•
Jun 9, 2025
•
12
1
BitVLA:面向机器人操作的1比特视觉-语言-动作模型
BitVLA: 1-bit Vision-Language-Action Models for Robotics Manipulation
Hongyu Wang, Chuyan Xiong, Ruiping Wang, Xilin Chen
•
Jun 9, 2025
•
12
1
可争议的智能:通过辩论演讲评估大语言模型法官的基准测试
Debatable Intelligence: Benchmarking LLM Judges via Debate Speech Evaluation
Noy Sternlicht, Ariel Gera, Roy Bar-Haim, Tom Hope, Noam Slonim
•
Jun 5, 2025
•
11
1
穿越低谷:小型语言模型实现高效长链思维训练之路
Through the Valley: Path to Effective Long CoT Training for Small Language Models
Renjie Luo, Jiaxi Li, Chen Huang, Wei Lu
•
Jun 9, 2025
•
10
1
思维假象:通过问题复杂性视角理解推理模型的优势与局限
The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity
Parshin Shojaee, Iman Mirzadeh, Keivan Alizadeh, Maxwell Horton, Samy Bengio, Mehrdad Farajtabar
•
Jun 7, 2025
•
10
1
多模态基础模型中从动力学模型引导世界模型的构建
Bootstrapping World Models from Dynamics Models in Multimodal Foundation Models
Yifu Qiu, Yftah Ziser, Anna Korhonen, Shay B. Cohen, Edoardo M. Ponti
•
Jun 6, 2025
•
10
1
视觉Transformer无需训练寄存器
Vision Transformers Don't Need Trained Registers
Nick Jiang, Amil Dravid, Alexei Efros, Yossi Gandelsman
•
Jun 9, 2025
•
9
1
CCI4.0:提升大语言模型推理能力的双语预训练数据集
CCI4.0: A Bilingual Pretraining Dataset for Enhancing Reasoning in Large Language Models
Guang Liu, Liangdong Wang, Jijie Li, Yang Yu, Yao Xu, Jiabei Chen, Yu Bai, Feng Liao, Yonghua Lin
•
Jun 9, 2025
•
8
1
ConfQA:仅在确信时作答
ConfQA: Answer Only If You Are Confident
Yin Huang, Yifan Ethan Xu, Kai Sun, Vera Yan, Alicia Sun, Haidar Khan, Jimmy Nguyen, Mohammad Kachuee, Zhaojiang Lin, Yue Liu, Aaron Colak, Anuj Kumar, Wen-tau Yih, Xin Luna Dong
•
Jun 8, 2025
•
8
1
从条件数视角看模型免疫
Model Immunization from a Condition Number Perspective
Amber Yijia Zheng, Cedar Site Bai, Brian Bullins, Raymond A. Yeh
•
May 29, 2025
•
8
1
以玩促泛化:通过游戏玩法学习推理
Play to Generalize: Learning to Reason Through Game Play
Yunfei Xie, Yinsong Ma, Shiyi Lan, Alan Yuille, Junfei Xiao, Chen Wei
•
Jun 9, 2025
•
7
2
GUI-反思:通过自我反思赋能多模态GUI模型行为
GUI-Reflection: Empowering Multimodal GUI Models with Self-Reflection Behavior
Penghao Wu, Shengnan Ma, Bo Wang, Jiaheng Yu, Lewei Lu, Ziwei Liu
•
Jun 9, 2025
•
7
1
良好的开端是成功的一半:通过弱到强解码实现低资源偏好对齐
Well Begun is Half Done: Low-resource Preference Alignment by Weak-to-Strong Decoding
Feifan Song, Shaohang Wei, Wen Luo, Yuxuan Fan, Tianyu Liu, Guoyin Wang, Houfeng Wang
•
Jun 9, 2025
•
7
1
梦境之地:基于模拟器与生成模型的可控世界构建
Dreamland: Controllable World Creation with Simulator and Generative Models
Sicheng Mo, Ziyang Leng, Leon Liu, Weizhen Wang, Honglin He, Bolei Zhou
•
Jun 9, 2025
•
6
1
合成自我!诱导角色引导提示以构建大语言模型中的个性化奖励机制
SynthesizeMe! Inducing Persona-Guided Prompts for Personalized Reward Models in LLMs
Michael J Ryan, Omar Shaikh, Aditri Bhagirath, Daniel Frees, William Held, Diyi Yang
•
Jun 5, 2025
•
6
1
ExpertLongBench:基于结构化检查表的专家级长文本生成任务语言模型基准测试
ExpertLongBench: Benchmarking Language Models on Expert-Level Long-Form Generation Tasks with Structured Checklists
Jie Ruan, Inderjeet Nair, Shuyang Cao, Amy Liu, Sheza Munir, Micah Pollens-Dempsey, Tiffany Chiang, Lucy Kates, Nicholas David, Sihan Chen, Ruxin Yang, Yuqian Yang, Jasmine Gump, Tessa Bialek, Vivek Sankaran, Margo Schlanger, Lu Wang
•
Jun 2, 2025
•
6
1
所见即所存:知识冲突对大语言模型的颠覆性影响
What Is Seen Cannot Be Unseen: The Disruptive Effect of Knowledge Conflict on Large Language Models
Kaiser Sun, Fan Bai, Mark Dredze
•
Jun 6, 2025
•
5
1
卡带式表示:通过自学习实现的轻量级通用长上下文表征
Cartridges: Lightweight and general-purpose long context representations via self-study
Sabri Eyuboglu, Ryan Ehrlich, Simran Arora, Neel Guha, Dylan Zinsley, Emily Liu, Will Tennien, Atri Rudra, James Zou, Azalia Mirhoseini, Christopher Re
•
Jun 6, 2025
•
4
1
变革的推动者:面向战略规划的自进化大语言模型代理
Agents of Change: Self-Evolving LLM Agents for Strategic Planning
Nikolas Belle, Dakota Barnes, Alfonso Amayuelas, Ivan Bercovich, Xin Eric Wang, William Wang
•
Jun 5, 2025
•
4
1
τ^2-Bench:双控环境下对话代理的评估平台
τ^2-Bench: Evaluating Conversational Agents in a Dual-Control Environment
Victor Barres, Honghua Dong, Soham Ray, Xujie Si, Karthik Narasimhan
•
Jun 9, 2025
•
3
1
SAFEFLOW:一种确保可信与事务性自主代理系统的原则性协议
SAFEFLOW: A Principled Protocol for Trustworthy and Transactional Autonomous Agent Systems
Peiran Li, Xinkai Zou, Zhuohang Wu, Ruifeng Li, Shuo Xing, Hanwen Zheng, Zhikai Hu, Yuping Wang, Haoxi Li, Qin Yuan, Yingmo Zhang, Zhengzhong Tu
•
Jun 9, 2025
•
3
1
学习强化学习所不能:针对最难题目的交错在线微调
Learning What Reinforcement Learning Can't: Interleaved Online Fine-Tuning for Hardest Questions
Lu Ma, Hao Liang, Meiyi Qiang, Lexiang Tang, Xiaochen Ma, Zhen Hao Wong, Junbo Niu, Chengyu Shen, Runming He, Bin Cui, Wentao Zhang
•
Jun 9, 2025
•
3
1
机器人学习中的自适应改进循环
Self-Adapting Improvement Loops for Robotic Learning
Calvin Luo, Zilai Zeng, Mingxi Jia, Yilun Du, Chen Sun
•
Jun 7, 2025
•
3
1
NetPress:面向网络应用的动态生成大语言模型基准测试
NetPress: Dynamically Generated LLM Benchmarks for Network Applications
Yajie Zhou, Jiajun Ruan, Eric S. Wang, Sadjad Fouladi, Francis Y. Yan, Kevin Hsieh, Zaoxing Liu
•
Jun 3, 2025
•
3
2
PolyVivid:跨模态交互与增强的多主体生动视频生成
PolyVivid: Vivid Multi-Subject Video Generation with Cross-Modal Interaction and Enhancement
Teng Hu, Zhentao Yu, Zhengguang Zhou, Jiangning Zhang, Yuan Zhou, Qinglin Lu, Ran Yi
•
Jun 9, 2025
•
2
1
超频LLM推理:监控与调控大语言模型中的思维路径长度
Overclocking LLM Reasoning: Monitoring and Controlling Thinking Path Lengths in LLMs
Roy Eisenstadt, Itamar Zimerman, Lior Wolf
•
Jun 8, 2025
•
2
1
GeometryZero:通过群体对比策略优化提升大语言模型的几何解题能力
GeometryZero: Improving Geometry Solving for LLM with Group Contrastive Policy Optimization
Yikun Wang, Yibin Wang, Dianyi Wang, Zimian Peng, Qipeng Guo, Dacheng Tao, Jiaqi Wang
•
Jun 8, 2025
•
2
1
MegaHan97K:面向超大类中文字符识别的大规模数据集,涵盖超过97,000个类别
MegaHan97K: A Large-Scale Dataset for Mega-Category Chinese Character Recognition with over 97K Categories
Yuyi Zhang, Yongxin Shi, Peirong Zhang, Yixin Zhao, Zhenhua Yang, Lianwen Jin
•
Jun 5, 2025
•
2
1
通过动态目标边界的鲁棒偏好优化
Robust Preference Optimization via Dynamic Target Margins
Jie Sun, Junkang Wu, Jiancan Wu, Zhibo Zhu, Xingyu Lu, Jun Zhou, Lintao Ma, Xiang Wang
•
Jun 4, 2025
•
2
1
动态视图合成作为逆问题
Dynamic View Synthesis as an Inverse Problem
Hidir Yesiltepe, Pinar Yanardag
•
Jun 9, 2025
•
1
1
CyberV:视频理解中测试时自适应的控制论方法
CyberV: Cybernetics for Test-time Scaling in Video Understanding
Jiahao Meng, Shuyang Sun, Yue Tan, Lu Qi, Yunhai Tong, Xiangtai Li, Longyin Wen
•
Jun 9, 2025
•
1
1
通过概念感知微调提升大型语言模型性能
Improving large language models with concept-aware fine-tuning
Michael K. Chen, Xikun Zhang, Jiaxing Huang, Dacheng Tao
•
Jun 9, 2025
•
1
1
利用代理模型评估大语言模型在资源较少语言中的鲁棒性
Evaluating LLMs Robustness in Less Resourced Languages with Proxy Models
Maciej Chrabąszcz, Katarzyna Lorenc, Karolina Seweryn
•
Jun 9, 2025
•
1
1
通过正交匹配追踪实现无需训练的分词器移植
Training-Free Tokenizer Transplantation via Orthogonal Matching Pursuit
Charles Goddard, Fernando Fernandes Neto
•
Jun 7, 2025
•
1
1
隐于明处:探究多模态语言模型中的隐性推理能力
Hidden in Plain Sight: Probing Implicit Reasoning in Multimodal Language Models
Qianqi Yan, Hongquan Li, Shan Jiang, Yang Zhao, Xinze Guan, Ching-Chen Kuo, Xin Eric Wang
•
May 30, 2025
•
1
1
EVOREFUSE:面向大语言模型对伪恶意指令过度拒绝的评估与缓解的进化式提示优化
EVOREFUSE: Evolutionary Prompt Optimization for Evaluation and Mitigation of LLM Over-Refusal to Pseudo-Malicious Instructions
Xiaorui Wu, Xiaofeng Mao, Xin Zhang, Fei Li, Chong Teng, Yuxiang Peng, Li Zheng, Donghong Ji, Zhuang Li
•
May 29, 2025
•
1
1
元自适应提示蒸馏用于少样本视觉问答
Meta-Adaptive Prompt Distillation for Few-Shot Visual Question Answering
Akash Gupta, Amos Storkey, Mirella Lapata
•
Jun 7, 2025
•
0
1
基于流式第一人称视频的主动式助手对话生成
Proactive Assistant Dialogue Generation from Streaming Egocentric Videos
Yichi Zhang, Xin Luna Dong, Zhaojiang Lin, Andrea Madotto, Anuj Kumar, Babak Damavandi, Joyce Chai, Seungwhan Moon
•
Jun 6, 2025
•
0
1