ChatPaper.ai
打开菜单
首页
每日论文
arXiv
HuggingFace
定价
账户
工作台
🇨🇳
中文简体
Loading...
•
•
•
•
•
•
•
•
•
•
AI研究论文每日精选
每日精选AI研究论文及翻译
June 2nd, 2025
ProRL:持续强化学习拓展大语言模型的推理边界
ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models
Mingjie Liu, Shizhe Diao, Ximing Lu, Jian Hu, Xin Dong, Yejin Choi, Jan Kautz, Yi Dong
•
May 30, 2025
•
109
3
AlphaOne:测试时兼具快慢思维的推理模型
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time
Junyu Zhang, Runpei Dong, Han Wang, Xuying Ning, Haoran Geng, Peihao Li, Xialin He, Yutong Bai, Jitendra Malik, Saurabh Gupta, Huan Zhang
•
May 30, 2025
•
79
2
时间盲区:为何视频语言模型难以捕捉人类视角?
Time Blindness: Why Video-Language Models Can't See What Humans Can?
Ujjwal Upadhyay, Mukul Ranjan, Zhiqiang Shen, Mohamed Elhoseiny
•
May 30, 2025
•
71
3
大规模语言模型在数据合成中的应用
Large Language Models for Data Synthesis
Yihong Tang, Menglin Kong, Lijun Sun
•
May 20, 2025
•
46
2
HardTests:为LLM编码合成高质量测试用例
HardTests: Synthesizing High-Quality Test Cases for LLM Coding
Zhongmou He, Yee Man Choi, Kexun Zhang, Jiabao Ji, Junting Zhou, Dejia Xu, Ivan Bercovich, Aidan Zhang, Lei Li
•
May 30, 2025
•
40
2
勿仅一瞥:迈向通过选择性视觉重访实现的多模态交互推理
Don't Look Only Once: Towards Multimodal Interactive Reasoning with Selective Visual Revisitation
Jiwan Chung, Junhyeok Kim, Siyeol Kim, Jaeyoung Lee, Min Soo Kim, Youngjae Yu
•
May 24, 2025
•
35
2
ViStoryBench:故事可视化综合基准测试套件
ViStoryBench: Comprehensive Benchmark Suite for Story Visualization
Cailin Zhuang, Ailin Huang, Wei Cheng, Jingwei Wu, Yaoqi Hu, Jiaqi Liao, Zhewei Huang, Hongyuan Wang, Xinyao Liao, Weiwei Cai, Hengyuan Xu, Xuanyang Zhang, Xianfang Zeng, Gang Yu, Chi Zhang
•
May 30, 2025
•
29
2
EXP-Bench:人工智能能否开展AI研究实验?
EXP-Bench: Can AI Conduct AI Research Experiments?
Patrick Tser Jern Kon, Jiachen Liu, Xinyi Zhu, Qiuyi Ding, Jingjia Peng, Jiarong Xing, Yibo Huang, Yiming Qiu, Jayanth Srinivasa, Myungjin Lee, Mosharaf Chowdhury, Matei Zaharia, Ang Chen
•
May 30, 2025
•
22
3
DINO-R1:激励视觉基础模型中的推理能力
DINO-R1: Incentivizing Reasoning Capability in Vision Foundation Models
Chenbin Pan, Wenbin He, Zhengzhong Tu, Liu Ren
•
May 29, 2025
•
22
3
开放验证世界:一个全面的网络平台,用于测试与评估多模态大语言模型代理
Open CaptchaWorld: A Comprehensive Web-based Platform for Testing and Benchmarking Multimodal LLM Agents
Yaxin Luo, Zhaoyi Li, Jiacheng Liu, Jiacheng Cui, Xiaohan Zhao, Zhiqiang Shen
•
May 30, 2025
•
21
2
CoDA:面向全身操控铰接物体的协同扩散噪声优化
CoDA: Coordinated Diffusion Noise Optimization for Whole-Body Manipulation of Articulated Objects
Huaijin Pi, Zhi Cen, Zhiyang Dou, Taku Komura
•
May 27, 2025
•
19
2
MoDoMoDo:多领域数据混合驱动的多模态大语言模型强化学习
MoDoMoDo: Multi-Domain Data Mixtures for Multimodal LLM Reinforcement Learning
Yiqing Liang, Jielin Qiu, Wenhao Ding, Zuxin Liu, James Tompkin, Mengdi Xu, Mengzhou Xia, Zhengzhong Tu, Laixi Shi, Jiacheng Zhu
•
May 30, 2025
•
18
3
视觉语言模型存在偏见。
Vision Language Models are Biased
An Vo, Khai-Nguyen Nguyen, Mohammad Reza Taesiri, Vy Tuong Dang, Anh Totti Nguyen, Daeyoung Kim
•
May 29, 2025
•
17
2
MetaFaith:大语言模型中自然语言不确定性的忠实表达
MetaFaith: Faithful Natural Language Uncertainty Expression in LLMs
Gabrielle Kaili-May Liu, Gal Yona, Avi Caciularu, Idan Szpektor, Tim G. J. Rudner, Arman Cohan
•
May 30, 2025
•
16
2
EmergentTTS-Eval:利用模型即评委方法评估TTS模型在复杂韵律、表现力及语言挑战上的表现
EmergentTTS-Eval: Evaluating TTS Models on Complex Prosodic, Expressiveness, and Linguistic Challenges Using Model-as-a-Judge
Ruskin Raj Manku, Yuzhi Tang, Xingjian Shi, Mu Li, Alex Smola
•
May 29, 2025
•
16
2
UniGeo:驾驭视频扩散实现统一且一致的几何估计
UniGeo: Taming Video Diffusion for Unified Consistent Geometry Estimation
Yang-Tian Sun, Xin Yu, Zehuan Huang, Yi-Hua Huang, Yuan-Chen Guo, Ziyi Yang, Yan-Pei Cao, Xiaojuan Qi
•
May 30, 2025
•
13
2
CLaSp:自推测解码中的上下文层跳跃机制
CLaSp: In-Context Layer Skip for Self-Speculative Decoding
Longze Chen, Renke Shan, Huiming Wang, Lu Wang, Ziqiang Liu, Run Luo, Jiawei Wang, Hamid Alinejad-Rokny, Min Yang
•
May 30, 2025
•
13
6
多思少看?评估多模态推理模型中的放大幻觉现象
More Thinking, Less Seeing? Assessing Amplified Hallucination in Multimodal Reasoning Models
Chengzhi Liu, Zhongxing Xu, Qingyue Wei, Juncheng Wu, James Zou, Xin Eric Wang, Yuyin Zhou, Sheng Liu
•
May 23, 2025
•
13
2
EasyText:面向多语言文本渲染的可控扩散变换器
EasyText: Controllable Diffusion Transformer for Multilingual Text Rendering
Runnan Lu, Yuxuan Zhang, Jailing Liu, Haifa Wang, Yiren Song
•
May 30, 2025
•
11
2
大语言模型是局部线性映射
Large Language Models are Locally Linear Mappings
James R. Golden
•
May 30, 2025
•
11
4
分叉-合并解码:增强视听大语言模型的多模态理解能力
Fork-Merge Decoding: Enhancing Multimodal Understanding in Audio-Visual Large Language Models
Chaeyoung Jung, Youngjoon Jang, Jongmin Choi, Joon Son Chung
•
May 27, 2025
•
10
2
ReasonGen-R1:通过监督微调(SFT)与强化学习(RL)实现自回归图像生成模型的思维链(CoT)
ReasonGen-R1: CoT for Autoregressive Image generation models through SFT and RL
Yu Zhang, Yunqi Li, Yifan Yang, Rui Wang, Yuqing Yang, Dai Qi, Jianmin Bao, Dongdong Chen, Chong Luo, Lili Qiu
•
May 30, 2025
•
9
2
利用负面信号:从教师数据中进行强化蒸馏以提升大语言模型推理能力
Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning
Shuyao Xu, Cheng Peng, Jiangxuan Long, Weidi Xu, Wei Chu, Yuan Qi
•
May 30, 2025
•
9
3
DexUMI:以人手作为通用操控界面的灵巧操作技术
DexUMI: Using Human Hand as the Universal Manipulation Interface for Dexterous Manipulation
Mengda Xu, Han Zhang, Yifan Hou, Zhenjia Xu, Linxi Fan, Manuela Veloso, Shuran Song
•
May 28, 2025
•
8
2
ChARM:基于角色的自适应奖励建模框架,用于高级角色扮演语言代理
ChARM: Character-based Act-adaptive Reward Modeling for Advanced Role-Playing Language Agents
Feiteng Fang, Ting-En Lin, Yuchuan Wu, Xiong Liu, Xiang Huang, Dingwei Chen, Jing Ye, Haonan Zhang, Liang Zhu, Hamid Alinejad-Rokny, Min Yang, Fei Huang, Yongbin Li
•
May 29, 2025
•
7
2
大型语言模型的角色扮演评估
Role-Playing Evaluation for Large Language Models
Yassine El Boudouri, Walter Nuninger, Julian Alvarez, Yvan Peter
•
May 19, 2025
•
7
2
评估与引导多模态大语言模型中的模态偏好
Evaluating and Steering Modality Preferences in Multimodal Large Language Model
Yu Zhang, Jinlong Ma, Yongshuai Hou, Xuefeng Bai, Kehai Chen, Yang Xiang, Jun Yu, Min Zhang
•
May 27, 2025
•
6
2
利用大语言模型进行科学新颖性检测
Harnessing Large Language Models for Scientific Novelty Detection
Yan Liu, Zonglin Yang, Soujanya Poria, Thanh-Son Nguyen, Erik Cambria
•
May 30, 2025
•
5
2
un^2CLIP:通过反演unCLIP提升CLIP的视觉细节捕捉能力
un^2CLIP: Improving CLIP's Visual Detail Capturing Ability via Inverting unCLIP
Yinqi Li, Jiahe Zhao, Hong Chang, Ruibing Hou, Shiguang Shan, Xilin Chen
•
May 30, 2025
•
5
2
微调小型语言模型还是提示大型语言模型?生成低代码工作流的案例研究
Fine-Tune an SLM or Prompt an LLM? The Case of Generating Low-Code Workflows
Orlando Marquez Ayala, Patrice Bechard, Emily Chen, Maggie Baird, Jingfei Chen
•
May 30, 2025
•
5
2
Point-MoE:通过专家混合实现跨领域泛化的3D语义分割
Point-MoE: Towards Cross-Domain Generalization in 3D Semantic Segmentation via Mixture-of-Experts
Xuweiyi Chen, Wentao Zhou, Aruni RoyChowdhury, Zezhou Cheng
•
May 29, 2025
•
5
2
实现灵活的多LLM集成,助力可扩展知识聚合
Enabling Flexible Multi-LLM Integration for Scalable Knowledge Aggregation
Zhenglun Kong, Zheng Zhan, Shiyue Hou, Yifan Gong, Xin Meng, Pengwei Sui, Peiyan Dong, Xuan Shen, Zifeng Wang, Pu Zhao, Hao Tang, Stratis Ioannidis, Yanzhi Wang
•
May 28, 2025
•
5
2
SiLVR:一种基于语言的简易视频推理框架
SiLVR: A Simple Language-based Video Reasoning Framework
Ce Zhang, Yan-Bo Lin, Ziyang Wang, Mohit Bansal, Gedas Bertasius
•
May 30, 2025
•
4
2
重温循环神经网络中的双线性状态转移机制
Revisiting Bi-Linear State Transitions in Recurrent Neural Networks
M. Reza Ebrahimi, Roland Memisevic
•
May 27, 2025
•
4
2
TRIDENT:通过三维多样化红队数据合成提升大语言模型安全性
TRIDENT: Enhancing Large Language Model Safety with Tri-Dimensional Diversified Red-Teaming Data Synthesis
Xiaorui Wu, Xiaofeng Mao, Fei Li, Xin Zhang, Xuanhong Li, Chong Teng, Donghong Ji, Zhuang Li
•
May 30, 2025
•
3
2
GATE:通用阿拉伯语文本嵌入——通过嵌套表示学习与混合损失训练提升语义文本相似性
GATE: General Arabic Text Embedding for Enhanced Semantic Textual Similarity with Matryoshka Representation Learning and Hybrid Loss Training
Omer Nacar, Anis Koubaa, Serry Sibaee, Yasser Al-Habashi, Adel Ammar, Wadii Boulila
•
May 30, 2025
•
3
2
形式不确定性语法:在自动推理任务中何时信任大语言模型
Grammars of Formal Uncertainty: When to Trust LLMs in Automated Reasoning Tasks
Debargha Ganguly, Vikash Singh, Sreehari Sankar, Biyao Zhang, Xuecen Zhang, Srinivasan Iyengar, Xiaotian Han, Amit Sharma, Shivkumar Kalyanaraman, Vipin Chaudhary
•
May 26, 2025
•
3
2
自动化却充满风险的游戏:消费者市场中代理间谈判与交易的建模
The Automated but Risky Game: Modeling Agent-to-Agent Negotiations and Transactions in Consumer Markets
Shenzhe Zhu, Jiao Sun, Yi Nian, Tobin South, Alex Pentland, Jiaxin Pei
•
May 29, 2025
•
2
3
OMNIGUARD:一种跨模态AI安全监管的高效方法
OMNIGUARD: An Efficient Approach for AI Safety Moderation Across Modalities
Sahil Verma, Keegan Hines, Jeff Bilmes, Charlotte Siska, Luke Zettlemoyer, Hila Gonen, Chandan Singh
•
May 29, 2025
•
2
2
LegalSearchLM:将法律案例检索重构为法律要素生成
LegalSearchLM: Rethinking Legal Case Retrieval as Legal Elements Generation
Chaeeun Kim, Jinu Lee, Wonseok Hwang
•
May 28, 2025
•
2
1
上下文是寻找黄金段落的金钥匙:评估与训练上下文文档嵌入
Context is Gold to find the Gold Passage: Evaluating and Training Contextual Document Embeddings
Max Conti, Manuel Faysse, Gautier Viaud, Antoine Bosselut, Céline Hudelot, Pierre Colombo
•
May 30, 2025
•
1
2
多语言大语言模型安全研究现状:从测量语言差距到缓解差距
The State of Multilingual LLM Safety Research: From Measuring the Language Gap to Mitigating It
Zheng-Xin Yong, Beyza Ermis, Marzieh Fadaee, Stephen H. Bach, Julia Kreutzer
•
May 30, 2025
•
1
2