ChatPaper.ai
打开菜单
首页
每日论文
arXiv
HuggingFace
定价
账户
工作台
🇨🇳
中文简体
Loading...
•
•
•
•
•
•
•
•
•
•
AI研究论文每日精选
每日精选AI研究论文及翻译
May 7th, 2025
通过强化微调实现统一多模态思维链奖励模型
Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning
Yibin Wang, Zhimin Li, Yuhang Zang, Chunyu Wang, Qinglin Lu, Cheng Jin, Jiaqi Wang
•
May 6, 2025
•
69
2
绝对零度:零数据下的强化自我对弈推理
Absolute Zero: Reinforced Self-play Reasoning with Zero Data
Andrew Zhao, Yiran Wu, Yang Yue, Tong Wu, Quentin Xu, Yang Yue, Matthieu Lin, Shenzhi Wang, Qingyun Wu, Zilong Zheng, Gao Huang
•
May 6, 2025
•
65
1
RADLADS:大规模快速注意力蒸馏至线性注意力解码器
RADLADS: Rapid Attention Distillation to Linear Attention Decoders at Scale
Daniel Goldstein, Eric Alcaide, Janna Lu, Eugene Cheah
•
May 5, 2025
•
23
1
FlexiAct:迈向异构场景下的灵活动作控制
FlexiAct: Towards Flexible Action Control in Heterogeneous Scenarios
Shiyi Zhang, Junhao Zhuang, Zhaoyang Zhang, Ying Shan, Yansong Tang
•
May 6, 2025
•
21
1
RetroInfer:一种面向可扩展长上下文LLM推理的向量存储方法
RetroInfer: A Vector-Storage Approach for Scalable Long-Context LLM Inference
Yaoqi Chen, Jinkai Zhang, Baotong Lu, Qianxi Zhang, Chengruidong Zhang, Jingjia Luo, Di Liu, Huiqiang Jiang, Qi Chen, Jing Liu, Bailu Ding, Xiao Yan, Jiawei Jiang, Chen Chen, Mingxing Zhang, Yuqing Yang, Fan Yang, Mao Yang
•
May 5, 2025
•
19
2
Qwen3量化技术的实证研究
An Empirical Study of Qwen3 Quantization
Xingyu Zheng, Yuye Li, Haoran Chu, Yue Feng, Xudong Ma, Jie Luo, Jinyang Guo, Haotong Qin, Michele Magno, Xianglong Liu
•
May 4, 2025
•
15
1
通过阅读中的眼动解码开放式信息获取目标
Decoding Open-Ended Information Seeking Goals from Eye Movements in Reading
Cfir Avraham Hadar, Omer Shubi, Yoav Meiri, Yevgeni Berzak
•
May 4, 2025
•
14
2
全方位足球理解的多智能体系统
Multi-Agent System for Comprehensive Soccer Understanding
Jiayuan Rao, Zifeng Li, Haoning Wu, Ya Zhang, Yanfeng Wang, Weidi Xie
•
May 6, 2025
•
11
1
大语言模型的地理空间机制可解释性
Geospatial Mechanistic Interpretability of Large Language Models
Stef De Sabbata, Stefano Mizzaro, Kevin Roitero
•
May 6, 2025
•
7
1
HoloTime:驾驭视频扩散模型实现全景4D场景生成
HoloTime: Taming Video Diffusion Models for Panoramic 4D Scene Generation
Haiyang Zhou, Wangbo Yu, Jiawen Guan, Xinhua Cheng, Yonghong Tian, Li Yuan
•
Apr 30, 2025
•
7
1
VITA-Audio:面向高效大规模语音语言模型的快速交错跨模态令牌生成
VITA-Audio: Fast Interleaved Cross-Modal Token Generation for Efficient Large Speech-Language Model
Zuwei Long, Yunhang Shen, Chaoyou Fu, Heting Gao, Lijiang Li, Peixian Chen, Mengdan Zhang, Hang Shao, Jian Li, Jinlong Peng, Haoyu Cao, Ke Li, Rongrong Ji, Xing Sun
•
May 6, 2025
•
6
1
InfoVids:重塑观看体验——探索可视化与演讲者关系的创新模式
InfoVids: Reimagining the Viewer Experience with Alternative Visualization-Presenter Relationships
Ji Won Chung, Tongyu Zhou, Ivy Chen, Kevin Hsu, Ryan A. Rossi, Alexa Siu, Shunan Guo, Franck Dernoncourt, James Tompkin, Jeff Huang
•
May 6, 2025
•
5
1
SWE-smith:面向软件工程智能体的数据扩展
SWE-smith: Scaling Data for Software Engineering Agents
John Yang, Kilian Leret, Carlos E. Jimenez, Alexander Wettig, Kabir Khandpur, Yanzhe Zhang, Binyuan Hui, Ofir Press, Ludwig Schmidt, Diyi Yang
•
Apr 30, 2025
•
5
1
场景合成:面向三维场景生成的语言与视觉智能体框架
Scenethesis: A Language and Vision Agentic Framework for 3D Scene Generation
Lu Ling, Chen-Hsuan Lin, Tsung-Yi Lin, Yifan Ding, Yu Zeng, Yichen Sheng, Yunhao Ge, Ming-Yu Liu, Aniket Bera, Zhaoshuo Li
•
May 5, 2025
•
3
1
训练模型理解(而非生成)高风险数据
Teaching Models to Understand (but not Generate) High-risk Data
Ryan Wang, Matthew Finlayson, Luca Soldaini, Swabha Swayamdipta, Robin Jia
•
May 5, 2025
•
2
1
按需调用接口:大型语言模型在问答任务中的自适应调用策略
Invoke Interfaces Only When Needed: Adaptive Invocation for Large Language Models in Question Answering
Jihao Zhao, Chunlai Zhou, Biao Qin
•
May 5, 2025
•
2
1
何种代理导致任务失败?何时发生?——论大语言模型多代理系统的自动化故障归因
Which Agent Causes Task Failures and When? On Automated Failure Attribution of LLM Multi-Agent Systems
Shaokun Zhang, Ming Yin, Jieyu Zhang, Jiale Liu, Zhiguang Han, Jingyang Zhang, Beibin Li, Chi Wang, Huazheng Wang, Yiran Chen, Qingyun Wu
•
Apr 30, 2025
•
2
1
Auto-SLURP:智能个人助手中多智能体框架评估的基准数据集
Auto-SLURP: A Benchmark Dataset for Evaluating Multi-Agent Frameworks in Smart Personal Assistant
Lei Shen, Xiaoyu Shen
•
Apr 25, 2025
•
2
1
阿尔法卓越基准
Alpha Excel Benchmark
David Noever, Forrest McKee
•
May 7, 2025
•
0
1