ChatPaper.aiChatPaper.ai
首頁

arXiv

HuggingFace

定價賬戶工作台

•
•

•
•

•
•

•
•

•
•

Footer

Company name

ChatPaper.ai: Your advanced AI reading assistant.

Contact us: [email protected]

X (Twitter)

Products

  • AI Search
  • AI Mind Map
  • Arxiv Summary
  • Huggingface Summary

Support

  • FAQ
  • Contact

Company

  • Blog
  • Privacy Policy
  • Terms of Service

Available Languages

  • 🇬🇧English
  • 🇨🇳中文简体
  • 🇭🇰繁體中文
  • 🇯🇵日本語
  • 🇰🇷한국어
  • 🇩🇪Deutsch
  • 🇫🇷Français
  • 🇷🇺Русский
  • 🇪🇸Español

© 2025 chatpaper.ai All rights reserved.

AI研究論文每日精選

每日精選AI研究論文及翻譯

超越80/20法则:高熵少数词元驱动LLM推理的有效强化学习
Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Shenzhi Wang, Le Yu, Chang Gao, Chujie Zheng, Shixuan Liu, Rui Lu, Kai Dang, Xionghui Chen, Jianxin Yang, Zhenru Zhang, Yuqiong Liu, An Yang, Andrew Zhao, Yang Yue, Shiji Song, Bowen Yu, Gao Huang, Junyang Lin•Jun 2, 2025•1283

SmolVLA:面向經濟高效機器人的視覺-語言-動作模型
SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

Mustafa Shukor, Dana Aubakirova, Francesco Capuano, Pepijn Kooijmans, Steven Palma, Adil Zouitine, Michel Aractingi, Caroline Pascal, Martino Russi, Andres Marafioti, Simon Alibert, Matthieu Cord, Thomas Wolf, Remi Cadene•Jun 2, 2025•7414

推理健身房:具可驗證獎勵的強化學習推理環境
REASONING GYM: Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Zafir Stojanovski, Oliver Stanley, Joe Sharratt, Richard Jones, Abdulhakeem Adefioye, Jean Kaddour, Andreas Köpf•May 30, 2025•584

通過梯度分組調整學習率來馴服大型語言模型
Taming LLMs by Scaling Learning Rates with Gradient Grouping

Siyuan Li, Juanxi Tian, Zedong Wang, Xin Jin, Zicheng Liu, Wentao Zhang, Dan Xu•Jun 1, 2025•354

時序上下文微調技術:實現視頻擴散模型的多功能控制
Temporal In-Context Fine-Tuning for Versatile Control of Video Diffusion Models

Kinam Kim, Junha Hyung, Jaegul Choo•Jun 1, 2025•343

SRPO:通过反射感知强化学习增强多模态大语言模型的推理能力
SRPO: Enhancing Multimodal LLM Reasoning via Reflection-Aware Reinforcement Learning

Zhongwei Wan, Zhihao Dou, Che Liu, Yu Zhang, Dongfei Cui, Qinjian Zhao, Hui Shen, Jing Xiong, Yi Xin, Yifan Jiang, Yangfan He, Mi Zhang, Shen Yan•Jun 2, 2025•302

ShapeLLM-Omni:一款原生多模态大語言模型,專注於3D生成與理解
ShapeLLM-Omni: A Native Multimodal LLM for 3D Generation and Understanding

Junliang Ye, Zhengyi Wang, Ruowen Zhao, Shenghao Xie, Jun Zhu•Jun 2, 2025•272

ARIA:以意圖驅動的獎勵聚合訓練語言代理
ARIA: Training Language Agents with Intention-Driven Reward Aggregation

Ruihan Yang, Yikai Zhang, Aili Chen, Xintao Wang, Siyu Yuan, Jiangjie Chen, Deqing Yang, Yanghua Xiao•May 31, 2025•272

LoHoVLA:面向长程具身任务的统一视觉-语言-动作模型
LoHoVLA: A Unified Vision-Language-Action Model for Long-Horizon Embodied Tasks

Yi Yang, Jiaxuan Sun, Siqi Kou, Yihan Wang, Zhijie Deng•May 31, 2025•272

Jigsaw-R1:基於規則的視覺強化學習在拼圖遊戲中的研究
Jigsaw-R1: A Study of Rule-based Visual Reinforcement Learning with Jigsaw Puzzles

Zifu Wang, Junyi Zhu, Bo Tang, Zhiyu Li, Feiyu Xiong, Jiaqian Yu, Matthew B. Blaschko•May 29, 2025•242

面向机器人操作的协作轨迹控制视频生成学习
Learning Video Generation for Robotic Manipulation with Collaborative Trajectory Control

Xiao Fu, Xintao Wang, Xian Liu, Jianhong Bai, Runsen Xu, Pengfei Wan, Di Zhang, Dahua Lin•Jun 2, 2025•232

地球心智:迈向基于大型多模态模型的多粒度与多传感器地球观测
EarthMind: Towards Multi-Granular and Multi-Sensor Earth Observation with Large Multimodal Models

Yan Shu, Bin Ren, Zhitong Xiong, Danda Pani Paudel, Luc Van Gool, Begum Demir, Nicu Sebe, Paolo Rota•Jun 2, 2025•202

AReaL:面向语言推理的大规模异步强化学习系统
AReaL: A Large-Scale Asynchronous Reinforcement Learning System for Language Reasoning

Wei Fu, Jiaxuan Gao, Xujie Shen, Chen Zhu, Zhiyu Mei, Chuyi He, Shusheng Xu, Guo Wei, Jun Mei, Jiashu Wang, Tongkai Yang, Binhang Yuan, Yi Wu•May 30, 2025•202

壓縮表徵的統一縮放定律
Unified Scaling Laws for Compressed Representations

Andrei Panferov, Alexandra Volkova, Ionut-Vlad Modoranu, Vage Egiazarian, Mher Safaryan, Dan Alistarh•Jun 2, 2025•172

MiCRo:基於混合建模與情境感知路由的個人化偏好學習
MiCRo: Mixture Modeling and Context-aware Routing for Personalized Preference Learning

Jingyan Shen, Jiarui Yao, Rui Yang, Yifan Sun, Feng Luo, Rui Pan, Tong Zhang, Han Zhao•May 30, 2025•152

激勵推理以實現大型語言模型的高級指令跟隨能力
Incentivizing Reasoning for Advanced Instruction-Following of Large Language Models

Yulei Qin, Gang Li, Zongyi Li, Zihan Xu, Yuchen Shi, Zhekai Lin, Xiao Cui, Ke Li, Xing Sun•Jun 2, 2025•142

IVY-FAKE:一個統一的圖像與視頻AIGC檢測可解釋性框架與基準
IVY-FAKE: A Unified Explainable Framework and Benchmark for Image and Video AIGC Detection

Wayne Zhang, Changjiang Jiang, Zhonghao Zhang, Chenyang Si, Fengchang Yu, Wei Peng•Jun 1, 2025•133

從令牌到行動:狀態機推理緩解信息檢索中的過度思考
From Token to Action: State Machine Reasoning to Mitigate Overthinking in Information Retrieval

Dohyeon Lee, Yeonseok Jeong, Seung-won Hwang•May 29, 2025•132

像經濟學家一樣推理:在經濟問題上的後續訓練促使大型語言模型實現策略性泛化
Reasoning Like an Economist: Post-Training on Economic Problems Induces Strategic Generalization in LLMs

Yufa Zhou, Shaobo Wang, Xingyu Dong, Xiangqi Jin, Yifang Chen, Yue Min, Kexin Yang, Xingzhang Ren, Dayiheng Liu, Linfeng Zhang•May 31, 2025•112

Cora:基於少步擴散的對應感知圖像編輯
Cora: Correspondence-aware image editing using few step diffusion

Amirhossein Almohammadi, Aryan Mikaeili, Sauradip Nag, Negar Hassanpour, Andrea Tagliasacchi, Ali Mahdavi-Amiri•May 29, 2025•112

WebChoreArena:评估网页浏览代理在现实繁琐网页任务中的表现
WebChoreArena: Evaluating Web Browsing Agents on Realistic Tedious Web Tasks

Atsuyuki Miyai, Zaiying Zhao, Kazuki Egashira, Atsuki Sato, Tatsumi Sunada, Shota Onohara, Hiromasa Yamanishi, Mashiro Toyooka, Kunato Nishina, Ryoma Maeda, Kiyoharu Aizawa, Toshihiko Yamasaki•Jun 2, 2025•103

VisualSphinx:面向強化學習的大規模合成視覺邏輯謎題
VisualSphinx: Large-Scale Synthetic Vision Logic Puzzles for RL

Yichen Feng, Zhangchen Xu, Fengqing Jiang, Yuetai Li, Bhaskar Ramasubramanian, Luyao Niu, Bill Yuchen Lin, Radha Poovendran•May 29, 2025•92

OWSM v4:通过数据扩展与清洗提升开放式Whisper风格语音模型
OWSM v4: Improving Open Whisper-Style Speech Models via Data Scaling and Cleaning

Yifan Peng, Shakeel Muhammad, Yui Sudo, William Chen, Jinchuan Tian, Chyi-Jiunn Lin, Shinji Watanabe•May 31, 2025•82

從視頻中學習三維世界:利用三維視覺幾何先驗增強多模態大語言模型
Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors

Duo Zheng, Shijia Huang, Yanyang Li, Liwei Wang•May 30, 2025•82

壓力測試機器生成文本檢測:轉變語言模型寫作風格以欺騙檢測器
Stress-testing Machine Generated Text Detection: Shifting Language Models Writing Style to Fool Detectors

Andrea Pedrotti, Michele Papucci, Cristiano Ciaccio, Alessio Miaschi, Giovanni Puccetti, Felice Dell'Orletta, Andrea Esuli•May 30, 2025•82

CodeV-R1:推理增强型Verilog生成
CodeV-R1: Reasoning-Enhanced Verilog Generation

Yaoyu Zhu, Di Huang, Hanqi Lyu, Xiaoyun Zhang, Chongxiao Li, Wenxuan Shi, Yutong Wu, Jianan Mu, Jinghua Wang, Yang Zhao, Pengwei Jin, Shuyao Cheng, Shengwen Liang, Xishan Zhang, Rui Zhang, Zidong Du, Qi Guo, Xing Hu, Yunji Chen•May 30, 2025•82

DyePack:利用後門技術可證明地標記大型語言模型中的測試集污染
DyePack: Provably Flagging Test Set Contamination in LLMs Using Backdoors

Yize Cheng, Wenxiao Wang, Mazda Moayeri, Soheil Feizi•May 29, 2025•82

歸一化注意力引導:擴散模型的通用負向引導
Normalized Attention Guidance: Universal Negative Guidance for Diffusion Model

Dar-Yen Chen, Hmrishav Bandyopadhyay, Kai Zou, Yi-Zhe Song•May 27, 2025•83

深奥语言模型
Esoteric Language Models

Subham Sekhar Sahoo, Zhihan Yang, Yash Akhauri, Johnna Liu, Deepansha Singh, Zhoujun Cheng, Zhengzhong Liu, Eric Xing, John Thickstun, Arash Vahdat•Jun 2, 2025•72

zip2zip:基於詞彙壓縮的語言模型推理時自適應詞表技術
zip2zip: Inference-Time Adaptive Vocabularies for Language Models via Token Compression

Saibo Geng, Nathan Ranchin, Yunzhen yao, Maxime Peyrard, Chris Wendler, Michael Gastpar, Robert West•Jun 1, 2025•72

達爾文哥德爾機:自我改進代理的開放式演化
Darwin Godel Machine: Open-Ended Evolution of Self-Improving Agents

Jenny Zhang, Shengran Hu, Cong Lu, Robert Lange, Jeff Clune•May 29, 2025•72

何時行動,何時等待:面向任務型對話中意圖觸發性的結構軌跡建模
WHEN TO ACT, WHEN TO WAIT: Modeling Structural Trajectories for Intent Triggerability in Task-Oriented Dialogue

Yaoyao Qian, Jindan Huang, Yuanli Wang, Simon Yu, Kyrie Zhixuan Zhou, Jiayuan Mao, Mingfu Liang, Hanhan Zhou•Jun 2, 2025•62

從注入到蒸餾:語言模型中的級聯對抗偏見
Cascading Adversarial Bias from Injection to Distillation in Language Models

Harsh Chaudhari, Jamie Hayes, Matthew Jagielski, Ilia Shumailov, Milad Nasr, Alina Oprea•May 30, 2025•62

VAU-R1:通過強化學習微調提升視頻異常理解能力
VAU-R1: Advancing Video Anomaly Understanding via Reinforcement Fine-Tuning

Liyun Zhu, Qixiang Chen, Xi Shen, Xiaodong Cun•May 29, 2025•62

SATA-BENCH:多選題全選適用基準測試
SATA-BENCH: Select All That Apply Benchmark for Multiple Choice Questions

Weijie Xu, Shixian Cui, Xi Fang, Chi Xue, Stephanie Eckman, Chandan Reddy•May 31, 2025•52

Pro3D-Editor:一種基於漸進視角的一致且精確的三維編輯方法
Pro3D-Editor : A Progressive-Views Perspective for Consistent and Precise 3D Editing

Yang Zheng, Mengqi Huang, Nan Chen, Zhendong Mao•May 31, 2025•52

步長自適應:面向預算迭代訓練的統一學習率調度方案
Stepsize anything: A unified learning rate schedule for budgeted-iteration training

Anda Tang, Yiming Dong, Yutao Zeng, zhou Xun, Zhouchen Lin•May 30, 2025•52

從指南到實踐:阿拉伯語語言模型評估的新範式
From Guidelines to Practice: A New Paradigm for Arabic Language Model Evaluation

Serry Sibaee, Omer Nacar, Adel Ammar, Yasser Al-Habashi, Abdulrahman Al-Batati, Wadii Boulila•Jun 2, 2025•43

從指南到實踐:阿拉伯語語言模型評估的新範式
From Guidelines to Practice: A New Paradigm for Arabic Language Model Evaluation

Serry Sibaee, Omer Nacar, Adel Ammar, Yasser Al-Habashi, Abdulrahman Al-Batati, Wadii Boulila•Jun 2, 2025•43

LLM在迴路中:構建PARADEHATE數據集以實現仇恨言論淨化
LLM in the Loop: Creating the PARADEHATE Dataset for Hate Speech Detoxification

Shuzhou Yuan, Ercong Nie, Lukas Kouba, Ashish Yashwanth Kangen, Helmut Schmid, Hinrich Schutze, Michael Farber•Jun 2, 2025•43

稀有性:面向检索增强生成系统的检索感知鲁棒性评估
RARE: Retrieval-Aware Robustness Evaluation for Retrieval-Augmented Generation Systems

Yixiao Zeng, Tianyu Cao, Danqing Wang, Xinran Zhao, Zimeng Qiu, Morteza Ziyadi, Tongshuang Wu, Lei Li•Jun 1, 2025•42

ComposeAnything:面向文本到图像生成的复合对象先验
ComposeAnything: Composite Object Priors for Text-to-Image Generation

Zeeshan Khan, Shizhe Chen, Cordelia Schmid•May 30, 2025•43

全方位回應:在雙向互動中的線上多模態對話回應生成
OmniResponse: Online Multimodal Conversational Response Generation in Dyadic Interactions

Cheng Luo, Jianghui Wang, Bing Li, Siyang Song, Bernard Ghanem•May 27, 2025•42

编程概念与神经元在代码语言模型中的共享机制
How Programming Concepts and Neurons Are Shared in Code Language Models

Amir Hossein Kargaran, Yihong Liu, François Yvon, Hinrich Schütze•Jun 1, 2025•32

SealQA:提升搜索增强语言模型推理能力的新标杆
SealQA: Raising the Bar for Reasoning in Search-Augmented Language Models

Thinh Pham, Nguyen Nguyen, Pratibha Zunjare, Weiyuan Chen, Yu-Min Tseng, Tu Vu•Jun 1, 2025•32

評估語言模型預測者時的陷阱
Pitfalls in Evaluating Language Model Forecasters

Daniel Paleka, Shashwat Goel, Jonas Geiping, Florian Tramèr•May 31, 2025•32

SenseFlow:面向流式文本至图像蒸馏的分布匹配扩展技术
SenseFlow: Scaling Distribution Matching for Flow-based Text-to-Image Distillation

Xingtong Ge, Xin Zhang, Tongda Xu, Yi Zhang, Xinjie Zhang, Yan Wang, Jun Zhang•May 31, 2025•32

MaskSearch:提升代理搜索能力的通用預訓練框架
MaskSearch: A Universal Pre-Training Framework to Enhance Agentic Search Capability

Weiqi Wu, Xin Guan, Shen Huang, Yong Jiang, Pengjun Xie, Fei Huang, Jiuxin Cao, Hai Zhao, Jingren Zhou•May 26, 2025•32

再思考!测试时计算量对大语言模型偏好、观点及信念的影响
Think Again! The Effect of Test-Time Compute on Preferences, Opinions, and Beliefs of Large Language Models

George Kour, Itay Nakash, Ateret Anaby-Tavor, Michal Shmueli-Scheuer•May 26, 2025•32

將視覺語言模型助手與個性化情境認知對齊
Aligning VLM Assistants with Personalized Situated Cognition

Yongqi Li, Shen Zhou, Xiaohu Li, Xin Miao, Jintao Wen, Mayi Xu, Jianhao Chen, Birong Pan, Hankun Kang, Yuanyuan Zhu, Ming Zhong, Tieyun Qian•Jun 1, 2025•22

揭開真相的面紗:在面向推理的監督微調中,主權重於降維後浮現
LIFT the Veil for the Truth: Principal Weights Emerge after Rank Reduction for Reasoning-Focused Supervised Fine-Tuning

Zihang Liu, Tianyu Pang, Oleg Balabanov, Chaoqun Yang, Tianjin Huang, Lu Yin, Yaoqing Yang, Shiwei Liu•Jun 1, 2025•22

城市之鏡:評測大型語言視覺模型於城市社會經濟感知之應用
CityLens: Benchmarking Large Language-Vision Models for Urban Socioeconomic Sensing

Tianhui Liu, Jie Feng, Hetian Pang, Xin Zhang, Tianjian Ouyang, Zhiyuan Zhang, Yong Li•May 31, 2025•22

利用雙語翻譯數據實現大規模多語言大型語言模型適應
Massively Multilingual Adaptation of Large Language Models Using Bilingual Translation Data

Shaoxiong Ji, Zihao Li, Jaakko Paavola, Indraneil Paul, Hengyu Luo, Jörg Tiedemann•May 31, 2025•22

MagiCodec:基於簡單遮罩高斯注入的編解碼器,實現高保真重建與生成
MagiCodec: Simple Masked Gaussian-Injected Codec for High-Fidelity Reconstruction and Generation

Yakun Song, Jiawei Chen, Xiaobin Zhuang, Chenpeng Du, Ziyang Ma, Jian Wu, Jian Cong, Dongya Jia, Zhuo Chen, Yuping Wang, Yuxuan Wang, Xie Chen•May 31, 2025•22

Neuro2Semantic:一种用于人类颅内脑电图连续语言语义重建的迁移学习框架
Neuro2Semantic: A Transfer Learning Framework for Semantic Reconstruction of Continuous Language from Human Intracranial EEG

Siavash Shams, Richard Antonello, Gavin Mischler, Stephan Bickel, Ashesh Mehta, Nima Mesgarani•May 31, 2025•22

雙耳流:基於流匹配模型的高質量雙耳語音合成之因果與可串流方法
BinauralFlow: A Causal and Streamable Approach for High-Quality Binaural Speech Synthesis with Flow Matching Models

Susan Liang, Dejan Markovic, Israel D. Gebru, Steven Krenn, Todd Keebler, Jacob Sandakly, Frank Yu, Samuel Hassel, Chenliang Xu, Alexander Richard•May 28, 2025•22

R1-代碼解釋器:通過監督學習與強化學習訓練大型語言模型進行代碼推理
R1-Code-Interpreter: Training LLMs to Reason with Code via Supervised and Reinforcement Learning

Yongchao Chen, Yueying Liu, Junwei Zhou, Yilun Hao, Jingquan Wang, Yang Zhang, Chuchu Fan•May 27, 2025•22

Frankentext:將隨機文本片段縫合成長篇敘事
Frankentext: Stitching random text fragments into long-form narratives

Chau Minh Pham, Jenna Russell, Dzung Pham, Mohit Iyyer•May 23, 2025•22

规划与预算:大型语言模型推理中测试阶段的高效扩展策略
Plan and Budget: Effective and Efficient Test-Time Scaling on Large Language Model Reasoning

Junhong Lin, Xinyue Zeng, Jie Zhu, Song Wang, Julian Shun, Jun Wu, Dawei Zhou•May 22, 2025•22

像素與先驗:透過視覺反事實控制視覺-語言模型中的知識先驗
Pixels Versus Priors: Controlling Knowledge Priors in Vision-Language Models through Visual Counterfacts

Michal Golovanevsky, William Rudman, Michael Lepori, Amir Bar, Ritambhara Singh, Carsten Eickhoff•May 21, 2025•22

MIKU-PAL:一種自動化且標準化的多模態語音副語言及情感標註方法
MIKU-PAL: An Automated and Standardized Multi-Modal Method for Speech Paralinguistic and Affect Labeling

Yifan Cheng, Ruoyi Zhang, Jiatong Shi•May 21, 2025•22

基於置信度邊界加權偽標籤的Shuffle PatchMix增強方法,用於提升無源域適應性能
Shuffle PatchMix Augmentation with Confidence-Margin Weighted Pseudo-Labels for Enhanced Source-Free Domain Adaptation

Prasanna Reddy Pulakurthi, Majid Rabbani, Jamison Heard, Sohail Dianat, Celso M. de Melo, Raghuveer Rao•May 30, 2025•12

基於多模態擴散模型的離散-連續量子電路合成
Synthesis of discrete-continuous quantum circuits with multimodal diffusion models

Florian Fürrutter, Zohim Chandani, Ikko Hamamura, Hans J. Briegel, Gorka Muñoz-Gil•Jun 2, 2025•02