ChatPaper.ai
打開菜單
首頁
每日論文
arXiv
HuggingFace
定價
賬戶
工作台
🇭🇰
繁體中文
Loading...
•
•
•
•
•
•
•
•
•
•
AI研究論文每日精選
每日精選AI研究論文及翻譯
May 26th, 2025
TabSTAR:具備語義目標感知表徵的基礎表格模型
TabSTAR: A Foundation Tabular Model With Semantically Target-Aware Representations
Alan Arazi, Eilam Shapira, Roi Reichart
•
May 23, 2025
•
103
4
QwenLong-L1:迈向基于强化学习的长上下文大型推理模型
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning
Fanqi Wan, Weizhou Shen, Shengyi Liao, Yingcheng Shi, Chenliang Li, Ziyi Yang, Ji Zhang, Fei Huang, Jingren Zhou, Ming Yan
•
May 23, 2025
•
75
3
將大型語言模型代理蒸餾至小型模型,並結合檢索與程式碼工具
Distilling LLM Agent into Small Models with Retrieval and Code Tools
Minki Kang, Jongwon Jeong, Seanie Lee, Jaewoong Cho, Sung Ju Hwang
•
May 23, 2025
•
71
5
Quartet:原生FP4訓練對於大型語言模型而言可能是最佳選擇
Quartet: Native FP4 Training Can Be Optimal for Large Language Models
Roberto L. Castro, Andrei Panferov, Soroush Tabesh, Oliver Sieberling, Jiale Chen, Mahdi Nikdan, Saleh Ashkboos, Dan Alistarh
•
May 20, 2025
•
70
2
推理模型固執難改:診斷推理模型中的指令覆寫問題
Reasoning Model is Stubborn: Diagnosing Instruction Overriding in Reasoning Models
Doohyuk Jang, Yoonjeon Kim, Chanjae Park, Hyun Ryu, Eunho Yang
•
May 22, 2025
•
59
2
一視同仁的強化學習:視覺三元統一強化學習
One RL to See Them All: Visual Triple Unified Reinforcement Learning
Yan Ma, Linge Du, Xuyang Shen, Shaoxiang Chen, Pengfei Li, Qibing Ren, Lizhuang Ma, Yuchao Dai, Pengfei Liu, Junjie Yan
•
May 23, 2025
•
55
2
PhyX:你的模型是否具備物理推理的「智慧」?
PhyX: Does Your Model Have the "Wits" for Physical Reasoning?
Hui Shen, Taiqiang Wu, Qi Han, Yunta Hsieh, Jizhou Wang, Yuyue Zhang, Yuxin Cheng, Zijian Hao, Yuansheng Ni, Xin Wang, Zhongwei Wan, Kai Zhang, Wendong Xu, Jing Xiong, Ping Luo, Wenhu Chen, Chaofan Tao, Zhuoqing Mao, Ngai Wong
•
May 21, 2025
•
47
4
QwenLong-CPRS:邁向具備動態上下文優化的無限長語言模型
QwenLong-CPRS: Towards infty-LLMs with Dynamic Context Optimization
Weizhou Shen, Chenliang Li, Fanqi Wan, Shengyi Liao, Shaopeng Lai, Bo Zhang, Yingcheng Shi, Yuning Wu, Gang Fu, Zhansheng Li, Bin Yang, Ji Zhang, Fei Huang, Jingren Zhou, Ming Yan
•
May 23, 2025
•
39
3
透過測試時進化搜索實現圖像與視頻生成的規模化
Scaling Image and Video Generation via Test-Time Evolutionary Search
Haoran He, Jiajun Liang, Xintao Wang, Pengfei Wan, Di Zhang, Kun Gai, Ling Pan
•
May 23, 2025
•
38
2
MOOSE-Chem3:透過模擬實驗反饋實現實驗引導的假設排序
MOOSE-Chem3: Toward Experiment-Guided Hypothesis Ranking via Simulated Experimental Feedback
Wanhao Liu, Zonglin Yang, Jue Wang, Lidong Bing, Di Zhang, Dongzhan Zhou, Yuqiang Li, Houqiang Li, Erik Cambria, Wanli Ouyang
•
May 23, 2025
•
29
3
模型已知最佳噪聲:基於注意力機制的貝葉斯主動噪聲選擇於視頻擴散模型
Model Already Knows the Best Noise: Bayesian Active Noise Selection via Attention in Video Diffusion Model
Kwanyoung Kim, Sanghyun Kim
•
May 23, 2025
•
29
3
VeriThinker:學習驗證使推理模型更高效
VeriThinker: Learning to Verify Makes Reasoning Model Efficient
Zigeng Chen, Xinyin Ma, Gongfan Fang, Ruonan Yu, Xinchao Wang
•
May 23, 2025
•
23
2
擴散分類器理解組合性,但需滿足特定條件
Diffusion Classifiers Understand Compositionality, but Conditions Apply
Yujin Jeong, Arnas Uselis, Seong Joon Oh, Anna Rohrbach
•
May 23, 2025
•
18
3
AudioTrust:音頻大型語言模型多面向可信度的基準測試
AudioTrust: Benchmarking the Multifaceted Trustworthiness of Audio Large Language Models
Kai Li, Can Shen, Yile Liu, Jirui Han, Kelong Zheng, Xuechao Zou, Zhe Wang, Xingjian Du, Shun Zhang, Hanjun Luo, Yingbin Jin, Xinxin Xing, Ziyang Ma, Yue Liu, Xiaojun Jia, Yifan Zhang, Junfeng Fang, Kun Wang, Yibo Yan, Haoyang Li, Yiming Li, Xiaobin Zhuang, Yang Liu, Haibo Hu, Zhuo Chen, Zhizheng Wu, Xiaolin Hu, Eng-Siong Chng, XiaoFeng Wang, Wenyuan Xu, Wei Dong, Xinfeng Li
•
May 22, 2025
•
17
2
Direct3D-S2:透過空間稀疏注意力實現簡易的千兆級3D生成
Direct3D-S2: Gigascale 3D Generation Made Easy with Spatial Sparse Attention
Shuang Wu, Youtian Lin, Feihu Zhang, Yifei Zeng, Yikang Yang, Yajie Bao, Jiachen Qian, Siyu Zhu, Philip Torr, Xun Cao, Yao Yao
•
May 23, 2025
•
16
2
不確定性位置:大型語言模型中位置偏差的跨語言研究
Position of Uncertainty: A Cross-Linguistic Study of Positional Bias in Large Language Models
Menschikov Mikhail, Alexander Kharitonov, Maiia Kotyga, Vadim Porvatov, Anna Zhukovskaya, David Kagramanyan, Egor Shvetsov, Evgeny Burnaev
•
May 22, 2025
•
16
2
s3:訓練搜索代理無需大量數據,強化學習足矣
s3: You Don't Need That Much Data to Train a Search Agent via RL
Pengcheng Jiang, Xueqiang Xu, Jiacheng Lin, Jinfeng Xiao, Zifeng Wang, Jimeng Sun, Jiawei Han
•
May 20, 2025
•
15
2
全前端工程流程中的多模态大语言模型基准测试
FullFront: Benchmarking MLLMs Across the Full Front-End Engineering Workflow
Haoyu Sun, Huichen Will Wang, Jiawei Gu, Linjie Li, Yu Cheng
•
May 23, 2025
•
14
2
思維增強策略優化:橋接外部指導與內部能力
Thought-Augmented Policy Optimization: Bridging External Guidance and Internal Capabilities
Jinyang Wu, Chonghua Liao, Mingkuan Feng, Shuai Zhang, Zhengqi Wen, Pengpeng Shao, Huazhe Xu, Jianhua Tao
•
May 21, 2025
•
14
2
Time-R1:邁向大型語言模型中的全面時間推理
Time-R1: Towards Comprehensive Temporal Reasoning in LLMs
Zijia Liu, Peixuan Han, Haofei Yu, Haoru Li, Jiaxuan You
•
May 16, 2025
•
14
3
以謊言教學:基於合成負例的課程式DPO用於幻覺檢測
Teaching with Lies: Curriculum DPO on Synthetic Negatives for Hallucination Detection
Shrey Pandit, Ashwin Vinod, Liu Leqi, Ying Ding
•
May 23, 2025
•
13
2
晴朗之夜:邁向多天氣條件下的夜間圖像復原
Clear Nights Ahead: Towards Multi-Weather Nighttime Image Restoration
Yuetong Liu, Yunqiu Xu, Yang Wei, Xiuli Bi, Bin Xiao
•
May 22, 2025
•
11
2
無聲勝有聲:針對低資源語言的無語音語音指令訓練
Speechless: Speech Instruction Training Without Speech for Low Resource Languages
Alan Dao, Dinh Bach Vu, Huy Hoang Ha, Tuan Le Duc Anh, Shreyas Gopal, Yue Heng Yeo, Warren Keng Hoong Low, Eng Siong Chng, Jia Qi Yip
•
May 23, 2025
•
10
2
RBench-V:多模態輸出視覺推理模型的首選評估基準
RBench-V: A Primary Assessment for Visual Reasoning Models with Multi-modal Outputs
Meng-Hao Guo, Xuanyu Chu, Qianrui Yang, Zhe-Han Mo, Yiqing Shen, Pei-lin Li, Xinjie Lin, Jinnian Zhang, Xin-Sheng Chen, Yi Zhang, Kiyohiro Nakayama, Zhengyang Geng, Houwen Peng, Han Hu, Shi-Nin Hu
•
May 22, 2025
•
10
3
透過合成任務與強化學習教導大型語言模型保持上下文忠實性
Teaching Large Language Models to Maintain Contextual Faithfulness via Synthetic Tasks and Reinforcement Learning
Shuzheng Si, Haozhe Zhao, Cheng Gao, Yuzhuo Bai, Zhitong Wang, Bofei Gao, Kangyang Luo, Wenhao Li, Yufei Huang, Gang Chen, Fanchao Qi, Minjia Zhang, Baobao Chang, Maosong Sun
•
May 22, 2025
•
10
5
Trinity-RFT:一個通用且統一的強化微調框架,適用於大型語言模型
Trinity-RFT: A General-Purpose and Unified Framework for Reinforcement Fine-Tuning of Large Language Models
Xuchen Pan, Yanxi Chen, Yushuo Chen, Yuchang Sun, Daoyuan Chen, Wenhao Zhang, Yuexiang Xie, Yilun Huang, Yilei Zhang, Dawei Gao, Yaliang Li, Bolin Ding, Jingren Zhou
•
May 23, 2025
•
9
2
ScanBot:邁向具身機器人系統中的智能表面掃描
ScanBot: Towards Intelligent Surface Scanning in Embodied Robotic Systems
Zhiling Chen, Yang Zhang, Fardin Jalil Piran, Qianyu Zhou, Jiong Tang, Farhad Imani
•
May 22, 2025
•
9
2
視覺語言模型在現實世界中安全嗎?基於迷因的基準測試研究
Are Vision-Language Models Safe in the Wild? A Meme-Based Benchmark Study
DongGeon Lee, Joonwon Jang, Jihae Jeong, Hwanjo Yu
•
May 21, 2025
•
8
2
合成數據強化學習:任務定義即為關鍵
Synthetic Data RL: Task Definition Is All You Need
Yiduo Guo, Zhen Guo, Chuanwei Huang, Zi-Ang Wang, Zekai Zhang, Haofei Yu, Huishuai Zhang, Yikang Shen
•
May 18, 2025
•
8
2
共舞時刻!身份保持型多人互動視頻生成
DanceTogether! Identity-Preserving Multi-Person Interactive Video Generation
Junhao Chen, Mingjin Chen, Jianjin Xu, Xiang Li, Junting Dong, Mingze Sun, Puhua Jiang, Hongxiang Li, Yuhang Yang, Hao Zhao, Xiaoxiao Long, Ruqi Huang
•
May 23, 2025
•
6
2
RePrompt:基於強化學習的推理增強型重新提示技術在文本到圖像生成中的應用
RePrompt: Reasoning-Augmented Reprompting for Text-to-Image Generation via Reinforcement Learning
Mingrui Wu, Lu Wang, Pu Zhao, Fangkai Yang, Jianjin Zhang, Jianfeng Liu, Yuefeng Zhan, Weihao Han, Hao Sun, Jiayi Ji, Xiaoshuai Sun, Qingwei Lin, Weiwei Deng, Dongmei Zhang, Feng Sun, Qi Zhang, Rongrong Ji
•
May 23, 2025
•
6
2
Transformer Copilot:基於LLM微調中的錯誤日誌進行學習
Transformer Copilot: Learning from The Mistake Log in LLM Fine-tuning
Jiaru Zou, Yikun Ban, Zihao Li, Yunzhe Qi, Ruizhong Qiu, Ling Yang, Jingrui He
•
May 22, 2025
•
6
2
關於基於KL正則化策略梯度算法的大語言模型推理設計
On the Design of KL-Regularized Policy Gradient Algorithms for LLM Reasoning
Yifan Zhang, Yifeng Liu, Huizhuo Yuan, Yang Yuan, Quanquan Gu, Andrew C Yao
•
May 23, 2025
•
5
2
視覺-語言-動作模型的互動式後訓練
Interactive Post-Training for Vision-Language-Action Models
Shuhan Tan, Kairan Dou, Yue Zhao, Philipp Krähenbühl
•
May 22, 2025
•
5
2
ReflAct:基於目標狀態反思的LLM代理世界錨定決策
ReflAct: World-Grounded Decision Making in LLM Agents via Goal-State Reflection
Jeonghye Kim, Sojeong Rhee, Minbeom Kim, Dohyung Kim, Sangmook Lee, Youngchul Sung, Kyomin Jung
•
May 21, 2025
•
5
2
大型語言模型僅透過閱讀就能隱含地學會視覺與聽覺理解
Large Language Models Implicitly Learn to See and Hear Just By Reading
Prateek Verma, Mert Pilanci
•
May 20, 2025
•
5
3
價值引導搜索:高效思維鏈推理
Value-Guided Search for Efficient Chain-of-Thought Reasoning
Kaiwen Wang, Jin Peng Zhou, Jonathan Chang, Zhaolin Gao, Nathan Kallus, Kianté Brantley, Wen Sun
•
May 23, 2025
•
4
2
並非所有模型都適合專家卸載:論混合專家模型的本地路由一致性
Not All Models Suit Expert Offloading: On Local Routing Consistency of Mixture-of-Expert Models
Jingcong Liang, Siyuan Wang, Miren Tian, Yitong Li, Duyu Tang, Zhongyu Wei
•
May 21, 2025
•
3
2
確保安全!在問答系統中對抗間接攻擊的大型語言模型上下文安全策略保持基準測試
Keep Security! Benchmarking Security Policy Preservation in Large Language Model Contexts Against Indirect Attacks in Question Answering
Hwan Chang, Yumin Kim, Yonghyun Jun, Hwanhee Lee
•
May 21, 2025
•
3
2
重訪殘差連接:正交更新實現穩定高效的深度網絡
Revisiting Residual Connections: Orthogonal Updates for Stable and Efficient Deep Networks
Giyeong Oh, Woohyun Cho, Siyeol Kim, Suhwan Choi, Younjae Yu
•
May 17, 2025
•
3
2
FREESON:基於語料庫遍歷MCTS的無檢索器增強推理
FREESON: Retriever-Free Retrieval-Augmented Reasoning via Corpus-Traversing MCTS
Chaeeun Kim, Seungone Kim
•
May 22, 2025
•
2
2
透過動態筆記撰寫增強大型語言模型的推理能力以應對複雜問答
Augmenting LLM Reasoning with Dynamic Notes Writing for Complex QA
Rishabh Maheshwary, Masoud Hashemi, Khyati Mahajan, Shiva Krishna Reddy Malay, Sai Rajeswar, Sathwik Tejaswi Madhusudhan, Spandana Gella, Vikas Yadav
•
May 22, 2025
•
2
2
NOVER:基於無驗證器強化學習的語言模型激勵訓練
NOVER: Incentive Training for Language Models via Verifier-Free Reinforcement Learning
Wei Liu, Siya Qi, Xinyu Wang, Chen Qian, Yali Du, Yulan He
•
May 21, 2025
•
2
5
TIME:大型語言模型在現實場景中多層次時間推理的基準測試
TIME: A Multi-level Benchmark for Temporal Reasoning of LLMs in Real-World Scenarios
Shaohang Wei, Wei Li, Feifan Song, Wen Luo, Tianyi Zhuang, Haochen Tan, Zhijiang Guo, Houfeng Wang
•
May 19, 2025
•
2
2
尼羅河對話:邁向語言多樣性與文化意識的大語言模型,服務在地社群
NileChat: Towards Linguistically Diverse and Culturally Aware LLMs for Local Communities
Abdellah El Mekki, Houdaifa Atou, Omer Nacar, Shady Shehata, Muhammad Abdul-Mageed
•
May 23, 2025
•
1
2
伏羲MT:面向中文中心的多語言機器翻譯之大規模語言模型稀疏化
FuxiMT: Sparsifying Large Language Models for Chinese-Centric Multilingual Machine Translation
Shaolin Zhu, Tianyu Dong, Bo Li, Deyi Xiong
•
May 20, 2025
•
1
2
通用生物序列重排序提升全新肽段測序效能
Universal Biological Sequence Reranking for Improved De Novo Peptide Sequencing
Zijie Qiu, Jiaqi Wei, Xiang Zhang, Sheng Xu, Kai Zou, Zhi Jin, Zhiqiang Gao, Nanqing Dong, Siqi Sun
•
May 23, 2025
•
0
2