ChatPaper.ai
打開菜單
首頁
每日論文
arXiv
HuggingFace
定價
賬戶
工作台
🇭🇰
繁體中文
Loading...
•
•
•
•
•
•
•
•
•
•
AI研究論文每日精選
每日精選AI研究論文及翻譯
May 9th, 2025
感知、推理、思考與規劃:大型多模態推理模型綜述
Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models
Yunxin Li, Zhenyu Liu, Zitao Li, Xuanyu Zhang, Zhenran Xu, Xinyu Chen, Haoyuan Shi, Shenyuan Jiang, Xintong Wang, Jifang Wang, Shouzheng Huang, Xinping Zhao, Borui Jiang, Lanqing Hong, Longyue Wang, Zhuotao Tian, Baoxing Huai, Wenhan Luo, Weihua Luo, Zheng Zhang, Baotian Hu, Min Zhang
•
May 8, 2025
•
74
1
Flow-GRPO:通過線上強化學習訓練流匹配模型
Flow-GRPO: Training Flow Matching Models via Online RL
Jie Liu, Gongye Liu, Jiajun Liang, Yangguang Li, Jiaheng Liu, Xintao Wang, Pengfei Wan, Di Zhang, Wanli Ouyang
•
May 8, 2025
•
34
2
可擴展的思維鏈接:基於彈性推理
Scalable Chain of Thoughts via Elastic Reasoning
Yuhui Xu, Hanze Dong, Lei Wang, Doyen Sahoo, Junnan Li, Caiming Xiong
•
May 8, 2025
•
16
1
作為評判者的感知代理:評估大型語言模型中的高階社會認知
Sentient Agent as a Judge: Evaluating Higher-Order Social Cognition in Large Language Models
Bang Zhang, Ruotian Ma, Qingxuan Jiang, Peisong Wang, Jiaqi Chen, Zheng Xie, Xingyu Chen, Yue Wang, Fanghua Ye, Jian Li, Yifan Yang, Zhaopeng Tu, Xiaolong Li
•
May 1, 2025
•
16
3
三維場景生成:綜述
3D Scene Generation: A Survey
Beichen Wen, Haozhe Xie, Zhaoxi Chen, Fangzhou Hong, Ziwei Liu
•
May 8, 2025
•
10
1
FG-CLIP:細粒度視覺與文本對齊
FG-CLIP: Fine-Grained Visual and Textual Alignment
Chunyu Xie, Bin Wang, Fanjing Kong, Jincheng Li, Dawei Liang, Gengshen Zhang, Dawei Leng, Yuhui Yin
•
May 8, 2025
•
10
1
ICon:自動化數據選擇中的上下文貢獻
ICon: In-Context Contribution for Automatic Data Selection
Yixin Yang, Qingxiu Dong, Linli Yao, Fangwei Zhu, Zhifang Sui
•
May 8, 2025
•
9
1
從文本生成物理穩定且可建造的樂高設計
Generating Physically Stable and Buildable LEGO Designs from Text
Ava Pun, Kangle Deng, Ruixuan Liu, Deva Ramanan, Changliu Liu, Jun-Yan Zhu
•
May 8, 2025
•
7
1
StreamBridge:將您的離線視頻大型語言模型轉變為主動式串流助手
StreamBridge: Turning Your Offline Video Large Language Model into a Proactive Streaming Assistant
Haibo Wang, Bo Feng, Zhengfeng Lai, Mingze Xu, Shiyu Li, Weifeng Ge, Afshin Dehghan, Meng Cao, Ping Huang
•
May 8, 2025
•
7
1
X-Reasoner:邁向跨模態與跨領域的通用推理能力
X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains
Qianchu Liu, Sheng Zhang, Guanghui Qin, Timothy Ossowski, Yu Gu, Ying Jin, Sid Kiblawi, Sam Preston, Mu Wei, Paul Vozila, Tristan Naumann, Hoifung Poon
•
May 6, 2025
•
7
2
LiftFeat:基於3D幾何感知的局部特徵匹配
LiftFeat: 3D Geometry-Aware Local Feature Matching
Yepeng Liu, Wenpeng Lai, Zhou Zhao, Yuxuan Xiong, Jinchi Zhu, Jun Cheng, Yongchao Xu
•
May 6, 2025
•
6
1
跨語言推理通過測試時縮放
Crosslingual Reasoning through Test-Time Scaling
Zheng-Xin Yong, M. Farid Adilazuarda, Jonibek Mansurov, Ruochen Zhang, Niklas Muennighoff, Carsten Eickhoff, Genta Indra Winata, Julia Kreutzer, Stephen H. Bach, Alham Fikri Aji
•
May 8, 2025
•
5
1
PlaceIt3D:語言引導的物體放置於真實3D場景中
PlaceIt3D: Language-Guided Object Placement in Real 3D Scenes
Ahmed Abdelreheem, Filippo Aleotti, Jamie Watson, Zawar Qureshi, Abdelrahman Eldesokey, Peter Wonka, Gabriel Brostow, Sara Vicente, Guillermo Garcia-Hernando
•
May 8, 2025
•
5
1
WaterDrum:面向数据去学习度量的水印技术
WaterDrum: Watermarking for Data-centric Unlearning Metric
Xinyang Lu, Xinyuan Niu, Gregory Kang Ruey Lau, Bui Thi Cam Nhung, Rachael Hwee Ling Sim, Fanyu Wen, Chuan-Sheng Foo, See-Kiong Ng, Bryan Kian Hsiang Low
•
May 8, 2025
•
4
1
將價值重新注入強化學習:通過統一大型語言模型推理器與驗證器實現更好的測試時擴展
Putting the Value Back in RL: Better Test-Time Scaling by Unifying LLM Reasoners With Verifiers
Kusha Sareen, Morgane M Moss, Alessandro Sordoni, Rishabh Agarwal, Arian Hosseini
•
May 7, 2025
•
4
1
BrowseComp-ZH:大型語言模型中文網頁瀏覽能力基準測試
BrowseComp-ZH: Benchmarking Web Browsing Ability of Large Language Models in Chinese
Peilin Zhou, Bruce Leon, Xiang Ying, Can Zhang, Yifan Shao, Qichen Ye, Dading Chong, Zhiling Jin, Chenxuan Xie, Meng Cao, Yuxin Gu, Sixin Hong, Jing Ren, Jian Chen, Chao Liu, Yining Hua
•
Apr 27, 2025
•
4
1
視覺-語言-行動模型:概念、進展、應用與挑戰
Vision-Language-Action Models: Concepts, Progress, Applications and Challenges
Ranjan Sapkota, Yang Cao, Konstantinos I. Roumeliotis, Manoj Karkee
•
May 7, 2025
•
3
1
SIMPLEMIX:在語言模型偏好學習中簡單混合離線與在線數據的簡易方法
SIMPLEMIX: Frustratingly Simple Mixing of Off- and On-policy Data in Language Model Preference Learning
Tianjian Li, Daniel Khashabi
•
May 5, 2025
•
3
1
鏈式思考標記是計算機程序中的變量。
Chain-of-Thought Tokens are Computer Program Variables
Fangwei Zhu, Peiyi Wang, Zhifang Sui
•
May 8, 2025
•
1
1