ChatPaper.ai
打開菜單
首頁
每日論文
arXiv
HuggingFace
定價
賬戶
工作台
🇭🇰
繁體中文
Loading...
•
•
•
•
•
•
•
•
•
•
AI研究論文每日精選
每日精選AI研究論文及翻譯
March 31st, 2025
思而後薦:釋放序列推薦中的潛在推理能力
Think Before Recommend: Unleashing the Latent Reasoning Power for Sequential Recommendation
Jiakai Tang, Sunhao Dai, Teng Shi, Jun Xu, Xu Chen, Wen Chen, Wu Jian, Yuning Jiang
•
Mar 28, 2025
•
35
2
感知精準的3D說話頭像生成:新定義、語音網格表示與評估指標
Perceptually Accurate 3D Talking Head Generation: New Definitions, Speech-Mesh Representation, and Evaluation Metrics
Lee Chae-Yeon, Oh Hyun-Bin, Han EunGi, Kim Sung-Bin, Suekyeong Nam, Tae-Hyun Oh
•
Mar 26, 2025
•
22
3
MedAgent-Pro:基於多模態證據的醫療診斷推理代理工作流研究
MedAgent-Pro: Towards Multi-modal Evidence-based Medical Diagnosis via Reasoning Agentic Workflow
Ziyue Wang, Junde Wu, Chang Han Low, Yueming Jin
•
Mar 21, 2025
•
6
2
ORIGEN:文本到圖像生成中的零樣本三維方位定位
ORIGEN: Zero-Shot 3D Orientation Grounding in Text-to-Image Generation
Yunhong Min, Daehyeon Choi, Kyeongmin Yeo, Jihyun Lee, Minhyuk Sung
•
Mar 28, 2025
•
24
3
探索基於人類反饋的強化學習中的數據擴展趨勢與效應
Exploring Data Scaling Trends and Effects in Reinforcement Learning from Human Feedback
Wei Shen, Guanlin Liu, Zheng Wu, Ruofei Zhu, Qingping Yang, Chao Xin, Yu Yue, Lin Yan
•
Mar 28, 2025
•
44
2
物理學:評估基礎模型在大學層級物理問題解決上的表現
PHYSICS: Benchmarking Foundation Models on University-Level Physics Problem Solving
Kaiyue Feng, Yilun Zhao, Yixin Liu, Tianyu Yang, Chen Zhao, John Sous, Arman Cohan
•
Mar 26, 2025
•
17
2
重建具生物力學精確骨骼的人體模型
Reconstructing Humans with a Biomechanically Accurate Skeleton
Yan Xia, Xiaowei Zhou, Etienne Vouga, Qixing Huang, Georgios Pavlakos
•
Mar 27, 2025
•
9
2
大型推理模型的高效推理方法綜述:語言、多模態及更廣闊領域
A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond
Xiaoye Qu, Yafu Li, Zhaochen Su, Weigao Sun, Jianhao Yan, Dongrui Liu, Ganqu Cui, Daizong Liu, Shuxian Liang, Junxian He, Peng Li, Wei Wei, Jing Shao, Chaochao Lu, Yue Zhang, Xian-Sheng Hua, Bowen Zhou, Yu Cheng
•
Mar 27, 2025
•
39
4
你的視覺Transformer實質上是個圖像分割模型
Your ViT is Secretly an Image Segmentation Model
Tommie Kerssies, Niccolò Cavagnero, Alexander Hermans, Narges Norouzi, Giuseppe Averta, Bastian Leibe, Gijs Dubbelman, Daan de Geus
•
Mar 24, 2025
•
21
2
Hi3DGen:通過法線橋接實現從圖像生成高保真三維幾何
Hi3DGen: High-fidelity 3D Geometry Generation from Images via Normal Bridging
Chongjie Ye, Yushuang Wu, Ziteng Lu, Jiahao Chang, Xiaoyang Guo, Jiaqing Zhou, Hao Zhao, Xiaoguang Han
•
Mar 28, 2025
•
11
2
軟體工程人工智慧的挑戰與發展路徑
Challenges and Paths Towards AI for Software Engineering
Alex Gu, Naman Jain, Wen-Ding Li, Manish Shetty, Yijia Shao, Ziyang Li, Diyi Yang, Kevin Ellis, Koushik Sen, Armando Solar-Lezama
•
Mar 28, 2025
•
4
2
OThink-MR1:透過動態強化學習激發多模態通用推理能力
OThink-MR1: Stimulating multimodal generalized reasoning capabilities via dynamic reinforcement learning
Zhiyuan Liu, Yuting Zhang, Feng Liu, Changwang Zhang, Ying Sun, Jun Wang
•
Mar 20, 2025
•
26
3
AdaptiVocab:通過輕量級詞彙適配提升LLM在特定領域的效率
AdaptiVocab: Enhancing LLM Efficiency in Focused Domains through Lightweight Vocabulary Adaptation
Itay Nakash, Nitay Calderon, Eyal Ben David, Elad Hoffer, Roi Reichart
•
Mar 25, 2025
•
75
2
SparseFlex:高分辨率與任意拓撲的三維形狀建模
SparseFlex: High-Resolution and Arbitrary-Topology 3D Shape Modeling
Xianglong He, Zi-Xin Zou, Chia-Hao Chen, Yuan-Chen Guo, Ding Liang, Chun Yuan, Wanli Ouyang, Yan-Pei Cao, Yangguang Li
•
Mar 27, 2025
•
9
2
大型語言模型中大量激活的細化分析
A Refined Analysis of Massive Activations in LLMs
Louis Owen, Nilabhra Roy Chowdhury, Abhay Kumar, Fabian Güra
•
Mar 28, 2025
•
14
3
Zero4D:利用現成視頻擴散模型從單一視頻進行免訓練的4D視頻生成
Zero4D: Training-Free 4D Video Generation From Single Video Using Off-the-Shelf Video Diffusion Model
Jangho Park, Taesung Kwon, Jong Chul Ye
•
Mar 28, 2025
•
18
2
視頻中的任意運動分割
Segment Any Motion in Videos
Nan Huang, Wenzhao Zheng, Chenfeng Xu, Kurt Keutzer, Shanghang Zhang, Angjoo Kanazawa, Qianqian Wang
•
Mar 28, 2025
•
17
2
ReFeed:基於反饋反思推理的多維度摘要精煉
ReFeed: Multi-dimensional Summarization Refinement with Reflective Reasoning on Feedback
Taewon Yun, Jihwan Oh, Hyangsuk Min, Yuho Lee, Jihwan Bang, Jason Cai, Hwanjun Song
•
Mar 27, 2025
•
20
3
SWI:大型語言模型中的意圖驅動對話
SWI: Speaking with Intent in Large Language Models
Yuwei Yin, EunJeong Hwang, Giuseppe Carenini
•
Mar 27, 2025
•
2
2
Free4D:具備時空一致性的免調參四維場景生成
Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency
Tianqi Liu, Zihao Huang, Zhaoxi Chen, Guangcong Wang, Shoukang Hu, Liao Shen, Huiqiang Sun, Zhiguo Cao, Wei Li, Ziwei Liu
•
Mar 26, 2025
•
21
2
X^{2}-高斯:用於連續時間斷層重建的四維輻射高斯分布擬合
X^{2}-Gaussian: 4D Radiative Gaussian Splatting for Continuous-time Tomographic Reconstruction
Weihao Yu, Yuanhao Cai, Ruyi Zha, Zhiwen Fan, Chenxin Li, Yixuan Yuan
•
Mar 27, 2025
•
3
2
4D-Bench:用於4D物體理解的多模態大型語言模型基準測試
4D-Bench: Benchmarking Multi-modal Large Language Models for 4D Object Understanding
Wenxuan Zhu, Bing Li, Cheng Zheng, Jinjie Mai, Jun Chen, Letian Jiang, Abdullah Hamdi, Sara Rojas Martinez, Chia-Wen Lin, Mohamed Elhoseiny, Bernard Ghanem
•
Mar 22, 2025
•
8
3
論大型多模態模型作為開放世界圖像分類器
On Large Multimodal Models as Open-World Image Classifiers
Alessandro Conti, Massimiliano Mancini, Enrico Fini, Yiming Wang, Paolo Rota, Elisa Ricci
•
Mar 27, 2025
•
5
2