ChatPaper.ai
打開菜單
首頁
每日論文
arXiv
HuggingFace
定價
賬戶
工作台
🇭🇰
繁體中文
Loading...
•
•
•
•
•
•
•
•
•
•
AI研究論文每日精選
每日精選AI研究論文及翻譯
March 3rd, 2025
DexGraspVLA:邁向通用靈巧抓取的視覺-語言-動作框架
DexGraspVLA: A Vision-Language-Action Framework Towards General Dexterous Grasping
Yifan Zhong, Xuchuan Huang, Ruochong Li, Ceyao Zhang, Yitao Liang, Yaodong Yang, Yuanpei Chen
•
Feb 28, 2025
•
9
2
DeepSolution:透過基於樹狀結構的探索與雙點思維提升複雜工程方案設計
DeepSolution: Boosting Complex Engineering Solution Design via Tree-based Exploration and Bi-point Thinking
Zhuoqun Li, Haiyang Yu, Xuanang Chen, Hongyu Lin, Yaojie Lu, Fei Huang, Xianpei Han, Yongbin Li, Le Sun
•
Feb 28, 2025
•
40
4
SoS1:O1與R1類推理大型語言模型是平方和求解器
SoS1: O1 and R1-Like Reasoning LLMs are Sum-of-Square Solvers
Kechen Li, Wenqi Zhu, Coralia Cartis, Tianbo Ji, Shiwei Liu
•
Feb 27, 2025
•
22
2
LiteASR:基於低秩逼近的高效自動語音辨識
LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation
Keisuke Kamahori, Jungo Kasai, Noriyuki Kojima, Baris Kasikci
•
Feb 27, 2025
•
13
2
偏好學習釋放大語言模型的心理諮詢潛能
Preference Learning Unlocks LLMs' Psycho-Counseling Skills
Mian Zhang, Shaun M. Eack, Zhiyu Zoey Chen
•
Feb 27, 2025
•
7
2
告訴我原因:視覺基礎模型作為自我解釋的分類器
Tell me why: Visual foundation models as self-explainable classifiers
Hugues Turbé, Mina Bjelogrlic, Gianmarco Mengaldo, Christian Lovis
•
Feb 26, 2025
•
11
2
鏈式草稿:以少寫多,加速思考
Chain of Draft: Thinking Faster by Writing Less
Silei Xu, Wenhao Xie, Lingxiao Zhao, Pengcheng He
•
Feb 25, 2025
•
48
4
最佳腦細胞凋零
Optimal Brain Apoptosis
Mingyuan Sun, Zheng Fang, Jiaxu Wang, Junjie Jiang, Delei Kong, Chenming Hu, Yuetong Fang, Renjing Xu
•
Feb 25, 2025
•
10
2
我們能在多大程度上利用ImageNet進行文本到圖像的生成?
How far can we go with ImageNet for Text-to-Image generation?
L. Degeorge, A. Ghosh, N. Dufour, D. Picard, V. Kalogeiton
•
Feb 28, 2025
•
26
2
LettuceDetect:面向RAG應用的幻覺檢測框架
LettuceDetect: A Hallucination Detection Framework for RAG Applications
Ádám Kovács, Gábor Recski
•
Feb 24, 2025
•
11
2
HAIC:通過提升多模態大型語言模型的字幕質量來增進人類行為理解與生成
HAIC: Improving Human Action Understanding and Generation with Better Captions for Multi-modal Large Language Models
Xiao Wang, Jingyun Hua, Weihong Lin, Yuanxing Zhang, Fuzheng Zhang, Jianlong Wu, Di Zhang, Liqiang Nie
•
Feb 28, 2025
•
2
2
預測性數據選擇:能預測的數據即是能教學的數據
Predictive Data Selection: The Data That Predicts Is the Data That Teaches
Kashun Shum, Yuzhen Huang, Hongjian Zou, Ding Qi, Yixuan Liao, Xiaoxin Chen, Qian Liu, Junxian He
•
Mar 2, 2025
•
57
2
MIGE:基於多模態指令的圖像生成與編輯統一框架
MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing
Xueyun Tian, Wei Li, Bingbing Xu, Yige Yuan, Yuanzhuo Wang, Huawei Shen
•
Feb 28, 2025
•
5
2
通過單步獎勵實現多輪代碼生成
Multi-Turn Code Generation Through Single-Step Rewards
Arnav Kumar Jain, Gonzalo Gonzalez-Pumariega, Wayne Chen, Alexander M Rush, Wenting Zhao, Sanjiban Choudhury
•
Feb 27, 2025
•
31
2
TeleRAG:具前瞻性檢索的高效檢索增強生成推理
TeleRAG: Efficient Retrieval-Augmented Generation Inference with Lookahead Retrieval
Chien-Yu Lin, Keisuke Kamahori, Yiyu Liu, Xiaoxiang Shi, Madhav Kashyap, Yile Gu, Rulin Shao, Zihao Ye, Kan Zhu, Stephanie Wang, Arvind Krishnamurthy, Rohan Kadekodi, Luis Ceze, Baris Kasikci
•
Feb 28, 2025
•
11
2
EgoNormia:物理社交規範理解的基準測試
EgoNormia: Benchmarking Physical Social Norm Understanding
MohammadHossein Rezaei, Yicheng Fu, Phil Cuvin, Caleb Ziems, Yanzhe Zhang, Hao Zhu, Diyi Yang
•
Feb 27, 2025
•
5
2
ViDoRAG:基於動態迭代推理代理的視覺文件檢索增強生成
ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents
Qiuchen Wang, Ruixue Ding, Zehui Chen, Weiqi Wu, Shihang Wang, Pengjun Xie, Feng Zhao
•
Feb 25, 2025
•
20
2
基於視覺的人形機器人靈巧操作之模擬到現實強化學習
Sim-to-Real Reinforcement Learning for Vision-Based Dexterous Manipulation on Humanoids
Toru Lin, Kartik Sachdev, Linxi Fan, Jitendra Malik, Yuke Zhu
•
Feb 27, 2025
•
16
2