ChatPaper.ai
打開菜單
首頁
每日論文
arXiv
HuggingFace
定價
賬戶
工作台
🇭🇰
繁體中文
Loading...
•
•
•
•
•
•
•
•
•
•
AI研究論文每日精選
每日精選AI研究論文及翻譯
April 29th, 2025
RepText:通過複製實現視覺文本渲染
RepText: Rendering Visual Text via Replicating
Haofan Wang, Yujia Xu, Yimeng Li, Junchen Li, Chaowei Zhang, Jing Wang, Kejia Yang, Zhibo Chen
•
Apr 28, 2025
•
27
3
LLM中的臨床知識並未轉化為人際互動能力
Clinical knowledge in LLMs does not translate to human interactions
Andrew M. Bean, Rebecca Payne, Guy Parsons, Hannah Rose Kirk, Juan Ciro, Rafael Mosquera, Sara Hincapié Monsalve, Aruna S. Ekanayaka, Lionel Tarassenko, Luc Rocher, Adam Mahdi
•
Apr 26, 2025
•
20
3
LLM驅動的GUI代理在手機自動化中的應用:進展與前景綜述
LLM-Powered GUI Agents in Phone Automation: Surveying Progress and Prospects
Guangyi Liu, Pengxiang Zhao, Liang Liu, Yaxuan Guo, Han Xiao, Weifeng Lin, Yuxiang Chai, Yue Han, Shuai Ren, Hao Wang, Xiaoyu Liang, Wenhao Wang, Tianze Wu, Linghao Li, Hao Wang, Guanjing Xiong, Yong Liu, Hongsheng Li
•
Apr 28, 2025
•
19
4
SPC:通過對抗性遊戲進化自我對弈評判器以提升大語言模型推理能力
SPC: Evolving Self-Play Critic via Adversarial Games for LLM Reasoning
Jiaqi Chen, Bang Zhang, Ruotian Ma, Peisong Wang, Xiaodan Liang, Zhaopeng Tu, Xiaolong Li, Kwan-Yee K. Wong
•
Apr 27, 2025
•
14
2
CipherBank:透過密碼學挑戰探索大型語言模型推理能力的邊界
CipherBank: Exploring the Boundary of LLM Reasoning Capabilities through Cryptography Challenges
Yu Li, Qizhi Pei, Mengyuan Sun, Honglin Lin, Chenlin Ming, Xin Gao, Jiang Wu, Conghui He, Lijun Wu
•
Apr 27, 2025
•
14
4
多模态数学推理基准测试:显式视觉依赖关系
Benchmarking Multimodal Mathematical Reasoning with Explicit Visual Dependency
Zhikai Wang, Jiashuo Sun, Wenqi Zhang, Zhiqiang Hu, Xin Li, Fan Wang, Deli Zhao
•
Apr 24, 2025
•
8
2
MMInference:通過模態感知置換稀疏注意力加速長上下文視覺語言模型的預填充
MMInference: Accelerating Pre-filling for Long-Context VLMs via Modality-Aware Permutation Sparse Attention
Yucheng Li, Huiqiang Jiang, Chengruidong Zhang, Qianhui Wu, Xufang Luo, Surin Ahn, Amir H. Abdi, Dongsheng Li, Jianfeng Gao, Yuqing Yang, Lili Qiu
•
Apr 22, 2025
•
8
2
基於提示控制的通用歌曲生成框架
Versatile Framework for Song Generation with Prompt-based Control
Yu Zhang, Wenxiang Guo, Changhao Pan, Zhiyuan Zhu, Ruiqi Li, Jingyu Lu, Rongjie Huang, Ruiyuan Zhang, Zhiqing Hong, Ziyue Jiang, Zhou Zhao
•
Apr 27, 2025
•
7
2
群下采样与等變抗鋸齒
Group Downsampling with Equivariant Anti-aliasing
Md Ashiqur Rahman, Raymond A. Yeh
•
Apr 24, 2025
•
7
2
NORA:一個小型開源的通用視覺語言行動模型,專為具身任務設計
NORA: A Small Open-Sourced Generalist Vision Language Action Model for Embodied Tasks
Chia-Yu Hung, Qi Sun, Pengfei Hong, Amir Zadeh, Chuan Li, U-Xuan Tan, Navonil Majumder, Soujanya Poria
•
Apr 28, 2025
•
5
2
TrustGeoGen:可擴展且形式化驗證的數據引擎,用於可信賴的多模態幾何問題求解
TrustGeoGen: Scalable and Formal-Verified Data Engine for Trustworthy Multi-modal Geometric Problem Solving
Daocheng Fu, Zijun Chen, Renqiu Xia, Qi Liu, Yuan Feng, Hongbin Zhou, Renrui Zhang, Shiyang Feng, Peng Gao, Junchi Yan, Botian Shi, Bo Zhang, Yu Qiao
•
Apr 22, 2025
•
5
2
Mem0:構建具備可擴展長期記憶的生產級AI代理
Mem0: Building Production-Ready AI Agents with Scalable Long-Term Memory
Prateek Chhikara, Dev Khant, Saket Aryan, Taranjeet Singh, Deshraj Yadav
•
Apr 28, 2025
•
3
2
ICL密碼:通過替換密碼量化上下文學習中的“學習”程度
ICL CIPHERS: Quantifying "Learning'' in In-Context Learning via Substitution Ciphers
Zhouxiang Fang, Aayush Mishra, Muhan Gao, Anqi Liu, Daniel Khashabi
•
Apr 28, 2025
•
3
2
ChiseLLM:釋放推理型大型語言模型在Chisel敏捷硬體開發中的潛力
ChiseLLM: Unleashing the Power of Reasoning LLMs for Chisel Agile Hardware Development
Bowei Wang, Jiaran Gao, Yelai Feng, Renzhi Chen, Shanshan Li, Lei Wang
•
Apr 27, 2025
•
3
2