ChatPaper.ai
打开菜单
首页
每日论文
arXiv
HuggingFace
定价
账户
工作台
🇨🇳
中文简体
Loading...
•
•
•
•
•
•
•
•
•
•
AI研究论文每日精选
每日精选AI研究论文及翻译
April 29th, 2025
RepText:通过复制实现视觉文本渲染
RepText: Rendering Visual Text via Replicating
Haofan Wang, Yujia Xu, Yimeng Li, Junchen Li, Chaowei Zhang, Jing Wang, Kejia Yang, Zhibo Chen
•
Apr 28, 2025
•
25
3
大语言模型中的临床知识无法直接转化为人际互动能力
Clinical knowledge in LLMs does not translate to human interactions
Andrew M. Bean, Rebecca Payne, Guy Parsons, Hannah Rose Kirk, Juan Ciro, Rafael Mosquera, Sara Hincapié Monsalve, Aruna S. Ekanayaka, Lionel Tarassenko, Luc Rocher, Adam Mahdi
•
Apr 26, 2025
•
20
3
LLM驱动的GUI代理在手机自动化中的应用:进展与前景综述
LLM-Powered GUI Agents in Phone Automation: Surveying Progress and Prospects
Guangyi Liu, Pengxiang Zhao, Liang Liu, Yaxuan Guo, Han Xiao, Weifeng Lin, Yuxiang Chai, Yue Han, Shuai Ren, Hao Wang, Xiaoyu Liang, Wenhao Wang, Tianze Wu, Linghao Li, Hao Wang, Guanjing Xiong, Yong Liu, Hongsheng Li
•
Apr 28, 2025
•
19
4
SPC:通过对抗性游戏进化自博弈评判器以提升大语言模型推理能力
SPC: Evolving Self-Play Critic via Adversarial Games for LLM Reasoning
Jiaqi Chen, Bang Zhang, Ruotian Ma, Peisong Wang, Xiaodan Liang, Zhaopeng Tu, Xiaolong Li, Kwan-Yee K. Wong
•
Apr 27, 2025
•
14
2
CipherBank:通过密码学挑战探索大语言模型推理能力的边界
CipherBank: Exploring the Boundary of LLM Reasoning Capabilities through Cryptography Challenges
Yu Li, Qizhi Pei, Mengyuan Sun, Honglin Lin, Chenlin Ming, Xin Gao, Jiang Wu, Conghui He, Lijun Wu
•
Apr 27, 2025
•
14
4
多模态数学推理基准测试:显式视觉依赖关系
Benchmarking Multimodal Mathematical Reasoning with Explicit Visual Dependency
Zhikai Wang, Jiashuo Sun, Wenqi Zhang, Zhiqiang Hu, Xin Li, Fan Wang, Deli Zhao
•
Apr 24, 2025
•
8
2
MMInference:通过模态感知排列稀疏注意力加速长上下文视觉语言模型的预填充
MMInference: Accelerating Pre-filling for Long-Context VLMs via Modality-Aware Permutation Sparse Attention
Yucheng Li, Huiqiang Jiang, Chengruidong Zhang, Qianhui Wu, Xufang Luo, Surin Ahn, Amir H. Abdi, Dongsheng Li, Jianfeng Gao, Yuqing Yang, Lili Qiu
•
Apr 22, 2025
•
8
2
基于等变性的群下采样与抗锯齿技术
Group Downsampling with Equivariant Anti-aliasing
Md Ashiqur Rahman, Raymond A. Yeh
•
Apr 24, 2025
•
6
2
NORA:一款面向具身任务的小型开源通用视觉语言动作模型
NORA: A Small Open-Sourced Generalist Vision Language Action Model for Embodied Tasks
Chia-Yu Hung, Qi Sun, Pengfei Hong, Amir Zadeh, Chuan Li, U-Xuan Tan, Navonil Majumder, Soujanya Poria
•
Apr 28, 2025
•
5
2
TrustGeoGen:面向可信多模态几何问题求解的可扩展形式化验证数据引擎
TrustGeoGen: Scalable and Formal-Verified Data Engine for Trustworthy Multi-modal Geometric Problem Solving
Daocheng Fu, Zijun Chen, Renqiu Xia, Qi Liu, Yuan Feng, Hongbin Zhou, Renrui Zhang, Shiyang Feng, Peng Gao, Junchi Yan, Botian Shi, Bo Zhang, Yu Qiao
•
Apr 22, 2025
•
5
2
Mem0:构建具备可扩展长期记忆的生产级AI代理
Mem0: Building Production-Ready AI Agents with Scalable Long-Term Memory
Prateek Chhikara, Dev Khant, Saket Aryan, Taranjeet Singh, Deshraj Yadav
•
Apr 28, 2025
•
3
2
ICL密码:通过替换密码量化上下文学习中的“学习”能力
ICL CIPHERS: Quantifying "Learning'' in In-Context Learning via Substitution Ciphers
Zhouxiang Fang, Aayush Mishra, Muhan Gao, Anqi Liu, Daniel Khashabi
•
Apr 28, 2025
•
3
2
ChiseLLM:释放推理大语言模型潜力,助力Chisel敏捷硬件开发
ChiseLLM: Unleashing the Power of Reasoning LLMs for Chisel Agile Hardware Development
Bowei Wang, Jiaran Gao, Yelai Feng, Renzhi Chen, Shanshan Li, Lei Wang
•
Apr 27, 2025
•
3
2
基于提示控制的通用歌曲生成框架
Versatile Framework for Song Generation with Prompt-based Control
Yu Zhang, Wenxiang Guo, Changhao Pan, Zhiyuan Zhu, Ruiqi Li, Jingyu Lu, Rongjie Huang, Ruiyuan Zhang, Zhiqing Hong, Ziyue Jiang, Zhou Zhao
•
Apr 27, 2025
•
2
2