ChatPaper.aiChatPaper.ai
首页

arXiv

HuggingFace

定价账户工作台

•
•

•
•

•
•

•
•

•
•

Footer

Company name

ChatPaper.ai: Your advanced AI reading assistant.

Contact us: [email protected]

X (Twitter)

Products

  • AI Search
  • AI Mind Map
  • Arxiv Summary
  • Huggingface Summary

Support

  • FAQ
  • Contact

Company

  • Blog
  • Privacy Policy
  • Terms of Service

Available Languages

  • 🇬🇧English
  • 🇨🇳中文简体
  • 🇭🇰繁體中文
  • 🇯🇵日本語
  • 🇰🇷한국어
  • 🇩🇪Deutsch
  • 🇫🇷Français
  • 🇷🇺Русский
  • 🇪🇸Español

© 2025 chatpaper.ai All rights reserved.

AI研究论文每日精选

每日精选AI研究论文及翻译

明日是否依然成立?多语言常青问题分类提升可信问答系统
Will It Still Be True Tomorrow? Multilingual Evergreen Question Classification to Improve Trustworthy QA

Sergey Pletenev, Maria Marina, Nikolay Ivanov, Daria Galimzianova, Nikita Krayko, Mikhail Salnikov, Vasily Konovalov, Alexander Panchenko, Viktor Moskvoretskii•May 27, 2025•834

利用自注意力机制实现大语言模型中的输入相关软提示
Leveraging Self-Attention for Input-Dependent Soft Prompting in LLMs

Ananth Muppidi, Abhilash Nandy, Sambaran Bandyopadhyay•Jun 5, 2025•281

FusionAudio-1.2M:迈向多模态上下文融合的细粒度音频描述
FusionAudio-1.2M: Towards Fine-grained Audio Captioning with Multimodal Contextual Fusion

Shunian Chen, Xinyuan Xie, Zheshu Chen, Liyan Zhao, Owen Lee, Zhan Su, Qilin Sun, Benyou Wang•Jun 1, 2025•272

MORSE-500:一个可编程控制的视频基准测试集,用于压力测试多模态推理能力
MORSE-500: A Programmatically Controllable Video Benchmark to Stress-Test Multimodal Reasoning

Zikui Cai, Andrew Wang, Anirudh Satheesh, Ankit Nakhawa, Hyunwoo Jae, Keenan Powell, Minghui Liu, Neel Jay, Sungbin Oh, Xiyao Wang, Yongyuan Liang, Tom Goldstein, Furong Huang•Jun 5, 2025•261

PartCrafter:基于组合式潜在扩散Transformer的结构化三维网格生成
PartCrafter: Structured 3D Mesh Generation via Compositional Latent Diffusion Transformers

Yuchen Lin, Chenguo Lin, Panwang Pan, Honglei Yan, Yiqiang Feng, Yadong Mu, Katerina Fragkiadaki•Jun 5, 2025•212

Sentinel:防御提示注入攻击的顶尖模型
Sentinel: SOTA model to protect against prompt injections

Dror Ivry, Oran Nahum•Jun 5, 2025•191

扩展模态是实现全模态的正确路径吗?
Is Extending Modality The Right Path Towards Omni-Modality?

Tinghui Zhu, Kai Zhang, Muhao Chen, Yu Su•Jun 2, 2025•182

STARFlow:面向高分辨率图像生成的扩展型潜在归一化流
STARFlow: Scaling Latent Normalizing Flows for High-resolution Image Synthesis

Jiatao Gu, Tianrong Chen, David Berthelot, Huangjie Zheng, Yuyang Wang, Ruixiang Zhang, Laurent Dinh, Miguel Angel Bautista, Josh Susskind, Shuangfei Zhai•Jun 6, 2025•151

音频感知大语言模型作为口语风格的评判者
Audio-Aware Large Language Models as Judges for Speaking Styles

Cheng-Han Chiang, Xiaofei Wang, Chung-Ching Lin, Kevin Lin, Linjie Li, Radu Kopetz, Yao Qian, Zhendong Wang, Zhengyuan Yang, Hung-yi Lee, Lijuan Wang•Jun 6, 2025•123

医疗世界模型:面向治疗规划的肿瘤演化生成式模拟
Medical World Model: Generative Simulation of Tumor Evolution for Treatment Planning

Yijun Yang, Zhao-Yang Wang, Qiuping Liu, Shuwen Sun, Kang Wang, Rama Chellappa, Zongwei Zhou, Alan Yuille, Lei Zhu, Yu-Dong Zhang, Jieneng Chen•Jun 2, 2025•122

视角融合:基于第一人称与第三人称视觉的跨视角协同智能研究综述
Bridging Perspectives: A Survey on Cross-view Collaborative Intelligence with Egocentric-Exocentric Vision

Yuping He, Yifei Huang, Guo Chen, Lidong Lu, Baoqi Pei, Jilan Xu, Tong Lu, Yoichi Sato•Jun 6, 2025•61

3DFlowAction:从3D流态世界中学习跨实体操作模型
3DFlowAction: Learning Cross-Embodiment Manipulation from 3D Flow World Model

Hongyan Zhi, Peihao Chen, Siyuan Zhou, Yubo Dong, Quanxi Wu, Lei Han, Mingkui Tan•Jun 6, 2025•51

CodeContests+:面向竞技编程的高质量测试用例生成
CodeContests+: High-Quality Test Case Generation for Competitive Programming

Zihan Wang, Siyao Liu, Yang Sun, Hongyan Li, Kai Shen•Jun 6, 2025•51

HASHIRU:面向混合智能资源利用的分层代理系统
HASHIRU: Hierarchical Agent System for Hybrid Intelligent Resource Utilization

Kunal Pai, Parth Shah, Harshil Patel•Jun 1, 2025•51

同行评审精度:基于DataSeeds标注图像构建视觉模型微调的基础数据集
Peer-Ranked Precision: Creating a Foundational Dataset for Fine-Tuning Vision Models from DataSeeds' Annotated Imagery

Sajjad Abdoli, Freeman Lewin, Gediminas Vasiliauskas, Fabian Schonholz•Jun 6, 2025•41

前缀分组器:通过共享前缀前向传播实现高效的GRPO训练
Prefix Grouper: Efficient GRPO Training through Shared-Prefix Forward

Zikang Liu, Tongtian Yue, Yepeng Tang, Longteng Guo, Junxian Cai, Qingbin Liu, Xi Chen, Jing Liu•Jun 5, 2025•41

MIRIAD:通过数百万医疗问答对增强大型语言模型
MIRIAD: Augmenting LLMs with millions of medical query-response pairs

Qinyue Zheng, Salman Abdullah, Sam Rawal, Cyril Zakka, Sophie Ostmeier, Maximilian Purk, Eduardo Reis, Eric J. Topol, Jure Leskovec, Michael Moor•Jun 6, 2025•31

当模型所知超越其解释能力:量化人机协作中的知识迁移
When Models Know More Than They Can Explain: Quantifying Knowledge Transfer in Human-AI Collaboration

Quan Shi, Carlos E. Jimenez, Shunyu Yao, Nick Haber, Diyi Yang, Karthik Narasimhan•Jun 5, 2025•31

当语义误导视觉:缓解大规模多模态模型在场景文本检测与理解中的幻觉问题
When Semantics Mislead Vision: Mitigating Large Multimodal Models Hallucinations in Scene Text Spotting and Understanding

Yan Shu, Hangui Lin, Yexin Liu, Yan Zhang, Gangyan Zeng, Yan Li, Yu Zhou, Ser-Nam Lim, Harry Yang, Nicu Sebe•Jun 5, 2025•31

物理场景的Splatting:从非完美机器人数据到端到端的真实到仿真转换
Splatting Physical Scenes: End-to-End Real-to-Sim from Imperfect Robot Data

Ben Moran, Mauro Comi, Steven Bohez, Tom Erez, Zhibin Li, Leonard Hasenclever•Jun 4, 2025•32

少数中的真理:高效多模态推理的高价值数据选择
Truth in the Few: High-Value Data Selection for Efficient Multi-Modal Reasoning

Shenshen Li, Kaiyuan Deng, Lei Wang, Hao Yang, Chong Peng, Peng Yan, Fumin Shen, Heng Tao Shen, Xing Xu•Jun 5, 2025•21

GuideX:面向零样本信息抽取的引导式合成数据生成
GuideX: Guided Synthetic Data Generation for Zero-Shot Information Extraction

Neil De La Fuente, Oscar Sainz, Iker García-Ferrero, Eneko Agirre•May 31, 2025•22

稀疏化状态空间模型是高效的高速网络架构。
Sparsified State-Space Models are Efficient Highway Networks

Woomin Song, Jihoon Tack, Sangwoo Mo, Seunghyuk Oh, Jinwoo Shin•May 27, 2025•12

AssetOpsBench:工业资产运维任务自动化AI代理基准测试平台
AssetOpsBench: Benchmarking AI Agents for Task Automation in Industrial Asset Operations and Maintenance

Dhaval Patel, Shuxin Lin, James Rayfield, Nianjun Zhou, Roman Vaculin, Natalia Martinez, Fearghal O'donncha, Jayant Kalagnanam•Jun 4, 2025•02