ChatPaper.ai
打开菜单
首页
每日论文
arXiv
HuggingFace
定价
账户
工作台
🇨🇳
中文简体
Loading...
•
•
•
•
•
•
•
•
•
•
AI研究论文每日精选
每日精选AI研究论文及翻译
October 8th, 2024
SwiftKV:具有知识保留模型转换的快速预填充优化推断
SwiftKV: Fast Prefill-Optimized Inference with Knowledge-Preserving Model Transformation
Aurick Qiao, Zhewei Yao, Samyam Rajbhandari, Yuxiong He
•
Oct 4, 2024
•
2
2
选择:图像分类数据整理策略的大规模基准测试
SELECT: A Large-Scale Benchmark of Data Curation Strategies for Image Classification
Benjamin Feuer, Jiawei Xu, Niv Cohen, Patrick Yubeaton, Govind Mittal, Chinmay Hegde
•
Oct 7, 2024
•
7
2
像人类一样在数字世界中导航:GUI代理的通用视觉基础
Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI Agents
Boyu Gou, Ruohan Wang, Boyuan Zheng, Yanan Xie, Cheng Chang, Yiheng Shu, Huan Sun, Yu Su
•
Oct 7, 2024
•
19
2
MathHay:一种用于LLM中长文本数学推理的自动化基准测试
MathHay: An Automated Benchmark for Long-Context Mathematical Reasoning in LLMs
Lei Wang, Shan Dong, Yuhui Xu, Hanze Dong, Yalu Wang, Amrita Saha, Ee-Peng Lim, Caiming Xiong, Doyen Sahoo
•
Oct 7, 2024
•
13
3
快速!压缩步骤和层以加速音乐生成
Presto! Distilling Steps and Layers for Accelerating Music Generation
Zachary Novack, Ge Zhu, Jonah Casebeer, Julian McAuley, Taylor Berg-Kirkpatrick, Nicholas J. Bryan
•
Oct 7, 2024
•
18
4
LLaMA-Berry:O1级奥林匹克水平数学推理的成对优化
LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning
Di Zhang, Jianbo Wu, Jingdi Lei, Tong Che, Jiatong Li, Tong Xie, Xiaoshui Huang, Shufei Zhang, Marco Pavone, Yuqiang Li, Wanli Ouyang, Dongzhan Zhou
•
Oct 3, 2024
•
55
4
临床实体识别基准数据集
Named Clinical Entity Recognition Benchmark
Wadood M Abdul, Marco AF Pimentel, Muhammad Umar Salman, Tathagata Raha, Clément Christophe, Praveen K Kanithi, Nasir Hayat, Ronnie Rajan, Shadab Khan
•
Oct 7, 2024
•
17
3
UniMuMo: 统一文本、音乐和动作生成
UniMuMo: Unified Text, Music and Motion Generation
Han Yang, Kun Su, Yutong Zhang, Jiaben Chen, Kaizhi Qian, Gaowen Liu, Chuang Gan
•
Oct 6, 2024
•
19
2
从文本指令中实现角色-场景自主交互合成
Autonomous Character-Scene Interaction Synthesis from Text Instruction
Nan Jiang, Zimo He, Zi Wang, Hongjie Li, Yixin Chen, Siyuan Huang, Yixin Zhu
•
Oct 4, 2024
•
7
2
GSM-Symbolic:理解大型语言模型中数学推理的局限性
GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models
Iman Mirzadeh, Keivan Alizadeh, Hooman Shahrokhi, Oncel Tuzel, Samy Bengio, Mehrdad Farajtabar
•
Oct 7, 2024
•
22
6
ScienceAgentBench:朝着数据驱动科学发现的语言代理严格评估的方向前进
ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery
Ziru Chen, Shijie Chen, Yuting Ning, Qianheng Zhang, Boshi Wang, Botao Yu, Yifei Li, Zeyi Liao, Chen Wei, Zitong Lu, Vishal Dey, Mingyi Xue, Frazier N. Baker, Benjamin Burns, Daniel Adu-Ampratwum, Xuhui Huang, Xia Ning, Song Gao, Yu Su, Huan Sun
•
Oct 7, 2024
•
21
2
简要总结:大规模视觉语言模型的令牌级侦探奖励模型
TLDR: Token-Level Detective Reward Model for Large Vision Language Models
Deqing Fu, Tong Xiao, Rui Wang, Wang Zhu, Pengchuan Zhang, Guan Pang, Robin Jia, Lawrence Chen
•
Oct 7, 2024
•
17
2
差动变压器
Differential Transformer
Tianzhu Ye, Li Dong, Yuqing Xia, Yutao Sun, Yi Zhu, Gao Huang, Furu Wei
•
Oct 7, 2024
•
178
35
在视频传播中重新定义时间建模:矢量化时间步进方法
Redefining Temporal Modeling in Video Diffusion: The Vectorized Timestep Approach
Yaofang Liu, Yumeng Ren, Xiaodong Cun, Aitor Artola, Yang Liu, Tieyong Zeng, Raymond H. Chan, Jean-michel Morel
•
Oct 4, 2024
•
5
2
在多视角指代交流中对语言进行基础化
Grounding Language in Multi-Perspective Referential Communication
Zineng Tang, Lingjun Mao, Alane Suhr
•
Oct 4, 2024
•
4
2
在规模上合并模型时有哪些要点?
What Matters for Model Merging at Scale?
Prateek Yadav, Tu Vu, Jonathan Lai, Alexandra Chronopoulou, Manaal Faruqui, Mohit Bansal, Tsendsuren Munkhdalai
•
Oct 4, 2024
•
8
2
OmniBooth:使用多模态指导学习图像合成的潜在控制
OmniBooth: Learning Latent Control for Image Synthesis with Multi-modal Instruction
Leheng Li, Weichao Qiu, Xu Yan, Jing He, Kaiqiang Zhou, Yingjie Cai, Qing Lian, Bingbing Liu, Ying-Cong Chen
•
Oct 7, 2024
•
9
2
LLM知道的比它们展示的更多:关于LLM内在表示的幻觉
LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations
Hadas Orgad, Michael Toker, Zorik Gekhman, Roi Reichart, Idan Szpektor, Hadas Kotek, Yonatan Belinkov
•
Oct 3, 2024
•
49
5
FAN:傅立叶分析网络
FAN: Fourier Analysis Networks
Yihong Dong, Ge Li, Yongding Tao, Xue Jiang, Kechi Zhang, Jia Li, Jing Su, Jun Zhang, Jingjing Xu
•
Oct 3, 2024
•
27
6
MonST3R:一种在运动存在的情况下估计几何形状的简单方法
MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion
Junyi Zhang, Charles Herrmann, Junhwa Hur, Varun Jampani, Trevor Darrell, Forrester Cole, Deqing Sun, Ming-Hsuan Yang
•
Oct 4, 2024
•
19
3
视频指导:通过教师指导改进视频扩散模型,无需训练
VideoGuide: Improving Video Diffusion Models without Training Through a Teacher's Guide
Dohun Lee, Bryan S Kim, Geon Yeong Park, Jong Chul Ye
•
Oct 6, 2024
•
30
3
TurtleBench:通过现实世界的是/否谜题评估顶级语言模型
TurtleBench: Evaluating Top Language Models via Real-World Yes/No Puzzles
Qingchen Yu, Shichao Song, Ke Fang, Yunfeng Shi, Zifan Zheng, Hanyu Wang, Simin Niu, Zhiyu Li
•
Oct 7, 2024
•
10
2
SePPO:半策略偏好优化用于扩散对齐
SePPO: Semi-Policy Preference Optimization for Diffusion Alignment
Daoan Zhang, Guangchen Lan, Dong-Jun Han, Wenlin Yao, Xiaoman Pan, Hongming Zhang, Mingxiao Li, Pengcheng Chen, Yu Dong, Christopher Brinton, Jiebo Luo
•
Oct 7, 2024
•
5
2