ChatPaper.ai
打开菜单
首页
每日论文
arXiv
HuggingFace
定价
账户
工作台
🇨🇳
中文简体
Loading...
•
•
•
•
•
•
•
•
•
•
AI研究论文每日精选
每日精选AI研究论文及翻译
September 18th, 2024
傅里叶-科尔莫戈洛夫-阿诺德网络中的隐式神经表示
Implicit Neural Representations with Fourier Kolmogorov-Arnold Networks
Ali Mehrabian, Parsa Mojarad Adi, Moein Heidari, Ilker Hacihaliloglu
•
Sep 14, 2024
•
5
2
全能生成:统一图像生成
OmniGen: Unified Image Generation
Shitao Xiao, Yueze Wang, Junjie Zhou, Huaying Yuan, Xingrun Xing, Ruiran Yan, Shuting Wang, Tiejun Huang, Zheng Liu
•
Sep 17, 2024
•
115
7
基于基础模型的类人情感认知
Human-like Affective Cognition in Foundation Models
Kanishk Gandhi, Zoe Lynch, Jan-Philipp Fränken, Kayla Patterson, Sharon Wambu, Tobias Gerstenberg, Desmond C. Ong, Noah D. Goodman
•
Sep 18, 2024
•
6
2
EzAudio:利用高效扩散Transformer增强文本转语音生成
EzAudio: Enhancing Text-to-Audio Generation with Efficient Diffusion Transformer
Jiarui Hai, Yong Xu, Hao Zhang, Chenxing Li, Helin Wang, Mounya Elhilali, Dong Yu
•
Sep 17, 2024
•
20
3
不连续地形中的敏捷连续跳跃
Agile Continuous Jumping in Discontinuous Terrains
Yuxiang Yang, Guanya Shi, Changyi Lin, Xiangyun Meng, Rosario Scalise, Mateo Guaman Castro, Wenhao Yu, Tingnan Zhang, Ding Zhao, Jie Tan, Byron Boots
•
Sep 17, 2024
•
12
2
关于基于代理的模型中代理能力的限制
On the limits of agency in agent-based models
Ayush Chopra, Shashank Kumar, Nurullah Giray-Kuru, Ramesh Raskar, Arnau Quera-Bofarull
•
Sep 14, 2024
•
14
2
PDMX:用于符号音乐处理的大规模公共领域MusicXML数据集
PDMX: A Large-Scale Public Domain MusicXML Dataset for Symbolic Music Processing
Phillip Long, Zachary Novack, Taylor Berg-Kirkpatrick, Julian McAuley
•
Sep 17, 2024
•
5
2
Prompt检索器:指令训练的检索器可以像语言模型一样被提示
Promptriever: Instruction-Trained Retrievers Can Be Prompted Like Language Models
Orion Weller, Benjamin Van Durme, Dawn Lawrie, Ashwin Paranjape, Yuhao Zhang, Jack Hessel
•
Sep 17, 2024
•
24
2
SplatFields:用于稀疏3D和4D重建的神经高斯斑点
SplatFields: Neural Gaussian Splats for Sparse 3D and 4D Reconstruction
Marko Mihajlovic, Sergey Prokudin, Siyu Tang, Robert Maier, Federica Bogo, Tony Tung, Edmond Boyer
•
Sep 17, 2024
•
9
2
对量化指令调整的大型语言模型进行全面评估:高达405B的实验分析
A Comprehensive Evaluation of Quantized Instruction-Tuned Large Language Models: An Experimental Analysis up to 405B
Jemin Lee, Sihyeong Park, Jinse Kwon, Jihun Oh, Yongin Kwon
•
Sep 17, 2024
•
17
3
通过基于事实的归因和学习拒绝来衡量和增强RAG中LLMs的可信度。
Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Refuse
Maojia Song, Shang Hong Sim, Rishabh Bhardwaj, Hai Leong Chieu, Navonil Majumder, Soujanya Poria
•
Sep 17, 2024
•
7
2
Phidias:一种用于从文本、图像和3D条件生成3D内容的生成模型,采用参考增强扩散。
Phidias: A Generative Model for Creating 3D Content from Text, Image, and 3D Conditions with Reference-Augmented Diffusion
Zhenwei Wang, Tengfei Wang, Zexin He, Gerhard Hancke, Ziwei Liu, Rynson W. H. Lau
•
Sep 17, 2024
•
28
2
OSV:一步足以实现高质量图像到视频的生成
OSV: One Step is Enough for High-Quality Image to Video Generation
Xiaofeng Mao, Zhengkai Jiang, Fu-Yun Wang, Wenbing Zhu, Jiangning Zhang, Hao Chen, Mingmin Chi, Yabiao Wang
•
Sep 17, 2024
•
14
2
微调图像条件扩散模型比你想象的要容易
Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think
Gonzalo Martin Garcia, Karim Abou Zeid, Christian Schmidt, Daan de Geus, Alexander Hermans, Bastian Leibe
•
Sep 17, 2024
•
31
2
单层可学习激活隐式神经表示(SL^{2}A-INR)
Single-Layer Learnable Activation for Implicit Neural Representation (SL^{2}A-INR)
Moein Heidari, Reza Rezaeian, Reza Azad, Dorit Merhof, Hamid Soltanian-Zadeh, Ilker Hacihaliloglu
•
Sep 17, 2024
•
5
2
NVLM:开放式前沿多模态LLM模型
NVLM: Open Frontier-Class Multimodal LLMs
Wenliang Dai, Nayeon Lee, Boxin Wang, Zhuoling Yang, Zihan Liu, Jon Barker, Tuomas Rintamaki, Mohammad Shoeybi, Bryan Catanzaro, Wei Ping
•
Sep 17, 2024
•
75
2