ChatPaper.ai
打开菜单
首页
每日论文
arXiv
HuggingFace
定价
账户
工作台
🇨🇳
中文简体
Loading...
•
•
•
•
•
•
•
•
•
•
AI研究论文每日精选
每日精选AI研究论文及翻译
August 26th, 2024
构建和更好理解视觉-语言模型:见解与未来方向
Building and better understanding vision-language models: insights and future directions
Hugo Laurençon, Andrés Marafioti, Victor Sanh, Léo Tronchon
•
Aug 22, 2024
•
131
5
CustomCrafter:保留运动和概念组合能力的定制视频生成
CustomCrafter: Customized Video Generation with Preserving Motion and Concept Composition Abilities
Tao Wu, Yong Zhang, Xintao Wang, Xianpan Zhou, Guangcong Zheng, Zhongang Qi, Ying Shan, Xi Li
•
Aug 23, 2024
•
12
2
自信普通微分编辑
CODE: Confident Ordinary Differential Editing
Bastien van Delft, Tommaso Martorella, Alexandre Alahi
•
Aug 22, 2024
•
4
2
T3M:从语音引导的文本指导3D人体动作合成
T3M: Text Guided 3D Human Motion Synthesis from Speech
Wenshuo Peng, Kaipeng Zhang, Sai Qian Zhang
•
Aug 23, 2024
•
13
2
HiRED:面向资源受限环境的高分辨率视觉语言模型高效推理的注意力引导标记丢弃
HiRED: Attention-Guided Token Dropping for Efficient Inference of High-Resolution Vision-Language Models in Resource-Constrained Environments
Kazi Hasan Ibn Arif, JinYi Yoon, Dimitrios S. Nikolopoulos, Hans Vandierendonck, Deepu John, Bo Ji
•
Aug 20, 2024
•
11
2
基于LLM自动化的联邦学习的网络解决方案
A Web-Based Solution for Federated Learning with LLM-Based Automation
Chamith Mawela, Chaouki Ben Issaid, Mehdi Bennis
•
Aug 23, 2024
•
10
1
MME-RealWorld:您的多模态LLM能否挑战对人类而言困难的高分辨率真实场景?
MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?
Yi-Fan Zhang, Huanyu Zhang, Haochen Tian, Chaoyou Fu, Shuangqing Zhang, Junfei Wu, Feng Li, Kun Wang, Qingsong Wen, Zhang Zhang, Liang Wang, Rong Jin, Tieniu Tan
•
Aug 23, 2024
•
27
4
多层Transformer的梯度可以在几乎线性时间内进行近似计算。
Multi-Layer Transformers Gradient Can be Approximated in Almost Linear Time
Yingyu Liang, Zhizhou Sha, Zhenmei Shi, Zhao Song, Yufa Zhou
•
Aug 23, 2024
•
25
4
LayerPano3D:用于超沉浸式场景生成的分层3D全景
LayerPano3D: Layered 3D Panorama for Hyper-Immersive Scene Generation
Shuai Yang, Jing Tan, Mengchen Zhang, Tong Wu, Yixuan Li, Gordon Wetzstein, Ziwei Liu, Dahua Lin
•
Aug 23, 2024
•
27
2
FLoD:将灵活细节级别集成到3D高斯飞溅中,用于可定制渲染。
FLoD: Integrating Flexible Level of Detail into 3D Gaussian Splatting for Customizable Rendering
Yunji Seo, Young Sun Choi, Hyun Seung Son, Youngjung Uh
•
Aug 23, 2024
•
6
2
使用在线子空间下降进行高效内存的LLM训练
Memory-Efficient LLM Training with Online Subspace Descent
Kaizhao Liang, Bo Liu, Lizhang Chen, Qiang Liu
•
Aug 23, 2024
•
14
3
圆桌:利用动态模式和上下文自动完成提高表格问答中的查询精度
RoundTable: Leveraging Dynamic Schema and Contextual Autocomplete for Enhanced Query Precision in Tabular Question Answering
Pratyush Kumar, Kuber Vijaykumar Bellad, Bharat Vadlamudi, Aman Chadha
•
Aug 22, 2024
•
5
1