ChatPaper.ai
打開菜單
首頁
每日論文
arXiv
HuggingFace
定價
賬戶
工作台
🇭🇰
繁體中文
Loading...
•
•
•
•
•
•
•
•
•
•
AI研究論文每日精選
每日精選AI研究論文及翻譯
August 26th, 2024
建構和更深入了解視覺語言模型:洞見與未來方向
Building and better understanding vision-language models: insights and future directions
Hugo Laurençon, Andrés Marafioti, Victor Sanh, Léo Tronchon
•
Aug 22, 2024
•
131
5
CustomCrafter:保留動作和概念組合能力的定制視頻生成
CustomCrafter: Customized Video Generation with Preserving Motion and Concept Composition Abilities
Tao Wu, Yong Zhang, Xintao Wang, Xianpan Zhou, Guangcong Zheng, Zhongang Qi, Ying Shan, Xi Li
•
Aug 23, 2024
•
12
2
代碼:自信普通微分編輯
CODE: Confident Ordinary Differential Editing
Bastien van Delft, Tommaso Martorella, Alexandre Alahi
•
Aug 22, 2024
•
4
2
T3M:從語音引導的文本指導下的3D人體動作合成
T3M: Text Guided 3D Human Motion Synthesis from Speech
Wenshuo Peng, Kaipeng Zhang, Sai Qian Zhang
•
Aug 23, 2024
•
13
2
HiRED:針對資源受限環境中高解析度視覺語言模型的高效推論,提出了基於注意力引導的標記丟棄方法。
HiRED: Attention-Guided Token Dropping for Efficient Inference of High-Resolution Vision-Language Models in Resource-Constrained Environments
Kazi Hasan Ibn Arif, JinYi Yoon, Dimitrios S. Nikolopoulos, Hans Vandierendonck, Deepu John, Bo Ji
•
Aug 20, 2024
•
11
2
基於LLM自動化的聯邦學習網絡解決方案
A Web-Based Solution for Federated Learning with LLM-Based Automation
Chamith Mawela, Chaouki Ben Issaid, Mehdi Bennis
•
Aug 23, 2024
•
10
1
MME-RealWorld:您的多模態LLM是否能挑戰對人類而言困難的高解析度真實世界場景?
MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?
Yi-Fan Zhang, Huanyu Zhang, Haochen Tian, Chaoyou Fu, Shuangqing Zhang, Junfei Wu, Feng Li, Kun Wang, Qingsong Wen, Zhang Zhang, Liang Wang, Rong Jin, Tieniu Tan
•
Aug 23, 2024
•
27
4
多層Transformer的梯度可以在幾乎線性的時間內進行近似。
Multi-Layer Transformers Gradient Can be Approximated in Almost Linear Time
Yingyu Liang, Zhizhou Sha, Zhenmei Shi, Zhao Song, Yufa Zhou
•
Aug 23, 2024
•
25
4
LayerPano3D:用於超沉浸式場景生成的分層3D全景
LayerPano3D: Layered 3D Panorama for Hyper-Immersive Scene Generation
Shuai Yang, Jing Tan, Mengchen Zhang, Tong Wu, Yixuan Li, Gordon Wetzstein, Ziwei Liu, Dahua Lin
•
Aug 23, 2024
•
27
2
FLoD:將靈活的細節層級整合到 3D 高斯飛濺中,以供自定義渲染。
FLoD: Integrating Flexible Level of Detail into 3D Gaussian Splatting for Customizable Rendering
Yunji Seo, Young Sun Choi, Hyun Seung Son, Youngjung Uh
•
Aug 23, 2024
•
6
2
具有線上子空間下降的記憶效率LLM訓練
Memory-Efficient LLM Training with Online Subspace Descent
Kaizhao Liang, Bo Liu, Lizhang Chen, Qiang Liu
•
Aug 23, 2024
•
14
3
圓桌:利用動態架構和情境自動完成提升表格問答中的查詢精確度
RoundTable: Leveraging Dynamic Schema and Contextual Autocomplete for Enhanced Query Precision in Tabular Question Answering
Pratyush Kumar, Kuber Vijaykumar Bellad, Bharat Vadlamudi, Aman Chadha
•
Aug 22, 2024
•
5
1