ChatPaper.ai
打开菜单
首页
每日论文
arXiv
HuggingFace
定价
账户
工作台
🇨🇳
中文简体
Loading...
•
•
•
•
•
•
•
•
•
•
AI研究论文每日精选
每日精选AI研究论文及翻译
February 29th, 2024
1比特LLM时代:所有大型语言模型均为1.58比特。
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Shuming Ma, Hongyu Wang, Lingxiao Ma, Lei Wang, Wenhui Wang, Shaohan Huang, Li Dong, Ruiping Wang, Jilong Xue, Furu Wei
•
Feb 27, 2024
•
618
143
EMO:情感头像生成——在弱条件下利用音频到视频扩散模型生成富有表现力的头像视频
EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
Linrui Tian, Qi Wang, Bang Zhang, Liefeng Bo
•
Feb 27, 2024
•
196
20
Sora:大型视觉模型的背景、技术、局限性和机遇综述
Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models
Yixin Liu, Kai Zhang, Yuan Li, Zhiling Yan, Chujie Gao, Ruoxi Chen, Zhengqing Yuan, Yue Huang, Hanchi Sun, Jianfeng Gao, Lifang He, Lichao Sun
•
Feb 27, 2024
•
89
5
OmniACT:用于实现桌面和Web多模态通用自主代理的数据集和基准。
OmniACT: A Dataset and Benchmark for Enabling Multimodal Generalist Autonomous Agents for Desktop and Web
Raghav Kapoor, Yash Parag Butala, Melisa Russak, Jing Yu Koh, Kiran Kamble, Waseem Alshikh, Ruslan Salakhutdinov
•
Feb 27, 2024
•
26
6
当扩展遇上LLM微调:数据、模型和微调方法的影响
When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method
Biao Zhang, Zhongtao Liu, Colin Cherry, Orhan Firat
•
Feb 27, 2024
•
26
3
无需训练的大型语言模型长上下文扩展
Training-Free Long-Context Scaling of Large Language Models
Chenxin An, Fei Huang, Jun Zhang, Shansan Gong, Xipeng Qiu, Chang Zhou, Lingpeng Kong
•
Feb 27, 2024
•
25
4
DiffuseKronA:一种用于个性化扩散模型的参数高效微调方法
DiffuseKronA: A Parameter Efficient Fine-tuning Method for Personalized Diffusion Model
Shyam Marjit, Harshit Singh, Nityanand Mathur, Sayak Paul, Chia-Mu Yu, Pin-Yu Chen
•
Feb 27, 2024
•
25
1
视频作为现实世界决策的新语言
Video as the New Language for Real-World Decision Making
Sherry Yang, Jacob Walker, Jack Parker-Holder, Yilun Du, Jake Bruce, Andre Barreto, Pieter Abbeel, Dale Schuurmans
•
Feb 27, 2024
•
22
1
评估LLM代理的非常长期对话记忆
Evaluating Very Long-Term Conversational Memory of LLM Agents
Adyasha Maharana, Dong-Ho Lee, Sergey Tulyakov, Mohit Bansal, Francesco Barbieri, Yuwei Fang
•
Feb 27, 2024
•
20
3
迈向语言模型的最佳学习
Towards Optimal Learning of Language Models
Yuxian Gu, Li Dong, Yaru Hao, Qingxiu Dong, Minlie Huang, Furu Wei
•
Feb 27, 2024
•
18
1
Sora 以惊人的几何一致性生成视频。
Sora Generates Videos with Stunning Geometrical Consistency
Xuanyi Li, Daquan Zhou, Chenxu Zhang, Shaodong Wei, Qibin Hou, Ming-Ming Cheng
•
Feb 27, 2024
•
18
1
视听感知:使用扩散潜变对齐器进行开放领域视听生成
Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners
Yazhou Xing, Yingqing He, Zeyue Tian, Xintao Wang, Qifeng Chen
•
Feb 27, 2024
•
16
1
Playground v2.5: 三个见解,提升文本到图像生成中的美学质量
Playground v2.5: Three Insights towards Enhancing Aesthetic Quality in Text-to-Image Generation
Daiqing Li, Aleks Kamko, Ehsan Akhgari, Ali Sabet, Linmiao Xu, Suhail Doshi
•
Feb 27, 2024
•
12
1
带有布局学习的解缠的三维场景生成
Disentangled 3D Scene Generation with Layout Learning
Dave Epstein, Ben Poole, Ben Mildenhall, Alexei A. Efros, Aleksander Holynski
•
Feb 26, 2024
•
12
1
VastGaussian:用于大场景重建的大规模3D高斯函数
VastGaussian: Vast 3D Gaussians for Large Scene Reconstruction
Jiaqi Lin, Zhihao Li, Xiao Tang, Jianzhuang Liu, Shiyong Liu, Jiayue Liu, Yangdi Lu, Xiaofei Wu, Songcen Xu, Youliang Yan, Wenming Yang
•
Feb 27, 2024
•
11
45