ChatPaper.ai
打开菜单
首页
每日论文
arXiv
HuggingFace
定价
账户
工作台
🇨🇳
中文简体
Loading...
•
•
•
•
•
•
•
•
•
•
AI研究论文每日精选
每日精选AI研究论文及翻译
September 27th, 2024
在大型语言模型时代,对话分析的必要性:任务、技术和趋势调查
The Imperative of Conversation Analysis in the Era of LLMs: A Survey of Tasks, Techniques, and Trends
Xinghua Zhang, Haiyang Yu, Yongbin Li, Minzheng Wang, Longze Chen, Fei Huang
•
Sep 21, 2024
•
13
2
Lotus:基于扩散的视觉基础模型用于高质量密集预测
Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction
Jing He, Haodong Li, Wei Yin, Yixun Liang, Leheng Li, Kaiqiang Zhou, Hongbo Liu, Bingbing Liu, Ying-Cong Chen
•
Sep 26, 2024
•
34
2
在早期层中发现宝石:通过减少1000倍输入标记加速长上下文LLM
Discovering the Gems in Early Layers: Accelerating Long-Context LLMs with 1000x Input Token Reduction
Zhenmei Shi, Yifei Ming, Xuan-Phi Nguyen, Yingyu Liang, Shafiq Joty
•
Sep 25, 2024
•
26
5
像素空间中的潜在扩散模型后训练
Pixel-Space Post-Training of Latent Diffusion Models
Christina Zhang, Simran Motwani, Matthew Yu, Ji Hou, Felix Juefei-Xu, Sam Tsai, Peter Vajda, Zijian He, Jialiang Wang
•
Sep 26, 2024
•
22
2
LLaVA-3D:一种简单而有效的方法,赋予LMMs 3D 感知能力。
LLaVA-3D: A Simple yet Effective Pathway to Empowering LMMs with 3D-awareness
Chenming Zhu, Tai Wang, Wenwei Zhang, Jiangmiao Pang, Xihui Liu
•
Sep 26, 2024
•
35
2
通过令牌池化在最小性能影响下减少多向量检索的印记
Reducing the Footprint of Multi-Vector Retrieval with Minimal Performance Impact via Token Pooling
Benjamin Clavié, Antoine Chaffin, Griffin Adams
•
Sep 23, 2024
•
11
2
无需调整指令的指令跟随
Instruction Following without Instruction Tuning
John Hewitt, Nelson F. Liu, Percy Liang, Christopher D. Manning
•
Sep 21, 2024
•
31
4
Disco4D:从单个图像实现解耦的4D人体生成与动画
Disco4D: Disentangled 4D Human Generation and Animation from a Single Image
Hui En Pang, Shuai Liu, Zhongang Cai, Lei Yang, Tianwei Zhang, Ziwei Liu
•
Sep 25, 2024
•
11
2
MaskLLM:用于大型语言模型的可学习半结构稀疏化
MaskLLM: Learnable Semi-Structured Sparsity for Large Language Models
Gongfan Fang, Hongxu Yin, Saurav Muralidharan, Greg Heinrich, Jeff Pool, Jan Kautz, Pavlo Molchanov, Xinchao Wang
•
Sep 26, 2024
•
48
3
EMOVA:赋予语言模型看、听和表达生动情感的能力
EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions
Kai Chen, Yunhao Gou, Runhui Huang, Zhili Liu, Daxin Tan, Jing Xu, Chunwei Wang, Yi Zhu, Yihan Zeng, Kuo Yang, Dingdong Wang, Kun Xiang, Haoyuan Li, Haoli Bai, Jianhua Han, Xiaohui Li, Weike Jin, Nian Xie, Yu Zhang, James T. Kwok, Hengshuang Zhao, Xiaodan Liang, Dit-Yan Yeung, Xiao Chen, Zhenguo Li, Wei Zhang, Qun Liu, Lanqing Hong, Lu Hou, Hang Xu
•
Sep 26, 2024
•
41
13
机器人看见机器人做:使用单目4D重建模拟关节对象操作
Robot See Robot Do: Imitating Articulated Object Manipulation with Monocular 4D Reconstruction
Justin Kerr, Chung Min Kim, Mingxuan Wu, Brent Yi, Qianqian Wang, Ken Goldberg, Angjoo Kanazawa
•
Sep 26, 2024
•
9
2
利用GraphRAG增强结构化数据检索:足球数据案例研究
Enhancing Structured-Data Retrieval with GraphRAG: Soccer Data Case Study
Zahra Sepasdar, Sushant Gautam, Cise Midoglu, Michael A. Riegler, Pål Halvorsen
•
Sep 26, 2024
•
9
2