ChatPaper.ai
打开菜单
首页
每日论文
arXiv
HuggingFace
定价
账户
工作台
🇨🇳
中文简体
Loading...
•
•
•
•
•
•
•
•
•
•
AI研究论文每日精选
每日精选AI研究论文及翻译
August 6th, 2024
MiniCPM-V:您手机上的GPT-4V级MLLM
MiniCPM-V: A GPT-4V Level MLLM on Your Phone
Yuan Yao, Tianyu Yu, Ao Zhang, Chongyi Wang, Junbo Cui, Hongji Zhu, Tianchi Cai, Haoyu Li, Weilin Zhao, Zhihui He, Qianyu Chen, Huarong Zhou, Zhensheng Zou, Haoye Zhang, Shengding Hu, Zhi Zheng, Jie Zhou, Jie Cai, Xu Han, Guoyang Zeng, Dahai Li, Zhiyuan Liu, Maosong Sun
•
Aug 3, 2024
•
83
6
语言模型可以在说话的同时倾听。
Language Model Can Listen While Speaking
Ziyang Ma, Yakun Song, Chenpeng Du, Jian Cong, Zhuo Chen, Yuping Wang, Yuxuan Wang, Xie Chen
•
Aug 5, 2024
•
42
6
RAG铸造厂:增强LLM以实现检索增强生成的框架
RAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented Generation
Daniel Fleischer, Moshe Berchansky, Moshe Wasserblat, Peter Izsak
•
Aug 5, 2024
•
38
2
Lumina-mGPT:利用多模态生成预训练照亮灵活的逼真文本到图像生成
Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining
Dongyang Liu, Shitian Zhao, Le Zhuo, Weifeng Lin, Yu Qiao, Hongsheng Li, Peng Gao
•
Aug 5, 2024
•
36
2
MeshAnything V2:使用相邻网格生成由艺术家创建的网格的标记化
MeshAnything V2: Artist-Created Mesh Generation With Adjacent Mesh Tokenization
Yiwen Chen, Yikai Wang, Yihao Luo, Zhengyi Wang, Zilong Chen, Jun Zhu, Chi Zhang, Guosheng Lin
•
Aug 5, 2024
•
33
2
自学评估者
Self-Taught Evaluators
Tianlu Wang, Ilia Kulikov, Olga Golovneva, Ping Yu, Weizhe Yuan, Jane Dwivedi-Yu, Richard Yuanzhe Pang, Maryam Fazel-Zarandi, Jason Weston, Xian Li
•
Aug 5, 2024
•
30
4
释放数据海啸的力量:关于语言模型指导调整的数据评估和选择的综合调查
Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models
Yulei Qin, Yuncheng Yang, Pengcheng Guo, Gang Li, Hang Shao, Yuchen Shi, Zihan Xu, Yun Gu, Ke Li, Xing Sun
•
Aug 4, 2024
•
19
4
VidGen-1M:用于文本到视频生成的大规模数据集
VidGen-1M: A Large-Scale Dataset for Text-to-video Generation
Zhiyu Tan, Xiaomeng Yang, Luozheng Qin, Hao Li
•
Aug 5, 2024
•
15
4
创造,不要复制!创造性生成的推动能量扩散
ProCreate, Dont Reproduce! Propulsive Energy Diffusion for Creative Generation
Jack Lu, Ryan Teehan, Mengye Ren
•
Aug 5, 2024
•
12
2
BioMamba:一种基于预训练的生物医学语言表示模型,利用Mamba
BioMamba: A Pre-trained Biomedical Language Representation Model Leveraging Mamba
Ling Yue, Sixue Xing, Yingzhou Lu, Tianfan Fu
•
Aug 5, 2024
•
11
2
GPUDrive:基于数据驱动的,每秒100万帧的多智能体驾驶模拟。
GPUDrive: Data-driven, multi-agent driving simulation at 1 million FPS
Saman Kazemkhani, Aarav Pandya, Daphne Cornelisse, Brennan Shacklett, Eugene Vinitsky
•
Aug 2, 2024
•
10
2
ExoViP:具有外骨骼模块的逐步验证和探索的组合视觉推理
ExoViP: Step-by-step Verification and Exploration with Exoskeleton Modules for Compositional Visual Reasoning
Yuxuan Wang, Alan Yuille, Zhuowan Li, Zilong Zheng
•
Aug 5, 2024
•
9
2
超参数对大型语言模型推理性能的影响:对vLLM和HuggingFace管道的评估
The Impact of Hyperparameters on Large Language Model Inference Performance: An Evaluation of vLLM and HuggingFace Pipelines
Matias Martinez
•
Aug 2, 2024
•
9
4
在注重隐私的助手中实现情境完整性
Operationalizing Contextual Integrity in Privacy-Conscious Assistants
Sahra Ghalebikesabi, Eugene Bagdasaryan, Ren Yi, Itay Yona, Ilia Shumailov, Aneesh Pappu, Chongyang Shi, Laura Weidinger, Robert Stanforth, Leonard Berrada, Pushmeet Kohli, Po-Sen Huang, Borja Balle
•
Aug 5, 2024
•
5
2