ChatPaper.ai
打開菜單
首頁
每日論文
arXiv
HuggingFace
定價
賬戶
工作台
🇭🇰
繁體中文
Loading...
•
•
•
•
•
•
•
•
•
•
AI研究論文每日精選
每日精選AI研究論文及翻譯
August 6th, 2024
MiniCPM-V:在您的手機上運行的 GPT-4V 級別 MLLM
MiniCPM-V: A GPT-4V Level MLLM on Your Phone
Yuan Yao, Tianyu Yu, Ao Zhang, Chongyi Wang, Junbo Cui, Hongji Zhu, Tianchi Cai, Haoyu Li, Weilin Zhao, Zhihui He, Qianyu Chen, Huarong Zhou, Zhensheng Zou, Haoye Zhang, Shengding Hu, Zhi Zheng, Jie Zhou, Jie Cai, Xu Han, Guoyang Zeng, Dahai Li, Zhiyuan Liu, Maosong Sun
•
Aug 3, 2024
•
83
6
語言模型可以在說話的同時聆聽。
Language Model Can Listen While Speaking
Ziyang Ma, Yakun Song, Chenpeng Du, Jian Cong, Zhuo Chen, Yuping Wang, Yuxuan Wang, Xie Chen
•
Aug 5, 2024
•
42
6
RAG 鑄造廠:增強檢索增強生成的框架
RAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented Generation
Daniel Fleischer, Moshe Berchansky, Moshe Wasserblat, Peter Izsak
•
Aug 5, 2024
•
38
2
Lumina-mGPT:具備多模態生成預訓練的靈活照片逼真文本到圖像生成
Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining
Dongyang Liu, Shitian Zhao, Le Zhuo, Weifeng Lin, Yu Qiao, Hongsheng Li, Peng Gao
•
Aug 5, 2024
•
36
2
MeshAnything V2:藝術家創建的具有相鄰網格標記化的網格生成
MeshAnything V2: Artist-Created Mesh Generation With Adjacent Mesh Tokenization
Yiwen Chen, Yikai Wang, Yihao Luo, Zhengyi Wang, Zilong Chen, Jun Zhu, Chi Zhang, Guosheng Lin
•
Aug 5, 2024
•
33
2
自我學習評估者
Self-Taught Evaluators
Tianlu Wang, Ilia Kulikov, Olga Golovneva, Ping Yu, Weizhe Yuan, Jane Dwivedi-Yu, Richard Yuanzhe Pang, Maryam Fazel-Zarandi, Jason Weston, Xian Li
•
Aug 5, 2024
•
30
4
釋放數據海嘯的力量:關於語言模型指導調整的數據評估和選擇的全面調查
Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models
Yulei Qin, Yuncheng Yang, Pengcheng Guo, Gang Li, Hang Shao, Yuchen Shi, Zihan Xu, Yun Gu, Ke Li, Xing Sun
•
Aug 4, 2024
•
19
4
VidGen-1M:一個用於文本轉視頻生成的大規模數據集
VidGen-1M: A Large-Scale Dataset for Text-to-video Generation
Zhiyu Tan, Xiaomeng Yang, Luozheng Qin, Hao Li
•
Aug 5, 2024
•
15
4
創造,而非複製!用於創意生成的推進能量擴散
ProCreate, Dont Reproduce! Propulsive Energy Diffusion for Creative Generation
Jack Lu, Ryan Teehan, Mengye Ren
•
Aug 5, 2024
•
12
2
BioMamba:一個利用 Mamba 的預訓練生物醫學語言表示模型
BioMamba: A Pre-trained Biomedical Language Representation Model Leveraging Mamba
Ling Yue, Sixue Xing, Yingzhou Lu, Tianfan Fu
•
Aug 5, 2024
•
11
2
GPUDrive:以每秒100萬幀的速度進行數據驅動的多智能駕駛模擬。
GPUDrive: Data-driven, multi-agent driving simulation at 1 million FPS
Saman Kazemkhani, Aarav Pandya, Daphne Cornelisse, Brennan Shacklett, Eugene Vinitsky
•
Aug 2, 2024
•
10
2
ExoViP:具有外骨骼模組的逐步驗證和探索,用於組合式視覺推理
ExoViP: Step-by-step Verification and Exploration with Exoskeleton Modules for Compositional Visual Reasoning
Yuxuan Wang, Alan Yuille, Zhuowan Li, Zilong Zheng
•
Aug 5, 2024
•
9
2
超參數對大型語言模型推論性能的影響:對vLLM和HuggingFace管線的評估
The Impact of Hyperparameters on Large Language Model Inference Performance: An Evaluation of vLLM and HuggingFace Pipelines
Matias Martinez
•
Aug 2, 2024
•
9
4
在注重隱私的助理中實現情境完整性
Operationalizing Contextual Integrity in Privacy-Conscious Assistants
Sahra Ghalebikesabi, Eugene Bagdasaryan, Ren Yi, Itay Yona, Ilia Shumailov, Aneesh Pappu, Chongyang Shi, Laura Weidinger, Robert Stanforth, Leonard Berrada, Pushmeet Kohli, Po-Sen Huang, Borja Balle
•
Aug 5, 2024
•
5
2