ChatPaper.ai
打開菜單
首頁
每日論文
arXiv
HuggingFace
定價
賬戶
工作台
🇭🇰
繁體中文
Loading...
•
•
•
•
•
•
•
•
•
•
AI研究論文每日精選
每日精選AI研究論文及翻譯
July 10th, 2024
視覺語言模型是盲目的
Vision language models are blind
Pooyan Rahmanzadehgervi, Logan Bolton, Mohammad Reza Taesiri, Anh Totti Nguyen
•
Jul 9, 2024
•
83
17
AgentInstruct:朝向具有主動流的生成式教學
AgentInstruct: Toward Generative Teaching with Agentic Flows
Arindam Mitra, Luciano Del Corro, Guoqing Zheng, Shweti Mahajan, Dany Rouhana, Andres Codas, Yadong Lu, Wei-ge Chen, Olga Vrousgos, Corby Rosset, Fillipe Silva, Hamed Khanpour, Yash Lara, Ahmed Awadallah
•
Jul 3, 2024
•
51
15
智能體的網絡:編織異質智能體的網絡以促進協作智能
Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence
Weize Chen, Ziming You, Ran Li, Yitong Guan, Chen Qian, Chenyang Zhao, Cheng Yang, Ruobing Xie, Zhiyuan Liu, Maosong Sun
•
Jul 9, 2024
•
28
4
Video-STaR:自我訓練使得能夠利用任何監督進行視頻指導調整
Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision
Orr Zohar, Xiaohan Wang, Yonatan Bitton, Idan Szpektor, Serena Yeung-Levy
•
Jul 8, 2024
•
27
3
RodinHD:使用擴散模型進行高保真度3D頭像生成
RodinHD: High-Fidelity 3D Avatar Generation with Diffusion Models
Bowen Zhang, Yiji Cheng, Chunyu Wang, Ting Zhang, Jiaolong Yang, Yansong Tang, Feng Zhao, Dong Chen, Baining Guo
•
Jul 9, 2024
•
24
1
將LLMs調整至希伯來語:揭示具備增強詞彙和指導能力的DictaLM 2.0
Adapting LLMs to Hebrew: Unveiling DictaLM 2.0 with Enhanced Vocabulary and Instruction Capabilities
Shaltiel Shmidman, Avi Shmidman, Amir DN Cohen, Moshe Koppel
•
Jul 9, 2024
•
22
1
MiraData:具有長時間和結構化字幕的大規模視頻數據集
MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions
Xuan Ju, Yiming Gao, Zhaoyang Zhang, Ziyang Yuan, Xintao Wang, Ailing Zeng, Yu Xiong, Qiang Xu, Ying Shan
•
Jul 8, 2024
•
19
1
BM25S:透過積極稀疏評分實現數量級更快速的詞彙檢索
BM25S: Orders of magnitude faster lexical search via eager sparse scoring
Xing Han Lù
•
Jul 4, 2024
•
13
3
回顧鏡頭:僅使用注意力地圖在大型語言模型中檢測和緩解情境幻覺
Lookback Lens: Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps
Yung-Sung Chuang, Linlu Qiu, Cheng-Yu Hsieh, Ranjay Krishna, Yoon Kim, James Glass
•
Jul 9, 2024
•
12
3
定理羊:將通用LLM轉換為Lean4專家
TheoremLlama: Transforming General-Purpose LLMs into Lean4 Experts
Ruida Wang, Jipeng Zhang, Yizhen Jia, Rui Pan, Shizhe Diao, Renjie Pi, Tong Zhang
•
Jul 3, 2024
•
12
1
使用具有學習異向縮放的任務向量進行知識組合
Knowledge Composition using Task Vectors with Learned Anisotropic Scaling
Frederic Z. Zhang, Paul Albert, Cristian Rodriguez-Opazo, Anton van den Hengel, Ehsan Abbasnejad
•
Jul 3, 2024
•
12
3
基於圖形的標題生成:通過互連區域標題來增強視覺描述
Graph-Based Captioning: Enhancing Visual Descriptions by Interconnecting Region Captions
Yu-Guan Hsieh, Cheng-Yu Hsieh, Shih-Ying Yeh, Louis Béthune, Hadi Pour Ansari, Pavan Kumar Anasosalu Vasu, Chun-Liang Li, Ranjay Krishna, Oncel Tuzel, Marco Cuturi
•
Jul 9, 2024
•
11
1
VIMI:透過多模式指示來建立視頻生成
VIMI: Grounding Video Generation through Multi-modal Instruction
Yuwei Fang, Willi Menapace, Aliaksandr Siarohin, Tsai-Shien Chen, Kuan-Chien Wang, Ivan Skorokhodov, Graham Neubig, Sergey Tulyakov
•
Jul 8, 2024
•
10
1
從迴圈到錯誤:語言模型在不確定性下的後備行為
From Loops to Oops: Fallback Behaviors of Language Models Under Uncertainty
Maor Ivgi, Ori Yoran, Jonathan Berant, Mor Geva
•
Jul 8, 2024
•
7
3
你是怎麼知道的?教導生成式語言模型參考生物醫學問題的答案
How do you know that? Teaching Generative Language Models to Reference Answers to Biomedical Questions
Bojana Bašaragin, Adela Ljajić, Darija Medvecki, Lorenzo Cassano, Miloš Košprdić, Nikola Milošević
•
Jul 6, 2024
•
4
1
基於語言嵌入的時間序列分類技術:LETS-C
LETS-C: Leveraging Language Embedding for Time Series Classification
Rachneet Kaur, Zhen Zeng, Tucker Balch, Manuela Veloso
•
Jul 9, 2024
•
2
5