ChatPaper.ai
打开菜单
首页
每日论文
arXiv
HuggingFace
定价
账户
工作台
🇨🇳
中文简体
Loading...
•
•
•
•
•
•
•
•
•
•
AI研究论文每日精选
每日精选AI研究论文及翻译
July 10th, 2024
视觉语言模型是盲目的。
Vision language models are blind
Pooyan Rahmanzadehgervi, Logan Bolton, Mohammad Reza Taesiri, Anh Totti Nguyen
•
Jul 9, 2024
•
83
17
AgentInstruct:朝向具有主体流的生成式教学
AgentInstruct: Toward Generative Teaching with Agentic Flows
Arindam Mitra, Luciano Del Corro, Guoqing Zheng, Shweti Mahajan, Dany Rouhana, Andres Codas, Yadong Lu, Wei-ge Chen, Olga Vrousgos, Corby Rosset, Fillipe Silva, Hamed Khanpour, Yash Lara, Ahmed Awadallah
•
Jul 3, 2024
•
51
15
智能体的互联网:编织异构智能体的协作智能网络
Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence
Weize Chen, Ziming You, Ran Li, Yitong Guan, Chen Qian, Chenyang Zhao, Cheng Yang, Ruobing Xie, Zhiyuan Liu, Maosong Sun
•
Jul 9, 2024
•
28
4
Video-STaR:自训练使视频指导调整具有任何监督
Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision
Orr Zohar, Xiaohan Wang, Yonatan Bitton, Idan Szpektor, Serena Yeung-Levy
•
Jul 8, 2024
•
27
3
RodinHD:使用扩散模型实现高保真度的3D头像生成
RodinHD: High-Fidelity 3D Avatar Generation with Diffusion Models
Bowen Zhang, Yiji Cheng, Chunyu Wang, Ting Zhang, Jiaolong Yang, Yansong Tang, Feng Zhao, Dong Chen, Baining Guo
•
Jul 9, 2024
•
24
1
将LLM调整到希伯来语:揭示具有增强词汇量和指导能力的DictaLM 2.0
Adapting LLMs to Hebrew: Unveiling DictaLM 2.0 with Enhanced Vocabulary and Instruction Capabilities
Shaltiel Shmidman, Avi Shmidman, Amir DN Cohen, Moshe Koppel
•
Jul 9, 2024
•
22
1
MiraData:一个具有长时长和结构化字幕的大规模视频数据集
MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions
Xuan Ju, Yiming Gao, Zhaoyang Zhang, Ziyang Yuan, Xintao Wang, Ailing Zeng, Yu Xiong, Qiang Xu, Ying Shan
•
Jul 8, 2024
•
19
1
BM25S:通过急切稀疏评分实现数量级更快的词汇搜索
BM25S: Orders of magnitude faster lexical search via eager sparse scoring
Xing Han Lù
•
Jul 4, 2024
•
13
3
回顾镜头:仅利用注意力图检测和减轻大型语言模型中的上下文幻觉
Lookback Lens: Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps
Yung-Sung Chuang, Linlu Qiu, Cheng-Yu Hsieh, Ranjay Krishna, Yoon Kim, James Glass
•
Jul 9, 2024
•
12
3
定理Llama:将通用LLM转化为Lean4专家
TheoremLlama: Transforming General-Purpose LLMs into Lean4 Experts
Ruida Wang, Jipeng Zhang, Yizhen Jia, Rui Pan, Shizhe Diao, Renjie Pi, Tong Zhang
•
Jul 3, 2024
•
12
1
使用学习的各向异性缩放的任务向量进行知识组合
Knowledge Composition using Task Vectors with Learned Anisotropic Scaling
Frederic Z. Zhang, Paul Albert, Cristian Rodriguez-Opazo, Anton van den Hengel, Ehsan Abbasnejad
•
Jul 3, 2024
•
12
3
基于图的标题生成:通过相互连接区域描述来增强视觉描述
Graph-Based Captioning: Enhancing Visual Descriptions by Interconnecting Region Captions
Yu-Guan Hsieh, Cheng-Yu Hsieh, Shih-Ying Yeh, Louis Béthune, Hadi Pour Ansari, Pavan Kumar Anasosalu Vasu, Chun-Liang Li, Ranjay Krishna, Oncel Tuzel, Marco Cuturi
•
Jul 9, 2024
•
11
1
VIMI:通过多模态指导实现视频生成
VIMI: Grounding Video Generation through Multi-modal Instruction
Yuwei Fang, Willi Menapace, Aliaksandr Siarohin, Tsai-Shien Chen, Kuan-Chien Wang, Ivan Skorokhodov, Graham Neubig, Sergey Tulyakov
•
Jul 8, 2024
•
10
1
从循环到失误:语言模型在不确定性下的回退行为
From Loops to Oops: Fallback Behaviors of Language Models Under Uncertainty
Maor Ivgi, Ori Yoran, Jonathan Berant, Mor Geva
•
Jul 8, 2024
•
7
3
你是怎么知道的?教导生成式语言模型引用生物医学问题的答案
How do you know that? Teaching Generative Language Models to Reference Answers to Biomedical Questions
Bojana Bašaragin, Adela Ljajić, Darija Medvecki, Lorenzo Cassano, Miloš Košprdić, Nikola Milošević
•
Jul 6, 2024
•
4
1
基于语言嵌入的时间序列分类方法 LETS-C
LETS-C: Leveraging Language Embedding for Time Series Classification
Rachneet Kaur, Zhen Zeng, Tucker Balch, Manuela Veloso
•
Jul 9, 2024
•
2
5