ChatPaper.ai
打開菜單
首頁
每日論文
arXiv
HuggingFace
定價
賬戶
工作台
🇭🇰
繁體中文
Loading...
•
•
•
•
•
•
•
•
•
•
AI研究論文每日精選
每日精選AI研究論文及翻譯
June 25th, 2024
康布里亞-1:一個完全開放、以視覺為中心的多模態LLM探索
Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs
Shengbang Tong, Ellis Brown, Penghao Wu, Sanghyun Woo, Manoj Middepogu, Sai Charitha Akula, Jihan Yang, Shusheng Yang, Adithya Iyer, Xichen Pan, Austin Wang, Rob Fergus, Yann LeCun, Saining Xie
•
Jun 24, 2024
•
61
4
DreamBench++:一個針對個性化圖像生成的與人類對齊的基準。
DreamBench++: A Human-Aligned Benchmark for Personalized Image Generation
Yuang Peng, Yuxin Cui, Haomiao Tang, Zekun Qi, Runpei Dong, Jing Bai, Chunrui Han, Zheng Ge, Xiangyu Zhang, Shu-Tao Xia
•
Jun 24, 2024
•
57
4
BigCodeBench:使用多樣功能呼叫和複雜指令進行代碼生成基準測試
BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions
Terry Yue Zhuo, Minh Chien Vu, Jenny Chim, Han Hu, Wenhao Yu, Ratnadira Widyasari, Imam Nur Bani Yusuf, Haolan Zhan, Junda He, Indraneil Paul, Simon Brunner, Chen Gong, Thong Hoang, Armel Randy Zebaze, Xiaoheng Hong, Wen-Ding Li, Jean Kaddour, Ming Xu, Zhihan Zhang, Prateek Yadav, Naman Jain, Alex Gu, Zhoujun Cheng, Jiawei Liu, Qian Liu, Zijian Wang, David Lo, Binyuan Hui, Niklas Muennighoff, Daniel Fried, Xiaoning Du, Harm de Vries, Leandro Von Werra
•
Jun 22, 2024
•
47
8
評估部分標註在資訊檢索上的 D-MERIT
Evaluating D-MERIT of Partial-annotation on Information Retrieval
Royi Rassin, Yaron Fairstein, Oren Kalinsky, Guy Kushilevitz, Nachshon Cohen, Alexander Libov, Yoav Goldberg
•
Jun 23, 2024
•
36
2
從語言到視覺的長距離上下文轉移
Long Context Transfer from Language to Vision
Peiyuan Zhang, Kaichen Zhang, Bo Li, Guangtao Zeng, Jingkang Yang, Yuanhan Zhang, Ziyue Wang, Haoran Tan, Chunyuan Li, Ziwei Liu
•
Jun 24, 2024
•
34
2
Video-Infinity:分散式長視頻生成
Video-Infinity: Distributed Long Video Generation
Zhenxiong Tan, Xingyi Yang, Songhua Liu, Xinchao Wang
•
Jun 24, 2024
•
30
2
VideoHallucer:評估大型視頻語言模型中的內在和外在幻覺
VideoHallucer: Evaluating Intrinsic and Extrinsic Hallucinations in Large Video-Language Models
Yuxuan Wang, Yueqian Wang, Dongyan Zhao, Cihang Xie, Zilong Zheng
•
Jun 24, 2024
•
27
2
WARP:權重平均獎勵策略的優勢
WARP: On the Benefits of Weight Averaged Rewarded Policies
Alexandre Ramé, Johan Ferret, Nino Vieillard, Robert Dadashi, Léonard Hussenot, Pierre-Louis Cedoz, Pier Giuseppe Sessa, Sertan Girgin, Arthur Douillard, Olivier Bachem
•
Jun 24, 2024
•
23
1
線性複雜度語言模型的擴展定律
Scaling Laws for Linear Complexity Language Models
Xuyang Shen, Dong Li, Ruitao Leng, Zhen Qin, Weigao Sun, Yiran Zhong
•
Jun 24, 2024
•
23
4
朝向快速多語言LLM推論:推測解碼與專用起草者
Towards Fast Multilingual LLM Inference: Speculative Decoding and Specialized Drafters
Euiin Yi, Taehyeon Kim, Hongseok Jeung, Du-Seong Chang, Se-Young Yun
•
Jun 24, 2024
•
20
3
通過緩解穩定性差距實現高效的持續預訓練
Efficient Continual Pre-training by Mitigating the Stability Gap
Yiduo Guo, Jie Fu, Huishuai Zhang, Dongyan Zhao, Yikang Shen
•
Jun 21, 2024
•
20
1
稀疏即快速,少即是多:長距離Transformer的高效稀疏注意力
Sparser is Faster and Less is More: Efficient Sparse Attention for Long-Range Transformers
Chao Lou, Zixia Jia, Zilong Zheng, Kewei Tu
•
Jun 24, 2024
•
19
1
語義熵探針:在LLM中實現堅固且經濟的幻覺檢測
Semantic Entropy Probes: Robust and Cheap Hallucination Detection in LLMs
Jannik Kossen, Jiatong Han, Muhammed Razzak, Lisa Schut, Shreshth Malik, Yarin Gal
•
Jun 22, 2024
•
14
1
超越回合制遊戲:利用雙工模型實現即時對話
Beyond the Turn-Based Game: Enabling Real-Time Conversations with Duplex Models
Xinrong Zhang, Yingfa Chen, Shengding Hu, Xu Han, Zihang Xu, Yuanwei Xu, Weilin Zhao, Maosong Sun, Zhiyuan Liu
•
Jun 22, 2024
•
14
2
毒性緩解的偏好調整在不同語言間具有普遍性
Preference Tuning For Toxicity Mitigation Generalizes Across Languages
Xiaochen Li, Zheng-Xin Yong, Stephen H. Bach
•
Jun 23, 2024
•
11
1
自動偵測:朝向大型語言模型自動弱點偵測的統一框架
AutoDetect: Towards a Unified Framework for Automated Weakness Detection in Large Language Models
Jiale Cheng, Yida Lu, Xiaotao Gu, Pei Ke, Xiao Liu, Yuxiao Dong, Hongning Wang, Jie Tang, Minlie Huang
•
Jun 24, 2024
•
10
2
語言模型中的信心調節神經元
Confidence Regulation Neurons in Language Models
Alessandro Stolfo, Ben Wu, Wes Gurnee, Yonatan Belinkov, Xingyi Song, Mrinmaya Sachan, Neel Nanda
•
Jun 24, 2024
•
10
1
需要多少參數才能換一顆燈泡?評估對話遊戲自我對弈的表現,並根據模型特性進行分析。
How Many Parameters Does it Take to Change a Light Bulb? Evaluating Performance in Self-Play of Conversational Games as a Function of Model Characteristics
Nidhir Bhavsar, Jonathan Jordan, Sherzod Hakimov, David Schlangen
•
Jun 20, 2024
•
9
1
ClotheDreamer:使用3D高斯函數生成器的文本導向服裝生成
ClotheDreamer: Text-Guided Garment Generation with 3D Gaussians
Yufei Liu, Junshu Tang, Chu Zheng, Shijie Zhang, Jinkun Hao, Junwei Zhu, Dongjin Huang
•
Jun 24, 2024
•
7
1
位於中間:校準位置注意偏差以提升長距離上下文利用
Found in the Middle: Calibrating Positional Attention Bias Improves Long Context Utilization
Cheng-Yu Hsieh, Yung-Sung Chuang, Chun-Liang Li, Zifeng Wang, Long T. Le, Abhishek Kumar, James Glass, Alexander Ratner, Chen-Yu Lee, Ranjay Krishna, Tomas Pfister
•
Jun 23, 2024
•
6
1
IRASim:學習互動式真實機器人動作模擬器
IRASim: Learning Interactive Real-Robot Action Simulators
Fangqi Zhu, Hongtao Wu, Song Guo, Yuxiao Liu, Chilam Cheang, Tao Kong
•
Jun 20, 2024
•
6
1
視頻-SALMONN:語音增強的視聽大型語言模型
video-SALMONN: Speech-Enhanced Audio-Visual Large Language Models
Guangzhi Sun, Wenyi Yu, Changli Tang, Xianzhao Chen, Tian Tan, Wei Li, Lu Lu, Zejun Ma, Yuxuan Wang, Chao Zhang
•
Jun 22, 2024
•
5
1
少樣本學習在長文本中可行嗎?重複利用上下文生成示範
Can Few-shot Work in Long-Context? Recycling the Context to Generate Demonstrations
Arie Cattan, Alon Jacovi, Alex Fabrikant, Jonathan Herzig, Roee Aharoni, Hannah Rashkin, Dror Marcus, Avinatan Hassidim, Yossi Matias, Idan Szpektor, Avi Caciularu
•
Jun 19, 2024
•
5
1
排斥分數蒸餾用於擴散模型的多樣抽樣
Repulsive Score Distillation for Diverse Sampling of Diffusion Models
Nicolas Zilberstein, Morteza Mardani, Santiago Segarra
•
Jun 24, 2024
•
4
2
奧林匹克競技場獎牌排名:迄今為止最聰明的人工智慧是誰?
OlympicArena Medal Ranks: Who Is the Most Intelligent AI So Far?
Zhen Huang, Zengzhi Wang, Shijie Xia, Pengfei Liu
•
Jun 24, 2024
•
2
2