ChatPaper.ai
打開菜單
首頁
每日論文
arXiv
HuggingFace
定價
賬戶
工作台
🇭🇰
繁體中文
Loading...
•
•
•
•
•
•
•
•
•
•
AI研究論文每日精選
每日精選AI研究論文及翻譯
June 27th, 2024
Adam-mini:使用更少的學習率獲得更多
Adam-mini: Use Fewer Learning Rates To Gain More
Yushun Zhang, Congliang Chen, Ziniu Li, Tian Ding, Chenwei Wu, Yinyu Ye, Zhi-Quan Luo, Ruoyu Sun
•
Jun 24, 2024
•
69
4
八爪魚規劃器:用於計劃者-動作代理的設備端語言模型
Octo-planner: On-device Language Model for Planner-Action Agents
Wei Chen, Zhiyuan Li, Zhen Guo, Yikang Shen
•
Jun 26, 2024
•
49
5
CharXiv:在多模式LLM中對真實圖表理解的差距進行圖表化。
CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs
Zirui Wang, Mengzhou Xia, Luxi He, Howard Chen, Yitao Liu, Richard Zhu, Kaiqu Liang, Xindi Wu, Haotian Liu, Sadhika Malladi, Alexis Chevalier, Sanjeev Arora, Danqi Chen
•
Jun 26, 2024
•
30
2
ChronoMagic-Bench:一個用於評估文本轉時間膠片視頻生成的變形評估基準。
ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation
Shenghai Yuan, Jinfa Huang, Yongqi Xu, Yaoyang Liu, Shaofeng Zhang, Yujun Shi, Ruijie Zhu, Xinhua Cheng, Jiebo Luo, Li Yuan
•
Jun 26, 2024
•
21
3
深入探討大型語言模型中的專家混合模型
A Closer Look into Mixture-of-Experts in Large Language Models
Ka Man Lo, Zeyu Huang, Zihan Qiu, Zili Wang, Jie Fu
•
Jun 26, 2024
•
16
2
WildGuard:針對安全風險、越獄和LLM拒絕的一站式開放式監管工具
WildGuard: Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs
Seungju Han, Kavel Rao, Allyson Ettinger, Liwei Jiang, Bill Yuchen Lin, Nathan Lambert, Yejin Choi, Nouha Dziri
•
Jun 26, 2024
•
13
1
EHRCon:用於檢查電子健康記錄中非結構化註釋與結構化表格之間一致性的數據集
EHRCon: Dataset for Checking Consistency between Unstructured Notes and Structured Tables in Electronic Health Records
Yeonsu Kwon, Jiho Kim, Gyubok Lee, Seongsu Bae, Daeun Kyung, Wonchul Cha, Tom Pollard, Alistair Johnson, Edward Choi
•
Jun 24, 2024
•
13
7
符號學習使自我演化的智能體成為可能。
Symbolic Learning Enables Self-Evolving Agents
Wangchunshu Zhou, Yixin Ou, Shengwei Ding, Long Li, Jialong Wu, Tiannan Wang, Jiamin Chen, Shuai Wang, Xiaohua Xu, Ningyu Zhang, Huajun Chen, Yuchen Eleanor Jiang
•
Jun 26, 2024
•
12
1
比賽時間匹配:朝向自動足球比賽評論生成
MatchTime: Towards Automatic Soccer Game Commentary Generation
Jiayuan Rao, Haoning Wu, Chang Liu, Yanfeng Wang, Weidi Xie
•
Jun 26, 2024
•
12
4
Math-LLaVA:為多模態大型語言模型啟動數學推理
Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language Models
Wenhao Shi, Zhiqiang Hu, Yi Bin, Junhua Liu, Yang Yang, See-Kiong Ng, Lidong Bing, Roy Ka-Wei Lee
•
Jun 25, 2024
•
11
1
大规模野外协作:从野外越狱到(对抗性地)更安全的语言模型
WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer Language Models
Liwei Jiang, Kavel Rao, Seungju Han, Allyson Ettinger, Faeze Brahman, Sachin Kumar, Niloofar Mireshghallah, Ximing Lu, Maarten Sap, Yejin Choi, Nouha Dziri
•
Jun 26, 2024
•
9
1
深度強化學習的理解與診斷
Understanding and Diagnosing Deep Reinforcement Learning
Ezgi Korkmaz
•
Jun 23, 2024
•
9
1
多模式任務向量實現多樣本多模式上下文學習。
Multimodal Task Vectors Enable Many-Shot Multimodal In-Context Learning
Brandon Huang, Chancharik Mitra, Assaf Arbelle, Leonid Karlinsky, Trevor Darrell, Roei Herzig
•
Jun 21, 2024
•
9
1
MemServe:具彈性記憶體池的分散式LLM服務的上下文快取
MemServe: Context Caching for Disaggregated LLM Serving with Elastic Memory Pool
Cunchen Hu, Heyang Huang, Junhao Hu, Jiang Xu, Xusheng Chen, Tao Xie, Chenxi Wang, Sa Wang, Yungang Bao, Ninghui Sun, Yizhou Shan
•
Jun 25, 2024
•
4
1