ChatPaper.ai
打開菜單
首頁
每日論文
arXiv
HuggingFace
定價
賬戶
工作台
🇭🇰
繁體中文
Loading...
•
•
•
•
•
•
•
•
•
•
AI研究論文每日精選
每日精選AI研究論文及翻譯
February 26th, 2025
SWE-RL:透過開放軟體演化上的強化學習推進大型語言模型推理能力
SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution
Yuxiang Wei, Olivier Duchenne, Jade Copet, Quentin Carbonneaux, Lingming Zhang, Daniel Fried, Gabriel Synnaeve, Rishabh Singh, Sida I. Wang
•
Feb 25, 2025
•
74
5
OmniAlign-V:邁向多模態大語言模型與人類偏好更佳對齊之路
OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference
Xiangyu Zhao, Shengyuan Ding, Zicheng Zhang, Haian Huang, Maosong Cao, Weiyun Wang, Jiaqi Wang, Xinyu Fang, Wenhai Wang, Guangtao Zhai, Haodong Duan, Hua Yang, Kai Chen
•
Feb 25, 2025
•
73
2
稀疏注意力机制:精準加速任何模型推理的解決方案
SpargeAttn: Accurate Sparse Attention Accelerating Any Model Inference
Jintao Zhang, Chendong Xiang, Haofeng Huang, Jia Wei, Haocheng Xi, Jun Zhu, Jianfei Chen
•
Feb 25, 2025
•
57
2
ART:匿名區域變換器用於可變多層透明圖像生成
ART: Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation
Yifan Pu, Yiming Zhao, Zhicong Tang, Ruihong Yin, Haoxing Ye, Yuhui Yuan, Dong Chen, Jianmin Bao, Sirui Zhang, Yanbin Wang, Lin Liang, Lijuan Wang, Ji Li, Xiu Li, Zhouhui Lian, Gao Huang, Baining Guo
•
Feb 25, 2025
•
36
4
KV-Edit:無需訓練的圖像編輯技術,實現精確背景保留
KV-Edit: Training-Free Image Editing for Precise Background Preservation
Tianrui Zhu, Shiyi Zhang, Jiawei Shao, Yansong Tang
•
Feb 24, 2025
•
36
3
揭示大型語言模型下游性能的擴展規律:基於聚類的視角
Unveiling Downstream Performance Scaling of LLMs: A Clustering-Based Perspective
Chengyin Xu, Kaiyuan Chen, Xiao Li, Ke Shen, Chenggang Li
•
Feb 24, 2025
•
20
2
Curie:朝向具嚴謹性和自動化的人工智慧代理科學實驗前進
Curie: Toward Rigorous and Automated Scientific Experimentation with AI Agents
Patrick Tser Jern Kon, Jiachen Liu, Qiuyi Ding, Yiming Qiu, Zhenning Yang, Yibo Huang, Jayanth Srinivasa, Myungjin Lee, Mosharaf Chowdhury, Ang Chen
•
Feb 22, 2025
•
19
5
K-LoRA:實現無需訓練即可融合任意主題與風格LoRA的關鍵技術
K-LoRA: Unlocking Training-Free Fusion of Any Subject and Style LoRAs
Ziheng Ouyang, Zhen Li, Qibin Hou
•
Feb 25, 2025
•
15
2
將視覺感知標記引入多模態大型語言模型
Introducing Visual Perception Token into Multimodal Large Language Model
Runpeng Yu, Xinyin Ma, Xinchao Wang
•
Feb 24, 2025
•
15
2
規模分佈解耦:實現大型語言模型的穩定高效訓練
Scale-Distribution Decoupling: Enabling Stable and Effective Training of Large Language Models
Ya Wang, Zhijian Zhuo, Yutao Zeng, Xun Zhou, Jian Yang, Xiaoqing Li
•
Feb 21, 2025
•
13
2
WebGames:挑戰通用型網頁瀏覽AI代理
WebGames: Challenging General-Purpose Web-Browsing AI Agents
George Thomas, Alex J. Chan, Jikun Kang, Wenqi Wu, Filippos Christianos, Fraser Greenlee, Andy Toulis, Marvin Purtorab
•
Feb 25, 2025
•
12
2
彩票大語言模型假說:重新思考大語言模型壓縮應保留哪些能力?
The Lottery LLM Hypothesis, Rethinking What Abilities Should LLM Compression Preserve?
Zhenheng Tang, Xiang Liu, Qian Wang, Peijie Dong, Bingsheng He, Xiaowen Chu, Bo Li
•
Feb 24, 2025
•
8
2
MLLMs知其所見:無需訓練的多模態LLMs對細微視覺細節的感知能力
MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs
Jiarui Zhang, Mahyar Khayatkhoei, Prateek Chhikara, Filip Ilievski
•
Feb 24, 2025
•
7
2
提示到排行榜
Prompt-to-Leaderboard
Evan Frick, Connor Chen, Joseph Tennyson, Tianle Li, Wei-Lin Chiang, Anastasios N. Angelopoulos, Ion Stoica
•
Feb 20, 2025
•
7
3
尋找最佳平衡點:用於擴展偏好優化的偏好數據建構
Finding the Sweet Spot: Preference Data Construction for Scaling Preference Optimization
Yao Xiao, Hai Ye, Linyao Chen, Hwee Tou Ng, Lidong Bing, Xiaoli Li, Roy Ka-wei Lee
•
Feb 24, 2025
•
6
2
LDGen:透過大型語言模型驅動的語言表徵提升文本至圖像合成
LDGen: Enhancing Text-to-Image Synthesis via Large Language Model-Driven Language Representation
Pengzhi Li, Pengfei Yu, Zide Liu, Wei He, Xuhao Pan, Xudong Rao, Tao Wei, Wei Chen
•
Feb 25, 2025
•
5
2
AAD-LLM:基於神經注意力機制的聽覺場景理解
AAD-LLM: Neural Attention-Driven Auditory Scene Understanding
Xilin Jiang, Sukru Samet Dindar, Vishal Choudhari, Stephan Bickel, Ashesh Mehta, Guy M McKhann, Adeen Flinker, Daniel Friedman, Nima Mesgarani
•
Feb 24, 2025
•
5
3
統計學家視角下的大型語言模型綜述
An Overview of Large Language Models for Statisticians
Wenlong Ji, Weizhe Yuan, Emily Getzen, Kyunghyun Cho, Michael I. Jordan, Song Mei, Jason E Weston, Weijie J. Su, Jing Xu, Linjun Zhang
•
Feb 25, 2025
•
4
2
LaTIM:測量Mamba模型中的潛在令牌間交互作用
LaTIM: Measuring Latent Token-to-Token Interactions in Mamba Models
Hugo Pitorro, Marcos Treviso
•
Feb 21, 2025
•
4
2
Shakti-VLMs:面向企業AI的可擴展視覺語言模型
Shakti-VLMs: Scalable Vision-Language Models for Enterprise AI
Syed Abdul Gaffar Shakhadri, Kruthika KR, Kartik Basavaraj Angadi
•
Feb 24, 2025
•
3
2
by Leveraging Word-in-Context Knowledge WiCkeD:一種利用上下文詞彙知識提升多選題基準測試難度的簡易方法
WiCkeD: A Simple Method to Make Multiple Choice Benchmarks More Challenging
Ahmed Elhady, Eneko Agirre, Mikel Artetxe
•
Feb 25, 2025
•
2
2
透過詞彙課程擴展大規模語言模型預訓練
Scaling LLM Pre-training with Vocabulary Curriculum
Fangyuan Yu
•
Feb 25, 2025
•
1
2