ChatPaper.ai
打開菜單
首頁
每日論文
arXiv
HuggingFace
定價
賬戶
工作台
🇭🇰
繁體中文
Loading...
•
•
•
•
•
•
•
•
•
•
AI研究論文每日精選
每日精選AI研究論文及翻譯
February 13th, 2025
DPO-Shift:調整直接偏好優化的分佈
DPO-Shift: Shifting the Distribution of Direct Preference Optimization
Xiliang Yang, Feng Jiang, Qianen Zhang, Lei Zhao, Xiao Li
•
Feb 11, 2025
•
15
2
忽略KL懲罰!通過提升關鍵標記的探索來增強強化學習微調。
Ignore the KL Penalty! Boosting Exploration on Critical Tokens to Enhance RL Fine-Tuning
Jean Vassoyan, Nathanaël Beau, Roman Plaud
•
Feb 10, 2025
•
18
2
朝向可信賴的檢索增強生成大型語言模型:一項調查
Towards Trustworthy Retrieval Augmented Generation for Large Language Models: A Survey
Bo Ni, Zheyuan Liu, Leyao Wang, Yongjia Lei, Yuying Zhao, Xueqi Cheng, Qingkai Zeng, Luna Dong, Yinglong Xia, Krishnaram Kenthapadi, Ryan Rossi, Franck Dernoncourt, Md Mehrab Tanjim, Nesreen Ahmed, Xiaorui Liu, Wenqi Fan, Erik Blasch, Yu Wang, Meng Jiang, Tyler Derr
•
Feb 8, 2025
•
8
2
WorldGUI:針對全面桌面 GUI 自動化的動態測試
WorldGUI: Dynamic Testing for Comprehensive Desktop GUI Automation
Henry Hengyuan Zhao, Difei Gao, Mike Zheng Shou
•
Feb 12, 2025
•
27
4
TextAtlas5M:一個用於密集文字圖像生成的大規模數據集
TextAtlas5M: A Large-scale Dataset for Dense Text Image Generation
Alex Jinpeng Wang, Dongxing Mao, Jiawei Zhang, Weiming Han, Zhuobai Dong, Linjie Li, Yiqi Lin, Zhengyuan Yang, Libo Qin, Fuwei Zhang, Lijuan Wang, Min Li
•
Feb 11, 2025
•
45
2
LASP-2:重新思考線性注意力及其混合的序列平行化
LASP-2: Rethinking Sequence Parallelism for Linear Attention and Its Hybrid
Weigao Sun, Disen Lan, Yiran Zhong, Xiaoye Qu, Yu Cheng
•
Feb 11, 2025
•
24
2
Light-A-Video:透過漸進式光融合實現無需訓練的視頻燈光調整
Light-A-Video: Training-free Video Relighting via Progressive Light Fusion
Yujie Zhou, Jiazi Bu, Pengyang Ling, Pan Zhang, Tong Wu, Qidong Huang, Jinsong Li, Xiaoyi Dong, Yuhang Zang, Yuhang Cao, Anyi Rao, Jiaqi Wang, Li Niu
•
Feb 12, 2025
•
44
2
TransMLA:多頭潛在注意力就是你所需的
TransMLA: Multi-head Latent Attention Is All You Need
Fanxu Meng, Zengwei Yao, Muhan Zhang
•
Feb 11, 2025
•
49
9
PDE-Controller:LLM 用於偏微分方程的自動形式化和推理
PDE-Controller: LLMs for Autoformalization and Reasoning of PDEs
Mauricio Soroco, Jialin Song, Mengzhou Xia, Kye Emond, Weiran Sun, Wuyang Chen
•
Feb 3, 2025
•
16
2
MetaSC:用於語言模型的測試時間安全規範優化
MetaSC: Test-Time Safety Specification Optimization for Language Models
Víctor Gallego
•
Feb 11, 2025
•
3
2
研究主題:論推理增強型LLM模型在金融領域的可轉移性
Fino1: On the Transferability of Reasoning Enhanced LLMs to Finance
Lingfei Qian, Weipeng Zhou, Yan Wang, Xueqing Peng, Jimin Huang, Qianqian Xie
•
Feb 12, 2025
•
56
5
SARChat-Bench-2M:一個用於合成開口雷達圖像解釋的多任務視覺語言基準測試
SARChat-Bench-2M: A Multi-Task Vision-Language Benchmark for SAR Image Interpretation
Zhiming Ma, Xiayang Xiao, Sihao Dong, Peidong Wang, HaiPeng Wang, Qingyun Pan
•
Feb 12, 2025
•
12
4
LLM 模組:使用增強型交叉注意力從大型模型轉移知識到小型模型
LLM Modules: Knowledge Transfer from a Large to a Small Model using Enhanced Cross-Attention
Konstantin Kolomeitsev
•
Feb 12, 2025
•
4
2
蒸餾擴展定律
Distillation Scaling Laws
Dan Busbridge, Amitis Shidani, Floris Weers, Jason Ramapuram, Etai Littwin, Russ Webb
•
Feb 12, 2025
•
48
4
以「動畫任何人2:具高保真度之角色影像動畫及環境可負擔性」為題。
Animate Anyone 2: High-Fidelity Character Image Animation with Environment Affordance
Li Hu, Guangyuan Wang, Zhen Shen, Xin Gao, Dechao Meng, Lian Zhuo, Peng Zhang, Bang Zhang, Liefeng Bo
•
Feb 10, 2025
•
16
4
BenchMAX:用於大型語言模型的全面多語言評估套件
BenchMAX: A Comprehensive Multilingual Evaluation Suite for Large Language Models
Xu Huang, Wenhao Zhu, Hanxu Hu, Conghui He, Lei Li, Shujian Huang, Fei Yuan
•
Feb 11, 2025
•
54
2
中介者:具有較少參數衝突和基於不確定性路由的記憶效率LLM合併
Mediator: Memory-efficient LLM Merging with Less Parameter Conflicts and Uncertainty Based Routing
Kunfeng Lai, Zhenheng Tang, Xinglin Pan, Peijie Dong, Xiang Liu, Haolan Chen, Li Shen, Bo Li, Xiaowen Chu
•
Feb 6, 2025
•
4
2
下一個區塊預測:透過半自回歸建模生成影片
Next Block Prediction: Video Generation via Semi-Autoregressive Modeling
Shuhuai Ren, Shuming Ma, Xu Sun, Furu Wei
•
Feb 11, 2025
•
9
2
CineMaster:一個具有3D感知和可控制性的電影式文本到視頻生成框架
CineMaster: A 3D-Aware and Controllable Framework for Cinematic Text-to-Video Generation
Qinghe Wang, Yawen Luo, Xiaoyu Shi, Xu Jia, Huchuan Lu, Tianfan Xue, Xintao Wang, Pengfei Wan, Di Zhang, Kun Gai
•
Feb 12, 2025
•
43
2
NoLiMa:超越字面匹配的長文本評估
NoLiMa: Long-Context Evaluation Beyond Literal Matching
Ali Modarressi, Hanieh Deilamsalehy, Franck Dernoncourt, Trung Bui, Ryan A. Rossi, Seunghyun Yoon, Hinrich Schütze
•
Feb 7, 2025
•
15
2
在醫學影像密集對比表示學習中針對偽陽性和偽陰性問題的同胚先驗
Homeomorphism Prior for False Positive and Negative Problem in Medical Image Dense Contrastive Representation Learning
Yuting He, Boyu Wang, Rongjun Ge, Yang Chen, Guanyu Yang, Shuo Li
•
Feb 7, 2025
•
0
2
使用連續概念進行LLM預訓練
LLM Pretraining with Continuous Concepts
Jihoon Tack, Jack Lanchantin, Jane Yu, Andrew Cohen, Ilia Kulikov, Janice Lan, Shibo Hao, Yuandong Tian, Jason Weston, Xian Li
•
Feb 12, 2025
•
28
4