ChatPaper.ai
打開菜單
首頁
每日論文
arXiv
HuggingFace
定價
賬戶
工作台
🇭🇰
繁體中文
Loading...
•
•
•
•
•
•
•
•
•
•
AI研究論文每日精選
每日精選AI研究論文及翻譯
March 7th, 2025
FuseChat-3.0:偏好优化与异构模型融合的完美结合
FuseChat-3.0: Preference Optimization Meets Heterogeneous Model Fusion
Ziyi Yang, Fanqi Wan, Longguang Zhong, Canbin Huang, Guosheng Liang, Xiaojun Quan
•
Mar 6, 2025
•
15
3
LLMVoX:適用於任何大型語言模型的自回歸串流文字轉語音模型
LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM
Sambal Shikhar, Mohammed Irfan Kurpath, Sahal Shaji Mullappilly, Jean Lahoud, Fahad Khan, Rao Muhammad Anwer, Salman Khan, Hisham Cholakkal
•
Mar 6, 2025
•
70
5
Audio Flamingo 2:具備長音頻理解與專家推理能力的音頻語言模型
Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities
Sreyan Ghosh, Zhifeng Kong, Sonal Kumar, S Sakshi, Jaehyeon Kim, Wei Ping, Rafael Valle, Dinesh Manocha, Bryan Catanzaro
•
Mar 6, 2025
•
23
2
最佳融合:整合語言模型與擴散模型於影片生成
The Best of Both Worlds: Integrating Language Models and Diffusion Models for Video Generation
Aoxiong Yin, Kai Shen, Yichong Leng, Xu Tan, Xinyu Zhou, Juncheng Li, Siliang Tang
•
Mar 6, 2025
•
9
1
HybridNorm:透過混合歸一化實現穩定且高效的Transformer訓練
HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization
Zhijian Zhuo, Yutao Zeng, Ya Wang, Sijun Zhang, Jian Yang, Xiaoqing Li, Xun Zhou, Jinwen Ma
•
Mar 6, 2025
•
20
8
專用的反饋與編輯模型賦能開放式通用領域任務的推理時擴展
Dedicated Feedback and Edit Models Empower Inference-Time Scaling for Open-Ended General-Domain Tasks
Zhilin Wang, Jiaqi Zeng, Olivier Delalleau, Daniel Egert, Ellie Evans, Hoo-Chang Shin, Felipe Soares, Yi Dong, Oleksii Kuchaiev
•
Mar 6, 2025
•
7
4
LINGOLY-TOO:透過語言模板化與拼寫混淆分離記憶與推理
LINGOLY-TOO: Disentangling Memorisation from Reasoning with Linguistic Templatisation and Orthographic Obfuscation
Jude Khouja, Karolina Korgul, Simi Hellsten, Lingyi Yang, Vlad Neacs, Harry Mayne, Ryan Kearns, Andrew Bean, Adam Mahdi
•
Mar 4, 2025
•
25
3
寶可夢冠軍:專家級極小化極大語言代理
PokéChamp: an Expert-level Minimax Language Agent
Seth Karten, Andy Luu Nguyen, Chi Jin
•
Mar 6, 2025
•
12
2
IFIR:一個全面評估專家領域資訊檢索中指令遵循能力的基準
IFIR: A Comprehensive Benchmark for Evaluating Instruction-Following in Expert-Domain Information Retrieval
Tingyu Song, Guo Gan, Mingsheng Shang, Yilun Zhao
•
Mar 6, 2025
•
21
2
透過後量化積分識別敏感權重
Identifying Sensitive Weights via Post-quantization Integral
Yuezhou Hu, Weiyu Huang, Zichen Liang, Chang Chen, Jintao Zhang, Jun Zhu, Jianfei Chen
•
Feb 28, 2025
•
7
2
L^2M:長上下文語言建模的互信息縮放定律
L^2M: Mutual Information Scaling Law for Long-Context Language Modeling
Zhuo Chen, Oriol Mayné i Comas, Zhuotao Jin, Di Luo, Marin Soljačić
•
Mar 6, 2025
•
20
2
LLM作為斷線電話:迭代生成導致信息失真
LLM as a Broken Telephone: Iterative Generation Distorts Information
Amr Mohamed, Mingmeng Geng, Michalis Vazirgiannis, Guokan Shang
•
Feb 27, 2025
•
27
2
論雙語語言模型中共享語法表徵的習得
On the Acquisition of Shared Grammatical Representations in Bilingual Language Models
Catherine Arnett, Tyler A. Chang, James A. Michaelov, Benjamin K. Bergen
•
Mar 5, 2025
•
3
1
面向多模态大语言模型的令牌高效长视频理解
Token-Efficient Long Video Understanding for Multimodal LLMs
Jindong Jiang, Xiuyu Li, Zhijian Liu, Muyang Li, Guo Chen, Zhiqi Li, De-An Huang, Guilin Liu, Zhiding Yu, Kurt Keutzer, Sungjin Ahn, Jan Kautz, Hongxu Yin, Yao Lu, Song Han, Wonmin Byeon
•
Mar 6, 2025
•
94
2
如何引導大型語言模型的潛在變量以進行幻覺檢測?
How to Steer LLM Latents for Hallucination Detection?
Seongheon Park, Xuefeng Du, Min-Hsuan Yeh, Haobo Wang, Yixuan Li
•
Mar 1, 2025
•
11
2
專家聯盟:將分層路由機制應用於等效分解的Transformer模型
Union of Experts: Adapting Hierarchical Routing to Equivalently Decomposed Transformer
Yujiao Yang, Jing Lian, Linhui Li
•
Mar 4, 2025
•
8
4
EgoLife:邁向以自我為中心的生活助手
EgoLife: Towards Egocentric Life Assistant
Jingkang Yang, Shuai Liu, Hongming Guo, Yuhao Dong, Xiamengwei Zhang, Sicheng Zhang, Pengyun Wang, Zitang Zhou, Binzhu Xie, Ziyue Wang, Bei Ouyang, Zhengyu Lin, Marco Cominelli, Zhongang Cai, Yuanhan Zhang, Peiyuan Zhang, Fangzhou Hong, Joerg Widmer, Francesco Gringoli, Lei Yang, Bo Li, Ziwei Liu
•
Mar 5, 2025
•
42
2
START:具備工具的自學推理器
START: Self-taught Reasoner with Tools
Chengpeng Li, Mingfeng Xue, Zhenru Zhang, Jiaxi Yang, Beichen Zhang, Xiang Wang, Bowen Yu, Binyuan Hui, Junyang Lin, Dayiheng Liu
•
Mar 6, 2025
•
111
6
理解與預測GitHub上有害對話中的脫軌現象
Understanding and Predicting Derailment in Toxic Conversations on GitHub
Mia Mohammad Imran, Robert Zita, Rebekah Copeland, Preetha Chatterjee, Rahat Rizvi Rahman, Kostadin Damevski
•
Mar 4, 2025
•
4
2
結合流匹配與變換器以高效求解貝葉斯逆問題
Combining Flow Matching and Transformers for Efficient Solution of Bayesian Inverse Problems
Daniil Sherki, Ivan Oseledets, Ekaterina Muravleva
•
Mar 3, 2025
•
5
2
迷失於直譯:監督式訓練如何塑造大型語言模型中的翻譯腔
Lost in Literalism: How Supervised Training Shapes Translationese in LLMs
Yafu Li, Ronghao Zhang, Zhilin Wang, Huajian Zhang, Leyang Cui, Yongjing Yin, Tong Xiao, Yue Zhang
•
Mar 6, 2025
•
5
2