ChatPaper.ai
打開菜單
首頁
每日論文
arXiv
HuggingFace
定價
賬戶
工作台
🇭🇰
繁體中文
Loading...
•
•
•
•
•
•
•
•
•
•
AI研究論文每日精選
每日精選AI研究論文及翻譯
February 11th, 2025
MetaChain:一個完全自動化且無程式碼的LLM智能體框架
MetaChain: A Fully-Automated and Zero-Code Framework for LLM Agents
Jiabin Tang, Tianyu Fan, Chao Huang
•
Feb 9, 2025
•
16
2
歷史引導的影片擴散
History-Guided Video Diffusion
Kiwhan Song, Boyuan Chen, Max Simchowitz, Yilun Du, Russ Tedrake, Vincent Sitzmann
•
Feb 10, 2025
•
12
2
Steel-LLM:從零到開源 —— 在構建以中文為中心的LLM中的個人旅程
Steel-LLM:From Scratch to Open Source -- A Personal Journey in Building a Chinese-Centric LLM
Qingshui Gu, Shu Li, Tianyu Zheng, Zhaoxiang Zhang
•
Feb 10, 2025
•
4
2
APE:通過自適應並行編碼實現更快速和更長的上下文增強生成
APE: Faster and Longer Context-Augmented Generation via Adaptive Parallel Encoding
Xinyu Yang, Tianqi Chen, Beidi Chen
•
Feb 8, 2025
•
6
4
大型語言模型中的深度之咒
The Curse of Depth in Large Language Models
Wenfang Sun, Xinyuan Song, Pengxiang Li, Lu Yin, Yefeng Zheng, Shiwei Liu
•
Feb 9, 2025
•
39
5
基於暫存解碼中的時間局部性,利用階層起草實現大型語言模型的無損加速。
Lossless Acceleration of Large Language Models with Hierarchical Drafting based on Temporal Locality in Speculative Decoding
Sukmin Cho, Sangjin Choi, Taeho Hwang, Jeongyeon Seo, Soyeong Jeong, Huije Lee, Hoyun Song, Jong C. Park, Youngjin Kwon
•
Feb 8, 2025
•
18
3
CustomVideoX:3D 參考注意力驅動的動態適應,用於零樣本定制視頻擴散變壓器
CustomVideoX: 3D Reference Attention Driven Dynamic Adaptation for Zero-Shot Customized Video Diffusion Transformers
D. She, Mushui Liu, Jingxuan Pang, Jin Wang, Zhen Yang, Wanggui He, Guanghao Zhang, Yi Wang, Qihan Huang, Haobin Tang, Yunlong Yu, Siming Fu
•
Feb 10, 2025
•
11
2
1B LLM能否超越405B LLM?重新思考計算效能最佳化的測試時間擴展
Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling
Runze Liu, Junqi Gao, Jian Zhao, Kaiyan Zhang, Xiu Li, Biqing Qi, Wanli Ouyang, Bowen Zhou
•
Feb 10, 2025
•
151
6
Jakiro:透過MoE提升具分離多頭的推理解碼
Jakiro: Boosting Speculative Decoding with Decoupled Multi-Head via MoE
Haiduo Huang, Fuwei Yang, Zhenhua Liu, Yixing Xu, Jinze Li, Yang Liu, Xuanwu Yin, Dong Li, Pengju Ren, Emad Barsoum
•
Feb 10, 2025
•
5
2
邁向代理人的網際網路規模訓練
Towards Internet-Scale Training For Agents
Brandon Trabucco, Gunnar Sigurdsson, Robinson Piramuthu, Ruslan Salakhutdinov
•
Feb 10, 2025
•
8
2
擴散模型的雙字幕偏好優化
Dual Caption Preference Optimization for Diffusion Models
Amir Saeidi, Yiran Luo, Agneet Chatterjee, Shamanthak Hegde, Bimsara Pathiraja, Yezhou Yang, Chitta Baral
•
Feb 9, 2025
•
9
2
LM2:大記憶體模型
LM2: Large Memory Models
Jikun Kang, Wenqi Wu, Filippos Christianos, Alex J. Chan, Fraser Greenlee, George Thomas, Marvin Purtorab, Andy Toulis
•
Feb 9, 2025
•
30
7
DreamDPO:透過直接偏好優化將文本生成與3D生成與人類偏好對齊
DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct Preference Optimization
Zhenglin Zhou, Xiaobo Xia, Fan Ma, Hehe Fan, Yi Yang, Tat-Seng Chua
•
Feb 5, 2025
•
7
2
探索數學推理學習中結果獎勵的極限
Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning
Chengqi Lyu, Songyang Gao, Yuzhe Gu, Wenwei Zhang, Jianfei Gao, Kuikun Liu, Ziyi Wang, Shuaibin Li, Qian Zhao, Haian Huang, Weihan Cao, Jiangning Liu, Hongwei Liu, Junnan Liu, Songyang Zhang, Dahua Lin, Kai Chen
•
Feb 10, 2025
•
61
6
使用多智能體強化學習訓練社交推理語言模型
Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning
Bidipta Sarkar, Warren Xia, C. Karen Liu, Dorsa Sadigh
•
Feb 9, 2025
•
38
3
標記的隱藏生活:透過視覺資訊引導減少大型視覺語言模型的幻覺
The Hidden Life of Tokens: Reducing Hallucination of Large Vision-Language Models via Visual Information Steering
Zhuowei Li, Haizhou Shi, Yunhe Gao, Di Liu, Zhenting Wang, Yuxiao Chen, Ting Liu, Long Zhao, Hao Wang, Dimitris N. Metaxas
•
Feb 5, 2025
•
12
3
高效-vDiT:具有注意力瓦片的高效視訊擴散Transformer
Efficient-vDiT: Efficient Video Diffusion Transformers With Attention Tile
Hangliang Ding, Dacheng Li, Runlong Su, Peiyuan Zhang, Zhijie Deng, Ion Stoica, Hao Zhang
•
Feb 10, 2025
•
10
2
ReasonFlux:通過擴展思維模板進行層次化LLM推理
ReasonFlux: Hierarchical LLM Reasoning via Scaling Thought Templates
Ling Yang, Zhaochen Yu, Bin Cui, Mengdi Wang
•
Feb 10, 2025
•
21
3
SynthDetoxM:現代少樣本平行解毒資料標註器
SynthDetoxM: Modern LLMs are Few-Shot Parallel Detoxification Data Annotators
Daniil Moskovskiy, Nikita Sushko, Sergey Pletenev, Elena Tutubalina, Alexander Panchenko
•
Feb 10, 2025
•
90
2
套娃量化
Matryoshka Quantization
Pranav Nair, Puranjay Datta, Jeff Dean, Prateek Jain, Aditya Kusupati
•
Feb 10, 2025
•
30
4
Show-o Turbo:朝向加速統一多模式理解與生成的方向
Show-o Turbo: Towards Accelerated Unified Multimodal Understanding and Generation
Chenkai Xu, Xu Wang, Zhenyi Liao, Yishun Li, Tianqi Hou, Zhijie Deng
•
Feb 8, 2025
•
22
2
CODESIM:通過模擬驅動的規劃和除錯進行多智能體代碼生成和問題解決
CODESIM: Multi-Agent Code Generation and Problem Solving through Simulation-Driven Planning and Debugging
Md. Ashraful Islam, Mohammed Eunus Ali, Md Rizwan Parvez
•
Feb 8, 2025
•
23
3
EVEv2:改進的無編碼器視覺語言模型基準
EVEv2: Improved Baselines for Encoder-Free Vision-Language Models
Haiwen Diao, Xiaotong Li, Yufeng Cui, Yueze Wang, Haoge Deng, Ting Pan, Wenxuan Wang, Huchuan Lu, Xinlong Wang
•
Feb 10, 2025
•
12
2
體現式紅隊作戰用於稽核機器人基礎模型
Embodied Red Teaming for Auditing Robotic Foundation Models
Sathwik Karnik, Zhang-Wei Hong, Nishant Abhangi, Yen-Chen Lin, Tsun-Hsuan Wang, Christophe Dupuy, Rahul Gupta, Pulkit Agrawal
•
Nov 27, 2024
•
2
2
禁忌科學:雙用途人工智慧挑戰基準和科學拒絕測試
Forbidden Science: Dual-Use AI Challenge Benchmark and Scientific Refusal Tests
David Noever, Forrest McKee
•
Feb 8, 2025
•
1
2
Lumina-Video:使用多尺度 Next-DiT 實現高效靈活的影片生成
Lumina-Video: Efficient and Flexible Video Generation with Multi-scale Next-DiT
Dongyang Liu, Shicheng Li, Yutong Liu, Zhen Li, Kai Wang, Xinyue Li, Qi Qin, Yufei Liu, Yi Xin, Zhongyu Li, Bin Fu, Chenyang Si, Yuewen Cao, Conghui He, Ziwei Liu, Yu Qiao, Qibin Hou, Hongsheng Li, Peng Gao
•
Feb 10, 2025
•
14
2