ChatPaper.ai
打開菜單
首頁
每日論文
arXiv
HuggingFace
定價
賬戶
工作台
🇭🇰
繁體中文
Loading...
•
•
•
•
•
•
•
•
•
•
AI研究論文每日精選
每日精選AI研究論文及翻譯
February 14th, 2025
颱風 T1:一個開放的泰國推理模型
Typhoon T1: An Open Thai Reasoning Model
Pittawat Taveekitworachai, Potsawee Manakul, Kasima Tharnpipitchai, Kunat Pipatanakul
•
Feb 13, 2025
•
16
2
CoSER:協調基於LLM的已建立角色模擬
CoSER: Coordinating LLM-Based Persona Simulation of Established Roles
Xintao Wang, Heng Wang, Yifei Zhang, Xinfeng Yuan, Rui Xu, Jen-tse Huang, Siyu Yuan, Haoran Guo, Jiangjie Chen, Wei Wang, Yanghua Xiao, Shuchang Zhou
•
Feb 13, 2025
•
29
2
這個模型是否也能識別狗?從權重中進行零樣本模型搜索
Can this Model Also Recognize Dogs? Zero-Shot Model Search from Weights
Jonathan Kahana, Or Nathan, Eliahu Horwitz, Yedid Hoshen
•
Feb 13, 2025
•
35
2
自我引用:大型語言模型中用於上下文歸因的自監督對齊
SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models
Yung-Sung Chuang, Benjamin Cohen-Wang, Shannon Zejiang Shen, Zhaofeng Wu, Hu Xu, Xi Victoria Lin, James Glass, Shang-Wen Li, Wen-tau Yih
•
Feb 13, 2025
•
36
2
透過高品質合成資料改善多模式多語言嵌入式表示_mmE5
mmE5: Improving Multimodal Multilingual Embeddings via High-quality Synthetic Data
Haonan Chen, Liang Wang, Nan Yang, Yutao Zhu, Ziliang Zhao, Furu Wei, Zhicheng Dou
•
Feb 12, 2025
•
13
2
具有三維感知二維表示的潛在輻射場
Latent Radiance Fields with 3D-aware 2D Representations
Chaoyi Zhou, Xi Liu, Feng Luo, Siyu Huang
•
Feb 13, 2025
•
6
2
一個開放的配方:透過模型合併在一天內將特定語言的LLMs調整為推理模型
An Open Recipe: Adapting Language-Specific LLMs to a Reasoning Model in One Day via Model Merging
Kunat Pipatanakul, Pittawat Taveekitworachai, Potsawee Manakul, Kasima Tharnpipitchai
•
Feb 13, 2025
•
32
4
InfiniteHiP:在單個 GPU 上將語言模型上下文擴展至 3 百萬個標記
InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU
Heejun Lee, Geon Park, Jaduk Suh, Sung Ju Hwang
•
Feb 13, 2025
•
149
6
DexTrack:朝向從人類參考實現靈巧操作的通用神經跟蹤控制
DexTrack: Towards Generalizable Neural Tracking Control for Dexterous Manipulation from Human References
Xueyi Liu, Jianibieke Adalibieke, Qianwei Han, Yuzhe Qin, Li Yi
•
Feb 13, 2025
•
12
2
TripoSG:使用大規模矯正流模型進行高保真度的3D形狀合成
TripoSG: High-Fidelity 3D Shape Synthesis using Large-Scale Rectified Flow Models
Yangguang Li, Zi-Xin Zou, Zexiang Liu, Dehu Wang, Yuan Liang, Zhipeng Yu, Xingchao Liu, Yuan-Chen Guo, Ding Liang, Wanli Ouyang, Yan-Pei Cao
•
Feb 10, 2025
•
41
4
Skrr:跳過並重複使用文本編碼層以實現記憶效率的文本到圖像生成
Skrr: Skip and Re-use Text Encoder Layers for Memory Efficient Text-to-Image Generation
Hoigi Seo, Wongi Jeong, Jae-sun Seo, Se Young Chun
•
Feb 12, 2025
•
44
2
探索在3D LMMs中無編碼器架構的潛力
Exploring the Potential of Encoder-free Architectures in 3D LMMs
Yiwen Tang, Zoey Guo, Zhuhao Wang, Ray Zhang, Qizhi Chen, Junli Liu, Delin Qu, Zhigang Wang, Dong Wang, Xuelong Li, Bin Zhao
•
Feb 13, 2025
•
26
2
EmbodiedBench:為以視覺驅動的具體化代理人提供全面評估的多模式大型語言模型基準。
EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents
Rui Yang, Hanyang Chen, Junyu Zhang, Mark Zhao, Cheng Qian, Kangrui Wang, Qineng Wang, Teja Venkat Koripella, Marziyeh Movahedi, Manling Li, Heng Ji, Huan Zhang, Tong Zhang
•
Feb 13, 2025
•
36
2
大型語言模型中的數學推理:評估跨越廣泛數值範圍的邏輯和算術錯誤
Mathematical Reasoning in Large Language Models: Assessing Logical and Arithmetic Errors across Wide Numerical Ranges
Safal Shrestha, Minwu Kim, Keith Ross
•
Feb 12, 2025
•
11
2
MME-CoT:在大型多模態模型中對思維鏈進行基準測試,評估推理品質、韌性和效率。
MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency
Dongzhi Jiang, Renrui Zhang, Ziyu Guo, Yanwei Li, Yu Qi, Xinyan Chen, Liuhui Wang, Jianhan Jin, Claire Guo, Shen Yan, Bo Zhang, Chaoyou Fu, Peng Gao, Hongsheng Li
•
Feb 13, 2025
•
28
2
3CAD:一個大規模真實世界的3C產品數據集,用於無監督異常检測。
3CAD: A Large-Scale Real-World 3C Product Dataset for Unsupervised Anomaly
Enquan Yang, Peng Xing, Hanyang Sun, Wenbo Guo, Yuanwei Ma, Zechao Li, Dan Zeng
•
Feb 9, 2025
•
6
2
CoT-Valve:可壓縮長度的思維鏈調整
CoT-Valve: Length-Compressible Chain-of-Thought Tuning
Xinyin Ma, Guangnian Wan, Runpeng Yu, Gongfan Fang, Xinchao Wang
•
Feb 13, 2025
•
14
2
大型語言模型中的邏輯推理:一項調查
Logical Reasoning in Large Language Models: A Survey
Hanmeng Liu, Zhizhang Fu, Mengru Ding, Ruoxi Ning, Chaoli Zhang, Xiaozhang Liu, Yue Zhang
•
Feb 13, 2025
•
23
5
LLM 肩上的隨機鸚鵡:對物理概念理解的總結評估
The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding
Mo Yu, Lemao Liu, Junjie Wu, Tsz Ting Chung, Shunchi Zhang, Jiangnan Li, Dit-Yan Yeung, Jie Zhou
•
Feb 13, 2025
•
194
3
VFX創建者:具可控擴散變壓器的動畫視覺效果生成
VFX Creator: Animated Visual Effect Generation with Controllable Diffusion Transformer
Xinyu Liu, Ailing Zeng, Wei Xue, Harry Yang, Wenhan Luo, Qifeng Liu, Yike Guo
•
Feb 9, 2025
•
8
2
SQuARE:用於增強大型語言模型中的思維連鎖的順序問答推理引擎
SQuARE: Sequential Question Answering Reasoning Engine for Enhanced Chain-of-Thought in Large Language Models
Daniel Fleischer, Moshe Berchansky, Gad Markovits, Moshe Wasserblat
•
Feb 13, 2025
•
16
2