ChatPaper.ai
打開菜單
首頁
每日論文
arXiv
HuggingFace
定價
賬戶
工作台
🇭🇰
繁體中文
Loading...
•
•
•
•
•
•
•
•
•
•
AI研究論文每日精選
每日精選AI研究論文及翻譯
February 6th, 2025
HackerRank-ASTRA:評估大型語言模型在跨領域多文件項目問題上的正確性與一致性
HackerRank-ASTRA: Evaluating Correctness & Consistency of Large Language Models on cross-domain multi-file project problems
Jun Xing, Mayur Bhatia, Sahil Phulwani, Darshan Suresh, Rafik Matta
•
Jan 31, 2025
•
0
2
LayerTracer:透過擴散Transformer實現認知對齊的分層SVG合成
LayerTracer: Cognitive-Aligned Layered SVG Synthesis via Diffusion Transformer
Yiren Song, Danze Chen, Mike Zheng Shou
•
Feb 3, 2025
•
20
4
謎語:潛在的成員推斷與檢索增強生成
Riddle Me This! Stealthy Membership Inference for Retrieval-Augmented Generation
Ali Naseh, Yuefeng Peng, Anshuman Suri, Harsh Chaudhari, Alina Oprea, Amir Houmansadr
•
Feb 1, 2025
•
5
2
TwinMarket:用於金融市場的可擴展行為和社會模擬
TwinMarket: A Scalable Behavioral and Social Simulation for Financial Markets
Yuzhe Yang, Yifei Zhang, Minghao Wu, Kaidi Zhang, Yunmiao Zhang, Honghai Yu, Yan Hu, Benyou Wang
•
Feb 3, 2025
•
38
3
利用MCTS-自動化結構思維提升多模態推理
Boosting Multimodal Reasoning with MCTS-Automated Structured Thinking
Jinyang Wu, Mingkuan Feng, Shuai Zhang, Ruihan Jin, Feihu Che, Zengqi Wen, Jianhua Tao
•
Feb 4, 2025
•
22
4
大型語言模型引導的自我除錯程式碼生成
Large Language Model Guided Self-Debugging Code Generation
Muntasir Adnan, Zhiwei Xu, Carlos C. N. Kuhn
•
Feb 5, 2025
•
13
2
揭開LLM中的長篇推理之謎
Demystifying Long Chain-of-Thought Reasoning in LLMs
Edward Yeo, Yuxuan Tong, Morry Niu, Graham Neubig, Xiang Yue
•
Feb 5, 2025
•
59
3
關於語言模型蒸餾中的教師模型攻擊
On Teacher Hacking in Language Model Distillation
Daniil Tiapkin, Daniele Calandriello, Johan Ferret, Sarah Perrin, Nino Vieillard, Alexandre Ramé, Mathieu Blondel
•
Feb 4, 2025
•
18
2
基於機率推論的方法,利用基於粒子的蒙特卡羅方法對LLM進行推論時的縮放
A Probabilistic Inference Approach to Inference-Time Scaling of LLMs using Particle-Based Monte Carlo Methods
Isha Puri, Shivchander Sudalairaj, Guangxuan Xu, Kai Xu, Akash Srivastava
•
Feb 3, 2025
•
10
3
SmolLM2:當小型模型變得強大——小型語言模型的資料中心訓練
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model
Loubna Ben Allal, Anton Lozhkov, Elie Bakouch, Gabriel Martín Blázquez, Guilherme Penedo, Lewis Tunstall, Andrés Marafioti, Hynek Kydlíček, Agustín Piqueres Lajarín, Vaibhav Srivastav, Joshua Lochner, Caleb Fahlgren, Xuan-Son Nguyen, Clémentine Fourrier, Ben Burtenshaw, Hugo Larcher, Haojun Zhao, Cyril Zakka, Mathieu Morlon, Colin Raffel, Leandro von Werra, Thomas Wolf
•
Feb 4, 2025
•
228
6
LIMO:推理的精簡原則
LIMO: Less is More for Reasoning
Yixin Ye, Zhen Huang, Yang Xiao, Ethan Chern, Shijie Xia, Pengfei Liu
•
Feb 5, 2025
•
61
4
標記混合:混合潛在標記和文本標記以提升語言模型推理
Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning
DiJia Su, Hanlin Zhu, Yingchen Xu, Jiantao Jiao, Yuandong Tian, Qinqing Zheng
•
Feb 5, 2025
•
17
2
基於激活的大型語言模型合併
Activation-Informed Merging of Large Language Models
Amin Heyrani Nobari, Kaveh Alimohammadi, Ali ArjomandBigdeli, Akash Srivastava, Faez Ahmed, Navid Azizan
•
Feb 4, 2025
•
6
2
使用通用多提示進行越獄
Jailbreaking with Universal Multi-Prompts
Yu-Ling Hsu, Hsuan Su, Shang-Tse Chen
•
Feb 3, 2025
•
9
2