ChatPaper.ai
打開菜單
首頁
每日論文
arXiv
HuggingFace
定價
賬戶
工作台
🇭🇰
繁體中文
Loading...
•
•
•
•
•
•
•
•
•
•
AI研究論文每日精選
每日精選AI研究論文及翻譯
February 24th, 2025
LLM顯微鏡:揭示標點符號在Transformer上下文記憶中的隱藏作用
LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers
Anton Razzhigaev, Matvey Mikhalchuk, Temurbek Rahmatullaev, Elizaveta Goncharova, Polina Druzhinina, Ivan Oseledets, Andrey Kuznetsov
•
Feb 20, 2025
•
175
3
SurveyX:基於大型語言模型的學術問卷自動化系統
SurveyX: Academic Survey Automation via Large Language Models
Xun Liang, Jiawei Yang, Yezhaohui Wang, Chen Tang, Zifan Zheng, Simin Niu, Shichao Song, Hanyu Wang, Bo Tang, Feiyu Xiong, Keming Mao, Zhiyu li
•
Feb 20, 2025
•
100
5
Mol-LLaMA:邁向大規模分子語言模型中的分子通用理解
Mol-LLaMA: Towards General Understanding of Molecules in Large Molecular Language Model
Dongki Kim, Wonbin Lee, Sung Ju Hwang
•
Feb 19, 2025
•
46
2
PhotoDoodle:從少量成對數據中學習藝術圖像編輯
PhotoDoodle: Learning Artistic Image Editing from Few-Shot Pairwise Data
Shijie Huang, Yiren Song, Yuxuan Zhang, Hailong Guo, Xueyin Wang, Mike Zheng Shou, Jiaming Liu
•
Feb 20, 2025
•
42
6
MaskGWM:一種基於視頻遮罩重建的通用駕駛世界模型
MaskGWM: A Generalizable Driving World Model with Video Mask Reconstruction
Jingcheng Ni, Yuxin Guo, Yichen Liu, Rui Chen, Lewei Lu, Zehuan Wu
•
Feb 17, 2025
•
40
2
SIFT:透過貼紙將大型語言模型的推理能力根植於情境中
SIFT: Grounding LLM Reasoning in Contexts via Stickers
Zihao Zeng, Xuyao Huang, Boxiu Li, Zhijie Deng
•
Feb 19, 2025
•
31
3
VLM^2-Bench:深入探討視覺語言模型如何隱含地連結顯式匹配的視覺線索
VLM^2-Bench: A Closer Look at How Well VLMs Implicitly Link Explicit Matching Visual Cues
Jianshu Zhang, Dongyu Yao, Renjie Pi, Paul Pu Liang, Yi R., Fung
•
Feb 17, 2025
•
30
2
LightThinker:逐步思考的壓縮技術
LightThinker: Thinking Step-by-Step Compression
Jintian Zhang, Yuqi Zhu, Mengshu Sun, Yujie Luo, Shuofei Qiao, Lun Du, Da Zheng, Huajun Chen, Ningyu Zhang
•
Feb 21, 2025
•
29
7
MoBA:長上下文大語言模型的區塊注意力混合機制
MoBA: Mixture of Block Attention for Long-Context LLMs
Enzhe Lu, Zhejun Jiang, Jingyuan Liu, Yulun Du, Tao Jiang, Chao Hong, Shaowei Liu, Weiran He, Enming Yuan, Yuzhi Wang, Zhiqi Huang, Huan Yuan, Suting Xu, Xinran Xu, Guokun Lai, Yanru Chen, Huabin Zheng, Junjie Yan, Jianlin Su, Yuxin Wu, Neo Y. Zhang, Zhilin Yang, Xinyu Zhou, Mingxing Zhang, Jiezhong Qiu
•
Feb 18, 2025
•
17
2
安全標準對所有人一視同仁嗎?大型語言模型的用戶特定安全評估
Is Safety Standard Same for Everyone? User-Specific Safety Evaluation of Large Language Models
Yeonjun In, Wonjoong Kim, Kanghoon Yoon, Sungchul Kim, Mehrab Tanjim, Kibum Kim, Chanyoung Park
•
Feb 20, 2025
•
16
2
StructFlowBench:多輪指令跟蹤的結構化流程基準測試
StructFlowBench: A Structured Flow Benchmark for Multi-turn Instruction Following
Jinnan Li, Jinzhe Li, Yue Wang, Yi Chang, Yuan Wu
•
Feb 20, 2025
•
15
2
邁向全自動化材料發現:基於大規模合成數據集與專家級LLM評判機制
Towards Fully-Automated Materials Discovery via Large-Scale Synthesis Dataset and Expert-Level LLM-as-a-Judge
Heegyu Kim, Taeyang Jeon, Seungtaek Choi, Jihoon Hong, Dongwon Jeon, Sungbum Cho, Ga-Yeon Baek, Kyung-Won Kwak, Dong-Hee Lee, Sun-Jin Choi, Jisu Bae, Chihoon Lee, Yunseo Kim, Jinsung Park, Hyunsouk Cho
•
Feb 23, 2025
•
11
2
以韓國教育標準評估多模態生成式人工智慧
Evaluating Multimodal Generative AI with Korean Educational Standards
Sanghee Park, Geewook Kim
•
Feb 21, 2025
•
10
3
大型語言模型中推理與表現的關係——o3(迷你版)更深入地思考,而非更長時間
The Relationship Between Reasoning and Performance in Large Language Models -- o3 (mini) Thinks Harder, Not Longer
Marthe Ballon, Andres Algaba, Vincent Ginis
•
Feb 21, 2025
•
9
2
MedHallu:大型語言模型醫學幻覺檢測之全面基準測試
MedHallu: A Comprehensive Benchmark for Detecting Medical Hallucinations in Large Language Models
Shrey Pandit, Jiawei Xu, Junyuan Hong, Zhangyang Wang, Tianlong Chen, Kaidi Xu, Ying Ding
•
Feb 20, 2025
•
9
2
FantasyID:基於面部知識增強的ID保持視頻生成
FantasyID: Face Knowledge Enhanced ID-Preserving Video Generation
Yunpeng Zhang, Qiang Wang, Fan Jiang, Yaqi Fan, Mu Xu, Yonggang Qi
•
Feb 19, 2025
•
9
2
深入JSON思維:強化策略以嚴格遵循LLM架構
Think Inside the JSON: Reinforcement Strategy for Strict LLM Schema Adherence
Bhavik Agarwal, Ishan Joshi, Viktoria Rojkova
•
Feb 18, 2025
•
9
2
KITAB-Bench:一個全面的多領域基準測試,專為阿拉伯語OCR與文件理解而設計
KITAB-Bench: A Comprehensive Multi-Domain Benchmark for Arabic OCR and Document Understanding
Ahmed Heakl, Abdullah Sohail, Mukul Ranjan, Rania Hossam, Ghazi Ahmed, Mohamed El-Geish, Omar Maher, Zhiqiang Shen, Fahad Khan, Salman Khan
•
Feb 20, 2025
•
8
2
ReQFlow:用於高效高質量蛋白質骨架生成的校正四元數流
ReQFlow: Rectified Quaternion Flow for Efficient and High-Quality Protein Backbone Generation
Angxiao Yue, Zichong Wang, Hongteng Xu
•
Feb 20, 2025
•
8
3
一步扩散模型与f-散度分布匹配
One-step Diffusion Models with f-Divergence Distribution Matching
Yilun Xu, Weili Nie, Arash Vahdat
•
Feb 21, 2025
•
7
2
InterFeedback:透過人類回饋揭示大型多模態模型的互動智能
InterFeedback: Unveiling Interactive Intelligence of Large Multimodal Models via Human Feedback
Henry Hengyuan Zhao, Wenqi Pei, Yifei Tao, Haiyang Mei, Mike Zheng Shou
•
Feb 20, 2025
•
7
2
樹狀辯論法:多人格辯論樹激發批判性思維,助力科學比較分析
Tree-of-Debate: Multi-Persona Debate Trees Elicit Critical Thinking for Scientific Comparative Analysis
Priyanka Kargupta, Ishika Agarwal, Tal August, Jiawei Han
•
Feb 20, 2025
•
6
2
EgoSpeak:為真實場景中的自我中心對話代理學習何時發言
EgoSpeak: Learning When to Speak for Egocentric Conversational Agents in the Wild
Junhyeok Kim, Min Soo Kim, Jiwan Chung, Jungbin Cho, Jisoo Kim, Sungwoong Kim, Gyeongbo Sim, Youngjae Yu
•
Feb 17, 2025
•
6
2
超級智能體帶來災難性風險:科學家AI能否提供更安全的路徑?
Superintelligent Agents Pose Catastrophic Risks: Can Scientist AI Offer a Safer Path?
Yoshua Bengio, Michael Cohen, Damiano Fornasiere, Joumana Ghosn, Pietro Greiner, Matt MacDermott, Sören Mindermann, Adam Oberman, Jesse Richardson, Oliver Richardson, Marc-Antoine Rondeau, Pierre-Luc St-Charles, David Williams-King
•
Feb 21, 2025
•
5
2
mStyleDistance:多語言風格嵌入及其評估
mStyleDistance: Multilingual Style Embeddings and their Evaluation
Justin Qiu, Jiacheng Zhu, Ajay Patel, Marianna Apidianaki, Chris Callison-Burch
•
Feb 21, 2025
•
3
2
CrossOver:三維場景跨模態對齊
CrossOver: 3D Scene Cross-Modal Alignment
Sayan Deb Sarkar, Ondrej Miksik, Marc Pollefeys, Daniel Barath, Iro Armeni
•
Feb 20, 2025
•
3
3
PLDR-LLMs 學會了一種可泛化的張量運算元,能在推理階段替代其自身的深度神經網絡。
PLDR-LLMs Learn A Generalizable Tensor Operator That Can Replace Its Own Deep Neural Net At Inference
Burc Gokden
•
Feb 19, 2025
•
3
2
WHAC:基於世界場景的人類與相機研究
WHAC: World-grounded Humans and Cameras
Wanqi Yin, Zhongang Cai, Ruisi Wang, Fanzhou Wang, Chen Wei, Haiyi Mei, Weiye Xiao, Zhitao Yang, Qingping Sun, Atsushi Yamashita, Ziwei Liu, Lei Yang
•
Mar 19, 2024
•
3
2
罕見疾病大規模鑑別診斷與大型語言模型應用: 從腹部放線菌病到威爾森氏症
Rare Disease Differential Diagnosis with Large Language Models at Scale: From Abdominal Actinomycosis to Wilson's Disease
Elliot Schumacher, Dhruv Naik, Anitha Kannan
•
Feb 20, 2025
•
2
2
政治科學領域的大型語言模型基準測試:聯合國視角
Benchmarking LLMs for Political Science: A United Nations Perspective
Yueqing Liang, Liangwei Yang, Chen Wang, Congying Xia, Rui Meng, Xiongxiao Xu, Haoran Wang, Ali Payani, Kai Shu
•
Feb 19, 2025
•
2
2
學習發現用於基因表達預測的調控元件
Learning to Discover Regulatory Elements for Gene Expression Prediction
Xingyu Su, Haiyang Yu, Degui Zhi, Shuiwang Ji
•
Feb 19, 2025
•
2
2
UPCORE:面向平衡反學習的效用保持核心集選擇
UPCORE: Utility-Preserving Coreset Selection for Balanced Unlearning
Vaidehi Patil, Elias Stengel-Eskin, Mohit Bansal
•
Feb 20, 2025
•
1
2
JL1-CD:遙感變化檢測的新基準與穩健的多教師知識蒸餾框架
JL1-CD: A New Benchmark for Remote Sensing Change Detection and a Robust Multi-Teacher Knowledge Distillation Framework
Ziyuan Liu, Ruifei Zhu, Long Gao, Yuanxiu Zhou, Jingyu Ma, Yuantao Gu
•
Feb 19, 2025
•
1
2
超越「拒絕」:量化AI的過度拒絕與情感依附邊界
Beyond No: Quantifying AI Over-Refusal and Emotional Attachment Boundaries
David Noever, Grant Rosario
•
Feb 20, 2025
•
0
3