ChatPaper.ai
メニューを開く
ホーム
今日の論文
arXiv
HuggingFace
料金プラン
アカウント
ワークスペース
🇯🇵
日本語
Loading...
•
•
•
•
•
•
•
•
•
•
AI研究論文デイリー
翻訳付きの日次キュレーションされたAI研究論文
April 24th, 2025
PHYBench: 大規模言語モデルにおける物理的知覚と推論の包括的評価
PHYBench: Holistic Evaluation of Physical Perception and Reasoning in Large Language Models
Shi Qiu, Shaoyang Guo, Zhuo-Yang Song, Yunbo Sun, Zeyu Cai, Jiashen Wei, Tianyu Luo, Yixuan Yin, Haoxu Zhang, Yi Hu, Chenyang Wang, Chencheng Tang, Haoling Chang, Qi Liu, Ziheng Zhou, Tianyu Zhang, Jingtian Zhang, Zhangyi Liu, Minghao Li, Yuku Zhang, Boxuan Jing, Xianqi Yin, Yutong Ren, Zizhuo Fu, Weike Wang, Xudong Tian, Anqi Lv, Laifu Man, Jianxiang Li, Feiyu Tao, Qihua Sun, Zhou Liang, Yushu Mu, Zhongxuan Li, Jing-Jun Zhang, Shutao Zhang, Xiaotian Li, Xingqi Xia, Jiawei Lin, Zheyu Shen, Jiahang Chen, Qiuhao Xiong, Binran Wang, Fengyuan Wang, Ziyang Ni, Bohan Zhang, Fan Cui, Changkun Shao, Qing-Hong Cao, Ming-xing Luo, Muhan Zhang, Hua Xing Zhu
•
Apr 22, 2025
•
33
2
DreamID:三重項IDグループ学習による高忠実度かつ高速な拡散モデルベースの顔交換
DreamID: High-Fidelity and Fast diffusion-based Face Swapping via Triplet ID Group Learning
Fulong Ye, Miao Hua, Pengze Zhang, Xinghui Li, Qichao Sun, Songtao Zhao, Qian He, Xinglong Wu
•
Apr 20, 2025
•
48
8
高品質なCoTデータ生成の再考:LLM適応型質問難易度評価の観点から
Rethinking the Generation of High-Quality CoT Data from the Perspective of LLM-Adaptive Question Difficulty Grading
Qianjin Yu, Keyu Wu, Zihan Chen, Chushu Zhang, Manlin Mei, Lingjun Huang, Fang Tan, Yongsheng Du, Kunlin Liu, Yurui Zhu
•
Apr 16, 2025
•
12
3
LLM(エージェント)フルスタック安全性に関する包括的調査:データ、トレーニング、デプロイメント
A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment
Kun Wang, Guibin Zhang, Zhenhong Zhou, Jiahao Wu, Miao Yu, Shiqian Zhao, Chenlong Yin, Jinhu Fu, Yibo Yan, Hanjun Luo, Liang Lin, Zhihao Xu, Haolang Lu, Xinye Cao, Xinyun Zhou, Weifei Jin, Fanci Meng, Junyuan Mao, Hao Wu, Minghe Wang, Fan Zhang, Junfeng Fang, Chengwei Liu, Yifan Zhang, Qiankun Li, Chongye Guo, Yalan Qin, Yi Ding, Donghai Hong, Jiaming Ji, Xinfeng Li, Yifan Jiang, Dongxia Wang, Yihao Huang, Yufei Guo, Jen-tse Huang, Yanwei Yue, Wenke Huang, Guancheng Wan, Tianlin Li, Lei Bai, Jie Zhang, Qing Guo, Jingyi Wang, Tianlong Chen, Joey Tianyi Zhou, Xiaojun Jia, Weisong Sun, Cong Wu, Jing Chen, Xuming Hu, Yiming Li, Xiao Wang, Ningyu Zhang, Luu Anh Tuan, Guowen Xu, Tianwei Zhang, Xingjun Ma, Xiang Wang, Bo An, Jun Sun, Mohit Bansal, Shirui Pan, Yuval Elovici, Bhavya Kailkhura, Bo Li, Yaodong Yang, Hongwei Li, Wenyuan Xu, Yizhou Sun, Wei Wang, Qing Li, Ke Tang, Yu-Gang Jiang, Felix Juefei-Xu, Hui Xiong, Xiaofeng Wang, Shuicheng Yan, Dacheng Tao, Philip S. Yu, Qingsong Wen, Yang Liu
•
Apr 22, 2025
•
13
2
Causal-Copilot: 自律的な因果分析エージェント
Causal-Copilot: An Autonomous Causal Analysis Agent
Xinyue Wang, Kun Zhou, Wenyi Wu, Har Simrat Singh, Fang Nan, Songyao Jin, Aryan Philip, Saloni Patnaik, Hou Zhu, Shivam Singh, Parjanya Prashant, Qian Shen, Biwei Huang
•
Apr 17, 2025
•
5
2
見過ごされ、見落とされる:CheckboxQAによる大規模言語モデルのチェックボックス盲点への対応
Unchecked and Overlooked: Addressing the Checkbox Blind Spot in Large Language Models with CheckboxQA
Michał Turski, Mateusz Chiliński, Łukasz Borchmann
•
Apr 14, 2025
•
4
2
CRUST-Bench: C言語から安全なRustへのトランスパイリングのための包括的ベンチマーク
CRUST-Bench: A Comprehensive Benchmark for C-to-safe-Rust Transpilation
Anirudh Khatry, Robert Zhang, Jia Pan, Ziteng Wang, Qiaochu Chen, Greg Durrett, Isil Dillig
•
Apr 21, 2025
•
6
2
構成理解を向上させるための分離型グローバル-ローカルアラインメント
Decoupled Global-Local Alignment for Improving Compositional Understanding
Xiaoxing Hu, Kaicheng Yang, Jun Wang, Haoran Xu, Ziyong Feng, Yupei Wang
•
Apr 23, 2025
•
15
2
RePOPE: POPEベンチマークにおけるアノテーションエラーの影響
RePOPE: Impact of Annotation Errors on the POPE Benchmark
Yannic Neuhaus, Matthias Hein
•
Apr 22, 2025
•
8
2
I-Con: 表現学習のための統一的フレームワーク
I-Con: A Unifying Framework for Representation Learning
Shaden Alshammari, John Hershey, Axel Feldmann, William T. Freeman, Mark Hamilton
•
Apr 23, 2025
•
28
2
Tina: LoRAによる小型推論モデル
Tina: Tiny Reasoning Models via LoRA
Shangshang Wang, Julian Asilis, Ömer Faruk Akgül, Enes Burak Bilgin, Ollie Liu, Willie Neiswanger
•
Apr 22, 2025
•
50
4
Trillion 7B 技術レポート
Trillion 7B Technical Report
Sungjun Han, Juyoung Suk, Suyeong An, Hyungguk Kim, Kyuseok Kim, Wonsuk Yang, Seungtaek Choi, Jamin Shin
•
Apr 21, 2025
•
34
2
マルチタスク視覚的グラウンディングのための段階的言語誘導型視覚学習
Progressive Language-guided Visual Learning for Multi-Task Visual Grounding
Jingchao Wang, Hong Wang, Wenlong Zhang, Kunhua Ji, Dingjiang Huang, Yefeng Zheng
•
Apr 22, 2025
•
2
2
AIMO-2優勝ソリューション:OpenMathReasoningデータセットを用いた最先端の数学的推論モデルの構築
AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset
Ivan Moshkov, Darragh Hanley, Ivan Sorokin, Shubham Toshniwal, Christof Henkel, Benedikt Schifferer, Wei Du, Igor Gitman
•
Apr 23, 2025
•
18
2
Pre-DPO:ガイディング参照モデルを用いた直接選好最適化におけるデータ活用の改善
Pre-DPO: Improving Data Utilization in Direct Preference Optimization Using a Guiding Reference Model
Junshu Pan, Wei Shen, Shulin Huang, Qiji Zhou, Yue Zhang
•
Apr 22, 2025
•
18
2
VisuLogic: マルチモーダル大規模言語モデルの視覚的推論能力を評価するためのベンチマーク
VisuLogic: A Benchmark for Evaluating Visual Reasoning in Multi-modal Large Language Models
Weiye Xu, Jiahao Wang, Weiyun Wang, Zhe Chen, Wengang Zhou, Aijun Yang, Lewei Lu, Houqiang Li, Xiaohua Wang, Xizhou Zhu, Wenhai Wang, Jifeng Dai, Jinguo Zhu
•
Apr 21, 2025
•
71
2
DreamO: 画像カスタマイズのための統合フレームワーク
DreamO: A Unified Framework for Image Customization
Chong Mou, Yanze Wu, Wenxu Wu, Zinan Guo, Pengze Zhang, Yufeng Cheng, Yiming Luo, Fei Ding, Shiwen Zhang, Xinghui Li, Mengtian Li, Songtao Zhao, Jian Zhang, Qian He, Xinglong Wu
•
Apr 23, 2025
•
19
2