ChatPaper.ai
메뉴 열기
홈
오늘의 논문
arXiv
HuggingFace
요금제
계정
작업공간
🇰🇷
한국어
Loading...
•
•
•
•
•
•
•
•
•
•
AI 연구 논문 데일리
번역이 포함된 일일 선별된 AI 연구 논문
April 22nd, 2025
EasyEdit2: 대규모 언어 모델 편집을 위한 사용자 친화적 조정 프레임워크
EasyEdit2: An Easy-to-use Steering Framework for Editing Large Language Models
Ziwen Xu, Shuxun Wang, Kewei Xu, Haoming Xu, Mengru Wang, Xinle Deng, Yunzhi Yao, Guozhou Zheng, Huajun Chen, Ningyu Zhang
•
Apr 21, 2025
•
21
2
LeetCodeDataset: 코드 LLM의 강건한 평가와 효율적 학습을 위한 시계열 데이터셋
LeetCodeDataset: A Temporal Dataset for Robust Evaluation and Efficient Training of Code LLMs
Yunhui Xia, Wei Shen, Yan Wang, Jason Klein Liu, Huifeng Sun, Siyue Wu, Jian Hu, Xiaolong Xu
•
Apr 20, 2025
•
19
2
다른 관점에서 보기: MLLM의 다중 뷰 이해 능력 평가
Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs
Chun-Hsiao Yeh, Chenyu Wang, Shengbang Tong, Ta-Ying Cheng, Rouyu Wang, Tianzhe Chu, Yuexiang Zhai, Yubei Chen, Shenghua Gao, Yi Ma
•
Apr 21, 2025
•
22
2
InfiGUI-R1: 반응형 행위자에서 숙고형 추론자로 발전하는 멀티모달 GUI 에이전트
InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners
Yuhang Liu, Pengxiang Li, Congkai Xie, Xavier Hu, Xiaotian Han, Shengyu Zhang, Hongxia Yang, Fei Wu
•
Apr 19, 2025
•
13
2
LoftUp: 비전 파운데이션 모델을 위한 좌표 기반 특징 업샘플러 학습
LoftUp: Learning a Coordinate-Based Feature Upsampler for Vision Foundation Models
Haiwen Huang, Anpei Chen, Volodymyr Havrylov, Andreas Geiger, Dan Zhang
•
Apr 18, 2025
•
4
2
RF-DETR 객체 탐지 대 YOLOv12: 복잡한 과수원 환경에서 라벨 모호성 하의 단일 클래스 및 다중 클래스 그린프룻 탐지를 위한 트랜스포머 기반과 CNN 기반 아키텍처 비교 연구
RF-DETR Object Detection vs YOLOv12 : A Study of Transformer-based and CNN-based Architectures for Single-Class and Multi-Class Greenfruit Detection in Complex Orchard Environments Under Label Ambiguity
Ranjan Sapkota, Rahul Harsha Cheppally, Ajay Sharda, Manoj Karkee
•
Apr 17, 2025
•
4
2
FlowReasoner: 쿼리 수준 메타 에이전트 강화
FlowReasoner: Reinforcing Query-Level Meta-Agents
Hongcheng Gao, Yue Liu, Yufei He, Longxu Dou, Chao Du, Zhijie Deng, Bryan Hooi, Min Lin, Tianyu Pang
•
Apr 21, 2025
•
46
2
SilVar-Med: 의료 영상 내 이상 징후 탐지를 위한 설명 가능한 음성 기반 시각 언어 모델
SilVar-Med: A Speech-Driven Visual Language Model for Explainable Abnormality Detection in Medical Imaging
Tan-Hanh Pham, Chris Ngo, Trong-Duong Bui, Minh Luu Quang, Tan-Huong Pham, Truong-Son Hy
•
Apr 14, 2025
•
2
2
NEMOTRON-CROSSTHINK: 수학적 추론을 넘어 자기 학습 확장하기
NEMOTRON-CROSSTHINK: Scaling Self-Learning beyond Math Reasoning
Syeda Nahida Akter, Shrimai Prabhumoye, Matvei Novikov, Seungju Han, Ying Lin, Evelina Bakhturi, Eric Nyberg, Yejin Choi, Mostofa Patwary, Mohammad Shoeybi, Bryan Catanzaro
•
Apr 15, 2025
•
6
4
Eagle 2.5: 프론티어 비전-언어 모델을 위한 장문맥 사후 학습 강화
Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models
Guo Chen, Zhiqi Li, Shihao Wang, Jindong Jiang, Yicheng Liu, Lidong Lu, De-An Huang, Wonmin Byeon, Matthieu Le, Tuomas Rintamaki, Tyler Poon, Max Ehrlich, Tuomas Rintamaki, Tyler Poon, Tong Lu, Limin Wang, Bryan Catanzaro, Jan Kautz, Andrew Tao, Zhiding Yu, Guilin Liu
•
Apr 21, 2025
•
65
5
ToolRL: 도구 학습에 필요한 것은 보상뿐
ToolRL: Reward is All Tool Learning Needs
Cheng Qian, Emre Can Acikgoz, Qi He, Hongru Wang, Xiusi Chen, Dilek Hakkani-Tür, Gokhan Tur, Heng Ji
•
Apr 16, 2025
•
41
2
DRAGON: 분포적 보상을 통한 확산 생성 모델 최적화
DRAGON: Distributional Rewards Optimize Diffusion Generative Models
Yatong Bai, Jonah Casebeer, Somayeh Sojoudi, Nicholas J. Bryan
•
Apr 21, 2025
•
10
2
THOUGHTTERMINATOR: 추론 모델에서의 과도한 사고 벤치마킹, 보정 및 완화
THOUGHTTERMINATOR: Benchmarking, Calibrating, and Mitigating Overthinking in Reasoning Models
Xiao Pu, Michael Saxon, Wenyue Hua, William Yang Wang
•
Apr 17, 2025
•
24
2
주사위를 굴리고 뛰기 전에 살펴보라: 다음 토큰 예측의 창의적 한계를 넘어서기
Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction
Vaishnavh Nagarajan, Chen Henry Wu, Charles Ding, Aditi Raghunathan
•
Apr 21, 2025
•
2
2
오프-폴리시 가이던스 하에서 추론 학습하기
Learning to Reason under Off-Policy Guidance
Jianhao Yan, Yafu Li, Zican Hu, Zhi Wang, Ganqu Cui, Xiaoye Qu, Yu Cheng, Yue Zhang
•
Apr 21, 2025
•
77
4
RainbowPlus: 진화적 품질-다양성 탐색을 통한 적대적 프롬프트 생성 강화
RainbowPlus: Enhancing Adversarial Prompt Generation via Evolutionary Quality-Diversity Search
Quy-Anh Dang, Chris Ngo, Truong-Son Hy
•
Apr 21, 2025
•
6
8
비디오 큐브의 강화된 압축을 통한 효율적인 비디오 이해를 위한 LMM
An LMM for Efficient Video Understanding via Reinforced Compression of Video Cubes
Ji Qi, Yuan Yao, Yushi Bai, Bin Xu, Juanzi Li, Zhiyuan Liu, Tat-Seng Chua
•
Apr 21, 2025
•
10
3
LookingGlass: 라플라시안 피라미드 워핑을 통한 생성적 아나모포시스
LookingGlass: Generative Anamorphoses via Laplacian Pyramid Warping
Pascal Chang, Sergio Sancho, Jingwei Tang, Markus Gross, Vinicius C. Azevedo
•
Apr 11, 2025
•
8
6
X-Teaming: 적응형 멀티 에이전트를 활용한 다중 턴 Jailbreak 공격 및 방어
X-Teaming: Multi-Turn Jailbreaks and Defenses with Adaptive Multi-Agents
Salman Rahman, Liwei Jiang, James Shiffer, Genglin Liu, Sheriff Issaka, Md Rizwan Parvez, Hamid Palangi, Kai-Wei Chang, Yejin Choi, Saadia Gabriel
•
Apr 15, 2025
•
30
2
OTC: 강화 학습을 통한 최적의 도구 호출
OTC: Optimal Tool Calls via Reinforcement Learning
Hongru Wang, Cheng Qian, Wanjun Zhong, Xiusi Chen, Jiahao Qiu, Shijue Huang, Bowen Jin, Mengdi Wang, Kam-Fai Wong, Heng Ji
•
Apr 21, 2025
•
33
2
SphereDiff: 구형 잠재 표현을 통한 조정 불필요 전방위 파노라마 이미지 및 비디오 생성
SphereDiff: Tuning-free Omnidirectional Panoramic Image and Video Generation via Spherical Latent Representation
Minho Park, Taewoong Kang, Jooyeol Yun, Sungwon Hwang, Jaegul Choo
•
Apr 19, 2025
•
28
2
CoMotion: 동시 다인 3D 모션
CoMotion: Concurrent Multi-person 3D Motion
Alejandro Newell, Peiyun Hu, Lahav Lipson, Stephan R. Richter, Vladlen Koltun
•
Apr 16, 2025
•
3
2
UFO2: 데스크톱 에이전트OS
UFO2: The Desktop AgentOS
Chaoyun Zhang, He Huang, Chiming Ni, Jian Mu, Si Qin, Shilin He, Lu Wang, Fangkai Yang, Pu Zhao, Chao Du, Liqun Li, Yu Kang, Zhao Jiang, Suzhen Zheng, Rujia Wang, Jiaxu Qian, Minghua Ma, Jian-Guang Lou, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang
•
Apr 20, 2025
•
27
3
Uni3C: 비디오 생성을 위한 정밀 3D 강화 카메라와 인간 동작 제어의 통합
Uni3C: Unifying Precisely 3D-Enhanced Camera and Human Motion Controls for Video Generation
Chenjie Cao, Jingkai Zhou, Shikai Li, Jingyun Liang, Chaohui Yu, Fan Wang, Xiangyang Xue, Yanwei Fu
•
Apr 21, 2025
•
18
2
TAPIP3D: 지속적인 3D 기하 구조에서 임의의 점 추적
TAPIP3D: Tracking Any Point in Persistent 3D Geometry
Bowei Zhang, Lei Ke, Adam W. Harley, Katerina Fragkiadaki
•
Apr 20, 2025
•
7
2
LearnAct: 통합 데모 벤치마크를 갖춘 Few-Shot 모바일 GUI 에이전트
LearnAct: Few-Shot Mobile GUI Agent with a Unified Demonstration Benchmark
Guangyi Liu, Pengxiang Zhao, Liang Liu, Zhiming Chen, Yuxiang Chai, Shuai Ren, Hao Wang, Shibo He, Wenchao Meng
•
Apr 18, 2025
•
11
2
StyleMe3D: 다중 인코더를 활용한 3D 가우시안의 분리된 사전 지식을 통한 스타일화
StyleMe3D: Stylization with Disentangled Priors by Multiple Encoders on 3D Gaussians
Cailin Zhuang, Yaoqi Hu, Xuanyang Zhang, Wei Cheng, Jiacheng Bao, Shengqi Liu, Yiying Yang, Xianfang Zeng, Gang Yu, Ming Li
•
Apr 21, 2025
•
23
2
PROMPTEVALS: 맞춤형 생산용 대규모 언어 모델 파이프라인을 위한 주장(Assertions)과 가드레일(Guardrails) 데이터셋
PROMPTEVALS: A Dataset of Assertions and Guardrails for Custom Production Large Language Model Pipelines
Reya Vir, Shreya Shankar, Harrison Chase, Will Fu-Hinthorn, Aditya Parameswaran
•
Apr 20, 2025
•
4
2