ChatPaper.aiChatPaper.ai
홈

arXiv

HuggingFace

요금제계정작업공간

•
•

•
•

•
•

•
•

•
•

Footer

Company name

ChatPaper.ai: Your advanced AI reading assistant.

Contact us: [email protected]

X (Twitter)

Products

  • AI Search
  • AI Mind Map
  • Arxiv Summary
  • Huggingface Summary

Support

  • FAQ
  • Contact

Company

  • Blog
  • Privacy Policy
  • Terms of Service

Available Languages

  • 🇬🇧English
  • 🇨🇳中文简体
  • 🇭🇰繁體中文
  • 🇯🇵日本語
  • 🇰🇷한국어
  • 🇩🇪Deutsch
  • 🇫🇷Français
  • 🇷🇺Русский
  • 🇪🇸Español

© 2025 chatpaper.ai All rights reserved.

AI 연구 논문 데일리

번역이 포함된 일일 선별된 AI 연구 논문

블록 트랜스포머: 빠른 추론을 위한 전역-지역 언어 모델링
Block Transformer: Global-to-Local Language Modeling for Fast Inference

Namgyu Ho, Sangmin Bae, Taehyeon Kim, Hyunjik Jo, Yireun Kim, Tal Schuster, Adam Fisch, James Thorne, Se-Young Yun•Jun 4, 2024•411

Parrot: 다국어 시각적 명령어 튜닝
Parrot: Multilingual Visual Instruction Tuning

Hai-Long Sun, Da-Wei Zhou, Yang Li, Shiyin Lu, Chao Yi, Qing-Guo Chen, Zhao Xu, Weihua Luo, Kaifu Zhang, De-Chuan Zhan, Han-Jia Ye•Jun 4, 2024•392

Mobile-Agent-v2: 다중 에이전트 협업을 통한 효율적 탐색 기능을 갖춘 모바일 디바이스 운영 보조 시스템
Mobile-Agent-v2: Mobile Device Operation Assistant with Effective Navigation via Multi-Agent Collaboration

Junyang Wang, Haiyang Xu, Haitao Jia, Xi Zhang, Ming Yan, Weizhou Shen, Ji Zhang, Fei Huang, Jitao Sang•Jun 3, 2024•352

Ouroboros3D: 3D 인식 재귀적 확산을 통한 이미지-3D 생성
Ouroboros3D: Image-to-3D Generation via 3D-aware Recursive Diffusion

Hao Wen, Zehuan Huang, Yaohui Wang, Xinyuan Chen, Yu Qiao, Lu Sheng•Jun 5, 2024•222

오디오 맘바: 오디오 표현 학습을 위한 양방향 상태 공간 모델
Audio Mamba: Bidirectional State Space Model for Audio Representation Learning

Mehmet Hamza Erol, Arda Senocak, Jiu Feng, Joon Son Chung•Jun 5, 2024•211

PosterLLaVa: LLM 기반 통합 다중 모달 레이아웃 생성기 구축
PosterLLaVa: Constructing a Unified Multi-modal Layout Generator with LLM

Tao Yang, Yingmin Luo, Zhongang Qi, Yang Wu, Ying Shan, Chang Wen Chen•Jun 5, 2024•182

LiveSpeech: 오디오 이산 코드의 자기회귀적 모델링을 통한 저지연 제로샷 텍스트-투-스피치
LiveSpeech: Low-Latency Zero-shot Text-to-Speech via Autoregressive Modeling of Audio Discrete Codes

Trung Dang, David Aponte, Dung Tran, Kazuhito Koishida•Jun 5, 2024•162

사전 지식을 탐색함으로써 텍스트-비디오 합성 성능 향상
Searching Priors Makes Text-to-Video Synthesis Better

Haoran Cheng, Liang Peng, Linxuan Xia, Yuepeng Hu, Hengjia Li, Qinglin Lu, Xiaofei He, Boxi Wu•Jun 5, 2024•142

직접 정렬 알고리즘에서 보상 모델 과적합화에 대한 스케일링 법칙
Scaling Laws for Reward Model Overoptimization in Direct Alignment Algorithms

Rafael Rafailov, Yaswanth Chittepu, Ryan Park, Harshit Sikchi, Joey Hejna, Bradley Knox, Chelsea Finn, Scott Niekum•Jun 5, 2024•140

대화형 추천을 위한 아이템-언어 모델
Item-Language Model for Conversational Recommendation

Li Yang, Anushya Subbiah, Hardik Patel, Judith Yue Li, Yanwei Song, Reza Mirghaderi, Vikram Aggarwal•Jun 5, 2024•121

PLaD: 가상 선호도 쌍을 활용한 선호도 기반 대규모 언어 모델 증류
PLaD: Preference-based Large Language Model Distillation with Pseudo-Preference Pairs

Rongzhi Zhang, Jiaming Shen, Tianqi Liu, Haorui Wang, Zhen Qin, Feng Han, Jialu Liu, Simon Baumgartner, Michael Bendersky, Chao Zhang•Jun 5, 2024•111

Xmodel-LM 기술 보고서
Xmodel-LM Technical Report

Yichuan Wang, Yang Liu, Yu Yan, Xucheng Huang, Ling Jiang•Jun 5, 2024•111