ChatPaper.aiChatPaper.ai
홈

arXiv

HuggingFace

요금제계정작업공간

•
•

•
•

•
•

•
•

•
•

Footer

Company name

ChatPaper.ai: Your advanced AI reading assistant.

Contact us: [email protected]

X (Twitter)

Products

  • AI Search
  • AI Mind Map
  • Arxiv Summary
  • Huggingface Summary

Support

  • FAQ
  • Contact

Company

  • Blog
  • Privacy Policy
  • Terms of Service

Available Languages

  • 🇬🇧English
  • 🇨🇳中文简体
  • 🇭🇰繁體中文
  • 🇯🇵日本語
  • 🇰🇷한국어
  • 🇩🇪Deutsch
  • 🇫🇷Français
  • 🇷🇺Русский
  • 🇪🇸Español

© 2025 chatpaper.ai All rights reserved.

AI 연구 논문 데일리

번역이 포함된 일일 선별된 AI 연구 논문

BASE TTS: 100,000시간의 데이터로 10억 파라미터 텍스트-음성 변환 모델 구축에서 얻은 교훈
BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data

Mateusz Łajszczak, Guillermo Cámbara, Yang Li, Fatih Beyhan, Arent van Korlaar, Fan Yang, Arnaud Joly, Álvaro Martín-Cortinas, Ammar Abbas, Adam Michalski, Alexis Moinet, Sri Karlapati, Ewa Muszyńska, Haohan Guo, Bartosz Putrycz, Soledad López Gambino, Kayeon Yoo, Elena Sokolova, Thomas Drugman•Feb 12, 2024•629

링어텐션을 활용한 백만 길이 비디오와 언어에 대한 세계 모델
World Model on Million-Length Video And Language With RingAttention

Hao Liu, Wilson Yan, Matei Zaharia, Pieter Abbeel•Feb 13, 2024•405

전문가 혼합 모델이 딥 강화 학습을 위한 파라미터 스케일링의 문을 열다
Mixtures of Experts Unlock Parameter Scaling for Deep RL

Johan Obando-Ceron, Ghada Sokar, Timon Willi, Clare Lyle, Jesse Farebrother, Jakob Foerster, Gintare Karolina Dziugaite, Doina Precup, Pablo Samuel Castro•Feb 13, 2024•372

Lumos: 장면 텍스트 인식을 통해 멀티모달 LLM의 역량 강화
Lumos : Empowering Multimodal LLMs with Scene Text Recognition

Ashish Shenoy, Yichao Lu, Srihari Jayakumar, Debojeet Chatterjee, Mohsen Moslehpour, Pierce Chuang, Abhay Harpale, Vikas Bhardwaj, Di Xu, Shicong Zhao, Longfang Zhao, Ankit Ramchandani, Xin Luna Dong, Anuj Kumar•Feb 12, 2024•282

그래프 맘바: 상태 공간 모델을 활용한 그래프 학습을 향하여
Graph Mamba: Towards Learning on Graphs with State Space Models

Ali Behrouz, Farnoosh Hashemi•Feb 13, 2024•171

UFO: Windows OS 상호작용을 위한 UI 중심 에이전트
UFO: A UI-Focused Agent for Windows OS Interaction

Chaoyun Zhang, Liqun Li, Shilin He, Xu Zhang, Bo Qiao, Si Qin, Minghua Ma, Yu Kang, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang, Qi Zhang•Feb 8, 2024•163

IM-3D: 고품질 3D 생성을 위한 반복적 다중 뷰 확산 및 재구성
IM-3D: Iterative Multiview Diffusion and Reconstruction for High-Quality 3D Generation

Luke Melas-Kyriazi, Iro Laina, Christian Rupprecht, Natalia Neverova, Andrea Vedaldi, Oran Gafni, Filippos Kokkinos•Feb 13, 2024•141

ChatCell: 자연어를 활용한 단일 세포 분석 지원
ChatCell: Facilitating Single-Cell Analysis with Natural Language

Yin Fang, Kangwei Liu, Ningyu Zhang, Xinle Deng, Penghui Yang, Zhuo Chen, Xiangru Tang, Mark Gerstein, Xiaohui Fan, Huajun Chen•Feb 13, 2024•144

텍스트-이미지 생성을 위한 연속적 3D 단어 학습
Learning Continuous 3D Words for Text-to-Image Generation

Ta-Ying Cheng, Matheus Gadelha, Thibault Groueix, Matthew Fisher, Radomir Mech, Andrew Markham, Niki Trigoni•Feb 13, 2024•124

추론 효율적인 대형 언어 모델을 위한 탠덤 트랜스포머
Tandem Transformers for Inference Efficient LLMs

Aishwarya P S, Pranav Ajit Nair, Yashas Samaga, Toby Boyd, Sanjiv Kumar, Prateek Jain, Praneeth Netrapalli•Feb 13, 2024•101

단일 시연을 통한 비전 기반 손동작 커스터마이제이션
Vision-Based Hand Gesture Customization from a Single Demonstration

Soroush Shahi, Cori Tymoszek Park, Richard Kang, Asaf Liberman, Oron Levy, Jun Gong, Abdelkareem Bedri, Gierad Laput•Feb 13, 2024•92

NeRF 유사성: NeRF를 위한 예제 기반 시각적 속성 전이
NeRF Analogies: Example-Based Visual Attribute Transfer for NeRFs

Michael Fischer, Zhengqin Li, Thu Nguyen-Phuoc, Aljaz Bozic, Zhao Dong, Carl Marshall, Tobias Ritschel•Feb 13, 2024•61