ChatPaper.ai
메뉴 열기
홈
오늘의 논문
arXiv
HuggingFace
요금제
계정
작업공간
🇰🇷
한국어
Loading...
•
•
•
•
•
•
•
•
•
•
AI 연구 논문 데일리
번역이 포함된 일일 선별된 AI 연구 논문
April 9th, 2024
Ferret-UI: 멀티모달 LLM을 활용한 모바일 UI의 근거 기반 이해
Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs
Keen You, Haotian Zhang, Eldon Schoop, Floris Weers, Amanda Swearngin, Jeffrey Nichols, Yinfei Yang, Zhe Gan
•
Apr 8, 2024
•
83
3
MagicTime: 변형 시뮬레이터로서의 타임랩스 비디오 생성 모델
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
Shenghai Yuan, Jinfa Huang, Yujun Shi, Yongqi Xu, Ruijie Zhu, Bin Lin, Xinhua Cheng, Li Yuan, Jiebo Luo
•
Apr 7, 2024
•
35
2
SwapAnything: 개인화된 시각 편집에서 임의 객체 교체 가능
SwapAnything: Enabling Arbitrary Object Swapping in Personalized Visual Editing
Jing Gu, Yilin Wang, Nanxuan Zhao, Wei Xiong, Qing Liu, Zhifei Zhang, He Zhang, Jianming Zhang, HyunJoon Jung, Xin Eric Wang
•
Apr 8, 2024
•
27
0
ByteEdit: 생성적 이미지 편집의 향상, 준수 및 가속화
ByteEdit: Boost, Comply and Accelerate Generative Image Editing
Yuxi Ren, Jie Wu, Yanzuo Lu, Huafeng Kuang, Xin Xia, Xionghui Wang, Qianqian Wang, Yixing Zhu, Pan Xie, Shiyin Wang, Xuefeng Xiao, Yitong Wang, Min Zheng, Lean Fu
•
Apr 7, 2024
•
27
1
UniFL: 통합 피드백 학습을 통해 Stable Diffusion 개선하기
UniFL: Improve Stable Diffusion via Unified Feedback Learning
Jiacheng Zhang, Jie Wu, Yuxi Ren, Xin Xia, Huafeng Kuang, Pan Xie, Jiashi Li, Xuefeng Xiao, Weilin Huang, Min Zheng, Lean Fu, Guanbin Li
•
Apr 8, 2024
•
26
1
SpatialTracker: 3D 공간에서 모든 2D 픽셀 추적
SpatialTracker: Tracking Any 2D Pixels in 3D Space
Yuxi Xiao, Qianqian Wang, Shangzhan Zhang, Nan Xue, Sida Peng, Yujun Shen, Xiaowei Zhou
•
Apr 5, 2024
•
26
1
BeyondScene: 사전 학습된 확산 모델을 활용한 고해상도 인간 중심 장면 생성
BeyondScene: Higher-Resolution Human-Centric Scene Generation With Pretrained Diffusion
Gwanghyun Kim, Hayeon Kim, Hoigi Seo, Dong Un Kang, Se Young Chun
•
Apr 6, 2024
•
24
0
MA-LMM: 장기 비디오 이해를 위한 메모리 증강 대형 멀티모달 모델
MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding
Bo He, Hengduo Li, Young Kyun Jang, Menglin Jia, Xuefei Cao, Ashish Shah, Abhinav Shrivastava, Ser-Nam Lim
•
Apr 8, 2024
•
23
0
PhysAvatar: 시각적 관찰을 통해 옷을 입은 3D 아바타의 물리학 학습
PhysAvatar: Learning the Physics of Dressed 3D Avatars from Visual Observations
Yang Zheng, Qingqing Zhao, Guandao Yang, Wang Yifan, Donglai Xiang, Florian Dubost, Dmitry Lagun, Thabo Beeler, Federico Tombari, Leonidas Guibas, Gordon Wetzstein
•
Apr 5, 2024
•
18
0
YaART: 또 다른 ART 렌더링 기술
YaART: Yet Another ART Rendering Technology
Sergey Kastryulin, Artem Konev, Alexander Shishenya, Eugene Lyapustin, Artem Khurshudov, Alexander Tselousov, Nikita Vinokurov, Denis Kuznedelev, Alexander Markovich, Grigoriy Livshits, Alexey Kirillov, Anastasiia Tabisheva, Liubov Chubarova, Marina Kaminskaia, Alexander Ustyuzhanin, Artemii Shvetsov, Daniil Shlenskii, Valerii Startsev, Dmitrii Kornilov, Mikhail Romanov, Artem Babenko, Sergei Ovcharenko, Valentin Khrulkov
•
Apr 8, 2024
•
17
0
MoMA: 빠른 개인화 이미지 생성을 위한 멀티모달 LLM 어댑터
MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation
Kunpeng Song, Yizhe Zhu, Bingchen Liu, Qing Yan, Ahmed Elgammal, Xiao Yang
•
Apr 8, 2024
•
15
2
인간의 효용성을 최적화하여 확산 모델 정렬하기
Aligning Diffusion Models by Optimizing Human Utility
Shufan Li, Konstantinos Kallidromitis, Akash Gokul, Yusuke Kato, Kazuki Kozuka
•
Apr 6, 2024
•
15
1
Diffusion-RWKV: Diffusion 모델을 위한 RWKV 유사 아키텍처의 확장
Diffusion-RWKV: Scaling RWKV-Like Architectures for Diffusion Models
Zhengcong Fei, Mingyuan Fan, Changqian Yu, Debang Li, Junshi Huang
•
Apr 6, 2024
•
13
0
DATENeRF: NeRF의 깊이 인식 기반 텍스트 편집
DATENeRF: Depth-Aware Text-based Editing of NeRFs
Sara Rojas, Julien Philip, Kai Zhang, Sai Bi, Fujun Luan, Bernard Ghanem, Kalyan Sunkavall
•
Apr 6, 2024
•
11
0
코알라: 키 프레임 조건부 장영상-LLM
Koala: Key frame-conditioned long video-LLM
Reuben Tan, Ximeng Sun, Ping Hu, Jui-hsien Wang, Hanieh Deilamsalehy, Bryan A. Plummer, Bryan Russell, Kate Saenko
•
Apr 5, 2024
•
7
2