ChatPaper.ai
메뉴 열기
홈
오늘의 논문
arXiv
HuggingFace
요금제
계정
작업공간
🇰🇷
한국어
Loading...
•
•
•
•
•
•
•
•
•
•
AI 연구 논문 데일리
번역이 포함된 일일 선별된 AI 연구 논문
July 17th, 2024
Qwen2-Audio 기술 보고서
Qwen2-Audio Technical Report
Yunfei Chu, Jin Xu, Qian Yang, Haojie Wei, Xipin Wei, Zhifang Guo, Yichong Leng, Yuanjun Lv, Jinzheng He, Junyang Lin, Chang Zhou, Jingren Zhou
•
Jul 15, 2024
•
60
7
NeedleBench: LLM이 100만 컨텍스트 윈도우에서 검색과 추론을 할 수 있을까?
NeedleBench: Can LLMs Do Retrieval and Reasoning in 1 Million Context Window?
Mo Li, Songyang Zhang, Yunxin Liu, Kai Chen
•
Jul 16, 2024
•
45
3
확산 트랜스포머를 160억 파라미터로 확장하기
Scaling Diffusion Transformers to 16 Billion Parameters
Zhengcong Fei, Mingyuan Fan, Changqian Yu, Debang Li, Junshi Huang
•
Jul 16, 2024
•
27
2
Ref-AVS: 오디오-비주얼 장면에서 객체 참조 및 분할
Ref-AVS: Refer and Segment Objects in Audio-Visual Scenes
Yaoting Wang, Peiwen Sun, Dongzhan Zhou, Guangyao Li, Honggang Zhang, Di Hu
•
Jul 15, 2024
•
25
5
Sibyl: 복잡한 현실 세계 추론을 위한 간단하지만 효과적인 에이전트 프레임워크
Sibyl: Simple yet Effective Agent Framework for Complex Real-world Reasoning
Yulong Wang, Tianhao Shen, Lifeng Liu, Jian Xie
•
Jul 15, 2024
•
18
4
VLMEvalKit: 대규모 다중 모달리티 모델 평가를 위한 오픈소스 툴킷
VLMEvalKit: An Open-Source Toolkit for Evaluating Large Multi-Modality Models
Haodong Duan, Junming Yang, Yuxuan Qiao, Xinyu Fang, Lin Chen, Yuan Liu, Xiaoyi Dong, Yuhang Zang, Pan Zhang, Jiaqi Wang, Dahua Lin, Kai Chen
•
Jul 16, 2024
•
14
3
DreamCatalyst: 편집성과 정체성 보존 제어를 통한 빠르고 고품질의 3D 편집
DreamCatalyst: Fast and High-Quality 3D Editing via Controlling Editability and Identity Preservation
Jiwook Kim, Seonho Lee, Jaeyo Shin, Jiho Choi, Hyunjung Shim
•
Jul 16, 2024
•
12
2
Animate3D: Animating Any 3D Model with Multi-view Video Diffusion
Yanqin Jiang, Chaohui Yu, Chenjie Cao, Fan Wang, Weiming Hu, Jin Gao
•
Jul 16, 2024
•
10
2
디노이즈된 신경 가중치를 활용한 효율적 학습
Efficient Training with Denoised Neural Weights
Yifan Gong, Zheng Zhan, Yanyu Li, Yerlan Idelbayev, Andrey Zharkov, Kfir Aberman, Sergey Tulyakov, Yanzhi Wang, Jian Ren
•
Jul 16, 2024
•
9
3
FIRE: 멀티모달 모델의 피드백 통합 및 개선 평가를 위한 데이터셋
FIRE: A Dataset for Feedback Integration and Refinement Evaluation of Multimodal Models
Pengxiang Li, Zhi Gao, Bofei Zhang, Tao Yuan, Yuwei Wu, Mehrtash Harandi, Yunde Jia, Song-Chun Zhu, Qing Li
•
Jul 16, 2024
•
9
2
YouTube-SL-25: 대규모 오픈 도메인 다국어 수화 병렬 코퍼스
YouTube-SL-25: A Large-Scale, Open-Domain Multilingual Sign Language Parallel Corpus
Garrett Tanzer, Biao Zhang
•
Jul 15, 2024
•
9
4
EfficientQAT: 대규모 언어 모델을 위한 효율적인 양자화 인지 학습
EfficientQAT: Efficient Quantization-Aware Training for Large Language Models
Mengzhao Chen, Wenqi Shao, Peng Xu, Jiahao Wang, Peng Gao, Kaipeng Zhang, Yu Qiao, Ping Luo
•
Jul 10, 2024
•
9
3
GaLore에서 WeLore로: 저랭크 그래디언트가 어떻게 비균일적으로 저랭크 가중치를 생성하는가
From GaLore to WeLore: How Low-Rank Weights Non-uniformly Emerge from Low-Rank Gradients
Ajay Jaiswal, Lu Yin, Zhenyu Zhang, Shiwei Liu, Jiawei Zhao, Yuandong Tian, Zhangyang Wang
•
Jul 15, 2024
•
8
2
OmniBind: 바인딩 공간을 통한 대규범용 다중모달 표현
OmniBind: Large-scale Omni Multimodal Representation via Binding Spaces
Zehan Wang, Ziang Zhang, Hang Zhang, Luping Liu, Rongjie Huang, Xize Cheng, Hengshuang Zhao, Zhou Zhao
•
Jul 16, 2024
•
7
3
시뮬레이션된 휴머노이드를 이용한 다양한 물체 파지
Grasping Diverse Objects with Simulated Humanoids
Zhengyi Luo, Jinkun Cao, Sammy Christen, Alexander Winkler, Kris Kitani, Weipeng Xu
•
Jul 16, 2024
•
5
2
Vibravox: 신체 전도 오디오 센서로 수집한 프랑스어 음성 데이터셋
Vibravox: A Dataset of French Speech Captured with Body-conduction Audio Sensors
Julien Hauret, Malo Olivier, Thomas Joubaud, Christophe Langrenne, Sarah Poirée, Véronique Zimpfer, Éric Bavu
•
Jul 16, 2024
•
4
2
Data-Juicer 샌드박스: 멀티모달 데이터-모델 공동 개발을 위한 포괄적 도구 모음
Data-Juicer Sandbox: A Comprehensive Suite for Multimodal Data-Model Co-development
Daoyuan Chen, Haibin Wang, Yilun Huang, Ce Ge, Yaliang Li, Bolin Ding, Jingren Zhou
•
Jul 16, 2024
•
4
2
Click-Gaussian: 3D 가우시안에 대한 인터랙티브 세그멘테이션
Click-Gaussian: Interactive Segmentation to Any 3D Gaussians
Seokhun Choi, Hyeonseop Song, Jaechul Kim, Taehyeong Kim, Hoseok Do
•
Jul 16, 2024
•
3
3
불확실성은 취약하다: 대규모 언어 모델에서의 불확실성 조작
Uncertainty is Fragile: Manipulating Uncertainty in Large Language Models
Qingcheng Zeng, Mingyu Jin, Qinkai Yu, Zhenting Wang, Wenyue Hua, Zihao Zhou, Guangyan Sun, Yanda Meng, Shiqing Ma, Qifan Wang, Felix Juefei-Xu, Kaize Ding, Fan Yang, Ruixiang Tang, Yongfeng Zhang
•
Jul 15, 2024
•
1
2