ChatPaper.ai
打开菜单
首页
每日论文
arXiv
HuggingFace
定价
账户
工作台
🇨🇳
中文简体
Loading...
•
•
•
•
•
•
•
•
•
•
AI研究论文每日精选
每日精选AI研究论文及翻译
August 7th, 2024
在测试时最优地扩展LLM计算量可能比扩展模型参数更有效。
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
Charlie Snell, Jaehoon Lee, Kelvin Xu, Aviral Kumar
•
Aug 6, 2024
•
63
3
MMIU:用于评估大型视觉语言模型的多模态多图像理解
MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models
Fanqing Meng, Jin Wang, Chuanhao Li, Quanfeng Lu, Hao Tian, Jiaqi Liao, Xizhou Zhu, Jifeng Dai, Yu Qiao, Ping Luo, Kaipeng Zhang, Wenqi Shao
•
Aug 5, 2024
•
62
3
LLaVA-OneVision:简单的视觉任务迁移
LLaVA-OneVision: Easy Visual Task Transfer
Bo Li, Yuanhan Zhang, Dong Guo, Renrui Zhang, Feng Li, Hao Zhang, Kaichen Zhang, Yanwei Li, Ziwei Liu, Chunyuan Li
•
Aug 6, 2024
•
61
2
一个物体价值64x64像素:通过图像扩散生成3D物体
An Object is Worth 64x64 Pixels: Generating 3D Object via Image Diffusion
Xingguang Yan, Han-Hung Lee, Ziyu Wan, Angel X. Chang
•
Aug 6, 2024
•
41
3
MedTrinity-25M:一个包含多模态数据和多粒度标注的大规模医学数据集
MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine
Yunfei Xie, Ce Zhou, Lang Gao, Juncheng Wu, Xianhang Li, Hong-Yu Zhou, Sheng Liu, Lei Xing, James Zou, Cihang Xie, Yuyin Zhou
•
Aug 6, 2024
•
30
2
IPAdapter-Instruct:使用Instruct提示解决基于图像条件的歧义
IPAdapter-Instruct: Resolving Ambiguity in Image-based Conditioning using Instruct Prompts
Ciara Rowles, Shimon Vainer, Dante De Nigris, Slava Elizarov, Konstantin Kutsy, Simon Donné
•
Aug 6, 2024
•
23
2
CoverBench:一个用于复杂主张验证的具有挑战性的基准测试
CoverBench: A Challenging Benchmark for Complex Claim Verification
Alon Jacovi, Moran Ambar, Eyal Ben-David, Uri Shaham, Amir Feder, Mor Geva, Dror Marcus, Avi Caciularu
•
Aug 6, 2024
•
15
2
扩散模型作为数据挖掘工具
Diffusion Models as Data Mining Tools
Ioannis Siglidis, Aleksander Holynski, Alexei A. Efros, Mathieu Aubry, Shiry Ginosar
•
Jul 20, 2024
•
14
2
ReSyncer:为统一的音频-视觉同步面部表演者重新连接基于风格的生成器
ReSyncer: Rewiring Style-based Generator for Unified Audio-Visually Synced Facial Performer
Jiazhi Guan, Zhiliang Xu, Hang Zhou, Kaisiyuan Wang, Shengyi He, Zhanwang Zhang, Borong Liang, Haocheng Feng, Errui Ding, Jingtuo Liu, Jingdong Wang, Youjian Zhao, Ziwei Liu
•
Aug 6, 2024
•
11
2
从强弱L语言模型中合成文本到SQL数据
Synthesizing Text-to-SQL Data from Weak and Strong LLMs
Jiaxi Yang, Binyuan Hui, Min Yang, Jian Yang, Junyang Lin, Chang Zhou
•
Aug 6, 2024
•
11
2
StructEval:通过结构化评估加深和拓宽大型语言模型评估
StructEval: Deepen and Broaden Large Language Model Assessment via Structured Evaluation
Boxi Cao, Mengjie Ren, Hongyu Lin, Xianpei Han, Feng Zhang, Junfeng Zhan, Le Sun
•
Aug 6, 2024
•
10
2
AVESFormer:实时音频-视觉分割的高效Transformer设计
AVESFormer: Efficient Transformer Design for Real-Time Audio-Visual Segmentation
Zili Wang, Qi Yang, Linsu Shi, Jiazhong Yu, Qinghua Liang, Fei Li, Shiming Xiang
•
Aug 3, 2024
•
4
2