ChatPaper.ai
打開菜單
首頁
每日論文
arXiv
HuggingFace
定價
賬戶
工作台
🇭🇰
繁體中文
Loading...
•
•
•
•
•
•
•
•
•
•
AI研究論文每日精選
每日精選AI研究論文及翻譯
August 7th, 2024
在測試時間計算上,最佳地調整LLM的規模可能比調整模型參數更有效。
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
Charlie Snell, Jaehoon Lee, Kelvin Xu, Aviral Kumar
•
Aug 6, 2024
•
63
3
MMIU:多模態多圖像理解用於評估大視覺語言模型
MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models
Fanqing Meng, Jin Wang, Chuanhao Li, Quanfeng Lu, Hao Tian, Jiaqi Liao, Xizhou Zhu, Jifeng Dai, Yu Qiao, Ping Luo, Kaipeng Zhang, Wenqi Shao
•
Aug 5, 2024
•
62
3
LLaVA-OneVision:簡單的視覺任務轉移
LLaVA-OneVision: Easy Visual Task Transfer
Bo Li, Yuanhan Zhang, Dong Guo, Renrui Zhang, Feng Li, Hao Zhang, Kaichen Zhang, Yanwei Li, Ziwei Liu, Chunyuan Li
•
Aug 6, 2024
•
61
2
一個物件值得 64x64 像素:透過影像擴散生成 3D 物件
An Object is Worth 64x64 Pixels: Generating 3D Object via Image Diffusion
Xingguang Yan, Han-Hung Lee, Ziyu Wan, Angel X. Chang
•
Aug 6, 2024
•
41
3
MedTrinity-25M:一個具有多層次標註的大規模多模態醫學數據集
MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine
Yunfei Xie, Ce Zhou, Lang Gao, Juncheng Wu, Xianhang Li, Hong-Yu Zhou, Sheng Liu, Lei Xing, James Zou, Cihang Xie, Yuyin Zhou
•
Aug 6, 2024
•
30
2
IPAdapter-Instruct:使用Instruct提示解決基於圖像條件的歧義
IPAdapter-Instruct: Resolving Ambiguity in Image-based Conditioning using Instruct Prompts
Ciara Rowles, Shimon Vainer, Dante De Nigris, Slava Elizarov, Konstantin Kutsy, Simon Donné
•
Aug 6, 2024
•
23
2
CoverBench:一個針對複雜主張驗證的具挑戰性基準测试
CoverBench: A Challenging Benchmark for Complex Claim Verification
Alon Jacovi, Moran Ambar, Eyal Ben-David, Uri Shaham, Amir Feder, Mor Geva, Dror Marcus, Avi Caciularu
•
Aug 6, 2024
•
15
2
擴散模型作為資料探勘工具
Diffusion Models as Data Mining Tools
Ioannis Siglidis, Aleksander Holynski, Alexei A. Efros, Mathieu Aubry, Shiry Ginosar
•
Jul 20, 2024
•
14
2
ReSyncer:為統一的音視覺同步面部表演者重新連線風格生成器
ReSyncer: Rewiring Style-based Generator for Unified Audio-Visually Synced Facial Performer
Jiazhi Guan, Zhiliang Xu, Hang Zhou, Kaisiyuan Wang, Shengyi He, Zhanwang Zhang, Borong Liang, Haocheng Feng, Errui Ding, Jingtuo Liu, Jingdong Wang, Youjian Zhao, Ziwei Liu
•
Aug 6, 2024
•
11
2
從弱和強LML中合成文本到SQL數據
Synthesizing Text-to-SQL Data from Weak and Strong LLMs
Jiaxi Yang, Binyuan Hui, Min Yang, Jian Yang, Junyang Lin, Chang Zhou
•
Aug 6, 2024
•
11
2
StructEval:透過結構化評估加深和擴展大型語言模型的評估
StructEval: Deepen and Broaden Large Language Model Assessment via Structured Evaluation
Boxi Cao, Mengjie Ren, Hongyu Lin, Xianpei Han, Feng Zhang, Junfeng Zhan, Le Sun
•
Aug 6, 2024
•
10
2
AVESFormer:針對即時音視覺分割設計的高效Transformer
AVESFormer: Efficient Transformer Design for Real-Time Audio-Visual Segmentation
Zili Wang, Qi Yang, Linsu Shi, Jiazhong Yu, Qinghua Liang, Fei Li, Shiming Xiang
•
Aug 3, 2024
•
4
2