ChatPaper.ai
打開菜單
首頁
每日論文
arXiv
HuggingFace
定價
賬戶
工作台
🇭🇰
繁體中文
Loading...
•
•
•
•
•
•
•
•
•
•
AI研究論文每日精選
每日精選AI研究論文及翻譯
July 9th, 2024
MJ-Bench:您的多模態獎勵模型真的是評估文本到圖像生成的良好標準嗎?
MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?
Zhaorun Chen, Yichao Du, Zichen Wen, Yiyang Zhou, Chenhang Cui, Zhenzhen Weng, Haoqin Tu, Chaoqi Wang, Zhengwei Tong, Qinglan Huang, Canyu Chen, Qinghao Ye, Zhihong Zhu, Yuqing Zhang, Jiawei Zhou, Zhuokai Zhao, Rafael Rafailov, Chelsea Finn, Huaxiu Yao
•
Jul 5, 2024
•
57
5
LLaMAX:通過增強超過100種語言的翻譯能力,擴展LLM的語言範圍
LLaMAX: Scaling Linguistic Horizons of LLM by Enhancing Translation Capabilities Beyond 100 Languages
Yinquan Lu, Wenhao Zhu, Lei Li, Yu Qiao, Fei Yuan
•
Jul 8, 2024
•
38
2
關聯循環記憶轉換器
Associative Recurrent Memory Transformer
Ivan Rodkin, Yuri Kuratov, Aydar Bulatov, Mikhail Burtsev
•
Jul 5, 2024
•
37
2
從影片和模擬中學習以行動和推理為中心的圖像編輯
Learning Action and Reasoning-Centric Image Editing from Videos and Simulations
Benno Krojer, Dheeraj Vattikonda, Luis Lara, Varun Jampani, Eva Portelance, Christopher Pal, Siva Reddy
•
Jul 3, 2024
•
32
2
ANOLE:一種開放、自回歸、本地的大型多模態模型,用於交錯的圖像-文本生成。
ANOLE: An Open, Autoregressive, Native Large Multimodal Models for Interleaved Image-Text Generation
Ethan Chern, Jiadi Su, Yan Ma, Pengfei Liu
•
Jul 8, 2024
•
23
4
評估語言模型上下文窗口:一項「工作記憶」測試和推論時間修正
Evaluating Language Model Context Windows: A "Working Memory" Test and Inference-time Correction
Amanda Dsouza, Christopher Glaze, Changho Shin, Frederic Sala
•
Jul 4, 2024
•
17
1
UltraEdit:基於指令的大規模精細圖像編輯
UltraEdit: Instruction-based Fine-Grained Image Editing at Scale
Haozhe Zhao, Xiaojian Ma, Liang Chen, Shuzheng Si, Rujie Wu, Kaikai An, Peiyu Yu, Minjia Zhang, Qing Li, Baobao Chang
•
Jul 7, 2024
•
15
1
Tailor3D:使用雙面圖像進行定制化3D資產編輯與生成
Tailor3D: Customized 3D Assets Editing and Generation with Dual-Side Images
Zhangyang Qi, Yunhan Yang, Mengchen Zhang, Long Xing, Xiaoyang Wu, Tong Wu, Dahua Lin, Xihui Liu, Jiaqi Wang, Hengshuang Zhao
•
Jul 8, 2024
•
14
1
InverseCoder:透過Inverse-Instruct釋放指令調整的代碼LLM的潛力
InverseCoder: Unleashing the Power of Instruction-Tuned Code LLMs with Inverse-Instruct
Yutong Wu, Di Huang, Wenxuan Shi, Wei Wang, Lingzhe Gao, Shihao Liu, Ziyuan Nan, Kaizhao Yuan, Rui Zhang, Xishan Zhang, Zidong Du, Qi Guo, Yewen Pu, Dawei Yin, Xing Hu, Yunji Chen
•
Jul 8, 2024
•
14
2
作為流均衡的組合式視頻生成
Compositional Video Generation as Flow Equalization
Xingyi Yang, Xinchao Wang
•
Jun 10, 2024
•
14
1
視覺語言模型中的多對象幻覺
Multi-Object Hallucination in Vision-Language Models
Xuweiyi Chen, Ziqiao Ma, Xuejun Zhang, Sihan Xu, Shengyi Qian, Jianing Yang, David F. Fouhey, Joyce Chai
•
Jul 8, 2024
•
12
2
PAS:高效且節省資料的即插即用提示增強系統
PAS: Data-Efficient Plug-and-Play Prompt Augmentation System
Miao Zheng, Hao Liang, Fan Yang, Haoze Sun, Tianpeng Li, Lingchu Xiong, Yan Zhang, Yozhen Wu, Kun Li, Yanjun Sheng, Mingan Lin, Tao Zhang, Guosheng Dong, Yujing Qiao, Kun Fang, Weipeng Chen, Bin Cui, Wentao Zhang, Zenan Zhou
•
Jul 8, 2024
•
11
2
透過基於檢索的蒸餾訓練任務專家
Training Task Experts through Retrieval Based Distillation
Jiaxin Ge, Xueying Jia, Vijay Viswanathan, Hongyin Luo, Graham Neubig
•
Jul 7, 2024
•
10
1
透過複雜度的觀點理解視覺特徵依賴
Understanding Visual Feature Reliance through the Lens of Complexity
Thomas Fel, Louis Bethune, Andrew Kyle Lampinen, Thomas Serre, Katherine Hermann
•
Jul 8, 2024
•
7
1
PartCraft:透過零件製作創意物件
PartCraft: Crafting Creative Objects by Parts
Kam Woh Ng, Xiatian Zhu, Yi-Zhe Song, Tao Xiang
•
Jul 5, 2024
•
6
2
LLMAEL:大型語言模型是實體鏈接的良好上下文增強器。
LLMAEL: Large Language Models are Good Context Augmenters for Entity Linking
Amy Xin, Yunjia Qi, Zijun Yao, Fangwei Zhu, Kaisheng Zeng, Xu Bin, Lei Hou, Juanzi Li
•
Jul 4, 2024
•
4
1
ANAH-v2:擴展大型語言模型的分析性幻覺標註
ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language Models
Yuzhe Gu, Ziwei Ji, Wenwei Zhang, Chengqi Lyu, Dahua Lin, Kai Chen
•
Jul 5, 2024
•
3
3