ChatPaper.ai
打开菜单
首页
每日论文
arXiv
HuggingFace
定价
账户
工作台
🇨🇳
中文简体
Loading...
•
•
•
•
•
•
•
•
•
•
AI研究论文每日精选
每日精选AI研究论文及翻译
July 9th, 2024
MJ-Bench:您的多模态奖励模型真的是文本到图像生成的良好评判标准吗?
MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?
Zhaorun Chen, Yichao Du, Zichen Wen, Yiyang Zhou, Chenhang Cui, Zhenzhen Weng, Haoqin Tu, Chaoqi Wang, Zhengwei Tong, Qinglan Huang, Canyu Chen, Qinghao Ye, Zhihong Zhu, Yuqing Zhang, Jiawei Zhou, Zhuokai Zhao, Rafael Rafailov, Chelsea Finn, Huaxiu Yao
•
Jul 5, 2024
•
57
5
LLaMAX:通过增强翻译能力拓展LLM的语言视野,超越100种语言
LLaMAX: Scaling Linguistic Horizons of LLM by Enhancing Translation Capabilities Beyond 100 Languages
Yinquan Lu, Wenhao Zhu, Lei Li, Yu Qiao, Fei Yuan
•
Jul 8, 2024
•
38
2
关联循环记忆变换器
Associative Recurrent Memory Transformer
Ivan Rodkin, Yuri Kuratov, Aydar Bulatov, Mikhail Burtsev
•
Jul 5, 2024
•
37
2
从视频和模拟中学习以行动和推理为中心的图像编辑
Learning Action and Reasoning-Centric Image Editing from Videos and Simulations
Benno Krojer, Dheeraj Vattikonda, Luis Lara, Varun Jampani, Eva Portelance, Christopher Pal, Siva Reddy
•
Jul 3, 2024
•
32
2
ANOLE:一种开放的、自回归的、本地的大型多模态模型,用于交织的图像文本生成。
ANOLE: An Open, Autoregressive, Native Large Multimodal Models for Interleaved Image-Text Generation
Ethan Chern, Jiadi Su, Yan Ma, Pengfei Liu
•
Jul 8, 2024
•
23
4
评估语言模型上下文窗口:一项“工作记忆”测试和推理时间校正
Evaluating Language Model Context Windows: A "Working Memory" Test and Inference-time Correction
Amanda Dsouza, Christopher Glaze, Changho Shin, Frederic Sala
•
Jul 4, 2024
•
17
1
UltraEdit:基于指令的大规模细粒度图像编辑
UltraEdit: Instruction-based Fine-Grained Image Editing at Scale
Haozhe Zhao, Xiaojian Ma, Liang Chen, Shuzheng Si, Rujie Wu, Kaikai An, Peiyu Yu, Minjia Zhang, Qing Li, Baobao Chang
•
Jul 7, 2024
•
15
1
Tailor3D:使用双面图像定制3D资产编辑与生成
Tailor3D: Customized 3D Assets Editing and Generation with Dual-Side Images
Zhangyang Qi, Yunhan Yang, Mengchen Zhang, Long Xing, Xiaoyang Wu, Tong Wu, Dahua Lin, Xihui Liu, Jiaqi Wang, Hengshuang Zhao
•
Jul 8, 2024
•
14
1
InverseCoder:通过Inverse-Instruct释放指令调整代码LLMs的力量
InverseCoder: Unleashing the Power of Instruction-Tuned Code LLMs with Inverse-Instruct
Yutong Wu, Di Huang, Wenxuan Shi, Wei Wang, Lingzhe Gao, Shihao Liu, Ziyuan Nan, Kaizhao Yuan, Rui Zhang, Xishan Zhang, Zidong Du, Qi Guo, Yewen Pu, Dawei Yin, Xing Hu, Yunji Chen
•
Jul 8, 2024
•
14
2
作为流均衡的组合视频生成
Compositional Video Generation as Flow Equalization
Xingyi Yang, Xinchao Wang
•
Jun 10, 2024
•
14
1
视觉-语言模型中的多对象幻觉
Multi-Object Hallucination in Vision-Language Models
Xuweiyi Chen, Ziqiao Ma, Xuejun Zhang, Sihan Xu, Shengyi Qian, Jianing Yang, David F. Fouhey, Joyce Chai
•
Jul 8, 2024
•
12
2
PAS:数据高效即插即用提示增强系统
PAS: Data-Efficient Plug-and-Play Prompt Augmentation System
Miao Zheng, Hao Liang, Fan Yang, Haoze Sun, Tianpeng Li, Lingchu Xiong, Yan Zhang, Yozhen Wu, Kun Li, Yanjun Sheng, Mingan Lin, Tao Zhang, Guosheng Dong, Yujing Qiao, Kun Fang, Weipeng Chen, Bin Cui, Wentao Zhang, Zenan Zhou
•
Jul 8, 2024
•
11
2
通过基于检索的蒸馏来训练任务专家
Training Task Experts through Retrieval Based Distillation
Jiaxin Ge, Xueying Jia, Vijay Viswanathan, Hongyin Luo, Graham Neubig
•
Jul 7, 2024
•
10
1
通过复杂性视角理解视觉特征依赖
Understanding Visual Feature Reliance through the Lens of Complexity
Thomas Fel, Louis Bethune, Andrew Kyle Lampinen, Thomas Serre, Katherine Hermann
•
Jul 8, 2024
•
7
1
PartCraft: 通过零件制作创意物体
PartCraft: Crafting Creative Objects by Parts
Kam Woh Ng, Xiatian Zhu, Yi-Zhe Song, Tao Xiang
•
Jul 5, 2024
•
6
2
LLMAEL:大型语言模型是实体链接的良好上下文增强器
LLMAEL: Large Language Models are Good Context Augmenters for Entity Linking
Amy Xin, Yunjia Qi, Zijun Yao, Fangwei Zhu, Kaisheng Zeng, Xu Bin, Lei Hou, Juanzi Li
•
Jul 4, 2024
•
4
1
ANAH-v2:大型语言模型的分析性幻觉标注扩展
ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language Models
Yuzhe Gu, Ziwei Ji, Wenwei Zhang, Chengqi Lyu, Dahua Lin, Kai Chen
•
Jul 5, 2024
•
3
3