ChatPaper.ai
打开菜单
首页
每日论文
arXiv
HuggingFace
定价
账户
工作台
🇨🇳
中文简体
Loading...
•
•
•
•
•
•
•
•
•
•
AI研究论文每日精选
每日精选AI研究论文及翻译
July 30th, 2024
SaulLM-54B和SaulLM-141B:扩展法律领域的领域自适应
SaulLM-54B & SaulLM-141B: Scaling Up Domain Adaptation for the Legal Domain
Pierre Colombo, Telmo Pires, Malik Boudiaf, Rui Melo, Dominic Culver, Sofia Morgado, Etienne Malaboeuf, Gabriel Hautreux, Johanne Charpentier, Michael Desa
•
Jul 28, 2024
•
66
2
将大型语言模型集成到三模态架构中,用于自动抑郁症分类。
Integrating Large Language Models into a Tri-Modal Architecture for Automated Depression Classification
Santosh V. Patapati
•
Jul 27, 2024
•
59
9
SeaLLMs 3:面向东南亚语言的开放基础和聊天多语言大型语言模型
SeaLLMs 3: Open Foundation and Chat Multilingual Large Language Models for Southeast Asian Languages
Wenxuan Zhang, Hou Pong Chan, Yiran Zhao, Mahani Aljunied, Jianyu Wang, Chaoqun Liu, Yue Deng, Zhiqiang Hu, Weiwen Xu, Yew Ken Chia, Xin Li, Lidong Bing
•
Jul 29, 2024
•
58
6
FreeLong:使用SpectralBlend时域注意力实现无需训练的长视频生成
FreeLong: Training-Free Long Video Generation with SpectralBlend Temporal Attention
Yu Lu, Yuanzhi Liang, Linchao Zhu, Yi Yang
•
Jul 29, 2024
•
52
2
Theia:为机器人学习提炼多样视觉基础模型
Theia: Distilling Diverse Vision Foundation Models for Robot Learning
Jinghuan Shang, Karl Schmeckpeper, Brandon B. May, Maria Vittoria Minniti, Tarik Kelestemur, David Watkins, Laura Herlant
•
Jul 29, 2024
•
48
3
心智搜索:模拟人类思维引发深度人工智能搜索者
MindSearch: Mimicking Human Minds Elicits Deep AI Searcher
Zehui Chen, Kuikun Liu, Qiuchen Wang, Jiangning Liu, Wenwei Zhang, Kai Chen, Feng Zhao
•
Jul 29, 2024
•
44
4
MMAU:跨多领域代理能力的整体基准
MMAU: A Holistic Benchmark of Agent Capabilities Across Diverse Domains
Guoli Yin, Haoping Bai, Shuang Ma, Feng Nan, Yanchao Sun, Zhaoyang Xu, Shen Ma, Jiarui Lu, Xiang Kong, Aonan Zhang, Dian Ang Yap, Yizhe zhang, Karsten Ahnert, Vik Kamath, Mathias Berglund, Dominic Walsh, Tobias Gindele, Juergen Wiest, Zhengfeng Lai, Xiaoming Wang, Jiulong Shan, Meng Cao, Ruoming Pang, Zirui Wang
•
Jul 18, 2024
•
41
4
扩散反馈有助于改善CLIP的视觉效果。
Diffusion Feedback Helps CLIP See Better
Wenxuan Wang, Quan Sun, Fan Zhang, Yepeng Tang, Jing Liu, Xinlong Wang
•
Jul 29, 2024
•
37
2
嵌套专家混合模型:视觉标记的自适应处理
Mixture of Nested Experts: Adaptive Processing of Visual Tokens
Gagan Jain, Nidhi Hegde, Aditya Kusupati, Arsha Nagrani, Shyamal Buch, Prateek Jain, Anurag Arnab, Sujoy Paul
•
Jul 29, 2024
•
37
4
通过直接优化偏好进行自我训练可改善思维链推理。
Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning
Tianduo Wang, Shichen Li, Wei Lu
•
Jul 25, 2024
•
34
4
Cycle3D:通过生成-重建循环实现高质量和一致的图像到三维生成
Cycle3D: High-quality and Consistent Image-to-3D Generation via Generation-Reconstruction Cycle
Zhenyu Tang, Junwu Zhang, Xinhua Cheng, Wangbo Yu, Chaoran Feng, Yatian Pang, Bin Lin, Li Yuan
•
Jul 28, 2024
•
28
2
视觉谜题:一个针对大型视觉和语言模型的常识和世界知识挑战
Visual Riddles: a Commonsense and World Knowledge Challenge for Large Vision and Language Models
Nitzan Bitton-Guetta, Aviv Slobodkin, Aviya Maimon, Eliya Habba, Royi Rassin, Yonatan Bitton, Idan Szpektor, Amir Globerson, Yuval Elovici
•
Jul 28, 2024
•
23
2
城市场景理解的3D问答
3D Question Answering for City Scene Understanding
Penglei Sun, Yaoxian Song, Xiang Liu, Xiaofei Yang, Qiang Wang, Tiefeng Li, Yang Yang, Xiaowen Chu
•
Jul 24, 2024
•
22
5
ATHAR:用于古典阿拉伯语到英语翻译的高质量和多样化数据集
ATHAR: A High-Quality and Diverse Dataset for Classical Arabic to English Translation
Mohammed Khalil, Mohammed Sabry
•
Jul 29, 2024
•
21
1
元奖励语言模型:通过LLM作为元评判者实现自我改进的对齐
Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge
Tianhao Wu, Weizhe Yuan, Olga Golovneva, Jing Xu, Yuandong Tian, Jiantao Jiao, Jason Weston, Sainbayar Sukhbaatar
•
Jul 28, 2024
•
21
2
ImagiNet:用于通过对比学习实现通用合成图像检测的多内容数据集
ImagiNet: A Multi-Content Dataset for Generalizable Synthetic Image Detection via Contrastive Learning
Delyan Boychev, Radostin Cholakov
•
Jul 29, 2024
•
20
2
利用大型语言模型进行立陶宛在线评论的情感分析
Sentiment Analysis of Lithuanian Online Reviews Using Large Language Models
Brigita Vileikytė, Mantas Lukoševičius, Lukas Stankevičius
•
Jul 29, 2024
•
12
1
弥合差距:从单目手机捕捉实现类似工作室的阿凡达创建
Bridging the Gap: Studio-like Avatar Creation from a Monocular Phone Capture
ShahRukh Athar, Shunsuke Saito, Zhengyu Yang, Stanislav Pidhorsky, Chen Cao
•
Jul 28, 2024
•
12
1
WalkTheDog:通过相位流形实现跨形态运动对齐
WalkTheDog: Cross-Morphology Motion Alignment via Phase Manifolds
Peizhuo Li, Sebastian Starke, Yuting Ye, Olga Sorkine-Hornung
•
Jul 11, 2024
•
12
2
VolDoGer:LLM辅助数据集用于视觉-语言任务中的领域泛化
VolDoGer: LLM-assisted Datasets for Domain Generalization in Vision-Language Tasks
Juhwan Choi, Junehyoung Kwon, JungMin Yun, Seunguk Yu, YoungBin Kim
•
Jul 29, 2024
•
11
3
TAPTRv2:基于注意力的位置更新改进了跟踪任意点
TAPTRv2: Attention-based Position Update Improves Tracking Any Point
Hongyang Li, Hao Zhang, Shilong Liu, Zhaoyang Zeng, Feng Li, Tianhe Ren, Bohan Li, Lei Zhang
•
Jul 23, 2024
•
11
4