ChatPaper.ai
打開菜單
首頁
每日論文
arXiv
HuggingFace
定價
賬戶
工作台
🇭🇰
繁體中文
Loading...
•
•
•
•
•
•
•
•
•
•
AI研究論文每日精選
每日精選AI研究論文及翻譯
July 30th, 2024
SaulLM-54B和SaulLM-141B:擴展法律領域的領域適應
SaulLM-54B & SaulLM-141B: Scaling Up Domain Adaptation for the Legal Domain
Pierre Colombo, Telmo Pires, Malik Boudiaf, Rui Melo, Dominic Culver, Sofia Morgado, Etienne Malaboeuf, Gabriel Hautreux, Johanne Charpentier, Michael Desa
•
Jul 28, 2024
•
66
2
將大型語言模型整合到三模態架構中,用於自動化抑鬱分類。
Integrating Large Language Models into a Tri-Modal Architecture for Automated Depression Classification
Santosh V. Patapati
•
Jul 27, 2024
•
59
9
SeaLLMs 3:開放式基礎和多語言聊天大型語言模型,適用於東南亞語言
SeaLLMs 3: Open Foundation and Chat Multilingual Large Language Models for Southeast Asian Languages
Wenxuan Zhang, Hou Pong Chan, Yiran Zhao, Mahani Aljunied, Jianyu Wang, Chaoqun Liu, Yue Deng, Zhiqiang Hu, Weiwen Xu, Yew Ken Chia, Xin Li, Lidong Bing
•
Jul 29, 2024
•
58
6
FreeLong:使用SpectralBlend暫時關注實現無需訓練的長視頻生成
FreeLong: Training-Free Long Video Generation with SpectralBlend Temporal Attention
Yu Lu, Yuanzhi Liang, Linchao Zhu, Yi Yang
•
Jul 29, 2024
•
52
2
Theia:為機器人學習提煉多元視覺基礎模型
Theia: Distilling Diverse Vision Foundation Models for Robot Learning
Jinghuan Shang, Karl Schmeckpeper, Brandon B. May, Maria Vittoria Minniti, Tarik Kelestemur, David Watkins, Laura Herlant
•
Jul 29, 2024
•
48
3
心靈搜尋:模仿人類思維引發深度人工智慧搜索者
MindSearch: Mimicking Human Minds Elicits Deep AI Searcher
Zehui Chen, Kuikun Liu, Qiuchen Wang, Jiangning Liu, Wenwei Zhang, Kai Chen, Feng Zhao
•
Jul 29, 2024
•
44
4
MMAU:跨多個領域綜合評估智能體能力的基準
MMAU: A Holistic Benchmark of Agent Capabilities Across Diverse Domains
Guoli Yin, Haoping Bai, Shuang Ma, Feng Nan, Yanchao Sun, Zhaoyang Xu, Shen Ma, Jiarui Lu, Xiang Kong, Aonan Zhang, Dian Ang Yap, Yizhe zhang, Karsten Ahnert, Vik Kamath, Mathias Berglund, Dominic Walsh, Tobias Gindele, Juergen Wiest, Zhengfeng Lai, Xiaoming Wang, Jiulong Shan, Meng Cao, Ruoming Pang, Zirui Wang
•
Jul 18, 2024
•
41
4
擴散反饋有助於提升 CLIP 的視覺能力
Diffusion Feedback Helps CLIP See Better
Wenxuan Wang, Quan Sun, Fan Zhang, Yepeng Tang, Jing Liu, Xinlong Wang
•
Jul 29, 2024
•
37
2
嵌套專家混合模型:視覺標記的適應處理
Mixture of Nested Experts: Adaptive Processing of Visual Tokens
Gagan Jain, Nidhi Hegde, Aditya Kusupati, Arsha Nagrani, Shyamal Buch, Prateek Jain, Anurag Arnab, Sujoy Paul
•
Jul 29, 2024
•
37
4
通過直接偏好優化的自我訓練改善了思維鏈推理。
Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning
Tianduo Wang, Shichen Li, Wei Lu
•
Jul 25, 2024
•
34
4
Cycle3D:透過生成-重建循環實現高質量且一致的圖像到3D生成
Cycle3D: High-quality and Consistent Image-to-3D Generation via Generation-Reconstruction Cycle
Zhenyu Tang, Junwu Zhang, Xinhua Cheng, Wangbo Yu, Chaoran Feng, Yatian Pang, Bin Lin, Li Yuan
•
Jul 28, 2024
•
28
2
視覺謎題:針對大視覺和語言模型的常識和世界知識挑戰
Visual Riddles: a Commonsense and World Knowledge Challenge for Large Vision and Language Models
Nitzan Bitton-Guetta, Aviv Slobodkin, Aviya Maimon, Eliya Habba, Royi Rassin, Yonatan Bitton, Idan Szpektor, Amir Globerson, Yuval Elovici
•
Jul 28, 2024
•
23
2
城市場景理解的3D問答
3D Question Answering for City Scene Understanding
Penglei Sun, Yaoxian Song, Xiang Liu, Xiaofei Yang, Qiang Wang, Tiefeng Li, Yang Yang, Xiaowen Chu
•
Jul 24, 2024
•
22
5
ATHAR:用於古典阿拉伯文到英文翻譯的高質量和多樣化數據集
ATHAR: A High-Quality and Diverse Dataset for Classical Arabic to English Translation
Mohammed Khalil, Mohammed Sabry
•
Jul 29, 2024
•
21
1
元獎勵語言模型:透過以LLM為元評判者自我改進對齊
Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge
Tianhao Wu, Weizhe Yuan, Olga Golovneva, Jing Xu, Yuandong Tian, Jiantao Jiao, Jason Weston, Sainbayar Sukhbaatar
•
Jul 28, 2024
•
21
2
ImagiNet:通過對比學習實現通用合成圖像檢測的多內容數據集
ImagiNet: A Multi-Content Dataset for Generalizable Synthetic Image Detection via Contrastive Learning
Delyan Boychev, Radostin Cholakov
•
Jul 29, 2024
•
20
2
利用大型語言模型進行立陶宛線上評論的情感分析
Sentiment Analysis of Lithuanian Online Reviews Using Large Language Models
Brigita Vileikytė, Mantas Lukoševičius, Lukas Stankevičius
•
Jul 29, 2024
•
12
1
填補空白:從單眼手機捕捉實現類似工作室的頭像創建
Bridging the Gap: Studio-like Avatar Creation from a Monocular Phone Capture
ShahRukh Athar, Shunsuke Saito, Zhengyu Yang, Stanislav Pidhorsky, Chen Cao
•
Jul 28, 2024
•
12
1
WalkTheDog:透過相位流形的跨形態運動對齊
WalkTheDog: Cross-Morphology Motion Alignment via Phase Manifolds
Peizhuo Li, Sebastian Starke, Yuting Ye, Olga Sorkine-Hornung
•
Jul 11, 2024
•
12
2
VolDoGer:在視覺-語言任務中協助領域泛化的LLM輔助數據集
VolDoGer: LLM-assisted Datasets for Domain Generalization in Vision-Language Tasks
Juhwan Choi, Junehyoung Kwon, JungMin Yun, Seunguk Yu, YoungBin Kim
•
Jul 29, 2024
•
11
3
TAPTRv2:基於注意力的位置更新改善追蹤任意點
TAPTRv2: Attention-based Position Update Improves Tracking Any Point
Hongyang Li, Hao Zhang, Shilong Liu, Zhaoyang Zeng, Feng Li, Tianhe Ren, Bohan Li, Lei Zhang
•
Jul 23, 2024
•
11
4