ChatPaper.ai
打開菜單
首頁
每日論文
arXiv
HuggingFace
定價
賬戶
工作台
🇭🇰
繁體中文
Loading...
•
•
•
•
•
•
•
•
•
•
AI研究論文每日精選
每日精選AI研究論文及翻譯
July 19th, 2024
隨著詞彙量的擴大而產生的規模定律:更大的模型應配以更龐大的詞彙量
Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies
Chaofan Tao, Qian Liu, Longxu Dou, Niklas Muennighoff, Zhongwei Wan, Ping Luo, Min Lin, Ngai Wong
•
Jul 18, 2024
•
57
6
通過兆級標記數據存儲庫擴展基於檢索的語言模型
Scaling Retrieval-Based Language Models with a Trillion-Token Datastore
Rulin Shao, Jacqueline He, Akari Asai, Weijia Shi, Tim Dettmers, Sewon Min, Luke Zettlemoyer, Pang Wei Koh
•
Jul 9, 2024
•
32
3
運動形狀:從單一影片進行的4D重建
Shape of Motion: 4D Reconstruction from a Single Video
Qianqian Wang, Vickie Ye, Hang Gao, Jake Austin, Zhengqi Li, Angjoo Kanazawa
•
Jul 18, 2024
•
20
2
將Granite代碼模型擴展至128K上下文
Scaling Granite Code Models to 128K Context
Matt Stallone, Vaibhav Saxena, Leonid Karlinsky, Bridget McGinn, Tim Bula, Mayank Mishra, Adriana Meza Soria, Gaoyuan Zhang, Aditya Prasad, Yikang Shen, Saptha Surendran, Shanmukha Guttula, Hima Patel, Parameswaran Selvam, Xuan-Hong Dang, Yan Koyfman, Atin Sood, Rogerio Feris, Nirmit Desai, David D. Cox, Ruchir Puri, Rameswar Panda
•
Jul 18, 2024
•
20
3
街景:使用自回歸視頻擴散生成大規模一致街景
Streetscapes: Large-scale Consistent Street View Generation Using Autoregressive Video Diffusion
Boyang Deng, Richard Tucker, Zhengqi Li, Leonidas Guibas, Noah Snavely, Gordon Wetzstein
•
Jul 18, 2024
•
18
2
直接偏好優化中參考政策的理解
Understanding Reference Policies in Direct Preference Optimization
Yixin Liu, Pengfei Liu, Arman Cohan
•
Jul 18, 2024
•
17
3
多模式大型語言模型的信任度基準測試:一項全面研究
Benchmarking Trustworthiness of Multimodal Large Language Models: A Comprehensive Study
Yichi Zhang, Yao Huang, Yitong Sun, Chang Liu, Zhe Zhao, Zhengwei Fang, Yifan Wang, Huanran Chen, Xiao Yang, Xingxing Wei, Hang Su, Yinpeng Dong, Jun Zhu
•
Jun 11, 2024
•
17
4
CLAY:一個可控制的大規模生成模型,用於創建高質量的3D資產
CLAY: A Controllable Large-scale Generative Model for Creating High-quality 3D Assets
Longwen Zhang, Ziyu Wang, Qixuan Zhang, Qiwei Qiu, Anqi Pang, Haoran Jiang, Wei Yang, Lan Xu, Jingyi Yu
•
May 30, 2024
•
12
2
注意力溢出:長文本情境下的語言模型輸入模糊 缺失項目建議
Attention Overflow: Language Model Input Blur during Long-Context Missing Items Recommendation
Damien Sileo
•
Jul 18, 2024
•
10
3
BRIGHT:一個實際且具挑戰性的基準測試,針對需要大量推理的檢索。
BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval
Hongjin Su, Howard Yen, Mengzhou Xia, Weijia Shi, Niklas Muennighoff, Han-yu Wang, Haisu Liu, Quan Shi, Zachary S. Siegel, Michael Tang, Ruoxi Sun, Jinsung Yoon, Sercan O. Arik, Danqi Chen, Tao Yu
•
Jul 16, 2024
•
9
2
CodeV:透過多層摘要強化LLM用於Verilog生成
CodeV: Empowering LLMs for Verilog Generation through Multi-Level Summarization
Yang Zhao, Di Huang, Chongxiao Li, Pengwei Jin, Ziyuan Nan, Tianyun Ma, Lei Qi, Yansong Pan, Zhenxing Zhang, Rui Zhang, Xishan Zhang, Zidong Du, Qi Guo, Xing Hu, Yunji Chen
•
Jul 15, 2024
•
9
3
檢索增強機器學習:綜合與機遇
Retrieval-Enhanced Machine Learning: Synthesis and Opportunities
To Eun Kim, Alireza Salemi, Andrew Drozdov, Fernando Diaz, Hamed Zamani
•
Jul 17, 2024
•
6
2
基準一致性測試的正確執行:LLM基準評估指南
Benchmark Agreement Testing Done Right: A Guide for LLM Benchmark Evaluation
Yotam Perlitz, Ariel Gera, Ofir Arviv, Asaf Yehudai, Elron Bandel, Eyal Shnarch, Michal Shmueli-Scheuer, Leshem Choshen
•
Jul 18, 2024
•
5
3
一項關於醫學信件自動編碼與可解釋性的比較研究
A Comparative Study on Automatic Coding of Medical Letters with Explainability
Jamie Glen, Lifeng Han, Paul Rayson, Goran Nenadic
•
Jul 18, 2024
•
5
2
PM-LLM-Benchmark:評估大型語言模型在流程挖掘任務上的表現
PM-LLM-Benchmark: Evaluating Large Language Models on Process Mining Tasks
Alessandro Berti, Humam Kourani, Wil M. P. van der Aalst
•
Jul 18, 2024
•
2
2