ChatPaper.ai
打开菜单
首页
每日论文
arXiv
HuggingFace
定价
账户
工作台
🇨🇳
中文简体
Loading...
•
•
•
•
•
•
•
•
•
•
AI研究论文每日精选
每日精选AI研究论文及翻译
July 19th, 2024
随着词汇量的增加而产生的规模定律:更大的模型应配备更大的词汇量
Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies
Chaofan Tao, Qian Liu, Longxu Dou, Niklas Muennighoff, Zhongwei Wan, Ping Luo, Min Lin, Ngai Wong
•
Jul 18, 2024
•
57
6
利用万亿标记数据存储扩展基于检索的语言模型
Scaling Retrieval-Based Language Models with a Trillion-Token Datastore
Rulin Shao, Jacqueline He, Akari Asai, Weijia Shi, Tim Dettmers, Sewon Min, Luke Zettlemoyer, Pang Wei Koh
•
Jul 9, 2024
•
32
3
运动的形状:从单个视频进行的4D重建
Shape of Motion: 4D Reconstruction from a Single Video
Qianqian Wang, Vickie Ye, Hang Gao, Jake Austin, Zhengqi Li, Angjoo Kanazawa
•
Jul 18, 2024
•
20
2
将花岗岩代码模型扩展到128K上下文
Scaling Granite Code Models to 128K Context
Matt Stallone, Vaibhav Saxena, Leonid Karlinsky, Bridget McGinn, Tim Bula, Mayank Mishra, Adriana Meza Soria, Gaoyuan Zhang, Aditya Prasad, Yikang Shen, Saptha Surendran, Shanmukha Guttula, Hima Patel, Parameswaran Selvam, Xuan-Hong Dang, Yan Koyfman, Atin Sood, Rogerio Feris, Nirmit Desai, David D. Cox, Ruchir Puri, Rameswar Panda
•
Jul 18, 2024
•
20
3
街景:使用自回归视频扩散实现大规模一致的街景生成
Streetscapes: Large-scale Consistent Street View Generation Using Autoregressive Video Diffusion
Boyang Deng, Richard Tucker, Zhengqi Li, Leonidas Guibas, Noah Snavely, Gordon Wetzstein
•
Jul 18, 2024
•
18
2
直接偏好优化中的参考策略理解
Understanding Reference Policies in Direct Preference Optimization
Yixin Liu, Pengfei Liu, Arman Cohan
•
Jul 18, 2024
•
17
3
多模态大型语言模型可信度基准测试:一项全面研究
Benchmarking Trustworthiness of Multimodal Large Language Models: A Comprehensive Study
Yichi Zhang, Yao Huang, Yitong Sun, Chang Liu, Zhe Zhao, Zhengwei Fang, Yifan Wang, Huanran Chen, Xiao Yang, Xingxing Wei, Hang Su, Yinpeng Dong, Jun Zhu
•
Jun 11, 2024
•
17
4
CLAY:一个可控的用于创建高质量3D资产的大规模生成模型
CLAY: A Controllable Large-scale Generative Model for Creating High-quality 3D Assets
Longwen Zhang, Ziyu Wang, Qixuan Zhang, Qiwei Qiu, Anqi Pang, Haoran Jiang, Wei Yang, Lan Xu, Jingyi Yu
•
May 30, 2024
•
12
2
注意力溢出:长上下文期间语言模型输入模糊 缺失项目推荐
Attention Overflow: Language Model Input Blur during Long-Context Missing Items Recommendation
Damien Sileo
•
Jul 18, 2024
•
10
3
BRIGHT:一个逼真且具有挑战性的基准,用于依赖推理的检索。
BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval
Hongjin Su, Howard Yen, Mengzhou Xia, Weijia Shi, Niklas Muennighoff, Han-yu Wang, Haisu Liu, Quan Shi, Zachary S. Siegel, Michael Tang, Ruoxi Sun, Jinsung Yoon, Sercan O. Arik, Danqi Chen, Tao Yu
•
Jul 16, 2024
•
9
2
CodeV:通过多层摘要为LLMs生成Verilog的增强
CodeV: Empowering LLMs for Verilog Generation through Multi-Level Summarization
Yang Zhao, Di Huang, Chongxiao Li, Pengwei Jin, Ziyuan Nan, Tianyun Ma, Lei Qi, Yansong Pan, Zhenxing Zhang, Rui Zhang, Xishan Zhang, Zidong Du, Qi Guo, Xing Hu, Yunji Chen
•
Jul 15, 2024
•
9
3
检索增强机器学习:综合与机遇
Retrieval-Enhanced Machine Learning: Synthesis and Opportunities
To Eun Kim, Alireza Salemi, Andrew Drozdov, Fernando Diaz, Hamed Zamani
•
Jul 17, 2024
•
6
2
基准一致性测试的正确实施:LLM基准评估指南
Benchmark Agreement Testing Done Right: A Guide for LLM Benchmark Evaluation
Yotam Perlitz, Ariel Gera, Ofir Arviv, Asaf Yehudai, Elron Bandel, Eyal Shnarch, Michal Shmueli-Scheuer, Leshem Choshen
•
Jul 18, 2024
•
5
3
自动编码医学信函的可解释性比较研究
A Comparative Study on Automatic Coding of Medical Letters with Explainability
Jamie Glen, Lifeng Han, Paul Rayson, Goran Nenadic
•
Jul 18, 2024
•
5
2
PM-LLM-Benchmark:在过程挖掘任务上评估大型语言模型
PM-LLM-Benchmark: Evaluating Large Language Models on Process Mining Tasks
Alessandro Berti, Humam Kourani, Wil M. P. van der Aalst
•
Jul 18, 2024
•
2
2