ChatPaper.ai
打开菜单
首页
每日论文
arXiv
HuggingFace
定价
账户
工作台
🇨🇳
中文简体
Loading...
•
•
•
•
•
•
•
•
•
•
AI研究论文每日精选
每日精选AI研究论文及翻译
April 1st, 2024
Jamba:一种混合Transformer-Mamba语言模型
Jamba: A Hybrid Transformer-Mamba Language Model
Opher Lieber, Barak Lenz, Hofit Bata, Gal Cohen, Jhonathan Osin, Itay Dalmedigos, Erez Safahi, Shaked Meirom, Yonatan Belinkov, Shai Shalev-Shwartz, Omri Abend, Raz Alon, Tomer Asida, Amir Bergman, Roman Glozman, Michael Gokhman, Avashalom Manevich, Nir Ratner, Noam Rozen, Erez Shwartz, Mor Zusman, Yoav Shoham
•
Mar 28, 2024
•
111
5
Gecko:从大型语言模型中提炼出的多功能文本嵌入
Gecko: Versatile Text Embeddings Distilled from Large Language Models
Jinhyuk Lee, Zhuyun Dai, Xiaoqi Ren, Blair Chen, Daniel Cer, Jeremy R. Cole, Kai Hui, Michael Boratko, Rajvi Kapadia, Wen Ding, Yi Luan, Sai Meher Karthik Duddu, Gustavo Hernandez Abrego, Weiqiang Shi, Nithi Gupta, Aditya Kusupati, Prateek Jain, Siddhartha Reddy Jonnalagadda, Ming-Wei Chang, Iftekhar Naim
•
Mar 29, 2024
•
49
4
Transformer-Lite:高效部署大型语言模型于手机GPU
Transformer-Lite: High-efficiency Deployment of Large Language Models on Mobile Phone GPUs
Luchang Li, Sheng Qian, Jie Lu, Lunxi Yuan, Rui Wang, Qin Xie
•
Mar 29, 2024
•
35
3
ReALM:参考解析作为语言建模
ReALM: Reference Resolution As Language Modeling
Joel Ruben Antony Moniz, Soundarya Krishnan, Melis Ozyildirim, Prathamesh Saraf, Halim Cagri Ates, Yuan Zhang, Hong Yu, Nidhi Rajshree
•
Mar 29, 2024
•
22
2
InstantSplat:40秒内实现无界稀疏视角无姿态高斯喷洒
InstantSplat: Unbounded Sparse-view Pose-free Gaussian Splatting in 40 Seconds
Zhiwen Fan, Wenyan Cong, Kairun Wen, Kevin Wang, Jian Zhang, Xinghao Ding, Danfei Xu, Boris Ivanovic, Marco Pavone, Georgios Pavlakos, Zhangyang Wang, Yue Wang
•
Mar 29, 2024
•
19
2
不可解问题检测:评估视觉语言模型的可信度
Unsolvable Problem Detection: Evaluating Trustworthiness of Vision Language Models
Atsuyuki Miyai, Jingkang Yang, Jingyang Zhang, Yifei Ming, Qing Yu, Go Irie, Yixuan Li, Hai Li, Ziwei Liu, Kiyoharu Aizawa
•
Mar 29, 2024
•
16
2
定位语言模型中的段落记忆
Localizing Paragraph Memorization in Language Models
Niklas Stoehr, Mitchell Gordon, Chiyuan Zhang, Owen Lewis
•
Mar 28, 2024
•
15
1
狄江:通过紧凑内核化实现高效大型语言模型
DiJiang: Efficient Large Language Models through Compact Kernelization
Hanting Chen, Zhicheng Liu, Xutao Wang, Yuchuan Tian, Yunhe Wang
•
Mar 29, 2024
•
12
1
MambaMixer:高效选择性状态空间模型与双重令牌和通道选择
MambaMixer: Efficient Selective State Space Models with Dual Token and Channel Selection
Ali Behrouz, Michele Santacatterina, Ramin Zabih
•
Mar 29, 2024
•
12
1
抓取、轻触、喷溅:基于触觉的3D高斯喷溅技术用于重建复杂表面
Snap-it, Tap-it, Splat-it: Tactile-Informed 3D Gaussian Splatting for Reconstructing Challenging Surfaces
Mauro Comi, Alessio Tonioni, Max Yang, Jonathan Tremblay, Valts Blukis, Yijiong Lin, Nathan F. Lepora, Laurence Aitchison
•
Mar 29, 2024
•
10
1