ChatPaper.ai
打開菜單
首頁
每日論文
arXiv
HuggingFace
定價
賬戶
工作台
🇭🇰
繁體中文
Loading...
•
•
•
•
•
•
•
•
•
•
AI研究論文每日精選
每日精選AI研究論文及翻譯
April 1st, 2024
Jamba:一個混合Transformer-Mamba語言模型
Jamba: A Hybrid Transformer-Mamba Language Model
Opher Lieber, Barak Lenz, Hofit Bata, Gal Cohen, Jhonathan Osin, Itay Dalmedigos, Erez Safahi, Shaked Meirom, Yonatan Belinkov, Shai Shalev-Shwartz, Omri Abend, Raz Alon, Tomer Asida, Amir Bergman, Roman Glozman, Michael Gokhman, Avashalom Manevich, Nir Ratner, Noam Rozen, Erez Shwartz, Mor Zusman, Yoav Shoham
•
Mar 28, 2024
•
111
5
Gecko:源自大型語言模型的多功能文字嵌入
Gecko: Versatile Text Embeddings Distilled from Large Language Models
Jinhyuk Lee, Zhuyun Dai, Xiaoqi Ren, Blair Chen, Daniel Cer, Jeremy R. Cole, Kai Hui, Michael Boratko, Rajvi Kapadia, Wen Ding, Yi Luan, Sai Meher Karthik Duddu, Gustavo Hernandez Abrego, Weiqiang Shi, Nithi Gupta, Aditya Kusupati, Prateek Jain, Siddhartha Reddy Jonnalagadda, Ming-Wei Chang, Iftekhar Naim
•
Mar 29, 2024
•
49
4
Transformer-Lite:在行動電話GPU上高效部署大型語言模型
Transformer-Lite: High-efficiency Deployment of Large Language Models on Mobile Phone GPUs
Luchang Li, Sheng Qian, Jie Lu, Lunxi Yuan, Rui Wang, Qin Xie
•
Mar 29, 2024
•
35
3
ReALM:參考解析作為語言建模
ReALM: Reference Resolution As Language Modeling
Joel Ruben Antony Moniz, Soundarya Krishnan, Melis Ozyildirim, Prathamesh Saraf, Halim Cagri Ates, Yuan Zhang, Hong Yu, Nidhi Rajshree
•
Mar 29, 2024
•
22
2
InstantSplat:40秒內的無界稀疏視角無姿勢高斯Splatting
InstantSplat: Unbounded Sparse-view Pose-free Gaussian Splatting in 40 Seconds
Zhiwen Fan, Wenyan Cong, Kairun Wen, Kevin Wang, Jian Zhang, Xinghao Ding, Danfei Xu, Boris Ivanovic, Marco Pavone, Georgios Pavlakos, Zhangyang Wang, Yue Wang
•
Mar 29, 2024
•
19
2
無法解決問題的檢測:評估視覺語言模型的可信度
Unsolvable Problem Detection: Evaluating Trustworthiness of Vision Language Models
Atsuyuki Miyai, Jingkang Yang, Jingyang Zhang, Yifei Ming, Qing Yu, Go Irie, Yixuan Li, Hai Li, Ziwei Liu, Kiyoharu Aizawa
•
Mar 29, 2024
•
16
2
在語言模型中的段落記憶本地化
Localizing Paragraph Memorization in Language Models
Niklas Stoehr, Mitchell Gordon, Chiyuan Zhang, Owen Lewis
•
Mar 28, 2024
•
15
1
DiJiang:透過緊湊核化實現高效的大型語言模型
DiJiang: Efficient Large Language Models through Compact Kernelization
Hanting Chen, Zhicheng Liu, Xutao Wang, Yuchuan Tian, Yunhe Wang
•
Mar 29, 2024
•
12
1
MambaMixer:具有雙令牌和通道選擇的高效選擇性狀態空間模型
MambaMixer: Efficient Selective State Space Models with Dual Token and Channel Selection
Ali Behrouz, Michele Santacatterina, Ramin Zabih
•
Mar 29, 2024
•
12
1
點擊、輕拍、濺射:用於重建具挑戰性表面的觸覺資訊3D高斯濺射
Snap-it, Tap-it, Splat-it: Tactile-Informed 3D Gaussian Splatting for Reconstructing Challenging Surfaces
Mauro Comi, Alessio Tonioni, Max Yang, Jonathan Tremblay, Valts Blukis, Yijiong Lin, Nathan F. Lepora, Laurence Aitchison
•
Mar 29, 2024
•
10
1