ChatPaper.ai
打开菜单
首页
每日论文
arXiv
HuggingFace
定价
账户
工作台
🇨🇳
中文简体
Loading...
•
•
•
•
•
•
•
•
•
•
AI研究论文每日精选
每日精选AI研究论文及翻译
September 17th, 2024
Seed-Music:一个统一的框架,用于高质量和可控的音乐生成。
Seed-Music: A Unified Framework for High Quality and Controlled Music Generation
Ye Bai, Haonan Chen, Jitong Chen, Zhuo Chen, Yi Deng, Xiaohong Dong, Lamtharn Hantrakul, Weituo Hao, Qingqing Huang, Zhongyi Huang, Dongya Jia, Feihu La, Duc Le, Bochen Li, Chumin Li, Hui Li, Xingxing Li, Shouda Liu, Wei-Tsung Lu, Yiqing Lu, Andrew Shaw, Janne Spijkervet, Yakun Sun, Bo Wang, Ju-Chiang Wang, Yuping Wang, Yuxuan Wang, Ling Xu, Yifeng Yang, Chao Yao, Shuo Zhang, Yang Zhang, Yilin Zhang, Hang Zhao, Ziyi Zhao, Dejian Zhong, Shicen Zhou, Pei Zou
•
Sep 13, 2024
•
54
3
科尔莫戈洛夫-阿诺德变换器
Kolmogorov-Arnold Transformer
Xingyi Yang, Xinchao Wang
•
Sep 16, 2024
•
46
5
检索注意力:通过向量检索加速长上下文LLM推理
RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval
Di Liu, Meng Chen, Baotong Lu, Huiqiang Jiang, Zhenhua Han, Qianxi Zhang, Qi Chen, Chengruidong Zhang, Bailu Ding, Kai Zhang, Chen Chen, Fan Yang, Yuqing Yang, Lili Qiu
•
Sep 16, 2024
•
44
2
jina-embeddings-v3:具有任务LoRA的多语言嵌入
jina-embeddings-v3: Multilingual Embeddings With Task LoRA
Saba Sturua, Isabelle Mohr, Mohammad Kalim Akram, Michael Günther, Bo Wang, Markus Krimmel, Feng Wang, Georgios Mastrapas, Andreas Koukounas, Andreas Koukounas, Nan Wang, Han Xiao
•
Sep 16, 2024
•
32
6
视觉与语言中的一个缺失环节:关于漫画理解的调查
One missing piece in Vision and Language: A Survey on Comics Understanding
Emanuele Vivoli, Andrey Barsky, Mohamed Ali Souibgui, Artemis LLabres, Marco Bertini, Dimosthenis Karatzas
•
Sep 14, 2024
•
26
2
Ferret:针对大型语言模型的规模化联邦全参数调整
Ferret: Federated Full-Parameter Tuning at Scale for Large Language Models
Yao Shu, Wenyang Hu, See-Kiong Ng, Bryan Kian Hsiang Low, Fei Richard Yu
•
Sep 10, 2024
•
16
2
思维图谱
On the Diagram of Thought
Yifan Zhang, Yang Yuan, Andrew Chi-Chih Yao
•
Sep 16, 2024
•
14
2
ReCLAP:通过描述声音来改善零样本音频分类
ReCLAP: Improving Zero Shot Audio Classification by Describing Sounds
Sreyan Ghosh, Sonal Kumar, Chandra Kiran Reddy Evuru, Oriol Nieto, Ramani Duraiswami, Dinesh Manocha
•
Sep 13, 2024
•
13
2
引导视觉问答中的视觉-语言模型选择 跨任务、领域和知识类型
Guiding Vision-Language Model Selection for Visual Question-Answering Across Tasks, Domains, and Knowledge Types
Neelabh Sinha, Vinija Jain, Aman Chadha
•
Sep 14, 2024
•
9
2
在强化学习中进行策略过滤以微调用于代码生成的LLM
Policy Filtration in RLHF to Fine-Tune LLM for Code Generation
Wei Shen, Chuheng Zhang
•
Sep 11, 2024
•
6
2
破解 reCAPTCHAv2
Breaking reCAPTCHAv2
Andreas Plesner, Tobias Vontobel, Roger Wattenhofer
•
Sep 13, 2024
•
5
2
AudioBERT:音频知识增强语言模型
AudioBERT: Audio Knowledge Augmented Language Model
Hyunjong Ok, Suho Yoo, Jaeho Lee
•
Sep 12, 2024
•
5
2
基于电子健康记录,预测患者胸部X光图像的时间变化。
Towards Predicting Temporal Changes in a Patient's Chest X-ray Images based on Electronic Health Records
Daeun Kyung, Junu Kim, Tackeun Kim, Edward Choi
•
Sep 11, 2024
•
4
2
beeFormer:在推荐系统中弥合语义和交互相似性之间的差距
beeFormer: Bridging the Gap Between Semantic and Interaction Similarity in Recommender Systems
Vojtěch Vančura, Pavel Kordík, Milan Straka
•
Sep 16, 2024
•
3
2
LLM 动力驱动的字素到音素转换:基准和案例研究
LLM-Powered Grapheme-to-Phoneme Conversion: Benchmark and Case Study
Mahta Fetrat Qharabagh, Zahra Dehghanian, Hamid R. Rabiee
•
Sep 13, 2024
•
3
1