ChatPaper.ai
打開菜單
首頁
每日論文
arXiv
HuggingFace
定價
賬戶
工作台
🇭🇰
繁體中文
Loading...
•
•
•
•
•
•
•
•
•
•
AI研究論文每日精選
每日精選AI研究論文及翻譯
July 12th, 2024
Skywork-Math:大型語言模型中數學推理的數據縮放定律──故事繼續进行
Skywork-Math: Data Scaling Laws for Mathematical Reasoning in Large Language Models -- The Story Goes On
Liang Zeng, Liangjun Zhong, Liang Zhao, Tianwen Wei, Liu Yang, Jujie He, Cheng Cheng, Rui Hu, Yang Liu, Shuicheng Yan, Han Fang, Yahui Zhou
•
Jul 11, 2024
•
53
5
透過獎勵梯度進行影片傳播對齊
Video Diffusion Alignment via Reward Gradients
Mihir Prabhudesai, Russell Mendonca, Zheyang Qin, Katerina Fragkiadaki, Deepak Pathak
•
Jul 11, 2024
•
51
2
多模式自我指導:使用語言模型進行合成抽象圖像和視覺推理指導
Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Model
Wenqi Zhang, Zhenglin Cheng, Yuanyu He, Mengna Wang, Yongliang Shen, Zeqi Tan, Guiyang Hou, Mingqian He, Yanna Ma, Weiming Lu, Yueting Zhuang
•
Jul 9, 2024
•
47
3
MAVIS:數學視覺指導調校
MAVIS: Mathematical Visual Instruction Tuning
Renrui Zhang, Xinyu Wei, Dongzhi Jiang, Yichi Zhang, Ziyu Guo, Chengzhuo Tong, Jiaming Liu, Aojun Zhou, Bin Wei, Shanghang Zhang, Peng Gao, Hongsheng Li
•
Jul 11, 2024
•
34
3
Q-GaLore:具有 INT4 投影和層適應低秩梯度的量化 GaLore
Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients
Zhenyu Zhang, Ajay Jaiswal, Lu Yin, Shiwei Liu, Jiawei Zhao, Yuandong Tian, Zhangyang Wang
•
Jul 11, 2024
•
34
3
MambaVision:一個混合Mamba-Transformer視覺主幹
MambaVision: A Hybrid Mamba-Transformer Vision Backbone
Ali Hatamizadeh, Jan Kautz
•
Jul 10, 2024
•
33
5
語言模型中的自我識別
Self-Recognition in Language Models
Tim R. Davidson, Viacheslav Surkov, Veniamin Veselovsky, Giuseppe Russo, Robert West, Caglar Gulcehre
•
Jul 9, 2024
•
27
2
種子故事:利用大型語言模型進行多模態長篇故事生成
SEED-Story: Multimodal Long Story Generation with Large Language Model
Shuai Yang, Yuying Ge, Yang Li, Yukang Chen, Yixiao Ge, Ying Shan, Yingcong Chen
•
Jul 11, 2024
•
26
5
你的模型真的是一個優秀的數學推理者嗎?使用檢查表評估數學推理
Is Your Model Really A Good Math Reasoner? Evaluating Mathematical Reasoning with Checklist
Zihao Zhou, Shudong Liu, Maizhen Ning, Wei Liu, Jindong Wang, Derek F. Wong, Xiaowei Huang, Qiufeng Wang, Kaizhu Huang
•
Jul 11, 2024
•
23
4
DenseFusion-1M:整合視覺專家以實現全面多模態感知
DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception
Xiaotong Li, Fan Zhang, Haiwen Diao, Yueze Wang, Xinlong Wang, Ling-Yu Duan
•
Jul 11, 2024
•
19
2
GTA:一個通用工具智能體的基準測試
GTA: A Benchmark for General Tool Agents
Jize Wang, Zerun Ma, Yining Li, Songyang Zhang, Cailian Chen, Kai Chen, Xinyi Le
•
Jul 11, 2024
•
17
3
無向量量化的自回歸語音合成
Autoregressive Speech Synthesis without Vector Quantization
Lingwei Meng, Long Zhou, Shujie Liu, Sanyuan Chen, Bing Han, Shujie Hu, Yanqing Liu, Jinyu Li, Sheng Zhao, Xixin Wu, Helen Meng, Furu Wei
•
Jul 11, 2024
•
17
4
數據與多模態大型語言模型之間的協同作用:從共同發展的角度進行調查
The Synergy between Data and Multi-Modal Large Language Models: A Survey from Co-Development Perspective
Zhen Qin, Daoyuan Chen, Wenhao Zhang, Liuyi Yao, Yilun Huang, Bolin Ding, Yaliang Li, Shuiguang Deng
•
Jul 11, 2024
•
13
4
梯度提升強化學習
Gradient Boosting Reinforcement Learning
Benjamin Fuhrer, Chen Tessler, Gal Dalal
•
Jul 11, 2024
•
13
2
Live2Diff:透過視訊擴散模型中的單向注意力進行直播串流翻譯
Live2Diff: Live Stream Translation via Uni-directional Attention in Video Diffusion Models
Zhening Xing, Gereon Fox, Yanhong Zeng, Xingang Pan, Mohamed Elgharib, Christian Theobalt, Kai Chen
•
Jul 11, 2024
•
12
2
影片幀插補的通用隱式運動建模
Generalizable Implicit Motion Modeling for Video Frame Interpolation
Zujin Guo, Wei Li, Chen Change Loy
•
Jul 11, 2024
•
12
2
隨處地圖(MIA):利用大規模公共數據資源賦能鳥瞰式地圖製作
Map It Anywhere (MIA): Empowering Bird's Eye View Mapping using Large-scale Public Data
Cherie Ho, Jiaye Zou, Omar Alama, Sai Mitheran Jagadesh Kumar, Benjamin Chiang, Taneesh Gupta, Chen Wang, Nikhil Keetha, Katia Sycara, Sebastian Scherer
•
Jul 11, 2024
•
11
4
朝向建立具有系統1和系統2融合的專業通用人工智慧前進。
Towards Building Specialized Generalist AI with System 1 and System 2 Fusion
Kaiyan Zhang, Biqing Qi, Bowen Zhou
•
Jul 11, 2024
•
11
2
野生高斯:野外的三維高斯點陣化
WildGaussians: 3D Gaussian Splatting in the Wild
Jonas Kulhanek, Songyou Peng, Zuzana Kukelova, Marc Pollefeys, Torsten Sattler
•
Jul 11, 2024
•
10
2
OmniNOCS:用於將2D物體提升至3D的統一NOCS數據集與模型
OmniNOCS: A unified NOCS dataset and model for 3D lifting of 2D objects
Akshay Krishnan, Abhijit Kundu, Kevis-Kokitsi Maninis, James Hays, Matthew Brown
•
Jul 11, 2024
•
9
2
通過任務向量定制擴展個性化美學評估
Scaling Up Personalized Aesthetic Assessment via Task Vector Customization
Jooyeol Yun, Jaegul Choo
•
Jul 9, 2024
•
6
3