ChatPaper.ai
打開菜單
首頁
每日論文
arXiv
HuggingFace
定價
賬戶
工作台
🇭🇰
繁體中文
Loading...
•
•
•
•
•
•
•
•
•
•
AI研究論文每日精選
每日精選AI研究論文及翻譯
August 21st, 2024
人工智慧生成的影像浮水印技術的脆弱性:檢驗其對視覺改寫攻擊的強健性
The Brittleness of AI-Generated Image Watermarking Techniques: Examining Their Robustness Against Visual Paraphrasing Attacks
Niyar R Barman, Krish Sharma, Ashhar Aziz, Shashwat Bajpai, Shwetangshu Biswas, Vasu Sharma, Vinija Jain, Aman Chadha, Amit Sheth, Amitava Das
•
Aug 19, 2024
•
9
2
音訊匹配剪輯:在電影和影片中尋找和創建匹配的音訊過渡
Audio Match Cutting: Finding and Creating Matching Audio Transitions in Movies and Videos
Dennis Fedorishin, Lie Lu, Srirangaraj Setlur, Venu Govindaraju
•
Aug 20, 2024
•
9
2
MegaFusion:將擴散模型擴展至更高解析度影像生成,無需進一步調整
MegaFusion: Extend Diffusion Models towards Higher-resolution Image Generation without Further Tuning
Haoning Wu, Shaocheng Shen, Qiang Hu, Xiaoyun Zhang, Ya Zhang, Yanfeng Wang
•
Aug 20, 2024
•
12
2
RP1M:一個大規模的鋼琴演奏運動數據集,具有雙手靈巧機器人手。
RP1M: A Large-Scale Motion Dataset for Piano Playing with Bi-Manual Dexterous Robot Hands
Yi Zhao, Le Chen, Jan Schneider, Quankai Gao, Juho Kannala, Bernhard Schölkopf, Joni Pajarinen, Dieter Büchler
•
Aug 20, 2024
•
4
2
PhysBERT:物理科學文獻的文本嵌入模型
PhysBERT: A Text Embedding Model for Physics Scientific Literature
Thorsten Hellert, João Montenegro, Andrea Pollastro
•
Aug 18, 2024
•
8
1
NeCo:在19個GPU小時內通過Patch鄰域一致性改善DINOv2的空間表示。
NeCo: Improving DINOv2's spatial representations in 19 GPU hours with Patch Neighbor Consistency
Valentinos Pariza, Mohammadreza Salehi, Gertjan Burghouts, Francesco Locatello, Yuki M. Asano
•
Aug 20, 2024
•
13
2
預測獎勵與標記:大型語言模型中高效推論干預的非干擾式參數插入
Predicting Rewards Alongside Tokens: Non-disruptive Parameter Insertion for Efficient Inference Intervention in Large Language Model
Chenhan Yuan, Fei Huang, Ru Peng, Keming Lu, Bowen Yu, Chang Zhou, Jingren Zhou
•
Aug 20, 2024
•
9
2
MagicDec:透過推測解碼打破長內容生成的延遲-吞吐量折衷。
MagicDec: Breaking the Latency-Throughput Tradeoff for Long Context Generation with Speculative Decoding
Jian Chen, Vashisth Tiwari, Ranajoy Sadhukhan, Zhuoming Chen, Jinyuan Shi, Ian En-Hsu Yen, Beidi Chen
•
Aug 20, 2024
•
13
3
跨模型多模式模型:預測下一個標記並擴散影像
Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model
Chunting Zhou, Lili Yu, Arun Babu, Kushal Tirumala, Michihiro Yasunaga, Leonid Shamis, Jacob Kahn, Xuezhe Ma, Luke Zettlemoyer, Omer Levy
•
Aug 20, 2024
•
61
3
近期公眾對交通運輸的興趣激增:利用微博數據對百度Apollo Go進行情感分析
Recent Surge in Public Interest in Transportation: Sentiment Analysis of Baidu Apollo Go Using Weibo Data
Shiqi Wang, Zhouye Zhao, Yuhang Xie, Mingchuan Ma, Zirui Chen, Zeyu Wang, Bohao Su, Wenrui Xu, Tianyi Li
•
Aug 19, 2024
•
2
1
TableBench:一個針對表格問答的全面且複雜的基準測試
TableBench: A Comprehensive and Complex Benchmark for Table Question Answering
Xianjie Wu, Jian Yang, Linzheng Chai, Ge Zhang, Jiaheng Liu, Xinrun Du, Di Liang, Daixin Shu, Xianfu Cheng, Tianzhen Sun, Guanglin Niu, Tongliang Li, Zhoujun Li
•
Aug 17, 2024
•
53
3
Ferret:利用基於獎勵的評分技術進行更快速和有效的自動化紅隊行動
Ferret: Faster and Effective Automated Red Teaming with Reward-Based Scoring Technique
Tej Deep Pala, Vernon Y. H. Toh, Rishabh Bhardwaj, Soujanya Poria
•
Aug 20, 2024
•
12
2
要編碼,還是不編碼?探索編碼在預訓練中的影響。
To Code, or Not To Code? Exploring Impact of Code in Pre-training
Viraat Aryabumi, Yixuan Su, Raymond Ma, Adrien Morisot, Ivan Zhang, Acyr Locatelli, Marzieh Fadaee, Ahmet Üstün, Sara Hooker
•
Aug 20, 2024
•
43
2
MambaEVT:使用狀態空間模型的事件流視覺物體追蹤
MambaEVT: Event Stream based Visual Object Tracking using State Space Model
Xiao Wang, Chao wang, Shiao Wang, Xixi Wang, Zhicheng Zhao, Lin Zhu, Bo Jiang
•
Aug 20, 2024
•
7
2
ShapeSplat:一個大規模的高斯斑點數據集及其自監督預訓練
ShapeSplat: A Large-scale Dataset of Gaussian Splats and Their Self-Supervised Pretraining
Qi Ma, Yue Li, Bin Ren, Nicu Sebe, Ender Konukoglu, Theo Gevers, Luc Van Gool, Danda Pani Paudel
•
Aug 20, 2024
•
3
2