ChatPaper.ai
Abrir Menu
Início
Artigos do Dia
arXiv
HuggingFace
Preços
Conta
Área de trabalho
🇬🇧
English
Loading...
•
•
•
•
•
•
•
•
•
•
Artigos de Pesquisa em IA Diários
Artigos de pesquisa em IA selecionados diariamente com traduções
April 24th, 2025
PHYBench: Avaliação Holística da Percepção e Raciocínio Físico em Modelos de Linguagem de Grande Escala
PHYBench: Holistic Evaluation of Physical Perception and Reasoning in Large Language Models
Shi Qiu, Shaoyang Guo, Zhuo-Yang Song, Yunbo Sun, Zeyu Cai, Jiashen Wei, Tianyu Luo, Yixuan Yin, Haoxu Zhang, Yi Hu, Chenyang Wang, Chencheng Tang, Haoling Chang, Qi Liu, Ziheng Zhou, Tianyu Zhang, Jingtian Zhang, Zhangyi Liu, Minghao Li, Yuku Zhang, Boxuan Jing, Xianqi Yin, Yutong Ren, Zizhuo Fu, Weike Wang, Xudong Tian, Anqi Lv, Laifu Man, Jianxiang Li, Feiyu Tao, Qihua Sun, Zhou Liang, Yushu Mu, Zhongxuan Li, Jing-Jun Zhang, Shutao Zhang, Xiaotian Li, Xingqi Xia, Jiawei Lin, Zheyu Shen, Jiahang Chen, Qiuhao Xiong, Binran Wang, Fengyuan Wang, Ziyang Ni, Bohan Zhang, Fan Cui, Changkun Shao, Qing-Hong Cao, Ming-xing Luo, Muhan Zhang, Hua Xing Zhu
•
Apr 22, 2025
•
33
2
DreamID: Troca de Rosto Baseada em Difusão de Alta Fidelidade e Rápida via Aprendizado de Grupo Triplo de ID
DreamID: High-Fidelity and Fast diffusion-based Face Swapping via Triplet ID Group Learning
Fulong Ye, Miao Hua, Pengze Zhang, Xinghui Li, Qichao Sun, Songtao Zhao, Qian He, Xinglong Wu
•
Apr 20, 2025
•
48
8
Repensando a Geração de Dados CoT de Alta Qualidade sob a Perspectiva da Classificação Adaptativa de Dificuldade de Questões para LLMs
Rethinking the Generation of High-Quality CoT Data from the Perspective of LLM-Adaptive Question Difficulty Grading
Qianjin Yu, Keyu Wu, Zihan Chen, Chushu Zhang, Manlin Mei, Lingjun Huang, Fang Tan, Yongsheng Du, Kunlin Liu, Yurui Zhu
•
Apr 16, 2025
•
12
3
Uma Análise Abrangente sobre Segurança em Pilha Completa de LLM(-Agente): Dados, Treinamento e Implantação
A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment
Kun Wang, Guibin Zhang, Zhenhong Zhou, Jiahao Wu, Miao Yu, Shiqian Zhao, Chenlong Yin, Jinhu Fu, Yibo Yan, Hanjun Luo, Liang Lin, Zhihao Xu, Haolang Lu, Xinye Cao, Xinyun Zhou, Weifei Jin, Fanci Meng, Junyuan Mao, Hao Wu, Minghe Wang, Fan Zhang, Junfeng Fang, Chengwei Liu, Yifan Zhang, Qiankun Li, Chongye Guo, Yalan Qin, Yi Ding, Donghai Hong, Jiaming Ji, Xinfeng Li, Yifan Jiang, Dongxia Wang, Yihao Huang, Yufei Guo, Jen-tse Huang, Yanwei Yue, Wenke Huang, Guancheng Wan, Tianlin Li, Lei Bai, Jie Zhang, Qing Guo, Jingyi Wang, Tianlong Chen, Joey Tianyi Zhou, Xiaojun Jia, Weisong Sun, Cong Wu, Jing Chen, Xuming Hu, Yiming Li, Xiao Wang, Ningyu Zhang, Luu Anh Tuan, Guowen Xu, Tianwei Zhang, Xingjun Ma, Xiang Wang, Bo An, Jun Sun, Mohit Bansal, Shirui Pan, Yuval Elovici, Bhavya Kailkhura, Bo Li, Yaodong Yang, Hongwei Li, Wenyuan Xu, Yizhou Sun, Wei Wang, Qing Li, Ke Tang, Yu-Gang Jiang, Felix Juefei-Xu, Hui Xiong, Xiaofeng Wang, Shuicheng Yan, Dacheng Tao, Philip S. Yu, Qingsong Wen, Yang Liu
•
Apr 22, 2025
•
13
2
Causal-Copilot: Um Agente Autônomo de Análise Causal
Causal-Copilot: An Autonomous Causal Analysis Agent
Xinyue Wang, Kun Zhou, Wenyi Wu, Har Simrat Singh, Fang Nan, Songyao Jin, Aryan Philip, Saloni Patnaik, Hou Zhu, Shivam Singh, Parjanya Prashant, Qian Shen, Biwei Huang
•
Apr 17, 2025
•
5
2
Desatendido e Negligenciado: Abordando o Ponto Cego de Caixas de Seleção em Modelos de Linguagem de Grande Escala com CheckboxQA
Unchecked and Overlooked: Addressing the Checkbox Blind Spot in Large Language Models with CheckboxQA
Michał Turski, Mateusz Chiliński, Łukasz Borchmann
•
Apr 14, 2025
•
4
2
CRUST-Bench: Um Benchmark Abrangente para Transpilação de C para Rust Seguro
CRUST-Bench: A Comprehensive Benchmark for C-to-safe-Rust Transpilation
Anirudh Khatry, Robert Zhang, Jia Pan, Ziteng Wang, Qiaochu Chen, Greg Durrett, Isil Dillig
•
Apr 21, 2025
•
6
2
Alinhamento Global-Local Desacoplado para Melhorar a Compreensão Composicional
Decoupled Global-Local Alignment for Improving Compositional Understanding
Xiaoxing Hu, Kaicheng Yang, Jun Wang, Haoran Xu, Ziyong Feng, Yupei Wang
•
Apr 23, 2025
•
15
2
RePOPE: Impacto dos Erros de Anotação no Benchmark POPE
RePOPE: Impact of Annotation Errors on the POPE Benchmark
Yannic Neuhaus, Matthias Hein
•
Apr 22, 2025
•
8
2
I-Con: Um Framework Unificador para Aprendizado de Representação
I-Con: A Unifying Framework for Representation Learning
Shaden Alshammari, John Hershey, Axel Feldmann, William T. Freeman, Mark Hamilton
•
Apr 23, 2025
•
28
2
Tina: Modelos de Raciocínio Compactos via LoRA
Tina: Tiny Reasoning Models via LoRA
Shangshang Wang, Julian Asilis, Ömer Faruk Akgül, Enes Burak Bilgin, Ollie Liu, Willie Neiswanger
•
Apr 22, 2025
•
50
4
Relatório Técnico do Trillion 7B
Trillion 7B Technical Report
Sungjun Han, Juyoung Suk, Suyeong An, Hyungguk Kim, Kyuseok Kim, Wonsuk Yang, Seungtaek Choi, Jamin Shin
•
Apr 21, 2025
•
34
2
Aprendizado Visual Progressivo Orientado por Linguagem para Fundamentação Visual Multitarefa
Progressive Language-guided Visual Learning for Multi-Task Visual Grounding
Jingchao Wang, Hong Wang, Wenlong Zhang, Kunhua Ji, Dingjiang Huang, Yefeng Zheng
•
Apr 22, 2025
•
2
2
Solução Vencedora do AIMO-2: Construindo Modelos de Raciocínio Matemático de Última Geração com o Conjunto de Dados OpenMathReasoning
AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset
Ivan Moshkov, Darragh Hanley, Ivan Sorokin, Shubham Toshniwal, Christof Henkel, Benedikt Schifferer, Wei Du, Igor Gitman
•
Apr 23, 2025
•
18
2
Pré-DPO: Melhorando a Utilização de Dados na Otimização Direta de Preferências Usando um Modelo de Referência Orientador
Pre-DPO: Improving Data Utilization in Direct Preference Optimization Using a Guiding Reference Model
Junshu Pan, Wei Shen, Shulin Huang, Qiji Zhou, Yue Zhang
•
Apr 22, 2025
•
18
2
VisuLogic: Um Benchmark para Avaliação do Raciocínio Visual em Modelos de Linguagem Multimodais de Grande Escala
VisuLogic: A Benchmark for Evaluating Visual Reasoning in Multi-modal Large Language Models
Weiye Xu, Jiahao Wang, Weiyun Wang, Zhe Chen, Wengang Zhou, Aijun Yang, Lewei Lu, Houqiang Li, Xiaohua Wang, Xizhou Zhu, Wenhai Wang, Jifeng Dai, Jinguo Zhu
•
Apr 21, 2025
•
71
2
DreamO: Um Framework Unificado para Personalização de Imagens
DreamO: A Unified Framework for Image Customization
Chong Mou, Yanze Wu, Wenxu Wu, Zinan Guo, Pengze Zhang, Yufeng Cheng, Yiming Luo, Fei Ding, Shiwen Zhang, Xinghui Li, Mengtian Li, Songtao Zhao, Jian Zhang, Qian He, Xinglong Wu
•
Apr 23, 2025
•
19
2