ChatPaper.aiChatPaper

Ежедневные статьи

ReasonIR: Обучение ретриверов для задач логического вывода
ReasonIR: Training Retrievers for Reasoning Tasks

Rulin Shao, Rui Qiao, Varsha Kishore, Niklas Muennighoff, Xi Victoria Lin, Daniela Rus, Bryan Kian Hsiang Low, Sewon Min, Wen-tau Yih, Pang Wei Koh, Luke ZettlemoyerApr 29, 2025161

Обучение с подкреплением для логического вывода в больших языковых моделях с одним обучающим примером
Reinforcement Learning for Reasoning in Large Language Models with One Training Example

Yiping Wang, Qing Yang, Zhiyuan Zeng, Liliang Ren, Lucas Liu, Baolin Peng, Hao Cheng, Xuehai He, Kuan Wang, Jianfeng Gao, Weizhu Chen, Shuohang Wang, Simon Shaolei Du, Yelong ShenApr 29, 2025132

TesserAct: Обучение 4D-воплощённых моделей мира
TesserAct: Learning 4D Embodied World Models

Haoyu Zhen, Qiao Sun, Hongxin Zhang, Junyan Li, Siyuan Zhou, Yilun Du, Chuang GanApr 29, 202541