Cite
Notes
Only stored in your browser.
Attribution
Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond
arXiv 2026
TGDPO: Harnessing Token-Level Reward Guidance for Enhancing Direct Preference Optimization
arXiv 2025
from 2 papers
Jiaya Jia
Bei Yu
Bin Xia
Fengyi Wu
Haokun Gui
Haoxuan Che
Hengshuang Zhao
Jiehui Huang
Jinhui Ye
Jize Zhang