Cite
Notes
Only stored in your browser.
Attribution
E-GRPO: High Entropy Steps Drive Effective Reinforcement Learning for Flow Models
arXiv 2026
from 1 papers
Shengjun Zhang
Yueqi Duan
Zhang Zhang