Cite
Notes
Only stored in your browser.
Attribution
Multi-module GRPO: Composing Policy Gradients and Prompt Optimization for Language Model Programs
arXiv 2025
from 1 papers
Chen Qian
Christopher Potts
Dan Klein
Dilara Soylu
grad-student
Isaac Miller
Kaiqiang Song
Karel D'Oosterlinck
Lakshya A. Agrawal
Matei Zaharia
founder
Meng Jiang