Cite
Notes
Only stored in your browser.
Attribution
InT: Self-Proposed Interventions Enable Credit Assignment in LLM Reasoning
arXiv 2026
Thinking vs. Doing: Agents that Reason by Scaling Test-Time Interaction
arXiv 2025
Digi-Q: Learning Q-Value Functions for Training Device-Control Agents
from 3 papers
Aviral Kumar
Amrith Setlur
Yifei Zhou
Ameet Talwalkar
Diego Caples
Gene Yang
Ian Wu
Junhong Shen
Li Erran Li
Lunjun Zhang