DiJia Su
- Papers
- 5
Cite
Notes
Only stored in your browser.
5papers
Authored papers
5SPG: Sandwiched Policy Gradient for Masked Diffusion Language Models
arXiv 2025
Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning
arXiv 2025
Training Large Language Models to Reason in a Continuous Latent Space
arXiv 2024
Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping
arXiv 2024
Dualformer: Controllable Fast and Slow Thinking by Learning with Randomized Reasoning Traces
arXiv 2024
Affiliations
No known affiliations.
Frequent co-authors
10from 5 papers