Demin Song
- Papers
- 5
Cite
Notes
Only stored in your browser.
5papers
Authored papers
5CritiQ: Mining Data Quality Criteria from Human Preferences
arXiv 2025
Pre-Trained Policy Discriminators are General Reward Models
arXiv 2025
AlchemistCoder: Harmonizing and Eliciting Code Capability by Hindsight Tuning on Multi-source Data
arXiv 2024
OriGen:Enhancing RTL Code Generation with Code-to-Code Augmentation and Self-Reflection
arXiv 2024
Case2Code: Learning Inductive Reasoning with Synthetic Data
arXiv 2024
Affiliations
No known affiliations.
Frequent co-authors
10from 5 papers