Mingzhe Du
- Papers
- 8
Cite
Notes
Only stored in your browser.
Authored papers
8Secure Code Generation via Online Reinforcement Learning with Vulnerability Reward Model
arXiv 2026
CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models
arXiv 2026
GuardReasoner-VL: Safeguarding VLMs via Reinforced Reasoning
arXiv 2025
Afterburner: Reinforcement Learning Facilitates Self-Improving Code Efficiency Optimization
arXiv 2025
SWE-Perf: Can Language Models Optimize Code Performance on Real-World Repositories?
arXiv 2025
Mercury: A Code Efficiency Benchmark for Code Large Language Models
arXiv 2024
AntiLeak-Bench: Preventing Data Contamination by Automatically Constructing Benchmarks with Updated Real-World Knowledge
arXiv 2024
Measuring the Influence of Incorrect Code on Test Generation
arXiv 2024
Affiliations
Frequent co-authors
10from 8 papers