Ermo Hua
- Papers
- 9
Cite
Notes
Only stored in your browser.
Authored papers
9MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding
arXiv 2025
FlowRL: Matching Reward Distributions for LLM Reasoning
arXiv 2025
A Survey of Reinforcement Learning for Large Reasoning Models
arXiv 2025
UltraMedical: Building Specialized Generalists in Biomedicine
arXiv 2024
How to Synthesize Text Data without Model Collapse?
arXiv 2024
Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization
arXiv 2024
Intuitive Fine-Tuning: Towards Simplifying Alignment into a Single Process
arXiv 2024
Large Language Models as Biomedical Hypothesis Generators: A Comprehensive Evaluation
arXiv 2024
Fast and Slow Generating: An Empirical Study on Large and Small Language Models Collaborative Decoding
arXiv 2024
Affiliations
Frequent co-authors
10from 9 papers