Ermo Hua

Papers: 9

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

9papers

Authored papers

FlowRL: Matching Reward Distributions for LLM Reasoning

arXiv 2025

2025

A Survey of Reinforcement Learning for Large Reasoning Models

arXiv 2025

2025

MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding

arXiv 2025

2025

UltraMedical: Building Specialized Generalists in Biomedicine

arXiv 2024

2024

Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization

arXiv 2024

2024

Large Language Models as Biomedical Hypothesis Generators: A Comprehensive Evaluation

arXiv 2024

2024

Fast and Slow Generating: An Empirical Study on Large and Small Language Models Collaborative Decoding

arXiv 2024

2024

How to Synthesize Text Data without Model Collapse?

arXiv 2024

2024

Intuitive Fine-Tuning: Towards Simplifying Alignment into a Single Process

arXiv 2024

2024

Affiliations

No known affiliations.

Frequent co-authors

from 9 papers

Bowen Zhou

professor

9 shared papers

Kaiyan Zhang

9 shared papers

Ning Ding

researcher

Xingtai Lv

Biqing Qi

Xuekai Zhu

Che Jiang

Ganqu Cui

researcher

3 shared papers

Kai Tian

3 shared papers

Sihang Zeng

3 shared papers