Haoming Jiang
- Papers
- 7
Cite
Notes
Only stored in your browser.
Authored papers
7Think-RM: Enabling Long-Horizon Reasoning in Generative Reward Models
arXiv 2025
IHEval: Evaluating Language Models on Following the Instruction Hierarchy
arXiv 2025
Discriminative Finetuning of Generative Large Language Models without Reward Models and Human Preference Data
arXiv 2025
Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond
arXiv 2023
Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self-Training Approach
NAACL 2021 4
On the Variance of the Adaptive Learning Rate and Beyond
ICLR 2020 1
SMART: Robust and Efficient Fine-Tuning for Pre-trained Natural Language Models through Principled Regularized Optimization
smart-robust-and-efficient-fine-tuning-for-1
Affiliations
Frequent co-authors
10from 7 papers