Fei Mi
- Papers
- 12
Cite
Notes
Only stored in your browser.
Authored papers
12EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL
arXiv 2026
Memory-T1: Reinforcement Learning for Temporal Reasoning in Multi-session Agents
arXiv 2025
The Synergy Dilemma of Long-CoT SFT and RL: Investigating Post-Training Techniques for Reasoning VLMs
arXiv 2025
How Should We Enhance the Safety of Large Reasoning Models: An Empirical Study
arXiv 2025
Rethinking Expert Trajectory Utilization in LLM Post-training
arXiv 2025
Aligning Large Language Models with Human: A Survey
arXiv 2023
Data Management For Training Large Language Models: A Survey
arXiv 2023
FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models
arXiv 2023
Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization
arXiv 2023
ReSee: Responding through Seeing Fine-grained Visual Knowledge in Open-domain Dialogue
arXiv 2023
MoralDial: A Framework to Train and Evaluate Moral Dialogue Systems via Moral Discussions
arXiv 2022
COLD: A Benchmark for Chinese Offensive Language Detection
arXiv 2022
Affiliations
Frequent co-authors
10from 12 papers