Fei Mi

Papers: 12

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

12papers

Authored papers

EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL

arXiv 2026

2026

Memory-T1: Reinforcement Learning for Temporal Reasoning in Multi-session Agents

arXiv 2025

2025

Rethinking Expert Trajectory Utilization in LLM Post-training

arXiv 2025

2025

The Synergy Dilemma of Long-CoT SFT and RL: Investigating Post-Training Techniques for Reasoning VLMs

arXiv 2025

2025

How Should We Enhance the Safety of Large Reasoning Models: An Empirical Study

arXiv 2025

2025

Aligning Large Language Models with Human: A Survey

arXiv 2023

2023

Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization

arXiv 2023

2023

Data Management For Training Large Language Models: A Survey

arXiv 2023

2023

FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models

arXiv 2023

2023

ReSee: Responding through Seeing Fine-grained Visual Knowledge in Open-domain Dialogue

arXiv 2023

2023

COLD: A Benchmark for Chinese Offensive Language Detection

arXiv 2022

2022

MoralDial: A Framework to Train and Evaluate Moral Dialogue Systems via Moral Discussions

arXiv 2022

2022

Affiliations

No known affiliations.

Frequent co-authors

from 12 papers

Lifeng Shang

Qun Liu

Minlie Huang

Xingshan Zeng

Qi Zhu

Wanjun Zhong

Xin Jiang

YuFei Wang

Zhexin Zhang

Baojun Wang