Mingli Song
- Papers
- 18
Cite
Notes
Only stored in your browser.
Authored papers
18SeRL: Self-Play Reinforcement Learning for Large Language Models with Limited Data
arXiv 2025
Activation Approximations Can Incur Safety Vulnerabilities Even in Aligned LLMs: Comprehensive Analysis and Defense
arXiv 2025
Breaking the Exploration Bottleneck: Rubric-Scaffolded Reinforcement Learning for General LLM Reasoning
arXiv 2025
GUI-Robust: A Comprehensive Dataset for Testing GUI Agent Robustness in Real-World Anomalies
arXiv 2025
Reasoning with Reinforced Functional Token Tuning
arXiv 2025
Syn-GRPO: Self-Evolving Data Synthesis for MLLM Perception Reasoning
arXiv 2025
Holistic Semantic Representation for Navigational Trajectory Generation
arXiv 2025
Reinforced Model Merging
arXiv 2025
Consistent Paths Lead to Truth: Self-Rewarding Reinforcement Learning for LLM Reasoning
arXiv 2025
VeriGUI: Verifiable Long-Chain GUI Dataset
arXiv 2025
PatchDPO: Patch-level DPO for Finetuning-free Personalized Image Generation
CVPR 2025 1
Odyssey: Empowering Minecraft Agents with Open-World Skills
arXiv 2024
Decentralized SGD and Average-direction SAM are Asymptotically Equivalent
arXiv 2023
ModelGiF: Gradient Fields for Model Functional Distance
ICCV 2023 1
DepGraph: Towards Any Structural Pruning
CVPR 2023 1
DIVOTrack: A Novel Dataset and Baseline Method for Cross-View Multi-Object Tracking in DIVerse Open Scenes
arXiv 2023
Evaluation and Improvement of Interpretability for Self-Explainable Part-Prototype Networks
ICCV 2023 1
A Survey on Explainable Reinforcement Learning: Concepts, Algorithms, Challenges
arXiv 2022
Affiliations
Frequent co-authors
10from 18 papers