0

Mingli Song

Papers
18

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
18papers

Authored papers

18

SeRL: Self-Play Reinforcement Learning for Large Language Models with Limited Data

arXiv 2025

2025

Activation Approximations Can Incur Safety Vulnerabilities Even in Aligned LLMs: Comprehensive Analysis and Defense

arXiv 2025

2025

Breaking the Exploration Bottleneck: Rubric-Scaffolded Reinforcement Learning for General LLM Reasoning

arXiv 2025

2025

GUI-Robust: A Comprehensive Dataset for Testing GUI Agent Robustness in Real-World Anomalies

arXiv 2025

2025

Reasoning with Reinforced Functional Token Tuning

arXiv 2025

2025

Syn-GRPO: Self-Evolving Data Synthesis for MLLM Perception Reasoning

arXiv 2025

2025

Holistic Semantic Representation for Navigational Trajectory Generation

arXiv 2025

2025

Reinforced Model Merging

arXiv 2025

2025

Consistent Paths Lead to Truth: Self-Rewarding Reinforcement Learning for LLM Reasoning

arXiv 2025

2025

VeriGUI: Verifiable Long-Chain GUI Dataset

arXiv 2025

2025

PatchDPO: Patch-level DPO for Finetuning-free Personalized Image Generation

CVPR 2025 1

2024

Odyssey: Empowering Minecraft Agents with Open-World Skills

arXiv 2024

2024

Decentralized SGD and Average-direction SAM are Asymptotically Equivalent

arXiv 2023

2023

ModelGiF: Gradient Fields for Model Functional Distance

ICCV 2023 1

2023

DepGraph: Towards Any Structural Pruning

CVPR 2023 1

2023

DIVOTrack: A Novel Dataset and Baseline Method for Cross-View Multi-Object Tracking in DIVerse Open Scenes

arXiv 2023

2023

Evaluation and Improvement of Interpretability for Self-Explainable Part-Prototype Networks

ICCV 2023 1

2022

A Survey on Explainable Reinforcement Learning: Concepts, Algorithms, Challenges

arXiv 2022

2022

Affiliations

No known affiliations.

Frequent co-authors

10

from 18 papers