Xufang Luo
- Papers
- 14
Cite
Notes
Only stored in your browser.
Authored papers
14Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?
arXiv 2026
Understanding Reasoning in LLMs through Strategic Information Allocation under Uncertainty
arXiv 2026
Agent Lightning: Train ANY AI Agents with Reinforcement Learning
arXiv 2025
MMInference: Accelerating Pre-filling for Long-Context VLMs via Modality-Aware Permutation Sparse Attention
arXiv 2025
Do Not Let Low-Probability Tokens Over-Dominate in RL for LLMs
arXiv 2025
Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Views
arXiv 2025
$ΔL$ Normalization: Rethink Loss Aggregation in RLVR
arXiv 2025
Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key
CVPR 2025 1
VisRL: Intention-Driven Visual Perception via Reinforced Reasoning
ICCV 2025
LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic Prompt Compression
arXiv 2024
MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic Sparse Attention
arXiv 2024
Mitigate Position Bias in Large Language Models via Scaling a Single Dimension
arXiv 2024
Designing Network Algorithms via Large Language Models
arXiv 2024
LLM2CLIP: Powerful Language Model Unlocks Richer Visual Representation
arXiv 2024
Affiliations
Frequent co-authors
10from 14 papers