Haoli Bai
- Papers
- 12
Cite
Notes
Only stored in your browser.
Authored papers
12SWE-Lego: Pushing the Limits of Supervised Fine-tuning for Software Issue Resolving
arXiv 2026
Stabilizing Reinforcement Learning for Diffusion Language Models
arXiv 2026
MTR-DuplexBench: Towards a Comprehensive Evaluation of Multi-Round Conversations for Full-Duplex Speech Language Models
arXiv 2026
Memory-T1: Reinforcement Learning for Temporal Reasoning in Multi-session Agents
arXiv 2025
InSight-o3: Empowering Multimodal Foundation Models with Generalized Visual Search
arXiv 2025
Quantization Hurts Reasoning? An Empirical Study on Quantized Reasoning Models
arXiv 2025
The Synergy Dilemma of Long-CoT SFT and RL: Investigating Post-Training Techniques for Reasoning VLMs
arXiv 2025
Dream-VL & Dream-VLA: Open Vision-Language and Vision-Language-Action Models with Diffusion Language Model Backbone
arXiv 2025
FlatQuant: Flatness Matters for LLM Quantization
arXiv 2024
Visually Guided Generative Text-Layout Pre-training for Document Intelligence
arXiv 2024
EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions
CVPR 2025 1
IntactKV: Improving Large Language Model Quantization by Keeping Pivot Tokens Intact
arXiv 2024
Affiliations
Frequent co-authors
10from 12 papers