Wenxuan Huang
- Papers
- 15
Cite
Notes
Only stored in your browser.
Authored papers
15Flow-OPD: On-Policy Distillation for Flow Matching Models
arXiv 2026
VideoSeeker: Incentivizing Instance-level Video Understanding via Native Agentic Tool Invocation
arXiv 2026
SkillFlow:Benchmarking Lifelong Skill Discovery and Evolution for Autonomous Agents
arXiv 2026
SCOPE: Structured Decomposition and Conditional Skill Orchestration for Complex Image Generation
arXiv 2026
OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents
arXiv 2026
Unify-Agent: A Unified Multimodal Agent for World-Grounded Image Synthesis
arXiv 2026
Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models
arXiv 2026
UniCorn: Towards Self-Improving Unified Multimodal Models through Self-Generated Supervision
arXiv 2026
GroupGPT: A Token-efficient and Privacy-preserving Agentic Framework for Multi-User Chat Assistant
arXiv 2026
Vision-R1: Incentivizing Reasoning Capability in Multimodal Large Language Models
arXiv 2025
VCR-Bench: A Comprehensive Evaluation Framework for Video Chain-of-Thought Reasoning
arXiv 2025
Interleaving Reasoning for Better Text-to-Image Generation
arXiv 2025
Agentic Jigsaw Interaction Learning for Enhancing Visual Perception and Reasoning in Vision-Language Models
arXiv 2025
Refusal Falls off a Cliff: How Safety Alignment Fails in Reasoning?
arXiv 2025
Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows
arXiv 2025
Affiliations
Frequent co-authors
10from 15 papers