Jiaxing Huang
- Papers
- 11
Cite
Notes
Only stored in your browser.
Authored papers
11R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization
arXiv 2025
R1-ShareVL: Incentivizing Reasoning Capability of Multimodal Large Language Models via Share-GRPO
arXiv 2025
R1-Compress: Long Chain-of-Thought Compression via Chunk Compression and Search
arXiv 2025
Improving large language models with concept-aware fine-tuning
arXiv 2025
Reasoning with Reinforced Functional Token Tuning
arXiv 2025
VeriGUI: Verifiable Long-Chain GUI Dataset
arXiv 2025
Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search
arXiv 2024
Vision-Language Models for Vision Tasks: A Survey
arXiv 2023
Black-box Unsupervised Domain Adaptation with Bi-directional Atkinson-Shiffrin Memory
ICCV 2023 1
3D Semantic Segmentation in the Wild: Learning Generalized Models for Adverse-Condition Point Clouds
CVPR 2023 1
Domain Adaptive Video Segmentation via Temporal Pseudo Supervision
arXiv 2022
Affiliations
Frequent co-authors
10from 11 papers