Zhang Zhang
- Papers
- 15
Cite
Notes
Only stored in your browser.
Authored papers
15E-GRPO: High Entropy Steps Drive Effective Reinforcement Learning for Flow Models
arXiv 2026
How Well Do Models Follow Visual Instructions? VIBE: A Systematic Benchmark for Visual Instruction-Driven Image Editing
arXiv 2026
Aligning Multimodal LLM with Human Preference: A Survey
arXiv 2025
Patho-R1: A Multimodal Reinforcement Learning-Based Pathology Expert Reasoner
arXiv 2025
R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning
arXiv 2025
SimScale: Learning to Drive via Real-World Simulation at Scale
arXiv 2025
Thyme: Think Beyond Images
arXiv 2025
MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models
arXiv 2025
Latent Sketchpad: Sketching Visual Thoughts to Elicit Multimodal Reasoning in MLLMs
arXiv 2025
OpenGPT-4o-Image: A Comprehensive Dataset for Advanced Image Generation and Editing
arXiv 2025
MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs
arXiv 2024
Beyond LLaVA-HD: Diving into High-Resolution Large Multimodal Models
arXiv 2024
Debiasing Multimodal Large Language Models
arXiv 2024
OneNet: Enhancing Time Series Forecasting Models under Concept Drift by Online Ensembling
onenet-enhancing-time-series-forecasting
AdaNPC: Exploring Non-Parametric Classifier for Test-Time Adaptation
arXiv 2023
Affiliations
Frequent co-authors
10from 15 papers