Jia Wang
- Papers
- 11
Cite
Notes
Only stored in your browser.
Authored papers
11Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters
arXiv 2026
SpatialEvo: Self-Evolving Spatial Intelligence via Deterministic Geometric Environments
arXiv 2026
GEBench: Benchmarking Image Generation Models as GUI Environments
arXiv 2026
Forge-and-Quench: Enhancing Image Generation for Higher Fidelity in Unified Multimodal Models
arXiv 2026
STEP3-VL-10B Technical Report
arXiv 2026
NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale
arXiv 2025
Open Vision Reasoner: Transferring Linguistic Cognitive Behavior for Visual Reasoning
arXiv 2025
Step-GUI Technical Report
arXiv 2025
General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
arXiv 2024
Slow Perception: Let's Perceive Geometric Figures Step-by-step
arXiv 2024
MM-Instruct: Generated Visual Instructions for Large Multimodal Model Alignment
arXiv 2024
Affiliations
Frequent co-authors
10from 11 papers