Zhiyuan Zhao
- Papers
- 10
Cite
Notes
Only stored in your browser.
Authored papers
10Refinement via Regeneration: Enlarging Modification Space Boosts Image Refinement in Unified Multimodal Models
arXiv 2026
HunyuanImage 3.0 Technical Report
arXiv 2025
MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing
arXiv 2025
DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception
arXiv 2024
OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations
CVPR 2025 1
Time-MMD: Multi-Domain Multimodal Dataset for Time Series Analysis
arXiv 2024
PINNsFormer: A Transformer-Based Framework For Physics-Informed Neural Networks
arXiv 2023
Open-Source Reinforcement Learning Environments Implemented in MuJoCo with Franka Manipulator
arXiv 2023
Beyond Hallucinations: Enhancing LVLMs through Hallucination-Aware Direct Preference Optimization
arXiv 2023
MLLM-DataEngine: An Iterative Refinement Approach for MLLM
arXiv 2023
Affiliations
Frequent co-authors
10from 10 papers