Minho Park
- Papers
- 9
Cite
Notes
Only stored in your browser.
Authored papers
9SphereDiff: Tuning-free Omnidirectional Panoramic Image and Video Generation via Spherical Latent Representation
arXiv 2025
PHUMA: Physically-Grounded Humanoid Locomotion Dataset
arXiv 2025
ACG: Action Coherence Guidance for Flow-based VLA models
arXiv 2025
EgoX: Egocentric Video Generation from a Single Exocentric Video
arXiv 2025
Cross-Frame Representation Alignment for Fine-Tuning Video Diffusion Models
arXiv 2025
Forecasting Future International Events: A Reliable Dataset for Text-Based Event Modeling
arXiv 2024
StableVITON: Learning Semantic Correspondence with Latent Diffusion Model for Virtual Try-On
CVPR 2024 1
Learning to Generate Semantic Layouts for Higher Text-Image Correspondence in Text-to-Image Synthesis
ICCV 2023 1
iColoriT: Towards Propagating Local Hint to the Right Region in Interactive Colorization by Leveraging Vision Transformer
arXiv 2022
Affiliations
Frequent co-authors
10from 9 papers