Harry Yang
- Papers
- 18
Cite
Notes
Only stored in your browser.
Authored papers
18D-OPSD: On-Policy Self-Distillation for Continuously Tuning Step-Distilled Diffusion Models
arXiv 2026
Manifold-Aware Exploration for Reinforcement Learning in Video Generation
arXiv 2026
LoopViT: Scaling Visual ARC with Looped Transformers
arXiv 2026
When Semantics Mislead Vision: Mitigating Large Multimodal Models Hallucinations in Scene Text Spotting and Understanding
arXiv 2025
Niagara: Normal-Integrated Geometric Affine Field for Scene Reconstruction from a Single View
arXiv 2025
TalkVid: A Large-Scale Diversified Dataset for Audio-Driven Talking Head Synthesis
arXiv 2025
Go with Your Gut: Scaling Confidence for Autoregressive Image Generation
arXiv 2025
Follow-Your-Shape: Shape-Aware Image Editing via Trajectory-Guided Region Control
arXiv 2025
LightGen: Efficient Image Generation through Knowledge Distillation and Direct Preference Optimization
arXiv 2025
Distribution Matching Distillation Meets Reinforcement Learning
arXiv 2025
OpenSubject: Leveraging Video-Derived Identity and Diversity Priors for Subject-driven Image Generation and Manipulation
arXiv 2025
Model Reveals What to Cache: Profiling-Based Feature Reuse for Video Diffusion Models
ICCV 2025
VideoGen-of-Thought: A Collaborative Framework for Multi-Shot Video Generation
arXiv 2024
AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks
arXiv 2024
Next Patch Prediction for Autoregressive Visual Generation
arXiv 2024
Unveiling the Ignorance of MLLMs: Seeing Clearly, Answering Incorrectly
CVPR 2025 1
Long Video Generation with Time-Agnostic VQGAN and Time-Sensitive Transformer
arXiv 2022
RegMixup: Mixup as a Regularizer Can Surprisingly Improve Accuracy and Out Distribution Robustness
arXiv 2022
Affiliations
Frequent co-authors
10from 18 papers