Tae-Hyun Oh
- Papers
- 9
Cite
Notes
Only stored in your browser.
Authored papers
9JointDiT: Enhancing RGB-Depth Joint Modeling with Diffusion Transformers
ICCV 2025
Sound2Vision: Generating Diverse Visuals from Audio through Cross-Modal Latent Alignment
arXiv 2024
DisCoRD: Discrete Tokens to Continuous Motion via Rectified Flow Decoding
ICCV 2025
Noise Map Guidance: Inversion with Spatial Context for Real Image Editing
arXiv 2024
Enhancing Speech-Driven 3D Facial Animation with Audio-Visual Guidance from Lip Reading Expert
arXiv 2024
Prefix tuning for automated audio captioning
arXiv 2023
SMILE: Multimodal Dataset for Understanding Laughter in Video with Language Models
arXiv 2023
Cross-Attention of Disentangled Modalities for 3D Human Mesh Recovery with Transformers
arXiv 2022
FedPara: Low-Rank Hadamard Product for Communication-Efficient Federated Learning
fedpara-low-rank-hadamard-product-for
Affiliations
Frequent co-authors
10from 9 papers