Takashi Shibuya
- Papers
- 8
Cite
Notes
Only stored in your browser.
Authored papers
8HumanGif: Single-View Human Diffusion with Generative Prior
arXiv 2025
MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis
CVPR 2025 1
A Simple but Strong Baseline for Sounding Video Generation: Effective Adaptation of Audio and Video Diffusion Models for Joint Generation
arXiv 2024
MMDisCo: Multi-Modal Discriminator-Guided Cooperative Diffusion for Joint Audio and Video Generation
arXiv 2024
GenWarp: Single Image to Novel Views with Semantic-Preserving Generative Warping
arXiv 2024
SoundCTM: Unifying Score-based and Consistency Models for Full-band Text-to-Sound Generation
arXiv 2024
BigVSAN: Enhancing GAN-based Neural Vocoders with Slicing Adversarial Network
bigvsan-enhancing-gan-based-neural-vocoders
SAN: Inducing Metrizability of GAN with Discriminative Normalized Linear Layer
arXiv 2023
Affiliations
Frequent co-authors
10from 8 papers