David Harwath
- Papers
- 10
Cite
Notes
Only stored in your browser.
Authored papers
10Scaling Rich Style-Prompted Text-to-Speech Datasets
arXiv 2025
Rhapsody: A Dataset for Highlight Detection in Podcasts
arXiv 2025
VoiceCraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild
arXiv 2024
Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks
arXiv 2024
Interface Design for Self-Supervised Speech Models
arXiv 2024
Textless Speech-to-Speech Translation With Limited Parallel Data
arXiv 2023
SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model
arXiv 2022
Contrastive Audio-Visual Masked Autoencoder
arXiv 2022
Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality
arXiv 2022
Multimodal Clustering Networks for Self-supervised Learning from Unlabeled Videos
ICCV 2021 10
Affiliations
Frequent co-authors
10from 10 papers