David Harwath

Papers: 10

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

10papers

Authored papers

Scaling Rich Style-Prompted Text-to-Speech Datasets

arXiv 2025

2025

Rhapsody: A Dataset for Highlight Detection in Podcasts

arXiv 2025

2025

VoiceCraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild

arXiv 2024

2024

Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks

arXiv 2024

2024

Interface Design for Self-Supervised Speech Models

arXiv 2024

2024

Textless Speech-to-Speech Translation With Limited Parallel Data

arXiv 2023

2023

SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model

arXiv 2022

2022

Contrastive Audio-Visual Masked Autoencoder

arXiv 2022

2022

Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality

arXiv 2022

2022

Multimodal Clustering Networks for Self-supervised Learning from Unlabeled Videos

ICCV 2021 10

2021

Affiliations

No known affiliations.

Frequent co-authors

from 10 papers

Anuj Diwan

Eunsol Choi

Yi-Jen Shih

Andrew Rouditchenko

Hilde Kuehne

Hung-Yi Lee

James Glass

Layne Berry

Puyuan Peng

Abdelrahman Mohamed