Cite
Notes
Only stored in your browser.
Attribution
Metis: A Foundation Speech Generation Model with Masked Generative Pre-training
arXiv 2025
TaDiCodec: Text-aware Diffusion Speech Tokenizer for Speech Language Modeling
LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models
arXiv 2024
from 3 papers
Zhizheng Wu
Xueyao Zhang
Yuancheng Wang
Baichuan Zhou
Conghui He
Dahua Lin
Dekun Chen
Hengrui Kang
Honglin Lin
Huan Liao