Chao-Han Huck Yang
- Papers
- 13
Cite
Notes
Only stored in your browser.
Authored papers
13PRiSM: Benchmarking Phone Realization in Speech Models
arXiv 2026
How Auditory Knowledge in LLM Backbones Shapes Audio Language Models: A Holistic Evaluation
arXiv 2026
OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM
arXiv 2025
DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment
arXiv 2025
SAKE: Towards Editing Auditory Attribute Knowledge of Large Audio-Language Models
arXiv 2025
Investigating Safety Vulnerabilities of Large Audio-Language Models Under Speaker Emotional Variations
arXiv 2025
DeSTA2: Developing Instruction-Following Speech Language Model Without Speech Instruction-Tuning Data
arXiv 2024
GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators
arXiv 2024
Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks
arXiv 2024
Large Language Models are Efficient Learners of Noise-Robust Speech Recognition
arXiv 2024
Towards Neural Scaling Laws for Time Series Foundation Models
arXiv 2024
HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language Models
hyporadise-an-open-baseline-for-generative
Whispering LLaMA: A Cross-Modal Generative Error Correction Framework for Speech Recognition
arXiv 2023
Affiliations
Frequent co-authors
10from 13 papers