Po-Yao Huang

Cite

Notes

Only stored in your browser.

Attribution

5papers

Authored papers

Perception Encoder: The best visual embeddings are not at the output of the network

arXiv 2025

VoiceCraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild

arXiv 2024

Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles

arXiv 2023

CiT: Curation in Training for Effective Vision-Language Data

ICCV 2023 1

Masked Autoencoders that Listen

arXiv 2022

No known affiliations.

from 5 papers

Christoph Feichtenhofer

Hu Xu

Chen Wei

Daniel Bolya

Abdelrahman Mohamed

Alexei Baevski

Andrea Madotto

Arkabandhu Chowdhury

Chaitanya Ryali

Daniel Li