Xu Tang
- Papers
- 14
Cite
Notes
Only stored in your browser.
Authored papers
14FireRed-Image-Edit-1.0 Techinical Report
arXiv 2026
FireRedASR2S: A State-of-the-Art Industrial-Grade All-in-One Automatic Speech Recognition System
arXiv 2026
FireRed-OCR Technical Report
arXiv 2026
FireRedASR: Open-Source Industrial-Grade Mandarin Speech Recognition Models from Encoder-Decoder to LLM Integration
arXiv 2025
ASM-UNet: Adaptive Scan Mamba Integrating Group Commonalities and Individual Variations for Fine-Grained Segmentation
arXiv 2025
CQ-DINO: Mitigating Gradient Dilution via Category Queries for Vast Vocabulary Object Detection
arXiv 2025
InstantID: Zero-shot Identity-Preserving Generation in Seconds
arXiv 2024
Target-Driven Distillation: Consistency Distillation with Target Timestep Selection and Decoupled Guidance
arXiv 2024
StoryMaker: Towards Holistic Consistent Characters in Text-to-image Generation
arXiv 2024
Controllable Mind Visual Diffusion Model
arXiv 2023
Towards Open-Vocabulary Video Instance Segmentation
ICCV 2023 1
SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation
CVPR 2024 1
OvarNet: Towards Open-vocabulary Object Attribute Recognition
CVPR 2023 1
ZONE: Zero-Shot Instruction-Guided Local Editing
CVPR 2024 1
Affiliations
Frequent co-authors
10from 14 papers