Hongtao Xie
- Papers
- 10
Cite
Notes
Only stored in your browser.
Authored papers
10Mask$^2$DiT: Dual Mask-based Diffusion Transformer for Multi-Scene Long Video Generation
arXiv 2025
Hybrid-Level Instruction Injection for Video Token Compression in Multi-modal Large Language Models
CVPR 2025 1
Test-Time Scaling with Reflective Generative Model
arXiv 2025
Focus on the Whole Character: Discriminative Character Modeling for Scene Text Recognition
arXiv 2024
DEADiff: An Efficient Stylization Diffusion Model with Disentangled Representations
CVPR 2024 1
SVTRv2: CTC Beats Encoder-Decoder Models in Scene Text Recognition
ICCV 2025
TPS++: Attention-Enhanced Thin-Plate Spline for Scene Text Recognition
arXiv 2023
ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Spotting
arXiv 2022
Bridging the Gap Between Vision Transformers and Convolutional Neural Networks on Small Datasets
arXiv 2022
Read Like Humans: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Recognition
CVPR 2021 1
Affiliations
Frequent co-authors
10from 10 papers