Hongyu Li
- Papers
- 16
Cite
Notes
Only stored in your browser.
Authored papers
16Gen-Searcher: Reinforcing Agentic Search for Image Generation
arXiv 2026
4DThinker: Thinking with 4D Imagery for Dynamic Spatial Understanding
arXiv 2026
LongCat-Next: Lexicalizing Modalities as Discrete Tokens
arXiv 2026
UMO: Unified In-Context Learning Unlocks Motion Foundation Model Priors
arXiv 2026
Uni-Edit: Intelligent Editing Is A General Task For Unified Model Tuning
arXiv 2026
LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal Understanding
CVPR 2025 1
Reinforcement Learning Tuning for VideoLLMs: Reward Design and Data Efficiency
arXiv 2025
OneThinker: All-in-one Reasoning Model for Image and Video
arXiv 2025
2D Gaussian Splatting with Semantic Alignment for Image Inpainting
arXiv 2025
EditThinker: Unlocking Iterative Reasoning for Any Image Editor
arXiv 2025
OmniAlpha: A Sequence-to-Sequence Framework for Unified Multi-Task RGBA Generation
arXiv 2025
OpenSubject: Leveraging Video-Derived Identity and Diversity Priors for Subject-driven Image Generation and Manipulation
arXiv 2025
Architecture Decoupling Is Not All You Need For Unified Multimodal Model
arXiv 2025
LongCat-Video Technical Report
arXiv 2025
Thinking-while-Generating: Interleaving Textual Reasoning throughout Visual Generation
arXiv 2025
DuReader_retrieval: A Large-scale Chinese Benchmark for Passage Retrieval from Web Search Engine
arXiv 2022
Affiliations
Frequent co-authors
10from 16 papers