Chenfei Wu
- Papers
- 14
Cite
Notes
Only stored in your browser.
Authored papers
14Qwen-Image-VAE-2.0 Technical Report
arXiv 2026
Qwen-Image Technical Report
arXiv 2025
Qwen-Image-Layered: Towards Inherent Editability via Layer Decomposition
arXiv 2025
Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model
arXiv 2025
Step-Video-TI2V Technical Report: A State-of-the-Art Text-Driven Image-to-Video Generation Model
arXiv 2025
LVLM-Interpret: An Interpretability Tool for Large Vision-Language Models
arXiv 2024
LayoutNUWA: Revealing the Hidden Layout Expertise of Large Language Models
arXiv 2023
ORES: Open-vocabulary Responsible Visual Synthesis
arXiv 2023
ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learning
arXiv 2023
GameEval: Evaluating LLMs on Conversational Games
arXiv 2023
Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models
arXiv 2023
Low-code LLM: Graphical User Interface over Large Language Models
arXiv 2023
NUWA-Infinity: Autoregressive over Autoregressive Generation for Infinite Visual Synthesis
arXiv 2022
BridgeTower: Building Bridges Between Encoders in Vision-Language Representation Learning
arXiv 2022
Affiliations
Frequent co-authors
10from 14 papers