Kaifu Zhang
- Papers
- 15
Cite
Notes
Only stored in your browser.
Authored papers
15ComfyUI-R1: Exploring Reasoning Models for Workflow Generation
arXiv 2025
Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities
arXiv 2025
Diffusion-SDPO: Safeguarded Direct Preference Optimization for Diffusion Models
arXiv 2025
Ovis2.5 Technical Report
arXiv 2025
HSCodeComp: A Realistic and Expert-level Benchmark for Deep Search Agents in Hierarchical Rule Application
arXiv 2025
Marco-Voice Technical Report
arXiv 2025
TeEFusion: Blending Text Embeddings to Distill Classifier-Free Guidance
ICCV 2025
USB: A Comprehensive and Unified Safety Evaluation Benchmark for Multimodal Large Language Models
arXiv 2025
LayAlign: Enhancing Multilingual Reasoning in Large Language Models via Layer-Wise Adaptive Fusion and Alignment Strategy
arXiv 2025
CHATS: Combining Human-Aligned Optimization and Test-Time Sampling for Text-to-Image Generation
arXiv 2025
Omni-View: Unlocking How Generation Facilitates Understanding in Unified 3D Model based on Multiview images
arXiv 2025
A Unified Agentic Framework for Evaluating Conditional Image Generation
arXiv 2025
Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions
arXiv 2024
(Perhaps) Beyond Human Translation: Harnessing Multi-Agent Collaboration for Translating Ultra-Long Literary Texts
arXiv 2024
Parrot: Multilingual Visual Instruction Tuning
arXiv 2024
Affiliations
Frequent co-authors
10from 15 papers