Weihua Luo
- Papers
- 17
Cite
Notes
Only stored in your browser.
Authored papers
17UMEM: Unified Memory Extraction and Management Framework for Generalizable Memory
arXiv 2026
ComfyUI-R1: Exploring Reasoning Models for Workflow Generation
arXiv 2025
Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities
arXiv 2025
Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models
arXiv 2025
Diffusion-SDPO: Safeguarded Direct Preference Optimization for Diffusion Models
arXiv 2025
Ovis2.5 Technical Report
arXiv 2025
HSCodeComp: A Realistic and Expert-level Benchmark for Deep Search Agents in Hierarchical Rule Application
arXiv 2025
Marco-Voice Technical Report
arXiv 2025
TeEFusion: Blending Text Embeddings to Distill Classifier-Free Guidance
ICCV 2025
CHATS: Combining Human-Aligned Optimization and Test-Time Sampling for Text-to-Image Generation
arXiv 2025
Omni-View: Unlocking How Generation Facilitates Understanding in Unified 3D Model based on Multiview images
arXiv 2025
A Unified Agentic Framework for Evaluating Conditional Image Generation
arXiv 2025
LayAlign: Enhancing Multilingual Reasoning in Large Language Models via Layer-Wise Adaptive Fusion and Alignment Strategy
arXiv 2025
Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions
arXiv 2024
(Perhaps) Beyond Human Translation: Harnessing Multi-Agent Collaboration for Translating Ultra-Long Literary Texts
arXiv 2024
Parrot: Multilingual Visual Instruction Tuning
arXiv 2024
M3GIA: A Cognition Inspired Multilingual and Multimodal General Intelligence Ability Benchmark
arXiv 2024
Affiliations
Frequent co-authors
10from 17 papers