0

Weihua Luo

Papers
17

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
17papers

Authored papers

17

UMEM: Unified Memory Extraction and Management Framework for Generalizable Memory

arXiv 2026

2026

ComfyUI-R1: Exploring Reasoning Models for Workflow Generation

arXiv 2025

2025

Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities

arXiv 2025

2025

Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models

arXiv 2025

2025

Diffusion-SDPO: Safeguarded Direct Preference Optimization for Diffusion Models

arXiv 2025

2025

Ovis2.5 Technical Report

arXiv 2025

2025

HSCodeComp: A Realistic and Expert-level Benchmark for Deep Search Agents in Hierarchical Rule Application

arXiv 2025

2025

Marco-Voice Technical Report

arXiv 2025

2025

TeEFusion: Blending Text Embeddings to Distill Classifier-Free Guidance

ICCV 2025

2025

CHATS: Combining Human-Aligned Optimization and Test-Time Sampling for Text-to-Image Generation

arXiv 2025

2025

Omni-View: Unlocking How Generation Facilitates Understanding in Unified 3D Model based on Multiview images

arXiv 2025

2025

A Unified Agentic Framework for Evaluating Conditional Image Generation

arXiv 2025

2025

LayAlign: Enhancing Multilingual Reasoning in Large Language Models via Layer-Wise Adaptive Fusion and Alignment Strategy

arXiv 2025

2025

Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions

arXiv 2024

2024

(Perhaps) Beyond Human Translation: Harnessing Multi-Agent Collaboration for Translating Ultra-Long Literary Texts

arXiv 2024

2024

Parrot: Multilingual Visual Instruction Tuning

arXiv 2024

2024

M3GIA: A Cognition Inspired Multilingual and Multimodal General Intelligence Ability Benchmark

arXiv 2024

2024

Affiliations

No known affiliations.

Frequent co-authors

10

from 17 papers