0

Jie Wu

Papers
26

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
26papers

Authored papers

26

LoL: Longer than Longer, Scaling Video Generation to Hour

arXiv 2026

2026

AndroTMem: From Interaction Trajectories to Anchored Memory in Long-Horizon GUI Agents

arXiv 2026

2026

Enhancing Spatial Understanding in Image Generation via Reward Modeling

arXiv 2026

2026

Towards Long-horizon Agentic Multimodal Search

arXiv 2026

2026

AlphaGRPO: Unlocking Self-Reflective Multimodal Generation in UMMs via Decompositional Verifiable Reward

arXiv 2026

2026

X-Coder: Advancing Competitive Programming with Fully Synthetic Tasks, Solutions, and Tests

arXiv 2026

2026

Closing the Loop: Universal Repository Representation with RPG-Encoder

arXiv 2026

2026

A.S.E: A Repository-Level Benchmark for Evaluating Security in AI-Generated Code

arXiv 2025

2025

DanceGRPO: Unleashing GRPO on Visual Generation

arXiv 2025

2025

EpiCoder: Encompassing Diversity and Complexity in Code Generation

arXiv 2025

2025

IterPref: Focal Preference Learning for Code Generation via Iterative Debugging

arXiv 2025

2025

Self-Forcing++: Towards Minute-Scale High-Quality Video Generation

arXiv 2025

2025

OneReward: Unified Mask-Guided Image Generation via Multi-Task Human Preference Learning

arXiv 2025

2025

Step-Audio 2 Technical Report

arXiv 2025

2025

Step-Audio: Unified Understanding and Generation in Intelligent Speech Interaction

arXiv 2025

2025

Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model

arXiv 2025

2025

Step-Video-TI2V Technical Report: A State-of-the-Art Text-Driven Image-to-Video Generation Model

arXiv 2025

2025

ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with Reward Feedback Learning

arXiv 2024

2024

IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model

arXiv 2024

2024

Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM

arXiv 2024

2024

ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback

arXiv 2024

2024

ResAdapter: Domain Consistent Resolution Adapter for Diffusion Models

arXiv 2024

2024

AlignDet: Aligning Pre-training and Fine-tuning in Object Detection

ICCV 2023 1

2023

AutoDiffusion: Training-Free Optimization of Time Steps and Architectures for Automated Diffusion Model Acceleration

ICCV 2023 1

2023

Control-A-Video: Controllable Text-to-Video Diffusion Models with Motion Prior and Reward Feedback Learning

arXiv 2023

2023

Multi-Granularity Distillation Scheme Towards Lightweight Semi-Supervised Semantic Segmentation

arXiv 2022

2022

Affiliations

No known affiliations.

Frequent co-authors

10

from 26 papers