Hang Su
- Papers
- 35
Cite
Notes
Only stored in your browser.
Authored papers
35Causal Forcing: Autoregressive Diffusion Distillation Done Right for High-Quality Real-Time Interactive Video Generation
arXiv 2026
Safety Alignment as Continual Learning: Mitigating the Alignment Tax via Orthogonal Gradient Projection
arXiv 2026
GLEAN: Generalized Category Discovery with Diverse and Quality-Enhanced LLM Feedback
arXiv 2025
Visual Generation Without Guidance
arXiv 2025
From reactive to cognitive: brain-inspired spatial intelligence for embodied agents
arXiv 2025
HybridDepth: Robust Metric Depth Fusion by Leveraging Depth from Focus and Single-Image Priors
arXiv 2024
MicroDreamer: Efficient 3D Generation in $\sim$20 Seconds by Score-based Iterative Reconstruction
arXiv 2024
DPOT: Auto-Regressive Denoising Operator Transformer for Large-Scale PDE Pre-Training
arXiv 2024
Noise Contrastive Alignment of Language Models with Explicit Rewards
arXiv 2024
TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue Summarization
arXiv 2024
MAGID: An Automated Pipeline for Generating Synthetic Multi-modal Datasets
arXiv 2024
RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation
arXiv 2024
Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control
arXiv 2024
Membership Inference on Text-to-Image Diffusion Models via Conditional Likelihood Discrepancy
arXiv 2024
Controllable Contextualized Image Captioning: Directing the Visual Narrative through User-Defined Highlights
arXiv 2024
UniSumEval: Towards Unified, Fine-Grained, Multi-Dimensional Summarization Evaluation for LLMs
arXiv 2024
Layer-Aware Analysis of Catastrophic Overfitting: Revealing the Pseudo-Robust Shortcut Dependency
arXiv 2024
Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection
arXiv 2023
PINNacle: A Comprehensive Benchmark of Physics-Informed Neural Networks for Solving PDEs
arXiv 2023
LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents
arXiv 2023
Detection Transformer with Stable Matching
ICCV 2023 1
COCO-O: A Benchmark for Object Detectors under Natural Distribution Shifts
coco-o-a-benchmark-for-object-detectors-under
A Comprehensive Survey of Continual Learning: Theory, Method and Application
arXiv 2023
GNOT: A General Neural Operator Transformer for Operator Learning
arXiv 2023
Evil Geniuses: Delving into the Safety of LLM-based Agents
arXiv 2023
Text-to-Image Diffusion Models can be Easily Backdoored through Multimodal Data Poisoning
arXiv 2023
Score Regularized Policy Optimization through Diffusion Behavior
arXiv 2023
Contrastive Energy Prediction for Exact Energy-Guided Diffusion Sampling in Offline Reinforcement Learning
arXiv 2023
NUNO: A General Framework for Learning Parametric PDEs with Non-Uniform Data
arXiv 2023
Rethinking Model Ensemble in Transfer-based Adversarial Attacks
arXiv 2023
One Transformer Fits All Distributions in Multi-Modal Diffusion at Scale
arXiv 2023
Machine Vision Therapy: Multimodal Large Language Models Can Enhance Visual Robustness via Denoising In-Context Learning
arXiv 2023
Overcoming Recency Bias of Normalization Statistics in Continual Learning: Balance and Adaptation
overcoming-recency-bias-of-normalization
DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection
dino-detr-with-improved-denoising-anchor
DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR
dab-detr-dynamic-anchor-boxes-are-better
Affiliations
Frequent co-authors
10from 35 papers