Zhenguo Li
- Papers
- 34
Cite
Notes
Only stored in your browser.
Authored papers
34SimVLA: A Simple VLA Baseline for Robotic Manipulation
arXiv 2026
How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training
arXiv 2025
Implicit Search via Discrete Diffusion: A Study on Chess
arXiv 2025
Perceptual Decoupling for Scalable Multi-modal Reasoning via Reward-Optimized Captioning
arXiv 2025
Self-Adjust Softmax
arXiv 2025
Mathesis: Towards Formal Theorem Proving from Natural Languages
arXiv 2025
PIXART-δ: Fast and Controllable Image Generation with Latent Consistency Models
arXiv 2024
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
arXiv 2024
Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models
arXiv 2024
Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data
arXiv 2024
T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation
CVPR 2025 1
Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning
arXiv 2024
SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator
arXiv 2024
DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis
arXiv 2024
EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions
CVPR 2025 1
QuickLLaMA: Query-aware Inference Acceleration for Large Language Models
arXiv 2024
Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding
arXiv 2024
MUSTARD: Mastering Uniform Synthesis of Theorem and Proof Data
arXiv 2024
Jailbreaking as a Reward Misspecification Problem
arXiv 2024
Editing Massive Concepts in Text-to-Image Diffusion Models
arXiv 2024
A Survey of Reasoning with Foundation Models
arXiv 2023
UniTR: A Unified and Efficient Multi-Modal Transformer for Bird's-Eye-View Representation
ICCV 2023 1
MetaBEV: Solving Sensor Failures for BEV Detection and Map Segmentation
arXiv 2023
Make-A-Protagonist: Generic Video Editing with An Ensemble of Experts
arXiv 2023
DDP: Diffusion Model for Dense Visual Prediction
ICCV 2023 1
Progressive-Hint Prompting Improves Reasoning in Large Language Models
https-arxiv-org-abs-2304-09797
G-LLaVA: Solving Geometric Problem with Multi-Modal Large Language Model
arXiv 2023
Beyond One-to-One: Rethinking the Referring Image Segmentation
ICCV 2023 1
Lyra: Orchestrating Dual Correction in Automated Theorem Proving
arXiv 2023
SA-Solver: Stochastic Adams Solver for Fast Sampling of Diffusion Models
sa-solver-stochastic-adams-solver-for-fast
MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models
arXiv 2023
CAGroup3D: Class-Aware Grouping for 3D Object Detection on Point Clouds
arXiv 2022
Generalizing Few-Shot NAS with Gradient Matching
generalizing-few-shot-nas-with-gradient
DropNAS: Grouped Operation Dropout for Differentiable Architecture Search
arXiv 2022
Affiliations
Frequent co-authors
10from 34 papers