0

Zhenguo Li

Papers
34

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
34papers

Authored papers

34

SimVLA: A Simple VLA Baseline for Robotic Manipulation

arXiv 2026

2026

How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training

arXiv 2025

2025

Implicit Search via Discrete Diffusion: A Study on Chess

arXiv 2025

2025

Perceptual Decoupling for Scalable Multi-modal Reasoning via Reward-Optimized Captioning

arXiv 2025

2025

Self-Adjust Softmax

arXiv 2025

2025

Mathesis: Towards Formal Theorem Proving from Natural Languages

arXiv 2025

2025

PIXART-δ: Fast and Controllable Image Generation with Latent Consistency Models

arXiv 2024

2024

PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

arXiv 2024

2024

Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models

arXiv 2024

2024

Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data

arXiv 2024

2024

T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation

CVPR 2025 1

2024

Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning

arXiv 2024

2024

SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator

arXiv 2024

2024

DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis

arXiv 2024

2024

EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions

CVPR 2025 1

2024

QuickLLaMA: Query-aware Inference Acceleration for Large Language Models

arXiv 2024

2024

Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding

arXiv 2024

2024

MUSTARD: Mastering Uniform Synthesis of Theorem and Proof Data

arXiv 2024

2024

Jailbreaking as a Reward Misspecification Problem

arXiv 2024

2024

Editing Massive Concepts in Text-to-Image Diffusion Models

arXiv 2024

2024

A Survey of Reasoning with Foundation Models

arXiv 2023

2023

UniTR: A Unified and Efficient Multi-Modal Transformer for Bird's-Eye-View Representation

ICCV 2023 1

2023

MetaBEV: Solving Sensor Failures for BEV Detection and Map Segmentation

arXiv 2023

2023

Make-A-Protagonist: Generic Video Editing with An Ensemble of Experts

arXiv 2023

2023

DDP: Diffusion Model for Dense Visual Prediction

ICCV 2023 1

2023

Progressive-Hint Prompting Improves Reasoning in Large Language Models

https-arxiv-org-abs-2304-09797

2023

G-LLaVA: Solving Geometric Problem with Multi-Modal Large Language Model

arXiv 2023

2023

Beyond One-to-One: Rethinking the Referring Image Segmentation

ICCV 2023 1

2023

Lyra: Orchestrating Dual Correction in Automated Theorem Proving

arXiv 2023

2023

SA-Solver: Stochastic Adams Solver for Fast Sampling of Diffusion Models

sa-solver-stochastic-adams-solver-for-fast

2023

MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models

arXiv 2023

2023

CAGroup3D: Class-Aware Grouping for 3D Object Detection on Point Clouds

arXiv 2022

2022

Generalizing Few-Shot NAS with Gradient Matching

generalizing-few-shot-nas-with-gradient

2022

DropNAS: Grouped Operation Dropout for Differentiable Architecture Search

arXiv 2022

2022

Affiliations

No known affiliations.

Frequent co-authors

10

from 34 papers