0

Jianfei Chen

Papers
22

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
22papers

Authored papers

22

KernelBench-X: A Comprehensive Benchmark for Evaluating LLM-Generated GPU Kernels

arXiv 2026

2026

SageAttention3: Microscaling FP4 Attention for Inference and An Exploration of 8-Bit Training

arXiv 2025

2025

SageAttention2++: A More Efficient Implementation of SageAttention2

arXiv 2025

2025

Sparse VideoGen2: Accelerate Video Generation with Sparse Attention via Semantic-Aware Permutation

arXiv 2025

2025

SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse-Linear Attention

arXiv 2025

2025

TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times

arXiv 2025

2025

Task-Specific Zero-shot Quantization-Aware Training for Object Detection

ICCV 2025

2025

SpargeAttention: Accurate and Training-free Sparse Attention Accelerating Any Model Inference

arXiv 2025

2025

Visual Generation Without Guidance

arXiv 2025

2025

COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training

arXiv 2024

2024

ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing

arXiv 2024

2024

Diffusion Bridge Implicit Models

arXiv 2024

2024

Jetfire: Efficient and Accurate Transformer Pretraining with INT8 Data Flow and Per-Block Quantization

arXiv 2024

2024

Pruning Large Language Models with Semi-Structural Adaptive Sparse Training

arXiv 2024

2024

1-Bit FQT: Pushing the Limit of Fully Quantized Training to 1-bit

arXiv 2024

2024

Efficient Backpropagation with Variance-Controlled Adaptive Sampling

arXiv 2024

2024

DPM-Solver-v3: Improved Diffusion ODE Solver with Empirical Model Statistics

dpm-solver-v3-improved-diffusion-ode-solver

2023

Contrastive Energy Prediction for Exact Energy-Guided Diffusion Sampling in Offline Reinforcement Learning

arXiv 2023

2023

Training Transformers with 4-bit Integers

training-transformers-with-4-bit-integers

2023

Improved Techniques for Maximum Likelihood Estimation for Diffusion ODEs

arXiv 2023

2023

DPM-Solver++: Fast Solver for Guided Sampling of Diffusion Probabilistic Models

arXiv 2022

2022

GACT: Activation Compressed Training for Generic Network Architectures

arXiv 2022

2022

Affiliations

No known affiliations.

Frequent co-authors

10

from 22 papers