Cite
Notes
Only stored in your browser.
Attribution
Diffusion Language Models Know the Answer Before Decoding
arXiv 2025
Model Balancing Helps Low-data Training and Fine-tuning
arXiv 2024
A Three-regime Model of Network Pruning
arXiv 2023
from 3 papers
Yaoqing Yang
Arin Chang
Dilxat Muhtar
Li Shen
Lu Yin
Michael W. Mahoney
Pengxiang Li
Pu Ren
Shilin Yan
Shiwei Liu