0

Tuo Zhao

Papers
19

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
19papers

Authored papers

19

Approximation of Log-Partition Function in Policy Mirror Descent Induces Implicit Regularization for LLM Post-Training

arXiv 2026

2026

COSMOS: A Hybrid Adaptive Optimizer for Memory-Efficient Training of LLMs

arXiv 2025

2025

NoWag: A Unified Framework for Shape Preserving Compression of Large Language Models

arXiv 2025

2025

AdaSPEC: Selective Knowledge Distillation for Efficient Speculative Decoders

arXiv 2025

2025

Think-RM: Enabling Long-Horizon Reasoning in Generative Reward Models

arXiv 2025

2025

Discriminative Finetuning of Generative Large Language Models without Reward Models and Human Preference Data

arXiv 2025

2025

GEAR: An Efficient KV Cache Compression Recipe for Near-Lossless Generative Inference of LLM

arXiv 2024

2024

To Cool or not to Cool? Temperature Network Meets Large Foundation Models via DRO

arXiv 2024

2024

RoseLoRA: Row and Column-wise Sparse Low-rank Adaptation of Pre-trained Language Model for Knowledge Editing and Fine-tuning

arXiv 2024

2024

Model Tells Itself Where to Attend: Faithfulness Meets Automatic Attention Steering

arXiv 2024

2024

Tell Your Model Where to Attend: Post-hoc Attention Steering for LLMs

arXiv 2023

2023

AdaLoRA: Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning

arXiv 2023

2023

LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models

arXiv 2023

2023

Deep Reinforcement Learning from Hierarchical Preference Design

arXiv 2023

2023

Machine Learning Force Fields with Data Cost Aware Training

arXiv 2023

2023

Less is More: Task-aware Layer-wise Distillation for Language Model Compression

arXiv 2022

2022

Taming Sparsely Activated Transformer with Stochastic Experts

taming-sparsely-activated-transformer-with-1

2021

Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self-Training Approach

NAACL 2021 4

2020

SMART: Robust and Efficient Fine-Tuning for Pre-trained Natural Language Models through Principled Regularized Optimization

smart-robust-and-efficient-fine-tuning-for-1

2019

Affiliations

No known affiliations.

Frequent co-authors

10

from 19 papers