Qingru Zhang
- Papers
- 6
Cite
Notes
Only stored in your browser.
6papers
Authored papers
6Think-RM: Enabling Long-Horizon Reasoning in Generative Reward Models
arXiv 2025
GEAR: An Efficient KV Cache Compression Recipe for Near-Lossless Generative Inference of LLM
arXiv 2024
Model Tells Itself Where to Attend: Faithfulness Meets Automatic Attention Steering
arXiv 2024
AdaLoRA: Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning
arXiv 2023
Tell Your Model Where to Attend: Post-hoc Attention Steering for LLMs
arXiv 2023
Less is More: Task-aware Layer-wise Distillation for Language Model Compression
arXiv 2022
Affiliations
No known affiliations.
Frequent co-authors
10from 6 papers