Distilling Feedback into Memory-as-a-Tool

A framework converts transient critiques into retrievable guidelines using a file-based memory system and agent-controlled tool calls, enabling LLMs to match test-time refinement performance with reduced inference costs.

Open

Preview
Year: 2026
Venue: arXiv 2026
ArXiv: arxiv.org/abs/2601.05960
Authors: 1
Hosting: Abstract onlyARXIV-DEFAULT

Cite

Notes

Only stored in your browser.

Attribution

Abstract & full text: arxiv.org/abs/2601.05960ARXIV-DEFAULT
TL;DR: Semantic Scholar

Attribution policy →

Abstract

We propose a framework that amortizes the cost of inference-time reasoning by converting transient critiques into retrievable guidelines, through a file-based memory system and agent-controlled tool calls. We evaluate this method on the Rubric Feedback Bench, a novel dataset for rubric-based learning. Experiments demonstrate that our augmented LLMs rapidly match the performance of test-time refinement pipelines while drastically reducing inference cost.

Authors

Víctor Gallego