Papers

Trending research and the full catalog - each paper linked to the benchmarks, methods, and models it introduces.

Filtered by domain: ReasoningClear

CAVEWOMAN: How Large Language Models Behave Under Linguistic Input and Output Compression

23 Jun 2026

"Talk short. Drop grammar. Save token." This caveman style is widely promoted as a way to cut inference cost, but whether it actually saves anything depends on which channel (the user's prompt or the model's response) is being compressed.

Language Modeling Reasoning

ReNIO: Reweighting Negative Trajectory Importance for LLM On-Policy Distillation

22 Jun 2026

On-policy distillation (OPD) improves LLM reasoning by training a student model on its own generated outputs, but standard OPD treats all student-generated outputs (SGOs) equally regardless of their informativeness.

Language Modeling Reasoning Reinforcement Learning

Do Thinking Tokens Help with Safety?

23 Jun 2026

Today's reasoning models use thinking tokens to attain stronger performance on benchmarks than their instruction-tuned counterparts. It is also generally believed that this more "deliberative" mode should improve alignment and safety, by providing the model a safe space…

Reasoning