Cite
Notes
Only stored in your browser.
Attribution
Test-Time Scaling in Reasoning Models Is Not Effective for Knowledge-Intensive Tasks Yet
arXiv 2025
How Does Response Length Affect Long-Form Factuality
Automatic Model Selection with Large Language Models for Reasoning
arXiv 2023
from 3 papers
Bryan Hooi
See-Kiong Ng
Jimmy Z. J. Liu
Junxian He
Kenji Kawaguchi
Michael Qizhe Xie
Yuxi Xie