Kelvin Xu

Cite

Notes

Only stored in your browser.

Attribution

2papers

Authored papers

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

arXiv 2024

LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models

arXiv 2023

No known affiliations.

from 2 papers

Charlie Snell

Aviral Kumar

Charles Sun

Isadora White

Jaehoon Lee

researcher

Joey Hong

Marwa Abdulhai

Sergey Levine

professor

Yuexiang Zhai