Cite
Notes
Only stored in your browser.
Attribution
Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models
arXiv 2024
Using Large Language Models for Hyperparameter Optimization
arXiv 2023
Benchmarking Neural Network Training Algorithms
from 3 papers
Abel L. Peirson
Acyr Locatelli
Ankush Garg
Bilal Khan
Chandramouli Shama Sastry
Chris J. Maddison
Daniel Snider
Daniel Suo
Dwarak Talupuru
Edward Grefenstette