Kuan Wang
- Papers
- 3
Cite
Notes
Only stored in your browser.
3papers
Authored papers
3Reinforcement Learning for Reasoning in Large Language Models with One Training Example
arXiv 2025
ToolQA: A Dataset for LLM Question Answering with External Tools
toolqa-a-dataset-for-llm-question-answering
APQ: Joint Search for Network Architecture, Pruning and Quantization Policy
apq-joint-search-for-network-architecture
Affiliations
No known affiliations.
Frequent co-authors
10from 3 papers