Cite
Notes
Only stored in your browser.
Attribution
R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing
arXiv 2025
Multi-Crit: Benchmarking Multimodal Judges on Pluralistic Criteria-Following
QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs
from 3 papers
Chenxi Liu
Enshu Liu
Guohao Dai
Hanrong Ye
Heng Huang
Hongxu Yin
Huazhong Yang
Huizi Mao
Ka Chun Cheung
Kaishen Wang