Cite
Notes
Only stored in your browser.
Attribution
Towards Evaluating and Building Versatile Large Language Models for Medicine
arXiv 2024
Soft Robust MDPs and Risk-Sensitive MDPs: Equivalence, Policy Gradient, and Sample Complexity
arXiv 2023
Escaping saddle points in zeroth-order optimization: the power of two-point estimators
arXiv 2022
from 3 papers
Chaoyi Wu
Hongfei Gu
Jinxin Liu
Pengcheng Qiu
Runyu Zhang
Weidi Xie
Ya zhang
Yanfeng Wang
Yang Hu
Yujie Tang