Cite
Notes
Only stored in your browser.
Attribution
SocialEval: Evaluating Social Intelligence of Large Language Models
arXiv 2025
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning
SafetyBench: Evaluating the Safety of Large Language Models
arXiv 2023
from 3 papers
Minlie Huang
Jie Tang
engineer
Aohan Zeng
Baoxu Wang
Bin Xu
Boyan Shi
Changyu Pang
Chenhui Zhang
Chong Long
Da Yin