Bosi Wen
- Papers
- 8
Cite
Notes
Only stored in your browser.
8papers
Authored papers
8GLM-5: from Vibe Coding to Agentic Engineering
arXiv 2026
IF-RewardBench: Benchmarking Judge Models for Instruction-Following Evaluation
arXiv 2026
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models
arXiv 2025
Benchmarking Complex Instruction-Following with Multiple Constraints Composition
arXiv 2024
CharacterBench: Benchmarking Character Customization of Large Language Models
arXiv 2024
CharacterGLM: Customizing Chinese Conversational AI Characters with Large Language Models
arXiv 2023
AlignBench: Benchmarking Chinese Alignment of Large Language Models
arXiv 2023
CritiqueLLM: Towards an Informative Critique Generation Model for Evaluation of Large Language Model Generation
arXiv 2023
Affiliations
No known affiliations.
Frequent co-authors
10from 8 papers