Yidong Wang
- Papers
- 17
Cite
Notes
Only stored in your browser.
Authored papers
17GLM-5: from Vibe Coding to Agentic Engineering
arXiv 2026
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models
arXiv 2025
Masked Autoencoders Are Effective Tokenizers for Diffusion Models
arXiv 2025
TrustJudge: Inconsistencies of LLM-as-a-Judge and How to Alleviate Them
arXiv 2025
RewardAnything: Generalizable Principle-Following Reward Models
arXiv 2025
AutoSurvey: Large Language Models Can Automatically Write Surveys
arXiv 2024
Apollo: A Lightweight Multilingual Medical LLM towards Democratizing Medical AI to 6B People
arXiv 2024
FreeEval: A Modular Framework for Trustworthy and Efficient Evaluation of Large Language Models
arXiv 2024
Reasoning Through Execution: Unifying Process and Outcome Rewards for Code Generation
arXiv 2024
RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented Generation
arXiv 2024
CoderUJB: An Executable and Unified Java Benchmark for Practical Programming Scenarios
arXiv 2024
A Survey on Evaluation of Large Language Models
arXiv 2023
PandaLM: An Automatic Evaluation Benchmark for LLM Instruction Tuning Optimization
arXiv 2023
Survey on Factuality in Large Language Models: Knowledge, Retrieval and Domain-Specificity
arXiv 2023
Supervised Knowledge Makes Large Language Models Better In-context Learners
arXiv 2023
GLUE-X: Evaluating Natural Language Understanding Models from an Out-of-distribution Generalization Perspective
arXiv 2022
USB: A Unified Semi-supervised Learning Benchmark for Classification
arXiv 2022
Affiliations
Frequent co-authors
10from 17 papers