Yuxiao Dong
- Papers
- 52
Cite
Notes
Only stored in your browser.
Authored papers
52GLM-5: from Vibe Coding to Agentic Engineering
arXiv 2026
GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents
arXiv 2026
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models
arXiv 2025
TreeRL: LLM Reinforcement Learning with On-Policy Tree Search
arXiv 2025
AndroidGen: Building an Android Language Agent under Data Scarcity
arXiv 2025
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning
arXiv 2025
VPO: Aligning Text-to-Video Generation Models with Prompt Optimization
ICCV 2025
LongSafety: Evaluating Long-Context Safety of Large Language Models
arXiv 2025
Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling
arXiv 2025
Parameter-Efficient Fine-Tuning for Foundation Models
arXiv 2025
LongRAG: A Dual-Perspective Retrieval-Augmented Generation Paradigm for Long-Context Question Answering
arXiv 2024
CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer
arXiv 2024
WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning
arXiv 2024
LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
arXiv 2024
AutoWebGLM: A Large Language Model-based Web Navigating Agent
arXiv 2024
ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search
arXiv 2024
VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation
arXiv 2024
VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents
arXiv 2024
CogCoM: Train Large Vision-Language Models Diving into Details through Chain of Manipulations
arXiv 2024
LVBench: An Extreme Long Video Understanding Benchmark
ICCV 2025
ChatGLM-Math: Improving Math Problem-Solving in Large Language Models with a Self-Critique Pipeline
arXiv 2024
LongReward: Improving Long-context Large Language Models with AI Feedback
arXiv 2024
Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments
arXiv 2024
LogicGame: Benchmarking Rule-Based Reasoning Abilities of Large Language Models
arXiv 2024
AlignMMBench: Evaluating Chinese Multimodal Alignment in Large Vision-Language Models
arXiv 2024
ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools
arXiv 2024
CogVLM2: Visual Language Models for Image and Video Understanding
arXiv 2024
LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA
arXiv 2024
Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer
arXiv 2024
GLM-4-Voice: Towards Intelligent and Human-Like End-to-End Spoken Chatbot
arXiv 2024
LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks
arXiv 2024
AndroidLab: Training and Systematic Benchmarking of Android Autonomous Agents
arXiv 2024
LongAlign: A Recipe for Long Context Alignment of Large Language Models
arXiv 2024
SciInstruct: a Self-Reflective Instruction Annotated Dataset for Training Scientific Language Models
arXiv 2024
SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models
arXiv 2024
AutoDetect: Towards a Unified Framework for Automated Weakness Detection in Large Language Models
arXiv 2024
CogAgent: A Visual Language Model for GUI Agents
CVPR 2024 1
AgentTuning: Enabling Generalized Agent Abilities for LLMs
arXiv 2023
WebGLM: Towards An Efficient Web-Enhanced Question Answering System with Human Preferences
arXiv 2023
ImageReward: Learning and Evaluating Human Preferences for Text-to-Image Generation
imagereward-learning-and-evaluating-human
Black-Box Prompt Optimization: Aligning Large Language Models without Model Training
arXiv 2023
GOAL: A Challenging Knowledge-grounded Video Captioning Benchmark for Real-time Soccer Commentary Generation
arXiv 2023
CodeGeeX: A Pre-Trained Model for Code Generation with Multilingual Benchmarking on HumanEval-X
arXiv 2023
CharacterGLM: Customizing Chinese Conversational AI Characters with Large Language Models
arXiv 2023
AgentBench: Evaluating LLMs as Agents
arXiv 2023
AlignBench: Benchmarking Chinese Alignment of Large Language Models
arXiv 2023
CritiqueLLM: Towards an Informative Critique Generation Model for Evaluation of Large Language Model Generation
arXiv 2023
GLM-130B: An Open Bilingual Pre-trained Model
arXiv 2022
GraphMAE: Self-Supervised Masked Graph Autoencoders
arXiv 2022
OGB-LSC: A Large-Scale Challenge for Machine Learning on Graphs
arXiv 2021
GPT-GNN: Generative Pre-Training of Graph Neural Networks
arXiv 2020
GCC: Graph Contrastive Coding for Graph Neural Network Pre-Training
arXiv 2020
Affiliations
Frequent co-authors
10from 52 papers