Xiaotao Gu
- Papers
- 23
Cite
Notes
Only stored in your browser.
Authored papers
23GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents
arXiv 2026
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models
arXiv 2025
Concat-ID: Towards Universal Identity-Preserving Video Synthesis
arXiv 2025
SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representations
arXiv 2025
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning
arXiv 2025
UI2Code^N: A Visual Language Model for Test-Time Scalable Interactive UI-to-Code Generation
arXiv 2025
WebVIA: A Web-based Vision-Language Agentic Framework for Interactive and Verifiable UI-to-Code Generation
arXiv 2025
VPO: Aligning Text-to-Video Generation Models with Prompt Optimization
ICCV 2025
LongSafety: Evaluating Long-Context Safety of Large Language Models
arXiv 2025
GLM-TTS Technical Report
arXiv 2025
ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools
arXiv 2024
CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer
arXiv 2024
CogVLM2: Visual Language Models for Image and Video Understanding
arXiv 2024
SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models
arXiv 2024
AutoDetect: Towards a Unified Framework for Automated Weakness Detection in Large Language Models
arXiv 2024
LogicGame: Benchmarking Rule-Based Reasoning Abilities of Large Language Models
arXiv 2024
VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation
arXiv 2024
VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents
arXiv 2024
LVBench: An Extreme Long Video Understanding Benchmark
ICCV 2025
Benchmarking Complex Instruction-Following with Multiple Constraints Composition
arXiv 2024
AlignBench: Benchmarking Chinese Alignment of Large Language Models
arXiv 2023
QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models
arXiv 2023
Pangu-Weather: A 3D High-Resolution Model for Fast and Accurate Global Weather Forecast
arXiv 2022
Affiliations
Frequent co-authors
10from 23 papers