0

Xiaotao Gu

Papers
23

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
23papers

Authored papers

23

GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents

arXiv 2026

2026

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

arXiv 2025

2025

Concat-ID: Towards Universal Identity-Preserving Video Synthesis

arXiv 2025

2025

SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representations

arXiv 2025

2025

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

arXiv 2025

2025

UI2Code^N: A Visual Language Model for Test-Time Scalable Interactive UI-to-Code Generation

arXiv 2025

2025

WebVIA: A Web-based Vision-Language Agentic Framework for Interactive and Verifiable UI-to-Code Generation

arXiv 2025

2025

VPO: Aligning Text-to-Video Generation Models with Prompt Optimization

ICCV 2025

2025

LongSafety: Evaluating Long-Context Safety of Large Language Models

arXiv 2025

2025

GLM-TTS Technical Report

arXiv 2025

2025

ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools

arXiv 2024

2024

CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer

arXiv 2024

2024

CogVLM2: Visual Language Models for Image and Video Understanding

arXiv 2024

2024

SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models

arXiv 2024

2024

AutoDetect: Towards a Unified Framework for Automated Weakness Detection in Large Language Models

arXiv 2024

2024

LogicGame: Benchmarking Rule-Based Reasoning Abilities of Large Language Models

arXiv 2024

2024

VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation

arXiv 2024

2024

VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents

arXiv 2024

2024

LVBench: An Extreme Long Video Understanding Benchmark

ICCV 2025

2024

Benchmarking Complex Instruction-Following with Multiple Constraints Composition

arXiv 2024

2024

AlignBench: Benchmarking Chinese Alignment of Large Language Models

arXiv 2023

2023

QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models

arXiv 2023

2023

Pangu-Weather: A 3D High-Resolution Model for Fast and Accurate Global Weather Forecast

arXiv 2022

2022

Affiliations

No known affiliations.

Frequent co-authors

10

from 23 papers