Shiyu Huang
- Papers
- 8
Cite
Notes
Only stored in your browser.
8papers
Authored papers
8OmniGUI: Benchmarking GUI Agents in Omni-Modal Smartphone Environments
arXiv 2026
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning
arXiv 2025
CogVLM2: Visual Language Models for Image and Video Understanding
arXiv 2024
LVBench: An Extreme Long Video Understanding Benchmark
ICCV 2025
CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer
arXiv 2024
VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation
arXiv 2024
TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play
arXiv 2023
Robustness and Generalizability of Deepfake Detection: A Study with Diffusion Models
arXiv 2023
Affiliations
No known affiliations.
Frequent co-authors
10from 8 papers