Shiyu Huang

Papers: 8

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

8papers

Authored papers

OmniGUI: Benchmarking GUI Agents in Omni-Modal Smartphone Environments

arXiv 2026

2026

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

arXiv 2025

2025

CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer

arXiv 2024

2024

CogVLM2: Visual Language Models for Image and Video Understanding

arXiv 2024

2024

LVBench: An Extreme Long Video Understanding Benchmark

ICCV 2025

2024

VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation

arXiv 2024

2024

TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play

arXiv 2023

2023

Robustness and Generalizability of Deepfake Detection: A Study with Diffusion Models

arXiv 2023

2023

Affiliations

No known affiliations.

Frequent co-authors

from 8 papers

Jie Tang

engineer

Xiaotao Gu

Yuxiao Dong

Bin Xu

Ming Ding

Weihan Wang

Wenyi Hong

Xiaohan Zhang

Yean Cheng

Da Yin