Sheng Zhou
- Papers
- 8
Cite
Notes
Only stored in your browser.
Authored papers
8EgoTextVQA: Towards Egocentric Scene-Text Aware Video Question Answering
CVPR 2025 1
MP-GUI: Modality Perception with MLLMs for GUI Understanding
CVPR 2025 1
GUI-Robust: A Comprehensive Dataset for Testing GUI Agent Robustness in Real-World Anomalies
arXiv 2025
OThink-R1: Intrinsic Fast/Slow Thinking Mode Switching for Over-Reasoning Mitigation
arXiv 2025
RealUnify: Do Unified Models Truly Benefit from Unification? A Comprehensive Benchmark
arXiv 2025
FocusedAD: Character-centric Movie Audio Description
arXiv 2025
PSL: Rethinking and Improving Softmax Loss from Pairwise Perspective for Recommendation
arXiv 2024
DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Language Models
arXiv 2024
Affiliations
Frequent co-authors
10from 8 papers