Meng Cao
- Papers
- 12
Cite
Notes
Only stored in your browser.
Authored papers
12BrowseComp-ZH: Benchmarking Web Browsing Ability of Large Language Models in Chinese
arXiv 2025
ScaleLong: A Multi-Timescale Benchmark for Long Video Understanding
arXiv 2025
IV-Bench: A Benchmark for Image-Grounded Video Perception and Reasoning in Multimodal LLMs
arXiv 2025
"See the World, Discover Knowledge": A Chinese Factuality Evaluation for Large Vision Language Models
arXiv 2025
Vision-Language Models Meet Meteorology: Developing Models for Extreme Weather Events Detection with Heatmaps
arXiv 2024
PhysGame: Uncovering Physical Commonsense Violations in Gameplay Videos
physgame-uncovering-physical-commonsense
How to Continually Adapt Text-to-Image Diffusion Models for Flexible Customization?
arXiv 2024
ING-VP: MLLMs cannot Play Easy Vision-based Games Yet
arXiv 2024
Systematic Rectification of Language Models via Dead-end Analysis
arXiv 2023
VeCLIP: Improving CLIP Training via Visual-enriched Captions
arXiv 2023
Improving Retrieval-Augmented Large Language Models via Data Importance Learning
arXiv 2023
Efficient ConvBN Blocks for Transfer Learning and Beyond
arXiv 2023
Affiliations
Frequent co-authors
10from 12 papers