Yuhao Wang
- Papers
- 19
Cite
Notes
Only stored in your browser.
Authored papers
19VideoZeroBench: Probing the Limits of Video MLLMs with Spatio-Temporal Evidence Verification
arXiv 2026
VISTA-Bench: Do Vision-Language Models Really Understand Visualized Text as Well as Pure Text?
arXiv 2026
SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis
arXiv 2025
U-Bench: A Comprehensive Understanding of U-Net through 100-Variant Benchmarking
arXiv 2025
ECKGBench: Benchmarking Large Language Models in E-commerce Leveraging Knowledge Graph
arXiv 2025
Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs
arXiv 2025
VocalNet: Speech LLM with Multi-Token Prediction for Faster and High-Quality Generation
arXiv 2025
TeaRAG: A Token-Efficient Agentic Retrieval-Augmented Generation Framework
arXiv 2025
VocalBench: Benchmarking the Vocal Conversational Abilities for Speech Interaction Models
arXiv 2025
ImageRAG: Enhancing Ultra High Resolution Remote Sensing Imagery Analysis with ImageRAG
arXiv 2024
G3: An Effective and Adaptive Framework for Worldwide Geolocalization Using Large Multi-Modality Models
arXiv 2024
Unity is Strength: Unifying Convolutional and Transformeral Features for Better Person Re-Identification
arXiv 2024
LLMBox: A Comprehensive Library for Large Language Models
arXiv 2024
Sigma: Siamese Mamba Network for Multi-Modal Semantic Segmentation
arXiv 2024
Pre-train, Align, and Disentangle: Empowering Sequential Recommendation with Large Language Models
arXiv 2024
MM-SAP: A Comprehensive Benchmark for Assessing Self-Awareness of Multimodal Large Language Models in Perception
arXiv 2024
Bridging Relevance and Reasoning: Rationale Distillation in Retrieval-Augmented Generation
arXiv 2024
REAR: A Relevance-Aware Retrieval-Augmented Framework for Open-Domain Question Answering
arXiv 2024
Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation
arXiv 2023
Affiliations
Frequent co-authors
10from 19 papers