Yuhao Wang

Papers: 19

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

19papers

Authored papers

VideoZeroBench: Probing the Limits of Video MLLMs with Spatio-Temporal Evidence Verification

arXiv 2026

2026

VISTA-Bench: Do Vision-Language Models Really Understand Visualized Text as Well as Pure Text?

arXiv 2026

2026

SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis

arXiv 2025

2025

VocalNet: Speech LLM with Multi-Token Prediction for Faster and High-Quality Generation

arXiv 2025

2025

TeaRAG: A Token-Efficient Agentic Retrieval-Augmented Generation Framework

arXiv 2025

2025

ECKGBench: Benchmarking Large Language Models in E-commerce Leveraging Knowledge Graph

arXiv 2025

2025

Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs

arXiv 2025

2025

U-Bench: A Comprehensive Understanding of U-Net through 100-Variant Benchmarking

arXiv 2025

2025

VocalBench: Benchmarking the Vocal Conversational Abilities for Speech Interaction Models

arXiv 2025

2025

ImageRAG: Enhancing Ultra High Resolution Remote Sensing Imagery Analysis with ImageRAG

arXiv 2024

2024

G3: An Effective and Adaptive Framework for Worldwide Geolocalization Using Large Multi-Modality Models

arXiv 2024

2024

Unity is Strength: Unifying Convolutional and Transformeral Features for Better Person Re-Identification

arXiv 2024

2024

REAR: A Relevance-Aware Retrieval-Augmented Framework for Open-Domain Question Answering

arXiv 2024

2024

LLMBox: A Comprehensive Library for Large Language Models

arXiv 2024

2024

Sigma: Siamese Mamba Network for Multi-Modal Semantic Segmentation

arXiv 2024

2024

Pre-train, Align, and Disentangle: Empowering Sequential Recommendation with Large Language Models

arXiv 2024

2024

MM-SAP: A Comprehensive Benchmark for Assessing Self-Awareness of Multimodal Large Language Models in Perception

arXiv 2024

2024

Bridging Relevance and Reasoning: Rationale Distillation in Retrieval-Augmented Generation

arXiv 2024

2024

Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation

arXiv 2023

2023

Affiliations

No known affiliations.

Frequent co-authors

from 19 papers

Xiangyu Zhao

Ji-Rong Wen

Wayne Xin Zhao

Heyang Liu

Pengyue Jia

Ruiyang Ren

Yanfeng Wang

Yu Wang

Derong Xu

Haochen Wang