0

Hao Wu

Papers
26

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
26papers

Authored papers

26

MiMo-V2-Flash Technical Report

arXiv 2026

2026

HiDrop: Hierarchical Vision Token Reduction in MLLMs via Late Injection, Concave Pyramid Pruning, and Early Exit

arXiv 2026

2026

FireRed-OCR Technical Report

arXiv 2026

2026

UTPTrack: Towards Simple and Unified Token Pruning for Visual Tracking

arXiv 2026

2026

Speak While Watching: Unleashing TRUE Real-Time Video Understanding Capability of Multimodal Large Language Models

arXiv 2026

2026

ViCA: Efficient Multimodal LLMs with Vision-Only Cross-Attention

arXiv 2026

2026

PRISM: Position-encoded Regressive Inverse Spectral Model for Multilayer Thin-Film Design

arXiv 2026

2026

MemOS: A Memory OS for AI System

arXiv 2025

2025

NeuralOM: Neural Ocean Model for Subseasonal-to-Seasonal Simulation

arXiv 2025

2025

Pretraining Large Language Models with NVFP4

arXiv 2025

2025

OneForecast: A Universal Framework for Global and Regional Weather Forecasting

arXiv 2025

2025

Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows

arXiv 2025

2025

A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment

arXiv 2025

2025

Step-GUI Technical Report

arXiv 2025

2025

GCPO: When Contrast Fails, Go Gold

arXiv 2025

2025

A Systematic Study of Code Obfuscation Against LLM-based Vulnerability Detection

arXiv 2025

2025

CLIP-Guided Generative Networks for Transferable Targeted Adversarial Attacks

arXiv 2024

2024

One Perturbation is Enough: On Generating Universal Adversarial Perturbations against Vision-Language Pre-training Models

ICCV 2025

2024

Learning Graph Quantized Tokenizers

arXiv 2024

2024

Large-vocabulary forensic pathological analyses via prototypical cross-modal contrastive learning

arXiv 2024

2024

A Closer Look at Few-shot Classification Again

arXiv 2023

2023

An Intelligent Remote Sensing Image Quality Inspection System

arXiv 2023

2023

SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

arXiv 2022

2022

Contrastive Vision-Language Pre-training with Limited Resources

arXiv 2021

2021

XRJL-HKUST at SemEval-2021 Task 4: WordNet-Enhanced Dual Multi-head Co-Attention for Reading Comprehension of Abstract Meaning

SEMEVAL 2021

2021

Stochastic Normalizing Flows

NeurIPS 2020 12

2020

Affiliations

No known affiliations.

Frequent co-authors

10

from 26 papers