Xuming Hu
- Papers
- 28
Cite
Notes
Only stored in your browser.
Authored papers
28Panoramic Affordance Prediction
arXiv 2026
AndroTMem: From Interaction Trajectories to Anchored Memory in Long-Horizon GUI Agents
arXiv 2026
CausalEmbed: Auto-Regressive Multi-Vector Generation in Latent Space for Visual Document Embedding
arXiv 2026
ChunkKV: Semantic-Preserving KV Cache Compression for Efficient Long-Context LLM Inference
arXiv 2025
UltraEdit: Training-, Subject-, and Memory-Free Lifelong Editing in Large Language Models
arXiv 2025
Shifting AI Efficiency From Model-Centric to Data-Centric Compression
arXiv 2025
Benchmarking Multi-modal Semantic Segmentation under Sensor Failures: Missing and Noisy Modality Robustness
arXiv 2025
RTV-Bench: Benchmarking MLLM Continuous Perception, Understanding and Reasoning through Real-Time Video
arXiv 2025
SafeEraser: Enhancing Safety in Multimodal Large Language Models through Multimodal Machine Unlearning
arXiv 2025
TreeHop: Generate and Filter Next Query Embeddings Efficiently for Multi-hop Question Answering
arXiv 2025
Retrieval Augmented Generation and Understanding in Vision: A Survey and New Outlook
arXiv 2025
OneForecast: A Universal Framework for Global and Regional Weather Forecasting
arXiv 2025
Data Whisperer: Efficient Data Selection for Task-Specific LLM Fine-Tuning via Few-Shot In-Context Learning
arXiv 2025
EssayJudge: A Multi-Granular Benchmark for Assessing Automated Essay Scoring Capabilities of Multimodal Large Language Models
arXiv 2025
A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment
arXiv 2025
Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in Multimodal Large Language Models
arXiv 2024
Interpretable Contrastive Monte Carlo Tree Search Reasoning
arXiv 2024
MMNeuron: Discovering Neuron-Level Domain-Specific Interpretation in Multimodal Large Language Model
arXiv 2024
LongGenBench: Long-context Generation Benchmark
arXiv 2024
On the Robustness of Document-Level Relation Extraction Models to Entity Name Variations
arXiv 2024
Refiner: Restructure Retrieval Content Efficiently to Advance Question-Answering Capabilities
arXiv 2024
Multi-view Hypergraph-based Contrastive Learning Model for Cold-Start Micro-video Recommendation
arXiv 2024
Mitigating Modality Prior-Induced Hallucinations in Multimodal Large Language Models via Deciphering Attention Causality
arXiv 2024
LLaSA: Large Language and E-Commerce Shopping Assistant
arXiv 2024
Exploring Response Uncertainty in MLLMs: An Empirical Evaluation under Misleading Scenarios
arXiv 2024
A comprehensive evaluation of ChatGPT's zero-shot Text-to-SQL capability
arXiv 2023
A Semantic Invariant Robust Watermark for Large Language Models
arXiv 2023
Survey on Factuality in Large Language Models: Knowledge, Retrieval and Domain-Specificity
arXiv 2023
Affiliations
Frequent co-authors
10from 28 papers