Haoyu Wang
- Papers
- 36
Cite
Notes
Only stored in your browser.
Authored papers
36Can LLMs Clean Up Your Mess? A Survey of Application-Ready Data Preparation with LLMs
arXiv 2026
The DAWN of World-Action Interactive Models
arXiv 2026
Towards Automated Kernel Generation in the Era of LLMs
arXiv 2026
Project Imaging-X: A Survey of 1000+ Open-Access Medical Imaging Datasets for Foundation Model Development
arXiv 2026
One Turn Too Late: Response-Aware Defense Against Hidden Malicious Intent in Multi-Turn Dialogue
arXiv 2026
Language-based Trial and Error Falls Behind in the Era of Experience
arXiv 2026
No Other Representation Component Is Needed: Diffusion Transformers Can Provide Representation Guidance by Themselves
arXiv 2025
UltraHorizon: Benchmarking Agent Capabilities in Ultra Long-Horizon Scenarios
arXiv 2025
Haystack Engineering: Context Engineering for Heterogeneous and Agentic Long-Context Evaluation
arXiv 2025
LLM$\times$MapReduce-V2: Entropy-Driven Convolutional Test-Time Scaling for Generating Long-Form Articles from Extremely Long Resources
arXiv 2025
DuplexMamba: Enhancing Real-time Speech Conversations with Duplex and Streaming Capabilities
arXiv 2025
Lifelong Safety Alignment for Language Models
arXiv 2025
Perplexity Trap: PLM-Based Retrievers Overrate Low Perplexity Documents
arXiv 2025
Prompt-Free Conditional Diffusion for Multi-object Image Augmentation
arXiv 2025
AceSearcher: Bootstrapping Reasoning and Search for LLMs via Reinforced Self-Play
arXiv 2025
MedM-VL: What Makes a Good Medical LVLM?
arXiv 2025
Collab-RAG: Boosting Retrieval-Augmented Generation for Complex Question Answering via White-Box and Black-Box LLM Collaboration
arXiv 2025
WalnutData: A UAV Remote Sensing Dataset of Green Walnuts and Model Evaluation
arXiv 2025
Large Language Models for Cyber Security: A Systematic Literature Review
arXiv 2024
MV-VTON: Multi-View Virtual Try-On with Diffusion Models
arXiv 2024
UltraLink: An Open-Source Knowledge-Enhanced Multilingual Supervised Fine-tuning Dataset
arXiv 2024
Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection
arXiv 2024
Arm-Constrained Curriculum Learning for Loco-Manipulation of the Wheel-Legged Robot
arXiv 2024
Interactive Medical Image Segmentation: A Benchmark Dataset and Baseline
CVPR 2025 1
OMGEval: An Open Multilingual Generative Evaluation Benchmark for Large Language Models
arXiv 2024
Models Are Codes: Towards Measuring Malicious Code Poisoning Attacks on Pre-trained Model Hubs
arXiv 2024
Melody-Guided Music Generation
arXiv 2024
Cross-video Identity Correlating for Person Re-identification Pre-training
arXiv 2024
RoseLoRA: Row and Column-wise Sparse Low-rank Adaptation of Pre-trained Language Model for Knowledge Editing and Fine-tuning
arXiv 2024
SA-Med2D-20M Dataset: Segment Anything in 2D Medical Imaging with 20 Million masks
arXiv 2023
SAM-Med3D: Towards General-purpose Segmentation Models for Volumetric Medical Images
arXiv 2023
Large Language Models for Software Engineering: A Systematic Literature Review
arXiv 2023
Self-Supervised High Dynamic Range Imaging with Multi-Exposure Images in Dynamic Scenes
arXiv 2023
GLAD: Content-aware Dynamic Graphs For Log Anomaly Detection
arXiv 2023
Pitfalls in Language Models for Code Intelligence: A Taxonomy and Survey
arXiv 2023
Pose Flow: Efficient Online Pose Tracking
arXiv 2018
Affiliations
Frequent co-authors
10from 36 papers