0

Huaxiu Yao

Papers
41

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
41papers

Authored papers

41

AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration

arXiv 2026

2026

MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild

arXiv 2026

2026

SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning

arXiv 2026

2026

Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning

arXiv 2026

2026

Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections

arXiv 2026

2026

When and How Much to Imagine: Adaptive Test-Time Scaling with World Models for Visual Spatial Reasoning

arXiv 2026

2026

GUI-Libra: Training Native GUI Agents to Reason and Act with Action-aware Supervision and Partially Verifiable RL

arXiv 2026

2026

ClawArena: Benchmarking AI Agents in Evolving Information Environments

arXiv 2026

2026

VLAA-GUI: Knowing When to Stop, Recover, and Search, A Modular Framework for GUI Automation

arXiv 2026

2026

Your Agent, Their Asset: A Real-World Safety Analysis of OpenClaw

arXiv 2026

2026

SimpleMem: Efficient Lifelong Memory for LLM Agents

arXiv 2026

2026

MDocAgent: A Multi-Modal Multi-Agent Framework for Document Understanding

arXiv 2025

2025

On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective

arXiv 2025

2025

Agent0-VL: Exploring Self-Evolving Agent for Tool-Integrated Vision-Language Reasoning

arXiv 2025

2025

Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning

arXiv 2025

2025

Adapting Web Agents with Synthetic Supervision

arXiv 2025

2025

UQ: Assessing Language Models on Unsolved Questions

arXiv 2025

2025

Alignment Tipping Process: How Self-Evolution Pushes LLM Agents Off the Rails

arXiv 2025

2025

Re-Align: Aligning Vision Language Models via Retrieval-Augmented Direct Preference Optimization

arXiv 2025

2025

Autoregressive Models in Vision: A Survey

arXiv 2024

2024

MMedPO: Aligning Medical Vision-Language Models with Clinical-Aware Multimodal Preference Optimization

arXiv 2024

2024

SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image And Video Generation

arXiv 2024

2024

AutoTrust: Benchmarking Trustworthiness in Large Vision Language Models for Autonomous Driving

arXiv 2024

2024

WOMD-Reasoning: A Large-Scale Dataset for Interaction Reasoning in Driving

arXiv 2024

2024

MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models

arXiv 2024

2024

Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences

arXiv 2024

2024

Can Editing LLMs Inject Harm?

arXiv 2024

2024

MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?

arXiv 2024

2024

TrustLLM: Trustworthiness in Large Language Models

arXiv 2024

2024

GRAPE: Generalizing Robot Policy via Preference Alignment

arXiv 2024

2024

CREAM: Consistency Regularized Self-Rewarding Language Models

arXiv 2024

2024

RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language Models

arXiv 2024

2024

MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models

arXiv 2024

2024

Aligning Modalities in Vision Large Language Models via Preference Fine-tuning

arXiv 2024

2024

CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models

arXiv 2024

2024

MEIT: Multi-Modal Electrocardiogram Instruction Tuning on Large Language Models for Report Generation

arXiv 2024

2024

It Takes Two: On the Seamlessness between Reward and Policy Model in RLHF

arXiv 2024

2024

Generalizing to Unseen Domains in Diabetic Retinopathy with Disentangled Representations

arXiv 2024

2024

Analyzing and Mitigating Object Hallucination in Large Vision-Language Models

arXiv 2023

2023

How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs

arXiv 2023

2023

Meta-Learning with Fewer Tasks through Task Interpolation

meta-learning-with-fewer-tasks-through-task-1

2021

Affiliations

No known affiliations.

Frequent co-authors

10

from 41 papers