0

huan zhang

Papers
25

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
25papers

Authored papers

25

GUI-Libra: Training Native GUI Agents to Reason and Act with Action-aware Supervision and Partially Verifiable RL

arXiv 2026

2026

MonitorBench: A Comprehensive Benchmark for Chain-of-Thought Monitorability in Large Language Models

arXiv 2026

2026

FORTIS: Benchmarking Over-Privilege in Agent Skills

arXiv 2026

2026

Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

arXiv 2025

2025

EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents

arXiv 2025

2025

GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents

arXiv 2025

2025

On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective

arXiv 2025

2025

AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time

arXiv 2025

2025

When Reasoning Meets Its Laws

arXiv 2025

2025

Clip-and-Verify: Linear Constraint-Driven Domain Clipping for Accelerating Neural Network Verification

arXiv 2025

2025

Rethinking Diverse Human Preference Learning through Principal Component Analysis

arXiv 2025

2025

TrustLLM: Trustworthiness in Large Language Models

arXiv 2024

2024

Neural Network Verification with Branch-and-Bound for General Nonlinearities

arXiv 2024

2024

Foundation Models for Music: A Survey

arXiv 2024

2024

COLD-Attack: Jailbreaking LLMs with Stealthiness and Controllability

arXiv 2024

2024

A Pair Programming Framework for Code Generation via Multi-Plan Exploration and Feedback-Driven Refinement

arXiv 2024

2024

Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs

arXiv 2024

2024

Training-Free Bayesianization for Low-Rank Adapters of Large Language Models

arXiv 2024

2024

DynaMath: A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision Language Models

arXiv 2024

2024

Testing Neural Network Verifiers: A Soundness Benchmark with Hidden Counterexamples

arXiv 2024

2024

Using Explainable AI and Transfer Learning to understand and predict the maintenance of Atlantic blocking with limited observational data

arXiv 2024

2024

Robust Mixture-of-Expert Training for Convolutional Neural Networks

ICCV 2023 1

2023

HoneyBee: Progressive Instruction Finetuning of Large Language Models for Materials Science

arXiv 2023

2023

Efficiently Computing Local Lipschitz Constants of Neural Networks via Bound Propagation

arXiv 2022

2022

Fast Certified Robust Training with Short Warmup

NeurIPS 2021 12

2021

Affiliations

No known affiliations.

Frequent co-authors

10

from 25 papers