huan zhang
- Papers
- 25
Cite
Notes
Only stored in your browser.
Authored papers
25GUI-Libra: Training Native GUI Agents to Reason and Act with Action-aware Supervision and Partially Verifiable RL
arXiv 2026
MonitorBench: A Comprehensive Benchmark for Chain-of-Thought Monitorability in Large Language Models
arXiv 2026
FORTIS: Benchmarking Over-Privilege in Agent Skills
arXiv 2026
Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems
arXiv 2025
EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents
arXiv 2025
GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents
arXiv 2025
On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective
arXiv 2025
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time
arXiv 2025
When Reasoning Meets Its Laws
arXiv 2025
Clip-and-Verify: Linear Constraint-Driven Domain Clipping for Accelerating Neural Network Verification
arXiv 2025
Rethinking Diverse Human Preference Learning through Principal Component Analysis
arXiv 2025
TrustLLM: Trustworthiness in Large Language Models
arXiv 2024
Neural Network Verification with Branch-and-Bound for General Nonlinearities
arXiv 2024
Foundation Models for Music: A Survey
arXiv 2024
COLD-Attack: Jailbreaking LLMs with Stealthiness and Controllability
arXiv 2024
A Pair Programming Framework for Code Generation via Multi-Plan Exploration and Feedback-Driven Refinement
arXiv 2024
Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs
arXiv 2024
Training-Free Bayesianization for Low-Rank Adapters of Large Language Models
arXiv 2024
DynaMath: A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision Language Models
arXiv 2024
Testing Neural Network Verifiers: A Soundness Benchmark with Hidden Counterexamples
arXiv 2024
Using Explainable AI and Transfer Learning to understand and predict the maintenance of Atlantic blocking with limited observational data
arXiv 2024
Robust Mixture-of-Expert Training for Convolutional Neural Networks
ICCV 2023 1
HoneyBee: Progressive Instruction Finetuning of Large Language Models for Materials Science
arXiv 2023
Efficiently Computing Local Lipschitz Constants of Neural Networks via Bound Propagation
arXiv 2022
Fast Certified Robust Training with Short Warmup
NeurIPS 2021 12
Affiliations
Frequent co-authors
10from 25 papers