Jieyu Zhao
- Papers
- 12
Cite
Notes
Only stored in your browser.
Authored papers
12Experiential Reinforcement Learning
arXiv 2026
Video-Based Reward Modeling for Computer-Use Agents
arXiv 2026
The Blind Spot of Agent Safety: How Benign User Instructions Expose Critical Vulnerabilities in Computer-Use Agents
arXiv 2026
On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective
arXiv 2025
Efficient Reinforcement Finetuning via Adaptive Curriculum Learning
arXiv 2025
Discovering Knowledge Deficiencies of Language Models on Massive Knowledge Base
arXiv 2025
VLMs as GeoGuessr Masters: Exceptional Performance, Hidden Biases, and Privacy Risks
arXiv 2025
TrustLLM: Trustworthiness in Large Language Models
arXiv 2024
CLIMB: A Benchmark of Clinical Bias in Large Language Models
arXiv 2024
TACO: Temporal Latent Action-Driven Contrastive Loss for Visual Reinforcement Learning
arXiv 2023
Safer-Instruct: Aligning Language Models with Automated Preference Data
arXiv 2023
Are Personalized Stochastic Parrots More Dangerous? Evaluating Persona Biases in Dialogue Systems
arXiv 2023
Affiliations
Frequent co-authors
10from 12 papers