Taiwei Shi
- Papers
- 10
Cite
Notes
Only stored in your browser.
Authored papers
10Experiential Reinforcement Learning
arXiv 2026
Video-Based Reward Modeling for Computer-Use Agents
arXiv 2026
The Blind Spot of Agent Safety: How Benign User Instructions Expose Critical Vulnerabilities in Computer-Use Agents
arXiv 2026
Self-Evolving LLM Memory Extraction Across Heterogeneous Tasks
arXiv 2026
Discovering Knowledge Deficiencies of Language Models on Massive Knowledge Base
arXiv 2025
On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective
arXiv 2025
Efficient Reinforcement Finetuning via Adaptive Curriculum Learning
arXiv 2025
How Susceptible are Large Language Models to Ideological Manipulation?
arXiv 2024
CoAnnotating: Uncertainty-Guided Work Allocation between Human and Large Language Models for Data Annotation
arXiv 2023
Safer-Instruct: Aligning Language Models with Automated Preference Data
arXiv 2023
Affiliations
Frequent co-authors
10from 10 papers