Rahul Gupta
- Papers
- 7
Cite
Notes
Only stored in your browser.
7papers
Authored papers
7When Actions Go Off-Task: Detecting and Correcting Misaligned Actions in Computer-Use Agents
arXiv 2026
Towards Safety Reasoning in LLMs: AI-agentic Deliberation for Policy-embedded CoT Data Creation
arXiv 2025
Discovering Knowledge Deficiencies of Language Models on Massive Knowledge Base
arXiv 2025
Data Advisor: Dynamic Data Curation for Safety Alignment of Large Language Models
arXiv 2024
Evaluating Large Language Models on Controlled Generation Tasks
arXiv 2023
Controlling the Extraction of Memorized Data from Large Language Models via Prompt-Tuning
arXiv 2023
BOLD: Dataset and Metrics for Measuring Biases in Open-Ended Language Generation
arXiv 2021
Affiliations
No known affiliations.
Frequent co-authors
10from 7 papers