Robin Jia
- Papers
- 14
Cite
Notes
Only stored in your browser.
Authored papers
14Precise Debugging Benchmark: Is Your Model Debugging or Regenerating?
arXiv 2026
Self-Evolving LLM Memory Extraction Across Heterogeneous Tasks
arXiv 2026
Cancer-Myth: Evaluating AI Chatbot on Patient Questions with False Presuppositions
arXiv 2025
When Do LLMs Admit Their Mistakes? Understanding the Role of Model Belief in Retraction
arXiv 2025
Zebra-CoT: A Dataset for Interleaved Vision Language Reasoning
arXiv 2025
Promote, Suppress, Iterate: How Language Models Answer One-to-Many Factual Queries
arXiv 2025
Teaching Models to Understand (but not Generate) High-risk Data
arXiv 2025
IsoBench: Benchmarking Multimodal Foundation Models on Isomorphic Representations
arXiv 2024
Transformers Learn to Achieve Second-Order Convergence Rates for In-Context Linear Regression
arXiv 2023
How Predictable Are Large Language Model Capabilities? A Case Study on BIG-bench
arXiv 2023
Estimating Large Language Model Capabilities without Labeled Test Data
arXiv 2023
With Little Power Comes Great Responsibility
EMNLP 2020 11
MRQA 2019 Shared Task: Evaluating Generalization in Reading Comprehension
mrqa-2019-shared-task-evaluating-1
Delete, Retrieve, Generate: A Simple Approach to Sentiment and Style Transfer
delete-retrieve-generate-a-simple-approach-to-1
Affiliations
Frequent co-authors
10from 14 papers