Karolina Stańczak

Cite

Notes

Only stored in your browser.

Attribution

4papers

Authored papers

SafeArena: Evaluating the Safety of Autonomous Web Agents

arXiv 2025

AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories

arXiv 2025

DeepSeek-R1 Thoughtology: Let's think about LLM Reasoning

arXiv 2025

A Latent-Variable Model for Intrinsic Probing

arXiv 2022

No known affiliations.

from 4 papers

Arkil Patel

Nicholas Meade

Siva Reddy

Xing Han Lù

Alejandra Zambrano

Amirhossein Kazemnejad

Dongchan Shin

researcher

Ada Defne Tur

Adina Williams

Aditi Khandelwal