Yulia Tsvetkov
- Papers
- 27
Cite
Notes
Only stored in your browser.
Authored papers
27Privasis: Synthesizing the Largest "Public" Private Dataset from Scratch
arXiv 2026
Cognitive Foundations for Reasoning and Their Manifestation in LLMs
arXiv 2025
RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments
arXiv 2025
Spurious Rewards: Rethinking Training Signals in RLVR
arXiv 2025
PrefPalette: Personalized Preference Modeling with Latent Attributes
arXiv 2025
Don't Throw Away Your Pretrained Model
arXiv 2025
Medical Hallucinations in Foundation Models and Their Impact on Healthcare
arXiv 2025
BLAB: Brutally Long Audio Bench
arXiv 2025
Aligning LLMs to Ask Good Questions A Case Study in Clinical Reasoning
arXiv 2025
MMMG: a Comprehensive and Reliable Evaluation Suite for Multitask Multimodal Generation
arXiv 2025
Do Membership Inference Attacks Work on Large Language Models?
arXiv 2024
Tuning Language Models by Proxy
arXiv 2024
DELL: Generating Reactions and Explanations for LLM-Based Misinformation Detection
arXiv 2024
MediQ: Question-Asking LLMs and a Benchmark for Reliable Interactive Clinical Reasoning
arXiv 2024
Varying Shades of Wrong: Aligning LLMs with Wrong Answers Only
arXiv 2024
Modular Pluralism: Pluralistic Alignment via Multi-LLM Collaboration
arXiv 2024
Stumbling Blocks: Stress Testing the Robustness of Machine-Generated Text Detectors Under Attacks
arXiv 2024
Alpaca against Vicuna: Using LLMs to Uncover Memorization of LLMs
arXiv 2024
Quantifying Language Models' Sensitivity to Spurious Features in Prompt Design or: How I learned to start worrying about prompt formatting
arXiv 2023
Can Language Models Solve Graph Problems in Natural Language?
NeurIPS 2023 11
From Pretraining Data to Language Models to Downstream Tasks: Tracking the Trails of Political Biases Leading to Unfair NLP Models
arXiv 2023
Assessing Language Model Deployment with Risk Cards
arXiv 2023
Knowledge Card: Filling LLMs' Knowledge Gaps with Plug-in Specialized Language Models
arXiv 2023
KGQuiz: Evaluating the Generalization of Encoded Knowledge in Large Language Models
arXiv 2023
SSD-LM: Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular Control
arXiv 2022
DialoGraph: Incorporating Interpretable Strategy-Graph Networks into Negotiation Dialogues
dialograph-incorporating-interpretable
Black is to Criminal as Caucasian is to Police: Detecting and Removing Multiclass Bias in Word Embeddings
black-is-to-criminal-as-caucasian-is-to-1
Affiliations
Frequent co-authors
10from 27 papers
Shangbin Feng
Shuyue Stella Li
Yejin Choi
professor
Vidhisha Balachandran
Pang Wei Koh
Tianxing He
Chan Young Park
Hannaneh Hajishirzi
professor
Niloofar Mireshghallah
Noah A. Smith