Emine Yilmaz
- Papers
- 11
Cite
Notes
Only stored in your browser.
Authored papers
11AgentSearchBench: A Benchmark for AI Agent Search in the Wild
arXiv 2026
InnoEval: On Research Idea Evaluation as a Knowledge-Grounded, Multi-Perspective Reasoning Problem
arXiv 2026
PersonaLens: A Benchmark for Personalization Evaluation in Conversational AI Assistants
arXiv 2025
Judging the Judges: A Collection of LLM-Generated Relevance Judgements
arXiv 2025
Attributing Response to Context: A Jensen-Shannon Divergence Driven Mechanistic Study of Context Attribution in Retrieval-Augmented Generation
arXiv 2025
Machine-generated text detection prevents language model collapse
arXiv 2025
Benchmarking LLMs via Uncertainty Quantification
arXiv 2024
Instruction Tuning With Loss Over Instructions
arXiv 2024
LLMJudge: LLMs for Relevance Judgments
arXiv 2024
Enhancing Conversational Search: Large Language Model-Aided Informative Query Rewriting
arXiv 2023
Trans-Encoder: Unsupervised sentence-pair modelling through self- and mutual-distillations
trans-encoder-unsupervised-sentence-pair-1
Affiliations
Frequent co-authors
10from 11 papers