Crystina Zhang
- Papers
- 3
Cite
Notes
Only stored in your browser.
3papers
Authored papers
3BrowseComp-Plus: A More Fair and Transparent Evaluation Benchmark of Deep-Research Agent
arXiv 2025
Fixing Data That Hurts Performance: Cascading LLMs to Relabel Hard Negatives for Robust Information Retrieval
arXiv 2025
Drawing Conclusions from Draws: Rethinking Preference Semantics in Arena-Style LLM Evaluation
arXiv 2025
Affiliations
No known affiliations.
Frequent co-authors
10from 3 papers