Canyu Chen
- Papers
- 8
Cite
Notes
Only stored in your browser.
8papers
Authored papers
8From Generation to Judgment: Opportunities and Challenges of LLM-as-a-judge
arXiv 2024
Introducing v0.5 of the AI Safety Benchmark from MLCommons
arXiv 2024
ClinicalBench: Can LLMs Beat Traditional ML Models in Clinical Prediction?
arXiv 2024
Can Large Language Model Agents Simulate Human Trust Behavior?
arXiv 2024
Can Editing LLMs Inject Harm?
arXiv 2024
Can Knowledge Editing Really Correct Hallucinations?
arXiv 2024
MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?
arXiv 2024
Can LLM-Generated Misinformation Be Detected?
arXiv 2023
Affiliations
No known affiliations.
Frequent co-authors
10from 8 papers