Cite
Notes
Only stored in your browser.
Attribution
MMLongBench: Benchmarking Long-Context Vision-Language Models Effectively and Thoroughly
arXiv 2025
A Multi-Faceted Evaluation Framework for Assessing Synthetic Data Generated by Large Language Models
arXiv 2024
Sources of Hallucination by Large Language Models on Inference Tasks
arXiv 2023
from 3 papers
Mark Steedman
Ginny Wong
Jipeng Zhang
Mark Johnson
Mohammad Javad Hosseini
Nick McKenna
Pasquale Minervini
Rohit Saxena
Simon See
Tianyi Li