INVITE: a Testbed of Automatically Generated Invalid Questions to Evaluate Large Language Models for Hallucinations

Anil Ramakrishna, Rahul Gupta, Jens Lehmann, Morteza Ziyadi

Findings: Resources and Evaluation Findings Paper

Poster_Demo_Industry_Findings Virtual 7: Resources and Evaluation (Poster)
Conference Room: Virtual-Gathertown
Conference Time: December 10, 11:00-12:30 (+08) (Asia/Singapore)
Global Time: December 10, Poster_Demo_Industry_Findings Virtual 7 (03:00-04:30 UTC)
TLDR:
You can open the #paper-4970 channel in a separate window.
Abstract: