INVITE: a Testbed of Automatically Generated Invalid Questions to Evaluate Large Language Models for Hallucinations
Anil Ramakrishna, Rahul Gupta, Jens Lehmann, Morteza Ziyadi
Findings: Resources and Evaluation Findings Paper
Poster_Demo_Industry_Findings Virtual 7: Resources and Evaluation (Poster)
Conference Room: Virtual-Gathertown
Conference Time: December 10, 11:00-12:30 (+08) (Asia/Singapore)
Global Time: December 10, Poster_Demo_Industry_Findings Virtual 7 (03:00-04:30 UTC)
TLDR:
You can open the
#paper-4970
channel in a separate window.
Abstract: