Evaluation Metrics in the Era of GPT-4: Reliably Evaluating Large Language Models on Sequence to Sequence Tasks
Andrea Sottana, Bin Liang, Kai Zou, Zheng Yuan
Main: Resources and Evaluation Main-poster Paper
Poster_Demo_Industry Hybrid 3: Resources and Evaluation (Poster)
Conference Room: East Foyer(Virtual)
Conference Time: December 08, 16:00-17:30 (+08) (Asia/Singapore)
Global Time: December 08, Poster_Demo_Industry Hybrid 3 (08:00-09:30 UTC)
TLDR:
You can open the
#paper-1504
channel in a separate window.
Abstract: