Probing LLMs for hate speech detection: strengths and vulnerabilities

Add to Favorites

Poster_Demo_Industry_Findings Virtual 2: Interpretability, Interactivity, and Analysis of Models for NLP (Poster)

Conference Room: Virtual-Gathertown

Conference Time: December 08, 14:00-15:30 (+08) (Asia/Singapore)

Global Time: December 08, 06:00-07:30 UTC / 06:00-07:30 GMT

TLDR:

You can open the #paper-2016 channel in a separate window.

Abstract: