Probing LLMs for hate speech detection: strengths and vulnerabilities
Sarthak Roy, Ashish Harshvardhan, Animesh Mukherjee, Punyajoy Saha
Findings: Interpretability, Interactivity, and Analysis of Models for NLP Findings Paper
Poster_Demo_Industry_Findings Virtual 2: Interpretability, Interactivity, and Analysis of Models for NLP (Poster)
Conference Room: Virtual-Gathertown
Conference Time: December 08, 14:00-15:30 (+08) (Asia/Singapore)
Global Time: December 08, Poster_Demo_Industry_Findings Virtual 2 (06:00-07:30 UTC)
TLDR:
You can open the
#paper-2016
channel in a separate window.
Abstract: