SteerLM: Attribute Conditioned SFT as an (User-Steerable) Alternative to RLHF
Yi Dong, Zhilin Wang, Makesh Narsimhan Sreedhar, Xianchao Wu, Oleksii Kuchaiev
Findings: Theme Track: Large Language Models and the Future of NLP Findings Paper
Poster_Demo_Industry_Findings Virtual 1: Theme Track: Large Language Models and the Future of NLP (Poster)
Conference Room: Virtual-Gathertown
Conference Time: December 08, 11:00-12:30 (+08) (Asia/Singapore)
Global Time: December 08, Poster_Demo_Industry_Findings Virtual 1 (03:00-04:30 UTC)
TLDR:
You can open the
#paper-1165
channel in a separate window.
Abstract: